HOME

TheInfoList



OR:

ISO/IEC 8859-14:1998, ''Information technology — 8-bit single-byte coded graphic character sets — Part 14: Latin alphabet No. 8 (
Celtic Celtic, Celtics or Keltic may refer to: Language and ethnicity *pertaining to Celts, a collection of Indo-European peoples in Europe and Anatolia **Celts (modern) *Celtic languages **Proto-Celtic language *Celtic music *Celtic nations Sports Foo ...
)'', is part of the
ISO/IEC 8859 ISO/IEC 8859 is a joint ISO and IEC series of standards for 8-bit character encodings. The series of standards consists of numbered parts, such as ISO/IEC 8859-1, ISO/IEC 8859-2, etc. There are 15 parts, excluding the abandoned ISO/IEC 8859-12. ...
series of ASCII-based standard
character encoding Character encoding is the process of assigning numbers to graphical characters, especially the written characters of human language, allowing them to be stored, transmitted, and transformed using digital computers. The numerical values tha ...
s, first edition published in 1998. It is informally referred to as Latin-8 or ''Celtic''. It was designed to cover the
Celtic languages The Celtic languages (usually , but sometimes ) are a group of related languages descended from Proto-Celtic. They form a branch of the Indo-European language family. The term "Celtic" was first used to describe this language group by Edward ...
, such as Irish, Manx,
Scottish Gaelic Scottish Gaelic ( gd, Gàidhlig ), also known as Scots Gaelic and Gaelic, is a Goidelic language (in the Celtic branch of the Indo-European language family) native to the Gaels of Scotland. As a Goidelic language, Scottish Gaelic, as well as ...
,
Welsh Welsh may refer to: Related to Wales * Welsh, referring or related to Wales * Welsh language, a Brittonic Celtic language spoken in Wales * Welsh people People * Welsh (surname) * Sometimes used as a synonym for the ancient Britons (Celtic peopl ...
, Cornish, and Breton. ISO-8859-14 is the
IANA The Internet Assigned Numbers Authority (IANA) is a standards organization that oversees global IP address allocation, autonomous system number allocation, root zone management in the Domain Name System (DNS), media types, and other Interne ...
preferred charset name for this standard when supplemented with the
C0 and C1 control codes The C0 and C1 control code or control character sets define control codes for use in text by computer systems that use ASCII and derivatives of ASCII. The codes represent additional information about the text, such as the position of a curso ...
from
ISO/IEC 6429 ISO/IEC JTC 1, entitled "Information technology", is a joint technical committee (JTC) of the International Organization for Standardization (ISO) and the International Electrotechnical Commission (IEC). Its purpose is to develop, maintain and ...
. CeltScript made an extension for Windows called
Extended Latin-8 This is an extension of ISO 8859-14 ISO/IEC 8859-14:1998, ''Information technology — 8-bit single-byte coded graphic character sets — Part 14: Latin alphabet No. 8 ( Celtic)'', is part of the ISO/IEC 8859 series of ASCII-based standard charact ...
. Microsoft has assigned code page 28604 a.k.a. Windows-28604 to ISO-8859-14.


History

ISO-8859-14 was originally proposed for the
Sami languages Acronyms * SAMI, ''Synchronized Accessible Media Interchange'', a closed-captioning format developed by Microsoft * Saudi Arabian Military Industries, a government-owned defence company * South African Malaria Initiative, a virtual expertise ...
. ISO 8859-12 was proposed for Celtic. Later, ISO 8859-12 was proposed for
Devanagari Devanagari ( ; , , Sanskrit pronunciation: ), also called Nagari (),Kathleen Kuiper (2010), The Culture of India, New York: The Rosen Publishing Group, , page 83 is a left-to-right abugida (a type of segmental writing system), based on the ...
, so the Celtic proposal was changed to ISO 8859-14. The Sami proposal was changed to ISO 8859-15, but it got rejected as an ISO/IEC 8859 part, although it was registered as
ISO-IR-197 ISO-IR-197 (known by the ISO-IR registration number of its GR set) is an 8-bit, single-byte character encoding which was designed for the Sámi languages. It is a modification of ISO 8859-1, replacing certain punctuation and symbol characters wit ...
. The original proposal used a different arrangement of points 0xA1–BF. At the committee draft stage of the specification, a
dotless i I, or ı, called dotless I, is a letter used in the Latin-script alphabets of Azerbaijani, Crimean Tatar, Gagauz, Kazakh, Tatar, Kyrgyz, and Turkish. It commonly represents the close back unrounded vowel , except in Kazakh where it represen ...
was included at 0xAE, which was changed to a
registered trademark sign The registered trademark symbol, , is a typographic symbol that provides notice that the preceding word or symbol is a trademark or service mark that has been registered with a national trademark office. A trademark is a symbol, word, or w ...
(matching
ISO-8859-1 ISO/IEC 8859-1:1998, ''Information technology — 8-bit single-byte coded graphic character sets — Part 1: Latin alphabet No. 1'', is part of the ISO/IEC 8859 series of ASCII-based standard character encodings, first edition published in ...
) in the final publication.
ISO-IR-182 ISO-IR-182 is a Welsh variant of ISO/IEC 8859-1 that supports the Welsh language. However, it lacks the letters used in the Irish language (which are in ISO/IEC 8859-14). Code page layout Differences from ISO/IEC 8859-1 ISO/IEC 8859-1:1998, ' ...
, an earlier (registered in 1994) modification of
ISO-8859-1 ISO/IEC 8859-1:1998, ''Information technology — 8-bit single-byte coded graphic character sets — Part 1: Latin alphabet No. 1'', is part of the ISO/IEC 8859 series of ASCII-based standard character encodings, first edition published in ...
, had added the letters Ẁ, Ẃ, Ẅ, Ỳ, Ÿ, Ŵ, Ŷ and their lowercase forms (except for ÿ, which was already included) for
Welsh language Welsh ( or ) is a Celtic language of the Brittonic subgroup that is native to the Welsh people. Welsh is spoken natively in Wales, by some in England, and in Y Wladfa (the Welsh colony in Chubut Province, Argentina). Historically, it h ...
use. The final published version of ISO-8859-14 includes these letters in the same positions which they appear at in ISO-IR-182.


Codepage layout

Differences from
ISO-8859-1 ISO/IEC 8859-1:1998, ''Information technology — 8-bit single-byte coded graphic character sets — Part 1: Latin alphabet No. 1'', is part of the ISO/IEC 8859 series of ASCII-based standard character encodings, first edition published in ...
have the Unicode code point number below the character.


Draft layout

The first draft had positions A0-BF different. It did not include the pilcrow sign, but included the cent sign instead at its Latin-1 position. Later, it was ruled that the pilcrow sign was more common, so the pilcrow sign remains at its Latin-1 position, and the cent sign was removed instead. Differences from ISO-8859-14 have the Unicode code point below them.


References


External links


ISO/IEC 8859-14:1998ISO-IR 199
Celtic Supplementary Latin Set ''(May 1, 1998, submitted by Irish body NSAI/AGITS/WG6)'' {{DEFAULTSORT:ISO IEC 8859-14 ISO/IEC 8859 Computer-related introductions in 1998