ISO-IR-209
   HOME

TheInfoList



OR:

ISO-IR-197 (known by the
ISO-IR ISO/IEC 2022 ''Information technology—Character code structure and extension techniques'', is an ISO/IEC standard (equivalent to the ECMA standard ECMA-35, the ANSI standard ANSI X3.41 and the Japanese Industrial Standard JIS X 0202) in the f ...
registration number of its GR set) is an
8-bit In computer architecture, 8-bit Integer (computer science), integers or other Data (computing), data units are those that are 8 bits wide (1 octet (computing), octet). Also, 8-bit central processing unit (CPU) and arithmetic logic unit (ALU) arc ...
, single-byte
character encoding Character encoding is the process of assigning numbers to Graphics, graphical character (computing), characters, especially the written characters of Language, human language, allowing them to be Data storage, stored, Data communication, transmi ...
which was designed for the
Sámi languages Sámi languages ( ), in English also rendered as Sami and Saami, are a group of Uralic languages spoken by the Sámi people in Northern Europe (in parts of northern Finland, Norway, Sweden, and extreme northwestern Russia). There are, dependin ...
. It is a modification of
ISO 8859-1 ISO/IEC 8859-1:1998, ''Information technology — 8-bit single-byte coded graphic character sets — Part 1: Latin alphabet No. 1'', is part of the ISO/IEC 8859 series of ASCII-based standard character encodings, first edition published in 1 ...
, replacing certain punctuation and symbol characters with additional letters used in certain Sámi orthographies. ISO-IR-197 was proposed for establishment as a part of
ISO/IEC 8859 ISO/IEC 8859 is a joint ISO and IEC series of standards for 8-bit character encodings. The series of standards consists of numbered parts, such as ISO/IEC 8859-1, ISO/IEC 8859-2, etc. There are 15 parts, excluding the abandoned ISO/IEC 8859-12. ...
in 1996 (as part 14 and, later,
part 15 Code of Federal Regulations, 'Title 47, Part 15(47 CFR 15) is an oft-quoted part of Federal Communications Commission (FCC) rules and regulations regarding unlicensed transmissions. It is a part of Title 47 of the Code of Federal Regulations ...
), but was not accepted for this. However, ISO-IR-197 is referenced in an informative ISO/IEC 8859 annex, which lists it as an encoding which provides a more adequate coverage of the orthography of certain Sámi languages such as
Skolt Sámi Skolt Sámi ( , "the Sámi language", or , "the Eastern Sámi language", if a distinction needs to be made between it and the other Sámi languages) is a Uralic, Sámi language that is spoken by the Skolts, with approximately 300 speakers in ...
than
ISO-8859-4 ISO/IEC 8859-4:1998, ''Information technology — 8-bit single-byte coded graphic character sets — Part 4: Latin alphabet No. 4'', is part of the ISO/IEC 8859 series of ASCII-based standard character encodings, first edition published in 1988. I ...
or
ISO-8859-10 ISO/IEC 8859-10:1998, ''Information technology — 8-bit single-byte coded graphic character sets — Part 10: Latin alphabet No. 6'', is part of the ISO/IEC 8859 series of ASCII-based standard character encodings, first edition published in 1992. ...
, unless the latter is combined with ISO-IR-158.


Code page layout

Differences from
ISO 8859-1 ISO/IEC 8859-1:1998, ''Information technology — 8-bit single-byte coded graphic character sets — Part 1: Latin alphabet No. 1'', is part of the ISO/IEC 8859 series of ASCII-based standard character encodings, first edition published in 1 ...
have their Unicode code point.


Windows extension

As documented by
Evertype Michael Everson (born January 9, 1963) is an American and Irish linguist, script encoder, typesetter, type designer and publisher. He runs a publishing company called Evertype, through which he has published over a hundred books since 2006. His ...
, some Windows implementations use a variant which adds graphical characters to the C1 area ( 0x80-9F), including some of the other characters from the Mac OS Sámi repertoire. This was intended to be analogous to the Windows version of
Latin-1 ISO/IEC 8859-1:1998, ''Information technology — 8-bit single-byte coded graphic character sets — Part 1: Latin alphabet No. 1'', is part of the ISO/IEC 8859 series of ASCII-based standard character encodings, first edition published in 1 ...
(i.e.
Windows-1252 Windows-1252 or CP-1252 ( code page 1252) is a single-byte character encoding of the Latin alphabet, used by default in the legacy components of Microsoft Windows for English and many European languages including Spanish, French, and German. It ...
), and follows its layout where possible. Differences from Windows-1252 have their Unicode code point:


ISO-IR-209

ISO-IR-209 is an update that replaced the guillemets at 0xAB and 0xBB with the letter H with caron to add
Finnish Romani Finnish Kalo () is a language of the Romani language, Romani language family (a subgroup of Indo-European languages, Indo-European) spoken by Finnish Kale. The language is related to but not mutually intelligible with Scandoromani language, Scand ...
support.


References

{{character encoding ISO/IEC 8859