Û
   HOME





ISO/IEC 8859
ISO/IEC 8859 is a joint International Organization for Standardization, ISO and International Electrotechnical Commission, IEC series of standards for 8-bit character encodings. The series of standards consists of numbered parts, such as ISO/IEC 8859-1, ISO/IEC 8859-2, etc. There are 15 parts, excluding the abandoned ISO/IEC 8859-12. The ISO working group maintaining this series of standards has been disbanded. ISO/IEC 8859 parts 1, 2, 3, and 4 were originally Ecma International standard ECMA-94. Introduction While the bit patterns of the 95 graphic character, printable ASCII characters are sufficient to exchange information in modern English language, English, most other languages that use Latin script, Latin alphabets need additional symbols not covered by ASCII. ISO/IEC 8859 sought to remedy this problem by utilizing the eighth bit in an 8-bit byte to allow positions for another 96 printable characters. Early encodings were limited to 7 bits because of restrictions of som ...
[...More Info...]      
[...Related Items...]     OR:     [Wikipedia]   [Google]   [Baidu]  


ISO 9
ISO 9 is an international standard establishing a system for the transliteration into Latin characters of Cyrillic characters constituting the alphabets of many Slavic and non-Slavic languages. Published on February 23, 1995 by the International Organization for Standardization, the major advantage ISO 9 has over other competing systems is its univocal system of one character for one character equivalents (by the use of diacritics), which faithfully represents the original spelling and allows for reverse transliteration, even if the language is unknown. Earlier versions of the standard, ISO/R 9:1954, ISO/R 9:1968 and ISO 9:1986, were more closely based on the international scholarly system for linguistics ( scientific transliteration), but have diverged in favour of unambiguous transliteration over phonemic representation. The edition of 1995 supersedes the edition of 1986. ISO 9:1995 The standard features three mapping tables: the first covers contemporary Slavic languages, ...
[...More Info...]      
[...Related Items...]     OR:     [Wikipedia]   [Google]   [Baidu]  


picture info

Kurmanji Alphabet
Kurdish is written using either of two alphabets: the Latin-based Bedirxan or Hawar alphabet, introduced by Celadet Alî Bedirxan in 1932 and popularized through the Hawar magazine, and the Kurdo-Arabic alphabet. The Kurdistan Region has agreed upon a standard for Central Kurdish, implemented in Unicode for computation purposes. The Hawar alphabet is primarily used in Syria and Turkey, while the Kurdo-Arabic alphabet is commonly used in Iraq and Iran. The Hawar alphabet is also used to some extent in Iraqi Kurdistan. Two additional alphabets, based on the Armenian and Cyrillic scripts, were once used by Kurds in the Soviet Union, most notably in the Armenian Soviet Socialist Republic and Kurdistansky Uyezd. Southern Kurdish lacks a standard orthography, as of 2024. Hawar alphabet Usually it is the northern languages spoken by Kurds, Zazaki and Kurmanji, that are written in the extended Latin alphabet consisting of the 26 letters of the ISO basic Latin Alphabet with 5 le ...
[...More Info...]      
[...Related Items...]     OR:     [Wikipedia]   [Google]   [Baidu]  


picture info

Circumflex
The circumflex () is a diacritic in the Latin and Greek scripts that is also used in the written forms of many languages and in various romanization and transcription schemes. It received its English name from "bent around"a translation of the (). The circumflex in the Latin script is chevron-shaped (), while the Greek circumflex may be displayed either like a tilde () or like an inverted breve (). For the most commonly encountered uses of the accent in the Latin alphabet, precomposed characters are available. In English, the circumflex, like other diacritics, is sometimes retained on loanwords that used it in the original language (for example '' entrepôt, crème brûlée''). In mathematics and statistics, the circumflex diacritic is sometimes used to denote a function and is called a '' hat operator''. A free-standing version of the circumflex symbol, , is encoded in ASCII and Unicode and has become known as '' caret'' and has acquired special uses, particularly i ...
[...More Info...]      
[...Related Items...]     OR:     [Wikipedia]   [Google]   [Baidu]  


ISO/IEC 8859-4
ISO/IEC 8859-4:1998, ''Information technology — 8-bit single-byte coded graphic character sets — Part 4: Latin alphabet No. 4'', is part of the ISO/IEC 8859 series of ASCII-based standard character encodings, first edition published in 1988. It is informally referred to as Latin-4 or ''North European''. It was designed to cover Estonian, Latvian, Lithuanian, Greenlandic, and Sámi. It has been largely superseded by ISO/IEC 8859-10 and Unicode. Microsoft has assigned code page 28594 a.k.a. Windows-28594 to ISO-8859-4 in Windows. IBM has assigned code page 914 ( CCSID 914) to ISO 8859-4. ISO-8859-4 is the IANA preferred charset name for this standard when supplemented with the C0 and C1 control codes from ISO/IEC 6429. ISO-IR 205 (called Code page 58258 by FreeDOS) replaces the generic Currency Sign at 0xA4 with the Euro Sign. Codepage layout Differences from ISO-8859-1 ISO/IEC 8859-1:1998, ''Information technology—8-bit computing, 8-bit single-byte coded graphi ...
[...More Info...]      
[...Related Items...]     OR:     [Wikipedia]   [Google]   [Baidu]  


ISO/IEC 8859-9
ISO/IEC 8859-9:1999, ''Information technology — 8-bit single-byte coded graphic character sets — Part 9: Latin alphabet No. 5'', is part of the ISO/IEC 8859 series of ASCII-based standard character encodings, first edition published in 1989. It is designated ECMA-128 by Ecma International and TS 5881 as a Turkish standard. It is informally referred to as Latin-5 or ''Turkish''. It was designed to cover the Turkish language (and the vast majority of users use it for that language, even though it can also be used for some other languages), designed as being of more use than the ISO/IEC 8859-3 encoding. It is identical to ISO/IEC 8859-1 except for the replacement of six Icelandic characters ( Ðð, Ýý, Þþ) with characters unique to the Turkish alphabet ( Ğğ, İ, ı, Şş). And the uppercase of i is İ; the lowercase of I is ı. ISO-8859-9 is the IANA preferred charset name for this standard when supplemented with the C0 and C1 control codes from ISO/IEC 6429. ...
[...More Info...]      
[...Related Items...]     OR:     [Wikipedia]   [Google]   [Baidu]  




ISO/IEC 8859-3
ISO/IEC 8859-3:1999, ''Information technology — 8-bit single-byte coded graphic character sets — Part 3: Latin alphabet No. 3'', is part of the ISO/IEC 8859 series of ASCII-based standard character encodings, first edition published in 1988. It is informally referred to as Latin-3 or ''South European''. It was designed to cover Turkish, Maltese and Esperanto, though the introduction of ISO/IEC 8859-9 superseded it for Turkish. The encoding was popular for users of Esperanto, but fell out of use as application support for Unicode became more common. ISO-8859-3 is the IANA preferred charset name for this standard when supplemented with the C0 and C1 control codes from ISO/IEC 6429. Microsoft has assigned code page 28593 a.k.a. Windows-28593 to ISO-8859-3 in Windows. IBM has assigned code page 913 ( CCSID 913) to ISO 8859-3. Codepage layout Differences from ISO-8859-1 ISO/IEC 8859-1:1998, ''Information technology—8-bit computing, 8-bit single-byte coded graphic char ...
[...More Info...]      
[...Related Items...]     OR:     [Wikipedia]   [Google]   [Baidu]  


ISO/IEC 8859-1
ISO/IEC 8859-1:1998, ''Information technology—8-bit single-byte coded graphic character sets—Part 1: Latin alphabet No. 1'', is part of the ISO/IEC 8859 series of ASCII-based standard character encodings, first edition published in 1987. ISO/IEC 8859-1 encodes what it refers to as "Latin alphabet no. 1", consisting of 191 characters from the Latin script. This character-encoding scheme is used throughout the Americas, Western Europe, Oceania, and much of Africa. It is the basis for some popular 8-bit character sets and the first two blocks of characters in Unicode. , 1.1% of all web sites use . It is the most declared single-byte character encoding, but as Web browsers and the HTML5 standard interpret them as the superset Windows-1252, these documents may include characters from that set. Some countries or languages show a higher usage than the global average, in 2025 Brazil according to website use, use is at 2.9%, and in Germany at 2.3%. ISO-8859-1 was (ac ...
[...More Info...]      
[...Related Items...]     OR:     [Wikipedia]   [Google]   [Baidu]  


ISO/IEC 8859-10
ISO/IEC 8859-10:1998, ''Information technology — 8-bit single-byte coded graphic character sets — Part 10: Latin alphabet No. 6'', is part of the ISO/IEC 8859 series of ASCII-based standard character encodings, first edition published in 1992. It is informally referred to as Latin-6. It was designed to cover the Nordic languages, deemed of more use for them than ISO 8859-4. ISO-8859-10 is the IANA preferred charset name for this standard when supplemented with the C0 and C1 control codes from ISO/IEC 6429. Microsoft has assigned code page 28600 a.k.a. Windows-28600 to ISO-8859-10 in Windows. IBM has assigned Code page 919 to ISO-8859-10. It is published by Ecma International as ECMA-144. Codepage layout Differences from ISO-8859-1 have the Unicode code point number below the character. ISO-IR 158 Codepage layout ISO-IR 158 is a supplementary ISO 2022 graphical set, containing characters which are absent in ISO-8859-10, but which are required for writing Skolt Sami or hi ...
[...More Info...]      
[...Related Items...]     OR:     [Wikipedia]   [Google]   [Baidu]  


picture info

Afrikaans
Afrikaans is a West Germanic languages, West Germanic language spoken in South Africa, Namibia and to a lesser extent Botswana, Zambia, Zimbabwe and also Argentina where there is a group in Sarmiento, Chubut, Sarmiento that speaks the Patagonian Afrikaans, Patagonian dialect. It evolved from the Dutch language, Dutch vernacular of South Holland (Hollandic dialect) spoken by the free Burghers, predominantly Dutch settlers and slavery in South Africa#Dutch rule, enslaved population of the Dutch Cape Colony, where it gradually began to develop distinguishing characteristics in the 17th and 18th centuries. Although Afrikaans has adopted words from other languages including German language, German, Malay language, Malay and Khoisan languages, an estimated 90 to 95% of the vocabulary of Afrikaans is of Dutch origin. Differences between Afrikaans and Dutch often lie in the more analytic language, analytic Morphology (linguistics), morphology and grammar of Afrikaans, and differ ...
[...More Info...]      
[...Related Items...]     OR:     [Wikipedia]   [Google]   [Baidu]  


ISO/IEC 8859-14
ISO/IEC 8859-14:1998, ''Information technology — 8-bit single-byte coded graphic character sets — Part 14: Latin alphabet No. 8 ( Celtic)'', is part of the ISO/IEC 8859 series of ASCII-based standard character encodings, first edition published in 1998. It is informally referred to as Latin-8 or ''Celtic''. It was designed to cover the Celtic languages, such as Irish, Manx, Scottish Gaelic, Welsh, Cornish, and Breton. ISO-8859-14 is the IANA preferred charset name for this standard when supplemented with the C0 and C1 control codes from ISO/IEC 6429. CeltScript made an extension for Windows called Extended Latin-8. Microsoft has assigned code page 28604 a.k.a. Windows-28604 to ISO-8859-14. FreeDOS assigned code page 58163 to ISO-8859-14. History ISO-8859-14 was originally proposed for the Sami languages. ISO 8859-12 was proposed for Celtic. Later, ISO 8859-12 was proposed for Devanagari, so the Celtic proposal was changed to ISO 8859-14. The Sami proposal was changed t ...
[...More Info...]      
[...Related Items...]     OR:     [Wikipedia]   [Google]   [Baidu]  


picture info

EBCDIC
Extended Binary Coded Decimal Interchange Code (EBCDIC; ) is an eight- bit character encoding used mainly on IBM mainframe and IBM midrange computer operating systems. It descended from the code used with punched cards and the corresponding six-bit binary-coded decimal code used with most of IBM's computer peripherals of the late 1950s and early 1960s. It is supported by various non-IBM platforms, such as Fujitsu-Siemens' BS2000/OSD, OS-IV, MSP, and MSP-EX, the SDS Sigma series, Unisys VS/9, Unisys MCP and ICL VME. History EBCDIC was devised in 1963 and 1964 by IBM and was announced with the release of the IBM System/360 line of mainframe computers. It is an eight-bit character encoding, developed separately from the seven-bit ASCII encoding scheme. It was created to extend the existing Binary-Coded Decimal (BCD) Interchange Code, or BCDIC, which itself was devised as an efficient means of encoding the two ''zone'' and ''number'' punches on punched cards into six ...
[...More Info...]      
[...Related Items...]     OR:     [Wikipedia]   [Google]   [Baidu]