Latin Script In Unicode
Over a thousand characters from the Latin script are encoded in the Unicode Standard, grouped in several basic and extended Latin blocks. The extended ranges contain mainly precomposed letters plus diacritics that are equivalently encoded with combining diacritics, as well as some ligatures and distinct letters, used for example in the orthographies of various African languages (including click symbols in Latin Extended-B) and the Vietnamese alphabet (Latin Extended Additional). Latin Extended-C contains additions for Uighur and the Claudian letters. Latin Extended-D comprises characters that are mostly of interest to medievalists. Latin Extended-E mostly comprises characters used for German dialectology (Teuthonista). Latin Extended-F and -G contain characters for phonetic transcription. Blocks As of version of the Unicode Standard, 1,487 characters in the following 19 blocks are classified as belonging to the Latin script. * Basic Latin, 0000–007F. This block correspond ... [...More Info...]       [...Related Items...]     OR:     [Wikipedia]   [Google]   [Baidu]   |
|
Latin Script
The Latin script, also known as the Roman script, is a writing system based on the letters of the classical Latin alphabet, derived from a form of the Greek alphabet which was in use in the ancient Greek city of Cumae in Magna Graecia. The Greek alphabet was altered by the Etruscan civilization, Etruscans, and subsequently their alphabet was altered by the Ancient Romans. Several Latin-script alphabets exist, which differ in graphemes, collation and phonetic values from the classical Latin alphabet. The Latin script is the basis of the International Phonetic Alphabet (IPA), and the 26 most widespread letters are the letters contained in the ISO basic Latin alphabet, which are the same letters as the English alphabet. Latin script is the basis for the largest number of alphabets of any writing system and is the List of writing systems by adoption, most widely adopted writing system in the world. Latin script is used as the standard method of writing the languages of Western and ... [...More Info...]       [...Related Items...]     OR:     [Wikipedia]   [Google]   [Baidu]   |
|
Latin Extended-B
Latin Extended-B is the fourth block (0180-024F) of the Unicode Standard. It has been included since version 1.0, where it was only allocated to the code points 0180-01FF and contained 113 characters. During unification with ISO 10646 for version 1.1, the block range was extended by 80 code points and another 35 characters were assigned. In version 3.0 and later, the last 60 available code points in the block were assigned. Its block name in Unicode 1.0 was Extended Latin. Character table Subheadings The Latin Extended-B block contains ten subheadings for groups of characters: Non-European and historic Latin, African letters for clicks, Croatian digraphs matching Serbian Cyrillic letters, Pinyin diacritic-vowel combinations, Phonetic and historic letters, Additions for Slovenian and Croatian, Additions for Romanian, Miscellaneous additions, Additions for Livonian, and Additions for Sinology. The Non-European and historic, African clicks, Croatian digraphs, Pinyin, and the first p ... [...More Info...]       [...Related Items...]     OR:     [Wikipedia]   [Google]   [Baidu]   |
|
Halfwidth And Fullwidth Forms (Unicode Block)
Halfwidth and Fullwidth Forms is a Unicode block U+FF00–FFEF, provided so that older encodings containing both halfwidth and fullwidth characters can have lossless translation to/from Unicode. It is the second-to-last block of the Basic Multilingual Plane, followed only by the short Specials block at U+FFF0–FFFF. Its block name in Unicode 1.0 was Halfwidth and Fullwidth Variants. Range U+FF01–FF5E reproduces the characters of ASCII 21 to 7E as fullwidth forms. U+FF00 does not correspond to a fullwidth ASCII 20 (space character), since that role is already fulfilled by U+3000 " ideographic space". Range U+FF61–FF9F encodes halfwidth forms of katakana and related punctuation in a transposition of A1 to DF in the JIS X 0201 encoding – see half-width kana. The range U+FFA0–FFDC encodes halfwidth forms of compatibility jamo characters for Hangul, in a transposition of their 1974 standard layout. It is used in the mapping of some IBM encodings for K ... [...More Info...]       [...Related Items...]     OR:     [Wikipedia]   [Google]   [Baidu]   |
|
Alphabetic Presentation Forms
Alphabetic Presentation Forms is a Unicode block containing standard ligatures for the Latin, Armenian, and Hebrew scripts. Block History The following Unicode-related documents record the purpose and process of defining specific characters in the Alphabetic Presentation Forms block: See also *Armenian (Unicode block) * Latin alphabet in Unicode * Hebrew alphabet in Unicode *Precomposed character A precomposed character (alternatively composite character or decomposable character) is a Unicode entity that can also be defined as a sequence of one or more other characters. A precomposed character may typically represent a letter with a diac ... * Arabic Presentation Forms-A * Arabic Presentation Forms-B References {{reflist Unicode blocks Latin script ... [...More Info...]       [...Related Items...]     OR:     [Wikipedia]   [Google]   [Baidu]   |
|
Latin Extended-E
Latin Extended-E is a Unicode block containing Latin script characters used in German dialectology (Teuthonista), Anthropos (journal), Anthropos alphabet, Yakut scripts, Sakha and Americanist phonetic notation, Americanist usage. Block History The following Unicode-related documents record the purpose and process of defining specific characters in the Latin Extended-E block: See also * Greek alphabet#Greek in Unicode, Greek alphabet in Unicode References {{reflist Latin-script Unicode blocks Computer-related introductions in 2014 Unicode blocks ... [...More Info...]       [...Related Items...]     OR:     [Wikipedia]   [Google]   [Baidu]   |
|
Latin Extended-D
Latin Extended-D is a Unicode block containing Latin (script), Latin characters for phonetic, Mayanist, and Medieval transcription and notation systems. 89 of the characters in this block are for medieval characters proposed by the Medieval Unicode Font Initiative, many of which are representative of scribal abbreviations used in Medieval manuscript, manuscript texts. Block History The following Unicode-related documents record the purpose and process of defining specific characters in the Latin Extended-D block: References {{reflist Latin-script Unicode blocks Unicode blocks ... [...More Info...]       [...Related Items...]     OR:     [Wikipedia]   [Google]   [Baidu]   |
|
Latin Extended-C
Latin Extended-C is a Unicode block containing Latin characters for Uighur New Script, the Uralic Phonetic Alphabet, Shona, Claudian Latin and the Swedish Dialect Alphabet. Block History The following Unicode-related documents record the purpose and process of defining specific characters in the Latin Extended-C block: See also *Phonetic symbols in Unicode Unicode supports several phonetic scripts and notation systems through its existing scripts and the addition of extra Unicode block, blocks with phonetic characters. These phonetic characters are derived from an existing script, usually Latin, Gr ... References {{reflist Latin-script Unicode blocks Unicode blocks ... [...More Info...]       [...Related Items...]     OR:     [Wikipedia]   [Google]   [Baidu]   |
|
Number Forms
Number Forms is a Unicode block containing Unicode compatibility characters that have specific meaning as numbers, but are constructed from other characters. They consist primarily of vulgar fractions and Roman numerals. In addition to the characters in the Number Forms block, three fractions (¼, ½, and ¾) were inherited from ISO-8859-1, which was incorporated whole as the Latin-1 Supplement The Latin-1 Supplement (also called C1 Controls and Latin-1 Supplement) is the second Unicode block in the Unicode standard. It encodes the upper range of ISO 8859-1: 80 (U+0080) – FF (U+00FF). C1 Controls (0080–009F) are not graphic. T ... block. List of characters Block History The following Unicode-related documents record the purpose and process of defining specific characters in the Number Forms block: See also * Latin script in Unicode * Unicode symbols References {{Mathematical symbols notation language Symbols Unicode Unicode blocks ... [...More Info...]       [...Related Items...]     OR:     [Wikipedia]   [Google]   [Baidu]   |
|
Letterlike Symbols (Unicode Block)
Letterlike Symbols is a Unicode block containing 80 characters which are constructed mainly from the glyphs of one or more letters. In addition to this block, Unicode includes full styled mathematical alphabets, although Unicode does not explicitly categorize these characters as being "letterlike." Symbols Glyph variants Variation selectors may be used to specify chancery (U+FE00) vs roundhand (U+FE01) forms, if the font supports them: The remainder of the set is at Mathematical Alphanumeric Symbols. Block Emoji The Letterlike Symbols block contains two emoji: U+2122 and U+2139. The block has four standardized variants defined to specify emoji-style (U+FE0F VS16) or text presentation (U+FE0E VS15) for the two emoji, both of which default to a text presentation. History The following Unicode-related documents record the purpose and process of defining specific characters in the Letterlike Symbols block: See also * Greek in Unicode * Latin script in Un ... [...More Info...]       [...Related Items...]     OR:     [Wikipedia]   [Google]   [Baidu]   |
|
Superscripts And Subscripts
Superscripts and Subscripts is a Unicode block containing superscript and subscript numerals, mathematical operators, and letters used in mathematics and phonetics. The use of subscripts and superscripts in Unicode allows any polynomial, chemical and certain other equations to be represented in plain text without using any form of markup like HTML or TeX. Other superscript letters can be found in the Spacing Modifier Letters, Phonetic Extensions and Phonetic Extensions Supplement blocks, while the superscript 1, 2, and 3, inherited from ISO 8859-1 ISO/IEC 8859-1:1998, ''Information technology— 8-bit single-byte coded graphic character sets—Part 1: Latin alphabet No. 1'', is part of the ISO/IEC 8859 series of ASCII-based standard character encodings, first edition published in 19 ..., were included in the Latin-1 Supplement block. Block History The following Unicode-related documents record the purpose and process of defining specific characters in the Supers ... [...More Info...]       [...Related Items...]     OR:     [Wikipedia]   [Google]   [Baidu]   |
|
Latin Extended Additional
Latin Extended Additional is a Unicode block. The characters in this block are mostly precomposed combinations of Latin letters with one or more general diacritical marks. Ninety of the characters are used in the Vietnamese alphabet The Vietnamese alphabet (, ) is the modern writing script for the Vietnamese language. It uses the Latin script based on Romance languages like French language, French, originally developed by Francisco de Pina (1585–1625), a missionary from P .... There are also a few Medievalist characters. Latin extended additional table The following table shows the contents of the block: Compact table History The following Unicode-related documents record the purpose and process of defining specific characters in the Latin Extended Additional block: See also * Vietnamese language and computers References {{DEFAULTSORT:Latin Extended Additional Unicode Block Latin-script Unicode blocks Unicode blocks ... [...More Info...]       [...Related Items...]     OR:     [Wikipedia]   [Google]   [Baidu]   |
|
Phonetic Extensions Supplement
Phonetic Extensions Supplement is a Unicode block containing characters for specialized and deprecated forms of the International Phonetic Alphabet The International Phonetic Alphabet (IPA) is an alphabetic system of phonetic notation based primarily on the Latin script. It was devised by the International Phonetic Association in the late 19th century as a standard written representation .... Block History The following Unicode-related documents record the purpose and process of defining specific characters in the Phonetic Extensions Supplement block: See also * Greek alphabet in Unicode * Latin script in Unicode References {{reflist Unicode blocks ... [...More Info...]       [...Related Items...]     OR:     [Wikipedia]   [Google]   [Baidu]   |