Latin Extended
   HOME

TheInfoList



OR:

Over a thousand characters from the Latin script are encoded in the
Unicode Standard Unicode, formally The Unicode Standard,The formal version reference is is an information technology standard for the consistent encoding, representation, and handling of text expressed in most of the world's writing systems. The standard, whic ...
, grouped in several basic and extended Latin blocks. The extended ranges contain mainly precomposed letters plus diacritics that are equivalently encoded with combining diacritics, as well as some ligatures and distinct letters, used for example in the orthographies of various African languages (including
click Click, Klick and Klik may refer to: Airlines * Click Airways, a UAE airline * Clickair, a Spanish airline * MexicanaClick, a Mexican airline Art, entertainment, and media Fictional characters * Klick (fictional species), an alien race in the g ...
symbols in Latin Extended-B) and the Vietnamese alphabet (Latin Extended Additional). Latin Extended-C contains additions for Uighur and the
Claudian letters The Claudian letters were developed by the Roman emperor Claudius (reigned 41–54). He introduced three new letters to the Latin alphabet: *Ↄ or ↃϹ/X (''antisigma'') to replace BS and PS, much as X stood in for CS and GS. The shape o ...
. Latin Extended-D comprises characters that are mostly of interest to medievalists. Latin Extended-E mostly comprises characters used for German dialectology (
Teuthonista Teuthonista is a phonetic transcription system used predominantly for the transcription of (High) German dialects. It is very similar to other Central European transcription systems from the early 20th century. The base characters are mostly bas ...
). Latin Extended-F and -G contain characters for phonetic transcription.


Blocks

As of version 15.0 of the Unicode Standard, 1,481 characters in the following 19 blocks are classified as belonging to the Latin script. * Basic Latin, 0000–007F. This block corresponds to ASCII. * Latin-1 Supplement, 0080–00FF * Latin Extended-A, 0100–017F *
Latin Extended-B Latin Extended-B is the fourth block (0180-024F) of the Unicode Standard. It has been included since version 1.0, where it was only allocated to the code points 0180-01FF and contained 113 characters. During unification with ISO 10646 for version ...
, 0180–024F * IPA Extensions, 0250–02AF * Spacing Modifier Letters, 02B0–02FF * Phonetic Extensions, 1D00–1D7F * Phonetic Extensions Supplement, 1D80–1DBF * Latin Extended Additional, 1E00–1EFF * Superscripts and Subscripts, 2070–209F * Letterlike Symbols, 2100–214F *
Number Forms Number Forms is a Unicode block containing Unicode compatibility characters that have specific meaning as numbers, but are constructed from other characters. They consist primarily of vulgar fractions and Roman numerals. In addition to the cha ...
, 2150–218F * Latin Extended-C, 2C60–2C7F * Latin Extended-D, A720–A7FF * Latin Extended-E, AB30–AB6F * Alphabetic Presentation Forms (Latin ligatures) FB00–FB4F *
Halfwidth and Fullwidth Forms In CJK (Chinese, Japanese and Korean) computing, graphic characters are traditionally classed into fullwidth (in Taiwan and Hong Kong: 全形; in CJK: 全角) and halfwidth (in Taiwan and Hong Kong: 半形; in CJK: 半角) characters. Unlike ...
, FF00–FFEF * Latin Extended-F, 10780–107BF * Latin Extended-G, 1DF00–1DFFF In addition, a number of Latin-like characters are encoded in the Currency Symbols, Control Pictures, CJK Compatibility, Enclosed Alphanumerics,
Enclosed CJK Letters and Months Enclosed CJK Letters and Months is a Unicode block containing circled and parenthesized Katakana, Hangul, and CJK ideographs. Also included in the block are miscellaneous glyphs that would more likely fit in CJK Compatibility or Enclosed Alpha ...
, Mathematical Alphanumeric Symbols, and
Enclosed Alphanumeric Supplement Enclosed Alphanumeric Supplement is a Unicode block consisting of Latin alphabet characters and Arabic numerals enclosed in circles, ovals or boxes, used for a variety of purposes. It is encoded in the range U+1F100–U+1F1FF in the Supplem ...
blocks, but, although they are Latin letters graphically, they have the script property ''
common Common may refer to: Places * Common, a townland in County Tyrone, Northern Ireland * Boston Common, a central public park in Boston, Massachusetts * Cambridge Common, common land area in Cambridge, Massachusetts * Clapham Common, originally com ...
'', and, so, do not belong to the Latin script in Unicode terms. Lisu also consists almost entirely of Latin forms, but uses its own script property.


Table of characters

In this table those characters with the Unicode script property of Latin are highlighted in colour, indicating the version of Unicode they were introduced in. Reserved code points (which may be assigned as characters at a future date) have a grey background. All characters that do not belong to the Latin script have a white background (and the version of Unicode they were introduced in is therefore not indicated).


See also

* Universal Character Set characters * Letterlike Symbols (Unicode block) *
List of Latin-script letters This is a list of letters of the Latin script. The definition of a Latin-script letter for this list is a character encoded in the Unicode Standard that has a script property of 'Latin' and the general category of 'Letter'. An overview of the ...
* List of Latin letters by shape * Mathematical Alphanumeric Symbols * European Latin Unicode subset (DIN 91379)


References

{{DEFAULTSORT:Latin Characters in Unicode Unicode *