The Latin-1 Supplement (also called C1 Controls and Latin-1 Supplement) is the second
Unicode block
A Unicode block is one of several contiguous ranges of numeric character codes (code points) of the Unicode character set that are defined by the Unicode Consortium for administrative and documentation purposes. Typically, proposals such as the ad ...
in the
Unicode
Unicode, formally The Unicode Standard,The formal version reference is is an information technology Technical standard, standard for the consistent character encoding, encoding, representation, and handling of Character (computing), text expre ...
standard. It encodes the upper range of
ISO 8859-1: 80 (U+0080) - FF (U+00FF).
C1 Controls (0080–009F) are not graphic. This block ranges from U+0080 to U+00FF, contains 128 characters and includes the
C1 controls,
Latin-1
ISO/IEC 8859-1:1998, ''Information technology — 8-bit single-byte coded graphic character sets — Part 1: Latin alphabet No. 1'', is part of the ISO/IEC 8859 series of ASCII-based standard character encodings, first edition published in 1 ...
punctuation
Punctuation (or sometimes interpunction) is the use of spacing, conventional signs (called punctuation marks), and certain typographical devices as aids to the understanding and correct reading of written text, whether read silently or aloud. An ...
and
symbols
A symbol is a mark, sign, or word that indicates, signifies, or is understood as representing an idea, object, or relationship. Symbols allow people to go beyond what is known or seen by creating linkages between otherwise very different conc ...
, 30 pairs of
majuscule
Letter case is the distinction between the letters that are in larger uppercase or capitals (or more formally ''majuscule'') and smaller lowercase (or more formally ''minuscule'') in the written representation of certain languages. The writing ...
and
minuscule
Letter case is the distinction between the letters that are in larger uppercase or capitals (or more formally ''majuscule'') and smaller lowercase (or more formally ''minuscule'') in the written representation of certain languages. The writing ...
accented
A diacritic (also diacritical mark, diacritical point, diacritical sign, or accent) is a glyph added to a letter or to a basic glyph. The term derives from the Ancient Greek (, "distinguishing"), from (, "to distinguish"). The word ''diacriti ...
Latin characters
The Latin script, also known as Roman script, is an alphabetic writing system based on the letters of the classical Latin alphabet, derived from a form of the Greek alphabet which was in use in the ancient Greek city of Cumae, in southern Italy ...
and 2 mathematical operators.
The C1 controls and Latin-1 Supplement block has been included in its present form, with the same character repertoire since version 1.0 of the
Unicode Standard
Unicode, formally The Unicode Standard,The formal version reference is is an information technology standard for the consistent encoding, representation, and handling of text expressed in most of the world's writing systems. The standard, whic ...
. Its block name in Unicode 1.0 was simply Latin1.
Character table
Subheadings
The C1 Controls and Latin-1 Supplement block has four subheadings within its character collection: C1 controls, Latin-1 Punctuation and Symbols, Letters, and Mathematical operator(s).
C1 controls
The C1 controls subheading contains 32 supplementary control codes inherited from
ISO/IEC 8859-1
ISO/IEC 8859-1:1998, ''Information technology — 8-bit single-byte coded graphic character sets — Part 1: Latin alphabet No. 1'', is part of the ISO/IEC 8859 series of ASCII-based standard character encodings, first edition published in ...
and many other 8-bit character standards. The alias names for the C0 and C1 control codes are taken from
ISO/IEC 6429:1992.
Latin-1 punctuation and symbols
The Latin-1 Punctuation and Symbols subheading contains 32 characters of common international punctuation characters, such as inverted exclamation and question marks, and a middle dot; and symbols like currency signs, spacing diacritic marks, vulgar fraction, and superscript numbers.
Letters
The Letters subheading contains 30 pairs of majuscule and minuscule accented or novel Latin characters for western European languages, and two extra minuscule characters not commonly used word-initially.
Mathematical operator
The Mathematical operator subheading is used for the multiplication and division signs.
Number of symbols, letters and control codes
The table below shows the number of each letters, symbols and control codes in each subheadings in the C1 Controls and Latin-1 Supplement block.
Compact table
Emoji
The Latin-1 Supplement block contains two
emoji
An emoji ( ; plural emoji or emojis) is a pictogram, logogram, ideogram or smiley embedded in text and used in electronic messages and web pages. The primary function of emoji is to fill in emotional cues otherwise missing from typed conversat ...
:
U+00A9 and U+00AE.
The block has four
standardized variants defined to specify emoji-style (U+FE0F VS16) or text presentation (U+FE0E VS15) for the
two emoji, both of which default to a text presentation.
History
The following Unicode-related documents record the purpose and process of defining specific characters in the Latin-1 Supplement block:
See also
*
Phonetic symbols in Unicode
Unicode supports several phonetic scripts and notations through the existing writing systems and the addition of extra blocks with phonetic characters. These phonetic extras are derived from an existing script, usually Latin, Greek or Cyrillic. A ...
References
{{Unicode navigation
Latin-script Unicode blocks
Unicode blocks