As of
Unicode
Unicode, formally The Unicode Standard,The formal version reference is is an information technology Technical standard, standard for the consistent character encoding, encoding, representation, and handling of Character (computing), text expre ...
version 15.0, there are 149,186
characters
Character or Characters may refer to:
Arts, entertainment, and media Literature
* ''Character'' (novel), a 1936 Dutch novel by Ferdinand Bordewijk
* ''Characters'' (Theophrastus), a classical Greek set of character sketches attributed to The ...
with
code point
In character encoding terminology, a code point, codepoint or code position is a numerical value that maps to a specific character. Code points usually represent a single grapheme—usually a letter, digit, punctuation mark, or whitespace—but ...
s, covering 161 modern and historical
scripts
Script may refer to:
Writing systems
* Script, a distinctive writing system, based on a repertoire of specific elements or symbols, or that repertoire
* Script (styles of handwriting)
** Script typeface, a typeface with characteristics of handw ...
, as well as multiple symbol sets. This article includes the 1062 characters in the Multilingual European Character Set 2 (
MES-2
Unicode, formally The Unicode Standard,The formal version reference is is an information technology Technical standard, standard for the consistent character encoding, encoding, representation, and handling of Character (computing), text expre ...
) subset, and some additional related characters.
Character reference overview
HTML
The HyperText Markup Language or HTML is the standard markup language for documents designed to be displayed in a web browser. It can be assisted by technologies such as Cascading Style Sheets (CSS) and scripting languages such as JavaScri ...
and
XML
Extensible Markup Language (XML) is a markup language and file format for storing, transmitting, and reconstructing arbitrary data. It defines a set of rules for encoding documents in a format that is both human-readable and machine-readable. T ...
provide ways to reference Unicode characters when the characters themselves either cannot or should not be used. A ''numeric character reference'' refers to a character by its
Universal Character Set
The Universal Coded Character Set (UCS, Unicode) is a standard set of characters defined by the international standard ISO/IEC 10646, ''Information technology — Universal Coded Character Set (UCS)'' (plus amendments to that standard), whi ...
/
Unicode
Unicode, formally The Unicode Standard,The formal version reference is is an information technology Technical standard, standard for the consistent character encoding, encoding, representation, and handling of Character (computing), text expre ...
''code point'', and a ''character entity reference'' refers to a character by a predefined name.
A ''numeric character reference'' uses the format
:
''nnnn''
;
or
:
''hhhh''
;
where ''nnnn'' is the code point in
decimal
The decimal numeral system (also called the base-ten positional numeral system and denary or decanary) is the standard system for denoting integer and non-integer numbers. It is the extension to non-integer numbers of the Hindu–Arabic numeral ...
form, and ''hhhh'' is the code point in
hexadecimal
In mathematics and computing, the hexadecimal (also base-16 or simply hex) numeral system is a positional numeral system that represents numbers using a radix (base) of 16. Unlike the decimal system representing numbers using 10 symbols, hexa ...
form. The ''x'' must be lowercase in XML documents. The ''nnnn'' or ''hhhh'' may be any number of digits and may include leading zeros. The ''hhhh'' may mix uppercase and lowercase, though uppercase is the usual style.
In contrast, a ''character entity reference'' refers to a character by the name of an ''
entity
An entity is something that exists as itself, as a subject or as an object, actually or potentially, concretely or abstractly, physically or not. It need not be of material existence. In particular, abstractions and legal fictions are usually ...
'' which has the desired character as its ''replacement text''. The entity must either be predefined (built into the markup language) or explicitly declared in a
Document Type Definition
A document type definition (DTD) is a set of ''markup declarations'' that define a ''document type'' for an SGML-family markup language ( GML, SGML, XML, HTML).
A DTD defines the valid building blocks of an XML document. It defines the document ...
(DTD). The format is the same as for any entity reference:
:
&
''name''
;
where ''name'' is the case-sensitive name of the entity. The semicolon is required.
Because numbers are harder for humans to remember than names, character entity references are most often written by humans, while numeric character references are most often produced by computer programs.
Control codes
65 characters, including
DEL
Del, or nabla, is an operator used in mathematics (particularly in vector calculus) as a vector differential operator, usually represented by the nabla symbol ∇. When applied to a function defined on a one-dimensional domain, it denotes th ...
. All belong to the
common
Common may refer to:
Places
* Common, a townland in County Tyrone, Northern Ireland
* Boston Common, a central public park in Boston, Massachusetts
* Cambridge Common, common land area in Cambridge, Massachusetts
* Clapham Common, originally com ...
script.
Footnotes:
:
1 Control-C has typically been used as a "break" or "interrupt" key.
:
2 Control-D has been used to signal "end of file" for text typed in at the terminal on Unix / Linux systems. Windows, DOS, and older minicomputers used Control-Z for this purpose.
:
3 Control-G is an artifact of the days when
teletype
A teleprinter (teletypewriter, teletype or TTY) is an electromechanical device that can be used to send and receive typed messages through various communications channels, in both point-to-point and point-to-multipoint configurations. Initia ...
s were in use. Important messages could be signalled by striking the bell on the teletype. This was carried over on PCs by generating a buzz sound.
:
4 Line feed is used for "end of line" in text files on Unix / Linux systems.
:
5 Carriage Return (accompanied by line feed) is used as "end of line" character by Windows, DOS, and most minicomputers other than Unix- / Linux-based systems
:
6 Control-O has been the "discard output" key on minicomputers. Output is not sent to the terminal, but discarded, until another Control-o is typed.
:
7 Control-Q has been used to tell a host computer to resume sending output after it was stopped by Control-S.
:
8 Control-S has been used to tell a host computer to postpone sending output to the terminal. Output is suspended until restarted by the Control-Q key.
:
9 Control-U was originally used by
Digital Equipment Corporation
Digital Equipment Corporation (DEC ), using the trademark Digital, was a major American company in the computer industry from the 1960s to the 1990s. The company was co-founded by Ken Olsen and Harlan Anderson in 1957. Olsen was president unt ...
computers to cancel a line of typed-in text. Other manufacturers used Control-X for this purpose.
:
10 Control-X was commonly used to cancel a line of input typed in at the terminal.
:
11 Control-Z has commonly been used on minicomputers, Windows and DOS systems to indicate "end of file" either on a terminal or in a text file. Unix / Linux systems use Control-D to indicate end-of-file at a terminal.
Latin script
The Unicode Standard (version 15.0) classifies 1,481 characters as belonging to the Latin script.
Basic Latin
95 characters; the 52 alphabet characters belong to the Latin script. The remaining 43 belong to the
common
Common may refer to:
Places
* Common, a townland in County Tyrone, Northern Ireland
* Boston Common, a central public park in Boston, Massachusetts
* Cambridge Common, common land area in Cambridge, Massachusetts
* Clapham Common, originally com ...
script.
The 33 characters classified as ASCII Punctuation & Symbols are also sometimes referred to as ASCII special characters. See
§ Latin-1 Supplement and
§ Unicode symbols for additional "special characters". Certain special characters can be used in passwords; some organizations require their use. See the
List of Special Characters for Passwords
This is a list of Unicode characters which can be used to meet requirements for the use of special characters in user account passwords. Each system may have its own requirements and limitations. This article refers to the general class of char ...
.
{, class="wikitable sortable collapsible" id="Table_Basic_Latin"
!
!Code
!Glyph
!Decimal
!Octal
!Description
!#
, -
, align="center" rowspan=16 style="background: #dcffdc;", ASCII
Punctuation
& Symbols, , U+0020
, align="center",
, 32
, 040
,
Space
Space is the boundless three-dimensional extent in which objects and events have relative position and direction. In classical physics, physical space is often conceived in three linear dimensions, although modern physicists usually consider ...
, 0001
, -
, U+0021
, align="center", !
, 33
, 041
,
Exclamation mark
The exclamation mark, , or exclamation point (American English), is a punctuation mark usually used after an interjection or exclamation to indicate strong feelings or to show emphasis. The exclamation mark often marks the end of a sentence, f ...
, 0002
, -
, U+0022
, align="center", "
, 34
, 042
,
Quotation mark
Quotation marks (also known as quotes, quote marks, speech marks, inverted commas, or talking marks) are punctuation marks used in pairs in various writing systems to set off direct speech, a quotation, or a phrase. The pair consists of an ...
, 0003
, -
, U+0023
, align="center", #
, 35
, 043
,
Number sign
The symbol is known variously in English-speaking regions as the number sign, hash, or pound sign. The symbol has historically been used for a wide range of purposes including the designation of an ordinal number and as a Typographic ligature, ...
,
Hash,
Octothorpe
The symbol is known variously in English-speaking regions as the number sign, hash, or pound sign. The symbol has historically been used for a wide range of purposes including the designation of an ordinal number and as a ligatured abbreviati ...
,
Sharp
Sharp or SHARP may refer to:
Acronyms
* SHARP (helmet ratings) (Safety Helmet Assessment and Rating Programme), a British motorcycle helmet safety rating scheme
* Self Help Addiction Recovery Program, a charitable organisation founded in 19 ...
, 0004
, -
, U+0024
, align="center", $
, 36
, 044
,
Dollar sign
The dollar sign, also known as peso sign, is a symbol consisting of a capital " S" crossed with one or two vertical strokes ($ or ), used to indicate the unit of various currencies around the world, including most currencies denominated "pes ...
, 0005
, -
, U+0025
, align="center", %
, 37
, 045
,
Percent sign
The percent sign (sometimes per cent sign in British English) is the symbol used to indicate a percentage, a number or ratio as a fraction of 100. Related signs include the permille (per thousand) sign and the permyriad (per ten thousand) s ...
, 0006
, -
, U+0026
, align="center", &
, 38
, 046
,
Ampersand
The ampersand, also known as the and sign, is the logogram , representing the conjunction "and". It originated as a ligature of the letters ''et''—Latin for "and".
Etymology
Traditionally in English, when spelling aloud, any letter that ...
, 0007
, -
, U+0027
, align="center", '
, 39
, 047
,
Apostrophe
The apostrophe ( or ) is a punctuation mark, and sometimes a diacritical mark, in languages that use the Latin alphabet and some other alphabets. In English, the apostrophe is used for two basic purposes:
* The marking of the omission of one o ...
, 0008
, -
, U+0028
, align="center", (
, 40
, 050
,
Left parenthesis
A bracket is either of two tall fore- or back-facing punctuation marks commonly used to isolate a segment of text or data from its surroundings. Typically deployed in symmetric pairs, an individual bracket may be identified as a 'left' or 'r ...
, 0009
, -
, U+0029
, align="center", )
, 41
, 051
,
Right parenthesis
A bracket is either of two tall fore- or back-facing punctuation marks commonly used to isolate a segment of text or data from its surroundings. Typically deployed in symmetric pairs, an individual bracket may be identified as a 'left' or 'r ...
, 0010
, -
, U+002A
, align="center", *
, 42
, 052
,
Asterisk
The asterisk ( ), from Late Latin , from Ancient Greek , ''asteriskos'', "little star", is a typographical symbol. It is so called because it resembles a conventional image of a heraldic star.
Computer scientists and mathematicians often voc ...
, 0011
, -
, U+002B
, align="center", +
, 43
, 053
,
Plus sign
The plus and minus signs, and , are mathematical symbols used to represent the notions of positive and negative, respectively. In addition, represents the operation of addition, which results in a sum, while represents subtraction, result ...
, 0012
, -
, U+002C
, align="center", ,
, 44
, 054
,
Comma
The comma is a punctuation mark that appears in several variants in different languages. It has the same shape as an apostrophe or single closing quotation mark () in many typefaces, but it differs from them in being placed on the baseline ...
, 0013
, -
, U+002D
, align="center", -
, 45
, 055
,
Hyphen-minus
The hyphen-minus is the most commonly used type of hyphen, widely used in digital documents. It is the only character that looks like a minus sign or a dash in many character sets such as ASCII or on most keyboards, so it is also used as such. ...
, 0014
, -
, U+002E
, align="center", .
, 46
, 056
,
Full stop
The full stop (Commonwealth English), period (North American English), or full point , is a punctuation mark. It is used for several purposes, most often to mark the end of a declarative sentence (as distinguished from a question or exclamation ...
, 0015
, -
, U+002F
, align="center", /
, 47
, 057
,
Slash (Solidus)
, 0016
, -
, align="center" rowspan=10 style="background: #eaeaff;", ASCII
Digits, , U+0030
, align="center", 0
, 48
, 060
,
Digit Zero
, 0017
, -
, U+0031
, align="center", 1
, 49
, 061
,
Digit One
, 0018
, -
, U+0032
, align="center", 2
, 50
, 062
,
Digit Two
, 0019
, -
, U+0033
, align="center", 3
, 51
, 063
,
Digit Three
, 0020
, -
, U+0034
, align="center", 4
, 52
, 064
,
Digit Four
, 0021
, -
, U+0035
, align="center", 5
, 53
, 065
,
Digit Five
, 0022
, -
, U+0036
, align="center", 6
, 54
, 066
,
Digit Six
, 0023
, -
, U+0037
, align="center", 7
, 55
, 067
,
Digit Seven
, 0024
, -
, U+0038
, align="center", 8
, 56
, 070
,
Digit Eight
, 0025
, -
, U+0039
, align="center", 9
, 57
, 071
,
Digit Nine
, 0026
, -
, align="center" rowspan=7 style="background: #dcffdc;", ASCII
Punctuation
& Symbols, , U+003A
, align="center", :
, 58
, 072
,
Colon
, 0027
, -
, U+003B
, align="center", ;
, 59
, 073
,
Semicolon
The semicolon or semi-colon is a symbol commonly used as orthographic punctuation. In the English language, a semicolon is most commonly used to link (in a single sentence) two independent clauses that are closely related in thought. When a ...
, 0028
, -
, U+003C
, align="center", <
, 60
, 074
,
Less-than sign
The less-than sign is a mathematical symbol that denotes an inequality between two values. The widely adopted form of two equal-length strokes connecting in an acute angle at the left, , has been found in documents dated as far back as the 1560s ...
, 0029
, -
, U+003D
, align="center", =
, 61
, 075
,
Equal sign
The equals sign (British English, Unicode) or equal sign (American English), also known as the equality sign, is the mathematical symbol , which is used to indicate equality in some well-defined sense. In an equation, it is placed between two ...
, 0030
, -
, U+003E
, align="center", >
, 62
, 076
,
Greater-than sign
The greater-than sign is a mathematical symbol that denotes an inequality between two values. The widely adopted form of two equal-length strokes connecting in an acute angle at the right, , has been found in documents dated as far back as the 1 ...
, 0031
, -
, U+003F
, align="center", ?
, 63
, 077
,
Question mark
The question mark (also known as interrogation point, query, or eroteme in journalism) is a punctuation mark that indicates an interrogative clause or phrase in many languages.
History
In the fifth century, Syriac Bible manuscripts used ques ...
, 0032
, -
, U+0040
, align="center", @
, 64
, 0100
,
At sign
The at sign, , is normally read aloud as "at"; it is also commonly called the at symbol, commercial at, or address sign. It is used as an accounting and invoice abbreviation meaning "at a rate of" (e.g. 7 widgets @ £2 per widget = £14), but ...
, 0033
, -
, align="center" rowspan=26 style="background: #ffcccc;", Latin
Alphabet:
Uppercase, , U+0041
, align="center", A
, 65
, 0101
,
Latin Capital letter A
, 0034
, -
, U+0042
, align="center", B
, 66
, 0102
,
Latin Capital letter B
, 0035
, -
, U+0043
, align="center", C
, 67
, 0103
,
Latin Capital letter C
, 0036
, -
, U+0044
, align="center", D
, 68
, 0104
,
Latin Capital letter D
, 0037
, -
, U+0045
, align="center", E
, 69
, 0105
,
Latin Capital letter E
, 0038
, -
, U+0046
, align="center", F
, 70
, 0106
,
Latin Capital letter F
, 0039
, -
, U+0047
, align="center", G
, 71
, 0107
,
Latin Capital letter G
, 0040
, -
, U+0048
, align="center", H
, 72
, 0110
,
Latin Capital letter H
, 0041
, -
, U+0049
, align="center", I
, 73
, 0111
,
Latin Capital letter I
, 0042
, -
, U+004A
, align="center", J
, 74
, 0112
,
Latin Capital letter J
, 0043
, -
, U+004B
, align="center", K
, 75
, 0113
,
Latin Capital letter K
, 0044
, -
, U+004C
, align="center", L
, 76
, 0114
,
Latin Capital letter L
, 0045
, -
, U+004D
, align="center", M
, 77
, 0115
,
Latin Capital letter M
, 0046
, -
, U+004E
, align="center", N
, 78
, 0116
,
Latin Capital letter N
, 0047
, -
, U+004F
, align="center", O
, 79
, 0117
,
Latin Capital letter O
, 0048
, -
, U+0050
, align="center", P
, 80
, 0120
,
Latin Capital letter P
, 0049
, -
, U+0051
, align="center", Q
, 81
, 0121
,
Latin Capital letter Q
, 0050
, -
, U+0052
, align="center", R
, 82
, 0122
,
Latin Capital letter R
, 0051
, -
, U+0053
, align="center", S
, 83
, 0123
,
Latin Capital letter S
, 0052
, -
, U+0054
, align="center", T
, 84
, 0124
,
Latin Capital letter T
, 0053
, -
, U+0055
, align="center", U
, 85
, 0125
,
Latin Capital letter U
, 0054
, -
, U+0056
, align="center", V
, 86
, 0126
,
Latin Capital letter V
, 0055
, -
, U+0057
, align="center", W
, 87
, 0127
,
Latin Capital letter W
, 0056
, -
, U+0058
, align="center", X
, 88
, 0130
,
Latin Capital letter X
, 0057
, -
, U+0059
, align="center", Y
, 89
, 0131
,
Latin Capital letter Y
, 0058
, -
, U+005A
, align="center", Z
, 90
, 0132
,
Latin Capital letter Z
, 0059
, -
, align="center" rowspan=6 style="background: #dcffdc;", ASCII
Punctuation
& Symbols, , U+005B
, align="center", [
, 91
, 0133
,
Left Square Bracket
A bracket is either of two tall fore- or back-facing punctuation marks commonly used to isolate a segment of text or data from its surroundings. Typically deployed in symmetric pairs, an individual bracket may be identified as a 'left' or 'r ...
, 0060
, -
, U+005C
, align="center", \
, 92
, 0134
,
Backslash
The backslash is a typographical mark used mainly in computing and mathematics. It is the mirror image of the common slash . It is a relatively recent mark, first documented in the 1930s.
History
, efforts to identify either the origin o ...
, 0061
, -
, U+005D
, align="center", ]
, 93
, 0135
,
Right Square Bracket
A bracket is either of two tall fore- or back-facing punctuation marks commonly used to isolate a segment of text or data from its surroundings. Typically deployed in symmetric pairs, an individual bracket may be identified as a 'left' or 'r ...
, 0062
, -
, U+005E
, align="center", ^
, 94
, 0136
,
Circumflex accent
The circumflex () is a diacritic in the Latin and Greek scripts that is also used in the written forms of many languages and in various romanization and transcription schemes. It received its English name from la, circumflexus "bent around"a ...
, 0063
, -
, U+005F
, align="center", _
, 95
, 0137
,
Low line
, 0064
, -
, U+0060
, align="center", `
, 96
, 0140
,
Grave accent
The grave accent () ( or ) is a diacritical mark used to varying degrees in French, Dutch, Portuguese, Italian and many other western European languages, as well as for a few unusual uses in English. It is also used in other languages using t ...
, 0065
, -
, align="center" rowspan=26 style="background: #ffebeb;", Latin
Alphabet:
Lowercase, , U+0061
, align="center", a
, 97
, 0141
, Latin Small Letter A
, 0066
, -
, U+0062
, align="center", b
, 98
, 0142
, Latin Small Letter B
, 0067
, -
, U+0063
, align="center", c
, 99
, 0143
, Latin Small Letter C
, 0068
, -
, U+0064
, align="center", d
, 100
, 0144
, Latin Small Letter D
, 0069
, -
, U+0065
, align="center", e
, 101
, 0145
, Latin Small Letter E
, 0070
, -
, U+0066
, align="center", f
, 102
, 0146
, Latin Small Letter F
, 0071
, -
, U+0067
, align="center", g
, 103
, 0147
, Latin Small Letter G
, 0072
, -
, U+0068
, align="center", h
, 104
, 0150
, Latin Small Letter H
, 0073
, -
, U+0069
, align="center", i
, 105
, 0151
, Latin Small Letter I
, 0074
, -
, U+006A
, align="center", j
, 106
, 0152
, Latin Small Letter J
, 0075
, -
, U+006B
, align="center", k
, 107
, 0153
, Latin Small Letter K
, 0076
, -
, U+006C
, align="center", l
, 108
, 0154
, Latin Small Letter L
, 0077
, -
, U+006D
, align="center", m
, 109
, 0155
, Latin Small Letter M
, 0078
, -
, U+006E
, align="center", n
, 110
, 0156
, Latin Small Letter N
, 0079
, -
, U+006F
, align="center", o
, 111
, 0157
, Latin Small Letter O
, 0080
, -
, U+0070
, align="center", p
, 112
, 0160
, Latin Small Letter P
, 0081
, -
, U+0071
, align="center", q
, 113
, 0161
, Latin Small Letter Q
, 0082
, -
, U+0072
, align="center", r
, 114
, 0162
, Latin Small Letter R
, 0083
, -
, U+0073
, align="center", s
, 115
, 0163
, Latin Small Letter S
, 0084
, -
, U+0074
, align="center", t
, 116
, 0164
, Latin Small Letter T
, 0085
, -
, U+0075
, align="center", u
, 117
, 0165
, Latin Small Letter U
, 0086
, -
, U+0076
, align="center", v
, 118
, 0166
, Latin Small Letter V
, 0087
, -
, U+0077
, align="center", w
, 119
, 0167
, Latin Small Letter W
, 0088
, -
, U+0078
, align="center", x
, 120
, 0170
, Latin Small Letter X
, 0089
, -
, U+0079
, align="center", y
, 121
, 0171
, Latin Small Letter Y
, 0090
, -
, U+007A
, align="center", z
, 122
, 0172
, Latin Small Letter Z
, 0091
, -
, align="center" rowspan=4 style="background: #dcffdc;", ASCII
Punctuation
& Symbols, , U+007B
, align="center", {
, 123
, 0173
,
Left Curly Bracket
A bracket is either of two tall fore- or back-facing punctuation marks commonly used to isolate a segment of text or data from its surroundings. Typically deployed in symmetric pairs, an individual bracket may be identified as a 'left' or 'r ...
, 0092
, -
, U+007C
, align="center", |
, 124
, 0174
,
Vertical bar
The vertical bar, , is a glyph with various uses in mathematics, computing, and typography. It has many names, often related to particular meanings: Sheffer stroke (in logic), pipe, bar, or (literally the word "or"), vbar, and others.
Usage
...
, 0093
, -
, U+007D
, align="center", }
, 125
, 0175
,
Right Curly Bracket
A bracket is either of two tall fore- or back-facing punctuation marks commonly used to isolate a segment of text or data from its surroundings. Typically deployed in symmetric pairs, an individual bracket may be identified as a 'left' or 'r ...
, 0094
, -
, U+007E
, align="center", ~
, 126
, 0176
,
Tilde
The tilde () or , is a grapheme with several uses. The name of the character came into English from Spanish, which in turn came from the Latin '' titulus'', meaning "title" or "superscription". Its primary use is as a diacritic (accent) in ...
, 0095
, - class="nosort"
!
!Code
!Glyph
!Decimal
!Octal
!Description
!#
Latin-1 Supplement
96 characters; the 62 letters, and two
ordinal indicator
In written languages, an ordinal indicator is a character, or group of characters, following a numeral denoting that it is an ordinal number, rather than a cardinal number. In English orthography, this corresponds to the suffixes ''-st'', ''- ...
s belong to the Latin script. The remaining 32 belong to the
common
Common may refer to:
Places
* Common, a townland in County Tyrone, Northern Ireland
* Boston Common, a central public park in Boston, Massachusetts
* Cambridge Common, common land area in Cambridge, Massachusetts
* Clapham Common, originally com ...
script.
{, class="wikitable sortable collapsible" id="Table_Latin-1_Supplement"
!
!Code
!Glyph
!Decimal
!Octal
!HTML
!Description
! #
, -
, align="center" rowspan=32 style="background: #dcffdc;", Latin-1
Punctuation
& Symbols, , U+00A0
, align="center",
, 160
,
0302 0240
,
,
Non-breaking space
In word processing and digital typesetting, a non-breaking space, , also called NBSP, required space, hard space, or fixed space (though it is not of fixed width), is a space character that prevents an automatic line break at its position. In s ...
, 0096
, -
, U+00A1
, align="center", ¡
, 161
,
0302 0241
, ¡
,
Inverted Exclamation Mark
, 0097
, -
, U+00A2
, align="center", ¢
, 162
,
0302 0242
, ¢
,
Cent sign
The cent is a monetary unit of many national currencies that equals of the basic monetary unit.
Etymologically, the word 'cent' derives from the Latin word meaning hundred.
The cent sign is commonly a simple minuscule (lower case) letter . ...
, 0098
, -
, U+00A3
, align="center", £
, 163
,
0302 0243
, £
,
Pound sign
The pound sign is the symbol for the pound unit of sterling – the currency of the United Kingdom and previously of Great Britain and of the Kingdom of England. The same symbol is used for other currencies called pound, such as the Gibralta ...
, 0099
, -
, U+00A4
, align="center", ¤
, 164
,
0302 0244
, ¤
,
Currency sign
A currency symbol or currency sign is a graphic symbol used to denote a currency unit. Usually it is defined by the monetary authority, like the national central bank for the currency concerned.
In formatting, the symbol can use various format ...
, 0100
, -
, U+00A5
, align="center", ¥
, 165
,
0302 0245
, ¥
,
Yen sign
The yen and yuan sign, ¥, is a currency sign used for the Japanese yen and the Renminbi, Chinese yuan currency, currencies when writing in Latin scripts. This monetary symbol resembles a Latin letter Y with a single or double horizontal stroke. ...
, 0101
, -
, U+00A6
, align="center", ¦
, 166
,
0302 0246
, ¦
,
Broken bar
The vertical bar, , is a glyph with various uses in mathematics, computing, and typography. It has many names, often related to particular meanings: Sheffer stroke (in logic), pipe, bar, or (literally the word "or"), vbar, and others.
Usage ...
, 0102
, -
, U+00A7
, align="center", §
, 167
,
0302 0247
, §
,
Section sign
The section sign, §, is a typographical character for referencing individually numbered sections of a document; it is frequently used when citing sections of a legal code. It is also known as the section symbol, section mark, double-s, or silc ...
, 0103
, -
, U+00A8
, align="center", ¨
, 168
,
0302 0250
, ¨
,
Diaeresis (Umlaut)
, 0104
, -
, U+00A9
, align="center", ©
, 169
,
0302 0251
, ©
,
Copyright sign
The copyright symbol, or copyright sign, (a circled capital letter C for copyright), is the symbol used in copyright notices for works other than sound recordings. 17 U.S.C. The use of the symbol is described by the Universal Copyright Conv ...
, 0105
, -
, U+00AA
, align="center", ª
, 170
,
0302 0252
, ª
, style="background: #ffebeb;",
Feminine Ordinal Indicator
, 0106
, -
, U+00AB
, align="center", «
, 171
,
0302 0253
, «
,
Left-pointing double angle quotation mark
, 0107
, -
, U+00AC
, align="center", ¬
, 172
,
0302 0254
, ¬
,
Not sign
In logic, negation, also called the logical complement, is an operation that takes a proposition P to another proposition "not P", written \neg P, \mathord P or \overline. It is interpreted intuitively as being true when P is false, and false ...
, 0108
, -
, U+00AD
, align="center",
, 173
,
0302 0255
, ­
,
Soft hyphen
In computing and typesetting, a soft hyphen (ISO 8859: 0xAD, Unicode , HTML: ­ or ­ or ­) or syllable hyphen (EBCDIC: 0xCA), abbreviated SHY, is a code point reserved in some coded character sets for the purpose of breaki ...
, 0109
, -
, U+00AE
, align="center", ®
, 174
,
0302 0256
, ®
,
Registered sign
, 0110
, -
, U+00AF
, align="center", ¯
, 175
,
0302 0257
, ¯
,
Macron
, 0111
, -
, U+00B0
, align="center", °
, 176
,
0302 0260
, °
,
Degree sign
The degree symbol or degree sign, , is a typographical symbol that is used, among other things, to represent degrees of arc (e.g. in geographic coordinate systems), hours (in the medical field), degrees of temperature or alcohol proof. The sym ...
, 0112
, -
, U+00B1
, align="center", ±
, 177
,
0302 0261
, ±
,
Plus–minus sign
The plus–minus sign, , is a mathematical symbol with multiple meanings.
*In mathematics, it generally indicates a choice of exactly two possible values, one of which is obtained through addition and the other through subtraction.
*In experiment ...
, 0113
, -
, U+00B2
, align="center", ²
, 178
,
0302 0262
, ²
,
Superscript two
, 0114
, -
, U+00B3
, align="center", ³
, 179
,
0302 0263
, ³
,
Superscript three
, 0115
, -
, U+00B4
, align="center", ´
, 180
,
0302 0264
, ´
,
Acute accent
The acute accent (), , is a diacritic used in many modern written languages with alphabets based on the Latin, Cyrillic, and Greek scripts. For the most commonly encountered uses of the accent in the Latin and Greek alphabets, precomposed ch ...
, 0116
, -
, U+00B5
, align="center", µ
, 181
,
0302 0265
, µ
,
Micro sign
''Micro'' (Greek letter μ ( U+03BC) or the legacy symbol µ (U+00B5)) is a unit prefix in the metric system denoting a factor of 10−6 (one millionth). Confirmed in 1960, the prefix comes from the Greek ('), meaning "small".
The symbol for th ...
, 0117
, -
, U+00B6
, align="center", ¶
, 182
,
0302 0266
, ¶
,
Pilcrow
The pilcrow, ¶, is a handwritten or typographical character used to identify a paragraph. It is also called the paragraph mark (or sign or symbol), paraph, or blind P.
The pilcrow may be used at the start of separate paragraphs or to ...
sign
, 0118
, -
, U+00B7
, align="center", ·
, 183
,
0302 0267
, ·
,
Middle dot
An interpunct , also known as an interpoint, middle dot, middot and centered dot or centred dot, is a punctuation mark consisting of a vertically centered dot used for interword separation in ancient Latin script. (Word-separating spaces did no ...
, 0119
, -
, U+00B8
, align="center", ¸
, 184
,
0302 0270
, ¸
,
Cedilla
A cedilla ( ; from Spanish) or cedille (from French , ) is a hook or tail ( ¸ ) added under certain letters as a diacritical mark to modify their pronunciation. In Catalan, French, and Portuguese (called cedilha) it is used only under the ' ...
, 0120
, -
, U+00B9
, align="center", ¹
, 185
,
0302 0271
, ¹
,
Superscript one
, 0121
, -
, U+00BA
, align="center", º
, 186
,
0302 0272
, º
, style="background: #ffebeb;",
Masculine ordinal indicator
, 0122
, -
, U+00BB
, align="center", »
, 187
,
0302 0273
, »
,
Right-pointing double angle quotation mark
, 0123
, -
, U+00BC
, align="center", ¼
, 188
,
0302 0274
, ¼
, Vulgar fraction one quarter
, 0124
, -
, U+00BD
, align="center", ½
, 189
,
0302 0275
, ½
, Vulgar fraction one half
, 0125
, -
, U+00BE
, align="center", ¾
, 190
,
0302 0276
, ¾
, Vulgar fraction three quarters
, 0126
, -
, U+00BF
, align="center", ¿
, 191
,
0302 0277
, ¿
,
Inverted Question Mark
The inverted question mark, , and inverted exclamation mark, , are punctuation marks used to begin interrogative and exclamatory sentences or clauses in Spanish language, Spanish and some languages which have cultural ties with Spain, such as A ...
, 0127
, -
, align="center" rowspan=23 style="background: #ffcccc;", Letters:
Uppercase
, U+00C0
, align="center", À
, 192
,
0303 0200
, À
,
Latin Capital Letter A with grave
, 0128
, -
, U+00C1
, align="center", Á
, 193
,
0303 0201
, Á
,
Latin Capital letter A with acute
, 0129
, -
, U+00C2
, align="center", Â
, 194
,
0303 0202
, Â
,
Latin Capital letter A with circumflex
, 0130
, -
, U+00C3
, align="center", Ã
, 195
,
0303 0203
, Ã
,
Latin Capital letter A with tilde
, 0131
, -
, U+00C4
, align="center", Ä
, 196
,
0303 0204
, Ä
,
Latin Capital letter A with diaeresis
, 0132
, -
, U+00C5
, align="center", Å
, 197
,
0303 0205
, Å
,
Latin Capital letter A with ring above
, 0133
, -
, U+00C6
, align="center", Æ
, 198
,
0303 0206
, Æ
,
Latin Capital letter Æ
, 0134
, -
, U+00C7
, align="center", Ç
, 199
,
0303 0207
, Ç
,
Latin Capital letter C with cedilla
, 0135
, -
, U+00C8
, align="center", È
, 200
,
0303 0210
, È
,
Latin Capital letter E with grave
, 0136
, -
, U+00C9
, align="center", É
, 201
,
0303 0211
, É
,
Latin Capital letter E with acute
, 0137
, -
, U+00CA
, align="center", Ê
, 202
,
0303 0212
, Ê
,
Latin Capital letter E with circumflex
, 0138
, -
, U+00CB
, align="center", Ë
, 203
,
0303 0213
, Ë
,
Latin Capital letter E with diaeresis
, 0139
, -
, U+00CC
, align="center", Ì
, 204
,
0303 0214
, Ì
,
Latin Capital letter I with grave
, 0140
, -
, U+00CD
, align="center", Í
, 205
,
0303 0215
, Í
,
Latin Capital letter I with acute
, 0141
, -
, U+00CE
, align="center", Î
, 206
,
0303 0216
, Î
,
Latin Capital letter I with circumflex
, 0142
, -
, U+00CF
, align="center", Ï
, 207
,
0303 0217
, Ï
,
Latin Capital letter I with diaeresis
, 0143
, -
, U+00D0
, align="center", Ð
, 208
,
0303 0220
, Ð
,
Latin Capital letter Eth
, 0144
, -
, U+00D1
, align="center", Ñ
, 209
,
0303 0221
, Ñ
,
Latin Capital letter N with tilde
, 0145
, -
, U+00D2
, align="center", Ò
, 210
,
0303 0222
, Ò
,
Latin Capital letter O with grave
, 0146
, -
, U+00D3
, align="center", Ó
, 211
,
0303 0223
, Ó
,
Latin Capital letter O with acute
, 0147
, -
, U+00D4
, align="center", Ô
, 212
,
0303 0224
, Ô
,
Latin Capital letter O with circumflex
, 0148
, -
, U+00D5
, align="center", Õ
, 213
,
0303 0225
, Õ
,
Latin Capital letter O with tilde
, 0149
, -
, U+00D6
, align="center", Ö
, 214
,
0303 0226
, Ö
,
Latin Capital letter O with diaeresis
, 0150
, -
, align="center" rowspan=1 style="background: #eaeaff;", Math, , U+00D7
, align="center", ×
, 215
,
0303 0227
, ×
,
Multiplication sign
The multiplication sign, also known as the times sign or the dimension sign, is the symbol , used in mathematics to denote the multiplication operation and its resulting product. While similar to a lowercase X (), the form is properly a four- ...
, 0151
, -
, align="center" rowspan=7 style="background: #ffcccc;", Letters:
Uppercase
, U+00D8
, align="center", Ø
, 216
,
0303 0230
, Ø
,
Latin Capital letter O with stroke
, 0152
, -
, U+00D9
, align="center", Ù
, 217
,
0303 0231
, Ù
,
Latin Capital letter U with grave
, 0153
, -
, U+00DA
, align="center", Ú
, 218
,
0303 0232
, Ú
,
Latin Capital letter U with acute
, 0154
, -
, U+00DB
, align="center", Û
, 219
,
0303 0233
, Û
,
Latin Capital Letter U with circumflex
, 0155
, -
, U+00DC
, align="center", Ü
, 220
,
0303 0234
, Ü
,
Latin Capital Letter U with diaeresis
, 0156
, -
, U+00DD
, align="center", Ý
, 221
,
0303 0235
, Ý
,
Latin Capital Letter Y with acute
, 0157
, -
, U+00DE
, align="center", Þ
, 222
,
0303 0236
, Þ
,
Latin Capital Letter Thorn
, 0158
, -
, align="center" rowspan=24 style="background: #ffebeb;", Letters:
Lowercase
, U+00DF
, align="center", ß
, 223
,
0303 0237
, ß
,
Latin Small Letter sharp S
, 0159
, -
, U+00E0
, align="center", à
, 224
,
0303 0240
, à
, Latin Small Letter A with grave
, 0160
, -
, U+00E1
, align="center", á
, 225
,
0303 0241
, á
, Latin Small Letter A with acute
, 0161
, -
, U+00E2
, align="center", â
, 226
,
0303 0242
, â
, Latin Small Letter A with circumflex
, 0162
, -
, U+00E3
, align="center", ã
, 227
,
0303 0243
, ã
, Latin Small Letter A with tilde
, 0163
, -
, U+00E4
, align="center", ä
, 228
,
0303 0244
, ä
, Latin Small Letter A with diaeresis
, 0164
, -
, U+00E5
, align="center", å
, 229
,
0303 0245
, å
, Latin Small Letter A with ring above
, 0165
, -
, U+00E6
, align="center", æ
, 230
,
0303 0246
, æ
, Latin Small Letter Æ
, 0166
, -
, U+00E7
, align="center", ç
, 231
,
0303 0247
, ç
, Latin Small Letter C with cedilla
, 0167
, -
, U+00E8
, align="center", è
, 232
,
0303 0250
, è
, Latin Small Letter E with grave
, 0168
, -
, U+00E9
, align="center", é
, 233
,
0303 0251
, é
, Latin Small Letter E with acute
, 0169
, -
, U+00EA
, align="center", ê
, 234
,
0303 0252
, ê
, Latin Small Letter E with circumflex
, 0170
, -
, U+00EB
, align="center", ë
, 235
,
0303 0253
, ë
, Latin Small Letter E with diaeresis
, 0171
, -
, U+00EC
, align="center", ì
, 236
,
0303 0254
, ì
, Latin Small Letter I with grave
, 0172
, -
, U+00ED
, align="center", í
, 237
,
0303 0255
, í
, Latin Small Letter I with acute
, 0173
, -
, U+00EE
, align="center", î
, 238
,
0303 0256
, î
, Latin Small Letter I with circumflex
, 0174
, -
, U+00EF
, align="center", ï
, 239
,
0303 0257
, ï
, Latin Small Letter I with diaeresis
, 0175
, -
, U+00F0
, align="center", ð
, 240
,
0303 0260
, ð
, Latin Small Letter Eth
, 0176
, -
, U+00F1
, align="center", ñ
, 241
,
0303 0261
, ñ
, Latin Small Letter N with tilde
, 0177
, -
, U+00F2
, align="center", ò
, 242
,
0303 0262
, ò
, Latin Small Letter O with grave
, 0178
, -
, U+00F3
, align="center", ó
, 243
,
0303 0263
, ó
, Latin Small Letter O with acute
, 0179
, -
, U+00F4
, align="center", ô
, 244
,
0303 0264
, ô
, Latin Small Letter O with circumflex
, 0180
, -
, U+00F5
, align="center", õ
, 245
,
0303 0265
, õ
, Latin Small Letter O with tilde
, 0181
, -
, U+00F6
, align="center", ö
, 246
,
0303 0266
, ö
, Latin Small Letter O with diaeresis
, 0182
, -
, align="center" rowspan=1 style="background: #eaeaff;", Math, , U+00F7
, align="center", ÷
, 247
,
0303 0267
, ÷
,
Division sign
The division sign () is a symbol consisting of a short horizontal line with a dot above and another dot below, used in Anglophone countries to indicate mathematical division. However, this usage, though widespread in some countries, is not u ...
, 0183
, -
, align="center" rowspan=8 style="background: #ffebeb;", Letters:
Lowercase
, U+00F8
, align="center", ø
, 248
,
0303 0270
, ø
, Latin Small Letter O with stroke
, 0184
, -
, U+00F9
, align="center", ù
, 249
,
0303 0271
, ù
, Latin Small Letter U with grave
, 0185
, -
, U+00FA
, align="center", ú
, 250
,
0303 0272
, ú
, Latin Small Letter U with acute
, 0186
, -
, U+00FB
, align="center", û
, 251
,
0303 0273
, û
, Latin Small Letter U with circumflex
, 0187
, -
, U+00FC
, align="center", ü
, 252
,
0303 0274
, ü
, Latin Small Letter U with diaeresis
, 0188
, -
, U+00FD
, align="center", ý
, 253
,
0303 0275
, ý
, Latin Small Letter Y with acute
, 0189
, -
, U+00FE
, align="center", þ
, 254
,
0303 0276
, þ
, Latin Small Letter Thorn
, 0190
, -
, U+00FF
, align="center", ÿ
, 255
,
0303 0277
, ÿ
, Latin Small Letter Y with diaeresis
, 0191
, - class="nosort"
!
!Code
!Glyph
!Decimal
!Octal
!HTML
!Description
! #
Latin Extended-A
128 characters; all belong to the Latin script.
{, class="wikitable sortable collapsible" id="Table_Latin_Extended-A"
!
!Code
!Glyph
!Decimal
!HTML
!Description
! #
, -
, align="center" rowspan=73 style="background: #ffcccc;", European
Latin, , U+0100
, align="center", Ā
, 256
, Ā
,
Latin Capital Letter A with macron
, 0192
, -
, U+0101
, align="center", ā
, 257
, ā
, Latin Small Letter A with macron
, 0193
, -
, U+0102
, align="center", Ă
, 258
, Ă
,
Latin Capital Letter A with breve
, 0194
, -
, U+0103
, align="center", ă
, 259
, ă
, Latin Small Letter A with breve
, 0195
, -
, U+0104
, align="center", Ą
, 260
, Ą
,
Latin Capital Letter A with ogonek
, 0196
, -
, U+0105
, align="center", ą
, 261
, ą
, Latin Small Letter A with ogonek
, 0197
, -
, U+0106
, align="center", Ć
, 262
, Ć
,
Latin Capital Letter C with acute
, 0198
, -
, U+0107
, align="center", ć
, 263
, ć
, Latin Small Letter C with acute
, 0199
, -
, U+0108
, align="center", Ĉ
, 264
, Ĉ
,
Latin Capital Letter C with circumflex
, 0200
, -
, U+0109
, align="center", ĉ
, 265
, ĉ
, Latin Small Letter C with circumflex
, 0201
, -
, U+010A
, align="center", Ċ
, 266
, Ċ
,
Latin Capital Letter C with dot above
, 0202
, -
, U+010B
, align="center", ċ
, 267
, ċ
, Latin Small Letter C with dot above
, 0203
, -
, U+010C
, align="center", Č
, 268
, Č
,
Latin Capital Letter C with caron
, 0204
, -
, U+010D
, align="center", č
, 269
, č
, Latin Small Letter C with caron
, 0205
, -
, U+010E
, align="center", Ď
, 270
, Ď
,
Latin Capital Letter D with caron
, 0206
, -
, U+010F
, align="center", ď
, 271
, ď
, Latin Small Letter D with caron
, 0207
, -
, U+0110
, align="center", Đ
, 272
, Đ
,
Latin Capital Letter D with stroke
, 0208
, -
, U+0111
, align="center", đ
, 273
, đ
, Latin Small Letter D with stroke
, 0209
, -
, U+0112
, align="center", Ē
, 274
, Ē
, Latin Capital Letter E with macron
, 0210
, -
, U+0113
, align="center", ē
, 275
, ē
, Latin Small Letter E with macron
, 0211
, -
, U+0114
, align="center", Ĕ
, 276
, &Ebreve;
, Latin Capital Letter E with breve
, 0212
, -
, U+0115
, align="center", ĕ
, 277
, &ebreve;
, Latin Small Letter E with breve
, 0213
, -
, U+0116
, align="center", Ė
, 278
, Ė
,
Latin Capital Letter E with dot above
, 0214
, -
, U+0117
, align="center", ė
, 279
, ė
, Latin Small Letter E with dot above
, 0215
, -
, U+0118
, align="center", Ę
, 280
, Ę
,
Latin Capital Letter E with ogonek
, 0216
, -
, U+0119
, align="center", ę
, 281
, ę
, Latin Small Letter E with ogonek
, 0217
, -
, U+011A
, align="center", Ě
, 282
, Ě
,
Latin Capital Letter E with caron
, 0218
, -
, U+011B
, align="center", ě
, 283
, ě
, Latin Small Letter E with caron
, 0219
, -
, U+011C
, align="center", Ĝ
, 284
, Ĝ
,
Latin Capital Letter G with circumflex
, 0220
, -
, U+011D
, align="center", ĝ
, 285
, ĝ
, Latin Small Letter G with circumflex
, 0221
, -
, U+011E
, align="center", Ğ
, 286
, Ğ
,
Latin Capital Letter G with breve
, 0222
, -
, U+011F
, align="center", ğ
, 287
, ğ
, Latin Small Letter G with breve
, 0223
, -
, U+0120
, align="center", Ġ
, 288
, Ġ
,
Latin Capital Letter G with dot above
, 0224
, -
, U+0121
, align="center", ġ
, 289
, ġ
, Latin Small Letter G with dot above
, 0225
, -
, U+0122
, align="center", Ģ
, 290
, Ģ
,
Latin Capital Letter G with cedilla
, 0226
, -
, U+0123
, align="center", ģ
, 291
,
, Latin Small Letter G with cedilla
, 0227
, -
, U+0124
, align="center", Ĥ
, 292
, Ĥ
,
Latin Capital Letter H with circumflex
, 0228
, -
, U+0125
, align="center", ĥ
, 293
, ĥ
, Latin Small Letter H with circumflex
, 0229
, -
, U+0126
, align="center", Ħ
, 294
, Ħ
,
Latin Capital Letter H with stroke
, 0230
, -
, U+0127
, align="center", ħ
, 295
, ħ
, Latin Small Letter H with stroke
, 0231
, -
, U+0128
, align="center", Ĩ
, 296
, Ĩ
, Latin Capital Letter I with tilde
, 0232
, -
, U+0129
, align="center", ĩ
, 297
, ĩ
, Latin Small Letter I with tilde
, 0233
, -
, U+012A
, align="center", Ī
, 298
, Ī
, Latin Capital Letter I with macron
, 0234
, -
, U+012B
, align="center", ī
, 299
, ī
, Latin Small Letter I with macron
, 0235
, -
, U+012C
, align="center", Ĭ
, 300
, &Ibreve;
, Latin Capital Letter I with breve
, 0236
, -
, U+012D
, align="center", ĭ
, 301
, &ibreve;
, Latin Small Letter I with breve
, 0237
, -
, U+012E
, align="center", Į
, 302
, Į
, Latin Capital Letter I with ogonek
, 0238
, -
, U+012F
, align="center", į
, 303
, į
, Latin Small Letter I with ogonek
, 0239
, -
, U+0130
, align="center", İ
, 304
, İ
,
Latin Capital Letter I with dot above
, 0240
, -
, U+0131
, align="center", ı
, 305
, ı
,
Latin Small Letter dotless I
, 0241
, -
, U+0132
, align="center", IJ
, 306
, IJ
,
Latin Capital Ligature IJ
, 0242
, -
, U+0133
, align="center", ij
, 307
, ij
, Latin Small Ligature IJ
, 0243
, -
, U+0134
, align="center", Ĵ
, 308
, Ĵ
,
Latin Capital Letter J with circumflex
, 0244
, -
, U+0135
, align="center", ĵ
, 309
, ĵ
, Latin Small Letter J with circumflex
, 0245
, -
, U+0136
, align="center", Ķ
, 310
, Ķ
, Latin Capital Letter K with cedilla
, 0246
, -
, U+0137
, align="center", ķ
, 311
, ķ
, Latin Small Letter K with cedilla
, 0247
, -
, U+0138
, align="center", ĸ
, 312
,
,
Latin Small Letter Kra
, 0248
, -
, U+0139
, align="center", Ĺ
, 313
, Ĺ
,
Latin Capital Letter L with acute
, 0249
, -
, U+013A
, align="center", ĺ
, 314
, ĺ
, Latin Small Letter L with acute
, 0250
, -
, U+013B
, align="center", Ļ
, 315
, Ļ
, Latin Capital Letter L with cedilla
, 0251
, -
, U+013C
, align="center", ļ
, 316
, ļ
, Latin Small Letter L with cedilla
, 0252
, -
, U+013D
, align="center", Ľ
, 317
, Ľ
,
Latin Capital Letter L with caron
, 0253
, -
, U+013E
, align="center", ľ
, 318
, ľ
, Latin Small Letter L with caron
, 0254
, -
, U+013F
, align="center", Ŀ
, 319
, Ŀ
, Latin Capital Letter L with middle dot
, 0255
, -
, U+0140
, align="center", ŀ
, 320
, ŀ
, Latin Small Letter L with middle dot
, 0256
, -
, U+0141
, align="center", Ł
, 321
, Ł
,
Latin Capital Letter L with stroke
, 0257
, -
, U+0142
, align="center", ł
, 322
, ł
, Latin Small Letter L with stroke
, 0258
, -
, U+0143
, align="center", Ń
, 323
, Ń
,
Latin Capital Letter N with acute
, 0259
, -
, U+0144
, align="center", ń
, 324
, ń
, Latin Small Letter N with acute
, 0260
, -
, U+0145
, align="center", Ņ
, 325
, Ņ
, Latin Capital Letter N with cedilla
, 0261
, -
, U+0146
, align="center", ņ
, 326
, ņ
, Latin Small Letter N with cedilla
, 0262
, -
, U+0147
, align="center", Ň
, 327
, Ň
,
Latin Capital Letter N with caron
, 0263
, -
, U+0148
, align="center", ň
, 328
, ň
, Latin Small Letter N with caron
, 0264
, -
, align="center" rowspan=1 style="background: #ffebeb;", Deprecated, , U+0149
, align="center", ʼn
, 329
,
,
Latin Small Letter N preceded by apostrophe[Deprecated as of Unicode version 5.2.]
"U+0149 Latin small letter n preceded by apostrophe was encoded for use in
Afrikaans. The character is deprecated, and its use is strongly discouraged. In nearly all
cases it is better represented by a sequence of an apostrophe followed by “n”.
pg. 208
, 0265
, -
, align="center" rowspan=54 style="background: #ffcccc;", European
Latin, , U+014A
, align="center", Ŋ
, 330
, Ŋ
,
Ŋ, Latin Capital Letter Eng
, 0266
, -
, U+014B
, align="center", ŋ
, 331
, ŋ
, Latin Small Letter Eng
, 0267
, -
, U+014C
, align="center", Ō
, 332
, Ō
, Latin Capital Letter O with macron
, 0268
, -
, U+014D
, align="center", ō
, 333
, ō
, Latin Small Letter O with macron
, 0269
, -
, U+014E
, align="center", Ŏ
, 334
, &Obreve;
, Latin Capital Letter O with breve
, 0270
, -
, U+014F
, align="center", ŏ
, 335
, &obreve;
, Latin Small Letter O with breve
, 0271
, -
, U+0150
, align="center", Ő
, 336
, Ő
, Latin Capital Letter O with
double acute
The double acute accent ( ˝ ) is a diacritic mark of the Latin and Cyrillic scripts. It is used primarily in Hungarian alphabet, Hungarian or Chuvash language, Chuvash, and consequently it is sometimes referred to by typographers as hungarumlaut. ...
, 0272
, -
, U+0151
, align="center", ő
, 337
, ő
, Latin Small Letter O with
double acute
The double acute accent ( ˝ ) is a diacritic mark of the Latin and Cyrillic scripts. It is used primarily in Hungarian alphabet, Hungarian or Chuvash language, Chuvash, and consequently it is sometimes referred to by typographers as hungarumlaut. ...
, 0273
, -
, U+0152
, align="center", Œ
, 338
, Œ
,
Latin Capital Ligature OE
, 0274
, -
, U+0153
, align="center", œ
, 339
, œ
, Latin Small Ligature OE
, 0275
, -
, U+0154
, align="center", Ŕ
, 340
, Ŕ
,
Latin Capital Letter R with acute
, 0276
, -
, U+0155
, align="center", ŕ
, 341
, ŕ
, Latin Small Letter R with acute
, 0277
, -
, U+0156
, align="center", Ŗ
, 342
, Ŗ
, Latin Capital Letter R with cedilla
, 0278
, -
, U+0157
, align="center", ŗ
, 343
, ŗ
, Latin Small Letter R with cedilla
, 0279
, -
, U+0158
, align="center", Ř
, 344
, Ř
,
Latin Capital Letter R with caron
, 0280
, -
, U+0159
, align="center", ř
, 345
, ř
, Latin Small Letter R with caron
, 0281
, -
, U+015A
, align="center", Ś
, 346
, Ś
,
Latin Capital Letter S with acute
, 0282
, -
, U+015B
, align="center", ś
, 347
, ś
, Latin Small Letter S with acute
, 0283
, -
, U+015C
, align="center", Ŝ
, 348
, Ŝ
, Latin Capital Letter S with circumflex
, 0284
, -
, U+015D
, align="center", ŝ
, 349
, ŝ
, Latin Small Letter S with circumflex
, 0285
, -
, U+015E
, align="center", Ş
, 350
, Ş
,
Latin Capital Letter S with cedilla
, 0286
, -
, U+015F
, align="center", ş
, 351
, ş
, Latin Small Letter S with cedilla
, 0287
, -
, U+0160
, align="center", Š
, 352
, Š
,
Latin Capital Letter S with caron
, 0288
, -
, U+0161
, align="center", š
, 353
, š
, Latin Small Letter S with caron
, 0289
, -
, U+0162
, align="center", Ţ
, 354
, Ţ
, Latin Capital Letter T with cedilla
, 0290
, -
, U+0163
, align="center", ţ
, 355
, ţ
, Latin Small Letter T with cedilla
, 0291
, -
, U+0164
, align="center", Ť
, 356
, Ť
,
Latin Capital Letter T with caron
, 0292
, -
, U+0165
, align="center", ť
, 357
, ť
, Latin Small Letter T with caron
, 0293
, -
, U+0166
, align="center", Ŧ
, 358
, Ŧ
,
Latin Capital Letter T with stroke
, 0294
, -
, U+0167
, align="center", ŧ
, 359
, ŧ
, Latin Small Letter T with stroke
, 0295
, -
, U+0168
, align="center", Ũ
, 360
, Ũ
, Latin Capital Letter U with tilde
, 0296
, -
, U+0169
, align="center", ũ
, 361
, ũ
, Latin Small Letter U with tilde
, 0297
, -
, U+016A
, align="center", Ū
, 362
, Ū
, Latin Capital Letter U with macron
, 0298
, -
, U+016B
, align="center", ū
, 363
, ū
, Latin Small Letter U with macron
, 0299
, -
, U+016C
, align="center", Ŭ
, 364
, Ŭ
,
Latin Capital Letter U with breve
, 0300
, -
, U+016D
, align="center", ŭ
, 365
, ŭ
, Latin Small Letter U with breve
, 0301
, -
, U+016E
, align="center", Ů
, 366
, Ů
, Latin Capital Letter U with ring above
, 0302
, -
, U+016F
, align="center", ů
, 367
, ů
, Latin Small Letter U with ring above
, 0303
, -
, U+0170
, align="center", Ű
, 368
, Ű
, Latin Capital Letter U with
double acute
The double acute accent ( ˝ ) is a diacritic mark of the Latin and Cyrillic scripts. It is used primarily in Hungarian alphabet, Hungarian or Chuvash language, Chuvash, and consequently it is sometimes referred to by typographers as hungarumlaut. ...
, 0304
, -
, U+0171
, align="center", ű
, 369
, ű
, Latin Small Letter U with
double acute
The double acute accent ( ˝ ) is a diacritic mark of the Latin and Cyrillic scripts. It is used primarily in Hungarian alphabet, Hungarian or Chuvash language, Chuvash, and consequently it is sometimes referred to by typographers as hungarumlaut. ...
, 0305
, -
, U+0172
, align="center", Ų
, 370
, Ų
, Latin Capital Letter U with ogonek
, 0306
, -
, U+0173
, align="center", ų
, 371
, ų
, Latin Small Letter U with ogonek
, 0307
, -
, U+0174
, align="center", Ŵ
, 372
, Ŵ
, Latin Capital Letter W with circumflex
, 0308
, -
, U+0175
, align="center", ŵ
, 373
, ŵ
, Latin Small Letter W with circumflex
, 0309
, -
, U+0176
, align="center", Ŷ
, 374
, Ŷ
, Latin Capital Letter Y with circumflex
, 0310
, -
, U+0177
, align="center", ŷ
, 375
, ŷ
, Latin Small Letter Y with circumflex
, 0311
, -
, U+0178
, align="center", Ÿ
, 376
, Ÿ
, Latin Capital Letter Y with diaeresis
, 0312
, -
, U+0179
, align="center", Ź
, 377
, Ź
,
Latin Capital Letter Z with acute
, 0313
, -
, U+017A
, align="center", ź
, 378
, ź
, Latin Small Letter Z with acute
, 0314
, -
, U+017B
, align="center", Ż
, 379
, Ż
,
Latin Capital Letter Z with dot above
, 0315
, -
, U+017C
, align="center", ż
, 380
, ż
, Latin Small Letter Z with dot above
, 0316
, -
, U+017D
, align="center", Ž
, 381
, Ž
,
Latin Capital Letter Z with caron
, 0317
, -
, U+017E
, align="center", ž
, 382
, ž
, Latin Small Letter Z with caron
, 0318
, -
, U+017F
, align="center", ſ
, 383
,
, Latin Small Letter
long S
The long s , also known as the medial s or initial s, is an archaism, archaic form of the lowercase letter . It replaced the single ''s'', or one or both of the letters ''s'' in a 'double ''s sequence (e.g., "ſinfulneſs" for "sinfulness" ...
, 0319
, - class="nosort"
!
!Code
!Glyph
!Decimal
!HTML
!Description
! #
Latin Extended-B
208 characters; all belong to the Latin script; 33 in the MES-2 subset.
{, class="wikitable sortable collapsible" id="Table_Latin_Extended-B"
!
!Code
!Glyph
!Decimal
!Description
! #
! MES-2 Rationale
, -
, align="center" rowspan=64 style="background: #ffcccc;", Non-European
& historic Latin, , U+0180
,
, 384
,
Latin Small Letter B with stroke
, rowspan=15 colspan=2, ·
, -
, U+0181
,
, 385
,
Latin Capital Letter B with hook
, -
, U+0182
,
, 386
,
Latin Capital Letter B with top bar
, -
, U+0183
,
, 387
, Latin Small Letter B with top bar
, -
, U+0184
,
, 388
,
Latin Capital Letter Tone Six
, -
, U+0185
,
, 389
, Latin Small Letter Tone Six
, -
, U+0186
,
, 390
,
Latin Capital Letter Open O
, -
, U+0187
,
, 391
,
Latin Capital Letter C with hook
, -
, U+0188
,
, 392
, Latin Small Letter C with hook
, -
, U+0189
,
, 393
,
Latin Capital Letter African D
, -
, U+018A
,
, 394
,
Latin Capital Letter D with hook
, -
, U+018B
,
, 395
,
Latin Capital Letter D with top bar
, -
, U+018C
,
, 396
, Latin Small Letter D with top bar
, -
, U+018D
,
, 397
, Latin Small Letter Turned Delta
, -
, U+018E
,
, 398
,
Latin Capital Letter Reversed E
, -
, U+018F
,
, 399
,
Latin Capital Letter Schwa
, 0320
, for
Azerbaijani
, -
, U+0190
,
, 400
,
Latin Capital Letter Open E
, rowspan=2 colspan=2, ·
, -
, U+0191
,
, 401
, Latin Capital Letter F with hook
, -
, U+0192
,
, 402
, Latin Small Letter F with hook
, 0321
, in
WGL4
Windows Glyph List 4, or more commonly WGL4 for short, also known as the ''Pan-European character set'', is a character repertoire on Microsoft operating systems comprising 657 Unicode characters, two of them private use. Its purpose is to provide ...
, -
, U+0193
,
, 403
,
Latin Capital Letter G with hook
, rowspan=36 colspan=2, ·
, -
, U+0194
,
, 404
,
Latin Capital Letter Gamma
, -
, U+0195
,
, 405
,
Latin Small Letter HV
, -
, U+0196
,
, 406
,
Latin Capital Letter Iota
, -
, U+0197
,
, 407
,
Latin Capital Letter I with stroke
, -
, U+0198
,
, 408
,
Latin Capital Letter K with hook
, -
, U+0199
,
, 409
, Latin Small Letter K with hook
, -
, U+019A
,
, 410
,
Latin Small Letter L with bar
, -
, U+019B
,
, 411
,
Latin Small Letter Lambda with stroke
, -
, U+019C
,
, 412
,
Latin Capital Letter Turned M
, -
, U+019D
,
, 413
,
Latin Capital Letter N with left hook
, -
, U+019E
,
, 414
,
Latin Small Letter N with long right leg
, -
, U+019F
,
, 415
,
Latin Capital Letter O with middle tilde
, -
, U+01A0
,
, 416
,
Latin Capital Letter O with horn
, -
, U+01A1
,
, 417
, Latin Small Letter O with horn
, -
, U+01A2
,
, 418
,
Latin Capital Letter OI (= Latin Capital Letter Gha)
, -
, U+01A3
,
, 419
, Latin Small Letter OI (= Latin Small Letter Gha)
, -
, U+01A4
,
, 420
,
Latin Capital Letter P with hook
, -
, U+01A5
,
, 421
, Latin Small Letter P with hook
, -
, U+01A6
,
, 422
,
Latin Letter YR
, -
, U+01A7
,
, 423
,
Latin Capital Letter Tone Two
, -
, U+01A8
,
, 424
, Latin Small Letter Tone Two
, -
, U+01A9
,
, 425
,
Latin Capital Letter Esh
, -
, U+01AA
,
, 426
, Latin Letter Reversed Esh Loop
, -
, U+01AB
,
, 427
, Latin Small Letter T with palatal hook
, -
, U+01AC
,
, 428
,
Latin Capital Letter T with hook
, -
, U+01AD
,
, 429
, Latin Small Letter T with hook
, -
, U+01AE
,
, 430
,
Latin Capital Letter T with retroflex hook
, -
, U+01AF
,
, 431
,
Latin Capital Letter U with horn
, -
, U+01B0
,
, 432
, Latin Small Letter U with horn
, -
, U+01B1
,
, 433
,
Latin Capital Letter Upsilon
, -
, U+01B2
,
, 434
,
Latin Capital Letter V with hook
, -
, U+01B3
,
, 435
,
Latin Capital Letter Y with hook
, -
, U+01B4
,
, 436
, Latin Small Letter Y with hook
, -
, U+01B5
,
, 437
,
Latin Capital Letter Z with stroke
, -
, U+01B6
,
, 438
, Latin Small Letter Z with stroke
, -
, U+01B7
,
, 439
,
Latin Capital Letter Ezh
, 0322
, for
Sami
Acronyms
* SAMI, ''Synchronized Accessible Media Interchange'', a closed-captioning format developed by Microsoft
* Saudi Arabian Military Industries, a government-owned defence company
* South African Malaria Initiative, a virtual expertise net ...
, -
, U+01B8
,
, 440
,
Latin Capital Letter Ezh reversed
, rowspan=38 colspan=2, ·
, -
, U+01B9
,
, 441
, Latin Small Letter Ezh reversed
, -
, U+01BA
,
, 442
, Latin Small Letter Ezh with tail
, -
, U+01BB
,
, 443
, Latin Letter Two with stroke
, -
, U+01BC
,
, 444
,
Latin Capital Letter Tone Five
, -
, U+01BD
,
, 445
, Latin Small Letter Tone Five
, -
, U+01BE
,
, 446
, Latin Letter Inverted Glottal Stop with stroke
, -
, U+01BF
,
, 447
,
Latin Letter Wynn
, -
, align="center" rowspan="4" , African
clicks, , U+01C0
,
, 448
,
Latin Letter Dental Click
, -
, U+01C1
,
, 449
,
Latin Letter Lateral Click
, -
, U+01C2
,
, 450
,
Latin Letter Alveolar Click
, -
, U+01C3
,
, 451
,
Latin Letter Retroflex Click
, -
, align="center" rowspan="9" , Croatian, , U+01C4
,
, 452
,
Latin Capital Letter DZ with caron
, -
, U+01C5
,
, 453
, Latin Capital Letter D with Small Letter Z with caron
, -
, U+01C6
,
, 454
, Latin Small Letter DZ with caron
, -
, U+01C7
,
, 455
,
Latin Capital Letter LJ
, -
, U+01C8
,
, 456
, Latin Capital Letter L with Small Letter J
, -
, U+01C9
,
, 457
, Latin Small Letter LJ
, -
, U+01CA
,
, 458
, Latin Capital Letter NJ
, -
, U+01CB
,
, 459
, Latin Capital Letter N with Small Letter J
, -
, U+01CC
,
, 460
, Latin Small Letter NJ
, -
, align="center" rowspan="16" , Pinyin, , U+01CD
,
, 461
, Latin Capital Letter A with caron
, -
, U+01CE
,
, 462
, Latin Small Letter A with caron
, -
, U+01CF
,
, 463
, Latin Capital Letter I with caron
, -
, U+01D0
,
, 464
, Latin Small Letter I with caron
, -
, U+01D1
,
, 465
, Latin Capital Letter O with caron
, -
, U+01D2
,
, 466
, Latin Small Letter O with caron
, -
, U+01D3
,
, 467
, Latin Capital Letter U with caron
, -
, U+01D4
,
, 468
, Latin Small Letter U with caron
, -
, U+01D5
,
, 469
, Latin Capital Letter U with diaeresis and macron
, -
, U+01D6
,
, 470
, Latin Small Letter U with diaeresis and macron
, -
, U+01D7
,
, 471
, Latin Capital Letter U with diaeresis and acute
, -
, U+01D8
,
, 472
, Latin Small Letter U with diaeresis and acute
, -
, U+01D9
,
, 473
, Latin Capital Letter U with diaeresis and caron
, -
, U+01DA
,
, 474
, Latin Small Letter U with diaeresis and caron
, -
, U+01DB
,
, 475
, Latin Capital Letter U with diaeresis and grave
, -
, U+01DC
,
, 476
, Latin Small Letter U with diaeresis and grave
, -
, align="center" rowspan=35 style="background: #ffcccc;", Phonetic &
historic letters, , U+01DD
,
, 477
, Latin Small Letter Turned E
, -
, U+01DE
,
, 478
, Latin Capital Letter A with diaeresis and macron
, 0323
, rowspan=18, for
Sami
Acronyms
* SAMI, ''Synchronized Accessible Media Interchange'', a closed-captioning format developed by Microsoft
* Saudi Arabian Military Industries, a government-owned defence company
* South African Malaria Initiative, a virtual expertise net ...
, -
, U+01DF
,
, 479
, Latin Small Letter A with diaeresis and macron
, 0324
, -
, U+01E0
,
, 480
, Latin Capital Letter A with dot above and macron
, 0325
, -
, U+01E1
,
, 481
, Latin Small Letter A with dot above and macron
, 0326
, -
, U+01E2
,
, 482
, Latin Capital Letter Æ with macron
, 0327
, -
, U+01E3
,
, 483
, Latin Small Letter Æ with macron
, 0328
, -
, U+01E4
,
, 484
, Latin Capital Letter G with stroke
, 0329
, -
, U+01E5
,
, 485
, Latin Small Letter G with stroke
, 0330
, -
, U+01E6
,
, 486
, Latin Capital Letter G with caron
, 0331
, -
, U+01E7
,
, 487
, Latin Small Letter G with caron
, 0332
, -
, U+01E8
,
, 488
, Latin Capital Letter K with caron
, 0333
, -
, U+01E9
,
, 489
, Latin Small Letter K with caron
, 0334
, -
, U+01EA
,
, 490
, Latin Capital Letter O with ogonek
, 0335
, -
, U+01EB
,
, 491
, Latin Small Letter O with ogonek
, 0336
, -
, U+01EC
,
, 492
, Latin Capital Letter O with ogonek and macron
, 0337
, -
, U+01ED
,
, 493
, Latin Small Letter O with ogonek and macron
, 0338
, -
, U+01EE
,
, 494
, Latin Capital Letter Ezh with caron
, 0339
, -
, U+01EF
,
, 495
, Latin Small Letter Ezh with caron
, 0340
, -
, U+01F0
,
, 496
, Latin Small Letter J with caron
, rowspan=10 colspan=2, ·
, -
, U+01F1
,
, 497
, Latin Capital Letter DZ
, -
, U+01F2
,
, 498
, Latin Capital Letter D with Small Letter Z
, -
, U+01F3
,
, 499
, Latin Small Letter DZ
, -
, U+01F4
,
, 500
, Latin Capital Letter G with acute
, -
, U+01F5
,
, 501
, Latin Small Letter G with acute
, -
, U+01F6
,
, 502
, Latin Capital Letter
Hwair
Hwair (also , , ) is the name of , the Gothic letter expressing the or sound (reflected in English by the inverted '' wh''-spelling for ). Hwair is also the name of the Latin ligature (capital ) used to transcribe Gothic.
Name
The name of the ...
, -
, U+01F7
,
, 503
, Latin Capital Letter Wynn
, -
, U+01F8
,
, 504
, Latin Capital Letter N with grave
, -
, U+01F9
,
, 505
, Latin Small Letter N with grave
, -
, U+01FA
,
, 506
, Latin Capital Letter A with ring above and acute
, 0341
, rowspan=6, in
WGL4
Windows Glyph List 4, or more commonly WGL4 for short, also known as the ''Pan-European character set'', is a character repertoire on Microsoft operating systems comprising 657 Unicode characters, two of them private use. Its purpose is to provide ...
, -
, U+01FB
,
, 507
, Latin Small Letter A with ring above and acute
, 0342
, -
, U+01FC
,
, 508
, Latin Capital Letter Æ with acute
, 0343
, -
, U+01FD
,
, 509
, Latin Small Letter Æ with acute
, 0344
, -
, U+01FE
,
, 510
, Latin Capital Letter O with stroke and acute
, 0345
, -
, U+01FF
,
, 511
, Latin Small Letter O with stroke and acute
, 0346
, -style="border-top: 2px solid grey;"
, align="center" rowspan="24" , Slovenian
& Croatian, , U+0200
,
, 512
, Latin Capital Letter A with double grave
, rowspan=24 colspan=2, ·
, -
, U+0201
,
, 513
, Latin Small Letter A with double grave
, -
, U+0202
,
, 514
, Latin Capital Letter A with inverted breve
, -
, U+0203
,
, 515
, Latin Small Letter A with inverted breve
, -
, U+0204
,
, 516
, Latin Capital Letter E with double grave
, -
, U+0205
,
, 517
, Latin Small Letter E with double grave
, -
, U+0206
,
, 518
, Latin Capital Letter E with inverted breve
, -
, U+0207
,
, 519
, Latin Small Letter E with inverted breve
, -
, U+0208
,
, 520
, Latin Capital Letter I with double grave
, -
, U+0209
,
, 521
, Latin Small Letter I with double grave
, -
, U+020A
,
, 522
, Latin Capital Letter I with inverted breve
, -
, U+020B
,
, 523
, Latin Small Letter I with inverted breve
, -
, U+020C
,
, 524
, Latin Capital Letter O with double grave
, -
, U+020D
,
, 525
, Latin Small Letter O with double grave
, -
, U+020E
,
, 526
, Latin Capital Letter O with inverted breve
, -
, U+020F
,
, 527
, Latin Small Letter O with inverted breve
, -
, U+0210
,
, 528
, Latin Capital Letter R with double grave
, -
, U+0211
,
, 529
, Latin Small Letter R with double grave
, -
, U+0212
,
, 530
, Latin Capital Letter R with inverted breve
, -
, U+0213
,
, 531
, Latin Small Letter R with inverted breve
, -
, U+0214
,
, 532
, Latin Capital Letter U with double grave
, -
, U+0215
,
, 533
, Latin Small Letter U with double grave
, -
, U+0216
,
, 534
, Latin Capital Letter U with inverted breve
, -
, U+0217
,
, 535
, Latin Small Letter U with inverted breve
, -
, align="center" rowspan=4 style="background: #ffcccc;", Romanian, , U+0218
,
, 536
,
Latin Capital Letter S with comma below
, 0347
, rowspan=4, for
Romanian
Romanian may refer to:
*anything of, from, or related to the country and nation of Romania
**Romanians, an ethnic group
**Romanian language, a Romance language
***Romanian dialects, variants of the Romanian language
**Romanian cuisine, traditional ...
, -
, U+0219
,
, 537
,
Latin Small Letter S with comma below
, 0348
, -
, U+021A
,
, 538
,
Latin Capital Letter T with comma below
, 0349
, -
, U+021B
,
, 539
,
Latin Small Letter T with comma below
, 0350
, -
, align="center" rowspan="14" , Miscellaneous, , U+021C
,
, 540
, Latin Capital Letter
Yogh
The letter yogh (ȝogh) ( ; Scots Language, Scots: ; Middle English: ) was used in Middle English and Older Scots, representing ''y'' () and various velar consonant , velar phonemes. It was derived from the Insular G, Insular form of the letter ...
, rowspan=2 colspan=2, ·
, -
, U+021D
,
, 541
, Latin Small Letter Yogh
, -
, U+021E
,
, 542
, Latin Capital Letter H with caron
, 0351
, rowspan=2, for
Finnish Romani
Finnish Kalo () is a language of the Romani language, Romani language family (a subgroup of Indo-European languages, Indo-European) spoken by Finnish Kale. The language is related to but not mutually intelligible with Scandoromani language, Scand ...
,
Scots
, -
, U+021F
,
, 543
, Latin Small Letter H with caron
, 0352
, -
, U+0220
,
, 544
, Latin Capital Letter N with long right leg
, colspan="2" rowspan="48" , ·
, -
, U+0221
,
, 545
, Latin Small Letter D with curl
, -
, U+0222
,
, 546
, Latin Capital Letter
OU
, -
, U+0223
,
, 547
, Latin Small Letter OU
, -
, U+0224
,
, 548
, Latin Capital Letter Z with hook
, -
, U+0225
,
, 549
, Latin Small Letter Z with hook
, -
, U+0226
,
, 550
, Latin Capital Letter A with dot above
, -
, U+0227
,
, 551
, Latin Small Letter A with dot above
, -
, U+0228
,
, 552
, Latin Capital Letter E with cedilla
, -
, U+0229
,
, 553
, Latin Small Letter E with cedilla
, -
, align="center" rowspan="10" , Livonian, , U+022A
,
, 554
, Latin Capital Letter O with diaeresis and macron
, -
, U+022B
,
, 555
, Latin Small Letter O with diaeresis and macron
, -
, U+022C
,
, 556
, Latin Capital Letter O with tilde and macron
, -
, U+022D
,
, 557
, Latin Small Letter O with tilde and macron
, -
, U+022E
,
, 558
, Latin Capital Letter O with dot above
, -
, U+022F
,
, 559
, Latin Small Letter O with dot above
, -
, U+0230
,
, 560
, Latin Capital Letter O with dot above and macron
, -
, U+0231
,
, 561
, Latin Small Letter O with dot above and macron
, -
, U+0232
,
, 562
, Latin Capital Letter Y with macron
, -
, U+0233
,
, 563
, Latin Small Letter Y with macron
, -
, align="center" rowspan="3" , Sinology, , U+0234
,
, 564
, Latin Small Letter L with curl
, -
, U+0235
,
, 565
, Latin Small Letter N with curl
, -
, U+0236
,
, 566
, Latin Small Letter T with curl
, -
, rowspan="25" align="center" , Miscellaneous, , U+0237
,
, 567
, Latin Small Letter
Dotless J
, -
, U+0238
,
, 568
, Latin Small Letter DB
Digraph
, -
, U+0239
,
, 569
, Latin Small Letter QP Digraph
, -
, U+023A
,
, 570
, Latin Capital Letter A with stroke
, -
, U+023B
,
, 571
, Latin Capital Letter C with stroke
, -
, U+023C
,
, 572
, Latin Small Letter C with stroke
, -
, U+023D
,
, 573
, Latin Capital Letter L with bar
, -
, U+023E
,
, 574
, Latin Capital Letter T with diagonal stroke
, -
, U+023F
,
, 575
, Latin Small Letter S with swash tail
, -
, U+0240
,
, 576
, Latin Small Letter Z with swash tail
, -
, U+0241
,
, 577
, Latin Capital Letter Glottal Stop
, -
, U+0242
,
, 578
, Latin Small Letter Glottal Stop
, -
, U+0243
,
, 579
, Latin Capital Letter B with stroke
, -
, U+0244
,
, 580
, Latin Capital Letter U bar
, -
, U+0245
,
, 581
, Latin Capital Letter Turned V
, -
, U+0246
,
, 582
, Latin Capital Letter E with stroke
, -
, U+0247
,
, 583
, Latin Small Letter E with stroke
, -
, U+0248
,
, 584
, Latin Capital Letter J with stroke
, -
, U+0249
,
, 585
, Latin Small Letter J with stroke
, -
, U+024A
,
, 586
, Latin Capital Letter Q with hook tail
, -
, U+024B
,
, 587
, Latin Small Letter Q with hook tail
, -
, U+024C
,
, 588
, Latin Capital Letter R with stroke
, -
, U+024D
,
, 589
, Latin Small Letter R with stroke
, -
, U+024E
,
, 590
, Latin Capital Letter Y with stroke
, -
, U+024F
,
, 591
, Latin Small Letter Y with stroke
, - class="nosort"
!
!Code
!Glyph
!Decimal
!Description
! #
! MES-2 Rationale
Latin Extended Additional
256 characters; all belong to the Latin script; 23 in the MES-2 subset.
{, class="wikitable sortable collapsible" id="Table_Latin_Extended_Additional"
!
!Code
!Glyph
!Description
! #
! MES-2 Rationale
, -
, align="center" rowspan="156" , General Use
Extensions, , U+1E00
, align="center", Ḁ
, Latin Capital Letter A with ring below
, rowspan=2 colspan=2, ·
, -
, U+1E01
, align="center", ḁ
, Latin Small Letter A with ring below
, -
, U+1E02
, align="center", Ḃ
, Latin Capital Letter B with dot above
, 0647
, rowspan=2,
ISO 8859-14
ISO/IEC 8859-14:1998, ''Information technology — 8-bit single-byte coded graphic character sets — Part 14: Latin alphabet No. 8 (Celtic)'', is part of the ISO/IEC 8859 series of ASCII-based standard character encodings, first edition published ...
, -
, U+1E03
, align="center", ḃ
, Latin Small Letter B with dot above
, 0648
, -
, U+1E04
, align="center", Ḅ
, Latin Capital Letter B with dot below
, -
, U+1E05
, align="center", ḅ
, Latin Small Letter B with dot below
, -
, U+1E06
, align="center", Ḇ
, Latin Capital Letter B with line below
, -
, U+1E07
, align="center", ḇ
, Latin Small Letter B with line below
, -
, U+1E08
, align="center", Ḉ
, Latin Capital Letter C with cedilla and acute
, -
, U+1E09
, align="center", ḉ
, Latin Small Letter C with cedilla and acute
, -
, U+1E0A
, align="center", Ḋ
, Latin Capital Letter D with dot above
, 0649
, rowspan=2,
ISO 8859-14
ISO/IEC 8859-14:1998, ''Information technology — 8-bit single-byte coded graphic character sets — Part 14: Latin alphabet No. 8 (Celtic)'', is part of the ISO/IEC 8859 series of ASCII-based standard character encodings, first edition published ...
, -
, U+1E0B
, align="center", ḋ
, Latin Small Letter D with dot above
, 0650
, -
, U+1E0C
, align="center", Ḍ
, Latin Capital Letter D with dot below
, -
, U+1E0D
, align="center", ḍ
, Latin Small Letter D with dot below
, -
, U+1E0E
, align="center", Ḏ
, Latin Capital Letter D with line below
, -
, U+1E0F
, align="center", ḏ
, Latin Small Letter D with line below
, -
, U+1E10
, align="center", Ḑ
, Latin Capital Letter D with cedilla
, -
, U+1E11
, align="center", ḑ
, Latin Small Letter D with cedilla
, -
, U+1E12
, align="center", Ḓ
, Latin Capital Letter D with circumflex below
, -
, U+1E13
, align="center", ḓ
, Latin Small Letter D with circumflex below
, -
, U+1E14
, align="center", Ḕ
, Latin Capital Letter E with macron and grave
, -
, U+1E15
, align="center", ḕ
, Latin Small Letter E with macron and grave
, -
, U+1E16
, align="center", Ḗ
, Latin Capital Letter E with macron and acute
, -
, U+1E17
, align="center", ḗ
, Latin Small Letter E with macron and acute
, -
, U+1E18
, align="center", Ḙ
, Latin Capital Letter E with circumflex below
, -
, U+1E19
, align="center", ḙ
, Latin Small Letter E with circumflex below
, -
, U+1E1A
, align="center", Ḛ
, Latin Capital Letter E with tilde below
, -
, U+1E1B
, align="center", ḛ
, Latin Small Letter E with tilde below
, -
, U+1E1C
, align="center", Ḝ
, Latin Capital Letter E with cedilla and breve
, -
, U+1E1D
, align="center", ḝ
, Latin Small Letter E with cedilla and breve
, -
, U+1E1E
, align="center", Ḟ
, Latin Capital Letter F with dot above
, 0651
, rowspan=2,
ISO 8859-14
ISO/IEC 8859-14:1998, ''Information technology — 8-bit single-byte coded graphic character sets — Part 14: Latin alphabet No. 8 (Celtic)'', is part of the ISO/IEC 8859 series of ASCII-based standard character encodings, first edition published ...
, -
, U+1E1F
, align="center", ḟ
, Latin Small Letter F with dot above
, 0652
, -
, U+1E20
, align="center", Ḡ
, Latin Capital Letter G with macron
, -
, U+1E21
, align="center", ḡ
, Latin Small Letter G with macron
, -
, U+1E22
, align="center", Ḣ
, Latin Capital Letter H with dot above
, -
, U+1E23
, align="center", ḣ
, Latin Small Letter H with dot above
, -
, U+1E24
, align="center", Ḥ
, Latin Capital Letter H with dot below
, -
, U+1E25
, align="center", ḥ
, Latin Small Letter H with dot below
, -
, U+1E26
, align="center", Ḧ
, Latin Capital Letter H with diaeresis
, -
, U+1E27
, align="center", ḧ
, Latin Small Letter H with diaeresis
, -
, U+1E28
, align="center", Ḩ
, Latin Capital Letter H with cedilla
, -
, U+1E29
, align="center", ḩ
, Latin Small Letter H with cedilla
, -
, U+1E2A
, align="center", Ḫ
, Latin Capital Letter H with breve below
, -
, U+1E2B
, align="center", ḫ
, Latin Small Letter H with breve below
, -
, U+1E2C
, align="center", Ḭ
, Latin Capital Letter I with tilde below
, -
, U+1E2D
, align="center", ḭ
, Latin Small Letter I with tilde below
, -
, U+1E2E
, align="center", Ḯ
, Latin Capital Letter I with diaeresis and acute
, -
, U+1E2F
, align="center", ḯ
, Latin Small Letter I with diaeresis and acute
, -
, U+1E30
, align="center", Ḱ
, Latin Capital Letter K with acute
, -
, U+1E31
, align="center", ḱ
, Latin Small Letter K with acute
, -
, U+1E32
, align="center", Ḳ
, Latin Capital Letter K with dot below
, -
, U+1E33
, align="center", ḳ
, Latin Small Letter K with dot below
, -
, U+1E34
, align="center", Ḵ
, Latin Capital Letter K with line below
, -
, U+1E35
, align="center", ḵ
, Latin Small Letter K with line below
, -
, U+1E36
, align="center", Ḷ
, Latin Capital Letter L with dot below
, -
, U+1E37
, align="center", ḷ
, Latin Small Letter L with dot below
, -
, U+1E38
, align="center", Ḹ
, Latin Capital Letter L with dot below and macron
, -
, U+1E39
, align="center", ḹ
, Latin Small Letter L with dot below and macron
, -
, U+1E3A
, align="center", Ḻ
, Latin Capital Letter L with line below
, -
, U+1E3B
, align="center", ḻ
, Latin Small Letter L with line below
, -
, U+1E3C
, align="center", Ḽ
, Latin Capital Letter L with circumflex below
, -
, U+1E3D
, align="center", ḽ
, Latin Small Letter L with circumflex below
, -
, U+1E3E
, align="center", Ḿ
, Latin Capital Letter M with acute
, -
, U+1E3F
, align="center", ḿ
, Latin Small Letter M with acute
, -
, U+1E40
, align="center", Ṁ
, Latin Capital Letter M with dot above
, 0653
, rowspan=2,
ISO 8859-14
ISO/IEC 8859-14:1998, ''Information technology — 8-bit single-byte coded graphic character sets — Part 14: Latin alphabet No. 8 (Celtic)'', is part of the ISO/IEC 8859 series of ASCII-based standard character encodings, first edition published ...
, -
, U+1E41
, align="center", ṁ
, Latin Small Letter M with dot above
, 0654
, -
, U+1E42
, align="center", Ṃ
, Latin Capital Letter M with dot below
, -
, U+1E43
, align="center", ṃ
, Latin Small Letter M with dot below
, -
, U+1E44
, align="center", Ṅ
, Latin Capital Letter N with dot above
, -
, U+1E45
, align="center", ṅ
, Latin Small Letter N with dot above
, -
, U+1E46
, align="center", Ṇ
, Latin Capital Letter N with dot below
, -
, U+1E47
, align="center", ṇ
, Latin Small Letter N with dot below
, -
, U+1E48
, align="center", Ṉ
, Latin Capital Letter N with line below
, -
, U+1E49
, align="center", ṉ
, Latin Small Letter N with line below
, -
, U+1E4A
, align="center", Ṋ
, Latin Capital Letter N with circumflex below
, -
, U+1E4B
, align="center", ṋ
, Latin Small Letter N with circumflex below
, -
, U+1E4C
, align="center", Ṍ
, Latin Capital Letter O with tilde and acute
, -
, U+1E4D
, align="center", ṍ
, Latin Small Letter O with tilde and acute
, -
, U+1E4E
, align="center", Ṏ
, Latin Capital Letter O with tilde and diaeresis
, -
, U+1E4F
, align="center", ṏ
, Latin Small Letter O with tilde and diaeresis
, -
, U+1E50
, align="center", Ṑ
, Latin Capital Letter O with macron and grave
, -
, U+1E51
, align="center", ṑ
, Latin Small Letter O with macron and grave
, -
, U+1E52
, align="center", Ṓ
, Latin Capital Letter O with macron and acute
, -
, U+1E53
, align="center", ṓ
, Latin Small Letter O with macron and acute
, -
, U+1E54
, align="center", Ṕ
, Latin Capital Letter P with acute
, -
, U+1E55
, align="center", ṕ
, Latin Small Letter P with acute
, -
, U+1E56
, align="center", Ṗ
, Latin Capital Letter P with dot above
, 0655
, rowspan=2,
ISO 8859-14
ISO/IEC 8859-14:1998, ''Information technology — 8-bit single-byte coded graphic character sets — Part 14: Latin alphabet No. 8 (Celtic)'', is part of the ISO/IEC 8859 series of ASCII-based standard character encodings, first edition published ...
, -
, U+1E57
, align="center", ṗ
, Latin Small Letter P with dot above
, 0656
, -
, U+1E58
, align="center", Ṙ
, Latin Capital Letter R with dot above
, -
, U+1E59
, align="center", ṙ
, Latin Small Letter R with dot above
, -
, U+1E5A
, align="center", Ṛ
, Latin Capital Letter R with dot below
, -
, U+1E5B
, align="center", ṛ
, Latin Small Letter R with dot below
, -
, U+1E5C
, align="center", Ṝ
, Latin Capital Letter R with dot below and macron
, -
, U+1E5D
, align="center", ṝ
, Latin Small Letter R with dot below and macron
, -
, U+1E5E
, align="center", Ṟ
, Latin Capital Letter R with line below
, -
, U+1E5F
, align="center", ṟ
, Latin Small Letter R with line below
, -
, U+1E60
, align="center", Ṡ
, Latin Capital Letter S with dot above
, 0657
, rowspan=2,
ISO 8859-14
ISO/IEC 8859-14:1998, ''Information technology — 8-bit single-byte coded graphic character sets — Part 14: Latin alphabet No. 8 (Celtic)'', is part of the ISO/IEC 8859 series of ASCII-based standard character encodings, first edition published ...
, -
, U+1E61
, align="center", ṡ
, Latin Small Letter S with dot above
, 0658
, -
, U+1E62
, align="center", Ṣ
, Latin Capital Letter S with dot below
, -
, U+1E63
, align="center", ṣ
, Latin Small Letter S with dot below
, -
, U+1E64
, align="center", Ṥ
, Latin Capital Letter S with acute and dot above
, -
, U+1E65
, align="center", ṥ
, Latin Small Letter S with acute and dot above
, -
, U+1E66
, align="center", Ṧ
, Latin Capital Letter S with caron and dot above
, -
, U+1E67
, align="center", ṧ
, Latin Small Letter S with caron and dot above
, -
, U+1E68
, align="center", Ṩ
, Latin Capital Letter S with dot below and dot above
, -
, U+1E69
, align="center", ṩ
, Latin Small Letter S with dot below and dot above
, -
, U+1E6A
, align="center", Ṫ
, Latin Capital Letter T with dot above
, 0659
, rowspan=2,
ISO 8859-14
ISO/IEC 8859-14:1998, ''Information technology — 8-bit single-byte coded graphic character sets — Part 14: Latin alphabet No. 8 (Celtic)'', is part of the ISO/IEC 8859 series of ASCII-based standard character encodings, first edition published ...
, -
, U+1E6B
, align="center", ṫ
, Latin Small Letter T with dot above
, 0660
, -
, U+1E6C
, align="center", Ṭ
, Latin Capital Letter T with dot below
, -
, U+1E6D
, align="center", ṭ
, Latin Small Letter T with dot below
, -
, U+1E6E
, align="center", Ṯ
, Latin Capital Letter T with line below
, -
, U+1E6F
, align="center", ṯ
, Latin Small Letter T with line below
, -
, U+1E70
, align="center", Ṱ
, Latin Capital Letter T with circumflex below
, -
, U+1E71
, align="center", ṱ
, Latin Small Letter T with circumflex below
, -
, U+1E72
, align="center", Ṳ
, Latin Capital Letter U with diaeresis below
, -
, U+1E73
, align="center", ṳ
, Latin Small Letter U with diaeresis below
, -
, U+1E74
, align="center", Ṵ
, Latin Capital Letter U with tilde below
, -
, U+1E75
, align="center", ṵ
, Latin Small Letter U with tilde below
, -
, U+1E76
, align="center", Ṷ
, Latin Capital Letter U with circumflex below
, -
, U+1E77
, align="center", ṷ
, Latin Small Letter U with circumflex below
, -
, U+1E78
, align="center", Ṹ
, Latin Capital Letter U with tilde and acute
, -
, U+1E79
, align="center", ṹ
, Latin Small Letter U with tilde and acute
, -
, U+1E7A
, align="center", Ṻ
, Latin Capital Letter U with macron and diaeresis
, -
, U+1E7B
, align="center", ṻ
, Latin Small Letter U with macron and diaeresis
, -
, U+1E7C
, align="center", Ṽ
, Latin Capital Letter V with tilde
, -
, U+1E7D
, align="center", ṽ
, Latin Small Letter V with tilde
, -
, U+1E7E
, align="center", Ṿ
, Latin Capital Letter V with dot below
, -
, U+1E7F
, align="center", ṿ
, Latin Small Letter V with dot below
, -
, U+1E80
, align="center", Ẁ
, Latin Capital Letter W with grave
, 0661
, rowspan=6, in
WGL4
Windows Glyph List 4, or more commonly WGL4 for short, also known as the ''Pan-European character set'', is a character repertoire on Microsoft operating systems comprising 657 Unicode characters, two of them private use. Its purpose is to provide ...
, -
, U+1E81
, align="center", ẁ
, Latin Small Letter W with grave
, 0662
, -
, U+1E82
, align="center", Ẃ
, Latin Capital Letter W with acute
, 0663
, -
, U+1E83
, align="center", ẃ
, Latin Small Letter W with acute
, 0664
, -
, U+1E84
, align="center", Ẅ
, Latin Capital Letter W with diaeresis
, 0665
, -
, U+1E85
, align="center", ẅ
, Latin Small Letter W with diaeresis
, 0666
, -
, U+1E86
, align="center", Ẇ
, Latin Capital Letter W with dot above
, -
, U+1E87
, align="center", ẇ
, Latin Small Letter W with dot above
, -
, U+1E88
, align="center", Ẉ
, Latin Capital Letter W with dot below
, -
, U+1E89
, align="center", ẉ
, Latin Small Letter W with dot below
, -
, U+1E8A
, align="center", Ẋ
, Latin Capital Letter X with dot above
, -
, U+1E8B
, align="center", ẋ
, Latin Small Letter X with dot above
, -
, U+1E8C
, align="center", Ẍ
, Latin Capital Letter X with diaeresis
, -
, U+1E8D
, align="center", ẍ
, Latin Small Letter X with diaeresis
, -
, U+1E8E
, align="center", Ẏ
, Latin Capital Letter Y with dot above
, -
, U+1E8F
, align="center", ẏ
, Latin Small Letter Y with dot above
, -
, U+1E90
, align="center", Ẑ
, Latin Capital Letter Z with circumflex
, -
, U+1E91
, align="center", ẑ
, Latin Small Letter Z with circumflex
, -
, U+1E92
, align="center", Ẓ
, Latin Capital Letter Z with dot below
, -
, U+1E93
, align="center", ẓ
, Latin Small Letter Z with dot below
, -
, U+1E94
, align="center", Ẕ
, Latin Capital Letter Z with line below
, -
, U+1E95
, align="center", ẕ
, Latin Small Letter Z with line below
, -
, U+1E96
, align="center", ẖ
, Latin Small Letter H with line below
, -
, U+1E97
, align="center", ẗ
, Latin Small Letter T with diaeresis
, -
, U+1E98
, align="center", ẘ
, Latin Small Letter W with ring above
, -
, U+1E99
, align="center", ẙ
, Latin Small Letter Y with ring above
, -
, U+1E9A
, align="center", ẚ
, Latin Small Letter A with right half ring
, -
, U+1E9B
, align="center", ẛ
, Latin Small Letter
Long S
The long s , also known as the medial s or initial s, is an archaism, archaic form of the lowercase letter . It replaced the single ''s'', or one or both of the letters ''s'' in a 'double ''s sequence (e.g., "ſinfulneſs" for "sinfulness" ...
with dot above
, 0667
, for
Fraktur
Fraktur () is a calligraphic hand of the Latin alphabet and any of several blackletter typefaces derived from this hand. The blackletter lines are broken up; that is, their forms contain many angles when compared to the curves of the Antiqu ...
,
Irish Gaelic
Irish (Standard Irish: ), also known as Gaelic, is a Goidelic language of the Insular Celtic branch of the Celtic language family, which is a part of the Indo-European language family. Irish is indigenous to the island of Ireland and was the ...
,
Old English
Old English (, ), or Anglo-Saxon, is the earliest recorded form of the English language, spoken in England and southern and eastern Scotland in the early Middle Ages. It was brought to Great Britain by Anglo-Saxon settlement of Britain, Anglo ...
, -
, align="center" rowspan="2" , Medievalist, , U+1E9C
, align="center", ẜ
, Latin Small Letter Long S with diagonal stroke
, -
, U+1E9D
, align="center", ẝ
, Latin Small Letter Long S with high stroke
, -
, align="center" rowspan="1" , German
typography, , U+1E9E
, align="center", ẞ
, Latin
Capital Letter Sharp S
, -
, align="center" rowspan="1" , Medievalist, , U+1E9F
, align="center", ẟ
, Latin Small Letter
Delta
Delta commonly refers to:
* Delta (letter) (Δ or δ), a letter of the Greek alphabet
* River delta, at a river mouth
* D (NATO phonetic alphabet: "Delta")
* Delta Air Lines, US
* Delta variant of SARS-CoV-2 that causes COVID-19
Delta may also re ...
, -
, align="center" rowspan="90" , Vietnamese, , U+1EA0
, align="center", Ạ
, Latin Capital Letter A with dot below
, -
, U+1EA1
, align="center", ạ
, Latin Small Letter A with dot below
, -
, U+1EA2
, align="center", Ả
, Latin Capital Letter A with hook above
, -
, U+1EA3
, align="center", ả
, Latin Small Letter A with hook above
, -
, U+1EA4
, align="center", Ấ
, Latin Capital Letter A with circumflex and acute
, -
, U+1EA5
, align="center", ấ
, Latin Small Letter A with circumflex and acute
, -
, U+1EA6
, align="center", Ầ
, Latin Capital Letter A with circumflex and grave
, -
, U+1EA7
, align="center", ầ
, Latin Small Letter A with circumflex and grave
, -
, U+1EA8
, align="center", Ẩ
, Latin Capital Letter A with circumflex and hook above
, -
, U+1EA9
, align="center", ẩ
, Latin Small Letter A with circumflex and hook above
, -
, U+1EAA
, align="center", Ẫ
, Latin Capital Letter A with circumflex and tilde
, -
, U+1EAB
, align="center", ẫ
, Latin Small Letter A with circumflex and tilde
, -
, U+1EAC
, align="center", Ậ
, Latin Capital Letter A with circumflex and dot below
, -
, U+1EAD
, align="center", ậ
, Latin Small Letter A with circumflex and dot below
, -
, U+1EAE
, align="center", Ắ
, Latin Capital Letter A with breve and acute
, -
, U+1EAF
, align="center", ắ
, Latin Small Letter A with breve and acute
, -
, U+1EB0
, align="center", Ằ
, Latin Capital Letter A with breve and grave
, -
, U+1EB1
, align="center", ằ
, Latin Small Letter A with breve and grave
, -
, U+1EB2
, align="center", Ẳ
, Latin Capital Letter A with breve and hook above
, -
, U+1EB3
, align="center", ẳ
, Latin Small Letter A with breve and hook above
, -
, U+1EB4
, align="center", Ẵ
, Latin Capital Letter A with breve and tilde
, -
, U+1EB5
, align="center", ẵ
, Latin Small Letter A with breve and tilde
, -
, U+1EB6
, align="center", Ặ
, Latin Capital Letter A with breve and dot below
, -
, U+1EB7
, align="center", ặ
, Latin Small Letter A with breve and dot below
, -
, U+1EB8
, align="center", Ẹ
, Latin Capital Letter E with dot below
, -
, U+1EB9
, align="center", ẹ
, Latin Small Letter E with dot below
, -
, U+1EBA
, align="center", Ẻ
, Latin Capital Letter E with hook above
, -
, U+1EBB
, align="center", ẻ
, Latin Small Letter E with hook above
, -
, U+1EBC
, align="center", Ẽ
, Latin Capital Letter E with tilde
, -
, U+1EBD
, align="center", ẽ
, Latin Small Letter E with tilde
, -
, U+1EBE
, align="center", Ế
, Latin Capital Letter E with circumflex and acute
, -
, U+1EBF
, align="center", ế
, Latin Small Letter E with circumflex and acute
, -
, U+1EC0
, align="center", Ề
, Latin Capital Letter E with circumflex and grave
, -
, U+1EC1
, align="center", ề
, Latin Small Letter E with circumflex and grave
, -
, U+1EC2
, align="center", Ể
, Latin Capital Letter E with circumflex and hook above
, -
, U+1EC3
, align="center", ể
, Latin Small Letter E with circumflex and hook above
, -
, U+1EC4
, align="center", Ễ
, Latin Capital Letter E with circumflex and tilde
, -
, U+1EC5
, align="center", ễ
, Latin Small Letter E with circumflex and tilde
, -
, U+1EC6
, align="center", Ệ
, Latin Capital Letter E with circumflex and dot below
, -
, U+1EC7
, align="center", ệ
, Latin Small Letter E with circumflex and dot below
, -
, U+1EC8
, align="center", Ỉ
, Latin Capital Letter I with hook above
, -
, U+1EC9
, align="center", ỉ
, Latin Small Letter I with hook above
, -
, U+1ECA
, align="center", Ị
, Latin Capital Letter I with dot below
, -
, U+1ECB
, align="center", ị
, Latin Small Letter I with dot below
, -
, U+1ECC
, align="center", Ọ
, Latin Capital Letter O with dot below
, -
, U+1ECD
, align="center", ọ
, Latin Small Letter O with dot below
, -
, U+1ECE
, align="center", Ỏ
, Latin Capital Letter O with hook above
, -
, U+1ECF
, align="center", ỏ
, Latin Small Letter O with hook above
, -
, U+1ED0
, align="center", Ố
, Latin Capital Letter O with circumflex and acute
, -
, U+1ED1
, align="center", ố
, Latin Small Letter O with circumflex and acute
, -
, U+1ED2
, align="center", Ồ
, Latin Capital Letter O with circumflex and grave
, -
, U+1ED3
, align="center", ồ
, Latin Small Letter O with circumflex and grave
, -
, U+1ED4
, align="center", Ổ
, Latin Capital Letter O with circumflex and hook above
, -
, U+1ED5
, align="center", ổ
, Latin Small Letter O with circumflex and hook above
, -
, U+1ED6
, align="center", Ỗ
, Latin Capital Letter O with circumflex and tilde
, -
, U+1ED7
, align="center", ỗ
, Latin Small Letter O with circumflex and tilde
, -
, U+1ED8
, align="center", Ộ
, Latin Capital Letter O with circumflex and dot below
, -
, U+1ED9
, align="center", ộ
, Latin Small Letter O with circumflex and dot below
, -
, U+1EDA
, align="center", Ớ
, Latin Capital Letter O with horn and acute
, -
, U+1EDB
, align="center", ớ
, Latin Small Letter O with horn and acute
, -
, U+1EDC
, align="center", Ờ
, Latin Capital Letter O with horn and grave
, -
, U+1EDD
, align="center", ờ
, Latin Small Letter O with horn and grave
, -
, U+1EDE
, align="center", Ở
, Latin Capital Letter O with horn and hook above
, -
, U+1EDF
, align="center", ở
, Latin Small Letter O with horn and hook above
, -
, U+1EE0
, align="center", Ỡ
, Latin Capital Letter O with horn and tilde
, -
, U+1EE1
, align="center", ỡ
, Latin Small Letter O with horn and tilde
, -
, U+1EE2
, align="center", Ợ
, Latin Capital Letter O with horn and dot below
, -
, U+1EE3
, align="center", ợ
, Latin Small Letter O with horn and dot below
, -
, U+1EE4
, align="center", Ụ
, Latin Capital Letter U with dot below
, -
, U+1EE5
, align="center", ụ
, Latin Small Letter U with dot below
, -
, U+1EE6
, align="center", Ủ
, Latin Capital Letter U with hook above
, -
, U+1EE7
, align="center", ủ
, Latin Small Letter U with hook above
, -
, U+1EE8
, align="center", Ứ
, Latin Capital Letter U with horn and acute
, -
, U+1EE9
, align="center", ứ
, Latin Small Letter U with horn and acute
, -
, U+1EEA
, align="center", Ừ
, Latin Capital Letter U with horn and grave
, -
, U+1EEB
, align="center", ừ
, Latin Small Letter U with horn and grave
, -
, U+1EEC
, align="center", Ử
, Latin Capital Letter U with horn and hook above
, -
, U+1EED
, align="center", ử
, Latin Small Letter U with horn and hook above
, -
, U+1EEE
, align="center", Ữ
, Latin Capital Letter U with horn and tilde
, -
, U+1EEF
, align="center", ữ
, Latin Small Letter U with horn and tilde
, -
, U+1EF0
, align="center", Ự
, Latin Capital Letter U with horn and dot below
, -
, U+1EF1
, align="center", ự
, Latin Small Letter U with horn and dot below
, -
, U+1EF2
, align="center", Ỳ
, Latin Capital Letter Y with grave
, 0668
, rowspan=2, in
WGL4
Windows Glyph List 4, or more commonly WGL4 for short, also known as the ''Pan-European character set'', is a character repertoire on Microsoft operating systems comprising 657 Unicode characters, two of them private use. Its purpose is to provide ...
, -
, U+1EF3
, align="center", ỳ
, Latin Small Letter Y with grave
, 0669
, -
, U+1EF4
, align="center", Ỵ
, Latin Capital Letter Y with dot below
, -
, U+1EF5
, align="center", ỵ
, Latin Small Letter Y with dot below
, -
, U+1EF6
, align="center", Ỷ
, Latin Capital Letter Y with hook above
, -
, U+1EF7
, align="center", ỷ
, Latin Small Letter Y with hook above
, -
, U+1EF8
, align="center", Ỹ
, Latin Capital Letter Y with tilde
, -
, U+1EF9
, align="center", ỹ
, Latin Small Letter Y with tilde
, -
, align="center" rowspan="6" , Medievalist, , U+1EFA
, align="center", Ỻ
, Latin Capital Letter Middle-Welsh LL
, -
, U+1EFB
, align="center", ỻ
, Latin Small Letter Middle-Welsh LL
, -
, U+1EFC
, align="center", Ỽ
, Latin Capital Letter Middle-Welsh V
, -
, U+1EFD
, align="center", ỽ
, Latin Small Letter Middle-Welsh V
, -
, U+1EFE
, align="center", Ỿ
, Latin Capital Letter Y with loop
, -
, U+1EFF
, align="center", ỿ
, Latin Small Letter Y with loop
, -
!
!Code
!Glyph
!Description
! #
! MES-2 Rationale
Additional Latin Extended
*
Latin Extended-C (Unicode block)
*
Latin Extended-D (Unicode block)
*
Latin Extended-E (Unicode block)
*
Latin Extended-F (Unicode block)
*
Latin Extended-G (Unicode block)
Phonetic scripts
IPA Extensions
96 characters; all belong to the Latin script; three in the MES-2 subset.
{, class="wikitable sortable collapsible" id="Table_IPA_Extensions"
!Code
!Glyph
!Decimal
!Description
!#
!MES-2 Rationale
, -
, U+0250
,
, 592
, Latin Small Letter Turned A
, -
, U+0251
,
, 593
, Latin Small Letter Alpha
, -
, U+0252
,
, 594
, Latin Small Letter Turned Alpha
, -
, U+0253
,
, 595
, Latin Small Letter B with Hook
, -
, U+0254
,
, 596
, Latin Small Letter Open O
, -
, U+0255
,
, 597
, Latin Small Letter C with Curl
, -
, U+0256
,
, 598
, Latin Small Letter D with Tail
, -
, U+0257
,
, 599
, Latin Small Letter D with Hook
, -
, U+0258
,
, 600
, Latin Small Letter Reversed E
, -
, U+0259
,
, 601
, Latin Small Letter
Schwa
In linguistics, specifically phonetics and phonology, schwa (, rarely or ; sometimes spelled shwa) is a vowel sound denoted by the IPA symbol , placed in the central position of the vowel chart. In English and some other languages, it rep ...
, 0353
, for
Azerbaijani
, -
, U+025A
,
, 602
, Latin Small Letter Schwa with Hook
, -
, U+025B
,
, 603
, Latin Small Letter Open E
, -
, U+025C
,
, 604
, Latin Small Letter Reversed Open E
, -
, U+025D
,
, 605
, Latin Small Letter Reversed Open E with Hook
, -
, U+025E
,
, 606
, Latin Small Letter Closed Reversed Open E
, -
, U+025F
,
, 607
, Latin Small Letter Dotless J with Stroke
, -
, U+0260
,
, 608
, Latin Small Letter G with Hook
, -
, U+0261
,
, 609
, Latin Small Letter Script G
, -
, U+0262
,
, 610
, Latin Letter Small Capital G
, -
, U+0263
,
, 611
, Latin Small Letter Gamma
, -
, U+0264
,
, 612
, Latin Small Letter Rams Horn
, -
, U+0265
,
, 613
, Latin Small Letter Turned H
, -
, U+0266
,
, 614
, Latin Small Letter H with Hook
, -
, U+0267
,
, 615
, Latin Small Letter Heng with Hook
, -
, U+0268
,
, 616
, Latin Small Letter I with Stroke
, -
, U+0269
,
, 617
, Latin Small Letter Iota
, -
, U+026A
,
, 618
, Latin Letter Small Capital I
, -
, U+026B
,
, 619
, Latin Small Letter L with Middle Tilde
, -
, U+026C
,
, 620
, Latin Small Letter L with Belt
, -
, U+026D
,
, 621
, Latin Small Letter L with Retroflex Hook
, -
, U+026E
,
, 622
, Latin Small Letter Lezh
, -
, U+026F
,
, 623
, Latin Small Letter Turned M
, -
, U+0270
,
, 624
, Latin Small Letter Turned M with Long Leg
, -
, U+0271
,
, 625
, Latin Small Letter M with Hook
, -
, U+0272
,
, 626
, Latin Small Letter N with Left Hook
, -
, U+0273
,
, 627
, Latin Small Letter N with Retroflex Hook
, -
, U+0274
,
, 628
, Latin Letter Small Capital N
, -
, U+0275
,
, 629
, Latin Small Letter Barred O
, -
, U+0276
,
, 630
, Latin Letter Small Capital OE
, -
, U+0277
,
, 631
, Latin Small Letter Closed Omega
, -
, U+0278
,
, 632
, Latin Small Letter Phi
, -
, U+0279
,
, 633
, Latin Small Letter Turned R
, -
, U+027A
,
, 634
, Latin Small Letter Turned R with Long Leg
, -
, U+027B
,
, 635
, Latin Small Letter Turned R with Hook
, -
, U+027C
,
, 636
, Latin Small Letter R with long leg
, 0354
, for
Irish Gaelic
Irish (Standard Irish: ), also known as Gaelic, is a Goidelic language of the Insular Celtic branch of the Celtic language family, which is a part of the Indo-European language family. Irish is indigenous to the island of Ireland and was the ...
, -
, U+027D
,
, 637
, Latin Small Letter R with Tail
, -
, U+027E
,
, 638
, Latin Small Letter R with Fishhook
, -
, U+027F
,
, 639
, Latin Small Letter Reversed R with Fishhook
, -
, U+0280
,
, 640
, Latin Letter Small Capital R
, -
, U+0281
,
, 641
, Latin Letter Small Capital Inverted R
, -
, U+0282
,
, 642
, Latin Small Letter S with Hook
, -
, U+0283
,
, 643
, Latin Small Letter Esh
, -
, U+0284
,
, 644
, Latin Small Letter Dotless J with Stroke and Hook
, -
, U+0285
,
, 645
, Latin Small Letter Squat Reversed Esh
, -
, U+0286
,
, 646
, Latin Small Letter Esh with Curl
, -
, U+0287
,
, 647
, Latin Small Letter Turned T
, -
, U+0288
,
, 648
, Latin Small Letter T with Retroflex Hook
, -
, U+0289
,
, 649
, Latin Small Letter U Bar
, -
, U+028A
,
, 650
, Latin Small Letter Upsilon
, -
, U+028B
,
, 651
, Latin Small Letter V with Hook
, -
, U+028C
,
, 652
, Latin Small Letter Turned V
, -
, U+028D
,
, 653
, Latin Small Letter Turned W
, -
, U+028E
,
, 654
, Latin Small Letter Turned Y
, -
, U+028F
,
, 655
, Latin Letter Small Capital Y
, -
, U+0290
,
, 656
, Latin Small Letter Z with Retroflex Hook
, -
, U+0291
,
, 657
, Latin Small Letter Z with Curl
, -
, U+0292
,
, 658
, Latin Small Letter Ezh
, 0355
, for
Sami
Acronyms
* SAMI, ''Synchronized Accessible Media Interchange'', a closed-captioning format developed by Microsoft
* Saudi Arabian Military Industries, a government-owned defence company
* South African Malaria Initiative, a virtual expertise net ...
, -
, U+0293
,
, 659
, Latin Small Letter Ezh with Curl
, -
, U+0294
,
, 660
, Latin Letter Glottal Stop
, -
, U+0295
,
, 661
, Latin Letter Pharyngeal Voiced Fricative
, -
, U+0296
,
, 662
, Latin Letter Inverted Glottal Stop
, -
, U+0297
,
, 663
, Latin Letter Stretched C
, -
, U+0298
,
, 664
, Latin Letter Bilabial Click
, -
, U+0299
,
, 665
, Latin Letter Small Capital B
, -
, U+029A
,
, 666
, Latin Small Letter Closed Open E
, -
, U+029B
,
, 667
, Latin Letter Small Capital G with Hook
, -
, U+029C
,
, 668
, Latin Letter Small Capital H
, -
, U+029D
,
, 669
, Latin Small Letter J with Crossed Tail
, -
, U+029E
,
, 670
, Latin Small Letter Turned K
, -
, U+029F
,
, 671
, Latin Letter Small Capital L
, -
, U+02A0
,
, 672
, Latin Small Letter Q with Hook
, -
, U+02A1
,
, 673
, Latin Letter Glottal Stop with Stroke
, -
, U+02A2
,
, 674
, Latin Letter Reversed Glottal Stop with Stroke
, -
, U+02A3
,
, 675
, Latin Small Letter DZ Digraph
, -
, U+02A4
,
, 676
, Latin Small Letter Dezh Digraph
, -
, U+02A5
,
, 677
, Latin Small Letter DZ Digraph with Curl
, -
, U+02A6
,
, 678
, Latin Small Letter TS Digraph
, -
, U+02A7
,
, 679
, Latin Small Letter Tesh Digraph
, -
, U+02A8
,
, 680
, Latin Small Letter TC Digraph with Curl
, -
, U+02A9
,
, 681
, Latin Small Letter Feng Digraph
, -
, U+02AA
,
, 682
, Latin Small Letter LS Digraph
, -
, U+02AB
,
, 683
, Latin Small Letter LZ Digraph
, -
, U+02AC
,
, 684
, Latin Letter Bilabial Percussive
, -
, U+02AD
,
, 685
, Latin Letter Bidental Percussive
, -
, U+02AE
,
, 686
, Latin Small Letter Turned H with Fishhook
, -
, U+02AF
,
, 687
, Latin Small Letter Turned H with Fishhook and Tail
, - class="nosort"
!Code
!Glyph
!Decimal
!Description
!#
!MES-2 Rationale
Spacing modifier letters
80 characters; 15 in the MES-2 subset.
{, class="wikitable sortable collapsible" id="Table_Spacing_Modifier_Letters"
!Code
!Glyph
!Decimal
!Description
!#
!MES-2 Rationale
, -
, U+02B0
, ʰ
, 688
, Modifier Letter Small H
, rowspan=11 colspan=2, ·
, -
, U+02B1
, ʱ
, 689
, Modifier Letter Small H with hook
, -
, U+02B2
, ʲ
, 690
, Modifier Letter Small J
, -
, U+02B3
, ʳ
, 691
, Modifier Letter Small R
, -
, U+02B4
, ʴ
, 692
, Modifier Letter Small Turned R
, -
, U+02B5
, ʵ
, 693
, Modifier Letter Small Turned R with hook
, -
, U+02B6
, ʶ
, 694
, Modifier Letter Small Capital Inverted R
, -
, U+02B7
, ʷ
, 695
, Modifier Letter Small W
, -
, U+02B8
, ʸ
, 696
, Modifier Letter Small Y
, -
, U+02B9
, ʹ
, 697
, Modifier Letter Prime
, -
, U+02BA
, ʺ
, 698
, Modifier Letter Double Prime
, -
, U+02BB
, ʻ
, 699
, Modifier Letter Turned Comma
, 0356
, in
Sami
Acronyms
* SAMI, ''Synchronized Accessible Media Interchange'', a closed-captioning format developed by Microsoft
* Saudi Arabian Military Industries, a government-owned defence company
* South African Malaria Initiative, a virtual expertise net ...
, -
, U+02BC
, ʼ
, 700
,
Modifier Letter Apostrophe
The modifier letter apostrophe is a letter in Unicode encoding, used primarily for various glottal sounds.
Encoding
The letter apostrophe is encoded at , which is in the ''Spacing Modifier Letters'' Unicode block.
In Unicode code charts it loo ...
, 0357
, rowspan=2, in
ISO/IEC 8859-7
ISO/IEC 8859-7:2003, ''Information technology — 8-bit single-byte coded graphic character sets — Part 7: Latin/Greek alphabet'', is part of the ISO/IEC 8859 series of ASCII-based standard character encodings, first edition published in 1987. I ...
, -
, U+02BD
, ʽ
, 701
, Modifier Letter Reversed Comma
, 0358
, -
, U+02BE
, ʾ
, 702
, Modifier Letter Right Half Ring
, rowspan=8 colspan=2, ·
, -
, U+02BF
, ʿ
, 703
, Modifier Letter Left Half Ring
, -
, U+02C0
, ˀ
, 704
, Modifier Letter
Glottal Stop
The glottal plosive or stop is a type of consonantal sound used in many spoken languages, produced by obstructing airflow in the vocal tract or, more precisely, the glottis. The symbol in the International Phonetic Alphabet that represents thi ...
, -
, U+02C1
, ˁ
, 705
, Modifier Letter Reversed
Glottal Stop
The glottal plosive or stop is a type of consonantal sound used in many spoken languages, produced by obstructing airflow in the vocal tract or, more precisely, the glottis. The symbol in the International Phonetic Alphabet that represents thi ...
, -
, U+02C2
, ˂
, 706
, Modifier Letter Left Arrowhead
, -
, U+02C3
, ˃
, 707
, Modifier Letter Right Arrowhead
, -
, U+02C4
, ˄
, 708
, Modifier Letter Up Arrowhead
, -
, U+02C5
, ˅
, 709
, Modifier Letter Down Arrowhead
, -
, U+02C6
, ˆ
, 710
, Modifier Letter Circumflex Accent
, 0359
, rowspan=2, in
WGL4
Windows Glyph List 4, or more commonly WGL4 for short, also known as the ''Pan-European character set'', is a character repertoire on Microsoft operating systems comprising 657 Unicode characters, two of them private use. Its purpose is to provide ...
, -
, U+02C7
, ˇ
, 711
,
Caron
A caron (), háček or haček (, or ; plural ''háčeks'' or ''háčky'') also known as a hachek, wedge, check, kvačica, strešica, mäkčeň, varnelė, inverted circumflex, inverted hat, flying bird, inverted chevron, is a diacritic mark ( ...
, 0360
, -
, U+02C8
, ˈ
, 712
, Modifier Letter Vertical Line
, colspan=2, ·
, -
, U+02C9
, ˉ
, 713
, Modifier Letter Macron
, 0361
, in
WGL4
Windows Glyph List 4, or more commonly WGL4 for short, also known as the ''Pan-European character set'', is a character repertoire on Microsoft operating systems comprising 657 Unicode characters, two of them private use. Its purpose is to provide ...
, -
, U+02CA
, ˊ
, 714
, Modifier Letter Acute Accent
, rowspan=12 colspan=2, ·
, -
, U+02CB
, ˋ
, 715
, Modifier Letter Grave Accent
, -
, U+02CC
, ˌ
, 716
, Modifier Letter Low Vertical Line
, -
, U+02CD
, ˍ
, 717
, Modifier Letter Low Macron
, -
, U+02CE
, ˎ
, 718
, Modifier Letter Low Grave Accent
, -
, U+02CF
, ˏ
, 719
, Modifier Letter Low Acute Accent
, -
, U+02D0
, ː
, 720
, Modifier Letter Triangular Colon
, -
, U+02D1
, ˑ
, 721
, Modifier Letter Half Triangular Colon
, -
, U+02D2
, ˒
, 722
, Modifier Letter Centered Right Half Ring
, -
, U+02D3
, ˓
, 723
, Modifier Letter Centered Left Half Ring
, -
, U+02D4
, ˔
, 724
, Modifier Letter Up Tack
, -
, U+02D5
, ˕
, 725
, Modifier Letter Down Tack
, -
, U+02D6
, ˖
, 726
, Modifier Letter Plus Sign
, 0362
, in
WGL4
Windows Glyph List 4, or more commonly WGL4 for short, also known as the ''Pan-European character set'', is a character repertoire on Microsoft operating systems comprising 657 Unicode characters, two of them private use. Its purpose is to provide ...
(?)
, -
, U+02D7
, ˗
, 727
, Modifier Letter Minus Sign
, colspan=2, ·
, -
, U+02D8
, ˘
, 728
,
Breve
A breve (, less often , neuter form of the Latin "short, brief") is the diacritic mark ˘, shaped like the bottom half of a circle. As used in Ancient Greek, it is also called , . It resembles the caron (the wedge or in Czech, in Slo ...
, 0363
, rowspan=6, in
WGL4
Windows Glyph List 4, or more commonly WGL4 for short, also known as the ''Pan-European character set'', is a character repertoire on Microsoft operating systems comprising 657 Unicode characters, two of them private use. Its purpose is to provide ...
, -
, U+02D9
, ˙
, 729
, Dot Above
, 0364
, -
, U+02DA
, ˚
, 730
, Ring Above
, 0365
, -
, U+02DB
, ˛
, 731
,
Ogonek
The (; Polish: , "little tail", diminutive of ) is a diacritic hook placed under the lower right corner of a vowel in the Latin alphabet used in several European languages, and directly under a vowel in several Native American languages. It i ...
, 0366
, -
, U+02DC
, ˜
, 732
, Small Tilde
, 0367
, -
, U+02DD
, ˝
, 733
,
Double Acute Accent
The double acute accent ( ˝ ) is a diacritic mark of the Latin and Cyrillic scripts. It is used primarily in Hungarian or Chuvash, and consequently it is sometimes referred to by typographers as hungarumlaut. The signs formed with a regular um ...
, 0368
, -
, U+02DE
, ˞
, 734
, Modifier Letter Rhotic Hook
, colspan=2, ·
, -
, U+02DF
, ˟
, 735
, Modifier Letter Cross Accent
, 0369
, for Swedish dictionary use
, -
, U+02E0
, ˠ
, 736
, Modifier Letter Small Gamma
, rowspan=14 colspan=2, ·
, -
, U+02E1
, ˡ
, 737
, Modifier Letter Small L
, -
, U+02E2
, ˢ
, 738
, Modifier Letter Small S
, -
, U+02E3
, ˣ
, 739
, Modifier Letter Small X
, -
, U+02E4
, ˤ
, 740
, Modifier Letter Small Reversed Glottal Stop
, -
, U+02E5
, ˥
, 741
,
Modifier Letter Extra-High Tone Bar
, -
, U+02E6
, ˦
, 742
,
Modifier Letter High Tone Bar
, -
, U+02E7
, ˧
, 743
,
Modifier Letter Mid Tone Bar
, -
, U+02E8
, ˨
, 744
,
Modifier Letter Low Tone Bar
, -
, U+02E9
, ˩
, 745
,
Modifier Letter Extra-Low Tone Bar
, -
, U+02EA
, ˪
, 746
, Modifier Letter Yin Departing Tone Mark
, -
, U+02EB
, ˫
, 747
, Modifier Letter Yang Departing Tone Mark
, -
, U+02EC
, ˬ
, 748
, Modifier Letter Voicing
, -
, U+02ED
, ˭
, 749
, Modifier Letter Unaspirated
, -
, U+02EE
, ˮ
, 750
,
Modifier Letter Double Apostrophe
The modifier letter double apostrophe (ˮ) is a spacing glyph. It is used in the orthography of Tundra Nenets to denote a glottal stop, and in the orthography of Dan
Dan or DAN may refer to:
People
* Dan (name), including a list of people with ...
, 0370
, for
Nenets
, -
, U+02EF
, ˯
, 751
, Modifier Letter Low Down Arrowhead
, rowspan=17 colspan=2, ·
, -
, U+02F0
, ˰
, 752
, Modifier Letter Low Up Arrowhead
, -
, U+02F1
, ˱
, 753
, Modifier Letter Low Left Arrowhead
, -
, U+02F2
, ˲
, 754
, Modifier Letter Low Right Arrowhead
, -
, U+02F3
, ˳
, 755
, Modifier Letter Low Ring
, -
, U+02F4
, ˴
, 756
, Modifier Letter Middle Grave Accent
, -
, U+02F5
, ˵
, 757
, Modifier Letter Middle Double Grave Accent
, -
, U+02F6
, ˶
, 758
, Modifier Letter Middle Double Acute Accent
, -
, U+02F7
, ˷
, 759
, Modifier Letter Low Tilde
, -
, U+02F8
, ˸
, 760
, Modifier Letter Raised Colon
, -
, U+02F9
, ˹
, 761
, Modifier Letter Begin High Tone
, -
, U+02FA
, ˺
, 762
, Modifier Letter End High Tone
, -
, U+02FB
, ˻
, 763
, Modifier Letter Begin Low Tone
, -
, U+02FC
, ˼
, 764
, Modifier Letter End Low Tone
, -
, U+02FD
, ˽
, 765
, Modifier Letter Shelf
, -
, U+02FE
, ˾
, 766
, Modifier Letter Open Shelf
, -
, U+02FF
, ˿
, 767
, Modifier Letter Low Left Arrow
, - class="nosort"
!Code
!Glyph
!Decimal
!Description
!#
!MES-2 Rationale
Phonetic Extensions
*
Phonetic Extensions (Unicode block)
*
Phonetic Extensions Supplement (Unicode block)
Combining Marks
{, class="wikitable sortable collapsible"
!Code
!Glyph
!Decimal
!Description
, -
, U+0300 , , ̀ , , 768 , , Combining Grave Accent
, -
, U+0301 , , ́ , , 769 , , Combining Acute Accent
, -
, U+0302 , , ̂ , , 770 , , Combining Circumflex Accent
, -
, U+0303 , , ̃ , , 771 , , Combining Tilde
, -
, U+0304 , , ̄ , , 772 , , Combining Macron
, -
, U+0305 , , ̅ , , 773 , , Combining Overline
, -
, U+0306 , , ̆ , , 774 , , Combining Breve
, -
, U+0307 , , ̇ , , 775 , , Combining Dot Above
, -
, U+0308 , , ̈ , , 776 , , Combining Diaeresis
, -
, U+0309 , , ̉ , , 777 , , Combining Hook Above
, -
, U+030A , , ̊ , , 778 , , Combining Ring Above
, -
, U+030B , , ̋ , , 779 , , Combining Double Acute Accent
, -
, U+030C , , ̌ , , 780 , , Combining Caron
, -
, U+030D , , ̍ , , 781 , , Combining Vertical Line Above
, -
, U+030E , , ̎ , , 782 , , Combining Double Vertical Line Above
, -
, U+030F , , ̏ , , 783 , , Combining Double Grave Accent
, -
, U+0310 , , ̐ , , 784 , , Combining Candrabindu
, -
, U+0311 , , ̑ , , 785 , , Combining Inverted Breve
, -
, U+0312 , , ̒ , , 786 , , Combining Turned Comma Above
, -
, U+0313 , , ̓ , , 787 , , Combining Comma Above
, -
, U+0314 , , ̔ , , 788 , , Combining Reversed Comma Above
, -
, U+0315 , , ̕ , , 789 , , Combining Comma Above Right
, -
, U+0316 , , ̖ , , 790 , , Combining Grave Accent Below
, -
, U+0317 , , ̗ , , 791 , , Combining Acute Accent Below
, -
, U+0318 , , ̘ , , 792 , , Combining Left Tack Below
, -
, U+0319 , , ̙ , , 793 , , Combining Right Tack Below
, -
, U+031A , , ̚ , , 794 , , Combining Left Angle Above
, -
, U+031B , , ̛ , , 795 , , Combining Horn
, -
, U+031C , , ̜ , , 796 , , Combining Left Half Ring Below
, -
, U+031D , , ̝ , , 797 , , Combining Up Tack Below
, -
, U+031E , , ̞ , , 798 , , Combining Down Tack Below
, -
, U+031F , , ̟ , , 799 , , Combining Plus Sign Below
, -
, U+0320 , , ̠ , , 800 , , Combining Minus Sign Below
, -
, U+0321 , , ̡ , , 801 , , Combining Palatalized Hook Below
, -
, U+0322 , , ̢ , , 802 , , Combining Retroflex Hook Below
, -
, U+0323 , , ̣ , , 803 , , Combining Dot Below
, -
, U+0324 , , ̤ , , 804 , , Combining Diaeresis Below
, -
, U+0325 , , ̥ , , 805 , , Combining Ring Below
, -
, U+0326 , , ̦ , , 806 , , Combining Comma Below
, -
, U+0327 , , ̧ , , 807 , , Combining Cedilla
, -
, U+0328 , , ̨ , , 808 , , Combining Ogonek
, -
, U+0329 , , ̩ , , 809 , , Combining Vertical Line Below
, -
, U+032A , , ̪ , , 810 , , Combining Bridge Below
, -
, U+032B , , ̫ , , 811 , , Combining Inverted Double Arch Below
, -
, U+032C , , ̬ , , 812 , , Combining Caron Below
, -
, U+032D , , ̭ , , 813 , , Combining Circumflex Accent Below
, -
, U+032E , , ̮ , , 814 , , Combining Breve Below
, -
, U+032F , , ̯ , , 815 , , Combining Inverted Breve Below
, -
, U+0330 , , ̰ , , 816 , , Combining Tilde Below
, -
, U+0331 , , ̱ , , 817 , , Combining Macron Below
, -
, U+0332 , , ̲ , , 818 , , Combining Low Line
, -
, U+0333 , , ̳ , , 819 , , Combining Double Low Line
, -
, U+0334 , , ̴ , , 820 , , Combining Tilde Overlay
, -
, U+0335 , , ̵ , , 821 , , Combining Short Stroke Overlay
, -
, U+0336 , , ̶ , , 822 , , Combining Long Stroke Overlay
, -
, U+0337 , , ̷ , , 823 , , Combining Short Solidus Overlay
, -
, U+0338 , , ̸ , , 824 , , Combining Long Solidus Overlay
, -
, U+0339 , , ̹ , , 825 , , Combining Right Half Ring Below
, -
, U+033A , , ̺ , , 826 , , Combining Inverted Bridge Below
, -
, U+033B , , ̻ , , 827 , , Combining Square Below
, -
, U+033C , , ̼ , , 828 , , Combining Seagull Below
, -
, U+033D , , ̽ , , 829 , , Combining X Above
, -
, U+033E , , ̾ , , 830 , , Combining Vertical Tilde
, -
, U+033F , , ̿ , , 831 , , Combining Double Overline
, -
, U+0340 , , ̀ , , 832 , , Combining Grave Tone Mark
, -
, U+0341 , , ́ , , 833 , , Combining Acute Tone Mark
, -
, U+0342 , , ͂ , , 834 , , Combining Greek Perispomeni
, -
, U+0343 , , ̓ , , 835 , , Combining Greek Koronis
, -
, U+0344 , , ̈́ , , 836 , , Combining Greek
Dialytika Tonos
, -
, U+0345 , , ͅ , , 837 , , Combining Greek Ypogegrammeni
, -
, U+0346 , , ͆ , , 838 , , Combining Bridge Above
, -
, U+0347 , , ͇ , , 839 , , Combining Equals Sign Below
, -
, U+0348 , , ͈ , , 840 , , Combining Double Vertical Line Below
, -
, U+0349 , , ͉ , , 841 , , Combining Left Angle Below
, -
, U+034A , , ͊ , , 842 , , Combining Not Tilde Above
, -
, U+034B , , ͋ , , 843 , , Combining Homothetic Above
, -
, U+034C , , ͌ , , 844 , , Combining Almost Equal To Above
, -
, U+034D , , ͍ , , 845 , , Combining Left Right Arrow Below
, -
, U+034E , , ͎ , , 846 , , Combining Upwards Arrow Below
, -
, U+034F , , ͏ , , 847 , , Combining Grapheme Joiner
, -
, U+0350 , , ͐ , , 848 , , Combining Right Arrowhead Above
, -
, U+0351 , , ͑ , , 849 , , Combining Left Half Ring Above
, -
, U+0352 , , ͒ , , 850 , , Combining Fermata
, -
, U+0353 , , ͓ , , 851 , , Combining X Below
, -
, U+0354 , , ͔ , , 852 , , Combining Left Arrowhead Below
, -
, U+0355 , , ͕ , , 853 , , Combining Right Arrowhead Below
, -
, U+0356 , , ͖ , , 854 , , Combining Right Arrowhead And Up Arrowhead Below
, -
, U+0357 , , ͗ , , 855 , , Combining Right Half Ring Above
, -
, U+0358 , , ͘ , , 856 , , Combining Dot Above Right
, -
, U+0359 , , ͙ , , 857 , , Combining Asterisk Below
, -
, U+035A , , ͚ , , 858 , , Combining Double Ring Below
, -
, U+035B , , ͛ , , 859 , , Combining Zigzag Above
, -
, U+035C , , ͜ , , 860 , , Combining Double Breve Below
, -
, U+035D , , ͝ , , 861 , , Combining Double Breve
, -
, U+035E , , ͞ , , 862 , , Combining Double Macron
, -
, U+035F , , ͟ , , 863 , , Combining Double Macron Below
, -
, U+0360 , , ͠ , , 864 , , Combining Double Tilde
, -
, U+0361 , , ͡ , , 865 , , Combining Double Inverted Breve
, -
, U+0362 , , ͢ , , 866 , , Combining Double Rightwards Arrow Below
, -
, U+0363 , , ͣ , , 867 , , Combining Latin Small Letter A
, -
, U+0364 , , ͤ , , 868 , , Combining Latin Small Letter E
, -
, U+0365 , , ͥ , , 869 , , Combining Latin Small Letter I
, -
, U+0366 , , ͦ , , 870 , , Combining Latin Small Letter O
, -
, U+0367 , , ͧ , , 871 , , Combining Latin Small Letter U
, -
, U+0368 , , ͨ , , 872 , , Combining Latin Small Letter C
, -
, U+0369 , , ͩ , , 873 , , Combining Latin Small Letter D
, -
, U+036A , , ͪ , , 874 , , Combining Latin Small Letter H
, -
, U+036B , , ͫ , , 875 , , Combining Latin Small Letter M
, -
, U+036C , , ͬ , , 876 , , Combining Latin Small Letter R
, -
, U+036D , , ͭ , , 877 , , Combining Latin Small Letter T
, -
, U+036E , , ͮ , , 878 , , Combining Latin Small Letter V
, -
, U+036F , , ͯ , , 879 , , Combining Latin Small Letter X
, -
Greek and Coptic
144 code points; 135 assigned characters; 85 in the MES-2 subset.
{, class="wikitable sortable collapsible" id="Table_Greek_and_Coptic"
!Code
!Glyph
!Decimal
!Description
!#
, -
, U+0370
, Ͱ
, 880
, Greek Capital Letter Heta
, rowspan=4, ·
, -
, U+0371
, ͱ
, 881
, Greek Small Letter Heta
, -
, U+0372
, Ͳ
, 882
, Greek Capital Letter Archaic Sampi
, -
, U+0373
, ͳ
, 883
, Greek Small Letter Archaic Sampi
, -
, U+0374
, ʹ
, 884
, Greek Numeral Sign
, 0371
, -
, U+0375
, ͵
, 885
, Greek Lower Numeral Sign
, 0372
, -
, U+0376
, Ͷ
, 886
, Greek Capital Letter Pamphylian Digamma
, rowspan=2, ·
, -
, U+0377
, ͷ
, 887
, Greek Small Letter Pamphylian Digamma
, -
, U+037A
, ͺ
, 890
, Greek
Ypogegrammeni
The iota subscript is a diacritic mark in the Greek alphabet shaped like a small vertical stroke or miniature iota placed below the letter. It can occur with the vowel letters eta , omega , and alpha . It represents the former presence of an ...
, 0373
, -
, U+037B
, ͻ
, 891
, Greek Small Reversed Lunate Sigma Symbol
, rowspan=3, ·
, -
, U+037C
, ͼ
, 892
, Greek Small Dotted Lunate Sigma Symbol
, -
, U+037D
, ͽ
, 893
, Greek Small Reversed Dotted Lunate Sigma Symbol
, -
, U+037E
, ;
, 894
, Greek Question Mark
, 0374
, -
, U+037F
, Ϳ
, 895
, Greek Capital Letter Yot
, ·
, -
, U+0384
, ΄
, 900
, Greek
acute accent
The acute accent (), , is a diacritic used in many modern written languages with alphabets based on the Latin, Cyrillic, and Greek scripts. For the most commonly encountered uses of the accent in the Latin and Greek alphabets, precomposed ch ...
(
tonos
Greek orthography has used a variety of diacritics starting in the Hellenistic period. The more complex polytonic orthography ( el, πολυτονικό σύστημα γραφής, translit=polytonikó sýstīma grafī́s), which includes fiv ...
)
, 0375
, -
, U+0385
, ΅
, 901
, Greek
diaeresis with acute accent
, 0376
, -
, U+0386
, Ά
, 902
, Greek Capital Letter A with acute accent
, 0377
, -
, U+0387
, ·
, 903
, Greek
Ano Teleia
An interpunct , also known as an interpoint, middle dot, middot and centered dot or centred dot, is a punctuation mark consisting of a vertically centered dot used for interword separation in ancient Latin script. (Word-separating spaces did no ...
, 0378
, -
, U+0388
, Έ
, 904
, Greek Capital Letter Epsilon with acute accent
, 0379
, -
, U+0389
, Ή
, 905
, Greek Capital Letter Eta with acute accent
, 0380
, -
, U+038A
, Ί
, 906
, Greek Capital Letter Iota with acute accent
, 0381
, -
, U+038C
, Ό
, 908
, Greek Capital Letter Omicron with acute accent
, 0382
, -
, U+038E
, Ύ
, 910
, Greek Capital Letter Upsilon with acute accent
, 0383
, -
, U+038F
, Ώ
, 911
, Greek Capital Letter Omega with acute accent
, 0384
, -
, U+0390
, ΐ
, 912
, Greek Small Letter Iota with diaeresis and acute accent
, 0385
, -
, U+0391
, Α
, 913
, Greek Capital Letter Alpha
, 0386
, -
, U+0392
, Β
, 914
, Greek Capital Letter Beta
, 0387
, -
, U+0393
, Γ
, 915
, Greek Capital Letter Gamma
, 0388
, -
, U+0394
, Δ
, 916
, Greek Capital Letter Delta
, 0389
, -
, U+0395
, Ε
, 917
, Greek Capital Letter Epsilon
, 0390
, -
, U+0396
, Ζ
, 918
, Greek Capital Letter Zeta
, 0391
, -
, U+0397
, Η
, 919
, Greek Capital Letter Eta
, 0392
, -
, U+0398
, Θ
, 920
, Greek Capital Letter Theta
, 0393
, -
, U+0399
, Ι
, 921
, Greek Capital Letter Iota
, 0394
, -
, U+039A
, Κ
, 922
, Greek Capital Letter Kappa
, 0395
, -
, U+039B
, Λ
, 923
, Greek Capital Letter Lambda
, 0396
, -
, U+039C
, Μ
, 924
, Greek Capital Letter Mu
, 0397
, -
, U+039D
, Ν
, 925
, Greek Capital Letter Nu
, 0398
, -
, U+039E
, Ξ
, 926
, Greek Capital Letter Xi
, 0399
, -
, U+039F
, Ο
, 927
, Greek Capital Letter Omicron
, 0400
, -
, U+03A0
, Π
, 928
, Greek Capital Letter Pi
, 0401
, -
, U+03A1
, Ρ
, 929
, Greek Capital Letter Rho
, 0402
, -
, U+03A3
, Σ
, 931
, Greek Capital Letter Sigma
, 0403
, -
, U+03A4
, Τ
, 932
, Greek Capital Letter Tau
, 0404
, -
, U+03A5
, Υ
, 933
, Greek Capital Letter Upsilon
, 0405
, -
, U+03A6
, Φ
, 934
, Greek Capital Letter Phi
, 0406
, -
, U+03A7
, Χ
, 935
, Greek Capital Letter Chi
, 0407
, -
, U+03A8
, Ψ
, 936
, Greek Capital Letter Psi
, 0408
, -
, U+03A9
, Ω
, 937
, Greek Capital Letter Omega
, 0409
, -
, U+03AA
, Ϊ
, 938
, Greek Capital Letter Iota with diaeresis
, 0410
, -
, U+03AB
, Ϋ
, 939
, Greek Capital Letter Upsilon with diaeresis
, 0411
, -
, U+03AC
, ά
, 940
, Greek Small Letter Alpha with acute accent
, 0412
, -
, U+03AD
, έ
, 941
, Greek Small Letter Epsilon with acute accent
, 0413
, -
, U+03AE
, ή
, 942
, Greek Small Letter Eta with acute accent
, 0414
, -
, U+03AF
, ί
, 943
, Greek Small Letter Iota with acute accent
, 0415
, -
, U+03B0
, ΰ
, 944
, Greek Small Letter Upsilon with diaeresis and acute accent
, 0416
, -
, U+03B1
, α
, 945
, Greek Small Letter Alpha
, 0417
, -
, U+03B2
, β
, 946
, Greek Small Letter Beta
, 0418
, -
, U+03B3
, γ
, 947
, Greek Small Letter Gamma
, 0419
, -
, U+03B4
, δ
, 948
, Greek Small Letter Delta
, 0420
, -
, U+03B5
, ε
, 949
, Greek Small Letter Epsilon
, 0421
, -
, U+03B6
, ζ
, 950
, Greek Small Letter Zeta
, 0422
, -
, U+03B7
, η
, 951
, Greek Small Letter Eta
, 0423
, -
, U+03B8
, θ
, 952
, Greek Small Letter Theta
, 0424
, -
, U+03B9
, ι
, 953
, Greek Small Letter Iota
, 0425
, -
, U+03BA
, κ
, 954
, Greek Small Letter Kappa
, 0426
, -
, U+03BB
, λ
, 955
, Greek Small Letter Lambda
, 0427
, -
, U+03BC
, μ
, 956
, Greek Small Letter Mu
, 0428
, -
, U+03BD
, ν
, 957
, Greek Small Letter Nu
, 0429
, -
, U+03BE
, ξ
, 958
, Greek Small Letter Xi
, 0430
, -
, U+03BF
, ο
, 959
, Greek Small Letter Omicron
, 0431
, -
, U+03C0
, π
, 960
, Greek Small Letter Pi
, 0432
, -
, U+03C1
, ρ
, 961
, Greek Small Letter Rho
, 0433
, -
, U+03C2
, ς
, 962
, Greek Small Letter Final Sigma
, 0434
, -
, U+03C3
, σ
, 963
, Greek Small Letter Sigma
, 0435
, -
, U+03C4
, τ
, 964
, Greek Small Letter Tau
, 0436
, -
, U+03C5
, υ
, 965
, Greek Small Letter Upsilon
, 0437
, -
, U+03C6
, φ
, 966
, Greek Small Letter Phi
, 0438
, -
, U+03C7
, χ
, 967
, Greek Small Letter Chi
, 0439
, -
, U+03C8
, ψ
, 968
, Greek Small Letter Psi
, 0440
, -
, U+03C9
, ω
, 969
, Greek Small Letter Omega
, 0441
, -
, U+03CA
, ϊ
, 970
, Greek Small Letter Iota with diaeresis
, 0442
, -
, U+03CB
, ϋ
, 971
, Greek Small Letter Upsilon with diaeresis
, 0443
, -
, U+03CC
, ό
, 972
, Greek Small Letter Omicron with acute accent
, 0444
, -
, U+03CD
, ύ
, 973
, Greek Small Letter Upsilon with acute accent
, 0445
, -
, U+03CE
, ώ
, 974
, Greek Small Letter Omega with acute accent
, 0446
, -
, U+03CF
, Ϗ
, 975
, Greek Capital Kai Symbol
, ·
, -
, U+03D0
, ϐ
, 976
, Greek Beta Symbol
, rowspan=7, ·
, -
, U+03D1
, ϑ
, 977
, Greek Theta Symbol
, -
, U+03D2
, ϒ
, 978
, Greek Upsilon with hook Symbol
, -
, U+03D3
, ϓ
, 979
, Greek Upsilon with acute and hook Symbol
, -
, U+03D4
, ϔ
, 980
, Greek Upsilon with diaeresis and hook Symbol
, -
, U+03D5
, ϕ
, 981
, Greek Phi Symbol
, -
, U+03D6
, ϖ
, 982
, Greek Pi Symbol
, -
, U+03D7
, ϗ
, 983
, Greek Kai Symbol
, 0447
, -
, U+03D8
, Ϙ
, 984
, Greek Letter
Qoppa
Koppa or qoppa (; as a modern numeral sign: ) is a letter that was used in early forms of the Greek alphabet, derived from Phoenician qoph (). It was originally used to denote the sound, but dropped out of use as an alphabetic character in fav ...
, rowspan=2, ·
, -
, U+03D9
, ϙ
, 985
, Greek Small Letter Qoppa
, -
, U+03DA
, Ϛ
, 986
, Greek Letter
Stigma
, 0448
, -
, U+03DB
, ϛ
, 987
, Greek Small Letter Stigma
, 0449
, -
, U+03DC
, Ϝ
, 988
, Greek Letter
Digamma
Digamma or wau (uppercase: Ϝ, lowercase: ϝ, numeral: ϛ) is an archaic letter of the Greek alphabet. It originally stood for the sound but it has remained in use principally as a Greek numeral for 6. Whereas it was originally called ''waw' ...
, 0450
, -
, U+03DD
, ϝ
, 989
, Greek Small Letter Digamma
, 0451
, -
, U+03DE
, Ϟ
, 990
, Greek Letter Koppa
, 0452
, -
, U+03DF
, ϟ
, 991
, Greek Small Letter Koppa
, 0453
, -
, U+03E0
, Ϡ
, 992
, Greek Letter
Sampi
Sampi (modern: ϡ; ancient shapes: , ) is an archaic letter of the Greek alphabet. It was used as an addition to the classical 24-letter alphabet in some eastern Ionic dialects of ancient Greek in the 6th and 5th centuries BC, to denote some t ...
, 0454
, -
, U+03E1
, ϡ
, 993
, Greek Small Letter Sampi
, 0455
, -
, U+03E2
, Ϣ
, 994
, Coptic Capital Letter Shei
, rowspan=30, ·
, -
, U+03E3
, ϣ
, 995
, Coptic Small Letter Shei
, -
, U+03E4
, Ϥ
, 996
, Coptic Capital Letter Fei
, -
, U+03E5
, ϥ
, 997
, Coptic Small Letter Fei
, -
, U+03E6
, Ϧ
, 998
, Coptic Capital Letter Khei
, -
, U+03E7
, ϧ
, 999
, Coptic Small Letter Khei
, -
, U+03E8
, Ϩ
, 1000
, Coptic Capital Letter Hori
, -
, U+03E9
, ϩ
, 1001
, Coptic Small Letter Hori
, -
, U+03EA
, Ϫ
, 1002
, Coptic Capital Letter Gangia
, -
, U+03EB
, ϫ
, 1003
, Coptic Small Letter Gangia
, -
, U+03EC
, Ϭ
, 1004
, Coptic Capital Letter Shima
, -
, U+03ED
, ϭ
, 1005
, Coptic Small Letter Shima
, -
, U+03EE
, Ϯ
, 1006
, Coptic Capital Letter Dei
, -
, U+03EF
, ϯ
, 1007
, Coptic Small Letter Dei
, -
, U+03F0
, ϰ
, 1008
, Greek Kappa Symbol
, -
, U+03F1
, ϱ
, 1009
, Greek Rho Symbol
, -
, U+03F2
, ϲ
, 1010
, Greek Lunate Sigma Symbol
, -
, U+03F3
, ϳ
, 1011
, Greek Letter Yot
, -
, U+03F4
, ϴ
, 1012
, Greek Capital Theta Symbol
, -
, U+03F5
, ϵ
, 1013
, Greek Lunate Epsilon Symbol
, -
, U+03F6
, ϶
, 1014
, Greek Reversed Lunate Epsilon Symbol
, -
, U+03F7
, Ϸ
, 1015
, Greek Capital
Sho
Sho, Shō or SHO may refer to:
Music
* ''Shō'' (instrument) (笙), a Japanese wind instrument
* ''Kane'' (instrument) (鉦), a Japanese percussion instrument
* Sho?, a Dubai rock band
People
* Shō (given name), including ''Sho''
* Shō (su ...
, -
, U+03F8
, ϸ
, 1016
, Greek Small Letter Sho
, -
, U+03F9
, Ϲ
, 1017
, Greek Capital Lunate Sigma Symbol
, -
, U+03FA
, Ϻ
, 1018
, Greek Capital
San
, -
, U+03FB
, ϻ
, 1019
, Greek Small Letter San
, -
, U+03FC
, ϼ
, 1020
, Greek Rho with stroke Symbol
, -
, U+03FD
, Ͻ
, 1021
, Greek Capital Reversed Lunate Sigma Symbol
, -
, U+03FE
, Ͼ
, 1022
, Greek Capital Dotted Lunate Sigma Symbol
, -
, U+03FF
, Ͽ
, 1023
, Greek Capital Reversed Dotted Lunate Sigma Symbol
, - class="nosort"
!Code
!Glyph
!Decimal
!Description
!#
Greek Extended
For
polytonic
Greek orthography has used a variety of diacritics starting in the Hellenistic period. The more complex polytonic orthography ( el, πολυτονικό σύστημα γραφής, translit=polytonikó sýstīma grafī́s), which includes fiv ...
orthography
An orthography is a set of conventions for writing a language, including norms of spelling, hyphenation, capitalization, word breaks, emphasis, and punctuation.
Most transnational languages in the modern period have a writing system, and mos ...
. 256 code points; 233 assigned characters, all in the MES-2 subset (#670 – 902).
Cyrillic
256 characters; 191 in the MES-2 subset.
{, class="wikitable sortable collapsible" id="Table_Cyrillic_script_in_Unicode"
!Code
!Glyph
!Description
!#
, -
, U+0400
, Ѐ
,
Cyrillic Capital Letter Ie with grave
, 0456
, -
, U+0401
, Ё
, Cyrillic Capital Letter Io
, 0457
, -
, U+0402
, Ђ
, Cyrillic Capital Letter Dje
, 0458
, -
, U+0403
, Ѓ
, Cyrillic Capital Letter Gje
, 0459
, -
, U+0404
, Є
, Cyrillic Capital Letter Ukrainian Ie
, 0460
, -
, U+0405
, Ѕ
, Cyrillic Capital Letter Dze
, 0461
, -
, U+0406
, І
, Cyrillic Capital Letter Byelorussian-Ukrainian I
, 0462
, -
, U+0407
, Ї
, Cyrillic Capital Letter Yi
, 0463
, -
, U+0408
, Ј
, Cyrillic Capital Letter Je
, 0464
, -
, U+0409
, Љ
, Cyrillic Capital Letter Lje
, 0465
, -
, U+040A
, Њ
, Cyrillic Capital Letter Nje
, 0466
, -
, U+040B
, Ћ
, Cyrillic Capital Letter Tshe
, 0467
, -
, U+040C
, Ќ
, Cyrillic Capital Letter Kje
, 0468
, -
, U+040D
, Ѝ
, Cyrillic Capital Letter I with grave
, 0469
, -
, U+040E
, Ў
, Cyrillic Capital Letter Short U
, 0470
, -
, U+040F
, Џ
, Cyrillic Capital Letter Dzhe
, 0471
, -
, U+0410
, А
, Cyrillic Capital Letter A
, 0472
, -
, U+0411
, Б
, Cyrillic Capital Letter Be
, 0473
, -
, U+0412
, В
, Cyrillic Capital Letter Ve
, 0474
, -
, U+0413
, Г
, Cyrillic Capital Letter Ghe
, 0475
, -
, U+0414
, Д
, Cyrillic Capital Letter De
, 0476
, -
, U+0415
, Е
, Cyrillic Capital Letter Ie
, 0477
, -
, U+0416
, Ж
, Cyrillic Capital Letter Zhe
, 0478
, -
, U+0417
, З
, Cyrillic Capital Letter Ze
, 0479
, -
, U+0418
, И
, Cyrillic Capital Letter I
, 0480
, -
, U+0419
, Й
, Cyrillic Capital Letter Short I
, 0481
, -
, U+041A
, К
, Cyrillic Capital Letter Ka
, 0482
, -
, U+041B
, Л
, Cyrillic Capital Letter El
, 0483
, -
, U+041C
, М
, Cyrillic Capital Letter Em
, 0484
, -
, U+041D
, Н
, Cyrillic Capital Letter En
, 0485
, -
, U+041E
, О
, Cyrillic Capital Letter O
, 0486
, -
, U+041F
, П
, Cyrillic Capital Letter Pe
, 0487
, -
, U+0420
, Р
, Cyrillic Capital Letter Er
, 0488
, -
, U+0421
, С
, Cyrillic Capital Letter Es
, 0489
, -
, U+0422
, Т
, Cyrillic Capital Letter Te
, 0490
, -
, U+0423
, У
, Cyrillic Capital Letter U
, 0491
, -
, U+0424
, Ф
, Cyrillic Capital Letter Ef
, 0492
, -
, U+0425
, Х
, Cyrillic Capital Letter Ha
, 0493
, -
, U+0426
, Ц
, Cyrillic Capital Letter Tse
, 0494
, -
, U+0427
, Ч
, Cyrillic Capital Letter Che
, 0495
, -
, U+0428
, Ш
, Cyrillic Capital Letter Sha
, 0496
, -
, U+0429
, Щ
, Cyrillic Capital Letter Shcha
, 0497
, -
, U+042A
, Ъ
, Cyrillic Capital Letter Hard Sign
, 0498
, -
, U+042B
, Ы
, Cyrillic Capital Letter Yeru
, 0499
, -
, U+042C
, Ь
, Cyrillic Capital Letter Soft Sign
, 0500
, -
, U+042D
, Э
, Cyrillic Capital Letter E
, 0501
, -
, U+042E
, Ю
, Cyrillic Capital Letter Yu
, 0502
, -
, U+042F
, Я
, Cyrillic Capital Letter Ya
, 0503
, -
, U+0430
, а
, Cyrillic Small Letter A
, 0504
, -
, U+0431
, б
, Cyrillic Small Letter Be
, 0505
, -
, U+0432
, в
, Cyrillic Small Letter Ve
, 0506
, -
, U+0433
, г
, Cyrillic Small Letter Ghe
, 0507
, -
, U+0434
, д
, Cyrillic Small Letter De
, 0508
, -
, U+0435
, е
, Cyrillic Small Letter Ie
, 0509
, -
, U+0436
, ж
, Cyrillic Small Letter Zhe
, 0510
, -
, U+0437
, з
, Cyrillic Small Letter Ze
, 0511
, -
, U+0438
, и
, Cyrillic Small Letter I
, 0512
, -
, U+0439
, й
, Cyrillic Small Letter Short I
, 0513
, -
, U+043A
, к
, Cyrillic Small Letter Ka
, 0514
, -
, U+043B
, л
, Cyrillic Small Letter El
, 0515
, -
, U+043C
, м
, Cyrillic Small Letter Em
, 0516
, -
, U+043D
, н
, Cyrillic Small Letter En
, 0517
, -
, U+043E
, о
, Cyrillic Small Letter O
, 0518
, -
, U+043F
, п
, Cyrillic Small Letter Pe
, 0519
, -
, U+0440
, р
, Cyrillic Small Letter Er
, 0520
, -
, U+0441
, с
, Cyrillic Small Letter Es
, 0521
, -
, U+0442
, т
, Cyrillic Small Letter Te
, 0522
, -
, U+0443
, у
, Cyrillic Small Letter U
, 0523
, -
, U+0444
, ф
, Cyrillic Small Letter Ef
, 0524
, -
, U+0445
, х
, Cyrillic Small Letter Ha
, 0525
, -
, U+0446
, ц
, Cyrillic Small Letter Tse
, 0526
, -
, U+0447
, ч
, Cyrillic Small Letter Che
, 0527
, -
, U+0448
, ш
, Cyrillic Small Letter Sha
, 0528
, -
, U+0449
, щ
, Cyrillic Small Letter Shcha
, 0529
, -
, U+044A
, ъ
, Cyrillic Small Letter Hard Sign
, 0530
, -
, U+044B
, ы
, Cyrillic Small Letter Yeru
, 0531
, -
, U+044C
, ь
, Cyrillic Small Letter Soft Sign
, 0532
, -
, U+044D
, э
, Cyrillic Small Letter E
, 0533
, -
, U+044E
, ю
, Cyrillic Small Letter Yu
, 0534
, -
, U+044F
, я
, Cyrillic Small Letter Ya
, 0535
, -
, U+0450
, ѐ
,
Cyrillic Small Letter Ie with grave
, 0536
, -
, U+0451
, ё
, Cyrillic Small Letter Io
, 0537
, -
, U+0452
, ђ
, Cyrillic Small Letter Dje
, 0538
, -
, U+0453
, ѓ
, Cyrillic Small Letter Gje
, 0539
, -
, U+0454
, є
, Cyrillic Small Letter Ukrainian Ie
, 0540
, -
, U+0455
, ѕ
, Cyrillic Small Letter Dze
, 0541
, -
, U+0456
, і
, Cyrillic Small Letter Byelorussian-Ukrainian I
, 0542
, -
, U+0457
, ї
, Cyrillic Small Letter Yi
, 0543
, -
, U+0458
, ј
, Cyrillic Small Letter Je
, 0544
, -
, U+0459
, љ
, Cyrillic Small Letter Lje
, 0545
, -
, U+045A
, њ
, Cyrillic Small Letter Nje
, 0546
, -
, U+045B
, ћ
, Cyrillic Small Letter Tshe
, 0547
, -
, U+045C
, ќ
, Cyrillic Small Letter Kje
, 0548
, -
, U+045D
, ѝ
, Cyrillic Small Letter I with grave
, 0549
, -
, U+045E
, ў
, Cyrillic Small Letter Short U
, 0550
, -
, U+045F
, џ
, Cyrillic Small Letter Dzhe
, 0551
, -
, U+0460
, Ѡ
, Cyrillic Capital Letter Omega
, rowspan=48, ·
, -
, U+0461
, ѡ
, Cyrillic Small Letter Omega
, -
, U+0462
, Ѣ
, Cyrillic Capital Letter Yat
, -
, U+0463
, ѣ
, Cyrillic Small Letter Yat
, -
, U+0464
, Ѥ
, Cyrillic Capital Letter Iotified E
, -
, U+0465
, ѥ
, Cyrillic Small Letter Iotified E
, -
, U+0466
, Ѧ
, Cyrillic Capital Letter Little Yus
, -
, U+0467
, ѧ
, Cyrillic Small Letter Little Yus
, -
, U+0468
, Ѩ
, Cyrillic Capital Letter Iotified Little Yus
, -
, U+0469
, ѩ
, Cyrillic Small Letter Iotified Little Yus
, -
, U+046A
, Ѫ
, Cyrillic Capital Letter Big Yus
, -
, U+046B
, ѫ
, Cyrillic Small Letter Big Yus
, -
, U+046C
, Ѭ
, Cyrillic Capital Letter Iotified Big Yus
, -
, U+046D
, ѭ
, Cyrillic Small Letter Iotified Big Yus
, -
, U+046E
, Ѯ
, Cyrillic Capital Letter Ksi
, -
, U+046F
, ѯ
, Cyrillic Small Letter Ksi
, -
, U+0470
, Ѱ
, Cyrillic Capital Letter Psi
, -
, U+0471
, ѱ
, Cyrillic Small Letter Psi
, -
, U+0472
, Ѳ
, Cyrillic Capital Letter Fita
, -
, U+0473
, ѳ
, Cyrillic Small Letter Fita
, -
, U+0474
, Ѵ
, Cyrillic Capital Letter Izhitsa
, -
, U+0475
, ѵ
, Cyrillic Small Letter Izhitsa
, -
, U+0476
, Ѷ
, Cyrillic Capital Letter Izhitsa with double grave accent
, -
, U+0477
, ѷ
, Cyrillic Small Letter Izhitsa with double grave accent
, -
, U+0478
, Ѹ
, Cyrillic Capital Letter Uk
, -
, U+0479
, ѹ
, Cyrillic Small Letter Uk
, -
, U+047A
, Ѻ
, Cyrillic Capital Letter Round Omega
, -
, U+047B
, ѻ
, Cyrillic Small Letter Round Omega
, -
, U+047C
, Ѽ
, Cyrillic Capital Letter Omega with Titlo
, -
, U+047D
, ѽ
, Cyrillic Small Letter Omega with Titlo
, -
, U+047E
, Ѿ
, Cyrillic Capital Letter Ot
, -
, U+047F
, ѿ
, Cyrillic Small Letter Ot
, -
, U+0480
, Ҁ
, Cyrillic Capital Letter Koppa
, -
, U+0481
, ҁ
, Cyrillic Small Letter Koppa
, -
, U+0482
, ҂
, Cyrillic Thousands Sign
, -
, U+0483
, ҃
, Combining Cyrillic Titlo
, -
, U+0484
, ҄
, Combining Cyrillic Palatalization
, -
, U+0485
, ҅
, Combining Cyrillic Dasia Pneumata
, -
, U+0486
, ҆
, Combining Cyrillic Psili Pneumata
, -
, U+0487
, ҇
, Combining Cyrillic Pokrytie
, -
, U+0488
, ҈
, Combining Cyrillic Hundred Thousands Sign
, -
, U+0489
, ҉
, Combining Cyrillic Millions Sign
, -
, U+048A
, Ҋ
, Cyrillic Capital Letter Short I with tail
, -
, U+048B
, ҋ
, Cyrillic Small Letter Short I with tail
, -
, U+048C
, Ҍ
, Cyrillic Capital Letter Semisoft Sign
, -
, U+048D
, ҍ
, Cyrillic Small Letter Semisoft Sign
, -
, U+048E
, Ҏ
, Cyrillic Capital Letter Er with tick
, -
, U+048F
, ҏ
, Cyrillic Small Letter Er with tick
, -
, U+0490
, Ґ
, Cyrillic Capital Letter Ghe with upturn
, 0552
, -
, U+0491
, ґ
, Cyrillic Small Letter Ghe with upturn
, 0553
, -
, U+0492
, Ғ
, Cyrillic Capital Letter Ghe with stroke
, 0554
, -
, U+0493
, ғ
, Cyrillic Small Letter Ghe with stroke
, 0555
, -
, U+0494
, Ҕ
, Cyrillic Capital Letter Ghe with middle hook
, 0556
, -
, U+0495
, ҕ
, Cyrillic Small Letter Ghe with middle hook
, 0557
, -
, U+0496
, Җ
, Cyrillic Capital Letter Zhe with descender
, 0558
, -
, U+0497
, җ
, Cyrillic Small Letter Zhe with descender
, 0559
, -
, U+0498
, Ҙ
, Cyrillic Capital Letter Ze with descender
, 0560
, -
, U+0499
, ҙ
, Cyrillic Small Letter Ze with descender
, 0561
, -
, U+049A
, Қ
, Cyrillic Capital Letter Ka with descender
, 0562
, -
, U+049B
, қ
, Cyrillic Small Letter Ka with descender
, 0563
, -
, U+049C
, Ҝ
, Cyrillic Capital Letter Ka with vertical stroke
, 0564
, -
, U+049D
, ҝ
, Cyrillic Small Letter Ka with vertical stroke
, 0565
, -
, U+049E
, Ҟ
, Cyrillic Capital Letter Ka with stroke
, 0566
, -
, U+049F
, ҟ
, Cyrillic Small Letter Ka with stroke
, 0567
, -
, U+04A0
, Ҡ
, Cyrillic Capital Letter Bashkir Ka
, 0568
, -
, U+04A1
, ҡ
, Cyrillic Small Letter Bashkir Ka
, 0569
, -
, U+04A2
, Ң
, Cyrillic Capital Letter En with descender
, 0570
, -
, U+04A3
, ң
, Cyrillic Small Letter En with descender
, 0571
, -
, U+04A4
, Ҥ
, Cyrillic Capital Ligature En Ghe
, 0572
, -
, U+04A5
, ҥ
, Cyrillic Small Ligature En Ghe
, 0573
, -
, U+04A6
, Ҧ
, Cyrillic Capital Letter Pe with middle hook
, 0574
, -
, U+04A7
, ҧ
, Cyrillic Small Letter Pe with middle hook
, 0575
, -
, U+04A8
, Ҩ
, Cyrillic Capital Letter Abkhazian Ha
, 0576
, -
, U+04A9
, ҩ
, Cyrillic Small Letter Abkhazian Ha
, 0577
, -
, U+04AA
, Ҫ
, Cyrillic Capital Letter Es with descender
, 0578
, -
, U+04AB
, ҫ
, Cyrillic Small Letter Es with descender
, 0579
, -
, U+04AC
, Ҭ
, Cyrillic Capital Letter Te with descender
, 0580
, -
, U+04AD
, ҭ
, Cyrillic Small Letter Te with descender
, 0581
, -
, U+04AE
, Ү
, Cyrillic Capital Letter Straight U
, 0582
, -
, U+04AF
, ү
, Cyrillic Small Letter Straight U
, 0583
, -
, U+04B0
, Ұ
, Cyrillic Capital Letter Straight U with stroke
, 0584
, -
, U+04B1
, ұ
, Cyrillic Small Letter Straight U with stroke
, 0585
, -
, U+04B2
, Ҳ
, Cyrillic Capital Letter Ha with descender
, 0586
, -
, U+04B3
, ҳ
, Cyrillic Small Letter Ha with descender
, 0587
, -
, U+04B4
, Ҵ
, Cyrillic Capital Ligature Te Tse
, 0588
, -
, U+04B5
, ҵ
, Cyrillic Small Ligature Te Tse
, 0589
, -
, U+04B6
, Ҷ
, Cyrillic Capital Letter Che with descender
, 0590
, -
, U+04B7
, ҷ
, Cyrillic Small Letter Che with descender
, 0591
, -
, U+04B8
, Ҹ
, Cyrillic Capital Letter Che with vertical stroke
, 0592
, -
, U+04B9
, ҹ
, Cyrillic Small Letter Che with vertical stroke
, 0593
, -
, U+04BA
, Һ
, Cyrillic Capital Letter Shha
, 0594
, -
, U+04BB
, һ
, Cyrillic Small Letter Shha
, 0595
, -
, U+04BC
, Ҽ
, Cyrillic Capital Letter Abkhazian Che
, 0596
, -
, U+04BD
, ҽ
, Cyrillic Small Letter Abkhazian Che
, 0597
, -
, U+04BE
, Ҿ
, Cyrillic Capital Letter Abkhazian Che with descender
, 0598
, -
, U+04BF
, ҿ
, Cyrillic Small Letter Abkhazian Che with descender
, 0599
, -
, U+04C0
, Ӏ
, Cyrillic Letter Palochka
, 0600
, -
, U+04C1
, Ӂ
, Cyrillic Capital Letter Zhe with breve
, 0601
, -
, U+04C2
, ӂ
, Cyrillic Small Letter Zhe with breve
, 0602
, -
, U+04C3
, Ӄ
, Cyrillic Capital Letter Ka with hook
, 0603
, -
, U+04C4
, ӄ
, Cyrillic Small Letter Ka with hook
, 0604
, -
, U+04C5
, Ӆ
, Cyrillic Capital Letter El with tail
, rowspan=2, ·
, -
, U+04C6
, ӆ
, Cyrillic Small Letter El with tail
, -
, U+04C7
, Ӈ
, Cyrillic Capital Letter En with hook
, 0605
, -
, U+04C8
, ӈ
, Cyrillic Small Letter En with hook
, 0606
, -
, U+04C9
, Ӊ
, Cyrillic Capital Letter En with tail
, rowspan=2, ·
, -
, U+04CA
, ӊ
, Cyrillic Small Letter En with tail
, -
, U+04CB
, Ӌ
, Cyrillic Capital Letter Khakassian Che
, 0607
, -
, U+04CC
, ӌ
, Cyrillic Small Letter Khakassian Che
, 0608
, -
, U+04CD
, Ӎ
, Cyrillic Capital Letter Em with tail
, rowspan=3, ·
, -
, U+04CE
, ӎ
, Cyrillic Small Letter Em with tail
, -
, U+04CF
, ӏ
, Cyrillic Small Letter Palochka
, -
, U+04D0
, Ӑ
, Cyrillic Capital Letter A with breve
, 0609
, -
, U+04D1
, ӑ
, Cyrillic Small Letter A with breve
, 0610
, -
, U+04D2
, Ӓ
, Cyrillic Capital Letter A with diaeresis
, 0611
, -
, U+04D3
, ӓ
, Cyrillic Small Letter A with diaeresis
, 0612
, -
, U+04D4
, Ӕ
, Cyrillic Capital Ligature A Ie
, 0613
, -
, U+04D5
, ӕ
, Cyrillic Small Ligature A Ie
, 0614
, -
, U+04D6
, Ӗ
, Cyrillic Capital Letter Ie with breve
, 0615
, -
, U+04D7
, ӗ
, Cyrillic Small Letter Ie with breve
, 0616
, -
, U+04D8
, Ә
, Cyrillic Capital Letter Schwa
, 0617
, -
, U+04D9
, ә
, Cyrillic Small Letter Schwa
, 0618
, -
, U+04DA
, Ӛ
, Cyrillic Capital Letter Schwa with diaeresis
, 0619
, -
, U+04DB
, ӛ
, Cyrillic Small Letter Schwa with diaeresis
, 0620
, -
, U+04DC
, Ӝ
, Cyrillic Capital Letter Zhe with diaeresis
, 0621
, -
, U+04DD
, ӝ
, Cyrillic Small Letter Zhe with diaeresis
, 0622
, -
, U+04DE
, Ӟ
, Cyrillic Capital Letter Ze with diaeresis
, 0623
, -
, U+04DF
, ӟ
, Cyrillic Small Letter Ze with diaeresis
, 0624
, -
, U+04E0
, Ӡ
, Cyrillic Capital Letter Abkhazian Dze
, 0625
, -
, U+04E1
, ӡ
, Cyrillic Small Letter Abkhazian Dze
, 0626
, -
, U+04E2
, Ӣ
, Cyrillic Capital Letter I with macron
, 0627
, -
, U+04E3
, ӣ
, Cyrillic Small Letter I with macron
, 0628
, -
, U+04E4
, Ӥ
, Cyrillic Capital Letter I with diaeresis
, 0629
, -
, U+04E5
, ӥ
, Cyrillic Small Letter I with diaeresis
, 0630
, -
, U+04E6
, Ӧ
, Cyrillic Capital Letter O with diaeresis
, 0631
, -
, U+04E7
, ӧ
, Cyrillic Small Letter O with diaeresis
, 0632
, -
, U+04E8
, Ө
, Cyrillic Capital Letter Barred O
, 0633
, -
, U+04E9
, ө
, Cyrillic Small Letter Barred O
, 0634
, -
, U+04EA
, Ӫ
, Cyrillic Capital Letter Barred O with diaeresis
, 0635
, -
, U+04EB
, ӫ
, Cyrillic Small Letter Barred O with diaeresis
, 0636
, -
, U+04EC
, Ӭ
, Cyrillic Capital Letter E with diaeresis
, rowspan=2, ·
, -
, U+04ED
, ӭ
, Cyrillic Small Letter E with diaeresis
, -
, U+04EE
, Ӯ
, Cyrillic Capital Letter U with macron
, 0637
, -
, U+04EF
, ӯ
, Cyrillic Small Letter U with macron
, 0638
, -
, U+04F0
, Ӱ
, Cyrillic Capital Letter U with diaeresis
, 0639
, -
, U+04F1
, ӱ
, Cyrillic Small Letter U with diaeresis
, 0640
, -
, U+04F2
, Ӳ
, Cyrillic Capital Letter U with double acute
, 0641
, -
, U+04F3
, ӳ
, Cyrillic Small Letter U with double acute
, 0642
, -
, U+04F4
, Ӵ
, Cyrillic Capital Letter Che with diaeresis
, 0643
, -
, U+04F5
, ӵ
, Cyrillic Small Letter Che with diaeresis
, 0644
, -
, U+04F6
, Ӷ
, Cyrillic Capital Letter Ghe with descender
, rowspan=2, ·
, -
, U+04F7
, ӷ
, Cyrillic Small Letter Ghe with descender
, -
, U+04F8
, Ӹ
, Cyrillic Capital Letter Yeru with diaeresis
, 0645
, -
, U+04F9
, ӹ
, Cyrillic Small Letter Yeru with diaeresis
, 0646
, -
, U+04FA
, Ӻ
, Cyrillic Capital Letter Ghe with stroke and hook
, rowspan=6, ·
, -
, U+04FB
, ӻ
, Cyrillic Small Letter Ghe with stroke and hook
, -
, U+04FC
, Ӽ
, Cyrillic Capital Letter Ha with hook
, -
, U+04FD
, ӽ
, Cyrillic Small Letter Ha with hook
, -
, U+04FE
, Ӿ
, Cyrillic Capital Letter Ha with stroke
, -
, U+04FF
, ӿ
, Cyrillic Small Letter Ha with stroke
, - class="nosort"
!Code
!Glyph
!Description
!#
Cyrillic supplements
*
Cyrillic Supplement (Unicode block)
*
Cyrillic Extended-A (Unicode block)
*
Cyrillic Extended-B (Unicode block)
*
Cyrillic Extended-C (Unicode block)
*
Cyrillic Extended-D (Unicode block)
Armenian
Semitic languages
Arabic
Hebrew
Syriac
Mandaic
*
Mandaic (Unicode block)
Mandaic is a Unicode block containing characters of the Mandaic script used for writing the historic Eastern Aramaic, also called Classical Mandaic, and the modern Neo-Mandaic language.
History
The following Unicode-related documents record th ...
Samaritan
*
Samaritan (Unicode block)
Samaritan is a Unicode block containing characters used for writing Samaritan Hebrew and Samaritan Aramaic language, Aramaic.
History
The following Unicode-related documents record the purpose and process of defining specific characters in the ...
Thaana
Brahmic (Indic) scripts
The range from U+0900 to U+0DFF includes
Devanagari
Devanagari ( ; , , Sanskrit pronunciation: ), also called Nagari (),Kathleen Kuiper (2010), The Culture of India, New York: The Rosen Publishing Group, , page 83 is a left-to-right abugida (a type of segmental Writing systems#Segmental syste ...
,
Bengali script
Bengali or Bengalee, or Bengalese may refer to:
*something of, from, or related to Bengal, a large region in South Asia
* Bengalis, an ethnic and linguistic group of the region
* Bengali language, the language they speak
** Bengali alphabet, the w ...
,
Gurmukhi
Gurmukhī ( pa, ਗੁਰਮੁਖੀ, , Shahmukhi: ) is an abugida developed from the Laṇḍā scripts, standardized and used by the second Sikh guru, Guru Angad (1504–1552). It is used by Punjabi Sikhs to write the language, commonly r ...
,
Gujarati script
The Gujarati script (, transliterated: ) is an abugida for the Gujarati language, Kutchi language, and various other languages. It is a variant of the Devanagari script differentiated by the loss of the characteristic horizontal line running abo ...
,
Odia alphabet
Odia, also spelled Oriya or Odiya, may refer to:
* Odia people in Odisha, India
* Odia language, an Indian language, belonging to the Indo-Aryan branch of the Indo-European language family
* Odia alphabet, a writing system used for the Odia languag ...
,
Tamil script
The Tamil script ( , ) is an abugida script that is used by Tamils and Tamil language, Tamil speakers in India, Sri Lanka, Malaysia, Singapore, Indonesia and elsewhere to write the Tamil language. Certain minority languages such as Saurasht ...
,
Telugu script
Telugu script ( te, తెలుగు లిపి, Telugu lipi), an abugida from the Brahmic family of scripts, is used to write the Telugu language, a Dravidian language spoken in the Indian states of Andhra Pradesh and Telangana as well a ...
,
Kannada script
The Kannada script (IAST: ''Kannaḍa lipi''; obsolete: Kanarese or Canarese script in English) is an abugida of the Brahmic family, used to write Kannada, one of the Dravidian languages of South India especially in the state of Karnataka. Ka ...
,
Malayalam script, and
Sinhala script
The Sinhala script ( si, සිංහල අක්ෂර මාලාව, Siṁhala Akṣara Mālāva), also known as Sinhalese script, is a writing system used by the Sinhalese people and most Sri Lankans in Sri Lanka and elsewhere to write ...
.
Devanagari
Bengali
Gurmukhi
Gujarati
Oriya
Tamil
Telugu
Kannada
Malayalam
Sinhala
Other Brahmic scripts
Other Brahmic and Indic scripts in Unicode include:
*
Ahom (Unicode block)
Ahom is a Unicode block containing characters used for writing the Ahom alphabet, which was used to write the Ahom language spoken by the Ahom people in Assam
Assam (; ) is a state in northeastern India, south of the eastern Himalayas alon ...
*
Balinese (Unicode block)
Balinese is a Unicode block containing characters of Balinese script for the Balinese language. Balinese language is mainly spoken on the island of Bali, Indonesia.
Block
History
The following Unicode-related documents record the purpose and pr ...
*
Batak (Unicode block)
Batak is a Unicode block containing characters for writing the Batak dialects of Karo, Mandailing, Pakpak, Simalungun, and Toba Toba may refer to:
Languages
* Toba Sur language, spoken in South America
* Batak Toba, spoken in Indonesia
People ...
*
Bhaiksuki (Unicode block)
Bhaiksuki is a Unicode block containing characters from the Bhaiksuki alphabet, which is a Brahmi-based script that was used for writing Sanskrit during the 11th and 12th centuries CE, mainly in the present-day states of Bihar and West Bengal in ...
*
Buhid (Unicode block)
Buhid is a Unicode block
A Unicode block is one of several contiguous ranges of numeric character codes (code points) of the Unicode character set that are defined by the Unicode Consortium for administrative and documentation purposes. Typicall ...
*
Buginese (Unicode block)
Buginese is a Unicode block containing characters for writing the Buginese language of Sulawesi
Sulawesi (), also known as Celebes (), is an island in Indonesia. One of the four Greater Sunda Islands, and the world's eleventh-largest island ...
*
Chakma (Unicode block)
Chakma is a Unicode block containing characters for writing the Chakma language
Chakma language (; autonym: , ) is an Indo-Aryan language spoken by the Chakma and Daingnet people. The language has common features with other languages in the r ...
*
Cham (Unicode block)
Cham is a Unicode block containing characters of the Cham script, which is used for writing the Cham language, primarily used for the Eastern dialect in Cambodia and Vietnam.
A separate block for Western Cham, used in Cambodia, was first proposed ...
*
Common Indic Number Forms (Unicode block)
*
Dives Akuru (Unicode block)
Dives Akuru is a Unicode block containing characters from the Dhives Akuru script, which was used for writing the Maldivian language
Maldivian, also known by its endonym Dhivehi or Divehi ( ; '' dv, links=no, ދިވެހި'', ), is an Indo-A ...
*
Dogra (Unicode block)
*
Grantha (Unicode block)
Grantha is a Unicode block containing the ancient Grantha script characters of 6th to 19th century Tamil Nadu and Kerala for writing Sanskrit and Manipravalam.
History
The following Unicode-related documents record the purpose and process of de ...
*
Hanunoo (Unicode block)
Hanunoo is a Unicode block
A Unicode block is one of several contiguous ranges of numeric character codes ( code points) of the Unicode character set that are defined by the Unicode Consortium for administrative and documentation purposes. Typi ...
*
Javanese (Unicode block)
Javanese is a Unicode block containing aksara Jawa characters traditionally used for writing the Javanese language.
Block
The Unicode block for Javanese is U+A980–U+A9DF. There are 91 code points for Javanese script: 53 letters, 19 punctu ...
*
Kaithi (Unicode block)
*
Kawi (Unicode block)
*
Khmer (Unicode block)
Khmer is a Unicode block
A Unicode block is one of several contiguous ranges of numeric character codes ( code points) of the Unicode character set that are defined by the Unicode Consortium for administrative and documentation purposes. Typic ...
*
Khmer Symbols (Unicode block)
Khmer Symbols is a Unicode block containing lunar calendar, lunar date symbols, used in the writing system of the Khmer language, Khmer (Cambodian) language. For further details see Khmer alphabet#Unicode, Khmer alphabet – Unicode.
History
...
*
Khojki (Unicode block)
Khojki is a Unicode block containing characters used by the Khoja community of South Asia
South Asia is the southern subregion of Asia, which is defined in both geographical
Geography (from Greek: , ''geographia''. Combination o ...
*
Khudawadi (Unicode block)
Khudawadi is a Unicode block containing characters of the Khudabadi script used by some Sindhis in India for writing the Sindhi language
Sindhi ( ; , ) is an Indo-Aryan language spoken by about 30 million people in the Pakistani province o ...
*
Lao (Unicode block)
Lao is a Unicode block
A Unicode block is one of several contiguous ranges of numeric character codes ( code points) of the Unicode character set that are defined by the Unicode Consortium for administrative and documentation purposes. Typicall ...
*
Lepcha (Unicode block)
Lepcha is a Unicode block containing characters for writing the Lepcha language
Lepcha language, or Róng language ( Lepcha: ; ''Róng ríng''), is a Himalayish language spoken by the Lepcha people in Sikkim, India and parts of West Bengal, ...
*
Limbu (Unicode block)
Limbu is a Unicode block containing characters for writing the Limbu language
Limbu (Limbu: , ''yakthuṅ pan'') is a Sino-Tibetan language spoken by the Limbu people of Nepal and Northeastern India (particularly Darjeeling, Kalimpong, Sikkim, ...
*
Mahajani (Unicode block)
Mahajani is a Unicode block containing characters historically used for writing Punjabi and Marwari.
History
The following Unicode-related documents record the purpose and process of defining specific characters in the Mahajani block:
Refe ...
*
Makasar (Unicode block)
*
Marchen (Unicode block)
*
Meetei Mayek (Unicode block)
Meetei Mayek is a Unicode block containing characters for writing the Meitei language of Manipur, India.
History
The following Unicode-related documents record the purpose and process of defining specific characters in the Meetei Mayek block:
...
*
Meetei Mayek Extensions (Unicode block)
*
Modi (Unicode block)
Modi is a Unicode block containing the Modi alphabet characters for writing the Marathi language
Marathi (; ''Marāṭhī'', ) is an Indo-Aryan languages, Indo-Aryan language predominantly spoken by Marathi people in the Indian state of Mahar ...
*
Multani (Unicode block)
Multani is a Unicode block containing characters used for writing the Multani alphabet, a Brahmic scripts, Brahmic script used in the Multan region of Punjab and in northern Sindh in Pakistan. The script is now obsolete, but was historically used ...
*
Myanmar (Unicode block)
Myanmar is a Unicode block containing characters for the Burmese, Mon, Shan, Palaung, and the Karen languages of Myanmar, as well as the Aiton and Phake languages of Northeast India. It is also used to write Pali and Sanskrit in Myanmar.
...
*
Myanmar Extended-A (Unicode block)
*
Myanmar Extended-B (Unicode block)
*
New Tai Lue (Unicode block)
New Tai Lue is a Unicode block
A Unicode block is one of several contiguous ranges of numeric character codes ( code points) of the Unicode character set that are defined by the Unicode Consortium for administrative and documentation purposes. ...
*
Newa (Unicode block)
Newa is a Unicode block containing characters from the Newa alphabet, which is used to write Nepal Bhasa
Newar (), or Newari and known officially in Nepal as Nepal Bhasa, is a Sino-Tibetan languages, Sino-Tibetan language spoken by the ...
*
Phags-pa (Unicode block)
Phags-pa is a Unicode block containing characters from the 'Phags-pa script promulgated as a national script by Kublai Khan
Kublai ; Mongolian script: ; (23 September 1215 – 18 February 1294), also known by his temple name as the Emperor ...
*
Rejang (Unicode block)
Rejang is a Unicode block containing characters used prior to the introduction of Islam for writing the Rejang dialects Musi, Kebanagun, Pesisir, and Rawas ''For the area in Sumatra see Musi Rawas Regency''
Rawas is a village in West Papua, ...
*
Saurashtra (Unicode block)
Saurashtra is a Unicode block
A Unicode block is one of several contiguous ranges of numeric character codes (code points) of the Unicode character set that are defined by the Unicode Consortium for administrative and documentation purposes. Typ ...
*
Sharada (Unicode block)
Sharada is a Unicode block containing historic characters for writing Kashmiri, Sanskrit
Sanskrit (; attributively , ; nominally , , ) is a classical language belonging to the Indo-Aryan languages, Indo-Aryan branch of the Indo-European la ...
*
Siddham (Unicode block)
Siddham is a Unicode block containing characters for the historical, Brahmi-derived Siddham script Siddham may refer to:
*Siddhaṃ script
(also '), also known in its later evolved form as Siddhamātṛkā, is a medieval Brahmic abugida, ...
*
Sundanese (Unicode block)
Sundanese is a Unicode block containing modern characters for writing the Sundanese script of the Sundanese language of the island of Java, Indonesia.
History
The following Unicode-related documents record the purpose and process of defining ...
*
Sundanese Supplement (Unicode block)
*
Syloti Nagri (Unicode block)
Syloti Nagri () is a Unicode block containing characters of the Syloti Nagri script for writing the Sylheti language
Sylheti ( Sylheti Nāgarī: ; bn, সিলেটি ) is an Indo-Aryan language spoken by an estimated 11 million peopl ...
*
Tagalog (Unicode block)
Tagalog is a Unicode block containing characters of the Baybayin script, specifically the variety used for writing the Tagalog language before Spanish colonization of the Philippines eventually led to the adoption of the Latin alphabet. It has bee ...
*
Tagbanwa (Unicode block)
Tagbanwa is a Unicode block
A Unicode block is one of several contiguous ranges of numeric character codes ( code points) of the Unicode character set that are defined by the Unicode Consortium for administrative and documentation purposes. Typ ...
*
Tai Le (Unicode block)
Tai Le is a Unicode block
A Unicode block is one of several contiguous ranges of numeric character codes (code points) of the Unicode character set that are defined by the Unicode Consortium for administrative and documentation purposes. Typica ...
*
Tai Tham (Unicode block)
Tai Tham is a Unicode block containing characters of the Lanna script used for writing the Northern Thai (Kam Mu'ang), Tai Lü, and Khün languages.
__TOC__
History
123 of the 127 code points initially encoded were proposed in L2/07-007R, two m ...
*
Tai Viet (Unicode block)
*
Takri (Unicode block)
The Takri block U+11680–U+116CF was added to the Unicode Standard in January 2012 with the release of version 6.1.
Chart
History
The addition was made possible in part by a grant from the United States National Endowment for the Humanities ...
*
Thai (Unicode block)
Thai is a Unicode block
A Unicode block is one of several contiguous ranges of numeric character codes ( code points) of the Unicode character set that are defined by the Unicode Consortium for administrative and documentation purposes. Typical ...
*
Tibetan (Unicode block)
Tibetan is a Unicode block containing characters for the Tibetan, Dzongkha, and other languages of China, Bhutan, Nepal, Mongolia, northern India, eastern Pakistan and Russia.
Block
Former Tibetan block
The Tibetan Unicode block is unique fo ...
*
Tirhuta (Unicode block)
Tirhuta is a Unicode block containing characters for Brahmi-derived Tirhuta script which was the primary writing system for Maithili in Bihar, India and Madhesh, Nepal
Nepal (; ne, नेपाल ), formerly the Federal Democratic Rep ...
Other South and Central Asian writing systems
*
Gunjala Gondi (Unicode block)
Gunjala Gondi is a Unicode block containing characters of Gunjala Gondi script used for writing the Adilabad dialect of the Gondi language
Gondi () is a South-Central Dravidian language, spoken by about three million Gondi people, chiefly in ...
*
Masaram Gondi (Unicode block)
Masaram Gondi is a Unicode block containing characters from the Masaram Gondi script, which was designed for writing Gondi in 1918 by Munshi Mangal Singh Masaram, a Gond from Balaghat district of Madhya Pradesh, India
India, officially ...
*
Mro (Unicode block)
Mro is a Unicode block containing characters for writing the Bangladesh Mru language
Mru, also known as Mrung (Murung), is a Sino-Tibetan language of Bangladesh and Myanmar. It is spoken by a community of Mrus (Mros) inhabiting the Chittago ...
*
Nag Mundari (Unicode block)
Nag Mundari is a Unicode block containing the letters for writing the Mundari language
Mundari (Munɖari) is a Munda language of the Austroasiatic language family spoken by the Munda tribes in eastern Indian states of Jharkhand, Odisha and ...
*
Ol Chiki (Unicode block)
Ol Chiki is a Unicode block containing characters of the Ol Chiki, or Ol Cemet' script used for writing the Santali language during the early 20th century.
History
The following Unicode-related documents record the purpose and process of defini ...
*
Sora Sompeng (Unicode block)
Sora Sompeng is a Unicode block containing characters for writing the Sora language
Sora is a south Munda language of the Austroasiatic language of the Sora people, an ethnic group of eastern India, mainly in the states of Odisha and Andhra ...
*
Tangsa (Unicode block)
Tangsa is a Unicode block containing characters for Lakhum Mossang's script for writing the Tangsa language
Tangsa, also known as Tase and Tase Naga, is a Sino-Tibetan language or language cluster spoken by the Tangsa people of Burma and nort ...
*
Toto (Unicode block)
Toto is a Unicode block containing characters for Dhaniram Toto's script for writing the Toto language
Toto (Bengali: , Toto: ) is a Sino-Tibetan language spoken on the border of India and Bhutan, by the tribal Toto people in Totopara, West ...
*
Warang Citi (Unicode block)
Southeast Asian writing systems
*
Hanifi Rohingya (Unicode block)
Hanifi Rohingya is a Unicode block containing characters for Hanifi Rohingya script used for writing the Rohingya language in Myanmar and Bangladesh
Bangladesh (}, ), officially the People's Republic of Bangladesh, is a country in South ...
*
Kayah Li (Unicode block)
Kayah Li is a Unicode block
A Unicode block is one of several contiguous ranges of numeric character codes ( code points) of the Unicode character set that are defined by the Unicode Consortium for administrative and documentation purposes. Typ ...
*
Pahawh Hmong (Unicode block)
Pahawh Hmong is a Unicode block containing characters for writing Hmong languages.
History
The following Unicode-related documents record the purpose and process of defining specific characters in the Pahawh Hmong block:
References
{{ ...
*
Pau Cin Hau (Unicode block)
Pau Cin Hau is a Unicode block containing characters for the Pau Cin Hau alphabet which was created by Pau Cin Hau Pau Cin Hau is the founder and the name of a religion followed by some Tedim, Hakha in Chin state and Kale in Sagaing division in ...
Georgian
African scripts
Ge'ez/Ethiopic script
Other African scripts
*
Adlam (Unicode block)
Adlam is a Unicode block containing characters from the Adlam script, an alphabetic script devised during the late 1980s for writing the Fula language in Guinea, Nigeria, Liberia, and other nearby countries.
History
In June 2016, Adlam was added ...
*
Bamum (Unicode block)
Bamum is a Unicode block containing the characters of stage-G Bamum script, used for modern writing of the Bamum language of western Cameroon
Cameroon (; french: Cameroun, ff, Kamerun), officially the Republic of Cameroon (french: Répu ...
*
Bamum Supplement (Unicode block)
Bamum Supplement is a Unicode block containing the characters of the historic stage A-F of the Bamum script, used for writing the Bamum language of western Cameroon. The modern stage G characters, which include many characters used for stage A-F ...
*
Bassa Vah (Unicode block)
Bassa Vah is a Unicode block containing characters historically used for writing the Bassa language of Liberia and Sierra Leone
Sierra Leone,)]. officially the Republic of Sierra Leone, is a country on the southwest coast of West Africa. I ...
*
Medefaidrin (Unicode block)
Medefaidrin is a Unicode block containing characters for the constructed script Medefaidrin which is used to write the constructed language of the same name. The Medefaidrin language and script were created as a Christian sacred language by an ...
*
Mende Kikakui (Unicode block)
Mende Kikakui is a Unicode block
A Unicode block is one of several contiguous ranges of numeric character codes ( code points) of the Unicode character set that are defined by the Unicode Consortium for administrative and documentation purposes ...
*
NKo (Unicode block)
NKo is a Unicode block containing characters for the Manding languages of West Africa, including Bamanan, Jula, Maninka, Mandinka, and a common literary language, Kangbe, also called N'Ko
N'Ko () is a script devised by Solomana Kante in 1949 ...
*
Osmanya (Unicode block)
Osmanya is a Unicode block containing characters for writing the Somali language
Somali (Latin script: ; Wadaad: ; Osmanya: 𐒖𐒍 𐒈𐒝𐒑𐒛𐒐𐒘 ) is an Afroasiatic language belonging to the Cushitic branch. It is spoken as a m ...
*
Ottoman Siyaq Numbers
Ottoman Siyaq Numbers is a Unicode block containing a specialized subset of the Arabic script that was used for accounting in Ottoman Turkish language, Ottoman Turkish documents.
Block
History
The following Unicode-related documents record the ...
*
Tifinagh (Unicode block)
Tifinagh is a Unicode block containing characters of the Neo-Tifinagh alphabet, used for writing Northern Berber and Tuareg Berber in North Africa
North Africa, or Northern Africa is a region encompassing the northern portion of the Africa ...
*
Vai (Unicode block)
Vai is a Unicode block containing characters of the Vai syllabary used for writing the Vai language of Sierra Leone and Liberia
Liberia (), officially the Republic of Liberia, is a country on the West African coast. It is bordered by Sierr ...
American scripts
Unified Canadian Aboriginal Syllabics
Other American scripts
*
Cherokee (Unicode block)
Cherokee is a Unicode block containing the syllabic characters for writing the Cherokee language.
When Cherokee was first added to Unicode in version 3.0 it was treated as a unicameral alphabet, but in version 8.0 it was redefined as a bicamera ...
*
Cherokee Supplement (Unicode block)
*
*
Kaktovik Numerals (Unicode block)
The Kaktovik Numerals are a Unicode block for the Kaktovik numerals, a base-20 system of numerical digits created by Alaskan Iñupiat. It was first encoded in Unicode version 15 in 2022. It contains 20 characters for representing each of the digi ...
*
Osage (Unicode block)
Osage is a Unicode block containing characters from the Osage alphabet, which was devised in 2006 for writing the Osage language spoken by the Osage people of Oklahoma
Oklahoma (; Choctaw language, Choctaw: ; chr, ᎣᎧᎳᎰᎹ, ''Okal ...
Mongolian
Unicode symbols
{, class="wikitable sortable collapsible" id="Table_Unicode_symbols"
!Code
!Glyph
!Description
!#
, -
, U+2013
, –
,
En dash
The dash is a punctuation mark consisting of a long horizontal line. It is similar in appearance to the hyphen but is longer and sometimes higher from the baseline. The most common versions are the endash , generally longer than the hyphen b ...
, 0903
, -
, U+2014
, —
,
Em dash
The dash is a punctuation mark consisting of a long horizontal line. It is similar in appearance to the hyphen but is longer and sometimes higher from the baseline. The most common versions are the endash , generally longer than the hyphen b ...
, 0904
, -
, U+2015
, ―
,
Horizontal bar
The horizontal bar, also known as the high bar, is an apparatus used by male gymnasts in artistic gymnastics. It traditionally consists of a cylindrical metal (typically steel) bar that is rigidly held above and parallel to the floor by a syste ...
, 0905
, -
, U+2017
, ‗
,
Double low line
, 0906
, -
, U+2018
, ‘
,
Left single quotation mark
, 0907
, -
, U+2019
, ’
,
Right single quotation mark
, 0908
, -
, U+201A
, ‚
,
Single low-9 quotation mark
, 0909
, -
, U+201B
, ‛
,
Single high-reversed-9 quotation mark
, 0910
, -
, U+201C
, “
,
Left double quotation mark
, 0911
, -
, U+201D
, ”
,
Right double quotation mark
, 0912
, -
, U+201E
, „
,
Double low-9 quotation mark
, 0913
, -
, U+2020
, †
,
Dagger
A dagger is a fighting knife with a very sharp point and usually two sharp edges, typically designed or capable of being used as a thrusting or stabbing weapon.State v. Martin, 633 S.W.2d 80 (Mo. 1982): This is the dictionary or popular-use de ...
, 0914
, -
, U+2021
, ‡
,
Double dagger
A dagger, obelisk, or obelus is a typographical mark that usually indicates a footnote if an asterisk has already been used. The symbol is also used to indicate death (of people) or extinction (of species). It is one of the modern descenda ...
, 0915
, -
, U+2022
, •
,
Bullet
A bullet is a kinetic projectile, a component of firearm ammunition that is shot from a gun barrel. Bullets are made of a variety of materials, such as copper, lead, steel, polymer, rubber and even wax. Bullets are made in various shapes and co ...
, 0916
, -
, U+2026
,
,
Horizontal ellipsis
The ellipsis (, also known informally as dot dot dot) is a series of dots that indicates an intentional omission of a word, sentence, or whole section from a text without altering its original meaning. The plural is ellipses. The term origin ...
, 0917
, -
, U+2030
, ‰
,
Per mille sign
, 0918
, -
, U+2032
, ′
,
Prime
A prime number (or a prime) is a natural number greater than 1 that is not a product of two smaller natural numbers. A natural number greater than 1 that is not prime is called a composite number. For example, 5 is prime because the only ways ...
, 0919
, -
, U+2033
, ″
,
Double prime
The prime symbol , double prime symbol , triple prime symbol , and quadruple prime symbol are used to designate units and for other purposes in mathematics, science, linguistics and music.
Although the characters differ little in appearance fr ...
, 0920
, -
, U+2039
, ‹
,
Single left-pointing angle quotation mark
, 0921
, -
, U+203A
, ›
,
Single right-pointing angle quotation mark
, 0922
, -
, U+203C
, ‼
,
Double exclamation mark
The exclamation mark, , or exclamation point (American English), is a punctuation mark usually used after an interjection or exclamation to indicate strong feelings or to show emphasis. The exclamation mark often marks the end of a sentence, f ...
, 0923
, -
, U+203E
, ‾
,
Overline
An overline, overscore, or overbar, is a typographical feature of a horizontal line drawn immediately above the text. In old mathematical notation, an overline was called a '' vinculum'', a notation for grouping symbols which is expressed in m ...
, 0924
, -
, U+2044
, ⁄
,
Fraction slash
The slash is the oblique slanting line punctuation mark . Also known as a stroke, a solidus or several other historical or technical names including oblique and virgule. Once used to mark periods and commas, the slash is now used to represen ...
, 0925
, -
, U+204A
, ⁊
,
Tironian et sign
, 0926
, - class="nosort"
!Code
!Glyph
!Description
!#
General Punctuation
112 code points; 111 assigned characters; 24 in the MES-2 subset.
Superscripts and Subscripts
Currency Symbols
Letterlike Symbols
Number Forms
Arrows
*
Miscellaneous Symbols and Arrows (Unicode block)
Miscellaneous Symbols and Arrows is a Unicode block containing arrows and geometric shapes with various fills, astrological symbols, technical symbols, intonation marks, and others.
Block
Emoji
The Miscellaneous Symbols and Arrows block con ...
*
Supplemental Arrows-A (Unicode block)
Supplemental Arrows-A is a Unicode block containing various arrow symbols.
Block
History
The following Unicode-related documents record the purpose and process of defining specific characters in the Supplemental Arrows-A block:
See also ...
*
Supplemental Arrows-B (Unicode block)
Supplemental Arrows-B is a Unicode block containing miscellaneous arrows, arrow tails, crossing arrows used in knot descriptions, curved arrows, and harpoons.
Block
Emoji
The Supplemental Arrows-B block contains two emoji:
U+2934–U+2935.
...
*
Supplemental Arrows-C (Unicode block)
Supplemental Arrows-C is a Unicode block
A Unicode block is one of several contiguous ranges of numeric character codes ( code points) of the Unicode character set that are defined by the Unicode Consortium for administrative and documentation ...
Mathematical symbols
*
Supplemental Mathematical Operators (Unicode block)
Supplemental Mathematical Operators is a Unicode block containing various mathematical symbols, including N-ary operators, summations and integrals, intersections and unions, logical and relational operators, and subset/superset relations.
Block
...
*
Miscellaneous Mathematical Symbols-A (Unicode block)
Miscellaneous Mathematical Symbols-A is a Unicode block containing characters for mathematical, logical, and database notation.
Character table
Compact table
History
The following Unicode-related documents record the purpose and process o ...
*
Miscellaneous Mathematical Symbols-B (Unicode block)
Miscellaneous Mathematical Symbols-B is a Unicode block containing miscellaneous mathematical symbols, including brackets, angles, and circle symbols.
Block
Some of these symbols are used in Z notation. Specifically
*
*
*
*
*
*
The last two s ...
* Mathematical Alphanumeric Symbols:
Mathematical Alphanumeric Symbols (Unicode block)
Mathematical Alphanumeric Symbols is a Unicode block comprising styled forms of Latin and Greek letters and decimal digits that enable mathematicians to denote different notions with different letter styles. The letters in various fonts ofte ...
Miscellaneous Technical
Control Pictures
Optical Character Recognition
Enclosed Alphanumerics
Box Drawing
Block Elements
{, class="wikitable sortable collapsible" id="Table_Block_Elements"
!Code
!Glyph
!Description
, -
, U+2580
, ▀
, Upper half block
, -
, U+2581
, ▁
, Lower one eighth block
, -
, U+2582
, ▂
, Lower one quarter block
, -
, U+2583
, ▃
, Lower three eighths block
, -
, U+2584
, ▄
, Lower half block
, -
, U+2585
, ▅
, Lower five eighths block
, -
, U+2586
, ▆
, Lower three quarters block
, -
, U+2587
, ▇
, Lower seven eighths block
, -
, U+2588
, █
, Full block
, -
, U+2589
, ▉
, Left seven eighths block
, -
, U+258A
, ▊
, Left three quarters block
, -
, U+258B
, ▋
, Left five eighths block
, -
, U+258C
, ▌
, Left half block
, -
, U+258D
, ▍
, Left three eighths block
, -
, U+258E
, ▎
, Left one quarter block
, -
, U+258F
, ▏
, Left one eighth block
, -
, U+2590
, ▐
, Right half block
, -
, U+2591
, ░
, Light shade
, -
, U+2592
, ▒
, Medium shade
, -
, U+2593
, ▓
, Dark shade
, -
, U+2594
, ▔
, Upper one eighth block
, -
, U+2595
, ▕
, Right one eighth block
, -
, U+2596
, ▖
, Quadrant lower left
, -
, U+2597
, ▗
, Quadrant lower right
, -
, U+2598
, ▘
, Quadrant upper left
, -
, U+2599
, ▙
, Quadrant upper left and lower left and lower right
, -
, U+259A
, ▚
, Quadrant upper left and lower right
, -
, U+259B
, ▛
, Quadrant upper left and upper right and lower left
, -
, U+259C
, ▜
, Quadrant upper left and upper right and lower right
, -
, U+259D
, ▝
, Quadrant upper right
, -
, U+259E
, ▞
, Quadrant upper right and lower left
, -
, U+259F
, ▟
, Quadrant upper right and lower left and lower right
Geometric Shapes
{, class="wikitable sortable collapsible" id="Table_Geometric_Shapes"
!Code
!Glyph
!Description
, -
, U+25A0
, ■
, Black square
, -
, U+25A1
, □
, White square
, -
, U+25A2
, ▢
, White square with rounded corners
, -
, U+25A3
, ▣
, White square containing small black square
, -
, U+25A4
, ▤
, Square with horizontal fill
, -
, U+25A5
, ▥
, Square with vertical fill
, -
, U+25A6
, ▦
, Square with orthogonal crosshatch fill
, -
, U+25A7
, ▧
, Square with upper left to lower right fill
, -
, U+25A8
, ▨
, Square with upper right to lower left fill
, -
, U+25A9
, ▩
, Square with diagonal crosshatch fill
, -
, U+25AA
, ▪
, Black small square
, -
, U+25AB
, ▫
, White small square
, -
, U+25AC
, ▬
, Black rectangle
, -
, U+25AD
, ▭
, White rectangle
, -
, U+25AE
, ▮
, Black vertical rectangle
, -
, U+25AF
, ▯
, White vertical rectangle
, -
, U+25B0
, ▰
, Black parallelogram
, -
, U+25B1
, ▱
, White parallelogram
, -
, U+25B2
, ▲
, Black up-pointing triangle
, -
, U+25B3
, △
, White up-pointing triangle
, -
, U+25B4
, ▴
, Black up-pointing small triangle
, -
, U+25B5
, ▵
, White up-pointing small triangle
, -
, U+25B6
, ▶
, Black right-pointing triangle
, -
, U+25B7
, ▷
, White right-pointing triangle
, -
, U+25B8
, ▸
, Black right-pointing small triangle
, -
, U+25B9
, ▹
, White right-pointing small triangle
, -
, U+25BA
, ►
, Black right-pointing pointer
, -
, U+25BB
, ▻
, White right-pointing pointer
, -
, U+25BC
, ▼
, Black down-pointing triangle
, -
, U+25BD
, ▽
, White down-pointing triangle
, -
, U+25BE
, ▾
, Black down-pointing small triangle
, -
, U+25BF
, ▿
, White down-pointing small triangle
, -
, U+25C0
, ◀
, Black left-pointing triangle
, -
, U+25C1
, ◁
, White left-pointing triangle
, -
, U+25C2
, ◂
, Black left-pointing small triangle
, -
, U+25C3
, ◃
, White left-pointing small triangle
, -
, U+25C4
, ◄
, Black left-pointing pointer
, -
, U+25C5
, ◅
, White left-pointing pointer
, -
, U+25C6
, ◆
, Black diamond
, -
, U+25C7
, ◇
, White diamond
, -
, U+25C8
, ◈
, White diamond containing small black diamond
, -
, U+25C9
, ◉
, Fisheye
, -
, U+25CA
, ◊
, Lozenge
, -
, U+25CB
, ○
, White circle
, -
, U+25CC
, ◌
, Dotted circle
, -
, U+25CD
, ◍
, Circle with vertical fill
, -
, U+25CE
, ◎
, Bullseye
, -
, U+25CF
, ●
, Black circle
, -
, U+25D0
, ◐
, Circle with left half black
, -
, U+25D1
, ◑
, Circle with right half black
, -
, U+25D2
, ◒
, Circle with lower half black
, -
, U+25D3
, ◓
, Circle with upper half black
, -
, U+25D4
, ◔
, Circle with upper right quadrant black
, -
, U+25D5
, ◕
, Circle with all but upper left quadrant black
, -
, U+25D6
, ◖
, Left half circle black
, -
, U+25D7
, ◗
, Right half black circle
, -
, U+25D8
, ◘
, Inverse bullet
, -
, U+25D9
, ◙
, Inverse white circle
, -
, U+25DA
, ◚
, Upper half inverse white circle
, -
, U+25DB
, ◛
, Lower half inverse white circle
, -
, U+25DC
, ◜
, Upper left quadrant circular arc
, -
, U+25DD
, ◝
, Upper right quadrant circular arc
, -
, U+25DE
, ◞
, Lower right quadrant circular arc
, -
, U+25DF
, ◟
, Lower left quadrant circular arc
, -
, U+25E0
, ◠
, Upper half circle
, -
, U+25E1
, ◡
, Lower half circle
, -
, U+25E2
, ◢
, Black lower right triangle
, -
, U+25E3
, ◣
, Black lower left triangle
, -
, U+25E4
, ◤
, Black upper left triangle
, -
, U+25E5
, ◥
, Black upper right triangle
, -
, U+25E6
, ◦
, White bullet
, -
, U+25E7
, ◧
, Square with left half black
, -
, U+25E8
, ◨
, Square with right half black
, -
, U+25E9
, ◩
, Square with upper left diagonal half black
, -
, U+25EA
, ◪
, Square with lower right diagonal half black
, -
, U+25EB
, ◫
, White square with vertical bisecting line
, -
, U+25EC
, ◬
, White up-pointing triangle with dot
, -
, U+25ED
, ◭
, Up-pointing triangle with left half black
, -
, U+25EE
, ◮
, Up-pointing triangle with right half black
, -
, U+25EF
, ◯
, Large circle
, -
, U+25F0
, ◰
, White square with upper left quadrant
, -
, U+25F1
, ◱
, White square with lower left quadrant
, -
, U+25F2
, ◲
, White square with lower right quadrant
, -
, U+25F3
, ◳
, White square with upper right quadrant
, -
, U+25F4
, ◴
, White circle with upper left quadrant
, -
, U+25F5
, ◵
, White circle with lower left quadrant
, -
, U+25F6
, ◶
, White circle with lower right quadrant
, -
, U+25F7
, ◷
, White circle with upper right quadrant
, -
, U+25F8
, ◸
, Upper left triangle
, -
, U+25F9
, ◹
, Upper right triangle
, -
, U+25FA
, ◺
, Lower-left triangle
, -
, U+25FB
, ◻
, White medium square
, -
, U+25FC
, ◼
, Black medium square
, -
, U+25FD
, ◽
, White medium small square
, -
, U+25FE
, ◾
, Black medium small square
, -
, U+25FF
, ◿
, Lower right triangle
Miscellaneous Symbols
Symbols for Legacy Computing
Dingbats
{, class="wikitable sortable"
, -
!Code
!Result
!Description
, -
, U+2700
, ✀
, Black safety scissors
, -
, U+2701
, ✁
, Upper blade scissors
, -
, U+2702
, ✂
, Black scissors
, -
, U+2703
, ✃
, Lower blade scissors
, -
, U+2704
, ✄
, White scissors
, -
, U+2705
, ✅
, White heavy check mark
, -
, U+2706
, ✆
, Telephone location sign
, -
, U+2707
, ✇
, Tape drive
, -
, U+2708
, ✈
, Airplane
, -
, U+2709
, ✉
, Envelope
, -
, U+270A
, ✊
, Raised fist
, -
, U+270B
, ✋
, Raised hand
, -
, U+270C
, ✌
, Victory hand
, -
, U+270D
, ✍
, Writing hand
, -
, U+270E
, ✎
, Lower right pencil
, -
, U+270F
, ✏
, Pencil
, -
, U+2710
, ✐
, Upper right pencil
, -
, U+2711
, ✑
, White nib
, -
, U+2712
, ✒
, Black nib
, -
, U+2713
, ✓
, Check mark
, -
, U+2714
, ✔
, Heavy check mark
, -
, U+2715
, ✕
, Multiplication X
, -
, U+2716
, ✖
, Heavy multiplication X
, -
, U+2717
, ✗
, Ballot X
, -
, U+2718
, ✘
, Heavy ballot X
, -
, U+2719
, ✙
, Outlined Greek cross
, -
, U+271A
, ✚
, Heavy Greek cross
, -
, U+271B
, ✛
, Open center cross
, -
, U+271C
, ✜
, Heavy open center cross
, -
, U+271D
, ✝
, Latin cross
, -
, U+271E
, ✞
, Shadowed white Latin cross
, -
, U+271F
, ✟
, Outlined Latin cross
, -
, U+2720
, ✠
,
Maltese cross
The Maltese cross is a cross symbol, consisting of four " V" or arrowhead shaped concave quadrilaterals converging at a central vertex at right angles, two tips pointing outward symmetrically.
It is a heraldic cross variant which developed f ...
, -
, U+2721
, ✡
,
Star of David
The Star of David (). is a generally recognized symbol of both Jewish identity and Judaism. Its shape is that of a hexagram: the compound of two equilateral triangles.
A derivation of the ''seal of Solomon'', which was used for decorative ...
, -
, U+2722
, ✢
, Four teardrop-spoked asterisk
, -
, U+2723
, ✣
, Four balloon-spoked asterisk
, -
, U+2724
, ✤
, Heavy four balloon-spoked asterisk
, -
, U+2725
, ✥
, Four club-spoked asterisk
, -
, U+2726
, ✦
, Black four-pointed star
, -
, U+2727
, ✧
, White four-pointed star
, -
, U+2728
, ✨
, Sparkles
, -
, U+2729
, ✩
, Stress outlined white star
, -
, U+272A
, ✪
, Circled white star
, -
, U+272B
, ✫
, Open center black star
, -
, U+272C
, ✬
, Black center white star
, -
, U+272D
, ✭
, Outlined black star
, -
, U+272E
, ✮
, Heavy outlined black star
, -
, U+272F
, ✯
, Pinwheel star
, -
, U+2730
, ✰
, Shadowed white star
, -
, U+2731
, ✱
, Heavy asterisk
, -
, U+2732
, ✲
, Open center asterisk
, -
, U+2733
, ✳
, Eight spoked asterisk
, -
, U+2734
, ✴
, Eight pointed black star
, -
, U+2735
, ✵
, Eight pointed pinwheel star
, -
, U+2736
, ✶
, Six pointed black star
, -
, U+2737
, ✷
, Eight pointed rectilinear black star
, -
, U+2738
, ✸
, Heavy eight pointed rectilinear black star
, -
, U+2739
, ✹
, Twelve pointed black star
, -
, U+273A
, ✺
, Sixteen pointed asterisk
, -
, U+273B
, ✻
, Teardrop spoked asterisk
, -
, U+273C
, ✼
, Open center teardrop spoked asterisk
, -
, U+273D
, ✽
, Heavy teardrop spoked asterisk
, -
, U+273E
, ✾
, Six petalled black and white florette
, -
, U+273F
, ✿
, Black florette
, -
, U+2740
, ❀
, White florette
, -
, U+2741
, ❁
, Eight petalled outlined black florette
, -
, U+2742
, ❂
, Circled open center eight pointed star
, -
, U+2743
, ❃
, Heavy teardrop spoked pinwheel asterisk
, -
, U+2744
, ❄
, Snowflake
, -
, U+2745
, ❅
, Tight trifoliate snowflake
, -
, U+2746
, ❆
, Heavy chevron snowflake
, -
, U+2747
, ❇
, Sparkle
, -
, U+2748
, ❈
, Heavy sparkle
, -
, U+2749
, ❉
, Balloon spoked asterisk
, -
, U+274A
, ❊
, Eight teardrop spoked propeller asterisk
, -
, U+274B
, ❋
, Heavy eight teardrop spoked propeller asterisk
, -
, U+274C
, ❌
, Cross mark
, -
, U+274D
, ❍
, Shadowed white circle
, -
, U+274E
, ❎
, Negative squared cross mark
, -
, U+274F
, ❏
, Lower right drop-shadowed white square
, -
, U+2750
, ❐
, Upper right drop-shadowed white square
, -
, U+2751
, ❑
, Lower right shadowed white square
, -
, U+2752
, ❒
, Upper right shadowed white square
, -
, U+2753
, ❓
, Black question mark ornament
, -
, U+2754
, ❔
, White question mark ornament
, -
, U+2755
, ❕
, White exclamation mark ornament
, -
, U+2756
, ❖
, Black diamond minus white X
, -
, U+2757
, ❗
, Heavy exclamation mark symbol
, -
, U+2758
, ❘
, Light vertical bar
, -
, U+2759
, ❙
, Medium vertical bar
, -
, U+275A
, ❚
, Heavy vertical bar
, -
, U+275B
, ❛
, Heavy single turned comma quotation mark ornament
, -
, U+275C
, ❜
, Heavy single comma quotation mark ornament
, -
, U+275D
, ❝
, Heavy double turned comma quotation mark ornament
, -
, U+275E
, ❞
, Heavy double comma quotation mark ornament
, -
, U+275F
, ❜
, Heavy low single comma quotation mark ornament
, -
, U+2760
, ❞
, Heavy low double comma quotation mark ornament
, -
, U+2761
, ❡
, Curved stem paragraph sign ornament
, -
, U+2762
, ❢
, Heavy exclamation mark ornament
, -
, U+2763
, ❣
, Heavy heart exclamation mark ornament
, -
, U+2764
, ❤
, Heavy black heart
, -
, U+2765
, ❥
, Rotated heavy black heart bullet
, -
, U+2766
, ❦
, Floral heart
, -
, U+2767
, ❧
, Rotated floral heart bullet
, -
, U+2768
, ❨
, Medium left parenthesis ornament
, -
, U+2769
, ❩
, Medium right parenthesis ornament
, -
, U+276A
, ❪
, Medium flattened left parenthesis ornament
, -
, U+276B
, ❫
, Medium flattened right parenthesis ornament
, -
, U+276C
, ❬
, Medium left-pointing angle bracket ornament
, -
, U+276D
, ❭
, Medium right-pointing angle bracket ornament
, -
, U+276E
, ❮
, Heavy left-pointing angle quotation mark ornament
, -
, U+276F
, ❯
, Heavy right-pointing angle quotation mark ornament
, -
, U+2770
, ❰
, Heavy left-pointing angle bracket ornament
, -
, U+2771
, ❱
, Heavy right-pointing angle bracket ornament
, -
, U+2772
, ❲
, Light left tortoise shell bracket ornament
, -
, U+2773
, ❳
, Light right tortoise shell bracket ornament
, -
, U+2774
, ❴
, Medium left curly bracket ornament
, -
, U+2775
, ❵
, Medium left curly bracket ornament
, -
, U+2776
, ❶
, Dingbat negative circled digit one
, -
, U+2777
, ❷
, Dingbat negative circled digit two
, -
, U+2778
, ❸
, Dingbat negative circled digit three
, -
, U+2779
, ❹
, Dingbat negative circled digit four
, -
, U+277A
, ❺
, Dingbat negative circled digit five
, -
, U+277B
, ❻
, Dingbat negative circled digit six
, -
, U+277C
, ❼
, Dingbat negative circled digit seven
, -
, U+277D
, ❽
, Dingbat negative circled digit eight
, -
, U+277E
, ❾
, Dingbat negative circled digit nine
, -
, U+277F
, ❿
, Dingbat negative circled digit ten
, -
, U+2780
, ➀
, Dingbat circled sans-serif digit one
, -
, U+2781
, ➁
, Dingbat circled sans-serif digit two
, -
, U+2782
, ➂
, Dingbat circled sans-serif digit three
, -
, U+2783
, ➃
, Dingbat circled sans-serif digit four
, -
, U+2784
, ➄
, Dingbat circled sans-serif digit five
, -
, U+2785
, ➅
, Dingbat circled sans-serif digit six
, -
, U+2786
, ➆
, Dingbat circled sans-serif digit seven
, -
, U+2787
, ➇
, Dingbat circled sans-serif digit eight
, -
, U+2788
, ➈
, Dingbat circled sans-serif digit nine
, -
, U+2789
, ➉
, Dingbat circled sans-serif digit ten
, -
, U+278A
, ➊
, Dingbat negative circled sans-serif digit one
, -
, U+278B
, ➋
, Dingbat negative circled sans-serif digit two
, -
, U+278C
, ➌
, Dingbat negative circled sans-serif digit three
, -
, U+278D
, ➍
, Dingbat negative circled sans-serif digit four
, -
, U+278E
, ➎
, Dingbat negative circled sans-serif digit five
, -
, U+278F
, ➏
, Dingbat negative circled sans-serif digit six
, -
, U+2790
, ➐
, Dingbat negative circled sans-serif digit seven
, -
, U+2791
, ➑
, Dingbat negative circled sans-serif digit eight
, -
, U+2792
, ➒
, Dingbat negative circled sans-serif digit nine
, -
, U+2793
, ➓
, Dingbat negative circled sans-serif digit ten
, -
, U+2794
, ➔
, Heavy wide-headed rightward arrow
, -
, U+2795
, ➕
, Heavy plus sign
, -
, U+2796
, ➖
, Heavy minus sign
, -
, U+2797
, ➗
, Heavy division sign
, -
, U+2798
, ➘
, Heavy south east arrow
, -
, U+2799
, ➙
, Heavy rightward arrow
, -
, U+279A
, ➚
, Heavy north east arrow
, -
, U+279B
, ➛
, Drafting point rightward arrow
, -
, U+279C
, ➜
, Heavy round-tipped rightward arrow
, -
, U+279D
, ➝
, Triangle-headed rightward arrow
, -
, U+279E
, ➞
, Heavy triangle-headed rightward arrow
, -
, U+279F
, ➟
, Dashed triangle-headed rightward arrow
, -
, U+27A0
, ➠
, Heavy dashed triangle-headed rightward arrow
, -
, U+27A1
, ➡
, Black rightward arrow
, -
, U+27A2
, ➢
, Three-D top-lighted rightward arrowhead
, -
, U+27A3
, ➣
, Three-D bottom-lighted rightward arrowhead
, -
, U+27A4
, ➤
, Black rightward arrowhead
, -
, U+27A5
, ➥
, Heavy black curved downward and rightward arrow
, -
, U+27A6
, ➦
, Heavy black curved upward and rightward arrow
, -
, U+27A7
, ➧
, Squat black rightward arrow
, -
, U+27A8
, ➨
, Heavy concave-pointed black rightward arrow
, -
, U+27A9
, ➩
, Right-shaded white rightward arrow
, -
, U+27AA
, ➪
, Left-shaded white rightward arrow
, -
, U+27AB
, ➫
, Back-tilted shadowed white rightward arrow
, -
, U+27AC
, ➬
, Front-tilted shadowed white rightward arrow
, -
, U+27AD
, ➭
, Heavy lower right-shadowed white rightward arrow
, -
, U+27AE
, ➮
, Heavy upper right-shadowed white rightward arrow
, -
, U+27AF
, ➯
, Notched lower right-shadowed white rightward arrow
, -
, U+27B0
, ➰
, Curly loop
, -
, U+27B1
, ➱
, Notched upper right-shadowed white rightward arrow
, -
, U+27B2
, ➲
, Circled heavy white rightward arrow
, -
, U+27B3
, ➳
, White-feathered rightward arrow
, -
, U+27B4
, ➴
, Black-feathered south east arrow
, -
, U+27B5
, ➵
, Black-feathered rightward arrow
, -
, U+27B6
, ➶
, Black-feathered north east arrow
, -
, U+27B7
, ➷
, Heavy black-feathered south east arrow
, -
, U+27B8
, ➸
, Heavy black-feathered rightward arrow
, -
, U+27B9
, ➹
, Heavy black-feathered north east arrow
, -
, U+27BA
, ➺
, Teardrop-barbed rightward arrow
, -
, U+27BB
, ➻
, Heavy teardrop-shanked rightward arrow
, -
, U+27BC
, ➼
, Wedge-tailed rightward arrow
, -
, U+27BD
, ➽
, Heavy wedge-tailed rightward arrow
, -
, U+27BE
, ➾
, Open-outlined rightward arrow
, -
, U+27BF
, ➿
, Double curly loop
East Asian writing systems
CJK Symbols and Punctuation
Hiragana
Katakana
*
Kana Extended-A (Unicode block)
Kana Extended-A is a Unicode block containing hentaigana (non-standard hiragana) and historic kana characters. Additional hentaigana characters are encoded in the Kana Supplement block.
Block
History
The following Unicode-related documents re ...
*
Kana Extended-B (Unicode block)
*
Kana Supplement (Unicode block)
Kana Supplement is a Unicode block containing one archaic katakana character and 255 hentaigana (non-standard Hiragana) characters. Additional hentaigana characters are encoded in the Kana Extended-A block.
Block
History
The following Unicode-r ...
*
Katakana Phonetic Extensions (Unicode block)
*
Small Kana Extension (Unicode block)
Bopomofo
Hangul Jamo and Compatibility Jamo
Kanbun
Enclosed CJK Letters and Months
CJK Compatibility
CJK Compatibility Forms
CJK Unified Ideographs
*
CJK Unified Ideographs
The Chinese, Japanese and Korean (CJK) scripts share a common background, collectively known as CJK characters. In the process called Han unification, the common (shared) characters were identified and named CJK Unified Ideographs. As of Unicode ...
CJK Radicals
*
CJK Radicals Supplement (Unicode block)
CJK Radicals Supplement is a Unicode block containing alternative, often positional, forms of the Kangxi radicals. They are used as headers in dictionary indices and other CJK ideograph collections organized by radical-stroke.
Block
History
T ...
*
CJK Strokes (Unicode block)
CJK Strokes is a Unicode block
A Unicode block is one of several contiguous ranges of numeric character codes (code points) of the Unicode character set that are defined by the Unicode Consortium for administrative and documentation purposes. T ...
*
Kangxi Radicals (Unicode block)
Kangxi Radicals is a Unicode block. In version 3.0 (1999), this separate Kangxi Radicals block was introduced which encodes the 214 radicals in sequence, at U+2F00–2FD5. These are specific code points intended to represent the radical ''qu ...
Other East Asian writing systems
*
Counting Rod Numerals (Unicode block)
Counting Rod Numerals is a Unicode block containing traditional Chinese counting rod symbols, which mathematicians used for calculation in ancient China, Japan, Korea, and Vietnam. The orientation of the Unicode characters follows Song dynasty co ...
*
Halfwidth and Fullwidth Forms (Unicode block)
Halfwidth and Fullwidth Forms is the name of a Unicode block U+FF00–FFEF, provided so that older encodings containing both halfwidth and fullwidth characters can have lossless translation to/from Unicode. It is the second-to-last block of t ...
*
Ideographic Description Characters (Unicode block)
Ideographic Description Characters is a Unicode block containing graphic characters used for describing CJK Unified Ideographs, CJK ideographs. They are used in Chinese character description language#Ideographic Description Sequences, Ideographic ...
*
Khitan Small Script (Unicode block)
Khitan Small Script is a Unicode block containing characters from the Khitan small script, which was used for writing the Khitan language spoken by the Khitan people in northern China during the Liao dynasty
The Liao dynasty (; Khitan: '' ...
*
Lisu (Unicode block)
Lisu is a Unicode block containing characters of the Fraser alphabet, which is used to write the Lisu language. This alphabet (and by extension the block) consists of glyphs resembling capital letters in the basic Latin alphabet in their standard ...
*
Lisu Supplement (Unicode block)
*
Miao (Unicode block)
Miao is a Unicode block containing characters of the Pollard script, used for writing the Hmong Daw and A-Hmao language
The A-Hmao language, also known as Large Flowery Miao () or Northeast Yunnan Miao (), is a Hmongic language spoken in China ...
*
Modifier Tone Letters (Unicode block)
*
Nushu (Unicode block)
Nushu is a Unicode block containing characters from the Nüshu script, which is a syllabary derived from Chinese characters that was used exclusively among women in Jiangyong County in Hunan province of southern China.
An iteration mark for Nü ...
*
Nyiakeng Puachue Hmong (Unicode block)
Nyiakeng Puachue Hmong is a Unicode block
A Unicode block is one of several contiguous ranges of numeric character codes ( code points) of the Unicode character set that are defined by the Unicode Consortium for administrative and documentation p ...
*
Small Form Variants (Unicode block)
*
Tai Xuan Jing Symbols (Unicode block)
The text ''Tài Xuán Jīng'' ("Canon of Supreme Mystery", ) is a guide for divination composed by the Confucian writer Yang Xiong (53 BCE – 18 CE). The first draft of this work was completed in 2 BCE (in the decade before the fall of the Wes ...
*
Tangut (Unicode block)
Tangut is a Unicode block containing characters from the Tangut script, which was used for writing the Tangut language spoken by the Tangut people in the Western Xia Empire, and in China during the Yuan dynasty and early Ming dynasty.
Tangut cha ...
*
Tangut Components (Unicode block)
Tangut Components is a Unicode block containing components and radicals used in the modern study of the Tangut script.
Block
History
The following Unicode-related documents record the purpose and process of defining specific characters in the ...
*
Tangut Supplement (Unicode block)
*
Vertical Forms (Unicode block)
*
Wancho (Unicode block)
Wancho is a Unicode block containing the characters of the script used to write the Wancho language
Wancho is a Konyak language of north-eastern India. Wancho is spoken in 36 villages of southeastern Longding district, Tirap district, Aru ...
*
Yi Syllables (Unicode block)
Yi Syllables is a Unicode block
A Unicode block is one of several contiguous ranges of numeric character codes ( code points) of the Unicode character set that are defined by the Unicode Consortium for administrative and documentation purposes. ...
*
Yi Radicals (Unicode block)
Yi Radicals is a Unicode block
A Unicode block is one of several contiguous ranges of numeric character codes ( code points) of the Unicode character set that are defined by the Unicode Consortium for administrative and documentation purposes. ...
*
Yijing Hexagram Symbols (Unicode block)
Alphabetic Presentation Forms
Ancient and historic scripts
*
Aegean Numbers (Unicode block)
Aegean Numbers is a Unicode block containing punctuation, number, and unit characters for Linear A, Linear B, and the Cypriot syllabary, together Aegean numerals.
History
The following Unicode-related documents record the purpose and process ...
*
Anatolian Hieroglyphs (Unicode block)
Anatolian Hieroglyphs is a Unicode block containing Anatolian hieroglyphs
Anatolian hieroglyphs are an indigenous logographic script native to central Anatolia, consisting of some 500 signs. They were once commonly known as Hittite hieroglyphs ...
*
Ancient Greek Numbers (Unicode block)
Ancient Greek Numbers is a Unicode block containing acrophonic numerals used in ancient Greece, including ligatures
Ligature may refer to:
* Ligature (medicine), a piece of suture used to shut off a blood vessel or other anatomical structure
...
*
Ancient Symbols (Unicode block)
Ancient Symbols is a Unicode block containing Roman characters for Roman currency, currency, Ancient Roman units of measurement, weights, and measures.
Block
History
The following Unicode-related documents record the purpose and process of defi ...
*
Avestan (Unicode block)
Avestan is a Unicode block containing characters devised for recording the Zoroastrian religious texts, Avesta, and was used to write the Middle Persian
Middle Persian or Pahlavi, also known by its endonym Pārsīk or Pārsīg () in its later ...
*
Brahmi (Unicode block)
*
Carian (Unicode block)
Carian is a Unicode block
A Unicode block is one of several contiguous ranges of numeric character codes (code points) of the Unicode character set that are defined by the Unicode Consortium for administrative and documentation purposes. Typicall ...
*
Caucasian Albanian (Unicode block)
Caucasian Albanian is a Unicode block containing characters used by the Caucasian Albanian peoples of Azerbaijan and Dagestan for writing Northeast Caucasian languages
The Northeast Caucasian languages, also called East Caucasian, Nakh-Dagh ...
*
Chorasmian (Unicode block)
Chorasmian is a Unicode block containing characters from the Chorasmian script, which was used for writing the Khwarezmian language in Transoxiana
Transoxiana or Transoxania (Land beyond the Oxus) is the Latin name for a region and civilizat ...
*
Cuneiform (Unicode block)
In Unicode, the Sumero-Akkadian Cuneiform script is covered in three Unicode block, blocks in the Supplementary Multilingual Plane (SMP):
* U+12000–U+123FF Cuneiform
* U+12400–U+1247F Cuneiform Numbers and Punctuation
* U+12480&ndash ...
*
Cuneiform Numbers and Punctuation (Unicode block)
In Unicode, the Sumero-Akkadian Cuneiform script is covered in three Unicode block, blocks in the Supplementary Multilingual Plane (SMP):
* U+12000–U+123FF Cuneiform (Unicode block), Cuneiform
* U+12400–U+1247F Cuneiform Numbers and ...
*
Cypriot Syllabary (Unicode block)
Cypriot Syllabary is the Unicode block encoding the Cypriot syllabary, a writing system for Greek used in Cyprus
Cyprus ; tr, Kıbrıs (), officially the Republic of Cyprus,, , lit: Republic of Cyprus is an island country located sou ...
*
Cypro-Minoan (Unicode block)
Cypro-Minoan is a Unicode block
A Unicode block is one of several contiguous ranges of numeric character codes ( code points) of the Unicode character set that are defined by the Unicode Consortium for administrative and documentation purposes. ...
*
Early Dynastic Cuneiform (Unicode block)
Early Dynastic Cuneiform is the name of a Unicode block of the Supplementary Multilingual Plane (SMP), at U+12480–U+1254F, introduced in version 8.0 (June 2015).
It is a supplement to the earlier encoding of the cuneiform script in the t ...
*
Egyptian Hieroglyph Format Controls (Unicode block)
*
Egyptian Hieroglyphs (Unicode block)
Egyptian Hieroglyphs is a Unicode block containing the Gardiner's sign list of Egyptian hieroglyphs.
Block
The Egyptian Hieroglyphs Unicode block has 94 standardized variants defined to specify rotated signs:
* Variation selector-1 (VS1) (U+ ...
*
Elbasan (Unicode block)
Elbasan is a Unicode block containing the historic Elbasan characters for writing the Albanian language
Albanian ( endonym: or ) is an Indo-European language and an independent branch of that family of languages. It is spoken by the Al ...
*
Elymaic (Unicode block)
Elymaic is a Unicode block containing characters for the Elymaic alphabet, used in the ancient state of Elymais.
History
The following Unicode-related documents record the purpose and process of defining specific characters in the Elymaic block: ...
*
Glagolitic (Unicode block)
Glagolitic is a Unicode block
A Unicode block is one of several contiguous ranges of numeric character codes ( code points) of the Unicode character set that are defined by the Unicode Consortium for administrative and documentation purposes. ...
*
Glagolitic Supplement (Unicode block)
Glagolitic Supplement is a Unicode block containing supplementary characters used in the Glagolitic script
The Glagolitic script (, , ''glagolitsa'') is the oldest known Slavic alphabet. It is generally agreed to have been created in the 9th ...
*
Gothic (Unicode block)
Gothic is a Unicode block containing characters for writing the East Germanic Gothic language
Gothic is an extinct East Germanic language that was spoken by the Goths. It is known primarily from the ''Codex Argenteus'', a 6th-century copy o ...
*
Hatran (Unicode block)
Hatran is a Unicode block containing characters used on inscriptions discovered at Hatra in Iraq, which are written in the Hatran alphabet and represent a form of the Aramaic language
The Aramaic languages, short Aramaic ( syc, ܐܪܡܝܐ, ...
*
Imperial Aramaic (Unicode block)
Imperial Aramaic is a Unicode block containing characters for writing Aramaic during the Assyrian and Achaemenid Persian Empire
The Achaemenid Empire or Achaemenian Empire (; peo, 𐎧𐏁𐏂, , ), also called the First Persian Empire, ...
*
Indic Siyaq Numbers
Indic Siyaq Numbers is a Unicode block containing a specialized subset of the Arabic script that was used for accounting in India under the Mughals
The Mughal Empire was an early-modern empire that controlled much of South Asia between ...
*
Inscriptional Pahlavi (Unicode block)
Inscriptional Pahlavi is a Unicode block containing monumental inscription characters for writing Middle Persian
Middle Persian or Pahlavi, also known by its endonym Pārsīk or Pārsīg () in its later form, is a Western Middle Iranian lan ...
*
Inscriptional Parthian (Unicode block)
Inscriptional Parthian is a Unicode block
A Unicode block is one of several contiguous ranges of numeric character codes ( code points) of the Unicode character set that are defined by the Unicode Consortium for administrative and documentation ...
*
Kharoshthi (Unicode block)
*
Linear A (Unicode block)
Linear A is a Unicode block containing the characters of the ancient, undeciphered Linear A
Linear A is a writing system that was used by the Minoans of Crete from 1800 to 1450 BC to write the hypothesized Minoan language or languages ...
*
Linear B Ideograms (Unicode block)
Linear B Ideograms is a Unicode block containing ideographic characters for writing Mycenaean Greek
Mycenaean Greek is the most ancient attested form of the Greek language, on the Greek mainland and Crete in Mycenaean Greece (16th to 12th ...
*
Linear B Syllabary (Unicode block)
Linear B Syllabary is a Unicode block containing characters for the syllabic writing of Mycenaean Greek
Mycenaean Greek is the most ancient attested form of the Greek language, on the Greek mainland and Crete in Mycenaean Greece (16th to 1 ...
*
Lycian (Unicode block)
Lycian is a Unicode block containing characters for writing the ancient Lycian language
The Lycian language ( )Bryce (1986) page 30. was the language of the ancient Lycians who occupied the Anatolian region known during the Iron Age as Lyci ...
*
Lydian (Unicode block)
Lydian is a Unicode block containing characters for writing the Lydian language of ancient Anatolia
Anatolia, tr, Anadolu Yarımadası), and the Anatolian plateau, also known as Asia Minor, is a large peninsula in Western Asia and the wes ...
*
Manichaean (Unicode block)
Manichaean is a Unicode block containing characters historically used for writing Sogdian, Parthian, and the dialects of Fars
Dialects of Pars ''(Persia)'' are a group of southwestern and northwestern Persian dialects spoken in the central Pars ...
*
Mayan Numerals (Unicode block)
Mayan Numerals is a Unicode block
A Unicode block is one of several contiguous ranges of numeric character codes ( code points) of the Unicode character set that are defined by the Unicode Consortium for administrative and documentation purposes ...
*
Meroitic Cursive (Unicode block)
Meroitic Cursive is a Unicode block
A Unicode block is one of several contiguous ranges of numeric character codes (code points) of the Unicode character set that are defined by the Unicode Consortium for administrative and documentation purpose ...
*
Meroitic Hieroglyphs (Unicode block)
Meroitic Hieroglyphs is a Unicode block formal hieroglyphic containing characters for writing Meroitic Egyptian.
History
The following Unicode-related documents record the purpose and process of defining specific characters in the Meroitic Hie ...
*
Nabataean (Unicode block)
Nabataean is a Unicode block containing characters for writing the ancient Nabataean Aramaic, Nabataean language.
History
The following Unicode-related documents record the purpose and process of defining specific characters in the Nabataean bl ...
*
Nandinagari (Unicode block)
Nandinagari is a Unicode block containing characters for Nandinagari
Nandinagari is a Brahmic script derived from the Nāgarī script which appeared in the 7th century AD.George Cardona and Danesh Jain (2003), The Indo-Aryan Languages, Routled ...
*
Ogham (Unicode block)
Ogham is a Unicode block containing characters for representing Primitive Irish language inscriptions as codified in the Ogham script.
History
The following Unicode-related documents record the purpose and process of defining specific characte ...
*
Old Hungarian (Unicode block)
Old Hungarian is a Unicode block containing characters used for writing the Old Hungarian alphabet
Old or OLD may refer to:
Places
*Old, Baranya, Hungary
*Old, Northamptonshire, England
* Old Street station, a railway and tube station in Lon ...
*
Old Italic (Unicode block)
Old Italic is a Unicode block containing a unified repertoire of several Old Italic scripts used in various parts of Italy starting about 700 BCE, including the Etruscan alphabet and others that were derived from it (or cognate with it). All thos ...
*
Old North Arabian (Unicode block)
Old North Arabian is a Unicode block containing characters for writing the Ancient North Arabian
Ancient North Arabian (ANA)http://e-learning.tsu.ge/pluginfile.php/5868/mod_resource/content/0/dzveli_armosavluri_enebi_-ugarituli_punikuri_arameul ...
*
Old Permic (Unicode block)
Old Permic is a Unicode block containing Old Permic characters for writing the Komi language
The Komi language ( kv, коми кыв, ''komi kyv''), also known as Zyryan, Zyrian or Komi-Zyryan (Komi: коми-зырян кыв, komi-zyrjan ky ...
*
Old Persian (Unicode block)
Old Persian is a Unicode block containing cuneiform characters for writing the Old Persian language of the Achaemenid Empire
The Achaemenid Empire or Achaemenian Empire (; peo, wikt:𐎧𐏁𐏂𐎶, 𐎧𐏁𐏂, , ), also called the Fi ...
*
Old Sogdian (Unicode block)
Old Sogdian is a Unicode block
A Unicode block is one of several contiguous ranges of numeric character codes (code points) of the Unicode character set that are defined by the Unicode Consortium for administrative and documentation purposes. T ...
*
Old South Arabian (Unicode block)
Old South Arabian is a Unicode block containing characters for writing the Minean, Sabaean, Qatabanian, Hadramite, and Himyaritic languages of Yemen
Yemen (; ar, ٱلْيَمَن, al-Yaman), officially the Republic of Yemen,, ) is a coun ...
*
Old Turkic (Unicode block)
In Unicode, the block Old Turkic is located from U+10C00 to U+10C4F. It is used to display the Old Turkic alphabet
The Old Turkic script (also known as variously Göktürk script, Orkhon script, Orkhon-Yenisey script, Turkic runes) was the ...
*
Old Uyghur (Unicode block)
Old Uyghur is a Unicode block containing characters of the Old Uyghur alphabet
The Old Uyghur alphabet was a Turkic script used for writing the Old Uyghur, a variety of Old Turkic spoken in Turpan and Gansu that is the ancestor of the moder ...
*
Palmyrene (Unicode block)
Palmyrene is a Unicode block
A Unicode block is one of several contiguous ranges of numeric character codes ( code points) of the Unicode character set that are defined by the Unicode Consortium for administrative and documentation purposes. Ty ...
*
Phaistos Disc (Unicode block)
Phaistos Disc is a Unicode block containing the characters found on the undeciphered Phaistos Disc
The Phaistos Disc (also spelled Phaistos Disk, Phaestos Disc) is a disk of fired clay from the Minoan palace of Phaistos on the island of Crete ...
*
Phoenician (Unicode block)
Phoenician is a Unicode block containing characters used across the Mediterranean world from the 12th century BCE to the 3rd century CE. The Phoenician alphabet was added to the Unicode Standard in July 2006 with the release of version 5.0. An alt ...
*
Psalter Pahlavi (Unicode block)
Psalter Pahlavi is a Unicode block containing characters for writing Middle Persian. The script derives its name from the "Pahlavi Psalter", a 6th- or 7th-century translation of a Syriac Syriac may refer to:
*Syriac language, an ancient dialect ...
*
Runic (Unicode block)
Runic is a Unicode block containing runic characters.
It was introduced in Unicode 3.0 (1999), with eight additional characters introduced in Unicode 7.0 (2014).
The original encoding of runes in UCS was based on the recommendations of the "ISO ...
*
Sogdian (Unicode block)
Sogdian is a Unicode block containing characters used to write the Sogdian language
The Sogdian language was an Eastern Iranian languages, Eastern Iranian language spoken mainly in the Central Asian region of Sogdia (capital: Samarkand; other ...
*
Soyombo (Unicode block)
Soyombo is a Unicode block containing characters from the Soyombo alphabet, which is an abugida developed by the monk and scholar Zanabazar (1635–1723) in 1686 to write Mongolian. It can also be used to write Tibetan and Sanskrit
Sanskr ...
*
Ugaritic (Unicode block)
Ugaritic is a Unicode block
A Unicode block is one of several contiguous ranges of numeric character codes (code points) of the Unicode character set that are defined by the Unicode Consortium for administrative and documentation purposes. Typic ...
*
Vithkuqi (Unicode block)
Vithkuqi is a Unicode block
A Unicode block is one of several contiguous ranges of numeric character codes ( code points) of the Unicode character set that are defined by the Unicode Consortium for administrative and documentation purposes. Typ ...
*
Yezidi (Unicode block)
*
Zanabazar Square (Unicode block)
Zanabazar Square is a Unicode block containing characters from the Zanabazar Square script (also known as "Horizontal square script"), which is an abugida developed by the monk and scholar Zanabazar (1635–1723) to write Mongolian, Tibetan an ...
Shavian
Notational systems
Braille
*
Braille Patterns (Unicode block)
The Unicode block Braille Patterns (U+2800..U+28FF) contains all 256 possible patterns of an 8-dot braille cell, thereby including the complete 6-dot cell range.
Music
* Western
Musical Symbols (Unicode block)
Musical Symbols is a Unicode block containing characters for representing modern musical notation. Fonts that support it include ''Bravura'', ''Euterpe'', ''FreeSerif'', ''Musica'' and '' Symbola''. The Standard Music Font Layout (SMuFL), which i ...
*
Byzantine Musical Symbols (Unicode block)
Byzantine Musical Symbols is a Unicode block containing characters for representing Byzantine-era musical notation.
Block
History
The following Unicode-related documents record the purpose and process of defining specific characters in the Byza ...
*
Ancient Greek Musical Notation (Unicode block)
Ancient Greek Musical Notation is a Unicode block
A Unicode block is one of several contiguous ranges of numeric character codes ( code points) of the Unicode character set that are defined by the Unicode Consortium for administrative and docu ...
*
Znamenny Musical Notation (Unicode block)
Znamenny Musical Notation is a Unicode block containing characters for Znamenny musical notation from Russia.
Few fonts support this block as of 2021. Ones that do and are free for personal use include '' Symbola'' 14.0 and Slavonic' 1.00 (non-c ...
Shorthand
*
Duployan (Unicode block)
Duployan is a Unicode block containing characters for various Duployan shorthands, including French Duployéan, Chinook Writing, Romanian shorthand, and the English Sloan-Duployan, Pernin, and Perrault shorthands. It is the first block of shorth ...
*
Shorthand Format Controls (Unicode block)
Shorthand Format Controls is a Unicode block containing four formatting characters for representing shorthands in Unicode.
Block
Being invisible controls, they have no visible glyph but can have a representation.
*
*
*
::Romanian affix -tsion ...
Sutton SignWriting
* Sutton SignWriting:
Sutton SignWriting (Unicode block)
Emoji
*
Emoji in Unicode
Alchemical symbols
Game symbols
Mahjong Tiles
Domino Tiles
Playing Cards
Chess Symbols
Special areas and format characters
*
Private Use Areas
In Unicode, a Private Use Area (PUA) is a range of code points that, by definition, will not be assigned characters by the Unicode Consortium. Three private use areas are defined: one in the Basic Multilingual Plane (), and one each in, and nearl ...
**
Private Use Area (Unicode block)
**
Supplementary Private Use Area-A (Unicode block)
**
Supplementary Private Use Area-B (Unicode block)
*
Specials (Unicode block)
Specials is a short Unicode block of characters allocated at the very end of the Basic Multilingual Plane, at U+FFF0–FFFF. Of these 16 code points, five have been assigned since Unicode 3.0:
*, marks start of annotated text
*, marks start ...
*
Surrogates
''Surrogates'' is a 2009 American science fiction action film based on the 2005–2006 comic book series '' The Surrogates''. Directed by Jonathan Mostow, it stars Bruce Willis as Tom Greer, an FBI agent who ventures out into the real world to ...
**
Low Surrogates (Unicode block)
The Unicode Consortium and the ISO/IEC JTC 1/SC 2/ WG 2 jointly collaborate on the list of the characters in the Universal Coded Character Set. The Universal Coded Character Set, most commonly called the Universal Character Set ( UCS, officia ...
**
High Surrogates (Unicode block)
The Unicode Consortium and the ISO/IEC JTC 1/SC 2/ WG 2 jointly collaborate on the list of the characters in the Universal Coded Character Set. The Universal Coded Character Set, most commonly called the Universal Character Set ( UCS, offici ...
**
High Private Use Surrogates (Unicode block)
The Unicode Consortium and the ISO/IEC JTC 1/SC 2/Working group, WG 2 jointly collaborate on the list of the characters in the Universal Coded Character Set. The Universal Coded Character Set, most commonly called the Universal Character Set ...
*
Tags (Unicode block)
Tags is a Unicode block containing formatting tag characters. The block is designed to mirror ASCII. It was originally intended for language tags, but has now been repurposed as emoji modifiers, specifically for region flags.
Legacy use
U+E000 ...
*
Variation Selectors
Variation Selectors is the block name of a Unicode code point block containing 16 variation selectors. Each variation selector is used to specify a specific glyph variant for a preceding character. They are currently used to specify standardize ...
**
Variation Selectors (Unicode block)
Variation Selectors is the block name of a Unicode code point block containing 16 variation selectors. Each variation selector is used to specify a specific glyph variant for a preceding character. They are currently used to specify standardize ...
**
Variation Selectors Supplement (Unicode block)
Variation Selectors Supplement is a Unicode block containing additional Variation Selectors beyond those found in the Variation Selectors
Variation Selectors is the block name of a Unicode code point block containing 16 variation selectors. ...
See also
*
Comparison of Unicode encodings
*
Open-source Unicode typefaces
There are Unicode typefaces which are open-source and designed to contain glyphs of all Unicode characters, or at least a broad selection of Unicode scripts. There are also numerous projects aimed at providing only a certain script, such as the A ...
*
GNU Unifont
GNU Unifont is a free Unicode bitmap font using an intermediate bitmapped font format created by Roman Czyborra. The main Unifont covers all of the Basic Multilingual Plane (BMP). The "upper" companion covers significant parts of the Supplementa ...
*
List of radicals in Unicode
The List of Unicode radicals comprises those Unicode characters that represent radical components of CJK characters, Tangut characters or Yi syllables. These are used primarily for indexing characters in dictionaries.
There are two CJK radical ...
*
List of Unicode fonts
This is a list of typefaces, which are separated into groups by distinct artistic differences. The list includes typefaces that have articles or that are referenced. Superfamilies that fall under more than one category have an asterisk (*) after t ...
*
List of typefaces
This is a list of typefaces, which are separated into groups by distinct artistic differences. The list includes typefaces that have articles or that are referenced. Superfamilies that fall under more than one category have an asterisk (*) after t ...
*
Typographic unit
Typographic units are the units of measurement used in typography or typesetting. Traditional typometry units are different from familiar metric units because they were established in the early days of printing. Though most printing is digital n ...
*
Unicode Consortium
The Unicode Consortium (legally Unicode, Inc.) is a 501(c)(3) non-profit organization incorporated and based in Mountain View, California. Its primary purpose is to maintain and publish the Unicode Standard which was developed with the intenti ...
*
Fallback font
A fallback font is a reserve typeface containing symbols for as many Unicode characters as possible. When a display system encounters a character that is not part of the repertoire of any of the other available fonts, a symbol from a fallback font ...
*
Unicode font
A Unicode font is a computer font that maps glyphs to code points defined in the Unicode Standard. The vast majority of modern computer fonts use Unicode mappings, even those fonts which only include glyphs for a single writing system, or even onl ...
*
Universal Character Set characters
The Unicode Consortium and the ISO/IEC JTC 1/SC 2/ WG 2 jointly collaborate on the list of the characters in the Universal Coded Character Set. The Universal Coded Character Set, most commonly called the Universal Character Set ( UCS, officia ...
References
Unicode 7.0 Character Code Charts Unicode, Inc.
The Unicode Consortium (legally Unicode, Inc.) is a 501(c)(3) non-profit organization incorporated and based in Mountain View, California. Its primary purpose is to maintain and publish the Unicode Standard which was developed with the intenti ...
CWA 13873:2000 – Multilingual European Subsets in ISO/IEC 10646-1 CEN Workshop Agreement 13873
Multilingual European Character Set 2 (MES-2) Rationale Markus Kuhn, 1998
External links
Official web site of the Unicode Consortium(English)
{{DEFAULTSORT:Unicode Characters
Characters
Character or Characters may refer to:
Arts, entertainment, and media Literature
* ''Character'' (novel), a 1936 Dutch novel by Ferdinand Bordewijk
* ''Characters'' (Theophrastus), a classical Greek set of character sketches attributed to The ...
Lists of symbols