This is a list of
Unicode
Unicode, formally The Unicode Standard,The formal version reference is is an information technology Technical standard, standard for the consistent character encoding, encoding, representation, and handling of Character (computing), text expre ...
characters
Character or Characters may refer to:
Arts, entertainment, and media Literature
* ''Character'' (novel), a 1936 Dutch novel by Ferdinand Bordewijk
* ''Characters'' (Theophrastus), a classical Greek set of character sketches attributed to The ...
which can be used to meet requirements for the use of special characters in user account passwords. Each system may have its own requirements and limitations. This article refers to the general class of characters that are easily accessible to most users.
Character reference overview
An
HTML
The HyperText Markup Language or HTML is the standard markup language for documents designed to be displayed in a web browser. It can be assisted by technologies such as Cascading Style Sheets (CSS) and scripting languages such as JavaScri ...
or
XML
Extensible Markup Language (XML) is a markup language and file format for storing, transmitting, and reconstructing arbitrary data. It defines a set of rules for encoding documents in a format that is both human-readable and machine-readable. T ...
''numeric character reference'' refers to a character by its
Universal Character Set
The Universal Coded Character Set (UCS, Unicode) is a standard set of characters defined by the international standard ISO/IEC 10646, ''Information technology — Universal Coded Character Set (UCS)'' (plus amendments to that standard), whi ...
/
Unicode
Unicode, formally The Unicode Standard,The formal version reference is is an information technology Technical standard, standard for the consistent character encoding, encoding, representation, and handling of Character (computing), text expre ...
''code point'', and uses the format
:
''nnnn''
;
or
:
''hhhh''
;
where ''nnnn'' is the code point in
decimal
The decimal numeral system (also called the base-ten positional numeral system and denary or decanary) is the standard system for denoting integer and non-integer numbers. It is the extension to non-integer numbers of the Hindu–Arabic numeral ...
form, and ''hhhh'' is the code point in
hexadecimal
In mathematics and computing, the hexadecimal (also base-16 or simply hex) numeral system is a positional numeral system that represents numbers using a radix (base) of 16. Unlike the decimal system representing numbers using 10 symbols, hexa ...
form. The ''x'' must be lowercase in XML documents. The ''nnnn'' or ''hhhh'' may be any number of digits and may include leading zeros. The ''hhhh'' may mix uppercase and lowercase, though uppercase is the usual style.
In contrast, a ''character entity reference'' refers to a character by the name of an ''
entity
An entity is something that exists as itself, as a subject or as an object, actually or potentially, concretely or abstractly, physically or not. It need not be of material existence. In particular, abstractions and legal fictions are usually ...
'' which has the desired character as its ''replacement text''. The entity must either be predefined (built into the markup language) or explicitly declared in a
Document Type Definition
A document type definition (DTD) is a set of ''markup declarations'' that define a ''document type'' for an SGML-family markup language ( GML, SGML, XML, HTML).
A DTD defines the valid building blocks of an XML document. It defines the document ...
(DTD). The format is the same as for any entity reference:
:
&
''name''
;
where ''name'' is the case-sensitive name of the entity. The semicolon is required.
Table of Special Characters
These characters are part of the Basic Latin (Unicode block).
There are 33 characters classified as ASCII Punctuation & Symbols are also sometimes referred to as ASCII special characters. The table has columns for many of the most common password system solutions.
See also
*
Comparison of Unicode encodings
*
Open-source Unicode typefaces
There are Unicode typefaces which are open-source and designed to contain glyphs of all Unicode characters, or at least a broad selection of Unicode scripts. There are also numerous projects aimed at providing only a certain script, such as the A ...
*
GNU Unifont
GNU Unifont is a free Unicode bitmap font using an intermediate bitmapped font format created by Roman Czyborra. The main Unifont covers all of the Basic Multilingual Plane (BMP). The "upper" companion covers significant parts of the Supplementa ...
*
List of Unicode radicals
The List of Unicode radicals comprises those Unicode characters that represent radical components of CJK characters, Tangut characters or Yi syllables. These are used primarily for indexing characters in dictionaries.
There are two CJK radical ...
*
List of Unicode fonts
This is a list of typefaces, which are separated into groups by distinct artistic differences. The list includes typefaces that have articles or that are referenced. Superfamilies that fall under more than one category have an asterisk (*) after t ...
*
List of typefaces
This is a list of typefaces, which are separated into groups by distinct artistic differences. The list includes typefaces that have articles or that are referenced. Superfamilies that fall under more than one category have an asterisk (*) after t ...
*
Typographic unit
Typographic units are the units of measurement used in typography or typesetting. Traditional typometry units are different from familiar metric units because they were established in the early days of printing. Though most printing is digital n ...
*
Unicode Consortium
The Unicode Consortium (legally Unicode, Inc.) is a 501(c)(3) non-profit organization incorporated and based in Mountain View, California. Its primary purpose is to maintain and publish the Unicode Standard which was developed with the intenti ...
*
Unicode fallback font
A fallback font is a reserve typeface containing symbols for as many Unicode characters as possible. When a display system encounters a character that is not part of the repertoire of any of the other available fonts, a symbol from a fallback font ...
*
Unicode typeface
A Unicode font is a computer font that maps glyphs to code points defined in the Unicode Standard. The vast majority of modern computer fonts use Unicode mappings, even those fonts which only include glyphs for a single writing system, or even only ...
*
Universal Character Set characters
The Unicode Consortium and the ISO/IEC JTC 1/SC 2/ WG 2 jointly collaborate on the list of the characters in the Universal Coded Character Set. The Universal Coded Character Set, most commonly called the Universal Character Set ( UCS, officia ...
References
Unicode 7.0 Character Code Charts Unicode, Inc.
The Unicode Consortium (legally Unicode, Inc.) is a 501(c)(3) non-profit organization incorporated and based in Mountain View, California. Its primary purpose is to maintain and publish the Unicode Standard which was developed with the intenti ...
CWA 13873:2000 – Multilingual European Subsets in ISO/IEC 10646-1 CEN Workshop Agreement 13873
Multilingual European Character Set 2 (MES-2) Rationale Markus Kuhn, 1998
External links
Official web site of the Unicode Consortium(English)
decodeunicode.orgUnicode-Wiki with images of all 98,884 graphical unicode characters (German/English,
full text search
In text retrieval, full-text search refers to techniques for searching a single computer-stored document or a collection in a full-text database. Full-text search is distinguished from searches based on metadata or on parts of the original texts r ...
)
Letters with diacritical marks, grouped alphabetically Pinyin.info
UTF-8 encoding table and Unicode charactersFrom The Text String.
Unicode character table
{{DEFAULTSORT:Unicode Characters
Characters
Character or Characters may refer to:
Arts, entertainment, and media Literature
* ''Character'' (novel), a 1936 Dutch novel by Ferdinand Bordewijk
* ''Characters'' (Theophrastus), a classical Greek set of character sketches attributed to The ...
Unicode
Unicode, formally The Unicode Standard,The formal version reference is is an information technology Technical standard, standard for the consistent character encoding, encoding, representation, and handling of Character (computing), text expre ...