Windows-1254 is a
code page
In computing, a code page is a character encoding and as such it is a specific association of a set of printable characters and control characters with unique numbers. Typically each number represents the binary value in a single byte. (In some ...
used under
Microsoft Windows (and for the web), to write
Turkish that it was designed for (which is its dominant user, even though it can be used for some other languages too). Characters with codepoints A0 through FF are compatible with
ISO 8859-9, but the
CR range, which is reserved for
C1 control codes
The C0 and C1 control code or control character sets define control codes for use in text by computer systems that use ASCII and derivatives of ASCII. The codes represent additional information about the text, such as the position of a cursor, ...
in ISO 8859, is instead used for additional characters (analogous to the relationship between
ISO-8859-1 and
Windows-1252).
The
WHATWG Encoding Standard, which specifies the character encodings which are permitted in
HTML5 and which compliant browsers must support, includes Windows-1254, which is used for both the Windows-1254 and ISO-8859-9 labels.
Unicode is preferred for modern applications; authors of new pages and the designers of new protocols are instructed to use
UTF-8 instead.
, less than 0.05% of all web pages use Windows-1254, and less than 0.06% use ISO-8859-9, which the WHATWG also requires web browsers to handle as Windows-1254.
Since 1.9% of all websites located in Turkey use ISO-8859-9, plus the 1.3% that actually declare Windows-1254 used, in effect, 3.2% of websites there use Windows-1254.
IBM uses code page 1254 (
CCSID
A CCSID (coded character set identifier) is a 16-bit number that represents a particular encoding of a specific code page. For example, Unicode is a code page that has several encoding (so called "transformation") forms, like UTF-8, UTF-16 and UTF- ...
1254 and
euro sign extended CCSID 5350) for Windows-1254.
Character set
The following table shows Windows-1254. Each character is shown with its
Unicode equivalent.
See also
*
Latin script in Unicode
*
LMBCS-8
*
Unicode
*
Universal Character Set
**
European Unicode subset (DIN 91379)
*
UTF-8
*
Western Latin character sets (computing)
Several binary representations of 8-bit character sets for common Western European languages are compared in this article. These encodings were designed for representation of Italian, Spanish, Portuguese, French, German, Dutch, English, Dani ...
*
Windows-1250
Windows-1250 is a code page used under Microsoft Windows to represent texts in Central European and Eastern European languages that use Latin script, such as Czech (which is its main user with half its use, though Czech has 96.6% use of UTF-8, and ...
*
Windows code pages
*
ISO/IEC JTC 1/SC 2
References
External links
Windows 1254 reference chartIANA Charset Name Registration of windows-1254
{{character encoding
Windows code pages