HOME





Halfwidth And Fullwidth Forms (Unicode Block)
Halfwidth and Fullwidth Forms is a Unicode block U+FF00–FFEF, provided so that older encodings containing both halfwidth and fullwidth characters can have lossless translation to/from Unicode. It is the second-to-last block of the Basic Multilingual Plane, followed only by the short Specials block at U+FFF0–FFFF. Its block name in Unicode 1.0 was Halfwidth and Fullwidth Variants. Range U+FF01–FF5E reproduces the characters of ASCII 21 to 7E as fullwidth forms. U+FF00 does not correspond to a fullwidth ASCII 20 (space character), since that role is already fulfilled by U+3000 " ideographic space". Range U+FF61–FF9F encodes halfwidth forms of katakana and related punctuation in a transposition of A1 to DF in the JIS X 0201 encoding – see half-width kana. The range U+FFA0–FFDC encodes halfwidth forms of compatibility jamo characters for Hangul, in a transposition of their 1974 standard layout. It is used in the mapping of some IBM encodings for K ...
[...More Info...]      
[...Related Items...]     OR:     [Wikipedia]   [Google]   [Baidu]  


picture info

Hangul
The Korean alphabet is the modern writing system for the Korean language. In North Korea, the alphabet is known as (), and in South Korea, it is known as (). The letters for the five basic consonants reflect the shape of the speech organs used to pronounce them. They are systematically modified to indicate Phonetics, phonetic features. The vowel letters are systematically modified for related sounds, making Hangul a featural writing system. It has been described as a syllabic alphabet as it combines the features of Alphabet, alphabetic and Syllabary, syllabic writing systems. Hangul was created in 1443 by Sejong the Great, the fourth king of the Joseon dynasty. The alphabet was made as an attempt to increase literacy by serving as a complement to Hanja, which were Chinese characters used to write Literary Chinese in Korea by the 2nd century BCE, and had been adapted to write Korean by the 6th century CE. Modern Hangul orthography uses 24 basic letters: 14 consona ...
[...More Info...]      
[...Related Items...]     OR:     [Wikipedia]   [Google]   [Baidu]  


picture info

Shift Out And Shift In Characters
Shift Out (SO) and Shift In (SI) are ASCII control characters 14 and 15, respectively (0x0E and 0x0F). These are sometimes also called "Control-N" and "Control-O". The original purpose of these characters was to provide a way to shift a coloured ribbon, split longitudinally usually with red and black, up and down to the other colour in an electro-mechanical typewriter or teleprinter, such as the Teletype Model 38, to automate the same function of manual typewriters. Black was the conventional ambient default colour and so was shifted "in" or "out" with the other colour on the ribbon. Later advancements in technology instigated use of this function for switching to a different font or character set and back. This was used, for instance, in the Russian character set known as KOI7-switched, where SO starts printing Russian letters, and SI starts printing Latin letters again. Similarly, they are used for switching between Katakana and Roman letters in the 7-bit version of the Japa ...
[...More Info...]      
[...Related Items...]     OR:     [Wikipedia]   [Google]   [Baidu]  


picture info

Latin-script Unicode Blocks
The Latin script, also known as the Roman script, is a writing system based on the letters of the classical Latin alphabet, derived from a form of the Greek alphabet which was in use in the ancient Greek city of Cumae in Magna Graecia. The Greek alphabet was altered by the Etruscans, and subsequently their alphabet was altered by the Ancient Romans. Several Latin-script alphabets exist, which differ in graphemes, collation and phonetic values from the classical Latin alphabet. The Latin script is the basis of the International Phonetic Alphabet (IPA), and the 26 most widespread letters are the letters contained in the ISO basic Latin alphabet, which are the same letters as the English alphabet. Latin script is the basis for the largest number of alphabets of any writing system and is the most widely adopted writing system in the world. Latin script is used as the standard method of writing the languages of Western and Central Europe, most of sub-Saharan Africa, the America ...
[...More Info...]      
[...Related Items...]     OR:     [Wikipedia]   [Google]   [Baidu]  


Unicode Blocks
A Unicode block is one of several contiguous ranges of numeric character codes (code points) of the Unicode character set that are defined by the Unicode Consortium for administrative and documentation purposes. Typically, proposals such as the addition of new glyphs are discussed and evaluated by considering the relevant block or blocks as a whole. Each block is generally, but not always, meant to supply glyphs used by one or more specific languages, or in some general application area such as mathematics, surveying, decorative typesetting, social forums, etc. Design and implementation Unicode blocks are identified by unique names, which use only ASCII characters and are usually descriptive of the nature of the symbols, in English language, English; such as "Tibetan" or "Supplemental Arrows-A". (When comparing block names, one is supposed to equate uppercase with lowercase letters, and ignore any whitespace, hyphens, and underbars; so the last name is equivalent to "suppleme ...
[...More Info...]      
[...Related Items...]     OR:     [Wikipedia]   [Google]   [Baidu]  


Enclosed Alphanumerics
Enclosed Alphanumerics is a Unicode block of Typography, typographical symbols of an alphanumeric within a circle, a bracket or other not-closed enclosure, or ending in a full stop. It is currently fully allocated. Within the Basic Multilingual Plane, a few additional enclosed numerals are in the Dingbat#Unicode, Dingbats and the Enclosed CJK Letters and Months blocks. There is also a block with more of these characters in the Supplementary Multilingual Plane named Enclosed Alphanumeric Supplement (U+1F100–U+1F1FF), as of Unicode 6.0. Purpose Many of these characters were originally intended for use as bullets for lists.''The Unicode Standard'', 6.0.1 The parenthesized forms are historically based on typewriter approximations of the circled versions. Although these roles have been supplanted by styles and other markup in "rich text" contexts, the characters are included in the Unicode standard "for interoperability with the legacy East Asian character sets and for t ...
[...More Info...]      
[...Related Items...]     OR:     [Wikipedia]   [Google]   [Baidu]  


Latin Script In Unicode
Over a thousand characters from the Latin script are encoded in the Unicode Standard, grouped in several basic and extended Latin blocks. The extended ranges contain mainly precomposed letters plus diacritics that are equivalently encoded with combining diacritics, as well as some ligatures and distinct letters, used for example in the orthographies of various African languages (including click symbols in Latin Extended-B) and the Vietnamese alphabet (Latin Extended Additional). Latin Extended-C contains additions for Uighur and the Claudian letters. Latin Extended-D comprises characters that are mostly of interest to medievalists. Latin Extended-E mostly comprises characters used for German dialectology (Teuthonista). Latin Extended-F and -G contain characters for phonetic transcription. Blocks As of version of the Unicode Standard, 1,487 characters in the following 19 blocks are classified as belonging to the Latin script. * Basic Latin, 0000–007F. This block correspond ...
[...More Info...]      
[...Related Items...]     OR:     [Wikipedia]   [Google]   [Baidu]  




Katakana (Unicode Block)
Katakana is a Unicode block containing katakana characters for the Japanese and Ainu languages. Block History The following Unicode-related documents record the purpose and process of defining specific characters in the Katakana block: See also * Katakana Phonetic Extensions (Unicode block) * Kana Extended-A (Unicode block) * Kana Extended-B (Unicode block) * Kana Supplement (Unicode block) * Small Kana Extension (Unicode block) * Hiragana (Unicode block) * CJK Compatibility (Unicode block) * Enclosed CJK Letters and Months (Unicode block) * Halfwidth and Fullwidth Forms (Unicode block) Halfwidth and Fullwidth Forms is a Unicode block U+FF00–FFEF, provided so that older encodings containing both halfwidth and fullwidth characters can have lossless translation to/from Unicode. It is the second-to-last block of the Basic Mul ... References {{Japanese language Unicode blocks Kana ...
[...More Info...]      
[...Related Items...]     OR:     [Wikipedia]   [Google]   [Baidu]  


picture info

Hangul Jamo (Unicode Block)
Hangul Jamo (, ) is a Unicode block containing positional (''choseong'', ''jungseong'', and ''jongseong'') forms of the Hangul consonant and vowel clusters. While the Hangul Syllables Hangul Syllables is a Unicode block containing precomposed Hangul syllable blocks for modern Korean. The syllables Korean language and computers#Hangul in Unicode, can be directly mapped by algorithm to sequences of two or three characters in th ... Unicode block contains precomposed syllables used in standard modern Korean, the Hangul Jamo block can be used to compose arbitrary syllables dynamically, including those not included in the Hangul Syllables block. Block History The following Unicode-related documents record the purpose and process of defining specific characters in the Hangul Jamo block: References See also * Hangul Jamo Extended-A * Hangul Jamo Extended-B * CJK Symbols and Punctuation (Unicode block) * Enclosed CJK Letters and Months (Unicode block) {{Hangul Jamo ...
[...More Info...]      
[...Related Items...]     OR:     [Wikipedia]   [Google]   [Baidu]  


picture info

CJK Symbols And Punctuation (Unicode Block)
CJK Symbols and Punctuation is a Unicode block containing symbols and punctuation used for writing the Chinese, Japanese and Korean languages. It also contains one Chinese character. Block The block has variation sequences defined for East Asian punctuation positional variants. They use (VS01) and (VS02): Orientation Quotation marks and other punctuation have expected differences in behaviour in vertical and horizontal text. The quotation marks 「...」, 『...』 and 〝...〟 rotate 90 degrees, as follows: See also General Punctuation, for variation selectors and CJK behaviour of the Latin quotation marks ‘...’ and “...”. Chinese character The CJK Symbols and Punctuation block contains one Chinese character: . Although it is not covered under "Unified Ideographs", it is treated as a CJK character for all other intents and purposes. Emoji The CJK Symbols and Punctuation block contains two emoji: U+3030 and U+303D. The block has four standardized vari ...
[...More Info...]      
[...Related Items...]     OR:     [Wikipedia]   [Google]   [Baidu]  


ISO/IEC JTC 1/SC 2
ISO/IEC JTC 1/SC 2 Coded character sets is a standardization subcommittee of the Joint Technical Committee ISO/IEC JTC 1 of the International Organization for Standardization (ISO) and the International Electrotechnical Commission (IEC), that develops and facilitates standards within the field of coded character sets. The international secretariat of ISO/IEC JTC 1/SC 2 is the Japanese Industrial Standards Committee (JISC), located in Japan. SC 2 is responsible for the development of the Universal Coded Character Set standard (ISO/IEC 10646), which is the international standard corresponding to the Unicode Standard. History The subcommittee was established in 1987 under ISO/TC 97 as ISO/TC 97/SC 2, originally with the title "Character Sets and Information Coding", with the area of work being, "the standardization of bit and byte coded representation of information for interchange including among others, sets of graphic characters, of control functions, of picture elements and audi ...
[...More Info...]      
[...Related Items...]     OR:     [Wikipedia]   [Google]   [Baidu]  


International Committee For Information Technology Standards
The InterNational Committee for Information Technology Standards (INCITS), (pronounced "insights"), is an ANSI-accredited standards development organization composed of Information technology developers. It was formerly known as the X3 and NCITS. INCITS is the central U.S. forum dedicated to creating technology standards. INCITS is accredited by the American National Standards Institute (ANSI) and is affiliated with the Information Technology Industry Council, a global policy advocacy organization that represents U.S. and global innovation companies. INCITS coordinates technical standards activity between ANSI in the US and joint ISO The International Organization for Standardization (ISO ; ; ) is an independent, non-governmental, international standard development organization composed of representatives from the national standards organizations of member countries. Me .../ IEC committees worldwide. This provides a mechanism to create standards that will be implemen ...
[...More Info...]      
[...Related Items...]     OR:     [Wikipedia]   [Google]   [Baidu]  


picture info

Unicode
Unicode or ''The Unicode Standard'' or TUS is a character encoding standard maintained by the Unicode Consortium designed to support the use of text in all of the world's writing systems that can be digitized. Version 16.0 defines 154,998 Character (computing), characters and 168 script (Unicode), scripts used in various ordinary, literary, academic, and technical contexts. Unicode has largely supplanted the previous environment of a myriad of incompatible character sets used within different locales and on different computer architectures. The entire repertoire of these sets, plus many additional characters, were merged into the single Unicode set. Unicode is used to encode the vast majority of text on the Internet, including most web pages, and relevant Unicode support has become a common consideration in contemporary software development. Unicode is ultimately capable of encoding more than 1.1 million characters. The Unicode character repertoire is synchronized with Univers ...
[...More Info...]      
[...Related Items...]     OR:     [Wikipedia]   [Google]   [Baidu]