Kangxi Radicals (Unicode Block)
   HOME
*





Kangxi Radicals (Unicode Block)
Kangxi Radicals is a Unicode block. In version 3.0 (1999), this separate Kangxi Radicals block was introduced which encodes the 214 radicals in sequence, at U+2F00–2FD5. These are specific code points intended to represent the radical ''qua'' radical, as opposed to the character consisting of the unaugmented radical; thus, U+2F00 represents radical 1 while U+4E00 represents the character ''yī'' meaning "one". In addition, the CJK Radicals Supplement block (2E80–2EFF) was introduced, encoding alternative (often positional) forms taken by Kangxi radicals as they appear within specific characters. For example, ⺁ "CJK RADICAL CLIFF" (U+2E81) is a variant of ⼚ radical 27 (U+2F1A), itself identical in shape to the character consisting of unaugmented radical 27, 厂 "cliff" (U+5382). The Unicode standard encoded 20,992 characters in version 1.0.1 (1992) in the CJK Unified Ideographs block (U+4E00–9FFF). This standard followed the Kangxi order of radicals (radical 1 a ...
[...More Info...]      
[...Related Items...]     OR:     [Wikipedia]   [Google]   [Baidu]  


picture info

Chinese Characters
Chinese characters () are logograms developed for the writing of Chinese. In addition, they have been adapted to write other East Asian languages, and remain a key component of the Japanese writing system where they are known as ''kanji''. Chinese characters in South Korea, which are known as ''hanja'', retain significant use in Korean academia to study its documents, history, literature and records. Vietnam once used the '' chữ Hán'' and developed chữ Nôm to write Vietnamese before turning to a romanized alphabet. Chinese characters are the oldest continuously used system of writing in the world. By virtue of their widespread current use throughout East Asia and Southeast Asia, as well as their profound historic use throughout the Sinosphere, Chinese characters are among the most widely adopted writing systems in the world by number of users. The total number of Chinese characters ever to appear in a dictionary is in the tens of thousands, though most are graph ...
[...More Info...]      
[...Related Items...]     OR:     [Wikipedia]   [Google]   [Baidu]  


CJK Unified Ideographs Extension B
CJK Unified Ideographs Extension B is a Unicode block A Unicode block is one of several contiguous ranges of numeric character codes ( code points) of the Unicode character set that are defined by the Unicode Consortium for administrative and documentation purposes. Typically, proposals such as the ... containing rare and historic CJK ideographs for Chinese, Japanese, Korean, and Vietnamese. The block has dozens of variation sequences defined for standardized variants. It also has thousands of ideographic variation sequences registered in the Unicode Ideographic Variation Database (IVD). These sequences specify the desired glyph variant for a given Unicode character. It was the only CJK Unified Ideographs Extension block with a UCS2003 source identifier. Since Extension B contained too many characters, the original code charts were produced with a single glyph for all regions. The glyphs were designed by Beijing Zhongyi Electronic Ltd. After the introduction of multi-col ...
[...More Info...]      
[...Related Items...]     OR:     [Wikipedia]   [Google]   [Baidu]  


ISO/IEC JTC 1/SC 2
ISO/IEC JTC 1/SC 2 Coded character sets is a standardization subcommittee of the Joint Technical Committee ISO/IEC JTC 1 of the International Organization for Standardization (ISO) and the International Electrotechnical Commission (IEC), that develops and facilitates standards within the field of coded character sets. The international secretariat of ISO/IEC JTC 1/SC 2 is the Japanese Industrial Standards Committee (JISC), located in Japan. SC 2 is responsible for the development of the Universal Coded Character Set (ISO/IEC 10646) which is the international standard corresponding to the Unicode Standard. History ISO/IEC JTC 1/SC 2 was established in 1987, originally with the title “Character Sets and Information Coding,” with the area of work being, “the standardization of bit and byte coded representation of information for interchange including among others, sets of graphic characters, of control functions, of picture elements and audio information coding of text for proc ...
[...More Info...]      
[...Related Items...]     OR:     [Wikipedia]   [Google]   [Baidu]  


International Committee For Information Technology Standards
The InterNational Committee for Information Technology Standards (INCITS), (pronounced "insights"), is an ANSI-accredited standards development organization composed of Information technology developers. It was formerly known as the X3 and NCITS. INCITS is the central U.S. forum dedicated to creating technology standards. INCITS is accredited by the American National Standards Institute (ANSI) and is affiliated with the Information Technology Industry Council, a global policy advocacy organization that represents U.S. and global innovation companies. INCITS coordinates technical standards activity between ANSI in the US and joint ISO/IEC committees worldwide. This provides a mechanism to create standards that will be implemented in many nations. As such, INCITS' Executive Board also serves as ANSI's Technical Advisory Group for ISO/IEC Joint Technical Committee 1. JTC 1 is responsible for International standardization in the field of information technology. INCITS operates th ...
[...More Info...]      
[...Related Items...]     OR:     [Wikipedia]   [Google]   [Baidu]  


picture info

JIS X 0208
JIS X 0208 is a 2-byte character set specified as a Japanese Industrial Standards, Japanese Industrial Standard, containing 6879 graphic characters suitable for writing text, place names, personal names, and so forth in the Japanese language. The official title of the current standard is . It was originally established as JIS C 6226 in 1978, and has been revised in 1983, 1990, and 1997. It is also called Code page 952 by IBM. The 1978 version is also called Code page 955 by IBM. Scope of use and compatibility The character set JIS X 0208 establishes is primarily for the purpose of between data processing systems and the devices connected to them, or mutually between data communication systems. This character set can be used for data processing and text processing. Partial implementations of the character set are not considered compatible. Because there are places where such things have happened as the original drafting committee of the first standard taking care to separate c ...
[...More Info...]      
[...Related Items...]     OR:     [Wikipedia]   [Google]   [Baidu]  




Unicode Collation
The Unicode collation algorithm (UCA) is an algorithm defined in Unicode Technical Report #10, which is a customizable method to produce binary keys from strings representing text in any writing system and language that can be represented with Unicode. These keys can then be efficiently byte-by-byte compared in order to collate or sort them according to the rules of the language, with options for ignoring case, accents, etc. Unicode Technical Report #10 also specifies the ''Default Unicode Collation Element Table'' (DUCET). This data file specifies a default collation ordering. The DUCET is customizable for different languages. Some such customisations can be found in the Unicode Common Locale Data Repository (CLDR). An open source implementation of UCA is included with the International Components for Unicode, ICU. ICU supports tailoring, and the collation tailorings from CLDR are included in ICU. The effects of tailoring and many language-specific tailorings are displayed in the ...
[...More Info...]      
[...Related Items...]     OR:     [Wikipedia]   [Google]   [Baidu]  


Common Locale Data Repository
The Common Locale Data Repository Project, often abbreviated as CLDR, is a project of the Unicode Consortium to provide locale data in XML format for use in computer applications. CLDR contains locale-specific information that an operating system will typically provide to applications. Among the types of data that CLDR includes are the following: * Translations for language names * Translations for territory and country names * Translations for currency names, including singular/plural modifications * Translations for weekday, month, era, period of day, in full and abbreviated forms * Translations for time zones and example cities (or similar) for time zones * Translations for calendar fields * Patterns for formatting/parsing dates or times of day * Exemplar sets of characters used for writing the language * Patterns for formatting/parsing numbers * Rules for language-adapted collation * Rules for spelling out numbers as words * Rules for formatting numbers in traditional n ...
[...More Info...]      
[...Related Items...]     OR:     [Wikipedia]   [Google]   [Baidu]  


picture info

Unihan Database
Han unification is an effort by the authors of Unicode and the Universal Character Set to map multiple character sets of the Han characters of the so-called CJK languages into a single set of unified characters. Han characters are a feature shared in common by written Chinese (hanzi), Japanese (kanji), Korean (hanja) and Vietnamese (chữ Hán). Modern Chinese, Japanese and Korean typefaces typically use regional or historical variants of a given Han character. In the formulation of Unicode, an attempt was made to unify these variants by considering them different glyphs representing the same "grapheme", or orthographic unit, hence, "Han unification", with the resulting character repertoire sometimes contracted to Unihan. Nevertheless, many characters have regional variants assigned to different code points, such as Traditional (U+500B) versus Simplified (U+4E2A). Unihan can also refer to the Unihan Database maintained by the Unicode Consortium, which provides informa ...
[...More Info...]      
[...Related Items...]     OR:     [Wikipedia]   [Google]   [Baidu]  


CJK Unified Ideographs Extension F
CJK Unified Ideographs Extension F is a Unicode block containing rare and historic CJK ideographs for Chinese, Japanese, Korean, and Vietnamese, as well as more than a thousand Sawndip characters for writing the Zhuang language The Zhuang languages (; autonym: , pre-1982: , Sawndip: 話僮, from ''vah'', 'language' and ''Cuengh'', 'Zhuang'; ) are any of more than a dozen Tai languages spoken by the Zhuang people of Southern China in the province of Guangxi and adjac .... The block has 194 ideographic variation sequences registered in the Unicode Ideographic Variation Database (IVD). These sequences specify the desired glyph variant for a given Unicode character. Block History The following Unicode-related documents record the purpose and process of defining specific characters in the CJK Unified Ideographs Extension F block: References {{Unicode CJK Unified Ideographs Unicode blocks ...
[...More Info...]      
[...Related Items...]     OR:     [Wikipedia]   [Google]   [Baidu]  


CJK Unified Ideographs Extension E
CJK Unified Ideographs Extension E is a Unicode block A Unicode block is one of several contiguous ranges of numeric character codes ( code points) of the Unicode character set that are defined by the Unicode Consortium for administrative and documentation purposes. Typically, proposals such as the ... containing rare and historic CJK ideographs for Chinese, Japanese, Korean, and Vietnamese. The block has dozens of ideographic variation sequences registered in the Unicode Ideographic Variation Database (IVD). These sequences specify the desired glyph variant for a given Unicode character. Block History The following Unicode-related documents record the purpose and process of defining specific characters in the CJK Unified Ideographs Extension E block: References {{Unicode CJK Unified Ideographs Unicode blocks ...
[...More Info...]      
[...Related Items...]     OR:     [Wikipedia]   [Google]   [Baidu]  




CJK Unified Ideographs Extension D
CJK Unified Ideographs Extension D is a Unicode block A Unicode block is one of several contiguous ranges of numeric character codes ( code points) of the Unicode character set that are defined by the Unicode Consortium for administrative and documentation purposes. Typically, proposals such as the ... containing rare and historic CJK ideographs for Chinese, Japanese, Korean, and Vietnamese. The block has hundreds of ideographic variation sequences registered in the Unicode Ideographic Variation Database (IVD). These sequences specify the desired glyph variant for a given Unicode character. Block History The following Unicode-related documents record the purpose and process of defining specific characters in the CJK Unified Ideographs Extension D block: References {{CJK ideographs in Unicode Unicode blocks ...
[...More Info...]      
[...Related Items...]     OR:     [Wikipedia]   [Google]   [Baidu]  


CJK Unified Ideographs Extension C
__FORCETOC__ CJK Unified Ideographs Extension C is a Unicode block containing rare and historic CJK ideographs for Chinese, Japanese, Korean, and Vietnamese. The block has dozens of ideographic variation sequences registered in the Unicode Ideographic Variation Database (IVD). These sequences specify the desired glyph variant for a given Unicode character. Note that the Katakana ligature Ligature may refer to: * Ligature (medicine), a piece of suture used to shut off a blood vessel or other anatomical structure ** Ligature (orthodontic), used in dentistry * Ligature (music), an element of musical notation used especially in the me ... 𪜈 (U+2A708) has been erroneously encoded in this block as a Han character. Block History The following Unicode-related documents record the purpose and process of defining specific characters in the CJK Unified Ideographs Extension C block: References {{CJK ideographs in Unicode Unicode blocks ...
[...More Info...]      
[...Related Items...]     OR:     [Wikipedia]   [Google]   [Baidu]