CJK Unified Ideographs Extension I
__FORCETOC__ CJK Unified Ideographs Extension I is a Unicode block comprising CJK Unified Ideographs included in drafts of an amendment to China's GB 18030 standard circulated in 2022 and 2023, which were fast-tracked into Unicode in 2023. Background Unlike most other sets of CJK unified ideographs, Extension I was not prepared and submitted by the Ideographic Research Group (IRG). GB 18030 is a mandatory national standard of the People's Republic of China (PRC). It defines a Unicode Transformation Format which retains compatibility with existing data in the earlier GBK (character encoding), GBK and EUC-CN character encodings, and specifies particular Unicode characters which devices sold in China must support. Its 2022 edition, , changed a number of required characters to map to standard Unicode code points, rather than to Private Use Areas, private use area code points. In late 2022, the PRC made a draft of a further amendment to be made to GB 18030 available for public consul ... [...More Info...]       [...Related Items...]     OR:     [Wikipedia]   [Google]   [Baidu]   |
|
Chinese Characters
Chinese characters () are logograms developed for the writing of Chinese. In addition, they have been adapted to write other East Asian languages, and remain a key component of the Japanese writing system where they are known as '' kanji''. Chinese characters in South Korea, which are known as '' hanja'', retain significant use in Korean academia to study its documents, history, literature and records. Vietnam once used the ''chữ Hán'' and developed chữ Nôm to write Vietnamese before turning to a romanized alphabet. Chinese characters are the oldest continuously used system of writing in the world. By virtue of their widespread current use throughout East Asia and Southeast Asia, as well as their profound historic use throughout the Sinosphere, Chinese characters are among the most widely adopted writing systems in the world by number of users. The total number of Chinese characters ever to appear in a dictionary is in the tens of thousands, though most are g ... [...More Info...]       [...Related Items...]     OR:     [Wikipedia]   [Google]   [Baidu]   |
|
ISO/IEC 10646
ISO/IEC JTC 1, entitled "Information technology", is a joint technical committee (JTC) of the International Organization for Standardization (ISO) and the International Electrotechnical Commission (IEC). Its purpose is to develop, maintain and promote standards in the fields of information and communications technology (ICT). JTC 1 has been responsible for many critical IT standards, ranging from the Joint Photographic Experts Group (JPEG) image formats and Moving Picture Experts Group (MPEG) audio and video formats to the C and C++ programming languages. History ISO/IEC JTC 1 was formed in 1987 as a merger between ISO/TC 97 (Information Technology) and IEC/TC 83, with IEC/SC 47B joining later. The intent was to bring together, in a single committee, the IT standardization activities of the two parent organizations in order to avoid duplicative or possibly incompatible standards. At the time of its formation, the mandate of JTC 1 was to develop base standards in information tec ... [...More Info...]       [...Related Items...]     OR:     [Wikipedia]   [Google]   [Baidu]   |
|
Taiwan
Taiwan, officially the Republic of China (ROC), is a country in East Asia, at the junction of the East and South China Seas in the northwestern Pacific Ocean, with the People's Republic of China (PRC) to the northwest, Japan to the northeast, and the Philippines to the south. The territories controlled by the ROC consist of 168 islands, with a combined area of . The main island of Taiwan, also known as ''Formosa'', has an area of , with mountain ranges dominating the eastern two-thirds and plains in the western third, where its highly urbanised population is concentrated. The capital, Taipei, forms along with New Taipei City and Keelung the largest metropolitan area of Taiwan. Other major cities include Taoyuan, Taichung, Tainan, and Kaohsiung. With around 23.9 million inhabitants, Taiwan is among the most densely populated countries in the world. Taiwan has been settled for at least 25,000 years. Ancestors of Taiwanese indigenous peoples settled the island a ... [...More Info...]       [...Related Items...]     OR:     [Wikipedia]   [Google]   [Baidu]   |
|
Grass Radical
Radical 140 or radical grass () meaning "grass" is one of 29 of the 214 Kangxi radicals that are composed of 6 strokes. It transforms into when appearing at the top of a character or component. In the ''Kangxi Dictionary'' and in modern standard Traditional Chinese as used in Taiwan, Hong Kong and Macau, (with two horizontal strokes) consists of four strokes, while in Simplified Chinese and modern Japanese, (with a continuous horizontal stroke) consists of three strokes. In the ''Kangxi Dictionary'' there are 1902 characters (out of 40,000) found under this radical, making it the most commonly used radical. , the upper component form of , is the 30th indexing component in the ''Table of Indexing Chinese Character Components'' predominantly adopted by Simplified Chinese dictionaries published in mainland China, while is listed as its associated indexing component. Evolution File:艸-seal.svg, Small seal script character Derived characters Variant forms This radical cha ... [...More Info...]       [...Related Items...]     OR:     [Wikipedia]   [Google]   [Baidu]   |
|
CJK Unified Ideographs Extension H
__FORCETOC__ CJK Unified Ideographs Extension H is a Unicode block containing rare and historic CJK Unified Ideographs for Chinese, Japanese, Korean, Sawndip Zhuang characters or ''Sawndip'' (Sawndip: ; ) are logograms derived from Chinese characters and used by the Zhuang people of Guangxi and Yunnan provinces in China to write the Zhuang languages for more than one thousand years. The script is used ..., and Vietnamese. Block History The following Unicode-related documents record the purpose and process of defining specific characters in the CJK Unified Ideographs Extension H block: References {{Unicode CJK Unified Ideographs Unicode blocks ... [...More Info...]       [...Related Items...]     OR:     [Wikipedia]   [Google]   [Baidu]   |
|
Tertiary Ideographic Plane
In the Unicode standard, a plane is a continuous group of 65,536 (216) code points. There are 17 planes, identified by the numbers 0 to 16, which corresponds with the possible values 00–1016 of the first two positions in six position hexadecimal format (U+''hhhhhh''). Plane 0 is the Basic Multilingual Plane (BMP), which contains most commonly used characters. The higher planes 1 through 16 are called "supplementary planes". The last code point in Unicode is the last code point in plane 16, U+10FFFF. As of Unicode version , five of the planes have assigned code points (characters), and seven are named. The limit of 17 planes is due to UTF-16, which can encode 220 code points (16 planes) as pairs of words, plus the BMP as a single word. UTF-8 was designed with a much larger limit of 231 (2,147,483,648) code points (32,768 planes), and would still be able to encode 221 (2,097,152) code points (32 planes) even under the current limit of 4 bytes. The 17 planes can accommodate 1,114,1 ... [...More Info...]       [...Related Items...]     OR:     [Wikipedia]   [Google]   [Baidu]   |
|
CJK Unified Ideographs Extension F
CJK Unified Ideographs Extension F is a Unicode block containing rare and historic CJK ideographs for Chinese, Japanese, Korean, and Vietnamese, as well as more than a thousand Sawndip characters for writing the Zhuang language The Zhuang languages (; autonym: , pre-1982: , Sawndip: 話僮, from ''vah'', 'language' and ''Cuengh'', 'Zhuang'; ) are any of more than a dozen Tai languages spoken by the Zhuang people of Southern China in the province of Guangxi and adja .... The block has 194 ideographic variation sequences registered in the Unicode Ideographic Variation Database (IVD). These sequences specify the desired glyph variant for a given Unicode character. Block History The following Unicode-related documents record the purpose and process of defining specific characters in the CJK Unified Ideographs Extension F block: References {{Unicode CJK Unified Ideographs Unicode blocks ... [...More Info...]       [...Related Items...]     OR:     [Wikipedia]   [Google]   [Baidu]   |
|
Supplementary Ideographic Plane
In the Unicode standard, a plane is a continuous group of 65,536 (216) code points. There are 17 planes, identified by the numbers 0 to 16, which corresponds with the possible values 00–1016 of the first two positions in six position hexadecimal format (U+''hhhhhh''). Plane 0 is the Basic Multilingual Plane (BMP), which contains most commonly used characters. The higher planes 1 through 16 are called "supplementary planes". The last code point in Unicode is the last code point in plane 16, U+10FFFF. As of Unicode version , five of the planes have assigned code points (characters), and seven are named. The limit of 17 planes is due to UTF-16, which can encode 220 code points (16 planes) as pairs of words, plus the BMP as a single word. UTF-8 was designed with a much larger limit of 231 (2,147,483,648) code points (32,768 planes), and would still be able to encode 221 (2,097,152) code points (32 planes) even under the current limit of 4 bytes. The 17 planes can accommodate 1,114,1 ... [...More Info...]       [...Related Items...]     OR:     [Wikipedia]   [Google]   [Baidu]   |
|
CJK Unified Ideographs Extension E
CJK Unified Ideographs Extension E is a Unicode block containing rare and historic CJK ideographs for Chinese, Japanese, Korean, and Vietnamese. The block has dozens of ideographic variation sequences registered in the Unicode Ideographic Variation Database (IVD). These sequences specify the desired glyph variant for a given Unicode character. Block History The following Unicode-related documents record the purpose and process of defining specific characters in the CJK Unified Ideographs Extension E block: References {{Unicode CJK Unified Ideographs Unicode blocks ... [...More Info...]       [...Related Items...]     OR:     [Wikipedia]   [Google]   [Baidu]   |
|
CJK Unified Ideographs Extension D
CJK Unified Ideographs Extension D is a Unicode block A Unicode block is one of several contiguous ranges of numeric character codes (code points) of the Unicode character set that are defined by the Unicode Consortium for administrative and documentation purposes. Typically, proposals such as the ad ... containing rare and historic CJK ideographs for Chinese, Japanese, Korean, and Vietnamese. The block has hundreds of ideographic variation sequences registered in the Unicode Ideographic Variation Database (IVD). These sequences specify the desired glyph variant for a given Unicode character. Block History The following Unicode-related documents record the purpose and process of defining specific characters in the CJK Unified Ideographs Extension D block: References {{CJK ideographs in Unicode Unicode blocks ... [...More Info...]       [...Related Items...]     OR:     [Wikipedia]   [Google]   [Baidu]   |
|
Unicode Consortium
The Unicode Consortium (legally Unicode, Inc.) is a 501(c)(3) non-profit organization incorporated and based in Mountain View, California. Its primary purpose is to maintain and publish the Unicode Standard which was developed with the intention of replacing existing character encoding schemes which are limited in size and scope, and are incompatible with multilingual environments. The consortium describes its overall purpose as: Unicode's success at unifying character sets has led to its widespread adoption in the internationalization and localization of software. The standard has been implemented in many technologies, including XML, the Java programming language, Swift, and modern operating systems. Voting members include computer software and hardware companies with an interest in text-processing standards, including Adobe, Apple, the Bangladesh Computer Council, Emojipedia, Facebook, Google, IBM, Microsoft, the Omani Ministry of Endowments and Religious Affairs, ... [...More Info...]       [...Related Items...]     OR:     [Wikipedia]   [Google]   [Baidu]   |
|
Unihan
Han unification is an effort by the authors of Unicode and the Universal Character Set to map multiple character sets of the Han characters of the so-called CJK languages into a single set of unified characters. Han characters are a feature shared in common by written Chinese (hanzi), Japanese (kanji), Korean (hanja) and Vietnamese (chữ Hán). Modern Chinese, Japanese and Korean typefaces typically use regional or historical variants of a given Han character. In the formulation of Unicode, an attempt was made to unify these variants by considering them different glyphs representing the same "grapheme", or orthographic unit, hence, "Han unification", with the resulting character repertoire sometimes contracted to Unihan. Nevertheless, many characters have regional variants assigned to different code points, such as Traditional (U+500B) versus Simplified (U+4E2A). Unihan can also refer to the Unihan Database maintained by the Unicode Consortium, which provides informatio ... [...More Info...]       [...Related Items...]     OR:     [Wikipedia]   [Google]   [Baidu]   |