CJK Compatibility Ideographs Supplement
   HOME
*





CJK Compatibility Ideographs Supplement
CJK Compatibility Ideographs Supplement is a Unicode block containing Han characters used only for Round-trip format conversion, roundtrip compatibility mapping with planes 3, 4, 5, 6, 7, and 15 of CNS 11643-1992. Block History The following Unicode-related documents record the purpose and process of defining specific characters in the CJK Compatibility Ideographs Supplement block: See also *CJK Unified Ideographs *CJK Compatibility Ideographs References

{{CJK ideographs in Unicode Unicode blocks ...
[...More Info...]      
[...Related Items...]     OR:     [Wikipedia]   [Google]   [Baidu]  


picture info

Chinese Characters
Chinese characters () are logograms developed for the writing of Chinese. In addition, they have been adapted to write other East Asian languages, and remain a key component of the Japanese writing system where they are known as ''kanji''. Chinese characters in South Korea, which are known as ''hanja'', retain significant use in Korean academia to study its documents, history, literature and records. Vietnam once used the '' chữ Hán'' and developed chữ Nôm to write Vietnamese before turning to a romanized alphabet. Chinese characters are the oldest continuously used system of writing in the world. By virtue of their widespread current use throughout East Asia and Southeast Asia, as well as their profound historic use throughout the Sinosphere, Chinese characters are among the most widely adopted writing systems in the world by number of users. The total number of Chinese characters ever to appear in a dictionary is in the tens of thousands, though most are graph ...
[...More Info...]      
[...Related Items...]     OR:     [Wikipedia]   [Google]   [Baidu]  


CNS 11643-1992
CNS may refer to: Science and medicine * Central nervous system * Clinical nurse specialist * Coagulase-negative staphylococcus * Connectedness to nature scale * Conserved non-coding sequence of DNA * Crigler–Najjar syndrome * Crystallography and NMR system, a software library * Color Naming System * CNS (DNS server), Caching Name Server, a DNS server software product Military * CNS (chemical weapon), a mixture of chloroacetophenone, chloropicrin and chloroform * Chief of the Naval Staff (other), in several countries * Former Taiwanese navy ship prefix Education * Cicero-North Syracuse High School, New York, US * City of Norwich School, England * Computation and Neural Systems, a Caltech program Organisations * Canadian Nuclear Society * Congress of Neurological Surgeons * US Corporation for National Service, later Corporation for National and Community Service * Council for National Security, 2006 military of Thailand * Szekler National Council (), Romania * ...
[...More Info...]      
[...Related Items...]     OR:     [Wikipedia]   [Google]   [Baidu]  


Unicode Block
A Unicode block is one of several contiguous ranges of numeric character codes ( code points) of the Unicode character set that are defined by the Unicode Consortium for administrative and documentation purposes. Typically, proposals such as the addition of new glyphs are discussed and evaluated by considering the relevant block or blocks as a whole. Each block is generally, but not always, meant to supply glyphs used by one or more specific languages, or in some general application area such as mathematics, surveying, decorative typesetting, social forums, etc. Design and implementation Unicode blocks are identified by unique names, which use only ASCII characters and are usually descriptive of the nature of the symbols, in English; such as "Tibetan" or "Supplemental Arrows-A". (When comparing block names, one is supposed to equate uppercase with lowercase letters, and ignore any whitespace, hyphens, and underbars; so the last name is equivalent to "supplemental_arrows__a" a ...
[...More Info...]      
[...Related Items...]     OR:     [Wikipedia]   [Google]   [Baidu]  


Round-trip Format Conversion
{{noref, date=January 2019 The term round-trip is used in document conversion particularly involving markup languages such as XML and SGML. A successful round-trip consists of converting a document in format A (docA) to one in format B (docB) and then back again to format A (docA′). If docA and docA′ are identical then there has been no information loss and the round-trip has been successful. More generally it means converting from any data representation and back again, including from one data structure to another. Information loss When a document in one format is converted to another there is likely to be information loss. For example, suppose an HTML document is saved as plain text (*.txt). Then all the markup (structure, formatting, superscripts, …) will be lost. Compound documents will frequently lose information on images and other embedded objects. If the text file is converted back to the original format, information will necessarily be missing. A similar effect h ...
[...More Info...]      
[...Related Items...]     OR:     [Wikipedia]   [Google]   [Baidu]  




CNS 11643
The CNS 11643 character set (Chinese National Standard 11643), also officially known as the Chinese Standard Interchange Code or CSIC ( zh, tr=, t=中文標準交換碼), is officially the standard character set of Taiwan (Republic of China). In practice, variants of the related Big5 character set are ''de facto'' standard. CNS 11643 is designed to conform to ISO 2022. It contains 16 planes, so the maximum possible number of encodable characters is 16×94×94 = 141376. Planes 1 through 7 are defined by the standard; since 2007, planes 10 through 15 have also been defined by the standard. Prior to this, planes 12 to 15 (35344 code points) were specifically designated for user-defined characters. Unlike CCCII, the encoding of variant characters in CNS 11643 is not related. EUC-TW is an encoded representation of CNS 11643 and ASCII in Extended Unix Code (EUC) form. Other encodings capable of representing certain CSIC planes include ISO-2022-CN (planes 1 and 2) and ISO-2022-CN-EXT ...
[...More Info...]      
[...Related Items...]     OR:     [Wikipedia]   [Google]   [Baidu]  


picture info

Unicode
Unicode, formally The Unicode Standard,The formal version reference is is an information technology standard for the consistent encoding, representation, and handling of text expressed in most of the world's writing systems. The standard, which is maintained by the Unicode Consortium, defines as of the current version (15.0) 149,186 characters covering 161 modern and historic scripts, as well as symbols, emoji (including in colors), and non-visual control and formatting codes. Unicode's success at unifying character sets has led to its widespread and predominant use in the internationalization and localization of computer software. The standard has been implemented in many recent technologies, including modern operating systems, XML, and most modern programming languages. The Unicode character repertoire is synchronized with ISO/IEC 10646, each being code-for-code identical with the other. ''The Unicode Standard'', however, includes more than just the base code. Along ...
[...More Info...]      
[...Related Items...]     OR:     [Wikipedia]   [Google]   [Baidu]  


International Committee For Information Technology Standards
The InterNational Committee for Information Technology Standards (INCITS), (pronounced "insights"), is an ANSI-accredited standards development organization composed of Information technology developers. It was formerly known as the X3 and NCITS. INCITS is the central U.S. forum dedicated to creating technology standards. INCITS is accredited by the American National Standards Institute (ANSI) and is affiliated with the Information Technology Industry Council, a global policy advocacy organization that represents U.S. and global innovation companies. INCITS coordinates technical standards activity between ANSI in the US and joint ISO/IEC committees worldwide. This provides a mechanism to create standards that will be implemented in many nations. As such, INCITS' Executive Board also serves as ANSI's Technical Advisory Group for ISO/IEC Joint Technical Committee 1. JTC 1 is responsible for International standardization in the field of information technology. INCITS operates th ...
[...More Info...]      
[...Related Items...]     OR:     [Wikipedia]   [Google]   [Baidu]  


ISO/IEC JTC 1/SC 2
ISO/IEC JTC 1/SC 2 Coded character sets is a standardization subcommittee of the Joint Technical Committee ISO/IEC JTC 1 of the International Organization for Standardization (ISO) and the International Electrotechnical Commission (IEC), that develops and facilitates standards within the field of coded character sets. The international secretariat of ISO/IEC JTC 1/SC 2 is the Japanese Industrial Standards Committee (JISC), located in Japan. SC 2 is responsible for the development of the Universal Coded Character Set (ISO/IEC 10646) which is the international standard corresponding to the Unicode Standard. History ISO/IEC JTC 1/SC 2 was established in 1987, originally with the title “Character Sets and Information Coding,” with the area of work being, “the standardization of bit and byte coded representation of information for interchange including among others, sets of graphic characters, of control functions, of picture elements and audio information coding of text for proc ...
[...More Info...]      
[...Related Items...]     OR:     [Wikipedia]   [Google]   [Baidu]  


Ideographic Rapporteur Group
The Ideographic Research Group (IRG), formerly called the Ideographic Rapporteur Group, is a subgroup of Working Group 2 (WG2) of ISO/IEC JTC 1/SC 2 (SC 2), the subcommittee of the Joint Technical Committee of ISO and IEC which is responsible for developing standards within the field of coded character sets. IRG is composed of experts from China, Japan, South Korea, Vietnam and other countries and regions that use Han characters, as well as experts representing the Unicode Consortium. The group is responsible for coordinating the addition of new CJK unified ideographs to the Universal Multiple-Octet Coded Character Set (ISO/IEC 10646) and the Unicode Standard. The group meets twice a year for 4-5 days each time, and reports its activity to the subsequent meeting of WG2. History The precursor to the Ideographic Rapporteur Group was the CJK Joint Research Group (CJK-JRG), which was established in 1990. In October 1993 this group was established as a subgroup of WG2 under SC2 with ...
[...More Info...]      
[...Related Items...]     OR:     [Wikipedia]   [Google]   [Baidu]  


picture info

CJK Unified Ideographs
The Chinese, Japanese and Korean (CJK) scripts share a common background, collectively known as CJK characters. In the process called Han unification, the common (shared) characters were identified and named CJK Unified Ideographs. As of Unicode 15.0, Unicode defines a total of 97,058 CJK Unified Ideographs. The term ''ideographs'' is a misnomer, as the Chinese script is not ideographic but rather logographic. Historically, Vietnam used Chinese characters too, so sometimes the abbreviation CJKV is used. Vietnamese use was replaced by the Latin-based Vietnamese alphabet in the 1920s. Sources The Ideographic Research Group (IRG) is responsible for developing extensions to the encoded repertoires of CJK unified ideographs. IRG processes proposals for new CJK unified ideographs submitted by its member bodies, and after undergoing several rounds of expert review, IRG submits a consolidated set of characters to ISO/IEC JTC 1/SC 2 Working Group 2 (WG2) and the Unicode Technical Commit ...
[...More Info...]      
[...Related Items...]     OR:     [Wikipedia]   [Google]   [Baidu]  




CJK Compatibility Ideographs
CJK Compatibility Ideographs is a Unicode block created to contain Han characters that were encoded in multiple locations in other established character encodings, in addition to their CJK Unified Ideographs assignments, in order to retain round-trip compatibility between Unicode and those encodings. Such encodings include the South Korean KS X 1001:1998 (U+F900–U+FA0B, 268 characters), Taiwanese Big5 (U+FA0C–U+FA0D, 2 characters), Japanese IBM 32 ( CP932 variant; U+FA0E–U+FA2D, 32 characters), South Korean KS X 1001:2004 (U+FA2E–U+FA2F, 2 character), Japanese JIS X 0213 (U+FA30–U+FA6A, 59 characters), Japanese ARIB STD-B24 (U+FA6B–U+FA6D, 3 characters) and the North Korean KPS 10721-2000 (U+FA70–U+FAD9, 106 characters) source standards. In ensuing versions of the standard, more characters have been added to the block. These even include a few regular ideographs (with the Unified_Ideograph property) that do not have duplicates (U+FA ...
[...More Info...]      
[...Related Items...]     OR:     [Wikipedia]   [Google]   [Baidu]