HOME
*





Code Page 54936
GB 18030 is a Chinese government standard, described as ''Information Technology — Chinese coded character set'' and defines the required language and character support necessary for software in China. GB18030 is the registered Internet name for the official character set of the People's Republic of China (PRC) superseding GB2312. As a Unicode Transformation Format (i.e. an encoding of all Unicode code points), GB18030 supports both simplified and traditional Chinese characters. It is also compatible with legacy encodings including GB2312, CP936, and GBK 1.0. In addition to the "GB18030 character encoding", this standard contains requirements about which scripts must be supported, font support, etc. As of 2022, in terms of font implementations, "only the Simplified Chinese fonts of the ''Noto Sans CJK'' (Google), ''Source Han Mono'' (Adobe), and ''Source Han Sans'' (Adobe) typeface families are already compliant with GB 18030-2022 Implementation Level 2 .''Microsoft ...
[...More Info...]      
[...Related Items...]     OR:     [Wikipedia]   [Google]   [Baidu]  


Code Page 54936
GB 18030 is a Chinese government standard, described as ''Information Technology — Chinese coded character set'' and defines the required language and character support necessary for software in China. GB18030 is the registered Internet name for the official character set of the People's Republic of China (PRC) superseding GB2312. As a Unicode Transformation Format (i.e. an encoding of all Unicode code points), GB18030 supports both simplified and traditional Chinese characters. It is also compatible with legacy encodings including GB2312, CP936, and GBK 1.0. In addition to the "GB18030 character encoding", this standard contains requirements about which scripts must be supported, font support, etc. As of 2022, in terms of font implementations, "only the Simplified Chinese fonts of the ''Noto Sans CJK'' (Google), ''Source Han Mono'' (Adobe), and ''Source Han Sans'' (Adobe) typeface families are already compliant with GB 18030-2022 Implementation Level 2 .''Microsoft ...
[...More Info...]      
[...Related Items...]     OR:     [Wikipedia]   [Google]   [Baidu]  


Code Page 936 (Microsoft Windows)
Windows Code page 936 (abbreviated MS936, Windows-936 or (ambiguously) CP936), is Microsoft's character encoding for simplified Chinese, one of the four DBCSs for East Asian languages. Originally, Windows-936 covered GB 2312 (in its EUC-CN form), but it was expanded to cover most of GBK with the release of Windows 95. IBM's Code page 936 is a different encoding for Simplified Chinese, although International Components for Unicode does not include an IBM-936 codec, and uses the Windows code page for the "cp936" label. IBM's code page for GBK coverage is Code page 1386 (CP1386 or IBM-1386), which is defined as a combination of the single byte Code page 1114 and the double byte Code page 1385. It was superseded by code page 54936 (GB 18030), but was still prevalent in use. The Windows command prompt uses CP936 as the default code page for simplified Chinese installations, although part of the GB 18030 was made mandatory for all software products sold in China. In 2002, the IAN ...
[...More Info...]      
[...Related Items...]     OR:     [Wikipedia]   [Google]   [Baidu]  


UTF-16
UTF-16 (16-bit computing, 16-bit Unicode Transformation Format) is a character encoding capable of encoding all 1,112,064 valid code points of Unicode (in fact this number of code points is dictated by the design of UTF-16). The encoding is variable-width encoding, variable-length, as code points are encoded with one or two 16-bit ''code units''. UTF-16 arose from an earlier obsolete fixed-width 16-bit encoding, now known as UCS-2 (for 2-byte Universal Character Set), once it became clear that more than 216 (65,536) code points were needed. UTF-16 is used by systems such as the Microsoft Windows API, the Java programming language and JavaScript/ECMAScript. It is also sometimes used for plain text and word-processing data files on Microsoft Windows. It is rarely used for files on Unix-like systems. UTF-16 is often claimed to be more space-efficient than UTF-8 for East Asian languages, since it uses two bytes for characters that take 3 bytes in UTF-8. Since real text contains many s ...
[...More Info...]      
[...Related Items...]     OR:     [Wikipedia]   [Google]   [Baidu]  


picture info

UTF-8
UTF-8 is a variable-width encoding, variable-length character encoding used for electronic communication. Defined by the Unicode Standard, the name is derived from ''Unicode'' (or ''Universal Coded Character Set'') ''Transformation Format 8-bit''. UTF-8 is capable of encoding all 1,112,064 valid character code points in Unicode using one to four one-byte (8-bit) code units. Code points with lower numerical values, which tend to occur more frequently, are encoded using fewer bytes. It was designed for backward compatibility with ASCII: the first 128 characters of Unicode, which correspond one-to-one with ASCII, are encoded using a single byte with the same binary value as ASCII, so that valid ASCII text is valid UTF-8-encoded Unicode as well. UTF-8 was designed as a superior alternative to UTF-1, a proposed variable-length encoding with partial ASCII compatibility which lacked some features including self-synchronizing code, self-synchronization and fully ASCII-compatible handling ...
[...More Info...]      
[...Related Items...]     OR:     [Wikipedia]   [Google]   [Baidu]  




UCS-2
The Universal Coded Character Set (UCS, Unicode) is a standard set of characters defined by the international standard ISO/IEC 10646, ''Information technology — Universal Coded Character Set (UCS)'' (plus amendments to that standard), which is the basis of many character encodings, improving as characters from previously unrepresented typing systems are added. The UCS has over 1.1 million possible code points available for use/allocation, but only the first 65,536, which is the Basic Multilingual Plane (BMP), had entered into common use before 2000. This situation began changing when the People's Republic of China (PRC) ruled in 2006 that all software sold in its jurisdiction would have to support GB 18030. This required software intended for sale in the PRC to move beyond the BMP. The system deliberately leaves many code points not assigned to characters, even in the BMP. It does this to allow for future expansion or to minimise conflicts with other encoding forms. The ...
[...More Info...]      
[...Related Items...]     OR:     [Wikipedia]   [Google]   [Baidu]  


Code Points
In character encoding terminology, a code point, codepoint or code position is a numerical value that maps to a specific character. Code points usually represent a single grapheme—usually a letter, digit, punctuation mark, or whitespace—but sometimes represent symbols, control characters, or formatting. The set of all possible code points within a given encoding/character set make up that encoding's ''codespace''. For example, the character encoding scheme ASCII comprises 128 code points in the range 0 hex to 7Fhex, Extended ASCII comprises 256 code points in the range 0hex to FFhex, and Unicode comprises code points in the range 0hex to 10FFFFhex. The Unicode code space is divided into seventeen planes (the basic multilingual plane, and 16 supplementary planes), each with (= 216) code points. Thus the total size of the Unicode code space is 17 ×  = . Definition The notion of a code point is used for abstraction, to distinguish both: * the n ...
[...More Info...]      
[...Related Items...]     OR:     [Wikipedia]   [Google]   [Baidu]  


picture info

Basic Multilingual Plane
In the Unicode standard, a plane is a continuous group of 65,536 (216) code points. There are 17 planes, identified by the numbers 0 to 16, which corresponds with the possible values 00–1016 of the first two positions in six position hexadecimal format (U+''hhhhhh''). Plane 0 is the Basic Multilingual Plane (BMP), which contains most commonly used characters. The higher planes 1 through 16 are called "supplementary planes". The last code point in Unicode is the last code point in plane 16, U+10FFFF. As of Unicode version , five of the planes have assigned code points (characters), and seven are named. The limit of 17 planes is due to UTF-16, which can encode 220 code points (16 planes) as pairs of words, plus the BMP as a single word. UTF-8 was designed with a much larger limit of 231 (2,147,483,648) code points (32,768 planes), and would still be able to encode 221 (2,097,152) code points (32 planes) even under the current limit of 4 bytes. The 17 planes can accommodate 1,114,1 ...
[...More Info...]      
[...Related Items...]     OR:     [Wikipedia]   [Google]   [Baidu]  


Private Use Area
In Unicode, a Private Use Area (PUA) is a range of code points that, by definition, will not be assigned characters by the Unicode Consortium. Three private use areas are defined: one in the Basic Multilingual Plane (), and one each in, and nearly covering, planes 15 and 16 (, ). The code points in these areas cannot be considered as standardized characters in Unicode itself. They are intentionally left undefined so that third parties may define their own characters without conflicting with Unicode Consortium assignments. Under the Unicode Stability Policy, the Private Use Areas will remain allocated for that purpose in all future Unicode versions. Assignments to Private Use Area characters need not be private in the sense of strictly internal to an organisation; a number of assignment schemes have been published by several organisations. Such publication may include a font that supports the definition (showing the glyphs), and software making use of the private-use characters (e ...
[...More Info...]      
[...Related Items...]     OR:     [Wikipedia]   [Google]   [Baidu]  


picture info

GB/T 20542
The National Standards of the People's Republic of China (), coded as , are the standards issued by the Standardization Administration of China under the authorization of Article 10 of the Standardization Law of the People's Republic of China. According to Article 2, national standards are divided into mandatory national standards and recommended national standards. Mandatory national standards are prefixed "GB". Recommended national standards are prefixed "". Guidance technical documents are prefixed with "GB/Z", but are not legally part of the national standard system. Mandatory national standards are the basis for the product testing which products must undergo during the China Compulsory Certificate (CCC or 3C) certification. If there is no corresponding mandatory national standard, CCC is not required. List A non-exhaustive list of National Standards of the People's Republic of China is listed as follows, accompanied with similar international standards of ISO, marked as i ...
[...More Info...]      
[...Related Items...]     OR:     [Wikipedia]   [Google]   [Baidu]  


GB 16959
GB, or Gb may refer to: Places * United Kingdom (ISO 3166-1 code), a sovereign country situated off the north-western coast of continental Europe ** Great Britain, an island situated off the north-western coast of continental Europe ** Kingdom of Great Britain (1707–1800), a predecessor country of the United Kingdom * Gilgit-Baltistan, a region in northern Pakistan * Guinea-Bissau, a sovereign state in West Africa * Green Bay, Wisconsin, United States * Great Barrington, Massachusetts, United States Businesses and organisations * GB Airways, a British airline * Gardner Bender, a manufacturer of professional electrician's tools and supplies * Girls' Brigade, a Christian organization for girls * Grande Bibliothèque, a large public library in Montreal * University of Wisconsin–Green Bay, an American university * ABX Air (IATA airline designator GB), a cargo airline * GB Glace, a Swedish ice cream company * Griesedieck Brothers beer, an American beer brand * GB Supermarkets, a B ...
[...More Info...]      
[...Related Items...]     OR:     [Wikipedia]   [Google]   [Baidu]  


picture info

Tibetan Script
The Tibetan script is a segmental writing system (''abugida'') of Brahmic scripts, Indic origin used to write certain Tibetic languages, including Lhasa Tibetan, Tibetan, Dzongkha, Sikkimese language, Sikkimese, Ladakhi language, Ladakhi, Jirel language, Jirel and Balti language, Balti. It has also been used for some non-Tibetic languages in close cultural contact with Tibet, such as Thakali language, Thakali. The printed form is called uchen script while the hand-written cursive form used in everyday writing is called umê script. This writing system is used across the Himalayas, and Tibet. The script is closely linked to a broad ethnic Tibetan identity, spanning across areas in India, Nepal, Bhutan and Tibet. The Tibetan script is of Brahmic scripts, Brahmic origin from the Gupta script and is ancestral to scripts such as Meitei script, Meitei, Lepcha script, Lepcha,Daniels, Peter T. and William Bright. ''The World's Writing Systems''. New York: Oxford University Press, 1996. ...
[...More Info...]      
[...Related Items...]     OR:     [Wikipedia]   [Google]   [Baidu]  


picture info

Mongolian Script
The classical or traditional Mongolian script, also known as the , was the first writing system created specifically for the Mongolian language, and was the most widespread until the introduction of Cyrillic in 1946. It is traditionally written in vertical lines . Derived from the Old Uyghur alphabet, Mongolian is a true alphabet, with separate letters for consonants and vowels. The Mongolian script has been adapted to write languages such as Oirat and Manchu. Alphabets based on this classical vertical script are used in Mongolia and Inner Mongolia to this day to write Mongolian, Xibe and, experimentally, Evenki. Computer operating systems have been slow to adopt support for the Mongolian script, and almost all have incomplete support or other text rendering difficulties. History The Mongolian vertical script developed as an adaptation of the Old Uyghur alphabet for the Mongolian language. From the seventh and eighth to the fifteenth and sixteenth centuries, the Mongolia ...
[...More Info...]      
[...Related Items...]     OR:     [Wikipedia]   [Google]   [Baidu]