Block (Unicode)
   HOME
*





Block (Unicode)
A Unicode block is one of several contiguous ranges of numeric character codes ( code points) of the Unicode character set that are defined by the Unicode Consortium for administrative and documentation purposes. Typically, proposals such as the addition of new glyphs are discussed and evaluated by considering the relevant block or blocks as a whole. Each block is generally, but not always, meant to supply glyphs used by one or more specific languages, or in some general application area such as mathematics, surveying, decorative typesetting, social forums, etc. Design and implementation Unicode blocks are identified by unique names, which use only ASCII characters and are usually descriptive of the nature of the symbols, in English; such as "Tibetan" or "Supplemental Arrows-A". (When comparing block names, one is supposed to equate uppercase with lowercase letters, and ignore any whitespace, hyphens, and underbars; so the last name is equivalent to "supplemental_arrows__a ...
[...More Info...]      
[...Related Items...]     OR:     [Wikipedia]   [Google]   [Baidu]  


Code Point
In character encoding terminology, a code point, codepoint or code position is a numerical value that maps to a specific character. Code points usually represent a single grapheme—usually a letter, digit, punctuation mark, or whitespace—but sometimes represent symbols, control characters, or formatting. The set of all possible code points within a given encoding/character set make up that encoding's ''codespace''. For example, the character encoding scheme ASCII comprises 128 code points in the range 0 hex to 7Fhex, Extended ASCII comprises 256 code points in the range 0hex to FFhex, and Unicode comprises code points in the range 0hex to 10FFFFhex. The Unicode code space is divided into seventeen planes (the basic multilingual plane, and 16 supplementary planes), each with (= 216) code points. Thus the total size of the Unicode code space is 17 ×  = . Definition The notion of a code point is used for abstraction, to distinguish both: * the num ...
[...More Info...]      
[...Related Items...]     OR:     [Wikipedia]   [Google]   [Baidu]  


Miscellaneous Symbols
Miscellaneous Symbols is a Unicode block (U+2600–U+26FF) containing glyphs representing concepts from a variety of categories: astrological, astronomical, chess, dice, musical notation, political symbols, recycling, religious symbols, Bagua, trigrams, warning signs, and weather, among others. Tables Compact table Definitions Emoji The Miscellaneous Symbols block contains 83 emoji: U+2600–U+2604, U+260E, U+2611, U+2614–U+2615, U+2618, U+261D, U+2620, U+2622–U+2623, U+2626, U+262A, U+262E–U+262F, U+2638–U+263A, U+2640, U+2642, U+2648–U+2653, U+265F–U+2660, U+2663, U+2665–U+2666, U+2668, U+267B, U+267E–U+267F, U+2692–U+2697, U+2699, U+269B–U+269C, U+26A0–U+26A1, U+26A7, U+26AA–U+26AB, U+26B0–U+26B1, U+26BD–U+26BE, U+26C4–U+26C5, U+26C8, U+26CE–U+26CF, U+26D1, U+26D3–U+26D4, U+26E9–U+26EA, U+26F0–U+26F5, U+26F7–U+26FA and U+26FD. The block has 164 Variant form (Unicode), standardized variants defined to specify emoji-style (U+FE0F VS1 ...
[...More Info...]      
[...Related Items...]     OR:     [Wikipedia]   [Google]   [Baidu]  


picture info

Hangul
The Korean alphabet, known as Hangul, . Hangul may also be written as following South Korea's standard Romanization. ( ) in South Korea and Chosŏn'gŭl in North Korea, is the modern official writing system for the Korean language. The letters for the five basic consonants reflect the shape of the speech organs used to pronounce them, and they are systematically modified to indicate phonetic features; similarly, the vowel letters are systematically modified for related sounds, making Hangul a featural writing system. It has been described as a syllabic alphabet as it combines the features of alphabetic and syllabic writing systems, although it is not necessarily an abugida. Hangul was created in 1443 CE by King Sejong the Great in an attempt to increase literacy by serving as a complement (or alternative) to the logographic Sino-Korean ''Hanja'', which had been used by Koreans as its primary script to write the Korean language since as early as the Gojoseon period (spanni ...
[...More Info...]      
[...Related Items...]     OR:     [Wikipedia]   [Google]   [Baidu]  


Hangul Syllables
Hangul Syllables is a Unicode block containing precomposed Hangul syllable blocks for modern Korean. The syllables can be directly mapped by algorithm to sequences of two or three characters in the Hangul Jamo Unicode block: * one of U+1100–U+1112: the 19 modern Hangul leading consonant jamos; * one of U+1161–U+1175: the 21 modern Hangul vowel jamos; * none, or one of U+11A8–U+11C2: the 27 modern Hangul trailing consonant jamos. This block is encoded according to the canonically equivalent order of these (two or three) jamos (one in each subrange of jamos above) composing each syllable. Note that a full Hangul syllable may include one of these characters but may be preceded by one or more leading consonant jamos, and followed by one or more trailing jamos (possibly preceded by one or more vowel jamos if the encoded syllable is composed by two jamos does not include any trailing consonant jamos). As well some Hangul syllables may not include any one of these precomposed char ...
[...More Info...]      
[...Related Items...]     OR:     [Wikipedia]   [Google]   [Baidu]  




CJK Unified Ideographs Extension A
CJK Unified Ideographs Extension-A is a Unicode block A Unicode block is one of several contiguous ranges of numeric character codes (code points) of the Unicode character set that are defined by the Unicode Consortium for administrative and documentation purposes. Typically, proposals such as the ad ... containing rare Han ideographs. The block has dozens of variation sequences defined for standardized variants. It also has thousands of ideographic variation sequences registered in the Unicode Ideographic Variation Database (IVD). These sequences specify the desired glyph variant for a given Unicode character. Block History The following Unicode-related documents record the purpose and process of defining specific characters in the CJK Unified Ideographs Extension A block: References Unicode blocks {{CJK ideographs in Unicode ...
[...More Info...]      
[...Related Items...]     OR:     [Wikipedia]   [Google]   [Baidu]  


Hangul (obsolete Unicode Block)
Hangul, Hangul Supplementary-A, and Hangul Supplementary-B were character blocks that existed in Unicode 1.0 and 1.1, and ISO/IEC 10646-1:1993. These blocks encoded precomposed modern Hangul syllables. These three Unicode 1.x blocks were deleted and superseded by the new Hangul Syllables block (U+AC00–U+D7AF) in Unicode 2.0 (July 1996) and ISO/IEC 10646-1:1993 Amd. 5 (1998), and are now occupied by CJK Unified Ideographs Extension A and Yijing Hexagram Symbols. Moving or removing existing characters has been prohibited by the Unicode Stability Policy for all versions following Unicode 2.0, so the Hangul Syllables block introduced in Unicode 2.0 is immutable. Documentation The Unicode 1.0.0 code chart is still available online, including the Korean Hangul Syllables block, but not the supplements added in Unicode 1.1. Full code charts for Unicode 1.1 were "never created", since Unicode 1.1 was published only as a report amending Unicode 1.0 due to the urgency of releasing it; how ...
[...More Info...]      
[...Related Items...]     OR:     [Wikipedia]   [Google]   [Baidu]  


picture info

Tibetan Script
The Tibetan script is a segmental writing system (''abugida'') of Brahmic scripts, Indic origin used to write certain Tibetic languages, including Lhasa Tibetan, Tibetan, Dzongkha, Sikkimese language, Sikkimese, Ladakhi language, Ladakhi, Jirel language, Jirel and Balti language, Balti. It has also been used for some non-Tibetic languages in close cultural contact with Tibet, such as Thakali language, Thakali. The printed form is called uchen script while the hand-written cursive form used in everyday writing is called umê script. This writing system is used across the Himalayas, and Tibet. The script is closely linked to a broad ethnic Tibetan identity, spanning across areas in India, Nepal, Bhutan and Tibet. The Tibetan script is of Brahmic scripts, Brahmic origin from the Gupta script and is ancestral to scripts such as Meitei script, Meitei, Lepcha script, Lepcha,Daniels, Peter T. and William Bright. ''The World's Writing Systems''. New York: Oxford University Press, 1996. ...
[...More Info...]      
[...Related Items...]     OR:     [Wikipedia]   [Google]   [Baidu]  


Tibetan (Unicode Block)
Tibetan is a Unicode block containing characters for the Tibetan, Dzongkha, and other languages of China, Bhutan, Nepal, Mongolia, northern India, eastern Pakistan and Russia. Block Former Tibetan block The Tibetan Unicode block is unique for having been allocated in version 1.0.0 with a virama-based encoding that was unable to distinguish visible and conjunct consonant correctly. This encoding was removed from the Unicode Standard in version 1.0.1 in the process of unifying with ISO 10646 for version 1.1, then reintroduced as an explicit root/subjoined encoding, with a larger block size, in version 2.0. Moving or removing existing characters has been prohibited by the Unicode Stability Policy for all versions following Unicode 2.0, so the Tibetan characters encoded in Unicode 2.0 and all subsequent versions are immutable. The range of the former Unicode 1.0.0 Tibetan block has been occupied by the Myanmar block since Unicode 3.0. In Microsoft Windows, collation data refer ...
[...More Info...]      
[...Related Items...]     OR:     [Wikipedia]   [Google]   [Baidu]  


Myanmar (Unicode Block)
Myanmar is a Unicode block containing characters for the Burmese, Mon, Shan, Palaung, and the Karen languages of Myanmar, as well as the Aiton and Phake languages of Northeast India. It is also used to write Pali and Sanskrit in Myanmar. Block The block has sixteen variation sequences defined for standardized variants. They use (VS01) to denote the dotted letters used for the Khamti, Aiton, and Phake languages. (Note that this is font dependent. For example, the Padauk font supports some of the dotted forms.) History The following Unicode-related documents record the purpose and process of defining specific characters in the Myanmar block: Historic and nonstandard uses of range In Unicode 1.0.0, part of the current Myanmar block was used for Tibetan. In Microsoft Windows, collation data referring to the old Tibetan block was retained as late as Windows XP, and removed in Windows 2003. In Myanmar, devices and software localisation often use Zawgyi fon ...
[...More Info...]      
[...Related Items...]     OR:     [Wikipedia]   [Google]   [Baidu]  




Tibetan (obsolete Unicode Block)
Tibetan is a Unicode block containing characters for the Tibetan, Dzongkha, and other languages of China, Bhutan, Nepal, Mongolia, northern India, eastern Pakistan and Russia. Block Former Tibetan block The Tibetan Unicode block is unique for having been allocated in version 1.0.0 with a virama-based encoding that was unable to distinguish visible and conjunct consonant correctly. This encoding was removed from the Unicode Standard in version 1.0.1 in the process of unifying with ISO 10646 for version 1.1, then reintroduced as an explicit root/subjoined encoding, with a larger block size, in version 2.0. Moving or removing existing characters has been prohibited by the Unicode Stability Policy for all versions following Unicode 2.0, so the Tibetan characters encoded in Unicode 2.0 and all subsequent versions are immutable. The range of the former Unicode 1.0.0 Tibetan block has been occupied by the Myanmar block since Unicode 3.0. In Microsoft Windows, collation data refer ...
[...More Info...]      
[...Related Items...]     OR:     [Wikipedia]   [Google]   [Baidu]  


picture info

Script (Unicode)
In Unicode, a script is a collection of Letter (alphabet), letters and other written signs used to represent textual information in one or more writing systems. Some scripts support one and only one writing system and Written language, language, for example, Armenian language, Armenian. Other scripts support many different writing systems; for example, the Latin script in Unicode, Latin script supports English alphabet, English, French alphabet, French, German alphabet, German, Italian alphabet, Italian, Vietnamese language, Vietnamese, Latin alphabet, Latin itself, and several other languages. Some languages make use of multiple alternate writing systems and thus also use several scripts; for example, in Turkish language, Turkish, the Ottoman Turkish alphabet, Arabic script was used before the 20th century but transitioned to Latin in the early part of the 20th century. For a list of languages supported by each script, see the list of languages by writing system. More or less co ...
[...More Info...]      
[...Related Items...]     OR:     [Wikipedia]   [Google]   [Baidu]  


picture info

Hexadecimal
In mathematics and computing, the hexadecimal (also base-16 or simply hex) numeral system is a positional numeral system that represents numbers using a radix (base) of 16. Unlike the decimal system representing numbers using 10 symbols, hexadecimal uses 16 distinct symbols, most often the symbols "0"–"9" to represent values 0 to 9, and "A"–"F" (or alternatively "a"–"f") to represent values from 10 to 15. Software developers and system designers widely use hexadecimal numbers because they provide a human-friendly representation of binary-coded values. Each hexadecimal digit represents four bits (binary digits), also known as a nibble (or nybble). For example, an 8-bit byte can have values ranging from 00000000 to 11111111 in binary form, which can be conveniently represented as 00 to FF in hexadecimal. In mathematics, a subscript is typically used to specify the base. For example, the decimal value would be expressed in hexadecimal as . In programming, a number of ...
[...More Info...]      
[...Related Items...]     OR:     [Wikipedia]   [Google]   [Baidu]