Variation Selectors
   HOME
*





Variation Selectors
Variation Selectors is the block name of a Unicode code point block containing 16 variation selectors. Each variation selector is used to specify a specific glyph variant for a preceding character. They are currently used to specify standardized variation sequences for mathematical symbols, emoji symbols, 'Phags-pa letters, and CJK unified ideographs corresponding to CJK compatibility ideographs. At present only standardized variation sequences with VS1, VS2, VS3, VS15 and VS16 have been defined; VS15 and VS16 are reserved to request that a character should be displayed as text or as an emoji respectively. These combining characters are named ''variation selector-1'' (for U+FE00) through to ''variation selector-16'' (U+FE0F), and are abbreviated VS1 – VS16. Each applies to the immediately preceding character. As of Unicode 13.0: * CJK compatibility ideograph variation sequences contain VS1–VS3 (U+FE00–U+FE02) * CJK Unified Ideographs Extension A and B variati ...
[...More Info...]      
[...Related Items...]     OR:     [Wikipedia]   [Google]   [Baidu]  


picture info

Script (Unicode)
In Unicode, a script is a collection of letters and other written signs used to represent textual information in one or more writing systems. Some scripts support one and only one writing system and language, for example, Armenian. Other scripts support many different writing systems; for example, the Latin script supports English, French, German, Italian, Vietnamese, Latin itself, and several other languages. Some languages make use of multiple alternate writing systems and thus also use several scripts; for example, in Turkish, the Arabic script was used before the 20th century but transitioned to Latin in the early part of the 20th century. For a list of languages supported by each script, see the list of languages by writing system. More or less complementary to scripts are symbols and Unicode control characters. The unified diacritical characters and unified punctuation characters frequently have the "common" or "inherited" script property. However, the individual s ...
[...More Info...]      
[...Related Items...]     OR:     [Wikipedia]   [Google]   [Baidu]  


Manichaean (Unicode Block)
Manichaean is a Unicode block containing characters historically used for writing Sogdian, Parthian, and the dialects of Fars Dialects of Pars ''(Persia)'' are a group of southwestern and northwestern Persian dialects spoken in the central Pars province. The southwestern dialects can be divided into three families of dialects according to geographical distribution and lo .... Block The block has five variation sequences defined for standardized variants. They use (VS1) to denote alternate letter forms: History The following Unicode-related documents record the purpose and process of defining specific characters in the Manichaean block: References {{Manichaeism footer Unicode blocks ...
[...More Info...]      
[...Related Items...]     OR:     [Wikipedia]   [Google]   [Baidu]  


International Committee For Information Technology Standards
The InterNational Committee for Information Technology Standards (INCITS), (pronounced "insights"), is an ANSI-accredited standards development organization composed of Information technology developers. It was formerly known as the X3 and NCITS. INCITS is the central U.S. forum dedicated to creating technology standards. INCITS is accredited by the American National Standards Institute (ANSI) and is affiliated with the Information Technology Industry Council, a global policy advocacy organization that represents U.S. and global innovation companies. INCITS coordinates technical standards activity between ANSI in the US and joint ISO/IEC committees worldwide. This provides a mechanism to create standards that will be implemented in many nations. As such, INCITS' Executive Board also serves as ANSI's Technical Advisory Group for ISO/IEC Joint Technical Committee 1. JTC 1 is responsible for International standardization in the field of information technology. INCITS operates th ...
[...More Info...]      
[...Related Items...]     OR:     [Wikipedia]   [Google]   [Baidu]  


picture info

Unicode
Unicode, formally The Unicode Standard,The formal version reference is is an information technology standard for the consistent encoding, representation, and handling of text expressed in most of the world's writing systems. The standard, which is maintained by the Unicode Consortium, defines as of the current version (15.0) 149,186 characters covering 161 modern and historic scripts, as well as symbols, emoji (including in colors), and non-visual control and formatting codes. Unicode's success at unifying character sets has led to its widespread and predominant use in the internationalization and localization of computer software. The standard has been implemented in many recent technologies, including modern operating systems, XML, and most modern programming languages. The Unicode character repertoire is synchronized with ISO/IEC 10646, each being code-for-code identical with the other. ''The Unicode Standard'', however, includes more than just the base code. Along ...
[...More Info...]      
[...Related Items...]     OR:     [Wikipedia]   [Google]   [Baidu]  


Variant Form (Unicode)
A variant form is a different glyph for a character, encoded in Unicode through the mechanism of variation sequences: sequences in Unicode that consist of a base character followed by a variation selector character. A variant form usually has a very similar appearance and meaning as its base form. The mechanism is intended for variant forms where, generally, if the variant form is unavailable, displaying the base character does not change the meaning of the text, and may not even be noticeable by many readers. Unicode defines two types of variation sequences: * ''Standardized variation sequences'' defined in StandardizedVariants.txt * ''Ideographic variation sequences'' defined in the Ideographic Variation Database (IVD) Variation selector characters reside in several Unicode blocks: * Variation Selectors (16 characters abbreviated VS1–VS16) * Variation Selectors Supplement (240 characters abbreviated VS17–VS256) * Mongolian (3 characters abbreviated FVS1–FVS3) ...
[...More Info...]      
[...Related Items...]     OR:     [Wikipedia]   [Google]   [Baidu]  




Variation Selectors Supplement
Variation Selectors Supplement is a Unicode block containing additional Variation Selectors beyond those found in the Variation Selectors Variation Selectors is the block name of a Unicode code point block containing 16 variation selectors. Each variation selector is used to specify a specific glyph variant for a preceding character. They are currently used to specify standardize ... block. These combining characters are named ''variation selector-17'' (for U+E0100) through to ''variation selector-256'' (U+E01EF), abbreviated VS17 – VS256. , VS17 (U+E0100) to VS48 (U+E011F) are used in ideographic variation sequences in the Unicode Ideographic Variation Database (IVD). These selectors are known as Ideographic Variation Selectors (IVS). They are not listed in the list of standardized variation sequence, instead they are listed in anotheIdeographic Variation Database History The following Unicode-related documents record the purpose and process of defining specific char ...
[...More Info...]      
[...Related Items...]     OR:     [Wikipedia]   [Google]   [Baidu]  


Phags-pa (Unicode Block)
Phags-pa is a Unicode block containing characters from the 'Phags-pa script promulgated as a national script by Kublai Khan Kublai ; Mongolian script: ; (23 September 1215 – 18 February 1294), also known by his temple name as the Emperor Shizu of Yuan and his regnal name Setsen Khan, was the founder of the Yuan dynasty of China and the fifth khagan-emperor of ..., the founder of the Yuan dynasty. It was used primarily in writing Mongolian and Chinese, although it was intended for the use of all written languages of the Mongol Empire. Block The block has six variation sequences defined for standardized variants. They use (VS01): Note that four vowel letters have positional variants: History The following Unicode-related documents record the purpose and process of defining specific characters in the Phags-pa block: References {{reflist Unicode blocks ...
[...More Info...]      
[...Related Items...]     OR:     [Wikipedia]   [Google]   [Baidu]  


Myanmar Extended-A
Myanmar Extended-A is a Unicode block containing Myanmar characters for writing the Khamti Shan The Tai Khamti, ( Khamti: တဲး ၵံးတီႈ, ( th, ชาวไทคำตี่, my, ခန္တီးရှမ်းလူမျိုး, Hkamti Shan) or simply Khamti as they are also known, are a Tai ethnic group native t ... and Aiton languages. Block The block has eleven variation sequences defined for standardized variants. They use (VS01) to denote the dotted letters used for the Khamti, Aiton, and Phake languages. (Note that this is font dependent. For example, the Padauk font supports some of the dotted forms.) History The following Unicode-related documents record the purpose and process of defining specific characters in the Myanmar Extended-A block: References {{reflist Unicode blocks ...
[...More Info...]      
[...Related Items...]     OR:     [Wikipedia]   [Google]   [Baidu]  


Myanmar (Unicode Block)
Myanmar is a Unicode block containing characters for the Burmese, Mon, Shan, Palaung, and the Karen languages of Myanmar, as well as the Aiton and Phake languages of Northeast India. It is also used to write Pali and Sanskrit in Myanmar. Block The block has sixteen variation sequences defined for standardized variants. They use (VS01) to denote the dotted letters used for the Khamti, Aiton, and Phake languages. (Note that this is font dependent. For example, the Padauk font supports some of the dotted forms.) History The following Unicode-related documents record the purpose and process of defining specific characters in the Myanmar block: Historic and nonstandard uses of range In Unicode 1.0.0, part of the current Myanmar block was used for Tibetan. In Microsoft Windows, collation data referring to the old Tibetan block was retained as late as Windows XP, and removed in Windows 2003. In Myanmar, devices and software localisation often use Zawgyi font ...
[...More Info...]      
[...Related Items...]     OR:     [Wikipedia]   [Google]   [Baidu]  


picture info

Halfwidth And Fullwidth Forms
In CJK (Chinese, Japanese and Korean) computing, graphic characters are traditionally classed into fullwidth (in Taiwan and Hong Kong: 全形; in CJK: 全角) and halfwidth (in Taiwan and Hong Kong: 半形; in CJK: 半角) characters. Unlike monospaced fonts, a halfwidth character occupies half the width of a fullwidth character, hence the name. ''Halfwidth and Fullwidth Forms'' is also the name of a Unicode block U+FF00–FFEF, provided so that older encodings containing both halfwidth and fullwidth characters can have lossless translation to/from Unicode. Rationale In the days of text mode computing, Western characters were normally laid out in a grid on the screen, often 80 columns by 24 or 25 lines. Each character was displayed as a small dot matrix, often about 8 pixels wide, and a SBCS (single-byte character set) was generally used to encode characters of Western languages. For aesthetic reasons and readability, it is preferable for Han characters to be approx ...
[...More Info...]      
[...Related Items...]     OR:     [Wikipedia]   [Google]   [Baidu]  




Unicode Block
A Unicode block is one of several contiguous ranges of numeric character codes ( code points) of the Unicode character set that are defined by the Unicode Consortium for administrative and documentation purposes. Typically, proposals such as the addition of new glyphs are discussed and evaluated by considering the relevant block or blocks as a whole. Each block is generally, but not always, meant to supply glyphs used by one or more specific languages, or in some general application area such as mathematics, surveying, decorative typesetting, social forums, etc. Design and implementation Unicode blocks are identified by unique names, which use only ASCII characters and are usually descriptive of the nature of the symbols, in English; such as "Tibetan" or "Supplemental Arrows-A". (When comparing block names, one is supposed to equate uppercase with lowercase letters, and ignore any whitespace, hyphens, and underbars; so the last name is equivalent to "supplemental_arrows__a" a ...
[...More Info...]      
[...Related Items...]     OR:     [Wikipedia]   [Google]   [Baidu]  


Basic Latin (Unicode Block)
The Basic Latin or C0 Controls and Basic Latin Unicode block is the first block of the Unicode standard, and the only block which is encoded in one byte in UTF-8. The block contains all the letters and control codes of the ASCII encoding. It ranges from U+0000 to U+007F, contains 128 characters and includes the C0 controls, ASCII punctuation and symbols, ASCII digits, both the uppercase and lowercase of the English alphabet and a control character. The Basic Latin block was included in its present form from version 1.0.0 of the Unicode Standard, without addition or alteration of the character repertoire. Its block name in Unicode 1.0 was ASCII. Table of characters : The letter U+005C (\) may show up as a Yen(¥) or Won(₩) sign in Japanese/Korean fonts mistaking Unicode (especially UTF-8) as a legacy character set which replaced the backslash with these signs. Subheadings The C0 Controls and Basic Latin block contains six subheadings. C0 controls The C0 Controls, ref ...
[...More Info...]      
[...Related Items...]     OR:     [Wikipedia]   [Google]   [Baidu]