Thai (Unicode Block)

	Thai (Unicode Block) Thai is a Unicode block A Unicode block is one of several contiguous ranges of numeric character codes (code points) of the Unicode character set that are defined by the Unicode Consortium for administrative and documentation purposes. Typically, proposals such as the ad ... containing characters for the Thai, Lanna Tai, and Pali languages. It is based on the Thai Industrial Standard 620-2533. Block History The following Unicode-related documents record the purpose and process of defining specific characters in the Thai block: References {{reflist Unicode blocks Encodings of Thai ... [...More Info...] [...Related Items...] OR: [Wikipedia] [Google] [Baidu]
picture info	Thai Script The Thai script ( th, อักษรไทย, ) is the abugida used to write Thai, Southern Thai and many other languages spoken in Thailand. The Thai alphabet itself (as used to write Thai) has 44 consonant symbols ( th, พยัญชนะ, ''phayanchana''), 16 vowel symbols ( th, สระ, ''sara'') that combine into at least 32 vowel forms and four tone diacritics ( th, วรรณยุกต์ or วรรณยุต, or ) to create characters mostly representing syllables. Although commonly referred to as the "Thai alphabet", the script is in fact not a true alphabet but an abugida, a writing system in which the full characters represent consonants with diacritical marks for vowels; the absence of a vowel diacritic gives an implied 'a' or 'o'. Consonants are written horizontally from left to right, and vowels following a consonant in speech are written above, below, to the left or to the right of it, or a combination of those. History The Thai alphabet is de ... [...More Info...] [...Related Items...] OR: [Wikipedia] [Google] [Baidu]
picture info	Script (Unicode) In Unicode, a script is a collection of letters and other written signs used to represent textual information in one or more writing systems. Some scripts support one and only one writing system and language, for example, Armenian. Other scripts support many different writing systems; for example, the Latin script supports English, French, German, Italian, Vietnamese, Latin itself, and several other languages. Some languages make use of multiple alternate writing systems and thus also use several scripts; for example, in Turkish, the Arabic script was used before the 20th century but transitioned to Latin in the early part of the 20th century. For a list of languages supported by each script, see the list of languages by writing system. More or less complementary to scripts are symbols and Unicode control characters. The unified diacritical characters and unified punctuation characters frequently have the "common" or "inherited" script property. However, the individual s ... [...More Info...] [...Related Items...] OR: [Wikipedia] [Google] [Baidu]
picture info	Thai Alphabet The Thai script ( th, อักษรไทย, ) is the abugida used to write Thai, Southern Thai and many other languages spoken in Thailand. The Thai alphabet itself (as used to write Thai) has 44 consonant symbols ( th, พยัญชนะ, ''phayanchana''), 16 vowel symbols ( th, สระ, ''sara'') that combine into at least 32 vowel forms and four tone diacritics ( th, วรรณยุกต์ or วรรณยุต, or ) to create characters mostly representing syllables. Although commonly referred to as the "Thai alphabet", the script is in fact not a true alphabet but an abugida, a writing system in which the full characters represent consonants with diacritical marks for vowels; the absence of a vowel diacritic gives an implied 'a' or 'o'. Consonants are written horizontally from left to right, and vowels following a consonant in speech are written above, below, to the left or to the right of it, or a combination of those. History The Thai alphabet is de ... [...More Info...] [...Related Items...] OR: [Wikipedia] [Google] [Baidu]
picture info	Northern Thai Language Kam Mueang ( nod, , กำเมือง) or Northern Thai language ( th, ภาษาไทยถิ่นเหนือ) is the language of the Northern Thai people of Lanna, Thailand. It is a Southwestern Tai language that is closely related to Lao. Kam Mueang has approximately six million speakers, most of whom live in the native Northern Thailand, with a smaller community of Lanna speakers in northwestern Laos. Speakers of this language generally consider the name "Tai Yuan" to be pejorative. They refer to themselves as ' (, คน เมือง, – literally "people of Mueang" meaning "city dwellers"), Lanna, or Northern Thai. The language is also sometimes referred to as ' (พายัพ, ), "Northwestern (speech)". The term Yuan is still sometimes used for Northern Thai's distinctive Tai Tham alphabet, which is closely related to the old Tai Lue alphabet and the Lao religious alphabets. The use of the ', as the traditional alphabet is known, is now largel ... [...More Info...] [...Related Items...] OR: [Wikipedia] [Google] [Baidu]
picture info	Pali Pali () is a Middle Indo-Aryan liturgical language native to the Indian subcontinent. It is widely studied because it is the language of the Buddhist '' Pāli Canon'' or '' Tipiṭaka'' as well as the sacred language of '' Theravāda'' Buddhism.Stargardt, Janice. ''Tracing Thoughts Through Things: The Oldest Pali Texts and the Early Buddhist Archaeology of India and Burma.'', Royal Netherlands Academy of Arts and Sciences, 2000, page 25. Early in the language's history, it was written in the Brahmi script. Origin and development Etymology The word 'Pali' is used as a name for the language of the Theravada canon. The word seems to have its origins in commentarial traditions, wherein the (in the sense of the line of original text quoted) was distinguished from the commentary or vernacular translation that followed it in the manuscript. K. R. Norman suggests that its emergence was based on a misunderstanding of the compound , with being interpreted as the name of a particu ... [...More Info...] [...Related Items...] OR: [Wikipedia] [Google] [Baidu]
	Universal Coded Character Set The Universal Coded Character Set (UCS, Unicode) is a standard set of characters defined by the international standard ISO/ IEC 10646, ''Information technology — Universal Coded Character Set (UCS)'' (plus amendments to that standard), which is the basis of many character encodings, improving as characters from previously unrepresented typing systems are added. The UCS has over 1.1 million possible code points available for use/allocation, but only the first 65,536, which is the Basic Multilingual Plane (BMP), had entered into common use before 2000. This situation began changing when the People's Republic of China (PRC) ruled in 2006 that all software sold in its jurisdiction would have to support GB 18030. This required software intended for sale in the PRC to move beyond the BMP. The system deliberately leaves many code points not assigned to characters, even in the BMP. It does this to allow for future expansion or to minimise conflicts with other encoding forms. ... [...More Info...] [...Related Items...] OR: [Wikipedia] [Google] [Baidu]
	Unicode Block A Unicode block is one of several contiguous ranges of numeric character codes (code points) of the Unicode character set that are defined by the Unicode Consortium for administrative and documentation purposes. Typically, proposals such as the addition of new glyphs are discussed and evaluated by considering the relevant block or blocks as a whole. Each block is generally, but not always, meant to supply glyphs used by one or more specific languages, or in some general application area such as mathematics, surveying, decorative typesetting, social forums, etc. Design and implementation Unicode blocks are identified by unique names, which use only ASCII characters and are usually descriptive of the nature of the symbols, in English; such as "Tibetan" or "Supplemental Arrows-A". (When comparing block names, one is supposed to equate uppercase with lowercase letters, and ignore any whitespace, hyphens, and underbars; so the last name is equivalent to "supplemental_arrows__a" a ... [...More Info...] [...Related Items...] OR: [Wikipedia] [Google] [Baidu]
	Thai Industrial Standard 620-2533 Thai Industrial Standard 620-2533, commonly referred to as TIS-620, is the most common character set and character encoding for the Thai language. The standard is published by the Thai Industrial Standards Institute (TISI), an organ of the Ministry of Industry under the Royal Thai Government, and is the sole official standard for encoding Thai in Thailand. The descriptive name of the standard is "Standard for Thai Character Codes for Computers" (Thai: รหัสสำหรับอักขระไทยที่ใช้กับคอมพิวเตอร์). "2533" refers to year 2533 of the Buddhist Era (1990), the year the present version of the standard was published; a previous revision, TIS 620-2529 (1986), is now obsolete. The code page layout is the same between the two editions. TIS-620 is the IANA preferred charset name for TIS-620, and that charset name is used also for ISO/IEC 8859-11 (which adds a no-break space character at 0xA0, which is unassigned in ... [...More Info...] [...Related Items...] OR: [Wikipedia] [Google] [Baidu]
picture info	Unicode Unicode, formally The Unicode Standard,The formal version reference is is an information technology standard for the consistent encoding, representation, and handling of text expressed in most of the world's writing systems. The standard, which is maintained by the Unicode Consortium, defines as of the current version (15.0) 149,186 characters covering 161 modern and historic scripts, as well as symbols, emoji (including in colors), and non-visual control and formatting codes. Unicode's success at unifying character sets has led to its widespread and predominant use in the internationalization and localization of computer software. The standard has been implemented in many recent technologies, including modern operating systems, XML, and most modern programming languages. The Unicode character repertoire is synchronized with Universal Coded Character Set, ISO/IEC 10646, each being code-for-code identical with the other. ''The Unicode Standard'', however, includes more th ... [...More Info...] [...Related Items...] OR: [Wikipedia] [Google] [Baidu]
picture info	Unicode Consortium The Unicode Consortium (legally Unicode, Inc.) is a 501(c)(3) non-profit organization incorporated and based in Mountain View, California. Its primary purpose is to maintain and publish the Unicode Standard which was developed with the intention of replacing existing character encoding schemes which are limited in size and scope, and are incompatible with multilingual environments. The consortium describes its overall purpose as: Unicode's success at unifying character sets has led to its widespread adoption in the internationalization and localization of software. The standard has been implemented in many technologies, including XML, the Java programming language, Swift, and modern operating systems. Voting members include computer software and hardware companies with an interest in text-processing standards, including Adobe, Apple, the Bangladesh Computer Council, Emojipedia, Facebook, Google, IBM, Microsoft, the Omani Ministry of Endowments and Religious Affairs, ... [...More Info...] [...Related Items...] OR: [Wikipedia] [Google] [Baidu]
	International Committee For Information Technology Standards The InterNational Committee for Information Technology Standards (INCITS), (pronounced "insights"), is an ANSI-accredited standards development organization composed of Information technology developers. It was formerly known as the X3 and NCITS. INCITS is the central U.S. forum dedicated to creating technology standards. INCITS is accredited by the American National Standards Institute (ANSI) and is affiliated with the Information Technology Industry Council, a global policy advocacy organization that represents U.S. and global innovation companies. INCITS coordinates technical standards activity between ANSI in the US and joint ISO/ IEC committees worldwide. This provides a mechanism to create standards that will be implemented in many nations. As such, INCITS' Executive Board also serves as ANSI's Technical Advisory Group for ISO/IEC Joint Technical Committee 1. JTC 1 is responsible for International standardization in the field of information technology. INCITS operates ... [...More Info...] [...Related Items...] OR: [Wikipedia] [Google] [Baidu]
	Unicode Blocks A Unicode block is one of several contiguous ranges of numeric character codes (code points) of the Unicode character set that are defined by the Unicode Consortium for administrative and documentation purposes. Typically, proposals such as the addition of new glyphs are discussed and evaluated by considering the relevant block or blocks as a whole. Each block is generally, but not always, meant to supply glyphs used by one or more specific languages, or in some general application area such as mathematics, surveying, decorative typesetting, social forums, etc. Design and implementation Unicode blocks are identified by unique names, which use only ASCII characters and are usually descriptive of the nature of the symbols, in English; such as "Tibetan" or "Supplemental Arrows-A". (When comparing block names, one is supposed to equate uppercase with lowercase letters, and ignore any whitespace, hyphens, and underbars; so the last name is equivalent to "supplemental_arrows__a" a ... [...More Info...] [...Related Items...] OR: [Wikipedia] [Google] [Baidu]