TIS-620
Thai Industrial Standard 620-2533, commonly referred to as TIS-620, is the most common single-byte character encoding for the Thai language. The standard is published by the Thai Industrial Standards Institute (TISI), an organ of the Ministry of Industry under the Royal Thai Government, and is the sole official standard for encoding Thai in Thailand. The descriptive name of the standard is "Standard for Thai Character Codes for Computers" (Thai: รหัสสำหรับอักขระไทยที่ใช้กับคอมพิวเตอร์). "2533" refers to year 2533 of the Buddhist Era (1990), the year the present version of the standard was published; a previous revision, TIS 620-2529 (1986), is now obsolete. The code page layout is the same between the two editions. TIS-620 is the IANA preferred charset name for TIS-620, and that charset name is used also for ISO/IEC 8859-11 (which adds a no-break space character at 0xA0, which is unassigned in TIS ... [...More Info...]       [...Related Items...]     OR:     [Wikipedia]   [Google]   [Baidu]   |
|
ISO/IEC 8859-11
ISO/IEC 8859-11:2001, ''Information technology — 8-bit single-byte coded graphic character sets — Part 11: Latin/Thai alphabet'', is part of the ISO/IEC 8859 series of ASCII-based standard character encodings, first edition published in 2001. It is informally referred to as Latin/Thai. It is nearly identical to the national Thai standard TIS-620 (1990). The sole difference is that ISO/IEC 8859-11 allocates non-breaking space to code 0xA0, while TIS-620 leaves it undefined. (In practice, this small distinction is usually ignored.) ''ISO-8859-11'' is not a main registered IANA charset name despite following the normal pattern for IANA charsets based on the ISO 8859 series. However, it is defined as an alias of the close equivalent TIS-620 (which lacks the non-breaking space), and which can without problems be used for ISO/IEC 8859-11, since the no-break space has a code which was unallocated in TIS-620. Microsoft has assigned code page 28601 a.k.a. Windows-28601 to ISO-8859-11 in ... [...More Info...]       [...Related Items...]     OR:     [Wikipedia]   [Google]   [Baidu]   |
|
Unicode
Unicode or ''The Unicode Standard'' or TUS is a character encoding standard maintained by the Unicode Consortium designed to support the use of text in all of the world's writing systems that can be digitized. Version 16.0 defines 154,998 Character (computing), characters and 168 script (Unicode), scripts used in various ordinary, literary, academic, and technical contexts. Unicode has largely supplanted the previous environment of a myriad of incompatible character sets used within different locales and on different computer architectures. The entire repertoire of these sets, plus many additional characters, were merged into the single Unicode set. Unicode is used to encode the vast majority of text on the Internet, including most web pages, and relevant Unicode support has become a common consideration in contemporary software development. Unicode is ultimately capable of encoding more than 1.1 million characters. The Unicode character repertoire is synchronized with Univers ... [...More Info...]       [...Related Items...]     OR:     [Wikipedia]   [Google]   [Baidu]   |
|
Thai Alphabet
The Thai script (, , ) is the abugida used to write Thai, Southern Thai and many other languages spoken in Thailand. The Thai script itself (as used to write Thai) has 44 consonant symbols (, ), 16 vowel symbols (, ) that combine into at least 32 vowel forms, four tone diacritics ( or , or ), and other diacritics. Although commonly referred to as the ''Thai alphabet'', the script is not a true alphabet but an abugida, a writing system in which the full characters represent consonants with diacritical marks for vowels; the absence of a vowel diacritic gives an implied 'a' or 'o'. Consonants are written horizontally from left to right, and vowels following a consonant in speech are written above, below, to the left or to the right of it, or a combination of those. History The Thai script is derived from the Sukhothai script, which itself is derived from the Old Khmer script (, ''akson khom''), which is a southern Brahmic style of writing derived from the south Indian Pa ... [...More Info...]       [...Related Items...]     OR:     [Wikipedia]   [Google]   [Baidu]   |
|
Character Encoding
Character encoding is the process of assigning numbers to graphical character (computing), characters, especially the written characters of human language, allowing them to be stored, transmitted, and transformed using computers. The numerical values that make up a character encoding are known as code points and collectively comprise a code space or a code page. Early character encodings that originated with optical or electrical telegraphy and in early computers could only represent a subset of the characters used in written languages, sometimes restricted to Letter case, upper case letters, Numeral system, numerals and some punctuation only. Over time, character encodings capable of representing more characters were created, such as ASCII, the ISO/IEC 8859 encodings, various computer vendor encodings, and Unicode encodings such as UTF-8 and UTF-16. The Popularity of text encodings, most popular character encoding on the World Wide Web is UTF-8, which is used in 98.2% of surve ... [...More Info...]       [...Related Items...]     OR:     [Wikipedia]   [Google]   [Baidu]   |
|
Percent Sign
The percent sign (sometimes per cent sign in British English) is the symbol used to indicate a percentage, a number or ratio as a fraction (mathematics), fraction of 100. Related signs include the permille (per thousand) sign and the Basis point#Permyriad, permyriad (per ten thousand) sign (also known as a basis point), which indicate that a number is divided by one thousand or ten thousand, respectively. Higher proportions use parts-per notation. Correct style Form and spacing English style guides prescribe writing the percent sign following the number without any space between (e.g. 50%). However, the International System of Units and ISO 31-0 standard prescribe a space between the number and percent sign, in line with the general practice of using a non-breaking space between a numerical value and its corresponding unit of measurement. Other languages have other rules for spacing in front of the percent sign: * In Czech language, Czech and in Slovak language, S ... [...More Info...]       [...Related Items...]     OR:     [Wikipedia]   [Google]   [Baidu]   |
|
Quotation Mark
Quotation marks are punctuation marks used in pairs in various writing systems to identify direct speech, a quotation, or a phrase. The pair consists of an opening quotation mark and a closing quotation mark, which may or may not be the same glyph. Quotation marks have a variety of forms in different languages and in different media. History The single quotation mark is traced to Ancient Greek practice, adopted and adapted by monastic copyists. Isidore of Seville, in his seventh century encyclopedia, , described their use of the Greek ''diplé'' (a Angle bracket, chevron): The double quotation mark derives from a marginal notation used in fifteenth-century manuscript annotations to indicate a passage of particular importance (not necessarily a quotation); the notation was placed in the outside margin of the page and was repeated alongside each line of the passage. In his edition of the works of Aristotle, which appeared in 1483 or 1484, the Milanese Renaissance humanis ... [...More Info...]       [...Related Items...]     OR:     [Wikipedia]   [Google]   [Baidu]   |
|
Number Sign
The symbol is known as the number sign, hash, (or in North America) the pound sign. The symbol has historically been used for a wide range of purposes including the designation of an ordinal number and as a Typographic ligature, ligatured abbreviation for Pound (mass), pounds avoirdupois – having been derived from the now-rare . Since 2007, widespread usage of the symbol to introduce metadata tags on social media platforms has led to such tags being known as "hashtags", and from that, the symbol itself is sometimes called a hashtag. The symbol is distinguished from similar symbols by its combination of level horizontal strokes and right-tilting vertical strokes. History It is believed that the symbol traces its origins to the symbol , an abbreviation of the Roman term ''Roman pound, libra pondo'', which translates as "pound weight". The abbreviation "lb" was printed as a dedicated Ligature (writing), ligature including a horizontal line across (which indicated abbreviation ... [...More Info...]       [...Related Items...]     OR:     [Wikipedia]   [Google]   [Baidu]   |
|
Dollar Sign
The dollar sign, also known as the peso sign, is a currency symbol consisting of a Letter case, capital crossed with one or two vertical strokes ( or depending on typeface), used to indicate the unit of various currency, currencies around the world, including most currencies denominated "dollar" or "peso". The explicitly double-barred sign is called cifrão in the Portuguese language. The sign is also used in several compound currency symbols, such as the Brazilian real (R$) and the United States dollar (US$): in local use, the nationality prefix is usually omitted. In countries that have other currency symbols, the US dollar is often assumed and the "US" prefix omitted. The one- and two-stroke versions are often considered mere stylistic (typeface) variants, although in some places and epochs one of them may have been specifically assigned, by law or custom, to a specific currency. The Unicode computer encoding standard defines a single code for both. In most English l ... [...More Info...]       [...Related Items...]     OR:     [Wikipedia]   [Google]   [Baidu]   |
|
Apostrophe
The apostrophe (, ) is a punctuation mark, and sometimes a diacritical mark, in languages that use the Latin alphabet and some other alphabets. In English, the apostrophe is used for two basic purposes: * The marking of the omission of one or more letters, e.g. the contraction (grammar), contraction of "do not" to "don't" * The marking of Possessive, possessive case of nouns (as in "the eagle's feathers", "in one month's time", "the twins' coats") It is also used in a few exceptional cases for the #Use in forming some plurals, marking of plurals, e.g. "p's and q's" or Oakland A's. The same mark is used as a single quotation mark. It is also substituted informally for other marks for example instead of the prime symbol to indicate the units of foot (unit), foot or minutes of arc. The word ''apostrophe'' comes from the Ancient Greek language, Greek (hē apóstrophos [prosōidía], '[the accent of] turning away or elision'), through Latin language, Latin and French language, ... [...More Info...]       [...Related Items...]     OR:     [Wikipedia]   [Google]   [Baidu]   |
|
Ampersand
The ampersand, also known as the and sign, is the logogram , representing the grammatical conjunction, conjunction "and". It originated as a typographic ligature, ligature of the letters of the word (Latin for "and"). Etymology Traditionally in English, when spelling aloud, any letter that could also be used as a word in itself ("A", "I", and "Vocative case#English, O") was referred to by the Latin expression ('by itself'), as in "''per se'' A" or "A ''per se'' A". The character &, when used by itself as opposed to more extended forms such as ''&c.'', was similarly referred to as "and ''per se'' and". This last phrase was routinely slurred to "ampersand", and the term had entered common English usage by 1837. It has been false etymology, falsely claimed that André-Marie Ampère used the symbol in his widely read publications and that people began calling the new shape "Ampère's and". History The ampersand can be traced back to the 1st century AD and the old Roma ... [...More Info...]       [...Related Items...]     OR:     [Wikipedia]   [Google]   [Baidu]   |
|
Space Character
A whitespace character is a character data element that represents white space when text is rendered for display by a computer. For example, a ''space'' character (, ASCII 32) represents blank space such as a word divider in a Western script. A printable character results in output when rendered, but a whitespace character does not. Instead, whitespace characters define the layout of text to a limited degree, interrupting the normal sequence of rendering characters next to each other. The output of subsequent characters is typically shifted to the right (or to the left for right-to-left script) or to the start of the next line. The effect of multiple sequential whitespace characters is cumulative such that the next printable character is rendered at a location based on the accumulated effect of preceding whitespace characters. The origin of the term ''whitespace'' is rooted in the common practice of rendering text on white paper. Normally, a whitespace character is ''n ... [...More Info...]       [...Related Items...]     OR:     [Wikipedia]   [Google]   [Baidu]   |
|
Parenthesis
A bracket is either of two tall fore- or back-facing punctuation marks commonly used to isolate a segment of text or data from its surroundings. They come in four main pairs of shapes, as given in the box to the right, which also gives their names, that vary between British English, British and American English. "Brackets", without further qualification, are in British English the ... marks and in American English the ... marks. Other symbols are repurposed as brackets in specialist contexts, such as International Phonetic Alphabet#Brackets and transcription delimiters, those used by linguists. Brackets are typically deployed in symmetric pairs, and an individual bracket may be identified as a "left" or "right" bracket or, alternatively, an "opening bracket" or "closing bracket", respectively, depending on the Writing system#Directionality, directionality of the context. In casual writing and in technical fields such as computing or linguistic analysis of grammar, brackets ne ... [...More Info...]       [...Related Items...]     OR:     [Wikipedia]   [Google]   [Baidu]   |