Tibetan is a
Unicode block
A Unicode block is one of several contiguous ranges of numeric character codes (code points) of the Unicode character set that are defined by the Unicode Consortium for administrative and documentation purposes. Typically, proposals such as the ad ...
containing characters for the Tibetan, Dzongkha, and other languages of China, Bhutan, Nepal, Mongolia, northern India, eastern Pakistan and Russia.
Block
Former Tibetan block
The Tibetan Unicode block is unique for having been allocated in version 1.0.0 with a
virama
Virama ( ्) is a Sanskrit phonological concept to suppress the inherent vowel that otherwise occurs with every consonant letter, commonly used as a generic term for a codepoint in Unicode, representing either
# halanta, hasanta or explicit virā ...
-based encoding that was
unable to distinguish visible and
conjunct consonant
Conjunct consonants are a type of letters, used for example in Brahmi or Brahmi derived modern scripts such as Balinese, Bengali, Devanagari, Gujarati, etc to write consonant clusters such as or . Although most of the time, letters are formed ...
correctly. This encoding was removed from the
Unicode Standard
Unicode, formally The Unicode Standard,The formal version reference is is an information technology standard for the consistent encoding, representation, and handling of text expressed in most of the world's writing systems. The standard, whic ...
in version 1.0.1 in the process of unifying with
ISO 10646
ISO is the most common abbreviation for the International Organization for Standardization.
ISO or Iso may also refer to: Business and finance
* Iso (supermarket), a chain of Danish supermarkets incorporated into the SuperBest chain in 2007
* Iso ...
for version 1.1,
then reintroduced as an explicit root/subjoined encoding, with a larger block size, in version 2.0. Moving or removing existing characters has been prohibited by the Unicode Stability Policy for all versions following Unicode 2.0, so the Tibetan characters encoded in Unicode 2.0 and all subsequent versions are immutable.
The range of the former Unicode 1.0.0 Tibetan block has been occupied by the
Myanmar block since Unicode 3.0. In
Microsoft Windows
Windows is a group of several proprietary graphical operating system families developed and marketed by Microsoft. Each family caters to a certain sector of the computing industry. For example, Windows NT for consumers, Windows Server for serv ...
,
collation
Collation is the assembly of written information into a standard order. Many systems of collation are based on numerical order or alphabetical order, or extensions and combinations thereof. Collation is a fundamental element of most office fili ...
data referring to the old Tibetan block was retained as late as
Windows XP
Windows XP is a major release of Microsoft's Windows NT operating system. It was released to manufacturing on August 24, 2001, and later to retail on October 25, 2001. It is a direct upgrade to its predecessors, Windows 2000 for high-end and ...
, and removed in
Windows 2003
Windows Server 2003 is the sixth version of Windows Server operating system produced by Microsoft. It is part of the Windows NT family of operating systems and was released to manufacturing on March 28, 2003 and generally available on April 24, 2 ...
.
History
The following Unicode-related documents record the purpose and process of defining specific characters in the Tibetan block:
Footnotes
References
A Chinese concern posted to the Unicode Consortium citing the conjunct character "སྐྤྵྴྍྐ" (EWTS s+k+p+Sh+sh+x+ka; IAST {{IAST, skpṣśxka), showing the complexity of encoding.(Devanagari encoding never allowed "
ᳵ" to be conjuncted, i.e. "स्क्प्ष्श्ᳵ्क"does not exist.)
Unicode blocks