Khmer script ( km, អក្សរខ្មែរ, )
[Huffman, Franklin. 1970. ''Cambodian System of Writing and Beginning Reader''. Yale University Press. .] is an
abugida
An abugida (, from Ge'ez language, Ge'ez: ), sometimes known as alphasyllabary, neosyllabary or pseudo-alphabet, is a segmental Writing systems#Segmental writing system, writing system in which consonant-vowel sequences are written as units; ...
(alphasyllabary) script used to write the
Khmer language
Khmer (; , ) is an Austroasiatic languages, Austroasiatic language spoken by the Khmer people, and the Official language, official and national language of Cambodia. Khmer has been influenced considerably by Sanskrit and Pāli, Pali, especiall ...
, the official language of
Cambodia
Cambodia (; also Kampuchea ; km, កម្ពុជា, UNGEGN: ), officially the Kingdom of Cambodia, is a country located in the southern portion of the Indochinese Peninsula in Southeast Asia, spanning an area of , bordered by Thailand t ...
. It is also used to write
Pali
Pali () is a Middle Indo-Aryan liturgical language native to the Indian subcontinent. It is widely studied because it is the language of the Buddhist ''Pāli Canon'' or ''Tipiṭaka'' as well as the sacred language of ''Theravāda'' Buddhism ...
in the Buddhist liturgy of Cambodia and Thailand.
Khmer is written from
left to right
A writing system is a method of visually representing verbal communication, based on a script and a set of rules regulating its use. While both writing and speech are useful in conveying messages, writing differs in also being a reliable form ...
. Words within the same sentence or phrase are generally run together with no
space
Space is the boundless three-dimensional extent in which objects and events have relative position and direction. In classical physics, physical space is often conceived in three linear dimensions, although modern physicists usually consider ...
s between them.
Consonant cluster
In linguistics, a consonant cluster, consonant sequence or consonant compound, is a group of consonants which have no intervening vowel. In English, for example, the groups and are consonant clusters in the word ''splits''. In the education fie ...
s within a word are "stacked", with the second (and occasionally third) consonant being written in reduced form under the main consonant. Originally there were 35 consonant characters, but modern Khmer uses only 33. Each character represents a consonant sound together with an
inherent vowel An inherent vowel is part of an abugida (or alphasyllabary) script. It is a vowel sound which is used with each unmarked or basic consonant symbol. For example, if the Latin alphabet used 'i' as an inherent vowel, "Wikipedia" could be rendered as "W ...
, either ''â'' or ''ô''; in many cases, in the absence of another vowel mark, the inherent vowel is to be pronounced after the consonant.
There are some independent
vowel
A vowel is a syllabic speech sound pronounced without any stricture in the vocal tract. Vowels are one of the two principal classes of speech sounds, the other being the consonant. Vowels vary in quality, in loudness and also in quantity (leng ...
characters, but vowel sounds are more commonly represented as dependent vowels, additional marks accompanying a consonant character, and indicating what vowel sound is to be pronounced after that consonant (or consonant cluster). Most dependent vowels have two different pronunciations, depending in most cases on the inherent vowel of the consonant to which they are added. There are also a number of
diacritic
A diacritic (also diacritical mark, diacritical point, diacritical sign, or accent) is a glyph added to a letter or to a basic glyph. The term derives from the Ancient Greek (, "distinguishing"), from (, "to distinguish"). The word ''diacriti ...
s used to indicate further modifications in pronunciation. The script also includes its own
numerals
A numeral is a figure, symbol, or group of figures or symbols denoting a number. It may refer to:
* Numeral system used in mathematics
* Numeral (linguistics), a part of speech denoting numbers (e.g. ''one'' and ''first'' in English)
* Numerical d ...
and
punctuation mark
Punctuation (or sometimes interpunction) is the use of spacing, conventional signs (called punctuation marks), and certain typographical devices as aids to the understanding and correct reading of written text, whether read silently or aloud. An ...
s.
Origin
The Khmer script was adapted from the
Pallava script
The Pallava script or Pallava Grantha, is a Brahmic scripts, Brahmic script, named after the Pallava dynasty of South India, attested since the 4th century AD. As epigrapher Arlo Griffiths makes clear, however, the term is misleading as not all o ...
, used in southern India and South East Asia during the 5th and 6th centuries AD, which ultimately descended from the
Tamil-Brahmi
Tamil-Brahmi, also known as Tamizhi or Damili, was a variant of the Brahmi script in southern India. It was used to write inscriptions in the early form of Old Tamil.Richard Salomon (1998) ''Indian Epigraphy: A Guide to the Study of Inscription ...
script,. The oldest dated inscription in Khmer was found at
Angkor Borei District
Angkor Borei ( km, អង្គរបូរី, ) is a district located in Takéo Province, in southern Cambodia. According to the 1998 census of Cambodia, it had a population of 44,980.
Administration
The district has 6 communes, 34 villages ( ...
in
Takéo Province south of Phnom Penh and dates from 611. Stelae of the Pre-Angkorean and Angkorean periods, featuring the Khmer script, have been found throughout the former
Khmer Empire, from the
Mekong Delta
The Mekong Delta ( vi, Đồng bằng Sông Cửu Long, lit=Nine Dragon River Delta or simply vi, Đồng Bằng Sông Mê Kông, lit=Mekong River Delta, label=none), also known as the Western Region ( vi, Miền Tây, links=no) or South-weste ...
to what is now southern
Laos
Laos (, ''Lāo'' )), officially the Lao People's Democratic Republic ( Lao: ສາທາລະນະລັດ ປະຊາທິປະໄຕ ປະຊາຊົນລາວ, French: République démocratique populaire lao), is a socialist ...
,
Northeast Thailand
Northeast Thailand or Isan (Isan/ th, อีสาน, ; lo, ອີສານ; also written as Isaan, Isarn, Issarn, Issan, Esan, or Esarn; from Pali ''īsānna'' or Sanskrit ईशान्य ''īśānya'' "northeast") consists of 20 provin ...
, and
Central Thailand
Central Thailand (Central plain) or more specifically Siam (also known as Suvarnabhumi and Dvaravati) is one of the regions of Thailand, covering the broad alluvial plain of the Chao Phraya River. It is separated from northeast Thailand (Isan) by ...
.
The modern Khmer script differs somewhat from precedent forms seen on the inscriptions of the ruins of
Angkor
Angkor ( km, អង្គរ , 'Capital city'), also known as Yasodharapura ( km, យសោធរបុរៈ; sa, यशोधरपुर),Headly, Robert K.; Chhor, Kylin; Lim, Lam Kheng; Kheang, Lim Hak; Chun, Chen. 1977. ''Cambodian-Engl ...
. The
Thai
Thai or THAI may refer to:
* Of or from Thailand, a country in Southeast Asia
** Thai people, the dominant ethnic group of Thailand
** Thai language, a Tai-Kadai language spoken mainly in and around Thailand
*** Thai script
*** Thai (Unicode block ...
and
Lao scripts are descendants of an older cursive form of the Khmer script, through the
Sukhothai script
The Sukhothai script, also known as the ''proto-Thai script'' and ''Ram Khamhaeng alphabet'', is a Brahmic script which originated in the Sukhothai Kingdom. The script is found on the Ram Khamhaeng Inscription and the ''Lö Thai inscription''.
...
.
Consonants
There are 35 Khmer
consonant
In articulatory phonetics, a consonant is a speech sound that is articulated with complete or partial closure of the vocal tract. Examples are and pronounced with the lips; and pronounced with the front of the tongue; and pronounced wit ...
symbols, although modern Khmer only uses 33, two having become obsolete. Each consonant has an
inherent vowel An inherent vowel is part of an abugida (or alphasyllabary) script. It is a vowel sound which is used with each unmarked or basic consonant symbol. For example, if the Latin alphabet used 'i' as an inherent vowel, "Wikipedia" could be rendered as "W ...
: ''â'' or ''ô'' ; equivalently, each consonant is said to belong to the ''a''-series or ''o''-series. A consonant's series determines the pronunciation of the
dependent vowel
A dependant is a person who relies on another as a primary source of income. A common-law spouse who is financially supported by their partner may also be included in this definition. In some jurisdictions, supporting a dependant may enabl ...
symbols which may be attached to it, and in some positions the sound of the inherent vowel is itself pronounced.
The two series originally represented
voiceless
In linguistics, voicelessness is the property of sounds being pronounced without the larynx vibrating. Phonologically, it is a type of phonation, which contrasts with other states of the larynx, but some object that the word phonation implies v ...
and
voiced
Voice or voicing is a term used in phonetics and phonology to characterize speech sounds (usually consonants). Speech sounds can be described as either voiceless (otherwise known as ''unvoiced'') or voiced.
The term, however, is used to refer ...
consonants respectively (and are still referred to as such in Khmer).
Sound change
A sound change, in historical linguistics, is a change in the pronunciation of a language. A sound change can involve the replacement of one speech sound (or, more generally, one phonetic feature value) by a different one (called phonetic chang ...
s during the
Middle Khmer
Middle Khmer is the historical stage of the Khmer language as it existed between the 14th and 18th centuries, spanning the period between Old Khmer and the modern language. The beginning of the Middle Khmer period roughly coincides with the fall ...
period affected vowels following voiceless consonants, and these changes were preserved even though the distinctive voicing was lost (see
phonation in Khmer).
Each consonant, with one exception, also has a subscript form. These may also be called "sub-consonants"; the Khmer phrase is ', meaning "foot of a letter". Most subscript consonants resemble the corresponding consonant symbol, but in a smaller and possibly simplified form, although in a few cases there is no obvious resemblance. Most subscript consonants are written directly below other consonants, although subscript ' appears to the left, while a few others have ascending elements which appear to the right.
Subscripts are used in writing
consonant cluster
In linguistics, a consonant cluster, consonant sequence or consonant compound, is a group of consonants which have no intervening vowel. In English, for example, the groups and are consonant clusters in the word ''splits''. In the education fie ...
s (consonants pronounced consecutively in a word with no vowel sound between them). Clusters in Khmer normally consist of two consonants, although occasionally in the middle of a word there will be three. The first consonant in a cluster is written using the main consonant symbol, with the second (and third, if present) attached to it in subscript form. Subscripts were previously also used to write final consonants; in modern Khmer this may be done, optionally, in some words ending ''-ng'' or ''-y'', such as ' ("give").
The consonants and their subscript forms are listed in the following table. Usual phonetic values are given using the
International Phonetic Alphabet
The International Phonetic Alphabet (IPA) is an alphabetic system of phonetic transcription, phonetic notation based primarily on the Latin script. It was devised by the International Phonetic Association in the late 19th century as a standa ...
(IPA); variations are described below the table. The sound system is described in detail at
Khmer phonology
Khmer (; , ) is an Austroasiatic language spoken by the Khmer people, and the official and national language of Cambodia. Khmer has been influenced considerably by Sanskrit and Pali, especially in the royal and religious registers, through Hi ...
. The spoken
name
A name is a term used for identification by an external observer. They can identify a class or category of things, or a single thing, either uniquely, or within a given context. The entity identified by a name is called its referent. A personal ...
of each consonant letter is its value together with its inherent vowel. Transliterations are given using the transcription system of the ''Geographic Department of the Cambodian Ministry of Land Management and Urban Planning'' used by the Cambodian government and the
UNGEGN
The United Nations Group of Experts on Geographical Names (UNGEGN) is one of the nine expert groups of the United Nations Economic and Social Council (ECOSOC) and deals with the national and international standardization of geographical names. Ev ...
system;
[Report on the Current Status of United Nations Romanization Systems for Geographical Names – Khmer]
UNGEGN Working Group on Romanization Systems, September 2013 (linked fro
WGRS website
. for other systems see
Romanization of Khmer
The romanization of Khmer is a representation of the Khmer (Cambodian) language using letters of the Latin alphabet. This is most commonly done with Khmer proper nouns, such as names of people and geographical names, as in a gazetteer.
Romanizat ...
.
The letter appears in somewhat modified form (e.g. ) when combined with certain dependent vowels (see
Ligatures).
The letter ''nhô'' is written without the lower curve when a subscript is added. When it is subscripted to itself, the subscript is a smaller form of the entire letter: ''-nhnh-''.
Note that ' and ' have the same subscript form. In initial clusters this subscript is always pronounced , but in medial positions it is in some words and in others.
The series ', ', ', ', ' originally represented
retroflex consonant
A retroflex ( /ˈɹɛtʃɹoːflɛks/), apico-domal ( /əpɪkoːˈdɔmɪnəl/), or cacuminal () consonant is a coronal consonant where the tongue has a flat, concave, or even curled shape, and is articulated between the alveolar ridge and the har ...
s in the Indic parent scripts. The second, third and fourth of these are rare, and occur only for etymological reasons in a few Pali and Sanskrit loanwords. Because the sound /n/ is common, and often grammatically productive, in Mon-Khmer languages, the fifth of this group, , was adapted as an a-series counterpart of ' for convenience (all other nasal consonants are o-series).
Variation in pronunciation
The aspirated consonant letters (''kh-'', ''chh-'', ''th-'', ''ph-'') are pronounced with aspiration only before a vowel. There is also slight aspiration with ''k'', ''ch'', ''t'' and ''p'' sounds before
certain consonants, but this is regardless of whether they are spelt with a letter that indicates aspiration.
A Khmer word cannot end with more than one consonant sound, so subscript consonants at the end of words (which appear for etymological reasons) are not pronounced, although they may come to be pronounced when the same word begins a compound.
In some words, a single medial consonant symbol represents both the final consonant of one syllable and the initial consonant of the next.
The letter ''bâ'' represents only before a vowel. When final or followed by a subscript consonant, it is pronounced (and in the case where it is followed by a subscript consonant, it is also romanized as ''p'' in the UN system). For modification to ''p'' by means of a diacritic, see
Supplementary consonants. The letter, which represented /p/ in Indic scripts, also often maintains the sound in certain words borrowed from Sanskrit and Pali.
The letters ''dâ'' and ''dô'' are pronounced when final. The letter ''tâ'' is pronounced in initial position in a weak syllable ending with a nasal.
In final position, letters representing a sound (''k-'', ''kh-'') are pronounced as a glottal stop after the vowels , , , , , , , , . The letter ' is silent when final (in most dialects; see
Northern Khmer). The letter ' when final is pronounced (which in this position approaches ).
Supplementary consonants
The Khmer writing system includes supplementary consonants, used in certain
loanword
A loanword (also loan word or loan-word) is a word at least partly assimilated from one language (the donor language) into another language. This is in contrast to cognates, which are words in two or more languages that are similar because th ...
s, particularly from
French
French (french: français(e), link=no) may refer to:
* Something of, from, or related to France
** French language, which originated in France, and its various dialects and accents
** French people, a nation and ethnic group identified with Franc ...
and
Thai
Thai or THAI may refer to:
* Of or from Thailand, a country in Southeast Asia
** Thai people, the dominant ethnic group of Thailand
** Thai language, a Tai-Kadai language spoken mainly in and around Thailand
*** Thai script
*** Thai (Unicode block ...
. These mostly represent sounds which do not occur in native words, or for which the native letters are restricted to one of the two vowel series. Most of them are
digraphs, formed by stacking a subscript under the letter ''hâ'', with an additional ''treisăpt''
diacritic
A diacritic (also diacritical mark, diacritical point, diacritical sign, or accent) is a glyph added to a letter or to a basic glyph. The term derives from the Ancient Greek (, "distinguishing"), from (, "to distinguish"). The word ''diacriti ...
if required to change the inherent vowel to ''ô''. The character for ''pâ'', however, is formed by placing the ''musĕkâtônd'' ("mouse teeth") diacritic over the character ''bâ''.
Dependent vowels
Most Khmer vowel sounds are written using dependent, or
diacritic
A diacritic (also diacritical mark, diacritical point, diacritical sign, or accent) is a glyph added to a letter or to a basic glyph. The term derives from the Ancient Greek (, "distinguishing"), from (, "to distinguish"). The word ''diacriti ...
al, vowel symbols, known in Khmer as or ("connecting vowel"). These can only be written in combination with a consonant (or consonant cluster). The vowel is pronounced after the consonant (or cluster), even though some of the symbols have graphical elements which appear above, below or to the left of the consonant character.
Most of the vowel symbols have two possible pronunciations, depending on the inherent vowel of the consonant to which it is added. Their pronunciations may also be different in
weak syllables, and when they are shortened (e.g. by means of a diacritic).
Absence of a dependent vowel (or diacritic) often implies that a syllable-initial consonant is followed by the sound of its inherent vowel.
In determining the inherent vowel of a consonant cluster (i.e. how a following dependent vowel will be pronounced),
stops
Stop may refer to:
Places
*Stop, Kentucky, an unincorporated community in the United States
* Stop (Rogatica), a village in Rogatica, Republika Srpska, Bosnia and Herzegovina
Facilities
* Bus stop
* Truck stop, a type of rest stop for truck dri ...
and
fricatives
A fricative is a consonant produced by forcing air through a narrow channel made by placing two articulators close together. These may be the lower lip against the upper teeth, in the case of ; the back of the tongue against the soft palate in t ...
are dominant over
sonorant
In phonetics and phonology, a sonorant or resonant is a speech sound that is produced with continuous, non-turbulent airflow in the vocal tract; these are the manners of articulation that are most often voiced in the world's languages. Vowels are ...
s. For any consonant cluster including a combination of these sounds, a following dependent vowel is pronounced according to the dominant consonant, regardless of its position in the cluster. When both members of a cluster are dominant, the subscript consonant determines the pronunciation of a following dependent vowel.
A non-dominant consonant (and in some words also ''hâ'') will also have its inherent vowel changed by a preceding dominant consonant in the same word, even when there is a vowel between them, although some words (especially among those with more than two syllables) do not obey this rule.
The dependent vowels are listed below, in conventional form with a dotted circle as a dummy consonant symbol, and in combination with the a-series letter ''’â''. The IPA values given are representative of dialects from the northwest and central plains regions, specifically from the
Battambang
Battambang ( km, បាត់ដំបង, UNGEGN: ) is the capital of Battambang Province and the third largest city in Cambodia.
Founded in the 11th century by the Khmer Empire, Battambang is the leading rice-producing province of the coun ...
area, upon which
Standard Standard may refer to:
Symbols
* Colours, standards and guidons, kinds of military signs
* Standard (emblem), a type of a large symbol or emblem used for identification
Norms, conventions or requirements
* Standard (metrology), an object th ...
Khmer is based. Vowel pronunciation varies widely in other dialects such as
Northern Khmer, where diphthongs are leveled, and
Western Khmer, in which
breathy voice
Breathy voice (also called murmured voice, whispery voice, soughing and susurration) is a phonation in which the vocal folds vibrate, as they do in normal (modal) voicing, but are adjusted to let more air escape which produces a sighing-like ...
and
modal voice
Modal voice is the vocal register used most frequently in speech and singing in most languages. It is also the term used in linguistics for the most common phonation of vowels. The term "modal" refers to the resonant mode of vocal folds; that is ...
phonation
The term phonation has slightly different meanings depending on the subfield of phonetics. Among some phoneticians, ''phonation'' is the process by which the vocal folds produce certain sounds through quasi-periodic vibration. This is the defini ...
s are still contrastive.
The spoken name of each dependent vowel consists of the word ''srăk'' ("vowel") followed by the vowel's a-series value preceded by a glottal stop (and also followed by a glottal stop in the case of short vowels).
Modification by diacritics
The addition of some of the
Khmer diacritics can modify the length and value of inherent or dependent vowels.
The following table shows combinations with the ' and ' diacritics, representing final and . They are shown with the a-series consonant ''’â''.
The first four configurations listed here are treated as dependent vowels in their own right, and have names constructed in the same way as for the other dependent vowels (described in the previous section).
Other rarer configurations with the ' are (or ), pronounced , and , pronounced . The word "yes" (used by women) is pronounced
aːand rarely .
The ''bânták'' (a small vertical line written over the final consonant of a syllable) has the following effects:
*in a syllable with inherent ''â'', the vowel is shortened to , UN transcription ''á''
*in a syllable with inherent ''ô'', the vowel is modified to before a final
labial
The term ''labial'' originates from '' Labium'' (Latin for "lip"), and is the adjective that describes anything of or related to lips, such as lip-like structures. Thus, it may refer to:
* the lips
** In linguistics, a labial consonant
** In zoolog ...
, otherwise usually to ; UN transcription ''ó''
*in a syllable with the ''a'' dependent vowel symbol () in the a-series, the vowel is shortened to , UN transcription ''ă''
*in a syllable with that vowel symbol in the o-series, the vowel is modified to , UN transcription ''oă'', or to ''eă'' before ''k'', ''ng'', ''h''
The ' is equivalent to the ''a'' dependent vowel with the '. However, its o-series pronunciation becomes before final ''y'', and before final (silent) ''r''.
The ''yŭkôlpĭntŭ'' (pair of dots) represents (a-series) or (o-series), followed by a glottal stop.
Consonants with no dependent vowel
There are three environments where a consonant may appear without a dependent vowel. The rules governing the inherent vowel differ for all three environments. Consonants may be written with no dependent vowel as an initial consonant of a
weak syllable, an initial consonant of a strong syllable or as the final letter of a written word.
In careful speech, initial consonants without a dependent vowel in weak initial syllables are pronounced with their inherent vowel shortened as if modified by the ''bânták'' diacritic (see previous section). For example the first-series letter "" in "" ("torch") is pronounced with the short vowel . The second-series letter "" in "" ("light") is pronounced with the short diphthong . In casual speech, these are most often reduced to for both series.
Initial consonants in strong syllables without written vowels are pronounced with their inherent vowels. The word ("to tie") is pronounced , ("weak", "to sink") is pronounced . In some words, however, the inherent vowel is pronounced in its reduced form, as if modified by a ''bântăk'' diacritic, even though the diacritic is not written (e.g. "corpse"). Such reduction regularly takes place in words ending with a consonant with a silent subscript (such as "every"), although in most such words it is the ''bânták''-reduced form of the vowel ''a'' that is heard, as in "noise". The word "you, person" has the highly irregular pronunciation .
Consonants written as the final letter of a word usually represent a word-final sound and are pronounced without any following vowel and, in the case of stops, with
no audible release
A stop with no audible release, also known as an unreleased stop or an applosive, is a stop consonant with no release burst: no audible indication of the end of its occlusion (hold). In the International Phonetic Alphabet, lack of an audible relea ...
as in the examples above. However, in some words adopted from
Pali
Pali () is a Middle Indo-Aryan liturgical language native to the Indian subcontinent. It is widely studied because it is the language of the Buddhist ''Pāli Canon'' or ''Tipiṭaka'' as well as the sacred language of ''Theravāda'' Buddhism ...
and
Sanskrit
Sanskrit (; attributively , ; nominally , , ) is a classical language belonging to the Indo-Aryan branch of the Indo-European languages. It arose in South Asia after its predecessor languages had diffused there from the northwest in the late ...
, what would appear to be a final consonant under normal rules can actually be the initial consonant of a following syllable and pronounced with a short vowel as if followed by . For example, according to rules for native Khmer words, ("good", "clean", "beautiful") would appear to be a single syllable, but, being derived from Pali ''subha'', it is pronounced .
Ligatures
Most consonants, including a few of the subscripts, form
ligatures with the vowel (ា) and with all other dependent vowels that contain the same cane-like symbol. Most of these ligatures are easily recognizable, but a few may not be, particularly those involving the letter . This combines with the a vowel in the form , created to differentiate it from the consonant symbol and also from the ligature for with ().
Some more examples of ligatured symbols follow:
: Another example with , forming a similar ligature to that described above. Here the vowel is not a itself, but another vowel (au) which contains the cane-like stroke of that vowel as a graphical element.
: An example of the vowel a forming a connection with the
serif
In typography, a serif () is a small line or stroke regularly attached to the end of a larger stroke in a letter or symbol within a particular font or family of fonts. A typeface or "font family" making use of serifs is called a serif typeface ...
of a consonant.
: Subscript consonants with ascending strokes above the baseline also form ligatures with the vowel symbol.
: Another example of a subscript consonant forming a ligature, this time with the vowel .
: The subscript for is written to the left of the main consonant, in this case , which here forms a ligature with .
Independent vowels
Independent vowels are non-diacritical vowel characters that stand alone (i.e. without being attached to a consonant symbol). In Khmer they are called ''sră pénh tuŏ'', which means "complete vowels". They are used in some words to represent certain combinations of a vowel with an initial
glottal stop
The glottal plosive or stop is a type of consonantal sound used in many spoken languages, produced by obstructing airflow in the vocal tract or, more precisely, the glottis. The symbol in the International Phonetic Alphabet that represents thi ...
or
liquid
A liquid is a nearly incompressible fluid that conforms to the shape of its container but retains a (nearly) constant volume independent of pressure. As such, it is one of the four fundamental states of matter (the others being solid, gas, a ...
. The independent vowels are used in a small number of words, mostly of Indic origin, and consequently there is some inconsistency in their use and pronunciations.
[ However, a few words in which they occur are used quite frequently; these include: "now", "father", "or", "hear", "give, let", "oneself, I, you", "where".
Independent vowel letters are named similarly to the dependent vowels, with the word ''sră'' ("vowel") followed by the principal sound of the letter (the pronunciation or first of the pronunciations listed above), followed by an additional glottal stop after a short vowel. However the letter ឥ is called ''sră ĕ'' .
]
Diacritics
The Khmer writing system contains several diacritic
A diacritic (also diacritical mark, diacritical point, diacritical sign, or accent) is a glyph added to a letter or to a basic glyph. The term derives from the Ancient Greek (, "distinguishing"), from (, "to distinguish"). The word ''diacriti ...
s (, , ), used to indicate further modifications in pronunciation.
Dictionary order
For the purpose of dictionary ordering of words, main consonants, subscript consonants and dependent vowels are all significant; and when they appear in combination, they are considered in the order in which they would be spoken (main consonant, subscript, vowel). The order of the consonants
In articulatory phonetics, a consonant is a speech sound that is articulated with complete or partial closure of the vocal tract. Examples are and pronounced with the lips; and pronounced with the front of the tongue; and pronounced wit ...
and of the dependent vowels is the order in which they appear in the above tables. A syllable written without any dependent vowel is treated as if it contained a vowel character that precedes all the visible dependent vowels.
As mentioned above, the four configurations with diacritics exemplified in the syllables are treated as dependent vowels in their own right, and come in that order at the end of the list of dependent vowels. Other configurations with the ''reăhmŭkh'' diacritic
A diacritic (also diacritical mark, diacritical point, diacritical sign, or accent) is a glyph added to a letter or to a basic glyph. The term derives from the Ancient Greek (, "distinguishing"), from (, "to distinguish"). The word ''diacriti ...
are ordered as if that diacritic were a final consonant coming after all other consonants. Words with the ''bânták'' and ''sâmyoŭk sânhnhéa'' diacritics are ordered directly after identically spelled words without the diacritics.
Vowels precede consonants in the ordering, so a combination of main and subscript consonants comes after any instance in which the same main consonant appears unsubscripted before a vowel.
Words spelled with an independent vowel whose sound begins with a glottal stop follow after words spelled with the equivalent combination of ''’â'' plus dependent vowel. Words spelled with an independent vowel whose sound begins or follow after all words beginning with the consonants ''rô'' and ''lô'' respectively.
Words spelled with a consonant modified by a diacritic follow words spelled with the same consonant and dependent vowel symbol but without the diacritic. However, words spelled with (a ''bâ'' converted to a ''p'' sound by a diacritic) follow all words with unmodified ''bâ'' (without diacritic and without subscript). Sometimes words in which is pronounced ''p'' are ordered as if the letter were written .
Numerals
The numerals of the Khmer script, similar to that used by other civilizations in Southeast Asia, are also derived from the southern Indian script. Western-style Arabic numerals
Arabic numerals are the ten numerical digits: , , , , , , , , and . They are the most commonly used symbols to write Decimal, decimal numbers. They are also used for writing numbers in other systems such as octal, and for writing identifiers ...
are also used, but to a lesser extent.
In large numbers, groups of three digits are delimited with Western-style periods. The decimal point
A decimal separator is a symbol used to separate the integer part from the fractional part of a number written in decimal form (e.g., "." in 12.45). Different countries officially designate different symbols for use as the separator. The choi ...
is represented by a comma. The Cambodian currency, the riel Riel may refer to:
Places
*Riel, Netherlands, a town in the Netherlands
*Riel (electoral district), a provincial electoral district in Manitoba, Canada, named after Louis Riel
* Riel, Winnipeg, a community committee comprising three city wards
Peo ...
, is abbreviated using the symbol or simply the letter ''rô''.
Spacing and punctuation
Spaces Spaces may refer to:
* Google Spaces (app), a cross-platform application for group messaging and sharing
* Windows Live Spaces, the next generation of MSN Spaces
* Spaces (software), a virtual desktop manager implemented in Mac OS X Leopard
* Spac ...
are not used between all words in written Khmer. Spaces are used within sentences in roughly the same places as comma
The comma is a punctuation mark that appears in several variants in different languages. It has the same shape as an apostrophe or single closing quotation mark () in many typefaces, but it differs from them in being placed on the baseline ...
s might be in English, although they may also serve to set off certain items such as numbers and proper names.
Western-style punctuation mark
Punctuation (or sometimes interpunction) is the use of spacing, conventional signs (called punctuation marks), and certain typographical devices as aids to the understanding and correct reading of written text, whether read silently or aloud. An ...
s are quite commonly used in modern Khmer writing, including French-style guillemet
Guillemets (, also , , ) are a pair of punctuation marks in the form of sideways double chevrons, and , used as quotation marks in a number of languages. In some of these languages "single" guillemets, and , are used for a quotation inside ano ...
s for quotation marks
Quotation marks (also known as quotes, quote marks, speech marks, inverted commas, or talking marks) are punctuation marks used in pairs in various writing systems to set off direct speech, a quotation, or a phrase. The pair consists of an ...
. However, traditional Khmer punctuation marks are also used; some of these are described in the following table.
A hyphen
The hyphen is a punctuation mark used to join words and to separate syllables of a single word. The use of hyphens is called hyphenation. ''Son-in-law'' is an example of a hyphenated word. The hyphen is sometimes confused with dashes (figure d ...
( ''sâhâ sânhnhéa'') is commonly used between components of personal names, and also as in English when a word is divided between lines of text. It can also be used between numbers to denote ranges or dates. Particular uses of Western-style periods include grouping of digits in large numbers (see ''Numerals
A numeral is a figure, symbol, or group of figures or symbols denoting a number. It may refer to:
* Numeral system used in mathematics
* Numeral (linguistics), a part of speech denoting numbers (e.g. ''one'' and ''first'' in English)
* Numerical d ...
'' hereinbefore) and denotation of abbreviation
An abbreviation (from Latin ''brevis'', meaning ''short'') is a shortened form of a word or phrase, by any method. It may consist of a group of letters or words taken from the full version of the word or phrase; for example, the word ''abbrevia ...
s.
Styles
Several styles of Khmer writing are used for varying purposes. The two main styles are (literally "slanted script") and ("round script").
* () refers to oblique
Oblique may refer to:
* an alternative name for the character usually called a slash (punctuation) ( / )
* Oblique angle, in geometry
*Oblique triangle, in geometry
*Oblique lattice, in geometry
* Oblique leaf base, a characteristic shape of the b ...
letters. Entire bodies of text such as novels and other publications may be produced in ''âksâr chriĕng''. Unlike in written English
English orthography is the writing system used to represent spoken English, allowing readers to connect the graphemes to sound and to meaning. It includes English's norms of spelling, hyphenation, capitalisation, word breaks, emphasis, and p ...
, oblique lettering does not represent any grammatical differences such as emphasis or quotation. Handwritten Khmer is often written in the oblique style.
* () or () refers to upright or 'standing' letters, as opposed to oblique letters. Most modern Khmer typeface
A typeface (or font family) is the design of lettering that can include variations in size, weight (e.g. bold), slope (e.g. italic), width (e.g. condensed), and so on. Each of these variations of the typeface is a font.
There are list of type ...
s are designed in this manner instead of being oblique, as text can be italicized by way of word processor commands and other computer applications to represent the oblique manner of ''âksâr chriĕng''.
* (), also known as the Khom Thai script
The Khom script ( th, อักษรขอม, akson khom, or later th, อักษรขอมไทย, akson khom thai; lo, ອັກສອນຂອມ, Aksone Khom; km, អក្សរខម, âksâr khâm) is a Brahmic script and a vari ...
, is a style used in Pali palm-leaf manuscript
Palm-leaf manuscripts are manuscript
A manuscript (abbreviated MS for singular and MSS for plural) was, traditionally, any document written by hand – or, once practical typewriters became available, typewritten – as opposed ...
s. It is characterized by sharper serifs and angles and retainment of some antique characteristics, notably in the consonant ''kâ'' (). This style is also for yantra tattoos and yantra
Yantra () (literally "machine, contraption") is a geometrical diagram, mainly from the Tantric traditions of the Indian religions. Yantras are used for the worship of deities in temples or at home; as an aid in meditation; used for the benefits ...
s on cloth, paper, or engravings on brass plates in Cambodia as well as in Thailand.[This particular style of Khmer shall not be confused with another script with the same name, described by ]Paul Sidwell
Paul James Sidwell is an Australian linguist based in Canberra, Australia who has held research and lecturing positions at the Australian National University. Sidwell, who is also an expert and consultant in forensic linguistics, is most notable ...
(see Khom script (Ong Kommadam)
The Khom script is a writing system used in Laos. The term "Khom" is also used to refer to the Ancient Khmer lettering used in Thailand's Buddhist temples to inscribe sacred Buddhist mantras and prayers, but that is an entirely different script.
...
).
* () is calligraphical
Calligraphy (from el, link=y, καλλιγραφία) is a Visual arts, visual art related to writing. It is the design and execution of lettering with a pen, ink brush, or other writing instrument. Contemporary calligraphic practice can be ...
style similar to ''âksâr khâm'' as it also retains some characters reminiscent of antique Khmer script. Its name in Khmer means literally 'round script' and it refers to the bold and thick lettering style. It is used for titles and headings in Cambodian documents, on books, banknotes, shop signs and banners. It is sometimes used to emphasize royal names or other important names.
Unicode
The basic Khmer block was added to the Unicode
Unicode, formally The Unicode Standard,The formal version reference is is an information technology Technical standard, standard for the consistent character encoding, encoding, representation, and handling of Character (computing), text expre ...
Standard in version 3.0, released in September 1999. It then contained 103 defined code points; this was extended to 114 in version 4.0, released in April 2003. Version 4.0 also introduced an additional block, called Khmer Symbols
Khmer Symbols is a Unicode block
A Unicode block is one of several contiguous ranges of numeric character codes ( code points) of the Unicode character set that are defined by the Unicode Consortium for administrative and documentation purpose ...
, containing 32 signs used for writing lunar dates.
The Unicode block for basic Khmer characters is U+1780–U+17FF:
The first 35 characters are the consonant letters (including two obsolete). The symbols at U+17A3 and U+17A4 are deprecated (they were intended for use in Pali and Sanskrit transliteration, but are identical in appearance to the consonant , written alone or with the ''a'' vowel). These are followed by the 15 independent vowels (including one obsolete and one variant form). The code points U+17B4 and U+17B5 are invisible combining marks for inherent vowels, intended for use only in special applications.
Next come the 16 dependent vowel signs and the 12 diacritics
A diacritic (also diacritical mark, diacritical point, diacritical sign, or accent) is a glyph added to a letter or to a basic glyph. The term derives from the Ancient Greek (, "distinguishing"), from (, "to distinguish"). The word ''diacritic ...
(excluding the ''kbiĕh kraôm'', which is identical in form to the ''ŏ'' dependent vowel); these are represented together with a dotted circle, but should be displayed appropriately in combination with a preceding Khmer letter.
The code point U+17D2, called ', meaning "foot", is used to indicate that a following consonant is to be written in subscript form. It is not normally visibly rendered as a character. U+17D3 was originally intended for use in writing lunar dates, but its use is now discouraged (see the Khmer Symbols block hereafter). The next seven characters are the punctuation marks
Punctuation (or sometimes interpunction) is the use of spacing, conventional signs (called punctuation marks), and certain typographical devices as aids to the understanding and correct reading of written text, whether read silently or aloud. An ...
listed hereinbefore; these are followed by the riel Riel may refer to:
Places
*Riel, Netherlands, a town in the Netherlands
*Riel (electoral district), a provincial electoral district in Manitoba, Canada, named after Louis Riel
* Riel, Winnipeg, a community committee comprising three city wards
Peo ...
currency symbol, a rare sign corresponding to the Sanskrit avagraha
Avagraha () is a symbol used to indicate prodelision of an ''()'' in many Indian languages like Sanskrit as shown below. It is usually transliterated with an apostrophe in Roman script and, in case of Devanagari, as in the Sanskrit philosophical e ...
, and a mostly obsolete version of the ''vĭréam'' diacritic. The U+17Ex series contains the Khmer numerals
Khmer numerals are the numerals used in the Khmer language. They have been in use since at least the early 7th century, with the earliest known use being on a stele dated to AD 604 found in Prasat Bayang, near Angkor Borei, Cambodia.
Numera ...
, and the U+17Fx series contains variants of the numerals used in divination
Divination (from Latin ''divinare'', 'to foresee, to foretell, to predict, to prophesy') is the attempt to gain insight into a question or situation by way of an occultic, standardized process or ritual. Used in various forms throughout histor ...
lore.
The block with additional lunar date symbols is U+19E0–U+19FF:
The symbols at U+19E0 and U+19F0 represent the first and second "eighth month" in a lunar year containing a leap-month (see Khmer calendar
Khmer(s) may refer to:
Cambodia
*''Srok Khmer'' (lit. "Khmer land" or "Land of the Khmer(s)"), a colloquial exonym used to refer to Cambodia by Cambodians; see
*
*Khmer people, the ethnic group to which the great majority of Cambodians belong
**K ...
). The remaining symbols in this block denote the days of a lunar month: those in the U+19Ex series for waxing days, and those in the U+19Fx series for waning days.
See also
* Khmer Braille
Braille is the braille alphabet of the Khmer language of Cambodia.[World Braille Usage< ...](_blank)
* Romanization of Khmer
The romanization of Khmer is a representation of the Khmer (Cambodian) language using letters of the Latin alphabet. This is most commonly done with Khmer proper nouns, such as names of people and geographical names, as in a gazetteer.
Romanizat ...
* Khom Thai script
The Khom script ( th, อักษรขอม, akson khom, or later th, อักษรขอมไทย, akson khom thai; lo, ອັກສອນຂອມ, Aksone Khom; km, អក្សរខម, âksâr khâm) is a Brahmic script and a vari ...
Notes
References
* ''Dictionnaire Cambodgien'', Vol I & II, 1967, L'institut Bouddhique (Khmer Language)
* Jacob, Judith. 1974. ''A Concise Cambodian-English Dictionary''. London, Oxford University Press.
External links
Omniglot entry on Khmer
Khmer Romanization Table
(PDF)
{{DEFAULTSORT:Khmer Script
Khmer language
Writing systems of Asia
Writing systems without word boundaries