HOME

TheInfoList



OR:

The Sinhala script ( si, සිංහල අක්ෂර මාලාව, Siṁhala Akṣara Mālāva), also known as Sinhalese script, is a
writing system A writing system is a method of visually representing verbal communication, based on a script and a set of rules regulating its use. While both writing and speech are useful in conveying messages, writing differs in also being a reliable form ...
used by the
Sinhalese people Sinhalese people ( si, සිංහල ජනතාව, Sinhala Janathāva) are an Indo-Aryan ethnolinguistic group native to the island of Sri Lanka. They were historically known as Hela people ( si, හෙළ). They constitute about 75% of t ...
and most
Sri Lankans This is a demography of the population of Sri Lanka including population density, ethnicity, education level, health of the populace, economic status, religious affiliations and other aspects of the population. Sri Lanka is an island in the ...
in
Sri Lanka Sri Lanka (, ; si, ශ්‍රී ලංකා, Śrī Laṅkā, translit-std=ISO (); ta, இலங்கை, Ilaṅkai, translit-std=ISO ()), formerly known as Ceylon and officially the Democratic Socialist Republic of Sri Lanka, is an ...
and elsewhere to write the
Sinhala language Sinhala ( ; , ''siṁhala'', ), sometimes called Sinhalese (), is an Indo-Aryan languages, Indo-Aryan language primarily spoken by the Sinhalese people of Sri Lanka, who make up the largest ethnic group on the island, numbering about 16 milli ...
as well as the
liturgical language A sacred language, holy language or liturgical language is any language that is cultivated and used primarily in church service or for other religious reasons by people who speak another, primary language in their daily lives. Concept A sacre ...
s
Pali Pali () is a Middle Indo-Aryan liturgical language native to the Indian subcontinent. It is widely studied because it is the language of the Buddhist ''Pāli Canon'' or ''Tipiṭaka'' as well as the sacred language of ''Theravāda'' Buddhism ...
and
Sanskrit Sanskrit (; attributively , ; nominally , , ) is a classical language belonging to the Indo-Aryan branch of the Indo-European languages. It arose in South Asia after its predecessor languages had diffused there from the northwest in the late ...
.Daniels (1996), p. 408. The Sinhalese Akṣara Mālāva, one of the
Brahmic scripts The Brahmic scripts, also known as Indic scripts, are a family of abugida writing systems. They are used throughout the Indian subcontinent, Southeast Asia and parts of East Asia. They are descended from the Brahmi script of ancient India ...
, is a descendant of the
Ancient India According to consensus in modern genetics, anatomically modern humans first arrived on the Indian subcontinent from Africa between 73,000 and 55,000 years ago. Quote: "Y-Chromosome and Mt-DNA data support the colonization of South Asia by m ...
n
Brahmi script Brahmi (; ; ISO: ''Brāhmī'') is a writing system of ancient South Asia. "Until the late nineteenth century, the script of the Aśokan (non-Kharosthi) inscriptions and its immediate derivatives was referred to by various names such as 'lath' o ...
. It is also related to the
Grantha script The Grantha script ( ta, கிரந்த எழுத்து, Granta eḻuttu; ml, ഗ്രന്ഥലിപി, granthalipi) is a South Indian script, found particularly in Tamil Nadu and Kerala. Originating from the Pallava script, t ...
. The Sinhala script is an
abugida An abugida (, from Ge'ez language, Ge'ez: ), sometimes known as alphasyllabary, neosyllabary or pseudo-alphabet, is a segmental Writing systems#Segmental writing system, writing system in which consonant-vowel sequences are written as units; ...
written from left to right. Sinhala letters are classified in two sets. The core set of letters forms the ' alphabet (Pure Sinhala, ), which is a subset of the ' alphabet (Mixed Sinhala, ).


History

The Sinhala script is a Brahmi derivate and was imported from Northern India around the 3rd century BCE. It developed in a complex manner, partly independently but also strongly influenced by South Indian scripts at various stages, manifestly influenced by the early
Grantha script The Grantha script ( ta, கிரந்த எழுத்து, Granta eḻuttu; ml, ഗ്രന്ഥലിപി, granthalipi) is a South Indian script, found particularly in Tamil Nadu and Kerala. Originating from the Pallava script, t ...
. Pottery from the 6th century BCE has been found in
Anuradhapura Anuradhapura ( si, අනුරාධපුරය, translit=Anurādhapuraya; ta, அனுராதபுரம், translit=Aṉurātapuram) is a major city located in north central plain of Sri Lanka. It is the capital city of North Central ...
with lithic inscriptions dating from the 2nd century BCE written in
Prakrit The Prakrits (; sa, prākṛta; psu, 𑀧𑀸𑀉𑀤, ; pka, ) are a group of vernacular Middle Indo-Aryan languages that were used in the Indian subcontinent from around the 3rd century BCE to the 8th century CE. The term Prakrit is usu ...
. Medieval Sinhalese, which emerged around 750 AD, is marked by very strong influence from the
Grantha script The Grantha script ( ta, கிரந்த எழுத்து, Granta eḻuttu; ml, ഗ്രന്ഥലിപി, granthalipi) is a South Indian script, found particularly in Tamil Nadu and Kerala. Originating from the Pallava script, t ...
. Subsequently, Medieval (and modern) Sinhalese resemble the South Indian scripts. By the 9th century CE,
literature Literature is any collection of written work, but it is also used more narrowly for writings specifically considered to be an art form, especially prose fiction, drama, and poetry. In recent centuries, the definition has expanded to include ...
written in the Sinhala script had emerged and the script began to be used in other contexts. For instance, the
Buddhist literature Buddhist texts are those religious texts which belong to the Buddhist tradition. The earliest Buddhist texts were not committed to writing until some centuries after the death of Gautama Buddha. The oldest surviving Buddhist manuscripts a ...
of the
Theravada ''Theravāda'' () ( si, ථේරවාදය, my, ထေရဝါဒ, th, เถรวาท, km, ថេរវាទ, lo, ເຖຣະວາດ, pi, , ) is the most commonly accepted name of Buddhism's oldest existing school. The school' ...
-
Buddhists Buddhism ( , ), also known as Buddha Dharma and Dharmavinaya (), is an Indian religion or philosophical tradition based on teachings attributed to the Buddha. It originated in northern India as a -movement in the 5th century BCE, and gra ...
of Sri Lanka, written in
Pali Pali () is a Middle Indo-Aryan liturgical language native to the Indian subcontinent. It is widely studied because it is the language of the Buddhist ''Pāli Canon'' or ''Tipiṭaka'' as well as the sacred language of ''Theravāda'' Buddhism ...
, used Sinhala script. Modern Sinhalese emerged in the 13th century and is marked by the composition of the grammar book ''Sidat Sangara''. In 1736, the Dutch were the first to print with Sinhala type on the island. The resulting type followed the features of the native Sinhala script used on palm leaves. The Dutch created type was monolinear and geometric in fashion, with no separation between words in early documents. During the second half of the 19th century, during the colonial period, a new style of Sinhala letterforms emerged in opposition to the monolinear and geometric form that used high contrast and had varied thicknesses. This high contrast type gradually replaced the monolinear type as the preferred style and continues to be used in the present day. The high contrast style is still preferred for text typesetting in printed newspapers, books, and magazines in Sri Lanka. Today, the alphabet is used by over 16 million people to write Sinhala in very diverse contexts, such as
newspapers A newspaper is a periodical publication containing written information about current events and is often typed in black ink with a white or gray background. Newspapers can cover a wide variety of fields such as politics, business, sports ...
,
TV commercials A television advertisement (also called a television commercial, TV commercial, commercial, spot, television spot, TV spot, advert, television advert, TV advert, television ad, TV ad or simply an ad) is a span of television programming produce ...
,
government A government is the system or group of people governing an organized community, generally a state. In the case of its broad associative definition, government normally consists of legislature, executive, and judiciary. Government is a ...
announcements,
graffiti Graffiti (plural; singular ''graffiti'' or ''graffito'', the latter rarely used except in archeology) is art that is written, painted or drawn on a wall or other surface, usually without permission and within public view. Graffiti ranges from s ...
, and
schoolbooks A textbook is a book containing a comprehensive compilation of content in a branch of study with the intention of explaining it. Textbooks are produced to meet the needs of educators, usually at educational institutions. Schoolbooks are textboo ...
. Sinhala is the main language written in this script, but rare instances of its use for writing Sri Lanka Malay have been recorded.


Structure

Sinhala script is an
abugida An abugida (, from Ge'ez language, Ge'ez: ), sometimes known as alphasyllabary, neosyllabary or pseudo-alphabet, is a segmental Writing systems#Segmental writing system, writing system in which consonant-vowel sequences are written as units; ...
written from left to right. It uses
consonant In articulatory phonetics, a consonant is a speech sound that is articulated with complete or partial closure of the vocal tract. Examples are and pronounced with the lips; and pronounced with the front of the tongue; and pronounced wit ...
s as the basic unit for word construction as each consonant has an
inherent vowel An inherent vowel is part of an abugida (or alphasyllabary) script. It is a vowel sound which is used with each unmarked or basic consonant symbol. For example, if the Latin alphabet used 'i' as an inherent vowel, "Wikipedia" could be rendered as "W ...
(), which can be changed with a different vowel stroke. To represent different sounds it is necessary to add vowel strokes, or diacritics called (Pili), that can be used before, after, above or below the base-consonant. Most of the Sinhala letters are
curlicue A curlicue, or alternatively curlycue, in the visual arts, is a fancy twist, or curl, composed usually from a series of concentric circles. It is a recurring motif in architecture (as decoration to the lintel/ architrave above a door), in callig ...
s; straight lines are almost completely absent from the alphabet, and it does not have joining characters. This is because Sinhala used to be written on dried palm leaves, which would split along the veins on writing straight lines. This was undesirable, and therefore, the round shapes were preferred. Upper and lower cases do not exist in Sinhala. Sinhala letters are ordered into two sets. The core set of letters forms the ' alphabet (Pure Sinhala, ), which is a subset of the ' alphabet (Mixed Sinhala, ). This "pure" alphabet contains all the graphemes necessary to write Eḷu (classical Sinhala) as described in the classical grammar Sidatsan̆garā (1300 AD).Gair and Paolillo 1997. This is the reason why this set is also called ''Eḷu hōdiya'' ("Eḷu alphabet" ). The definition of the two sets is thus a historic one. Out of pure coincidence, the phoneme inventory of present-day colloquial Sinhala is such that yet again the ''śuddha'' alphabet suffices as a good representation of the sounds. All native
phoneme In phonology and linguistics, a phoneme () is a unit of sound that can distinguish one word from another in a particular language. For example, in most dialects of English, with the notable exception of the West Midlands and the north-west o ...
s of the Sinhala spoken today can be represented in ', while in order to render special Sanskrit and Pali sounds, one can fall back on '. This is most notably necessary for the
grapheme In linguistics, a grapheme is the smallest functional unit of a writing system. The word ''grapheme'' is derived and the suffix ''-eme'' by analogy with ''phoneme'' and other names of emic units. The study of graphemes is called ''graphemics' ...
s for the
Middle Indic The Middle Indo-Aryan languages (or Middle Indic languages, sometimes conflated with the Prakrits, which are a stage of Middle Indic) are a historical group of languages of the Indo-Aryan family. They are the descendants of Old Indo-Aryan (OIA; ...
phonemes that the
Sinhala language Sinhala ( ; , ''siṁhala'', ), sometimes called Sinhalese (), is an Indo-Aryan languages, Indo-Aryan language primarily spoken by the Sinhalese people of Sri Lanka, who make up the largest ethnic group on the island, numbering about 16 milli ...
lost during its history, such as aspirates. Most phonemes of Sinhala can be represented by a ''śuddha'' letter or by a ''miśra'' letter, but normally only one of them is considered correct. This one-to-many mapping of
phonemes In phonology and linguistics, a phoneme () is a unit of sound that can distinguish one word from another in a particular language. For example, in most dialects of English, with the notable exception of the West Midlands and the north-west o ...
onto
graphemes In linguistics, a grapheme is the smallest functional unit of a writing system. The word ''grapheme'' is derived and the suffix ''-eme'' by analogy with ''phoneme'' and other names of emic units. The study of graphemes is called ''graphemics' ...
is a frequent source of
misspelling Spelling is a set of conventions that regulate the way of using graphemes (writing system) to represent a language in its written form. In other words, spelling is the rendering of speech sound (phoneme) into writing (grapheme). Spelling is one ...
s.Matzel (1983) p. 15, 17, 18 While a phoneme can be represented by more than one grapheme, each grapheme can be pronounced in only one way, with the exceptions of the inherent vowel sound, which can be either (stressed) or (unstressed), and "ව" where the consonant is either or depending on the word. This means that the actual
pronunciation Pronunciation is the way in which a word or a language is spoken. This may refer to generally agreed-upon sequences of sounds used in speaking a given word or language in a specific dialect ("correct pronunciation") or simply the way a particular ...
of a word is almost always clear from its orthographic form. Stress is almost always predictable; only words with or (which are both allophones of "ව"), and a very few other words need to be learnt individually. Some pronunciation exceptions in Sinhala: * කරනවා – to do – (not ) * හතලිහ – forty – (not )


Diacritics

In Sinhala the diacritics are called පිලි ''pili'' (vowel strokes). දිග ''diga'' means "long" because the vowel is sounded for longer and දෙක ''deka'' means "two" because the stroke is doubled when written.


Non-vocalic diacritics

The
anusvara Anusvara (Sanskrit: ') is a symbol used in many Indic scripts to mark a type of nasal sound, typically transliterated . Depending on its location in the word and the language for which it is used, its exact pronunciation can vary. In the context ...
(often called ''binduva'' 'zero' ) is represented by one small circle ◌ං (Unicode 0D82),Karunatillake (2004), p. xxxii and the
visarga Visarga ( sa, विसर्गः, translit=visargaḥ) means "sending forth, discharge". In Sanskrit phonology ('' ''), ' (also called, equivalently, ' by earlier grammarians) is the name of a phone voiceless glottal fricative, , written as: ...
(technically part of the ''miśra'' alphabet) by two ◌ඃ (Unicode 0D83). The inherent vowel can be removed by a special
virama Virama ( ्) is a Sanskrit phonological concept to suppress the inherent vowel that otherwise occurs with every consonant letter, commonly used as a generic term for a codepoint in Unicode, representing either # halanta, hasanta or explicit virā ...
diacritic, the ''hal kirīma'' (◌්), which has two shapes depending on which consonant it attaches to. Both are represented in the image on the right side. The first one is the most common one, while the second one is used for letters ending at the top left corner.


Letters


Śuddha set

The ''śuddha'' graphemes are the mainstay of Sinhala script and are used on an everyday basis. Every sequence of sounds of Sinhala of today can be represented by these graphemes. Additionally, the ''śuddha'' set comprises graphemes for
retroflex A retroflex (Help:IPA/English, /ˈɹɛtʃɹoːflɛks/), apico-domal (Help:IPA/English, /əpɪkoːˈdɔmɪnəl/), or cacuminal () consonant is a coronal consonant where the tongue has a flat, concave, or even curled shape, and is articulated betw ...
and , which are no longer phonemic in modern Sinhala. These two letters were needed for the representation of Eḷu, but are now obsolete from a purely phonemic view. However, words which historically contain these two phonemes are still often written with the graphemes representing the retroflex sounds.


Vowels

Vowels come in two shapes: independent and
diacritic A diacritic (also diacritical mark, diacritical point, diacritical sign, or accent) is a glyph added to a letter or to a basic glyph. The term derives from the Ancient Greek (, "distinguishing"), from (, "to distinguish"). The word ''diacriti ...
. The independent shape is used when a vowel does not follow a consonant, e.g. at the beginning of a word. The diacritic shape is used when a vowel follows a consonant. Depending on the vowel, the diacritic can attach at several places (see diacritics section above) While most diacritics are regular, the diacritic for takes a different shape according to the consonant it attaches to. The most common one is the one used for the consonant ප (p): පු (pu) and පූ (pū). Some consonants ending at the lower right corner (ක (k),ග (g), ත(t), but not න(n) or හ(h)) use this diacritic: කු (ku) and කූ (kuu). Combinations of ර(r) or ළ(ḷ) with have idiosyncratic shapes, viz රු (ru) රූ (rū) ළු (ḷu) ළූ (ḷū).Jayawardena-Moser (2004) p. 11 Note that the diacritic used for රු (ru) and රූ (rū) is what is normally used for the , and therefore there are idiosyncratic forms for ræ and rǣ, viz රැ and රෑ ifference may not be visible depending on how unicode is rendered in your browser


Consonants

The ''śuddha'' alphabet comprises 8
plosive In phonetics, a plosive, also known as an occlusive or simply a stop, is a pulmonic consonant in which the vocal tract is blocked so that all airflow ceases. The occlusion may be made with the tongue tip or blade (, ), tongue body (, ), lips ...
s, 2
fricative A fricative is a consonant produced by forcing air through a narrow channel made by placing two articulators close together. These may be the lower lip against the upper teeth, in the case of ; the back of the tongue against the soft palate in t ...
s, 2
affricate An affricate is a consonant that begins as a stop and releases as a fricative, generally with the same place of articulation (most often coronal). It is often difficult to decide if a stop and fricative form a single phoneme or a consonant pair. ...
s, 2
nasals In phonetics, a nasal, also called a nasal occlusive or nasal stop in contrast with an oral stop or nasalized consonant, is an occlusive consonant produced with a lowered velum, allowing air to escape freely through the nose. The vast majorit ...
, 2
liquids A liquid is a nearly incompressible fluid that conforms to the shape of its container but retains a (nearly) constant volume independent of pressure. As such, it is one of the four fundamental states of matter (the others being solid, gas, an ...
and 2 glides. Additionally, there are the two graphemes for the retroflex sounds and , which are not phonemic in modern Sinhala, but which still form part of the set. These are shaded in the table. The voiceless affricate (ච ) is not included in the ''śuddha'' set by purists since it does not occur in the main text of the Sidatsan̆garā. The Sidatsan̆garā does use it in examples though, so this sound did exist in Eḷu. In any case, it is needed for the representation of modern Sinhala. The basic shapes of these consonants carry an inherent unless this is replaced by another vowel or removed by the ''hal kirīma''.


Prenasalized consonants

The
prenasalized consonant Prenasalized consonants are phonetic sequences of a nasal and an obstruent (or occasionally a non-nasal sonorant such as ) that behave phonologically like single consonants. The primary reason for considering them to be single consonants, rathe ...
s resemble their plain counterparts. is made up by the left half of and the right half of , while the other three are just like the grapheme for the plosive with a little stroke attached to their left.Fairbanks et al. (1968), p. 126 Vowel diacritics attach in the same way as they would to the corresponding plain plosive.


Miśra set

The ''miśra'' alphabet is a
superset In mathematics, set ''A'' is a subset of a set ''B'' if all elements of ''A'' are also elements of ''B''; ''B'' is then a superset of ''A''. It is possible for ''A'' and ''B'' to be equal; if they are unequal, then ''A'' is a proper subset of ...
of ''śuddha''. It adds letters for aspirates,
retroflex A retroflex (Help:IPA/English, /ˈɹɛtʃɹoːflɛks/), apico-domal (Help:IPA/English, /əpɪkoːˈdɔmɪnəl/), or cacuminal () consonant is a coronal consonant where the tongue has a flat, concave, or even curled shape, and is articulated betw ...
es and
sibilant Sibilants are fricative consonants of higher amplitude and pitch, made by directing a stream of air with the tongue towards the teeth. Examples of sibilants are the consonants at the beginning of the English words ''sip'', ''zip'', ''ship'', and ...
s, which are not phonemic in today's Sinhala, but which are necessary to represent non-native words, like
loanword A loanword (also loan word or loan-word) is a word at least partly assimilated from one language (the donor language) into another language. This is in contrast to cognates, which are words in two or more languages that are similar because th ...
s from
Sanskrit Sanskrit (; attributively , ; nominally , , ) is a classical language belonging to the Indo-Aryan branch of the Indo-European languages. It arose in South Asia after its predecessor languages had diffused there from the northwest in the late ...
, Pali or
English English usually refers to: * English language * English people English may also refer to: Peoples, culture, and language * ''English'', an adjective for something of, from, or related to England ** English national ide ...
. The use of the extra letters is mainly a question of prestige. From a purely phonemic point of view, there is no benefit in using them, and they can be replaced by a (sequence of) ''śuddha'' letters as follows: For the ''miśra'' aspirates, the replacement is the
plain In geography, a plain is a flat expanse of land that generally does not change much in elevation, and is primarily treeless. Plains occur as lowlands along valleys or at the base of mountains, as coastal plains, and as plateaus or uplands ...
''śuddha'' counterpart, for the ''miśra''
retroflex A retroflex (Help:IPA/English, /ˈɹɛtʃɹoːflɛks/), apico-domal (Help:IPA/English, /əpɪkoːˈdɔmɪnəl/), or cacuminal () consonant is a coronal consonant where the tongue has a flat, concave, or even curled shape, and is articulated betw ...
liquids A liquid is a nearly incompressible fluid that conforms to the shape of its container but retains a (nearly) constant volume independent of pressure. As such, it is one of the four fundamental states of matter (the others being solid, gas, an ...
the corresponding ''śuddha'' coronal liquid,Karunatillake (2004), p. xxxi for the
sibilant Sibilants are fricative consonants of higher amplitude and pitch, made by directing a stream of air with the tongue towards the teeth. Examples of sibilants are the consonants at the beginning of the English words ''sip'', ''zip'', ''ship'', and ...
s, . ඤ (ñ) and ඥ (gn) cannot be represented by ''śuddha'' graphemes but are found only in fewer than 10 words each. ෆ fa can be represented by ප pa with a Latin inscribed in the cup.


Vowels

There are six additional vocalic diacritics in the ''miśra'' alphabet. The two
diphthong A diphthong ( ; , ), also known as a gliding vowel, is a combination of two adjacent vowel sounds within the same syllable. Technically, a diphthong is a vowel with two different targets: that is, the tongue (and/or other parts of the speech o ...
s are quite common, while the "syllabic" ṛ is much rarer, and the "syllabic" ḷ is all but obsolete. The latter are almost exclusively found in loanwords from Sanskrit.Matzel (1983), p. 8 The ''miśra'' can also be written with ''śuddha'' + or +, which corresponds to the actual
pronunciation Pronunciation is the way in which a word or a language is spoken. This may refer to generally agreed-upon sequences of sounds used in speaking a given word or language in a specific dialect ("correct pronunciation") or simply the way a particular ...
. The ''miśra'' syllabic is obsolete, but can be rendered by ''śuddha'' +.Matzel (1983), p. 14 Miśra is rendered as ''śuddha'' , ''miśra'' as ''śuddha'' . Note that the transliteration of both ළ් and ෟ is . This is not very problematic as the second one is extremely scarce.


Consonants


Consonant conjuncts

Certain combinations of graphemes trigger special ligatures. Special signs exist for an ර (r) following a consonant (inverted arch underneath), a ර (r) preceding a consonant (loop above) and a ය (y) following a consonant (half a ය on the right). Fairbanks et al. (1968), p. 109 Jayawardena-Moser (2004), p. 12 Furthermore, very frequent combinations are often written in one stroke, like ''ddh'', ''kv'' or ''kś''. If this is the case, the first consonant is not marked with a ''hal kirīma''. The image on the right shows the
glyph A glyph () is any kind of purposeful mark. In typography, a glyph is "the specific shape, design, or representation of a character". It is a particular graphical representation, in a particular typeface, of an element of written language. A g ...
for '' śrī'', which is composed of the letter ''ś'' with a ligature indicating the ''r'' below and the vowel ''ī'' marked above. Most other conjunct consonants are made with an explicit virama, called ''al-lakuna'' or ''hal kirīma'', and the
zero-width joiner The zero-width joiner (ZWJ, ) is a non-printing character used in the computerized typesetting of writing systems in which the shape or positioning of a grapheme depends on its relation to other graphemes ( complex scripts), such as the Arabic s ...
as shown in the following table, some of which may not display correctly due to limitations of your system. Some of the more common are displayed in the following table. Note that although modern Sinhala sounds are not aspirated, aspiration is marked in the sound where it was historically present to highlight the differences in modern spelling. Also note that all of the combinations are encoded with the ''al-lakuna'' (Unicode U+0DCA) first, followed by the zero-width joiner (Unicode U+200D) except for touching letters which have the zero-width joiner (Unicode U+200D) first followed by the ''al-lakuna'' (Unicode U+0DCA). Touching letters were used in ancient scriptures but are not used in modern Sinhala. Vowels may be attached to any of the ligatures formed, attaching to the rightmost part of the glyph except for vowels that use the ''kombuva'', where the ''kombuva'' is written before the ligature or cluster and the remainder of the vowel, if any, is attached to the rightmost part. In the table below, appending "o" (''kombuva saha ælepilla'' – ''kombuva'' with ''ælepilla'') to the cluster "ky" only adds a single code point, but adds two vowel strokes, one each to the left and right of the consonant cluster.


Letter names

The Sinhala ''śuddha'' graphemes are named in a uniform way adding ''-yanna'' to the sound produced by the letter, including vocalic diacritics.Fairbanks et al. (1968), p. 366 The name for the letter අ is thus ''ayanna'', for the letter ආ ''āyanna'', for the letter ක ''kayanna'', for the letter කා ''kāyanna'', for the letter කෙ ''keyanna'' and so forth. For letters with ''hal kirīma'', an
epenthetic In phonology, epenthesis (; Greek language, Greek ) means the addition of one or more sounds to a word, especially in the beginning syllable (''prothesis (linguistics), prothesis'') or in the ending syllable (''paragoge'') or in-between two syll ...
''a'' is added for easier pronunciation: the name for the letter ක් is ''akyanna''. Another naming convention is to use ''al-'' before a letter with suppressed vowel, thus ''alkayanna''. Since the extra ''miśra'' letters are phonetically not distinguishable from the ''śuddha'' letters, proceeding in the same way would lead to confusion. Names of ''miśra'' letters are normally made up of the names of two ''śuddha'' letters pronounced as one word. The first one indicates the sound, the second one the shape. For example, the aspirated ඛ (kh) is called ''bayanu kayanna''. ''kayanna'' indicates the sound, while ''bayanu'' indicates the shape: ඛ (kh) is similar in shape to බ (b) (''bayunu = like bayanna''). Another method is to qualify the ''miśra'' aspirates by ''mahāprāna'' (ඛ: ''mahāprāna kayanna'') and the ''miśra'' retroflexes by ''mūrdhaja'' (ළ: ''mūrdhaja layanna'').


Numerals

Sinhala had special symbols to represent numerals, which were in use until the beginning of the 19th century. This system is now superseded by
Hindu–Arabic numeral system The Hindu–Arabic numeral system or Indo-Arabic numeral system Audun HolmeGeometry: Our Cultural Heritage 2000 (also called the Hindu numeral system or Arabic numeral system) is a positional decimal numeral system, and is the most common syste ...
. ;Sinhala Illakkam (
Sinhala Archaic Numbers Sinhala Archaic Numbers is a Unicode block A Unicode block is one of several contiguous ranges of numeric character codes ( code points) of the Unicode character set that are defined by the Unicode Consortium for administrative and documentati ...
) Sinhala Illakkam were used for writing numbers prior to the fall of
Kandyan Kingdom The Kingdom of Kandy was a monarchy on the island of Sri Lanka, located in the central and eastern portion of the island. It was founded in the late 15th century and endured until the early 19th century. Initially a client kingdom of the Kin ...
in 1815. These digits did not have a zero instead the numbers had signs for 10, 20, 30, 40, 50, 60, 70, 80, 90, 100, 1000. These digits and numbers can be seen primarily in Royal documents and artefacts. ;Sinhala Lith Illakkam ( Sinhala Astrological Numbers) Prior to the fall of Kandyan Kingdom all calculations were carried out using Lith digits. After the fall of the
Kandyan Kingdom The Kingdom of Kandy was a monarchy on the island of Sri Lanka, located in the central and eastern portion of the island. It was founded in the late 15th century and endured until the early 19th century. Initially a client kingdom of the Kin ...
, Sinhala Lith Illakkam were primarily used for writing horoscopes. However, there is evidence that they were used for other purposes such as writing page numbers etc. The tradition of writing degrees and minutes of zodiac signs in horoscopes continued into the 20th century using different versions of Lith Digits. Unlike the Sinhala Illakkam, Sinhala Lith Illakkam included a 0. Neither the
Sinhala numerals Sinhala numerals, are the units of the numeral system, originating from the Indian subcontinent, used in Sinhala language in modern-day Sri Lanka. Numerals or numerations around Kandyan Kingdom It had been found that five different types of nu ...
nor U+0DF4 ෴ Sinhala punctuatio
kunddaliya
is in general use today, but some use it in social media, Internet messaging and blogs. The kunddaliya was formerly used as a full stop.


Transliteration

Sinhala
transliteration Transliteration is a type of conversion of a text from one writing system, script to another that involves swapping Letter (alphabet), letters (thus ''wikt:trans-#Prefix, trans-'' + ''wikt:littera#Latin, liter-'') in predictable ways, such as ...
(Sinhala: රෝම අකුරින් ලිවීම ''rōma akurin livīma'', literally "Roman letter writing") can be done in analogy to Devanāgarī transliteration. Layman's transliterations in Sri Lanka normally follow neither of these. Vowels are transliterated according to English spelling equivalences, which can yield a variety of spellings for a number of phonemes. for instance can be , , , , etc. A transliteration pattern peculiar to Sinhala, and facilitated by the absence of phonemic aspirates, is the use of for the
voiceless dental plosive The voiceless alveolar, dental and postalveolar plosives (or stops) are types of consonantal sounds used in almost all spoken languages. The symbol in the International Phonetic Alphabet that represents voiceless dental, alveolar, and postalv ...
, and the use of for the
voiceless retroflex plosive The voiceless retroflex plosive or stop is a type of consonantal sound, used in some spoken languages. This consonant is found as a phoneme mostly (though not exclusively) in two areas: South Asia and Australia. Transcription The symbol that r ...
. This is presumably because the retroflex plosive is perceived the same as the English
alveolar plosive In phonetics and phonology, an alveolar stop is a type of consonantal sound, made with the tongue in contact with the alveolar ridge located just behind the teeth (hence alveolar), held tightly enough to block the passage of air (hence a stop cons ...
, and the Sinhala dental plosive is equated with the English
voiceless dental fricative The voiceless dental non-sibilant fricative is a type of consonantal sound used in some spoken languages. It is familiar to English speakers as the 'th' in ''think''. Though rather rare as a phoneme in the world's inventory of languages, it is en ...
.Matzel (1983), p. 16 Dental and retroflex voiced plosives are always rendered as , though, presumably because is not found as a representation of in English orthography.


Use for the Pali language

Many of the oldest manuscripts in the Pali language are written in the Sinhala script. ''Miśra'' consonants are used to represent Pali phonemes that have no Sinhala counterpart. The following table lays out the Sinhala representations of Pali consonants with their standard academic Romanizations: The vowels are a subset of those for writing Sinhala: The is represented with the sign ං. Consonant sequences may be combined in ligatures in a manner identical to that described above for Sinhala. As an example, below is the first verse from the
Dhammapada The Dhammapada (Pāli; sa, धर्मपद, Dharmapada) is a collection of sayings of the Buddha in verse form and one of the most widely read and best known Buddhist scriptures. The original version of the Dhammapada is in the Khuddaka ...
in Pali in Sinhala script, followed by Romanization:


Relation to other scripts

;Similarities Sinhala is one of the
Brahmic scripts The Brahmic scripts, also known as Indic scripts, are a family of abugida writing systems. They are used throughout the Indian subcontinent, Southeast Asia and parts of East Asia. They are descended from the Brahmi script of ancient India ...
, and thus shares many similarities with other members of the family, such as the
Kannada Kannada (; ಕನ್ನಡ, ), originally romanised Canarese, is a Dravidian language spoken predominantly by the people of Karnataka in southwestern India, with minorities in all neighbouring states. It has around 47 million native s ...
,
Malayalam Malayalam (; , ) is a Dravidian language spoken in the Indian state of Kerala and the union territories of Lakshadweep and Puducherry (Mahé district) by the Malayali people. It is one of 22 scheduled languages of India. Malayalam was des ...
,
Telugu Telugu may refer to: * Telugu language, a major Dravidian language of India *Telugu people, an ethno-linguistic group of India * Telugu script, used to write the Telugu language ** Telugu (Unicode block), a block of Telugu characters in Unicode S ...
,
Tamil script The Tamil script ( , ) is an abugida script that is used by Tamils and Tamil language, Tamil speakers in India, Sri Lanka, Malaysia, Singapore, Indonesia and elsewhere to write the Tamil language. Certain minority languages such as Saurasht ...
and
Devanāgarī Devanagari ( ; , , Sanskrit pronunciation: ), also called Nagari (),Kathleen Kuiper (2010), The Culture of India, New York: The Rosen Publishing Group, , page 83 is a left-to-right abugida (a type of segmental writing system), based on the a ...
. As a general example, is the inherent vowel in all these scripts. Other similarities include the diacritic for , which resembles a doubled in all scripts and the diacritic for which is composed of preceding and following . Likewise, the combination of the diacritics for and yields in all these scripts. ;Differences Sinhala alphabet differs from other Indo-Aryan alphabets in that it contains a pair of vowel sounds (U+0DD0 and U+0DD1 in the proposed Unicode Standard) that are unique to it. These are the two vowel sounds that are similar to the two vowel sounds that occur at the beginning of the English words ''at'' (ඇ) and ''ant'' (ඈ). Another feature that distinguishes Sinhala from its sister Indo-Aryan languages is the presence of a set of five nasal sounds known as half-nasal or prenasalized stops.


Computer encoding

Generally speaking, Sinhala support is less developed than support for Devanāgarī, for instance. A recurring problem is the rendering of diacritics which precede the consonant and diacritic signs which come in different shapes, like the one for . Sinhala support did not come built in with
Microsoft Microsoft Corporation is an American multinational technology corporation producing computer software, consumer electronics, personal computers, and related services headquartered at the Microsoft Redmond campus located in Redmond, Washing ...
Windows XP Windows XP is a major release of Microsoft's Windows NT operating system. It was released to manufacturing on August 24, 2001, and later to retail on October 25, 2001. It is a direct upgrade to its predecessors, Windows 2000 for high-end and ...
, unlike
Tamil Tamil may refer to: * Tamils, an ethnic group native to India and some other parts of Asia **Sri Lankan Tamils, Tamil people native to Sri Lanka also called ilankai tamils **Tamil Malaysians, Tamil people native to Malaysia * Tamil language, nativ ...
and
Hindi Hindi (Devanāgarī: or , ), or more precisely Modern Standard Hindi (Devanagari: ), is an Indo-Aryan language spoken chiefly in the Hindi Belt region encompassing parts of northern, central, eastern, and western India. Hindi has been de ...
, but was supported by third-party means such as Keyman by
SIL International SIL International (formerly known as the Summer Institute of Linguistics) is an evangelical Christian non-profit organization whose main purpose is to study, develop and document languages, especially those that are lesser-known, in order to ex ...
. Thereafter, all versions of
Windows Vista Windows Vista is a major release of the Windows NT operating system developed by Microsoft. It was the direct successor to Windows XP, which was released five years before, at the time being the longest time span between successive releases of ...
and above, including
Windows 10 Windows 10 is a major release of Microsoft's Windows NT operating system. It is the direct successor to Windows 8.1, which was released nearly two years earlier. It was released to manufacturing on July 15, 2015, and later to retail on J ...
come with Sinhala support by default, and do not require external
font In metal typesetting, a font is a particular size, weight and style of a typeface. Each font is a matched set of type, with a piece (a "sort") for each glyph. A typeface consists of a range of such fonts that shared an overall design. In mod ...
s to be installed to read Sinhala script. ''
Nirmala UI Nirmala UI is an Indic scripts typeface created by Tiro Typeworks and commissioned by Microsoft. It was first released with Windows 8 in 2012 as a UI font and currently supports languages using Bengali–Assamese, Devanagari, Kannada, Gujarati, ...
'' is the default Sinhala font in Windows 10. The latest versions of Windows 10 have added support for
Sinhala Archaic Numbers Sinhala Archaic Numbers is a Unicode block A Unicode block is one of several contiguous ranges of numeric character codes ( code points) of the Unicode character set that are defined by the Unicode Consortium for administrative and documentati ...
that were not supported by default in previous versions. For
macOS macOS (; previously OS X and originally Mac OS X) is a Unix operating system developed and marketed by Apple Inc. since 2001. It is the primary operating system for Apple's Mac computers. Within the market of desktop and lapt ...
,
Apple Inc. Apple Inc. is an American multinational technology company headquartered in Cupertino, California, United States. Apple is the largest technology company by revenue (totaling in 2021) and, as of June 2022, is the world's biggest company ...
has provided Sinhala font support for versions of macOS that are Catalina and above through
Unicode Unicode, formally The Unicode Standard,The formal version reference is is an information technology Technical standard, standard for the consistent character encoding, encoding, representation, and handling of Character (computing), text expre ...
integration. Keyboard support is available by third-party means such as Helakuru an
Keyman
In
Mac OS X macOS (; previously OS X and originally Mac OS X) is a Unix operating system developed and marketed by Apple Inc. since 2001. It is the primary operating system for Apple's Mac (computer), Mac computers. Within the market of ...
, Sinhala font and keyboard support were provided b
Nickshanks
an

For
Linux Linux ( or ) is a family of open-source Unix-like operating systems based on the Linux kernel, an operating system kernel first released on September 17, 1991, by Linus Torvalds. Linux is typically packaged as a Linux distribution, which ...
, the
IBus When drinking beer, there are many factors to be considered. Principal among them are bitterness, the variety of flavours present in the beverage and their intensity, alcohol content, and colour. Standards for those characteristics allow a more o ...
, and SCIM input methods allow the use Sinhala script in applications with support for a number of key maps and techniques such as traditional, phonetic and assisted techniques.A screenshot showing some of the options
/ref> In addition, newer versions of the Android mobile operating system also support both rendering and input of Sinhala script by default and applications like Helakuru serve as dedicated keyboard integrators.


Unicode

Sinhala script was added to the
Unicode Unicode, formally The Unicode Standard,The formal version reference is is an information technology Technical standard, standard for the consistent character encoding, encoding, representation, and handling of Character (computing), text expre ...
Standard in September 1999 with the release of version 3.0. This character allocation has been adopted in Sri Lanka as the
Standard Standard may refer to: Symbols * Colours, standards and guidons, kinds of military signs * Standard (emblem), a type of a large symbol or emblem used for identification Norms, conventions or requirements * Standard (metrology), an object th ...
SLS1134. The main Unicode block for Sinhala is U+0D80–U+0DFF. Another block,
Sinhala Archaic Numbers Sinhala Archaic Numbers is a Unicode block A Unicode block is one of several contiguous ranges of numeric character codes ( code points) of the Unicode character set that are defined by the Unicode Consortium for administrative and documentati ...
, was added to Unicode in version 7.0.0 in June 2014. Its range is U+111E0–U+111FF.


See also

* Sinhala Braille * History of Sinhala software * Loanwords **
Dutch loanwords in Sinhala This is a list of Sinhala words of Dutch origin. ''Note: For information on the transcription used, see National Library at Calcutta romanization. An exception from the standard is the romanization of Sinhala long "ä" () as "ää".'' Sinhala w ...
** English loanwords in Sinhala ** Portuguese loanwords in Sinhala ** Tamil loanwords in Sinhala


References


Further reading

* Coperahewa, Sandagomi. ''Sinhala Akuru Puranaya'' 'Chronicle of Sinhala Letters''Nugegoda: Sarasavi, 2018. * * * * * * *


External links


Scripts (ISO 15924) "Sinhala"Sinhala Unicode CharactersSinhala Unicode CharactersSinhala Unicode Character Code ChartSinhala Archaic Numbers Unicode Character Code Chart


Online resources * Sinhala guide of the Sinhala Wikipedia (in English)
Online Sinhala Unicode Writer

Sinhala English Dictionary and Sinhala To Hindi Language Translator

Sinhala Unicode Support Group

Online Unicode Converter
{{DEFAULTSORT:Sinhala script Brahmic scripts