Sinhala ( ; , ''siṁhala'', ), sometimes called Sinhalese (), is an
Indo-Aryan language
The Indo-Aryan languages (or sometimes Indic languages) are a branch of the Indo-Iranian languages in the Indo-European language family. As of the early 21st century, they have more than 800 million speakers, primarily concentrated in India, Pa ...
primarily spoken by the
Sinhalese people
Sinhalese people ( si, සිංහල ජනතාව, Sinhala Janathāva) are an Indo-Aryan ethnolinguistic group native to the island of Sri Lanka. They were historically known as Hela people ( si, හෙළ). They constitute about 75% of t ...
of
Sri Lanka
Sri Lanka (, ; si, ශ්රී ලංකා, Śrī Laṅkā, translit-std=ISO (); ta, இலங்கை, Ilaṅkai, translit-std=ISO ()), formerly known as Ceylon and officially the Democratic Socialist Republic of Sri Lanka, is an ...
, who make up the largest ethnic group on the island, numbering about 16 million.
Sinhala is also spoken as the first language by other ethnic groups in Sri Lanka, totalling about 2 million people as of 2001. It is written using the
Sinhala script
The Sinhala script ( si, සිංහල අක්ෂර මාලාව, Siṁhala Akṣara Mālāva), also known as Sinhalese script, is a writing system used by the Sinhalese people and most Sri Lankans in Sri Lanka and elsewhere to write ...
, which is a
Brahmic script
The Brahmic scripts, also known as Indic scripts, are a family of abugida writing systems. They are used throughout the Indian subcontinent, Southeast Asia and parts of East Asia. They are descended from the Brahmi script of ancient India ...
closely related to the
Grantha script
The Grantha script ( ta, கிரந்த எழுத்து, Granta eḻuttu; ml, ഗ്രന്ഥലിപി, granthalipi) is a South Indian script, found particularly in Tamil Nadu and Kerala. Originating from the Pallava script, t ...
of South India.
Sinhala is one of the official and national languages of Sri Lanka. Along with
Pali
Pali () is a Middle Indo-Aryan liturgical language native to the Indian subcontinent. It is widely studied because it is the language of the Buddhist ''Pāli Canon'' or ''Tipiṭaka'' as well as the sacred language of ''Theravāda'' Buddhism ...
, it played a major role in the development of
Theravada Buddhist
''Theravāda'' () ( si, ථේරවාදය, my, ထေရဝါဒ, th, เถรวาท, km, ថេរវាទ, lo, ເຖຣະວາດ, pi, , ) is the most commonly accepted name of Buddhism's oldest existing school. The school' ...
literature.
The early form of the Sinhala language, is attested as early as the 3rd century BCE. The language of these inscriptions with long vowels and aspirated consonants is a Prakrit similar to Magadhi, a regional associate of the Middle Indian Prakrits that has been used during the time of the Buddha.
The closest relatives are the
Vedda language
Vedda is an endangered language that is used by the indigenous Vedda people of Sri Lanka. Additionally, communities such as Coast Veddas and Anuradhapura Veddas who do not strictly identify as Veddas also use words from the Vedda language in par ...
(an endangered, indigenous creole still spoken by a minority of Sri Lankans, mixing Sinhala with an isolate of unknown origin and from which Old Sinhala borrowed various aspects into its main Indo-Aryan substrate), and the
Maldivian language
Maldivian, also known by its endonym Dhivehi or Divehi ( ; '' dv, links=no, ދިވެހި'', ), is an Indo-Aryan language spoken in the South Asian island country of Maldives and on Minicoy Island, Lakshadweep, union territory of India.
The M ...
.
It has two main varieties, written and spoken, and is a conspicuous example of the linguistic phenomenon known as
diglossia
In linguistics, diglossia () is a situation in which two dialects or languages are used (in fairly strict compartmentalization) by a single language community. In addition to the community's everyday or vernacular language variety (labeled " ...
.
Etymology
''Sinhala'' () is a
Sanskrit
Sanskrit (; attributively , ; nominally , , ) is a classical language belonging to the Indo-Aryan branch of the Indo-European languages. It arose in South Asia after its predecessor languages had diffused there from the northwest in the late ...
term; the corresponding
Middle Indo-Aryan
The Middle Indo-Aryan languages (or Middle Indic languages, sometimes conflated with the Prakrits, which are a stage of Middle Indic) are a historical group of languages of the Indo-Aryan family. They are the descendants of Old Indo-Aryan (OIA; ...
(
Eḷu) word is ''Sīhala''.
The name is a derivation from ', the Sanskrit word for "lion". The name is sometimes glossed as "abode of lions", and attributed to a supposed former abundance of
lion
The lion (''Panthera leo'') is a large Felidae, cat of the genus ''Panthera'' native to Africa and India. It has a muscular, broad-chested body; short, rounded head; round ears; and a hairy tuft at the end of its tail. It is sexually dimorphi ...
s on the island.
History
According to the chronicle ''
Mahavansa'', written in
Pali
Pali () is a Middle Indo-Aryan liturgical language native to the Indian subcontinent. It is widely studied because it is the language of the Buddhist ''Pāli Canon'' or ''Tipiṭaka'' as well as the sacred language of ''Theravāda'' Buddhism ...
, Vanga kingdom's
Prince Vijaya
According to the ''Mahāvaṃsa'' chronicle, Prince Vijaya (c. 543–505 BCE) was the first Sinhalese monarchy, Sinhalese king. Legends and records from both Indian and Sri Lanka sources say that he along with several hundred followers came to ...
and his entourage merged with the
Yakkha Yakkha may refer to:
* Yakkha people, an ethnic group of South Asia
* Yakkha language, a Sino-Tibetan language
* Yaksha
The yakshas ( sa, यक्ष ; pi, yakkha, i=yes) are a broad class of nature-spirits, usually benevolent, but sometime ...
and later settlers from the
Pandya kingdom. In the following centuries, there was substantial immigration from Eastern India (
Vanga Kingdom
Vanga was an ancient kingdom and geopolitical division within the Ganges delta in the Indian subcontinent. The kingdom is one of the namesakes of the Bengal region. It was located in southern Bengal, with the core region including present-day ...
(Bengal),
Kalinga Kalinga may refer to:
Geography, linguistics and/or ethnology
* Kalinga (historical region), a historical region of India
** Kalinga (Mahabharata), an apocryphal kingdom mentioned in classical Indian literature
** Kalinga script, an ancient writ ...
,
Magadha
Magadha was a region and one of the sixteen sa, script=Latn, Mahajanapadas, label=none, lit=Great Kingdoms of the Second Urbanization (600–200 BCE) in what is now south Bihar (before expansion) at the eastern Ganges Plain. Magadha was ruled ...
) which led to an admixture of features of Eastern Prakrits.
Stages of historical development
The development of Sinhala is divided into four epochs:
* Sinhala Prakrit (3rd c BCE to 4th c CE)
* Proto-Sinhala (4th c CE to 8th c CE)
* Medieval Sinhala (8th c CE to 13th c CE)
* Modern Sinhala (13th c CE to the present)
Phonetic development
The most important phonetic developments of Sinhala include:
* the loss of the
aspiration distinction (e.g. ''kanavā'' "to eat" corresponds to Sanskrit ''khādati'', Hindustani ''khānā'')
* the loss of a
vowel length
In linguistics, vowel length is the perceived length of a vowel sound: the corresponding physical measurement is duration. In some languages vowel length is an important phonemic factor, meaning vowel length can change the meaning of the word, f ...
distinction; long vowels in the modern language are due to
loanword
A loanword (also loan word or loan-word) is a word at least partly assimilated from one language (the donor language) into another language. This is in contrast to cognates, which are words in two or more languages that are similar because th ...
s (e.g. ''vibāgaya'' "exam" < Sanskrit ''vibhāga'') and
sandhi
Sandhi ( sa, सन्धि ' , "joining") is a cover term for a wide variety of sound changes that occur at morpheme or word boundaries. Examples include fusion of sounds across word boundaries and the alteration of one sound depending on near ...
, either after
elision
In linguistics, an elision or deletion is the omission of one or more sounds (such as a vowel, a consonant, or a whole syllable) in a word or phrase. However, these terms are also used to refer more narrowly to cases where two words are run toget ...
of
Intervocalic consonant
In phonetics and phonology, an intervocalic consonant is a consonant that occurs between two vowels. Intervocalic consonants are often associated with lenition, a phonetic process that causes consonants to weaken and eventually disappear entirel ...
s (e.g. ''dānavā'' "to put" < ''damanavā'') or in originally
compound
Compound may refer to:
Architecture and built environments
* Compound (enclosure), a cluster of buildings having a shared purpose, usually inside a fence or wall
** Compound (fortification), a version of the above fortified with defensive struct ...
words.
* the simplification of
consonant cluster
In linguistics, a consonant cluster, consonant sequence or consonant compound, is a group of consonants which have no intervening vowel. In English, for example, the groups and are consonant clusters in the word ''splits''. In the education fie ...
s and
geminate consonants into geminates and single consonants respectively (e.g. Sanskrit ''viṣṭā'' "time" > Sinhalese Prakrit ''viṭṭa'' > Modern Sinhala ''viṭa'')
* development of to (e.g. san̆da/han̆da "moon" corresponds to Sanskrit ''candra'') and development of to (e.g. ''däla'' "web" corresponds to Sanskrit ''jāla'')
Western vs. Eastern Prakrit features
According to
Wilhelm Geiger
Wilhelm Ludwig Geiger (; ; 21 July 1856 – 2 September 1943) was a German Orientalist in the fields of Indo-Iranian languages and the history of Iran and Sri Lanka. He was known as a specialist in Pali, Sinhala language and the Dhivehi language ...
, an example of a possible Western feature in Sinhala is the retention of initial which developed into in the Eastern languages (e.g. Sanskrit ''viṃśati'' "twenty", Sinhala ''visi-'', Hindi ''bīs''). This is disputed by
Muhammad Shahidullah
Muhammad Shahidullah ( bn, মুহম্মদ শহীদুল্লাহ; 10 July 1885 – 13 July 1969) was a Bengali linguist, philologist, educationist, and writer.
In 2004, he was ranked number 16 in BBC's poll of the Greatest Benga ...
who says that Sinhala Prakrit branched off from the Eastern Prakrits prior to this change. He cites the inscriptions of Asoka, none of which show this sound change.
An example of an Eastern feature is the ending -e for masculine nominative singular (instead of Western -o) in Sinhalese Prakrit. There are several cases of vocabulary
doublets, e.g. the words ''mässā'' ("fly") and ''mäkkā'' ("flea"), which both correspond to Sanskrit ''makṣikā'' but stem from two regionally different Prakrit words ''macchiā'' (Western prakrits) and ''makkhikā'' (as in Eastern prakrits like
Pali
Pali () is a Middle Indo-Aryan liturgical language native to the Indian subcontinent. It is widely studied because it is the language of the Buddhist ''Pāli Canon'' or ''Tipiṭaka'' as well as the sacred language of ''Theravāda'' Buddhism ...
).
Pre-1815 Sinhalese literature
In 1815 the island of Ceylon came under
British rule
The British Raj (; from Hindi ''rāj'': kingdom, realm, state, or empire) was the rule of the British Crown on the Indian subcontinent;
*
* it is also called Crown rule in India,
*
*
*
*
or Direct rule in India,
* Quote: "Mill, who was hims ...
. During the career of
Christopher Reynolds (1922–2015) as a Sinhalese lecturer at the
SOAS, University of London
SOAS University of London (; the School of Oriental and African Studies) is a public research university in London, England, and a member institution of the federal University of London. Founded in 1916, SOAS is located in the Bloomsbury are ...
, he extensively researched the Sinhalese language and its pre-1815 literature: the Sri Lankan government awarded him the
Sri Lanka Ranjana The Sri Lanka Ranjana ( si, ශ්රී ලංකා රන්ජන) is a national honour of Sri Lanka for foreigners or non nationals, awarded "for distinguished service of highly meritorious nature".
Awardees
Awardees include:
;2005
* Baik ...
medal for this. He wrote the 377-page ''An anthology of Sinhalese literature up to 1815'', selected by the
UNESCO
The United Nations Educational, Scientific and Cultural Organization is a specialized agency of the United Nations (UN) aimed at promoting world peace and security through international cooperation in education, arts, sciences and culture. It ...
National Commission of Ceylon
Substratum influence in Sinhala
According to
Wilhelm Geiger
Wilhelm Ludwig Geiger (; ; 21 July 1856 – 2 September 1943) was a German Orientalist in the fields of Indo-Iranian languages and the history of Iran and Sri Lanka. He was known as a specialist in Pali, Sinhala language and the Dhivehi language ...
, Sinhala has features that set it apart from other Indo-Aryan languages. Some of the differences can be explained by the substrate influence of the parent stock of the
Vedda language
Vedda is an endangered language that is used by the indigenous Vedda people of Sri Lanka. Additionally, communities such as Coast Veddas and Anuradhapura Veddas who do not strictly identify as Veddas also use words from the Vedda language in par ...
. Sinhala has many words that are only found in Sinhala, or shared between Sinhala and Vedda and not etymologically derivable from Middle or Old Indo-Aryan. Possible examples include ''kola'' for leaf in Sinhala and Vedda (although others suggest a Dravidian origin for this word.), ''dola'' for pig in Vedda and offering in Sinhala. Other common words are ''rera'' for wild duck, and ''gala'' for stones (in
toponyms
Toponymy, toponymics, or toponomastics is the study of '' toponyms'' (proper names of places, also known as place names and geographic names), including their origins, meanings, usage and types. Toponym is the general term for a proper name of ...
used throughout the island, although others have also suggested a Dravidian origin).
There are also high frequency words denoting body parts in Sinhala, such as ''olluva'' for head, ''kakula'' for leg, ''bella'' for neck and ''kalava'' for thighs, that are derived from pre-Sinhalese languages of Sri Lanka.
The author of the oldest Sinhala grammar, ''Sidatsangarava'', written in the 13th century CE, recognised a category of words that exclusively belonged to early Sinhala. The grammar lists ''naramba'' (to see) and ''kolamba'' (fort or harbour) as belonging to an indigenous source. ''Kolamba'' is the source of the name of the commercial capital
Colombo
Colombo ( ; si, කොළඹ, translit=Koḷam̆ba, ; ta, கொழும்பு, translit=Koḻumpu, ) is the executive and judicial capital and largest city of Sri Lanka by population. According to the Brookings Institution, Colombo me ...
.
South Dravidian substratum influence
The loss of aspirated stops and the consistent left branching syntax in Sinhala is attributed to a probable
South Dravidian
The Dravidian languages (or sometimes Dravidic) are a family of languages spoken by 250 million people, mainly in southern India, north-east Sri Lanka, and south-west Pakistan. Since the colonial era, there have been small but significant i ...
substratum effect. This has been explained by a period of prior bilingualism:
Influences from neighbouring languages
In addition to many
Tamil loanwords, several phonetic and grammatical features present in neighbouring
Dravidian languages
The Dravidian languages (or sometimes Dravidic) are a family of languages spoken by 250 million people, mainly in southern India, north-east Sri Lanka, and south-west Pakistan. Since the colonial era, there have been small but significant ...
, setting today's spoken Sinhala apart from its Northern Indo-Aryan siblings, bear witness to the close interactions with Dravidian speakers. Some of the features that may be traced to Dravidian influence are –
* the loss of aspiration
* the use of the
attributive verb An attributive verb is a verb that modifies (expresses an attribute of) a noun in the manner of an attributive adjective, rather than express an independent idea as a predicate.
In English (and in most European languages), verb forms that can be u ...
of ''kiyana'' "to say" as a subordinating
conjunction
Conjunction may refer to:
* Conjunction (grammar), a part of speech
* Logical conjunction, a mathematical operator
** Conjunction introduction, a rule of inference of propositional logic
* Conjunction (astronomy), in which two astronomical bodies ...
with the meanings "that" and "if", e.g.:
European influence
As a result of centuries of colonial rule, interaction, settlement, intermarriage and assimilation, modern Sinhala contains many
Portuguese
Portuguese may refer to:
* anything of, from, or related to the country and nation of Portugal
** Portuguese cuisine, traditional foods
** Portuguese language, a Romance language
*** Portuguese dialects, variants of the Portuguese language
** Portu ...
,
Dutch
Dutch commonly refers to:
* Something of, from, or related to the Netherlands
* Dutch people ()
* Dutch language ()
Dutch may also refer to:
Places
* Dutch, West Virginia, a community in the United States
* Pennsylvania Dutch Country
People E ...
and
English loanwords.
Influences on other languages
Macanese Patois
Macanese patois, known as to its speakers, is a Portuguese-based creole language with a substrate from Cantonese, Malay and Sinhala, which was originally spoken by the Macanese community of the Portuguese colony of Macau. It is now spoken by ...
or Macau Creole (known as ''Patuá'' to its speakers) is a
creole language
A creole language, or simply creole, is a stable natural language that develops from the simplifying and mixing of different languages into a new one within a fairly brief period of time: often, a pidgin evolved into a full-fledged language. ...
derived mainly from
Malay
Malay may refer to:
Languages
* Malay language or Bahasa Melayu, a major Austronesian language spoken in Indonesia, Malaysia, Brunei and Singapore
** History of the Malay language, the Malay language from the 4th to the 14th century
** Indonesi ...
, Sinhala,
Cantonese
Cantonese ( zh, t=廣東話, s=广东话, first=t, cy=Gwóngdūng wá) is a language within the Chinese (Sinitic) branch of the Sino-Tibetan languages originating from the city of Guangzhou (historically known as Canton) and its surrounding are ...
, and
Portuguese
Portuguese may refer to:
* anything of, from, or related to the country and nation of Portugal
** Portuguese cuisine, traditional foods
** Portuguese language, a Romance language
*** Portuguese dialects, variants of the Portuguese language
** Portu ...
, which was originally spoken by the
Macanese people
The Macanese people ( pt, Macaense) are an East Asian ethnic group that originated in Macau in the 16th century, consisting of people of predominantly mixed Cantonese and Portuguese as well as Malay, Japanese, English, Sinhalese, and Indian anc ...
of the
Portuguese
Portuguese may refer to:
* anything of, from, or related to the country and nation of Portugal
** Portuguese cuisine, traditional foods
** Portuguese language, a Romance language
*** Portuguese dialects, variants of the Portuguese language
** Portu ...
colony of
Macau
Macau or Macao (; ; ; ), officially the Macao Special Administrative Region of the People's Republic of China (MSAR), is a city and special administrative region of China in the western Pearl River Delta by the South China Sea. With a pop ...
. It is now spoken by a few families in Macau and in the Macanese
diaspora
A diaspora ( ) is a population that is scattered across regions which are separate from its geographic place of origin. Historically, the word was used first in reference to the dispersion of Greeks in the Hellenic world, and later Jews after ...
.
The language developed first mainly among the descendants of Portuguese settlers who often married women from
Malacca
Malacca ( ms, Melaka) is a state in Malaysia located in the southern region of the Malay Peninsula, next to the Strait of Malacca. Its capital is Malacca City, dubbed the Historic City, which has been listed as a UNESCO World Heritage Site si ...
and
Sri Lanka
Sri Lanka (, ; si, ශ්රී ලංකා, Śrī Laṅkā, translit-std=ISO (); ta, இலங்கை, Ilaṅkai, translit-std=ISO ()), formerly known as Ceylon and officially the Democratic Socialist Republic of Sri Lanka, is an ...
rather than from neighbouring
China
China, officially the People's Republic of China (PRC), is a country in East Asia. It is the world's most populous country, with a population exceeding 1.4 billion, slightly ahead of India. China spans the equivalent of five time zones and ...
, so the language had strong Malay and Sinhala influence from the beginning.
Accents and dialects
The Sinhala language has different types of variations which are commonly identified as 'dialects and accents'. Among those variations,
'regional variations
' are prominent. Some of the well-known regional variations of Sinhala language are:
# The
Uva Province
Uva Province ( si, ඌව පළාත, Uva Paḷāta, ta, ஊவா மாகாணம், Uvā Mākāṇam) is Sri Lanka's second least populated Provinces of Sri Lanka, province, with 1,259,880 people, created in 1896. It consists of two dis ...
variation (Monaragala, Badulla).
# The southern variation (Matara, Galle).
# The up-country variation (Kandy, Matale).
# The Sabaragamu variation (Kegalle, Balangoda).
Uva regional variation in relation to grammar
People from Uva province also have a unique linguistic variation in relation to the pronunciation of words. In general, Sinhala singular words are pluralized by adding suffixes like O, hu, wal or waru. But when it comes to Monaragala, the situation is somewhat different as when nouns are pluralized a nasal sound is added.
Southern variation
The Kamath language (an indigenous language of paddy culture) used by the Southerners is somewhat different from the ‘Kamath language’ used in other parts (Uva, Kandy) of Sri Lanka as it is marked with a systematic variation; ‘boya’ at the end of the majority of nouns as the examples below show.
Crops: ‘Kurakkan boya’ (bran)
‘Rambakan boya’ (banana)
Tools: ‘Thattu boya’ (bucket)
Other words: ‘Nivahan boya’ (home)
Here the particular word ‘boya’ means ‘a little’ in the Southern region and at the end of most of nouns, 'boya' is added regularly. This particular word 'boya' is added to most words by the Southern villages as a token of respect towards the things (those things can be crops, tools etc.) they are referring to.
Kandy, Kegalle and Galle people
Even though the Kandy, Kegalle and Galle people pronounce words with slight differences, the Sinhalese can understand the majority of the sentences.
Diglossia
In Sinhala there is distinctive
diglossia
In linguistics, diglossia () is a situation in which two dialects or languages are used (in fairly strict compartmentalization) by a single language community. In addition to the community's everyday or vernacular language variety (labeled " ...
, as in many languages of South Asia. The
literary language
A literary language is the form (register) of a language used in written literature, which can be either a nonstandard dialect or a standardized variety of the language. Literary language sometimes is noticeably different from the spoken langu ...
and the
spoken language
A spoken language is a language produced by articulate sounds or (depending on one's definition) manual gestures, as opposed to a written language. An oral language or vocal language is a language produced with the vocal tract in contrast with a si ...
differ from each other in many aspects. The written language is used for all forms of
literary texts but also orally at formal occasions (public speeches, TV and radio news broadcasts, etc.), whereas the spoken language is used as the language of communication in everyday life (see also
Sinhala slang
Sinhala slang is used by speakers of the Sinhala language in Sri Lanka, as well as many other Sinhala-speaking individuals.
Sinhala dialects
Sinhala dialects are the various minor variations of Sinhalese language which are based on the locale ...
and
colloquialism
Colloquialism (), also called colloquial language, everyday language or general parlance, is the style (sociolinguistics), linguistic style used for casual (informal) communication. It is the most common functional style of speech, the idiom norm ...
). As a rule, the literary language uses more
Sanskrit
Sanskrit (; attributively , ; nominally , , ) is a classical language belonging to the Indo-Aryan branch of the Indo-European languages. It arose in South Asia after its predecessor languages had diffused there from the northwest in the late ...
-based words.
Sinhala diglossia can also be described in terms of informal and formal varieties. The variety used for formal purposes is closer to the written/literary variety, whereas the variety used for informal purposes is closer to the spoken variety. It is also used in some modern literature (e.g. Liyanage Amarakeerthi's ''Kurulu Hadawatha'').
The most important difference between the two varieties is the lack of
inflected
In linguistic morphology, inflection (or inflexion) is a process of word formation in which a word is modified to express different grammatical categories such as tense, case, voice, aspect, person, number, gender, mood, animacy, and defini ...
verb
A verb () is a word (part of speech) that in syntax generally conveys an action (''bring'', ''read'', ''walk'', ''run'', ''learn''), an occurrence (''happen'', ''become''), or a state of being (''be'', ''exist'', ''stand''). In the usual descri ...
forms in the spoken language.
The children are taught the written language at school almost like a
foreign language
A foreign language is a language that is not an official language of, nor typically spoken in, a given country, and that native speakers from that country must usually acquire through conscious learning - be this through language lessons at schoo ...
.
Sinhala also has diverse
slang
Slang is vocabulary (words, phrases, and linguistic usages) of an informal register, common in spoken conversation but avoided in formal writing. It also sometimes refers to the language generally exclusive to the members of particular in-gro ...
. Most slang words and terms were regarded as taboo, and most were frowned upon as non-scholarly. However, nowadays Sinhala slang words and terms, even the ones with sexual references, are commonly used among younger Sri Lankans.
Sinhala script
The Sinhala script ( si, සිංහල අක්ෂර මාලාව, Siṁhala Akṣara Mālāva), also known as Sinhalese script, is a writing system used by the Sinhalese people and most Sri Lankans in Sri Lanka and elsewhere to write ...
, ''Sinhala hodiya'', is based on the ancient
Brahmi script
Brahmi (; ; ISO: ''Brāhmī'') is a writing system of ancient South Asia. "Until the late nineteenth century, the script of the Aśokan (non-Kharosthi) inscriptions and its immediate derivatives was referred to by various names such as 'lath' o ...
, as are most Indian scripts. Sinhala script is closely related to
South India
South India, also known as Dakshina Bharata or Peninsular India, consists of the peninsular southern part of India. It encompasses the Indian states of Andhra Pradesh, Karnataka, Kerala, Tamil Nadu, and Telangana, as well as the union territo ...
n
Grantha script
The Grantha script ( ta, கிரந்த எழுத்து, Granta eḻuttu; ml, ഗ്രന്ഥലിപി, granthalipi) is a South Indian script, found particularly in Tamil Nadu and Kerala. Originating from the Pallava script, t ...
and
Khmer script
Khmer script ( km, អក្សរខ្មែរ, )Huffman, Franklin. 1970. ''Cambodian System of Writing and Beginning Reader''. Yale University Press. . is an abugida (alphasyllabary) script used to write the Khmer language, the official la ...
taken the elements from the related
Kadamba script
The Kadamba script is the first writing system devised specifically for writing Kannada and Telugu. It is a descendant of the Brahmi script. The Kadamba script is also known as ''Pre-Old-Kannada script''.
The Kadamba script is one of the oldes ...
.
The writing system for Sinhala is an
abugida
An abugida (, from Ge'ez language, Ge'ez: ), sometimes known as alphasyllabary, neosyllabary or pseudo-alphabet, is a segmental Writing systems#Segmental writing system, writing system in which consonant-vowel sequences are written as units; ...
, where the consonants are written with letters while the vowels are indicated with
diacritic
A diacritic (also diacritical mark, diacritical point, diacritical sign, or accent) is a glyph added to a letter or to a basic glyph. The term derives from the Ancient Greek (, "distinguishing"), from (, "to distinguish"). The word ''diacriti ...
s (''pilla'') on those consonants, unlike English where both consonants and vowels are full letters, or Urdu where vowels need not be written at all. Also, when a diacritic is not used, an "inherent vowel", either or , is understood, depending on the position of the consonant within the word. For example, the letter ක ''k'' on its own indicates ''ka'', either or . The various vowels are written කා , කැ , කෑ (after the consonant), කි , කී (above the consonant), කු , කූ (below the consonant), කෙ , කේ (before the consonant), කො , කෝ (surrounding the consonant). There are also a few diacritics for consonants, such as in special circumstances, although the tendency nowadays is to spell words with the full letter ර , plus either a preceding or following ''hal kirima''. One word that is still spelt with an "r" diacritic is ශ්රී, as in ශ්රී ලංකාව (Sri Lankāwa). The "r" diacritic is the curved line under the first letter ("ශ": "ශ්ර"). A second diacritic, this time for the vowel sound completes the word ("ශ්ර": "ශ්රීී"). For simple without a vowel, a vowel-cancelling diacritic (
virama
Virama ( ्) is a Sanskrit phonological concept to suppress the inherent vowel that otherwise occurs with every consonant letter, commonly used as a generic term for a codepoint in Unicode, representing either
# halanta, hasanta or explicit virā ...
) called හල් කිරීම is used: ක් . Several of these diacritics occur in two forms, which depend on the shape of the consonant letter. Vowels also have independent letters, but these are only used at the beginning of words where there is no preceding consonant to add a diacritic to.
The complete script consists of about 60 letters, 18 for vowels and 42 for consonants. However, only 57 (16 vowels and 41 consonants) are required for writing colloquial spoken Sinhala (''suddha Sinhala''). The rest indicate sounds that have been merged in the course of linguistic change, such as the aspirates, and are restricted to
Sanskrit
Sanskrit (; attributively , ; nominally , , ) is a classical language belonging to the Indo-Aryan branch of the Indo-European languages. It arose in South Asia after its predecessor languages had diffused there from the northwest in the late ...
and
Pali
Pali () is a Middle Indo-Aryan liturgical language native to the Indian subcontinent. It is widely studied because it is the language of the Buddhist ''Pāli Canon'' or ''Tipiṭaka'' as well as the sacred language of ''Theravāda'' Buddhism ...
loan words. One letter (ඦ), representing the sound /ⁿd͡ʒa/, is attested although no words using this letter are attested.
Sinhala is written from left to right and Sinhala script is mainly used for Sinhala, as well as the liturgical languages
Pali
Pali () is a Middle Indo-Aryan liturgical language native to the Indian subcontinent. It is widely studied because it is the language of the Buddhist ''Pāli Canon'' or ''Tipiṭaka'' as well as the sacred language of ''Theravāda'' Buddhism ...
and
Sanskrit
Sanskrit (; attributively , ; nominally , , ) is a classical language belonging to the Indo-Aryan branch of the Indo-European languages. It arose in South Asia after its predecessor languages had diffused there from the northwest in the late ...
. The alphabetic sequence is similar to those of other Brahmic scripts:
Phonology
Sinhala has so-called
prenasalized consonant
Prenasalized consonants are phonetic sequences of a nasal and an obstruent (or occasionally a non-nasal sonorant such as ) that behave phonologically like single consonants. The primary reason for considering them to be single consonants, rathe ...
s, or 'half nasal' consonants. A short
homorganic nasal occurs before a voiced stop, it is shorter than a sequence of nasal plus stop. The nasal is syllabified with the onset of the following syllable, which means that the
moraic weight of the preceding syllable is left unchanged. For example, ''tam̆ba'' 'copper' contrasts with ''tamba'' 'boil'.
/f~ɸ/ and /ʃ/ are restricted to loans, typically English or Sanskrit. They are commonly replaced by /p/ and /s/ respectively in colloquial speech. Some speakers use the
voiceless labiodental fricative
The voiceless labiodental fricative is a type of consonantal sound used in a number of spoken languages. The symbol in the International Phonetic Alphabet that represents this sound is .
Some scholars also posit the voiceless labiodental approx ...
as in English, and some use the
voiceless bilabial fricative
The voiceless bilabial fricative is a type of consonantal sound, used in some spoken languages. The symbol in the International Phonetic Alphabet that represents this sound is .
Features
Features of the voiceless bilabial fricative:
Occ ...
due to its similarity to the native
voiceless bilabial stop
The voiceless bilabial plosive or stop is a type of consonantal sound used in most spoken languages. The symbol in the International Phonetic Alphabet that represents this sound is , and the equivalent X-SAMPA symbol is p.
Features
Features of ...
/p/.
Long /əː/ is restricted to English loans. /a/ and /ə/ are allophones in Sinhala and contrast with each other in stressed and unstressed syllables respectively. In writing, /a/ and /ə/ are both spelt without a vowel sign attached to the consonant letter, so the patterns of stress in the language must be used to determine the correct pronunciation. Most Sinhala syllables are of the form CV. The first syllable of each word is stressed, with the exception of the verb කරනවා /kərənəˈwaː/ ("to do") and all of its inflected forms where the first syllable is unstressed. Syllables using long vowels are always stressed. The remainder of the syllables are unstressed if they use a short vowel, unless they are immediately followed by one of: a CCV syllable, final /j(i)/ (-යි), final /wu/ (-වු), or a final consonant without a following vowel. The sound /ha/ is always stressed in nouns, adjectives, and adverbs, and so is not pronounced /hə/ except in the word හතලිහ /ˈhat̪əlihə/ ("forty"), where the initial /ha/ is stressed and the final /hə/ is unstressed.
Morphology
Nominal morphology
The main features marked on Sinhala nouns are case, number, definiteness and
animacy
Animacy (antonym: inanimacy) is a grammatical and semantic feature, existing in some languages, expressing how sentient or alive the referent of a noun is. Widely expressed, animacy is one of the most elementary principles in languages around th ...
.
Cases
Sinhala distinguishes several cases. The five primary cases are the
nominative
In grammar, the nominative case (abbreviated ), subjective case, straight case or upright case is one of the grammatical cases of a noun or other part of speech, which generally marks the subject of a verb or (in Latin and formal variants of Engl ...
,
accusative
The accusative case (abbreviated ) of a noun is the grammatical case used to mark the direct object of a transitive verb.
In the English language, the only words that occur in the accusative case are pronouns: 'me,' 'him,' 'her,' 'us,' and ‘the ...
,
dative
In grammar, the dative case (abbreviated , or sometimes when it is a core argument) is a grammatical case used in some languages to indicate the recipient or beneficiary of an action, as in "Maria Jacobo potum dedit", Latin for "Maria gave Jacob a ...
,
genitive
In grammar, the genitive case (abbreviated ) is the grammatical case that marks a word, usually a noun, as modifying another word, also usually a noun—thus indicating an attributive relationship of one noun to the other noun. A genitive can al ...
, and
ablative
In grammar, the ablative case (pronounced ; sometimes abbreviated ) is a grammatical case for nouns, pronouns, and adjectives in the grammars of various languages; it is sometimes used to express motion away from something, among other uses. T ...
. Some scholars also suggest that it has a
locative
In grammar, the locative case (abbreviated ) is a grammatical case which indicates a location. It corresponds vaguely to the English prepositions "in", "on", "at", and "by". The locative case belongs to the general local cases, together with the ...
and
instrumental
An instrumental is a recording normally without any vocals, although it might include some inarticulate vocals, such as shouted backup vocals in a big band setting. Through semantic widening, a broader sense of the word song may refer to instru ...
case. However, for inanimate nouns the locative and genitive, and instrumental and ablative, are identical. In addition, for animate nouns these cases formed by placing ''atiŋ'' ("with the hand") and ''laᵑgə'' ("near") directly after the nominative.
The brackets with most of the vowel length symbols indicate the optional shortening of long vowels in certain
unstressed
In linguistics, and particularly phonology, stress or accent is the relative emphasis or prominence given to a certain syllable in a word or to a certain word in a phrase or sentence. That emphasis is typically caused by such properties as i ...
syllables.
Number marking
In Sinhala animate nouns, the plural is marked with ''-o(ː)'', a
long consonant
In phonetics and phonology, gemination (), or consonant lengthening (from Latin 'doubling', itself from '' gemini'' 'twins'), is an articulation of a consonant for a longer period of time than that of a singleton consonant. It is distinct from ...
plus ''-u'', or with ''-la(ː)''. Most inanimates mark the plural through
disfix
In linguistic morphology, a disfix is a subtractive morpheme, a morpheme manifest through the subtraction of segments from a root or stem. Although other forms of disfixation exist, the element subtracted is usually the final segment of the stem ...
ation.
Loanword
A loanword (also loan word or loan-word) is a word at least partly assimilated from one language (the donor language) into another language. This is in contrast to cognates, which are words in two or more languages that are similar because th ...
s from English mark the singular with ''ekə'', and do not mark the plural. This can be interpreted as a
singulative number
In linguistics, singulative number and collective number (abbreviated and ) are terms used when the grammatical number for multiple items is the unmarked form of a noun, and the noun is specially marked to indicate a single item.
This is the ...
.
On the left hand side of the table, plurals are longer than singulars. On the right hand side, it is the other way round, with the exception of paːrə "street". Note that
animate
Animation is a method by which still figures are manipulated to appear as moving images. In traditional animation, images are drawn or painted by hand on transparent celluloid sheets to be photographed and exhibited on film. Today, most anima ...
lexemes are mostly in the classes on the left-hand side, while
animate
Animation is a method by which still figures are manipulated to appear as moving images. In traditional animation, images are drawn or painted by hand on transparent celluloid sheets to be photographed and exhibited on film. Today, most anima ...
lexemes are most often in the classes on the right hand.
Indefinite article
The indefinite article is ''-ek'' for animates and ''-ak'' for inanimates. The indefinite article exists only in the singular, where its absence marks definiteness. In the plural, (in)definiteness does not receive special marking.
Verbal morphology
Sinhala distinguishes three conjugation classes.
Spoken Sinhala does not mark person, number or gender on the verb (literary Sinhala does). In other words, there is no subject–verb agreement.
Syntax
* Left-branching language (see
branching), which means that determining elements are usually put in front of what they determine (see example below).
* An exception to this is formed by statements of quantity which usually stand behind what they define. Example: "the four flowers" translates to , literally "flowers four". On the other hand, it can be argued that the numeral is the head in this construction, and the flowers the modifier, so that a more literal English rendering would be "a floral foursome"
* SOV (
subject–object–verb
Subject ( la, subiectus "lying beneath") may refer to:
Philosophy
*'' Hypokeimenon'', or ''subiectum'', in metaphysics, the "internal", non-objective being of a thing
**Subject (philosophy), a being that has subjective experiences, subjective con ...
)
word order
In linguistics, word order (also known as linear order) is the order of the syntactic constituents of a language. Word order typology studies it from a cross-linguistic perspective, and examines how different languages employ different orders. C ...
, common to most left-branching languages.
* As is common in left-branching languages, it has no prepositions, only postpositions (see
Adposition
Prepositions and postpositions, together called adpositions (or broadly, in traditional grammar, simply prepositions), are a class of words used to express spatial or temporal relations (''in'', ''under'', ''towards'', ''before'') or mark various ...
). Example: "under the book" translates to , literally "book under".
* Sinhala has no
copula: "I am rich" translates to , literally "I rich". There are two
existential verbs, which are used for
locative predications, but these verbs are not used for predications of class-membership or property-assignment, unlike English ''is''.
* There are almost no
conjunctions as English ''that'' or ''whether'', but only
non-finite clauses that are formed by the means of
participle
In linguistics, a participle () (from Latin ' a "sharing, partaking") is a nonfinite verb form that has some of the characteristics and functions of both verbs and adjectives. More narrowly, ''participle'' has been defined as "a word derived from ...
s and
verbal adjectives. Example: "The man who writes books" translates to , literally "books writing man".
Semantics
There is a four-way
deictic
In linguistics, deixis (, ) is the use of general words and phrases to refer to a specific time, place, or person in context, e.g., the words ''tomorrow'', ''there'', and ''they''. Words are deictic if their semantic meaning is fixed but their de ...
system (which is rare): There are four demonstrative
stems (see
demonstrative pronoun
Demonstratives (abbreviated ) are words, such as ''this'' and ''that'', used to indicate which entities are being referred to and to distinguish those entities from others. They are typically deictic; their meaning depending on a particular frame ...
s) "here, close to the speaker", "there, close to the person addressed", "there, close to a third person, visible" and "there, close to a third person, not visible".
Use of (''thuma'')
Sinhalese has an all-purpose odd suffix (''thuma)'' which when suffixed to a pronoun creates a formal and respectful tone in reference to a person. This is usually used in referring to politicians, nobles, and priests.
e.g. oba thuma (ඔබ තුමා) - you (vocative, when addressing a minister, high-ranking official, or generally showing respect in public etc.)
janadhipathi thuma () - the president (third person)
Discourse
Sinhala is a
pro-drop language
A pro-drop language (from "pronoun-dropping") is a language where certain classes of pronouns may be omitted when they can be pragmatically or grammatically inferable. The precise conditions vary from language to language, and can be quite int ...
: Arguments of a sentence can be omitted when they can be inferred from context. This is true for
subject—as in Italian, for instance—but also objects and other parts of the sentence can be "dropped" in Sinhala if they can be inferred. In that sense, Sinhala can be called a "super pro-drop language", like
Japanese
Japanese may refer to:
* Something from or related to Japan, an island country in East Asia
* Japanese language, spoken mainly in Japan
* Japanese people, the ethnic group that identifies with Japan through ancestry or culture
** Japanese diaspor ...
.
Example: The sentence , literally "where went", can mean "where did I/you/he/she/we... go".
See also
*
Sinhala honorifics
*
Sinhala idioms and proverbs
Sinhala idioms ( si, රූඩි, ''rūḍi'') and colloquial expressions that are widely used to communicate figuratively, as with any other developed language. This page also contains a list of old and popular Sinhala proverb
A proverb (f ...
*
Sinhala keyboard
*
Sinhala numerals
Sinhala numerals, are the units of the numeral system, originating from the Indian subcontinent, used in Sinhala language in modern-day Sri Lanka.
Numerals or numerations around Kandyan Kingdom
It had been found that five different types of nu ...
*
Sinhala slang
Sinhala slang is used by speakers of the Sinhala language in Sri Lanka, as well as many other Sinhala-speaking individuals.
Sinhala dialects
Sinhala dialects are the various minor variations of Sinhalese language which are based on the locale ...
*
Madura English–Sinhala Dictionary
References
Bibliography
* Gair, James: ''Sinhala and Other South Asian Languages'', New York 1998.
*
*
*
Further reading
*
*
*
*
*
everal new editions
* (''Article on the use of slang amongst Sinhalese Raggers.'')
External links
*
Charles Henry CarterA Sinhalese-English dictionary.Colombo: The "Ceylon Observer" Printing Works; London: Probsthain & Co., 1924.
* Simhala Sabdakosa Karyamsaya
Sanksipta Simhala Sabdakosaya.Kolamba : Samskrtika Katayutu Pilibanda Departamentuva, 2007–2009.
Sinhala Dictionary and Language Translator– Madura Online English
Kapruka Sinhala dictionary
{{DEFAULTSORT:Sinhala Language
Southern Indo-Aryan languages
Fusional languages
Languages of Sri Lanka
Subject–object–verb languages