The Indo-Aryan languages (or sometimes Indic languages) are a branch of the
Indo-Iranian languages
The Indo-Iranian languages (also Indo-Iranic languages or Aryan languages) constitute the largest and southeasternmost extant branch of the Indo-European language family (with over 400 languages), predominantly spoken in the geographical subr ...
in the
Indo-European language family
The Indo-European languages are a language family native to the overwhelming majority of Europe, the Iranian plateau, and the northern Indian subcontinent. Some European languages of this family, English, French, Portuguese, Russian, ...
. As of the early 21st century, they have more than 800 million speakers, primarily concentrated in
India
India, officially the Republic of India ( Hindi: ), is a country in South Asia. It is the seventh-largest country by area, the second-most populous country, and the most populous democracy in the world. Bounded by the Indian Ocean on the ...
,
Pakistan
Pakistan ( ur, ), officially the Islamic Republic of Pakistan ( ur, , label=none), is a country in South Asia. It is the world's List of countries and dependencies by population, fifth-most populous country, with a population of almost 24 ...
,
Bangladesh
Bangladesh (}, ), officially the People's Republic of Bangladesh, is a country in South Asia. It is the List of countries and dependencies by population, eighth-most populous country in the world, with a population exceeding 165 million pe ...
,
Nepal
Nepal (; ne, नेपाल ), formerly the Federal Democratic Republic of Nepal ( ne,
सङ्घीय लोकतान्त्रिक गणतन्त्र नेपाल ), is a landlocked country in South Asia. It is ma ...
,
Sri Lanka
Sri Lanka (, ; si, ශ්රී ලංකා, Śrī Laṅkā, translit-std=ISO (); ta, இலங்கை, Ilaṅkai, translit-std=ISO ()), formerly known as Ceylon and officially the Democratic Socialist Republic of Sri Lanka, is an ...
, and
Maldives.
Moreover, apart from the
Indian subcontinent
The Indian subcontinent is a physiographical region in Southern Asia. It is situated on the Indian Plate, projecting southwards into the Indian Ocean from the Himalayas. Geopolitically, it includes the countries of Bangladesh, Bhutan, India ...
, large immigrant and expatriate Indo-Aryan–speaking communities live in
Northwestern Europe,
Western Asia
Western Asia, West Asia, or Southwest Asia, is the westernmost subregion of the larger geographical region of Asia, as defined by some academics, UN bodies and other institutions. It is almost entirely a part of the Middle East, and includes A ...
,
North America, the
Caribbean,
Southeast Africa,
Polynesia
Polynesia () "many" and νῆσος () "island"), to, Polinisia; mi, Porinihia; haw, Polenekia; fj, Polinisia; sm, Polenisia; rar, Porinetia; ty, Pōrīnetia; tvl, Polenisia; tkl, Polenihia (, ) is a subregion of Oceania, made up of ...
and
Australia, along with several million speakers of
Romani language
Romani (; also Romany, Romanes , Roma; rom, rromani ćhib, links=no) is an Indo-Aryan macrolanguage of the Romani communities. According to '' Ethnologue'', seven varieties of Romani are divergent enough to be considered languages of their ...
s primarily concentrated in
Southeastern Europe. There are over 200 known Indo-Aryan languages.
Modern Indo-Aryan languages descend from Old Indo-Aryan languages such as early
Vedic Sanskrit
Vedic Sanskrit was an ancient language of the Indo-Aryan subgroup of the Indo-European language family. It is attested in the Vedas and related literature compiled over the period of the mid- 2nd to mid-1st millennium BCE. It was orally prese ...
, through
Middle Indo-Aryan languages (or
Prakrit
The Prakrits (; sa, prākṛta; psu, 𑀧𑀸𑀉𑀤, ; pka, ) are a group of vernacular Middle Indo-Aryan languages that were used in the Indian subcontinent from around the 3rd century BCE to the 8th century CE. The term Prakrit is usu ...
s).
The largest such languages in terms of
first-speakers are
Hindi–Urdu (),
[Standard Hindi first language: 260.3 million (2001), as second language: 120 million (1999). Urdu L1: 68.9 million (2001–2014), L2: 94 million (1999): ''Ethnologue'' 19.] Bengali (242 million),
Punjabi
Punjabi, or Panjabi, most often refers to:
* Something of, from, or related to Punjab, a region in India and Pakistan
* Punjabi language
* Punjabi people
* Punjabi dialects and languages
Punjabi may also refer to:
* Punjabi (horse), a British Th ...
(about 120 million),
Marathi
Marathi may refer to:
*Marathi people, an Indo-Aryan ethnolinguistic group of Maharashtra, India
*Marathi language, the Indo-Aryan language spoken by the Marathi people
*Palaiosouda, also known as Marathi, a small island in Greece
See also
*
* ...
(112 million),
Gujarati (60 million),
Rajasthani (58 million),
Bhojpuri (51 million),
Odia (35 million),
Maithili (about 34 million),
Sindhi (25 million),
Nepali
Nepali or Nepalese may refer to :
Concerning Nepal
* Anything of, from, or related to Nepal
* Nepali people, citizens of Nepal
* Nepali language, an Indo-Aryan language found in Nepal, the current official national language and a language spoken ...
(16 million),
Assamese
Assamese may refer to:
* Assamese people, a socio-ethnolinguistic identity of north-eastern India
* People of Assam, multi-ethnic, multi-linguistic and multi-religious people of Assam
* Assamese language, one of the easternmost Indo-Aryan language ...
(15 million),
Chhattisgarhi
Chhattisgarhi ( / ) is an Indo-Aryan language, spoken by approximately 16 million people from Chhattisgarh & other states. It is mostly spoken in the Indian states of Chhattisgarh, Odisha, Madhya Pradesh & Maharashtra. It is closely related ...
(18 million),
Sinhala (17 million), and
Romani
Romani may refer to:
Ethnicities
* Romani people, an ethnic group of Northern Indian origin, living dispersed in Europe, the Americas and Asia
** Romani genocide, under Nazi rule
* Romani language, any of several Indo-Aryan languages of the Roma ...
(). A 2005 estimate placed the total number of native speakers of the Indo-Aryan languages at nearly 900 million people.
Classification
Theories

The Indo-Aryan family as a whole is thought to represent a
dialect continuum
A dialect continuum or dialect chain is a series of language varieties spoken across some geographical area such that neighboring varieties are mutually intelligible, but the differences accumulate over distance so that widely separated varie ...
, where languages are often transitional towards neighboring varieties. Because of this, the division into languages vs. dialects is in many cases somewhat arbitrary. The classification of the Indo-Aryan languages is controversial, with many transitional areas that are assigned to different branches depending on classification. There are concerns that a
tree model
In historical linguistics, the tree model (also Stammbaum, genetic, or cladistic model) is a model of the evolution of languages analogous to the concept of a family tree, particularly a phylogenetic tree in the biological evolution of specie ...
is insufficient for explaining the development of New Indo-Aryan, with some scholars suggesting the
wave model.
Subgroups
The following table of proposals is expanded from . Note that the table only lists some modern Indo-Aryan languages.
Anton I. Kogan
Anton may refer to: People
*Anton (given name), including a list of people with the given name
*Anton (surname)
Places
*Anton Municipality, Bulgaria
**Anton, Sofia Province, a village
*Antón District, Panama
**Antón, a town and capital of th ...
, in 2016, conducted a
lexicostatistical study of the New Indo-Aryan languages based on a 100-word
Swadesh list, using techniques developed by the glottochronologist and comparative linguist
Sergei Starostin.
That grouping system is notable for Kogan's exclusion of Dardic from Indo-Aryan on the basis of his previous studies showing low lexical similarity to Indo-Aryan (43.5%) and negligible difference with similarity to Iranian (39.3%). He also calculated Sinhala–Dhivehi to be the most divergent Indo-Aryan branch. Nevertheless, the modern consensus of Indo-Aryan linguists tends towards the inclusion of Dardic based on morphological and grammatical features.
Inner–Outer hypothesis
The Inner–Outer hypothesis argues for a core and periphery of Indo-Aryan languages, with Outer Indo-Aryan (generally including Eastern and Southern Indo-Aryan, and sometimes Northwestern Indo-Aryan,
Dardic and
Pahari) representing an older stratum of Old Indo-Aryan that has been mixed to varying degrees with the newer stratum that is Inner Indo-Aryan. It is a contentious proposal with a long history, with varying degrees of claimed phonological and morphological evidence. Since its proposal by
Rudolf Hoernlé
Augustus Frederic Rudolf Hoernlé CIE (1841 – 1918), also referred to as Rudolf Hoernle or A. F. Rudolf Hoernle, was a German Indologist and philologist. He is famous for his studies on the Bower Manuscript (1891), Weber Manuscript (1893) a ...
in 1880 and refinement by
George Grierson it has undergone numerous revisions and a great deal of debate, with the most recent iteration by
Franklin Southworth and
Claus Peter Zoller based on robust linguistic evidence (particularly an Outer past tense in ''-l-''). Some of the theory's skeptics include
Suniti Kumar Chatterji and
Colin P. Masica
Colin Paul Masica (June 13, 1931 – February 23, 2022) was an American linguist who was professor emeritus in thDepartment of South Asian Languages and Civilizationsand the Department of Linguistics at the University of Chicago. Besides being a s ...
.
Groups
The below classification follows , and .
Dardic
The Dardic languages (also Dardu or Pisaca) are a group of Indo-Aryan languages largely spoken in the northwestern extremities of the Indian subcontinent. Dardic was first formulated by
George Abraham Grierson in his
Linguistic Survey of India but he did not consider it to be a subfamily of Indo-Aryan. The Dardic group as a genetic grouping (rather than areal) has been scrutinised and questioned to a degree by recent scholarship: Southworth, for example, says "the viability of Dardic as a genuine subgroup of Indo-Aryan is doubtful" and "the similarities among
ardic languagesmay result from subsequent convergence".
The Dardic languages are thought to be transitional with Punjabi and Pahari (e.g. Zoller describes Kashmiri as "an interlink between Dardic and West Pahāṛī"),
as well as non-Indo-Aryan Nuristani; and are renowned for their relatively conservative features in the context of
Proto-Indo-Aryan.
* Kashmiri:
Kashmiri Kashmiri may refer to:
* People or things related to the Kashmir Valley or the broader region of Kashmir
* Kashmiris, an ethnic group native to the Kashmir Valley
* Kashmiri language, their language
People with the name
* Kashmiri Saikia Barua ...
,
Kishtwari,
Poguli;
* Shina:
Brokskad,
Kundal Shahi,
Shina,
Ushojo
Ushoji (natively known as Ushojo) is an Indo-Aryan language spoken in Kohistan and Swat districts of the Khyber-Pakhtunkhwa province of Pakistan.
Status
Ushoji may be incredibly endangered due to the dominance of the Pashto language in the r ...
,
Kalkoti,
Palula,
Savi;
* Chitrali:
Kalasha,
Khowar;
* Kohistani:
Bateri,
Chilisso,
Gowro,
Indus Kohistani,
Kalami,
Tirahi,
Torwali,
Wotapuri-Katarqalai
Wotapuri-Katarqalai is an Indo-Aryan language documented to have been spoken in Afghanistan
Afghanistan, officially the Islamic Emirate of Afghanistan,; prs, امارت اسلامی افغانستان is a landlocked country located at ...
;
*
Pashayi
* Kunar:
Dameli,
Gawar-Bati
Gawar-Bati or Narsati is an Indo-Aryan language spoken in the Chitral region of northern Pakistan, and across the border in Afghanistan. It is also known as Aranduyiwar in Chitral because it is spoken in Arandu, which is the last village in ...
,
Nangalami
Nangalami, or Grangali, is an Indo-Aryan language spoken in Afghanistan. Zemiaki was formerly considered a Nangalami dialect, but has been reassessed and placed in the Nuristani language group being close to Waigali
Waigali (') is a languag ...
,
Shumashti
Shumashti – also known as Shumasht – is an Indo-Aryan languages, Indo-Aryan language spoken in eastern Afghanistan. It is spoken in parts of Kunar Province: on the western side of the Kunar Valley between Jalalabad and the Pech Valley. The nu ...
.
Northern Zone
The Northern Indo-Aryan languages, also known as the Pahari ('hill') languages, are spoken throughout the Himalayan regions of the subcontinent.
* Eastern Pahari:
Nepali
Nepali or Nepalese may refer to :
Concerning Nepal
* Anything of, from, or related to Nepal
* Nepali people, citizens of Nepal
* Nepali language, an Indo-Aryan language found in Nepal, the current official national language and a language spoken ...
,
Jumli
Jumli or Jumli Khas is an Indo-Aryan language of Nepal closely related to Nepali. It is primarily spoken in the Karnali Province of Nepal. The language is occasionally referred to as a dialect of Nepali language however Government of Nepal consid ...
,
Doteli;
* Central Pahari:
Garhwali,
Kumaoni;
*
Western Pahari (Himachali):
Dogri,
Kangri,
Bhadarwahi,
Churahi
Churahi (Takri: ) is a Western Pahari language of Himachal Pradesh, India. It is spoken in the Chaurah and Saluni tehsils of Chamba district, and is considered vulnerable.
Adages
Script
The native script of the language is Takri script.
...
,
Bhateali,
Bilaspuri,
Chambeali,
Gaddi,
Pangwali,
Mandeali,
Mahasu Pahari,
Jaunsari,
Kullui,
Pahari Kinnauri,
Hinduri,
Sarazi
Sarazi or Sirazi (also spelled Siraji) is an Indo-Aryan language of Jammu and Kashmir, India. It is native to the Saraz region, a hilly area taking up the northern half of Doda district and parts of neighbouring Ramban and Kishtwar districts.
...
,
Sirmauri.
Northwestern Zone
Northwestern Indo-Aryan languages are spoken in the northwestern region of India and Eastern Pakistan.
Punjabi
Punjabi, or Panjabi, most often refers to:
* Something of, from, or related to Punjab, a region in India and Pakistan
* Punjabi language
* Punjabi people
* Punjabi dialects and languages
Punjabi may also refer to:
* Punjabi (horse), a British Th ...
is spoken predominantly in the
Punjab region and is the official language of
the northern Indian state of Punjab; in addition to being the most widely-spoken language in Pakistan. To the south,
Sindhi and its variants are spoken; primarily in
Sindh
Sindh (; ; ur, , ; historically romanized as Sind) is one of the four provinces of Pakistan. Located in the southeastern region of the country, Sindh is the third-largest province of Pakistan by land area and the second-largest province ...
. Northwestern languages are ultimately thought to be descended from
Shauraseni Prakrit.
*
Punjabi
Punjabi, or Panjabi, most often refers to:
* Something of, from, or related to Punjab, a region in India and Pakistan
* Punjabi language
* Punjabi people
* Punjabi dialects and languages
Punjabi may also refer to:
* Punjabi (horse), a British Th ...
** Eastern Punjabi:
Punjabi
Punjabi, or Panjabi, most often refers to:
* Something of, from, or related to Punjab, a region in India and Pakistan
* Punjabi language
* Punjabi people
* Punjabi dialects and languages
Punjabi may also refer to:
* Punjabi (horse), a British Th ...
,
Doabi,
Majhi,
Malwai,
Puadhi,
Sansi;
** Western Punjabi (
Lahnda
Lahnda () () also known as Lahndi or Western Punjabi, is a group of north-western Indo-Aryan language varieties spoken in parts of Pakistan and India. Its validity as a genetic grouping is not certain. Terms like ''Lahnda'' or ''Western Punja ...
):
Saraiki,
Hindko,
Pahari-Pothwari,
Inku†;
*
Sindhi:
Sindhi,
Jadgali,
Kutchi,
Luwati,
Memoni,
Khetrani
Khetrānī, or Khetranki, is an Indo-Aryan language of north-eastern Balochistan. It is spoken by the majority of the Khetrans, a Baloch tribe that occupies a hilly tract in the Sulaiman Mountains comprising the whole of Barkhan District as well ...
,
Kholosi
Kholosi is an Indo-Aryan language spoken in two villages in southern Iran that was first described in 2008. At its current status, the language is considered endangered. In 2008, it was only spoken in the neighboring villages of Kholus and Got ...
.
Western Zone
Western Indo-Aryan languages, are spoken in the central and western areas within India, such as
Madhya Pradesh
Madhya Pradesh (, ; meaning 'central province') is a state in central India. Its capital city, capital is Bhopal, and the largest city is Indore, with Jabalpur, Ujjain, Gwalior, Sagar, Madhya Pradesh, Sagar, and Rewa, India, Rewa being the othe ...
and
Rajasthan
Rajasthan (; lit. 'Land of Kings') is a state in northern India. It covers or 10.4 per cent of India's total geographical area. It is the largest Indian state by area and the seventh largest by population. It is on India's northwestern s ...
, in addition to contiguous regions in Pakistan. Gujarati is the official language of
Gujarat
Gujarat (, ) is a state along the western coast of India. Its coastline of about is the longest in the country, most of which lies on the Kathiawar peninsula. Gujarat is the fifth-largest Indian state by area, covering some ; and the nin ...
, and is spoken by over 50 million people. In Europe, various
Romani languages are spoken by the
Romani people
The Romani (also spelled Romany or Rromani , ), colloquially known as the Roma, are an Indo-Aryan ethnic group, traditionally nomadic itinerants. They live in Europe and Anatolia, and have diaspora populations located worldwide, with sig ...
, an itinerant community who historically migrated from India. The Western Indo-Aryan languages are thought to have diverged from their northwestern counterparts, although they have a common antecedent in
Shauraseni Prakrit.
*
Rajasthani: Standard Rajasthani,
Bagri,
Marwari,
Mewati,
Dhundari,
Harauti,
Mewari,
Shekhawati
Shekhawati is a semi-arid historical region located in the northeast part of Rajasthan, India. The region was ruled by Shekhawat Rajputs.
Shekhawati is located in North Rajasthan, comprising the districts of Jhunjhunu,
parts of Sikar that lies ...
,
Dhatki,
Malvi,
Nimadi,
Gujari
Gojri (, ), also known as Gujari, Gujri, Gojari, or Gojri, is a variety of Rajasthani spoken by the Gurjars and other tribes of India, Pakistan and Afghanistan.
In India, the language is mainly spoken in Jammu and Kashmir, Himachal Prad ...
,
Goaria,
Loarki
Marwari (मारवाड़ी, IAST: Mārwāṛī; also rendered as ''Marwadi'' or ''Marvadi'') is a Rajasthani language spoken in the Indian state of Rajasthan. Marwari is also found in the neighbouring states of Gujarat and Haryana, ...
,
Bhoyari,
Kanjari,
Od;
*
Gujarati:
Gujarati,
Jandavra
Jandavra (Jhandoria) is a minor Indic language of Sindh, Pakistan, and Jodhpur
Jodhpur (; ) is the second-largest city in the Indian state of Rajasthan and officially the second metropolitan city of the state. It was formerly the seat o ...
,
Saurashtra,
Aer,
Vaghri,
Parkari Koli,
Kachi Koli
Kachi Koli is an Indo-Aryan language spoken in Pakistan and India. Part of the Gujarati subfamily, Kachi Koli is closely related to Parkari Koli and Wadiyara Koli
Wadiyara Koli is an Indo-Aryan language of the Gujarati group. It is spok ...
,
Wadiyara Koli;
*
Bhil:
Kalto,
Vasavi,
Wagdi
Wagdi is a Bhil language of India spoken mainly in Dungarpur and Banswara districts of Southern Rajasthan. Wagdi has been characterized as a dialect of Bhili
Bhili (Bhili: ), , is a Western Indo-Aryan language spoken in west-central India ...
,
Gamit,
Vaagri Booli;
** Northern Bhil:
Bauria,
Bhilori,
Magari;
** Central Bhil:
Bhili proper,
Bhilali,
Chodri,
Dhodia,
Dhanki,
Dubli;
** Bareli:
Palya Bareli,
Pauri Bareli,
Rathwi Bareli,
Pardhi
Pardhi is a Hindu tribe in India. The tribe is found mostly in Maharashtra and parts of Madhya Pradesh however small numbers can be found in Gujarat and Andhra Pradesh. The word Pardhi is derived from the Marathi (state language) word ‘''Paradh ...
;
*
Khandeshi
*
Lambadi
*
Domaaki
Dawoodi (), also known as Domaakí (), Dumaki or Domaá, is an endangered Indo-Aryan language spoken by a few hundred people living in the Gilgit-Baltistan territory in northern Pakistan. It is historically related to the Central Indo-Aryan langu ...
*
Domari
Domari is an endangered Indo-Aryan language, spoken by Dom people scattered across the Middle East and North Africa. The language is reported to be spoken as far north as Azerbaijan and as far south as central Sudan, in Turkey, Iran, Iraq, Palest ...
*
Romani
Romani may refer to:
Ethnicities
* Romani people, an ethnic group of Northern Indian origin, living dispersed in Europe, the Americas and Asia
** Romani genocide, under Nazi rule
* Romani language, any of several Indo-Aryan languages of the Roma ...
:
Carpathian Romani,
Balkan Romani
Balkan Romani, Balkaniko Romanes, or Balkan Gypsy is a specific non-Vlax dialect of the Romani language, spoken by groups within the Balkans, which include countries such as Albania, Bosnia-Herzegovina, Bulgaria, Greece, Kosovo, North Macedonia, ...
,
Vlax Romani;
**
Northern Romani:
Sinte Romani,
Finnish Kalo,
Baltic Romani.
Central Zone (Madhya ''or'' Hindi)
Within India,
Hindi languages are spoken primarily in the
Hindi belt
The Hindi Belt, also known as the Hindi Heartland, is a linguistic region encompassing parts of northern, central, eastern and western India where various Central Indo-Aryan languages subsumed under the term 'Hindi' (for example, by th ...
regions and
Gangetic plains, including
Delhi
Delhi, officially the National Capital Territory (NCT) of Delhi, is a city and a union territory of India containing New Delhi, the capital of India. Straddling the Yamuna river, primarily its western or right bank, Delhi shares borders wi ...
and the surrounding areas; where they are often transitional with neighbouring lects. Many of these languages, including
Braj and
Awadhi, have rich literary and poetic traditions.
, a Persianized derivative of
Khariboli, is the official language of
Pakistan
Pakistan ( ur, ), officially the Islamic Republic of Pakistan ( ur, , label=none), is a country in South Asia. It is the world's List of countries and dependencies by population, fifth-most populous country, with a population of almost 24 ...
and also has strong
historical connections to
India
India, officially the Republic of India ( Hindi: ), is a country in South Asia. It is the seventh-largest country by area, the second-most populous country, and the most populous democracy in the world. Bounded by the Indian Ocean on the ...
, where it also has been designated with official status.
Hindi
Hindi (Devanāgarī: or , ), or more precisely Modern Standard Hindi (Devanagari: ), is an Indo-Aryan languages, Indo-Aryan language spoken chiefly in the Hindi Belt region encompassing parts of North India, northern, Central India, centr ...
, a standardized and Sanskritized register of
Khariboli, is the official language of the
Government of India
The Government of India ( ISO: ; often abbreviated as GoI), known as the Union Government or Central Government but often simply as the Centre, is the national government of the Republic of India, a federal democracy located in South Asia, ...
.
Together with Urdu, it is the third most-spoken language in the world.
* Western Hindi:
Hindustani
Hindustani may refer to:
* something of, from, or related to Hindustan (another name of India)
* Hindustani language, an Indo-Aryan language, whose two official norms are Hindi and Urdu
* Fiji Hindi, a variety of Eastern Hindi spoken in Fiji, and ...
(including
Standard Hindi and
Standard Urdu),
Khariboli,
Braj,
Haryanvi,
Bundeli
Bundeli ( Devanagari: बुन्देली or बुंदेली; or Bundelkhandi) is an Indo-Aryan language spoken in the Bundelkhand region of central India. It belongs to the Central Indo-Ayran languages and is part of the Western ...
,
Kannauji,
Parya Parya may refer to:
* Parya language, spoken in Central Asia
* several mountains in Peru:
** Parya (Ayacucho)
** Paria (Peru)
** Puka Parya
** Parya Chaka
See also
* '' Parrya'', a genus of plants
* Paria (disambiguation)
Paria may refer to th ...
;
* Eastern Hindi:
Bagheli,
Chhattisgarhi
Chhattisgarhi ( / ) is an Indo-Aryan language, spoken by approximately 16 million people from Chhattisgarh & other states. It is mostly spoken in the Indian states of Chhattisgarh, Odisha, Madhya Pradesh & Maharashtra. It is closely related ...
,
Surgujia
Surgujia is an Indo-Aryan language spoken in Chhattisgarh. It belongs to the Eastern Hindi subgroup.
Speakers
Surgujia is primarily spoken in Surguja, Jashpur, and Koriya districts of Chhattisgarh; and to a lesser extent in Raigarh and Korb ...
;
**
Awadhi:
Fiji Hindi,
Caribbean Hindustani
Eastern Zone
The Eastern Indo-Aryan languages, also known as Magadhan languages, are spoken throughout the eastern subcontinent, including
Odisha
Odisha (English: , ), formerly Orissa ( the official name until 2011), is an Indian state located in Eastern India. It is the 8th largest state by area, and the 11th largest by population. The state has the third largest population of Sc ...
and
Bihar
Bihar (; ) is a state in eastern India. It is the 2nd largest state by population in 2019, 12th largest by area of , and 14th largest by GDP in 2021. Bihar borders Uttar Pradesh to its west, Nepal to the north, the northern part of West ...
, alongside other regions surrounding the northwestern Himalayan corridor.
Bengali is the seventh most-spoken language in the world, and has a strong literary tradition; the
national anthem
A national anthem is a patriotic musical composition symbolizing and evoking eulogies of the history and traditions of a country or nation. The majority of national anthems are marches or hymns in style. American, Central Asian, and Europe ...
s of
India
India, officially the Republic of India ( Hindi: ), is a country in South Asia. It is the seventh-largest country by area, the second-most populous country, and the most populous democracy in the world. Bounded by the Indian Ocean on the ...
and
Bangladesh
Bangladesh (}, ), officially the People's Republic of Bangladesh, is a country in South Asia. It is the List of countries and dependencies by population, eighth-most populous country in the world, with a population exceeding 165 million pe ...
are written in Bengali.
Assamese
Assamese may refer to:
* Assamese people, a socio-ethnolinguistic identity of north-eastern India
* People of Assam, multi-ethnic, multi-linguistic and multi-religious people of Assam
* Assamese language, one of the easternmost Indo-Aryan language ...
and
Odia are the official languages of
Assam
Assam (; ) is a state in northeastern India, south of the eastern Himalayas along the Brahmaputra and Barak River valleys. Assam covers an area of . The state is bordered by Bhutan and Arunachal Pradesh to the north; Nagaland and Manipur ...
and
Odisha
Odisha (English: , ), formerly Orissa ( the official name until 2011), is an Indian state located in Eastern India. It is the 8th largest state by area, and the 11th largest by population. The state has the third largest population of Sc ...
, respectively. The Eastern Indo-Aryan languages descend from Magadhan
Apabhraṃśa and ultimately from
Magadhi Prakrit.
*
Bihari:
**
Bhojpuri,
Caribbean Hindustani,
Fiji Hindi;
**
Magahi,
Khortha;
**
Maithili,
Angika,
Bajjika, Dehati;
**
Sadanic:
Nagpuri (Sadri),
Kurmali (Panchpargania);
**
Tharu,
Kochila Tharu
Kochila Tharu, also called Septari or Saptariya Tharu, Madhya-Purbiya Tharu, and Mid-Eastern Tharu, is a diverse group of language varieties in the Tharu group of the Indo-Aryan languages. The several names of the varieties refer to the regions ...
,
Buksa,
Majhi,
Musasa;
**
Kumhali
Kumhali, Kumali, or Kumbale, is an Indo-Aryan language spoken by some of the Kumal people of Nepal
Nepal (; ne, नेपाल ), formerly the Federal Democratic Republic of Nepal ( ne,
सङ्घीय लोकतान्त� ...
, Kuswaric:
Danwar,
Bote-Darai;
*
Halbic:
Halbi,
Kamar,
Bhunjia,
Nahari;
*
Odia:
Baleswari, Kataki,
Ganjami,
Sundargadi,
Sambalpuri,
Desia;
**
Bodo Parja,
Bhatri
Bhatri is an Eastern Indo-Aryan language spoken by the Bhottada tribe in Chhattisgarh and Odisha, India. The language is spoken predominantly in eastern Bastar district and in Koraput and Nabarangpur
Nabarangpur is a town, villa city and a ...
,
Reli,
Kupia;
*
Bengali–Assamese:
Bishnupriya Manipuri,
Hajong,
Chittagonian,
Chakma
Chakma may refer to:
*Chakma people, a Tibeto-Burman people of Bangladesh and Northeast India
*Chakma language, the Indo-Aryan language spoken by them
**Chakma script
***Chakma (Unicode block)
Chakma is a Unicode block containing characters for ...
,
Noakhailla
Noakhailla (), also known by the exonym Noakhalian, is a dialect of Bengali, spoken by an estimated 7 million people, primarily in the Greater Noakhali region of Bangladesh as well as southern parts of Tripura in India. Outside of these regions, t ...
,
Tanchangya,
Rohingya,
Sylheti,;
** Bengali-Gauda:
Bengali,
Bangali
Bengalis (singular Bengali bn, বাঙ্গালী/বাঙালি ), also rendered as Bangalee or the Bengali people, are an Indo-Aryan ethnolinguistic group originating from and culturally affiliated with the Bengal region of S ...
,
Rarhi,
Varendri
North Central Bengali or Varendrī () is a dialect of the Bengali language, spoken in the Varendra region (primarily consisting of the Rajshahi Division in Bangladesh and the Malda division in India). Varendri dialect was classified by many re ...
, Sundarbani,
Manbhumi,
Dhakaiya Kutti,
Dobhashi;
** Kamarupic:
Assamese
Assamese may refer to:
* Assamese people, a socio-ethnolinguistic identity of north-eastern India
* People of Assam, multi-ethnic, multi-linguistic and multi-religious people of Assam
* Assamese language, one of the easternmost Indo-Aryan language ...
,
Kamrupi,
Goalpariya,
Rangpuri,
Surjapuri
Surjapuri is an Eastern Indo-Aryan language spoken in Eastern India including North Bengal, West Bengal, and Banganchal of Eastern Bihar, as well as in Nepal. Among speakers in some regions, it is known as 'Deshi Bhasa'. It possesses similaritie ...
,
Rajbanshi;
Southern Zone
Marathi-Konkani languages are ultimately descended from
Maharashtri Prakrit, whereas Insular Indo-Aryan languages are descended from
Elu Prakrit
Eḷu, also Hela or Helu, is a hypothesized language Middle Indo-Aryan language or Prakrit of the 3rd century BCE. It is ancestral to the Sinhalese and Dhivehi languages.
R. C. Childers, in the ''Journal of the Royal Asiatic Society'', states ...
and possess several characteristics that markedly distinguish them from most of their mainland Indo-Aryan counterparts.
*
Marathi-Konkani
** Marathic:
Marathi
Marathi may refer to:
*Marathi people, an Indo-Aryan ethnolinguistic group of Maharashtra, India
*Marathi language, the Indo-Aryan language spoken by the Marathi people
*Palaiosouda, also known as Marathi, a small island in Greece
See also
*
* ...
,
Varhadi,
Andh,
Berar-Deccan Marathi
Berar-Deccan Marathi, is a possible language of the Marathi–Konkani languages, Marathi–Konkani group, or perhaps just a regional Marathi dialects, dialect of Marathi. ''Glottolog'' reports that it is closely related to Varhadi-Nagpuri lang ...
,
Phudagi
The Phudagi language, also known as Vadvali, is a language or dialect of the Marathi–Konkani group.
This language is spoken by Panchkalshi and Chaukalshi communities residing in Palghar, Vasai
Vasai (Konkani and Marathi pronunciation: ...
,
Katkari
The Katkari also called Kathodi, are an Indian tribe from Maharashtra. They have been categorised as a Scheduled tribe. They are bilingual, speaking the Katkari language, a dialect of the Marathi-Konkani languages, with each other; they speak Ma ...
,
Varli,
Kadodi;
** Konkanic:
Konkani,
Canarese Konkani
Canarese Konkani are a set of dialects spoken by minority Konkani people of the Canara sub-region of Karnataka, and also in Kassergode of Kerala that was part of South Canara.The Constitution Act 1992 (71st Amendment) Kanarese script is the p ...
,
Maharashtrian Konkani.
Insular Indic
Insular Indic languages (of
Sri Lanka
Sri Lanka (, ; si, ශ්රී ලංකා, Śrī Laṅkā, translit-std=ISO (); ta, இலங்கை, Ilaṅkai, translit-std=ISO ()), formerly known as Ceylon and officially the Democratic Socialist Republic of Sri Lanka, is an ...
and
Maldives) started developing independently and diverging from the continental Indo-Aryan languages from around 5th century BCE.
* Insular Indo-Aryan
**
Sinhala
**
Maldivian: Dhivehi, Mahl
Unclassified
The following languages are otherwise unclassified within Indo-Aryan:
*
Chinali–Lahul Lohar:
Chinali,
Lahul Lohar.
*
Badeshi
History

Proto-Indo-Aryan
Proto-Indo-Aryan (or sometimes Proto-Indic) is the
reconstructed proto-language of the Indo-Aryan languages. It is intended to reconstruct the language of the
pre-Vedic Indo-Aryans. Proto-Indo-Aryan is meant to be the predecessor of
Old Indo-Aryan
The Indo-Aryan languages (or sometimes Indic languages) are a branch of the Indo-Iranian languages in the Indo-European language family. As of the early 21st century, they have more than 800 million speakers, primarily concentrated in India, ...
(1500–300 BCE), which is directly attested as
Vedic and
Mitanni-Aryan
Mitanni (; Hittite cuneiform ; ''Mittani'' '), c. 1550–1260 BC, earlier called Ḫabigalbat in old Babylonian texts, c. 1600 BC; Hanigalbat or Hani-Rabbat (''Hanikalbat'', ''Khanigalbat'', cuneiform ') in Assyrian records, or ''Naharin'' in ...
. Despite the great archaicity of Vedic, however, the other Indo-Aryan languages preserve a small number of
conservative features lost in Vedic.
Mitanni-Aryan hypothesis
Some theonyms, proper names, and other terminology of the Late
Bronze Age
The Bronze Age is a historic period, lasting approximately from 3300 BC to 1200 BC, characterized by the use of bronze, the presence of writing in some areas, and other early features of urban civilization. The Bronze Age is the second pri ...
Mitanni civilization of
Upper Mesopotamia exhibit an Indo-Aryan superstrate. While what few written records left by the Mittani are either in
Hurrian (which appears to have been the predominant language of their kingdom) or
Akkadian Akkadian or Accadian may refer to:
* Akkadians, inhabitants of the Akkadian Empire
* Akkadian language, an extinct Eastern Semitic language
* Akkadian literature, literature in this language
* Akkadian cuneiform, early writing system
* Akkadian myt ...
(the main
diplomatic language of the Late Bronze Age Near East), these apparently Indo-Aryan names suggest that an Indo-Aryan elite imposed itself over the
Hurrians
The Hurrians (; cuneiform: ; transliteration: ''Ḫu-ur-ri''; also called Hari, Khurrites, Hourri, Churri, Hurri or Hurriter) were a people of the Bronze Age Near East. They spoke a Hurrian language and lived in Anatolia, Syria and Northern ...
in the course of the
Indo-Aryan expansion. If these traces are Indo-Aryan, they would be the earliest known direct evidence of Indo-Aryan, and would increase the precision in dating the split between the Indo-Aryan and Iranian languages (as the texts in which the apparent Indicisms occur can be dated with some accuracy).
In a treaty between the
Hittites
The Hittites () were an Anatolian people who played an important role in establishing first a kingdom in Kussara (before 1750 BC), then the Kanesh or Nesha kingdom (c. 1750–1650 BC), and next an empire centered on Hattusa in north-cent ...
and the Mitanni, the deities
Mitra,
Varuna,
Indra, and the
Ashvins (
Nasatya) are invoked.
Kikkuli's horse training text includes technical terms such as ''aika'' (cf. Sanskrit ''eka'', "one"), ''tera'' (''tri'', "three"), ''panza'' (''pancha'', "five"), ''satta'' (''sapta'', seven), ''na'' (''nava'', "nine"), ''vartana'' (''vartana'', "turn", round in the horse race). The numeral ''aika'' "one" is of particular importance because it places the superstrate in the vicinity of Indo-Aryan proper as opposed to Indo-Iranian in general or early Iranian (which has ''aiva''). Another text has ''babru'' (''babhru'', "brown"), ''parita'' (''palita'', "grey"), and (''pingala'', "red"). Their chief festival was the celebration of the
solstice
A solstice is an event that occurs when the Sun appears to reach its most northerly or southerly excursion relative to the celestial equator on the celestial sphere. Two solstices occur annually, around June 21 and December 21. In many count ...
(''vishuva'') which was common in most cultures in the ancient world. The Mitanni warriors were called ''marya'', the term for "warrior" in
Sanskrit
Sanskrit (; attributively , ; nominalization, nominally , , ) is a classical language belonging to the Indo-Aryan languages, Indo-Aryan branch of the Indo-European languages. It arose in South Asia after its predecessor languages had Trans-cul ...
as well; note ''mišta-nnu'' (= ''miẓḍha'', ≈ Sanskrit ''mīḍha'') "payment (for catching a fugitive)" (M. Mayrhofer, ''Etymologisches Wörterbuch des Altindoarischen'', Heidelberg, 1986–2000; Vol. II:358).
Sanskritic interpretations of Mitanni royal names render
Artashumara (''artaššumara'') as ''Ṛtasmara'' "who thinks of
Ṛta" (Mayrhofer II 780), Biridashva (''biridašṷa, biriiašṷ''a) as ''Prītāśva'' "whose horse is dear" (Mayrhofer II 182), Priyamazda (''priiamazda'') as ''Priyamedha'' "whose wisdom is dear" (Mayrhofer II 189, II378), Citrarata as ''Citraratha'' "whose chariot is shining" (Mayrhofer I 553), Indaruda/Endaruta as ''Indrota'' "helped by
Indra" (Mayrhofer I 134), Shativaza (''šattiṷaza'') as ''Sātivāja'' "winning the race price" (Mayrhofer II 540, 696), Šubandhu as ''Subandhu'' "having good relatives" (a name in
Palestine, Mayrhofer II 209, 735), Tushratta (''tṷišeratta, tušratta'', etc.) as *tṷaiašaratha, Vedic
Tvastar "whose chariot is vehement" (Mayrhofer, Etym. Wb., I 686, I 736).
Indian subcontinent
Dates indicate only a rough time frame.
*
Proto-Indo-Aryan (before 1500 BCE, reconstructed)
* Old Indo-Aryan (ca. 1500–300 BCE)
** early Old Indo-Aryan: includes
Vedic Sanskrit
Vedic Sanskrit was an ancient language of the Indo-Aryan subgroup of the Indo-European language family. It is attested in the Vedas and related literature compiled over the period of the mid- 2nd to mid-1st millennium BCE. It was orally prese ...
(ca. 1500 to 500 BCE)
** late Old Indo-Aryan:
Epic Sanskrit,
Classical Sanskrit (ca. 200 CE to 1300 CE)
**
Mitanni Indo-Aryan
Mitanni (; Hittite cuneiform ; ''Mittani'' '), c. 1550–1260 BC, earlier called Ḫabigalbat in old Babylonian texts, c. 1600 BC; Hanigalbat or Hani-Rabbat (''Hanikalbat'', ''Khanigalbat'', cuneiform ') in Assyrian records, or ''Naharin'' in ...
(ca. 1400 BCE)
*
Middle Indo-Aryan
The Middle Indo-Aryan languages (or Middle Indic languages, sometimes conflated with the Prakrits, which are a stage of Middle Indic) are a historical group of languages of the Indo-Aryan family. They are the descendants of Old Indo-Aryan (OIA; ...
or
Prakrit
The Prakrits (; sa, prākṛta; psu, 𑀧𑀸𑀉𑀤, ; pka, ) are a group of vernacular Middle Indo-Aryan languages that were used in the Indian subcontinent from around the 3rd century BCE to the 8th century CE. The term Prakrit is usu ...
s (ca. 300 BCE to 1500 CE)
** early Buddhist texts (ca. 6th or 5th century BCE)
** early Middle Indo-Aryan: e.g. Ashokan Prakrits,
Pali
Pali () is a Middle Indo-Aryan liturgical language native to the Indian subcontinent. It is widely studied because it is the language of the Buddhist '' Pāli Canon'' or '' Tipiṭaka'' as well as the sacred language of '' Theravāda'' Bud ...
,
Gandhari
Gandhari may refer to:
* Gandhari (Mahabharata), a character in the Indian epic ''Mahabharata''
* Gandhari khilla, a hill fort near Bokkalagutta, Telangana, India
* Gandhari language, north-western prakrit spoken in Gāndhāra
**Kharosthi, or Gan ...
, (ca. 300 BCE to 200 BCE)
** middle Middle Indo-Aryan: e.g.
Dramatic Prakrits,
Elu (ca. 200 BCE to 700 CE)
** late Middle Indo-Aryan: e.g.
Abahattha (ca. 700 CE to 1500 CE)
* Early Modern Indo-Aryan (Late Medieval India): e.g. early
Dakhini and emergence of the
Dehlavi dialect
Old Indo-Aryan
The earliest evidence of the group is from
Vedic Sanskrit
Vedic Sanskrit was an ancient language of the Indo-Aryan subgroup of the Indo-European language family. It is attested in the Vedas and related literature compiled over the period of the mid- 2nd to mid-1st millennium BCE. It was orally prese ...
, that is used in the ancient preserved texts of the
Indian subcontinent
The Indian subcontinent is a physiographical region in Southern Asia. It is situated on the Indian Plate, projecting southwards into the Indian Ocean from the Himalayas. Geopolitically, it includes the countries of Bangladesh, Bhutan, India ...
, the foundational canon of the
Hindu synthesis known as the
Veda
upright=1.2, The Vedas are ancient Sanskrit texts of Hinduism. Above: A page from the '' Atharvaveda''.
The Vedas (, , ) are a large body of religious texts originating in ancient India. Composed in Vedic Sanskrit, the texts constitute th ...
s. The
Indo-Aryan superstrate in Mitanni is of similar age to the language of the
Rigveda
The ''Rigveda'' or ''Rig Veda'' ( ', from ' "praise" and ' "knowledge") is an ancient Indian collection of Vedic Sanskrit hymns (''sūktas''). It is one of the four sacred canonical Hindu texts ('' śruti'') known as the Vedas. Only one ...
, but the only evidence of it is a few proper names and specialized loanwords.
While Old Indo-Aryan is the earliest stage of the Indo-Aryan branch, from which all known languages of the later stages Middle and New Indo-Aryan are derived, some documented Middle Indo-Aryan variants cannot fully be derived from the documented form of Old Indo-Aryan (on which Vedic and Classical Sanskrit are based), but betray features that must go back to other undocumented variants/dialects of Old Indo-Aryan.
From Vedic Sanskrit, "
Sanskrit
Sanskrit (; attributively , ; nominalization, nominally , , ) is a classical language belonging to the Indo-Aryan languages, Indo-Aryan branch of the Indo-European languages. It arose in South Asia after its predecessor languages had Trans-cul ...
" (literally "put together", "perfected" or "elaborated") developed as the prestige language of culture, science and religion, as well as the court, theatre, etc. Sanskrit of the later Vedic texts is comparable to
Classical Sanskrit, but is largely
mutually unintelligible with Vedic Sanskrit.
Middle Indo-Aryan (Prakrits)
Outside the learned sphere of Sanskrit, vernacular dialects (
Prakrit
The Prakrits (; sa, prākṛta; psu, 𑀧𑀸𑀉𑀤, ; pka, ) are a group of vernacular Middle Indo-Aryan languages that were used in the Indian subcontinent from around the 3rd century BCE to the 8th century CE. The term Prakrit is usu ...
s) continued to evolve. The oldest attested Prakrits are the
Buddhist
Buddhism ( , ), also known as Buddha Dharma and Dharmavinaya (), is an Indian religion or philosophical tradition based on teachings attributed to the Buddha. It originated in northern India as a -movement in the 5th century BCE, and ...
and
Jain canonical languages
Pali
Pali () is a Middle Indo-Aryan liturgical language native to the Indian subcontinent. It is widely studied because it is the language of the Buddhist '' Pāli Canon'' or '' Tipiṭaka'' as well as the sacred language of '' Theravāda'' Bud ...
and
Ardhamagadhi Prakrit, respectively. Inscriptions in
Ashokan Prakrit
Ashokan Pali (or Aśokan Dhammalipi) is the Middle Indo-Aryan dialect continuum used in the Edicts of Ashoka, attributed to Emperor Ashoka of the Mauryan Empire who reigned to . The Edicts are inscriptions on monumental pillars and rocks through ...
were also part of this early Middle Indo-Aryan stage.
By medieval times, the Prakrits had diversified into various
Middle Indo-Aryan languages. ''
Apabhraṃśa'' is the conventional cover term for transitional dialects connecting late Middle Indo-Aryan with early Modern Indo-Aryan, spanning roughly the 6th to 13th centuries. Some of these dialects showed considerable literary production; the ''Śravakacāra'' of Devasena (dated to the 930s) is now considered to be the first Hindi book.
The next major milestone occurred with the
Muslim conquests in the Indian subcontinent in the 13th–16th centuries. Under the flourishing
Turco-Mongol
The Turco-Mongol or Turko-Mongol tradition was an ethnocultural synthesis that arose in Asia during the 14th century, among the ruling elites of the Golden Horde and the Chagatai Khanate. The ruling Mongolian nobility, Mongol elites of these Kh ...
Mughal Empire
The Mughal Empire was an early-modern empire that controlled much of South Asia between the 16th and 19th centuries. Quote: "Although the first two Timurid emperors and many of their noblemen were recent migrants to the subcontinent, the ...
,
Persian became very influential as the language of prestige of the Islamic courts due to adoption of the foreign language by the Mughal emperors.
The two largest languages that formed from Apabhraṃśa were
Bengali and
Hindustani
Hindustani may refer to:
* something of, from, or related to Hindustan (another name of India)
* Hindustani language, an Indo-Aryan language, whose two official norms are Hindi and Urdu
* Fiji Hindi, a variety of Eastern Hindi spoken in Fiji, and ...
; others include
Assamese
Assamese may refer to:
* Assamese people, a socio-ethnolinguistic identity of north-eastern India
* People of Assam, multi-ethnic, multi-linguistic and multi-religious people of Assam
* Assamese language, one of the easternmost Indo-Aryan language ...
,
Sindhi,
Gujarati,
Odia,
Marathi
Marathi may refer to:
*Marathi people, an Indo-Aryan ethnolinguistic group of Maharashtra, India
*Marathi language, the Indo-Aryan language spoken by the Marathi people
*Palaiosouda, also known as Marathi, a small island in Greece
See also
*
* ...
, and
Punjabi
Punjabi, or Panjabi, most often refers to:
* Something of, from, or related to Punjab, a region in India and Pakistan
* Punjabi language
* Punjabi people
* Punjabi dialects and languages
Punjabi may also refer to:
* Punjabi (horse), a British Th ...
.
New Indo-Aryan
= Medieval Hindustani
=
In the
Central Zone Hindi-speaking areas, for a long time the
prestige dialect was
Braj Bhasha, but this was replaced in the 19th century by
Dehlavi-based
Hindustani
Hindustani may refer to:
* something of, from, or related to Hindustan (another name of India)
* Hindustani language, an Indo-Aryan language, whose two official norms are Hindi and Urdu
* Fiji Hindi, a variety of Eastern Hindi spoken in Fiji, and ...
. Hindustani was strongly influenced by
Persian, with these and later Sanskrit influence leading to the emergence of Modern Standard Hindi and Modern Standard
as
registers of the Hindustani language.
This state of affairs continued until the division of the British Indian Empire in 1947, when Hindi became the official language in India and
became official in Pakistan. Despite the different script the fundamental grammar remains identical, the difference is more
sociolinguistic
Sociolinguistics is the descriptive study of the effect of any or all aspects of society, including cultural Norm (sociology), norms, expectations, and context (language use), context, on the way language is used, and society's effect on languag ...
than purely linguistic.
Today it is widely understood/spoken as a second or third language throughout South Asia and one of the most widely known languages in the world in terms of number of speakers.
Outside the Indian subcontinent
Domari
Domari
Domari is an endangered Indo-Aryan language, spoken by Dom people scattered across the Middle East and North Africa. The language is reported to be spoken as far north as Azerbaijan and as far south as central Sudan, in Turkey, Iran, Iraq, Palest ...
is an Indo-Aryan language spoken by older
Dom people
The Dom (also called Domi; ar, دومي / ALA-LC: ', / , Ḍom / or , or sometimes also called Doms) are descendants of the Dom with origins in the Indian subcontinent which through ancient migrations are found scattered across Middle Ea ...
scattered across the Middle East. The language is reported to be spoken as far north as
Azerbaijan
Azerbaijan (, ; az, Azərbaycan ), officially the Republic of Azerbaijan, , also sometimes officially called the Azerbaijan Republic is a transcontinental country located at the boundary of Eastern Europe and Western Asia. It is a part of th ...
and as far south as central Sudan.
[*Matras, Y. (2012). ''A grammar of Domari''. Berlin: De Gruyter Mouton (Mouton Grammar Library).] Based on the systematicity of sound changes, linguists have concluded that the ethnonyms ''Domari'' and ''
Romani
Romani may refer to:
Ethnicities
* Romani people, an ethnic group of Northern Indian origin, living dispersed in Europe, the Americas and Asia
** Romani genocide, under Nazi rule
* Romani language, any of several Indo-Aryan languages of the Roma ...
'' derive from the Indo-Aryan word ''ḍom''.
Lomavren
Lomavren is a nearly extinct
mixed language
A mixed language is a language that arises among a bilingual group combining aspects of two or more languages but not clearly deriving primarily from any single language. It differs from a creole or pidgin language in that, whereas creoles/pidgi ...
, spoken by the
Lom people
The Lom people or tr, Lomlar, also known in tr, Poşa as (Bosha or Posha) by non-Loms ( hy, Բոշա, ka, ბოშა, tr; russian: Боша) or Romani (russian: армянские цыгане; hy, հայ գնչուներ) or Caucasian Ro ...
, that arose from
language contact between a language related to
Romani
Romani may refer to:
Ethnicities
* Romani people, an ethnic group of Northern Indian origin, living dispersed in Europe, the Americas and Asia
** Romani genocide, under Nazi rule
* Romani language, any of several Indo-Aryan languages of the Roma ...
and
Domari
Domari is an endangered Indo-Aryan language, spoken by Dom people scattered across the Middle East and North Africa. The language is reported to be spoken as far north as Azerbaijan and as far south as central Sudan, in Turkey, Iran, Iraq, Palest ...
and the
Armenian language
Armenian ( classical: , reformed: , , ) is an Indo-European language and an independent branch of that family of languages. It is the official language of Armenia. Historically spoken in the Armenian Highlands, today Armenian is widely spoken th ...
.
Romani
The Romani language is usually included in the Western Indo-Aryan languages. Romani varieties, which are mainly spoken throughout Europe, are noted for their relatively conservative nature; maintaining the Middle Indo-Aryan present-tense person concord markers, alongside consonantal endings for nominal case. Indeed, these features are no longer evident in most other modern Central Indo-Aryan languages. Moreover, Romani shares an innovative pattern of past-tense person, which corresponds to Dardic languages, such as Kashmiri and Shina. This is believed to be further indication that proto-Romani speakers were originally situated in central regions of the subcontinent, before migrating to northwestern regions. However, there are no known historical sources regarding the development of the Romani language specifically within India.
Research conducted by nineteenth-century scholars Pott (1845) and Miklosich (1882–1888) demonstrated that the Romani language is most aptly designated as a New Indo-Aryan language (NIA), as opposed to Middle Indo-Aryan (MIA); establishing that proto-Romani speakers could not have left India significantly earlier than AD 1000.
The principal argument favouring a migration during or after the transition period to NIA is the loss of the old system of nominal case, coupled with its reduction to a two-way nominative-oblique case system. A secondary argument concerns the system of gender differentiation, due to the fact that Romani has only two genders (masculine and feminine). Middle Indo-Aryan languages (named MIA) generally employed three genders (masculine, feminine and neuter), and some modern Indo-Aryan languages retain this aspect today.
It is suggested that loss of the neuter gender did not occur until the transition to NIA. During this process, most of the neuter nouns became masculine, while several became feminine. For example, the neuter ''aggi'' "fire" in Prakrit morphed into the feminine ''āg'' in Hindi, and ''jag'' in Romani. The parallels in grammatical gender evolution between Romani and other NIA languages have additionally been cited as indications that the forerunner of Romani remained on the Indian subcontinent until a later period, possibly as late as the tenth century.
Sindhic migrations
Kholosi
Kholosi is an Indo-Aryan language spoken in two villages in southern Iran that was first described in 2008. At its current status, the language is considered endangered. In 2008, it was only spoken in the neighboring villages of Kholus and Got ...
,
Jadgali, and
Luwati represent offshoots of the Sindhic subfamily of Indo-Aryan that have established themselves in the
Persian gulf
The Persian Gulf ( fa, خلیج فارس, translit=xalij-e fârs, lit=Gulf of Fars, ), sometimes called the ( ar, اَلْخَلِيْجُ ٱلْعَرَبِيُّ, Al-Khalīj al-ˁArabī), is a mediterranean sea in Western Asia. The bo ...
region, perhaps through sea-based migrations. These are of a later origin than the Rom and Dom migrations which represent a different part of Indo-Aryan as well.
Indentured labourer migrations
The use by the
British East India Company of indentured labourers led to the transplanting of Indo-Aryan languages around the world, leading to locally influenced lects that diverged from the source language, such as
Fiji Hindi and
Caribbean Hindustani.
Phonology
Consonants
Stop positions
The normative system of New Indo-Aryan stops consists of five
places of articulation:
labial,
dental, "
retroflex",
palatal, and
velar, which is the same as that of Sanskrit. The "retroflex" position may involve retroflexion, or curling the tongue to make the contact with the underside of the tip, or merely retraction. The point of contact may be
alveolar or
postalveolar
Postalveolar or post-alveolar consonants are consonants articulated with the tongue near or touching the ''back'' of the alveolar ridge. Articulation is farther back in the mouth than the alveolar consonants, which are at the ridge itself, but no ...
, and the distinctive quality may arise more from the shaping than from the position of the tongue. Palatals stops have
affricated release and are traditionally included as involving a distinctive tongue position (blade in contact with hard palate). Widely transcribed as , claims to be a more accurate rendering.
Moving away from the normative system, some languages and dialects have alveolar affricates instead of palatal, though some among them retain in certain positions: before
front vowels (esp. ), before , or when
geminated. Alveolar as an ''additional'' point of articulation occurs in
Marathi
Marathi may refer to:
*Marathi people, an Indo-Aryan ethnolinguistic group of Maharashtra, India
*Marathi language, the Indo-Aryan language spoken by the Marathi people
*Palaiosouda, also known as Marathi, a small island in Greece
See also
*
* ...
and
Konkani where dialect mixture and others factors upset the aforementioned complementation to produce minimal environments, in some West Pahari dialects through internal developments (, > ), and in
Kashmiri Kashmiri may refer to:
* People or things related to the Kashmir Valley or the broader region of Kashmir
* Kashmiris, an ethnic group native to the Kashmir Valley
* Kashmiri language, their language
People with the name
* Kashmiri Saikia Barua ...
. The addition of a
retroflex affricate to this in some
Dardic languages maxes out the number of stop positions at seven (barring borrowed ), while a reduction to the inventory involves *ts > , which has happened in
Assamese
Assamese may refer to:
* Assamese people, a socio-ethnolinguistic identity of north-eastern India
* People of Assam, multi-ethnic, multi-linguistic and multi-religious people of Assam
* Assamese language, one of the easternmost Indo-Aryan language ...
,
Chittagonian,
Sinhala (though there have been other sources of a secondary ), and Southern Mewari.
Further reductions in the number of stop articulations are in Assamese and
Romani
Romani may refer to:
Ethnicities
* Romani people, an ethnic group of Northern Indian origin, living dispersed in Europe, the Americas and Asia
** Romani genocide, under Nazi rule
* Romani language, any of several Indo-Aryan languages of the Roma ...
, which have lost the characteristic dental/retroflex contrast, and in Chittagonian, which may lose its labial and velar articulations through
spirantisation in many positions (> ).
/q x ɣ f/ are restricted to Perso-Arabic loanwords in most IA languages but they occur natively in Khowar. According to Masica (1991) some dialects of Pashayi have a /θ/ which is unusual for IA languages. Domari which is spoken in the Middle East and had high contact with Middle Eastern languages has /q ħ ʕ ʔ/ and emphatic consonants from loanwords.
Nasals
Sanskrit was noted as having five
nasal-stop articulations corresponding to its oral stops, and among modern languages and dialects Dogri, Kacchi, Kalasha, Rudhari, Shina, Saurashtri, and Sindhi have been analysed as having this full complement of phonemic nasals , with the last two generally as the result of the loss of the stop from a
homorganic nasal + stop cluster ( > and > ), though there are other sources as well.
In languages that lack phonemic nasals at some places of articulation, they can still occur allophonically from place assimilation in a nasal + stop culture, e.g. Hindi > .
Aspiration and breathy-voice
Most Indo-Aryan languages have contrastive
aspiration (), and some retain historical
breathy voice
Breathy voice (also called murmured voice, whispery voice, soughing and susurration) is a phonation in which the vocal folds vibrate, as they do in normal (modal) voicing, but are adjusted to let more air escape which produces a sighing-like ...
on voiced consonants (). Sometimes both phenomena are analysed as a single aspiration contrast. The places and manners of articulation which allow contrastive aspiration vary by language; e.g. Sindhi permits phonemic , but the phonemic status of this sound in Hindi is uncertain, and many "Dardic" languages lack aspirated retroflex sibilants despite having unaspirated equivalents.
In languages that have lost breathy-voice, the contrast has often been replaced with tone.
Regional developments
Some of these are mentioned in .
*
Implosives: Languages in the
Sindhic subfamily, as well as
Saraiki, western
Marwari dialects, and some dialects of Gujarati have developed implosive consonants from historical intervocalic geminates and word-initial stops. Sindhi has a full implosive series except for the dental implosive: . It has been claimed that
Wadiyari Koli has the dental implosive too. Other languages have less complete implosive series, e.g. Kacchi has just .
*
Prenasalized stops
Prenasalized consonants are phonetic sequences of a nasal and an obstruent (or occasionally a non-nasal sonorant such as ) that behave phonologically like single consonants. The primary reason for considering them to be single consonants, rathe ...
: Sinhala and Maldivian (Dhivehi) have a series of prenasalized stops covering all places except for palatal: .
*
Palatalization
Palatalization may refer to:
*Palatalization (phonetics), the phonetic feature of palatal secondary articulation
*Palatalization (sound change)
Palatalization is a historical-linguistic sound change that results in a palatalized articulation ...
: Kashmiri (natively) and some Romani dialects (from contact with Slavic languages) have contrastive palatalisation.
*
Voiceless lateral In Gawarbati, some Pashai dialects, partly Bashkarik and some Shina dialects have /ɬ/ from clusters of tr kr or sometimes pr; dr gr and br merged with /l/ in these languages.
*
Lateral affricate
A lateral is a consonant in which the airstream proceeds along one or both of the sides of the tongue, but it is blocked by the tongue from going through the middle of the mouth. An example of a lateral consonant is the English ''L'', as in ''Larr ...
s: Bhadarwahi has an unusual series of lateral retroflex affricates ( derived from historical clusters.
Vowels
Vowel typologies are varied across Indo-Aryan due to diachronic mergers and (in some cases) splits, as well as different accounts by linguists for even the widely-spoken languages. Vowel systems per are listed below. Many languages also have phonemic nasal vowels.
Sylheti language being a
tonal, still classified as the Indo-Aryan language. The vowels of Sylheti language listed below.
Charts
The following are consonant systems of major and representative New Indo-Aryan languages, mostly following , though here they are in
IPA. Parentheses indicate those consonants found only in loanwords: square brackets indicate those with "very low functional load". The arrangement is roughly geographical.
Sociolinguistics
Register
In many Indo-Aryan languages, the literary register is often more archaic and utilises a different lexicon (Sanskrit or Perso-Arabic) than spoken vernacular. One example is Bengali's high literary form,
Sādhū bhāśā as opposed to the more modern
Calita bhāśā (Cholito-bhasha). This distinction approaches
diglossia
In linguistics, diglossia () is a situation in which two dialects or languages are used (in fairly strict compartmentalization) by a single language community. In addition to the community's everyday or vernacular language variety (labeled " ...
.
Language and dialect
In the context of South Asia, the choice between the appellations
"language" and "dialect" is a difficult one, and any distinction made using these terms is obscured by their ambiguity. In one general colloquial sense, a language is a "developed" dialect: one that is standardised, has a written tradition and enjoys
social prestige. As there are degrees of development, the boundary between a language and a dialect thus defined is not clear-cut, and there is a large middle ground where assignment is contestable.
There is a second meaning of these terms, in which the distinction is drawn on the basis of linguistic similarity. Though seemingly a "proper" linguistics sense of the terms, it is still problematic: methods that have been proposed for quantifying difference (for example, based on
mutual intelligibility
In linguistics, mutual intelligibility is a relationship between languages or dialects in which speakers of different but related varieties can readily understand each other without prior familiarity or special effort. It is sometimes used as a ...
) have not been seriously applied in practice; and any relationship established in this framework is relative.
See also
*
Indo-Aryans
*
Iranic languages
*
Indo-Aryan migration
The Indo-Aryan migrations were the migrations into the Indian subcontinent of Indo-Aryan peoples, an ethnolinguistic group that spoke Indo-Aryan languages, the predominant languages of today's North India, Pakistan, Nepal, Bangladesh, Sri Lank ...
*
Proto-Vedic Continuity
* The family of
Brahmic scripts
*
Linguistic history of India
*
Indo-Aryan loanwords in Tamil
The Tamil language has absorbed many Indo-Aryan, Prakrit, Pali and Sanskrit loanwords ever since the early 1st millennium CE, when the Sangam period Chola kingdoms became influenced by spread of Jainism, Buddhism and early Hinduism.
M ...
*
Languages of Bangladesh
*
Languages of India
Languages spoken in India belong to several language families, the major ones being the Indo-European languages spoken by 78.05% of Indians and the Dravidian languages spoken by 19.64% of Indians, both families together are sometimes known ...
*
Languages of Maldives
*
Languages of Nepal
*
Languages of Pakistan
*
Languages of Sri Lanka
Several languages are spoken in Sri Lanka within the Indo-Aryan, Austronesian, and Dravidian families. Sri Lanka accords official status to Sinhala and Tamil, and English as a recognised language. The languages spoken on the island nation are ...
*
Languages of South Asia
Notes
References
Further reading
*
John Beames, ''A comparative grammar of the modern Aryan languages of India: to wit, Hindi, Panjabi, Sindhi, Gujarati, Marathi, Oriya, and Bangali''. Londinii: Trübner, 1872–1879. 3 vols.
*Morgenstierne, Georg. "Early Iranic Influence upon Indo-Aryan." Acta Iranica, I. série, Commemoration Cyrus. Vol. I. Hommage universel (1974): 271-279.
* .
* Madhav Deshpande (1979). ''Sociolinguistic attitudes in India: An historical reconstruction''. Ann Arbor: Karoma Publishers. , (pbk).
*
Chakrabarti, Byomkes (1994). ''A comparative study of Santali and Bengali''. Calcutta: K.P. Bagchi & Co.
* Erdosy, George. (1995). ''The Indo-Aryans of ancient South Asia: Language, material culture and ethnicity''. Berlin:
Walter de Gruyter
Walter de Gruyter GmbH, known as De Gruyter (), is a German scholarly publishing house specializing in academic literature.
History
The roots of the company go back to 1749 when Frederick the Great granted the Königliche Realschule in B ...
. .
Ernst Kausen, 2006. ''Die Klassifikation der indogermanischen Sprachen''(
Microsoft Word
Microsoft Word is a word processing software developed by Microsoft. It was first released on October 25, 1983, under the name ''Multi-Tool Word'' for Xenix systems. Subsequent versions were later written for several other platforms includi ...
, 133 KB)
* Kobayashi, Masato.; &
George Cardona (2004). ''Historical phonology of old Indo-Aryan consonants''. Tokyo: Research Institute for Languages and Cultures of Asia and Africa, Tokyo University of Foreign Studies. .
* .
* Misra, Satya Swarup. (1980). ''Fresh light on Indo-European classification and chronology''. Varanasi: Ashutosh Prakashan Sansthan.
* Misra, Satya Swarup. (1991–1993). ''The Old-Indo-Aryan, a historical & comparative grammar'' (Vols. 1–2). Varanasi: Ashutosh Prakashan Sansthan.
* Sen, Sukumar. (1995). ''Syntactic studies of Indo-Aryan languages''. Tokyo: Institute for the Study of Languages and Foreign Cultures of Asia and Africa, Tokyo University of Foreign Studies.
* Vacek, Jaroslav. (1976). ''The sibilants in Old Indo-Aryan: A contribution to the history of a linguistic area''. Prague: Charles University.
External links
The Indo-Aryan languages 25 October 2009
The Indo-Aryan languagesColin P.Masica
Survey of the syntax of the modern Indo-Aryan languages(Rajesh Bhatt), 7 February 2003.
{{DEFAULTSORT:Indo-Aryan Languages
Indo-European languages