HOME

TheInfoList



OR:

The Indo-Aryan languages (or sometimes Indic languages) are a branch of the
Indo-Iranian languages The Indo-Iranian languages (also Indo-Iranic languages or Aryan languages) constitute the largest and southeasternmost extant branch of the Indo-European language family (with over 400 languages), predominantly spoken in the geographical subr ...
in the
Indo-European language family The Indo-European languages are a language family native to the overwhelming majority of Europe, the Iranian plateau, and the northern Indian subcontinent. Some European languages of this family, English, French, Portuguese, Russian, ...
. As of the early 21st century, they have more than 800 million speakers, primarily concentrated in
India India, officially the Republic of India ( Hindi: ), is a country in South Asia. It is the seventh-largest country by area, the second-most populous country, and the most populous democracy in the world. Bounded by the Indian Ocean on the ...
,
Pakistan Pakistan ( ur, ), officially the Islamic Republic of Pakistan ( ur, , label=none), is a country in South Asia. It is the world's List of countries and dependencies by population, fifth-most populous country, with a population of almost 24 ...
,
Bangladesh Bangladesh (}, ), officially the People's Republic of Bangladesh, is a country in South Asia. It is the List of countries and dependencies by population, eighth-most populous country in the world, with a population exceeding 165 million pe ...
,
Nepal Nepal (; ne, नेपाल ), formerly the Federal Democratic Republic of Nepal ( ne, सङ्घीय लोकतान्त्रिक गणतन्त्र नेपाल ), is a landlocked country in South Asia. It is ma ...
,
Sri Lanka Sri Lanka (, ; si, ශ්‍රී ලංකා, Śrī Laṅkā, translit-std=ISO (); ta, இலங்கை, Ilaṅkai, translit-std=ISO ()), formerly known as Ceylon and officially the Democratic Socialist Republic of Sri Lanka, is an ...
, and Maldives. Moreover, apart from the
Indian subcontinent The Indian subcontinent is a physiographical region in Southern Asia. It is situated on the Indian Plate, projecting southwards into the Indian Ocean from the Himalayas. Geopolitically, it includes the countries of Bangladesh, Bhutan, India ...
, large immigrant and expatriate Indo-Aryan–speaking communities live in Northwestern Europe,
Western Asia Western Asia, West Asia, or Southwest Asia, is the westernmost subregion of the larger geographical region of Asia, as defined by some academics, UN bodies and other institutions. It is almost entirely a part of the Middle East, and includes A ...
, North America, the Caribbean, Southeast Africa,
Polynesia Polynesia () "many" and νῆσος () "island"), to, Polinisia; mi, Porinihia; haw, Polenekia; fj, Polinisia; sm, Polenisia; rar, Porinetia; ty, Pōrīnetia; tvl, Polenisia; tkl, Polenihia (, ) is a subregion of Oceania, made up of ...
and Australia, along with several million speakers of
Romani language Romani (; also Romany, Romanes , Roma; rom, rromani ćhib, links=no) is an Indo-Aryan macrolanguage of the Romani communities. According to '' Ethnologue'', seven varieties of Romani are divergent enough to be considered languages of their ...
s primarily concentrated in Southeastern Europe. There are over 200 known Indo-Aryan languages. Modern Indo-Aryan languages descend from Old Indo-Aryan languages such as early
Vedic Sanskrit Vedic Sanskrit was an ancient language of the Indo-Aryan subgroup of the Indo-European language family. It is attested in the Vedas and related literature compiled over the period of the mid- 2nd to mid-1st millennium BCE. It was orally prese ...
, through Middle Indo-Aryan languages (or
Prakrit The Prakrits (; sa, prākṛta; psu, 𑀧𑀸𑀉𑀤, ; pka, ) are a group of vernacular Middle Indo-Aryan languages that were used in the Indian subcontinent from around the 3rd century BCE to the 8th century CE. The term Prakrit is usu ...
s). The largest such languages in terms of first-speakers are Hindi–Urdu (),Standard Hindi first language: 260.3 million (2001), as second language: 120 million (1999). Urdu L1: 68.9 million (2001–2014), L2: 94 million (1999): ''Ethnologue'' 19. Bengali (242 million),
Punjabi Punjabi, or Panjabi, most often refers to: * Something of, from, or related to Punjab, a region in India and Pakistan * Punjabi language * Punjabi people * Punjabi dialects and languages Punjabi may also refer to: * Punjabi (horse), a British Th ...
(about 120 million),
Marathi Marathi may refer to: *Marathi people, an Indo-Aryan ethnolinguistic group of Maharashtra, India *Marathi language, the Indo-Aryan language spoken by the Marathi people *Palaiosouda, also known as Marathi, a small island in Greece See also * * ...
(112 million), Gujarati (60 million), Rajasthani (58 million), Bhojpuri (51 million), Odia (35 million), Maithili (about 34 million), Sindhi (25 million),
Nepali Nepali or Nepalese may refer to : Concerning Nepal * Anything of, from, or related to Nepal * Nepali people, citizens of Nepal * Nepali language, an Indo-Aryan language found in Nepal, the current official national language and a language spoken ...
(16 million),
Assamese Assamese may refer to: * Assamese people, a socio-ethnolinguistic identity of north-eastern India * People of Assam, multi-ethnic, multi-linguistic and multi-religious people of Assam * Assamese language, one of the easternmost Indo-Aryan language ...
(15 million),
Chhattisgarhi Chhattisgarhi ( / ) is an Indo-Aryan language, spoken by approximately 16 million people from Chhattisgarh & other states. It is mostly spoken in the Indian states of Chhattisgarh, Odisha, Madhya Pradesh & Maharashtra. It is closely related ...
(18 million), Sinhala (17 million), and
Romani Romani may refer to: Ethnicities * Romani people, an ethnic group of Northern Indian origin, living dispersed in Europe, the Americas and Asia ** Romani genocide, under Nazi rule * Romani language, any of several Indo-Aryan languages of the Roma ...
(). A 2005 estimate placed the total number of native speakers of the Indo-Aryan languages at nearly 900 million people.


Classification


Theories

The Indo-Aryan family as a whole is thought to represent a
dialect continuum A dialect continuum or dialect chain is a series of language varieties spoken across some geographical area such that neighboring varieties are mutually intelligible, but the differences accumulate over distance so that widely separated varie ...
, where languages are often transitional towards neighboring varieties. Because of this, the division into languages vs. dialects is in many cases somewhat arbitrary. The classification of the Indo-Aryan languages is controversial, with many transitional areas that are assigned to different branches depending on classification. There are concerns that a
tree model In historical linguistics, the tree model (also Stammbaum, genetic, or cladistic model) is a model of the evolution of languages analogous to the concept of a family tree, particularly a phylogenetic tree in the biological evolution of specie ...
is insufficient for explaining the development of New Indo-Aryan, with some scholars suggesting the wave model.


Subgroups

The following table of proposals is expanded from . Note that the table only lists some modern Indo-Aryan languages.
Anton I. Kogan Anton may refer to: People *Anton (given name), including a list of people with the given name *Anton (surname) Places *Anton Municipality, Bulgaria **Anton, Sofia Province, a village *Antón District, Panama **Antón, a town and capital of th ...
, in 2016, conducted a lexicostatistical study of the New Indo-Aryan languages based on a 100-word Swadesh list, using techniques developed by the glottochronologist and comparative linguist Sergei Starostin. That grouping system is notable for Kogan's exclusion of Dardic from Indo-Aryan on the basis of his previous studies showing low lexical similarity to Indo-Aryan (43.5%) and negligible difference with similarity to Iranian (39.3%). He also calculated Sinhala–Dhivehi to be the most divergent Indo-Aryan branch. Nevertheless, the modern consensus of Indo-Aryan linguists tends towards the inclusion of Dardic based on morphological and grammatical features.


Inner–Outer hypothesis

The Inner–Outer hypothesis argues for a core and periphery of Indo-Aryan languages, with Outer Indo-Aryan (generally including Eastern and Southern Indo-Aryan, and sometimes Northwestern Indo-Aryan, Dardic and Pahari) representing an older stratum of Old Indo-Aryan that has been mixed to varying degrees with the newer stratum that is Inner Indo-Aryan. It is a contentious proposal with a long history, with varying degrees of claimed phonological and morphological evidence. Since its proposal by
Rudolf Hoernlé Augustus Frederic Rudolf Hoernlé CIE (1841 – 1918), also referred to as Rudolf Hoernle or A. F. Rudolf Hoernle, was a German Indologist and philologist. He is famous for his studies on the Bower Manuscript (1891), Weber Manuscript (1893) a ...
in 1880 and refinement by George Grierson it has undergone numerous revisions and a great deal of debate, with the most recent iteration by Franklin Southworth and Claus Peter Zoller based on robust linguistic evidence (particularly an Outer past tense in ''-l-''). Some of the theory's skeptics include Suniti Kumar Chatterji and
Colin P. Masica Colin Paul Masica (June 13, 1931 – February 23, 2022) was an American linguist who was professor emeritus in thDepartment of South Asian Languages and Civilizationsand the Department of Linguistics at the University of Chicago. Besides being a s ...
.


Groups

The below classification follows , and .


Dardic

The Dardic languages (also Dardu or Pisaca) are a group of Indo-Aryan languages largely spoken in the northwestern extremities of the Indian subcontinent. Dardic was first formulated by George Abraham Grierson in his Linguistic Survey of India but he did not consider it to be a subfamily of Indo-Aryan. The Dardic group as a genetic grouping (rather than areal) has been scrutinised and questioned to a degree by recent scholarship: Southworth, for example, says "the viability of Dardic as a genuine subgroup of Indo-Aryan is doubtful" and "the similarities among ardic languagesmay result from subsequent convergence". The Dardic languages are thought to be transitional with Punjabi and Pahari (e.g. Zoller describes Kashmiri as "an interlink between Dardic and West Pahāṛī"), as well as non-Indo-Aryan Nuristani; and are renowned for their relatively conservative features in the context of Proto-Indo-Aryan. * Kashmiri:
Kashmiri Kashmiri may refer to: * People or things related to the Kashmir Valley or the broader region of Kashmir * Kashmiris, an ethnic group native to the Kashmir Valley * Kashmiri language, their language People with the name * Kashmiri Saikia Barua ...
, Kishtwari, Poguli; * Shina: Brokskad, Kundal Shahi, Shina,
Ushojo Ushoji (natively known as Ushojo) is an Indo-Aryan language spoken in Kohistan and Swat districts of the Khyber-Pakhtunkhwa province of Pakistan. Status Ushoji may be incredibly endangered due to the dominance of the Pashto language in the r ...
, Kalkoti, Palula, Savi; * Chitrali: Kalasha, Khowar; * Kohistani: Bateri, Chilisso, Gowro, Indus Kohistani, Kalami, Tirahi, Torwali,
Wotapuri-Katarqalai Wotapuri-Katarqalai is an Indo-Aryan language documented to have been spoken in Afghanistan Afghanistan, officially the Islamic Emirate of Afghanistan,; prs, امارت اسلامی افغانستان is a landlocked country located at ...
; * Pashayi * Kunar: Dameli,
Gawar-Bati Gawar-Bati or Narsati is an Indo-Aryan language spoken in the Chitral region of northern Pakistan, and across the border in Afghanistan. It is also known as Aranduyiwar in Chitral because it is spoken in Arandu, which is the last village in ...
,
Nangalami Nangalami, or Grangali, is an Indo-Aryan language spoken in Afghanistan. Zemiaki was formerly considered a Nangalami dialect, but has been reassessed and placed in the Nuristani language group being close to Waigali Waigali (') is a languag ...
,
Shumashti Shumashti – also known as Shumasht – is an Indo-Aryan languages, Indo-Aryan language spoken in eastern Afghanistan. It is spoken in parts of Kunar Province: on the western side of the Kunar Valley between Jalalabad and the Pech Valley. The nu ...
.


Northern Zone

The Northern Indo-Aryan languages, also known as the Pahari ('hill') languages, are spoken throughout the Himalayan regions of the subcontinent. * Eastern Pahari:
Nepali Nepali or Nepalese may refer to : Concerning Nepal * Anything of, from, or related to Nepal * Nepali people, citizens of Nepal * Nepali language, an Indo-Aryan language found in Nepal, the current official national language and a language spoken ...
,
Jumli Jumli or Jumli Khas is an Indo-Aryan language of Nepal closely related to Nepali. It is primarily spoken in the Karnali Province of Nepal. The language is occasionally referred to as a dialect of Nepali language however Government of Nepal consid ...
, Doteli; * Central Pahari: Garhwali, Kumaoni; * Western Pahari (Himachali): Dogri, Kangri, Bhadarwahi,
Churahi Churahi (Takri: ) is a Western Pahari language of Himachal Pradesh, India. It is spoken in the Chaurah and Saluni tehsils of Chamba district, and is considered vulnerable. Adages Script The native script of the language is Takri script. ...
, Bhateali, Bilaspuri, Chambeali, Gaddi, Pangwali, Mandeali, Mahasu Pahari, Jaunsari, Kullui, Pahari Kinnauri, Hinduri,
Sarazi Sarazi or Sirazi (also spelled Siraji) is an Indo-Aryan language of Jammu and Kashmir, India. It is native to the Saraz region, a hilly area taking up the northern half of Doda district and parts of neighbouring Ramban and Kishtwar districts. ...
, Sirmauri.


Northwestern Zone

Northwestern Indo-Aryan languages are spoken in the northwestern region of India and Eastern Pakistan.
Punjabi Punjabi, or Panjabi, most often refers to: * Something of, from, or related to Punjab, a region in India and Pakistan * Punjabi language * Punjabi people * Punjabi dialects and languages Punjabi may also refer to: * Punjabi (horse), a British Th ...
is spoken predominantly in the Punjab region and is the official language of the northern Indian state of Punjab; in addition to being the most widely-spoken language in Pakistan. To the south, Sindhi and its variants are spoken; primarily in
Sindh Sindh (; ; ur, , ; historically romanized as Sind) is one of the four provinces of Pakistan. Located in the southeastern region of the country, Sindh is the third-largest province of Pakistan by land area and the second-largest province ...
. Northwestern languages are ultimately thought to be descended from Shauraseni Prakrit. *
Punjabi Punjabi, or Panjabi, most often refers to: * Something of, from, or related to Punjab, a region in India and Pakistan * Punjabi language * Punjabi people * Punjabi dialects and languages Punjabi may also refer to: * Punjabi (horse), a British Th ...
** Eastern Punjabi:
Punjabi Punjabi, or Panjabi, most often refers to: * Something of, from, or related to Punjab, a region in India and Pakistan * Punjabi language * Punjabi people * Punjabi dialects and languages Punjabi may also refer to: * Punjabi (horse), a British Th ...
, Doabi, Majhi, Malwai, Puadhi, Sansi; ** Western Punjabi (
Lahnda Lahnda () () also known as Lahndi or Western Punjabi, is a group of north-western Indo-Aryan language varieties spoken in parts of Pakistan and India. Its validity as a genetic grouping is not certain. Terms like ''Lahnda'' or ''Western Punja ...
): Saraiki, Hindko, Pahari-Pothwari, Inku†; * Sindhi: Sindhi, Jadgali, Kutchi, Luwati, Memoni,
Khetrani Khetrānī, or Khetranki, is an Indo-Aryan language of north-eastern Balochistan. It is spoken by the majority of the Khetrans, a Baloch tribe that occupies a hilly tract in the Sulaiman Mountains comprising the whole of Barkhan District as well ...
,
Kholosi Kholosi is an Indo-Aryan language spoken in two villages in southern Iran that was first described in 2008. At its current status, the language is considered endangered. In 2008, it was only spoken in the neighboring villages of Kholus and Got ...
.


Western Zone

Western Indo-Aryan languages, are spoken in the central and western areas within India, such as
Madhya Pradesh Madhya Pradesh (, ; meaning 'central province') is a state in central India. Its capital city, capital is Bhopal, and the largest city is Indore, with Jabalpur, Ujjain, Gwalior, Sagar, Madhya Pradesh, Sagar, and Rewa, India, Rewa being the othe ...
and
Rajasthan Rajasthan (; lit. 'Land of Kings') is a state in northern India. It covers or 10.4 per cent of India's total geographical area. It is the largest Indian state by area and the seventh largest by population. It is on India's northwestern s ...
, in addition to contiguous regions in Pakistan. Gujarati is the official language of
Gujarat Gujarat (, ) is a state along the western coast of India. Its coastline of about is the longest in the country, most of which lies on the Kathiawar peninsula. Gujarat is the fifth-largest Indian state by area, covering some ; and the nin ...
, and is spoken by over 50 million people. In Europe, various Romani languages are spoken by the
Romani people The Romani (also spelled Romany or Rromani , ), colloquially known as the Roma, are an Indo-Aryan ethnic group, traditionally nomadic itinerants. They live in Europe and Anatolia, and have diaspora populations located worldwide, with sig ...
, an itinerant community who historically migrated from India. The Western Indo-Aryan languages are thought to have diverged from their northwestern counterparts, although they have a common antecedent in Shauraseni Prakrit. * Rajasthani: Standard Rajasthani, Bagri, Marwari, Mewati, Dhundari, Harauti, Mewari,
Shekhawati Shekhawati is a semi-arid historical region located in the northeast part of Rajasthan, India. The region was ruled by Shekhawat Rajputs. Shekhawati is located in North Rajasthan, comprising the districts of Jhunjhunu, parts of Sikar that lies ...
, Dhatki, Malvi, Nimadi,
Gujari Gojri (, ), also known as Gujari, Gujri, Gojari, or Gojri, is a variety of Rajasthani spoken by the Gurjars and other tribes of India, Pakistan and Afghanistan. In India, the language is mainly spoken in Jammu and Kashmir, Himachal Prad ...
, Goaria,
Loarki Marwari (मारवाड़ी, IAST: Mārwāṛī; also rendered as ''Marwadi'' or ''Marvadi'') is a Rajasthani language spoken in the Indian state of Rajasthan. Marwari is also found in the neighbouring states of Gujarat and Haryana, ...
, Bhoyari, Kanjari, Od; * Gujarati: Gujarati,
Jandavra Jandavra (Jhandoria) is a minor Indic language of Sindh, Pakistan, and Jodhpur Jodhpur (; ) is the second-largest city in the Indian state of Rajasthan and officially the second metropolitan city of the state. It was formerly the seat o ...
, Saurashtra, Aer, Vaghri, Parkari Koli,
Kachi Koli Kachi Koli is an Indo-Aryan language spoken in Pakistan and India. Part of the Gujarati subfamily, Kachi Koli is closely related to Parkari Koli and Wadiyara Koli Wadiyara Koli is an Indo-Aryan language of the Gujarati group. It is spok ...
, Wadiyara Koli; * Bhil: Kalto, Vasavi,
Wagdi Wagdi is a Bhil language of India spoken mainly in Dungarpur and Banswara districts of Southern Rajasthan. Wagdi has been characterized as a dialect of Bhili Bhili (Bhili: ), , is a Western Indo-Aryan language spoken in west-central India ...
, Gamit, Vaagri Booli; ** Northern Bhil: Bauria, Bhilori, Magari; ** Central Bhil: Bhili proper, Bhilali, Chodri, Dhodia, Dhanki, Dubli; ** Bareli: Palya Bareli, Pauri Bareli, Rathwi Bareli,
Pardhi Pardhi is a Hindu tribe in India. The tribe is found mostly in Maharashtra and parts of Madhya Pradesh however small numbers can be found in Gujarat and Andhra Pradesh. The word Pardhi is derived from the Marathi (state language) word ‘''Paradh ...
; * Khandeshi * Lambadi *
Domaaki Dawoodi (), also known as Domaakí (), Dumaki or Domaá, is an endangered Indo-Aryan language spoken by a few hundred people living in the Gilgit-Baltistan territory in northern Pakistan. It is historically related to the Central Indo-Aryan langu ...
*
Domari Domari is an endangered Indo-Aryan language, spoken by Dom people scattered across the Middle East and North Africa. The language is reported to be spoken as far north as Azerbaijan and as far south as central Sudan, in Turkey, Iran, Iraq, Palest ...
*
Romani Romani may refer to: Ethnicities * Romani people, an ethnic group of Northern Indian origin, living dispersed in Europe, the Americas and Asia ** Romani genocide, under Nazi rule * Romani language, any of several Indo-Aryan languages of the Roma ...
: Carpathian Romani,
Balkan Romani Balkan Romani, Balkaniko Romanes, or Balkan Gypsy is a specific non-Vlax dialect of the Romani language, spoken by groups within the Balkans, which include countries such as Albania, Bosnia-Herzegovina, Bulgaria, Greece, Kosovo, North Macedonia, ...
, Vlax Romani; ** Northern Romani: Sinte Romani, Finnish Kalo, Baltic Romani.


Central Zone (Madhya ''or'' Hindi)

Within India, Hindi languages are spoken primarily in the
Hindi belt The Hindi Belt, also known as the Hindi Heartland, is a linguistic region encompassing parts of northern, central, eastern and western India where various Central Indo-Aryan languages subsumed under the term 'Hindi' (for example, by th ...
regions and Gangetic plains, including
Delhi Delhi, officially the National Capital Territory (NCT) of Delhi, is a city and a union territory of India containing New Delhi, the capital of India. Straddling the Yamuna river, primarily its western or right bank, Delhi shares borders wi ...
and the surrounding areas; where they are often transitional with neighbouring lects. Many of these languages, including Braj and Awadhi, have rich literary and poetic traditions.
Urdu Urdu (;"Urdu"
''
, a Persianized derivative of Khariboli, is the official language of
Pakistan Pakistan ( ur, ), officially the Islamic Republic of Pakistan ( ur, , label=none), is a country in South Asia. It is the world's List of countries and dependencies by population, fifth-most populous country, with a population of almost 24 ...
and also has strong historical connections to
India India, officially the Republic of India ( Hindi: ), is a country in South Asia. It is the seventh-largest country by area, the second-most populous country, and the most populous democracy in the world. Bounded by the Indian Ocean on the ...
, where it also has been designated with official status.
Hindi Hindi (Devanāgarī: or , ), or more precisely Modern Standard Hindi (Devanagari: ), is an Indo-Aryan languages, Indo-Aryan language spoken chiefly in the Hindi Belt region encompassing parts of North India, northern, Central India, centr ...
, a standardized and Sanskritized register of Khariboli, is the official language of the
Government of India The Government of India ( ISO: ; often abbreviated as GoI), known as the Union Government or Central Government but often simply as the Centre, is the national government of the Republic of India, a federal democracy located in South Asia, ...
. Together with Urdu, it is the third most-spoken language in the world. * Western Hindi:
Hindustani Hindustani may refer to: * something of, from, or related to Hindustan (another name of India) * Hindustani language, an Indo-Aryan language, whose two official norms are Hindi and Urdu * Fiji Hindi, a variety of Eastern Hindi spoken in Fiji, and ...
(including Standard Hindi and Standard Urdu), Khariboli, Braj, Haryanvi,
Bundeli Bundeli ( Devanagari: बुन्देली or बुंदेली; or Bundelkhandi) is an Indo-Aryan language spoken in the Bundelkhand region of central India. It belongs to the Central Indo-Ayran languages and is part of the Western ...
, Kannauji,
Parya Parya may refer to: * Parya language, spoken in Central Asia * several mountains in Peru: ** Parya (Ayacucho) ** Paria (Peru) ** Puka Parya ** Parya Chaka See also * '' Parrya'', a genus of plants * Paria (disambiguation) Paria may refer to th ...
; * Eastern Hindi: Bagheli,
Chhattisgarhi Chhattisgarhi ( / ) is an Indo-Aryan language, spoken by approximately 16 million people from Chhattisgarh & other states. It is mostly spoken in the Indian states of Chhattisgarh, Odisha, Madhya Pradesh & Maharashtra. It is closely related ...
,
Surgujia Surgujia is an Indo-Aryan language spoken in Chhattisgarh. It belongs to the Eastern Hindi subgroup. Speakers Surgujia is primarily spoken in Surguja, Jashpur, and Koriya districts of Chhattisgarh; and to a lesser extent in Raigarh and Korb ...
; ** Awadhi: Fiji Hindi, Caribbean Hindustani


Eastern Zone

The Eastern Indo-Aryan languages, also known as Magadhan languages, are spoken throughout the eastern subcontinent, including
Odisha Odisha (English: , ), formerly Orissa ( the official name until 2011), is an Indian state located in Eastern India. It is the 8th largest state by area, and the 11th largest by population. The state has the third largest population of Sc ...
and
Bihar Bihar (; ) is a state in eastern India. It is the 2nd largest state by population in 2019, 12th largest by area of , and 14th largest by GDP in 2021. Bihar borders Uttar Pradesh to its west, Nepal to the north, the northern part of West ...
, alongside other regions surrounding the northwestern Himalayan corridor. Bengali is the seventh most-spoken language in the world, and has a strong literary tradition; the
national anthem A national anthem is a patriotic musical composition symbolizing and evoking eulogies of the history and traditions of a country or nation. The majority of national anthems are marches or hymns in style. American, Central Asian, and Europe ...
s of
India India, officially the Republic of India ( Hindi: ), is a country in South Asia. It is the seventh-largest country by area, the second-most populous country, and the most populous democracy in the world. Bounded by the Indian Ocean on the ...
and
Bangladesh Bangladesh (}, ), officially the People's Republic of Bangladesh, is a country in South Asia. It is the List of countries and dependencies by population, eighth-most populous country in the world, with a population exceeding 165 million pe ...
are written in Bengali.
Assamese Assamese may refer to: * Assamese people, a socio-ethnolinguistic identity of north-eastern India * People of Assam, multi-ethnic, multi-linguistic and multi-religious people of Assam * Assamese language, one of the easternmost Indo-Aryan language ...
and Odia are the official languages of
Assam Assam (; ) is a state in northeastern India, south of the eastern Himalayas along the Brahmaputra and Barak River valleys. Assam covers an area of . The state is bordered by Bhutan and Arunachal Pradesh to the north; Nagaland and Manipur ...
and
Odisha Odisha (English: , ), formerly Orissa ( the official name until 2011), is an Indian state located in Eastern India. It is the 8th largest state by area, and the 11th largest by population. The state has the third largest population of Sc ...
, respectively. The Eastern Indo-Aryan languages descend from Magadhan Apabhraṃśa and ultimately from Magadhi Prakrit. * Bihari: ** Bhojpuri, Caribbean Hindustani, Fiji Hindi; ** Magahi, Khortha; ** Maithili, Angika, Bajjika, Dehati; ** Sadanic: Nagpuri (Sadri), Kurmali (Panchpargania); ** Tharu,
Kochila Tharu Kochila Tharu, also called Septari or Saptariya Tharu, Madhya-Purbiya Tharu, and Mid-Eastern Tharu, is a diverse group of language varieties in the Tharu group of the Indo-Aryan languages. The several names of the varieties refer to the regions ...
, Buksa, Majhi, Musasa; **
Kumhali Kumhali, Kumali, or Kumbale, is an Indo-Aryan language spoken by some of the Kumal people of Nepal Nepal (; ne, नेपाल ), formerly the Federal Democratic Republic of Nepal ( ne, सङ्घीय लोकतान्त� ...
, Kuswaric: Danwar, Bote-Darai; * Halbic: Halbi, Kamar, Bhunjia, Nahari; * Odia: Baleswari, Kataki, Ganjami, Sundargadi, Sambalpuri, Desia; ** Bodo Parja,
Bhatri Bhatri is an Eastern Indo-Aryan language spoken by the Bhottada tribe in Chhattisgarh and Odisha, India. The language is spoken predominantly in eastern Bastar district and in Koraput and Nabarangpur Nabarangpur is a town, villa city and a ...
, Reli, Kupia; * Bengali–Assamese: Bishnupriya Manipuri, Hajong, Chittagonian,
Chakma Chakma may refer to: *Chakma people, a Tibeto-Burman people of Bangladesh and Northeast India *Chakma language, the Indo-Aryan language spoken by them **Chakma script ***Chakma (Unicode block) Chakma is a Unicode block containing characters for ...
,
Noakhailla Noakhailla (), also known by the exonym Noakhalian, is a dialect of Bengali, spoken by an estimated 7 million people, primarily in the Greater Noakhali region of Bangladesh as well as southern parts of Tripura in India. Outside of these regions, t ...
, Tanchangya, Rohingya, Sylheti,; ** Bengali-Gauda: Bengali,
Bangali Bengalis (singular Bengali bn, বাঙ্গালী/বাঙালি ), also rendered as Bangalee or the Bengali people, are an Indo-Aryan ethnolinguistic group originating from and culturally affiliated with the Bengal region of S ...
, Rarhi,
Varendri North Central Bengali or Varendrī () is a dialect of the Bengali language, spoken in the Varendra region (primarily consisting of the Rajshahi Division in Bangladesh and the Malda division in India). Varendri dialect was classified by many re ...
, Sundarbani, Manbhumi, Dhakaiya Kutti, Dobhashi; ** Kamarupic:
Assamese Assamese may refer to: * Assamese people, a socio-ethnolinguistic identity of north-eastern India * People of Assam, multi-ethnic, multi-linguistic and multi-religious people of Assam * Assamese language, one of the easternmost Indo-Aryan language ...
, Kamrupi, Goalpariya, Rangpuri,
Surjapuri Surjapuri is an Eastern Indo-Aryan language spoken in Eastern India including North Bengal, West Bengal, and Banganchal of Eastern Bihar, as well as in Nepal. Among speakers in some regions, it is known as 'Deshi Bhasa'. It possesses similaritie ...
, Rajbanshi;


Southern Zone

Marathi-Konkani languages are ultimately descended from Maharashtri Prakrit, whereas Insular Indo-Aryan languages are descended from
Elu Prakrit Eḷu, also Hela or Helu, is a hypothesized language Middle Indo-Aryan language or Prakrit of the 3rd century BCE. It is ancestral to the Sinhalese and Dhivehi languages. R. C. Childers, in the ''Journal of the Royal Asiatic Society'', states ...
and possess several characteristics that markedly distinguish them from most of their mainland Indo-Aryan counterparts. * Marathi-Konkani ** Marathic:
Marathi Marathi may refer to: *Marathi people, an Indo-Aryan ethnolinguistic group of Maharashtra, India *Marathi language, the Indo-Aryan language spoken by the Marathi people *Palaiosouda, also known as Marathi, a small island in Greece See also * * ...
, Varhadi, Andh,
Berar-Deccan Marathi Berar-Deccan Marathi, is a possible language of the Marathi–Konkani languages, Marathi–Konkani group, or perhaps just a regional Marathi dialects, dialect of Marathi. ''Glottolog'' reports that it is closely related to Varhadi-Nagpuri lang ...
,
Phudagi The Phudagi language, also known as Vadvali, is a language or dialect of the Marathi–Konkani group. This language is spoken by Panchkalshi and Chaukalshi communities residing in Palghar, Vasai Vasai (Konkani and Marathi pronunciation: ...
,
Katkari The Katkari also called Kathodi, are an Indian tribe from Maharashtra. They have been categorised as a Scheduled tribe. They are bilingual, speaking the Katkari language, a dialect of the Marathi-Konkani languages, with each other; they speak Ma ...
, Varli, Kadodi; ** Konkanic: Konkani,
Canarese Konkani Canarese Konkani are a set of dialects spoken by minority Konkani people of the Canara sub-region of Karnataka, and also in Kassergode of Kerala that was part of South Canara.The Constitution Act 1992 (71st Amendment) Kanarese script is the p ...
, Maharashtrian Konkani.


Insular Indic

Insular Indic languages (of
Sri Lanka Sri Lanka (, ; si, ශ්‍රී ලංකා, Śrī Laṅkā, translit-std=ISO (); ta, இலங்கை, Ilaṅkai, translit-std=ISO ()), formerly known as Ceylon and officially the Democratic Socialist Republic of Sri Lanka, is an ...
and Maldives) started developing independently and diverging from the continental Indo-Aryan languages from around 5th century BCE. * Insular Indo-Aryan ** Sinhala ** Maldivian: Dhivehi, Mahl


Unclassified

The following languages are otherwise unclassified within Indo-Aryan: * Chinali–Lahul Lohar: Chinali, Lahul Lohar. * Badeshi


History


Proto-Indo-Aryan

Proto-Indo-Aryan (or sometimes Proto-Indic) is the reconstructed proto-language of the Indo-Aryan languages. It is intended to reconstruct the language of the pre-Vedic Indo-Aryans. Proto-Indo-Aryan is meant to be the predecessor of
Old Indo-Aryan The Indo-Aryan languages (or sometimes Indic languages) are a branch of the Indo-Iranian languages in the Indo-European language family. As of the early 21st century, they have more than 800 million speakers, primarily concentrated in India, ...
(1500–300 BCE), which is directly attested as Vedic and
Mitanni-Aryan Mitanni (; Hittite cuneiform ; ''Mittani'' '), c. 1550–1260 BC, earlier called Ḫabigalbat in old Babylonian texts, c. 1600 BC; Hanigalbat or Hani-Rabbat (''Hanikalbat'', ''Khanigalbat'', cuneiform ') in Assyrian records, or ''Naharin'' in ...
. Despite the great archaicity of Vedic, however, the other Indo-Aryan languages preserve a small number of conservative features lost in Vedic.


Mitanni-Aryan hypothesis

Some theonyms, proper names, and other terminology of the Late
Bronze Age The Bronze Age is a historic period, lasting approximately from 3300 BC to 1200 BC, characterized by the use of bronze, the presence of writing in some areas, and other early features of urban civilization. The Bronze Age is the second pri ...
Mitanni civilization of Upper Mesopotamia exhibit an Indo-Aryan superstrate. While what few written records left by the Mittani are either in Hurrian (which appears to have been the predominant language of their kingdom) or
Akkadian Akkadian or Accadian may refer to: * Akkadians, inhabitants of the Akkadian Empire * Akkadian language, an extinct Eastern Semitic language * Akkadian literature, literature in this language * Akkadian cuneiform, early writing system * Akkadian myt ...
(the main diplomatic language of the Late Bronze Age Near East), these apparently Indo-Aryan names suggest that an Indo-Aryan elite imposed itself over the
Hurrians The Hurrians (; cuneiform: ; transliteration: ''Ḫu-ur-ri''; also called Hari, Khurrites, Hourri, Churri, Hurri or Hurriter) were a people of the Bronze Age Near East. They spoke a Hurrian language and lived in Anatolia, Syria and Northern ...
in the course of the Indo-Aryan expansion. If these traces are Indo-Aryan, they would be the earliest known direct evidence of Indo-Aryan, and would increase the precision in dating the split between the Indo-Aryan and Iranian languages (as the texts in which the apparent Indicisms occur can be dated with some accuracy). In a treaty between the
Hittites The Hittites () were an Anatolian people who played an important role in establishing first a kingdom in Kussara (before 1750 BC), then the Kanesh or Nesha kingdom (c. 1750–1650 BC), and next an empire centered on Hattusa in north-cent ...
and the Mitanni, the deities Mitra, Varuna, Indra, and the Ashvins ( Nasatya) are invoked. Kikkuli's horse training text includes technical terms such as ''aika'' (cf. Sanskrit ''eka'', "one"), ''tera'' (''tri'', "three"), ''panza'' (''pancha'', "five"), ''satta'' (''sapta'', seven), ''na'' (''nava'', "nine"), ''vartana'' (''vartana'', "turn", round in the horse race). The numeral ''aika'' "one" is of particular importance because it places the superstrate in the vicinity of Indo-Aryan proper as opposed to Indo-Iranian in general or early Iranian (which has ''aiva''). Another text has ''babru'' (''babhru'', "brown"), ''parita'' (''palita'', "grey"), and (''pingala'', "red"). Their chief festival was the celebration of the
solstice A solstice is an event that occurs when the Sun appears to reach its most northerly or southerly excursion relative to the celestial equator on the celestial sphere. Two solstices occur annually, around June 21 and December 21. In many count ...
(''vishuva'') which was common in most cultures in the ancient world. The Mitanni warriors were called ''marya'', the term for "warrior" in
Sanskrit Sanskrit (; attributively , ; nominalization, nominally , , ) is a classical language belonging to the Indo-Aryan languages, Indo-Aryan branch of the Indo-European languages. It arose in South Asia after its predecessor languages had Trans-cul ...
as well; note ''mišta-nnu'' (= ''miẓḍha'', ≈ Sanskrit ''mīḍha'') "payment (for catching a fugitive)" (M. Mayrhofer, ''Etymologisches Wörterbuch des Altindoarischen'', Heidelberg, 1986–2000; Vol. II:358). Sanskritic interpretations of Mitanni royal names render Artashumara (''artaššumara'') as ''Ṛtasmara'' "who thinks of Ṛta" (Mayrhofer II 780), Biridashva (''biridašṷa, biriiašṷ''a) as ''Prītāśva'' "whose horse is dear" (Mayrhofer II 182), Priyamazda (''priiamazda'') as ''Priyamedha'' "whose wisdom is dear" (Mayrhofer II 189, II378), Citrarata as ''Citraratha'' "whose chariot is shining" (Mayrhofer I 553), Indaruda/Endaruta as ''Indrota'' "helped by Indra" (Mayrhofer I 134), Shativaza (''šattiṷaza'') as ''Sātivāja'' "winning the race price" (Mayrhofer II 540, 696), Šubandhu as ''Subandhu'' "having good relatives" (a name in Palestine, Mayrhofer II 209, 735), Tushratta (''tṷišeratta, tušratta'', etc.) as *tṷaiašaratha, Vedic Tvastar "whose chariot is vehement" (Mayrhofer, Etym. Wb., I 686, I 736).


Indian subcontinent

Dates indicate only a rough time frame. * Proto-Indo-Aryan (before 1500 BCE, reconstructed) * Old Indo-Aryan (ca. 1500–300 BCE) ** early Old Indo-Aryan: includes
Vedic Sanskrit Vedic Sanskrit was an ancient language of the Indo-Aryan subgroup of the Indo-European language family. It is attested in the Vedas and related literature compiled over the period of the mid- 2nd to mid-1st millennium BCE. It was orally prese ...
(ca. 1500 to 500 BCE) ** late Old Indo-Aryan: Epic Sanskrit, Classical Sanskrit (ca. 200 CE to 1300 CE) **
Mitanni Indo-Aryan Mitanni (; Hittite cuneiform ; ''Mittani'' '), c. 1550–1260 BC, earlier called Ḫabigalbat in old Babylonian texts, c. 1600 BC; Hanigalbat or Hani-Rabbat (''Hanikalbat'', ''Khanigalbat'', cuneiform ') in Assyrian records, or ''Naharin'' in ...
(ca. 1400 BCE) *
Middle Indo-Aryan The Middle Indo-Aryan languages (or Middle Indic languages, sometimes conflated with the Prakrits, which are a stage of Middle Indic) are a historical group of languages of the Indo-Aryan family. They are the descendants of Old Indo-Aryan (OIA; ...
or
Prakrit The Prakrits (; sa, prākṛta; psu, 𑀧𑀸𑀉𑀤, ; pka, ) are a group of vernacular Middle Indo-Aryan languages that were used in the Indian subcontinent from around the 3rd century BCE to the 8th century CE. The term Prakrit is usu ...
s (ca. 300 BCE to 1500 CE) ** early Buddhist texts (ca. 6th or 5th century BCE) ** early Middle Indo-Aryan: e.g. Ashokan Prakrits,
Pali Pali () is a Middle Indo-Aryan liturgical language native to the Indian subcontinent. It is widely studied because it is the language of the Buddhist '' Pāli Canon'' or '' Tipiṭaka'' as well as the sacred language of '' Theravāda'' Bud ...
,
Gandhari Gandhari may refer to: * Gandhari (Mahabharata), a character in the Indian epic ''Mahabharata'' * Gandhari khilla, a hill fort near Bokkalagutta, Telangana, India * Gandhari language, north-western prakrit spoken in Gāndhāra **Kharosthi, or Gan ...
, (ca. 300 BCE to 200 BCE) ** middle Middle Indo-Aryan: e.g. Dramatic Prakrits, Elu (ca. 200 BCE to 700 CE) ** late Middle Indo-Aryan: e.g. Abahattha (ca. 700 CE to 1500 CE) * Early Modern Indo-Aryan (Late Medieval India): e.g. early Dakhini and emergence of the Dehlavi dialect


Old Indo-Aryan

The earliest evidence of the group is from
Vedic Sanskrit Vedic Sanskrit was an ancient language of the Indo-Aryan subgroup of the Indo-European language family. It is attested in the Vedas and related literature compiled over the period of the mid- 2nd to mid-1st millennium BCE. It was orally prese ...
, that is used in the ancient preserved texts of the
Indian subcontinent The Indian subcontinent is a physiographical region in Southern Asia. It is situated on the Indian Plate, projecting southwards into the Indian Ocean from the Himalayas. Geopolitically, it includes the countries of Bangladesh, Bhutan, India ...
, the foundational canon of the Hindu synthesis known as the
Veda upright=1.2, The Vedas are ancient Sanskrit texts of Hinduism. Above: A page from the '' Atharvaveda''. The Vedas (, , ) are a large body of religious texts originating in ancient India. Composed in Vedic Sanskrit, the texts constitute th ...
s. The Indo-Aryan superstrate in Mitanni is of similar age to the language of the
Rigveda The ''Rigveda'' or ''Rig Veda'' ( ', from ' "praise" and ' "knowledge") is an ancient Indian collection of Vedic Sanskrit hymns (''sūktas''). It is one of the four sacred canonical Hindu texts ('' śruti'') known as the Vedas. Only one ...
, but the only evidence of it is a few proper names and specialized loanwords. While Old Indo-Aryan is the earliest stage of the Indo-Aryan branch, from which all known languages of the later stages Middle and New Indo-Aryan are derived, some documented Middle Indo-Aryan variants cannot fully be derived from the documented form of Old Indo-Aryan (on which Vedic and Classical Sanskrit are based), but betray features that must go back to other undocumented variants/dialects of Old Indo-Aryan. From Vedic Sanskrit, "
Sanskrit Sanskrit (; attributively , ; nominalization, nominally , , ) is a classical language belonging to the Indo-Aryan languages, Indo-Aryan branch of the Indo-European languages. It arose in South Asia after its predecessor languages had Trans-cul ...
" (literally "put together", "perfected" or "elaborated") developed as the prestige language of culture, science and religion, as well as the court, theatre, etc. Sanskrit of the later Vedic texts is comparable to Classical Sanskrit, but is largely mutually unintelligible with Vedic Sanskrit.


Middle Indo-Aryan (Prakrits)

Outside the learned sphere of Sanskrit, vernacular dialects (
Prakrit The Prakrits (; sa, prākṛta; psu, 𑀧𑀸𑀉𑀤, ; pka, ) are a group of vernacular Middle Indo-Aryan languages that were used in the Indian subcontinent from around the 3rd century BCE to the 8th century CE. The term Prakrit is usu ...
s) continued to evolve. The oldest attested Prakrits are the
Buddhist Buddhism ( , ), also known as Buddha Dharma and Dharmavinaya (), is an Indian religion or philosophical tradition based on teachings attributed to the Buddha. It originated in northern India as a -movement in the 5th century BCE, and ...
and Jain canonical languages
Pali Pali () is a Middle Indo-Aryan liturgical language native to the Indian subcontinent. It is widely studied because it is the language of the Buddhist '' Pāli Canon'' or '' Tipiṭaka'' as well as the sacred language of '' Theravāda'' Bud ...
and Ardhamagadhi Prakrit, respectively. Inscriptions in
Ashokan Prakrit Ashokan Pali (or Aśokan Dhammalipi) is the Middle Indo-Aryan dialect continuum used in the Edicts of Ashoka, attributed to Emperor Ashoka of the Mauryan Empire who reigned to . The Edicts are inscriptions on monumental pillars and rocks through ...
were also part of this early Middle Indo-Aryan stage. By medieval times, the Prakrits had diversified into various Middle Indo-Aryan languages. '' Apabhraṃśa'' is the conventional cover term for transitional dialects connecting late Middle Indo-Aryan with early Modern Indo-Aryan, spanning roughly the 6th to 13th centuries. Some of these dialects showed considerable literary production; the ''Śravakacāra'' of Devasena (dated to the 930s) is now considered to be the first Hindi book. The next major milestone occurred with the Muslim conquests in the Indian subcontinent in the 13th–16th centuries. Under the flourishing
Turco-Mongol The Turco-Mongol or Turko-Mongol tradition was an ethnocultural synthesis that arose in Asia during the 14th century, among the ruling elites of the Golden Horde and the Chagatai Khanate. The ruling Mongolian nobility, Mongol elites of these Kh ...
Mughal Empire The Mughal Empire was an early-modern empire that controlled much of South Asia between the 16th and 19th centuries. Quote: "Although the first two Timurid emperors and many of their noblemen were recent migrants to the subcontinent, the ...
, Persian became very influential as the language of prestige of the Islamic courts due to adoption of the foreign language by the Mughal emperors. The two largest languages that formed from Apabhraṃśa were Bengali and
Hindustani Hindustani may refer to: * something of, from, or related to Hindustan (another name of India) * Hindustani language, an Indo-Aryan language, whose two official norms are Hindi and Urdu * Fiji Hindi, a variety of Eastern Hindi spoken in Fiji, and ...
; others include
Assamese Assamese may refer to: * Assamese people, a socio-ethnolinguistic identity of north-eastern India * People of Assam, multi-ethnic, multi-linguistic and multi-religious people of Assam * Assamese language, one of the easternmost Indo-Aryan language ...
, Sindhi, Gujarati, Odia,
Marathi Marathi may refer to: *Marathi people, an Indo-Aryan ethnolinguistic group of Maharashtra, India *Marathi language, the Indo-Aryan language spoken by the Marathi people *Palaiosouda, also known as Marathi, a small island in Greece See also * * ...
, and
Punjabi Punjabi, or Panjabi, most often refers to: * Something of, from, or related to Punjab, a region in India and Pakistan * Punjabi language * Punjabi people * Punjabi dialects and languages Punjabi may also refer to: * Punjabi (horse), a British Th ...
.


New Indo-Aryan


= Medieval Hindustani

= In the Central Zone Hindi-speaking areas, for a long time the prestige dialect was Braj Bhasha, but this was replaced in the 19th century by Dehlavi-based
Hindustani Hindustani may refer to: * something of, from, or related to Hindustan (another name of India) * Hindustani language, an Indo-Aryan language, whose two official norms are Hindi and Urdu * Fiji Hindi, a variety of Eastern Hindi spoken in Fiji, and ...
. Hindustani was strongly influenced by Persian, with these and later Sanskrit influence leading to the emergence of Modern Standard Hindi and Modern Standard
Urdu Urdu (;"Urdu"
''
as registers of the Hindustani language. This state of affairs continued until the division of the British Indian Empire in 1947, when Hindi became the official language in India and
Urdu Urdu (;"Urdu"
''
became official in Pakistan. Despite the different script the fundamental grammar remains identical, the difference is more
sociolinguistic Sociolinguistics is the descriptive study of the effect of any or all aspects of society, including cultural Norm (sociology), norms, expectations, and context (language use), context, on the way language is used, and society's effect on languag ...
than purely linguistic. Today it is widely understood/spoken as a second or third language throughout South Asia and one of the most widely known languages in the world in terms of number of speakers.


Outside the Indian subcontinent


Domari

Domari Domari is an endangered Indo-Aryan language, spoken by Dom people scattered across the Middle East and North Africa. The language is reported to be spoken as far north as Azerbaijan and as far south as central Sudan, in Turkey, Iran, Iraq, Palest ...
is an Indo-Aryan language spoken by older
Dom people The Dom (also called Domi; ar, دومي / ALA-LC: ', / , Ḍom / or , or sometimes also called Doms) are descendants of the Dom with origins in the Indian subcontinent which through ancient migrations are found scattered across Middle Ea ...
scattered across the Middle East. The language is reported to be spoken as far north as
Azerbaijan Azerbaijan (, ; az, Azərbaycan ), officially the Republic of Azerbaijan, , also sometimes officially called the Azerbaijan Republic is a transcontinental country located at the boundary of Eastern Europe and Western Asia. It is a part of th ...
and as far south as central Sudan.*Matras, Y. (2012). ''A grammar of Domari''. Berlin: De Gruyter Mouton (Mouton Grammar Library). Based on the systematicity of sound changes, linguists have concluded that the ethnonyms ''Domari'' and ''
Romani Romani may refer to: Ethnicities * Romani people, an ethnic group of Northern Indian origin, living dispersed in Europe, the Americas and Asia ** Romani genocide, under Nazi rule * Romani language, any of several Indo-Aryan languages of the Roma ...
'' derive from the Indo-Aryan word ''ḍom''.


Lomavren

Lomavren is a nearly extinct
mixed language A mixed language is a language that arises among a bilingual group combining aspects of two or more languages but not clearly deriving primarily from any single language. It differs from a creole or pidgin language in that, whereas creoles/pidgi ...
, spoken by the
Lom people The Lom people or tr, Lomlar, also known in tr, Poşa as (Bosha or Posha) by non-Loms ( hy, Բոշա, ka, ბოშა, tr; russian: Боша) or Romani (russian: армянские цыгане; hy, հայ գնչուներ) or Caucasian Ro ...
, that arose from language contact between a language related to
Romani Romani may refer to: Ethnicities * Romani people, an ethnic group of Northern Indian origin, living dispersed in Europe, the Americas and Asia ** Romani genocide, under Nazi rule * Romani language, any of several Indo-Aryan languages of the Roma ...
and
Domari Domari is an endangered Indo-Aryan language, spoken by Dom people scattered across the Middle East and North Africa. The language is reported to be spoken as far north as Azerbaijan and as far south as central Sudan, in Turkey, Iran, Iraq, Palest ...
and the
Armenian language Armenian ( classical: , reformed: , , ) is an Indo-European language and an independent branch of that family of languages. It is the official language of Armenia. Historically spoken in the Armenian Highlands, today Armenian is widely spoken th ...
.


Romani

The Romani language is usually included in the Western Indo-Aryan languages. Romani varieties, which are mainly spoken throughout Europe, are noted for their relatively conservative nature; maintaining the Middle Indo-Aryan present-tense person concord markers, alongside consonantal endings for nominal case. Indeed, these features are no longer evident in most other modern Central Indo-Aryan languages. Moreover, Romani shares an innovative pattern of past-tense person, which corresponds to Dardic languages, such as Kashmiri and Shina. This is believed to be further indication that proto-Romani speakers were originally situated in central regions of the subcontinent, before migrating to northwestern regions. However, there are no known historical sources regarding the development of the Romani language specifically within India. Research conducted by nineteenth-century scholars Pott (1845) and Miklosich (1882–1888) demonstrated that the Romani language is most aptly designated as a New Indo-Aryan language (NIA), as opposed to Middle Indo-Aryan (MIA); establishing that proto-Romani speakers could not have left India significantly earlier than AD 1000. The principal argument favouring a migration during or after the transition period to NIA is the loss of the old system of nominal case, coupled with its reduction to a two-way nominative-oblique case system. A secondary argument concerns the system of gender differentiation, due to the fact that Romani has only two genders (masculine and feminine). Middle Indo-Aryan languages (named MIA) generally employed three genders (masculine, feminine and neuter), and some modern Indo-Aryan languages retain this aspect today. It is suggested that loss of the neuter gender did not occur until the transition to NIA. During this process, most of the neuter nouns became masculine, while several became feminine. For example, the neuter ''aggi'' "fire" in Prakrit morphed into the feminine ''āg'' in Hindi, and ''jag'' in Romani. The parallels in grammatical gender evolution between Romani and other NIA languages have additionally been cited as indications that the forerunner of Romani remained on the Indian subcontinent until a later period, possibly as late as the tenth century.


Sindhic migrations

Kholosi Kholosi is an Indo-Aryan language spoken in two villages in southern Iran that was first described in 2008. At its current status, the language is considered endangered. In 2008, it was only spoken in the neighboring villages of Kholus and Got ...
, Jadgali, and Luwati represent offshoots of the Sindhic subfamily of Indo-Aryan that have established themselves in the
Persian gulf The Persian Gulf ( fa, خلیج فارس, translit=xalij-e fârs, lit=Gulf of Fars, ), sometimes called the ( ar, اَلْخَلِيْجُ ٱلْعَرَبِيُّ, Al-Khalīj al-ˁArabī), is a mediterranean sea in Western Asia. The bo ...
region, perhaps through sea-based migrations. These are of a later origin than the Rom and Dom migrations which represent a different part of Indo-Aryan as well.


Indentured labourer migrations

The use by the British East India Company of indentured labourers led to the transplanting of Indo-Aryan languages around the world, leading to locally influenced lects that diverged from the source language, such as Fiji Hindi and Caribbean Hindustani.


Phonology


Consonants


Stop positions

The normative system of New Indo-Aryan stops consists of five places of articulation: labial, dental, " retroflex", palatal, and velar, which is the same as that of Sanskrit. The "retroflex" position may involve retroflexion, or curling the tongue to make the contact with the underside of the tip, or merely retraction. The point of contact may be alveolar or
postalveolar Postalveolar or post-alveolar consonants are consonants articulated with the tongue near or touching the ''back'' of the alveolar ridge. Articulation is farther back in the mouth than the alveolar consonants, which are at the ridge itself, but no ...
, and the distinctive quality may arise more from the shaping than from the position of the tongue. Palatals stops have affricated release and are traditionally included as involving a distinctive tongue position (blade in contact with hard palate). Widely transcribed as , claims to be a more accurate rendering. Moving away from the normative system, some languages and dialects have alveolar affricates instead of palatal, though some among them retain in certain positions: before front vowels (esp. ), before , or when geminated. Alveolar as an ''additional'' point of articulation occurs in
Marathi Marathi may refer to: *Marathi people, an Indo-Aryan ethnolinguistic group of Maharashtra, India *Marathi language, the Indo-Aryan language spoken by the Marathi people *Palaiosouda, also known as Marathi, a small island in Greece See also * * ...
and Konkani where dialect mixture and others factors upset the aforementioned complementation to produce minimal environments, in some West Pahari dialects through internal developments (, > ), and in
Kashmiri Kashmiri may refer to: * People or things related to the Kashmir Valley or the broader region of Kashmir * Kashmiris, an ethnic group native to the Kashmir Valley * Kashmiri language, their language People with the name * Kashmiri Saikia Barua ...
. The addition of a retroflex affricate to this in some Dardic languages maxes out the number of stop positions at seven (barring borrowed ), while a reduction to the inventory involves *ts > , which has happened in
Assamese Assamese may refer to: * Assamese people, a socio-ethnolinguistic identity of north-eastern India * People of Assam, multi-ethnic, multi-linguistic and multi-religious people of Assam * Assamese language, one of the easternmost Indo-Aryan language ...
, Chittagonian, Sinhala (though there have been other sources of a secondary ), and Southern Mewari. Further reductions in the number of stop articulations are in Assamese and
Romani Romani may refer to: Ethnicities * Romani people, an ethnic group of Northern Indian origin, living dispersed in Europe, the Americas and Asia ** Romani genocide, under Nazi rule * Romani language, any of several Indo-Aryan languages of the Roma ...
, which have lost the characteristic dental/retroflex contrast, and in Chittagonian, which may lose its labial and velar articulations through spirantisation in many positions (> ). /q x ɣ f/ are restricted to Perso-Arabic loanwords in most IA languages but they occur natively in Khowar. According to Masica (1991) some dialects of Pashayi have a /θ/ which is unusual for IA languages. Domari which is spoken in the Middle East and had high contact with Middle Eastern languages has /q ħ ʕ ʔ/ and emphatic consonants from loanwords.


Nasals

Sanskrit was noted as having five nasal-stop articulations corresponding to its oral stops, and among modern languages and dialects Dogri, Kacchi, Kalasha, Rudhari, Shina, Saurashtri, and Sindhi have been analysed as having this full complement of phonemic nasals , with the last two generally as the result of the loss of the stop from a homorganic nasal + stop cluster ( > and > ), though there are other sources as well. In languages that lack phonemic nasals at some places of articulation, they can still occur allophonically from place assimilation in a nasal + stop culture, e.g. Hindi > .


Aspiration and breathy-voice

Most Indo-Aryan languages have contrastive aspiration (), and some retain historical
breathy voice Breathy voice (also called murmured voice, whispery voice, soughing and susurration) is a phonation in which the vocal folds vibrate, as they do in normal (modal) voicing, but are adjusted to let more air escape which produces a sighing-like ...
on voiced consonants (). Sometimes both phenomena are analysed as a single aspiration contrast. The places and manners of articulation which allow contrastive aspiration vary by language; e.g. Sindhi permits phonemic , but the phonemic status of this sound in Hindi is uncertain, and many "Dardic" languages lack aspirated retroflex sibilants despite having unaspirated equivalents. In languages that have lost breathy-voice, the contrast has often been replaced with tone.


Regional developments

Some of these are mentioned in . * Implosives: Languages in the Sindhic subfamily, as well as Saraiki, western Marwari dialects, and some dialects of Gujarati have developed implosive consonants from historical intervocalic geminates and word-initial stops. Sindhi has a full implosive series except for the dental implosive: . It has been claimed that Wadiyari Koli has the dental implosive too. Other languages have less complete implosive series, e.g. Kacchi has just . *
Prenasalized stops Prenasalized consonants are phonetic sequences of a nasal and an obstruent (or occasionally a non-nasal sonorant such as ) that behave phonologically like single consonants. The primary reason for considering them to be single consonants, rathe ...
: Sinhala and Maldivian (Dhivehi) have a series of prenasalized stops covering all places except for palatal: . *
Palatalization Palatalization may refer to: *Palatalization (phonetics), the phonetic feature of palatal secondary articulation *Palatalization (sound change) Palatalization is a historical-linguistic sound change that results in a palatalized articulation ...
: Kashmiri (natively) and some Romani dialects (from contact with Slavic languages) have contrastive palatalisation. * Voiceless lateral In Gawarbati, some Pashai dialects, partly Bashkarik and some Shina dialects have /ɬ/ from clusters of tr kr or sometimes pr; dr gr and br merged with /l/ in these languages. *
Lateral affricate A lateral is a consonant in which the airstream proceeds along one or both of the sides of the tongue, but it is blocked by the tongue from going through the middle of the mouth. An example of a lateral consonant is the English ''L'', as in ''Larr ...
s: Bhadarwahi has an unusual series of lateral retroflex affricates ( derived from historical clusters.


Vowels

Vowel typologies are varied across Indo-Aryan due to diachronic mergers and (in some cases) splits, as well as different accounts by linguists for even the widely-spoken languages. Vowel systems per are listed below. Many languages also have phonemic nasal vowels. Sylheti language being a tonal, still classified as the Indo-Aryan language. The vowels of Sylheti language listed below.


Charts

The following are consonant systems of major and representative New Indo-Aryan languages, mostly following , though here they are in IPA. Parentheses indicate those consonants found only in loanwords: square brackets indicate those with "very low functional load". The arrangement is roughly geographical.


Sociolinguistics


Register

In many Indo-Aryan languages, the literary register is often more archaic and utilises a different lexicon (Sanskrit or Perso-Arabic) than spoken vernacular. One example is Bengali's high literary form, Sādhū bhāśā as opposed to the more modern Calita bhāśā (Cholito-bhasha). This distinction approaches
diglossia In linguistics, diglossia () is a situation in which two dialects or languages are used (in fairly strict compartmentalization) by a single language community. In addition to the community's everyday or vernacular language variety (labeled " ...
.


Language and dialect

In the context of South Asia, the choice between the appellations "language" and "dialect" is a difficult one, and any distinction made using these terms is obscured by their ambiguity. In one general colloquial sense, a language is a "developed" dialect: one that is standardised, has a written tradition and enjoys social prestige. As there are degrees of development, the boundary between a language and a dialect thus defined is not clear-cut, and there is a large middle ground where assignment is contestable. There is a second meaning of these terms, in which the distinction is drawn on the basis of linguistic similarity. Though seemingly a "proper" linguistics sense of the terms, it is still problematic: methods that have been proposed for quantifying difference (for example, based on
mutual intelligibility In linguistics, mutual intelligibility is a relationship between languages or dialects in which speakers of different but related varieties can readily understand each other without prior familiarity or special effort. It is sometimes used as a ...
) have not been seriously applied in practice; and any relationship established in this framework is relative.


See also

* Indo-Aryans * Iranic languages *
Indo-Aryan migration The Indo-Aryan migrations were the migrations into the Indian subcontinent of Indo-Aryan peoples, an ethnolinguistic group that spoke Indo-Aryan languages, the predominant languages of today's North India, Pakistan, Nepal, Bangladesh, Sri Lank ...
* Proto-Vedic Continuity * The family of Brahmic scripts * Linguistic history of India *
Indo-Aryan loanwords in Tamil The Tamil language has absorbed many Indo-Aryan, Prakrit, Pali and Sanskrit loanwords ever since the early 1st millennium CE, when the Sangam period Chola kingdoms became influenced by spread of Jainism, Buddhism and early Hinduism. M ...
* Languages of Bangladesh *
Languages of India Languages spoken in India belong to several language families, the major ones being the Indo-European languages spoken by 78.05% of Indians and the Dravidian languages spoken by 19.64% of Indians, both families together are sometimes known ...
* Languages of Maldives * Languages of Nepal * Languages of Pakistan *
Languages of Sri Lanka Several languages are spoken in Sri Lanka within the Indo-Aryan, Austronesian, and Dravidian families. Sri Lanka accords official status to Sinhala and Tamil, and English as a recognised language. The languages spoken on the island nation are ...
* Languages of South Asia


Notes


References


Further reading

* John Beames, ''A comparative grammar of the modern Aryan languages of India: to wit, Hindi, Panjabi, Sindhi, Gujarati, Marathi, Oriya, and Bangali''. Londinii: Trübner, 1872–1879. 3 vols. *Morgenstierne, Georg. "Early Iranic Influence upon Indo-Aryan." Acta Iranica, I. série, Commemoration Cyrus. Vol. I. Hommage universel (1974): 271-279. * . * Madhav Deshpande (1979). ''Sociolinguistic attitudes in India: An historical reconstruction''. Ann Arbor: Karoma Publishers. , (pbk). * Chakrabarti, Byomkes (1994). ''A comparative study of Santali and Bengali''. Calcutta: K.P. Bagchi & Co. * Erdosy, George. (1995). ''The Indo-Aryans of ancient South Asia: Language, material culture and ethnicity''. Berlin:
Walter de Gruyter Walter de Gruyter GmbH, known as De Gruyter (), is a German scholarly publishing house specializing in academic literature. History The roots of the company go back to 1749 when Frederick the Great granted the Königliche Realschule in B ...
. .
Ernst Kausen, 2006. ''Die Klassifikation der indogermanischen Sprachen''
(
Microsoft Word Microsoft Word is a word processing software developed by Microsoft. It was first released on October 25, 1983, under the name ''Multi-Tool Word'' for Xenix systems. Subsequent versions were later written for several other platforms includi ...
, 133 KB) * Kobayashi, Masato.; & George Cardona (2004). ''Historical phonology of old Indo-Aryan consonants''. Tokyo: Research Institute for Languages and Cultures of Asia and Africa, Tokyo University of Foreign Studies. . * . * Misra, Satya Swarup. (1980). ''Fresh light on Indo-European classification and chronology''. Varanasi: Ashutosh Prakashan Sansthan. * Misra, Satya Swarup. (1991–1993). ''The Old-Indo-Aryan, a historical & comparative grammar'' (Vols. 1–2). Varanasi: Ashutosh Prakashan Sansthan. * Sen, Sukumar. (1995). ''Syntactic studies of Indo-Aryan languages''. Tokyo: Institute for the Study of Languages and Foreign Cultures of Asia and Africa, Tokyo University of Foreign Studies. * Vacek, Jaroslav. (1976). ''The sibilants in Old Indo-Aryan: A contribution to the history of a linguistic area''. Prague: Charles University.


External links


The Indo-Aryan languages
25 October 2009
The Indo-Aryan languages
Colin P.Masica
Survey of the syntax of the modern Indo-Aryan languages
(Rajesh Bhatt), 7 February 2003. {{DEFAULTSORT:Indo-Aryan Languages Indo-European languages