HOME

TheInfoList



OR:

The Indo-Aryan languages (or sometimes Indic languages) are a branch of the
Indo-Iranian languages The Indo-Iranian languages (also Indo-Iranic languages or Aryan languages) constitute the largest and southeasternmost extant branch of the Indo-European languages, Indo-European language family (with over 400 languages), predominantly spoken i ...
in the
Indo-European language family The Indo-European languages are a language family native to the overwhelming majority of Europe, the Iranian plateau, and the northern Indian subcontinent. Some European languages of this family, English, French, Portuguese, Russian, Dutch ...
. As of the early 21st century, they have more than 800 million speakers, primarily concentrated in
India India, officially the Republic of India (Hindi: ), is a country in South Asia. It is the seventh-largest country by area, the second-most populous country, and the most populous democracy in the world. Bounded by the Indian Ocean on the so ...
,
Pakistan Pakistan ( ur, ), officially the Islamic Republic of Pakistan ( ur, , label=none), is a country in South Asia. It is the world's List of countries and dependencies by population, fifth-most populous country, with a population of almost 24 ...
,
Bangladesh Bangladesh (}, ), officially the People's Republic of Bangladesh, is a country in South Asia. It is the eighth-most populous country in the world, with a population exceeding 165 million people in an area of . Bangladesh is among the mos ...
,
Nepal Nepal (; ne, नेपाल ), formerly the Federal Democratic Republic of Nepal ( ne, सङ्घीय लोकतान्त्रिक गणतन्त्र नेपाल ), is a landlocked country in South Asia. It is mai ...
,
Sri Lanka Sri Lanka (, ; si, ශ්‍රී ලංකා, Śrī Laṅkā, translit-std=ISO (); ta, இலங்கை, Ilaṅkai, translit-std=ISO ()), formerly known as Ceylon and officially the Democratic Socialist Republic of Sri Lanka, is an ...
, and
Maldives Maldives (, ; dv, ދިވެހިރާއްޖެ, translit=Dhivehi Raajje, ), officially the Republic of Maldives ( dv, ދިވެހިރާއްޖޭގެ ޖުމްހޫރިއްޔާ, translit=Dhivehi Raajjeyge Jumhooriyyaa, label=none, ), is an archipelag ...
. Moreover, apart from the
Indian subcontinent The Indian subcontinent is a list of the physiographic regions of the world, physiographical region in United Nations geoscheme for Asia#Southern Asia, Southern Asia. It is situated on the Indian Plate, projecting southwards into the Indian O ...
, large immigrant and expatriate Indo-Aryan–speaking communities live in
Northwestern Europe Northwestern Europe, or Northwest Europe, is a loosely defined subregion of Europe, overlapping Northern and Western Europe. The region can be defined both geographically and ethnographically. Geographic definitions Geographically, Northw ...
,
Western Asia Western Asia, West Asia, or Southwest Asia, is the westernmost subregion of the larger geographical region of Asia, as defined by some academics, UN bodies and other institutions. It is almost entirely a part of the Middle East, and includes Ana ...
,
North America North America is a continent in the Northern Hemisphere and almost entirely within the Western Hemisphere. It is bordered to the north by the Arctic Ocean, to the east by the Atlantic Ocean, to the southeast by South America and the Car ...
, the
Caribbean The Caribbean (, ) ( es, El Caribe; french: la Caraïbe; ht, Karayib; nl, De Caraïben) is a region of the Americas that consists of the Caribbean Sea, its islands (some surrounded by the Caribbean Sea and some bordering both the Caribbean Se ...
,
Southeast Africa Southeast Africa or Southeastern Africa is an African region that is intermediate between East Africa and Southern Africa. It comprises the countries Botswana, Eswatini, Kenya, Lesotho, Malawi, Mozambique, Namibia, Rwanda, South Africa, Tanzania ...
,
Polynesia Polynesia () "many" and νῆσος () "island"), to, Polinisia; mi, Porinihia; haw, Polenekia; fj, Polinisia; sm, Polenisia; rar, Porinetia; ty, Pōrīnetia; tvl, Polenisia; tkl, Polenihia (, ) is a subregion of Oceania, made up of ...
and
Australia Australia, officially the Commonwealth of Australia, is a Sovereign state, sovereign country comprising the mainland of the Australia (continent), Australian continent, the island of Tasmania, and numerous List of islands of Australia, sma ...
, along with several million speakers of
Romani language Romani (; also Romany, Romanes , Roma; rom, rromani ćhib, links=no) is an Indo-Aryan macrolanguage of the Romani communities. According to '' Ethnologue'', seven varieties of Romani are divergent enough to be considered languages of their ...
s primarily concentrated in
Southeastern Europe Southeast Europe or Southeastern Europe (SEE) is a geographical subregion of Europe, consisting primarily of the Balkans. Sovereign states and territories that are included in the region are Albania, Bosnia and Herzegovina, Bulgaria, Croatia (al ...
. There are over 200 known Indo-Aryan languages. Modern Indo-Aryan languages descend from Old Indo-Aryan languages such as early
Vedic Sanskrit Vedic Sanskrit was an ancient language of the Indo-Aryan subgroup of the Indo-European language family. It is attested in the Vedas and related literature compiled over the period of the mid- 2nd to mid-1st millennium BCE. It was orally preser ...
, through
Middle Indo-Aryan languages The Middle Indo-Aryan languages (or Middle Indic languages, sometimes conflated with the Prakrits, which are a stage of Middle Indic) are a historical group of languages of the Indo-Aryan family. They are the descendants of Old Indo-Aryan (OIA; ...
(or
Prakrit The Prakrits (; sa, prākṛta; psu, 𑀧𑀸𑀉𑀤, ; pka, ) are a group of vernacular Middle Indo-Aryan languages that were used in the Indian subcontinent from around the 3rd century BCE to the 8th century CE. The term Prakrit is usu ...
s). The largest such languages in terms of first-speakers are
Hindi–Urdu Hindustani (; Devanagari: , * * * * ; Perso-Arabic: , , ) is the ''lingua franca'' of Northern and Central India and Pakistan. Hindustani is a pluricentric language with two standard registers, known as Hindi and Urdu. Thus, the langu ...
(),Standard Hindi first language: 260.3 million (2001), as second language: 120 million (1999). Urdu L1: 68.9 million (2001–2014), L2: 94 million (1999): ''Ethnologue'' 19.
Bengali Bengali or Bengalee, or Bengalese may refer to: *something of, from, or related to Bengal, a large region in South Asia * Bengalis, an ethnic and linguistic group of the region * Bengali language, the language they speak ** Bengali alphabet, the w ...
(242 million), Punjabi (about 120 million),
Marathi Marathi may refer to: *Marathi people, an Indo-Aryan ethnolinguistic group of Maharashtra, India *Marathi language, the Indo-Aryan language spoken by the Marathi people *Palaiosouda, also known as Marathi, a small island in Greece See also * * ...
(112 million),
Gujarati Gujarati may refer to: * something of, from, or related to Gujarat, a state of India * Gujarati people, the major ethnic group of Gujarat * Gujarati language, the Indo-Aryan language spoken by them * Gujarati languages, the Western Indo-Aryan sub- ...
(60 million),
Rajasthani Rajasthani may refer to: * something of, from, or related to Rajasthan, a state of India * Rajasthani languages, a group of languages spoken there * Rajasthani people, the native inhabitants of the region * Rajasthani architecture * Rajasthani art ...
(58 million),
Bhojpuri Bhojpuri (;Bhojpuri entry, Oxford Dictionaries
, Oxford U ...
(51 million),
Odia Odia, also spelled Oriya or Odiya, may refer to: * Odia people in Odisha, India * Odia language, an Indian language, belonging to the Indo-Aryan branch of the Indo-European language family * Odia alphabet, a writing system used for the Odia languag ...
(35 million), Maithili (about 34 million), Sindhi (25 million), Nepali (16 million), Assamese (15 million),
Chhattisgarhi Chhattisgarhi ( / ) is an Indo-Aryan language, spoken by approximately 16 million people from Chhattisgarh & other states. It is mostly spoken in the Indian states of Chhattisgarh, Odisha, Madhya Pradesh & Maharashtra. It is closely related ...
(18 million), Sinhala (17 million), and
Romani Romani may refer to: Ethnicities * Romani people, an ethnic group of Northern Indian origin, living dispersed in Europe, the Americas and Asia ** Romani genocide, under Nazi rule * Romani language, any of several Indo-Aryan languages of the Roma ...
(). A 2005 estimate placed the total number of native speakers of the Indo-Aryan languages at nearly 900 million people.


Classification


Theories

The Indo-Aryan family as a whole is thought to represent a
dialect continuum A dialect continuum or dialect chain is a series of Variety (linguistics), language varieties spoken across some geographical area such that neighboring varieties are Mutual intelligibility, mutually intelligible, but the differences accumulat ...
, where languages are often transitional towards neighboring varieties. Because of this, the division into languages vs. dialects is in many cases somewhat arbitrary. The classification of the Indo-Aryan languages is controversial, with many transitional areas that are assigned to different branches depending on classification. There are concerns that a
tree model In historical linguistics, the tree model (also Stammbaum, genetic, or cladistic model) is a model of the evolution of languages analogous to the concept of a family tree, particularly a phylogenetic tree in the biological evolution of species. ...
is insufficient for explaining the development of New Indo-Aryan, with some scholars suggesting the
wave model In historical linguistics, the wave model or wave theory (German ''Wellentheorie'') is a model of language change in which a new language feature (innovation) or a new combination of language features spreads from its region of origin, affecting ...
.


Subgroups

The following table of proposals is expanded from . Note that the table only lists some modern Indo-Aryan languages.
Anton I. Kogan Anton may refer to: People *Anton (given name), including a list of people with the given name *Anton (surname) Places *Anton Municipality, Bulgaria **Anton, Sofia Province, a village *Antón District, Panama **Antón, a town and capital of th ...
, in 2016, conducted a
lexicostatistical Lexicostatistics is a method of comparative linguistics that involves comparing the percentage of lexical cognates between languages to determine their relationship. Lexicostatistics is related to the comparative method but does not reconstruct a p ...
study of the New Indo-Aryan languages based on a 100-word
Swadesh list The Swadesh list ("Swadesh" is pronounced ) is a classic compilation of tentatively universal concepts for the purposes of lexicostatistics. Translations of the Swadesh list into a set of languages allow researchers to quantify the interrelatedness ...
, using techniques developed by the glottochronologist and comparative linguist
Sergei Starostin Sergei Anatolyevich Starostin (russian: Серге́й Анато́льевич Ста́ростин; March 24, 1953 – September 30, 2005) was a Russian historical linguist and philologist, perhaps best known for his reconstructions of hypotheti ...
. That grouping system is notable for Kogan's exclusion of Dardic from Indo-Aryan on the basis of his previous studies showing low lexical similarity to Indo-Aryan (43.5%) and negligible difference with similarity to Iranian (39.3%). He also calculated Sinhala–Dhivehi to be the most divergent Indo-Aryan branch. Nevertheless, the modern consensus of Indo-Aryan linguists tends towards the inclusion of Dardic based on morphological and grammatical features.


Inner–Outer hypothesis

The Inner–Outer hypothesis argues for a core and periphery of Indo-Aryan languages, with Outer Indo-Aryan (generally including Eastern and Southern Indo-Aryan, and sometimes Northwestern Indo-Aryan, Dardic and Pahari) representing an older stratum of Old Indo-Aryan that has been mixed to varying degrees with the newer stratum that is Inner Indo-Aryan. It is a contentious proposal with a long history, with varying degrees of claimed phonological and morphological evidence. Since its proposal by
Rudolf Hoernlé Augustus Frederic Rudolf Hoernlé CIE (1841 – 1918), also referred to as Rudolf Hoernle or A. F. Rudolf Hoernle, was a German Indologist and philologist. He is famous for his studies on the Bower Manuscript (1891), Weber Manuscript (1893) and ...
in 1880 and refinement by
George Grierson George Allison Grierson (April 11, 1867–October 18, 1931) was a politician in Manitoba, Canada. He served in the Legislative Assembly of Manitoba from 1914 to 1922, and was a cabinet minister in the government of Tobias Norris. Grierso ...
it has undergone numerous revisions and a great deal of debate, with the most recent iteration by
Franklin Southworth Franklin C. Southworth (born 1929) is an American linguist and Professor Emeritus of South Asian linguistics at the University of Pennsylvania The University of Pennsylvania (also known as Penn or UPenn) is a private research university i ...
and
Claus Peter Zoller Claus Peter Zoller is a linguist and professor of South Asian Studies at the Department of Culture Studies and Oriental Languages of the University of Oslo. His research interests include Hindi literature and Hindi, linguistics, the languages of the ...
based on robust linguistic evidence (particularly an Outer past tense in ''-l-''). Some of the theory's skeptics include
Suniti Kumar Chatterji Bhashacharya Acharya Suniti Kumar Chatterjee (26 November 1890 – 29 May 1977) was an Indian linguist, educationist and litterateur. He was a recipient of the second-highest Indian civilian honour of Padma Vibhushan. Life Childhood Chatterji ...
and Colin P. Masica.


Groups

The below classification follows , and .


Dardic

The Dardic languages (also Dardu or Pisaca) are a group of Indo-Aryan languages largely spoken in the northwestern extremities of the Indian subcontinent. Dardic was first formulated by
George Abraham Grierson Sir George Abraham Grierson (7 January 1851 – 9 March 1941) was an Irish administrator and linguist in British India. He worked in the Indian Civil Service but an interest in philology and linguistics led him to pursue studies in the languag ...
in his
Linguistic Survey of India The Linguistic Survey of India (LSI) is a comprehensive survey of the languages of British India, describing 364 languages and dialects. The Survey was first proposed by George Abraham Grierson, a member of the Indian Civil Service and a linguist w ...
but he did not consider it to be a subfamily of Indo-Aryan. The Dardic group as a genetic grouping (rather than areal) has been scrutinised and questioned to a degree by recent scholarship: Southworth, for example, says "the viability of Dardic as a genuine subgroup of Indo-Aryan is doubtful" and "the similarities among ardic languagesmay result from subsequent convergence". The Dardic languages are thought to be transitional with Punjabi and Pahari (e.g. Zoller describes Kashmiri as "an interlink between Dardic and West Pahāṛī"), as well as non-Indo-Aryan Nuristani; and are renowned for their relatively conservative features in the context of
Proto-Indo-Aryan Proto-Indo-Aryan (sometimes Proto-Indic) is the reconstructed proto-language of the Indo-Aryan languages. It is intended to reconstruct the language of the Proto-Indo-Aryans. Being descended from Proto-Indo-Iranian (which in turn is descended fr ...
. * Kashmiri:
Kashmiri Kashmiri may refer to: * People or things related to the Kashmir Valley or the broader region of Kashmir * Kashmiris, an ethnic group native to the Kashmir Valley * Kashmiri language, their language People with the name * Kashmiri Saikia Baruah ...
,
Kishtwari Kishtwari or Kashtwari is a northern Indo-Aryan language closely related to the Kashmiri language, with strong influences from neighboring Western Pahari varieties, spoken in Kishtwar district in Jammu and Kashmir, India. Kishtwari has historic ...
,
Poguli Pogali or Pugali, more recently known, together with neighboring languages, as Panchali or Khah, is an Indo-Aryan language spoken in parts of the Jammu region of Jammu and Kashmir, India. Its area encompasses the Pogal and Paristan valleys, and c ...
; * Shina: Brokskad, Kundal Shahi,
Shina Shina may refer to: * Shina language, an Indo-Aryan language spoken in Gilgit-Baltistan, Pakistan * Shina people, a Dardic ethnic group in Gilgit Baltistan, Pakistan People named Shina * Shina Matsudo (born 1973), Japanese freestyle swimmer * ...
, Ushojo,
Kalkoti Kalkoti, also known as Goedijaa, is an Indo-Aryan language spoken in the Kalkot Tehsil, in the Upper Dir district in Pakistan Pakistan ( ur, ), officially the Islamic Republic of Pakistan ( ur, , label=none), is a country in South Asi ...
, Palula,
Savi Savi is a town in Benin that was the capital of the Kingdom of Whydah prior to its capture by the forces of Dahomey in 1727. An account of the city was given by Robert Norris in 1789: There were British, French, Dutch and Portuguese factorie ...
; * Chitrali:
Kalasha A kalasha, also spelled kalash or kalasa, also called ghat or ghot ( sa, कलश , Telugu: కలశము Kannada: ಕಳಶ literally "pitcher, pot"), is a metal (brass, copper, silver or gold) pot with a large base and small mouth, large eno ...
,
Khowar Khowar () or Chitrali, is an Indo-Aryan language primarily spoken in Chitral and surrounding areas in Pakistan. Khowar is the lingua franca of Chitral, and it is also spoken in the Gupis-Yasin and Ghizer districts of Gilgit-Baltistan, as ...
; * Kohistani:
Bateri Bateri (, बटेरी) is an Indo-Aryan language spoken in Kohistan District, Pakistan and Jammu and Kashmir, India. Status As of now, there is little research done on the language and is currently being studied and surveyed by organization ...
, Chilisso, Gowro,
Indus Kohistani Indus Kohistani (, Kōstaiñ) is an Indo-Aryan language spoken in the former Kohistan District of Pakistan. The language was referred to as Maiyã (Mayon) or Shuthun by early researchers, but subsequent observations have not verified that these ...
, Kalami,
Tirahi Tirahi ( ps, تيراهي) were the speakers of the Tirahi language, a nearly extinct if not already extinct Indo-Aryan language which may still be spoken by older adults, who are likewise fluent in Pashto, in a few villages in the southeast of Jal ...
, Torwali, Wotapuri-Katarqalai; * Pashayi * Kunar: Dameli,
Gawar-Bati Gawar-Bati or Narsati is an Indo-Aryan language spoken in the Chitral region of northern Pakistan, and across the border in Afghanistan. It is also known as Aranduyiwar in Chitral because it is spoken in Arandu, which is the last village in lo ...
, Nangalami,
Shumashti Shumashti – also known as Shumasht – is an Indo-Aryan languages, Indo-Aryan language spoken in eastern Afghanistan. It is spoken in parts of Kunar Province: on the western side of the Kunar Valley between Jalalabad and the Pech Valley. The nu ...
.


Northern Zone

The Northern Indo-Aryan languages, also known as the Pahari ('hill') languages, are spoken throughout the Himalayan regions of the subcontinent. * Eastern Pahari: Nepali, Jumli,
Doteli Doteli, or Dotyali () is an Indo-Aryan language spoken by about 800,000 people, most of whom live in Nepal. It is a dialect of Khas, which is an ancient form of the modern Nepali language, and is written in the Devanagari script. It has official ...
; * Central Pahari:
Garhwali Garhwali may refer to: * Garhwali people, an ethno-linguistic group who live in northern India * Garhwali language, the Indo-Aryan language spoken by Garhwali people * anything from or related to: **Garhwal division, a region in state of Uttarakhan ...
, Kumaoni; *
Western Pahari The Western Pahari languages are a group of Northern Indo-Aryan languages that are spoken in the state of Himachal Pradesh, Jammu region of Jammu and Kashmir and parts of Uttarakhand and Punjab Languages The following lists the languages cla ...
(Himachali):
Dogri Dogri (Name Dogra Akkhar: ; Devanagari: डोगरी; Nastaliq: ; ) is an Indo-Aryan language primarily spoken in the Jammu region of Jammu and Kashmir, India, with smaller groups of speakers in adjoining regions of western Himachal Prade ...
,
Kangri Kangri can mean: *of, from, or related to the Kangra Valley or the Kangra district of northern India *Kangri language, the Indo-Aryan language of the valley *Kanger A kanger (; also known as kangri or kangid or kangir) is an earthen pot woven ar ...
,
Bhadarwahi Bhadarwahi is an Indo-Aryan language of the Western Pahari group spoken in the Bhaderwah region of Jammu and Kashmir, India. The name Bhadarwahi can be understood either in a narrow sense as referring to the dialect, locally known as Bhiḍl ...
,
Churahi Churahi (Takri: ) is a Western Pahari language of Himachal Pradesh, India. It is spoken in the Chaurah and Saluni tehsils of Chamba district, and is considered vulnerable. Adages Script The native script of the language is Takri script. ...
,
Bhateali Bhateali, or Bhattiyali, is a Western Pahari language of northern India. The 2011 Indian Census counted 23,970 speakers, of which 15,107 were found in Chamba district of Himachal Pradesh. Bhateali has sometimes been counted as dialect of either ...
,
Bilaspuri Bilaspuri (Takri: ), or Kahluri (Takri:) is a language spoken in northern India, predominantly in the Bilaspur district of Himachal Pradesh. It is associated with the people of the former princely state of Bilaspur in the Panjab Hills. Bilaspu ...
,
Chambeali Chambeali (Takri: ) is a language spoken in the Chamba district of Himachal Pradesh. Classification The Chambeali language is a part of the North-Western branch of the Indo-Aryan languages. It is further classified as a member of the Western-P ...
, Gaddi,
Pangwali Pangwali (Takri: ) is a Western Pahari language of Himachal Pradesh, India. It is spoken in the Pangi Tehsil of Chamba district, and is threatened to go extinct. Pangwali is natively written in the Takri script, but Devanagari is used as well. ...
,
Mandeali Mandeali (Takri: ) is a language spoken in northern India, predominantly in the Mandi district of Himachal Pradesh by the people of the Mandi Valley and particularly in the major city of Mandi. Other spellings for the name are Mandiyali and Mand ...
,
Mahasu Pahari Mahasu Pahari (Takri: ) is a Western Pahari (Himachali, Takri: ) language spoken in Himachal Pradesh. It is also known as Mahasui or Mahasuvi. The speaking population is about 1,000,000 (2001). It is more commonly spoken in the Himachal Pradesh, S ...
,
Jaunsari Jaunsari may refer to: * Jaunsari people, an ethnic group of northern India * Jaunsari language Jaunsari () is a Western Pahari language of northern India spoken by the Jaunsari people in the Chakrata and Kalsi blocks of Dehradun district in ...
,
Kullui Kului (, also known as Kulvi, Takri: ) is a Western Pahari language spoken in the Indian state of Himachal Pradesh. Phonology Consonants For the stops and affricates there is a four-way distinction in phonation between tenuis , voiced , ...
, Pahari Kinnauri, Hinduri,
Sarazi Sarazi or Sirazi (also spelled Siraji) is an Indo-Aryan language of Jammu and Kashmir, India. It is native to the Saraz region, a hilly area taking up the northern half of Doda district and parts of neighbouring Ramban and Kishtwar districts. ...
,
Sirmauri Sirmauri is a Western Pahari language spoken in the Sirmaur district in the northern Indian state of Himachal Pradesh Himachal Pradesh (; ; "Snow-laden Mountain Province") is a state in the northern part of India. Situated in the Western ...
.


Northwestern Zone

Northwestern Indo-Aryan languages are spoken in the northwestern region of India and Eastern Pakistan. Punjabi is spoken predominantly in the
Punjab region Punjab (; Punjabi Language, Punjabi: پنجاب ; ਪੰਜਾਬ ; ; also Romanization, romanised as ''Panjāb'' or ''Panj-Āb'') is a geopolitical, cultural, and historical region in South Asia, specifically in the northern part of the I ...
and is the official language of the northern Indian state of Punjab; in addition to being the most widely-spoken language in Pakistan. To the south, Sindhi and its variants are spoken; primarily in
Sindh Sindh (; ; ur, , ; historically romanized as Sind) is one of the four provinces of Pakistan. Located in the southeastern region of the country, Sindh is the third-largest province of Pakistan by land area and the second-largest province ...
. Northwestern languages are ultimately thought to be descended from
Shauraseni Prakrit Shauraseni Prakrit (, ) was a Middle Indo-Aryan language and a Dramatic Prakrit. Shauraseni was the chief language used in drama in northern medieval India. Most of the material in this language originates from the 3rd to 10th centuries, though ...
. * Punjabi ** Eastern Punjabi: Punjabi,
Doabi Doabi is a dialect of the Punjabi language. The dialect is named for the region in which it was historically spoken, Doaba (also known as Bist Doab); the word doab means "the land between two rivers" and this dialect was historically spoken in ...
, Majhi, Malwai, Puadhi, Sansi; ** Western Punjabi (
Lahnda Lahnda () () also known as Lahndi or Western Punjabi, is a group of north-western Indo-Aryan language varieties spoken in parts of Pakistan and India. Its validity as a genetic grouping is not certain. Terms like ''Lahnda'' or ''Western Punja ...
): Saraiki,
Hindko Hindko (, romanized: , ) is a cover term for a diverse group of Lahnda dialects spoken by several million people of various ethnic backgrounds in several areas in northwestern Pakistan, primarily in the provinces of Khyber Pakhtunkhwa and Pun ...
,
Pahari-Pothwari The Indo-Aryan language spoken on the Pothohar Plateau in the far north of Punjab, Pakistan, Pakistani Punjab, as well as in most of Pakistan's Azad Kashmir and in western areas of India's Jammu and Kashmir (union territory), Jammu and Kashmir, i ...
, Inku†; * Sindhi: Sindhi,
Jadgali Jaḍgālī is an Indo-Aryan language spoken by the Jadgal, an ethno-linguistic group of Pakistan and Iran. It is one of only two Indo-Aryan languages found on the Iranian plateau. It is a dialect of Sindhi most closely related to Lasi. The ...
, Kutchi,
Luwati Luwati (Al-Lawatia, ar, اللواتية, translit=al-lawātiyya; also known as Khoja, Khojki, Lawatiyya, Lawatiya, or Hyderabadi) is an Indo-Aryan language spoken by 5,000 to 10,000 people known as the Lawatiya (also called the Khojas or Hydera ...
,
Memoni Memoni (ميموني, મેમોની) is an Indo-Aryan language spoken by Kathiawari Memons from the Kathiawar region of Gujarat, India. Memon people are a subgroup or an ethnic group that originated in north-western India. After the India ...
, Khetrani, Kholosi.


Western Zone

Western Indo-Aryan languages, are spoken in the central and western areas within India, such as
Madhya Pradesh Madhya Pradesh (, ; meaning 'central province') is a state in central India. Its capital is Bhopal, and the largest city is Indore, with Jabalpur, Ujjain, Gwalior, Sagar, and Rewa being the other major cities. Madhya Pradesh is the seco ...
and
Rajasthan Rajasthan (; lit. 'Land of Kings') is a state in northern India. It covers or 10.4 per cent of India's total geographical area. It is the largest Indian state by area and the seventh largest by population. It is on India's northwestern si ...
, in addition to contiguous regions in Pakistan. Gujarati is the official language of
Gujarat Gujarat (, ) is a state along the western coast of India. Its coastline of about is the longest in the country, most of which lies on the Kathiawar peninsula. Gujarat is the fifth-largest Indian state by area, covering some ; and the ninth ...
, and is spoken by over 50 million people. In Europe, various
Romani languages Romani (; also Romany, Romanes , Roma; rom, rromani ćhib, links=no) is an Indo-Aryan macrolanguage of the Romani communities. According to ''Ethnologue'', seven varieties of Romani are divergent enough to be considered languages of their o ...
are spoken by the
Romani people The Romani (also spelled Romany or Rromani , ), colloquially known as the Roma, are an Indo-Aryan ethnic group, traditionally nomadic itinerants. They live in Europe and Anatolia, and have diaspora populations located worldwide, with sig ...
, an itinerant community who historically migrated from India. The Western Indo-Aryan languages are thought to have diverged from their northwestern counterparts, although they have a common antecedent in
Shauraseni Prakrit Shauraseni Prakrit (, ) was a Middle Indo-Aryan language and a Dramatic Prakrit. Shauraseni was the chief language used in drama in northern medieval India. Most of the material in this language originates from the 3rd to 10th centuries, though ...
. *
Rajasthani Rajasthani may refer to: * something of, from, or related to Rajasthan, a state of India * Rajasthani languages, a group of languages spoken there * Rajasthani people, the native inhabitants of the region * Rajasthani architecture * Rajasthani art ...
: Standard Rajasthani, Bagri, Marwari,
Mewati Mewati (Devanagri:मेवाती; Perso-Arabic:میواتی) is an Indo-Aryan language spoken by about three million speakers in the Mewat Region (Alwar and Bharatpur, districts of Rajasthan, Nuh district of Haryana). While other people ...
,
Dhundari Dhundhari (also known as Jaipuri) is a dialect of Rajasthani spoken in the Dhundhar region of northeastern Rajasthan state, India. Dhundari-speaking people are found in four districts – Jaipur, Sawai Madhopur, Dausa, Tonk and some parts of ...
,
Harauti Harauti or Hadauti (Hadoti) is a Rajasthani language spoken by approximately four million people in the Hadoti region of southeastern Rajasthan, India. Its speakers are concentrated in the districts of Kota, Baran, Bundi and Jhalawar in Rajast ...
,
Mewari Mewari is an Indo-Aryan language of the Rajasthani group. It is spoken by about five million speakers in Rajsamand, Bhilwara, Udaipur, Chittorgarh and Pratapgarh districts of Rajasthan state and Mandsaur, Neemuch districts of Madhya Pradesh ...
,
Shekhawati Shekhawati is a semi-arid historical region located in the northeast part of Rajasthan, India. The region was ruled by Shekhawat Rajputs. Shekhawati is located in North Rajasthan, comprising the districts of Jhunjhunu district, Jhunjhunu, part ...
,
Dhatki Dhatki (धाटकी; ڍاٽڪي), also known as Dhatti (धाटी; ڍاٽي) or Thari (थारी; ٿَري), is one of the Rajasthani languages of the Indo-Aryan branch of the Indo-European language family. Dhatki is closely related ...
,
Malvi The Malvi or Malavi, also known as Manthani or Mahadeopuri, is breed of zebu cattle from the Malwa plateau in western Madhya Pradesh, in central India. It is a good draught breed; the milk yield of the cows is low. The breed has been studie ...
,
Nimadi Nimadi is a Western Indo-Aryan language spoken in the Nimar region of west-central India within the state of Madhya Pradesh. This region lies adjacent to Maharashtra and south of Malwa. The districts where Nimadi is spoken are: Barwani, Khandwa ...
,
Gujari Gojri (, ), also known as Gujari, Gujri, Gojari, or Gojri, is a variety of Rajasthani spoken by the Gurjars and other tribes of India, Pakistan and Afghanistan. In India, the language is mainly spoken in Jammu and Kashmir, Himachal Pradesh, ...
,
Goaria Goaria is a Marwari Rajasthani language spoken by some 25,000 people in Sindh Province, Pakistan. The people are predominantly Hindu, and use the Hindi language Hindi (Devanāgarī: or , ), or more precisely Modern Standard Hindi (De ...
, Loarki,
Bhoyari Bhoyari, also known as Bhoyari Pawari, is an Indo-Aryan dialect of Central India. It is spoken by the Bhoyar social group in Betul, Chhindwara, and Wardha districts. See also * Rajasthani Language Rajasthani (Devanagari: ) refers to ...
, Kanjari, Od; *
Gujarati Gujarati may refer to: * something of, from, or related to Gujarat, a state of India * Gujarati people, the major ethnic group of Gujarat * Gujarati language, the Indo-Aryan language spoken by them * Gujarati languages, the Western Indo-Aryan sub- ...
:
Gujarati Gujarati may refer to: * something of, from, or related to Gujarat, a state of India * Gujarati people, the major ethnic group of Gujarat * Gujarati language, the Indo-Aryan language spoken by them * Gujarati languages, the Western Indo-Aryan sub- ...
, Jandavra, Saurashtra, Aer, Vaghri, Parkari Koli,
Kachi Koli Kachi Koli is an Indo-Aryan language spoken in Pakistan and India India, officially the Republic of India (Hindi: ), is a country in South Asia. It is the seventh-largest country by area, the second-most populous country, and the most ...
,
Wadiyara Koli Wadiyara Koli is an Indo-Aryan language of the Gujarati group. It is spoken by the Wadiyara people, who originate from Wadiyar in Gujarat; many of whom are thought to have migrated to Sindh in the early twentieth century, following the onset ...
; * Bhil languages, Bhil: Kalto language, Kalto, Vasavi language, Vasavi, Wagdi, Gamit language, Gamit, Vaagri Booli language, Vaagri Booli; ** Northern Bhil: Bauria language, Bauria, Bhilori language, Bhilori, Magari language, Magari; ** Central Bhil: Bhili language, Bhili proper, Bhilali language, Bhilali, Chodri language, Chodri, Dhodia language, Dhodia, Dhanki language, Dhanki, Dubli language, Dubli; ** Bareli: Palya Bareli language, Palya Bareli, Pauri Bareli language, Pauri Bareli, Rathwi Bareli language, Rathwi Bareli, Pardhi language, Pardhi; * Khandeshi language, Khandeshi * Lambadi * Domaaki language, Domaaki * Domari language, Domari *
Romani Romani may refer to: Ethnicities * Romani people, an ethnic group of Northern Indian origin, living dispersed in Europe, the Americas and Asia ** Romani genocide, under Nazi rule * Romani language, any of several Indo-Aryan languages of the Roma ...
: Carpathian Romani, Balkan Romani, Vlax Romani language, Vlax Romani; ** Northern Romani dialects, Northern Romani: Sinte Romani, Finnish Kalo language, Finnish Kalo, Baltic Romani.


Central Zone (Madhya ''or'' Hindi)

Within India, Hindi languages are spoken primarily in the Hindi belt regions and Gangetic plains, including Delhi and the surrounding areas; where they are often transitional with neighbouring lects. Many of these languages, including Braj and Awadhi, have rich literary and poetic traditions. Urdu, a Persianized derivative of Khariboli, is the official language of
Pakistan Pakistan ( ur, ), officially the Islamic Republic of Pakistan ( ur, , label=none), is a country in South Asia. It is the world's List of countries and dependencies by population, fifth-most populous country, with a population of almost 24 ...
and also has strong Dakhini, historical connections to
India India, officially the Republic of India (Hindi: ), is a country in South Asia. It is the seventh-largest country by area, the second-most populous country, and the most populous democracy in the world. Bounded by the Indian Ocean on the so ...
, where it also has been designated with official status. Hindi, a standardized and Sanskritized register of Khariboli, is the official language of the Government of India. Hindustani language, Together with Urdu, it is the third most-spoken language in the world. * Western Hindi: Hindustani language, Hindustani (including Hindi, Standard Hindi and Urdu, Standard Urdu), Khariboli, Braj Bhasha, Braj, Haryanvi language, Haryanvi, Bundeli language, Bundeli, Kannauji language, Kannauji, Parya language, Parya; * Eastern Hindi: Bagheli language, Bagheli,
Chhattisgarhi Chhattisgarhi ( / ) is an Indo-Aryan language, spoken by approximately 16 million people from Chhattisgarh & other states. It is mostly spoken in the Indian states of Chhattisgarh, Odisha, Madhya Pradesh & Maharashtra. It is closely related ...
, Surgujia language, Surgujia; ** Awadhi language, Awadhi: Fiji Hindi, Caribbean Hindustani


Eastern Zone

The Eastern Indo-Aryan languages, also known as Magadhan languages, are spoken throughout the eastern subcontinent, including Odisha and Bihar, alongside other regions surrounding the northwestern Himalayan corridor.
Bengali Bengali or Bengalee, or Bengalese may refer to: *something of, from, or related to Bengal, a large region in South Asia * Bengalis, an ethnic and linguistic group of the region * Bengali language, the language they speak ** Bengali alphabet, the w ...
is the seventh most-spoken language in the world, and has a strong literary tradition; the national anthems of
India India, officially the Republic of India (Hindi: ), is a country in South Asia. It is the seventh-largest country by area, the second-most populous country, and the most populous democracy in the world. Bounded by the Indian Ocean on the so ...
and
Bangladesh Bangladesh (}, ), officially the People's Republic of Bangladesh, is a country in South Asia. It is the eighth-most populous country in the world, with a population exceeding 165 million people in an area of . Bangladesh is among the mos ...
are written in Bengali. Assamese and
Odia Odia, also spelled Oriya or Odiya, may refer to: * Odia people in Odisha, India * Odia language, an Indian language, belonging to the Indo-Aryan branch of the Indo-European language family * Odia alphabet, a writing system used for the Odia languag ...
are the official languages of Assam and Odisha, respectively. The Eastern Indo-Aryan languages descend from Magadhan Apabhraṃśa and ultimately from Magadhi Prakrit. * Bihari languages, Bihari: **
Bhojpuri Bhojpuri (;Bhojpuri entry, Oxford Dictionaries
, Oxford U ...
, Caribbean Hindustani, Fiji Hindi; ** Magahi language, Magahi, Khortha language, Khortha; ** Maithili, Angika, Bajjika, Dehati; ** Sadanic languages, Sadanic: Nagpuri language, Nagpuri (Sadri), Kurmali language, Kurmali (Panchpargania); ** Tharu languages, Tharu, Kochila Tharu, Buksa language, Buksa, Majhi language, Majhi, Musasa language, Musasa; ** Kumhali language, Kumhali, Kuswaric: Danwar language, Danwar, Bote-Darai language, Bote-Darai; * Halbic languages, Halbic: Halbi language, Halbi, Kamar language, Kamar, Bhunjia language (Halbic), Bhunjia, Nahari language, Nahari; *
Odia Odia, also spelled Oriya or Odiya, may refer to: * Odia people in Odisha, India * Odia language, an Indian language, belonging to the Indo-Aryan branch of the Indo-European language family * Odia alphabet, a writing system used for the Odia languag ...
: Baleswari Odia, Baleswari, Kataki, Ganjami Odia, Ganjami, Sundargadi Odia, Sundargadi, Sambalpuri language, Sambalpuri, Desia language, Desia; ** Bodo Parja language, Bodo Parja, Bhatri language, Bhatri, Reli language, Reli, Kupia language, Kupia; * Bengali–Assamese languages, Bengali–Assamese: Bishnupriya Manipuri language, Bishnupriya Manipuri, Hajong language, Hajong, Chittagonian language, Chittagonian, Chakma language, Chakma, Noakhailla, Tanchangya language, Tanchangya, Rohingya language, Rohingya, Sylheti language, Sylheti,; ** Bengali-Gauda:
Bengali Bengali or Bengalee, or Bengalese may refer to: *something of, from, or related to Bengal, a large region in South Asia * Bengalis, an ethnic and linguistic group of the region * Bengali language, the language they speak ** Bengali alphabet, the w ...
, Bangali (ethnic dialect), Bangali, Rarhi dialect, Rarhi, Varendri dialect, Varendri, Sundarbani, Manbhumi dialect, Manbhumi, Dhakaiya Kutti, Dobhashi; ** Kamarupic: Assamese, Kamrupi dialects, Kamrupi, Goalpariya dialects, Goalpariya, Rangpuri language, Rangpuri, Surjapuri language, Surjapuri, Rajbanshi language (Nepal), Rajbanshi;


Southern Zone

Marathi-Konkani languages are ultimately descended from Maharashtri Prakrit, whereas Insular Indo-Aryan languages are descended from Elu, Elu Prakrit and possess several characteristics that markedly distinguish them from most of their mainland Indo-Aryan counterparts. * Marathi-Konkani languages, Marathi-Konkani ** Marathic:
Marathi Marathi may refer to: *Marathi people, an Indo-Aryan ethnolinguistic group of Maharashtra, India *Marathi language, the Indo-Aryan language spoken by the Marathi people *Palaiosouda, also known as Marathi, a small island in Greece See also * * ...
, Varhadi dialect, Varhadi, Andh language, Andh, Berar-Deccan Marathi, Phudagi language, Phudagi, Katkari language, Katkari, Varli language, Varli, Kadodi language, Kadodi; ** Konkanic: Konkani language, Konkani, Canarese Konkani, Maharashtrian Konkani.


Insular Indic

Insular Indic languages (of
Sri Lanka Sri Lanka (, ; si, ශ්‍රී ලංකා, Śrī Laṅkā, translit-std=ISO (); ta, இலங்கை, Ilaṅkai, translit-std=ISO ()), formerly known as Ceylon and officially the Democratic Socialist Republic of Sri Lanka, is an ...
and
Maldives Maldives (, ; dv, ދިވެހިރާއްޖެ, translit=Dhivehi Raajje, ), officially the Republic of Maldives ( dv, ދިވެހިރާއްޖޭގެ ޖުމްހޫރިއްޔާ, translit=Dhivehi Raajjeyge Jumhooriyyaa, label=none, ), is an archipelag ...
) started developing independently and diverging from the continental Indo-Aryan languages from around 5th century BCE. * Insular Indo-Aryan ** Sinhala ** Maldivian language, Maldivian: Dhivehi, Mahl


Unclassified

The following languages are otherwise unclassified within Indo-Aryan: * Chinali-Lahuli languages, Chinali–Lahul Lohar: Chinali language, Chinali, Lahul Lohar language, Lahul Lohar. * Badeshi language, Badeshi


History


Proto-Indo-Aryan

Proto-Indo-Aryan (or sometimes Proto-Indic) is the Linguistic reconstruction, reconstructed proto-language of the Indo-Aryan languages. It is intended to reconstruct the language of the Indo-Aryan peoples#History, pre-Vedic Indo-Aryans. Proto-Indo-Aryan is meant to be the predecessor of #Old Indo-Aryan, Old Indo-Aryan (1500–300 BCE), which is directly attested as Vedic Sanskrit, Vedic and Indo-Aryan superstrate in Mitanni, Mitanni-Aryan. Despite the great archaicity of Vedic, however, the other Indo-Aryan languages preserve a small number of Proto-Indo-Aryan language#Differences from Vedic, conservative features lost in Vedic.


Mitanni-Aryan hypothesis

Some theonyms, proper names, and other terminology of the Late Bronze Age Mitanni civilization of Upper Mesopotamia exhibit an Indo-Aryan superstrate. While what few written records left by the Mittani are either in Hurrian language, Hurrian (which appears to have been the predominant language of their kingdom) or Akkadian language, Akkadian (the main diplomatic language of the Late Bronze Age Near East), these apparently Indo-Aryan names suggest that an Indo-Aryan elite imposed itself over the Hurrians in the course of the Indo-Aryan migration, Indo-Aryan expansion. If these traces are Indo-Aryan, they would be the earliest known direct evidence of Indo-Aryan, and would increase the precision in dating the split between the Indo-Aryan and Iranian languages (as the texts in which the apparent Indicisms occur can be dated with some accuracy). In a treaty between the Hittites and the Mitanni, the deities Mitra, Varuna, Indra, and the Ashvins (Nasatya) are invoked. Kikkuli's horse training text includes technical terms such as ''aika'' (cf. Sanskrit ''eka'', "one"), ''tera'' (''tri'', "three"), ''panza'' (''pancha'', "five"), ''satta'' (''sapta'', seven), ''na'' (''nava'', "nine"), ''vartana'' (''vartana'', "turn", round in the horse race). The numeral ''aika'' "one" is of particular importance because it places the superstrate in the vicinity of Indo-Aryan proper as opposed to Indo-Iranian in general or early Iranian (which has ''aiva''). Another text has ''babru'' (''babhru'', "brown"), ''parita'' (''palita'', "grey"), and (''pingala'', "red"). Their chief festival was the celebration of the solstice (''vishuva'') which was common in most cultures in the ancient world. The Mitanni warriors were called ''marya'', the term for "warrior" in Sanskrit as well; note ''mišta-nnu'' (= ''miẓḍha'', ≈ Sanskrit ''mīḍha'') "payment (for catching a fugitive)" (M. Mayrhofer, ''Etymologisches Wörterbuch des Altindoarischen'', Heidelberg, 1986–2000; Vol. II:358). Sanskritic interpretations of Mitanni royal names render Artashumara (''artaššumara'') as ''Ṛtasmara'' "who thinks of Ṛta" (Mayrhofer II 780), Biridashva (''biridašṷa, biriiašṷ''a) as ''Prītāśva'' "whose horse is dear" (Mayrhofer II 182), Priyamazda (''priiamazda'') as ''Priyamedha'' "whose wisdom is dear" (Mayrhofer II 189, II378), Citrarata as ''Citraratha'' "whose chariot is shining" (Mayrhofer I 553), Indaruda/Endaruta as ''Indrota'' "helped by Indra" (Mayrhofer I 134), Shativaza (''šattiṷaza'') as ''Sātivāja'' "winning the race price" (Mayrhofer II 540, 696), Šubandhu as ''Subandhu'' "having good relatives" (a name in Palestine (region), Palestine, Mayrhofer II 209, 735), Tushratta (''tṷišeratta, tušratta'', etc.) as *tṷaiašaratha, Vedic Tvastar "whose chariot is vehement" (Mayrhofer, Etym. Wb., I 686, I 736).


Indian subcontinent

Dates indicate only a rough time frame. *
Proto-Indo-Aryan Proto-Indo-Aryan (sometimes Proto-Indic) is the reconstructed proto-language of the Indo-Aryan languages. It is intended to reconstruct the language of the Proto-Indo-Aryans. Being descended from Proto-Indo-Iranian (which in turn is descended fr ...
(before 1500 BCE, reconstructed) * Old Indo-Aryan (ca. 1500–300 BCE) ** early Old Indo-Aryan: includes
Vedic Sanskrit Vedic Sanskrit was an ancient language of the Indo-Aryan subgroup of the Indo-European language family. It is attested in the Vedas and related literature compiled over the period of the mid- 2nd to mid-1st millennium BCE. It was orally preser ...
(ca. 1500 to 500 BCE) ** late Old Indo-Aryan: Epic Sanskrit, Classical Sanskrit (ca. 200 CE to 1300 CE) ** Indo-Aryan superstrate in Mitanni, Mitanni Indo-Aryan (ca. 1400 BCE) * Middle Indo-Aryan languages, Middle Indo-Aryan or
Prakrit The Prakrits (; sa, prākṛta; psu, 𑀧𑀸𑀉𑀤, ; pka, ) are a group of vernacular Middle Indo-Aryan languages that were used in the Indian subcontinent from around the 3rd century BCE to the 8th century CE. The term Prakrit is usu ...
s (ca. 300 BCE to 1500 CE) ** early Buddhist texts (ca. 6th or 5th century BCE) ** early Middle Indo-Aryan: e.g. Ashokan Prakrits, Pali, Gandhari language, Gandhari, (ca. 300 BCE to 200 BCE) ** middle Middle Indo-Aryan: e.g. Dramatic Prakrits, Elu (ca. 200 BCE to 700 CE) ** late Middle Indo-Aryan: e.g. Abahattha (ca. 700 CE to 1500 CE) * Early Modern Indo-Aryan (Late Medieval India): e.g. early Dakhini and emergence of the Dehlavi dialect


Old Indo-Aryan

The earliest evidence of the group is from
Vedic Sanskrit Vedic Sanskrit was an ancient language of the Indo-Aryan subgroup of the Indo-European language family. It is attested in the Vedas and related literature compiled over the period of the mid- 2nd to mid-1st millennium BCE. It was orally preser ...
, that is used in the ancient preserved texts of the
Indian subcontinent The Indian subcontinent is a list of the physiographic regions of the world, physiographical region in United Nations geoscheme for Asia#Southern Asia, Southern Asia. It is situated on the Indian Plate, projecting southwards into the Indian O ...
, the foundational canon of the Hindu synthesis known as the Vedas. The Indo-Aryan superstrate in Mitanni is of similar age to the language of the Rigveda, but the only evidence of it is a few proper names and specialized loanwords. While Old Indo-Aryan is the earliest stage of the Indo-Aryan branch, from which all known languages of the later stages Middle and New Indo-Aryan are derived, some documented Middle Indo-Aryan variants cannot fully be derived from the documented form of Old Indo-Aryan (on which Vedic and Classical Sanskrit are based), but betray features that must go back to other undocumented variants/dialects of Old Indo-Aryan. From Vedic Sanskrit, "Sanskrit" (literally "put together", "perfected" or "elaborated") developed as the prestige language of culture, science and religion, as well as the court, theatre, etc. Sanskrit of the later Vedic texts is comparable to Classical Sanskrit, but is largely mutually unintelligible with Vedic Sanskrit.


Middle Indo-Aryan (Prakrits)

Outside the learned sphere of Sanskrit, vernacular dialects (
Prakrit The Prakrits (; sa, prākṛta; psu, 𑀧𑀸𑀉𑀤, ; pka, ) are a group of vernacular Middle Indo-Aryan languages that were used in the Indian subcontinent from around the 3rd century BCE to the 8th century CE. The term Prakrit is usu ...
s) continued to evolve. The oldest attested Prakrits are the Buddhism, Buddhist and Jainism, Jain canonical languages Pali and Ardhamagadhi Prakrit, respectively. Inscriptions in Ashokan Prakrit were also part of this early Middle Indo-Aryan stage. By medieval times, the Prakrits had diversified into various
Middle Indo-Aryan languages The Middle Indo-Aryan languages (or Middle Indic languages, sometimes conflated with the Prakrits, which are a stage of Middle Indic) are a historical group of languages of the Indo-Aryan family. They are the descendants of Old Indo-Aryan (OIA; ...
. ''Apabhraṃśa'' is the conventional cover term for transitional dialects connecting late Middle Indo-Aryan with early Modern Indo-Aryan, spanning roughly the 6th to 13th centuries. Some of these dialects showed considerable literary production; the ''Śravakacāra'' of Devasena (dated to the 930s) is now considered to be the first Hindi book. The next major milestone occurred with the Muslim conquests in the Indian subcontinent in the 13th–16th centuries. Under the flourishing Turco-Mongol tradition, Turco-Mongol Mughal Empire, Persian language in the Indian subcontinent, Persian became very influential as the language of prestige of the Islamic courts due to adoption of the foreign language by the Mughal emperors. The two largest languages that formed from Apabhraṃśa were
Bengali Bengali or Bengalee, or Bengalese may refer to: *something of, from, or related to Bengal, a large region in South Asia * Bengalis, an ethnic and linguistic group of the region * Bengali language, the language they speak ** Bengali alphabet, the w ...
and Hindustani language, Hindustani; others include Assamese, Sindhi,
Gujarati Gujarati may refer to: * something of, from, or related to Gujarat, a state of India * Gujarati people, the major ethnic group of Gujarat * Gujarati language, the Indo-Aryan language spoken by them * Gujarati languages, the Western Indo-Aryan sub- ...
,
Odia Odia, also spelled Oriya or Odiya, may refer to: * Odia people in Odisha, India * Odia language, an Indian language, belonging to the Indo-Aryan branch of the Indo-European language family * Odia alphabet, a writing system used for the Odia languag ...
,
Marathi Marathi may refer to: *Marathi people, an Indo-Aryan ethnolinguistic group of Maharashtra, India *Marathi language, the Indo-Aryan language spoken by the Marathi people *Palaiosouda, also known as Marathi, a small island in Greece See also * * ...
, and Punjabi.


New Indo-Aryan


= Medieval Hindustani

= In the Central Zone (Hindi), Central Zone Hindi-speaking areas, for a long time the prestige dialect was Braj Bhasha, but this was replaced in the 19th century by Dehlavi dialect, Dehlavi-based Hindustani language, Hindustani. Hindustani was strongly influenced by Persian language, Persian, with these and later Sanskrit influence leading to the emergence of Modern Standard Hindi and Modern Standard Urdu as register (sociolinguistics), registers of the Hindustani language. This state of affairs continued until the division of the British Indian Empire in 1947, when Hindi became the official language in India and Urdu became official in Pakistan. Despite the different script the fundamental grammar remains identical, the difference is more sociolinguistics, sociolinguistic than purely linguistic. Today it is widely understood/spoken as a second or third language throughout South Asia and one of the most widely known languages in the world in terms of number of speakers.


Outside the Indian subcontinent


Domari

Domari language, Domari is an Indo-Aryan language spoken by older Dom people scattered across the Middle East. The language is reported to be spoken as far north as Azerbaijan and as far south as central Sudan.*Matras, Y. (2012). ''A grammar of Domari''. Berlin: De Gruyter Mouton (Mouton Grammar Library). Based on the systematicity of sound changes, linguists have concluded that the ethnonyms ''Domari'' and ''Romani people, Romani'' derive from the Indo-Aryan word ''ḍom''.


Lomavren

Lomavren is a nearly extinct mixed language, spoken by the Lom people, that arose from language contact between a language related to
Romani Romani may refer to: Ethnicities * Romani people, an ethnic group of Northern Indian origin, living dispersed in Europe, the Americas and Asia ** Romani genocide, under Nazi rule * Romani language, any of several Indo-Aryan languages of the Roma ...
and Domari language, Domari and the Armenian language.


Romani

The Romani language is usually included in the Western Indo-Aryan languages. Romani varieties, which are mainly spoken throughout Europe, are noted for their relatively conservative nature; maintaining the Middle Indo-Aryan present-tense person concord markers, alongside consonantal endings for nominal case. Indeed, these features are no longer evident in most other modern Central Indo-Aryan languages. Moreover, Romani shares an innovative pattern of past-tense person, which corresponds to Dardic languages, such as Kashmiri and Shina. This is believed to be further indication that proto-Romani speakers were originally situated in central regions of the subcontinent, before migrating to northwestern regions. However, there are no known historical sources regarding the development of the Romani language specifically within India. Research conducted by nineteenth-century scholars Pott (1845) and Miklosich (1882–1888) demonstrated that the Romani language is most aptly designated as a New Indo-Aryan language (NIA), as opposed to Middle Indo-Aryan (MIA); establishing that proto-Romani speakers could not have left India significantly earlier than AD 1000. The principal argument favouring a migration during or after the transition period to NIA is the loss of the old system of nominal case, coupled with its reduction to a two-way nominative-oblique case system. A secondary argument concerns the system of gender differentiation, due to the fact that Romani has only two genders (masculine and feminine). Middle Indo-Aryan languages (named MIA) generally employed three genders (masculine, feminine and neuter), and some modern Indo-Aryan languages retain this aspect today. It is suggested that loss of the neuter gender did not occur until the transition to NIA. During this process, most of the neuter nouns became masculine, while several became feminine. For example, the neuter ''aggi'' "fire" in Prakrit morphed into the feminine ''āg'' in Hindi, and ''jag'' in Romani. The parallels in grammatical gender evolution between Romani and other NIA languages have additionally been cited as indications that the forerunner of Romani remained on the Indian subcontinent until a later period, possibly as late as the tenth century.


Sindhic migrations

Kholosi, Jadgali, and Luwati represent offshoots of the Sindhic subfamily of Indo-Aryan that have established themselves in the Persian gulf region, perhaps through sea-based migrations. These are of a later origin than the Rom and Dom migrations which represent a different part of Indo-Aryan as well.


Indentured labourer migrations

The use by the British East India Company of indentured labourers led to the transplanting of Indo-Aryan languages around the world, leading to locally influenced lects that diverged from the source language, such as Fiji Hindi and Caribbean Hindustani.


Phonology


Consonants


Stop positions

The normative system of New Indo-Aryan stops consists of five places of articulation: Labial consonant, labial, Dental consonant, dental, "Retroflex consonant, retroflex", palatal consonant, palatal, and velar consonant, velar, which is the same as that of Sanskrit. The "retroflex" position may involve retroflexion, or curling the tongue to make the contact with the underside of the tip, or merely retraction. The point of contact may be alveolar consonant, alveolar or postalveolar, and the distinctive quality may arise more from the shaping than from the position of the tongue. Palatals stops have affricate consonant, affricated release and are traditionally included as involving a distinctive tongue position (blade in contact with hard palate). Widely transcribed as , claims to be a more accurate rendering. Moving away from the normative system, some languages and dialects have alveolar affricates instead of palatal, though some among them retain in certain positions: before front vowels (esp. ), before , or when geminated. Alveolar as an ''additional'' point of articulation occurs in
Marathi Marathi may refer to: *Marathi people, an Indo-Aryan ethnolinguistic group of Maharashtra, India *Marathi language, the Indo-Aryan language spoken by the Marathi people *Palaiosouda, also known as Marathi, a small island in Greece See also * * ...
and Konkani people, Konkani where dialect mixture and others factors upset the aforementioned complementation to produce minimal environments, in some West Pahari dialects through internal developments (, > ), and in
Kashmiri Kashmiri may refer to: * People or things related to the Kashmir Valley or the broader region of Kashmir * Kashmiris, an ethnic group native to the Kashmir Valley * Kashmiri language, their language People with the name * Kashmiri Saikia Baruah ...
. The addition of a Voiceless retroflex affricate, retroflex affricate to this in some Dardic languages maxes out the number of stop positions at seven (barring borrowed ), while a reduction to the inventory involves *ts > , which has happened in Assamese, Chittagonian language, Chittagonian, Sinhala (though there have been other sources of a secondary ), and Southern Mewari. Further reductions in the number of stop articulations are in Assamese and
Romani Romani may refer to: Ethnicities * Romani people, an ethnic group of Northern Indian origin, living dispersed in Europe, the Americas and Asia ** Romani genocide, under Nazi rule * Romani language, any of several Indo-Aryan languages of the Roma ...
, which have lost the characteristic dental/retroflex contrast, and in Chittagonian, which may lose its labial and velar articulations through spirantisation in many positions (> ). /q x ɣ f/ are restricted to Perso-Arabic loanwords in most IA languages but they occur natively in Khowar. According to Masica (1991) some dialects of Pashayi have a /θ/ which is unusual for IA languages. Domari which is spoken in the Middle East and had high contact with Middle Eastern languages has /q ħ ʕ ʔ/ and emphatic consonants from loanwords.


Nasals

Sanskrit was noted as having five nasal stop, nasal-stop articulations corresponding to its oral stops, and among modern languages and dialects Dogri, Kacchi, Kalasha, Rudhari, Shina, Saurashtri, and Sindhi have been analysed as having this full complement of phonemic nasals , with the last two generally as the result of the loss of the stop from a homorganic nasal + stop cluster ( > and > ), though there are other sources as well. In languages that lack phonemic nasals at some places of articulation, they can still occur allophonically from place assimilation in a nasal + stop culture, e.g. Hindi > .


Aspiration and breathy-voice

Most Indo-Aryan languages have contrastive Aspirated consonant, aspiration (), and some retain historical breathy voice on voiced consonants (). Sometimes both phenomena are analysed as a single aspiration contrast. The places and manners of articulation which allow contrastive aspiration vary by language; e.g. Sindhi permits phonemic , but the phonemic status of this sound in Hindi is uncertain, and many "Dardic" languages lack aspirated retroflex sibilants despite having unaspirated equivalents. In languages that have lost breathy-voice, the contrast has often been replaced with tone.


Regional developments

Some of these are mentioned in . * Implosive consonant, Implosives: Languages in the Sindhi languages, Sindhic subfamily, as well as Saraiki, western Marwari dialects, and some dialects of Gujarati have developed implosive consonants from historical intervocalic geminates and word-initial stops. Sindhi has a full implosive series except for the dental implosive: . It has been claimed that Wadiyari Koli has the dental implosive too. Other languages have less complete implosive series, e.g. Kacchi has just . * Prenasalized stops: Sinhala and Maldivian (Dhivehi) have a series of prenasalized stops covering all places except for palatal: . * Palatalization (phonetics), Palatalization: Kashmiri (natively) and some Romani dialects (from contact with Slavic languages) have contrastive palatalisation. * ɬ, Voiceless lateral In Gawarbati, some Pashai dialects, partly Bashkarik and some Shina dialects have /ɬ/ from clusters of tr kr or sometimes pr; dr gr and br merged with /l/ in these languages. * Lateral affricates: Bhadarwahi has an unusual series of lateral retroflex affricates ( derived from historical clusters.


Vowels

Vowel typologies are varied across Indo-Aryan due to diachronic mergers and (in some cases) splits, as well as different accounts by linguists for even the widely-spoken languages. Vowel systems per are listed below. Many languages also have phonemic nasal vowels. Sylheti language being a Tone (linguistics), tonal, still classified as the Indo-Aryan language. The vowels of Sylheti language listed below.


Charts

The following are consonant systems of major and representative New Indo-Aryan languages, mostly following , though here they are in International Phonetic Alphabet, IPA. Parentheses indicate those consonants found only in loanwords: square brackets indicate those with "very low functional load". The arrangement is roughly geographical.


Sociolinguistics


Register

In many Indo-Aryan languages, the literary register is often more archaic and utilises a different lexicon (Sanskrit or Perso-Arabic) than spoken vernacular. One example is Bengali's high literary form, Sadhu bhasha, Sādhū bhāśā as opposed to the more modern Calita bhasa, Calita bhāśā (Cholito-bhasha). This distinction approaches diglossia.


Language and dialect

In the context of South Asia, the choice between the appellations Language or dialect, "language" and "dialect" is a difficult one, and any distinction made using these terms is obscured by their ambiguity. In one general colloquial sense, a language is a "developed" dialect: one that is standardised, has a written tradition and enjoys Prestige (sociolinguistics), social prestige. As there are degrees of development, the boundary between a language and a dialect thus defined is not clear-cut, and there is a large middle ground where assignment is contestable. There is a second meaning of these terms, in which the distinction is drawn on the basis of linguistic similarity. Though seemingly a "proper" linguistics sense of the terms, it is still problematic: methods that have been proposed for quantifying difference (for example, based on mutual intelligibility) have not been seriously applied in practice; and any relationship established in this framework is relative.


See also

* Indo-Aryans * Iranic languages * Indo-Aryan migration * Proto-Vedic Continuity * The family of Brahmic family, Brahmic scripts * Linguistic history of India * Indo-Aryan loanwords in Tamil * Languages of Bangladesh * Languages of India * Maldives#Languages, Languages of Maldives * Languages of Nepal * Languages of Pakistan * Languages of Sri Lanka * Languages of South Asia


Notes


References


Further reading

* John Beames, ''A comparative grammar of the modern Aryan languages of India: to wit, Hindi, Panjabi, Sindhi, Gujarati, Marathi, Oriya, and Bangali''. Londinii: Trübner, 1872–1879. 3 vols. *Morgenstierne, Georg. "Early Iranic Influence upon Indo-Aryan." Acta Iranica, I. série, Commemoration Cyrus. Vol. I. Hommage universel (1974): 271-279. * . * Madhav Deshpande (1979). ''Sociolinguistic attitudes in India: An historical reconstruction''. Ann Arbor: Karoma Publishers. , (pbk). * Byomkes Chakrabarti, Chakrabarti, Byomkes (1994). ''A comparative study of Santali and Bengali''. Calcutta: K.P. Bagchi & Co. * Erdosy, George. (1995). ''The Indo-Aryans of ancient South Asia: Language, material culture and ethnicity''. Berlin: Walter de Gruyter. .
Ernst Kausen, 2006. ''Die Klassifikation der indogermanischen Sprachen''
(Microsoft Word, 133 KB) * Kobayashi, Masato.; & George Cardona (2004). ''Historical phonology of old Indo-Aryan consonants''. Tokyo: Research Institute for Languages and Cultures of Asia and Africa, Tokyo University of Foreign Studies. . * . * Misra, Satya Swarup. (1980). ''Fresh light on Indo-European classification and chronology''. Varanasi: Ashutosh Prakashan Sansthan. * Misra, Satya Swarup. (1991–1993). ''The Old-Indo-Aryan, a historical & comparative grammar'' (Vols. 1–2). Varanasi: Ashutosh Prakashan Sansthan. * Sen, Sukumar. (1995). ''Syntactic studies of Indo-Aryan languages''. Tokyo: Institute for the Study of Languages and Foreign Cultures of Asia and Africa, Tokyo University of Foreign Studies. * Vacek, Jaroslav. (1976). ''The sibilants in Old Indo-Aryan: A contribution to the history of a linguistic area''. Prague: Charles University.


External links


The Indo-Aryan languages
25 October 2009
The Indo-Aryan languages
Colin P.Masica
Survey of the syntax of the modern Indo-Aryan languages
(Rajesh Bhatt), 7 February 2003. {{DEFAULTSORT:Indo-Aryan Languages Indo-European languages Indo-Aryan languages,