HOME

TheInfoList



OR:

The Austroasiatic languages , , are a large
language family A language family is a group of languages related through descent from a common ''ancestral language'' or ''parental language'', called the proto-language of that family. The term "family" reflects the tree model of language origination in hist ...
in
Mainland Southeast Asia Mainland Southeast Asia, also known as the Indochinese Peninsula or Indochina, is the continental portion of Southeast Asia. It lies east of the Indian subcontinent and south of Mainland China and is bordered by the Indian Ocean to the west an ...
and
South Asia South Asia is the southern subregion of Asia, which is defined in both geographical and ethno-cultural terms. The region consists of the countries of Afghanistan, Bangladesh, Bhutan, India, Maldives, Nepal, Pakistan, and Sri Lanka.;;;;;;;; ...
. These languages are scattered throughout parts of Thailand, Laos, India, Myanmar, Malaysia, Bangladesh, Nepal, and southern China and are the majority languages of Vietnam and Cambodia. There are around 117 million speakers of Austroasiatic languages. Of these languages, only
Vietnamese Vietnamese may refer to: * Something of, from, or related to Vietnam, a country in Southeast Asia ** A citizen of Vietnam. See Demographics of Vietnam. * Vietnamese people, or Kinh people, a Southeast Asian ethnic group native to Vietnam ** Overse ...
, Khmer, and
Mon Mon, MON or Mon. may refer to: Places * Mon State, a subdivision of Myanmar * Mon, India, a town in Nagaland * Mon district, Nagaland * Mon, Raebareli, a village in Uttar Pradesh, India * Mon, Switzerland, a village in the Canton of Grisons * An ...
have a long-established recorded history. Only two have official status as modern
national language A national language is a language (or language variant, e.g. dialect) that has some connection—de facto or de jure—with a nation. There is little consistency in the use of this term. One or more languages spoken as first languages in the te ...
s: Vietnamese in
Vietnam Vietnam or Viet Nam ( vi, Việt Nam, ), officially the Socialist Republic of Vietnam,., group="n" is a country in Southeast Asia, at the eastern edge of mainland Southeast Asia, with an area of and population of 96 million, making i ...
and Khmer in Cambodia. The Mon language is a recognized indigenous language in Myanmar and Thailand. In Myanmar, the
Wa language Wa (Va) is an Austroasiatic language spoken by the Wa people of Myanmar and China. There are three distinct varieties, sometimes considered separate languages; their names in ''Ethnologue'' are Parauk, the majority and standard form; Vo (Zhenka ...
is the de facto official language of
Wa State Wa State, my, ဝပြည်နယ် is an autonomous self-governing polity in Myanmar (Burma). It is ''de facto'' independent from the rest of the country and has its own political system, administrative divisions and army.29 December 2 ...
. Santali is one of the 22
scheduled languages of India There is no national language in India. However, article 343(1) of the Indian constitution specifically mentions that, "The official language of the Union shall be Hindi in Devanagari script. The form of numerals to be used for the official pur ...
. The rest of the languages are spoken by minority groups and have no official status. ''
Ethnologue ''Ethnologue: Languages of the World'' (stylized as ''Ethnoloɠue'') is an annual reference publication in print and online that provides statistics and other information on the living languages of the world. It is the world's most comprehensiv ...
'' identifies 168 Austroasiatic languages. These form thirteen established families (plus perhaps
Shompen The Shompen or Shom Pen are the indigenous people of the interior of Great Nicobar Island, part of the Indian union territory of Andaman and Nicobar Islands. The Shompen are a designated Scheduled Castes and Scheduled Tribes, Scheduled Tribe. ...
, which is poorly attested, as a fourteenth), which have traditionally been grouped into two, as Mon–Khmer, and Munda. However, one recent classification posits three groups (Munda, Mon-Khmer, and Khasi–Khmuic), while another has abandoned Mon–Khmer as a taxon altogether, making it synonymous with the larger family. Austroasiatic languages have a disjunct distribution across Southeast Asia and parts of India, Bangladesh, Nepal and East Asia, separated by regions where other languages are spoken. They appear to be the extant original languages of Mainland Southeast Asia (excluding the
Andaman Islands The Andaman Islands () are an archipelago in the northeastern Indian Ocean about southwest off the coasts of Myanmar's Ayeyarwady Region. Together with the Nicobar Islands to their south, the Andamans serve as a maritime boundary between th ...
), with the neighboring, and sometimes surrounding, Kra–Dai, Hmong-Mien, Austronesian, and
Sino-Tibetan languages Sino-Tibetan, also cited as Trans-Himalayan in a few sources, is a family of more than 400 languages, second only to Indo-European in number of native speakers. The vast majority of these are the 1.3 billion native speakers of Chinese languages. ...
being the result of later migrations.


Etymology

The name ''Austroasiatic'' comes from a combination of the
Latin Latin (, or , ) is a classical language belonging to the Italic branch of the Indo-European languages. Latin was originally a dialect spoken in the lower Tiber area (then known as Latium) around present-day Rome, but through the power of the ...
words for "South" and "Asia", hence "South Asia".


Typology

Regarding word structure, Austroasiatic languages are well known for having an iambic "sesquisyllabic" pattern, with basic nouns and verbs consisting of an initial, unstressed, reduced
minor syllable Primarily in Austroasiatic languages (also known as Mon–Khmer), in a typical word a minor syllable is a reduced (minor) syllable followed by a full tonic or stressed syllable. The minor syllable may be of the form or , with a reduced vowel, as i ...
followed by a stressed, full syllable. This reduction of presyllables has led to a variety among modern languages of phonological shapes of the same original Proto-Austroasiatic prefixes, such as the causative prefix, ranging from CVC syllables to consonant clusters to single consonants. As for word formation, most Austroasiatic languages have a variety of derivational prefixes, many have
infix An infix is an affix inserted inside a word stem (an existing word or the core of a family of words). It contrasts with ''adfix,'' a rare term for an affix attached to the outside of a stem, such as a prefix or suffix. When marking text for int ...
es, but suffixes are almost completely non-existent in most branches except Munda, and a few specialized exceptions in other Austroasiatic branches. The Austroasiatic languages are further characterized as having unusually large vowel inventories and employing some sort of
register Register or registration may refer to: Arts entertainment, and media Music * Register (music), the relative "height" or range of a note, melody, part, instrument, etc. * ''Register'', a 2017 album by Travis Miller * Registration (organ), the ...
contrast, either between modal (normal) voice and breathy (lax) voice or between modal voice and
creaky voice In linguistics, creaky voice (sometimes called laryngealisation, pulse phonation, vocal fry, or glottal fry) refers to a low, scratchy sound that occupies the vocal range below the common vocal register. It is a special kind of phonation in which ...
. Languages in the Pearic branch and some in the Vietic branch can have a three- or even four-way voicing contrast. However, some Austroasiatic languages have lost the register contrast by evolving more diphthongs or in a few cases, such as Vietnamese,
tonogenesis Tone is the use of pitch in language to distinguish lexical or grammatical meaning – that is, to distinguish or to inflect words. All verbal languages use pitch to express emotional and other paralinguistic information and to convey emph ...
. Vietnamese has been so heavily influenced by Chinese that its original Austroasiatic phonological quality is obscured and now resembles that of South Chinese languages, whereas Khmer, which had more influence from Sanskrit, has retained a more typically Austroasiatic structure.


Proto-language

Much work has been done on the reconstruction of Proto-Mon–Khmer in
Harry L. Shorto Harry Leonard Shorto (19 September 1919 – 30 July 1995) was a British philologist and linguist who specialized on the Mon language and Mon-Khmer studies. He authored both a modern Mon dictionary and a dictionary of Mon epigraphy. He worked f ...
's ''Mon–Khmer Comparative Dictionary''. Little work has been done on the
Munda languages The Munda languages are a group of closely related languages spoken by about nine million people in India and Bangladesh. Historically, they have been called the Kolarian languages. They constitute a branch of the Austroasiatic language famil ...
, which are not well documented. With their demotion from a primary branch, Proto-Mon–Khmer becomes synonymous with Proto-Austroasiatic. Paul Sidwell (2005) reconstructs the consonant inventory of Proto-Mon–Khmer as follows: This is identical to earlier reconstructions except for . is better preserved in the
Katuic languages The fifteen Katuic languages form a branch of the Austroasiatic languages spoken by about 1.3 million people in Southeast Asia. People who speak Katuic languages are called the Katuic peoples. Paul Sidwell is the leading specialist on the Katuic ...
, which Sidwell has specialized in.


Internal classification

Linguists traditionally recognize two primary divisions of Austroasiatic: the
Mon–Khmer languages The Austroasiatic languages , , are a large language family in Mainland Southeast Asia and South Asia. These languages are scattered throughout parts of Thailand, Laos, India, Myanmar, Malaysia, Bangladesh, Nepal, and southern China and are t ...
of
Southeast Asia Southeast Asia, also spelled South East Asia and South-East Asia, and also known as Southeastern Asia, South-eastern Asia or SEA, is the geographical United Nations geoscheme for Asia#South-eastern Asia, south-eastern region of Asia, consistin ...
,
Northeast India , native_name_lang = mni , settlement_type = , image_skyline = , image_alt = , image_caption = , motto = , image_map = Northeast india.png , ...
and the
Nicobar Islands The Nicobar Islands are an archipelagic island chain in the eastern Indian Ocean. They are located in Southeast Asia, northwest of Aceh on Sumatra, and separated from Thailand to the east by the Andaman Sea. Located southeast of the Indian s ...
, and the
Munda languages The Munda languages are a group of closely related languages spoken by about nine million people in India and Bangladesh. Historically, they have been called the Kolarian languages. They constitute a branch of the Austroasiatic language famil ...
of
East East or Orient is one of the four cardinal directions or points of the compass. It is the opposite direction from west and is the direction from which the Sun rises on the Earth. Etymology As in other languages, the word is formed from the fa ...
and
Central India Central India is a loosely defined geographical region of India. There is no clear official definition and various ones may be used. One common definition consists of the states of Chhattisgarh and Madhya Pradesh, which are included in alm ...
and parts of
Bangladesh Bangladesh (}, ), officially the People's Republic of Bangladesh, is a country in South Asia. It is the eighth-most populous country in the world, with a population exceeding 165 million people in an area of . Bangladesh is among the mos ...
, parts of
Nepal Nepal (; ne, नेपाल ), formerly the Federal Democratic Republic of Nepal ( ne, सङ्घीय लोकतान्त्रिक गणतन्त्र नेपाल ), is a landlocked country in South Asia. It is mai ...
. However, no evidence for this classification has ever been published. Each of the families that is written in boldface type below is accepted as a valid clade. By contrast, the relationships ''between'' these families within Austroasiatic are debated. In addition to the traditional classification, two recent proposals are given, neither of which accepts traditional "Mon–Khmer" as a valid unit. However, little of the data used for competing classifications has ever been published, and therefore cannot be evaluated by peer review. In addition, there are suggestions that additional branches of Austroasiatic might be preserved in substrata of Acehnese in Sumatra (Diffloth), the
Chamic languages The Chamic languages, also known as Aceh–Chamic and Achinese–Chamic, are a group of ten languages spoken in Aceh (Sumatra, Indonesia) and in parts of Cambodia, Thailand, Vietnam and Hainan, China. The Chamic languages are a subgroup of Mala ...
of Vietnam, and the
Land Dayak languages The Land Dayak languages are a group of dozen or so languages spoken by the Bidayuh Land Dayaks of Borneo. Languages ''Glottolog'' ''Glottolog'' classifies the Land Dayak languages as follows. *Benyadu-Bekati: Bekati (Bekatiq), Sara, Lara (Ra ...
of Borneo (Adelaar 1995).


Diffloth (1974)

Diffloth's widely cited original classification, now abandoned by Diffloth himself, is used in ''
Encyclopædia Britannica The (Latin for "British Encyclopædia") is a general knowledge English-language encyclopaedia. It is published by Encyclopædia Britannica, Inc.; the company has existed since the 18th century, although it has changed ownership various time ...
'' and—except for the breakup of Southern Mon–Khmer—in ''Ethnologue.'' * Munda ** North Munda *** Korku *** Kherwarian ** South Munda *** Kharia–Juang *** Koraput Munda * Mon–Khmer ** Eastern Mon–Khmer *** Khmer (Cambodian) *** Pearic *** Bahnaric *** Katuic ***
Vietic The Vietic languages are a branch of the Austroasiatic language family, spoken by the Vietic peoples in Laos and Vietnam. The branch was once referred to by the terms ''Việt–Mường'', ''Annamese–Muong'', and ''Vietnamuong''; the term '' ...
(Vietnamese, Muong) ** Northern Mon–Khmer ***
Khasi Khasi may refer to: * Khasi people, an ethnic group of Meghalaya, India * Khasi language, a major Austroasiatic language spoken in Meghalaya, India * Khāṣi language, an Indo-Aryan language of Jammu and Kashmir, India See also * Khasi Hills * Gh ...
(
Meghalaya Meghalaya (, or , meaning "abode of clouds"; from Sanskrit , "cloud" + , "abode") is a states and union territories of India, state in northeastern India. Meghalaya was formed on 21 January 1972 by carving out two districts from the state of As ...
, India) ***
Palaungic The nearly thirty Palaungic or Palaung–Wa languages form a branch of the Austroasiatic languages. Phonological developments Most of the Palaungic languages lost the contrastive voicing of the ancestral Austroasiatic consonants, with the disti ...
***
Khmuic The Khmuic languages are a branch of the Austroasiatic languages spoken mostly in northern Laos, as well as in neighboring northern Vietnam and southern Yunnan, China. Khmu language, Khmu is the only widely spoken language in the group. Homelan ...
** Southern Mon–Khmer ***
Mon Mon, MON or Mon. may refer to: Places * Mon State, a subdivision of Myanmar * Mon, India, a town in Nagaland * Mon district, Nagaland * Mon, Raebareli, a village in Uttar Pradesh, India * Mon, Switzerland, a village in the Canton of Grisons * An ...
***
Aslian The Aslian languages () are the southernmost branch of Austroasiatic languages spoken on the Malay Peninsula. They are the languages of many of the '' Orang Asli'', the aboriginal inhabitants of the peninsula. The total number of native speakers ...
(
Malaya Malaya refers to a number of historical and current political entities related to what is currently Peninsular Malaysia in Southeast Asia: Political entities * British Malaya (1826–1957), a loose collection of the British colony of the Straits ...
) *** Nicobarese (
Nicobar Islands The Nicobar Islands are an archipelagic island chain in the eastern Indian Ocean. They are located in Southeast Asia, northwest of Aceh on Sumatra, and separated from Thailand to the east by the Andaman Sea. Located southeast of the Indian s ...
)


Peiros (2004)

Peiros is a
lexicostatistic Lexicostatistics is a method of comparative linguistics that involves comparing the percentage of lexical cognates between languages to determine their relationship. Lexicostatistics is related to the comparative method but does not reconstruct a ...
classification, based on percentages of shared vocabulary. This means that languages can appear to be more distantly related than they actually are due to
language contact Language contact occurs when speakers of two or more languages or varieties interact and influence each other. The study of language contact is called contact linguistics. When speakers of different languages interact closely, it is typical for th ...
. Indeed, when Sidwell (2009) replicated Peiros's study with languages known well enough to account for loans, he did not find the internal (branching) structure below. * Nicobarese * Munda–Khmer ** Munda ** Mon–Khmer ***
Khasi Khasi may refer to: * Khasi people, an ethnic group of Meghalaya, India * Khasi language, a major Austroasiatic language spoken in Meghalaya, India * Khāṣi language, an Indo-Aryan language of Jammu and Kashmir, India See also * Khasi Hills * Gh ...
*** Nuclear Mon–Khmer **** Mangic ( Mang + Palyu) (perhaps in Northern MK) ****
Vietic The Vietic languages are a branch of the Austroasiatic language family, spoken by the Vietic peoples in Laos and Vietnam. The branch was once referred to by the terms ''Việt–Mường'', ''Annamese–Muong'', and ''Vietnamuong''; the term '' ...
(perhaps in Northern MK) **** Northern Mon–Khmer *****
Palaungic The nearly thirty Palaungic or Palaung–Wa languages form a branch of the Austroasiatic languages. Phonological developments Most of the Palaungic languages lost the contrastive voicing of the ancestral Austroasiatic consonants, with the disti ...
*****
Khmuic The Khmuic languages are a branch of the Austroasiatic languages spoken mostly in northern Laos, as well as in neighboring northern Vietnam and southern Yunnan, China. Khmu language, Khmu is the only widely spoken language in the group. Homelan ...
**** Central Mon–Khmer ***** Khmer dialects ***** Pearic ***** Asli-Bahnaric ******
Aslian The Aslian languages () are the southernmost branch of Austroasiatic languages spoken on the Malay Peninsula. They are the languages of many of the '' Orang Asli'', the aboriginal inhabitants of the peninsula. The total number of native speakers ...
****** Mon–Bahnaric ******* Monic ******* Katu–Bahnaric ******** Katuic ******** Bahnaric


Diffloth (2005)

Diffloth compares reconstructions of various clades, and attempts to classify them based on shared innovations, though like other classifications the evidence has not been published. As a schematic, we have: Or in more detail, *
Munda languages The Munda languages are a group of closely related languages spoken by about nine million people in India and Bangladesh. Historically, they have been called the Kolarian languages. They constitute a branch of the Austroasiatic language famil ...
(India) :* Koraput: 7 languages :*Core Munda languages ::* Kharian–Juang: 2 languages ::*North Munda languages ::: '' Korku'' ::: Kherwarian: 12 languages * Khasi–Khmuic languages (Northern Mon–Khmer) :* Khasian: 3 languages of north eastern India and adjacent region of Bangladesh :*Palaungo-Khmuic languages ::*
Khmuic The Khmuic languages are a branch of the Austroasiatic languages spoken mostly in northern Laos, as well as in neighboring northern Vietnam and southern Yunnan, China. Khmu language, Khmu is the only widely spoken language in the group. Homelan ...
: 13 languages of Laos and Thailand ::*Palaungo-Pakanic languages ::: Pakanic or Palyu: 4 or 5 languages of southern China and Vietnam :::
Palaungic The nearly thirty Palaungic or Palaung–Wa languages form a branch of the Austroasiatic languages. Phonological developments Most of the Palaungic languages lost the contrastive voicing of the ancestral Austroasiatic consonants, with the disti ...
: 21 languages of Burma, southern China, and Thailand * Nuclear Mon–Khmer languages :* Khmero-Vietic languages (Eastern Mon–Khmer) ::* Vieto-Katuic languages ?Sidwell (2005) casts doubt on Diffloth's Vieto-Katuic hypothesis, saying that the evidence is ambiguous, and that it is not clear where Katuic belongs in the family. :::
Vietic The Vietic languages are a branch of the Austroasiatic language family, spoken by the Vietic peoples in Laos and Vietnam. The branch was once referred to by the terms ''Việt–Mường'', ''Annamese–Muong'', and ''Vietnamuong''; the term '' ...
: 10 languages of Vietnam and Laos, including Muong and
Vietnamese Vietnamese may refer to: * Something of, from, or related to Vietnam, a country in Southeast Asia ** A citizen of Vietnam. See Demographics of Vietnam. * Vietnamese people, or Kinh people, a Southeast Asian ethnic group native to Vietnam ** Overse ...
, which has the most speakers of any Austroasiatic language. ::: Katuic: 19 languages of Laos, Vietnam, and Thailand. ::* Khmero-Bahnaric languages :::* Bahnaric: 40 languages of Vietnam, Laos, and Cambodia. :::*Khmeric languages :::: The Khmer dialects of Cambodia, Thailand, and Vietnam. :::: Pearic: 6 languages of Cambodia. :* Nico-Monic languages (Southern Mon–Khmer) ::* Nicobarese: 6 languages of the
Nicobar Islands The Nicobar Islands are an archipelagic island chain in the eastern Indian Ocean. They are located in Southeast Asia, northwest of Aceh on Sumatra, and separated from Thailand to the east by the Andaman Sea. Located southeast of the Indian s ...
, a territory of India. ::* Asli-Monic languages :::
Aslian The Aslian languages () are the southernmost branch of Austroasiatic languages spoken on the Malay Peninsula. They are the languages of many of the '' Orang Asli'', the aboriginal inhabitants of the peninsula. The total number of native speakers ...
: 19 languages of peninsular Malaysia and Thailand. ::: Monic: 2 languages, the
Mon language The Mon language (, mnw, ဘာသာမန်, links=no, (Mon-Thai ဘာသာမည်) ; my, မွန်ဘာသာ; th, ภาษามอญ; formerly known as Peguan and Talaing) is an Austroasiatic language spoken by the Mon peopl ...
of Burma and the Nyahkur language of Thailand.


Sidwell (2009–2015)

Paul Sidwell Paul James Sidwell is an Australian linguist based in Canberra, Australia who has held research and lecturing positions at the Australian National University. Sidwell, who is also an expert and consultant in forensic linguistics, is most notable ...
(2009), in a
lexicostatistical Lexicostatistics is a method of comparative linguistics that involves comparing the percentage of lexical cognates between languages to determine their relationship. Lexicostatistics is related to the comparative method but does not reconstruct a p ...
comparison of 36 languages which are well known enough to exclude loanwords, finds little evidence for internal branching, though he did find an area of increased contact between the Bahnaric and Katuic languages, such that languages of all branches apart from the geographically distant Munda and Nicobarese show greater similarity to Bahnaric and Katuic the closer they are to those branches, without any noticeable innovations common to Bahnaric and Katuic. He therefore takes the conservative view that the thirteen branches of Austroasiatic should be treated as equidistant on current evidence. Sidwell & Blench (2011) discuss this proposal in more detail, and note that there is good evidence for a Khasi–Palaungic node, which could also possibly be closely related to Khmuic.Sidwell, Paul, and Roger Blench. 2011.
The Austroasiatic Urheimat: the Southeastern Riverine Hypothesis
" Enfield, NJ (ed.) ''Dynamics of Human Diversity'', 317–345. Canberra: Pacific Linguistics.
If this would the case, Sidwell & Blench suggest that Khasic may have been an early offshoot of Palaungic that had spread westward. Sidwell & Blench (2011) suggest
Shompen The Shompen or Shom Pen are the indigenous people of the interior of Great Nicobar Island, part of the Indian union territory of Andaman and Nicobar Islands. The Shompen are a designated Scheduled Castes and Scheduled Tribes, Scheduled Tribe. ...
as an additional branch, and believe that a Vieto-Katuic connection is worth investigating. In general, however, the family is thought to have diversified too quickly for a deeply nested structure to have developed, since Proto-Austroasiatic speakers are believed by Sidwell to have radiated out from the central
Mekong The Mekong or Mekong River is a trans-boundary river in East Asia and Southeast Asia. It is the world's List of rivers by length, twelfth longest river and List of longest rivers of Asia, the third longest in Asia. Its estimated length is , ...
river valley relatively quickly. Subsequently, Sidwell (2015a: 179) proposed that Nicobarese subgroups with
Aslian The Aslian languages () are the southernmost branch of Austroasiatic languages spoken on the Malay Peninsula. They are the languages of many of the '' Orang Asli'', the aboriginal inhabitants of the peninsula. The total number of native speakers ...
, just as how Khasian and Palaungic subgroup with each other. A subsequent computational phylogenetic analysis (Sidwell 2015b) suggests that Austroasiatic branches may have a loosely nested structure rather than a completely rake-like structure, with an east–west division (consisting of Munda, Khasic, Palaungic, and Khmuic forming a western group as opposed to all of the other branches) occurring possibly as early as 7,000 years before present. However, he still considers the subbranching dubious. Integrating computational phylogenetic linguistics with recent archaeological findings, Paul Sidwell (2015c)Sidwell, Paul. 2015c. ''Phylogeny, innovations, and correlations in the prehistory of Austroasiatic''. Paper presented at the workshop ''Integrating inferences about our past: new findings and current issues in the peopling of the Pacific and South East Asia'', 22–23 June 2015, Max Planck Institute for the Science of Human History, Jena, Germany. further expanded his Mekong riverine hypothesis by proposing that Austroasiatic had ultimately expanded into
Indochina Mainland Southeast Asia, also known as the Indochinese Peninsula or Indochina, is the continental portion of Southeast Asia. It lies east of the Indian subcontinent and south of Mainland China and is bordered by the Indian Ocean to the west an ...
from the
Lingnan Lingnan (; Vietnamese: Lĩnh Nam) is a geographic area referring to the lands in the south of the Nanling Mountains. The region covers the modern Chinese subdivisions of Guangdong, Guangxi, Hainan, Hong Kong, and Macau, as well as modern northe ...
area of
southern China South China () is a geographical and cultural region that covers the southernmost part of China. Its precise meaning varies with context. A notable feature of South China in comparison to the rest of China is that most of its citizens are not n ...
, with the subsequent Mekong riverine dispersal taking place after the initial arrival of Neolithic farmers from southern China. Sidwell (2015c) tentatively suggests that Austroasiatic may have begun to split up 5,000 years B.P. during the
Neolithic transition The Neolithic Revolution, or the (First) Agricultural Revolution, was the wide-scale transition of many human cultures during the Neolithic period from a lifestyle of hunting and gathering to one of agriculture and settlement, making an inc ...
era of
mainland Southeast Asia Mainland Southeast Asia, also known as the Indochinese Peninsula or Indochina, is the continental portion of Southeast Asia. It lies east of the Indian subcontinent and south of Mainland China and is bordered by the Indian Ocean to the west an ...
, with all the major branches of Austroasiatic formed by 4,000 B.P. Austroasiatic would have had two possible dispersal routes from the western periphery of the
Pearl River The Pearl River, also known by its Chinese name Zhujiang or Zhu Jiang in Mandarin pinyin or Chu Kiang and formerly often known as the , is an extensive river system in southern China. The name "Pearl River" is also often used as a catch-all ...
watershed of
Lingnan Lingnan (; Vietnamese: Lĩnh Nam) is a geographic area referring to the lands in the south of the Nanling Mountains. The region covers the modern Chinese subdivisions of Guangdong, Guangxi, Hainan, Hong Kong, and Macau, as well as modern northe ...
, which would have been either a coastal route down the coast of Vietnam, or downstream through the
Mekong River The Mekong or Mekong River is a trans-boundary river in East Asia and Southeast Asia. It is the world's twelfth longest river and the third longest in Asia. Its estimated length is , and it drains an area of , discharging of water annuall ...
via
Yunnan Yunnan , () is a landlocked Provinces of China, province in Southwest China, the southwest of the People's Republic of China. The province spans approximately and has a population of 48.3 million (as of 2018). The capital of the province is ...
. Both the reconstructed lexicon of Proto-Austroasiatic and the archaeological record clearly show that early Austroasiatic speakers around 4,000 B.P. cultivated rice and
millet Millets () are a highly varied group of small-seeded grasses, widely grown around the world as cereal crops or grains for fodder and human food. Most species generally referred to as millets belong to the tribe Paniceae, but some millets al ...
, kept livestock such as dogs, pigs, and chickens, and thrived mostly in estuarine rather than coastal environments. At 4,500 B.P., this "Neolithic package" suddenly arrived in Indochina from the Lingnan area without cereal grains and displaced the earlier pre-Neolithic hunter-gatherer cultures, with grain husks found in northern Indochina by 4,100 B.P. and in southern Indochina by 3,800 B.P. However, Sidwell (2015c) found that iron is not reconstructable in Proto-Austroasiatic, since each Austroasiatic branch has different terms for iron that had been borrowed relatively lately from Tai, Chinese, Tibetan, Malay, and other languages. During the
Iron Age The Iron Age is the final epoch of the three-age division of the prehistory and protohistory of humanity. It was preceded by the Stone Age (Paleolithic, Mesolithic, Neolithic) and the Bronze Age (Chalcolithic). The concept has been mostly appl ...
about 2,500 B.P., relatively young Austroasiatic branches in Indochina such as
Vietic The Vietic languages are a branch of the Austroasiatic language family, spoken by the Vietic peoples in Laos and Vietnam. The branch was once referred to by the terms ''Việt–Mường'', ''Annamese–Muong'', and ''Vietnamuong''; the term '' ...
, Katuic, Pearic, and Khmer were formed, while the more internally diverse Bahnaric branch (dating to about 3,000 B.P.) underwent more extensive internal diversification. By the Iron Age, all of the Austroasiatic branches were more or less in their present-day locations, with most of the diversification within Austroasiatic taking place during the Iron Age. Paul Sidwell (2018) considers the Austroasiatic language family to have rapidly diversified around 4,000 years B.P. during the arrival of rice agriculture in Indochina, but notes that the origin of Proto-Austroasiatic itself is older than that date. The lexicon of Proto-Austroasiatic can be divided into an early and late stratum. The early stratum consists of basic lexicon including body parts, animal names, natural features, and pronouns, while the names of cultural items (agriculture terms and words for cultural artifacts, which are reconstructible in Proto-Austroasiatic) form part of the later stratum.
Roger Blench Roger Marsh Blench (born August 1, 1953) is a British linguist, ethnomusicologist and development anthropologist. He has an M.A. and a Ph.D. from the University of Cambridge and is based in Cambridge, England. He researches, publishes, and works ...
(2017)Blench, Roger. 2017.
Waterworld: lexical evidence for aquatic subsistence strategies in Austroasiatic
'. Presented at ICAAL 7, Kiel, Germany.
suggests that vocabulary related to aquatic subsistence strategies (such as boats, waterways, river fauna, and fish capture techniques) can be reconstructed for Proto-Austroasiatic. Blench (2017) finds widespread Austroasiatic roots for 'river, valley', 'boat', 'fish', 'catfish sp.', 'eel', 'prawn', 'shrimp' (Central Austroasiatic), 'crab', 'tortoise', 'turtle', 'otter', 'crocodile', 'heron, fishing bird', and 'fish trap'. Archaeological evidence for the presence of agriculture in northern
Indochina Mainland Southeast Asia, also known as the Indochinese Peninsula or Indochina, is the continental portion of Southeast Asia. It lies east of the Indian subcontinent and south of Mainland China and is bordered by the Indian Ocean to the west an ...
(northern Vietnam, Laos, and other nearby areas) dates back to only about 4,000 years ago (2,000 BC), with agriculture ultimately being introduced from further up to the north in the Yangtze valley where it has been dated to 6,000 B.P. Sidwell (2022)Sidwell, Paul. 2021
''Austroasiatic Dispersal: the AA "Water-World" Extended''SEALS 2021

Video)
/ref> proposes that the locus of Proto-Austroasiatic was in the
Red River Delta The Red River Delta or Hong River Delta ( vi, Châu thổ sông Hồng) is the flat low-lying plain formed by the Red River and its distributaries merging with the Thái Bình River in northern Vietnam. ''Hồng'' (紅) is a Sino-Vietnamese wor ...
area about 4,000-4,500 years before present, instead of the Middle Mekong as he had previously proposed. Austroasiatic dispersed coastal maritime routes and also upstream through river valleys. Khmuic, Palaungic, and Khasic resulted from a westward dispersal that ultimately came from the Red Valley valley. Based on their current distributions, about half of all Austroasiatic branches (including Nicobaric and Munda) can be traced to coastal maritime dispersals. Hence, this points to a relatively late riverine dispersal of Austroasiatic as compared to
Sino-Tibetan Sino-Tibetan, also cited as Trans-Himalayan in a few sources, is a family of more than 400 languages, second only to Indo-European in number of native speakers. The vast majority of these are the 1.3 billion native speakers of Chinese languages. ...
, whose speakers had a distinct non-riverine culture. In addition to living an aquatic-based lifestyle, early Austroasiatic speakers would have also had access to livestock, crops, and newer types of watercraft. As early Austroasiatic speakers dispersed rapidly via waterways, they would have encountered speakers of older language families who were already settled in the area, such as Sino-Tibetan.


Sidwell (2018)

Sidwell (2018) (quoted in Sidwell 2021) gives a more nested classification of Austroasiatic branches as suggested by his computational phylogenetic analysis of Austroasiatic languages using a 200-word list. Many of the tentative groupings are likely linkages. Pakanic and
Shompen The Shompen or Shom Pen are the indigenous people of the interior of Great Nicobar Island, part of the Indian union territory of Andaman and Nicobar Islands. The Shompen are a designated Scheduled Castes and Scheduled Tribes, Scheduled Tribe. ...
were not included.


Possible extinct branches

Roger Blench Roger Marsh Blench (born August 1, 1953) is a British linguist, ethnomusicologist and development anthropologist. He has an M.A. and a Ph.D. from the University of Cambridge and is based in Cambridge, England. He researches, publishes, and works ...
(2009) also proposes that there might have been other primary branches of Austroasiatic that are now extinct, based on substrate evidence in modern-day languages. * Pre-
Chamic The Chamic languages, also known as Aceh–Chamic and Achinese–Chamic, are a group of ten languages spoken in Aceh (Sumatra, Indonesia) and in parts of Cambodia, Thailand, Vietnam and Hainan, China. The Chamic languages are a subgroup of Malay ...
languages (the languages of coastal Vietnam before the Chamic migrations). Chamic has various Austroasiatic loanwords that cannot be clearly traced to existing Austroasiatic branches (Sidwell 2006, 2007).Sidwell, Paul. 2006.
Dating the Separation of Acehnese and Chamic By Etymological Analysis of the Aceh-Chamic Lexicon
." In The ''Mon-Khmer Studies Journal'', 36: 187–206.
Sidwell, Paul. 2007.
The Mon-Khmer Substrate in Chamic: Chamic, Bahnaric and Katuic Contact
" In SEALS XII Papers from the 12th Annual Meeting of the Southeast Asian Linguistics Society 2002, edited by Ratree Wayland et al.. Canberra, Australia, 113-128. Pacific Linguistics, Research School of Pacific and Asian Studies, The Australian National University.
Larish (1999)Larish, Michael David. 1999. ''The Position of Moken and Moklen Within the Austronesian Language Family''. Doctoral dissertation, University of Hawai'i at Mānoa. also notes that
Moklenic languages The Moklenic or Moken–Moklen languages consist of a pair of two closely related but distinct languages, namely Moken and Moklen. Larish (1999) establishes the two languages as forming two distinct subgroups of a larger Moken–Moklen branch. La ...
contain many Austroasiatic loanwords, some of which are similar to the ones found in Chamic. * Acehnese substratum (Sidwell 2006). Acehnese has many basic words that are of Austroasiatic origin, suggesting that either Austronesian speakers have absorbed earlier Austroasiatic residents in northern Sumatra, or that words might have been borrowed from Austroasiatic languages in southern Vietnam – or perhaps a combination of both. Sidwell (2006) argues that Acehnese and Chamic had often borrowed Austroasiatic words independently of each other, while some Austroasiatic words can be traced back to Proto-Aceh-Chamic. Sidwell (2006) accepts that Acehnese and Chamic are related, but that they had separated from each other before Chamic had borrowed most of its Austroasiatic lexicon. *
Bornean Borean (also Boreal or Boralean)http://ehl.santafe.edu/EhlforWeb.pdf is a hypothetical linguistic macrofamily that encompasses almost all language families worldwide except those native to the Americas, Africa, Oceania, and the Andaman Islands. ...
substrate languages (Blench 2010). Blench cites Austroasiatic-origin words in modern-day Bornean branches such as
Land Dayak The Land Dayak languages are a group of dozen or so languages spoken by the Bidayuh Land Dayaks of Borneo. Languages ''Glottolog'' ''Glottolog'' classifies the Land Dayak languages as follows. *Benyadu-Bekati: Bekati (Bekatiq), Sara, Lara (Rar ...
(
Bidayuh Bidayuh is the collective name for several indigenous groups found in southern Sarawak, Malaysia and northern West Kalimantan, Indonesia, on the island of Borneo, which are broadly similar in language and culture (see also #Language issues, is ...
, Dayak Bakatiq, etc.), Dusunic (
Central Dusun Central Dusun, also known as Bunduliwan (Dusun: ), is one of the more widespread languages spoken by the Dusun (including Kadazan) peoples of Sabah, Malaysia. Kadazandusun language standardisation What is termed as ''Central Dusun'' (or sim ...
,
Visayan Visayans (Visayan: ''mga Bisaya''; ) or Visayan people are a Philippine ethnolinguistic group or metaethnicity native to the Visayas, the southernmost islands of Luzon and a significant portion of Mindanao. When taken as a single ethnic group, ...
, etc.), Kayan, and
Kenyah The Kenyah people are an indigenous, Austronesian-speaking people of Borneo, living in the remote Baram Lio Matoh, Long Selaan, Long Moh, Long Anap, Long Mekaba, Long Jeeh, Long Belaong, Long San, Long Silat, Long Tungan, Data Kakus, Data Su ...
, noting especially resemblances with
Aslian The Aslian languages () are the southernmost branch of Austroasiatic languages spoken on the Malay Peninsula. They are the languages of many of the '' Orang Asli'', the aboriginal inhabitants of the peninsula. The total number of native speakers ...
. As further evidence for his proposal, Blench also cites ethnographic evidence such as musical instruments in Borneo shared in common with Austroasiatic-speaking groups in mainland Southeast Asia. Adelaar (1995) has also noticed phonological and lexical similarities between
Land Dayak The Land Dayak languages are a group of dozen or so languages spoken by the Bidayuh Land Dayaks of Borneo. Languages ''Glottolog'' ''Glottolog'' classifies the Land Dayak languages as follows. *Benyadu-Bekati: Bekati (Bekatiq), Sara, Lara (Rar ...
and
Aslian The Aslian languages () are the southernmost branch of Austroasiatic languages spoken on the Malay Peninsula. They are the languages of many of the '' Orang Asli'', the aboriginal inhabitants of the peninsula. The total number of native speakers ...
. * Lepcha substratum ("''Rongic''"). Many words of Austroasiatic origin have been noticed in Lepcha, suggesting a
Sino-Tibetan Sino-Tibetan, also cited as Trans-Himalayan in a few sources, is a family of more than 400 languages, second only to Indo-European in number of native speakers. The vast majority of these are the 1.3 billion native speakers of Chinese languages. ...
superstrate laid over an Austroasiatic substrate. Blench (2013) calls this branch "''Rongic''" based on the Lepcha autonym ''Róng''. Other languages with proposed Austroasiatic substrata are: *
Jiamao Jiamao (, ''Jiamao''; also ''Tái'' or ''Sāi'') is a possible language isolate spoken in southern Hainan, China. Jiamao speakers' autonym is ''1''.See Proto-Tai_language#Tones for an explanation of the tone codes. Classification Jiamao was ...
, based on evidence from the register system of Jiamao, a Hlai language (Thurgood 1992). Jiamao is known for its highly aberrant vocabulary in relation to other
Hlai languages The Hlai languages () are a primary branch of the Kra–Dai language family spoken in the mountains of central and south-central Hainan in China by the Hlai people, not to be confused with the colloquial name for the Leizhou branch of Min Chines ...
. * Kerinci: van Reijn (1974) notes that Kerinci, a
Malayic The Malayic languages are a branch of the Malayo-Polynesian subgroup of the Austronesian language family. The most prominent member is Malay, which is the national language of Brunei, Singapore and Malaysia; it further serves as basis for Ind ...
language of central
Sumatra Sumatra is one of the Sunda Islands of western Indonesia. It is the largest island that is fully within Indonesian territory, as well as the sixth-largest island in the world at 473,481 km2 (182,812 mi.2), not including adjacent i ...
, shares many phonological similarities with Austroasiatic languages, such as
sesquisyllabic Primarily in Austroasiatic languages (also known as Mon–Khmer), in a typical word a minor syllable is a reduced (minor) syllable followed by a full tonic or stressed syllable. The minor syllable may be of the form or , with a reduced vowel, as ...
word structure and vowel inventory. John Peterson (2017) suggests that "pre- Munda" ("proto-" in regular terminology) languages may have once dominated the eastern
Indo-Gangetic Plain The Indo-Gangetic Plain, also known as the North Indian River Plain, is a fertile plain encompassing northern regions of the Indian subcontinent, including most of northern and eastern India, around half of Pakistan, virtually all of Bangla ...
, and were then absorbed by Indo-Aryan languages at an early date as Indo-Aryan spread east. Peterson notes that eastern
Indo-Aryan languages The Indo-Aryan languages (or sometimes Indic languages) are a branch of the Indo-Iranian languages in the Indo-European languages, Indo-European language family. As of the early 21st century, they have more than 800 million speakers, primarily ...
display many morphosyntactic features similar to those of Munda languages, while western Indo-Aryan languages do not.


Writing systems

Other than Latin-based alphabets, many Austroasiatic languages are written with the Khmer,
Thai Thai or THAI may refer to: * Of or from Thailand, a country in Southeast Asia ** Thai people, the dominant ethnic group of Thailand ** Thai language, a Tai-Kadai language spoken mainly in and around Thailand *** Thai script *** Thai (Unicode block ...
, Lao, and Burmese alphabets. Vietnamese divergently had an indigenous script based on Chinese logographic writing. This has since been supplanted by the Latin alphabet in the 20th century. The following are examples of past-used alphabets or current alphabets of Austroasiatic languages. *
Chữ Nôm Chữ Nôm (, ; ) is a logographic writing system formerly used to write the Vietnamese language. It uses Chinese characters (''Chữ Hán'') to represent Sino-Vietnamese vocabulary and some native Vietnamese words, with other words represented ...
*
Khmer alphabet Khmer script ( km, អក្សរខ្មែរ, )Huffman, Franklin. 1970. ''Cambodian System of Writing and Beginning Reader''. Yale University Press. . is an abugida (alphasyllabary) script used to write the Khmer language, the official la ...
*
Khom script Khom script may refer to either of the following writing systems derived from the Khmer script: *Khom Thai script, a script based on ancient Khmer and historically used in Thailand *Khom script (Ong Kommadam) The Khom script is a writing system u ...
(used for a short period in the early 20th century for indigenous languages in Laos) *
Old Mon script Old or OLD may refer to: Places * Old, Baranya, Hungary * Old, Northamptonshire, England *Old Street station, a railway and tube station in London (station code OLD) *OLD, IATA code for Old Town Municipal Airport and Seaplane Base, Old Town, ...
*
Mon script Mon, MON or Mon. may refer to: Places * Mon State, a subdivision of Myanmar * Mon, India, a town in Nagaland * Mon district, Nagaland * Mon, Raebareli, a village in Uttar Pradesh, India * Mon, Switzerland, a village in the Canton of Grisons * A ...
*
Pahawh Hmong Pahawh Hmong ( RPA: Phaj hauj Hmoob , Pahawh: ; known also as ''Ntawv Pahawh, Ntawv Keeb, Ntawv Caub Fab, Ntawv Soob Lwj'') is an indigenous semi-syllabic script, invented in 1959 by Shong Lue Yang, to write two Hmong languages, Hmong Daw ''( ...
was once used to write
Khmu The Khmu (; Khmu: ; lo, ຂະມຸ ; th, ขมุ ; vi, Khơ Mú; ; my, ခမူ) are an ethnic group of Southeast Asia. The majority (88%) live in northern Laos where they constitute the largest minority ethnic group, comprising elev ...
, under the name "Pahawh Khmu" * Tai Le ( Palaung, Blang) *
Tai Tham Tai Tham script ('' Tham'' meaning "scripture") is the name given to an abugida writing system used mainly for a group of Southwestern Tai languages i.e., Northern Thai, Tai Lü, Khün and Lao; as well as the liturgical languages of Buddhism ...
( Blang) *
Ol Chiki alphabet The Ol Chiki () script, also known as Ol Chemetʼ (Santali: ''ol'' 'writing', ''chemet'' 'learning'), Ol Ciki, Ol, and sometimes as the Santali alphabet invented by Pandit Raghunath Murmu in the year 1925, is the official writing system for San ...
( Santali alphabet) *
Mundari Bani Mundari Bani (Mundari: 𞓗𞓕𞓨𞓚 ''Bani'' 'alphabet', also known as Mundari Bani Hisir ''Hisir'' 'writing', Nag Mundari, or the Mundari alphabet) is the writing system created for the Mundari language, spoken in eastern India. Mundari is ...
( Mundari alphabet) *
Warang Citi Warang Citi (also written Varang Kshiti or Barang Kshiti; , IPA: /wɐrɐŋ ʧɪt̪ɪ/) is a writing system invented by Lako Bodra for the Ho language spoken in East India. It is used in primary and adult education and in various publications. I ...
( Ho alphabet) * Ol Onal (
Bhumij Bhumij may refer to: *Bhumij people, tribal ethnic group of India * Bhumij language, the language of Bhumij people *Bhumija Bhumija is a variety of north Indian temple architecture marked by how the rotating square-circle principle is applied to ...
alphabet) * Sorang Sompeng alphabet ( Sora alphabet)


External relations


Austric languages

Austroasiatic is an integral part of the controversial Austric hypothesis, which also includes the Austronesian languages, and in some proposals also the
Kra–Dai languages The Kra–Dai languages (also known as Tai–Kadai and Daic) are a language family in Mainland Southeast Asia, Southern China and Northeast India. All languages in the family are tonal languages, including Thai and Lao, the national languages o ...
and the
Hmong–Mien languages The Hmong–Mien languages (also known as Miao–Yao and rarely as Yangtzean) are a highly tonal language family of southern China and northern Southeast Asia. They are spoken in mountainous areas of southern China, including Guizhou, Hunan, Yunn ...
.


Hmong-Mien

Several lexical resemblances are found between the Hmong-Mien and Austroasiatic language families (Ratliff 2010), some of which had earlier been proposed by
Haudricourt Haudricourt () is a commune in the Seine-Maritime department in the Normandy region in northern France. Geography A farming village situated in the Pays de Bray, some southeast of Dieppe at the junction of the D9 and D436 roads. The commune b ...
(1951). This could imply a relation or early language contact along the
Yangtze The Yangtze or Yangzi ( or ; ) is the longest river in Asia, the third-longest in the world, and the longest in the world to flow entirely within one country. It rises at Jari Hill in the Tanggula Mountains (Tibetan Plateau) and flows ...
. According to Cai (et al. 2011), Hmong–Mien is at least partially related to Austroasiatic but was heavily influenced by
Sino-Tibetan Sino-Tibetan, also cited as Trans-Himalayan in a few sources, is a family of more than 400 languages, second only to Indo-European in number of native speakers. The vast majority of these are the 1.3 billion native speakers of Chinese languages. ...
, especially
Tibeto-Burman languages The Tibeto-Burman languages are the non-Sinitic members of the Sino-Tibetan language family, over 400 of which are spoken throughout the Southeast Asian Massif ("Zomia") as well as parts of East Asia and South Asia. Around 60 million people speak ...
.


Indo-Aryan languages

It is suggested that the Austroasiatic languages have some influence on
Indo-Aryan languages The Indo-Aryan languages (or sometimes Indic languages) are a branch of the Indo-Iranian languages in the Indo-European languages, Indo-European language family. As of the early 21st century, they have more than 800 million speakers, primarily ...
including
Sanskrit Sanskrit (; attributively , ; nominally , , ) is a classical language belonging to the Indo-Aryan branch of the Indo-European languages. It arose in South Asia after its predecessor languages had diffused there from the northwest in the late ...
and middle Indo-Aryan languages. Indian linguist
Suniti Kumar Chatterji Bhashacharya Acharya Suniti Kumar Chatterjee (26 November 1890 – 29 May 1977) was an Indian linguist, educationist and litterateur. He was a recipient of the second-highest Indian civilian honour of Padma Vibhushan. Life Childhood Chatterji ...
pointed that a specific number of substantives in languages such as
Hindi Hindi (Devanāgarī: or , ), or more precisely Modern Standard Hindi (Devanagari: ), is an Indo-Aryan language spoken chiefly in the Hindi Belt region encompassing parts of northern, central, eastern, and western India. Hindi has been de ...
, Punjabi and
Bengali Bengali or Bengalee, or Bengalese may refer to: *something of, from, or related to Bengal, a large region in South Asia * Bengalis, an ethnic and linguistic group of the region * Bengali language, the language they speak ** Bengali alphabet, the w ...
were borrowed from
Munda languages The Munda languages are a group of closely related languages spoken by about nine million people in India and Bangladesh. Historically, they have been called the Kolarian languages. They constitute a branch of the Austroasiatic language famil ...
. Additionally, French linguist
Jean Przyluski Jean Przyluski (17 August 1885 – 28 October 1944) was a French linguist and scholar of religion and Buddhism of Polish descent. His interests ranged widely through the structure of the Vietnamese language, the development of Buddhist myths ...
suggested a similarity between the tales from the Austroasiatic realm and the Indian mythological stories of Matsyagandha (from ''
Mahabharata The ''Mahābhārata'' ( ; sa, महाभारतम्, ', ) is one of the two major Sanskrit epics of ancient India in Hinduism, the other being the ''Rāmāyaṇa''. It narrates the struggle between two groups of cousins in the Kuruk ...
'') and the
Nāga The Nagas (IAST: ''nāga''; Devanāgarī: नाग) are a divine, or semi-divine, race of half-human, half-serpent beings that reside in the netherworld (Patala), and can occasionally take human or part-human form, or are so depicted in art. ...
s.


Austroasiatic migrations and archaeogenetics

Mitsuru Sakitani suggests that Haplogroup O1b1, which is common in Austroasiatic people and some other ethnic groups in
southern China South China () is a geographical and cultural region that covers the southernmost part of China. Its precise meaning varies with context. A notable feature of South China in comparison to the rest of China is that most of its citizens are not n ...
, and haplogroup O1b2, which is common in today's
Japanese Japanese may refer to: * Something from or related to Japan, an island country in East Asia * Japanese language, spoken mainly in Japan * Japanese people, the ethnic group that identifies with Japan through ancestry or culture ** Japanese diaspor ...
,
Koreans Koreans ( South Korean: , , North Korean: , ; see names of Korea) are an East Asian ethnic group native to the Korean Peninsula. Koreans mainly live in the two Korean nation states: North Korea and South Korea (collectively and simply refe ...
and some
Manchu The Manchus (; ) are a Tungusic East Asian ethnic group native to Manchuria in Northeast Asia. They are an officially recognized ethnic minority in China and the people from whom Manchuria derives its name. The Later Jin (1616–1636) and ...
, are the carriers of early rice-agriculturalists from
Indochina Mainland Southeast Asia, also known as the Indochinese Peninsula or Indochina, is the continental portion of Southeast Asia. It lies east of the Indian subcontinent and south of Mainland China and is bordered by the Indian Ocean to the west an ...
. Another study suggests that the haplogroup O1b1 is the major Austroasiatic paternal lineage and O1b2 the "para-Austroasiatic" lineage of the
Manchu The Manchus (; ) are a Tungusic East Asian ethnic group native to Manchuria in Northeast Asia. They are an officially recognized ethnic minority in China and the people from whom Manchuria derives its name. The Later Jin (1616–1636) and ...
,
Korean Korean may refer to: People and culture * Koreans, ethnic group originating in the Korean Peninsula * Korean cuisine * Korean culture * Korean language **Korean alphabet, known as Hangul or Chosŏn'gŭl **Korean dialects and the Jeju language ** ...
and
Yayoi people The were an ancient ethnicity that migrated to the Japanese archipelago from Korea and China during the Yayoi period (300 BCE–300 CE). Although highly controversial, a single study that utilized radiometric dating techniques inconclusively ...
. A 2021 study by Tagore et al. found that the proto-Austroasiatic speakers split from an Basal East Asian source population, native to
Mainland Southeast Asia Mainland Southeast Asia, also known as the Indochinese Peninsula or Indochina, is the continental portion of Southeast Asia. It lies east of the Indian subcontinent and south of Mainland China and is bordered by the Indian Ocean to the west an ...
and
Northeast India , native_name_lang = mni , settlement_type = , image_skyline = , image_alt = , image_caption = , motto = , image_map = Northeast india.png , ...
, which also gave rise to other East Asian-related populations, including Northeast Asians and
Indigenous peoples of the Americas The Indigenous peoples of the Americas are the inhabitants of the Americas before the arrival of the European settlers in the 15th century, and the ethnic groups who now identify themselves with those peoples. Many Indigenous peoples of the A ...
. The proto-Austroasiatic speakers can be linked to the
Hoabinhian Hoabinhian is a lithic techno-complex of archaeological sites associated with assemblages in Southeast Asia from late Pleistocene to Holocene, dated to c.10,000–2000 BCE. It is attributed to hunter-gatherer societies of the region and their ...
material culture. From Mainland Southeast Asia, the Austroasiatic speakers expanded into the Indian-subcontinent and
Maritime Southeast Asia Maritime Southeast Asia comprises the countries of Brunei, Indonesia, Malaysia, the Philippines, Singapore, and East Timor. Maritime Southeast Asia is sometimes also referred to as Island Southeast Asia, Insular Southeast Asia or Oceanic Sout ...
. There is evidence that later back migration from more northerly East Asian groups (such as Kra-Dai speakers) merged with indigenous Southeast Asians, contributing to the fragmentation observed among modern day Austroasiatic-speakers. In the
Indian subcontinent The Indian subcontinent is a list of the physiographic regions of the world, physiographical region in United Nations geoscheme for Asia#Southern Asia, Southern Asia. It is situated on the Indian Plate, projecting southwards into the Indian O ...
, Austroasiatic speakers, specifically Mundari, intermixed with the local population. Furthermore they concluded that their results do not support a genetic relationship between Ancient Southeast Asian hunter-gatherers (Hoabinhians) with Papuan-related groups, as previously suggested by McColl et al. 2018, but that these Ancient Southeast Asians are characterized by Basal East Asian ancestry. The authors finally concluded that genetics do not necessarily correspond with linguistic identity, pointing to the fragmentation of modern Austroasiatic speakers. Larena et al. 2021 could reproduce the genetic evidence for the origin of Basal East Asians in Mainland Southeast Asia, which are estimated to have formed about 50kya years ago, and expanded through multiple migration waves southwards and northwards. Early Austroasiatic speakers are estimated to have originated from an lineage, which split from Ancestral East Asians between 25,000 and 15,000 years ago, and were among the first wave to replace distinct Australasian-related groups in
Insular Southeast Asia Maritime Southeast Asia comprises the countries of Brunei, Indonesia, Malaysia, the Philippines, Singapore, and East Timor. Maritime Southeast Asia is sometimes also referred to as Island Southeast Asia, Insular Southeast Asia or Oceanic Sout ...
. East Asian-related ancestry became dominant in Insular Southeast Asia already between 15,000 years to 12,000 years ago, and may be associated with Austroasiatic groups, which however got again replaced by later Austronesian groups some 10,000 to 7,000 years ago. Early Austroasiatic people were found to be best represented by the
Mlabri people The Mlabri (Thai: มลาบรี) or Mrabri are an ethnic group of Thailand and Laos, and have been called "the most interesting and least understood people in Southeast Asia". Only about 400 or fewer Mlabris remain in the world today, with som ...
in modern day
Thailand Thailand ( ), historically known as Siam () and officially the Kingdom of Thailand, is a country in Southeast Asia, located at the centre of the Indochinese Peninsula, spanning , with a population of almost 70 million. The country is bo ...
. Proposals for Austroasiatic substratum among later Austronesian languages in Western Indonesia, noteworthy among the
Dayak language The Dayak (; older spelling: Dajak) or Dyak or Dayuh are one of the native groups of Borneo. It is a loose term for over 200 riverine and hill-dwelling ethnic groups, located principally in the central and southern interior of Borneo, each w ...
s, is strengthened by genetic data, suggesting Austroasiatic speakers were assimilated by Austronesian speakers. A study in November 2021 (Guo et al.) found that modern East-Eurasians can be modeled from four ancestry components, which descended from a common ancestor in Mainland Southeast Asia, one being the "Ancestral Austroasiatic" component (AAA), which is more prevalent among modern Southeast Asians, and making up the exclusive ancestry among Austroasiatic-speaking
Lua Lua or LUA may refer to: Science and technology * Lua (programming language) * Latvia University of Agriculture * Last universal ancestor, in evolution Ethnicity and language * Lua people, of Laos * Lawa people, of Thailand sometimes referred t ...
and
Mlabri people The Mlabri (Thai: มลาบรี) or Mrabri are an ethnic group of Thailand and Laos, and have been called "the most interesting and least understood people in Southeast Asia". Only about 400 or fewer Mlabris remain in the world today, with som ...
. The early Austroasiatic speakers are suggested to have been
hunter-gatherers A traditional hunter-gatherer or forager is a human living an ancestrally derived lifestyle in which most or all food is obtained by foraging, that is, by gathering food from local sources, especially edible wild plants but also insects, fungi, ...
but became rice-agriculturalists quite early, spreading from Mainland Southeast Asia northwards to the
Yangtze river The Yangtze or Yangzi ( or ; ) is the longest list of rivers of Asia, river in Asia, the list of rivers by length, third-longest in the world, and the longest in the world to flow entirely within one country. It rises at Jari Hill in th ...
, westwards into the
Indian subcontinent The Indian subcontinent is a list of the physiographic regions of the world, physiographical region in United Nations geoscheme for Asia#Southern Asia, Southern Asia. It is situated on the Indian Plate, projecting southwards into the Indian O ...
, and southwards into Insular Southeast Asia. Evidence for these migrations are Austroasiatic loanwords related to rice-agriculture found among non-Austroasiatic languages, and the presence of Austroasiatic genetic ancestry. According to a recent genetic study,
Sundanese Sundanese may refer to: * Sundanese people * Sundanese language * Sundanese script Standard Sundanese script (''Aksara Sunda Baku'', ) is a writing system which is used by the Sundanese people. It is built based on Old Sundanese script (' ...
, Javanese, and Balinese, has almost an equal ratio of genetic marker shared between Austronesian and
Austroasiatic The Austroasiatic languages , , are a large language family A language family is a group of languages related through descent from a common ''ancestral language'' or ''parental language'', called the proto-language of that family. The te ...
heritages.


Migration into India

According to Chaubey et al., "Austro-Asiatic speakers in India today are derived from dispersal from
Southeast Asia Southeast Asia, also spelled South East Asia and South-East Asia, and also known as Southeastern Asia, South-eastern Asia or SEA, is the geographical United Nations geoscheme for Asia#South-eastern Asia, south-eastern region of Asia, consistin ...
, followed by extensive sex-specific admixture with local Indian populations." According to Riccio et al., the
Munda people The Munda people are an Austroasiatic speaking ethnic group of India. They predominantly speak the Mundari language as their native language, which belongs to the Munda subgroup of Austroasiatic languages. The Munda are found mainly concentr ...
are likely descended from Austroasiatic migrants from Southeast Asia. According to Zhang et al., Austroasiatic migrations from Southeast Asia into India took place after the last Glacial maximum, circa 10,000 years ago. Arunkumar et al., suggest Austroasiatic migrations from Southeast Asia occurred into Northeast India 5.2 ± 0.6 kya and into East India 4.3 ± 0.2 kya.


Notes


References


Sources

* Adams, K. L. (1989). ''Systems of numeral classification in the Mon–Khmer, Nicobarese and Aslian subfamilies of Austroasiatic''. Canberra, A.C.T., Australia: Dept. of Linguistics, Research School of Pacific Studies, Australian National University. * * Alves, Mark J. (2015). Morphological functions among Mon-Khmer languages: beyond the basics. In N. J. Enfield & Bernard Comrie (eds.), ''Languages of Mainland Southeast Asia: the state of the art''. Berlin: de Gruyter Mouton, 531–557. * Bradley, David (2012).
Languages and Language Families in China
, in Rint Sybesma (ed.), ''Encyclopedia of Chinese Language and Linguistics''. * Chakrabarti, Byomkes. (1994). ''A Comparative Study of Santali and Bengali''. * * Diffloth, Gérard. (2005). "The contribution of linguistic palaeontology and Austro-Asiatic". in Laurent Sagart, Roger Blench and Alicia Sanchez-Mazas, eds. ''The Peopling of East Asia: Putting Together Archaeology, Linguistics and Genetics.'' 77–80. London: Routledge Curzon. * Filbeck, D. (1978). ''T'in: a historical study''. Pacific linguistics, no. 49. Canberra: Dept. of Linguistics, Research School of Pacific Studies, Australian National University. * Hemeling, K. (1907). ''Die Nanking Kuanhua''. (German language) * Jenny, Mathias and
Paul Sidwell Paul James Sidwell is an Australian linguist based in Canberra, Australia who has held research and lecturing positions at the Australian National University. Sidwell, who is also an expert and consultant in forensic linguistics, is most notable ...
, eds (2015).
The Handbook of Austroasiatic Languages
'. Leiden: Brill. * Peck, B. M., Comp. (1988). ''An Enumerative Bibliography of South Asian Language Dictionaries''. * Peiros, Ilia. 1998. ''Comparative Linguistics in Southeast Asia.'' Pacific Linguistics Series C, No. 142. Canberra: Australian National University. * Shorto, Harry L. edited by Sidwell, Paul, Cooper, Doug and Bauer, Christian (2006).
A Mon–Khmer comparative dictionary
'. Canberra: Australian National University. Pacific Linguistics. * Shorto, H. L. ''Bibliographies of Mon–Khmer and Tai Linguistics''. London oriental bibliographies, v. 2. London: Oxford University Press, 1963. * * * * van Driem, George. (2007). Austroasiatic phylogeny and the Austroasiatic homeland in light of recent population genetic studies. ''Mon-Khmer Studies, 37'', 1-14. * Zide, Norman H., and Milton E. Barker. (1966) ''Studies in Comparative Austroasiatic Linguistics'', The Hague: Mouton (Indo-Iranian monographs, v. 5.). *


Further reading

* * Mann, Noel, Wendy Smith and Eva Ujlakyova. 2009.
Linguistic clusters of Mainland Southeast Asia: an overview of the language families.
' Chiang Mai: Payap University. * * * Sidwell, Paul. 2016
Bibliography of Austroasiatic linguistics and related resources
* E. K. Brown (ed.) Encyclopedia of Languages and Linguistics. Oxford: Elsevier Press. * Gregory D. S. Anderson and Norman H. Zide. 2002. Issues in Proto-Munda and Proto-Austroasiatic Nominal Derivation: The Bimoraic Constraint. In Marlys A. Macken (ed.) Papers from the 10th Annual Meeting of the Southeast Asian Linguistics Society. Tempe, AZ: Arizona State University, South East Asian Studies Program, Monograph Series Press. pp. 55–74.


External links

* Swadesh lists for Austro-Asiatic languages (from Wiktionary's Swadesh-list appendix)
Austro-Asiatic
at the Linguist List MultiTree Project (not functional as of 2014): Genealogical trees attributed to Sebeok 1942, Pinnow 1959, Diffloth 2005, and Matisoff 2006


Mon–Khmer Languages Project
at SEAlang
Munda Languages Project
at SEAlang
RWAAI
(Repository and Workspace for Austroasiatic Intangible Heritage) * http://hdl.handle.net/10050/00-0000-0000-0003-66A4-2@view RWAAI Digital Archive
Michel Ferlus's recordings of Mon-Khmer (Austroasiatic) languages
(CNRS) {{Authority control Agglutinative languages Language families Sino-Austronesian languages