The Indo-European languages are a
language family
A language family is a group of languages related through descent from a common ancestor, called the proto-language of that family. The term ''family'' is a metaphor borrowed from biology, with the tree model used in historical linguistics ...
native to the northern
Indian subcontinent
The Indian subcontinent is a physiographic region of Asia below the Himalayas which projects into the Indian Ocean between the Bay of Bengal to the east and the Arabian Sea to the west. It is now divided between Bangladesh, India, and Pakista ...
, most of
Europe
Europe is a continent located entirely in the Northern Hemisphere and mostly in the Eastern Hemisphere. It is bordered by the Arctic Ocean to the north, the Atlantic Ocean to the west, the Mediterranean Sea to the south, and Asia to the east ...
, and the
Iranian plateau
The Iranian plateau or Persian plateau is a geological feature spanning parts of the Caucasus, Central Asia, South Asia, and West Asia. It makes up part of the Eurasian plate, and is wedged between the Arabian plate and the Indian plate. ...
with additional native branches found in regions such as
Sri Lanka
Sri Lanka, officially the Democratic Socialist Republic of Sri Lanka, also known historically as Ceylon, is an island country in South Asia. It lies in the Indian Ocean, southwest of the Bay of Bengal, separated from the Indian subcontinent, ...
, the
Maldives
The Maldives, officially the Republic of Maldives, and historically known as the Maldive Islands, is an Archipelagic state, archipelagic country in South Asia located in the Indian Ocean. The Maldives is southwest of Sri Lanka and India, abou ...
, parts of Central Asia (e.g.,
Tajikistan
Tajikistan, officially the Republic of Tajikistan, is a landlocked country in Central Asia. Dushanbe is the capital city, capital and most populous city. Tajikistan borders Afghanistan to the Afghanistan–Tajikistan border, south, Uzbekistan to ...
and
Afghanistan
Afghanistan, officially the Islamic Emirate of Afghanistan, is a landlocked country located at the crossroads of Central Asia and South Asia. It is bordered by Pakistan to the Durand Line, east and south, Iran to the Afghanistan–Iran borde ...
),
Armenia
Armenia, officially the Republic of Armenia, is a landlocked country in the Armenian Highlands of West Asia. It is a part of the Caucasus region and is bordered by Turkey to the west, Georgia (country), Georgia to the north and Azerbaijan to ...
, and areas of southern India. Historically, Indo-European languages were also spoken in
Anatolia
Anatolia (), also known as Asia Minor, is a peninsula in West Asia that makes up the majority of the land area of Turkey. It is the westernmost protrusion of Asia and is geographically bounded by the Mediterranean Sea to the south, the Aegean ...
. Some European languages of this family—
English,
French,
Portuguese,
Russian
Russian(s) may refer to:
*Russians (), an ethnic group of the East Slavic peoples, primarily living in Russia and neighboring countries
*A citizen of Russia
*Russian language, the most widely spoken of the Slavic languages
*''The Russians'', a b ...
,
Spanish
Spanish might refer to:
* Items from or related to Spain:
**Spaniards are a nation and ethnic group indigenous to Spain
**Spanish language, spoken in Spain and many countries in the Americas
**Spanish cuisine
**Spanish history
**Spanish culture
...
, and
Dutch
Dutch or Nederlands commonly refers to:
* Something of, from, or related to the Netherlands
** Dutch people as an ethnic group ()
** Dutch nationality law, history and regulations of Dutch citizenship ()
** Dutch language ()
* In specific terms, i ...
—have expanded through
colonialism
Colonialism is the control of another territory, natural resources and people by a foreign group. Colonizers control the political and tribal power of the colonised territory. While frequently an Imperialism, imperialist project, colonialism c ...
in the modern period and are now spoken across several continents. The Indo-European family is divided into several branches or sub-families, including
Albanian
Albanian may refer to:
*Pertaining to Albania in Southeast Europe; in particular:
**Albanians, an ethnic group native to the Balkans
**Albanian language
**Albanian culture
**Demographics of Albania, includes other ethnic groups within the country ...
,
Armenian
Armenian may refer to:
* Something of, from, or related to Armenia, a country in the South Caucasus region of Eurasia
* Armenians, the national people of Armenia, or people of Armenian descent
** Armenian diaspora, Armenian communities around the ...
,
Balto-Slavic
The Balto-Slavic languages form a branch of the Indo-European languages, Indo-European family of languages, traditionally comprising the Baltic languages, Baltic and Slavic languages. Baltic and Slavic languages share several linguistic traits ...
,
Celtic
Celtic, Celtics or Keltic may refer to:
Language and ethnicity
*pertaining to Celts, a collection of Indo-European peoples in Europe and Anatolia
**Celts (modern)
*Celtic languages
**Proto-Celtic language
*Celtic music
*Celtic nations
Sports Foot ...
,
Germanic,
Hellenic,
Indo-Iranian, and
Italic, all of which contain present-day living languages, as well as many more
extinct
Extinction is the termination of an organism by the death of its Endling, last member. A taxon may become Functional extinction, functionally extinct before the death of its last member if it loses the capacity to Reproduction, reproduce and ...
branches.
Today, the individual Indo-European languages with the most native speakers are English, Spanish, Portuguese, Russian,
Hindustani,
Bengali
Bengali or Bengalee, or Bengalese may refer to:
*something of, from, or related to Bengal, a large region in South Asia
* Bengalis, an ethnic and linguistic group of the region
* Bengali language, the language they speak
** Bengali alphabet, the w ...
, French, and
German
German(s) may refer to:
* Germany, the country of the Germans and German things
**Germania (Roman era)
* Germans, citizens of Germany, people of German ancestry, or native speakers of the German language
** For citizenship in Germany, see also Ge ...
; many others spoken by smaller groups are in danger of extinction. Over 3.4 billion people (42% of the global population) speak an Indo-European language as a
first language
A first language (L1), native language, native tongue, or mother tongue is the first language a person has been exposed to from birth or within the critical period hypothesis, critical period. In some countries, the term ''native language'' ...
—by far the most of any language family. There are about 446 living Indo-European languages, according to an estimate by ''
Ethnologue
''Ethnologue: Languages of the World'' is an annual reference publication in print and online that provides statistics and other information on the living languages of the world. It is the world's most comprehensive catalogue of languages. It w ...
'', of which 313 belong to the Indo-Iranian branch.
All Indo-European languages are descended from a single prehistoric language,
linguistically reconstructed
Linguistic reconstruction is the practice of establishing the features of an unattested ancestor language of one or more given languages. There are two kinds of reconstruction:
* Internal reconstruction uses irregularities in a single language t ...
as
Proto-Indo-European
Proto-Indo-European (PIE) is the reconstructed common ancestor of the Indo-European language family. No direct record of Proto-Indo-European exists; its proposed features have been derived by linguistic reconstruction from documented Indo-Euro ...
, spoken sometime during the
Neolithic
The Neolithic or New Stone Age (from Ancient Greek, Greek 'new' and 'stone') is an archaeological period, the final division of the Stone Age in Mesopotamia, Asia, Europe and Africa (c. 10,000 BCE to c. 2,000 BCE). It saw the Neolithic Revo ...
or early
Bronze Age
The Bronze Age () was a historical period characterised principally by the use of bronze tools and the development of complex urban societies, as well as the adoption of writing in some areas. The Bronze Age is the middle principal period of ...
(). The geographical location where it was spoken, the
Proto-Indo-European homeland
The Proto-Indo-European homeland was the prehistoric homeland of the Proto-Indo-European language (PIE), meaning it was the region where the proto-language was spoken before it split into the dialects from which the earliest Indo-European langu ...
, has been the object of many competing hypotheses; the academic consensus supports the
Kurgan hypothesis
The Kurgan hypothesis (also known as the Kurgan theory, Kurgan model, or steppe theory) is the most widely accepted proposal to identify the Proto-Indo-European homeland from which the Indo-European languages spread out throughout Europe and part ...
, which posits the homeland to be the
Pontic–Caspian steppe
The Pontic–Caspian Steppe is a steppe extending across Eastern Europe to Central Asia, formed by the Caspian and Pontic steppes. It stretches from the northern shores of the Black Sea (the ''Pontus Euxinus'' of antiquity) to the northern a ...
in what is now
Ukraine
Ukraine is a country in Eastern Europe. It is the List of European countries by area, second-largest country in Europe after Russia, which Russia–Ukraine border, borders it to the east and northeast. Ukraine also borders Belarus to the nor ...
and
Southern Russia
Southern Russia or the South of Russia ( rus, Юг России, p=juk rɐˈsʲiɪ) is a Colloquialism, colloquial term for the southernmost geographic portion of European Russia. The term is generally used to refer to the region of Russia's So ...
, associated with the
Yamnaya culture
The Yamnaya ( ) or Yamna culture ( ), also known as the Pit Grave culture or Ochre Grave culture, is a late Copper Age to early Bronze Age archaeological culture of the region between the Southern Bug, Dniester, and Ural rivers (the Pontic–C ...
and other related archaeological cultures during the 4th and early 3rd millennia BC. By the time the first written records appeared, Indo-European had already evolved into numerous languages spoken across much of Europe,
South Asia
South Asia is the southern Subregion#Asia, subregion of Asia that is defined in both geographical and Ethnicity, ethnic-Culture, cultural terms. South Asia, with a population of 2.04 billion, contains a quarter (25%) of the world's populatio ...
, and part of
Western Asia
West Asia (also called Western Asia or Southwest Asia) is the westernmost region of Asia. As defined by most academics, UN bodies and other institutions, the subregion consists of Anatolia, the Arabian Peninsula, Iran, Mesopotamia, the Armenian ...
. Written evidence of Indo-European appeared during the Bronze Age in the form of
Mycenaean Greek
Mycenaean Greek is the earliest attested form of the Greek language. It was spoken on the Greek mainland and Crete in Mycenaean Greece (16th to 12th centuries BC). The language is preserved in inscriptions in Linear B, a script first atteste ...
and the
Anatolian languages
The Anatolian languages are an extinct branch of Indo-European languages that were spoken in Anatolia. The best known Anatolian language is Hittite, which is considered the earliest-attested Indo-European language.
Undiscovered until the late ...
of
Hittite and
Luwian
Luwian (), sometimes known as Luvian or Luish, is an ancient language, or group of languages, within the Anatolian branch of the Indo-European language family. The ethnonym Luwian comes from ''Luwiya'' (also spelled ''Luwia'' or ''Luvia'') – ...
. The oldest records are isolated Hittite words and names—interspersed in texts that are otherwise in the unrelated
Akkadian language
Akkadian ( ; )John Huehnergard & Christopher Woods, "Akkadian and Eblaite", ''The Cambridge Encyclopedia of the World's Ancient Languages''. Ed. Roger D. Woodard (2004, Cambridge) Pages 218–280 was an East Semitic language that is attested ...
, a
Semitic language
The Semitic languages are a branch of the Afroasiatic language family. They include Arabic,
Amharic, Tigrinya, Aramaic, Hebrew, Maltese, Modern South Arabian languages and numerous other ancient and modern languages. They are spoken by mo ...
—found in texts of the
Assyria
Assyria (Neo-Assyrian cuneiform: , ''māt Aššur'') was a major ancient Mesopotamian civilization that existed as a city-state from the 21st century BC to the 14th century BC and eventually expanded into an empire from the 14th century BC t ...
n colony of
Kültepe
Kültepe ( Turkish: ), also known under its ancient name Kaneš (Kanesh, sometimes also Kaniš/Kanish) or Neša (Nesha), is an archaeological site in Kayseri Province, Turkey. It was already a major settlement at the beginning of the 3rd mille ...
in eastern
Anatolia
Anatolia (), also known as Asia Minor, is a peninsula in West Asia that makes up the majority of the land area of Turkey. It is the westernmost protrusion of Asia and is geographically bounded by the Mediterranean Sea to the south, the Aegean ...
dating to the 20th century BC. Although no older written records of the original
Proto-Indo-European population remain, some aspects of
their culture and
their religion can be reconstructed from later evidence in the daughter cultures. The Indo-European family is significant to the field of
historical linguistics
Historical linguistics, also known as diachronic linguistics, is the scientific study of how languages change over time. It seeks to understand the nature and causes of linguistic change and to trace the evolution of languages. Historical li ...
as it possesses the second-longest
recorded history
Recorded history or written history describes the historical events that have been recorded in a written form or other documented communication which are subsequently evaluated by historians using the historical method. For broader world h ...
of any known family after
Egyptian
''Egyptian'' describes something of, from, or related to Egypt.
Egyptian or Egyptians may refer to:
Nations and ethnic groups
* Egyptians, a national group in North Africa
** Egyptian culture, a complex and stable culture with thousands of year ...
and the
Semitic languages
The Semitic languages are a branch of the Afroasiatic languages, Afroasiatic language family. They include Arabic,
Amharic, Tigrinya language, Tigrinya, Aramaic, Hebrew language, Hebrew, Maltese language, Maltese, Modern South Arabian language ...
, which belong to the
Afroasiatic language family
The Afroasiatic languages (also known as Afro-Asiatic, Afrasian, Hamito-Semitic, or Semito-Hamitic) are a language family (or "phylum") of about 400 languages spoken predominantly in West Asia, North Africa, the Horn of Africa, and parts of the ...
. The analysis of the family relationships between the Indo-European languages, and the reconstruction of their common source, was central to the development of the methodology of historical linguistics as an academic discipline in the 19th century.
The Indo-European language family is not considered by the current academic consensus in the field of linguistics to have any
genetic relationships with other language families, although several
disputed hypotheses propose such relations.
History of Indo-European linguistics
During the 16th century, European visitors to the
Indian subcontinent
The Indian subcontinent is a physiographic region of Asia below the Himalayas which projects into the Indian Ocean between the Bay of Bengal to the east and the Arabian Sea to the west. It is now divided between Bangladesh, India, and Pakista ...
began to notice similarities among
Indo-Aryan,
Iranian
Iranian () may refer to:
* Something of, from, or related to Iran
** Iranian diaspora, Iranians living outside Iran
** Iranian architecture, architecture of Iran and parts of the rest of West Asia
** Iranian cuisine, cooking traditions and practic ...
, and
European languages. In 1583, English
Jesuit
The Society of Jesus (; abbreviation: S.J. or SJ), also known as the Jesuit Order or the Jesuits ( ; ), is a religious order (Catholic), religious order of clerics regular of pontifical right for men in the Catholic Church headquartered in Rom ...
missionary and
Konkani __NOTOC__
Konkani may refer to:
Language
* Konkani language is an Indo-Aryan language spoken in the Konkan region of India.
* Konkani alphabets, different scripts used to write the language
**Konkani in the Roman script, one of the scripts used to ...
scholar
Thomas Stephens wrote a letter from
Goa
Goa (; ; ) is a state on the southwestern coast of India within the Konkan region, geographically separated from the Deccan highlands by the Western Ghats. It is bound by the Indian states of Maharashtra to the north, and Karnataka to the ...
to his brother (not published until the 20th century) in which he noted similarities between Indian languages and
Greek
Greek may refer to:
Anything of, from, or related to Greece, a country in Southern Europe:
*Greeks, an ethnic group
*Greek language, a branch of the Indo-European language family
**Proto-Greek language, the assumed last common ancestor of all kno ...
and
Latin
Latin ( or ) is a classical language belonging to the Italic languages, Italic branch of the Indo-European languages. Latin was originally spoken by the Latins (Italic tribe), Latins in Latium (now known as Lazio), the lower Tiber area aroun ...
.
Another account was made by
Filippo Sassetti
Filippo Sassetti (1540–1588) was a traveller and merchant from a long-established Florentine mercantile family, who was born in Florence in 1540. Though his father had sold family interests and he had to commence as a clerk in a merchant busine ...
, a merchant born in
Florence
Florence ( ; ) is the capital city of the Italy, Italian region of Tuscany. It is also the most populated city in Tuscany, with 362,353 inhabitants, and 989,460 in Metropolitan City of Florence, its metropolitan province as of 2025.
Florence ...
in 1540, who travelled to the Indian subcontinent. Writing in 1585, he noted some word similarities between
Sanskrit
Sanskrit (; stem form ; nominal singular , ,) is a classical language belonging to the Indo-Aryan languages, Indo-Aryan branch of the Indo-European languages. It arose in northwest South Asia after its predecessor languages had Trans-cultural ...
and Italian (these included ''devaḥ''/''dio'' 'God', ''sarpaḥ''/''serpe'' 'serpent', ''sapta''/''sette'' 'seven', ''aṣṭa''/''otto'' 'eight', and ''nava''/''nove'' 'nine'). However, neither Stephens' nor Sassetti's observations led to further scholarly inquiry.
In 1647,
Dutch
Dutch or Nederlands commonly refers to:
* Something of, from, or related to the Netherlands
** Dutch people as an ethnic group ()
** Dutch nationality law, history and regulations of Dutch citizenship ()
** Dutch language ()
* In specific terms, i ...
linguist and scholar
Marcus Zuerius van Boxhorn
Marcus Zuerius van Boxhorn (August 28, 1612 – October 3, 1653) was a Dutch people, Dutch scholar (his Latinized name was Marcus Zuerius Boxhornius). Born in Bergen op Zoom, he was professor at the University of Leiden. He discovered the similar ...
noted the similarity among certain Asian and European languages and theorized that they were derived from a primitive common language that he called Scythian. He included in his hypothesis
Dutch
Dutch or Nederlands commonly refers to:
* Something of, from, or related to the Netherlands
** Dutch people as an ethnic group ()
** Dutch nationality law, history and regulations of Dutch citizenship ()
** Dutch language ()
* In specific terms, i ...
,
Albanian
Albanian may refer to:
*Pertaining to Albania in Southeast Europe; in particular:
**Albanians, an ethnic group native to the Balkans
**Albanian language
**Albanian culture
**Demographics of Albania, includes other ethnic groups within the country ...
,
Greek
Greek may refer to:
Anything of, from, or related to Greece, a country in Southern Europe:
*Greeks, an ethnic group
*Greek language, a branch of the Indo-European language family
**Proto-Greek language, the assumed last common ancestor of all kno ...
,
Latin
Latin ( or ) is a classical language belonging to the Italic languages, Italic branch of the Indo-European languages. Latin was originally spoken by the Latins (Italic tribe), Latins in Latium (now known as Lazio), the lower Tiber area aroun ...
,
Persian
Persian may refer to:
* People and things from Iran, historically called ''Persia'' in the English language
** Persians, the majority ethnic group in Iran, not to be conflated with the Iranic peoples
** Persian language, an Iranian language of the ...
, and
German
German(s) may refer to:
* Germany, the country of the Germans and German things
**Germania (Roman era)
* Germans, citizens of Germany, people of German ancestry, or native speakers of the German language
** For citizenship in Germany, see also Ge ...
, later adding
Slavic
Slavic, Slav or Slavonic may refer to:
Peoples
* Slavic peoples, an ethno-linguistic group living in Europe and Asia
** East Slavic peoples, eastern group of Slavic peoples
** South Slavic peoples, southern group of Slavic peoples
** West Slav ...
,
Celtic
Celtic, Celtics or Keltic may refer to:
Language and ethnicity
*pertaining to Celts, a collection of Indo-European peoples in Europe and Anatolia
**Celts (modern)
*Celtic languages
**Proto-Celtic language
*Celtic music
*Celtic nations
Sports Foot ...
, and
Baltic languages
The Baltic languages are a branch of the Indo-European languages, Indo-European language family spoken natively or as a second language by a population of about 6.5–7.0 million people . However, Van Boxhorn's suggestions did not become widely known and did not stimulate further research.
Ottoman Turkish traveller
Evliya Çelebi
Dervish Mehmed Zillî (25 March 1611 – 1682), known as Evliya Çelebi (), was an Ottoman Empire, Ottoman explorer who travelled through his home country during its cultural zenith as well as neighboring lands. He travelled for over 40 years, rec ...
visited Vienna in 1665–1666 as part of a diplomatic mission and noted a few similarities between words in German and in Persian.
Gaston Coeurdoux and others made observations of the same type. Coeurdoux made a thorough comparison of Sanskrit, Latin, and Greek
conjugations in the late 1760s to suggest a relationship among them. Meanwhile,
Mikhail Lomonosov
Mikhail Vasilyevich Lomonosov (; , ; – ) was a Russian polymath, scientist and writer, who made important contributions to literature, education, and science. Among his discoveries were the atmosphere of Venus and the law of conservation of ...
compared different language groups, including Slavic, Baltic ("
Kurlandic"), Iranian ("
Medic
A medic is a person trained to provide medical care, encompassing a wide range of individuals involved in the diagnosis, treatment, and management of health conditions. The term can refer to fully qualified medical practitioners, such as physic ...
"),
Finnish,
Chinese, "Hottentot" (
Khoekhoe
Khoikhoi ( /ˈkɔɪkɔɪ/ ''KOY-koy'') (or Khoekhoe in Namibian orthography) are the traditionally nomadic pastoralist indigenous population of South Africa. They are often grouped with the hunter-gatherer San (literally "foragers") peop ...
), and others, noting that related languages (including Latin, Greek, German, and Russian) must have separated in antiquity from common ancestors.
[M. V. Lomonosov (drafts for ''Russian Grammar'', published 1755). In: Complete Edition, Moscow, 1952, vol. 7, pp. 652–659]
:
Представимъ долготу времени, которою сіи языки раздѣлились. ... Польской и россійской языкъ коль давно раздѣлились! Подумай же, когда курляндской! Подумай же, когда латинской, греч., нѣм., росс. О глубокая древность! magine the depth of time when these languages separated! ... Polish and Russian separated so long ago! Now think how long ago [this happened toKurlandic! Think when [this happened to] Latin, Greek, German, and Russian! Oh, great antiquity!]
The hypothesis reappeared in 1786 when Sir William Jones first lectured on the striking similarities among three of the oldest languages known in his time:
Latin
Latin ( or ) is a classical language belonging to the Italic languages, Italic branch of the Indo-European languages. Latin was originally spoken by the Latins (Italic tribe), Latins in Latium (now known as Lazio), the lower Tiber area aroun ...
,
Greek
Greek may refer to:
Anything of, from, or related to Greece, a country in Southern Europe:
*Greeks, an ethnic group
*Greek language, a branch of the Indo-European language family
**Proto-Greek language, the assumed last common ancestor of all kno ...
, and
Sanskrit
Sanskrit (; stem form ; nominal singular , ,) is a classical language belonging to the Indo-Aryan languages, Indo-Aryan branch of the Indo-European languages. It arose in northwest South Asia after its predecessor languages had Trans-cultural ...
, to which he tentatively added
Gothic,
Celtic
Celtic, Celtics or Keltic may refer to:
Language and ethnicity
*pertaining to Celts, a collection of Indo-European peoples in Europe and Anatolia
**Celts (modern)
*Celtic languages
**Proto-Celtic language
*Celtic music
*Celtic nations
Sports Foot ...
, and
Persian
Persian may refer to:
* People and things from Iran, historically called ''Persia'' in the English language
** Persians, the majority ethnic group in Iran, not to be conflated with the Iranic peoples
** Persian language, an Iranian language of the ...
, though his classification contained some inaccuracies and omissions. In one of the most famous quotations in linguistics, Jones made the following prescient statement in a lecture to the
Asiatic Society of Bengal
The Asiatic Society is an organisation founded during the Company rule in India to enhance and further the cause of " Oriental research" (in this case, research into India and the surrounding regions). It was founded by the philologist Will ...
in 1786, conjecturing the existence of an earlier ancestor language, which he called "a common source" but did not name:
Thomas Young first used the term ''Indo-European'' in 1813, deriving it from the geographical extremes of the language family: from
Western Europe
Western Europe is the western region of Europe. The region's extent varies depending on context.
The concept of "the West" appeared in Europe in juxtaposition to "the East" and originally applied to the Western half of the ancient Mediterranean ...
to
North India
North India is a geographical region, loosely defined as a cultural region comprising the northern part of India (or historically, the Indian subcontinent) wherein Indo-Aryans (speaking Indo-Aryan languages) form the prominent majority populati ...
. A synonym is Indo-Germanic (''Idg.'' or ''IdG.''), specifying the family's southeasternmost and northwesternmost branches. This first appeared in French (''indo-germanique'') in 1810 in the work of
Conrad Malte-Brun
Conrad Malte-Brun (; born Malthe Conrad Bruun; 12 August 177514 December 1826), sometimes referred to simply as Malte-Brun, was a Dano- French geographer and journalist. His second son, Victor Adolphe Malte-Brun, was also a geographer. Today he ...
; in most languages this term is now dated or less common than ''Indo-European'', although in German ''indogermanisch'' remains the standard scientific term. A
number of other synonymous terms have also been used.
Franz Bopp
Franz Bopp (; 14 September 1791 – 23 October 1867) was a German linguistics, linguist known for extensive and pioneering comparative linguistics, comparative work on Indo-European languages.
Early life
Bopp was born in Mainz, but the pol ...
wrote in 1816 ''On the conjugational system of the Sanskrit language compared with that of Greek, Latin, Persian and Germanic'' and between 1833 and 1852 he wrote ''Comparative Grammar''. This marks the beginning of
Indo-European studies
Indo-European studies () is a field of linguistics and an interdisciplinary field of study dealing with Indo-European languages, both current and extinct. The goal of those engaged in these studies is to amass information about the hypothetical p ...
as an academic discipline. The classical phase of Indo-European
comparative linguistics
Comparative linguistics is a branch of historical linguistics that is concerned with comparing languages to establish their historical relatedness.
Genetic relatedness implies a common origin or proto-language and comparative linguistics aim ...
leads from this work to
August Schleicher
August Schleicher (; 19 February 1821 – 6 December 1868) was a German linguist. Schleicher studied the Proto-Indo-European language and devised theories concerning historical linguistics. His great work was ''A Compendium of the Comparative Gr ...
's 1861 ''Compendium'' and up to
Karl Brugmann
Friedrich Karl Brugmann (; 16 March 1849 – 29 June 1919) was a German linguist. He is noted for his work in Indo-European linguistics.
Biography
Friedrich Karl Brugman was born in Wiesbaden to a middle-class family in 1849.
He was educated a ...
's ''
Grundriss'', published in the 1880s. Brugmann's
neogrammarian
The Neogrammarians (, , ) were a German school of linguists, originally at the University of Leipzig, in the late 19th century who proposed the Neogrammarian hypothesis of the regularity of sound change.
Overview
According to the Neogrammarian ...
reevaluation of the field and
Ferdinand de Saussure
Ferdinand Mongin de Saussure (; ; 26 November 185722 February 1913) was a Swiss linguist, semiotician and philosopher. His ideas laid a foundation for many significant developments in both linguistics and semiotics in the 20th century. He is wi ...
's development of the
laryngeal theory
The laryngeal theory is a theory in historical linguistics positing that the Proto-Indo-European (PIE) language included a number of laryngeal consonants that are not linguistic reconstruction, reconstructable by direct application of the com ...
may be considered the beginning of "modern" Indo-European studies. The generation of Indo-Europeanists active in the last third of the 20th century (such as
Calvert Watkins
Calvert Watkins ( /ˈwɒtkɪnz/; March 13, 1933 – March 20, 2013) was an American linguist and philologist, known for his book '' How to Kill a Dragon''. He was a professor of linguistics and the classics at Harvard University and after retirem ...
,
Jochem Schindler
Jochem "Joki" Schindler (8 November 1944 – 24 December 1994) was an Austrian Indo-Europeanist. In spite of his comparatively thin bibliography, he made important contributions, in particular to the theory of Proto-Indo-European nominal inflecti ...
, and
Helmut Rix
Helmut Rix (4 July 1926, in Amberg – 3 December 2004, in Colmar) was a German linguist and professor of the Sprachwissenschaftliches Seminar of Albert-Ludwigs-Universität, Freiburg, Germany.
He is best known for his research into Indo-Euro ...
) developed a better understanding of morphology and of
ablaut
In linguistics, the Indo-European ablaut ( , from German ) is a system of apophony (regular vowel variations) in the Proto-Indo-European language (PIE).
An example of ablaut in English is the strong verb ''sing, sang, sung'' and its relate ...
in the wake of
Kuryłowicz's 1956 ''Apophony in Indo-European'', who in 1927 pointed out the existence of the
Hittite consonant ḫ. Kuryłowicz's discovery supported Ferdinand de Saussure's 1879 proposal of the existence of , elements de Saussure reconstructed to account for vowel length alternations in Indo-European languages. This led to the so-called
laryngeal theory
The laryngeal theory is a theory in historical linguistics positing that the Proto-Indo-European (PIE) language included a number of laryngeal consonants that are not linguistic reconstruction, reconstructable by direct application of the com ...
, a major step forward in Indo-European linguistics and a confirmation of de Saussure's theory.
Classification
The various subgroups of the Indo-European language family include ten major branches, listed below in alphabetical order:
*
Albanian
Albanian may refer to:
*Pertaining to Albania in Southeast Europe; in particular:
**Albanians, an ethnic group native to the Balkans
**Albanian language
**Albanian culture
**Demographics of Albania, includes other ethnic groups within the country ...
, attested from the 13th century;
Proto-Albanian
Proto-Albanian is the ancestral reconstructed language of Albanian, before the Gheg– Tosk dialectal diversification (before ). Albanoid and other Paleo-Balkan languages had their formative core in the Balkans after the Indo-European migrat ...
evolved from an ancient
Paleo-Balkan language
The Paleo-Balkan languages are a geographical grouping of various Indo-European languages that were spoken in the Balkans and surrounding areas in ancient times. In antiquity, Dacian, Greek, Illyrian, Messapic, Paeonian, Phrygian and Thracian were ...
, traditionally thought to be
Illyrian, or otherwise a totally unattested Balkan
Indo-European language
The Indo-European languages are a language family native to the northern Indian subcontinent, most of Europe, and the Iranian plateau with additional native branches found in regions such as Sri Lanka, the Maldives, parts of Central Asia ( ...
that was closely related to Illyrian and
Messapic
Messapic (; also known as Messapian; or as Iapygian) is an extinct Indo-European Paleo-Balkanic language of the southeastern Italian Peninsula, once spoken in Salento by the Iapygian peoples of the region: the Calabri and Salentini (known colle ...
.
*
Anatolian, extinct by
Late Antiquity
Late antiquity marks the period that comes after the end of classical antiquity and stretches into the onset of the Early Middle Ages. Late antiquity as a period was popularized by Peter Brown (historian), Peter Brown in 1971, and this periodiza ...
, spoken in
Anatolia
Anatolia (), also known as Asia Minor, is a peninsula in West Asia that makes up the majority of the land area of Turkey. It is the westernmost protrusion of Asia and is geographically bounded by the Mediterranean Sea to the south, the Aegean ...
, attested in isolated terms in
Luwian
Luwian (), sometimes known as Luvian or Luish, is an ancient language, or group of languages, within the Anatolian branch of the Indo-European language family. The ethnonym Luwian comes from ''Luwiya'' (also spelled ''Luwia'' or ''Luvia'') – ...
/
Hittite mentioned in Semitic
Old Assyrian texts from the 20th and 19th centuries BC,
Hittite texts
The corpus of texts written in the Hittite language consists of more than 30,000 tablets or fragments that have been excavated from the royal archives of the capital of the Hittite Kingdom, Hattusa, close to the modern Turkish town of Boğazkale ...
from about 1650 BC.
*
Armenian
Armenian may refer to:
* Something of, from, or related to Armenia, a country in the South Caucasus region of Eurasia
* Armenians, the national people of Armenia, or people of Armenian descent
** Armenian diaspora, Armenian communities around the ...
, attested from the early 5th century AD. It evolved from the
Proto-Armenian language
Proto-Armenian is the earlier, unattested stage of the Armenian language which has been reconstructed by linguists. As Armenian is the only known language of its branch of the Indo-European languages, the comparative method cannot be used to re ...
which, according to the
Armenian hypothesis
The Armenian hypothesis, also known as the Near Eastern model, is a theory of the Proto-Indo-European homeland, initially proposed by linguists Tamaz V. Gamkrelidze and Vyacheslav Ivanov in the early 1980s, which suggests that the Proto-Indo- ...
, developed ''in situ'' from the
Proto-Indo-European language
Proto-Indo-European (PIE) is the reconstructed common ancestor of the Indo-European language family. No direct record of Proto-Indo-European exists; its proposed features have been derived by linguistic reconstruction from documented Indo-Eu ...
of the 3rd millennium BC.
*
Balto-Slavic
The Balto-Slavic languages form a branch of the Indo-European languages, Indo-European family of languages, traditionally comprising the Baltic languages, Baltic and Slavic languages. Baltic and Slavic languages share several linguistic traits ...
, believed by most Indo-Europeanists to form a phylogenetic unit, while a minority ascribes similarities to prolonged language-contact.
**
Slavic
Slavic, Slav or Slavonic may refer to:
Peoples
* Slavic peoples, an ethno-linguistic group living in Europe and Asia
** East Slavic peoples, eastern group of Slavic peoples
** South Slavic peoples, southern group of Slavic peoples
** West Slav ...
(from
Proto-Slavic
Proto-Slavic (abbreviated PSl., PS.; also called Common Slavic or Common Slavonic) is the unattested, reconstructed proto-language of all Slavic languages. It represents Slavic speech approximately from the 2nd millennium BC through the 6th ...
), attested from the 9th century AD (
possibly earlier), earliest texts in
Old Church Slavonic
Old Church Slavonic or Old Slavonic ( ) is the first Slavic languages, Slavic literary language and the oldest extant written Slavonic language attested in literary sources. It belongs to the South Slavic languages, South Slavic subgroup of the ...
. Slavic languages include
Bulgarian,
Russian
Russian(s) may refer to:
*Russians (), an ethnic group of the East Slavic peoples, primarily living in Russia and neighboring countries
*A citizen of Russia
*Russian language, the most widely spoken of the Slavic languages
*''The Russians'', a b ...
,
Polish
Polish may refer to:
* Anything from or related to Poland, a country in Europe
* Polish language
* Polish people, people from Poland or of Polish descent
* Polish chicken
* Polish brothers (Mark Polish and Michael Polish, born 1970), American twin ...
,
Czech
Czech may refer to:
* Anything from or related to the Czech Republic, a country in Europe
** Czech language
** Czechs, the people of the area
** Czech culture
** Czech cuisine
* One of three mythical brothers, Lech, Czech, and Rus
*Czech (surnam ...
,
Slovak,
Silesian,
Kashubian,
Macedonian
Macedonian most often refers to someone or something from or related to Macedonia.
Macedonian(s) may refer to:
People Modern
* Macedonians (ethnic group), a nation and a South Slavic ethnic group primarily associated with North Macedonia
* Mac ...
,
Serbo-Croatian
Serbo-Croatian ( / ), also known as Bosnian-Croatian-Montenegrin-Serbian (BCMS), is a South Slavic language and the primary language of Serbia, Croatia, Bosnia and Herzegovina, and Montenegro. It is a pluricentric language with four mutually i ...
(
Bosnian,
Croatian,
Montenegrin,
Serbian
Serbian may refer to:
* Pertaining to Serbia in Southeast Europe; in particular
**Serbs, a South Slavic ethnic group native to the Balkans
** Serbian language
** Serbian culture
**Demographics of Serbia, includes other ethnic groups within the co ...
),
Sorbian,
Slovenian,
Ukrainian,
Belarusian, and
Rusyn.
**
Baltic
Baltic may refer to:
Peoples and languages
*Baltic languages, a subfamily of Indo-European languages, including Lithuanian, Latvian and extinct Old Prussian
*Balts (or Baltic peoples), ethnic groups speaking the Baltic languages and/or originatin ...
, attested from the 14th century; although attested relatively recently, they retain many archaic features attributed to
Proto-Indo-European
Proto-Indo-European (PIE) is the reconstructed common ancestor of the Indo-European language family. No direct record of Proto-Indo-European exists; its proposed features have been derived by linguistic reconstruction from documented Indo-Euro ...
(PIE). Living examples are
Lithuanian and
Latvian.
*
Celtic
Celtic, Celtics or Keltic may refer to:
Language and ethnicity
*pertaining to Celts, a collection of Indo-European peoples in Europe and Anatolia
**Celts (modern)
*Celtic languages
**Proto-Celtic language
*Celtic music
*Celtic nations
Sports Foot ...
(from
Proto-Celtic
Proto-Celtic, or Common Celtic, is the hypothetical ancestral proto-language of all known Celtic languages, and a descendant of Proto-Indo-European. It is not attested in writing but has been partly Linguistic reconstruction, reconstructed throu ...
), attested since the 6th century BC;
Lepontic inscriptions date as early as the 6th century BC;
Celtiberian from the 2nd century BC; Primitive Irish
Ogham inscription
Roughly 400 inscriptions in the ogham alphabet are known from stone monuments scattered around the Irish Sea, the bulk of them dating to the fifth and sixth centuries. The language of these inscriptions is predominantly Primitive Irish, but a ...
s from the 4th or 5th century AD, earliest inscriptions in
Old Welsh
Old Welsh () is the stage of the Welsh language from about 800 AD until the early 12th century when it developed into Middle Welsh.Koch, p. 1757. The preceding period, from the time Welsh became distinct from Common Brittonic around 550, ha ...
from the 7th century AD. Modern Celtic languages include
Welsh,
Cornish,
Breton
Breton most often refers to:
*anything associated with Brittany, and generally
**Breton people
**Breton language, a Southwestern Brittonic Celtic language of the Indo-European language family, spoken in Brittany
** Breton (horse), a breed
**Gale ...
,
Scottish Gaelic
Scottish Gaelic (, ; Endonym and exonym, endonym: ), also known as Scots Gaelic or simply Gaelic, is a Celtic language native to the Gaels of Scotland. As a member of the Goidelic language, Goidelic branch of Celtic, Scottish Gaelic, alongs ...
,
Irish and
Manx.
*
Germanic (from
Proto-Germanic
Proto-Germanic (abbreviated PGmc; also called Common Germanic) is the linguistic reconstruction, reconstructed proto-language of the Germanic languages, Germanic branch of the Indo-European languages.
Proto-Germanic eventually developed from ...
), earliest attestations in
runic
Runes are the letters in a set of related alphabets, known as runic rows, runic alphabets or futharks (also, see '' futhark'' vs ''runic alphabet''), native to the Germanic peoples. Runes were primarily used to represent a sound value (a ...
inscriptions from around the 2nd century AD, earliest coherent texts in
Gothic, 4th century AD.
Old English
Old English ( or , or ), or Anglo-Saxon, is the earliest recorded form of the English language, spoken in England and southern and eastern Scotland in the Early Middle Ages. It developed from the languages brought to Great Britain by Anglo-S ...
manuscript tradition from about the 8th century AD. Includes
English,
Frisian,
German
German(s) may refer to:
* Germany, the country of the Germans and German things
**Germania (Roman era)
* Germans, citizens of Germany, people of German ancestry, or native speakers of the German language
** For citizenship in Germany, see also Ge ...
,
Dutch
Dutch or Nederlands commonly refers to:
* Something of, from, or related to the Netherlands
** Dutch people as an ethnic group ()
** Dutch nationality law, history and regulations of Dutch citizenship ()
** Dutch language ()
* In specific terms, i ...
,
Scots,
Danish,
Swedish,
Norwegian,
Afrikaans
Afrikaans is a West Germanic languages, West Germanic language spoken in South Africa, Namibia and to a lesser extent Botswana, Zambia, Zimbabwe and also Argentina where there is a group in Sarmiento, Chubut, Sarmiento that speaks the Pat ...
,
Yiddish
Yiddish, historically Judeo-German, is a West Germanic language historically spoken by Ashkenazi Jews. It originated in 9th-century Central Europe, and provided the nascent Ashkenazi community with a vernacular based on High German fused with ...
,
Low German
Low German is a West Germanic languages, West Germanic language variety, language spoken mainly in Northern Germany and the northeastern Netherlands. The dialect of Plautdietsch is also spoken in the Russian Mennonite diaspora worldwide. "Low" ...
,
Icelandic,
Elfdalian, and
Faroese.
*
Hellenic (from
Proto-Greek
The Proto-Greek language (also known as Proto-Hellenic) is the Indo-European language which was the last common ancestor of all varieties of Greek, including Mycenaean Greek, the subsequent ancient Greek dialects (i.e., Attic, Ionic, Ae ...
, see also
History of Greek
Greek language, Greek is an Indo-European language, the sole surviving descendant of the Hellenic languages, Hellenic sub-family. Although it split off from other Indo-European languages around the 3rd millennium BCE (or possibly before), it is f ...
); fragmentary records in
Mycenaean Greek from between 1450 and 1350 BC have been found.
Homer
Homer (; , ; possibly born ) was an Ancient Greece, Ancient Greek poet who is credited as the author of the ''Iliad'' and the ''Odyssey'', two epic poems that are foundational works of ancient Greek literature. Despite doubts about his autho ...
ic texts date to the 8th century BC.
*
Indo-Iranian, attested , descended from
Proto-Indo-Iranian
Proto-Indo-Iranian, also called Proto-Indo-Iranic or Proto-Aryan, is the reconstructed proto-language of the Indo-Iranian branch of Indo-European. Its speakers, the hypothetical Proto-Indo-Iranians, are assumed to have lived in the late 3rd ...
(dated to the late 3rd millennium BC).
**
Indo-Aryan, attested from around 1400 BC in
Hittite texts from
Anatolia
Anatolia (), also known as Asia Minor, is a peninsula in West Asia that makes up the majority of the land area of Turkey. It is the westernmost protrusion of Asia and is geographically bounded by the Mediterranean Sea to the south, the Aegean ...
, showing
traces of Indo-Aryan words. Epigraphically from the 3rd century BC in the form of
Prakrit
Prakrit ( ) is a group of vernacular classical Middle Indo-Aryan languages that were used in the Indian subcontinent from around the 5th century BCE to the 12th century CE. The term Prakrit is usually applied to the middle period of Middle Ind ...
(
Edicts of Ashoka
The Edicts of Ashoka are a collection of more than thirty inscriptions on the Pillars of Ashoka, as well as boulders and cave walls, attributed to Emperor Ashoka of the Maurya Empire who ruled most of the Indian subcontinent from 268 BCE to 2 ...
). The
Rigveda
The ''Rigveda'' or ''Rig Veda'' (, , from wikt:ऋच्, ऋच्, "praise" and wikt:वेद, वेद, "knowledge") is an ancient Indian Miscellany, collection of Vedic Sanskrit hymns (''sūktas''). It is one of the four sacred canoni ...
is assumed to preserve intact records
via oral tradition dating from c. the mid-2nd millennium BC in the form of
Vedic Sanskrit
Vedic Sanskrit, also simply referred as the Vedic language, is the most ancient known precursor to Sanskrit, a language in the Indo-Aryan languages, Indo-Aryan subgroup of the Indo-European languages, Indo-European language family. It is atteste ...
. Includes a wide range of modern languages from
North India
North India is a geographical region, loosely defined as a cultural region comprising the northern part of India (or historically, the Indian subcontinent) wherein Indo-Aryans (speaking Indo-Aryan languages) form the prominent majority populati ...
, Eastern Pakistan and Bangladesh, including
Hindustani (
Hindi
Modern Standard Hindi (, ), commonly referred to as Hindi, is the Standard language, standardised variety of the Hindustani language written in the Devanagari script. It is an official language of India, official language of the Government ...
,
Urdu
Urdu (; , , ) is an Indo-Aryan languages, Indo-Aryan language spoken chiefly in South Asia. It is the Languages of Pakistan, national language and ''lingua franca'' of Pakistan. In India, it is an Eighth Schedule to the Constitution of Indi ...
),
Bengali
Bengali or Bengalee, or Bengalese may refer to:
*something of, from, or related to Bengal, a large region in South Asia
* Bengalis, an ethnic and linguistic group of the region
* Bengali language, the language they speak
** Bengali alphabet, the w ...
,
Odia,
Assamese,
Punjabi,
Kashmiri,
Gujarati,
Marathi
Marathi may refer to:
*Marathi people, an Indo-Aryan ethnolinguistic group of Maharashtra, India
**Marathi people (Uttar Pradesh), the Marathi people in the Indian state of Uttar Pradesh
*Marathi language, the Indo-Aryan language spoken by the Mar ...
,
Sindhi and
Nepali, as well as
Sinhala of
Sri Lanka
Sri Lanka, officially the Democratic Socialist Republic of Sri Lanka, also known historically as Ceylon, is an island country in South Asia. It lies in the Indian Ocean, southwest of the Bay of Bengal, separated from the Indian subcontinent, ...
and
Dhivehi of the
Maldives
The Maldives, officially the Republic of Maldives, and historically known as the Maldive Islands, is an Archipelagic state, archipelagic country in South Asia located in the Indian Ocean. The Maldives is southwest of Sri Lanka and India, abou ...
and
Minicoy
Minicoy, locally known as Maliku (), is an island in Lakshadweep, India. Along with Viringili, it is on Maliku atoll, the southernmost atoll of Lakshadweep archipelago. Administratively, it is a census town in the Indian States and territories ...
.
**
Iranian
Iranian () may refer to:
* Something of, from, or related to Iran
** Iranian diaspora, Iranians living outside Iran
** Iranian architecture, architecture of Iran and parts of the rest of West Asia
** Iranian cuisine, cooking traditions and practic ...
or Iranic, attested from roughly 1000 BC in the form of
Avestan
Avestan ( ) is the liturgical language of Zoroastrianism. It belongs to the Iranian languages, Iranian branch of the Indo-European languages, Indo-European language family and was First language, originally spoken during the Avestan period, Old ...
. Epigraphically from 520 BC in the form of
Old Persian
Old Persian is one of two directly attested Old Iranian languages (the other being Avestan) and is the ancestor of Middle Persian (the language of the Sasanian Empire). Like other Old Iranian languages, it was known to its native speakers as (I ...
(
Behistun inscription
The Behistun Inscription (also Bisotun, Bisitun or Bisutun; , Old Persian: Bagastana, meaning "the place of god") is a multilingual Achaemenid royal inscriptions, Achaemenid royal inscription and large rock relief on a cliff at Mount Behistun i ...
). Includes
Persian
Persian may refer to:
* People and things from Iran, historically called ''Persia'' in the English language
** Persians, the majority ethnic group in Iran, not to be conflated with the Iranic peoples
** Persian language, an Iranian language of the ...
,
Pashto
Pashto ( , ; , ) is an eastern Iranian language in the Indo-European language family, natively spoken in northwestern Pakistan and southern and eastern Afghanistan. It has official status in Afghanistan and the Pakistani province of Khyb ...
,
Kurdish
Kurdish may refer to:
*Kurds or Kurdish people
*Kurdish language
** Northern Kurdish (Kurmanji)
**Central Kurdish (Sorani)
**Southern Kurdish
** Laki Kurdish
*Kurdish alphabets
*Kurdistan, the land of the Kurdish people which includes:
**Southern ...
,
Balochi,
Luri,
Tajik, and
Ossetian.
**
Nuristani, attested since the 20th century, are among the newest Indo-European languages to be studied. Includes
Katë,
Prasun,
Ashkun,
Nuristani Kalasha
Nuristani Kalasha ('), also known as ''Waigali'', is a Nuristani language spoken by about 10,000 people in the Nuristan Province of Afghanistan. The native name is ''Kalaṣa-alâ'' 'Kalasha-language'. "Waigali" refers to the dialect of the V� ...
,
Tregami
Tregami is a Nuristani language spoken in the villages of Gambir, Kaṭâr, and Devoz in the Tregâm Valley off the lower Pech River in the Watapur District of Kunar Province in Afghanistan. The area is in the Hindu Kush along the border with P ...
, and
Zemiaki
Zemiaki is a Nuristani language spoken by some 400–500 people in the Kunar Province of Afghanistan.
It is named after the settlement in which it is spoken, from the Pashto "language of Zemyaki", the native equivalent being ''J̌amlām-am baš ...
.
*
Italic (from
Proto-Italic
The Proto-Italic language is the ancestor of the Italic languages, most notably Latin and its descendants, the Romance languages. It is not directly attested in writing, but has been reconstructed to some degree through the comparative method. ...
), attested from the 7th century BC. Includes the ancient
Osco-Umbrian languages
The Osco-Umbrian, Sabellic or Sabellian languages are an extinct group of Italic languages, the Indo-European languages that were spoken in central and southern Italy by the Osco-Umbrians before being replaced by Latin, as the power of ancient Ro ...
,
Faliscan, as well as
Latin
Latin ( or ) is a classical language belonging to the Italic languages, Italic branch of the Indo-European languages. Latin was originally spoken by the Latins (Italic tribe), Latins in Latium (now known as Lazio), the lower Tiber area aroun ...
and its descendants, the
Romance languages
The Romance languages, also known as the Latin or Neo-Latin languages, are the languages that are Language family, directly descended from Vulgar Latin. They are the only extant subgroup of the Italic languages, Italic branch of the Indo-E ...
, such as
Italian
Italian(s) may refer to:
* Anything of, from, or related to the people of Italy over the centuries
** Italians, a Romance ethnic group related to or simply a citizen of the Italian Republic or Italian Kingdom
** Italian language, a Romance languag ...
and
French.
*
Tocharian, with proposed links to the
Afanasevo culture
The Afanasievo culture, or Afanasevo culture (Afanasevan culture) ( ''Afanas'yevskaya kul'tura''), is an early archaeological culture of south Siberia, occupying the Minusinsk Basin and the Altai Mountains during the eneolithic era, 3300 to 2 ...
of Southern Siberia. Extant in two dialects (Turfanian and Kuchean, or Tocharian A and B), attested during roughly the 6th–9th centuries AD. Marginalized by the Old Turkic
Uyghur Khaganate
The Uyghur Khaganate (also Uyghur Empire or Uighur Khaganate, self defined as Toquz-Oghuz country; , Tang-era names, with modern Hanyu Pinyin: or ) was a Turkic empire that existed for about a century between the mid 8th and 9th centuries. It ...
and probably extinct by the 10th century.
In addition to the classical ten branches listed above, several extinct and little-known languages and language-groups have existed or are proposed to have existed:
*
Ancient Belgian: hypothetical language associated with the proposed
Nordwestblock
The Nordwestblock (German language, German, "Northwest Block") is a hypothetical Northwestern European cultural region that some scholars propose as a prehistoric culture in the present-day Netherlands, Belgium, far-northern France, and Northern ...
cultural area. Speculated to be connected to Italic or Venetic, and to have certain phonological features in common with Lusitanian.
*
Cimmerian
The Cimmerians were an Ancient Iranian peoples, ancient Eastern Iranian languages, Eastern Iranian peoples, Iranic Eurasian nomads, equestrian nomadic people originating in the Pontic–Caspian steppe, part of whom subsequently migrated into W ...
: possibly Iranic, Thracian, or Celtic
*
Dacian: possibly very close to Thracian
*
Elymian
The Elymians () were an ancient tribal people who inhabited the western part of Sicily during the Bronze Age and Classical antiquity.
Origins
According to Thucydides, the Elymians were refugees coming from the destroyed Troy. Instead for Hellan ...
: Poorly-attested language spoken by the
Elymians
The Elymians () were an ancient tribe, tribal people who inhabited the western part of Sicily during the Bronze Age and Classical antiquity.
Origins
According to Thucydides, the Elymians were refugees coming from the destroyed Troy. Instead for ...
, one of the three indigenous (i.e. pre-Greek and pre-Punic) tribes of Sicily. Indo-European affiliation widely accepted, possibly related to Italic or Anatolian.
*
Illyrian: possibly related to Albanian, Messapian, or both
*
Liburnian
The Liburnians or Liburni () were an ancient tribe inhabiting the district called Liburnia, a coastal region of the northeastern Adriatic between the rivers ''Arsia'' ( Raša) and ''Titius'' ( Krka) in what is now Croatia. According to Strabo ...
: evidence too scant and uncertain to determine anything with certainty
*
Ligurian: possibly close to or part of Celtic.
*
Lusitanian: possibly related to (or part of) Celtic, Ligurian, or Italic
*
Ancient Macedonian: proposed relationship to Greek.
*
Messapic
Messapic (; also known as Messapian; or as Iapygian) is an extinct Indo-European Paleo-Balkanic language of the southeastern Italian Peninsula, once spoken in Salento by the Iapygian peoples of the region: the Calabri and Salentini (known colle ...
: not conclusively deciphered, often considered to be related to Albanian as the available fragmentary linguistic evidence shows common characteristic innovations and a number of significant lexical correspondences between the two languages
*
Paionian: extinct language once spoken north of Macedon
*
Phrygian: language of the ancient
Phrygians
The Phrygians (Greek: Φρύγες, ''Phruges'' or ''Phryges'') were an ancient Indo-European speaking people who inhabited central-western Anatolia (modern-day Turkey) in antiquity.
Ancient Greek authors used "Phrygian" as an umbrella term t ...
. Very likely, but not certainly, a sister group to Hellenic.
*
Sicel
The Sicels ( ; or ''Siculī'') were an Indo-European tribe who inhabited eastern Sicily, their namesake, during the Iron Age. They spoke the Siculian language. After the defeat of the Sicels at the Battle of Nomae in 450 BC and the death of ...
: an ancient language spoken by the Sicels (Greek Sikeloi, Latin Siculi), one of the three indigenous (i.e. pre-Greek and pre-Punic) tribes of Sicily. Proposed relationship to Latin or Proto-Illyrian (Pre-Indo-European) at an earlier stage.
*
Sorothaptic
The Paleo-Hispanic or Paleo-Iberian languages are the languages of the Pre-Roman peoples of the Iberian Peninsula, excluding languages of foreign colonies, such as Greek in Emporion and Phoenician in Qart Hadast. After the Roman conquest of His ...
: proposed, pre-Celtic, Iberian language
*
Thracian
The Thracians (; ; ) were an Indo-European speaking people who inhabited large parts of Southeast Europe in ancient history.. "The Thracians were an Indo-European people who occupied the area that today is shared between north-eastern Greece, ...
: possibly including Dacian
*
Venetic
Venetic ( ) is an extinct Indo-European language, most commonly classified into the Italic subgroup, that was spoken by the Veneti people in ancient times in northeast Italy (Veneto and Friuli) and part of modern Slovenia, between the Po ...
: shares several similarities with Latin and the Italic languages, but also has some affinities with other IE languages, especially Germanic and Celtic.
Membership of languages in the Indo-European language family is determined by
genealogical
Genealogy () is the study of families, family history, and the tracing of their lineages. Genealogists use oral interviews, historical records, genetic analysis, and other records to obtain information about a family and to demonstrate kin ...
relationships, meaning that all members are presumed descendants of a common ancestor,
Proto-Indo-European
Proto-Indo-European (PIE) is the reconstructed common ancestor of the Indo-European language family. No direct record of Proto-Indo-European exists; its proposed features have been derived by linguistic reconstruction from documented Indo-Euro ...
. Membership in the various branches, groups, and subgroups of Indo-European is also genealogical, but here the defining factors are ''shared innovations'' among various languages, suggesting a common ancestor that split off from other Indo-European groups. For example, what makes the Germanic languages a branch of Indo-European is that much of their structure and phonology can be stated in rules that apply to all of them. Many of their common features are presumed innovations that took place in
Proto-Germanic
Proto-Germanic (abbreviated PGmc; also called Common Germanic) is the linguistic reconstruction, reconstructed proto-language of the Germanic languages, Germanic branch of the Indo-European languages.
Proto-Germanic eventually developed from ...
, the source of all the Germanic languages.
In the 21st century, several attempts have been made to model the phylogeny of Indo-European languages using Bayesian methodologies similar to those applied to problems in biological phylogeny.
Although there are differences in absolute timing between the various analyses, there is much commonality between them, including the result that the first known language groups to diverge were the Anatolian and Tocharian language families, in that order.
Tree versus wave model
The "
tree model
In historical linguistics, the tree model (also Stammbaum, genetic, or cladistic model) is a model of the evolution of languages analogous to the concept of a family tree, particularly a phylogenetic tree in the biological evolution of species. ...
" is considered an appropriate representation of the genealogical history of a language family if communities do not remain in contact after their languages have started to diverge. In this case, subgroups defined by shared innovations form a nested pattern. The tree model is not appropriate in cases where languages remain in contact as they diversify; in such cases subgroups may overlap, and the "
wave model
In historical linguistics, the wave model or wave theory () is a model of language change in which a new language feature (innovation) or a new combination of language features spreads from its region of origin, being adopted by a gradually expa ...
" is a more accurate representation. Most approaches to Indo-European subgrouping to date have assumed that the tree model is by-and-large valid for Indo-European; however, there is also a long tradition of wave-model approaches.
In addition to genealogical changes, many of the early changes in Indo-European languages can be attributed to
language contact
Language contact occurs when speakers of two or more languages or varieties interact with and influence each other. The study of language contact is called contact linguistics. Language contact can occur at language borders, between adstratum ...
. It has been asserted, for example, that many of the more striking features shared by Italic languages (Latin, Oscan, Umbrian, etc.) might well be
areal features
In geolinguistics, areal features are elements shared by languages or dialects in a geographic area, particularly when such features are not descended from a common ancestor or proto-language. An areal feature is contrasted with genetic relat ...
. More certainly, very similar-looking alterations in the systems of
long vowel
In linguistics, vowel length is the perceived or actual duration of a vowel sound when pronounced. Vowels perceived as shorter are often called short vowels and those perceived as longer called long vowels.
On one hand, many languages do not d ...
s in the West Germanic languages greatly postdate any possible notion of a
proto-language
In the tree model of historical linguistics, a proto-language is a postulated ancestral language from which a number of attested languages are believed to have descended by evolution, forming a language family. Proto-languages are usually unatte ...
innovation (and cannot readily be regarded as "areal", either, because English and continental West Germanic were not a linguistic area). In a similar vein, there are many similar innovations in Germanic and Balto-Slavic that are far more likely areal features than traceable to a common proto-language, such as the uniform development of a
high vowel
A close vowel, also known as a high vowel (in U.S. terminology), is any in a class of vowel sounds used in many spoken languages. The defining characteristic of a close vowel is that the tongue is positioned approximately as close as possible to ...
(*''u'' in the case of Germanic, *''i/u'' in the case of Baltic and Slavic) before the PIE syllabic resonants *''ṛ, *ḷ, *ṃ, *ṇ'', unique to these two groups among IE languages, which is in agreement with the wave model. The
Balkan sprachbund even features areal convergence among members of very different branches.
An extension to the ''
Ringe-
Warnow
The Warnow () is a river in the state of Mecklenburg-Vorpommern in Germany. It flows into the Baltic Sea near the town of Rostock
Rostock (; Polabian language, Polabian: ''Roztoc''), officially the Hanseatic and University City of Rostock ( ...
model of language evolution'' suggests that early IE had featured limited contact between distinct lineages, with only the Germanic subfamily exhibiting a less treelike behaviour as it acquired some characteristics from neighbours early in its evolution. The internal diversification of especially West Germanic is cited to have been radically non-treelike.
Proposed subgroupings
Specialists have postulated the existence of higher-order subgroups such as
Italo-Celtic
In historical linguistics, Italo-Celtic is a hypothetical grouping of the Italic and Celtic branches of the Indo-European language family on the basis of features shared by these two branches and no others. There is controversy about the causes o ...
,
Graeco-Armenian
Graeco-Armenian (or Helleno-Armenian) is the hypothetical common ancestor of Greek (or Hellenic) and Armenian branches that postdates the Proto-Indo-European language. Its status is somewhat similar to that of the Italo-Celtic grouping: each ...
,
Graeco-Aryan
Graeco-Aryan, or Graeco-Armeno-Aryan, is a hypothetical clade within the Indo-European family that would be the ancestor of Hellenic, Armenian, and the Indo-Iranian languages, which spans Southern Europe, Armenian highlands and Southern Asi ...
or Graeco-Armeno-Aryan, and Balto-Slavo-Germanic. However, unlike the ten traditional branches, these are all controversial to a greater or lesser degree.
The Italo-Celtic subgroup was at one point uncontroversial, considered by
Antoine Meillet
Paul Jules Antoine Meillet (; 11 November 1866 – 21 September 1936) was one of the most important French linguists of the early 20th century. He began his studies at the Sorbonne University, where he was influenced by Michel Bréal, the Swiss l ...
to be even better established than Balto-Slavic. The main lines of evidence included the genitive suffix ''-ī''; the superlative suffix ''-m̥mo''; the change of /p/ to /kʷ/ before another /kʷ/ in the same word (as in ''penkʷe'' > ''*kʷenkʷe'' > Latin , Old Irish ); and the subjunctive morpheme ''-ā-''. This evidence was prominently challenged by
Calvert Watkins
Calvert Watkins ( /ˈwɒtkɪnz/; March 13, 1933 – March 20, 2013) was an American linguist and philologist, known for his book '' How to Kill a Dragon''. He was a professor of linguistics and the classics at Harvard University and after retirem ...
, while Michael Weiss has argued for the subgroup.
Evidence for a relationship between Greek and Armenian includes the regular change of the
second laryngeal to ''a'' at the beginnings of words, as well as terms for "woman" and "sheep". Greek and Indo-Iranian share innovations mainly in verbal morphology and patterns of nominal derivation. Relations have also been proposed between Phrygian and Greek, and between Thracian and Armenian. Some fundamental shared features, like the
aorist
Aorist ( ; abbreviated ) verb forms usually express perfective aspect and refer to past events, similar to a preterite. Ancient Greek grammar had the aorist form, and the grammars of other Indo-European languages and languages influenced by the ...
(a verb form denoting action without reference to duration or completion) having the perfect active particle -s fixed to the stem, link this group closer to Anatolian languages and Tocharian. Shared features with Balto-Slavic languages, on the other hand (especially present and preterit formations), might be due to later contacts.
The
Indo-Hittite
In Indo-European linguistics, the term Indo-Hittite (also Indo-Anatolian) refers to Edgar Howard Sturtevant's 1926 hypothesis that the Anatolian languages split off a Pre-Proto-Indo-European language considerably earlier than the separation o ...
hypothesis proposes that the Indo-European language family consists of two main branches: one represented by the Anatolian languages and another branch encompassing all other Indo-European languages. Features that separate Anatolian from all other branches of Indo-European (such as the gender or the verb system) have been interpreted alternately as archaic debris or as innovations due to prolonged isolation. Points proffered in favour of the Indo-Hittite hypothesis are the (non-universal) Indo-European agricultural terminology in Anatolia and the preservation of laryngeals. However, in general this hypothesis is considered to attribute too much weight to the Anatolian evidence. According to another view, the Anatolian subgroup left the Indo-European parent language comparatively late, approximately at the same time as Indo-Iranian and later than the Greek or Armenian divisions. A third view, especially prevalent in the so-called French school of Indo-European studies, holds that extant similarities in non-
satem
Languages of the Indo-European family are classified as either centum languages or satem languages according to how the dorsal consonants (sounds of "K", "G" and "Y" type) of the reconstructed Proto-Indo-European language (PIE) developed. An ...
languages in general—including Anatolian—might be due to their peripheral location in the Indo-European language-area and to early separation, rather than indicating a special ancestral relationship. Hans J. Holm, based on lexical calculations, arrives at a picture roughly replicating the general scholarly opinion and refuting the Indo-Hittite hypothesis.
Satem and centum languages

The division of the Indo-European languages into satem and centum groups was put forward by Peter von Bradke in 1890, although
Karl Brugmann
Friedrich Karl Brugmann (; 16 March 1849 – 29 June 1919) was a German linguist. He is noted for his work in Indo-European linguistics.
Biography
Friedrich Karl Brugman was born in Wiesbaden to a middle-class family in 1849.
He was educated a ...
did propose a similar type of division in 1886. In the satem languages, which include the Balto-Slavic and Indo-Iranian branches, as well as (in most respects) Albanian and Armenian, the reconstructed
Proto-Indo-European palatovelars remained distinct and were fricativized, while the labiovelars merged with the 'plain velars'. In the centum languages, the palatovelars merged with the plain velars, while the labiovelars remained distinct. The results of these alternative developments are exemplified by the words for "hundred" in Avestan () and Latin ()—the initial palatovelar developed into a fricative in the former, but became an ordinary velar in the latter.
Rather than being a genealogical separation, the centum–satem division is commonly seen as resulting from innovative changes that spread across PIE dialect-branches over a particular geographical area; the centum–satem
isogloss
An isogloss, also called a heterogloss, is the geographic boundary of a certain linguistics, linguistic feature, such as the pronunciation of a vowel, the meaning of a word, or the use of some morphological or syntactic feature. Isoglosses are a ...
intersects a number of other isoglosses that mark distinctions between features in the early IE branches. It may be that the centum branches in fact reflect the original state of affairs in PIE, and only the satem branches shared a set of innovations, which affected all but the peripheral areas of the PIE dialect continuum. Kortlandt proposes that the ancestors of Balts and Slavs took part in satemization before being drawn later into the western Indo-European sphere.
Proposed external relations
From the very beginning of Indo-European studies, there have been attempts to link the Indo-European languages genealogically to other languages and language families. However, these theories remain highly controversial, and most specialists in Indo-European linguistics are sceptical or agnostic about such proposals.
Proposals linking the Indo-European languages with a single language family include:
*
Indo-Uralic
Indo-Uralic is a controversial linguistic hypothesis proposing a genealogical family consisting of Indo-European and Uralic.
The suggestion of a genetic relationship between Indo-European and Uralic is often credited to the Danish linguist Vi ...
, joining Indo-European with
Uralic
The Uralic languages ( ), sometimes called the Uralian languages ( ), are spoken predominantly in Europe and North Asia. The Uralic languages with the most native speakers are Hungarian, Finnish, and Estonian. Other languages with speakers abo ...
*
Pontic, postulated by
John Colarusso
John Colarusso is a linguist specializing in Caucasian languages. Since 1976, he has taught at McMaster University in Hamilton, Ontario.
Colarusso has published more than sixty-five articles on linguistics, myths, politics, and the Caucasus; he ...
, which joins Indo-European with
Northwest Caucasian
Other proposed families include:
*
Nostratic
Nostratic is a hypothetical language macrofamily including many of the language families of northern Eurasia first proposed in 1903. Though a historically important proposal, it is now generally considered a fringe theory. Its exact compositi ...
, comprising all or some of the Eurasiatic languages and the
Kartvelian,
Dravidian (or wider,
Elamo-Dravidian
The Elamo-Dravidian language family is a hypothesised language family that links the Elamite language of ancient Elam (present-day southwestern Iran, and southeastern Iraq) to the Dravidian languages of South Asia. The latest version (2015) of t ...
) and
Afroasiatic language families
*
Eurasiatic
Eurasiatic is a hypothetical and controversial language macrofamily proposal that would include many language families historically spoken in northern, western, and southern Eurasia.
The idea of a Eurasiatic superfamily dates back more than 100 ...
, a theory championed by
Joseph Greenberg
Joseph Harold Greenberg (May 28, 1915 – May 7, 2001) was an American linguist, known mainly for his work concerning linguistic typology and the genetic classification of languages.
Life Early life and education
Joseph Greenberg was born on M ...
, comprising the
Uralic
The Uralic languages ( ), sometimes called the Uralian languages ( ), are spoken predominantly in Europe and North Asia. The Uralic languages with the most native speakers are Hungarian, Finnish, and Estonian. Other languages with speakers abo ...
,
Altaic
The Altaic () languages are a group of languages comprising the Turkic, Mongolic and Tungusic language families, with some linguists including the Koreanic and Japonic families. These languages share agglutinative morphology, head-final ...
and various '
Paleosiberian
The Paleo-Siberian languages are several language isolates and small language families spoken in parts of Siberia. They are not known to have any genetic relationship to each other; their only common link is that they are held to have antedated ...
' families (
Ainu,
Yukaghir
The Yukaghirs, or Yukagirs ( (), ), are a Siberian ethnic group in the Russian Far East, living in the basin of the Kolyma River.
Geographic distribution
The Tundra Yukaghirs live in the Lower Kolyma region in the Sakha Republic; the Taiga Y ...
,
Nivkh
Nivkh or Amuric or Gilyak may refer to:
* Nivkh people (''Nivkhs'') or Gilyak people (''Gilyaks'')
* Nivkh languages or Gilyak languages
* Gilyak class gunboat, ''Gilyak'' class gunboat, such as the Russian gunboat Korietz#Second gunboat, second R ...
,
Chukotko-Kamchatkan,
Eskimo–Aleut
The Eskaleut ( ), Eskimo–Aleut or Inuit–Yupik–Unangan languages are a language family native to the northern portions of the North American continent, and a small part of northeastern Asia. Languages in the family are indigenous to parts of ...
) and possibly others
Nostratic and Eurasiatic, in turn, have been included in even wider groupings, such as
Borean
Borean (also Boreal or Boralean) is a hypothetical (i.e. proposed) linguistic macrofamily that encompasses almost all language families worldwide except those native to the Americas, Africa, Oceania, and the Andaman Islands. It is considered a fri ...
, a language family separately proposed by
Harold C. Fleming
Harold Crane Fleming (December 23, 1926 – April 29, 2015) was an American anthropologist and historical linguist specializing in the cultures and languages of the Horn of Africa. As an adherent of the Four Field School of American anth ...
and
Sergei Starostin
Sergei Anatolyevich Starostin (; March 24, 1953 – September 30, 2005) was a Russian historical linguistics, historical linguist and philology, philologist, perhaps best known for his reconstructions of hypothetical proto-languages, including hi ...
that encompasses almost all of the world's natural languages with the exception of those native to
sub-Saharan Africa
Sub-Saharan Africa is the area and regions of the continent of Africa that lie south of the Sahara. These include Central Africa, East Africa, Southern Africa, and West Africa. Geopolitically, in addition to the list of sovereign states and ...
,
New Guinea
New Guinea (; Hiri Motu: ''Niu Gini''; , fossilized , also known as Papua or historically ) is the List of islands by area, world's second-largest island, with an area of . Located in Melanesia in the southwestern Pacific Ocean, the island is ...
,
Australia
Australia, officially the Commonwealth of Australia, is a country comprising mainland Australia, the mainland of the Australia (continent), Australian continent, the island of Tasmania and list of islands of Australia, numerous smaller isl ...
, and the
Andaman Islands
The Andaman Islands () are an archipelago, made up of 200 islands, in the northeastern Indian Ocean about southwest off the coasts of Myanmar's Ayeyarwady Region. Together with the Nicobar Islands to their south, the Andamans serve as a mari ...
.
Evolution
Proto-Indo-European

The proposed Proto-Indo-European language (PIE) is the
reconstructed common ancestor of the Indo-European languages, spoken by the
Proto-Indo-Europeans
The Proto-Indo-Europeans are a hypothetical prehistoric ethnolinguistic group of Eurasia who spoke Proto-Indo-European (PIE), the reconstructed common ancestor of the Indo-European language family.
Knowledge of them comes chiefly from t ...
. From the 1960s, knowledge of Anatolian became certain enough to establish its relationship to PIE. Using the method of
internal reconstruction
Internal reconstruction is a method of reconstructing an earlier state in a language's history using only language-internal evidence of the language in question.
The comparative method compares variations between languages, such as in sets of co ...
, an earlier stage, called Pre-Proto-Indo-European, has been proposed.
PIE is an
inflected language
Fusional languages or inflected languages are a type of synthetic language, distinguished from agglutinative languages by their tendency to use single inflectional morphemes to denote multiple grammatical, syntactic, or semantic features.
For e ...
, in which the grammatical relationships between words were signalled through inflectional morphemes (usually endings). The
roots
A root is the part of a plant, generally underground, that anchors the plant body, and absorbs and stores water and nutrients.
Root or roots may also refer to:
Art, entertainment, and media
* ''The Root'' (magazine), an online magazine focusin ...
of PIE are basic
morpheme
A morpheme is any of the smallest meaningful constituents within a linguistic expression and particularly within a word. Many words are themselves standalone morphemes, while other words contain multiple morphemes; in linguistic terminology, this ...
s carrying a
lexical
Lexical may refer to:
Linguistics
* Lexical corpus or lexis, a complete set of all words in a language
* Lexical item, a basic unit of lexicographical classification
* Lexicon, the vocabulary of a person, language, or branch of knowledge
* Lexical ...
meaning. By addition of
suffix
In linguistics, a suffix is an affix which is placed after the stem of a word. Common examples are case endings, which indicate the grammatical case of nouns and adjectives, and verb endings, which form the conjugation of verbs. Suffixes can ca ...
es, they form
stems, and by addition of
endings
End, END, Ending, or ENDS may refer to:
End Mathematics
*End (category theory)
*End (topology)
*End (graph theory)
*End (graph_theory)#Cayley_graphs, End (group theory) (a subcase of the previous)
*End (endomorphism) Sports and games
*End (gridir ...
, these form grammatically inflected words (
nouns
In grammar, a noun is a word that represents a concrete or abstract thing, like living creatures, places, actions, qualities, states of existence, and ideas. A noun may serve as an object or subject within a phrase, clause, or sentence.Example n ...
or
verbs
A verb is a word that generally conveys an action (''bring'', ''read'', ''walk'', ''run'', ''learn''), an occurrence (''happen'', ''become''), or a state of being (''be'', ''exist'', ''stand''). In the usual description of English, the basic fo ...
). The reconstructed
Indo-European verb
Proto-Indo-European verbs reflect a complex system of morphology, more complicated than the substantive, with verbs categorized according to their grammatical aspect, aspect, using multiple grammatical moods and grammatical voice, voices, and be ...
system is complex and, like the noun, exhibits a system of
ablaut
In linguistics, the Indo-European ablaut ( , from German ) is a system of apophony (regular vowel variations) in the Proto-Indo-European language (PIE).
An example of ablaut in English is the strong verb ''sing, sang, sung'' and its relate ...
.
Diversification
The diversification of the parent language into the attested branches of daughter languages is historically unattested. The timeline of the evolution of the various daughter languages, on the other hand, is mostly undisputed, quite regardless of the question of
Indo-European origins
The Proto-Indo-European homeland was the prehistoric linguistic homeland, homeland of the Proto-Indo-European language (PIE), meaning it was the region where the proto-language was spoken before it split into the dialects from which the earliest ...
.
Using a mathematical analysis borrowed from evolutionary biology,
Donald Ringe
Donald Ringe () is an American linguist and Indo-Europeanist.
He has been described as a historical linguist and as a mathematical linguist. He is multi-lingual. His work is on language family trees and the Proto-Indo-European language, a ...
and
Tandy Warnow
Tandy Warnow is an American computer scientist and Grainger Distinguished Chair in Engineering at the University of Illinois at Urbana–Champaign.Tandy Warnow's , retrieved 2020-09-10. She is known for her work on the reconstruction of evolution ...
propose the following evolutionary tree of Indo-European branches:
* Pre-
Anatolian (before 3500 BC)
* Pre-
Tocharian
* Pre-Italic and Pre-Celtic (before 2500 BC)
* Pre-Armenian and Pre-Greek (after 2500 BC)
* Proto-
Indo-Iranian (2000 BC)
* Pre-Germanic and Pre-Balto-Slavic; Proto-Germanic
David Anthony proposes the following sequence:
* Pre-
Anatolian (4200 BC)
* Pre-
Tocharian (3700 BC)
*
Pre-Germanic (3300 BC)
* Pre-Italic and Pre-Celtic (3000 BC)
* Pre-Armenian (2800 BC)
* Pre-Balto-Slavic (2800 BC)
* Pre-Greek (2500 BC)
* Proto-
Indo-Iranian (2200 BC); split into Iranian and Old Indic 1800 BC
From 1500 BC the following sequence may be given:
* 1500–1000 BC: The
Nordic Bronze Age
The Nordic Bronze Age (also Northern Bronze Age, or Scandinavian Bronze Age) is a period of Scandinavian prehistory from .
The Nordic Bronze Age culture emerged about 1750 BC as a continuation of the Late Neolithic Dagger period, which is root ...
of
Scandinavia
Scandinavia is a subregion#Europe, subregion of northern Europe, with strong historical, cultural, and linguistic ties between its constituent peoples. ''Scandinavia'' most commonly refers to Denmark, Norway, and Sweden. It can sometimes also ...
develops
pre-Proto-Germanic
Proto-Germanic (abbreviated PGmc; also called Common Germanic) is the reconstructed proto-language of the Germanic branch of the Indo-European languages.
Proto-Germanic eventually developed from pre-Proto-Germanic into three Germanic branc ...
, and the (pre-) Proto-Celtic
Urnfield
The Urnfield culture () was a late Bronze Age culture of Central Europe, often divided into several local cultures within a broader Urnfield tradition. The name comes from the custom of cremating the dead and placing their ashes in urns, which ...
and
Hallstatt
Hallstatt () is a small town in the district of Gmunden District, Gmunden, in the Austrian state of Upper Austria. Situated between the southwestern shore of Hallstätter See and the steep slopes of the Dachstein massif, the town lies in the Sa ...
cultures emerge in Central Europe, introducing the
Iron Age
The Iron Age () is the final epoch of the three historical Metal Ages, after the Chalcolithic and Bronze Age. It has also been considered as the final age of the three-age division starting with prehistory (before recorded history) and progre ...
. Migration of the Proto-
Italic speakers into the Italian peninsula (
Bagnolo stele
The Bagnolo steles are two stone boulders found in Ceresolo-Bagnolo, Malegno commune, Brescia province, Lombardia, Northern Italy, at the base of Monte Mignone, at an altitude of ca. 700 m. Bagnolo 1 was discovered in 1963, bearing depictions of ...
).
Migration of Aryans to India followed by the redaction of the
Rigveda
The ''Rigveda'' or ''Rig Veda'' (, , from wikt:ऋच्, ऋच्, "praise" and wikt:वेद, वेद, "knowledge") is an ancient Indian Miscellany, collection of Vedic Sanskrit hymns (''sūktas''). It is one of the four sacred canoni ...
; rise of the
Vedic civilization
The Vedic period, or the Vedic age (), is the period in the late Bronze Age and early Iron Age of the history of India when the Vedic literature, including the Vedas (–900 BCE), was composed in the northern Indian subcontinent, between the e ...
and
beginning of Iron Age in the
Punjab
Punjab (; ; also romanised as Panjāb or Panj-Āb) is a geopolitical, cultural, and historical region in South Asia. It is located in the northwestern part of the Indian subcontinent, comprising areas of modern-day eastern Pakistan and no ...
. The
Mycenaean civilization
Mycenaean Greece (or the Mycenaean civilization) was the last phase of the Bronze Age in ancient Greece, spanning the period from approximately 1750 to 1050 BC.. It represents the first advanced and distinctively Greek civilization in mainla ...
gives way to the
Greek Dark Ages
The Greek Dark Ages ( 1180–800 BC) were earlier regarded as two continuous periods of Greek history: the Postpalatial Bronze Age (c. 1180–1050 BC) and the Prehistoric Iron Age or Early Iron Age (c. 1050–800 BC). The last included all the ...
. Hittite goes extinct.
Iranian speakers start migrating southwards to
Greater Iran
Greater Iran or Greater Persia ( ), also called the Iranosphere or the Persosphere, is an expression that denotes a wide socio-cultural region comprising parts of West Asia, the South Caucasus, Central Asia, South Asia, and East Asia (specifica ...
.
Balto-Slavic
The Balto-Slavic languages form a branch of the Indo-European languages, Indo-European family of languages, traditionally comprising the Baltic languages, Baltic and Slavic languages. Baltic and Slavic languages share several linguistic traits ...
splits into ancestors of modern
Baltic
Baltic may refer to:
Peoples and languages
*Baltic languages, a subfamily of Indo-European languages, including Lithuanian, Latvian and extinct Old Prussian
*Balts (or Baltic peoples), ethnic groups speaking the Baltic languages and/or originatin ...
and
Slavic
Slavic, Slav or Slavonic may refer to:
Peoples
* Slavic peoples, an ethno-linguistic group living in Europe and Asia
** East Slavic peoples, eastern group of Slavic peoples
** South Slavic peoples, southern group of Slavic peoples
** West Slav ...
.
* 1000–500 BC: The
Celtic languages
The Celtic languages ( ) are a branch of the Indo-European language family, descended from the hypothetical Proto-Celtic language. The term "Celtic" was first used to describe this language group by Edward Lhuyd in 1707, following Paul-Yve ...
spread over Central and Western Europe, including
Britain
Britain most often refers to:
* Great Britain, a large island comprising the countries of England, Scotland and Wales
* The United Kingdom of Great Britain and Northern Ireland, a sovereign state in Europe comprising Great Britain and the north-eas ...
.
Baltic languages
The Baltic languages are a branch of the Indo-European languages, Indo-European language family spoken natively or as a second language by a population of about 6.5–7.0 million people are spoken in a huge area from present-day Poland to
Moscow
Moscow is the Capital city, capital and List of cities and towns in Russia by population, largest city of Russia, standing on the Moskva (river), Moskva River in Central Russia. It has a population estimated at over 13 million residents with ...
.
Pre-Proto-Germanic
Proto-Germanic (abbreviated PGmc; also called Common Germanic) is the reconstructed proto-language of the Germanic branch of the Indo-European languages.
Proto-Germanic eventually developed from pre-Proto-Germanic into three Germanic branc ...
gives rise to
Proto-Germanic
Proto-Germanic (abbreviated PGmc; also called Common Germanic) is the linguistic reconstruction, reconstructed proto-language of the Germanic languages, Germanic branch of the Indo-European languages.
Proto-Germanic eventually developed from ...
in southern Scandinavia.
Homer
Homer (; , ; possibly born ) was an Ancient Greece, Ancient Greek poet who is credited as the author of the ''Iliad'' and the ''Odyssey'', two epic poems that are foundational works of ancient Greek literature. Despite doubts about his autho ...
and the beginning of
Classical Antiquity
Classical antiquity, also known as the classical era, classical period, classical age, or simply antiquity, is the period of cultural History of Europe, European history between the 8th century BC and the 5th century AD comprising the inter ...
. The Vedic civilization gives way to the
Mahajanapadas
The Mahājanapadas were sixteen Realm, kingdoms and aristocracy, aristocratic republics that existed in ancient India from the sixth to fourth centuries BCE, during the History of India#Second urbanisation (c. 600 – 200 BCE), second urbanis ...
as the Indo-Aryan tongue reaches eastwards, giving rise to the
Greater Magadha
Greater Magadha is a theory in the studies of the ancient history of India, introduced by Johannes Bronkhorst. It refers to the non-Vedic political and cultural sphere that developed in the lower Gangetic plains (modern day Bihar, Eastern ...
cultural sphere, where
Mahavira
Mahavira (Devanagari: महावीर, ), also known as Vardhamana (Devanagari: वर्धमान, ), was the 24th ''Tirthankara'' (Supreme Preacher and Ford Maker) of Jainism. Although the dates and most historical details of his lif ...
preaches
Jainism
Jainism ( ), also known as Jain Dharma, is an Indian religions, Indian religion whose three main pillars are nonviolence (), asceticism (), and a rejection of all simplistic and one-sided views of truth and reality (). Jainism traces its s ...
and
Siddhartha Gautama
Siddhartha Gautama, most commonly referred to as the Buddha (),*
*
*
was a śramaṇa, wandering ascetic and religious teacher who lived in South Asia during the 6th or 5th century BCE and founded Buddhism. According to Buddhist lege ...
preaches
Buddhism
Buddhism, also known as Buddhadharma and Dharmavinaya, is an Indian religion and List of philosophies, philosophical tradition based on Pre-sectarian Buddhism, teachings attributed to the Buddha, a wandering teacher who lived in the 6th or ...
.
Zoroaster
Zarathushtra Spitama, more commonly known as Zoroaster or Zarathustra, was an Iranian peoples, Iranian religious reformer who challenged the tenets of the contemporary Ancient Iranian religion, becoming the spiritual founder of Zoroastrianism ...
composes the
Gathas
The Gathas () are 17 hymns in the Avestan language from the Zoroastrian oral tradition of the Avesta. The oldest surviving text fragment dates from 1323 CE, but they are believed by scholars to have been composed before 1000 BCE and passed dow ...
, rise of the
Achaemenid Empire
The Achaemenid Empire or Achaemenian Empire, also known as the Persian Empire or First Persian Empire (; , , ), was an Iranian peoples, Iranian empire founded by Cyrus the Great of the Achaemenid dynasty in 550 BC. Based in modern-day Iran, i ...
, replacing the
Elamites and
Babylonia
Babylonia (; , ) was an Ancient history, ancient Akkadian language, Akkadian-speaking state and cultural area based in the city of Babylon in central-southern Mesopotamia (present-day Iraq and parts of Kuwait, Syria and Iran). It emerged as a ...
. Separation of Proto-Italic into
Osco-Umbrian
The Osco-Umbrian, Sabellic or Sabellian languages are an extinct group of Italic languages, the Indo-European languages that were spoken in central and southern Italy by the Osco-Umbrians before being replaced by Latin, as the power of ancient Rom ...
,
Latin-Faliscan, and possibly
Venetic
Venetic ( ) is an extinct Indo-European language, most commonly classified into the Italic subgroup, that was spoken by the Veneti people in ancient times in northeast Italy (Veneto and Friuli) and part of modern Slovenia, between the Po ...
and
Siculian
Siculian (or Sicel) is an extinct Indo-European language spoken in central and eastern Sicily by the Sicels. It is attested in fewer than thirty inscriptions in eastern Sicily from the late 6th century to 4th century BCE, and in around twenty-five ...
. A variety of
Paleo-Balkan languages
The Paleo-Balkan languages are a geographical grouping of various Indo-European languages that were spoken in the Balkans and surrounding areas in ancient times. In antiquity, Dacian, Greek, Illyrian, Messapic, Paeonian, Phrygian and Thracian wer ...
besides Greek are spoken in Southern Europe, including
Thracian
The Thracians (; ; ) were an Indo-European speaking people who inhabited large parts of Southeast Europe in ancient history.. "The Thracians were an Indo-European people who occupied the area that today is shared between north-eastern Greece, ...
,
Dacian and
Illyrian, and in
Anatolia
Anatolia (), also known as Asia Minor, is a peninsula in West Asia that makes up the majority of the land area of Turkey. It is the westernmost protrusion of Asia and is geographically bounded by the Mediterranean Sea to the south, the Aegean ...
(
Phrygian). Development of
Prakrits
Prakrit ( ) is a group of vernacular classical Middle Indo-Aryan languages that were used in the Indian subcontinent from around the 5th century BCE to the 12th century CE. The term Prakrit is usually applied to the middle period of Middle Ind ...
across the northern Indian subcontinent, as well as migration of Indo-Aryan speakers to
Sri Lanka
Sri Lanka, officially the Democratic Socialist Republic of Sri Lanka, also known historically as Ceylon, is an island country in South Asia. It lies in the Indian Ocean, southwest of the Bay of Bengal, separated from the Indian subcontinent, ...
and the
Maldives
The Maldives, officially the Republic of Maldives, and historically known as the Maldive Islands, is an Archipelagic state, archipelagic country in South Asia located in the Indian Ocean. The Maldives is southwest of Sri Lanka and India, abou ...
.
* 500–1 BC:
Classical Antiquity
Classical antiquity, also known as the classical era, classical period, classical age, or simply antiquity, is the period of cultural History of Europe, European history between the 8th century BC and the 5th century AD comprising the inter ...
: spread of
Greek
Greek may refer to:
Anything of, from, or related to Greece, a country in Southern Europe:
*Greeks, an ethnic group
*Greek language, a branch of the Indo-European language family
**Proto-Greek language, the assumed last common ancestor of all kno ...
and
Latin
Latin ( or ) is a classical language belonging to the Italic languages, Italic branch of the Indo-European languages. Latin was originally spoken by the Latins (Italic tribe), Latins in Latium (now known as Lazio), the lower Tiber area aroun ...
throughout the Mediterranean and, during the
Hellenistic period
In classical antiquity, the Hellenistic period covers the time in Greek history after Classical Greece, between the death of Alexander the Great in 323 BC and the death of Cleopatra VII in 30 BC, which was followed by the ascendancy of the R ...
(
Indo-Greeks
The Indo-Greek Kingdom, also known as the Yavana Kingdom, was a Hellenistic-era Greek kingdom covering various parts of modern-day Afghanistan, Pakistan and northwestern India.
The term "Indo-Greek Kingdom" loosely describes a number of var ...
), to Central Asia and the
Hindukush
The Hindu Kush is an mountain range in Central and South Asia to the west of the Himalayas. It stretches from central and eastern Afghanistan into northwestern Pakistan and far southeastern Tajikistan. The range forms the western section of ...
. The Magadhan power and influence rises in ancient India, especially with the conquests of the
Nandan and
Mauryan empire
The Maurya Empire was a geographically extensive Iron Age historical power in South Asia with its power base in Magadha. Founded by Chandragupta Maurya around c. 320 BCE, it existed in loose-knit fashion until 185 BCE. The primary sourc ...
s. Germanic speakers start migrating southwards to occupy formerly Celtic territories.
Scythian cultures
The Scythian cultures was an archaeological horizon that flourished across the Eurasian Steppe during the Iron Age, from approximately the 9th century BC to the 2nd century AD. It included the Scythian, Sauromatian and Sarma ...
extend from Eastern Europe (
Pontic Scythians) to Northwest China (
Ordos culture
The Ordos culture () was a material culture occupying a region centered on the Ordos Loop (corresponding to the region of Suiyuan, including Baotou to the north, all located in modern Inner Mongolia, China) during the Bronze Age, Bronze and ea ...
).
* 1 BC – AD 500:
Late Antiquity
Late antiquity marks the period that comes after the end of classical antiquity and stretches into the onset of the Early Middle Ages. Late antiquity as a period was popularized by Peter Brown (historian), Peter Brown in 1971, and this periodiza ...
,
Gupta period
The Gupta Empire was an Indian empire during the classical period of the Indian subcontinent which existed from the mid 3rd century to mid 6th century CE. At its zenith, the dynasty ruled over an empire that spanned much of the northern Indian ...
; attestation of
Armenian
Armenian may refer to:
* Something of, from, or related to Armenia, a country in the South Caucasus region of Eurasia
* Armenians, the national people of Armenia, or people of Armenian descent
** Armenian diaspora, Armenian communities around the ...
.
Proto-Slavic
Proto-Slavic (abbreviated PSl., PS.; also called Common Slavic or Common Slavonic) is the unattested, reconstructed proto-language of all Slavic languages. It represents Slavic speech approximately from the 2nd millennium BC through the 6th ...
. The
Roman Empire
The Roman Empire ruled the Mediterranean and much of Europe, Western Asia and North Africa. The Roman people, Romans conquered most of this during the Roman Republic, Republic, and it was ruled by emperors following Octavian's assumption of ...
and then the
Germanic migrations
The Migration Period ( 300 to 600 AD), also known as the Barbarian Invasions, was a period in European history marked by large-scale migrations that saw the fall of the Western Roman Empire and subsequent settlement of its former territories ...
marginalize the Celtic languages to the British Isles.
Sogdian, an
eastern Iranian language
Eastern or Easterns may refer to:
Transportation
Airlines
*China Eastern Airlines, a current Chinese airline based in Shanghai
* Eastern Air, former name of Zambia Skyways
*Eastern Air Lines, a defunct American airline that operated from 192 ...
, becomes the ''
lingua franca
A lingua franca (; ; for plurals see ), also known as a bridge language, common language, trade language, auxiliary language, link language or language of wider communication (LWC), is a Natural language, language systematically used to make co ...
'' of the
Silk Road
The Silk Road was a network of Asian trade routes active from the second century BCE until the mid-15th century. Spanning over , it played a central role in facilitating economic, cultural, political, and religious interactions between the ...
in Central Asia leading to China, due to the proliferation of
Sogdia
Sogdia () or Sogdiana was an ancient Iranian peoples, Iranian civilization between the Amu Darya and the Syr Darya, and in present-day Uzbekistan, Turkmenistan, Tajikistan, Kazakhstan, and Kyrgyzstan. Sogdiana was also a province of the Achaemen ...
n merchants there. Greek settlements and
Byzantine
The Byzantine Empire, also known as the Eastern Roman Empire, was the continuation of the Roman Empire centred on Constantinople during late antiquity and the Middle Ages. Having survived the events that caused the fall of the Western Roman E ...
rule make the last Anatolian languages
extinct
Extinction is the termination of an organism by the death of its Endling, last member. A taxon may become Functional extinction, functionally extinct before the death of its last member if it loses the capacity to Reproduction, reproduce and ...
.
Turkic languages
The Turkic languages are a language family of more than 35 documented languages, spoken by the Turkic peoples of Eurasia from Eastern Europe and Southern Europe to Central Asia, East Asia, North Asia (Siberia), and West Asia. The Turkic langua ...
start replacing
Scythian languages
The Scythian languages are a group of Eastern Iranic languages of the classical and late antique period (the Middle Iranic period), spoken in a vast region of Eurasia by the populations belonging to the Scythian cultures and their desce ...
.
* 500–1000:
Early Middle Ages
The Early Middle Ages (or early medieval period), sometimes controversially referred to as the Dark Ages (historiography), Dark Ages, is typically regarded by historians as lasting from the late 5th to the 10th century. They marked the start o ...
. The
Viking Age
The Viking Age (about ) was the period during the Middle Ages when Norsemen known as Vikings undertook large-scale raiding, colonising, conquest, and trading throughout Europe and reached North America. The Viking Age applies not only to their ...
forms an Old Norse
koine
Koine Greek (, ), also variously known as Hellenistic Greek, common Attic, the Alexandrian dialect, Biblical Greek, Septuagint Greek or New Testament Greek, was the common supra-regional form of Greek spoken and written during the Hellenistic ...
spanning Scandinavia, the British Isles and Iceland. Phrygian becomes extinct. The
Islamic conquests
The early Muslim conquests or early Islamic conquests (), also known as the Arab conquests, were initiated in the 7th century by Muhammad, the founder of Islam. He established the first Islamic state in Medina, Arabia that expanded rapidly un ...
and the
Turkic expansion
Turkic peoples are a collection of diverse ethnic groups of West Asia, West, Central Asia, Central, East Asia, East, and North Asia as well as parts of Europe, who speak Turkic languages.. "Turkic peoples, any of various peoples whose members ...
result in the
Arabization
Arabization or Arabicization () is a sociology, sociological process of cultural change in which a non-Arab society becomes Arabs, Arab, meaning it either directly adopts or becomes strongly influenced by the Arabic, Arabic language, Arab cultu ...
and
Turkification
Turkification, Turkization, or Turkicization () describes a shift whereby populations or places receive or adopt Turkic attributes such as culture, language, history, or ethnicity. However, often this term is more narrowly applied to mean specif ...
of significant areas where Indo-European languages were spoken, but
Persian
Persian may refer to:
* People and things from Iran, historically called ''Persia'' in the English language
** Persians, the majority ethnic group in Iran, not to be conflated with the Iranic peoples
** Persian language, an Iranian language of the ...
still develops under Islamic rule and extends into
Afghanistan
Afghanistan, officially the Islamic Emirate of Afghanistan, is a landlocked country located at the crossroads of Central Asia and South Asia. It is bordered by Pakistan to the Durand Line, east and south, Iran to the Afghanistan–Iran borde ...
and
Tajikistan
Tajikistan, officially the Republic of Tajikistan, is a landlocked country in Central Asia. Dushanbe is the capital city, capital and most populous city. Tajikistan borders Afghanistan to the Afghanistan–Tajikistan border, south, Uzbekistan to ...
. Due to further
Turkic migrations,
Tocharian becomes fully extinct while Scythian languages are overwhelmingly replaced. Slavic languages spread over wide areas in central, eastern and southeastern Europe, largely replacing Romance in the Balkans (with the exception of Romanian) and whatever was left of the
Paleo-Balkan languages
The Paleo-Balkan languages are a geographical grouping of various Indo-European languages that were spoken in the Balkans and surrounding areas in ancient times. In antiquity, Dacian, Greek, Illyrian, Messapic, Paeonian, Phrygian and Thracian wer ...
with the exception of Albanian. Pannonian Basin is taken by the
Magyars
Hungarians, also known as Magyars, are an ethnic group native to Hungary (), who share a common culture, language and history. They also have a notable presence in former parts of the Kingdom of Hungary. The Hungarian language belongs to the ...
from the western
Slavs
The Slavs or Slavic people are groups of people who speak Slavic languages. Slavs are geographically distributed throughout the northern parts of Eurasia; they predominantly inhabit Central Europe, Eastern Europe, Southeastern Europe, and ...
.
* 1000–1500:
Late Middle Ages
The late Middle Ages or late medieval period was the Periodization, period of History of Europe, European history lasting from 1300 to 1500 AD. The late Middle Ages followed the High Middle Ages and preceded the onset of the early modern period ( ...
: Attestation of
Albanian
Albanian may refer to:
*Pertaining to Albania in Southeast Europe; in particular:
**Albanians, an ethnic group native to the Balkans
**Albanian language
**Albanian culture
**Demographics of Albania, includes other ethnic groups within the country ...
and
Baltic
Baltic may refer to:
Peoples and languages
*Baltic languages, a subfamily of Indo-European languages, including Lithuanian, Latvian and extinct Old Prussian
*Balts (or Baltic peoples), ethnic groups speaking the Baltic languages and/or originatin ...
. Modern dialects of Indo-European languages start emerging.
* 1500–2000:
early modern period
The early modern period is a Periodization, historical period that is defined either as part of or as immediately preceding the modern period, with divisions based primarily on the history of Europe and the broader concept of modernity. There i ...
to present: Colonialism results in the spread of Indo-European languages to every habitable continent, most notably
Romance
Romance may refer to:
Common meanings
* Romance (love), emotional attraction towards another person and the courtship behaviors undertaken to express the feelings
** Romantic orientation, the classification of the sex or gender with which a pers ...
(North, Central and South America, North and Sub-Saharan Africa, West Asia),
West Germanic
The West Germanic languages constitute the largest of the three branches of the Germanic languages, Germanic family of languages (the others being the North Germanic languages, North Germanic and the extinct East Germanic languages, East Germ ...
(
English in North America, Sub-Saharan Africa, East Asia and Australia; to a lesser extent Dutch and German), and
Russian
Russian(s) may refer to:
*Russians (), an ethnic group of the East Slavic peoples, primarily living in Russia and neighboring countries
*A citizen of Russia
*Russian language, the most widely spoken of the Slavic languages
*''The Russians'', a b ...
to Central Asia and North Asia.
Key languages for reconstruction
In reconstructing the history of the Indo-European languages and the form of the
Proto-Indo-European language
Proto-Indo-European (PIE) is the reconstructed common ancestor of the Indo-European language family. No direct record of Proto-Indo-European exists; its proposed features have been derived by linguistic reconstruction from documented Indo-Eu ...
, some languages have been of particular importance. These generally include the ancient Indo-European languages that are both well-attested and documented at an early date, although some languages from later periods are important if they are particularly
linguistically conservative (most notably,
Lithuanian). Early poetry is of special significance because of the rigid
poetic meter
In poetry, metre ( Commonwealth spelling) or meter (American spelling; see spelling differences) is the basic rhythmic structure of a verse or lines in verse. Many traditional verse forms prescribe a specific verse metre, or a certain set of ...
normally employed, which makes it possible to reconstruct a number of features (e.g.
vowel length
In linguistics, vowel length is the perceived or actual length (phonetics), duration of a vowel sound when pronounced. Vowels perceived as shorter are often called short vowels and those perceived as longer called long vowels.
On one hand, many ...
) that were either unwritten or corrupted in the process of transmission down to the earliest extant written
manuscript
A manuscript (abbreviated MS for singular and MSS for plural) was, traditionally, any document written by hand or typewritten, as opposed to mechanically printed or reproduced in some indirect or automated way. More recently, the term has ...
s.
Most noticeably:
*
Vedic Sanskrit
Vedic Sanskrit, also simply referred as the Vedic language, is the most ancient known precursor to Sanskrit, a language in the Indo-Aryan languages, Indo-Aryan subgroup of the Indo-European languages, Indo-European language family. It is atteste ...
(). This language is unique in that its source documents were all composed orally, and were passed down through
oral tradition
Oral tradition, or oral lore, is a form of human communication in which knowledge, art, ideas and culture are received, preserved, and transmitted orally from one generation to another.Jan Vansina, Vansina, Jan: ''Oral Tradition as History'' (19 ...
(
shakha
A shakha () is a Hindu theological school that specializes in learning certain Vedic texts, or else the traditional texts followed by such a school.V. S. Apte. A Practical Sanskrit Dictionary, p. 913, left column.Monier-Williams, A Sanskrit-Engli ...
schools) for c. 2,000 years before ever being written down. The oldest documents are all in poetic form; oldest and most important of all is the
Rigveda
The ''Rigveda'' or ''Rig Veda'' (, , from wikt:ऋच्, ऋच्, "praise" and wikt:वेद, वेद, "knowledge") is an ancient Indian Miscellany, collection of Vedic Sanskrit hymns (''sūktas''). It is one of the four sacred canoni ...
()).
*
Ancient Greek
Ancient Greek (, ; ) includes the forms of the Greek language used in ancient Greece and the classical antiquity, ancient world from around 1500 BC to 300 BC. It is often roughly divided into the following periods: Mycenaean Greek (), Greek ...
().
Mycenaean Greek
Mycenaean Greek is the earliest attested form of the Greek language. It was spoken on the Greek mainland and Crete in Mycenaean Greece (16th to 12th centuries BC). The language is preserved in inscriptions in Linear B, a script first atteste ...
() is the oldest recorded form, but its value is lessened by the limited material, restricted subject matter, and highly ambiguous writing system. More important is Ancient Greek, documented extensively beginning with the two
Homeric poems
Homer (; , ; possibly born ) was an Ancient Greek poet who is credited as the author of the ''Iliad'' and the ''Odyssey'', two epic poems that are foundational works of ancient Greek literature. Despite doubts about his authorship, Homer is ...
(the ''
Iliad
The ''Iliad'' (; , ; ) is one of two major Ancient Greek epic poems attributed to Homer. It is one of the oldest extant works of literature still widely read by modern audiences. As with the ''Odyssey'', the poem is divided into 24 books and ...
'' and the ''
Odyssey
The ''Odyssey'' (; ) is one of two major epics of ancient Greek literature attributed to Homer. It is one of the oldest surviving works of literature and remains popular with modern audiences. Like the ''Iliad'', the ''Odyssey'' is divi ...
'', ).
*
Hittite (). This is the earliest-recorded of all Indo-European languages, and highly divergent from the others due to the early separation of the
Anatolian languages
The Anatolian languages are an extinct branch of Indo-European languages that were spoken in Anatolia. The best known Anatolian language is Hittite, which is considered the earliest-attested Indo-European language.
Undiscovered until the late ...
from the remainder. It possesses some highly archaic features found only fragmentarily, if at all, in other languages. At the same time, however, it appears to have undergone many early phonological and grammatical changes which, combined with the ambiguities of its writing system, hinder its usefulness somewhat.
Other primary sources:
*
Latin
Latin ( or ) is a classical language belonging to the Italic languages, Italic branch of the Indo-European languages. Latin was originally spoken by the Latins (Italic tribe), Latins in Latium (now known as Lazio), the lower Tiber area aroun ...
, attested in a huge amount of poetic and prose material in the
Classical period () and limited
Old Latin
Old Latin, also known as Early, Archaic or Priscan Latin (Classical ), was the Latin language in the period roughly before 75 BC, i.e. before the age of Classical Latin. A member of the Italic languages, it descends from a common Proto-Italic ...
material from as early as .
*
Gothic (the most archaic well-documented
Germanic language
The Germanic languages are a branch of the Indo-European language family spoken natively by a population of about 515 million people mainly in Europe, North America, Oceania, and Southern Africa. The most widely spoken Germanic language, ...
, ), along with the combined witness of the other old Germanic languages: most importantly,
Old English
Old English ( or , or ), or Anglo-Saxon, is the earliest recorded form of the English language, spoken in England and southern and eastern Scotland in the Early Middle Ages. It developed from the languages brought to Great Britain by Anglo-S ...
(),
Old High German
Old High German (OHG; ) is the earliest stage of the German language, conventionally identified as the period from around 500/750 to 1050. Rather than representing a single supra-regional form of German, Old High German encompasses the numerous ...
() and
Old Norse
Old Norse, also referred to as Old Nordic or Old Scandinavian, was a stage of development of North Germanic languages, North Germanic dialects before their final divergence into separate Nordic languages. Old Norse was spoken by inhabitants ...
(, with limited earlier sources dating to ).
*
Old Avestan
Avestan ( ) is the liturgical language of Zoroastrianism. It belongs to the Iranian branch of the Indo-European language family and was originally spoken during the Old Iranian period ( – 400 BCE) by the Iranians living in the eastern po ...
() and
Younger Avestan
Avestan ( ) is the liturgical language of Zoroastrianism. It belongs to the Iranian branch of the Indo-European language family and was originally spoken during the Old Iranian period ( – 400 BCE) by the Iranians living in the eastern por ...
()). Documentation is sparse, but nonetheless quite important due to its highly archaic nature.
* Modern
Lithuanian, with limited records in
Old Lithuanian
Lithuanian (, ) is an East Baltic language belonging to the Baltic branch of the Indo-European language family. It is the language of Lithuanians and the official language of Lithuania as well as one of the official languages of the European ...
().
*
Old Church Slavonic
Old Church Slavonic or Old Slavonic ( ) is the first Slavic languages, Slavic literary language and the oldest extant written Slavonic language attested in literary sources. It belongs to the South Slavic languages, South Slavic subgroup of the ...
().
Other secondary sources, due to poor attestation:
*
Luwian
Luwian (), sometimes known as Luvian or Luish, is an ancient language, or group of languages, within the Anatolian branch of the Indo-European language family. The ethnonym Luwian comes from ''Luwiya'' (also spelled ''Luwia'' or ''Luvia'') – ...
,
Lycian,
Lydian and other
Anatolian languages
The Anatolian languages are an extinct branch of Indo-European languages that were spoken in Anatolia. The best known Anatolian language is Hittite, which is considered the earliest-attested Indo-European language.
Undiscovered until the late ...
().
*
Oscan
Oscan is an extinct Indo-European language of southern Italy. The language is in the Osco-Umbrian or Sabellic branch of the Italic languages. Oscan is therefore a close relative of Umbrian and South Picene.
Oscan was spoken by a number of t ...
,
Umbrian
Umbrian is an extinct Italic language formerly spoken by the Umbri in the ancient Italian region of Umbria. Within the Italic languages it is closely related to the Oscan group and is therefore associated with it in the group of Osco-Umbr ...
and other
Old Italic languages ()).
*
Old Persian
Old Persian is one of two directly attested Old Iranian languages (the other being Avestan) and is the ancestor of Middle Persian (the language of the Sasanian Empire). Like other Old Iranian languages, it was known to its native speakers as (I ...
().
*
Old Prussian
Old Prussian is an extinct West Baltic language belonging to the Baltic branch of the Indo-European languages, which was once spoken by the Old Prussians, the Baltic peoples of the Prussian region. The language is called Old Prussian to av ...
(); even more archaic than Lithuanian.
Other secondary sources, due to extensive phonological changes and relatively limited attestation:
*
Old Irish
Old Irish, also called Old Gaelic (, Ogham, Ogham script: ᚌᚑᚔᚇᚓᚂᚉ; ; ; or ), is the oldest form of the Goidelic languages, Goidelic/Gaelic language for which there are extensive written texts. It was used from 600 to 900. The ...
().
*
Tocharian (), underwent large phonetic shifts and mergers in the proto-language, and has an almost entirely reworked declension system.
*
Classical Armenian
Classical Armenian (, , ; meaning "literary anguage; also Old Armenian or Liturgical Armenian) is the oldest attested form of the Armenian language. It was first written down at the beginning of the 5th century, and most Armenian literature fro ...
().
*
Albanian
Albanian may refer to:
*Pertaining to Albania in Southeast Europe; in particular:
**Albanians, an ethnic group native to the Balkans
**Albanian language
**Albanian culture
**Demographics of Albania, includes other ethnic groups within the country ...
(present).
Sound changes
As speakers of Proto-Indo-European (PIE) dispersed, the language's sound system diverged as well, changing according to various
sound law
In historical linguistics, a sound change is a change in the pronunciation of a language. A sound change can involve the replacement of one speech sound (or, more generally, one phonetic feature value) by a different one (called phonetic cha ...
s evidenced in the
daughter language
In historical linguistics, a daughter language, also known as descendant language, is a language descended from another language, its mother language, through a process of genetic descent. If more than one language has developed from the same pro ...
s.
PIE is normally reconstructed with a complex system of 15
stop consonant
In phonetics, a plosive, also known as an occlusive or simply a stop, is a pulmonic consonant in which the vocal tract is blocked so that all airflow ceases.
The occlusion may be made with the tongue tip or blade (, ), tongue body (, ), lip ...
s, including an unusual three-way
phonation
The term phonation has slightly different meanings depending on the subfield of phonetics. Among some phoneticians, ''phonation'' is the process by which the vocal folds produce certain sounds through quasi-periodic vibration. This is the defi ...
(
voicing) distinction between
voiceless
In linguistics, voicelessness is the property of sounds being pronounced without the larynx vibrating. Phonologically, it is a type of phonation, which contrasts with other states of the larynx, but some object that the word phonation implies v ...
,
voiced
Voice or voicing is a term used in phonetics and phonology to characterize speech sounds (usually consonants). Speech sounds can be described as either voiceless (otherwise known as ''unvoiced'') or voiced.
The term, however, is used to refe ...
and "
voiced aspirated
Breathy voice (also called murmured voice, whispery voice, soughing and susurration) is a phonation in which the vocal folds vibrate, as they do in normal (modal) voicing, but are adjusted to let more air escape which produces a sighing-like s ...
" (i.e.
breathy voiced
Breathy voice (also called murmured voice, whispery voice, soughing and susurration) is a phonation in which the vocal folds vibrate, as they do in normal (modal) voicing, but are adjusted to let more air escape which produces a sighing-like s ...
) stops, and a three-way distinction among
velar consonant
Velar consonants are consonants articulated with the back part of the tongue (the dorsum) against the soft palate, the back part of the roof of the mouth (also known as the "velum").
Since the velar region of the roof of the mouth is relativel ...
s (''k''-type sounds) between "palatal" ''ḱ ǵ ǵh'', "plain velar" ''k g gh'' and
labiovelar ''kʷ gʷ gʷh''. (The correctness of the terms ''palatal'' and ''plain velar'' is disputed; see
Proto-Indo-European phonology
The phonology of the Proto-Indo-European language (PIE) has been reconstructed by linguists, based on the similarities and differences among current and extinct Indo-European languages. Because PIE was not written, linguists must rely on the evi ...
.) All daughter languages have reduced the number of distinctions among these sounds, often in divergent ways.
As an example, in
English, one of the
Germanic language
The Germanic languages are a branch of the Indo-European language family spoken natively by a population of about 515 million people mainly in Europe, North America, Oceania, and Southern Africa. The most widely spoken Germanic language, ...
s, the following are some of the major changes that happened:
None of the daughter-language families (except possibly
Anatolian, particularly
Luvian) reflect the plain velar stops differently from the other two series, and there is even a certain amount of dispute whether this series existed at all in PIE. The major distinction between
''centum'' and ''satem'' languages corresponds to the outcome of the PIE plain velars:
* The "central" ''satem'' languages (
Indo-Iranian,
Balto-Slavic
The Balto-Slavic languages form a branch of the Indo-European languages, Indo-European family of languages, traditionally comprising the Baltic languages, Baltic and Slavic languages. Baltic and Slavic languages share several linguistic traits ...
,
Albanian
Albanian may refer to:
*Pertaining to Albania in Southeast Europe; in particular:
**Albanians, an ethnic group native to the Balkans
**Albanian language
**Albanian culture
**Demographics of Albania, includes other ethnic groups within the country ...
, and
Armenian
Armenian may refer to:
* Something of, from, or related to Armenia, a country in the South Caucasus region of Eurasia
* Armenians, the national people of Armenia, or people of Armenian descent
** Armenian diaspora, Armenian communities around the ...
) reflect both "plain velar" and labiovelar stops as plain velars, often with secondary
palatalization
Palatalization may refer to:
*Palatalization (phonetics), the phonetic feature of palatal secondary articulation
*Palatalization (sound change)
Palatalization ( ) is a historical-linguistic sound change that results in a palatalized articulati ...
before a
front vowel
A front vowel is a class of vowel sounds used in some spoken languages, its defining characteristic being that the highest point of the tongue is positioned approximately as far forward as possible in the mouth without creating a constriction th ...
(''e i ē ī''). The "palatal" stops are palatalized and often appear as
sibilant
Sibilants (from 'hissing') are fricative and affricate consonants of higher amplitude and pitch, made by directing a stream of air with the tongue towards the teeth. Examples of sibilants are the consonants at the beginning of the English w ...
s (usually but not always distinct from the secondarily palatalized stops).
* The "peripheral" ''centum'' languages (
Germanic,
Italic,
Celtic
Celtic, Celtics or Keltic may refer to:
Language and ethnicity
*pertaining to Celts, a collection of Indo-European peoples in Europe and Anatolia
**Celts (modern)
*Celtic languages
**Proto-Celtic language
*Celtic music
*Celtic nations
Sports Foot ...
,
Greek
Greek may refer to:
Anything of, from, or related to Greece, a country in Southern Europe:
*Greeks, an ethnic group
*Greek language, a branch of the Indo-European language family
**Proto-Greek language, the assumed last common ancestor of all kno ...
,
Anatolian and
Tocharian) reflect both "palatal" and "plain velar" stops as plain velars, while the labiovelars continue unchanged, often with later reduction into plain
labial or
velar consonant
Velar consonants are consonants articulated with the back part of the tongue (the dorsum) against the soft palate, the back part of the roof of the mouth (also known as the "velum").
Since the velar region of the roof of the mouth is relativel ...
s.
The three-way PIE distinction between voiceless, voiced and voiced aspirated stops is considered extremely unusual from the perspective of
linguistic typology
Linguistic typology (or language typology) is a field of linguistics that studies and classifies languages according to their structural features to allow their comparison. Its aim is to describe and explain the structural diversity and the co ...
—particularly in the existence of voiced aspirated stops without a corresponding series of voiceless aspirated stops. None of the various daughter-language families continue it unchanged, with numerous "solutions" to the apparently unstable PIE situation:
* The
Indo-Aryan language
The Indo-Aryan languages, or sometimes Indic languages, are a branch of the Indo-Iranian languages in the Indo-European language family. As of 2024, there are more than 1.5 billion speakers, primarily concentrated east of the Indus river in Ba ...
s preserve the three series unchanged but have evolved a fourth series of voiceless aspirated consonants.
* The
Iranian language
The Iranian languages, also called the Iranic languages, are a branch of the Indo-Iranian languages in the Indo-European language family that are spoken natively by the Iranian peoples, predominantly in the Iranian Plateau.
The Iranian language ...
s probably passed through the same stage, subsequently changing the aspirated stops into fricatives.
*
Greek
Greek may refer to:
Anything of, from, or related to Greece, a country in Southern Europe:
*Greeks, an ethnic group
*Greek language, a branch of the Indo-European language family
**Proto-Greek language, the assumed last common ancestor of all kno ...
converted the voiced aspirates into voiceless aspirates.
*
Italic probably passed through the same stage, but reflects the voiced aspirates as voiceless fricatives, especially ''f'' (or sometimes plain voiced stops in
Latin
Latin ( or ) is a classical language belonging to the Italic languages, Italic branch of the Indo-European languages. Latin was originally spoken by the Latins (Italic tribe), Latins in Latium (now known as Lazio), the lower Tiber area aroun ...
).
*
Celtic
Celtic, Celtics or Keltic may refer to:
Language and ethnicity
*pertaining to Celts, a collection of Indo-European peoples in Europe and Anatolia
**Celts (modern)
*Celtic languages
**Proto-Celtic language
*Celtic music
*Celtic nations
Sports Foot ...
,
Balto-Slavic
The Balto-Slavic languages form a branch of the Indo-European languages, Indo-European family of languages, traditionally comprising the Baltic languages, Baltic and Slavic languages. Baltic and Slavic languages share several linguistic traits ...
,
Anatolian, and
Albanian
Albanian may refer to:
*Pertaining to Albania in Southeast Europe; in particular:
**Albanians, an ethnic group native to the Balkans
**Albanian language
**Albanian culture
**Demographics of Albania, includes other ethnic groups within the country ...
merge the voiced aspirated into plain voiced stops.
*
Germanic and
Armenian
Armenian may refer to:
* Something of, from, or related to Armenia, a country in the South Caucasus region of Eurasia
* Armenians, the national people of Armenia, or people of Armenian descent
** Armenian diaspora, Armenian communities around the ...
change all three series in a
chain shift
In historical linguistics, a chain shift is a set of sound changes in which the change in pronunciation of one speech sound (typically, a phoneme) is linked to, and presumably causes, a change in pronunciation of other sounds. The sounds invo ...
(e.g. with ''bh b p'' becoming ''b p f'' (known as ''
Grimm's law
Grimm's law, also known as the First Germanic Consonant Shift or First Germanic Sound Shift, is a set of sound laws describing the Proto-Indo-European (PIE) stop consonants as they developed in Proto-Germanic in the first millennium BC, first d ...
'' in Germanic)).
Among the other notable changes affecting consonants are:
* The
Ruki sound law
The ruki sound law, also known as the ruki rule or iurk rule, is a historical sound change that took place in the satem branches of the Indo-European language family, namely in Balto-Slavic, Armenian, and Indo-Iranian. According to this sound ...
(''s'' becomes before ''r, u, k, i'') in the ''
satem
Languages of the Indo-European family are classified as either centum languages or satem languages according to how the dorsal consonants (sounds of "K", "G" and "Y" type) of the reconstructed Proto-Indo-European language (PIE) developed. An ...
'' languages.
* Loss of prevocalic ''p'' in
Proto-Celtic
Proto-Celtic, or Common Celtic, is the hypothetical ancestral proto-language of all known Celtic languages, and a descendant of Proto-Indo-European. It is not attested in writing but has been partly Linguistic reconstruction, reconstructed throu ...
.
* Development of prevocalic ''s'' to ''h'' in
Proto-Greek
The Proto-Greek language (also known as Proto-Hellenic) is the Indo-European language which was the last common ancestor of all varieties of Greek, including Mycenaean Greek, the subsequent ancient Greek dialects (i.e., Attic, Ionic, Ae ...
, with later loss of ''h'' between vowels.
*
Verner's law in
Proto-Germanic
Proto-Germanic (abbreviated PGmc; also called Common Germanic) is the linguistic reconstruction, reconstructed proto-language of the Germanic languages, Germanic branch of the Indo-European languages.
Proto-Germanic eventually developed from ...
.
*
Grassmann's law
Grassmann's law, named after its discoverer Hermann Grassmann, is a dissimilatory phonological process in Ancient Greek and Sanskrit which states that if an Aspiration (phonetics), aspirated consonant is followed by another aspirated consonant ...
(dissimilation of aspirates) independently in Proto-Greek and Proto-Indo-Iranian.
The following table shows the basic outcomes of PIE consonants in some of the most important daughter languages for the purposes of reconstruction. For a fuller table, see
Indo-European sound laws
As the Proto-Indo-European language (PIE) broke up, its sound system diverged as well, as evidenced in various sound laws associated with the daughter Indo-European languages.
Especially notable is the palatalization that produced the satem ...
.
:Notes:
* C- At the beginning of a word.
* -C- Between vowels.
* -C At the end of a word.
* `-C- Following an unstressed vowel (
Verner's law).
* -C-
(rl) Between vowels, or between a vowel and (on either side).
* C
T Before a (PIE) stop ().
* C
T− After a (PIE) obstruent (, etc.; ).
* C
(T) Before or after an obstruent (, etc.; ).
* C
H Before an original laryngeal.
* C
E Before a (PIE) front vowel ().
* C
E' Before secondary (post-PIE) front-vowels.
* C
e Before .
* C
(u) Before or after a (PIE) (
boukólos rule
The ''boukólos'' rule is a phonological rule of the Proto-Indo-European language (PIE). It states that a labiovelar stop (, , ) dissimilates to an ordinary velar stop (, , ) next to the vowel or its corresponding glide .
The rule is named a ...
).
* C
(O) Before or after a (PIE) (
boukólos rule
The ''boukólos'' rule is a phonological rule of the Proto-Indo-European language (PIE). It states that a labiovelar stop (, , ) dissimilates to an ordinary velar stop (, , ) next to the vowel or its corresponding glide .
The rule is named a ...
).
* C
n− After .
* C
R Before a
sonorant
In phonetics and phonology, a sonorant or resonant is a speech sound that is produced with continuous, non-turbulent airflow in the vocal tract; these are the manners of articulation that are most often voiced in the world's languages. Vowels a ...
().
* C
(R) Before or after a
sonorant
In phonetics and phonology, a sonorant or resonant is a speech sound that is produced with continuous, non-turbulent airflow in the vocal tract; these are the manners of articulation that are most often voiced in the world's languages. Vowels a ...
().
* C
(r),l,u− Before or after .
* C
ruki− After (
Ruki sound law
The ruki sound law, also known as the ruki rule or iurk rule, is a historical sound change that took place in the satem branches of the Indo-European language family, namely in Balto-Slavic, Armenian, and Indo-Iranian. According to this sound ...
).
* C
..Ch Before an aspirated consonant in the next syllable (
Grassmann's law
Grassmann's law, named after its discoverer Hermann Grassmann, is a dissimilatory phonological process in Ancient Greek and Sanskrit which states that if an Aspiration (phonetics), aspirated consonant is followed by another aspirated consonant ...
, also known as
dissimilation of aspirates).
* C
E..Ch Before a (PIE) front vowel () as well as before an aspirated consonant in the next syllable (
Grassmann's law
Grassmann's law, named after its discoverer Hermann Grassmann, is a dissimilatory phonological process in Ancient Greek and Sanskrit which states that if an Aspiration (phonetics), aspirated consonant is followed by another aspirated consonant ...
, also known as
dissimilation of aspirates).
* C
(u)..Ch Before or after a (PIE) as well as before an aspirated consonant in the next syllable (
Grassmann's law
Grassmann's law, named after its discoverer Hermann Grassmann, is a dissimilatory phonological process in Ancient Greek and Sanskrit which states that if an Aspiration (phonetics), aspirated consonant is followed by another aspirated consonant ...
, also known as
dissimilation of aspirates).
Comparison of conjugations
The following table presents a comparison of conjugations of the
thematic present indicative of the verbal root * of the English verb ''
to bear'' and its reflexes in various early attested IE languages and their modern descendants or relatives, showing that all languages had in the early stage an inflectional verb system.
While similarities are still visible between the modern descendants and relatives of these ancient languages, the differences have increased over time. Some IE languages have moved from
synthetic
Synthetic may refer to:
Science
* Synthetic biology
* Synthetic chemical or compound, produced by the process of chemical synthesis
* Synthetic elements, chemical elements that are not naturally found on Earth and therefore have to be created in ...
verb systems to largely
periphrastic
In linguistics and literature, periphrasis () is the use of a larger number of words, with an implicit comparison to the possibility of using fewer. The comparison may be within a language or between languages. For example, "more happy" is periph ...
systems. In addition, the
pronoun
In linguistics and grammar, a pronoun (Interlinear gloss, glossed ) is a word or a group of words that one may substitute for a noun or noun phrase.
Pronouns have traditionally been regarded as one of the part of speech, parts of speech, but so ...
s of periphrastic forms are in parentheses when they appear. Some of these verbs have undergone a change in meaning as well.
* In
Modern Irish
Irish (Standard Irish: ), also known as Irish Gaelic or simply Gaelic ( ), is a Celtic language of the Indo-European language family. It is a member of the Goidelic languages of the Insular Celtic sub branch of the family and is indigenous ...
''beir'' usually only carries the meaning ''to bear'' in the sense of bearing a child; its common meanings are ''to catch, grab''. Apart from the first person, the forms given in the table above are dialectical or obsolete. The second and third person forms are typically instead conjugated
periphrastically by adding a pronoun after the verb: ''beireann tú, beireann sé/sí, beireann sibh, beireann siad''.
* The
Hindustani (
Hindi
Modern Standard Hindi (, ), commonly referred to as Hindi, is the Standard language, standardised variety of the Hindustani language written in the Devanagari script. It is an official language of India, official language of the Government ...
and
Urdu
Urdu (; , , ) is an Indo-Aryan languages, Indo-Aryan language spoken chiefly in South Asia. It is the Languages of Pakistan, national language and ''lingua franca'' of Pakistan. In India, it is an Eighth Schedule to the Constitution of Indi ...
) verb ''bʰarnā'', the continuation of the Sanskrit verb, can have a variety of meanings, but the most common is "to fill". The forms given in the table, although etymologically derived from the
present indicative, now have the meaning of
future subjunctive
The subjunctive (also known as the conjunctive in some languages) is a grammatical mood, a feature of an utterance that indicates the speaker's attitude toward it. Subjunctive forms of verbs are typically used to express various states of unrealit ...
.
The loss of the
present indicative in Hindustani is roughly compensated by the periphrastic
habitual indicative construction, using the
habitual participle (etymologically from the Sanskrit present participle ''bʰarant-'') and an auxiliary: ''ma͠i bʰartā hū̃, tū bʰartā hai, vah bʰartā hai, ham bʰarte ha͠i, tum bʰarte ho, ve bʰarte ha͠i'' (masculine forms).
* German is not directly descended from Gothic, but the Gothic forms are a close approximation of what the early West Germanic forms of would have looked like. The descendant of Proto-Germanic ''*beraną'' (English ''bear'') survives in German only in the compound ''gebären'', meaning "bear (a child)".
* The Latin verb ''ferre'' is irregular, and not a good representative of a normal thematic verb. In most Romance languages such as Portuguese, other verbs now mean "to carry" (e.g. Pt. ''portar'' < Lat. ''portare'') and ''ferre'' was borrowed and nativized only in compounds such as "to suffer" (from Latin ''sub-'' and ''ferre'') and "to confer" (from Latin "con-" and "ferre").
* In Modern
Greek
Greek may refer to:
Anything of, from, or related to Greece, a country in Southern Europe:
*Greeks, an ethnic group
*Greek language, a branch of the Indo-European language family
**Proto-Greek language, the assumed last common ancestor of all kno ...
, ''phero'' φέρω (modern transliteration ''fero'') "to bear" is still used but only in specific contexts and is most common in such compounds as αναφέρω, διαφέρω, εισφέρω, εκφέρω, καταφέρω, προφέρω, προαναφέρω, προσφέρω etc. The form that is (very) common today is ''pherno'' φέρνω (modern transliteration ''ferno'') meaning "to bring". Additionally, the perfective form of ''pherno'' (used for the subjunctive voice and also for the future tense) is also ''phero''.
* The dual forms are archaic in standard Lithuanian, and are only presently used in some dialects (e.g.
Samogitian).
* Among modern Slavic languages, only Slovene continues to have a dual number in the standard variety.
Comparison of cognates
Present distribution

Today, Indo-European languages are spoken by billions of
native speakers
A first language (L1), native language, native tongue, or mother tongue is the first language a person has been exposed to from birth or within the critical period. In some countries, the term ''native language'' or ''mother tongue'' refers ...
across all inhabited continents, the largest number by far for any recognized language family. Of the
20 languages with the largest numbers of speakers according to ''Ethnologue'', 10 are Indo-European:
English,
Hindustani,
Spanish
Spanish might refer to:
* Items from or related to Spain:
**Spaniards are a nation and ethnic group indigenous to Spain
**Spanish language, spoken in Spain and many countries in the Americas
**Spanish cuisine
**Spanish history
**Spanish culture
...
,
Bengali
Bengali or Bengalee, or Bengalese may refer to:
*something of, from, or related to Bengal, a large region in South Asia
* Bengalis, an ethnic and linguistic group of the region
* Bengali language, the language they speak
** Bengali alphabet, the w ...
,
French,
Russian
Russian(s) may refer to:
*Russians (), an ethnic group of the East Slavic peoples, primarily living in Russia and neighboring countries
*A citizen of Russia
*Russian language, the most widely spoken of the Slavic languages
*''The Russians'', a b ...
,
Portuguese,
German
German(s) may refer to:
* Germany, the country of the Germans and German things
**Germania (Roman era)
* Germans, citizens of Germany, people of German ancestry, or native speakers of the German language
** For citizenship in Germany, see also Ge ...
,
Persian
Persian may refer to:
* People and things from Iran, historically called ''Persia'' in the English language
** Persians, the majority ethnic group in Iran, not to be conflated with the Iranic peoples
** Persian language, an Iranian language of the ...
and
Punjabi, each with 100 million speakers or more. Additionally, hundreds of millions of persons worldwide study Indo-European languages as secondary or tertiary languages, including in cultures which have completely different language families and historical backgrounds—there are around 600 million learners of English alone.
The success of the language family, including the large number of speakers and the vast portions of the Earth that they inhabit, is due to several factors. The ancient
Indo-European migrations
The Indo-European migrations are hypothesized migrations of Proto-Indo-Europeans, peoples who spoke Proto-Indo-European language, Proto-Indo-European (PIE) and the derived Indo-European languages, which took place from around 4000 to 1000 BCE, ...
and widespread dissemination of
Indo-European culture
Proto-Indo-European society is the reconstructed culture of Proto-Indo-Europeans, the ancient speakers of the Proto-Indo-European language, ancestor of all modern Indo-European languages. Historical linguistics combined with archaeological and ...
throughout
Eurasia
Eurasia ( , ) is a continental area on Earth, comprising all of Europe and Asia. According to some geographers, Physical geography, physiographically, Eurasia is a single supercontinent. The concept of Europe and Asia as distinct continents d ...
, including that of the
Proto-Indo-Europeans
The Proto-Indo-Europeans are a hypothetical prehistoric ethnolinguistic group of Eurasia who spoke Proto-Indo-European (PIE), the reconstructed common ancestor of the Indo-European language family.
Knowledge of them comes chiefly from t ...
themselves, and that of their daughter cultures including the
Indo-Aryans
Indo-Aryan peoples are a diverse collection of peoples predominantly found in South Asia, who (traditionally) speak Indo-Aryan languages. Historically, Aryans were the Indo-Iranian speaking pastoralists who migrated from Central Asia int ...
,
Iranian peoples
Iranian peoples, or Iranic peoples, are the collective ethnolinguistic groups who are identified chiefly by their native usage of any of the Iranian languages, which are a branch of the Indo-Iranian languages within the Indo-European langu ...
,
Celts
The Celts ( , see Names of the Celts#Pronunciation, pronunciation for different usages) or Celtic peoples ( ) were a collection of Indo-European languages, Indo-European peoples. "The Celts, an ancient Indo-European people, reached the apoge ...
,
Greeks
Greeks or Hellenes (; , ) are an ethnic group and nation native to Greece, Greek Cypriots, Cyprus, Greeks in Albania, southern Albania, Greeks in Turkey#History, Anatolia, parts of Greeks in Italy, Italy and Egyptian Greeks, Egypt, and to a l ...
,
Romans
Roman or Romans most often refers to:
*Rome, the capital city of Italy
*Ancient Rome, Roman civilization from 8th century BC to 5th century AD
*Roman people, the people of Roman civilization
*Epistle to the Romans, shortened to Romans, a letter w ...
,
Germanic peoples
The Germanic peoples were tribal groups who lived in Northern Europe in Classical antiquity and the Early Middle Ages. In modern scholarship, they typically include not only the Roman-era ''Germani'' who lived in both ''Germania'' and parts of ...
, and
Slavs
The Slavs or Slavic people are groups of people who speak Slavic languages. Slavs are geographically distributed throughout the northern parts of Eurasia; they predominantly inhabit Central Europe, Eastern Europe, Southeastern Europe, and ...
, led to these peoples' branches of the language family already taking a dominant foothold in virtually all of
Eurasia
Eurasia ( , ) is a continental area on Earth, comprising all of Europe and Asia. According to some geographers, Physical geography, physiographically, Eurasia is a single supercontinent. The concept of Europe and Asia as distinct continents d ...
except for swathes of the
Near East
The Near East () is a transcontinental region around the Eastern Mediterranean encompassing the historical Fertile Crescent, the Levant, Anatolia, Egypt, Mesopotamia, and coastal areas of the Arabian Peninsula. The term was invented in the 20th ...
,
North
North is one of the four compass points or cardinal directions. It is the opposite of south and is perpendicular to east and west. ''North'' is a noun, adjective, or adverb indicating Direction (geometry), direction or geography.
Etymology
T ...
and
East Asia
East Asia is a geocultural region of Asia. It includes China, Japan, Mongolia, North Korea, South Korea, and Taiwan, plus two special administrative regions of China, Hong Kong and Macau. The economies of Economy of China, China, Economy of Ja ...
, replacing many (but not all) of the previously-spoken
pre-Indo-European languages
The pre-Indo-European languages are any of several ancient languages, not necessarily related to one another, that existed in Prehistoric Europe, Asia Minor, Ancient Iran and United Nations geoscheme for Asia#Southern_Asia, Southern Asia before ...
of this extensive area. However
Semitic languages
The Semitic languages are a branch of the Afroasiatic languages, Afroasiatic language family. They include Arabic,
Amharic, Tigrinya language, Tigrinya, Aramaic, Hebrew language, Hebrew, Maltese language, Maltese, Modern South Arabian language ...
remain dominant in much of the
Middle East
The Middle East (term originally coined in English language) is a geopolitical region encompassing the Arabian Peninsula, the Levant, Turkey, Egypt, Iran, and Iraq.
The term came into widespread usage by the United Kingdom and western Eur ...
and
North Africa
North Africa (sometimes Northern Africa) is a region encompassing the northern portion of the African continent. There is no singularly accepted scope for the region. However, it is sometimes defined as stretching from the Atlantic shores of t ...
, and
Caucasian languages
The Caucasian languages comprise a large and extremely varied array of languages spoken by more than ten million people in and around the Caucasus Mountains, which lie between the Black Sea and the Caspian Sea.
Linguistic comparison allows t ...
in much of the
Caucasus
The Caucasus () or Caucasia (), is a region spanning Eastern Europe and Western Asia. It is situated between the Black Sea and the Caspian Sea, comprising parts of Southern Russia, Georgia, Armenia, and Azerbaijan. The Caucasus Mountains, i ...
region. Similarly in
Europe
Europe is a continent located entirely in the Northern Hemisphere and mostly in the Eastern Hemisphere. It is bordered by the Arctic Ocean to the north, the Atlantic Ocean to the west, the Mediterranean Sea to the south, and Asia to the east ...
and the
Urals
The Ural Mountains ( ),; , ; , or simply the Urals, are a mountain range in Eurasia that runs north–south mostly through Russia, from the coast of the Arctic Ocean to the river Ural (river), Ural and northwestern Kazakhstan. the
Uralic languages
The Uralic languages ( ), sometimes called the Uralian languages ( ), are spoken predominantly in Europe and North Asia. The Uralic languages with the most native speakers are Hungarian, Finnish, and Estonian. Other languages with speakers ab ...
(such as Hungarian, Finnish, Estonian etc.) remain, as does
Basque
Basque may refer to:
* Basques, an ethnic group of Spain and France
* Basque language, their language
Places
* Basque Country (greater region), the homeland of the Basque people with parts in both Spain and France
* Basque Country (autonomous co ...
, a pre-Indo-European isolate.
Despite being unaware of their common linguistic origin, diverse groups of Indo-European speakers continued to culturally dominate and often replace the indigenous languages of the western two-thirds of Eurasia. By the beginning of the
Common Era
Common Era (CE) and Before the Common Era (BCE) are year notations for the Gregorian calendar (and its predecessor, the Julian calendar), the world's most widely used calendar era. Common Era and Before the Common Era are alternatives to the ...
, Indo-European peoples controlled almost the entirety of this area: the Celts western and central Europe, the Romans southern Europe, the Germanic peoples northern Europe, the Slavs eastern Europe, the Iranian peoples most of western and central Asia and parts of eastern Europe, and the Indo-Aryan peoples in the
Indian subcontinent
The Indian subcontinent is a physiographic region of Asia below the Himalayas which projects into the Indian Ocean between the Bay of Bengal to the east and the Arabian Sea to the west. It is now divided between Bangladesh, India, and Pakista ...
, with the
Tocharians
The Tocharians or Tokharians ( ; ) were speakers of the Tocharian languages, a group of Indo-European languages known from around 7,600 documents from the 6th and 7th centuries, found on the northern edge of the Tarim Basin (modern-day Xinj ...
inhabiting the Indo-European frontier in western China. By the medieval period, only the
Semitic,
Dravidian,
Caucasian
Caucasian may refer to:
Common meanings
*Anything from the Caucasus region or related to it
** Ethnic groups in the Caucasus
** ''Caucasian Exarchate'' (1917–1920), an ecclesiastical exarchate of the Russian Orthodox Church in the Caucasus re ...
, and
Uralic languages
The Uralic languages ( ), sometimes called the Uralian languages ( ), are spoken predominantly in Europe and North Asia. The Uralic languages with the most native speakers are Hungarian, Finnish, and Estonian. Other languages with speakers ab ...
, and the language isolate
Basque
Basque may refer to:
* Basques, an ethnic group of Spain and France
* Basque language, their language
Places
* Basque Country (greater region), the homeland of the Basque people with parts in both Spain and France
* Basque Country (autonomous co ...
remained of the (relatively)
indigenous languages of Europe and the western half of Asia.
Despite medieval invasions by
Eurasian nomads
Eurasian nomads form groups of nomad, nomadic peoples who have lived in various areas of the Eurasian Steppe. History largely knows them via frontier historical sources from Europe and Asia.
The steppe nomads had no permanent abode, but travelle ...
, a group to which the Proto-Indo-Europeans had once belonged, Indo-European expansion reached another peak in the
early modern period
The early modern period is a Periodization, historical period that is defined either as part of or as immediately preceding the modern period, with divisions based primarily on the history of Europe and the broader concept of modernity. There i ...
with the dramatic increase in the population of the
Indian subcontinent
The Indian subcontinent is a physiographic region of Asia below the Himalayas which projects into the Indian Ocean between the Bay of Bengal to the east and the Arabian Sea to the west. It is now divided between Bangladesh, India, and Pakista ...
and European expansionism throughout the globe during the
Age of Discovery
The Age of Discovery (), also known as the Age of Exploration, was part of the early modern period and overlapped with the Age of Sail. It was a period from approximately the 15th to the 17th century, during which Seamanship, seafarers fro ...
, as well as the continued replacement and assimilation of surrounding non-Indo-European languages and peoples due to increased state centralization and
nationalism
Nationalism is an idea or movement that holds that the nation should be congruent with the state. As a movement, it presupposes the existence and tends to promote the interests of a particular nation, Smith, Anthony. ''Nationalism: Theory, I ...
. These trends compounded throughout the modern period due to the general global
population growth
Population growth is the increase in the number of people in a population or dispersed group. The World population, global population has grown from 1 billion in 1800 to 8.2 billion in 2025. Actual global human population growth amounts to aroun ...
and the results of
European colonization
The phenomenon of colonization is one that stretches around the globe and across time. Ancient and medieval colonialism was practiced by various civilizations such as the Phoenicians, Babylonians, Persians, Greeks, Romans, Han Chinese, and A ...
of the
Western Hemisphere
The Western Hemisphere is the half of the planet Earth that lies west of the Prime Meridian (which crosses Greenwich, London, United Kingdom) and east of the 180th meridian.- The other half is called the Eastern Hemisphere. Geopolitically, ...
and
Oceania
Oceania ( , ) is a region, geographical region including Australasia, Melanesia, Micronesia, and Polynesia. Outside of the English-speaking world, Oceania is generally considered a continent, while Mainland Australia is regarded as its co ...
, leading to an explosion in the number of Indo-European speakers as well as the territories inhabited by them.
Due to colonization and the modern dominance of Indo-European languages in the fields of politics, global science, technology, education, finance, and sports, even many modern countries whose populations largely speak non-Indo-European languages have Indo-European languages as official languages, and the majority of the global population speaks at least one Indo-European language. The overwhelming majority of
languages used on the Internet
Language is a structured system of communication that consists of grammar and vocabulary. It is the primary means by which humans convey meaning, both in spoken and signed forms, and may also be conveyed through writing. Human language is ch ...
are Indo-European, with
English continuing to lead the group; English in general has in many respects
become the ''lingua franca'' of global communication.
See also
Indo-European Swadesh lists
*
Grammatical conjugation
In linguistics, conjugation ( ) is the creation of derived forms of a verb from its principal parts by inflection (alteration of form according to rules of grammar). For instance, the verb ''break'' can be conjugated to form the words ''break'' ...
* ''
The Horse, the Wheel, and Language'' (book)
*
Indo-European copula
A feature common to all Indo-European languages is the presence of a verb corresponding to the English verb ''to be''.
General features
This verb has two basic meanings:
*In a less marked context it is a simple copula (''I’m tired''; ''Th ...
*
Indo-European sound laws
As the Proto-Indo-European language (PIE) broke up, its sound system diverged as well, as evidenced in various sound laws associated with the daughter Indo-European languages.
Especially notable is the palatalization that produced the satem ...
*
Indo-European studies
Indo-European studies () is a field of linguistics and an interdisciplinary field of study dealing with Indo-European languages, both current and extinct. The goal of those engaged in these studies is to amass information about the hypothetical p ...
*
Indo-Semitic languages
*
Indo-Uralic languages
Indo-Uralic is a controversial linguistic hypothesis proposing a genealogical family consisting of Indo-European and Uralic.
The suggestion of a genetic relationship between Indo-European and Uralic is often credited to the Danish linguist V ...
*
Eurasiatic languages
Eurasiatic is a hypothetical and controversial language macrofamily proposal that would include many language families historically spoken in northern, western, and southern Eurasia.
The idea of a Eurasiatic superfamily dates back more than 100 ...
*
Language family
A language family is a group of languages related through descent from a common ancestor, called the proto-language of that family. The term ''family'' is a metaphor borrowed from biology, with the tree model used in historical linguistics ...
*
Languages of Asia
Asia is home to hundreds of languages comprising several families and some unrelated isolates. The most spoken language families on the continent include Austroasiatic languages, Austroasiatic, Austronesian languages, Austronesian, Japonic langua ...
*
Languages of Europe
There are over 250 languages indigenous to Europe, and most belong to the Indo-European language family. Out of a demographics of Europe, total European population of 744 million as of 2018, some 94% are native speakers of an Indo-European lang ...
*
Languages of India
Languages of India belong to several list of language families, language families, the major ones being the Indo-Aryan languages spoken by 78.05% of Indian people, Indians and the Dravidian languages spoken by 19.64% of Indians; both fami ...
*
Linguistics
Linguistics is the scientific study of language. The areas of linguistic analysis are syntax (rules governing the structure of sentences), semantics (meaning), Morphology (linguistics), morphology (structure of words), phonetics (speech sounds ...
*
List of Indo-European languages
This is a list of languages in the Indo-European languages, Indo-European language family. It contains a large number of individual languages, together spoken by roughly half the world's population.
Numbers of languages and language groups
T ...
*
Proto-Indo-European root
The roots of the reconstructed Proto-Indo-European language (PIE) are basic parts of words to carry a lexical meaning, so-called morphemes. PIE roots usually have verbal meaning like "to eat" or "to run". Roots never occurred alone in the langu ...
*
Proto-Indo-European religion
Proto-Indo-European mythology is the body of myths and deities associated with the Proto-Indo-Europeans, speakers of the hypothesized Proto-Indo-European language. Although the mythological motifs are not directly attested – since Proto-In ...
Notes
References
Citations
Sources
*
*
*
*
* Paperback: .
*
*
*
*
*
*
*
*
*
*
*
*
*
Part II via Internet Archive
*
** Reprinted in
*
*
Further reading
*
*
*
*
*
*
*
*
*
*
*
*
*
* Asadpour, Hiwa, and Thomas Jügel, eds. Word Order Variation: Semitic, Turkic and Indo-European Languages in Contact. Vol. 31. Walter de Gruyter GmbH & Co KG, 2022.
External links
Encyclopedia of Indo-European Culture (1997)
Databases
*
*
*
* .
*
*
an online collection of introductory videos to Ancient Indo-European languages produced by the University of Göttingen
Lexica
*
*
*
*
{{Authority control
Language families