Genetic model
   HOME

TheInfoList



OR:

In
historical linguistics Historical linguistics, also termed diachronic linguistics, is the scientific study of language change over time. Principal concerns of historical linguistics include: # to describe and account for observed changes in particular languages # ...
, the tree model (also Stammbaum, genetic, or cladistic model) is a model of the evolution of languages analogous to the concept of a
family tree A family tree, also called a genealogy or a pedigree chart, is a chart representing family relationships in a conventional tree structure. More detailed family trees, used in medicine and social work, are known as genograms. Representations of ...
, particularly a phylogenetic tree in the
biological evolution Evolution is change in the heritable characteristics of biological populations over successive generations. These characteristics are the expressions of genes, which are passed on from parent to offspring during reproduction. Variation t ...
of
species In biology, a species is the basic unit of classification and a taxonomic rank of an organism, as well as a unit of biodiversity. A species is often defined as the largest group of organisms in which any two individuals of the appropriate s ...
. As with species, each language is assumed to have evolved from a single parent or "mother" language, with languages that share a common ancestor belonging to the same
language family A language family is a group of languages related through descent from a common ''ancestral language'' or ''parental language'', called the proto-language of that family. The term "family" reflects the tree model of language origination in h ...
. Popularized by the German linguist
August Schleicher August Schleicher (; 19 February 1821 – 6 December 1868) was a German linguist. His great work was ''A Compendium of the Comparative Grammar of the Indo-European Languages'' in which he attempted to reconstruct the Proto-Indo-European languag ...
in 1853, François (2014). the tree model has always been a common method of describing genetic relationships between languages since the first attempts to do so. It is central to the field of comparative linguistics, which involves using evidence from known languages and observed rules of language feature evolution to identify and describe the hypothetical
proto-language In the tree model of historical linguistics, a proto-language is a postulated ancestral language from which a number of attested languages are believed to have descended by evolution, forming a language family. Proto-languages are usually unattes ...
s ancestral to each language family, such as
Proto-Indo-European Proto-Indo-European (PIE) is the reconstructed common ancestor of the Indo-European language family. Its proposed features have been derived by linguistic reconstruction from documented Indo-European languages. No direct record of Proto-Indo- ...
and the
Indo-European languages The Indo-European languages are a language family native to the overwhelming majority of Europe, the Iranian plateau, and the northern Indian subcontinent. Some European languages of this family, English, French, Portuguese, Russian, Dutc ...
. However, this is largely a theoretical, qualitative pursuit, and linguists have always emphasized the inherent limitations of the tree model due to the large role played by
horizontal transmission Horizontal transmission is the transmission of organisms between biotic and/or abiotic members of an ecosystem that are not in a parent-progeny relationship. This concept has been generalized to include transmissions of infectious agents, symbiont ...
in language evolution, ranging from
loanword A loanword (also loan word or loan-word) is a word at least partly assimilated from one language (the donor language) into another language. This is in contrast to cognates, which are words in two or more languages that are similar because t ...
s to
creole language A creole language, or simply creole, is a stable natural language that develops from the simplifying and mixing of different languages into a new one within a fairly brief period of time: often, a pidgin evolved into a full-fledged language. ...
s that have multiple mother languages. The
wave model In historical linguistics, the wave model or wave theory (German ''Wellentheorie'') is a model of language change in which a new language feature (innovation) or a new combination of language features spreads from its region of origin, affecting ...
was developed in 1872 by Schleicher's student Johannes Schmidt as an alternative to the tree model that incorporates horizontal transmission. The tree model also has the same limitations as biological taxonomy with respect to the species problem of quantizing a continuous phenomenon that includes exceptions like
ring species In biology, a ring species is a connected series of neighbouring populations, each of which interbreeds with closely sited related populations, but for which there exist at least two "end" populations in the series, which are too distantly relate ...
in biology and
dialect continua A dialect continuum or dialect chain is a series of language varieties spoken across some geographical area such that neighboring varieties are mutually intelligible, but the differences accumulate over distance so that widely separated variet ...
in language. The concept of a linkage was developed in response and refers to a group of languages that evolved from a dialect continuum rather than from linguistically isolated child languages of a single language.


History


Old Testament and St. Augustine

Augustine of Hippo supposed that each of the descendants of Noah founded a nation and that each nation was given its own language:
Assyrian Assyrian may refer to: * Assyrian people, the indigenous ethnic group of Mesopotamia. * Assyria, a major Mesopotamian kingdom and empire. ** Early Assyrian Period ** Old Assyrian Period ** Middle Assyrian Empire ** Neo-Assyrian Empire * Assyrian ...
for
Assur Aššur (; Sumerian: AN.ŠAR2KI, Assyrian cuneiform: ''Aš-šurKI'', "City of God Aššur"; syr, ܐܫܘܪ ''Āšūr''; Old Persian ''Aθur'', fa, آشور: ''Āšūr''; he, אַשּׁוּר, ', ar, اشور), also known as Ashur and Qal ...
,
Hebrew Hebrew (; ; ) is a Northwest Semitic language of the Afroasiatic language family. Historically, it is one of the spoken languages of the Israelites and their longest-surviving descendants, the Jews and Samaritans. It was largely preserved ...
for Heber, and so on. In all he identified 72 nations, tribal founders and languages. The confusion and dispersion occurred in the time of
Peleg Peleg ( he, פֶּלֶג, Péleḡ, in pausa he, פָּלֶג, Pā́leḡ, "division"; grc-x-biblical, Φάλεκ, Phálek) is mentioned in the Hebrew Bible as one of the two sons of Eber, an ancestor of the Ishmaelites and the Israelites, ac ...
, son of Heber, son of
Shem Shem (; he, שֵׁם ''Šēm''; ar, سَام, Sām) ''Sḗm''; Ge'ez: ሴም, ''Sēm'' was one of the sons of Noah in the book of Genesis and in the book of Chronicles, and the Quran. The children of Shem were Elam, Ashur, Arphaxad, Lu ...
, son of Noah. Augustine made a
hypothesis A hypothesis (plural hypotheses) is a proposed explanation for a phenomenon. For a hypothesis to be a scientific hypothesis, the scientific method requires that one can test it. Scientists generally base scientific hypotheses on previous obse ...
not unlike those of later historical linguists, that the family of Heber "preserved that language not unreasonably believed to have been the common language of the race ... thenceforth named Hebrew." Most of the 72 languages, however, date to many generations after Heber. St. Augustine solves this first problem by supposing that Heber, who lived 430 years, was still alive when God assigned the 72.


Ursprache, the language of paradise

St. Augustine's hypothesis stood without major question for over a thousand years. Then, in a series of tracts, published in 1684, expressing skepticism concerning various beliefs, especially Biblical, Sir Thomas Browne wrote:
"Though the earth were widely peopled before the flood ... yet whether, after a large dispersion, and the space of sixteen hundred years, men maintained so uniform a language in all parts, ... may very well be doubted."
By then, discovery of the
New World The term ''New World'' is often used to mean the majority of Earth's Western Hemisphere, specifically the Americas."America." ''The Oxford Companion to the English Language'' (). McArthur, Tom, ed., 1992. New York: Oxford University Press, p. ...
and exploration of the
Far East The ''Far East'' was a European term to refer to the geographical regions that includes East and Southeast Asia as well as the Russian Far East to a lesser extent. South Asia is sometimes also included for economic and cultural reasons. The ter ...
had brought knowledge of numbers of new languages far beyond the 72 calculated by St. Augustine. Citing the Native American languages, Browne suggests the "confusion of tongues at first fell only upon those present in Sinaar at the work of Babel ...." For those "about the foot of the hills, whereabout the ark rested ... their primitive language might in time branch out into several parts of Europe and This is an inkling of a tree. In Browne's view, simplification from a larger aboriginal language than Hebrew could account for the differences in language. He suggests ancient Chinese, from which the others descended by "confusion, admixtion and corruption". Later he invokes "commixture and alteration." Browne reports a number of reconstructive activities by the scholars of the times:
"The learned Casaubon conceiveth that a dialogue might be composed in Saxon, only of such words as are derivable from the Greek ...
Verstegan Richard Rowlands, born Richard Verstegan (c. 1550 – 1640), was an Anglo-Dutch antiquary, publisher, humorist and translator. Verstegan was born in East London the son of a cooper; his grandfather, Theodore Roland Verstegen, was a refugee f ...
made no doubt that he could contrive a letter that might be understood by the English, Dutch, and East Frislander ... And if, as the learned Buxhornius contendeth, the Scythian language as the mother tongue runs throughout the nations of Europe, and even as far as Persia, the community on many words, between so many nations, hath more reasonable traduction and were rather derivable from the common tongue diffused through them all, than from any particular nation, which hath also borrowed and holdeth but at second hand."
The confusion at the Tower of Babel was thus removed as an obstacle by setting it aside. Attempts to find similarities in all languages were resulting in the gradual uncovering of an ancient master language from which all the other languages derive. Browne undoubtedly did his writing and thinking well before 1684. In that same revolutionary century in Britain James Howell published of ''
Epistolae Ho-Elianae ''Epistolae Ho-Elianae'' (or ''Familiar Letters'') is a literary work by the 17th-century Anglo-Welsh historian and writer, James Howell. It was mainly written when Howell was in the Fleet Prison, during the 1640s; but its content reflects earlie ...
'', quasi-fictional letters to various important persons in the realm containing valid historical information. In Letter LVIII the metaphor of a tree of languages appears fully developed short of being a professional linguist's view:
"I will now hoist sail for the Netherlands, whose language is the same dialect with the English, and was so from the beginning, being both of them derived from the high Dutch owell is wrong here The Danish also is but a branch of the same tree ... Now the High Dutch or Teutonick Tongue, is one of the prime and most spacious Maternal Languages of Europe ... it was the language of the
Goths The Goths ( got, 𐌲𐌿𐍄𐌸𐌹𐌿𐌳𐌰, translit=''Gutþiuda''; la, Gothi, grc-gre, Γότθοι, Gótthoi) were a Germanic people who played a major role in the fall of the Western Roman Empire and the emergence of medieval Europe ...
and
Vandals The Vandals were a Germanic people who first inhabited what is now southern Poland. They established Vandal kingdoms on the Iberian Peninsula, Mediterranean islands, and North Africa in the fifth century. The Vandals migrated to the area betw ...
, and continueth yet of the greatest part of
Poland Poland, officially the Republic of Poland, is a country in Central Europe. It is divided into 16 administrative provinces called voivodeships, covering an area of . Poland has a population of over 38 million and is the fifth-most populou ...
and
Hungary Hungary ( hu, Magyarország ) is a landlocked country in Central Europe. Spanning of the Pannonian Basin, Carpathian Basin, it is bordered by Slovakia to the north, Ukraine to the northeast, Romania to the east and southeast, Serbia to the ...
, who have a Dialect of hers for their vulgar tongue ... Some of her writers would make this world believe that she was the language spoken in paradise."
The search for "the language of paradise" was on among all the linguists of Europe. Those who wrote in Latin called it the ''lingua prima'', the ''lingua primaeva'' or the ''lingua primigenia.'' In English it was the Adamic language; in German, the ''Ursprache'' or the ''hebräische Ursprache'' if one believed it was Hebrew. This mysterious language had the aura of purity and incorruption about it, and those qualities were the standards used to select candidates. This concept of ''Ursprache'' came into use well before the
neo-grammarian The Neogrammarians (German: ''Junggrammatiker'', 'young grammarians') were a German school of linguists, originally at the University of Leipzig, in the late 19th century who proposed the Neogrammarian hypothesis of the regularity of sound change. ...
s adopted it for their proto-languages. The gap between the widely divergent families of languages remained unclosed.


Indo-European model

On February 2, 1786, Sir William Jones delivered his ''Third Anniversary Discourse'' to the
Asiatic Society The Asiatic Society is a government of India organisation founded during the Company rule in India to enhance and further the cause of "Oriental research", in this case, research into India and the surrounding regions. It was founded by the p ...
as its president on the topic of the
Hindus Hindus (; ) are people who religiously adhere to Hinduism. Jeffery D. Long (2007), A Vision for Hinduism, IB Tauris, , pages 35–37 Historically, the term has also been used as a geographical, cultural, and later religious identifier for ...
. In it he applied the logic of the tree model to three languages, Greek, Latin and Sanskrit, but for the first time in history on purely linguistic grounds, noting "a stronger affinity, both in the roots of the verbs and in the forms of grammar, than could possibly have been produced by accident; ...." He went on to postulate that they sprang from "some common source, which, perhaps, no longer exists." To them he added Gothic, Celtic and Persian as "to the same family." Jones did not name his "common source" nor develop the idea further, but it was taken up by the linguists of the times. In the ''(London) Quarterly Review'' of late 1813-1814, Thomas Young published a review of
Johann Christoph Adelung Johann Christoph Adelung (8 August 173210 September 1806) was a German grammarian and philologist. Biography He was born at Spantekow, in Western Pomerania, and educated at schools in Anklam and Berge Monastery, Magdeburg, and the University o ...
's ''Mithridates, oder allgemeine Sprachenkunde'' ("Mithridates, or a General History of Languages"), Volume I of which had come out in 1806, and Volumes II and III, , continued by Johann Severin Vater. Adelung's work described some 500 "languages and dialects" and hypothesized a universal descent from the language of paradise, located in Kashmir central to the total range of the 500. Young begins by pointing out Adelung's indebtedness to
Conrad Gesner Conrad Gessner (; la, Conradus Gesnerus 26 March 1516 – 13 December 1565) was a Swiss physician, naturalist, bibliographer, and philologist. Born into a poor family in Zürich, Switzerland, his father and teachers quickly realised his tale ...
's ''Mithridates, de Differentiis Linguarum'' of 1555 and other subsequent catalogues of languages and Young undertakes to present Adelung's classification. The
monosyllabic In linguistics, a monosyllable is a word or utterance of only one syllable. It is most commonly studied in the fields of phonology and morphology and it has no semantic content. The word has originated from the Greek language. "Yes", "no", "jump", ...
type is most ancient and primitive, spoken in Asia, to the east of Eden, in the direction of Adam's exit from Eden. Then follows Jones' group, still without a name, but attributed to Jones: "Another ancient and extensive class of languages united by a greater number of resemblances than can well be altogether accidental." For this class he offers a name, "Indoeuropean," the first known linguistic use of the word, but not its first known use. The
British East India Company The East India Company (EIC) was an English, and later British, joint-stock company founded in 1600 and dissolved in 1874. It was formed to trade in the Indian Ocean region, initially with the East Indies (the Indian subcontinent and South ...
was using "Indo-European commerce" to mean the trade of commodities between India and Europe. All the evidence Young cites for the ancestral group are the most similar words: mother, father, etc. Adelung's additional classes were the Tataric, the African and the American, which depend on geography and a presumed descent from Eden. Young does not share Adelung's enthusiasm for the language of paradise, and brands it as mainly speculative. Young's designation, successful in English, was only one of several candidates proposed between 1810 and 1867: indo-germanique (
Conrad Malte-Brun Conrad Malte-Brun (12 August 177514 December 1826), born Malthe Conrad Bruun, and sometimes referred to simply as Malte-Brun, was a Dano-French geographer and journalist. His second son, Victor Adolphe Malte-Brun, was also a geographer. Today he ...
, 1810), japetisk (
Rasmus Christian Rask Rasmus Kristian Rask (; born Rasmus Christian Nielsen Rasch; 22 November 1787 – 14 November 1832) was a Danish linguist and philologist. He wrote several grammars and worked on comparative phonology and morphology. Rask traveled extensively to ...
, 1815), Indo-Germanisch (
Julius Klaproth Heinrich Julius Klaproth (11 October 1783 – 28 August 1835) was a German linguist, historian, ethnographer, author, orientalist and explorer. As a scholar, he is credited along with Jean-Pierre Abel-Rémusat, with being instrumental in turni ...
, 1823), indisch-teutsch (F. Schmitthenner, 1826), sanskritisch (
Wilhelm von Humboldt Friedrich Wilhelm Christian Karl Ferdinand von Humboldt (, also , ; ; 22 June 1767 – 8 April 1835) was a Prussian philosopher, linguist, government functionary, diplomat, and founder of the Humboldt University of Berlin, which was named afte ...
, 1827), indokeltisch (A. F. Pott, 1840), arioeuropeo (
Graziadio Isaia Ascoli Graziadio Isaia Ascoli (; 16 July 1829 – 21 January 1907) was an Italian linguist. Life and work Ascoli was born in an Italian-speaking Jewish family in the multiethnic town of Gorizia, then part of the Austrian Empire (now in Italy). Alre ...
, 1854), Aryan (
Max Müller Friedrich Max Müller (; 6 December 1823 – 28 October 1900) was a German-born philologist and Orientalist, who lived and studied in Britain for most of his life. He was one of the founders of the western academic disciplines of Indian ...
, 1861) and aryaque (H. Chavée, 1867). These men were all polyglots and prodigies in languages. (Klaproth, for example, the author of the successful German-language candidate, Indo-Germanisch, who criticised Jones for his uncritical method, knew Chinese, Japanese, Tibetan and a number of other languages with their scripts.) The concept of a Biblical Ursprache appealed to their imagination. As hope of finding it gradually died they fell back on the growing concept of common Indo-European spoken by nomadic tribes on the plains of Eurasia, and although they made a good case that this language can be deduced by the methods of comparative linguistics, in fact that is not how they obtained it. It was the one case in which their efforts to find the Ursprache succeeded.


Neogrammarian model

The model is due in its most strict formulation to the
Neogrammarian The Neogrammarians (German: ''Junggrammatiker'', 'young grammarians') were a German school of linguists, originally at the University of Leipzig, in the late 19th century who proposed the Neogrammarian hypothesis of the regularity of sound change ...
s. The model relies on earlier conceptions of William Jones,
Franz Bopp Franz Bopp (; 14 September 1791 – 23 October 1867) was a German linguist known for extensive and pioneering comparative work on Indo-European languages. Early life Bopp was born in Mainz, but the political disarray in the Republic of Mai ...
and
August Schleicher August Schleicher (; 19 February 1821 – 6 December 1868) was a German linguist. His great work was ''A Compendium of the Comparative Grammar of the Indo-European Languages'' in which he attempted to reconstruct the Proto-Indo-European languag ...
by adding the exceptionlessness of the
sound laws A sound change, in historical linguistics, is a change in the pronunciation of a language. A sound change can involve the replacement of one speech sound (or, more generally, one phonetic feature value) by a different one (called phonetic chang ...
and the regularity of the process. The linguist perhaps most responsible for establishing the link to Darwinism was August Schleicher. That he was comparing his ''Stammbaum'', or family tree of languages, to Darwin's presentation of evolution shortly after that presentation, is proved by the open letter he wrote in 1863 to Ernst Haeckel, published posthumously, however. In 1869, Haeckel had suggested he read ''Origin of Species''. After reading it Schleicher wrote ''Die Darwinische Theorie und die Sprachwissenschaft'', "Darwinism tested by the Science of Language." In a scenario reminiscent of that between Darwin and Wallace over the discovery of evolution (both discovered it independently), Schleicher endorsed Darwin's presentation, but criticised it for not inserting any species. He then presented a ''Stammbaum'' of languages, which, however, was not the first he had published. The evolution of languages was not the source of Darwin's theory of evolution. He had based that on variation of species, such as he had observed in finches in the Galapagos Islands, who had appeared to be modifications of a common ancestor. Selection of domestic species to produce a new variety also played a role in his conclusions. The first edition of ''Origin of Species'' in 1859 discusses the language tree as though ''de novo'' under the topic of classification. Darwin criticises the synchronic method devised by
Linnaeus Carl Linnaeus (; 23 May 1707 – 10 January 1778), also known after his ennoblement in 1761 as Carl von Linné Blunt (2004), p. 171. (), was a Swedish botanist, zoologist, taxonomist, and physician who formalised binomial nomenclature, the ...
, suggesting that it be replaced by a "natural arrangement" based on evolution. He says:
"It may be worth while to illustrate this view of classification, by taking the case of languages. If we possessed a perfect pedigree of mankind, a genealogical arrangement of the races of man would afford the best classification of the various languages now spoken throughout the world; and if all extinct languages, and all intermediate and slowly changing dialects, had to be included, such an arrangement would, I think, be the only possible one. Yet it might be that some very ancient language had altered little, and had given rise to few new languages, whilst others (owing to the spreading and subsequent isolation and states of civilisation of the several races, descended from a common race) had altered much, and had given rise to many new languages and dialects. The various degrees of difference in the languages from the same stock, would have to be expressed by groups subordinate to groups; but the proper or even only possible arrangement would still be genealogical; and this would be strictly natural, as it would connect together all languages, extinct and modern, by the closest affinities, and would give the filiation and origin of each tongue."
Schleicher had never heard of Darwin before Haeckel brought him to Schleicher's attention. He had published his own work on the ''Stammbaum'' in an article of 1853, six years before the first edition of ''Origin of Species'' in 1859. The concept of descent of languages was by no means new.
Thomas Jefferson Thomas Jefferson (April 13, 1743 – July 4, 1826) was an American statesman, diplomat, lawyer, architect, philosopher, and Founding Father who served as the third president of the United States from 1801 to 1809. He was previously the natio ...
, a devout linguist himself, had proposed that the continual necessity for
neologism A neologism Greek νέο- ''néo''(="new") and λόγος /''lógos'' meaning "speech, utterance"] is a relatively recent or isolated term, word, or phrase that may be in the process of entering common use, but that has not been fully accepted int ...
s implies that languages must "progress" or "advance." These ideas foreshadow evolution of either biological species or languages, but after the contact of Schleicher with Darwin's ideas, and perhaps Darwin's contact with the historical linguists, Evolution and language change were inextricably linked, and would become the basis for classification. Now, as then, the main problems would be to prove specific lines of descent, and to identify the branch points.


Phylogenetic tree

The old metaphor was given an entirely new meaning under the old name by
Joseph Harold Greenberg Joseph Harold Greenberg (May 28, 1915 – May 7, 2001) was an American linguist, known mainly for his work concerning linguistic typology and the genetic classification of languages. Life Early life and education Joseph Greenberg was born on ...
in a series of essays beginning about 1950. Since the adoption of the family tree metaphor by the linguists, the concept of
evolution Evolution is change in the heritable characteristics of biological populations over successive generations. These characteristics are the expressions of genes, which are passed on from parent to offspring during reproduction. Variation ...
had been proposed by
Charles Darwin Charles Robert Darwin ( ; 12 February 1809 – 19 April 1882) was an English naturalist, geologist, and biologist, widely known for his contributions to evolutionary biology. His proposition that all species of life have descended ...
and was generally accepted in biology.
Taxonomy Taxonomy is the practice and science of categorization or classification. A taxonomy (or taxonomical classification) is a scheme of classification, especially a hierarchical classification, in which things are organized into groups or types. ...
, the classification of living things, had already been invented by
Carl Linnaeus Carl Linnaeus (; 23 May 1707 – 10 January 1778), also known after his Nobility#Ennoblement, ennoblement in 1761 as Carl von Linné#Blunt, Blunt (2004), p. 171. (), was a Swedish botanist, zoologist, taxonomist, and physician who formalise ...
. It used a
binomial nomenclature In taxonomy, binomial nomenclature ("two-term naming system"), also called nomenclature ("two-name naming system") or binary nomenclature, is a formal system of naming species of living things by giving each a name composed of two parts, bot ...
to assign a
species In biology, a species is the basic unit of classification and a taxonomic rank of an organism, as well as a unit of biodiversity. A species is often defined as the largest group of organisms in which any two individuals of the appropriate s ...
name and a
genus Genus ( plural genera ) is a taxonomic rank used in the biological classification of living and fossil organisms as well as viruses. In the hierarchy of biological classification, genus comes above species and below family. In binomial nom ...
name to every known living organism. These were arranged in a biological hierarchy under several phyla, or most general groups, branching ultimately to the various species. The basis for this
biological classification In biology, taxonomy () is the scientific study of naming, defining ( circumscribing) and classifying groups of biological organisms based on shared characteristics. Organisms are grouped into taxa (singular: taxon) and these groups are give ...
was the observed shared physical features of the species. Darwin, however, reviving another ancient metaphor, the
tree of life The tree of life is a fundamental archetype in many of the world's mythological, religious, and philosophical traditions. It is closely related to the concept of the sacred tree.Giovino, Mariana (2007). ''The Assyrian Sacred Tree: A Hist ...
, hypothesized that the groups of the Linnaean classification (today's
taxa In biology, a taxon (back-formation from ''taxonomy''; plural taxa) is a group of one or more populations of an organism or organisms seen by taxonomists to form a unit. Although neither is required, a taxon is usually known by a particular nam ...
), descended in a tree structure over time from simplest to most complex. The Linnaean hierarchical tree was synchronic; Darwin envisioned a diachronic process of common descent. Where Linnaeus had conceived
rank Rank is the relative position, value, worth, complexity, power, importance, authority, level, etc. of a person or object within a ranking, such as: Level or position in a hierarchical organization * Academic rank * Diplomatic rank * Hierarchy * ...
s, which were consistent with the
great chain of being The great chain of being is a hierarchical structure of all matter and life, thought by medieval Christianity to have been decreed by God. The chain begins with God and descends through angels, humans, animals and plants to minerals. The great ...
adopted by the rationalists, Darwin conceived lineages. Over the decades after Darwin it became clear that the ranks of Linnaeus' hierarchy did not correspond exactly to the lineages. It became the prime goal of taxonomy to discover the lineages and alter the classification to reflect them, which it did under the overall guidance of the
Nomenclature Codes Nomenclature codes or codes of nomenclature are the various rulebooks that govern biological taxonomic nomenclature, each in their own broad field of organisms. To an end-user who only deals with names of species, with some awareness that specie ...
, rule books kept by international organizations to authorize and publish proposals to reclassify species and other taxa. The new approach was called
phylogeny A phylogenetic tree (also phylogeny or evolutionary tree Felsenstein J. (2004). ''Inferring Phylogenies'' Sinauer Associates: Sunderland, MA.) is a branching diagram or a tree showing the evolutionary relationships among various biological spe ...
, the "generation of phyla," which devised a new tree metaphor, the phylogenetic tree. One unit in the tree and all its offspring units were a clade and the discovery of clades was
cladistics Cladistics (; ) is an approach to biological classification in which organisms are categorized in groups (" clades") based on hypotheses of most recent common ancestry. The evidence for hypothesized relationships is typically shared derived cha ...
. Greenberg began writing during a time when phylogenetic systematics lacked the tools available to it later: the computer (computational systematics) and DNA sequencing (
molecular systematics Molecular phylogenetics () is the branch of phylogeny that analyzes genetic, hereditary molecular differences, predominantly in DNA sequences, to gain information on an organism's evolutionary relationships. From these analyses, it is possible to ...
). To discover a cladistic relationship researchers relied on as large a number of morphological similarities among species as could be defined and tabulated. Statistically the greater the number of similarities the more likely species were to be in the same clade. This approach appealed to Greenberg, who was interested in discovering
linguistic universal A linguistic universal is a pattern that occurs systematically across natural languages, potentially true for all of them. For example, ''All languages have nouns and verbs'', or ''If a language is spoken, it has consonants and vowels.'' Research ...
s. Altering the tree model to make the family tree a phylogenetic tree he said:
"Any language consists of thousands of forms with both sound and meaning ... any sound whatever can express any meaning whatever. Therefore, if two languages agree in a considerable number of such items ... we necessarily draw a conclusion of common historical origin. Such genetic classifications are not arbitrary ... the analogy here to biological classification is extremely close ... just as in biology we classify species in the same genus or high unit because the resemblances are such as to suggest a hypothesis of common descent, so with genetic hypotheses in language."
In this analogy, a language family is like a clade, the languages are like
species In biology, a species is the basic unit of classification and a taxonomic rank of an organism, as well as a unit of biodiversity. A species is often defined as the largest group of organisms in which any two individuals of the appropriate s ...
, the
proto-language In the tree model of historical linguistics, a proto-language is a postulated ancestral language from which a number of attested languages are believed to have descended by evolution, forming a language family. Proto-languages are usually unattes ...
is like an ancestor
taxon In biology, a taxon ( back-formation from '' taxonomy''; plural taxa) is a group of one or more populations of an organism or organisms seen by taxonomists to form a unit. Although neither is required, a taxon is usually known by a particular n ...
, the language tree is like a phylogenetic tree and languages and dialects are like species and varieties. Greenberg formulated large tables of characteristics of hitherto neglected languages of Africa, the Americas, Indonesia and northern Eurasia and typed them according to their similarities. He called this approach " typological classification", arrived at by
descriptive linguistics In the study of language, description or descriptive linguistics is the work of objectively analyzing and describing how language is actually used (or how it was used in the past) by a speech community. François & Ponsonnet (2013). All acad ...
rather than by comparative linguistics.


Dates and glottochronology

The
comparative method In linguistics, the comparative method is a technique for studying the development of languages by performing a feature-by-feature comparison of two or more languages with common descent from a shared ancestor and then extrapolating backwards t ...
has been used by historical linguists to piece together tree models utilizing discrete lexical, morphological, and phonological data. Chronology can be found but there is no absolute date estimates utilizing this system.
Glottochronology Glottochronology (from Attic Greek γλῶττα ''tongue, language'' and χρόνος ''time'') is the part of lexicostatistics which involves comparative linguistics and deals with the chronological relationship between languages.Sheila Embleton ...
enables absolute dates to be estimated. Shared cognates (cognates meaning to have common historical origin) are calculate divergence times. However the method was found to be later discredited due to the data being unreliable. Due to this historical linguists have trouble with exact age estimation when pinpointing the age of the Indo-European language family. It could range from 4000 BP to 40,000 BP, or anywhere in-between those dates according to Dixon sourced from the rise and fall of language, (Cambridge University Press). As seen in the article here. Possible solutions for Glottochronology are forthcoming due to
computational phylogenetic Computational phylogenetics is the application of computational algorithms, methods, and programs to phylogenetic
methods. Techniques such as using models of evolution improves accuracy of tree branch length and topology. There for, using computational phylogenetic methods computational methods enable researchers to analyze linguistic data from evolutionary biology. This further assists in testing theories against each other, such as the Kurgan theory and the Anatolian theory, both claiming origins of Info-European languages.


Computational phylogenetics in historical linguistics

The comparative method compares features of various languages to assess how similar one language is to another. The results of such an assessment are data-oriented; that is, the results depend on the number of features and the number of languages compared. Until the arrival of the computer on the historical linguistics landscape, the numbers in both cases were necessarily small. The effect was of trying to depict a photograph using a small number of large pixels, or picture units. The limitations of the Tree Model were all too painfully apparent, resulting in complaints from the major historical linguists. In the late 20th century, linguists began using software intended for biological classification to classify languages. Programs and methods became increasingly sophisticated. In the early 21st century, the Computational Phylogenetics in Historical Linguistics (CPHL) project, a consortium of historical linguists, received funding from the
National Science Foundation The National Science Foundation (NSF) is an independent agency of the United States government that supports fundamental research and education in all the non-medical fields of science and engineering. Its medical counterpart is the National ...
to study phylogenies. The Indo-European family is a major topic of study. As of January, 2012, they had collected and coded a "screened" database of "22 phonological characters, 13 morphological characters, and 259 lexical characters," and an unscreened database of more. Wordlists of 24 Indo-European languages are included. Larger numbers of features and languages increase the precision, provided they meet certain criteria. Using specialized computer software, they test various phylogenetic hypotheses for their ability to account for the characters by genetic descent.


Limitations of the model

One endemic limitation of the tree model is the very founding presumption on which it is based: it requires a classification based on languages or, more generally, on
language varieties In sociolinguistics, a variety, also called an isolect or lect, is a specific form of a language or language cluster. This may include languages, dialects, registers, styles, or other forms of language, as well as a standard variety.Meecham, Ma ...
. Since a variety represents an abstraction from the totality of linguistic features, there is the possibility for information loss during the translation of data (from a map of isoglosses) into a tree. For example, there is the issue of
dialect continua A dialect continuum or dialect chain is a series of language varieties spoken across some geographical area such that neighboring varieties are mutually intelligible, but the differences accumulate over distance so that widely separated variet ...
. They provide varieties that are not unequivocally one language or another but contain features characteristic of more than one. The issue of how they are to be classified is similar to the issue presented by
ring species In biology, a ring species is a connected series of neighbouring populations, each of which interbreeds with closely sited related populations, but for which there exist at least two "end" populations in the series, which are too distantly relate ...
to the concept of
species In biology, a species is the basic unit of classification and a taxonomic rank of an organism, as well as a unit of biodiversity. A species is often defined as the largest group of organisms in which any two individuals of the appropriate s ...
classification in biology. The limitations of the tree model, in particular its inability to handle the non-discrete distribution of shared innovations in
dialect continua A dialect continuum or dialect chain is a series of language varieties spoken across some geographical area such that neighboring varieties are mutually intelligible, but the differences accumulate over distance so that widely separated variet ...
, have been addressed through the development of non-cladistic (non-tree-based) methodologies. They include the
Wave model In historical linguistics, the wave model or wave theory (German ''Wellentheorie'') is a model of language change in which a new language feature (innovation) or a new combination of language features spreads from its region of origin, affecting ...
; and more recently, the concept of linkage.See ; Heggarty ''et al.'' (2010); François (2014). An additional limitation of the tree model involves mixed and hybrid languages, as well as language mixing in general since the tree model allows only for divergences. For example, according to Zuckermann (2009:63), "Israeli", his term for
Modern Hebrew Modern Hebrew ( he, עברית חדשה, ''ʿivrít ḥadašá ', , '' lit.'' "Modern Hebrew" or "New Hebrew"), also known as Israeli Hebrew or Israeli, and generally referred to by speakers simply as Hebrew ( ), is the standard form of the H ...
, which he regards as a Semito-European hybrid, "demonstrates that the reality of linguistic genesis is far more complex than a simple family tree system allows. 'Revived' languages are unlikely to have a single parent."


Perfect phylogenies

The purpose of phylogenetic software is to generate
cladogram A cladogram (from Greek ''clados'' "branch" and ''gramma'' "character") is a diagram used in cladistics to show relations among organisms. A cladogram is not, however, an evolutionary tree because it does not show how ancestors are related to ...
s, a special kind of tree in which the links only bifurcate; that is, at any node in the same direction only two branches are offered. The input data is a set of characters that can be assigned states in different languages, such as present (1) or absent (0). A language therefore can be described by a unique coordinate set consisting of the state values for all of the characters considered. These coordinates can be like each other or less so. Languages that share the most states are most like each other. The software massages all the states of all the characters of all the languages by one of several mathematical methods to accomplish a pairwise comparison of each language with all the rest. It then constructs a cladogram based on degrees of similarity; for example, hypothetical languages, a and b, which are closest only to each other, are assumed to have a common ancestor, a-b. The next closest language, c, is assumed to have a common ancestor with a-b, and so on. The result is a projected series of historical paths leading from the overall common ancestor (the root) to the languages (the leaves). Each path is unique. There are no links between paths. Every leaf and node have one and only one ancestor. All the states are accounted for by descent from other states. A cladogram that conforms to these requirements is a perfect phylogeny. At first there seemed to be little consistency of results in trials varying the factors presumed to be relevant. A new cladogram resulted from any change, which suggested that the method was not capturing the underlying evolution of languages but only reflecting the extemporaneous judgements of the researchers. In order to find the factors that did bear on phylogeny the researchers needed to have some measure of the accuracy of their results; i.e., the results needed to be calibrated against known phylogenies. They ran the experiment using different assumptions looking for the ones that would produce the closest matches to the most secure Indo-European phylogenies. Those assumptions could be used on problem areas of the Indo-European phylogeny with greater confidence. To obtain a reasonably valid phylogeny, the researchers found they needed to enter as input all three types of characters: phonological, lexical and morphological, which were all required to present a picture that was sufficiently detailed for calculation of phylogeny. Only qualitative characters produced meaningful results. Repeated states were too ambiguous to be correctly interpreted by the software; therefore characters that were subject to
back formation In etymology, back-formation is the process or result of creating a new word via inflection, typically by removing or substituting actual or supposed affixes from a lexical item, in a way that expands the number of lexemes associated with the ...
and parallel development, which reverted a character to a prior state or adopted a state that evolved in another character, respectively, were screened from the input dataset.


Perfect phylogenetic networks

Despite their care to code the best qualitative characters in sufficient numbers, the researchers could obtain no perfect phylogenies for some groups, such as Germanic and Albanian within Indo-European. They reasoned that a significant number of characters, which could not be explained by genetic descent from the group's calculated ancestor, were borrowed. Presumably, if the
wave model In historical linguistics, the wave model or wave theory (German ''Wellentheorie'') is a model of language change in which a new language feature (innovation) or a new combination of language features spreads from its region of origin, affecting ...
, which explained borrowing, were a complete explanation of the group's characters, no phylogeny at all could be found for it. If both models were partially effective, then a tree would exist, but it would need to be supplemented by non-genetic explanations. The researchers therefore modified the software and method to include the possibility of borrowing. The researchers introduced into the experiment the concept of the interface, or allowed boundary over which character states would flow. A one-way interface, or edge, existed between a parent and a child. If only one-way edges were sufficient to explain the presence of all the states in a language, then there was no need to look beyond the perfect phylogeny. If not, then one or more contact edges, or bidirectional interfaces, could be added to the phylogeny. A language therefore might have more than one source of states: the parent or a contact language. A tree so modified was no longer a tree as such: there could be more than one path from root to leaf. The researchers called this arrangement a network. The states of a character still evolved along a unique path from root to leaf, but its origin could be either the root under consideration or a contact language. If all the states of the experiment could be accounted for by the network, it was termed a perfect phylogenetic network.


Compatibility and feasibility

The generation of networks required two phases. In the first phase, the researchers devised a number of phylogenies, called candidate trees, to be tested for compatibility. A character is compatible when its origin is explained by the phylogeny generated. In a perfect phylogeny, all the characters are compatible and the compatibility of the tree is 100%. By the principle of parsimony, or Occam's razor, no networks are warranted. Candidate trees were obtained by first running the phylogeny-generation software using the Indo-European dataset (the strings of character states) as input, then modifying the resultant tree into other hypotheses to be tested. None of the original candidate trees were perfect phylogenies, although some of the subtrees within them were. The next phase was to generate networks from the trees of highest compatibility scores by adding interfaces one at a time, selecting the interface of highest compatibility, until sufficiency was obtained; that is, the compatibility of the network was highest. As it turned out, the number of compatible networks generated might vary from none to over a dozen. However, not all the possible interfaces were historically feasible. Interfaces between some languages were geographically and chronologically not very likely. Inspecting the results, the researchers excluded the non-feasible interfaces until a list of only feasible networks remained, which could be arranged in order of compatibility score.


Most feasible network for Indo-European

The researchers began with five candidate trees for Indo-European, lettered A-E, one generated from the phylogenetic software, two modifications of it and two suggested by Craig Melchert, a historical linguist and Indo-Europeanist. The trees differed mainly in the placement of the most ambiguous group, the Germanic languages, and Albanian, which did not have enough distinctive characters to place it exactly. Tree A contained 14 incompatible characters; B, 19; C, 17; D, 21; E,18. Trees A and C had the best compatibility scores. The incompatibilities were all lexical, and A's were a subset of C's. Subsequent generation of networks found that all incompatibilities could be resolved with a minimum of three contact edges except for Tree E. As it did not have a high compatibility, it was excluded. Tree A had 16 possible networks, which a feasibility inspection reduced to three. Tree C had one network, but as it required an interface to Baltic and not Slavic, it was not feasible. Tree A, the most compatible and feasible tree, hypothesizes seven groups separating from Proto-Indo-European between about 4000 BC and 2250 BC, as follows. * The first to separate was Anatolian, about 4000 BC. * Tocharian followed at about 3500 BC. * Shortly thereafter, about 3250, Proto-Italo-Celtic (western Indo-European) separated, becoming Proto-Italic and Proto-Celtic at about 2500 BC. * At about 3000, Proto-Albano-Germanic separated, becoming Albanian and Proto-Germanic at about 2000. * At about 3000 Proto-Greco-Armenian (southern Indo-European) divided, becoming Proto-Greek and Proto-Armenian at about 1800. * Balto-Slavic appeared about 2500, dividing into Proto-Baltic and Proto-Slavic at about 1000. * Finally, Proto-Indo-European became Proto-Indo-Iranian (eastern Indo-European) at about 2250. Trees B and E offer the alternative of Proto-Germano-Balto-Slavic (northern Indo-European), making Albanian an independent branch. The only date for which authors vouch is the last, based on the continuity of the
Yamna culture The Yamnaya culture or the Yamna culture (russian: Ямная культура, ua, Ямна культура lit. 'culture of pits'), also known as the Pit Grave culture or Ochre Grave culture, was a late Copper Age to early Bronze Age archa ...
, the Andronovo Culture and known Indo-Aryan speaking cultures. All others are described as "dead reckoning.". Given the phylogeny of best compatibility, A, three contact edges are required to complete the compatibility. This is group of edges with the fewest borrowing events: * First, an edge between Proto-Italic and Proto-Germanic, which must have begun after 2000, according to the dating scheme given. * A second contact edge was between Proto-Italic and Proto-Greco-Armenian, which must have begun after 2500. * The third contact edge is between Proto-Germanic and Proto-Baltic, which must have begun after 1000. Tree A with the edges described above is described by the authors as "our best PPN.". In all PPNs, it is clear that although the initial daughter languages became distinct in relative isolation, the later evolution of the groups can be explained only by evolution in proximity to other languages with which an exchange takes place by the wave model.


See also

*
Comparative method In linguistics, the comparative method is a technique for studying the development of languages by performing a feature-by-feature comparison of two or more languages with common descent from a shared ancestor and then extrapolating backwards t ...
*
Evolutionary linguistics Evolutionary linguistics or Darwinian linguistics is a sociobiological approach to the study of language. Evolutionary linguists consider linguistics as a subfield of sociobiology and evolutionary psychology. The approach is also closely linked ...
* Genetic relationship (linguistics) *
Indo-European studies Indo-European studies is a field of linguistics and an interdisciplinary field of study dealing with Indo-European languages, both current and extinct. The goal of those engaged in these studies is to amass information about the hypothetical pro ...
*
Language family A language family is a group of languages related through descent from a common ''ancestral language'' or ''parental language'', called the proto-language of that family. The term "family" reflects the tree model of language origination in h ...
* Linkage (linguistics) *
Wave model (linguistics) In historical linguistics, the wave model or wave theory (German ''Wellentheorie'') is a model of language change in which a new language feature (innovation) or a new combination of language features spreads from its region of origin, affecting ...
*
Father Tongue hypothesis The Father Tongue hypothesis proposes that humans tend to speak their father's language. It is based on the discovery, in 1997, of a closer correlation between language and Y-chromosomal variation than between language and mitochondrial DNA varia ...


Notes


Bibliography

* * . * . * * *


External links

* * {{cite web , last1=Santorini , first1=Beatrice , first2=Anthony , last2=Kroch , date=2007 , title=Node Relations , work=The syntax of natural language: An online introduction using the Trees program , url=http://www.ling.upenn.edu/~beatrice/syntax-textbook/box-nodes.html , publisher=University of Pennsylvania Historical linguistics