HOME

TheInfoList



OR:

The Swadesh list ("Swadesh" is pronounced ) is a classic compilation of tentatively universal concepts for the purposes of lexicostatistics. Translations of the Swadesh list into a set of languages allow researchers to quantify the interrelatedness of those languages. The Swadesh list is named after linguist
Morris Swadesh Morris Swadesh (; January 22, 1909 – July 20, 1967) was an American linguist who specialized in comparative and historical linguistics. Swadesh was born in Massachusetts to Bessarabian Jewish immigrant parents. He completed bachelor's and ma ...
. It is used in lexicostatistics (the quantitative assessment of the genealogical relatedness of languages) and
glottochronology Glottochronology (from Attic Greek γλῶττα ''tongue, language'' and χρόνος ''time'') is the part of lexicostatistics which involves comparative linguistics and deals with the chronological relationship between languages.Sheila Embleton ...
(the dating of language divergence). Because there are several different lists, some authors also refer to "Swadesh lists".


Versions and authors

Morris Swadesh Morris Swadesh (; January 22, 1909 – July 20, 1967) was an American linguist who specialized in comparative and historical linguistics. Swadesh was born in Massachusetts to Bessarabian Jewish immigrant parents. He completed bachelor's and ma ...
himself created several versions of his list. He started with a list of 215 meanings (falsely introduced as a list of 225 meanings in the paper due to a spelling error), which he reduced to 165 words for the
Salish-Spokane-Kalispel language The Salish or Séliš language , also known as Kalispel–Pend d'oreille, Kalispel–Spokane–Flathead, or Montana Salish to distinguish it from other Salishan languages, is a Salishan language spoken (as of 2005) by about 64 elders of the Fla ...
. In 1952, he published a list of 215 meanings,Swadesh 1952: 456–
PDF
/ref> of which he suggested the removal of 16 for being unclear or not
universal Universal is the adjective for universe. Universal may also refer to: Companies * NBCUniversal, a media and entertainment company ** Universal Animation Studios, an American Animation studio, and a subsidiary of NBCUniversal ** Universal TV, a t ...
, with one added to arrive at 200 words. In 1955, he wrote, "The only solution appears to be a drastic weeding out of the list, in the realization that quality is at least as important as quantity. Even the new list has defects, but they are relatively mild and few in number." After minor corrections, the final 100-word list was published posthumously in 1971Swadesh 1971: 283 and 1972. Other versions of lexicostatistical test lists were published e.g. by
Robert Lees Robert Lees (July 10, 1912 – June 13, 2004) was an American television and film screenwriter. Lees was best known for writing comedy, including several Abbott and Costello films. Life and career Born in San Francisco, California, Lees g ...
(1953), John A. Rea (1958:145f),
Dell Hymes Dell Hathaway Hymes (June 7, 1927 in Portland, Oregon – November 13, 2009 in Charlottesville, Virginia) was a linguist, sociolinguist, anthropologist, and folklorist who established disciplinary foundations for the comparative, ethnographic st ...
(1960:6), E. Cross (1964 with 241 concepts), W. J. Samarin (1967:220f), D. Wilson (1969 with 57 meanings), Lionel Bender (1969), R. L. Oswald (1971), Winfred P. Lehmann (1984:35f), D. Ringe (1992, passim, different versions),
Sergei Starostin Sergei Anatolyevich Starostin (russian: Серге́й Анато́льевич Ста́ростин; March 24, 1953 – September 30, 2005) was a Russian historical linguist and philologist, perhaps best known for his reconstructions of hypothet ...
(1984, passim, different versions), William S-Y. Wang (1994), M. Lohr (2000, 128 meanings in 18 languages). B. Kessler (2002), and many others. The
Concepticon Concepticon is an open-source online lexical database of linguistic concept lists (word lists). It links concept labels (i.e., word list glosses) in concept lists (i.e., word lists) to concept sets (i.e., standardized word meanings). It is par ...
, a project hosted at the Cross-Linguistic Linked Data (CLLD) project, collects various concept lists (including classical Swadesh lists) across different linguistic areas and times, currently listing 240 different concept lists. Frequently used and widely available on the internet, is the version by Isidore Dyen (1992, 200 meanings of 95 language variants). Since 2010, a team around Michael Dunn has tried to update and enhance that list.


Principle

In origin, the words in the Swadesh lists were chosen for their universal, culturally independent availability in as many languages as possible, regardless of their "stability". Nevertheless, the stability of the resulting list of "universal" vocabulary under language change and the potential use of this fact for purposes of
glottochronology Glottochronology (from Attic Greek γλῶττα ''tongue, language'' and χρόνος ''time'') is the part of lexicostatistics which involves comparative linguistics and deals with the chronological relationship between languages.Sheila Embleton ...
have been analyzed by numerous authors, including Marisa Lohr 1999, 2000. The Swadesh list was put together by Morris Swadesh on the basis of his intuition. Similar more recent lists, such as the Dolgopolsky list (1964) or the Leipzig–Jakarta list (2009), are based on systematic data from many different languages, but they are not yet as widely known nor as widely used as the Swadesh list.


Usage in lexicostatistics and glottochronology

Lexicostatistical test lists are used in lexicostatistics to define subgroupings of languages, and in
glottochronology Glottochronology (from Attic Greek γλῶττα ''tongue, language'' and χρόνος ''time'') is the part of lexicostatistics which involves comparative linguistics and deals with the chronological relationship between languages.Sheila Embleton ...
to "provide dates for branching points in the tree". The task of defining (and counting the number) of cognate words in the list is far from trivial, and often is subject to dispute, because cognates do not necessarily look similar, and recognition of cognates presupposes knowledge of the
sound law A sound change, in historical linguistics, is a change in the pronunciation of a language. A sound change can involve the replacement of one speech sound (or, more generally, one phonetic feature value) by a different one (called phonetic cha ...
s of the respective languages.


Swadesh 100 original final list

Swadesh's final list, published in 1971, contains 100 terms. Explanations of the terms can be found in Swadesh 1952 or, where noted by a dagger (), in Swadesh 1955. Note that only this original sequence clarifies the correct meaning which is lost in an alphabetical order, e.g., in the case "27. bark" (originally without the specification here added). "Claw" was only added in 1955, but again replaced by many well-known specialists with (finger)nail, because expressions for "claw" are not available in many old, extinct, or lesser known languages. The 110-item
Global Lexicostatistical Database The Evolution of Human Languages (EHL) project is a historical-comparative linguistics research project hosted by the Santa Fe Institute. It aims to provide a detailed genealogical classification of the world's languages. The project was founded ...
list uses the original 100-item Swadesh list, in addition to 10 other words from the Swadesh–Yakhontov list.


Swadesh 207 list

The most used list nowadays is the Swadesh 207-word list, adapted from Swadesh 1952. In Wiktionary (" Swadesh lists by language"), Panlex and in Palisto's "Swadesh Word List of Indo-European languages", hundreds of Swadesh lists in this form can be found.


Shorter lists

The Swadesh–Yakhontov list is a 35-word subset of the Swadesh list posited as especially stable by Russian linguist Sergei Yakhontov around the 1960s, although the list was only officially published in 1991. It has been used in lexicostatistics by linguists such as
Sergei Starostin Sergei Anatolyevich Starostin (russian: Серге́й Анато́льевич Ста́ростин; March 24, 1953 – September 30, 2005) was a Russian historical linguist and philologist, perhaps best known for his reconstructions of hypothet ...
. With their Swadesh numbers, they are:Starostin 1991 Holman ''et al.'' (2008) found that in identifying the relationships between
Chinese dialects Chinese, also known as Sinitic, is a branch of the Sino-Tibetan language family consisting of hundreds of local varieties, many of which are not mutually intelligible. Variation is particularly strong in the more mountainous southeast of main ...
the Swadesh–Yakhontov list was less accurate than the original Swadesh-100 list. Further they found that a different (40-word) list (also known as the
ASJP list The Automated Similarity Judgment Program (ASJP) is a collaborative project applying computational approaches to comparative linguistics using a database of word lists. The database is open access and consists of 40-item basic-vocabulary lists f ...
) was just as accurate as the Swadesh-100 list. However, they calculated the relative stability of the words by comparing retentions between languages in established language families. They found no statistically significant difference in the correlations in the families of the Old versus the New World. The ranked Swadesh-100 list, with Swadesh numbers and relative stability, is as follows (Holman ''et al.,'' Appendix. Asterisked words appear on the 40-word list):


Sign languages

In studying the sign languages of Vietnam and
Thailand Thailand ( ), historically known as Siam () and officially the Kingdom of Thailand, is a country in Southeast Asia, located at the centre of the Indochinese Peninsula, spanning , with a population of almost 70 million. The country is b ...
, linguist James Woodward noted that the traditional Swadesh list applied to spoken languages was unsuited for
sign languages Sign languages (also known as signed languages) are languages that use the visual-manual modality to convey meaning, instead of spoken words. Sign languages are expressed through manual articulation in combination with non-manual markers. Sign l ...
. The Swadesh list results in overestimation of the relationships between sign languages, due to indexical signs such as pronouns and parts of the body. The modified list is as follows, in largely alphabetical order:


See also

* Other lists ** A General Service List of English Words — roughly 2,000 of the most common English words ** Dolgopolsky list — the 15 words that change least as languages evolve ** Leipzig–Jakarta list — 100 words resistant to borrowing, used to estimate chronological separation of languages, intended to improve on the Swadesh list ** Th
Appendix of Swadesh lists
in Wiktionary * Projects and databases **
Automated Similarity Judgment Program The Automated Similarity Judgment Program (ASJP) is a collaborative project applying computational approaches to comparative linguistics using a database of word lists. The database is open access and consists of 40-item basic-vocabulary lists ...
— a project applying computational approaches to comparative linguistics using a database of word lists ** Evolution of Human Languages — a project to provide a genealogical classification of the world's languages **
Intercontinental Dictionary Series The Intercontinental Dictionary Series (commonly abbreviated as IDS) is a large database of topical vocabulary lists in various world languages. The general editor of the database is Bernard Comrie of the Max Planck Institute for Evolutionary A ...
— a database of vocabulary lists in over 200 languages, especially indigenous South American and Northeast Caucasian * Linguistic concepts and fields **
Cognate In historical linguistics, cognates or lexical cognates are sets of words in different languages that have been inherited in direct descent from an etymological ancestor in a common parent language. Because language change can have radical ef ...
— a word derived from the same word as another **
Historical linguistics Historical linguistics, also termed diachronic linguistics, is the scientific study of language change over time. Principal concerns of historical linguistics include: # to describe and account for observed changes in particular languages # ...
— the study of language change over time **
Indo-European studies Indo-European studies is a field of linguistics and an interdisciplinary field of study dealing with Indo-European languages, both current and extinct. The goal of those engaged in these studies is to amass information about the hypothetical p ...
— the study of Indo-European languages and their hypothetical common ancestor, Proto-Indo-European **
Proto-language In the tree model of historical linguistics, a proto-language is a postulated ancestral language from which a number of attested languages are believed to have descended by evolution, forming a language family. Proto-languages are usually unattes ...
— a postulated ancestral language from which a family of languages is presumed to have evolved * Methods of language reconstruction **
Comparative method In linguistics, the comparative method is a technique for studying the development of languages by performing a feature-by-feature comparison of two or more languages with common descent from a shared ancestor and then extrapolating backwards t ...
— feature-by-feature comparison of related languages to reconstruct their development and common ancestor ** Mass lexical comparison — a controversial method, seen as a rival to the comparative method, to determine the relatedness of languages **
Internal reconstruction Internal reconstruction is a method of reconstructing an earlier state in a language's history using only language-internal evidence of the language in question. The comparative method compares variations between languages, such as in sets of c ...
— reconstruction of an earlier state of a language without comparing it to other languages * Other ** Basic English — a simplified form of English for communication and learning


Notes


References

* Campbell, Lyle. (1998). ''Historical Linguistics: An Introduction''. Edinburgh: Edinburgh University Press. . *Embleton, Sheila (1995). Review of ''An Indo-European Classification: A Lexicostatistical Experiment'' by Isidore Dyen, J.B. Kruskal and P.Black. TAPS Monograph 82–5, Philadelphia. in ''Diachronica'' Vol. 12, no. 2, 263–68. * Gudschinsky, Sarah. (1956). "The ABCs of Lexicostatistics (Glottochronology)." ''Word'', Vol. 12, 175–210. * Hoijer, Harry. (1956). "Lexicostatistics: A Critique." ''Language'', Vol. 32, 49–60. * Holm, Hans J. (2007). "The New Arboretum of Indo-European 'Trees': Can New Algorithms Reveal the Phylogeny and Even Prehistory of Indo-European?" ''Journal of Quantitative Linguistics'', Vol. 14, 167–214. *Holman, Eric W., Søren Wichmann, Cecil H. Brown, Viveka Velupillai, André Müller, Dik Bakker (2008). "Explorations in Automated Language Classification". ''Folia Linguistica'', Vol. 42, no. 2, 331–354 *Sankoff, David (1970). "On the Rate of Replacement of Word-Meaning Relationships." ''Language'', Vol. 46, 564–569. *Starostin, Sergei (1991). ''Altajskaja Problema i Proisxozhdenie Japonskogo Jazyka'' he Altaic Problem and the Origin of the Japanese Language Moscow: Nauka * Swadesh, Morris. (1950). "Salish Internal Relationships." ''International Journal of American Linguistics'', Vol. 16, 157–167. * Swadesh, Morris. (1952). "Lexicostatistic Dating of Prehistoric Ethnic Contacts." ''Proceedings of the American Philosophical Society'', Vol. 96, 452–463. * Swadesh, Morris. (1955). "Towards Greater Accuracy in Lexicostatistic Dating." ''
International Journal of American Linguistics The ''International Journal of American Linguistics'' (''IJAL'') is an academic journal devoted to the study of the indigenous languages of the Americas. ''IJAL'' focuses on the investigation of linguistic data and the presentation of grammatical ...
'', Vol. 21, 121–137. * Swadesh, Morris. (1971). ''The Origin and Diversification of Language''. Ed. ''post mortem'' by Joel Sherzer. Chicago: Aldine. . Contains final 100-word list on p. 283. * Swadesh, Morris, et al. (1972). "What is Glottochronology?" in Morris Swadesh and Joel Sherzer, ed., ''The Origin and Diversification of Language,'' pp. 271–284. London: Routledge & Kegan Paul. . *Wittmann, Henri (1973). "The Lexicostatistical Classification of the French-Based Creole Languages." ''Lexicostatistics in Genetic Linguistics: Proceedings of the Yale Conference, April 3–4, 1971'', dir. Isidore Dyen, 89–99. La Haye: Mouto


External links


Lexico-semantic universals: A critical overview

Rosetta project



Illustrated linguistic and etymology blog by Stephan Steinbach
{{Long-range comparative linguistics Historical linguistics Comparative linguistics Quantitative linguistics Word lists Linguistics lists 1950 documents