Linguistic Data Consortium
   HOME
*





Linguistic Data Consortium
The Linguistic Data Consortium is an open consortium of universities, companies and government research laboratories. It creates, collects and distributes speech and text databases, lexicons, and other resources for linguistics research and development purposes. The University of Pennsylvania is the LDC's host institution. The LDC was founded in 1992 with a grant from the US Defense Advanced Research Projects Agency (DARPA), and is partly supported by grant IRI-9528587 from the Information and Intelligent Systems division of the National Science Foundation. The director of LDC is Mark Liberman and the executive director is Christopher Cieri. See also *Corpus linguistics *Cross-Linguistic Linked Data (CLLD) – project coordinating over a dozen linguistics databases; hosted by the Max Planck Institute (Germany) *European Language Resources Association (ELRA) – a Luxembourg- and France-based institute with a mission similar to LDC's * Language Grid – a platform for language resour ...
[...More Info...]      
[...Related Items...]     OR:     [Wikipedia]   [Google]   [Baidu]  


picture info

Consortium
A consortium (plural: consortia) is an association of two or more individuals, companies, organizations or governments (or any combination of these entities) with the objective of participating in a common activity or pooling their resources for achieving a common goal. is a Latin word meaning "partnership", "association" or "society", and derives from ("shared in property"), itself from ("together") and ("fate"). Examples Educational The Big Ten Academic Alliance in the Midwest and Mid-Atlantic U.S., Claremont Colleges consortium in Southern California, Five College Consortium in Massachusetts, and Consórcio Nacional Honda are among the oldest and most successful higher education consortia in the World. The Big Ten Academic Alliance, formerly known as the Committee on Institutional Cooperation, includes the members of the Big Ten athletic conference. The participants in Five Colleges, Inc. are: Amherst College, Hampshire College, Mount Holyoke College, Smith College, a ...
[...More Info...]      
[...Related Items...]     OR:     [Wikipedia]   [Google]   [Baidu]  


European Language Resources Association
The European Language Resources Association (ELRA) is a not-for-profit organisation established under the law of the Grand Duchy of Luxembourg. Its seat is in Luxembourg and its headquarters is in Paris, France. Activities Since its founding in 1995, the European Language Resources Association has been a conduit for the distribution of speech, written, and terminology language resources (LRs) for human language technology (HLT), a key component of information society technologies (IST) In order to do so, a number of technical and logistic, commercial (prices, fees, royalties), legal (licensing, intellectual property rights, management), and information dissemination issues had to be addressed. ELRA broadening its objectives and responsibilities towards the HLT community over the years, and is now also involved in the production, or commissioning of the production, of language resources through a number of initiatives, and actively committed to the evaluation of language-engineeri ...
[...More Info...]      
[...Related Items...]     OR:     [Wikipedia]   [Google]   [Baidu]  


picture info

Consortia In The United States
A consortium (plural: consortia) is an association of two or more individuals, companies, organizations or governments (or any combination of these entities) with the objective of participating in a common activity or pooling their resources for achieving a common goal. is a Latin word meaning "partnership", "association" or "society", and derives from ("shared in property"), itself from ("together") and ("fate"). Examples Educational The Big Ten Academic Alliance in the Midwest and Mid-Atlantic U.S., Claremont Colleges consortium in Southern California, Five College Consortium in Massachusetts, and Consórcio Nacional Honda are among the oldest and most successful higher education consortia in the World. The Big Ten Academic Alliance, formerly known as the Committee on Institutional Cooperation, includes the members of the Big Ten athletic conference. The participants in Five Colleges, Inc. are: Amherst College, Hampshire College, Mount Holyoke College, Smith College, an ...
[...More Info...]      
[...Related Items...]     OR:     [Wikipedia]   [Google]   [Baidu]  


Lexicography
Lexicography is the study of lexicons, and is divided into two separate academic disciplines. It is the art of compiling dictionaries. * Practical lexicography is the art or craft of compiling, writing and editing dictionaries. * Theoretical lexicography is the scholarly study of semantic, orthographic, syntagmatic and paradigmatic features of lexemes of the lexicon (vocabulary) of a language, developing theories of dictionary components and structures linking the data in dictionaries, the needs for information by users in specific types of situations, and how users may best access the data incorporated in printed and electronic dictionaries. This is sometimes referred to as 'metalexicography'. There is some disagreement on the definition of lexicology, as distinct from lexicography. Some use "lexicology" as a synonym for theoretical lexicography; others use it to mean a branch of linguistics pertaining to the inventory of words in a particular language. A person devoted ...
[...More Info...]      
[...Related Items...]     OR:     [Wikipedia]   [Google]   [Baidu]  


Speech Technology
Speech technology relates to the technologies designed to duplicate and respond to the human voice. They have many uses. These include aid to the voice-disabled, the hearing-disabled, and the blind, along with communication with computers without a keyboard. They enhance game software and aid in marketing goods or services by telephone. The subject includes several subfields: * Speech synthesis * Speech recognition * Speaker recognition * Speaker verification * Speech encoding * Multimodal interaction See also * Communication aids * Language technology * Speech interface guideline * Speech processing Speech processing is the study of speech signals and the processing methods of signals. The signals are usually processed in a digital representation, so speech processing can be regarded as a special case of digital signal processing, applied t ... * ''Speech Technology'' (magazine) External links {{tech-stub Speech processing da:Taleteknologi fi:Puheteknologia th:ก ...
[...More Info...]      
[...Related Items...]     OR:     [Wikipedia]   [Google]   [Baidu]  


picture info

Natural Language Processing
Natural language processing (NLP) is an interdisciplinary subfield of linguistics, computer science, and artificial intelligence concerned with the interactions between computers and human language, in particular how to program computers to process and analyze large amounts of natural language data. The goal is a computer capable of "understanding" the contents of documents, including the contextual nuances of the language within them. The technology can then accurately extract information and insights contained in the documents as well as categorize and organize the documents themselves. Challenges in natural language processing frequently involve speech recognition, natural-language understanding, and natural-language generation. History Natural language processing has its roots in the 1950s. Already in 1950, Alan Turing published an article titled "Computing Machinery and Intelligence" which proposed what is now called the Turing test as a criterion of intelligence, t ...
[...More Info...]      
[...Related Items...]     OR:     [Wikipedia]   [Google]   [Baidu]  


picture info

Machine Translation
Machine translation, sometimes referred to by the abbreviation MT (not to be confused with computer-aided translation, machine-aided human translation or interactive translation), is a sub-field of computational linguistics that investigates the use of software to translate text or speech from one language to another. On a basic level, MT performs mechanical substitution of words in one language for words in another, but that alone rarely produces a good translation because recognition of whole phrases and their closest counterparts in the target language is needed. Not all words in one language have equivalent words in another language, and many words have more than one meaning. Solving this problem with corpus statistical and neural techniques is a rapidly growing field that is leading to better translations, handling differences in linguistic typology, translation of idioms, and the isolation of anomalies. Current machine translation software often allows for customizat ...
[...More Info...]      
[...Related Items...]     OR:     [Wikipedia]   [Google]   [Baidu]  




Language Grid
{{Short description, Linguistics website The Language Grid is a multilingual service platform on the Internet mainly for supporting Intercultural collaboration. It enables easy registration and sharing of language resources such as online dictionaries, bilingual corpora, and machine translations. Concept The Language Grid is developed to increase the accessibility and usability of language resources. It takes the service-oriented approach by wrapping existing language resources as atomic Web services and enables users to compose new services by combining atomic Web services. Architecture The architecture of the Language Grid is to increase the usability of language resources, and to decrease the risk to providers in opening their resources. By wrapping resources as services, providers can control their intellectual property rights. It is essential to define stakeholders, their roles and the social protocol among them. Every stakeholder related to service grids, a ''service gr ...
[...More Info...]      
[...Related Items...]     OR:     [Wikipedia]   [Google]   [Baidu]  


Cross-Linguistic Linked Data
The Cross-Linguistic Linked Data (CLLD) project coordinates over a dozen linguistics databases covering the languages of the world. It is hosted by the Department of Linguistic and Cultural Evolution at the Max Planck Institute for Evolutionary Anthropology in Leipzig, Germany (previously at the Max Planck Institute for the Science of Human History in Jena). Databases and projects *''Glottolog'' *World Atlas of Language Structures (WALS) *World Loanword Database (WOLD) * Atlas of Pidgin and Creole Language Structures (APICS) *Automated Similarity Judgment Program (ASJP) *Intercontinental Dictionary Series (IDS) *Electronic World Atlas of Varieties of English (eWAVE) *A world-wide survey of affix borrowing (AfBo) *South American Indigenous Language Structures Online (SAILS) *PHOIBLE *Tsammalex *Comparative Siouan Dictionary (CSD) * Concepticon *Dogon languages *Database of Cross-Linguistic Colexifications *Glottobank (includes Lexibank, Grambank, Phonobank, Parabank, Numeralbank) *' ...
[...More Info...]      
[...Related Items...]     OR:     [Wikipedia]   [Google]   [Baidu]  


picture info

Database
In computing, a database is an organized collection of data stored and accessed electronically. Small databases can be stored on a file system, while large databases are hosted on computer clusters or cloud storage. The design of databases spans formal techniques and practical considerations, including data modeling, efficient data representation and storage, query languages, security and privacy of sensitive data, and distributed computing issues, including supporting concurrent access and fault tolerance. A database management system (DBMS) is the software that interacts with end users, applications, and the database itself to capture and analyze the data. The DBMS software additionally encompasses the core facilities provided to administer the database. The sum total of the database, the DBMS and the associated applications can be referred to as a database system. Often the term "database" is also used loosely to refer to any of the DBMS, the database system or an appli ...
[...More Info...]      
[...Related Items...]     OR:     [Wikipedia]   [Google]   [Baidu]  


Corpus Linguistics
Corpus linguistics is the study of language, study of a language as that language is expressed in its text corpus (plural ''corpora''), its body of "real world" text. Corpus linguistics proposes that a reliable analysis of a language is more feasible with corpora collected in the field—the natural context ("realia") of that language—with minimal experimental interference. The text-corpus method uses the body of texts written in any natural language to derive the set of abstract rules which govern that language. Those results can be used to explore the relationships between that subject language and other languages which have undergone a similar analysis. The first such corpora were manually derived from source texts, but now that work is automated. Corpora have not only been used for linguistics research, they have also been used to compile dictionaries (starting with ''The American Heritage Dictionary of the English Language'' in 1969) and grammar guides, such as ''A Compreh ...
[...More Info...]      
[...Related Items...]     OR:     [Wikipedia]   [Google]   [Baidu]  


Mark Liberman
Mark Yoffe Liberman is an American linguist. He has a dual appointment at the University of Pennsylvania, as Trustee Professor of Phonetics in the Department of Linguistics, and as a professor in the Department of Computer and Information Sciences. He is the founder and director of the Linguistic Data Consortium. Liberman is the Faculty Director of Ware College House at the University of Pennsylvania. Early life Liberman is the son of psychologists Alvin Liberman and Isabelle Liberman. Mark Liberman attended Harvard College but did not graduate. After two years' service in the US Army in Vietnam, he enrolled in graduate school in linguistics at MIT, from which he received a Master of Science (1972) and a PhD (1975). Career From 1975 to 1990, he was a Member of Technical Staff at Bell Laboratories. Research Liberman's main research interests lie in phonetics, prosody, and other aspects of speech communication. His early research established the linguistic subfield of ...
[...More Info...]      
[...Related Items...]     OR:     [Wikipedia]   [Google]   [Baidu]