Language Documentation Tools And Methods
   HOME
*





Language Documentation Tools And Methods
The field of language documentation in the modern context involves a complex and ever-evolving set of tools and methods, and the study and development of their use - and, especially, identification and promotion of best practices - can be considered a sub-field of language documentation proper. Among these are ethical and recording principles, workflows and methods, hardware tools, and software tools. Principles and workflows Researchers in language documentation often conduct linguistic fieldwork to gather the data on which their work is based, recording audiovisual files that document language use in traditional contexts. Because the environments in which linguistic fieldwork often takes place may be logistically challenging, not every type of recording tool is necessary or ideal, and compromises must often be struck between quality, cost and usability. It is also important to envision one's complete workflow and intended outcomes; for example, if video files are made, some amoun ...
[...More Info...]      
[...Related Items...]     OR:     [Wikipedia]   [Google]   [Baidu]  


Language Documentation
Language documentation (also: documentary linguistics) is a subfield of linguistics which aims to describe the grammar and use of human languages. It aims to provide a comprehensive record of the linguistic practices characteristic of a given speech community. Language documentation seeks to create as thorough a record as possible of the speech community for both posterity and language revitalization. This record can be public or private depending on the needs of the community and the purpose of the documentation. In practice, language documentation can range from solo linguistic anthropological fieldwork to the creation of vast online archives that contain dozens of different languages, such as FirstVoices or OLAC. Language documentation provides a firmer foundation for linguistic analysis in that it creates a corpus of materials in the language. The materials in question can range from vocabulary lists and grammar rules to children's books and translated works. These materials can ...
[...More Info...]      
[...Related Items...]     OR:     [Wikipedia]   [Google]   [Baidu]  


picture info

Dallas
Dallas () is the List of municipalities in Texas, third largest city in Texas and the largest city in the Dallas–Fort Worth metroplex, the List of metropolitan statistical areas, fourth-largest metropolitan area in the United States at 7.5 million people. It is the largest city in and County seat, seat of Dallas County, Texas, Dallas County with portions extending into Collin County, Texas, Collin, Denton County, Texas, Denton, Kaufman County, Texas, Kaufman and Rockwall County, Texas, Rockwall counties. With a 2020 United States census, 2020 census population of 1,304,379, it is the List of United States cities by population, ninth most-populous city in the U.S. and the List of cities in Texas by population, third-largest in Texas after Houston and San Antonio. Located in the North Texas region, the city of Dallas is the main core of the largest metropolitan area in the Southern United States and the largest inland metropolitan area in the U.S. that lacks any navigable link ...
[...More Info...]      
[...Related Items...]     OR:     [Wikipedia]   [Google]   [Baidu]  


Dreaming Of Words
''Dreaming of Words'' is a 2021 Indian documentary film directed and produced by Nandan. ''Dreaming of Words'' has received numerous accolades including National Film Award for Best Educational/Motivational/Instructional Film (2020) awarded to Nandan as both director and producer at the 68th National Film Awards. The documentary traces the life and work of Njattyela Sreedharan, a fourth standard drop-out, who compiles a dictionary connecting four major Dravidian languages. Synopsis Njattyela Sreedharan, a fourth standard drop-out, compiles a dictionary connecting four major Dravidian languages. Travelling across four states and doing extensive research, he spent twenty five years making the multilingual dictionary. This unique dictionary offers a comparative study of Malayalam, Kannada, Tamil and Telugu. ''Dreaming of Words'' traces Sreedharan's life, work, love for languages and the struggles to get the dictionary published. The film also explores the linguistic and cultura ...
[...More Info...]      
[...Related Items...]     OR:     [Wikipedia]   [Google]   [Baidu]  


picture info

Sign Language
Sign languages (also known as signed languages) are languages that use the visual-manual modality to convey meaning, instead of spoken words. Sign languages are expressed through manual articulation in combination with non-manual markers. Sign languages are full-fledged natural languages with their own grammar and lexicon. Sign languages are not universal and are usually not mutually intelligible, although there are also similarities among different sign languages. Linguists consider both spoken and signed communication to be types of natural language, meaning that both emerged through an abstract, protracted aging process and evolved over time without meticulous planning. Sign language should not be confused with body language, a type of nonverbal communication. Wherever communities of deaf people exist, sign languages have developed as useful means of communication and form the core of local Deaf cultures. Although signing is used primarily by the deaf and hard of hearing, ...
[...More Info...]      
[...Related Items...]     OR:     [Wikipedia]   [Google]   [Baidu]  




Spoken Language
A spoken language is a language produced by articulate sounds or (depending on one's definition) manual gestures, as opposed to a written language. An oral language or vocal language is a language produced with the vocal tract in contrast with a sign language, which is produced with the body and hands. Definition The term "spoken language" is sometimes used to mean only oral languages, especially by linguists, excluding sign languages and making the terms 'spoken', 'oral', 'vocal language' synonymous. Others refer to sign language as "spoken", especially in contrast to written transcriptions of signs. Context In spoken language, much of a speaker's meaning is determined by the context. That contrasts with written language in which more of the meaning is provided directly by the text. In spoken language, the truth of a proposition is determined by common-sense reference to experience, but in written language, a greater emphasis is placed on logical and coherent argument. Similarly, ...
[...More Info...]      
[...Related Items...]     OR:     [Wikipedia]   [Google]   [Baidu]  


picture info

FLOSS
Free and open-source software (FOSS) is a term used to refer to groups of software consisting of both free software and open-source software where anyone is freely licensed to use, copy, study, and change the software in any way, and the source code is openly shared so that people are encouraged to voluntarily improve the design of the software. This is in contrast to proprietary software, where the software is under restrictive copyright licensing and the source code is usually hidden from the users. FOSS maintains the software user's civil liberty rights (see the Four Essential Freedoms, below). Other benefits of using FOSS can include decreased software costs, increased security and stability (especially in regard to malware), protecting privacy, education, and giving users more control over their own hardware. Free and open-source operating systems such as Linux and descendants of BSD are widely utilized today, powering millions of servers, desktops, smartphones (e.g., A ...
[...More Info...]      
[...Related Items...]     OR:     [Wikipedia]   [Google]   [Baidu]  


picture info

Lingua Libre
Lingua Libre is an online collaborative project and tool by the Wikimedia France association, which aims to build a collaborative, multilingual, audiovisual corpus under free license. Description Lingua Libre enables to record words, phrases or sentences of any language, oral (audio recording) or signed (video recording). Words are presented to the speaker in the form of a list, created on the spot or in advance, or reusing an existing Wikimedia category. The speaker simply reads the word displayed on the screen, and the software moves on to the next word when it detects a silence after the read word. This principle, borrowed from the open source software Shtooka recorder with the help of its creator, Nicolas Vion, makes it possible to record several hundreds of words per hour. The recordings are then uploaded automatically from the web client to the Wikimedia Commons media library. In spring 2021, Lingua Libre was offline due to a fire in Strasbourg, but no audio recording ...
[...More Info...]      
[...Related Items...]     OR:     [Wikipedia]   [Google]   [Baidu]  


HTK (software)
HTK (Hidden Markov Model Toolkit) is a proprietary software toolkit for handling HMMs. It is mainly intended for speech recognition, but has been used in many other pattern recognition applications that employ HMMs, including speech synthesis, character recognition and DNA sequencing. Originally developed at the Machine Intelligence Laboratory (formerly known as the Speech Vision and Robotics Group) of the Cambridge University Engineering Department (CUED), HTK is now being widely used among researchers who are working on HMMs. See also * List of speech recognition software ReferencesHTK Page in Cambridge University External linksusing the TIMIT TIMIT is a corpus of phonemically and lexically transcribed speech of American English speakers of different sexes and dialects. Each transcribed element has been delineated in time. TIMIT was designed to further acoustic-phonetic knowledge and au ... speech corpussource code Speech recognition software {{science-software-stub ...
[...More Info...]      
[...Related Items...]     OR:     [Wikipedia]   [Google]   [Baidu]  


picture info

ESpeakNG
eSpeakNG is a free and open-source, cross-platform, compact, software speech synthesizer. It uses a formant synthesis method, providing many languages in a relatively small file size. Much of the programming for eSpeakNG's language support is implemented using rule files with feedback from native speakers. Because of its small size and many languages, it is included in NVDA open source screen reader for Windows, as well as Android, Ubuntu and other Linux distributions. Its predecessor eSpeak was recommended by Microsoft in 2016 and was used by Google Translate for 27 languages in 2010; 17 of these were subsequently replaced by proprietary voices. The quality of the language voices varies greatly. In eSpeakNG's predecessor eSpeak, the initial versions of some languages were based on information found on Wikipedia. Some languages have had more work or feedback from native speakers than others. Most of the people who have helped to improve the various languages are blind users of ...
[...More Info...]      
[...Related Items...]     OR:     [Wikipedia]   [Google]   [Baidu]  


picture info

Nijmegen
Nijmegen (;; Spanish and it, Nimega. Nijmeegs: ''Nimwèège'' ) is the largest city in the Dutch province of Gelderland and tenth largest of the Netherlands as a whole, located on the Waal river close to the German border. It is about 60 km south east of Utrecht and 50 km north east of Eindhoven. Nijmegen is the oldest city in the Netherlands, the second to be recognized as such in Roman times, and in 2005 celebrated 2,000 years of existence. Nijmegen became a free imperial city in 1230 and in 1402 a Hanseatic city. Since 1923 it has been a university city with the opening of a Catholic institution now known as the Radboud University Nijmegen. The city is well known for the International Four Days Marches Nijmegen event. Its population in 2022 was 179,000; the municipality is part of the Arnhem–Nijmegen metropolitan area, with 736,107 inhabitants in 2011. Population centres The municipality is formed by the city of Nijmegen, incorporating the former villages of Ha ...
[...More Info...]      
[...Related Items...]     OR:     [Wikipedia]   [Google]   [Baidu]  


Max Planck Institute For Psycholinguistics
The Max Planck Institute for Psycholinguistics (German: ''Max-Planck-Institut für Psycholinguistik''; Dutch: ''Max Planck Instituut voor Psycholinguïstiek'') is a research institute situated on the campus of Radboud University Nijmegen located in Nijmegen, Gelderland, the Netherlands. The institute was founded in 1980 by Pim Levelt, and is particular for being entirely dedicated to psycholinguistics, and is also one of the few institutes of the Max Planck Society to be located outside Germany. The Nijmegen-based institute currently occupies 5th position in the Ranking Web of World Research Centers among all Max Planck institutes (7th by size, 4th by visibility). It currently employs about 235 people. Research The institute specializes in language comprehension, language production, language acquisition, language and genetics, and the relation between language and cognition. Its mission is to undertake basic research into the psychological, social and biological foundations of la ...
[...More Info...]      
[...Related Items...]     OR:     [Wikipedia]   [Google]   [Baidu]