HOME
*



picture info

Lingua Libre
Lingua Libre is an online collaborative project and tool by the Wikimedia France association, which aims to build a collaborative, multilingual, audiovisual corpus under free license. Description Lingua Libre enables to record words, phrases or sentences of any language, oral (audio recording) or signed (video recording). Words are presented to the speaker in the form of a list, created on the spot or in advance, or reusing an existing Wikimedia category. The speaker simply reads the word displayed on the screen, and the software moves on to the next word when it detects a silence after the read word. This principle, borrowed from the open source software Shtooka recorder with the help of its creator, Nicolas Vion, makes it possible to record several hundreds of words per hour. The recordings are then uploaded automatically from the web client to the Wikimedia Commons media library. In spring 2021, Lingua Libre was offline due to a fire in Strasbourg, but no audio recording ...
[...More Info...]      
[...Related Items...]     OR:     [Wikipedia]   [Google]   [Baidu]  


picture info

Multilingual
Multilingualism is the use of more than one language, either by an individual speaker or by a group of speakers. It is believed that multilingual speakers outnumber monolingual speakers in the world's population. More than half of all Europeans claim to speak at least one language other than their mother tongue; but many read and write in one language. Multilingualism is advantageous for people wanting to participate in trade, globalization and cultural openness. Owing to the ease of access to information facilitated by the Internet, individuals' exposure to multiple languages has become increasingly possible. People who speak several languages are also called polyglots. Multilingual speakers have acquired and maintained at least one language during childhood, the so-called first language (L1). The first language (sometimes also referred to as the mother tongue) is usually acquired without formal education, by mechanisms about which scholars disagree. Children acquirin ...
[...More Info...]      
[...Related Items...]     OR:     [Wikipedia]   [Google]   [Baidu]  


picture info

Natural Language Processing
Natural language processing (NLP) is an interdisciplinary subfield of linguistics, computer science, and artificial intelligence concerned with the interactions between computers and human language, in particular how to program computers to process and analyze large amounts of natural language data. The goal is a computer capable of "understanding" the contents of documents, including the contextual nuances of the language within them. The technology can then accurately extract information and insights contained in the documents as well as categorize and organize the documents themselves. Challenges in natural language processing frequently involve speech recognition, natural-language understanding, and natural-language generation. History Natural language processing has its roots in the 1950s. Already in 1950, Alan Turing published an article titled "Computing Machinery and Intelligence" which proposed what is now called the Turing test as a criterion of intelligence, t ...
[...More Info...]      
[...Related Items...]     OR:     [Wikipedia]   [Google]   [Baidu]  




Common Voice
Common Voice is a crowdsourcing project started by Mozilla to create a free database for speech recognition software. The project is supported by volunteers who record sample sentences with a microphone and review recordings of other users. The transcribed sentences will be collected in a voice database available under the public domain license CC0. This license ensures that developers can use the database for voice-to-text applications without restrictions or costs. Aims Common Voice aims to provide diverse voice samples. According to Mozilla's Katharina Borchert, many existing projects took datasets from public radio or otherwise had datasets that underrepresented both women and people with pronounced accents. History At the beginning of 2022, the Bengali.AI partnered with commonvoice to launch "Bangla Speech Recognition" project that aims to make machines understand Bangla language. 2000 hours of voice was collected with aim for higher than 10,000 hours. Voice database The ...
[...More Info...]      
[...Related Items...]     OR:     [Wikipedia]   [Google]   [Baidu]  


Forvo
Forvo.com is a website that allows access to, and playback of, pronunciation sound clips in many different languages in an attempt to facilitate the learning of languages. Forvo.com was first envisioned in 2007 by co-founder Israel Rondón, and came to fruition in 2008. Forvo.com is owned by Forvo Media SL, based in San Sebastián, Spain. Forvo's 'About'-page states Forvo to be the largest pronunciation guide website on the Internet. It has been listed in the 50 best websites of 2013 by ''Time''. All sound clips on Forvo.com are created by its users, who also have the power to vote on each clip, positively or negatively, in an effort to ensure that the highest quality sound clips have priority in the site's listings. The pronunciations are also reviewed and edited by a volunteer team of editors. Forvo has an API to share its pronunciation with other websites. The API service is paid (24 USD/year for individual use). Recommendations for adding words Forvo just encourages adding ...
[...More Info...]      
[...Related Items...]     OR:     [Wikipedia]   [Google]   [Baidu]  


Lingua Libre Atikamekw At Wikimania 2017 Montreal
Lingua (Latin, 'tongue') may refer to: * ''Lingua'' (journal), a peer-reviewed academic journal of general linguistics * ''Lingua'' (sculpture), by Jim Sanborn * ''Lingua'' (play), a 17th-century play attributed to Thomas Tomkis * Project Lingua, an online translation community * Lingua (indonesian vocal group) Lingua is an Indonesian vocal group formed in 1996, consisting of Frans Mohede, Amara, and Arie Widiawan. Throughout his career, Lingua has released four albums and four singles, namely three studio albums and one compilation mini album, as well ..., an indonesian vocal group. See also * Language (other) * Linga (other) * Lingga (other) * Tongue (other) * Lingua franca, a common language * Lingua.ly, an educational technology business {{disambiguation ...
[...More Info...]      
[...Related Items...]     OR:     [Wikipedia]   [Google]   [Baidu]  


picture info

Sign Language
Sign languages (also known as signed languages) are languages that use the visual-manual modality to convey meaning, instead of spoken words. Sign languages are expressed through manual articulation in combination with non-manual markers. Sign languages are full-fledged natural languages with their own grammar and lexicon. Sign languages are not universal and are usually not mutually intelligible, although there are also similarities among different sign languages. Linguists consider both spoken and signed communication to be types of natural language, meaning that both emerged through an abstract, protracted aging process and evolved over time without meticulous planning. Sign language should not be confused with body language, a type of nonverbal communication. Wherever communities of deaf people exist, sign languages have developed as useful means of communication and form the core of local Deaf cultures. Although signing is used primarily by the deaf and hard of hearing, ...
[...More Info...]      
[...Related Items...]     OR:     [Wikipedia]   [Google]   [Baidu]  


picture info

OAuth
OAuth (short for "Open Authorization") is an open standard for access delegation, commonly used as a way for internet users to grant websites or applications access to their information on other websites but without giving them the passwords. This mechanism is used by companies such as Amazon, Google, Facebook, Microsoft, and Twitter to permit the users to share information about their accounts with third-party applications or websites. Generally, OAuth provides clients a "secure delegated access" to server resources on behalf of a resource owner. It specifies a process for resource owners to authorize third-party access to their server resources without providing credentials. Designed specifically to work with Hypertext Transfer Protocol (HTTP), OAuth essentially allows access tokens to be issued to third-party clients by an authorization server, with the approval of the resource owner. The third party then uses the access token to access the protected resources hosted by the r ...
[...More Info...]      
[...Related Items...]     OR:     [Wikipedia]   [Google]   [Baidu]  


Wikibase
Wikibase is a set of MediaWiki extensions for working with versioned semi-structured data in a central repository based upon JSON instead of the unstructured data of MediaWiki wikitext. Its primary components are the ''Wikibase Repository'', an extension for storing and managing data, and the ''Wikibase Client'' which allows for the retrieval and embedding of structured data from a wikibase repository. Wikibase was developed for and is used by Wikidata. The data model for Wikibase links consists of "entities" which include individual "items", labels or identifier to describe them (potentially in multiple languages), and semantic statements that attribute "properties" to the item. These properties may either be other items within the database, or textual information. Wikibase has a JavaScript-based user interface, and provides exports of all or subsets of data in many formats. Projects using it include Wikidata, Europeana's Eagle Project, Lingua Libre, and the OpenStreetMap wiki. ...
[...More Info...]      
[...Related Items...]     OR:     [Wikipedia]   [Google]   [Baidu]  


picture info

MediaWiki
MediaWiki is a free and open-source wiki software. It is used on Wikipedia and almost all other Wikimedia websites, including Wiktionary, Wikimedia Commons and Wikidata; these sites define a large part of the requirement set for MediaWiki. It was developed for use on Wikipedia in 2002, and given the name "MediaWiki" in 2003. MediaWiki was originally developed by Magnus Manske and improved by Lee Daniel Crocker. Magnus Manske's announcement of "PHP Wikipedia", wikipedia-l, August 24, 2001 Its development has since then been coordinated by the Wikimedia Foundation. MediaWiki is written in the PHP programming language and stores all text content into a database. The software is optimized to efficiently handle large projects, which can have terabytes of content and hundreds of thousands of views per second. Because Wikipedia is one of the world's largest websites, achieving scalability through multiple layers of caching and database replication has been a major concern for de ...
[...More Info...]      
[...Related Items...]     OR:     [Wikipedia]   [Google]   [Baidu]  


picture info

Occitan Language
Occitan (; oc, occitan, link=no ), also known as ''lenga d'òc'' (; french: langue d'oc) by its native speakers, and sometimes also referred to as ''Provençal'', is a Romance languages, Romance language spoken in Southern France, Monaco, Italy's Occitan Valleys, as well as Spain's Val d'Aran; collectively, these regions are sometimes referred to as Occitania, Occitània. It is also spoken in Calabria (Southern Italy) in a linguistic enclave of Cosenza area (mostly Guardia Piemontese). Some include Catalan language, Catalan in Occitan, as the Linguistic distance, distance between this language and some Occitan dialects (such as the Gascon language) is similar to the distance between different Occitan dialects. Catalan was considered a dialect of Occitan until the end of the 19th century and still today remains its closest relative. Occitan is an official language of Catalonia, where a subdialect of Gascon known as Aranese dialect, Aranese is spoken in the Val d'Aran. Since Sept ...
[...More Info...]      
[...Related Items...]     OR:     [Wikipedia]   [Google]   [Baidu]