Common Voice
   HOME
*





Common Voice
Common Voice is a crowdsourcing project started by Mozilla to create a free database for speech recognition software. The project is supported by volunteers who record sample sentences with a microphone and review recordings of other users. The transcribed sentences will be collected in a voice database available under the public domain license CC0. This license ensures that developers can use the database for voice-to-text applications without restrictions or costs. Aims Common Voice aims to provide diverse voice samples. According to Mozilla's Katharina Borchert, many existing projects took datasets from public radio or otherwise had datasets that underrepresented both women and people with pronounced accents. History At the beginning of 2022, the Bengali.AI partnered with commonvoice to launch "Bangla Speech Recognition" project that aims to make machines understand Bangla language. 2000 hours of voice was collected with aim for higher than 10,000 hours. Voice database The ...
[...More Info...]      
[...Related Items...]     OR:     [Wikipedia]   [Google]   [Baidu]  


picture info

Mozilla Foundation
The Mozilla Foundation (stylized as moz://a) is an American non-profit organization that exists to support and collectively lead the open source Mozilla project. Founded in July 2003, the organization sets the policies that govern development, operates key infrastructure and controls Mozilla trademark A trademark (also written trade mark or trade-mark) is a type of intellectual property consisting of a recognizable sign, design, or expression that identifies products or services from a particular source and distinguishes them from others ...s and copyrights. It owns a taxable subsidiary: the Mozilla Corporation, which employs many Mozilla developers and coordinates releases of the Mozilla Firefox web browser and Mozilla Thunderbird email client. The Mozilla Foundation was founded by the Netscape-affiliated Mozilla Organization. The organization is currently based in the Silicon Valley city of Mountain View, California, Mountain View, California, United States. The Mozil ...
[...More Info...]      
[...Related Items...]     OR:     [Wikipedia]   [Google]   [Baidu]  


VentureBeat
''VentureBeat'' is an American technology website headquartered in San Francisco, California. It publishes news, analysis, long-form features, interviews, and videos. History The ''VentureBeat'' company was founded in 2006 by Matt Marshall, an ex-correspondent for ''The Mercury News''. In March 2009, ''VentureBeat'' signed a partnership agreement with IDG to produce DEMO Conference, a conference for startups to announce their launches and raise funding from venture capitalists and angel investors. In 2012, the partnership with IDG ended. In 2014 and 2015, the company raised outside investor funding from Silicon Valley venture capitalist firms including CrossLink Capital, Walden Venture Capital, Rally Ventures, Formation 8, and Lightbank. Editorial The ''VentureBeat'' website comprises a series of distinct news "Beats": Big data, Business (general news), Cloud, Deals, Dev, Enterprise, Entrepreneur, Media, Mobile, Marketing, Security, Small Biz, and Social. In addition, the ...
[...More Info...]      
[...Related Items...]     OR:     [Wikipedia]   [Google]   [Baidu]  


picture info

Breton Language
Breton (, ; or in Morbihan) is a Southwestern Brittonic language of the Celtic language family spoken in Brittany, part of modern-day France. It is the only Celtic language still widely in use on the European mainland, albeit as a member of the insular branch instead of the continental grouping. Breton was brought from Great Britain to Armorica (the ancient name for the coastal region that includes the Brittany peninsula) by migrating Britons during the Early Middle Ages, making it an Insular Celtic language. Breton is most closely related to Cornish, another Southwestern Brittonic language. Welsh and the extinct Cumbric, both Western Brittonic languages, are more distantly related. Having declined from more than one million speakers around 1950 to about 200,000 in the first decade of the 21st century, Breton is classified as "severely endangered" by the UNESCO '' Atlas of the World's Languages in Danger''. However, the number of children attending bilingual classes rose 33 ...
[...More Info...]      
[...Related Items...]     OR:     [Wikipedia]   [Google]   [Baidu]  


picture info

Belarusian Language
Belarusian ( be, беларуская мова, biełaruskaja mova, link=no, ) is an East Slavic language. It is the native language of many Belarusians and one of the two official state languages in Belarus. Additionally, it is spoken in some parts of Russia, Lithuania, Latvia, Poland, and Ukraine by Belarusian minorities in those countries. Before Belarus gained independence in 1991, the language was only known in English as ''Byelorussian'' or ''Belorussian'', the compound term retaining the English-language name for the Russian language in its second part, or alternatively as ''White Russian''. Following independence, it became known as ''Belarusan'' and since 1995 as ''Belarusian'' in English. As one of the East Slavic languages, Belarusian shares many grammatical and lexical features with other members of the group. To some extent, Russian, Rusyn, Ukrainian, and Belarusian retain a degree of mutual intelligibility. Its predecessor stage is known in Western academia as R ...
[...More Info...]      
[...Related Items...]     OR:     [Wikipedia]   [Google]   [Baidu]  


Basaa
Basaa (also spelled ''Bassa, Basa, Bissa''), or Mbene, is a Bantu language spoken in Cameroon by the Basaa people. It is spoken by about 300,000 people in the Centre and Littoral regions. Maho (2009) lists North and South Kogo as dialects. Background and Origin Basaa is spoken by 230,000 speakers. They live in Nyong-et-Kelle (Central Region) and Sanaga Maritime (with the exception of the Edéa commune, which has a Bakoko majority) and most of Nkam commune ( Littoral Region). In the western and northern parts of this department, the peripheral Basaa dialects are spoken: Yabasi in the commune of Yabassi, Diɓuum in the commune of Nkondjok (Diboum Canton), north of Ndemli and Dimbamban. Similarly, Basaa Baduala is spoken in Wouri Department (Littoral Region), traditional Basaa territory that is being transformed by the growth of Douala. Basaa is also found in Océan Department (commune of Bipindi, Southern Region). Hijuk is spoken only in the quarter of Niki in Batanga com ...
[...More Info...]      
[...Related Items...]     OR:     [Wikipedia]   [Google]   [Baidu]  


picture info

Bashkir Language
Bashkir (, ; Bashkir: ''Bashqortsa'', ''Bashqort tele'', ) is a Turkic language belonging to the Kipchak branch. It is co-official with Russian in Bashkortostan. It is spoken by approximately 1.4 million native speakers in Russia, as well as in Ukraine, Belarus, Kazakhstan, Uzbekistan, Estonia and other neighboring post-Soviet states, and among the Bashkir diaspora. It has three dialect groups: Southern, Eastern and Northwestern. Speakers Speakers of Bashkir mostly live in the republic of Bashkortostan (a republic within the Russian Federation). Many speakers also live in Tatarstan, Chelyabinsk, Orenburg, Tyumen, Sverdlovsk and Kurgan Oblasts and other regions of Russia. Minor Bashkir groups also live in Kazakhstan and other countries. Classification Bashkir together with Tatar belongs to the Bulgaric (russian: кыпчакско-булгарская) subgroups of the Kipchak languages. They share the same vocalism and the vowel shifts (see below) that make both languages ...
[...More Info...]      
[...Related Items...]     OR:     [Wikipedia]   [Google]   [Baidu]  


picture info

Asturian Language
Asturian (; ,Art. 1 de lLey 1/1998, de 23 de marzo, de uso y promoción del bable/asturiano [Law 1/93, of March 23, on the Use and Promotion of the Asturian Language/nowiki>] formerly also known as ) is a West Iberian languages, West Iberian Romance languages, Romance language spoken in the Principality of Asturias, Spain. Asturian is part of a wider linguistic group, the Asturleonese languages. The number of speakers is estimated at 100,000 (native) and 450,000 (second language). The dialects of the Astur-Leonese language family are traditionally classified in three groups: Western, Central, and Eastern. For historical and demographic reasons, the standard is based on Central Asturian. Asturian has a distinct grammar, dictionary, and orthography. It is regulated by the Academy of the Asturian Language. Although it is not an official language of Spain it is protected under the Statute of Autonomy of Asturias and is an elective language in schools. For much of its history, th ...
[...More Info...]      
[...Related Items...]     OR:     [Wikipedia]   [Google]   [Baidu]  


picture info

Assamese Language
Assamese (), also Asamiya ( ), is an Indo-Aryan language spoken mainly in the north-east Indian state of Assam, where it is an official language, and it serves as a ''lingua franca'' of the wider region. The easternmost Indo-Iranian language, it has over 23 million speakers. Nefamese, an Assamese-based pidgin, is used in Arunachal Pradesh, and Nagamese, an Assamese-based Creole language, is widely used in Nagaland. The Kamtapuri language of Rangpur division of Bangladesh and the Cooch Behar and Jalpaiguri districts of India are linguistically closer to Assamese, though the speakers identify with the Bengali culture and the literary language. In the past, it was the court language of the Ahom kingdom from the 17th century. Along with other Eastern Indo-Aryan languages, Assamese evolved at least before the 7th century CE from the middle Indo-Aryan Magadhi Prakrit. Its sister languages include Angika, Bengali, Bishnupriya Manipuri, Chakma, Chittagonian, Hajong, Rajbangsi ...
[...More Info...]      
[...Related Items...]     OR:     [Wikipedia]   [Google]   [Baidu]  


picture info

Armenian Language
Armenian ( classical: , reformed: , , ) is an Indo-European language and an independent branch of that family of languages. It is the official language of Armenia. Historically spoken in the Armenian Highlands, today Armenian is widely spoken throughout the Armenian diaspora. Armenian is written in its own writing system, the Armenian alphabet, introduced in 405 AD by the priest Mesrop Mashtots. The total number of Armenian speakers worldwide is estimated between 5 and 7 million. History Classification and origins Armenian is an independent branch of the Indo-European languages. It is of interest to linguists for its distinctive phonological changes within that family. Armenian exhibits more satemization than centumization, although it is not classified as belonging to either of these subgroups. Some linguists tentatively conclude that Armenian, Greek (and Phrygian) and Indo-Iranian were dialectally close to each other;''Handbook of Formal Languages'' (1997p. 6 wit ...
[...More Info...]      
[...Related Items...]     OR:     [Wikipedia]   [Google]   [Baidu]  


picture info

Arabic Language
Arabic (, ' ; , ' or ) is a Semitic language spoken primarily across the Arab world.Semitic languages: an international handbook / edited by Stefan Weninger; in collaboration with Geoffrey Khan, Michael P. Streck, Janet C. E.Watson; Walter de Gruyter GmbH & Co. KG, Berlin/Boston, 2011. Having emerged in the 1st century, it is named after the Arab people; the term "Arab" was initially used to describe those living in the Arabian Peninsula, as perceived by geographers from ancient Greece. Since the 7th century, Arabic has been characterized by diglossia, with an opposition between a standard prestige language—i.e., Literary Arabic: Modern Standard Arabic (MSA) or Classical Arabic—and diverse vernacular varieties, which serve as mother tongues. Colloquial dialects vary significantly from MSA, impeding mutual intelligibility. MSA is only acquired through formal education and is not spoken natively. It is the language of literature, official documents, and formal written m ...
[...More Info...]      
[...Related Items...]     OR:     [Wikipedia]   [Google]   [Baidu]  


picture info

Abkhaz Language
Abkhaz ( ; ), sometimes spelled Abxaz and also known as Abkhazian, is a Northwest Caucasian language most closely related to Abaza. It is spoken mostly by the Abkhaz people. It is one of the official languages of Abkhazia, where around 100,000 people speak it. Furthermore, it is spoken by thousands of members of the Abkhazian diaspora in Turkey, Georgia's autonomous republic of Adjara, Syria, Jordan, and several Western countries. 27 October is the day of the Abkhazian language in Georgia. Classification Abkhaz is a Northwest Caucasian language and is thus related to Adyghe. The language of Abkhaz is especially close to Abaza, and they are sometimes considered dialects of the same language,''B. G. Hewitt Abkhaz 1979;'' page 1. Abazgi, of which the literary dialects of Abkhaz and Abaza are simply two ends of a dialect continuum. Grammatically, the two are very similar; however, the differences in phonology are substantial, it also contains elements characteristic of Kabar ...
[...More Info...]      
[...Related Items...]     OR:     [Wikipedia]   [Google]   [Baidu]