HOME
*





OCR In Indian Languages
Indic OCR refers to the process of converting text images written in Indic scripts into e-text using Optical character recognition (OCR) techniques. Broadly, it can also refer to the OCR systems of Brahmic scripts for languages of South Asia and Southeast Asia, not just the scripts of the Indian subcontinent, which are all written in an abugida-based writing system. OCR for Latin characters is still not 100% accurate but a relatively high degree of accuracy in conversion has been able to be achieved. Such accuracy has not yet been able to be achieved for Indic scripts using OCR. This is due in part to the writing systems of Indic languages as well as a lack of standard representation, encoding, and support among operating systems and keyboards. The Centre for Development of Advanced Computing (C-DAC) and Technology Development for Indian Languages, the premier R&D organisation of the Ministry of Electronics and Information Technology (also known as MeitY) of India have carried ...
[...More Info...]      
[...Related Items...]     OR:     [Wikipedia]   [Google]   [Baidu]  


picture info

Writing System
A writing system is a method of visually representing verbal communication, based on a script and a set of rules regulating its use. While both writing and speech are useful in conveying messages, writing differs in also being a reliable form of information storage and transfer. Writing systems require shared understanding between writers and readers of the meaning behind the sets of characters that make up a script. Writing is usually recorded onto a durable medium, such as paper or electronic storage, although non-durable methods may also be used, such as writing on a computer display, on a blackboard, in sand, or by skywriting. Reading a text can be accomplished purely in the mind as an internal process, or expressed orally. Writing systems can be placed into broad categories such as alphabets, syllabaries, or logographies, although any particular system may have attributes of more than one category. In the alphabetic category, a standard set of letters represent speech ...
[...More Info...]      
[...Related Items...]     OR:     [Wikipedia]   [Google]   [Baidu]  


picture info

Devanagari
Devanagari ( ; , , Sanskrit pronunciation: ), also called Nagari (),Kathleen Kuiper (2010), The Culture of India, New York: The Rosen Publishing Group, , page 83 is a left-to-right abugida (a type of segmental Writing systems#Segmental systems: alphabets, writing system), based on the ancient Brahmi script, ''Brāhmī'' script, used in the northern Indian subcontinent. It was developed and in regular use by the 7th century CE. The Devanagari script, composed of 47 primary characters, including 14 vowels and 33 consonants, is the fourth most widely List of writing systems by adoption, adopted writing system in the world, being used for over 120 languages.Devanagari (Nagari)
, Script Features and Description, SIL International (2013), United States
The orthography of this script reflects the pr ...
[...More Info...]      
[...Related Items...]     OR:     [Wikipedia]   [Google]   [Baidu]  


picture info

Malayalam Language
Malayalam (; , ) is a Dravidian language spoken in the Indian state of Kerala and the union territories of Lakshadweep and Puducherry (Mahé district) by the Malayali people. It is one of 22 scheduled languages of India. Malayalam was designated a "Classical Language of India" in 2013. Malayalam has official language status in Kerala, and Puducherry ( Mahé), and is also the primary spoken language of Lakshadweep, and is spoken by 34 million people in India. Malayalam is also spoken by linguistic minorities in the neighbouring states; with significant number of speakers in the Kodagu and Dakshina Kannada districts of Karnataka, and Kanyakumari, district of Tamil Nadu. It is also spoken by the Malayali Diaspora worldwide, especially in the Persian Gulf countries, due to large populations of Malayali expatriates there. There are significant population in each cities in India including Mumbai, Bengaluru, Delhi, Kolkata, Pune etc. The origin of Malayalam remains a matter of ...
[...More Info...]      
[...Related Items...]     OR:     [Wikipedia]   [Google]   [Baidu]  


picture info

Gujarati Language
Gujarati (; gu, ગુજરાતી, Gujarātī, translit-std=ISO, label=Gujarati script, ) is an Indo-Aryan language native to the Indian state of Gujarat and spoken predominantly by the Gujarati people. Gujarati is descended from Old Gujarati (). In India, it is one of the 22 scheduled languages of the Union. It is also the official language in the state of Gujarat, as well as an official language in the union territory of Dadra and Nagar Haveli and Daman and Diu. As of 2011, Gujarati is the 6th most widely spoken language in India by number of native speakers, spoken by 55.5 million speakers which amounts to about 4.5% of the total Indian population. It is the 26th most widely spoken language in the world by number of native speakers as of 2007.Mikael Parkvall, "Världens 100 största språk 2007" (The World's 100 Largest Languages in 2007), in ''Nationalencyklopedin''. Asterisks mark th2010 estimatesfor the top dozen languages. Outside of Gujarat, Gujarati is ...
[...More Info...]      
[...Related Items...]     OR:     [Wikipedia]   [Google]   [Baidu]  


picture info

Vowel
A vowel is a syllabic speech sound pronounced without any stricture in the vocal tract. Vowels are one of the two principal classes of speech sounds, the other being the consonant. Vowels vary in quality, in loudness and also in quantity (length). They are usually voiced and are closely involved in prosodic variation such as tone, intonation and stress. The word ''vowel'' comes from the Latin word , meaning "vocal" (i.e. relating to the voice). In English, the word ''vowel'' is commonly used to refer both to vowel sounds and to the written symbols that represent them (a, e, i, o, u, and sometimes y). Definition There are two complementary definitions of vowel, one phonetic and the other phonological. *In the phonetic definition, a vowel is a sound, such as the English "ah" or "oh" , produced with an open vocal tract; it is median (the air escapes along the middle of the tongue), oral (at least some of the airflow must escape through the mouth), frictionless and continuant ...
[...More Info...]      
[...Related Items...]     OR:     [Wikipedia]   [Google]   [Baidu]  


Consonant
In articulatory phonetics, a consonant is a speech sound that is articulated with complete or partial closure of the vocal tract. Examples are and pronounced with the lips; and pronounced with the front of the tongue; and pronounced with the back of the tongue; , pronounced in the throat; , and , pronounced by forcing air through a narrow channel (fricatives); and and , which have air flowing through the nose ( nasals). Contrasting with consonants are vowels. Since the number of speech sounds in the world's languages is much greater than the number of letters in any one alphabet, linguists have devised systems such as the International Phonetic Alphabet (IPA) to assign a unique and unambiguous symbol to each attested consonant. The English alphabet has fewer consonant letters than the English language has consonant sounds, so digraphs like , , , and are used to extend the alphabet, though some letters and digraphs represent more than one consonant. For example, th ...
[...More Info...]      
[...Related Items...]     OR:     [Wikipedia]   [Google]   [Baidu]  


picture info

Meitei Language
Meitei (), also known as Manipuri (, ), is a Tibeto-Burman language of north-eastern India. It is spoken by around 1.8 million people, predominantly in the state of Manipur, but also by smaller communities in the rest of the country and in parts of neighbouring Myanmar and Bangladesh. It is native to the Meitei people, and within Manipur it serves as an official language and a lingua franca. It was used as a court language in the historic Manipur Kingdom and is presently included among the 22 Scheduled languages of India, scheduled languages of India. Meitei is a Tone (linguistics), tonal language whose exact classification within Sino-Tibetan languages, Sino-Tibetan remains unclear. It has lexical resemblances to Kuki language, Kuki and Tangkhul language, Tangkhul. Meitei is the List of languages by number of native speakers in India#List of languages by number of native speakers, most widely spoken Indian Sino-Tibetan languages, Sino-Tibetan language and the most spoken la ...
[...More Info...]      
[...Related Items...]     OR:     [Wikipedia]   [Google]   [Baidu]  


picture info

Assamese Language
Assamese (), also Asamiya ( ), is an Indo-Aryan language spoken mainly in the north-east Indian state of Assam, where it is an official language, and it serves as a ''lingua franca'' of the wider region. The easternmost Indo-Iranian language, it has over 23 million speakers. Nefamese, an Assamese-based pidgin, is used in Arunachal Pradesh, and Nagamese, an Assamese-based Creole language, is widely used in Nagaland. The Kamtapuri language of Rangpur division of Bangladesh and the Cooch Behar and Jalpaiguri districts of India are linguistically closer to Assamese, though the speakers identify with the Bengali culture and the literary language. In the past, it was the court language of the Ahom kingdom from the 17th century. Along with other Eastern Indo-Aryan languages, Assamese evolved at least before the 7th century CE from the middle Indo-Aryan Magadhi Prakrit. Its sister languages include Angika, Bengali, Bishnupriya Manipuri, Chakma, Chittagonian, Hajong, Rajbangsi ...
[...More Info...]      
[...Related Items...]     OR:     [Wikipedia]   [Google]   [Baidu]  


Eastern Nagari
Eastern may refer to: Transportation *China Eastern Airlines, a current Chinese airline based in Shanghai *Eastern Air, former name of Zambia Skyways *Eastern Air Lines, a defunct American airline that operated from 1926 to 1991 *Eastern Air Lines (2015), an American airline that began operations in 2015 *Eastern Airlines, LLC, previously Dynamic International Airways, a U.S. airline founded in 2010 *Eastern Airways, an English/British regional airline *Eastern Provincial Airways, a defunct Canadian airline that operated from 1949 to 1986 *Eastern Railway (other), various railroads *Eastern Avenue (other), various roads *Eastern Parkway (other), various parkways *Eastern Freeway, Melbourne, Australia *Eastern Freeway Mumbai, Mumbai, India *, a cargo liner in service 1946-65 Education *Eastern University (other) *Eastern College (other) Other uses * Eastern Broadcasting Limited, former name of Maritime Broadcasting System, Canada * ...
[...More Info...]      
[...Related Items...]     OR:     [Wikipedia]   [Google]   [Baidu]  


picture info

Sanskrit
Sanskrit (; attributively , ; nominally , , ) is a classical language belonging to the Indo-Aryan branch of the Indo-European languages. It arose in South Asia after its predecessor languages had diffused there from the northwest in the late Bronze Age. Sanskrit is the sacred language of Hinduism, the language of classical Hindu philosophy, and of historical texts of Buddhism and Jainism. It was a link language in ancient and medieval South Asia, and upon transmission of Hindu and Buddhist culture to Southeast Asia, East Asia and Central Asia in the early medieval era, it became a language of religion and high culture, and of the political elites in some of these regions. As a result, Sanskrit had a lasting impact on the languages of South Asia, Southeast Asia and East Asia, especially in their formal and learned vocabularies. Sanskrit generally connotes several Old Indo-Aryan language varieties. The most archaic of these is the Vedic Sanskrit found in the Rig Veda, a colle ...
[...More Info...]      
[...Related Items...]     OR:     [Wikipedia]   [Google]   [Baidu]  


picture info

Rajasthani Language
Rajasthani (Devanagari: ) refers to a group of Indo-Aryan languages and dialects spoken primarily in the state of Rajasthan and adjacent areas of Haryana, Gujarat, and Madhya Pradesh in India. There are also speakers in the Pakistani provinces of Punjab and Sindh. Rajasthani varieties are closely related to and partially intelligible with their sister languages Gujarati and Sindhi. It is spoken by 65.04% of the population of Rajasthan. The comprehensibility between Rajasthani and Gujarati goes from 60 to 85% depending on the geographical extent of its dialects. The term ''Rajasthani'' is also used to refer to a literary language mostly based on Marwari, which is being promoted as a standard language for the state of Rajasthan. History Rajasthani has a literary tradition going back approximately 1500 years. The Vasantgadh Inscription from modern day Sirohi that has been dated to the 7th century AD uses the term Rajasthaniaditya in reference to the official or maybe for a poe ...
[...More Info...]      
[...Related Items...]     OR:     [Wikipedia]   [Google]   [Baidu]  


picture info

Marathi Language
Marathi (; ''Marāṭhī'', ) is an Indo-Aryan languages, Indo-Aryan language predominantly spoken by Marathi people in the Indian state of Maharashtra. It is the official language of Maharashtra, and additional official language in the state of Goa. It is one of the 22 scheduled languages of India, with 83 million speakers as of 2011. Marathi ranks 11th in the List of languages by number of native speakers, list of languages with most native speakers in the world. Marathi has the List of languages by number of native speakers in India, third largest number of native speakers in India, after Hindi Language, Hindi and Bengali language, Bengali. The language has some of the oldest literature of all modern Indian languages. The major dialects of Marathi are Standard Marathi and the Varhadi dialect. Marathi distinguishes Clusivity, inclusive and exclusive forms of 'we' and possesses a three-way Grammatical gender, gender system, that features the neuter in addition to the masculine ...
[...More Info...]      
[...Related Items...]     OR:     [Wikipedia]   [Google]   [Baidu]