Double Metaphone
   HOME
*





Double Metaphone
Metaphone is a phonetic algorithm, published by Lawrence Philips in 1990, for indexing words by their English pronunciation. It fundamentally improves on the Soundex algorithm by using information about variations and inconsistencies in English spelling and pronunciation to produce a more accurate encoding, which does a better job of matching words and names which sound similar. As with Soundex, similar-sounding words should share the same keys. Metaphone is available as a built-in operator in a number of systems. Philips later produced a new version of the algorithm, which he named Double Metaphone. Contrary to the original algorithm whose application is limited to English only, this version takes into account spelling peculiarities of a number of other languages. In 2009 Philips released a third version, called Metaphone 3, which achieves an accuracy of approximately 99% for English words, non-English words familiar to Americans, and first names and family names commonly found ...
[...More Info...]      
[...Related Items...]     OR:     [Wikipedia]   [Google]   [Baidu]  


Phonetic Algorithm
A phonetic algorithm is an algorithm for indexing of words by their pronunciation. Most phonetic algorithms were developed for English and are not useful for indexing words in other languages. Because English spelling varies significantly depending on multiple factors, such as the word's origin and usage over time and borrowings from other languages, phonetic algorithms necessarily take into account numerous rules and exceptions. Algorithms Among the best-known phonetic algorithms are: * Soundex, which was developed to encode surnames for use in censuses. Soundex codes are four-character strings composed of a single letter followed by three numbers. * Daitch–Mokotoff Soundex, which is a refinement of Soundex designed to better match surnames of Slavic and Germanic origin. Daitch–Mokotoff Soundex codes are strings composed of six numeric digits. * Cologne phonetics: This is similar to Soundex, but more suitable for German words. * Metaphone and Double Metaphone which are suitab ...
[...More Info...]      
[...Related Items...]     OR:     [Wikipedia]   [Google]   [Baidu]  


picture info

Celtic Languages
The Celtic languages ( usually , but sometimes ) are a group of related languages descended from Proto-Celtic. They form a branch of the Indo-European language family. The term "Celtic" was first used to describe this language group by Edward Lhuyd in 1707, following Paul-Yves Pezron, who made the explicit link between the Celts described by classical writers and the Welsh and Breton languages. During the 1st millennium BC, Celtic languages were spoken across much of Europe and central Anatolia. Today, they are restricted to the northwestern fringe of Europe and a few diaspora communities. There are six living languages: the four continuously living languages Breton, Irish, Scottish Gaelic and Welsh, and the two revived languages Cornish and Manx. All are minority languages in their respective countries, though there are continuing efforts at revitalisation. Welsh is an official language in Wales and Irish is an official language of Ireland and of the European Union. Welsh ...
[...More Info...]      
[...Related Items...]     OR:     [Wikipedia]   [Google]   [Baidu]  


New York State Identification And Intelligence System
The New York State Identification and Intelligence System Phonetic Code, commonly known as NYSIIS, is a phonetic algorithm devised in 1970 as part of the New York State New York, officially the State of New York, is a state in the Northeastern United States. It is often called New York State to distinguish it from its largest city, New York City. With a total area of , New York is the 27th-largest U.S. stat ... Identification and Intelligence System (now a part of the New York State Division of Criminal Justice Services). It features an accuracy increase of 2.7% over the traditional Soundex algorithm. Procedure The algorithm, as described in ''Name Search Techniques'', is: #If the first letters of the name are #:'MAC' then change these letters to 'MCC' #:'KN' then change these letters to 'NN' #:'K' then change this letter to 'C' #:'PH' then change these letters to 'FF' #:'PF' then change these letters to 'FF' #:'SCH' then change these letters to 'SSS' #If the last le ...
[...More Info...]      
[...Related Items...]     OR:     [Wikipedia]   [Google]   [Baidu]  




Caverphone
The Caverphone within linguistics and computing, is a phonetic matching algorithm invented to identify English names with their sounds, originally built to process a custom dataset compound between 1893 and 1938 in southern Dunedin, New Zealand. Started from a similar concept as metaphone, it has been developed to accommodate and process general English since then. Etymology The Caverphone was created by David Hood in the Caversham Project at the University of Otago in New Zealand in 2002, revised in 2004. It was created to assist in data matching between late 19th century and early 20th century electoral rolls, where the name only needed to be in a "commonly recognisable form". The algorithm was intended to apply to those names that could not easily be matched between electoral rolls, after the exact matches were removed from the pool of potential matches. The algorithm is optimised for accents present in the study area (southern part of the city of Dunedin, New Zealand). Proced ...
[...More Info...]      
[...Related Items...]     OR:     [Wikipedia]   [Google]   [Baidu]  


Várzea Paulista
Várzea Paulista is a municipality in the state of São Paulo in Brazil Brazil ( pt, Brasil; ), officially the Federative Republic of Brazil (Portuguese: ), is the largest country in both South America and Latin America. At and with over 217 million people, Brazil is the world's fifth-largest country by area .... The population is 123,071 (2020 est.) in an area of 35.1 km². The elevation is 745 m. It is part of the agglomeration of Jundiaí. References Municipalities in São Paulo (state) {{SaoPauloState-geo-stub ...
[...More Info...]      
[...Related Items...]     OR:     [Wikipedia]   [Google]   [Baidu]  


picture info

Brazilian Portuguese
Brazilian Portuguese (' ), also Portuguese of Brazil (', ) or South American Portuguese (') is the set of varieties of the Portuguese language native to Brazil and the most influential form of Portuguese worldwide. It is spoken by almost all of the 214 million inhabitants of Brazil and spoken widely across the Brazilian diaspora, today consisting of about two million Brazilians who have emigrated to other countries. With a population of over 214 million, Brazil is by far the world's largest Portuguese-speaking nation and the only one in the Americas. Brazilian Portuguese differs, particularly in phonology and prosody, from varieties spoken in Portugal and Portuguese-speaking African countries. In these latter countries, the language tends to have a closer connection to contemporary European Portuguese, partly because Portuguese colonial rule ended much more recently there than in Brazil, partly due to the heavy indigenous and African influence on Brazilian Portuguese. Despite t ...
[...More Info...]      
[...Related Items...]     OR:     [Wikipedia]   [Google]   [Baidu]  


picture info

Indo-European
The Indo-European languages are a language family native to the overwhelming majority of Europe, the Iranian plateau, and the northern Indian subcontinent. Some European languages of this family, English, French, Portuguese, Russian, Dutch, and Spanish, have expanded through colonialism in the modern period and are now spoken across several continents. The Indo-European family is divided into several branches or sub-families, of which there are eight groups with languages still alive today: Albanian, Armenian, Balto-Slavic, Celtic, Germanic, Hellenic, Indo-Iranian, and Italic; and another nine subdivisions that are now extinct. Today, the individual Indo-European languages with the most native speakers are English, Hindi–Urdu, Spanish, Bengali, French, Russian, Portuguese, German, and Punjabi, each with over 100 million native speakers; many others are small and in danger of extinction. In total, 46% of the world's population (3.2 billion people) speaks an Indo-Eur ...
[...More Info...]      
[...Related Items...]     OR:     [Wikipedia]   [Google]   [Baidu]  


picture info

List Of Dialects Of English
Dialects are linguistic varieties that may differ in pronunciation, vocabulary, spelling and grammar. For the classification of varieties of English only in terms of pronunciation, see regional accents of English. Overview Dialects can be defined as "sub-forms of languages which are, in general, mutually comprehensible." English speakers from different countries and regions use a variety of different accents (systems of pronunciation) as well as various localized words and grammatical constructions; many different dialects can be identified based on these factors. Dialects can be classified at broader or narrower levels: within a broad national or regional dialect, various more localised sub-dialects can be identified, and so on. The combination of differences in pronunciation and use of local words may make some English dialects almost unintelligible to speakers from other regions without any prior exposure. The major native dialects of English are often divided by linguists i ...
[...More Info...]      
[...Related Items...]     OR:     [Wikipedia]   [Google]   [Baidu]  


picture info

Chinese Language
Chinese (, especially when referring to written Chinese) is a group of languages spoken natively by the ethnic Han Chinese majority and many minority ethnic groups in Greater China. About 1.3 billion people (or approximately 16% of the world's population) speak a variety of Chinese as their first language. Chinese languages form the Sinitic branch of the Sino-Tibetan languages family. The spoken varieties of Chinese are usually considered by native speakers to be variants of a single language. However, their lack of mutual intelligibility means they are sometimes considered separate languages in a family. Investigation of the historical relationships among the varieties of Chinese is ongoing. Currently, most classifications posit 7 to 13 main regional groups based on phonetic developments from Middle Chinese, of which the most spoken by far is Mandarin (with about 800 million speakers, or 66%), followed by Min (75 million, e.g. Southern Min), Wu (74 million, e.g. Shangh ...
[...More Info...]      
[...Related Items...]     OR:     [Wikipedia]   [Google]   [Baidu]  


picture info

Spanish Language
Spanish ( or , Castilian) is a Romance languages, Romance language of the Indo-European language family that evolved from colloquial Latin spoken on the Iberian peninsula. Today, it is a world language, global language with more than 500 million native speakers, mainly in the Americas and Spain. Spanish is the official language of List of countries where Spanish is an official language, 20 countries. It is the world's list of languages by number of native speakers, second-most spoken native language after Mandarin Chinese; the world's list of languages by total number of speakers, fourth-most spoken language overall after English language, English, Mandarin Chinese, and Hindustani language, Hindustani (Hindi-Urdu); and the world's most widely spoken Romance languages, Romance language. The largest population of native speakers is in Mexico. Spanish is part of the Iberian Romance languages, Ibero-Romance group of languages, which evolved from several dialects of Vulgar Latin in I ...
[...More Info...]      
[...Related Items...]     OR:     [Wikipedia]   [Google]   [Baidu]  


picture info

Italian Language
Italian (''italiano'' or ) is a Romance language of the Indo-European language family that evolved from the Vulgar Latin of the Roman Empire. Together with Sardinian, Italian is the least divergent language from Latin. Spoken by about 85 million people (2022), Italian is an official language in Italy, Switzerland (Ticino and the Grisons), San Marino, and Vatican City. It has an official minority status in western Istria (Croatia and Slovenia). Italian is also spoken by large immigrant and expatriate communities in the Americas and Australia.Ethnologue report for language code:ita (Italy)
– Gordon, Raymond G., Jr. (ed.), 2005. Ethnologue: Languages of the World, Fifteenth edition. Dallas, Tex.: SIL International. Online version
Itali ...
[...More Info...]      
[...Related Items...]     OR:     [Wikipedia]   [Google]   [Baidu]