The Latin script, also known as the Roman script, is a
writing system
A writing system comprises a set of symbols, called a ''script'', as well as the rules by which the script represents a particular language. The earliest writing appeared during the late 4th millennium BC. Throughout history, each independen ...
based on the letters of the
classical Latin alphabet, derived from a form of the
Greek alphabet
The Greek alphabet has been used to write the Greek language since the late 9th or early 8th century BC. It was derived from the earlier Phoenician alphabet, and is the earliest known alphabetic script to systematically write vowels as wel ...
which was in use in the ancient Greek city of
Cumae
Cumae ( or or ; ) was the first ancient Greek colony of Magna Graecia on the mainland of Italy and was founded by settlers from Euboea in the 8th century BCE. It became a rich Roman city, the remains of which lie near the modern village of ...
in
Magna Graecia
Magna Graecia refers to the Greek-speaking areas of southern Italy, encompassing the modern Regions of Italy, Italian regions of Calabria, Apulia, Basilicata, Campania, and Sicily. These regions were Greek colonisation, extensively settled by G ...
. The Greek alphabet was altered by the
Etruscans, and subsequently their alphabet was altered by the
Ancient Romans
The Roman people was the ethnicity and the body of Roman citizenship, Roman citizens
(; ) during the Roman Kingdom, the Roman Republic, and the Roman Empire. This concept underwent considerable changes throughout the long history of the Roman ...
. Several
Latin-script alphabet
A Latin-script alphabet (Latin alphabet or Roman alphabet) is an alphabet that uses Letter (alphabet), letters of the Latin script. The 21-letter archaic Latin alphabet and the 23-letter classical Latin alphabet belong to the oldest of this gr ...
s exist, which differ in graphemes, collation and phonetic values from the classical Latin alphabet.
The Latin script is the basis of the
International Phonetic Alphabet
The International Phonetic Alphabet (IPA) is an alphabetic system of phonetic notation based primarily on the Latin script. It was devised by the International Phonetic Association in the late 19th century as a standard written representation ...
(IPA), and the 26 most widespread letters are the letters contained in the
ISO basic Latin alphabet
The ISO basic Latin alphabet is an international standard (beginning with ISO/IEC 646) for a Latin-script alphabet that consists of two sets (uppercase and lowercase) of 26 letters, codified in various national and international standards and u ...
, which are the same letters as the
English alphabet
Modern English is written with a Latin-script alphabet consisting of 26 Letter (alphabet), letters, with each having both uppercase and lowercase forms. The word ''alphabet'' is a Compound (linguistics), compound of ''alpha'' and ''beta'', t ...
.
Latin script is the basis for the largest number of alphabets of any writing system and is the
most widely adopted writing system in the world. Latin script is used as the standard method of writing the languages of Western and Central Europe, most of sub-Saharan Africa, the Americas, and Oceania, as well as many languages in other parts of the world.
Name
The script is either called Latin script or Roman script, in reference to its origin in
ancient Rome
In modern historiography, ancient Rome is the Roman people, Roman civilisation from the founding of Rome, founding of the Italian city of Rome in the 8th century BC to the Fall of the Western Roman Empire, collapse of the Western Roman Em ...
(though some of the capital letters are Greek in origin). In the context of
transliteration
Transliteration is a type of conversion of a text from one script to another that involves swapping letters (thus '' trans-'' + '' liter-'') in predictable ways, such as Greek → and → the digraph , Cyrillic → , Armenian → or L ...
, the term "
romanization
In linguistics, romanization is the conversion of text from a different writing system to the Latin script, Roman (Latin) script, or a system for doing so. Methods of romanization include transliteration, for representing written text, and tra ...
" (
British English
British English is the set of Variety (linguistics), varieties of the English language native to the United Kingdom, especially Great Britain. More narrowly, it can refer specifically to the English language in England, or, more broadly, to ...
: "romanisation") is often found.
Unicode
Unicode or ''The Unicode Standard'' or TUS is a character encoding standard maintained by the Unicode Consortium designed to support the use of text in all of the world's writing systems that can be digitized. Version 16.0 defines 154,998 Char ...
uses the term "Latin" as does the
International Organization for Standardization
The International Organization for Standardization (ISO ; ; ) is an independent, non-governmental, international standard development organization composed of representatives from the national standards organizations of member countries.
M ...
(ISO).
The numeral system is called the Roman numeral system, and the collection of the elements is known as the
Roman numerals
Roman numerals are a numeral system that originated in ancient Rome and remained the usual way of writing numbers throughout Europe well into the Late Middle Ages. Numbers are written with combinations of letters from the Latin alphabet, eac ...
. The numbers 1, 2, 3 ... are Latin/Roman script numbers for the
Hindu–Arabic numeral system
The Hindu–Arabic numeral system (also known as the Indo-Arabic numeral system, Hindu numeral system, and Arabic numeral system) is a positional notation, positional Decimal, base-ten numeral system for representing integers; its extension t ...
.
ISO basic Latin alphabet
The use of the letters I and V for both consonants and vowels proved inconvenient as the Latin alphabet was adapted to Germanic and Romance languages.
W originated as a doubled
V (VV) used to represent the found in
Old English
Old English ( or , or ), or Anglo-Saxon, is the earliest recorded form of the English language, spoken in England and southern and eastern Scotland in the Early Middle Ages. It developed from the languages brought to Great Britain by Anglo-S ...
as early as the 7th century. It came into common use in the later 11th century, replacing the letter
wynn
Wynn or wyn (; also spelled wen, win, ƿynn, ƿyn, ƿen, and ƿin) is a letter of the Old English Latin alphabet, Old English alphabet, where it is used to represent the sound .
History The letter "W"
While the earliest Old English texts ...
, which had been used for the same sound. In the Romance languages, the minuscule form of V was a rounded ''u''; from this was derived a rounded capital U for the vowel in the 16th century, while a new, pointed minuscule ''v'' was derived from V for the consonant. In the case of I, a word-final
swash form, ''j'', came to be used for the consonant, with the unswashed form restricted to vowel use. Such conventions were erratic for centuries. J was introduced into English for the consonant in the 17th century (it had been rare as a vowel), but it was not universally considered a distinct letter in the alphabetic order until the 19th century.
By the 1960s, it became apparent to the computer and
telecommunication
Telecommunication, often used in its plural form or abbreviated as telecom, is the transmission of information over a distance using electronic means, typically through cables, radio waves, or other communication technologies. These means of ...
s industries in the
First World that a non-proprietary method of encoding characters was needed. The
International Organization for Standardization
The International Organization for Standardization (ISO ; ; ) is an independent, non-governmental, international standard development organization composed of representatives from the national standards organizations of member countries.
M ...
(ISO) encapsulated the Latin alphabet in their (
ISO/IEC 646
ISO/IEC 646 ''Information technology — ISO 7-bit coded character set for information interchange'', is an International Organization for Standardization, ISO/International Electrotechnical Commission, IEC standard in the ...
) standard. To achieve widespread acceptance, this encapsulation was based on popular usage. As the United States held a preeminent position in both industries during the 1960s, the standard was based on the already published ''American Standard Code for Information Interchange'', better known as
ASCII
ASCII ( ), an acronym for American Standard Code for Information Interchange, is a character encoding standard for representing a particular set of 95 (English language focused) printable character, printable and 33 control character, control c ...
, which included in the
character set
Character encoding is the process of assigning numbers to graphical characters, especially the written characters of human language, allowing them to be stored, transmitted, and transformed using computers. The numerical values that make up a c ...
the 26 × 2 (uppercase and lowercase) letters of the
English alphabet
Modern English is written with a Latin-script alphabet consisting of 26 Letter (alphabet), letters, with each having both uppercase and lowercase forms. The word ''alphabet'' is a Compound (linguistics), compound of ''alpha'' and ''beta'', t ...
. Later standards issued by the ISO, for example
ISO/IEC 10646
ISO/IEC JTC 1, entitled "Information technology", is a joint technical committee (JTC) of the International Organization for Standardization (ISO) and the International Electrotechnical Commission (IEC). Its purpose is to develop, maintain and ...
(
Unicode Latin), have continued to define the 26 × 2 letters of the English alphabet as the basic Latin alphabet with extensions to handle other letters in other languages.
Spread

The Latin alphabet spread, along with
Latin
Latin ( or ) is a classical language belonging to the Italic languages, Italic branch of the Indo-European languages. Latin was originally spoken by the Latins (Italic tribe), Latins in Latium (now known as Lazio), the lower Tiber area aroun ...
, from the
Italian Peninsula to the lands surrounding the
Mediterranean Sea
The Mediterranean Sea ( ) is a sea connected to the Atlantic Ocean, surrounded by the Mediterranean basin and almost completely enclosed by land: on the east by the Levant in West Asia, on the north by Anatolia in West Asia and Southern Eur ...
with the expansion of the
Roman Empire
The Roman Empire ruled the Mediterranean and much of Europe, Western Asia and North Africa. The Roman people, Romans conquered most of this during the Roman Republic, Republic, and it was ruled by emperors following Octavian's assumption of ...
. The eastern half of the Empire, including Greece, Turkey, the
Levant
The Levant ( ) is the subregion that borders the Eastern Mediterranean, Eastern Mediterranean sea to the west, and forms the core of West Asia and the political term, Middle East, ''Middle East''. In its narrowest sense, which is in use toda ...
, and Egypt, continued to use
Greek as a
lingua franca
A lingua franca (; ; for plurals see ), also known as a bridge language, common language, trade language, auxiliary language, link language or language of wider communication (LWC), is a Natural language, language systematically used to make co ...
, but Latin was widely spoken in the western half, and as the western
Romance languages
The Romance languages, also known as the Latin or Neo-Latin languages, are the languages that are Language family, directly descended from Vulgar Latin. They are the only extant subgroup of the Italic languages, Italic branch of the Indo-E ...
evolved out of Latin, they continued to use and adapt the Latin alphabet.
Middle Ages
With the spread of
Western Christianity
Western Christianity is one of two subdivisions of Christianity (Eastern Christianity being the other). Western Christianity is composed of the Latin Church and Protestantism, Western Protestantism, together with their offshoots such as the O ...
during the
Middle Ages
In the history of Europe, the Middle Ages or medieval period lasted approximately from the 5th to the late 15th centuries, similarly to the post-classical period of global history. It began with the fall of the Western Roman Empire and ...
, the Latin alphabet was gradually adopted by the peoples of
Northern Europe
The northern region of Europe has several definitions. A restrictive definition may describe northern Europe as being roughly north of the southern coast of the Baltic Sea, which is about 54th parallel north, 54°N, or may be based on other ge ...
who spoke
Celtic languages
The Celtic languages ( ) are a branch of the Indo-European language family, descended from the hypothetical Proto-Celtic language. The term "Celtic" was first used to describe this language group by Edward Lhuyd in 1707, following Paul-Yve ...
(displacing the
Ogham alphabet) or
Germanic languages
The Germanic languages are a branch of the Indo-European languages, Indo-European language family spoken natively by a population of about 515 million people mainly in Europe, North America, Oceania, and Southern Africa. The most widely spoke ...
(displacing earlier
Runic alphabet
Runes are the Letter (alphabet), letters in a set of related alphabets, known as runic rows, runic alphabets or futharks (also, see ''#Futharks, futhark'' vs ''#Runic alphabets, runic alphabet''), native to the Germanic peoples. Runes were ...
s) or
Baltic languages
The Baltic languages are a branch of the Indo-European languages, Indo-European language family spoken natively or as a second language by a population of about 6.5–7.0 million people , as well as by the speakers of several
Uralic languages, most notably
Hungarian,
Finnish and
Estonian.
The Latin script also came into use for writing the
West Slavic languages
The West Slavic languages are a subdivision of the Slavic language group. They include Polish, Czech, Slovak, Kashubian, Silesian, Upper Sorbian and Lower Sorbian. The languages have traditionally been spoken across a mostly continuous re ...
and several
South Slavic languages
The South Slavic languages are one of three branches of the Slavic languages. There are approximately 30 million speakers, mainly in the Balkans. These are separated geographically from speakers of the other two Slavic branches (West Slavic la ...
, as the people who spoke them adopted
Roman Catholicism
The Catholic Church (), also known as the Roman Catholic Church, is the List of Christian denominations by number of members, largest Christian church, with 1.27 to 1.41 billion baptized Catholics Catholic Church by country, worldwid ...
. The speakers of
East Slavic languages
The East Slavic languages constitute one of three regional subgroups of the Slavic languages, distinct from the West Slavic languages, West and South Slavic languages. East Slavic languages are currently spoken natively throughout Eastern Europe, ...
generally adopted
Cyrillic
The Cyrillic script ( ) is a writing system used for various languages across Eurasia. It is the designated national script in various Slavic, Turkic, Mongolic, Uralic, Caucasian and Iranic-speaking countries in Southeastern Europe, Ea ...
along with
Orthodox Christianity. The
Serbian language
Serbian (, ) is the standard language, standardized Variety (linguistics)#Standard varieties, variety of the Serbo-Croatian language mainly used by Serbs. It is the official and national language of Serbia, one of the three official languages of ...
uses both scripts, with Cyrillic predominating in official communication and Latin elsewhere, as determined by the Law on Official Use of the Language and Alphabet.
Since the 16th century
As late as 1500, the Latin script was limited primarily to the languages spoken in
Western,
Northern, and
Central Europe
Central Europe is a geographical region of Europe between Eastern Europe, Eastern, Southern Europe, Southern, Western Europe, Western and Northern Europe, Northern Europe. Central Europe is known for its cultural diversity; however, countries in ...
. The Orthodox Christian Slavs of
Eastern and
Southeastern Europe
Southeast Europe or Southeastern Europe is a geographical sub-region of Europe, consisting primarily of the region of the Balkans, as well as adjacent regions and Archipelago, archipelagos. There are overlapping and conflicting definitions of t ...
mostly used
Cyrillic
The Cyrillic script ( ) is a writing system used for various languages across Eurasia. It is the designated national script in various Slavic, Turkic, Mongolic, Uralic, Caucasian and Iranic-speaking countries in Southeastern Europe, Ea ...
, and the Greek alphabet was in use by Greek speakers around the eastern Mediterranean. The
Arabic script
The Arabic script is the writing system used for Arabic (Arabic alphabet) and several other languages of Asia and Africa. It is the second-most widely used alphabetic writing system in the world (after the Latin script), the second-most widel ...
was widespread within Islam, both among
Arab
Arabs (, , ; , , ) are an ethnic group mainly inhabiting the Arab world in West Asia and North Africa. A significant Arab diaspora is present in various parts of the world.
Arabs have been in the Fertile Crescent for thousands of years ...
s and non-Arab nations like the
Iranians,
Indonesians,
Malays, and
Turkic peoples
Turkic peoples are a collection of diverse ethnic groups of West Asia, West, Central Asia, Central, East Asia, East, and North Asia as well as parts of Europe, who speak Turkic languages.. "Turkic peoples, any of various peoples whose members ...
. Most of the rest of Asia used a variety of
Brahmic alphabets or the
Chinese script.
Through
European colonization the Latin script has spread to the
Americas
The Americas, sometimes collectively called America, are a landmass comprising the totality of North America and South America.''Webster's New World College Dictionary'', 2010 by Wiley Publishing, Inc., Cleveland, Ohio. When viewed as a sing ...
,
Oceania
Oceania ( , ) is a region, geographical region including Australasia, Melanesia, Micronesia, and Polynesia. Outside of the English-speaking world, Oceania is generally considered a continent, while Mainland Australia is regarded as its co ...
, parts of Asia, Africa, and the Pacific, in forms based on the
Spanish,
Portuguese,
English,
French,
German and
Dutch alphabets.
It is used for many
Austronesian languages
The Austronesian languages ( ) are a language family widely spoken throughout Maritime Southeast Asia, parts of Mainland Southeast Asia, Madagascar, the islands of the Pacific Ocean and Taiwan (by Taiwanese indigenous peoples). They are spoken ...
, including the
languages of the Philippines
There are some 130 to 195 languages spoken in the Philippines, depending on the method of classification. Almost all are Malayo-Polynesian languages native to the archipelago. A number of Spanish-influenced creole language, creole varieties ge ...
and the
Malaysian and
Indonesian language
Indonesian (; ) is the official language, official and national language of Indonesia. It is a standard language, standardized variety (linguistics), variety of Malay language, Malay, an Austronesian languages, Austronesian language that has ...
s, replacing earlier Arabic and indigenous Brahmic alphabets. Latin letters served as the basis for the forms of the
Cherokee syllabary developed by
Sequoyah
Sequoyah ( ; , , or , , ; 1770 – August 1843), also known as George Gist or George Guess, was a Native American polymath and Constructed script, neographer of the Cherokee Nation.
In 1821, Sequoyah completed his Cherokee syllabary, enabl ...
; however, the sound values are completely different.
Under Portuguese missionary influence, a Latin alphabet was devised for the
Vietnamese language
Vietnamese () is an Austroasiatic languages, Austroasiatic language Speech, spoken primarily in Vietnam where it is the official language. It belongs to the Vietic languages, Vietic subgroup of the Austroasiatic language family. Vietnamese is s ...
, which had previously used
Chinese characters
Chinese characters are logographs used Written Chinese, to write the Chinese languages and others from regions historically influenced by Chinese culture. Of the four independently invented writing systems accepted by scholars, they represe ...
. The Latin-based alphabet replaced the Chinese characters in administration in the 19th century with French rule.
Since the 19th century
In the late 19th century, the
Romanians
Romanians (, ; dated Endonym and exonym, exonym ''Vlachs'') are a Romance languages, Romance-speaking ethnic group and nation native to Central Europe, Central, Eastern Europe, Eastern, and Southeastern Europe. Sharing a Culture of Romania, ...
returned to the Latin alphabet, dropping the
Romanian Cyrillic alphabet
The Romanian Cyrillic alphabet is the Cyrillic alphabet that was used to write the Romanian language and Church Slavonic until the 1830s, when it began to be gradually replaced by a Latin-based Romanian alphabet.Cyrillic remained in occasion ...
.
Romanian is one of the
Romance language
The Romance languages, also known as the Latin or Neo-Latin languages, are the languages that are Language family, directly descended from Vulgar Latin. They are the only extant subgroup of the Italic languages, Italic branch of the Indo-E ...
s.
Since 20th century
In 1928, as part of
Mustafa Kemal Atatürk
Mustafa Kemal Atatürk ( 1881 – 10 November 1938) was a Turkish field marshal and revolutionary statesman who was the founding father of the Republic of Turkey, serving as its first President of Turkey, president from 1923 until Death an ...
's reforms, the new
Republic of Turkey
Turkey, officially the Republic of Türkiye, is a country mainly located in Anatolia in West Asia, with a relatively small part called East Thrace in Southeast Europe. It borders the Black Sea to the north; Georgia (country), Georgia, Armen ...
adopted a Latin alphabet for the
Turkish language
Turkish ( , , also known as 'Turkish of Turkey') is the most widely spoken of the Turkic languages, a member of Oghuz languages, Oghuz branch with around 90 million speakers. It is the national language of Turkey and one of two official languag ...
, replacing a modified Arabic alphabet. Most of the
Turkic-speaking peoples of the former
USSR
The Union of Soviet Socialist Republics. (USSR), commonly known as the Soviet Union, was a List of former transcontinental countries#Since 1700, transcontinental country that spanned much of Eurasia from 1922 until Dissolution of the Soviet ...
, including
Tatars
Tatars ( )[Tatar]
in the Collins English Dictionary are a group of Turkic peoples across Eas ...
,
Bashkirs
The Bashkirs ( , ) or Bashkorts (, ; , ) are a Turkic peoples, Turkic ethnic group indigenous to Russia. They are concentrated in Bashkortostan, a Republics of Russia, republic of the Russian Federation and in the broader historical region of B ...
,
Azeri,
Kazakh,
Kyrgyz and others, had their writing systems replaced by the Latin-based
Uniform Turkic alphabet in the 1930s; but, in the 1940s, all were replaced by Cyrillic.
After the
collapse of the Soviet Union
The Soviet Union was formally dissolved as a sovereign state and subject of international law on 26 December 1991 by Declaration No. 142-N of the Soviet of the Republics of the Supreme Soviet of the Soviet Union. Declaration No. 142-Н of ...
in 1991, three of the newly independent Turkic-speaking republics,
Azerbaijan
Azerbaijan, officially the Republic of Azerbaijan, is a Boundaries between the continents, transcontinental and landlocked country at the boundary of West Asia and Eastern Europe. It is a part of the South Caucasus region and is bounded by ...
,
Uzbekistan
, image_flag = Flag of Uzbekistan.svg
, image_coat = Emblem of Uzbekistan.svg
, symbol_type = Emblem of Uzbekistan, Emblem
, national_anthem = "State Anthem of Uzbekistan, State Anthem of the Republ ...
,
Turkmenistan
Turkmenistan is a landlocked country in Central Asia bordered by Kazakhstan to the northwest, Uzbekistan to the north, east and northeast, Afghanistan to the southeast, Iran to the south and southwest and the Caspian Sea to the west. Ash ...
, as well as Romanian-speaking
Moldova
Moldova, officially the Republic of Moldova, is a Landlocked country, landlocked country in Eastern Europe, with an area of and population of 2.42 million. Moldova is bordered by Romania to the west and Ukraine to the north, east, and south. ...
, officially adopted Latin alphabets for their languages.
Kyrgyzstan
Kyrgyzstan, officially the Kyrgyz Republic, is a landlocked country in Central Asia lying in the Tian Shan and Pamir Mountains, Pamir mountain ranges. Bishkek is the Capital city, capital and List of cities in Kyrgyzstan, largest city. Kyrgyz ...
,
Iranian
Iranian () may refer to:
* Something of, from, or related to Iran
** Iranian diaspora, Iranians living outside Iran
** Iranian architecture, architecture of Iran and parts of the rest of West Asia
** Iranian cuisine, cooking traditions and practic ...
-speaking
Tajikistan
Tajikistan, officially the Republic of Tajikistan, is a landlocked country in Central Asia. Dushanbe is the capital city, capital and most populous city. Tajikistan borders Afghanistan to the Afghanistan–Tajikistan border, south, Uzbekistan to ...
, and the breakaway region of
Transnistria
Transnistria, officially known as the Pridnestrovian Moldavian Republic and locally as Pridnestrovie, is a Landlocked country, landlocked Transnistria conflict#International recognition of Transnistria, breakaway state internationally recogn ...
kept the Cyrillic alphabet, chiefly due to their close ties with Russia.
In the 1930s and 1940s, the majority of
Kurds
Kurds (), or the Kurdish people, are an Iranian peoples, Iranic ethnic group from West Asia. They are indigenous to Kurdistan, which is a geographic region spanning southeastern Turkey, northwestern Iran, northern Iraq, and northeastern Syri ...
replaced the Arabic script with two Latin alphabets. Although only the official
Kurdish government uses an Arabic alphabet for public documents, the Latin Kurdish alphabet remains widely used throughout the region by the majority of
Kurdish-speakers.
In 1957, the
People's Republic of China
China, officially the People's Republic of China (PRC), is a country in East Asia. With population of China, a population exceeding 1.4 billion, it is the list of countries by population (United Nations), second-most populous country after ...
introduced a script reform to the
Zhuang language, changing its orthography from
Sawndip, a writing system based on Chinese, to a Latin script alphabet that used a mixture of Latin, Cyrillic, and IPA letters to represent both the phonemes and tones of the Zhuang language, without the use of diacritics. In 1982 this was further standardised to use only Latin script letters.
With the collapse of the
Derg and subsequent end of decades of
Amharic assimilation in 1991, various ethnic groups in
Ethiopia
Ethiopia, officially the Federal Democratic Republic of Ethiopia, is a landlocked country located in the Horn of Africa region of East Africa. It shares borders with Eritrea to the north, Djibouti to the northeast, Somalia to the east, Ken ...
dropped the
Geʽez script
Geʽez ( ; , ) is a script used as an abugida (alphasyllabary) for several Afroasiatic languages, Afro-Asiatic and Nilo-Saharan languages, Nilo-Saharan languages of Ethiopia and Eritrea. It originated as an abjad (consonantal alphabet) and was ...
, which was deemed unsuitable for languages outside of the
Semitic branch. In the following years the
Kafa,
Oromo,
Sidama,
Somali,
and
Wolaitta languages switched to Latin while there is continued debate on whether to follow suit for the
Hadiyya and
Kambaata languages.
21st century
On 15 September 1999 the authorities of
Tatarstan
Tatarstan, officially the Republic of Tatarstan, sometimes also called Tataria, is a Republics of Russia, republic of Russia located in Eastern Europe. It is a part of the Volga Federal District; and its capital city, capital and largest city i ...
, Russia, passed a law to make the Latin script a co-official writing system alongside Cyrillic for the
Tatar language
Tatar ( ; or ) is a Turkic languages, Turkic language spoken by the Volga Tatars mainly located in modern Tatarstan (European Russia), as well as Siberia. It should not be confused with Crimean Tatar language, Crimean Tatar or Siberian Tatar ...
by 2011. A year later, however, the Russian government overruled the law and banned Latinization on its territory.
In 2015, the
government of Kazakhstan
The Government of the Republic of Kazakhstan (, ''Qazaqstan Respublikasynyñ Ükımetı'') is the collegial body that exercises Executive branch, executive power in the Kazakhstan, Republic of Kazakhstan; The government heads the system of execu ...
announced that a
Kazakh Latin alphabet would replace the
Kazakh Cyrillic alphabet as the official writing system for the
Kazakh language
Kazakh is a Turkic language of the Kipchak branch spoken in Central Asia by Kazakhs. It is closely related to Nogai, Kyrgyz and Karakalpak. It is the official language of Kazakhstan, and has official status in the Altai Republic of Russia ...
by 2025. There are also talks about switching from the Cyrillic script to Latin in Ukraine,
Kyrgyzstan
Kyrgyzstan, officially the Kyrgyz Republic, is a landlocked country in Central Asia lying in the Tian Shan and Pamir Mountains, Pamir mountain ranges. Bishkek is the Capital city, capital and List of cities in Kyrgyzstan, largest city. Kyrgyz ...
, and
Mongolia
Mongolia is a landlocked country in East Asia, bordered by Russia to the north and China to the south and southeast. It covers an area of , with a population of 3.5 million, making it the world's List of countries and dependencies by po ...
. Mongolia, however, has since opted to revive the
Mongolian script
The traditional Mongolian script, also known as the Hudum Mongol bichig, was the first Mongolian alphabet, writing system created specifically for the Mongolian language, and was the most widespread until the introduction of Cyrillic script, Cy ...
instead of switching to Latin.
In October 2019,
Inuit Tapiriit Kanatami (ITK), the national organization for
Inuit
Inuit (singular: Inuk) are a group of culturally and historically similar Indigenous peoples traditionally inhabiting the Arctic and Subarctic regions of North America and Russia, including Greenland, Labrador, Quebec, Nunavut, the Northwe ...
in Canada announced that they will introduce a unified writing system for the
Inuit languages
The Inuit languages are a closely related group of Indigenous languages of the Americas, indigenous American languages traditionally spoken across the North American Arctic and the adjacent subarctic regions as far south as Labrador. The Inuit ...
in the country. The writing system is based on the Latin alphabet and is modeled after the one used in the
Greenlandic language
Greenlandic, also known by its Endonym and exonym, endonym Kalaallisut (, ), is an Inuit languages, Inuit language belonging to the Eskaleut languages#Internal classification, Eskimoan branch of the Eskaleut languages, Eskaleut language family. ...
.
On 12 February 2021 the government of Uzbekistan announced it will finalize the transition from Cyrillic to Latin for the
Uzbek language
Uzbek is a Karluk Turkic language spoken by Uzbeks. It is the official and national language of Uzbekistan and formally succeeded Chagatai, an earlier Karluk language endonymically called or , as the literary language of Uzbekistan in the 19 ...
by 2023. Plans to switch to Latin originally began in 1993 but subsequently stalled and Cyrillic remained in widespread use.
At present the
Crimean Tatar language
Crimean Tatar (), also called Crimean (), is a Turkic languages, Turkic language spoken in Crimea and the Crimean Tatar diasporas of Uzbekistan, Turkey and Bulgaria, as well as small communities in the United States and Canada. It should not ...
uses both Cyrillic and Latin. The use of Latin was originally approved by Crimean Tatar representatives after the Soviet Union's collapse but was never implemented by the regional government. After Russia's
annexation
Annexation, in international law, is the forcible acquisition and assertion of legal title over one state's territory by another state, usually following military occupation of the territory. In current international law, it is generally held t ...
of Crimea in 2014 the Latin script was dropped entirely. Nevertheless, Crimean Tatars outside of Crimea continue to use Latin and on 22 October 2021 the government of Ukraine approved a proposal endorsed by the
Mejlis of the Crimean Tatar People to switch the Crimean Tatar language to Latin by 2025.
In July 2020, 2.6 billion people (36% of the world population) use the Latin alphabet.
International standards
By the 1960s, it became apparent to the computer and
telecommunication
Telecommunication, often used in its plural form or abbreviated as telecom, is the transmission of information over a distance using electronic means, typically through cables, radio waves, or other communication technologies. These means of ...
s industries in the
First World that a non-proprietary method of encoding characters was needed. The
International Organization for Standardization
The International Organization for Standardization (ISO ; ; ) is an independent, non-governmental, international standard development organization composed of representatives from the national standards organizations of member countries.
M ...
(ISO) encapsulated the Latin alphabet in their (
ISO/IEC 646
ISO/IEC 646 ''Information technology — ISO 7-bit coded character set for information interchange'', is an International Organization for Standardization, ISO/International Electrotechnical Commission, IEC standard in the ...
) standard. To achieve widespread acceptance, this encapsulation was based on popular usage.
As the United States held a preeminent position in both industries during the 1960s, the standard was based on the already published ''American Standard Code for Information Interchange'', better known as
ASCII
ASCII ( ), an acronym for American Standard Code for Information Interchange, is a character encoding standard for representing a particular set of 95 (English language focused) printable character, printable and 33 control character, control c ...
, which included in the
character set
Character encoding is the process of assigning numbers to graphical characters, especially the written characters of human language, allowing them to be stored, transmitted, and transformed using computers. The numerical values that make up a c ...
the 26 × 2 (uppercase and lowercase) letters of the
English alphabet
Modern English is written with a Latin-script alphabet consisting of 26 Letter (alphabet), letters, with each having both uppercase and lowercase forms. The word ''alphabet'' is a Compound (linguistics), compound of ''alpha'' and ''beta'', t ...
. Later standards issued by the ISO, for example
ISO/IEC 10646
ISO/IEC JTC 1, entitled "Information technology", is a joint technical committee (JTC) of the International Organization for Standardization (ISO) and the International Electrotechnical Commission (IEC). Its purpose is to develop, maintain and ...
(
Unicode Latin), have continued to define the 26 × 2 letters of the English alphabet as the basic Latin alphabet with extensions to handle other letters in other languages.
National standards
The DIN standard
DIN 91379
The DIN, DIN standard DIN 91379: "Characters and defined character sequences in Unicode for the electronic processing of names and data exchange in Europe, with CD-ROM" defines a normative subset of Unicode Latin characters, sequences of base char ...
specifies a subset of Unicode letters, special characters, and sequences of letters and diacritic signs to allow the correct representation of names and to simplify data exchange in Europe. This specification supports all official languages of
European Union
The European Union (EU) is a supranational union, supranational political union, political and economic union of Member state of the European Union, member states that are Geography of the European Union, located primarily in Europe. The u ...
and
European Free Trade Association
The European Free Trade Association (EFTA) is a regional trade organization and free trade area consisting of four List of sovereign states and dependent territories in Europe, European states: Iceland, Liechtenstein, Norway and Switzerland. ...
countries (thus also the Greek and
Cyrillic
The Cyrillic script ( ) is a writing system used for various languages across Eurasia. It is the designated national script in various Slavic, Turkic, Mongolic, Uralic, Caucasian and Iranic-speaking countries in Southeastern Europe, Ea ...
scripts), plus the
German minority languages. To allow the transliteration of names in other writing systems to the Latin script according to the relevant ISO standards, all necessary combinations of base letters and diacritic signs are provided.
Efforts are being made to further develop it into a European
CEN standard.
As used by various languages
In the course of its use, the Latin alphabet was adapted for use in new languages, sometimes representing
phoneme
A phoneme () is any set of similar Phone (phonetics), speech sounds that are perceptually regarded by the speakers of a language as a single basic sound—a smallest possible Phonetics, phonetic unit—that helps distinguish one word fr ...
s not found in languages that were already written with the Roman characters. To represent these new sounds, extensions were therefore created, be it by adding
diacritic
A diacritic (also diacritical mark, diacritical point, diacritical sign, or accent) is a glyph added to a letter or to a basic glyph. The term derives from the Ancient Greek (, "distinguishing"), from (, "to distinguish"). The word ''diacrit ...
s to existing
letters, by joining multiple letters together to make
ligatures, by creating completely new forms, or by assigning a special function to pairs or triplets of letters. These new forms are given a place in the alphabet by defining an
alphabetical order
Alphabetical order is a system whereby character strings are placed in order based on the position of the characters in the conventional ordering of an alphabet. It is one of the methods of collation. In mathematics, a lexicographical order is ...
or collation sequence, which can vary with the particular language.
Letters
Some examples of new letters to the standard Latin alphabet are the
Runic letters
wynn
Wynn or wyn (; also spelled wen, win, ƿynn, ƿyn, ƿen, and ƿin) is a letter of the Old English Latin alphabet, Old English alphabet, where it is used to represent the sound .
History The letter "W"
While the earliest Old English texts ...
and
thorn , and the letter
eth , which were added to the alphabet of
Old English
Old English ( or , or ), or Anglo-Saxon, is the earliest recorded form of the English language, spoken in England and southern and eastern Scotland in the Early Middle Ages. It developed from the languages brought to Great Britain by Anglo-S ...
. Another Irish letter, the
insular ''g'', developed into
yogh , used in
Middle English
Middle English (abbreviated to ME) is a form of the English language that was spoken after the Norman Conquest of 1066, until the late 15th century. The English language underwent distinct variations and developments following the Old English pe ...
. Wynn was later replaced with the new letter , eth and thorn with , and yogh with . Although the four are no longer part of the English or Irish alphabets, eth and thorn are still used in the modern
Icelandic alphabet, while eth is also used by the
Faroese alphabet.
Some West, Central and
Southern Africa
Southern Africa is the southernmost region of Africa. No definition is agreed upon, but some groupings include the United Nations geoscheme for Africa, United Nations geoscheme, the intergovernmental Southern African Development Community, and ...
n languages use a few additional letters that have sound values similar to those of their equivalents in the IPA. For example,
Adangme uses the letters and , and
Ga uses , and .
Hausa uses and for
implosives, and for an
ejective.
Africanists have standardized these into the
African reference alphabet
The African Reference Alphabet is a largely defunct continent-wide guideline for the creation of Latin alphabets for African languages. Two variants of the initial proposal (one in English and a second in French) were made at a 1978 UNESCO-organi ...
.
Dotted and
dotless I — and — are two forms of the letter I used by the
Turkish,
Azerbaijani, and
Kazakh alphabets. The Azerbaijani language also has , which represents the
near-open front unrounded vowel
The near-open front unrounded vowel, or near-low front unrounded vowel, is a type of vowel sound. The symbol in the International Phonetic Alphabet that represents this sound is , a lowercase of the ligature. Both the symbol and the sound ar ...
.
Multigraphs
A
digraph is a pair of letters used to write one sound or a combination of sounds that does not correspond to the written letters in sequence. Examples are , , , , , in English, and , , and in Dutch. In Dutch the is capitalized as or the
ligature , but never as , and it often takes the appearance of a ligature very similar to the letter in
handwriting.
A
trigraph is made up of three letters, like the
German , the
Breton or the
Milanese . In the
orthographies of some languages, digraphs and trigraphs are regarded as independent letters of the alphabet in their own right. The capitalization of digraphs and trigraphs is language-dependent, as only the first letter may be capitalized, or all component letters simultaneously (even for words written in title case, where letters after the digraph or trigraph are left in lowercase).
Ligatures
A
ligature is a fusion of two or more ordinary letters into a new
glyph
A glyph ( ) is any kind of purposeful mark. In typography, a glyph is "the specific shape, design, or representation of a character". It is a particular graphical representation, in a particular typeface, of an element of written language. A ...
or character. Examples are (from , called ''ash''), (from , sometimes called ''oethel'' or ''eðel''), the
abbreviation
An abbreviation () is a shortened form of a word or phrase, by any method including shortening (linguistics), shortening, contraction (grammar), contraction, initialism (which includes acronym), or crasis. An abbreviation may be a shortened for ...
(from , called ''ampersand''), and (from or , the
archaic medial form of , followed by an or , called ''sharp S'' or ''eszett'').
Diacritics

A diacritic, in some cases also called an accent, is a small symbol that can appear above or below a letter, or in some other position, such as the
umlaut sign used in the German characters , , or the Romanian characters
ă,
â,
î,
ș,
ț. Its main function is to change the phonetic value of the letter to which it is added, but it may also modify the pronunciation of a whole syllable or word, indicate the start of a new syllable, or distinguish between
homographs such as the
Dutch words ''
een'' () meaning "a" or "an", and ''
één'', () meaning "one". As with the pronunciation of letters, the effect of diacritics is language-dependent.
English is the only major modern
European language that requires no diacritics for its native vocabulary. Historically, in formal writing, a
diaeresis was sometimes used to indicate the start of a new syllable within a sequence of letters that could otherwise be misinterpreted as being a single vowel (e.g., "coöperative", "reëlect"), but modern writing styles either omit such marks or use a hyphen to indicate a syllable break (e.g. "co-operative", "re-elect").
Collation
Some modified letters, such as the symbols , , and , may be regarded as new individual letters in themselves, and assigned a specific place in the alphabet for
collation purposes, separate from that of the letter on which they are based, as is done in
Swedish. In other cases, such as with , , in German, this is not done; letter-diacritic combinations being identified with their base letter. The same applies to digraphs and trigraphs. Different diacritics may be treated differently in collation within a single language. For example, in Spanish, the character is considered a letter, and sorted between and in dictionaries, but the accented vowels , , , , , are not separated from the unaccented vowels , , , , .
Capitalization
The languages that use the Latin script today generally use
capital letters
Letter case is the distinction between the letters that are in larger uppercase or capitals (more formally ''#Majuscule, majuscule'') and smaller lowercase (more formally ''#Minuscule, minuscule'') in the written representation of certain langua ...
to begin paragraphs and sentences and
proper nouns. The rules for
capitalization have changed over time, and different languages have varied in their rules for capitalization.
Old English
Old English ( or , or ), or Anglo-Saxon, is the earliest recorded form of the English language, spoken in England and southern and eastern Scotland in the Early Middle Ages. It developed from the languages brought to Great Britain by Anglo-S ...
, for example, was rarely written with even proper nouns capitalized; whereas
Modern English
Modern English, sometimes called New English (NE) or present-day English (PDE) as opposed to Middle and Old English, is the form of the English language that has been spoken since the Great Vowel Shift in England
England is a Count ...
of the 18th century had frequently all nouns capitalized, in the same way that Modern
German is written today, e.g. .
Romanization
Words from languages natively written with other
scripts, such as
Arabic
Arabic (, , or , ) is a Central Semitic languages, Central Semitic language of the Afroasiatic languages, Afroasiatic language family spoken primarily in the Arab world. The International Organization for Standardization (ISO) assigns lang ...
or
Chinese, are usually
transliterated
Transliteration is a type of conversion of a text from one writing system, script to another that involves swapping Letter (alphabet), letters (thus ''wikt:trans-#Prefix, trans-'' + ''wikt:littera#Latin, liter-'') in predictable ways, such as ...
or
transcribed when embedded in Latin-script text or in
multilingual international communication, a process termed ''romanization''.
Whilst the romanization of such languages is used mostly at unofficial levels, it has been especially prominent in computer messaging where only the limited seven-bit
ASCII
ASCII ( ), an acronym for American Standard Code for Information Interchange, is a character encoding standard for representing a particular set of 95 (English language focused) printable character, printable and 33 control character, control c ...
code is available on older systems. However, with the introduction of
Unicode
Unicode or ''The Unicode Standard'' or TUS is a character encoding standard maintained by the Unicode Consortium designed to support the use of text in all of the world's writing systems that can be digitized. Version 16.0 defines 154,998 Char ...
, romanization is now becoming less necessary. Keyboards used to enter such text may still restrict users to romanized text, as only ASCII or Latin-alphabet characters may be available.
See also
*
Western Latin character sets (computing)
Several 8-bit character encoding, character sets (encodings) were designed for binary representation of common Western European languages (Italian language, Italian, Spanish language, Spanish, Portuguese language, Portuguese, French language, Fren ...
*
European Latin Unicode subset (DIN 91379)
*
H with left hook
*
Latin letters used in mathematics
*
Latin omega
Notes
References
Citations
Sources
*
Further reading
* Boyle, Leonard E. 1976. "Optimist and recensionist: 'Common errors' or 'common variations. In ''Latin script and letters A.D. 400–900: Festschrift presented to Ludwig Bieler on the occasion of his 70th birthday.'' Edited by John J. O'Meara and Bernd Naumann, 264–74. Leiden, The Netherlands: Brill.
* Morison, Stanley. 1972. ''Politics and script: Aspects of authority and freedom in the development of Graeco-Latin script from the sixth century B.C. to the twentieth century A.D.'' Oxford: Clarendon.
External links
Unicode collation chart��Latin letters sorted by shape
Diacritics ProjectAll you need to design a font with correct accents
{{Authority control
History of the Roman Republic