HOME

TheInfoList



OR:

The Latin script, also known as Roman script, is an
alphabetic An alphabet is a standardized set of basic written graphemes (called letters) that represent the phonemes of certain spoken languages. Not all writing systems represent language in this way; in a syllabary, each character represents a syllab ...
writing system based on the letters of the classical Latin alphabet, derived from a form of the Greek alphabet which was in use in the ancient Greek city of Cumae, in southern Italy ( Magna Grecia). It was adopted by the Etruscans and subsequently by the
Romans Roman or Romans most often refers to: * Rome, the capital city of Italy * Ancient Rome, Roman civilization from 8th century BC to 5th century AD * Roman people, the people of ancient Rome *''Epistle to the Romans'', shortened to ''Romans'', a let ...
. Several
Latin-script alphabet A Latin-script alphabet (Latin alphabet or Roman alphabet) is an alphabet that uses letters of the Latin script. The 21-letter archaic Latin alphabet and the 23-letter classical Latin alphabet belong to the oldest of this group. The 26-lette ...
s exist, which differ in graphemes, collation and phonetic values from the classical Latin alphabet. The Latin script is the basis of the International Phonetic Alphabet, and the 26 most widespread letters are the letters contained in the
ISO basic Latin alphabet The ISO basic Latin alphabet is an international standard (beginning with ISO/IEC 646) for a Latin-script alphabet that consists of two sets (uppercase and lowercase) of 26 letters, codified in various national and international standards and ...
. Latin script is the basis for the largest number of alphabets of any writing system and is the most widely adopted writing system in the world. Latin script is used as the standard method of writing for most Western and Central, and some Eastern, European languages as well as many languages in other parts of the world.


Name

The script is either called Latin script or Roman script, in reference to its origin in ancient Rome (though some of the capital letters are Greek in origin). In the context of transliteration, the term " romanization" (
British English British English (BrE, en-GB, or BE) is, according to Oxford Dictionaries, "English as used in Great Britain, as distinct from that used elsewhere". More narrowly, it can refer specifically to the English language in England, or, more broadly ...
: "romanisation") is often found. Unicode uses the term "Latin" as does the International Organization for Standardization (ISO). The numeral system is called the Roman numeral system, and the collection of the elements is known as the Roman numerals. The numbers 1, 2, 3 ... are Latin/Roman script numbers for the Hindu–Arabic numeral system.


History


Old Italic alphabet


Archaic Latin alphabet

The letter was the western form of the Greek gamma, but it was used for the sounds and alike, possibly under the influence of Etruscan, which might have lacked any voiced
plosive In phonetics, a plosive, also known as an occlusive or simply a stop, is a pulmonic consonant in which the vocal tract is blocked so that all airflow ceases. The occlusion may be made with the tongue tip or blade (, ), tongue body (, ), lip ...
s. Later, probably during the 3rd century BC, the letter – unneeded to write Latin properly – was replaced with the new letter , a modified with a small horizontal stroke, which took its place in the alphabet. From then on, represented the voiced plosive , while was generally reserved for the voiceless plosive . The letter was used only rarely, in a small number of words such as ''
Kalendae The calends or kalends ( la, kalendae) is the first day of every month in the Roman calendar. The English word "calendar" is derived from this word. Use The Romans called the first day of every month the ''calends'', signifying the start of a ne ...
'', often interchangeably with .


Classical Latin alphabet

After the
Roman conquest of Greece Greece in the Roman era describes the Roman conquest of Greece, as well as the period of Greek history when Greece was dominated first by the Roman Republic and then by the Roman Empire. The Roman era of Greek history began with the Corinthian ...
in the 1st century BC, Latin adopted the Greek words and (or readopted, in the latter case) to write Greek loanwords, placing them at the end of the alphabet. An attempt by the emperor
Claudius Tiberius Claudius Caesar Augustus Germanicus (; 1 August 10 BC – 13 October AD 54) was the fourth Roman emperor, ruling from AD 41 to 54. A member of the Julio-Claudian dynasty, Claudius was born to Drusus and Antonia Minor ...
to introduce three additional letters did not last. Thus it was during the classical Latin period that the Latin alphabet contained 23 letters:''Italic text''


Medieval and later developments

It was not until the Middle Ages that the letter (originally a
ligature Ligature may refer to: * Ligature (medicine), a piece of suture used to shut off a blood vessel or other anatomical structure ** Ligature (orthodontic), used in dentistry * Ligature (music), an element of musical notation used especially in the me ...
of two s) was added to the Latin alphabet, to represent sounds from the Germanic languages which did not exist in medieval Latin, and only after the Renaissance did the convention of treating and as vowels, and and as consonants, become established. Prior to that, the former had been merely allographs of the latter. With the fragmentation of political power, the style of writing changed and varied greatly throughout the Middle Ages, even after the invention of the printing press. Early deviations from the classical forms were the uncial script, a development of the Old Roman cursive, and various so-called minuscule scripts that developed from New Roman cursive, of which the insular script developed by Irish literati & derivations of this, such as
Carolingian minuscule Carolingian minuscule or Caroline minuscule is a script which developed as a calligraphic standard in the medieval European period so that the Latin alphabet of Jerome's Vulgate Bible could be easily recognized by the literate class from one reg ...
were the most influential, introducing the lower case forms of the letters, as well as other writing conventions that have since become standard. The languages that use the Latin script generally use capital letters to begin paragraphs and sentences and proper nouns. The rules for
capitalization Capitalization (American English) or capitalisation (British English) is writing a word with its first letter as a capital letter (uppercase letter) and the remaining letters in lower case, in writing systems with a case distinction. The term a ...
have changed over time, and different languages have varied in their rules for capitalization. Old English, for example, was rarely written with even proper nouns capitalized, whereas Modern English writers and printers of the 17th and 18th century frequently capitalized most and sometimes all nouns – e.g. in the preamble and all of the United States Constitution – a practice still systematically used in Modern
German German(s) may refer to: * Germany (of or related to) **Germania (historical use) * Germans, citizens of Germany, people of German ancestry, or native speakers of the German language ** For citizens of Germany, see also German nationality law **Ge ...
.


ISO basic Latin alphabet

The use of the letters I and V for both consonants and vowels proved inconvenient as the Latin alphabet was adapted to Germanic and Romance languages. W originated as a doubled V (VV) used to represent the found in Old English as early as the 7th century. It came into common use in the later 11th century, replacing the letter wynn , which had been used for the same sound. In the Romance languages, the minuscule form of V was a rounded ''u''; from this was derived a rounded capital U for the vowel in the 16th century, while a new, pointed minuscule ''v'' was derived from V for the consonant. In the case of I, a word-final
swash Swash, or forewash in geography, is a turbulent layer of water that washes up on the beach after an incoming wave has broken. The swash action can move beach materials up and down the beach, which results in the cross-shore sediment exchange. T ...
form, ''j'', came to be used for the consonant, with the un-swashed form restricted to vowel use. Such conventions were erratic for centuries. J was introduced into English for the consonant in the 17th century (it had been rare as a vowel), but it was not universally considered a distinct letter in the alphabetic order until the 19th century. By the 1960s, it became apparent to the computer and telecommunications industries in the First World that a non-proprietary method of encoding characters was needed. The International Organization for Standardization (ISO) encapsulated the Latin alphabet in their ( ISO/IEC 646) standard. To achieve widespread acceptance, this encapsulation was based on popular usage. As the United States held a preeminent position in both industries during the 1960s, the standard was based on the already published ''American Standard Code for Information Interchange'', better known as
ASCII ASCII ( ), abbreviated from American Standard Code for Information Interchange, is a character encoding standard for electronic communication. ASCII codes represent text in computers, telecommunications equipment, and other devices. Because of ...
, which included in the
character set Character encoding is the process of assigning numbers to graphical characters, especially the written characters of human language, allowing them to be stored, transmitted, and transformed using digital computers. The numerical values that ...
the 26 × 2 (uppercase and lowercase) letters of the
English alphabet The alphabet for Modern English is a Latin-script alphabet consisting of 26 letters, each having an upper- and lower-case form. The word ''alphabet'' is a compound of the first two letters of the Greek alphabet, '' alpha'' and '' beta''. ...
. Later standards issued by the ISO, for example ISO/IEC 10646 (
Unicode Latin Over a thousand characters from the Latin script are encoded in the Unicode Standard, grouped in several basic and extended Latin blocks. The extended ranges contain mainly precomposed letters plus diacritics that are equivalently encoded with c ...
), have continued to define the 26 × 2 letters of the English alphabet as the basic Latin alphabet with extensions to handle other letters in other languages.


Spread

The Latin alphabet spread, along with Latin, from the Italian Peninsula to the lands surrounding the Mediterranean Sea with the expansion of the
Roman Empire The Roman Empire ( la, Imperium Romanum ; grc-gre, Βασιλεία τῶν Ῥωμαίων, Basileía tôn Rhōmaíōn) was the post-Roman Republic, Republican period of ancient Rome. As a polity, it included large territorial holdings aro ...
. The eastern half of the Empire, including Greece, Turkey, the Levant, and Egypt, continued to use Greek as a lingua franca, but Latin was widely spoken in the western half, and as the western Romance languages evolved out of Latin, they continued to use and adapt the Latin alphabet.


Middle Ages

With the spread of Western Christianity during the Middle Ages, the Latin alphabet was gradually adopted by the peoples of Northern Europe who spoke
Celtic languages The Celtic languages ( usually , but sometimes ) are a group of related languages descended from Proto-Celtic. They form a branch of the Indo-European language family. The term "Celtic" was first used to describe this language group by Edward ...
(displacing the
Ogham Ogham (Modern Irish: ; mga, ogum, ogom, later mga, ogam, label=none ) is an Early Medieval alphabet used primarily to write the early Irish language (in the "orthodox" inscriptions, 4th to 6th centuries AD), and later the Old Irish langua ...
alphabet) or Germanic languages (displacing earlier
Runic alphabet Runes are the letter (alphabet), letters in a set of related alphabets known as runic alphabets native to the Germanic peoples. Runes were used to write various Germanic languages (with some exceptions) before they adopted the Latin alphabet, a ...
s) or
Baltic languages The Baltic languages are a branch of the Indo-European language family spoken natively by a population of about 4.5 million people mainly in areas extending east and southeast of the Baltic Sea in Northern Europe. Together with the Slavic lan ...
, as well as by the speakers of several Uralic languages, most notably Hungarian, Finnish and
Estonian Estonian may refer to: * Something of, from, or related to Estonia, a country in the Baltic region in northern Europe * Estonians, people from Estonia, or of Estonian descent * Estonian language * Estonian cuisine * Estonian culture See also * ...
. The Latin script also came into use for writing the
West Slavic languages The West Slavic languages are a subdivision of the Slavic language group. They include Polish, Czech, Slovak, Kashubian, Upper Sorbian and Lower Sorbian. The languages have traditionally been spoken across a mostly continuous region encompas ...
and several South Slavic languages, as the people who spoke them adopted Roman Catholicism. The speakers of East Slavic languages generally adopted
Cyrillic , bg, кирилица , mk, кирилица , russian: кириллица , sr, ћирилица, uk, кирилиця , fam1 = Egyptian hieroglyphs , fam2 = Proto-Sinaitic , fam3 = Phoenician , fam4 = Gr ...
along with Orthodox Christianity. The Serbian language uses both scripts, with Cyrillic predominating in official communication and Latin elsewhere, as determined by the Law on Official Use of the Language and Alphabet.


Since the 16th century

As late as 1500, the Latin script was limited primarily to the languages spoken in
Western Western may refer to: Places *Western, Nebraska, a village in the US *Western, New York, a town in the US *Western Creek, Tasmania, a locality in Australia *Western Junction, Tasmania, a locality in Australia *Western world, countries that id ...
,
Northern Northern may refer to the following: Geography * North, a point in direction * Northern Europe, the northern part or region of Europe * Northern Highland, a region of Wisconsin, United States * Northern Province, Sri Lanka * Northern Range, a r ...
, and
Central Europe Central Europe is an area of Europe between Western Europe and Eastern Europe, based on a common historical, social and cultural identity. The Thirty Years' War (1618–1648) between Catholicism and Protestantism significantly shaped the ar ...
. The Orthodox Christian Slavs of
Eastern Eastern may refer to: Transportation *China Eastern Airlines, a current Chinese airline based in Shanghai * Eastern Air, former name of Zambia Skyways *Eastern Air Lines, a defunct American airline that operated from 1926 to 1991 * Eastern Air ...
and
Southeastern Europe Southeast Europe or Southeastern Europe (SEE) is a geographical subregion of Europe, consisting primarily of the Balkans. Sovereign states and territories that are included in the region are Albania, Bosnia and Herzegovina, Bulgaria, Croatia (a ...
mostly used
Cyrillic , bg, кирилица , mk, кирилица , russian: кириллица , sr, ћирилица, uk, кирилиця , fam1 = Egyptian hieroglyphs , fam2 = Proto-Sinaitic , fam3 = Phoenician , fam4 = Gr ...
, and the Greek alphabet was in use by Greek-speakers around the eastern Mediterranean. The
Arabic script The Arabic script is the writing system used for Arabic and several other languages of Asia and Africa. It is the second-most widely used writing system in the world by number of countries using it or a script directly derived from it, and the ...
was widespread within Islam, both among
Arab The Arabs (singular: Arab; singular ar, عَرَبِيٌّ, DIN 31635: , , plural ar, عَرَب, DIN 31635: , Arabic pronunciation: ), also known as the Arab people, are an ethnic group mainly inhabiting the Arab world in Western Asia, No ...
s and non-Arab nations like the Iranians, Indonesians, Malays, and Turkic peoples. Most of the rest of Asia used a variety of Brahmic alphabets or the Chinese script. Through European colonization the Latin script has spread to the
Americas The Americas, which are sometimes collectively called America, are a landmass comprising the totality of North and South America. The Americas make up most of the land in Earth's Western Hemisphere and comprise the New World. Along with t ...
, Oceania, parts of Asia, Africa, and the Pacific, in forms based on the
Spanish Spanish might refer to: * Items from or related to Spain: **Spaniards are a nation and ethnic group indigenous to Spain **Spanish language, spoken in Spain and many Latin American countries **Spanish cuisine Other places * Spanish, Ontario, Can ...
,
Portuguese Portuguese may refer to: * anything of, from, or related to the country and nation of Portugal ** Portuguese cuisine, traditional foods ** Portuguese language, a Romance language *** Portuguese dialects, variants of the Portuguese language ** Portu ...
, English,
French French (french: français(e), link=no) may refer to: * Something of, from, or related to France ** French language, which originated in France, and its various dialects and accents ** French people, a nation and ethnic group identified with Franc ...
,
German German(s) may refer to: * Germany (of or related to) **Germania (historical use) * Germans, citizens of Germany, people of German ancestry, or native speakers of the German language ** For citizens of Germany, see also German nationality law **Ge ...
and Dutch alphabets. It is used for many Austronesian languages, including the languages of the Philippines and the Malaysian and Indonesian languages, replacing earlier Arabic and indigenous Brahmic alphabets. Latin letters served as the basis for the forms of the
Cherokee syllabary The Cherokee syllabary is a syllabary invented by Sequoyah in the late 1810s and early 1820s to write the Cherokee language. His creation of the syllabary is particularly noteworthy as he was illiterate until the creation of his syllabary. He f ...
developed by Sequoyah; however, the sound values are completely different. Under Portuguese missionary influence, a Latin alphabet was devised for the Vietnamese language, which had previously used
Chinese characters Chinese characters () are logograms developed for the writing of Chinese. In addition, they have been adapted to write other East Asian languages, and remain a key component of the Japanese writing system where they are known as ''kanji ...
. The Latin-based alphabet replaced the Chinese characters in administration in the 19th century with French rule.


Since 19th century

In the late 19th century, the Romanians switched to the Latin alphabet, which they had used until the
Council of Florence The Council of Florence is the seventeenth ecumenical council recognized by the Catholic Church, held between 1431 and 1449. It was convoked as the Council of Basel by Pope Martin V shortly before his death in February 1431 and took place in ...
in 1439, primarily because
Romanian Romanian may refer to: *anything of, from, or related to the country and nation of Romania ** Romanians, an ethnic group **Romanian language, a Romance language ***Romanian dialects, variants of the Romanian language **Romanian cuisine, traditiona ...
is a Romance language. The Romanians were predominantly
Orthodox Christian Orthodoxy (from Greek: ) is adherence to correct or accepted creeds, especially in religion. Orthodoxy within Christianity refers to acceptance of the doctrines defined by various creeds and ecumenical councils in Antiquity, but different Church ...
s, and their Church, increasingly influenced by Russia after the fall of Byzantine Greek Constantinople in 1453 and capture of the Greek Orthodox Patriarch, had begun promoting the Slavic
Cyrillic , bg, кирилица , mk, кирилица , russian: кириллица , sr, ћирилица, uk, кирилиця , fam1 = Egyptian hieroglyphs , fam2 = Proto-Sinaitic , fam3 = Phoenician , fam4 = Gr ...
.


Since 20th century

In 1928, as part of Mustafa Kemal Atatürk's reforms, the new
Republic of Turkey Turkey ( tr, Türkiye ), officially the Republic of Türkiye ( tr, Türkiye Cumhuriyeti, links=no ), is a transcontinental country located mainly on the Anatolian Peninsula in Western Asia, with a small portion on the Balkan Peninsula in ...
adopted a Latin alphabet for the Turkish language, replacing a modified Arabic alphabet. Most of the Turkic-speaking peoples of the former USSR, including Tatars, Bashkirs,
Azeri Azerbaijanis (; az, Azərbaycanlılar, ), Azeris ( az, Azərilər, ), or Azerbaijani Turks ( az, Azərbaycan Türkləri, ) are a Turkic people living mainly in northwestern Iran and the Republic of Azerbaijan. They are the second-most nume ...
, Kazakh,
Kyrgyz Kyrgyz, Kirghiz or Kyrgyzstani may refer to: * Someone or something related to Kyrgyzstan *Kyrgyz people *Kyrgyz national games *Kyrgyz language *Kyrgyz culture *Kyrgyz cuisine *Yenisei Kirghiz *The Fuyü Gïrgïs language in Northeastern China ...
and others, had their writing systems replaced by the Latin-based Uniform Turkic alphabet in the 1930s; but, in the 1940s, all were replaced by Cyrillic. After the
collapse of the Soviet Union The dissolution of the Soviet Union, also negatively connoted as rus, Разва́л Сове́тского Сою́за, r=Razvál Sovétskogo Soyúza, ''Ruining of the Soviet Union''. was the process of internal disintegration within the Sov ...
in 1991, three of the newly independent Turkic-speaking republics,
Azerbaijan Azerbaijan (, ; az, Azərbaycan ), officially the Republic of Azerbaijan, , also sometimes officially called the Azerbaijan Republic is a transcontinental country located at the boundary of Eastern Europe and Western Asia. It is a part of t ...
, Uzbekistan, Turkmenistan, as well as Romanian-speaking
Moldova Moldova ( , ; ), officially the Republic of Moldova ( ro, Republica Moldova), is a Landlocked country, landlocked country in Eastern Europe. It is bordered by Romania to the west and Ukraine to the north, east, and south. The List of states ...
, officially adopted Latin alphabets for their languages. Kyrgyzstan,
Iranian Iranian may refer to: * Iran, a sovereign state * Iranian peoples, the speakers of the Iranian languages. The term Iranic peoples is also used for this term to distinguish the pan ethnic term from Iranian, used for the people of Iran * Iranian lan ...
-speaking
Tajikistan Tajikistan (, ; tg, Тоҷикистон, Tojikiston; russian: Таджикистан, Tadzhikistan), officially the Republic of Tajikistan ( tg, Ҷумҳурии Тоҷикистон, Jumhurii Tojikiston), is a landlocked country in Centr ...
, and the breakaway region of Transnistria kept the Cyrillic alphabet, chiefly due to their close ties with Russia. In the 1930s and 1940s, the majority of Kurds replaced the Arabic script with two Latin alphabets. Although only the official Kurdish government uses an Arabic alphabet for public documents, the Latin Kurdish alphabet remains widely used throughout the region by the majority of Kurdish-speakers. In 1957, the People's Republic of China introduced a script reform to the
Zhuang language The Zhuang languages (; autonym: , pre-1982: , Sawndip: 話僮, from ''vah'', 'language' and ''Cuengh'', 'Zhuang'; ) are any of more than a dozen Tai languages spoken by the Zhuang people of Southern China in the province of Guangxi and adja ...
, changing its orthography from Sawndip, a writing system based on Chinese, to a Latin script alphabet that used a mixture of Latin, Cyrillic, and IPA letters to represent both the phonemes and tones of the Zhuang language, without the use of diacritics. In 1982 this was further standardised to use only Latin script letters. With the collapse of the Derg and subsequent end of decades of
Amharic Amharic ( or ; (Amharic: ), ', ) is an Ethiopian Semitic language, which is a subgrouping within the Semitic branch of the Afroasiatic languages. It is spoken as a first language by the Amharas, and also serves as a lingua franca for all oth ...
assimilation in 1991, various ethnic groups in Ethiopia dropped the
Geʽez script Geʽez ( gez, ግዕዝ, Gəʿəz, ) is a script used as an abugida (alphasyllabary) for several Afro-Asiatic and Nilo-Saharan languages of Ethiopia and Eritrea. It originated as an ''abjad'' (consonant-only alphabet) and was first used to wr ...
, which was deemed unsuitable for languages outside of the Semitic branch. In the following years the Kafa, Oromo,
Sidama The Sidama ( am, ሲዳማ) are an ethnic group traditionally inhabiting the Sidama Region, formerly part of the Southern Nations, Nationalities, and Peoples' Region of Ethiopia. On 23 November 2019, the Sidama Zone became the 10th regional sta ...
, Somali, and Wolaitta languages switched to Latin while there is continued debate on whether to follow suit for the Hadiyya and Kambaata languages.


21st century

On 15 September 1999 the authorities of
Tatarstan The Republic of Tatarstan (russian: Республика Татарстан, Respublika Tatarstan, p=rʲɪsˈpublʲɪkə tətɐrˈstan; tt-Cyrl, Татарстан Республикасы), or simply Tatarstan (russian: Татарстан, tt ...
, Russia, passed a law to make the Latin script a co-official writing system alongside Cyrillic for the
Tatar language Tatar ( or ) is a Turkic language spoken by Tatars mainly located in modern Tatarstan (European Russia), as well as Siberia. It should not be confused with Crimean Tatar or Siberian Tatar, which are closely related but belong to different ...
by 2011. A year later, however, the Russian government overruled the law and banned Latinization on its territory. In 2015, the
government of Kazakhstan The Government of the Republic of Kazakhstan ( kk, Қазақстан Республикасының Үкіметі, tr, ''Qazaqstan Respublikasynyñ Ükımetı'') oversees a presidential republic. The President of Kazakhstan, currently Kassym ...
announced that a Kazakh Latin alphabet would replace the Kazakh Cyrillic alphabet as the official writing system for the
Kazakh language The Kazakh or simply Qazaq (Latin: or , Cyrillic: or , Arabic Script: or , , ) is a Turkic language of the Kipchak branch spoken in Central Asia by Kazakhs. It is closely related to Nogai, Kyrgyz and Karakalpak. It is the official languag ...
by 2025. There are also talks about switching from the Cyrillic script to Latin in Ukraine, Kyrgyzstan, and Mongolia. Mongolia, however, has since opted to revive the Mongolian script instead of switching to Latin. In October 2019, the organization National Representational Organization for Inuit in Canada (ITK) announced that they will introduce a unified writing system for the
Inuit languages The Inuit languages are a closely related group of indigenous American languages traditionally spoken across the North American Arctic and adjacent subarctic, reaching farthest south in Labrador. The related Yupik languages (spoken in wester ...
in the country. The writing system is based on the Latin alphabet and is modeled after the one used in the Greenlandic language. On 12 February 2021 the government of Uzbekistan announced it will finalize the transition from Cyrillic to Latin for the
Uzbek language Uzbek (''Oʻzbekcha, Oʻzbek tili or Ўзбекча, Ўзбек тили''), formerly known as ''Turki'' or ''Western Turki'', is a Turkic language spoken by Uzbeks. It is the official, and national language of Uzbekistan. Uzbek is spoken as e ...
by 2023. Plans to switch to Latin originally began in 1993 but subsequently stalled and Cyrillic remained in widespread use. At present the
Crimean Tatar language Crimean Tatar () also called Crimean (), is a Kipchak Turkic language spoken in Crimea and the Crimean Tatar diasporas of Uzbekistan, Turkey, Romania, and Bulgaria, as well as small communities in the United States and Canada. It should ...
uses both Cyrillic and Latin. The use of Latin was originally approved by Crimean Tatar representatives after the Soviet Union's collapse but was never implemented by the regional government. After Russia's
annexation Annexation (Latin ''ad'', to, and ''nexus'', joining), in international law, is the forcible acquisition of one state's territory by another state, usually following military occupation of the territory. It is generally held to be an illegal act ...
of Crimea in 2014 the Latin script was dropped entirely. Nevertheless Crimean Tatars outside of Crimea continue to use Latin and on 22 October 2021 the government of Ukraine approved a proposal endorsed by the
Mejlis of the Crimean Tatar People The Mejlis of the Crimean Tatar People ( crh, Къырымтатар Миллий Меджлиси - ''Qırımtatar Milliy Meclisi'') is the single highest executive-representative body of the Crimean Tatars in period between sessions of the ...
to switch the Crimean Tatar language to Latin by 2025. In July 2020, 2.6 billion people (36% of the world population) use the Latin alphabet.


International standards

By the 1960s, it became apparent to the computer and telecommunications industries in the First World that a non-proprietary method of encoding characters was needed. The International Organization for Standardization (ISO) encapsulated the Latin alphabet in their ( ISO/IEC 646) standard. To achieve widespread acceptance, this encapsulation was based on popular usage. As the United States held a preeminent position in both industries during the 1960s, the standard was based on the already published ''American Standard Code for Information Interchange'', better known as
ASCII ASCII ( ), abbreviated from American Standard Code for Information Interchange, is a character encoding standard for electronic communication. ASCII codes represent text in computers, telecommunications equipment, and other devices. Because of ...
, which included in the
character set Character encoding is the process of assigning numbers to graphical characters, especially the written characters of human language, allowing them to be stored, transmitted, and transformed using digital computers. The numerical values that ...
the 26 × 2 (uppercase and lowercase) letters of the
English alphabet The alphabet for Modern English is a Latin-script alphabet consisting of 26 letters, each having an upper- and lower-case form. The word ''alphabet'' is a compound of the first two letters of the Greek alphabet, '' alpha'' and '' beta''. ...
. Later standards issued by the ISO, for example ISO/IEC 10646 (
Unicode Latin Over a thousand characters from the Latin script are encoded in the Unicode Standard, grouped in several basic and extended Latin blocks. The extended ranges contain mainly precomposed letters plus diacritics that are equivalently encoded with c ...
), have continued to define the 26 × 2 letters of the English alphabet as the basic Latin alphabet with extensions to handle other letters in other languages.


National standards

The DIN standard
DIN 91379 The DIN standard DIN 91379: "Characters and defined character sequences in Unicode for the electronic processing of names and data exchange in Europe, with CD-ROM" defines a normative subset of Unicode Latin characters, sequences of base characte ...
specifies a subset of Unicode letters, special characters, and sequences of letters and diacritic signs to allow the correct representation of names and to simplify data exchange in Europe. This specification supports all official languages of European Union countries (thus also Greek and
Cyrillic , bg, кирилица , mk, кирилица , russian: кириллица , sr, ћирилица, uk, кирилиця , fam1 = Egyptian hieroglyphs , fam2 = Proto-Sinaitic , fam3 = Phoenician , fam4 = Gr ...
for Bulgarian) as well as the official languages of Iceland, Liechtenstein, Norway, and Switzerland, and also the German minority languages. To allow the transliteration of names in other writing systems to the Latin script according to the relevant ISO standards all necessary combinations of base letters and diacritic signs are provided. Efforts are being made to further develop it into a European CEN standard.


As used by various languages

In the course of its use, the Latin alphabet was adapted for use in new languages, sometimes representing
phoneme In phonology and linguistics, a phoneme () is a unit of sound that can distinguish one word from another in a particular language. For example, in most dialects of English, with the notable exception of the West Midlands and the north-west ...
s not found in languages that were already written with the Roman characters. To represent these new sounds, extensions were therefore created, be it by adding diacritics to existing
letters Letter, letters, or literature may refer to: Characters typeface * Letter (alphabet), a character representing one or more of the sounds used in speech; any of the symbols of an alphabet. * Letterform, the graphic form of a letter of the alpha ...
, by joining multiple letters together to make
ligatures Ligature may refer to: * Ligature (medicine), a piece of suture used to shut off a blood vessel or other anatomical structure ** Ligature (orthodontic), used in dentistry * Ligature (music), an element of musical notation used especially in the me ...
, by creating completely new forms, or by assigning a special function to pairs or triplets of letters. These new forms are given a place in the alphabet by defining an
alphabetical order Alphabetical order is a system whereby character strings are placed in order based on the position of the characters in the conventional ordering of an alphabet. It is one of the methods of collation. In mathematics, a lexicographical order is ...
or collation sequence, which can vary with the particular language.


Letters

Some examples of new letters to the standard Latin alphabet are the
Runic Runes are the letters in a set of related alphabets known as runic alphabets native to the Germanic peoples. Runes were used to write various Germanic languages (with some exceptions) before they adopted the Latin alphabet, and for specialised ...
letters wynn and thorn , and the letter eth , which were added to the alphabet of Old English. Another Irish letter, the insular ''g'', developed into
yogh The letter yogh (ȝogh) ( ; Scots: ; Middle English: ) was used in Middle English and Older Scots, representing ''y'' () and various velar phonemes. It was derived from the Insular form of the letter ''g''. In Middle English writing, tailed z ...
, used in Middle English. Wynn was later replaced with the new letter , eth and thorn with , and yogh with . Although the four are no longer part of the English or Irish alphabets, eth and thorn are still used in the modern Icelandic alphabet, while eth is also used by the
Faroese alphabet Faroese orthography is the method employed to write the Faroese language, using a 29-letter Latin alphabet. Alphabet The Faroese alphabet consists of 29 letters derived from the Latin script: * Eth (Faroese ') never appears at the beginning ...
. Some West, Central and Southern African languages use a few additional letters that have sound values similar to those of their equivalents in the IPA. For example,
Adangme The Dangbe language, also ''Dangbe'' or ''Adaŋgbi'', is a Kwa language spoken in south-eastern Ghana by the Dangbe People ''(Dangbeli)''. The Dangbeli are part of the larger Ga-Dangbe ethnic group. Klogbi is a variant, spoken by the Kloli ( ...
uses the letters and , and Ga uses , and .
Hausa Hausa may refer to: * Hausa people, an ethnic group of West Africa * Hausa language, spoken in West Africa * Hausa Kingdoms, a historical collection of Hausa city-states * Hausa (horse) or Dongola horse, an African breed of riding horse See also ...
uses and for
implosives Implosive consonants are a group of stop consonants (and possibly also some affricates) with a mixed glottalic ingressive and pulmonic egressive airstream mechanism.''Phonetics for communication disorders.'' Martin J. Ball and Nicole Müller. Ro ...
, and for an ejective.
Africanists African studies is the study of Africa, especially the continent's cultures and societies (as opposed to its geology, geography, zoology, etc.). The field includes the study of Africa's history (pre-colonial, colonial, post-colonial), demograph ...
have standardized these into the
African reference alphabet An African reference alphabet was first proposed in 1978 by a UNESCO-organized conference held in Niamey, Niger, and the proposed alphabet was revised in 1982. The conference recommended the use of single letters for a sound (that is, a phonem ...
. Dotted and dotless I — and — are two forms of the letter I used by the Turkish,
Azerbaijani Azerbaijani may refer to: * Something of, or related to Azerbaijan * Azerbaijanis * Azerbaijani language See also * Azerbaijan (disambiguation) * Azeri (disambiguation) * Azerbaijani cuisine * Culture of Azerbaijan The culture of Azerbaijan ...
, and Kazakh alphabets. The Azerbaijani language also has , which represents the near-open front unrounded vowel.


Multigraphs

A digraph is a pair of letters used to write one sound or a combination of sounds that does not correspond to the written letters in sequence. Examples are , , , , , in English, and , , and in Dutch. In Dutch the is capitalized as or the
ligature Ligature may refer to: * Ligature (medicine), a piece of suture used to shut off a blood vessel or other anatomical structure ** Ligature (orthodontic), used in dentistry * Ligature (music), an element of musical notation used especially in the me ...
, but never as , and it often takes the appearance of a ligature very similar to the letter in handwriting. A trigraph is made up of three letters, like the
German German(s) may refer to: * Germany (of or related to) **Germania (historical use) * Germans, citizens of Germany, people of German ancestry, or native speakers of the German language ** For citizens of Germany, see also German nationality law **Ge ...
, the Breton or the Milanese . In the orthographies of some languages, digraphs and trigraphs are regarded as independent letters of the alphabet in their own right. The capitalization of digraphs and trigraphs is language-dependent, as only the first letter may be capitalized, or all component letters simultaneously (even for words written in title case, where letters after the digraph or trigraph are left in lowercase).


Ligatures

A
ligature Ligature may refer to: * Ligature (medicine), a piece of suture used to shut off a blood vessel or other anatomical structure ** Ligature (orthodontic), used in dentistry * Ligature (music), an element of musical notation used especially in the me ...
is a fusion of two or more ordinary letters into a new
glyph A glyph () is any kind of purposeful mark. In typography, a glyph is "the specific shape, design, or representation of a character". It is a particular graphical representation, in a particular typeface, of an element of written language. A g ...
or character. Examples are (from , called "ash"), (from , sometimes called "oethel"), the
abbreviation An abbreviation (from Latin ''brevis'', meaning ''short'') is a shortened form of a word or phrase, by any method. It may consist of a group of letters or words taken from the full version of the word or phrase; for example, the word ''abbrevia ...
(from la, et, , and, called "ampersand"), and (from or , the archaic medial form of , followed by an or , called "sharp S" or "eszett").


Diacritics

A diacritic, in some cases also called an accent, is a small symbol that can appear above or below a letter, or in some other position, such as the umlaut sign used in the German characters , , or the Romanian characters ă, â, î, ș, ț. Its main function is to change the phonetic value of the letter to which it is added, but it may also modify the pronunciation of a whole syllable or word, indicate the start of a new syllable, or distinguish between homographs such as the Dutch words ''
een Een ːnis a village in the Netherlands. It is part of the Noordenveld municipality in Drenthe. History Een is an ''esdorp'' which developed in the middle ages on the higher grounds. The communal pasture is triangular. The village developed dur ...
'' () meaning "a" or "an", and '' één'', () meaning "one". As with the pronunciation of letters, the effect of diacritics is language-dependent. English is the only major modern European language that requires no diacritics for its native vocabulary. Historically, in formal writing, a diaeresis was sometimes used to indicate the start of a new syllable within a sequence of letters that could otherwise be misinterpreted as being a single vowel (e.g., “coöperative”, “reëlect”), but modern writing styles either omit such marks or use a hyphen to indicate a syllable break (e.g. “cooperative”, “re-elect”).


Collation

Some modified letters, such as the symbols , , and , may be regarded as new individual letters in themselves, and assigned a specific place in the alphabet for
collation Collation is the assembly of written information into a standard order. Many systems of collation are based on numerical order or alphabetical order, or extensions and combinations thereof. Collation is a fundamental element of most office fil ...
purposes, separate from that of the letter on which they are based, as is done in Swedish. In other cases, such as with , , in German, this is not done; letter-diacritic combinations being identified with their base letter. The same applies to digraphs and trigraphs. Different diacritics may be treated differently in collation within a single language. For example, in Spanish, the character is considered a letter, and sorted between and in dictionaries, but the accented vowels , , , , , are not separated from the unaccented vowels , , , , .


Capitalization

The languages that use the Latin script today generally use capital letters to begin paragraphs and sentences and proper nouns. The rules for
capitalization Capitalization (American English) or capitalisation (British English) is writing a word with its first letter as a capital letter (uppercase letter) and the remaining letters in lower case, in writing systems with a case distinction. The term a ...
have changed over time, and different languages have varied in their rules for capitalization. Old English, for example, was rarely written with even proper nouns capitalized; whereas Modern English of the 18th century had frequently all nouns capitalized, in the same way that Modern
German German(s) may refer to: * Germany (of or related to) **Germania (historical use) * Germans, citizens of Germany, people of German ancestry, or native speakers of the German language ** For citizens of Germany, see also German nationality law **Ge ...
is written today, e.g. german: Alle Schwestern der alten Stadt hatten die Vögel gesehen, , All of the sisters of the old city had seen the birds.


Romanization

Words from languages natively written with other
scripts Script may refer to: Writing systems * Script, a distinctive writing system, based on a repertoire of specific elements or symbols, or that repertoire * Script (styles of handwriting) ** Script typeface, a typeface with characteristics of handw ...
, such as
Arabic Arabic (, ' ; , ' or ) is a Semitic language spoken primarily across the Arab world.Semitic languages: an international handbook / edited by Stefan Weninger; in collaboration with Geoffrey Khan, Michael P. Streck, Janet C. E.Watson; Walte ...
or
Chinese Chinese can refer to: * Something related to China * Chinese people, people of Chinese nationality, citizenship, and/or ethnicity **''Zhonghua minzu'', the supra-ethnic concept of the Chinese nation ** List of ethnic groups in China, people of v ...
, are usually transliterated or transcribed when embedded in Latin-script text or in multilingual international communication, a process termed ''Romanization''. Whilst the Romanization of such languages is used mostly at unofficial levels, it has been especially prominent in computer messaging where only the limited seven-bit
ASCII ASCII ( ), abbreviated from American Standard Code for Information Interchange, is a character encoding standard for electronic communication. ASCII codes represent text in computers, telecommunications equipment, and other devices. Because of ...
code is available on older systems. However, with the introduction of Unicode, Romanization is now becoming less necessary. Note that keyboards used to enter such text may still restrict users to Romanized text, as only ASCII or Latin-alphabet characters may be available.


See also

* List of languages by writing system#Latin script *
Western Latin character sets (computing) Several binary representations of 8-bit character sets for common Western European languages are compared in this article. These encodings were designed for representation of Italian, Spanish, Portuguese, French, German, Dutch, English, Dani ...
* European Latin Unicode subset (DIN 91379) *
Latin letters used in mathematics Many letters of the Latin alphabet, both capital and small, are used in mathematics, science, and engineering to denote by convention specific or abstracted constants, variables of a certain type, units, multipliers, or physical entities. Cer ...
* Latin omega


Notes


References


Citations


Sources

*


Further reading

* Boyle, Leonard E. 1976. "Optimist and recensionist: 'Common errors' or 'common variations.'" In ''Latin script and letters A.D. 400–900: Festschrift presented to Ludwig Bieler on the occasion of his 70th birthday.'' Edited by John J. O'Meara and Bernd Naumann, 264–74. Leiden, The Netherlands: Brill. * Morison, Stanley. 1972. ''Politics and script: Aspects of authority and freedom in the development of Graeco-Latin script from the sixth century B.C. to the twentieth century A.D.'' Oxford: Clarendon.


External links


Unicode collation chart
��Latin letters sorted by shape
Diacritics ProjectAll you need to design a font with correct accents
{{Authority control History of the Roman Empire Officially used writing systems of India