The Latin script, also known as Roman script, is an
alphabetic
An alphabet is a standardized set of basic written graphemes (called letters) that represent the phonemes of certain spoken languages. Not all writing systems represent language in this way; in a syllabary, each character represents a syllab ...
writing system
A writing system is a method of visually representing verbal communication, based on a script and a set of rules regulating its use. While both writing and speech are useful in conveying messages, writing differs in also being a reliable form ...
based on the letters of the
classical Latin alphabet
The Latin alphabet or Roman alphabet is the collection of letters originally used by the ancient Romans to write the Latin language. Largely unaltered with the exception of extensions (such as diacritics), it used to write English and the o ...
, derived from a form of the
Greek alphabet
The Greek alphabet has been used to write the Greek language since the late 9th or early 8th century BCE. It is derived from the earlier Phoenician alphabet, and was the earliest known alphabetic script to have distinct letters for vowels as we ...
which was in use in the ancient
Greek
Greek may refer to:
Greece
Anything of, from, or related to Greece, a country in Southern Europe:
*Greeks, an ethnic group.
*Greek language, a branch of the Indo-European language family.
**Proto-Greek language, the assumed last common ancestor ...
city of
Cumae
Cumae ( grc, Κύμη, (Kumē) or or ; it, Cuma) was the first ancient Greek colony on the mainland of Italy, founded by settlers from Euboea in the 8th century BC and soon becoming one of the strongest colonies. It later became a rich Ro ...
, in southern
Italy
Italy ( it, Italia ), officially the Italian Republic, ) or the Republic of Italy, is a country in Southern Europe. It is located in the middle of the Mediterranean Sea, and its territory largely coincides with the homonymous geographical re ...
(
Magna Grecia
Magna Graecia (, ; , , grc, Μεγάλη Ἑλλάς, ', it, Magna Grecia) was the name given by the Romans to the coastal areas of Southern Italy in the present-day Italian regions of Calabria, Apulia, Basilicata, Campania and Sicily; these re ...
). It was adopted by the
Etruscans
The Etruscan civilization () was developed by a people of Etruria in ancient Italy with a common language and culture who formed a federation of city-states. After conquering adjacent lands, its territory covered, at its greatest extent, rou ...
and subsequently by the
Romans
Roman or Romans most often refers to:
*Rome, the capital city of Italy
* Ancient Rome, Roman civilization from 8th century BC to 5th century AD
*Roman people, the people of ancient Rome
*''Epistle to the Romans'', shortened to ''Romans'', a lette ...
. Several
Latin-script alphabet
A Latin-script alphabet (Latin alphabet or Roman alphabet) is an alphabet that uses letters of the Latin script. The 21-letter archaic Latin alphabet and the 23-letter classical Latin alphabet belong to the oldest of this group. The 26-letter ...
s exist, which differ in graphemes, collation and phonetic values from the classical Latin alphabet.
The Latin script is the basis of the
International Phonetic Alphabet
The International Phonetic Alphabet (IPA) is an alphabetic system of phonetic transcription, phonetic notation based primarily on the Latin script. It was devised by the International Phonetic Association in the late 19th century as a standa ...
, and the 26 most widespread letters are the letters contained in the
ISO basic Latin alphabet
The ISO basic Latin alphabet is an international standard (beginning with ISO/IEC 646) for a Latin-script alphabet that consists of two sets (uppercase and lowercase) of 26 letters, codified in various national and international standards and u ...
.
Latin script is the basis for the largest number of alphabets of any
writing system
A writing system is a method of visually representing verbal communication, based on a script and a set of rules regulating its use. While both writing and speech are useful in conveying messages, writing differs in also being a reliable form ...
and is the
most widely adopted writing system in the world. Latin script is used as the standard method of writing for most Western and Central, and some Eastern, European languages as well as many languages in other parts of the world.
Name
The script is either called Latin script or Roman script, in reference to its origin in
ancient Rome
In modern historiography, ancient Rome refers to Roman civilisation from the founding of the city of Rome in the 8th century BC to the collapse of the Western Roman Empire in the 5th century AD. It encompasses the Roman Kingdom (753–509 B ...
(though some of the capital letters are Greek in origin). In the context of
transliteration
Transliteration is a type of conversion of a text from one writing system, script to another that involves swapping Letter (alphabet), letters (thus ''wikt:trans-#Prefix, trans-'' + ''wikt:littera#Latin, liter-'') in predictable ways, such as ...
, the term "
romanization
Romanization or romanisation, in linguistics, is the conversion of text from a different writing system to the Roman (Latin) script, or a system for doing so. Methods of romanization include transliteration, for representing written text, and ...
" (
British English
British English (BrE, en-GB, or BE) is, according to Lexico, Oxford Dictionaries, "English language, English as used in Great Britain, as distinct from that used elsewhere". More narrowly, it can refer specifically to the English language in ...
: "romanisation") is often found.
Unicode
Unicode, formally The Unicode Standard,The formal version reference is is an information technology Technical standard, standard for the consistent character encoding, encoding, representation, and handling of Character (computing), text expre ...
uses the term "Latin" as does the
International Organization for Standardization
The International Organization for Standardization (ISO ) is an international standard development organization composed of representatives from the national standards organizations of member countries. Membership requirements are given in Ar ...
(ISO).
The numeral system is called the Roman numeral system, and the collection of the elements is known as the
Roman numerals
Roman numerals are a numeral system that originated in ancient Rome and remained the usual way of writing numbers throughout Europe well into the Late Middle Ages. Numbers are written with combinations of letters from the Latin alphabet, eac ...
. The numbers 1, 2, 3 ... are Latin/Roman script numbers for the
Hindu–Arabic numeral system
The Hindu–Arabic numeral system or Indo-Arabic numeral system Audun HolmeGeometry: Our Cultural Heritage 2000 (also called the Hindu numeral system or Arabic numeral system) is a positional decimal numeral system, and is the most common syste ...
.
History
Old Italic alphabet
Archaic Latin alphabet
The letter was the western form of the Greek
gamma
Gamma (uppercase , lowercase ; ''gámma'') is the third letter of the Greek alphabet. In the system of Greek numerals it has a value of 3. In Ancient Greek, the letter gamma represented a voiced velar stop . In Modern Greek, this letter re ...
, but it was used for the sounds and alike, possibly under the influence of
Etruscan __NOTOC__
Etruscan may refer to:
Ancient civilization
*The Etruscan language, an extinct language in ancient Italy
*Something derived from or related to the Etruscan civilization
**Etruscan architecture
**Etruscan art
**Etruscan cities
** Etrusca ...
, which might have lacked any voiced
plosive
In phonetics, a plosive, also known as an occlusive or simply a stop, is a pulmonic consonant in which the vocal tract is blocked so that all airflow ceases.
The occlusion may be made with the tongue tip or blade (, ), tongue body (, ), lips ...
s. Later, probably during the 3rd century BC, the letter – unneeded to write Latin properly – was replaced with the new letter , a modified with a small horizontal stroke, which took its place in the alphabet. From then on, represented the
voiced
Voice or voicing is a term used in phonetics and phonology to characterize speech sounds (usually consonants). Speech sounds can be described as either voiceless (otherwise known as ''unvoiced'') or voiced.
The term, however, is used to refer ...
plosive , while was generally reserved for the voiceless plosive . The letter was used only rarely, in a small number of words such as ''
Kalendae
The calends or kalends ( la, kalendae) is the first day of every month in the Roman calendar. The English word " calendar" is derived from this word.
Use
The Romans called the first day of every month the ''calends'', signifying the start of a n ...
'', often interchangeably with .
Classical Latin alphabet
After the
Roman conquest of Greece
Greece in the Roman era describes the Roman conquest of Greece, as well as the period of Greek history when Greece was dominated first by the Roman Republic and then by the Roman Empire.
The Roman era of Greek history began with the Corinthian ...
in the 1st century BC, Latin adopted the Greek words
and (or readopted, in the latter case) to write
Greek
Greek may refer to:
Greece
Anything of, from, or related to Greece, a country in Southern Europe:
*Greeks, an ethnic group.
*Greek language, a branch of the Indo-European language family.
**Proto-Greek language, the assumed last common ancestor ...
loanwords, placing them at the end of the alphabet. An attempt by the emperor
Claudius
Tiberius Claudius Caesar Augustus Germanicus (; 1 August 10 BC – 13 October AD 54) was the fourth Roman emperor, ruling from AD 41 to 54. A member of the Julio-Claudian dynasty, Claudius was born to Nero Claudius Drusus, Drusu ...
to introduce three
additional letters did not last. Thus it was during the
classical Latin
Classical Latin is the form of Literary Latin recognized as a literary standard by writers of the late Roman Republic and early Roman Empire. It was used from 75 BC to the 3rd century AD, when it developed into Late Latin. In some later periods ...
period that the Latin alphabet contained 23 letters:''Italic text''
Medieval and later developments
It was not until the
Middle Ages
In the history of Europe, the Middle Ages or medieval period lasted approximately from the late 5th to the late 15th centuries, similar to the post-classical period of global history. It began with the fall of the Western Roman Empire a ...
that the letter (originally a
ligature
Ligature may refer to:
* Ligature (medicine), a piece of suture used to shut off a blood vessel or other anatomical structure
** Ligature (orthodontic), used in dentistry
* Ligature (music), an element of musical notation used especially in the me ...
of two s) was added to the Latin alphabet, to represent sounds from the
Germanic languages
The Germanic languages are a branch of the Indo-European language family spoken natively by a population of about 515 million people mainly in Europe, North America, Oceania and Southern Africa. The most widely spoken Germanic language, Engli ...
which did not exist in medieval Latin, and only after the
Renaissance
The Renaissance ( , ) , from , with the same meanings. is a period in European history marking the transition from the Middle Ages to modernity and covering the 15th and 16th centuries, characterized by an effort to revive and surpass ideas ...
did the convention of treating and as
vowel
A vowel is a syllabic speech sound pronounced without any stricture in the vocal tract. Vowels are one of the two principal classes of speech sounds, the other being the consonant. Vowels vary in quality, in loudness and also in quantity (leng ...
s, and and as
consonant
In articulatory phonetics, a consonant is a speech sound that is articulated with complete or partial closure of the vocal tract. Examples are and pronounced with the lips; and pronounced with the front of the tongue; and pronounced wit ...
s, become established. Prior to that, the former had been merely
allographs of the latter.
With the fragmentation of political power, the
style of writing changed and varied greatly throughout the Middle Ages, even after the invention of the
printing press
A printing press is a mechanical device for applying pressure to an inked surface resting upon a printing, print medium (such as paper or cloth), thereby transferring the ink. It marked a dramatic improvement on earlier printing methods in wh ...
. Early deviations from the classical forms were the
uncial script
Uncial is a majuscule Glaister, Geoffrey Ashall. (1996) ''Encyclopedia of the Book''. 2nd edn. New Castle, DE, and London: Oak Knoll Press & The British Library, p. 494. script (written entirely in capital letters) commonly used from the 4th t ...
, a development of the
Old Roman cursive
Roman cursive (or Latin cursive) is a form of handwriting (or a script) used in ancient Rome and to some extent into the Middle Ages. It is customarily divided into old (or ancient) cursive and new cursive.
Old Roman cursive
Old Roman cursiv ...
, and various so-called minuscule scripts that developed from
New Roman cursive
Roman cursive (or Latin cursive) is a form of handwriting (or a script) used in ancient Rome and to some extent into the Middle Ages. It is customarily divided into old (or ancient) cursive and new cursive.
Old Roman cursive
Old Roman curs ...
, of which the
insular script
Insular script was a medieval script system originating from Ireland that spread to Anglo-Saxon England and continental Europe under the influence of Irish Christianity. Irish missionaries took the script to continental Europe, where they found ...
developed by Irish literati & derivations of this, such as
Carolingian minuscule
Carolingian minuscule or Caroline minuscule is a script which developed as a calligraphic standard in the medieval European period so that the Latin alphabet of Jerome's Vulgate Bible could be easily recognized by the literate class from one reg ...
were the most influential, introducing the
lower case
Letter case is the distinction between the letters that are in larger uppercase or capitals (or more formally ''majuscule'') and smaller lowercase (or more formally ''minuscule'') in the written representation of certain languages. The writing ...
forms of the letters, as well as other writing conventions that have since become standard.
The languages that use the Latin script generally use
capital letters
Letter case is the distinction between the letters that are in larger uppercase or capitals (or more formally ''majuscule'') and smaller lowercase (or more formally ''minuscule'') in the written representation of certain languages. The writing ...
to begin paragraphs and sentences and
proper nouns
A proper noun is a noun that identifies a single entity and is used to refer to that entity (''Africa'', ''Jupiter'', ''Sarah (given name), Sarah'', ''Microsoft)'' as distinguished from a common noun, which is a noun that refers to a Class (philo ...
. The rules for
capitalization
Capitalization (American English) or capitalisation (British English) is writing a word with its first letter as a capital letter (uppercase letter) and the remaining letters in lower case, in writing systems with a case distinction. The term a ...
have changed over time, and different languages have varied in their rules for capitalization.
Old English
Old English (, ), or Anglo-Saxon, is the earliest recorded form of the English language, spoken in England and southern and eastern Scotland in the early Middle Ages. It was brought to Great Britain by Anglo-Saxon settlement of Britain, Anglo ...
, for example, was rarely written with even proper nouns capitalized, whereas
Modern English writers and printers of the 17th and 18th century frequently capitalized most and sometimes all nouns – e.g. in the preamble and all of the United States Constitution – a practice still systematically used in Modern
German
German(s) may refer to:
* Germany (of or related to)
**Germania (historical use)
* Germans, citizens of Germany, people of German ancestry, or native speakers of the German language
** For citizens of Germany, see also German nationality law
**Ger ...
.
ISO basic Latin alphabet
The use of the letters I and V for both consonants and vowels proved inconvenient as the Latin alphabet was adapted to Germanic and Romance languages.
W originated as a doubled
V (VV) used to represent the found in
Old English
Old English (, ), or Anglo-Saxon, is the earliest recorded form of the English language, spoken in England and southern and eastern Scotland in the early Middle Ages. It was brought to Great Britain by Anglo-Saxon settlement of Britain, Anglo ...
as early as the 7th century. It came into common use in the later 11th century, replacing the letter
wynn
Wynn or wyn (; also spelled wen, ƿynn, and ƿen) is a letter of the Old English alphabet, where it is used to represent the sound .
History The letter "W"
While the earliest Old English texts represent this phoneme with the digraph , ...
, which had been used for the same sound. In the Romance languages, the minuscule form of V was a rounded ''u''; from this was derived a rounded capital U for the vowel in the 16th century, while a new, pointed minuscule ''v'' was derived from V for the consonant. In the case of I, a word-final
swash
Swash, or forewash in geography, is a turbulent layer of water that washes up on the beach after an incoming wave has broken. The swash action can move beach materials up and down the beach, which results in the cross-shore sediment exchange. T ...
form, ''j'', came to be used for the consonant, with the un-swashed form restricted to vowel use. Such conventions were erratic for centuries. J was introduced into English for the consonant in the 17th century (it had been rare as a vowel), but it was not universally considered a distinct letter in the alphabetic order until the 19th century.
By the 1960s, it became apparent to the computer and
telecommunication
Telecommunication is the transmission of information by various types of technologies over wire, radio, optical, or other electromagnetic systems. It has its origin in the desire of humans for communication over a distance greater than that fe ...
s industries in the
First World
The concept of First World originated during the Cold War and comprised countries that were under the influence of the United States and the rest of NATO and opposed the Soviet Union and/or communism during the Cold War. Since the collapse of ...
that a non-proprietary method of encoding characters was needed. The
International Organization for Standardization
The International Organization for Standardization (ISO ) is an international standard development organization composed of representatives from the national standards organizations of member countries. Membership requirements are given in Ar ...
(ISO) encapsulated the Latin alphabet in their (
ISO/IEC 646
ISO/IEC 646 is a set of ISO/IEC standards, described as ''Information technology — ISO 7-bit coded character set for information interchange'' and developed in cooperation with ASCII at least since 1964. Since its first edition in 1 ...
) standard. To achieve widespread acceptance, this encapsulation was based on popular usage. As the United States held a preeminent position in both industries during the 1960s, the standard was based on the already published ''American Standard Code for Information Interchange'', better known as
ASCII
ASCII ( ), abbreviated from American Standard Code for Information Interchange, is a character encoding standard for electronic communication. ASCII codes represent text in computers, telecommunications equipment, and other devices. Because of ...
, which included in the
character set
Character encoding is the process of assigning numbers to graphical characters, especially the written characters of human language, allowing them to be stored, transmitted, and transformed using digital computers. The numerical values that ...
the 26 × 2 (uppercase and lowercase) letters of the
English alphabet
The alphabet for Modern English is a Latin-script alphabet consisting of 26 letters, each having an upper- and lower-case form. The word ''alphabet'' is a compound of the first two letters of the Greek alphabet, '' alpha'' and '' beta''. ...
. Later standards issued by the ISO, for example
ISO/IEC 10646
ISO/IEC JTC 1, entitled "Information technology", is a joint technical committee (JTC) of the International Organization for Standardization (ISO) and the International Electrotechnical Commission (IEC). Its purpose is to develop, maintain and pr ...
(
Unicode Latin
Over a thousand characters from the Latin script are encoded in the Unicode Standard, grouped in several basic and extended Latin blocks. The extended ranges contain mainly precomposed letters plus diacritics that are equivalently encoded with co ...
), have continued to define the 26 × 2 letters of the English alphabet as the basic Latin alphabet with extensions to handle other letters in other languages.
Spread
The Latin alphabet spread, along with
Latin
Latin (, or , ) is a classical language belonging to the Italic branch of the Indo-European languages. Latin was originally a dialect spoken in the lower Tiber area (then known as Latium) around present-day Rome, but through the power of the ...
, from the
Italian Peninsula to the lands surrounding the
Mediterranean Sea
The Mediterranean Sea is a sea connected to the Atlantic Ocean, surrounded by the Mediterranean Basin and almost completely enclosed by land: on the north by Western and Southern Europe and Anatolia, on the south by North Africa, and on the ea ...
with the expansion of the
Roman Empire
The Roman Empire ( la, Imperium Romanum ; grc-gre, Βασιλεία τῶν Ῥωμαίων, Basileía tôn Rhōmaíōn) was the post-Republican period of ancient Rome. As a polity, it included large territorial holdings around the Mediterr ...
. The eastern half of the Empire, including Greece, Turkey, the
Levant
The Levant () is an approximate historical geographical term referring to a large area in the Eastern Mediterranean region of Western Asia. In its narrowest sense, which is in use today in archaeology and other cultural contexts, it is eq ...
, and Egypt, continued to use
Greek
Greek may refer to:
Greece
Anything of, from, or related to Greece, a country in Southern Europe:
*Greeks, an ethnic group.
*Greek language, a branch of the Indo-European language family.
**Proto-Greek language, the assumed last common ancestor ...
as a
lingua franca
A lingua franca (; ; for plurals see ), also known as a bridge language, common language, trade language, auxiliary language, vehicular language, or link language, is a language systematically used to make communication possible between groups ...
, but Latin was widely spoken in the western half, and as the western
Romance languages
The Romance languages, sometimes referred to as Latin languages or Neo-Latin languages, are the various modern languages that evolved from Vulgar Latin. They are the only extant subgroup of the Italic languages in the Indo-European language fam ...
evolved out of Latin, they continued to use and adapt the Latin alphabet.
Middle Ages
With the spread of
Western Christianity
Western Christianity is one of two sub-divisions of Christianity ( Eastern Christianity being the other). Western Christianity is composed of the Latin Church and Western Protestantism, together with their offshoots such as the Old Catholic ...
during the
Middle Ages
In the history of Europe, the Middle Ages or medieval period lasted approximately from the late 5th to the late 15th centuries, similar to the post-classical period of global history. It began with the fall of the Western Roman Empire a ...
, the Latin alphabet was gradually adopted by the peoples of
Northern Europe
The northern region of Europe has several definitions. A restrictive definition may describe Northern Europe as being roughly north of the southern coast of the Baltic Sea, which is about 54th parallel north, 54°N, or may be based on other g ...
who spoke
Celtic languages
The Celtic languages ( usually , but sometimes ) are a group of related languages descended from Proto-Celtic. They form a branch of the Indo-European language family. The term "Celtic" was first used to describe this language group by Edward ...
(displacing the
Ogham
Ogham (Modern Irish: ; mga, ogum, ogom, later mga, ogam, label=none ) is an Early Medieval alphabet used primarily to write the early Irish language (in the "orthodox" inscriptions, 4th to 6th centuries AD), and later the Old Irish langua ...
alphabet) or
Germanic languages
The Germanic languages are a branch of the Indo-European language family spoken natively by a population of about 515 million people mainly in Europe, North America, Oceania and Southern Africa. The most widely spoken Germanic language, Engli ...
(displacing earlier
Runic alphabet
Runes are the letters in a set of related alphabets known as runic alphabets native to the Germanic peoples. Runes were used to write various Germanic languages (with some exceptions) before they adopted the Latin alphabet, and for specialised ...
s) or
Baltic languages
The Baltic languages are a branch of the Indo-European language family spoken natively by a population of about 4.5 million people mainly in areas extending east and southeast of the Baltic Sea in Northern Europe. Together with the Slavic lang ...
, as well as by the speakers of several
Uralic languages
The Uralic languages (; sometimes called Uralian languages ) form a language family of 38 languages spoken by approximately 25million people, predominantly in Northern Eurasia. The Uralic languages with the most native speakers are Hungarian (w ...
, most notably
Hungarian,
Finnish
Finnish may refer to:
* Something or someone from, or related to Finland
* Culture of Finland
* Finnish people or Finns, the primary ethnic group in Finland
* Finnish language, the national language of the Finnish people
* Finnish cuisine
See also ...
and
Estonian.
The Latin script also came into use for writing the
West Slavic languages
The West Slavic languages are a subdivision of the Slavic language group. They include Polish, Czech, Slovak, Kashubian, Upper Sorbian and Lower Sorbian. The languages have traditionally been spoken across a mostly continuous region encompass ...
and several
South Slavic languages
The South Slavic languages are one of three branches of the Slavic languages. There are approximately 30 million speakers, mainly in the Balkans. These are separated geographically from speakers of the other two Slavic branches (West and East) ...
, as the people who spoke them adopted
Roman Catholicism
The Catholic Church, also known as the Roman Catholic Church, is the List of Christian denominations by number of members, largest Christian church, with 1.3 billion baptized Catholics Catholic Church by country, worldwide . It is am ...
. The speakers of
East Slavic languages
The East Slavic languages constitute one of three regional subgroups of the Slavic languages, distinct from the West and South Slavic languages. East Slavic languages are currently spoken natively throughout Eastern Europe, and eastwards to Sibe ...
generally adopted
Cyrillic
, bg, кирилица , mk, кирилица , russian: кириллица , sr, ћирилица, uk, кирилиця
, fam1 = Egyptian hieroglyphs
, fam2 = Proto-Sinaitic
, fam3 = Phoenician
, fam4 = G ...
along with
Orthodox Christianity
Orthodoxy (from Ancient Greek, Greek: ) is adherence to correct or accepted creeds, especially in religion.
Orthodoxy within Christianity refers to acceptance of the doctrines defined by various creeds and ecumenical councils in Late antiquity, A ...
. The
Serbian language
Serbian (, ) is the standardized variety of the Serbo-Croatian language mainly used by Serbs. It is the official and national language of Serbia, one of the three official languages of Bosnia and Herzegovina and co-official in Montenegro and Kos ...
uses both scripts, with Cyrillic predominating in official communication and Latin elsewhere, as determined by the Law on Official Use of the Language and Alphabet.
Since the 16th century
As late as 1500, the Latin script was limited primarily to the languages spoken in
Western
Western may refer to:
Places
*Western, Nebraska, a village in the US
*Western, New York, a town in the US
*Western Creek, Tasmania, a locality in Australia
*Western Junction, Tasmania, a locality in Australia
*Western world, countries that id ...
,
Northern, and
Central Europe
Central Europe is an area of Europe between Western Europe and Eastern Europe, based on a common historical, social and cultural identity. The Thirty Years' War (1618–1648) between Catholicism and Protestantism significantly shaped the area' ...
. The Orthodox Christian Slavs of
Eastern
Eastern may refer to:
Transportation
*China Eastern Airlines, a current Chinese airline based in Shanghai
*Eastern Air, former name of Zambia Skyways
*Eastern Air Lines, a defunct American airline that operated from 1926 to 1991
*Eastern Air Li ...
and
Southeastern Europe
Southeast Europe or Southeastern Europe (SEE) is a geographical subregion of Europe, consisting primarily of the Balkans. Sovereign states and territories that are included in the region are Albania, Bosnia and Herzegovina, Bulgaria, Croatia (al ...
mostly used
Cyrillic
, bg, кирилица , mk, кирилица , russian: кириллица , sr, ћирилица, uk, кирилиця
, fam1 = Egyptian hieroglyphs
, fam2 = Proto-Sinaitic
, fam3 = Phoenician
, fam4 = G ...
, and the Greek alphabet was in use by Greek-speakers around the eastern Mediterranean. The
Arabic script
The Arabic script is the writing system used for Arabic and several other languages of Asia and Africa. It is the second-most widely used writing system in the world by number of countries using it or a script directly derived from it, and the ...
was widespread within Islam, both among
Arab
The Arabs (singular: Arab; singular ar, عَرَبِيٌّ, DIN 31635: , , plural ar, عَرَب, DIN 31635: , Arabic pronunciation: ), also known as the Arab people, are an ethnic group mainly inhabiting the Arab world in Western Asia, ...
s and non-Arab nations like the
Iranians,
Indonesians
Indonesians (Indonesian: ''orang Indonesia'') are citizens or people originally from Indonesia, regardless of their ethnic or religious background. There are more than 1,300 ethnicities in Indonesia, making it a multicultural archipelagic coun ...
,
Malays, and
Turkic peoples
The Turkic peoples are a collection of diverse ethnic groups of West, Central, East, and North Asia as well as parts of Europe, who speak Turkic languages.. "Turkic peoples, any of various peoples whose members speak languages belonging t ...
. Most of the rest of Asia used a variety of
Brahmic alphabets or the
Chinese script
Chinese characters () are logograms developed for the writing of Chinese. In addition, they have been adapted to write other East Asian languages, and remain a key component of the Japanese writing system where they are known as ''kanji' ...
.
Through
European colonization
The historical phenomenon of colonization is one that stretches around the globe and across time. Ancient and medieval colonialism was practiced by the Phoenicians, the Greeks, the Turks, and the Arabs.
Colonialism in the modern sense began ...
the Latin script has spread to the
Americas
The Americas, which are sometimes collectively called America, are a landmass comprising the totality of North and South America. The Americas make up most of the land in Earth's Western Hemisphere and comprise the New World.
Along with th ...
,
Oceania
Oceania (, , ) is a region, geographical region that includes Australasia, Melanesia, Micronesia, and Polynesia. Spanning the Eastern Hemisphere, Eastern and Western Hemisphere, Western hemispheres, Oceania is estimated to have a land area of ...
, parts of Asia, Africa, and the Pacific, in forms based on the
Spanish
Spanish might refer to:
* Items from or related to Spain:
**Spaniards are a nation and ethnic group indigenous to Spain
**Spanish language, spoken in Spain and many Latin American countries
**Spanish cuisine
Other places
* Spanish, Ontario, Can ...
,
Portuguese
Portuguese may refer to:
* anything of, from, or related to the country and nation of Portugal
** Portuguese cuisine, traditional foods
** Portuguese language, a Romance language
*** Portuguese dialects, variants of the Portuguese language
** Portu ...
,
English
English usually refers to:
* English language
* English people
English may also refer to:
Peoples, culture, and language
* ''English'', an adjective for something of, from, or related to England
** English national ide ...
,
French,
German
German(s) may refer to:
* Germany (of or related to)
**Germania (historical use)
* Germans, citizens of Germany, people of German ancestry, or native speakers of the German language
** For citizens of Germany, see also German nationality law
**Ger ...
and
Dutch
Dutch commonly refers to:
* Something of, from, or related to the Netherlands
* Dutch people ()
* Dutch language ()
Dutch may also refer to:
Places
* Dutch, West Virginia, a community in the United States
* Pennsylvania Dutch Country
People E ...
alphabets.
It is used for many
Austronesian languages, including the
languages of the Philippines
There are some 120 to 187 languages spoken in the Philippines, depending on the method of classification. Almost all are Malayo-Polynesian languages native to the archipelago. A number of Spanish-influenced creole varieties generally called C ...
and the
Malaysian
Malaysian may refer to:
* Something from or related to Malaysia, a country in Southeast Asia
* Malaysian Malay, a dialect of Malay language spoken mainly in Malaysia
* Malaysian people, people who are identified with the country of Malaysia regard ...
and
Indonesian language
Indonesian ( ) is the official language, official and national language of Indonesia. It is a standard language, standardized variety (linguistics), variety of Malay language, Malay, an Austronesian languages, Austronesian language that has be ...
s, replacing earlier Arabic and indigenous Brahmic alphabets. Latin letters served as the basis for the forms of the
Cherokee syllabary
The Cherokee syllabary is a syllabary invented by Sequoyah in the late 1810s and early 1820s to write the Cherokee language. His creation of the syllabary is particularly noteworthy as he was illiterate until the creation of his syllabary. He ...
developed by
Sequoyah
Sequoyah (Cherokee language, Cherokee: ᏍᏏᏉᏯ, ''Ssiquoya'', or ᏎᏉᏯ, ''Se-quo-ya''; 1770 – August 1843), also known as George Gist or George Guess, was a Native Americans in the United States, Native American polymath of the Ch ...
; however, the sound values are completely different.
Under Portuguese missionary influence, a Latin alphabet was devised for the
Vietnamese language
Vietnamese ( vi, tiếng Việt, links=no) is an Austroasiatic languages, Austroasiatic language originating from Vietnam where it is the national language, national and official language. Vietnamese is spoken natively by over 70 million people, ...
, which had previously used
Chinese characters
Chinese characters () are logograms developed for the writing of Chinese. In addition, they have been adapted to write other East Asian languages, and remain a key component of the Japanese writing system where they are known as ''kanji' ...
. The Latin-based alphabet replaced the Chinese characters in administration in the 19th century with French rule.
Since 19th century
In the late 19th century, the
Romanians
The Romanians ( ro, români, ; dated exonym ''Vlachs'') are a Romance languages, Romance-speaking ethnic group. Sharing a common Culture of Romania, Romanian culture and Cultural heritage, ancestry, and speaking the Romanian language, they l ...
switched to the Latin alphabet, which they had used until the
Council of Florence
The Council of Florence is the seventeenth ecumenical council recognized by the Catholic Church, held between 1431 and 1449. It was convoked as the Council of Basel by Pope Martin V shortly before his death in February 1431 and took place in ...
in 1439, primarily because
Romanian
Romanian may refer to:
*anything of, from, or related to the country and nation of Romania
**Romanians, an ethnic group
**Romanian language, a Romance language
*** Romanian dialects, variants of the Romanian language
** Romanian cuisine, tradition ...
is a
Romance language
The Romance languages, sometimes referred to as Latin languages or Neo-Latin languages, are the various modern languages that evolved from Vulgar Latin. They are the only extant subgroup of the Italic languages in the Indo-European languages, I ...
. The Romanians were predominantly
Orthodox Christian
Orthodoxy (from Greek: ) is adherence to correct or accepted creeds, especially in religion.
Orthodoxy within Christianity refers to acceptance of the doctrines defined by various creeds and ecumenical councils in Antiquity, but different Churche ...
s, and their Church, increasingly influenced by Russia after the
fall of Byzantine Greek Constantinople in 1453 and capture of the Greek
Orthodox Patriarch, had begun promoting the Slavic
Cyrillic
, bg, кирилица , mk, кирилица , russian: кириллица , sr, ћирилица, uk, кирилиця
, fam1 = Egyptian hieroglyphs
, fam2 = Proto-Sinaitic
, fam3 = Phoenician
, fam4 = G ...
.
Since 20th century
In 1928, as part of
Mustafa Kemal Atatürk
Mustafa Kemal Atatürk, or Mustafa Kemal Pasha until 1921, and Ghazi Mustafa Kemal from 1921 Surname Law (Turkey), until 1934 ( 1881 – 10 November 1938) was a Turkish Mareşal (Turkey), field marshal, Turkish National Movement, re ...
's reforms, the new
Republic of Turkey
Turkey ( tr, Türkiye ), officially the Republic of Türkiye ( tr, Türkiye Cumhuriyeti, links=no ), is a list of transcontinental countries, transcontinental country located mainly on the Anatolia, Anatolian Peninsula in Western Asia, with ...
adopted a Latin alphabet for the
Turkish language
Turkish ( , ), also referred to as Turkish of Turkey (''Türkiye Türkçesi''), is the most widely spoken of the Turkic languages, with around 80 to 90 million speakers. It is the national language of Turkey and Northern Cyprus. Significant sma ...
, replacing a modified Arabic alphabet. Most of the
Turkic-speaking peoples of the former
USSR
The Soviet Union,. officially the Union of Soviet Socialist Republics. (USSR),. was a transcontinental country that spanned much of Eurasia from 1922 to 1991. A flagship communist state, it was nominally a federal union of fifteen nationa ...
, including
Tatars
The Tatars ()[Tatar]
in the Collins English Dictionary is an umbrella term for different ,
Bashkirs
, native_name_lang = bak
, flag = File:Bashkirs of Baymak rayon.jpg
, flag_caption = Bashkirs of Baymak in traditional dress
, image =
, caption =
, population = approx. 2 million
, popplace ...
,
Azeri
Azerbaijanis (; az, Azərbaycanlılar, ), Azeris ( az, Azərilər, ), or Azerbaijani Turks ( az, Azərbaycan Türkləri, ) are a Turkic peoples, Turkic people living mainly in Azerbaijan (Iran), northwestern Iran and the Azerbaijan, Republi ...
,
Kazakh,
Kyrgyz and others, had their writing systems replaced by the Latin-based
Uniform Turkic alphabet
A uniform is a variety of clothing worn by members of an organization while participating in that organization's activity. Modern uniforms are most often worn by armed forces and paramilitary organizations such as police, emergency services, ...
in the 1930s; but, in the 1940s, all were replaced by Cyrillic.
After the
collapse of the Soviet Union
The dissolution of the Soviet Union, also negatively connoted as rus, Разва́л Сове́тского Сою́за, r=Razvál Sovétskogo Soyúza, ''Ruining of the Soviet Union''. was the process of internal disintegration within the Sov ...
in 1991, three of the newly independent Turkic-speaking republics,
Azerbaijan
Azerbaijan (, ; az, Azərbaycan ), officially the Republic of Azerbaijan, , also sometimes officially called the Azerbaijan Republic is a transcontinental country located at the boundary of Eastern Europe and Western Asia. It is a part of th ...
,
Uzbekistan
Uzbekistan (, ; uz, Ozbekiston, italic=yes / , ; russian: Узбекистан), officially the Republic of Uzbekistan ( uz, Ozbekiston Respublikasi, italic=yes / ; russian: Республика Узбекистан), is a doubly landlocked cou ...
,
Turkmenistan
Turkmenistan ( or ; tk, Türkmenistan / Түркменистан, ) is a country located in Central Asia, bordered by Kazakhstan to the northwest, Uzbekistan to the north, east and northeast, Afghanistan to the southeast, Iran to the sout ...
, as well as Romanian-speaking
Moldova
Moldova ( , ; ), officially the Republic of Moldova ( ro, Republica Moldova), is a Landlocked country, landlocked country in Eastern Europe. It is bordered by Romania to the west and Ukraine to the north, east, and south. The List of states ...
, officially adopted Latin alphabets for their languages.
Kyrgyzstan
Kyrgyzstan,, pronounced or the Kyrgyz Republic, is a landlocked country in Central Asia. Kyrgyzstan is bordered by Kazakhstan to the north, Uzbekistan to the west, Tajikistan to the south, and the People's Republic of China to the east. ...
,
Iranian
Iranian may refer to:
* Iran, a sovereign state
* Iranian peoples, the speakers of the Iranian languages. The term Iranic peoples is also used for this term to distinguish the pan ethnic term from Iranian, used for the people of Iran
* Iranian lan ...
-speaking
Tajikistan
Tajikistan (, ; tg, Тоҷикистон, Tojikiston; russian: Таджикистан, Tadzhikistan), officially the Republic of Tajikistan ( tg, Ҷумҳурии Тоҷикистон, Jumhurii Tojikiston), is a landlocked country in Centr ...
, and the breakaway region of
Transnistria
Transnistria, officially the Pridnestrovian Moldavian Republic (PMR), is an unrecognised breakaway state that is internationally recognised as a part of Moldova. Transnistria controls most of the narrow strip of land between the Dniester riv ...
kept the Cyrillic alphabet, chiefly due to their close ties with Russia.
In the 1930s and 1940s, the majority of
Kurds ug:كۇردلار
Kurds ( ku, کورد ,Kurd, italic=yes, rtl=yes) or Kurdish people are an Iranian ethnic group native to the mountainous region of Kurdistan in Western Asia, which spans southeastern Turkey, northwestern Iran, northern Ir ...
replaced the Arabic script with two Latin alphabets. Although only the official
Kurdish government uses an Arabic alphabet for public documents, the Latin Kurdish alphabet remains widely used throughout the region by the majority of
Kurdish
Kurdish may refer to:
*Kurds or Kurdish people
*Kurdish languages
*Kurdish alphabets
*Kurdistan, the land of the Kurdish people which includes:
**Southern Kurdistan
**Eastern Kurdistan
**Northern Kurdistan
**Western Kurdistan
See also
* Kurd (dis ...
-speakers.
In 1957, the
People's Republic of China
China, officially the People's Republic of China (PRC), is a country in East Asia. It is the world's most populous country, with a population exceeding 1.4 billion, slightly ahead of India. China spans the equivalent of five time zones and ...
introduced a script reform to the
Zhuang language
The Zhuang languages (; autonym: , pre-1982: , Sawndip: 話僮, from ''vah'', 'language' and ''Cuengh'', 'Zhuang'; ) are any of more than a dozen Tai languages spoken by the Zhuang people of Southern China in the province of Guangxi and adjac ...
, changing its orthography from
Sawndip
Zhuang characters or ''Sawndip'' (Sawndip: ; ) are logograms derived from Chinese characters and used by the Zhuang people of Guangxi and Yunnan provinces in China to write the Zhuang languages for more than one thousand years. The script is used ...
, a writing system based on Chinese, to a Latin script alphabet that used a mixture of Latin, Cyrillic, and IPA letters to represent both the phonemes and tones of the Zhuang language, without the use of diacritics. In 1982 this was further standardised to use only Latin script letters.
With the collapse of the
Derg
The Derg (also spelled Dergue; , ), officially the Provisional Military Administrative Council (PMAC), was the military junta that ruled Ethiopia, then including present-day Eritrea, from 1974 to 1987, when the military leadership formally " c ...
and subsequent end of decades of
Amharic
Amharic ( or ; (Amharic: ), ', ) is an Ethiopian Semitic language, which is a subgrouping within the Semitic branch of the Afroasiatic languages. It is spoken as a first language by the Amharas, and also serves as a lingua franca for all oth ...
assimilation in 1991, various ethnic groups in
Ethiopia
Ethiopia, , om, Itiyoophiyaa, so, Itoobiya, ti, ኢትዮጵያ, Ítiyop'iya, aa, Itiyoppiya officially the Federal Democratic Republic of Ethiopia, is a landlocked country in the Horn of Africa. It shares borders with Eritrea to the ...
dropped the
Geʽez script
Geʽez ( gez, ግዕዝ, Gəʿəz, ) is a script used as an abugida (alphasyllabary) for several Afroasiatic languages, Afro-Asiatic and Nilo-Saharan languages, Nilo-Saharan languages of Ethiopia and Eritrea. It originated as an ''abjad'' (co ...
, which was deemed unsuitable for languages outside of the
Semitic branch. In the following years the
Kafa,
Oromo,
Sidama
The Sidama ( am, ሲዳማ) are an ethnic group traditionally inhabiting the Sidama Region, formerly part of the Southern Nations, Nationalities, and Peoples' Region of Ethiopia. On 23 November 2019, the Sidama Zone became the 10th regional st ...
,
Somali,
and
Wolaitta languages switched to Latin while there is continued debate on whether to follow suit for the
Hadiyya and
Kambaata languages.
21st century
On 15 September 1999 the authorities of
Tatarstan
The Republic of Tatarstan (russian: Республика Татарстан, Respublika Tatarstan, p=rʲɪsˈpublʲɪkə tətɐrˈstan; tt-Cyrl, Татарстан Республикасы), or simply Tatarstan (russian: Татарстан, tt ...
, Russia, passed a law to make the Latin script a co-official writing system alongside Cyrillic for the
Tatar language
Tatar ( or ) is a Turkic languages, Turkic language spoken by Volga Tatars, Tatars mainly located in modern Tatarstan (European Russia), as well as Siberia. It should not be confused with Crimean Tatar language, Crimean Tatar or Siberian Tat ...
by 2011. A year later, however, the Russian government overruled the law and banned Latinization on its territory.
In 2015, the
government of Kazakhstan
The Government of the Republic of Kazakhstan ( kk, Қазақстан Республикасының Үкіметі, tr, ''Qazaqstan Respublikasynyñ Ükımetı'') oversees a presidential republic. The President of Kazakhstan, currently Kassym- ...
announced that a
Kazakh Latin alphabet would replace the
Kazakh Cyrillic alphabet as the official writing system for the
Kazakh language
The Kazakh or simply Qazaq (Latin: or , Cyrillic: or , Arabic Script: or , , ) is a Turkic language of the Kipchak branch spoken in Central Asia by Kazakhs. It is closely related to Nogai, Kyrgyz and Karakalpak. It is the official lan ...
by 2025. There are also talks about switching from the Cyrillic script to Latin in Ukraine,
Kyrgyzstan
Kyrgyzstan,, pronounced or the Kyrgyz Republic, is a landlocked country in Central Asia. Kyrgyzstan is bordered by Kazakhstan to the north, Uzbekistan to the west, Tajikistan to the south, and the People's Republic of China to the east. ...
, and
Mongolia
Mongolia; Mongolian script: , , ; lit. "Mongol Nation" or "State of Mongolia" () is a landlocked country in East Asia, bordered by Russia to the north and China to the south. It covers an area of , with a population of just 3.3 million, ...
. Mongolia, however, has since opted to revive the
Mongolian script
The classical or traditional Mongolian script, also known as the , was the first writing system created specifically for the Mongolian language, and was the most widespread until the introduction of Cyrillic in 1946. It is traditionally writte ...
instead of switching to Latin.
In October 2019, the organization
National Representational Organization for Inuit in Canada (ITK) announced that they will introduce a unified writing system for the
Inuit languages
The Inuit languages are a closely related group of indigenous American languages traditionally spoken across the North American Arctic and adjacent subarctic, reaching farthest south in Labrador. The related Yupik languages (spoken in western ...
in the country. The writing system is based on the Latin alphabet and is modeled after the one used in the
Greenlandic language
Greenlandic ( kl, kalaallisut, link=no ; da, grønlandsk ) is an Eskimo–Aleut language with about 56,000 speakers, mostly Greenlandic Inuit in Greenland. It is closely related to the Inuit languages in Canada such as Inuktitut. It is the mos ...
.
On 12 February 2021 the government of Uzbekistan announced it will finalize the transition from Cyrillic to Latin for the
Uzbek language
Uzbek (''Oʻzbekcha, Oʻzbek tili or Ўзбекча, Ўзбек тили''), formerly known as ''Turki'' or ''Western Turki'', is a Turkic language spoken by Uzbeks. It is the official, and national language of Uzbekistan. Uzbek is spoken as ei ...
by 2023. Plans to switch to Latin originally began in 1993 but subsequently stalled and Cyrillic remained in widespread use.
At present the
Crimean Tatar language
Crimean Tatar () also called Crimean (), is a Kipchak Turkic language spoken in Crimea and the Crimean Tatar diasporas of Uzbekistan, Turkey, Romania, and Bulgaria, as well as small communities in the United States and Canada. It should n ...
uses both Cyrillic and Latin. The use of Latin was originally approved by Crimean Tatar representatives after the Soviet Union's collapse but was never implemented by the regional government. After Russia's
annexation
Annexation (Latin ''ad'', to, and ''nexus'', joining), in international law, is the forcible acquisition of one state's territory by another state, usually following military occupation of the territory. It is generally held to be an illegal act ...
of Crimea in 2014 the Latin script was dropped entirely. Nevertheless Crimean Tatars outside of Crimea continue to use Latin and on 22 October 2021 the government of Ukraine approved a proposal endorsed by the
Mejlis of the Crimean Tatar People
The Mejlis of the Crimean Tatar People ( crh, Къырымтатар Миллий Меджлиси - ''Qırımtatar Milliy Meclisi'') is the single highest executive-representative body of the Crimean Tatars in period between sessions of the Q ...
to switch the Crimean Tatar language to Latin by 2025.
In July 2020, 2.6 billion people (36% of the world population) use the Latin alphabet.
International standards
By the 1960s, it became apparent to the computer and
telecommunication
Telecommunication is the transmission of information by various types of technologies over wire, radio, optical, or other electromagnetic systems. It has its origin in the desire of humans for communication over a distance greater than that fe ...
s industries in the
First World
The concept of First World originated during the Cold War and comprised countries that were under the influence of the United States and the rest of NATO and opposed the Soviet Union and/or communism during the Cold War. Since the collapse of ...
that a non-proprietary method of encoding characters was needed. The
International Organization for Standardization
The International Organization for Standardization (ISO ) is an international standard development organization composed of representatives from the national standards organizations of member countries. Membership requirements are given in Ar ...
(ISO) encapsulated the Latin alphabet in their (
ISO/IEC 646
ISO/IEC 646 is a set of ISO/IEC standards, described as ''Information technology — ISO 7-bit coded character set for information interchange'' and developed in cooperation with ASCII at least since 1964. Since its first edition in 1 ...
) standard. To achieve widespread acceptance, this encapsulation was based on popular usage.
As the United States held a preeminent position in both industries during the 1960s, the standard was based on the already published ''American Standard Code for Information Interchange'', better known as
ASCII
ASCII ( ), abbreviated from American Standard Code for Information Interchange, is a character encoding standard for electronic communication. ASCII codes represent text in computers, telecommunications equipment, and other devices. Because of ...
, which included in the
character set
Character encoding is the process of assigning numbers to graphical characters, especially the written characters of human language, allowing them to be stored, transmitted, and transformed using digital computers. The numerical values that ...
the 26 × 2 (uppercase and lowercase) letters of the
English alphabet
The alphabet for Modern English is a Latin-script alphabet consisting of 26 letters, each having an upper- and lower-case form. The word ''alphabet'' is a compound of the first two letters of the Greek alphabet, '' alpha'' and '' beta''. ...
. Later standards issued by the ISO, for example
ISO/IEC 10646
ISO/IEC JTC 1, entitled "Information technology", is a joint technical committee (JTC) of the International Organization for Standardization (ISO) and the International Electrotechnical Commission (IEC). Its purpose is to develop, maintain and pr ...
(
Unicode Latin
Over a thousand characters from the Latin script are encoded in the Unicode Standard, grouped in several basic and extended Latin blocks. The extended ranges contain mainly precomposed letters plus diacritics that are equivalently encoded with co ...
), have continued to define the 26 × 2 letters of the English alphabet as the basic Latin alphabet with extensions to handle other letters in other languages.
National standards
The DIN standard
DIN 91379
The DIN standard DIN 91379: "Characters and defined character sequences in Unicode for the electronic processing of names and data exchange in Europe, with CD-ROM" defines a normative subset of Unicode Latin characters, sequences of base characte ...
specifies a subset of Unicode letters, special characters, and sequences of letters and diacritic signs to allow the correct representation of names and to simplify data exchange in Europe. This specification supports all official languages of
European Union
The European Union (EU) is a supranational political and economic union of member states that are located primarily in Europe. The union has a total area of and an estimated total population of about 447million. The EU has often been des ...
countries (thus also Greek and
Cyrillic
, bg, кирилица , mk, кирилица , russian: кириллица , sr, ћирилица, uk, кирилиця
, fam1 = Egyptian hieroglyphs
, fam2 = Proto-Sinaitic
, fam3 = Phoenician
, fam4 = G ...
for
Bulgarian
Bulgarian may refer to:
* Something of, from, or related to the country of Bulgaria
* Bulgarians, a South Slavic ethnic group
* Bulgarian language, a Slavic language
* Bulgarian alphabet
* A citizen of Bulgaria, see Demographics of Bulgaria
* Bul ...
) as well as the official languages of Iceland, Liechtenstein, Norway, and Switzerland, and also the German minority languages. To allow the transliteration of names in other writing systems to the Latin script according to the relevant ISO standards all necessary combinations of base letters and diacritic signs are provided.
Efforts are being made to further develop it into a European
CEN standard.
[
]
As used by various languages
In the course of its use, the Latin alphabet was adapted for use in new languages, sometimes representing
phoneme
In phonology and linguistics, a phoneme () is a unit of sound that can distinguish one word from another in a particular language.
For example, in most dialects of English, with the notable exception of the West Midlands and the north-west o ...
s not found in languages that were already written with the Roman characters. To represent these new sounds, extensions were therefore created, be it by adding
diacritic
A diacritic (also diacritical mark, diacritical point, diacritical sign, or accent) is a glyph added to a letter or to a basic glyph. The term derives from the Ancient Greek (, "distinguishing"), from (, "to distinguish"). The word ''diacriti ...
s to existing
letters
Letter, letters, or literature may refer to:
Characters typeface
* Letter (alphabet), a character representing one or more of the sounds used in speech; any of the symbols of an alphabet.
* Letterform, the graphic form of a letter of the alphabe ...
, by joining multiple letters together to make
ligatures, by creating completely new forms, or by assigning a special function to pairs or triplets of letters. These new forms are given a place in the alphabet by defining an
alphabetical order
Alphabetical order is a system whereby character strings are placed in order based on the position of the characters in the conventional ordering of an alphabet. It is one of the methods of collation. In mathematics, a lexicographical order is t ...
or collation sequence, which can vary with the particular language.
Letters
Some examples of new letters to the standard Latin alphabet are the
Runic
Runes are the letters in a set of related alphabets known as runic alphabets native to the Germanic peoples. Runes were used to write various Germanic languages (with some exceptions) before they adopted the Latin alphabet, and for specialised ...
letters
wynn
Wynn or wyn (; also spelled wen, ƿynn, and ƿen) is a letter of the Old English alphabet, where it is used to represent the sound .
History The letter "W"
While the earliest Old English texts represent this phoneme with the digraph , ...
and
thorn
Thorn(s) or The Thorn(s) may refer to:
Botany
* Thorns, spines, and prickles, sharp structures on plants
* ''Crataegus monogyna'', or common hawthorn, a plant species
Comics and literature
* Rose and Thorn, the two personalities of two DC Com ...
, and the letter
eth
(colloquially)
, former_name = eidgenössische polytechnische Schule
, image = ETHZ.JPG
, image_size =
, established =
, type = Public
, budget = CHF 1.896 billion (2021)
, rector = Günther Dissertori
, president = Joël Mesot
, a ...
, which were added to the alphabet of
Old English
Old English (, ), or Anglo-Saxon, is the earliest recorded form of the English language, spoken in England and southern and eastern Scotland in the early Middle Ages. It was brought to Great Britain by Anglo-Saxon settlement of Britain, Anglo ...
. Another Irish letter, the
insular ''g'', developed into
yogh
The letter yogh (ȝogh) ( ; Scots Language, Scots: ; Middle English: ) was used in Middle English and Older Scots, representing ''y'' () and various velar consonant , velar phonemes. It was derived from the Insular G, Insular form of the letter ...
, used in
Middle English
Middle English (abbreviated to ME) is a form of the English language that was spoken after the Norman conquest of 1066, until the late 15th century. The English language underwent distinct variations and developments following the Old English p ...
. Wynn was later replaced with the new letter , eth and thorn with , and yogh with . Although the four are no longer part of the English or Irish alphabets, eth and thorn are still used in the modern
Icelandic alphabet
Icelandic refers to anything of, from, or related to Iceland and may refer to:
*Icelandic people
Icelanders ( is, Íslendingar) are a North Germanic ethnic group and nation who are native to the island country of Iceland and speak Icelandic.
...
, while eth is also used by the
Faroese alphabet
Faroese orthography is the method employed to write the Faroese language, using a 29-letter Latin alphabet.
Alphabet
The Faroese alphabet consists of 29 letters derived from the Latin script:
* Eth (Faroese ') never appears at the beginnin ...
.
Some West, Central and
Southern Africa
Southern Africa is the southernmost subregion of the African continent, south of the Congo and Tanzania. The physical location is the large part of Africa to the south of the extensive Congo River basin. Southern Africa is home to a number of ...
n languages use a few additional letters that have sound values similar to those of their equivalents in the IPA. For example,
Adangme uses the letters and , and
Ga uses , and .
Hausa
Hausa may refer to:
* Hausa people, an ethnic group of West Africa
* Hausa language, spoken in West Africa
* Hausa Kingdoms, a historical collection of Hausa city-states
* Hausa (horse) or Dongola horse, an African breed of riding horse
See also
* ...
uses and for
implosives
Implosive consonants are a group of stop consonants (and possibly also some affricates) with a mixed glottalic ingressive and pulmonic egressive airstream mechanism.''Phonetics for communication disorders.'' Martin J. Ball and Nicole Müller. Rou ...
, and for an
ejective
In phonetics, ejective consonants are usually voiceless consonants that are pronounced with a glottalic egressive airstream. In the phonology of a particular language, ejectives may contrast with aspirated, voiced and tenuis consonants. Some l ...
.
Africanists
African studies is the study of Africa, especially the continent's cultures and societies (as opposed to its geology, geography, zoology, etc.). The field includes the study of Africa's history (pre-colonial, colonial, post-colonial), demograph ...
have standardized these into the
African reference alphabet
An African reference alphabet was first proposed in 1978 by a UNESCO-organized conference held in Niamey, Niger, and the proposed alphabet was revised in 1982. The conference recommended the use of single letters for a sound (that is, a phoneme) ...
.
Dotted and
dotless I
I, or ı, called dotless I, is a letter used in the Latin-script alphabets of Azerbaijani, Crimean Tatar, Gagauz, Kazakh, Tatar, Kyrgyz, and Turkish. It commonly represents the close back unrounded vowel , except in Kazakh where it represen ...
— and — are two forms of the letter I used by the
Turkish,
Azerbaijani, and
Kazakh alphabets. The Azerbaijani language also has , which represents the
near-open front unrounded vowel
The near-open front unrounded vowel, or near-low front unrounded vowel, is a type of vowel sound, used in some spoken languages. The symbol in the International Phonetic Alphabet that represents this sound is , a lowercase of the ligature. Bot ...
.
Multigraphs
A
digraph is a pair of letters used to write one sound or a combination of sounds that does not correspond to the written letters in sequence. Examples are , , , , , in English, and , , and in Dutch. In Dutch the is capitalized as or the
ligature
Ligature may refer to:
* Ligature (medicine), a piece of suture used to shut off a blood vessel or other anatomical structure
** Ligature (orthodontic), used in dentistry
* Ligature (music), an element of musical notation used especially in the me ...
, but never as , and it often takes the appearance of a ligature very similar to the letter in
handwriting
Handwriting is the writing done with a writing instrument, such as a pen or pencil, in the hand. Handwriting includes both printing and cursive styles and is separate from formal calligraphy or typeface
A typeface (or font family) is ...
.
A
trigraph is made up of three letters, like the
German
German(s) may refer to:
* Germany (of or related to)
**Germania (historical use)
* Germans, citizens of Germany, people of German ancestry, or native speakers of the German language
** For citizens of Germany, see also German nationality law
**Ger ...
, the
Breton
Breton most often refers to:
*anything associated with Brittany, and generally
** Breton people
** Breton language, a Southwestern Brittonic Celtic language of the Indo-European language family, spoken in Brittany
** Breton (horse), a breed
**Ga ...
or the
Milanese
Milanese (endonym in traditional orthography , ') is the central variety of the Western dialect of the Lombard language spoken in Milan, the rest of its metropolitan city, and the northernmost part of the province of Pavia. Milanese, due to ...
. In the
orthographies
An orthography is a set of conventions for writing a language, including norms of spelling, hyphenation, capitalization, word breaks, emphasis, and punctuation.
Most transnational languages in the modern period have a writing system, and mos ...
of some languages, digraphs and trigraphs are regarded as independent letters of the alphabet in their own right. The capitalization of digraphs and trigraphs is language-dependent, as only the first letter may be capitalized, or all component letters simultaneously (even for words written in title case, where letters after the digraph or trigraph are left in lowercase).
Ligatures
A
ligature
Ligature may refer to:
* Ligature (medicine), a piece of suture used to shut off a blood vessel or other anatomical structure
** Ligature (orthodontic), used in dentistry
* Ligature (music), an element of musical notation used especially in the me ...
is a fusion of two or more ordinary letters into a new
glyph
A glyph () is any kind of purposeful mark. In typography, a glyph is "the specific shape, design, or representation of a character". It is a particular graphical representation, in a particular typeface, of an element of written language. A g ...
or character. Examples are (from , called "ash"), (from , sometimes called "oethel"), the
abbreviation
An abbreviation (from Latin ''brevis'', meaning ''short'') is a shortened form of a word or phrase, by any method. It may consist of a group of letters or words taken from the full version of the word or phrase; for example, the word ''abbrevia ...
(from la, et, , and, called "ampersand"), and (from or , the
archaic medial form of , followed by an or , called "sharp S" or "eszett").
Diacritics
A diacritic, in some cases also called an accent, is a small symbol that can appear above or below a letter, or in some other position, such as the
umlaut sign used in the German characters , , or the Romanian characters
ă,
â,
î,
ș,
ț. Its main function is to change the phonetic value of the letter to which it is added, but it may also modify the pronunciation of a whole syllable or word, indicate the start of a new syllable, or distinguish between
homographs
A homograph (from the el, ὁμός, ''homós'', "same" and γράφω, ''gráphō'', "write") is a word that shares the same written form as another word but has a different meaning. However, some dictionaries insist that the words must also ...
such as the
Dutch
Dutch commonly refers to:
* Something of, from, or related to the Netherlands
* Dutch people ()
* Dutch language ()
Dutch may also refer to:
Places
* Dutch, West Virginia, a community in the United States
* Pennsylvania Dutch Country
People E ...
words ''
een'' () meaning "a" or "an", and ''
één'', () meaning "one". As with the pronunciation of letters, the effect of diacritics is language-dependent.
English is the only major modern
European language that requires no diacritics for its native vocabulary. Historically, in formal writing, a
diaeresis was sometimes used to indicate the start of a new syllable within a sequence of letters that could otherwise be misinterpreted as being a single vowel (e.g., “coöperative”, “reëlect”), but modern writing styles either omit such marks or use a hyphen to indicate a syllable break (e.g. “cooperative”, “re-elect”).
Collation
Some modified letters, such as the symbols , , and , may be regarded as new individual letters in themselves, and assigned a specific place in the alphabet for
collation
Collation is the assembly of written information into a standard order. Many systems of collation are based on numerical order or alphabetical order, or extensions and combinations thereof. Collation is a fundamental element of most office fili ...
purposes, separate from that of the letter on which they are based, as is done in
Swedish
Swedish or ' may refer to:
Anything from or related to Sweden, a country in Northern Europe. Or, specifically:
* Swedish language, a North Germanic language spoken primarily in Sweden and Finland
** Swedish alphabet, the official alphabet used by ...
. In other cases, such as with , , in German, this is not done; letter-diacritic combinations being identified with their base letter. The same applies to digraphs and trigraphs. Different diacritics may be treated differently in collation within a single language. For example, in Spanish, the character is considered a letter, and sorted between and in dictionaries, but the accented vowels , , , , , are not separated from the unaccented vowels , , , , .
Capitalization
The languages that use the Latin script today generally use
capital letters
Letter case is the distinction between the letters that are in larger uppercase or capitals (or more formally ''majuscule'') and smaller lowercase (or more formally ''minuscule'') in the written representation of certain languages. The writing ...
to begin paragraphs and sentences and
proper nouns
A proper noun is a noun that identifies a single entity and is used to refer to that entity (''Africa'', ''Jupiter'', ''Sarah (given name), Sarah'', ''Microsoft)'' as distinguished from a common noun, which is a noun that refers to a Class (philo ...
. The rules for
capitalization
Capitalization (American English) or capitalisation (British English) is writing a word with its first letter as a capital letter (uppercase letter) and the remaining letters in lower case, in writing systems with a case distinction. The term a ...
have changed over time, and different languages have varied in their rules for capitalization.
Old English
Old English (, ), or Anglo-Saxon, is the earliest recorded form of the English language, spoken in England and southern and eastern Scotland in the early Middle Ages. It was brought to Great Britain by Anglo-Saxon settlement of Britain, Anglo ...
, for example, was rarely written with even proper nouns capitalized; whereas
Modern English of the 18th century had frequently all nouns capitalized, in the same way that Modern
German
German(s) may refer to:
* Germany (of or related to)
**Germania (historical use)
* Germans, citizens of Germany, people of German ancestry, or native speakers of the German language
** For citizens of Germany, see also German nationality law
**Ger ...
is written today, e.g. german: Alle Schwestern der alten Stadt hatten die Vögel gesehen, , All of the sisters of the old city had seen the birds.
Romanization
Words from languages natively written with other
scripts
Script may refer to:
Writing systems
* Script, a distinctive writing system, based on a repertoire of specific elements or symbols, or that repertoire
* Script (styles of handwriting)
** Script typeface, a typeface with characteristics of handw ...
, such as
Arabic
Arabic (, ' ; , ' or ) is a Semitic languages, Semitic language spoken primarily across the Arab world.Semitic languages: an international handbook / edited by Stefan Weninger; in collaboration with Geoffrey Khan, Michael P. Streck, Janet C ...
or
Chinese
Chinese can refer to:
* Something related to China
* Chinese people, people of Chinese nationality, citizenship, and/or ethnicity
**''Zhonghua minzu'', the supra-ethnic concept of the Chinese nation
** List of ethnic groups in China, people of ...
, are usually
transliterated
Transliteration is a type of conversion of a text from one script to another that involves swapping letters (thus '' trans-'' + '' liter-'') in predictable ways, such as Greek → , Cyrillic → , Greek → the digraph , Armenian → or ...
or
transcribed when embedded in Latin-script text or in
multilingual
Multilingualism is the use of more than one language, either by an individual speaker or by a group of speakers. It is believed that multilingual speakers outnumber monolingual speakers in the world's population. More than half of all E ...
international communication, a process termed ''Romanization''.
Whilst the Romanization of such languages is used mostly at unofficial levels, it has been especially prominent in computer messaging where only the limited seven-bit
ASCII
ASCII ( ), abbreviated from American Standard Code for Information Interchange, is a character encoding standard for electronic communication. ASCII codes represent text in computers, telecommunications equipment, and other devices. Because of ...
code is available on older systems. However, with the introduction of
Unicode
Unicode, formally The Unicode Standard,The formal version reference is is an information technology Technical standard, standard for the consistent character encoding, encoding, representation, and handling of Character (computing), text expre ...
, Romanization is now becoming less necessary. Note that keyboards used to enter such text may still restrict users to Romanized text, as only ASCII or Latin-alphabet characters may be available.
See also
*
List of languages by writing system#Latin script
*
Western Latin character sets (computing)
Several binary representations of 8-bit character sets for common Western European languages are compared in this article. These encodings were designed for representation of Italian, Spanish, Portuguese, French, German, Dutch, English, Danish, S ...
*
European Latin Unicode subset (DIN 91379)
*
Latin letters used in mathematics
Many letters of the Latin alphabet, both capital and small, are used in mathematics, science, and engineering to denote by convention specific or abstracted constants, variables of a certain type, units, multipliers, or physical entities. Certain ...
*
Latin omega
Latin omega, or simply omega, is an additional letter of the Latin alphabet, based on the lowercase of the Greek letter omega . It was included as a Latin letter in the Mann and Dalby 1982 revision of the African reference alphabet and has been us ...
Notes
References
Citations
Sources
*
Further reading
* Boyle, Leonard E. 1976. "Optimist and recensionist: 'Common errors' or 'common variations.'" In ''Latin script and letters A.D. 400–900: Festschrift presented to Ludwig Bieler on the occasion of his 70th birthday.'' Edited by John J. O'Meara and Bernd Naumann, 264–74. Leiden, The Netherlands: Brill.
* Morison, Stanley. 1972. ''Politics and script: Aspects of authority and freedom in the development of Graeco-Latin script from the sixth century B.C. to the twentieth century A.D.'' Oxford: Clarendon.
External links
Unicode collation chartLatin letters sorted by shape
Diacritics ProjectAll you need to design a font with correct accents
{{Authority control
History of the Roman Empire
Officially used writing systems of India