HOME

TheInfoList



OR:

An alphabet is a standard set of letters written to represent particular sounds in a spoken language. Specifically, letters largely correspond to
phoneme A phoneme () is any set of similar Phone (phonetics), speech sounds that are perceptually regarded by the speakers of a language as a single basic sound—a smallest possible Phonetics, phonetic unit—that helps distinguish one word fr ...
s as the smallest sound segments that can distinguish one word from another in a given language. Not all
writing system A writing system comprises a set of symbols, called a ''script'', as well as the rules by which the script represents a particular language. The earliest writing appeared during the late 4th millennium BC. Throughout history, each independen ...
s represent language in this way: a
syllabary In the Linguistics, linguistic study of Written language, written languages, a syllabary is a set of grapheme, written symbols that represent the syllables or (more frequently) mora (linguistics), morae which make up words. A symbol in a syllaba ...
assigns symbols to spoken
syllable A syllable is a basic unit of organization within a sequence of speech sounds, such as within a word, typically defined by linguists as a ''nucleus'' (most often a vowel) with optional sounds before or after that nucleus (''margins'', which are ...
s, while logographies assign symbols to
word A word is a basic element of language that carries semantics, meaning, can be used on its own, and is uninterruptible. Despite the fact that language speakers often have an intuitive grasp of what a word is, there is no consensus among linguist ...
s,
morpheme A morpheme is any of the smallest meaningful constituents within a linguistic expression and particularly within a word. Many words are themselves standalone morphemes, while other words contain multiple morphemes; in linguistic terminology, this ...
s, or other semantic units. The first letters were invented in
Ancient Egypt Ancient Egypt () was a cradle of civilization concentrated along the lower reaches of the Nile River in Northeast Africa. It emerged from prehistoric Egypt around 3150BC (according to conventional Egyptian chronology), when Upper and Lower E ...
to serve as an aid in writing
Egyptian hieroglyph Ancient Egyptian hieroglyphs ( ) were the formal writing system used in Ancient Egypt for writing the Egyptian language. Hieroglyphs combined ideographic, logographic, syllabic and alphabetic elements, with more than 1,000 distinct characters. ...
s; these are referred to as Egyptian uniliteral signs by
lexicographer Lexicography is the study of lexicons and the art of compiling dictionaries. It is divided into two separate academic disciplines: * Practical lexicography is the art or craft of compiling, writing and editing dictionary, dictionaries. * The ...
s. This system was used until the 5th century AD, and fundamentally differed by adding pronunciation hints to existing hieroglyphs that had previously carried no pronunciation information. Later on, these
phonemic A phoneme () is any set of similar speech sounds that are perceptually regarded by the speakers of a language as a single basic sound—a smallest possible phonetic unit—that helps distinguish one word from another. All languages con ...
symbols also became used to transcribe foreign words. The first fully phonemic script was the
Proto-Sinaitic script The Proto-Sinaitic script is a Middle Bronze Age writing system known from a small corpus of about Serabit el-Khadim proto-Sinaitic inscriptions, 30-40 inscriptions and fragments from Serabit el-Khadim in the Sinai Peninsula, as well as Wadi el ...
, also descending from Egyptian hieroglyphs, which was later modified to create the
Phoenician alphabet The Phoenician alphabet is an abjad (consonantal alphabet) used across the Mediterranean civilization of Phoenicia for most of the 1st millennium BC. It was one of the first alphabets, attested in Canaanite and Aramaic inscriptions fo ...
. The Phoenician system is considered the first true alphabet and is the ultimate ancestor of many modern scripts, including
Arabic Arabic (, , or , ) is a Central Semitic languages, Central Semitic language of the Afroasiatic languages, Afroasiatic language family spoken primarily in the Arab world. The International Organization for Standardization (ISO) assigns lang ...
,
Cyrillic The Cyrillic script ( ) is a writing system used for various languages across Eurasia. It is the designated national script in various Slavic, Turkic, Mongolic, Uralic, Caucasian and Iranic-speaking countries in Southeastern Europe, Ea ...
,
Greek Greek may refer to: Anything of, from, or related to Greece, a country in Southern Europe: *Greeks, an ethnic group *Greek language, a branch of the Indo-European language family **Proto-Greek language, the assumed last common ancestor of all kno ...
,
Hebrew Hebrew (; ''ʿÎbrit'') is a Northwest Semitic languages, Northwest Semitic language within the Afroasiatic languages, Afroasiatic language family. A regional dialect of the Canaanite languages, it was natively spoken by the Israelites and ...
,
Latin Latin ( or ) is a classical language belonging to the Italic languages, Italic branch of the Indo-European languages. Latin was originally spoken by the Latins (Italic tribe), Latins in Latium (now known as Lazio), the lower Tiber area aroun ...
, and possibly Brahmic. Peter T. Daniels distinguishes true alphabets—which use letters to represent both consonants and vowels—from both
abugida An abugida (; from Geʽez: , )sometimes also called alphasyllabary, neosyllabary, or pseudo-alphabetis a segmental Writing systems#Segmental writing system, writing system in which consonant–vowel sequences are written as units; each unit ...
s and
abjad An abjad ( or abgad) is a writing system in which only consonants are represented, leaving the vowel sounds to be inferred by the reader. This contrasts with alphabets, which provide graphemes for both consonants and vowels. The term was introd ...
s, which only need letters for consonants. Abjads generally lack vowel indicators altogether, while abugidas represent them with
diacritic A diacritic (also diacritical mark, diacritical point, diacritical sign, or accent) is a glyph added to a letter or to a basic glyph. The term derives from the Ancient Greek (, "distinguishing"), from (, "to distinguish"). The word ''diacrit ...
s added to letters. In this narrower sense, the
Greek alphabet The Greek alphabet has been used to write the Greek language since the late 9th or early 8th century BC. It was derived from the earlier Phoenician alphabet, and is the earliest known alphabetic script to systematically write vowels as wel ...
was the first true alphabet; it was originally derived from the
Phoenician alphabet The Phoenician alphabet is an abjad (consonantal alphabet) used across the Mediterranean civilization of Phoenicia for most of the 1st millennium BC. It was one of the first alphabets, attested in Canaanite and Aramaic inscriptions fo ...
, which was an abjad. Alphabets usually have a standard ordering for their letters. This makes alphabets a useful tool in
collation Collation is the assembly of written information into a standard order. Many systems of collation are based on numerical order or alphabetical order, or extensions and combinations thereof. Collation is a fundamental element of most office fi ...
, as words can be listed in a well-defined order—commonly known as
alphabetical order Alphabetical order is a system whereby character strings are placed in order based on the position of the characters in the conventional ordering of an alphabet. It is one of the methods of collation. In mathematics, a lexicographical order is ...
. This also means that letters may be used as a method of "numbering" ordered items. Some systems demonstrate
acrophony Acrophony (; + 'sound') is the naming of letters of an alphabetic writing system so that a letter's name begins with the letter itself. For example, Greek letter names are acrophonic: the names of the letters α, β, γ, δ, are spelled with t ...
, a phenomenon where letters have been given names distinct from their pronunciations. Systems with acrophony include Greek,
Arabic Arabic (, , or , ) is a Central Semitic languages, Central Semitic language of the Afroasiatic languages, Afroasiatic language family spoken primarily in the Arab world. The International Organization for Standardization (ISO) assigns lang ...
,
Hebrew Hebrew (; ''ʿÎbrit'') is a Northwest Semitic languages, Northwest Semitic language within the Afroasiatic languages, Afroasiatic language family. A regional dialect of the Canaanite languages, it was natively spoken by the Israelites and ...
, and Syriac; systems without include the
Latin alphabet The Latin alphabet, also known as the Roman alphabet, is the collection of letters originally used by the Ancient Rome, ancient Romans to write the Latin language. Largely unaltered except several letters splitting—i.e. from , and from � ...
.


Etymology

The English word ''alphabet'' came into
Middle English Middle English (abbreviated to ME) is a form of the English language that was spoken after the Norman Conquest of 1066, until the late 15th century. The English language underwent distinct variations and developments following the Old English pe ...
from the
Late Latin Late Latin is the scholarly name for the form of Literary Latin of late antiquity.Roberts (1996), p. 537. English dictionary definitions of Late Latin date this period from the 3rd to 6th centuries CE, and continuing into the 7th century in ...
word , which in turn originated in the Greek ; it was made from the first two letters of the Greek alphabet, ''
alpha Alpha (uppercase , lowercase ) is the first letter of the Greek alphabet. In the system of Greek numerals, it has a value of one. Alpha is derived from the Phoenician letter ''aleph'' , whose name comes from the West Semitic word for ' ...
'' (α) and ''
beta Beta (, ; uppercase , lowercase , or cursive ; or ) is the second letter of the Greek alphabet. In the system of Greek numerals, it has a value of 2. In Ancient Greek, beta represented the voiced bilabial plosive . In Modern Greek, it represe ...
'' (β). The names for the Greek letters, in turn, came from the first two letters of the Phoenician alphabet: ''
aleph Aleph (or alef or alif, transliterated ʾ) is the first Letter (alphabet), letter of the Semitic abjads, including Phoenician alphabet, Phoenician ''ʾālep'' 𐤀, Hebrew alphabet, Hebrew ''ʾālef'' , Aramaic alphabet, Aramaic ''ʾālap'' � ...
'', the word for ''ox'', and '' bet'', the word for ''house''.


History


Alphabets related to Phoenician


Ancient Near Eastern alphabets

The Ancient Egyptian writing system had a set of some 24 hieroglyphs that are called uniliterals, which are glyphs that provide one sound. These glyphs were used as pronunciation guides for
logogram In a written language, a logogram (from Ancient Greek 'word', and 'that which is drawn or written'), also logograph or lexigraph, is a written character that represents a semantic component of a language, such as a word or morpheme. Chine ...
s, to write grammatical inflections, and, later, to transcribe loan words and foreign names. The script was used a fair amount in the 4th century AD. However, after pagan temples were closed down, it was forgotten in the 5th century until the discovery of the
Rosetta Stone The Rosetta Stone is a stele of granodiorite inscribed with three versions of a Rosetta Stone decree, decree issued in 196 BC during the Ptolemaic dynasty of ancient Egypt, Egypt, on behalf of King Ptolemy V Epiphanes. The top and middle texts ...
. There was also
cuneiform Cuneiform is a Logogram, logo-Syllabary, syllabic writing system that was used to write several languages of the Ancient Near East. The script was in active use from the early Bronze Age until the beginning of the Common Era. Cuneiform script ...
, primarily used to write several ancient languages, including Sumerian. The last known use of cuneiform was in 75 AD, after which the script fell out of use. In the
Middle Bronze Age The Bronze Age () was a historical period characterised principally by the use of bronze tools and the development of complex urban societies, as well as the adoption of writing in some areas. The Bronze Age is the middle principal period of ...
, an apparently alphabetic system known as the
Proto-Sinaitic script The Proto-Sinaitic script is a Middle Bronze Age writing system known from a small corpus of about Serabit el-Khadim proto-Sinaitic inscriptions, 30-40 inscriptions and fragments from Serabit el-Khadim in the Sinai Peninsula, as well as Wadi el ...
appeared in Egyptian turquoise mines in the
Sinai Peninsula The Sinai Peninsula, or simply Sinai ( ; ; ; ), is a peninsula in Egypt, and the only part of the country located in Asia. It is between the Mediterranean Sea to the north and the Red Sea to the south, and is a land bridge between Asia and Afri ...
, apparently left by Canaanite workers. Orly Goldwasser has connected the illiterate turquoise miner graffiti theory to the origin of the alphabet. In 1999, American Egyptologists
John John is a common English name and surname: * John (given name) * John (surname) John may also refer to: New Testament Works * Gospel of John, a title often shortened to John * First Epistle of John, often shortened to 1 John * Second E ...
and Deborah Darnell discovered an earlier version of this first alphabet at the
Wadi el-Hol Wadi el-Hol is a valley on the Farshut Road crossing the Qena Bend between Thebes, Egypt, Thebes and Hu, Egypt, Hiw, on the west bank of the river Nile in Egypt. Wadi el-Hol Inscriptions The Wadi el-Hol inscriptions are two rock inscriptions which ...
valley. The script dated to and shows evidence of having been adapted from specific forms of Egyptian hieroglyphs that could be dated to , strongly suggesting that the first alphabet had developed about that time. The script was based on letter appearances and names, believed to be based on Egyptian hieroglyphs. This script had no characters representing vowels. Originally, it probably was a syllabary—a script where syllables are represented with characters—with symbols that were not needed being removed. The best-attested Bronze Age alphabet is
Ugaritic Ugaritic () is an extinct Northwest Semitic languages, Northwest Semitic language known through the Ugaritic texts discovered by French archaeology, archaeologists in 1928 at Ugarit, including several major literary texts, notably the Baal cycl ...
, invented in
Ugarit Ugarit (; , ''ủgrt'' /ʾUgarītu/) was an ancient port city in northern Syria about 10 kilometers north of modern Latakia. At its height it ruled an area roughly equivalent to the modern Latakia Governorate. It was discovered by accident in 19 ...
before the 15th century BC. This was an alphabetic
cuneiform Cuneiform is a Logogram, logo-Syllabary, syllabic writing system that was used to write several languages of the Ancient Near East. The script was in active use from the early Bronze Age until the beginning of the Common Era. Cuneiform script ...
script with 30 signs, including three that indicate the following vowel. This script was not used after the destruction of Ugarit in 1178 BC. The Proto-Sinaitic script eventually developed into the Phoenician alphabet, conventionally called
Proto-Canaanite Proto-Canaanite is the name given to: # The Proto-Sinaitic script when found in Canaan, dating to about the 17th century BC and later. # A hypothetical ancestor of the Phoenician script before some cut-off date, typically 1050 BC, with an undefin ...
, before . The oldest text in Phoenician script is an inscription on the sarcophagus of King Ahiram . This script is the parent script of all western alphabets. By the 10th century BC, two other forms distinguish themselves, Canaanite and
Aramaic Aramaic (; ) is a Northwest Semitic language that originated in the ancient region of Syria and quickly spread to Mesopotamia, the southern Levant, Sinai, southeastern Anatolia, and Eastern Arabia, where it has been continually written a ...
. The Aramaic gave rise to the
Hebrew alphabet The Hebrew alphabet (, ), known variously by scholars as the Ktav Ashuri, Jewish script, square script and block script, is a unicase, unicameral abjad script used in the writing of the Hebrew language and other Jewish languages, most notably ...
. The
South Arabian alphabet The Ancient South Arabian script (Old South Arabian: ; modern ) branched from the Proto-Sinaitic script in about the late 2nd millennium BCE, and remained in use through the late sixth century CE. It is an abjad, a writing system where only con ...
, a sister script to the Phoenician alphabet, is the script from which the
Geʽez script Geʽez ( ; , ) is a script used as an abugida (alphasyllabary) for several Afroasiatic languages, Afro-Asiatic and Nilo-Saharan languages, Nilo-Saharan languages of Ethiopia and Eritrea. It originated as an abjad (consonantal alphabet) and was ...
was descended. Abugidas are writing systems with characters comprising consonant–vowel sequences. Alphabets without obligatory vowels are called ''
abjad An abjad ( or abgad) is a writing system in which only consonants are represented, leaving the vowel sounds to be inferred by the reader. This contrasts with alphabets, which provide graphemes for both consonants and vowels. The term was introd ...
s'', with examples being Arabic, Hebrew, and Syriac. The omission of vowels was not always a satisfactory solution due to the need of preserving sacred texts. "Weak" consonants are used to indicate vowels. These letters have a dual function since they can also be used as pure consonants. The Proto-Sinaitic script and the Ugaritic script were the first scripts with a limited number of signs instead of using many different signs for words, in contrast to cuneiform, Egyptian hieroglyphs, and
Linear B Linear B is a syllabary, syllabic script that was used for writing in Mycenaean Greek, the earliest Attested language, attested form of the Greek language. The script predates the Greek alphabet by several centuries, the earliest known examp ...
. The Phoenician script was probably the first phonemic script, and it contained only about two dozen distinct letters, making it a script simple enough for traders to learn. Another advantage of the Phoenician alphabet was that it could write different languages since it recorded words phonemically. The Phoenician script was spread across the Mediterranean by the Phoenicians. The Greek alphabet was the first in which vowels had independent letterforms separate from those of consonants. The Greeks chose letters representing sounds that did not exist in Greek to represent vowels. The
Linear B Linear B is a syllabary, syllabic script that was used for writing in Mycenaean Greek, the earliest Attested language, attested form of the Greek language. The script predates the Greek alphabet by several centuries, the earliest known examp ...
syllabary, used by Mycenaean Greeks from the 16th century BC, had 87 symbols, including five vowels. In its early years, there were many variants of the Greek alphabet, causing many different alphabets to evolve from it.


European alphabets

The Greek alphabet, in Euboean form, was carried over by Greek colonists to the Italian peninsula giving rise to many different alphabets used to write the
Italic languages The Italic languages form a branch of the Indo-European languages, Indo-European language family, whose earliest known members were spoken on the Italian Peninsula in the first millennium BC. The most important of the ancient Italic languages ...
, like the
Etruscan alphabet The Etruscan alphabet was used by the Etruscans, an ancient civilization of central and northern Italy, to write Etruscan language, their language, from about 700 BC to sometime around 100 AD. The Etruscan alphabet derives from the Euboean alpha ...
. One of these became the Latin alphabet, which spread across Europe as the Romans expanded their republic. After the fall of the
Western Roman Empire In modern historiography, the Western Roman Empire was the western provinces of the Roman Empire, collectively, during any period in which they were administered separately from the eastern provinces by a separate, independent imperial court. ...
, the alphabet survived in intellectual and religious works. It came to be used for the
Romance languages The Romance languages, also known as the Latin or Neo-Latin languages, are the languages that are Language family, directly descended from Vulgar Latin. They are the only extant subgroup of the Italic languages, Italic branch of the Indo-E ...
that descended from Latin and most of the other languages of western and central Europe. Today, it is the most widely used script in the world. The Etruscan alphabet remained nearly unchanged for several hundred years. Only evolving once the
Etruscan language Etruscan ( ) was the language of the Etruscan civilization in the ancient region of Etruria, in Etruria Padana and Etruria Campana in what is now Italy. Etruscan influenced Latin but was eventually superseded by it. Around 13,000 Etruscan epigraph ...
changed itself. The letters used for non-existent phonemes were dropped. Afterwards, however, the alphabet went through many different changes. The final classical form of Etruscan contained 20 letters. Four of them are vowels——six fewer letters than the earlier forms. The script in its classical form was used until the 1st century AD. The Etruscan language itself was not used during the
Roman Empire The Roman Empire ruled the Mediterranean and much of Europe, Western Asia and North Africa. The Roman people, Romans conquered most of this during the Roman Republic, Republic, and it was ruled by emperors following Octavian's assumption of ...
, but the script was used for religious texts. Some adaptations of the Latin alphabet have ligatures, a combination of two letters make one, such as æ in Danish and Icelandic and in Algonquian; borrowings from other alphabets, such as the thorn in
Old English Old English ( or , or ), or Anglo-Saxon, is the earliest recorded form of the English language, spoken in England and southern and eastern Scotland in the Early Middle Ages. It developed from the languages brought to Great Britain by Anglo-S ...
and Icelandic, which came from the Futhark runes; and modified existing letters, such as the eth of Old English and Icelandic, which is a modified ''d''. Other alphabets only use a subset of the Latin alphabet, such as Hawaiian and Italian, which uses the letters ''j, k, x, y,'' and ''w'' only in foreign words. Another notable script is
Elder Futhark The Elder Futhark (or Fuþark, ), also known as the Older Futhark, Old Futhark, or Germanic Futhark, is the oldest form of the runic alphabets. It was a writing system used by Germanic peoples for Northwest Germanic dialects in the Migration Per ...
, believed to have evolved out of one of the
Old Italic alphabet The Old Italic scripts are a family of ancient writing systems used in the Italian Peninsula between about 700 and 100 BC, for various languages spoken in that time and place. The most notable member is the Etruscan alphabet, which was the i ...
s. Elder Futhark gave rise to other alphabets known collectively as the
Runic alphabet Runes are the Letter (alphabet), letters in a set of related alphabets, known as runic rows, runic alphabets or futharks (also, see ''#Futharks, futhark'' vs ''#Runic alphabets, runic alphabet''), native to the Germanic peoples. Runes were ...
s. The Runic alphabets were used for Germanic languages from 100 AD to the late Middle Ages, being engraved on stone and jewelry, although inscriptions found on bone and wood occasionally appear. These alphabets have since been replaced with the Latin alphabet. The exception was for decorative use, where the runes remained in use until the 20th century. The
Old Hungarian script The Old Hungarian script or Hungarian runes (, 'székely-magyar runiform', or ) is an alphabetic writing system used for writing the Hungarian language. Modern Hungarian is written using the Latin-based Hungarian alphabet. The term "old" refers ...
was the writing system of the Hungarians. It was in use during the entire history of Hungary, albeit not as an official writing system. From the 19th century, it once again became more and more popular. The
Glagolitic alphabet The Glagolitic script ( , , ''glagolitsa'') is the oldest known Slavic alphabet. It is generally agreed that it was created in the 9th century for the purpose of translating liturgical texts into Old Church Slavonic by Saints Cyril and Methodi ...
was the initial script of the liturgical language
Old Church Slavonic Old Church Slavonic or Old Slavonic ( ) is the first Slavic languages, Slavic literary language and the oldest extant written Slavonic language attested in literary sources. It belongs to the South Slavic languages, South Slavic subgroup of the ...
and became, together with the Greek uncial script, the basis of the
Cyrillic script The Cyrillic script ( ) is a writing system used for various languages across Eurasia. It is the designated national script in various Slavic languages, Slavic, Turkic languages, Turkic, Mongolic languages, Mongolic, Uralic languages, Uralic, C ...
. Cyrillic is one of the most widely used modern alphabetic scripts and is notable for its use in Slavic languages and also for other languages within the former
Soviet Union The Union of Soviet Socialist Republics. (USSR), commonly known as the Soviet Union, was a List of former transcontinental countries#Since 1700, transcontinental country that spanned much of Eurasia from 1922 until Dissolution of the Soviet ...
.
Cyrillic alphabets Numerous Cyrillic alphabets are based on the Cyrillic script. The early Cyrillic alphabet was developed in the 9th century AD and replaced the earlier Glagolitic script developed by the theologians Saints Cyril and Methodius, Cyril and Methodi ...
include Serbian, Macedonian, Bulgarian, Russian, Belarusian, and Ukrainian. The Glagolitic alphabet is believed to have been created by
Saints Cyril and Methodius Cyril (; born Constantine, 826–869) and Methodius (; born Michael, 815–885) were brothers, Byzantine Christian theologians and missionaries. For their work evangelizing the Slavs, they are known as the "Apostles to the Slavs". They are ...
, while the Cyrillic alphabet was created by a circle of their disciples in the Preslav Literary School including Naum of Preslav, Constantine of Preslav, Chernorizets Hrabar among others. They feature many letters that appear to have been borrowed from or influenced by Greek and Hebrew.


Asian alphabets

Many phonetic scripts exist in Asia. The
Arabic alphabet The Arabic alphabet, or the Arabic abjad, is the Arabic script as specifically codified for writing the Arabic language. It is a unicase, unicameral script written from right-to-left in a cursive style, and includes 28 letters, of which most ...
,
Hebrew alphabet The Hebrew alphabet (, ), known variously by scholars as the Ktav Ashuri, Jewish script, square script and block script, is a unicase, unicameral abjad script used in the writing of the Hebrew language and other Jewish languages, most notably ...
,
Syriac alphabet The Syriac alphabet ( ) is a writing system primarily used to write the Syriac language since the 1st century. It is one of the Semitic languages, Semitic abjads descending from the Aramaic alphabet through the Palmyrene alphabet, and shares sim ...
, and other
abjad An abjad ( or abgad) is a writing system in which only consonants are represented, leaving the vowel sounds to be inferred by the reader. This contrasts with alphabets, which provide graphemes for both consonants and vowels. The term was introd ...
s of the Middle East are developments of the
Aramaic alphabet The ancient Aramaic alphabet was used to write the Aramaic languages spoken by ancient Aramean pre-Christian peoples throughout the Fertile Crescent. It was also adopted by other peoples as their own alphabet when empires and their subjects und ...
. Most alphabetic scripts of India and Eastern Asia descend from the
Brahmi script Brahmi ( ; ; ISO 15919, ISO: ''Brāhmī'') is a writing system from ancient India. "Until the late nineteenth century, the script of the Aśokan (non-Kharosthi) inscriptions and its immediate derivatives was referred to by various names such as ...
, believed to be a descendant of Aramaic. European alphabets, especially Latin and Cyrillic, have been adapted for many languages of Asia. Arabic is also widely used, sometimes as an abjad, as with
Urdu Urdu (; , , ) is an Indo-Aryan languages, Indo-Aryan language spoken chiefly in South Asia. It is the Languages of Pakistan, national language and ''lingua franca'' of Pakistan. In India, it is an Eighth Schedule to the Constitution of Indi ...
and Persian, and sometimes as a complete alphabet, as with Kurdish and Uyghur.


Other alphabets


Hangul

In
Korea Korea is a peninsular region in East Asia consisting of the Korean Peninsula, Jeju Island, and smaller islands. Since the end of World War II in 1945, it has been politically Division of Korea, divided at or near the 38th parallel north, 3 ...
,
Sejong the Great Sejong (; 15 May 1397 – 8 April 1450), commonly known as Sejong the Great (), was the fourth monarch of the Joseon, Joseon dynasty of Korea. He is regarded as the greatest ruler in Korean history, and is remembered as the inventor of Hangu ...
created the
Hangul The Korean alphabet is the modern writing system for the Korean language. In North Korea, the alphabet is known as (), and in South Korea, it is known as (). The letters for the five basic consonants reflect the shape of the speech organs ...
alphabet in 1443. Hangul is a unique alphabet: it is a featural alphabet, where the design of many of the letters comes from a sound's place of articulation, like P looking like the widened mouth and L looking like the tongue pulled in. The creation of Hangul was planned by the government of the day, and it places individual letters in syllable clusters with equal dimensions, in the same way as
Chinese characters Chinese characters are logographs used Written Chinese, to write the Chinese languages and others from regions historically influenced by Chinese culture. Of the four independently invented writing systems accepted by scholars, they represe ...
. This change allows for mixed-script writing, where one syllable always takes up one type space no matter how many letters get stacked into building that one sound-block.


Bopomofo

Bopomofo Bopomofo, also called Zhuyin Fuhao ( ; ), or simply Zhuyin, is a Chinese transliteration, transliteration system for Standard Chinese and other Sinitic languages. It is the principal method of teaching Chinese Mandarin pronunciation in Taiwa ...
, also referred to as ''zhuyin'', is a semi-syllabary used primarily in
Taiwan Taiwan, officially the Republic of China (ROC), is a country in East Asia. The main geography of Taiwan, island of Taiwan, also known as ''Formosa'', lies between the East China Sea, East and South China Seas in the northwestern Pacific Ocea ...
to transcribe the sounds of
Standard Chinese Standard Chinese ( zh, s=现代标准汉语, t=現代標準漢語, p=Xiàndài biāozhǔn hànyǔ, l=modern standard Han speech) is a modern standard form of Mandarin Chinese that was first codified during the republican era (1912–1949). ...
. Following the proclamation of the People's Republic of China in 1949 and its adoption of
Hanyu Pinyin Hanyu Pinyin, or simply pinyin, officially the Chinese Phonetic Alphabet, is the most common romanization system for Standard Chinese. ''Hanyu'' () literally means ' Han language'—that is, the Chinese language—while ''pinyin'' literally ...
in 1956, the use of bopomofo on the mainland is limited. Bopomofo developed from a form of Chinese shorthand based on Chinese characters in the early 1900s and has elements of both an alphabet and a syllabary. Like an alphabet, the phonemes of syllable initials are represented by individual symbols, but like a syllabary, the phonemes of the syllable finals are not; each possible final (excluding the medial glide) has its own character, an example being ''luan'' written as ㄌㄨㄢ (''l-u-an''). The last symbol ㄢ takes place as the entire final ''-an''. While bopomofo is not a mainstream writing system, it is still often used in ways similar to a
romanization In linguistics, romanization is the conversion of text from a different writing system to the Latin script, Roman (Latin) script, or a system for doing so. Methods of romanization include transliteration, for representing written text, and tra ...
system, for aiding pronunciation and as an input method for Chinese characters on computers and cellphones.


Types

The term "alphabet" is used by linguists and paleographers in both a wide and a narrow sense. In a broader sense, an alphabet is a ''segmental'' script at the
phoneme A phoneme () is any set of similar Phone (phonetics), speech sounds that are perceptually regarded by the speakers of a language as a single basic sound—a smallest possible Phonetics, phonetic unit—that helps distinguish one word fr ...
level—that is, it has separate glyphs for individual sounds and not for larger units such as syllables or words. In the narrower sense, some scholars distinguish "true" alphabets from two other types of segmental script,
abjad An abjad ( or abgad) is a writing system in which only consonants are represented, leaving the vowel sounds to be inferred by the reader. This contrasts with alphabets, which provide graphemes for both consonants and vowels. The term was introd ...
s, and
abugida An abugida (; from Geʽez: , )sometimes also called alphasyllabary, neosyllabary, or pseudo-alphabetis a segmental Writing systems#Segmental writing system, writing system in which consonant–vowel sequences are written as units; each unit ...
s. These three differ in how they treat vowels. Abjads have letters for consonants and leave most vowels unexpressed. Abugidas are also consonant-based but indicate vowels with
diacritic A diacritic (also diacritical mark, diacritical point, diacritical sign, or accent) is a glyph added to a letter or to a basic glyph. The term derives from the Ancient Greek (, "distinguishing"), from (, "to distinguish"). The word ''diacrit ...
s, a systematic graphic modification of the consonants. The earliest known alphabet using this sense is the Wadi el-Hol script, believed to be an abjad. Its successor, Phoenician, is the ancestor of modern alphabets, including
Arabic Arabic (, , or , ) is a Central Semitic languages, Central Semitic language of the Afroasiatic languages, Afroasiatic language family spoken primarily in the Arab world. The International Organization for Standardization (ISO) assigns lang ...
,
Greek Greek may refer to: Anything of, from, or related to Greece, a country in Southern Europe: *Greeks, an ethnic group *Greek language, a branch of the Indo-European language family **Proto-Greek language, the assumed last common ancestor of all kno ...
,
Latin Latin ( or ) is a classical language belonging to the Italic languages, Italic branch of the Indo-European languages. Latin was originally spoken by the Latins (Italic tribe), Latins in Latium (now known as Lazio), the lower Tiber area aroun ...
(via the
Old Italic alphabet The Old Italic scripts are a family of ancient writing systems used in the Italian Peninsula between about 700 and 100 BC, for various languages spoken in that time and place. The most notable member is the Etruscan alphabet, which was the i ...
),
Cyrillic The Cyrillic script ( ) is a writing system used for various languages across Eurasia. It is the designated national script in various Slavic, Turkic, Mongolic, Uralic, Caucasian and Iranic-speaking countries in Southeastern Europe, Ea ...
(via the Greek alphabet), and
Hebrew Hebrew (; ''ʿÎbrit'') is a Northwest Semitic languages, Northwest Semitic language within the Afroasiatic languages, Afroasiatic language family. A regional dialect of the Canaanite languages, it was natively spoken by the Israelites and ...
(via
Aramaic Aramaic (; ) is a Northwest Semitic language that originated in the ancient region of Syria and quickly spread to Mesopotamia, the southern Levant, Sinai, southeastern Anatolia, and Eastern Arabia, where it has been continually written a ...
). Examples of present-day abjads are the
Arabic Arabic (, , or , ) is a Central Semitic languages, Central Semitic language of the Afroasiatic languages, Afroasiatic language family spoken primarily in the Arab world. The International Organization for Standardization (ISO) assigns lang ...
and
Hebrew script The Hebrew alphabet (, ), known variously by scholars as the Ktav Ashuri, Jewish script, square script and block script, is a unicase, unicameral abjad script used in the writing of the Hebrew language and other Jewish languages, most notably ...
s; true alphabets include
Latin Latin ( or ) is a classical language belonging to the Italic languages, Italic branch of the Indo-European languages. Latin was originally spoken by the Latins (Italic tribe), Latins in Latium (now known as Lazio), the lower Tiber area aroun ...
, Cyrillic, and Korean hangul; and abugidas, used to write Tigrinya,
Amharic Amharic is an Ethio-Semitic language, which is a subgrouping within the Semitic branch of the Afroasiatic languages. It is spoken as a first language by the Amhara people, and also serves as a lingua franca for all other metropolitan populati ...
,
Hindi Modern Standard Hindi (, ), commonly referred to as Hindi, is the Standard language, standardised variety of the Hindustani language written in the Devanagari script. It is an official language of India, official language of the Government ...
, and Thai. The
Canadian Aboriginal syllabics Canadian syllabic writing, or simply syllabics, is a family of writing systems used in a number of indigenous Canadian languages of the Algonquian languages, Algonquian, Eskimo–Aleut languages, Inuit, and (formerly) Athabaskan languages, A ...
are also an abugida, rather than a syllabary, as their name would imply, because each glyph stands for a consonant and is modified by rotation to represent the following vowel. In a true syllabary, each consonant-vowel combination gets represented by a separate glyph. All three types may be augmented with syllabic glyphs.
Ugaritic Ugaritic () is an extinct Northwest Semitic languages, Northwest Semitic language known through the Ugaritic texts discovered by French archaeology, archaeologists in 1928 at Ugarit, including several major literary texts, notably the Baal cycl ...
, for example, is essentially an abjad but has syllabic letters for These are the only times that vowels are indicated. Coptic has a letter for .
Devanagari Devanagari ( ; in script: , , ) is an Indic script used in the Indian subcontinent. It is a left-to-right abugida (a type of segmental Writing systems#Segmental systems: alphabets, writing system), based on the ancient ''Brāhmī script, Brā ...
is typically an abugida augmented with dedicated letters for initial vowels, though some traditions use अ as a
zero consonant In orthography, a zero consonant, silent initial, or null-onset letter is a consonant letter that does not correspond to a consonant sound, but is required when a word or syllable starts with a vowel (i.e. has a null onset). Some abjads, abugid ...
as the graphic base for such vowels. The boundaries between the three types of segmental scripts are not always clear-cut. For example,
Sorani Central Kurdish, also known as Sorani Kurdish, is a Kurdish dialect or a language spoken in Iraq, mainly in Iraqi Kurdistan, as well as the provinces of Kurdistan, Kermanshah, and West Azerbaijan in western Iran. Central Kurdish is one of the ...
Kurdish is written in the
Arabic script The Arabic script is the writing system used for Arabic (Arabic alphabet) and several other languages of Asia and Africa. It is the second-most widely used alphabetic writing system in the world (after the Latin script), the second-most widel ...
, which, when used for other languages, is an abjad. In Kurdish, writing the vowels is mandatory, and whole letters are used, so the script is a true alphabet. Other languages may use a Semitic abjad with forced vowel diacritics, effectively making them abugidas. On the other hand, the
ʼPhags-pa script The Phagspa ( ), Phags-pa or ḥPags-pa script is an alphabet designed by the Tibetan monk and State Preceptor (later Imperial Preceptor) Drogön Chögyal Phagpa (1235–1280) for Kublai Khan (), the founder of the Yuan dynasty (1271–1 ...
of the
Mongol Empire The Mongol Empire was the List of largest empires, largest contiguous empire in human history, history. Originating in present-day Mongolia in East Asia, the Mongol Empire at its height stretched from the Sea of Japan to parts of Eastern Euro ...
was based closely on the Tibetan abugida, but vowel marks are written after the preceding consonant rather than as diacritic marks. Although short ''a'' is not written, as in the Indic abugidas, The source of the term "abugida", namely the
Geʽez script Geʽez ( ; , ) is a script used as an abugida (alphasyllabary) for several Afroasiatic languages, Afro-Asiatic and Nilo-Saharan languages, Nilo-Saharan languages of Ethiopia and Eritrea. It originated as an abjad (consonantal alphabet) and was ...
now used for
Amharic Amharic is an Ethio-Semitic language, which is a subgrouping within the Semitic branch of the Afroasiatic languages. It is spoken as a first language by the Amhara people, and also serves as a lingua franca for all other metropolitan populati ...
and Tigrinya, has assimilated into their consonant modifications. It is no longer systematic and must be learned as a syllabary rather than as a segmental script. Even more extreme, the Pahlavi abjad eventually became
logographic In a written language, a logogram (from Ancient Greek 'word', and 'that which is drawn or written'), also logograph or lexigraph, is a written character that represents a semantic component of a language, such as a word or morpheme. Chinese c ...
. Thus the primary categorisation of alphabets reflects how they treat vowels. For tonal languages, further classification can be based on their treatment of tone. Though names do not yet exist to distinguish the various types. Some alphabets disregard tone entirely, especially when it does not carry a heavy functional load, as in Somali and many other languages of Africa and the Americas. Most commonly, tones are indicated by diacritics, which is how vowels are treated in abugidas, which is the case for Vietnamese (a true alphabet) and Thai (an abugida). In Thai, the tone is determined primarily by a consonant, with diacritics for disambiguation. In the
Pollard script The Pollard script, also known as Pollard Miao () or Miao, is an abugida loosely based on the Latin alphabet and invented by Methodist missionary Sam Pollard. Pollard invented the script for use with A-Hmao, one of several Miao languages spoke ...
, an abugida, vowels are indicated by diacritics. The placing of the diacritic relative to the consonant is modified to indicate the tone. More rarely, a script may have separate letters for tones, as is the case for Hmong and Zhuang. For many, regardless of whether letters or diacritics get used, the most common tone is not marked, just as the most common vowel is not marked in Indic abugidas. In Zhuyin, not only is one of the tones unmarked; but there is a diacritic to indicate a lack of tone, like the
virama Virama ( ्, ) is a Sanskrit phonological concept to suppress the inherent vowel that otherwise occurs with every consonant letter, commonly used as a generic term for a codepoint in Unicode, representing either # halanta, hasanta or explicit vir ...
of Indic.


Alphabetical order

Alphabets often come to be associated with a standard ordering of their letters; this is for
collation Collation is the assembly of written information into a standard order. Many systems of collation are based on numerical order or alphabetical order, or extensions and combinations thereof. Collation is a fundamental element of most office fi ...
—namely, for listing words and other items in ''
alphabetical order Alphabetical order is a system whereby character strings are placed in order based on the position of the characters in the conventional ordering of an alphabet. It is one of the methods of collation. In mathematics, a lexicographical order is ...
''.


Latin alphabets

The ordering of the
Latin alphabet The Latin alphabet, also known as the Roman alphabet, is the collection of letters originally used by the Ancient Rome, ancient Romans to write the Latin language. Largely unaltered except several letters splitting—i.e. from , and from � ...
( A B C D E F G H I J K L M N O P Q R S T U V W X Y Z), which derives from the Northwest Semitic "Abgad" order, is already well established. Although, languages using this alphabet have different conventions for their treatment of modified letters (such as the French ''é'', ''à'', and ''ô'') and certain combinations of letters ( multigraphs). In French, these are not considered to be additional letters for collation. However, in Icelandic, the accented letters such as ''á'', ''í'', and ''ö'' are considered distinct letters representing different vowel sounds from sounds represented by their unaccented counterparts. In Spanish, ''ñ'' is considered a separate letter, but accented vowels such as ''á'' and ''é'' are not. The ''ll'' and ''ch'' were also formerly considered single letters and sorted separately after ''l'' and ''c'', but in 1994, the tenth congress of the
Association of Spanish Language Academies The Association of Academies of the Spanish Language (; ASALE) is an entity whose end is to work for the unity, integrity, and growth of the Spanish language. It was created in Mexico in 1951 and represents the union of all the separate academies ...
changed the collating order so that ''ll'' came to be sorted between ''lk'' and ''lm'' in the dictionary and ''ch'' came to be sorted between ''cg'' and ''ci''; those digraphs were still formally designated as letters, but in 2010 the changed it, so they are no longer considered letters at all. In German, words starting with ''sch-'' (which spells the German phoneme ) are inserted between words with initial ''sca-'' and ''sci-'' (all incidentally loanwords) instead of appearing after the initial ''sz'', as though it were a single letter, which contrasts several languages such as Albanian, in which ''dh-'', ''ë-'', ''gj-'', ''ll-'', ''rr-'', ''th-'', ''xh-,'' and ''zh-,'' which all represent phonemes and considered separate single letters, would follow the letters respectively, as well as Hungarian and Welsh. Further, German words with an umlaut get collated ignoring the umlaut as—contrary to Turkish, which adopted the
grapheme In linguistics, a grapheme is the smallest functional unit of a writing system. The word ''grapheme'' is derived from Ancient Greek ('write'), and the suffix ''-eme'' by analogy with ''phoneme'' and other emic units. The study of graphemes ...
s ö and ü, and where a word like ''tüfek'' would come after ''tuz'', in the dictionary. An exception is the German telephone directory, where umlauts are sorted like ''ä''=''ae'' since names such as ''Jäger'' also appear with the spelling ''Jaeger'' and are not distinguished in the spoken language. The Danish and Norwegian alphabets end with , whereas the Swedish conventionally put at the end. However, phonetically corresponds with , as does and .


Early alphabets

It is unknown whether the earliest alphabets had a defined sequence. Some alphabets today, such as the Hanuno'o script, are learned one letter at a time, in no particular order, and are not used for
collation Collation is the assembly of written information into a standard order. Many systems of collation are based on numerical order or alphabetical order, or extensions and combinations thereof. Collation is a fundamental element of most office fi ...
where a definite order is required. However, a dozen
Ugaritic Ugaritic () is an extinct Northwest Semitic languages, Northwest Semitic language known through the Ugaritic texts discovered by French archaeology, archaeologists in 1928 at Ugarit, including several major literary texts, notably the Baal cycl ...
tablets from the 14th century BC preserve the alphabet in two sequences. One, the ''ABCDE'' order later used in Phoenician, has continued with minor changes in
Hebrew Hebrew (; ''ʿÎbrit'') is a Northwest Semitic languages, Northwest Semitic language within the Afroasiatic languages, Afroasiatic language family. A regional dialect of the Canaanite languages, it was natively spoken by the Israelites and ...
,
Greek Greek may refer to: Anything of, from, or related to Greece, a country in Southern Europe: *Greeks, an ethnic group *Greek language, a branch of the Indo-European language family **Proto-Greek language, the assumed last common ancestor of all kno ...
,
Armenian Armenian may refer to: * Something of, from, or related to Armenia, a country in the South Caucasus region of Eurasia * Armenians, the national people of Armenia, or people of Armenian descent ** Armenian diaspora, Armenian communities around the ...
, Gothic,
Cyrillic The Cyrillic script ( ) is a writing system used for various languages across Eurasia. It is the designated national script in various Slavic, Turkic, Mongolic, Uralic, Caucasian and Iranic-speaking countries in Southeastern Europe, Ea ...
, and
Latin Latin ( or ) is a classical language belonging to the Italic languages, Italic branch of the Indo-European languages. Latin was originally spoken by the Latins (Italic tribe), Latins in Latium (now known as Lazio), the lower Tiber area aroun ...
; the other, ''HMĦLQ,'' was used in southern Arabia and is preserved today in
Geʽez Geez ( or ; , and sometimes referred to in scholarly literature as Classical Ethiopic) is an ancient South Semitic language. The language originates from what is now Ethiopia and Eritrea. Today, Geez is used as the main liturgical langu ...
. Both orders have therefore been stable for at least 3000 years.
Runic Runes are the letters in a set of related alphabets, known as runic rows, runic alphabets or futharks (also, see '' futhark'' vs ''runic alphabet''), native to the Germanic peoples. Runes were primarily used to represent a sound value (a ...
used an unrelated Futhark sequence, which got simplified later on.
Arabic Arabic (, , or , ) is a Central Semitic languages, Central Semitic language of the Afroasiatic languages, Afroasiatic language family spoken primarily in the Arab world. The International Organization for Standardization (ISO) assigns lang ...
usually uses its sequence, although Arabic retains the traditional
abjadi order The Abjad numerals, also called Hisab al-Jummal (, ), are a decimal alphabetic numeral system/ alphanumeric code, in which the 28 letters of the Arabic alphabet are assigned numerical values. They have been used in the Arabic-speaking world si ...
, which is used for numbers. The
Brahmic family The Brahmic scripts, also known as Indic scripts, are a family of abugida writing systems. They are used throughout South Asia, Southeast Asia and parts of East Asia. They are descended from the Brahmi script of ancient India and are used b ...
of alphabets used in India uses a unique order based on
phonology Phonology (formerly also phonemics or phonematics: "phonemics ''n.'' 'obsolescent''1. Any procedure for identifying the phonemes of a language from a corpus of data. 2. (formerly also phonematics) A former synonym for phonology, often pre ...
: The letters are arranged according to how and where the sounds get produced in the mouth. This organization is present in Southeast Asia, Tibet, Korean
hangul The Korean alphabet is the modern writing system for the Korean language. In North Korea, the alphabet is known as (), and in South Korea, it is known as (). The letters for the five basic consonants reflect the shape of the speech organs ...
, and even Japanese
kana are syllabary, syllabaries used to write Japanese phonology, Japanese phonological units, Mora (linguistics), morae. In current usage, ''kana'' most commonly refers to ''hiragana'' and ''katakana''. It can also refer to their ancestor , wh ...
, which is not an alphabet.


Acrophony

In Phoenician, each letter got associated with a word that begins with that sound. This is called
acrophony Acrophony (; + 'sound') is the naming of letters of an alphabetic writing system so that a letter's name begins with the letter itself. For example, Greek letter names are acrophonic: the names of the letters α, β, γ, δ, are spelled with t ...
and is continuously used to varying degrees in
Samaritan Samaritans (; ; ; ), are an ethnoreligious group originating from the Hebrews and Israelites of the ancient Near East. They are indigenous to Samaria, a historical region of History of ancient Israel and Judah, ancient Israel and Judah that ...
,
Aramaic Aramaic (; ) is a Northwest Semitic language that originated in the ancient region of Syria and quickly spread to Mesopotamia, the southern Levant, Sinai, southeastern Anatolia, and Eastern Arabia, where it has been continually written a ...
, Syriac,
Hebrew Hebrew (; ''ʿÎbrit'') is a Northwest Semitic languages, Northwest Semitic language within the Afroasiatic languages, Afroasiatic language family. A regional dialect of the Canaanite languages, it was natively spoken by the Israelites and ...
,
Greek Greek may refer to: Anything of, from, or related to Greece, a country in Southern Europe: *Greeks, an ethnic group *Greek language, a branch of the Indo-European language family **Proto-Greek language, the assumed last common ancestor of all kno ...
, and
Arabic Arabic (, , or , ) is a Central Semitic languages, Central Semitic language of the Afroasiatic languages, Afroasiatic language family spoken primarily in the Arab world. The International Organization for Standardization (ISO) assigns lang ...
. Acrophony was abandoned in
Latin Latin ( or ) is a classical language belonging to the Italic languages, Italic branch of the Indo-European languages. Latin was originally spoken by the Latins (Italic tribe), Latins in Latium (now known as Lazio), the lower Tiber area aroun ...
. It referred to the letters by adding a vowel—usually , sometimes or —before or after the consonant. Two exceptions were Y and Z, which were borrowed from the Greek alphabet rather than Etruscan. They were known as ''Y Graeca'' "Greek Y" and ''zeta'' (from Greek)—this discrepancy was inherited by many European languages, as in the term ''zed'' for Z in all forms of English, other than American English. Over time names sometimes shifted or were added, as in ''double U'' for W, or "double V" in French, the English name for Y, and the American ''zee'' for Z. Comparing them in English and French gives a clear reflection of the
Great Vowel Shift The Great Vowel Shift was a series of English phonology, pronunciation changes in the vowels of the English language that took place primarily between the 1400s and 1600s (the transition period from Middle English to Early Modern English), begi ...
: A, B, C, and D are pronounced in today's English, but in contemporary French they are . The French names (from which the English names got derived) preserve the qualities of the English vowels before the Great Vowel Shift. By contrast, the names of F, L, M, N, and S () remain the same in both languages because "short" vowels were largely unaffected by the Shift. In Cyrillic, originally, acrophony was present using Slavic words. The first three words going, azŭ, buky, vědě, with the Cyrillic collation order being, А, Б, В. However, this was later abandoned in favor of a system similar to Latin.


Orthography and pronunciation

When an alphabet is adopted or developed to represent a given language, an
orthography An orthography is a set of convention (norm), conventions for writing a language, including norms of spelling, punctuation, Word#Word boundaries, word boundaries, capitalization, hyphenation, and Emphasis (typography), emphasis. Most national ...
generally comes into being, providing rules for
spelling Spelling is a set of conventions for written language regarding how graphemes should correspond to the sounds of spoken language. Spelling is one of the elements of orthography, and highly standardized spelling is a prescriptive element. Spelli ...
words, following the principle on which alphabets get based. These rules will map letters of the alphabet to the
phoneme A phoneme () is any set of similar Phone (phonetics), speech sounds that are perceptually regarded by the speakers of a language as a single basic sound—a smallest possible Phonetics, phonetic unit—that helps distinguish one word fr ...
s of the spoken language. In a perfectly
phonemic orthography A phonemic orthography is an orthography (system for writing a language) in which the graphemes (written symbols) correspond consistently to the language's phonemes (the smallest units of speech that can differentiate words), or more generally ...
, there would be a consistent one-to-one correspondence between the letters and the phonemes so that a writer could predict the spelling of a word given its pronunciation, and a speaker would always know the pronunciation of a word given its spelling, and vice versa. However, this ideal is usually never achieved in practice. Languages can come close to it, such as Spanish and Finnish. Others, such as English, deviate from it to a much larger degree. The pronunciation of a language often evolves independently of its writing system. Writing systems have been borrowed for languages the orthography was not initially made to use. The degree to which letters of an alphabet correspond to phonemes of a language varies. Languages may fail to achieve a one-to-one correspondence between letters and sounds in any of several ways: * A language may represent a given phoneme by combinations of letters rather than just a single letter. Two-letter combinations are called digraphs, and three-letter groups are called trigraphs. German uses the
tetragraph A tetragraph, , is a sequence of four letters used to represent a single sound (phoneme), or a combination of sounds, that do not necessarily correspond to the individual values of the letters. In German, for example, the tetragraph ''tsch'' repre ...
s (four letters) "tsch" for the phoneme and (in a few borrowed words) "dsch" for . Kabardian also uses a tetragraph for one of its phonemes, namely "кхъу." Two letters representing one sound occur in several instances in Hungarian as well (where, for instance, ''cs'' stands for ʃ ''sz'' for ''zs'' for ''dzs'' for ʒ. * A language may represent the same phoneme with two or more different letters or combinations of letters. An example is
modern Greek Modern Greek (, or , ), generally referred to by speakers simply as Greek (, ), refers collectively to the dialects of the Greek language spoken in the modern era, including the official standardized form of the language sometimes referred to ...
which may write the phoneme in six different ways: , , , , , and . * A language may spell some words with unpronounced letters that exist for historical or other reasons. For example, the spelling of the Thai word for 'beer' retains a letter for the final consonant /r/ present in the English word it borrows, but silences it. * Pronunciation of individual words may change according to the presence of surrounding words in a sentence, for example, in
sandhi Sandhi ( ; , ) is any of a wide variety of sound changes that occur at morpheme or word boundaries. Examples include fusion of sounds across word boundaries and the alteration of one sound depending on nearby sounds or the grammatical function o ...
. * Different dialects of a language may use different phonemes for the same word. * A language may use different sets of symbols or rules for distinct vocabulary items, typically for foreign words, such as in the Japanese
katakana is a Japanese syllabary, one component of the Japanese writing system along with hiragana, kanji and in some cases the Latin script (known as rōmaji). The word ''katakana'' means "fragmentary kana", as the katakana characters are derived fr ...
syllabary is used for foreign words, and there are rules in English for using loanwords from other languages. National languages sometimes elect to address the problem of dialects by associating the alphabet with the national standard. Some national languages like Finnish,
Armenian Armenian may refer to: * Something of, from, or related to Armenia, a country in the South Caucasus region of Eurasia * Armenians, the national people of Armenia, or people of Armenian descent ** Armenian diaspora, Armenian communities around the ...
, Turkish, Russian,
Serbo-Croatian Serbo-Croatian ( / ), also known as Bosnian-Croatian-Montenegrin-Serbian (BCMS), is a South Slavic language and the primary language of Serbia, Croatia, Bosnia and Herzegovina, and Montenegro. It is a pluricentric language with four mutually i ...
( Serbian, Croatian, and Bosnian), and Bulgarian have a very regular spelling system with nearly one-to-one correspondence between letters and phonemes. Similarly, the Italian verb corresponding to 'spell (out),' ''compitare'', is unknown to many Italians because spelling is usually trivial, as Italian spelling is highly phonemic. In standard Spanish, one can tell the pronunciation of a word from its spelling, but not vice versa, as phonemes sometimes can be represented in more than one way, but a given letter is consistently pronounced. French using
silent letter In an alphabetic writing system, a silent letter is a letter that, in a particular word, does not correspond to any sound in the word's pronunciation. In linguistics, a silent letter is often symbolised with a null sign , which resembles the ...
s,
nasal vowel A nasal vowel is a vowel that is produced with a lowering of the soft palate (or velum) so that the air flow escapes through the nose and the mouth simultaneously, as in the French vowel /ɑ̃/ () or Amoy []. By contrast, oral vowels are p ...
s, and elision, may seem to lack much correspondence between the spelling and pronunciation. However, its rules on pronunciation, though complex, are consistent and predictable with a fair degree of accuracy. At the other extreme are languages such as English, where pronunciations mostly have to be memorized as they do not correspond to the spelling consistently. For English, this is because the
Great Vowel Shift The Great Vowel Shift was a series of English phonology, pronunciation changes in the vowels of the English language that took place primarily between the 1400s and 1600s (the transition period from Middle English to Early Modern English), begi ...
occurred after the orthography got established and because English has acquired a large number of loanwords at different times, retaining their original spelling at varying levels. However, even English has general, albeit complex, rules that predict pronunciation from spelling. Rules like this are usually successful. However, rules to predict spelling from pronunciation have a higher failure rate. Sometimes, countries have the written language undergo a
spelling reform A spelling reform is a deliberate, often authoritatively sanctioned or mandated change to spelling rules. Proposals for such reform are fairly common, and over the years, many languages have undergone such reforms. Recent high-profile examples a ...
to realign the writing with the contemporary spoken language. These can range from simple spelling changes and word forms to switching the entire writing system. For example, Turkey switched from the Arabic alphabet to a Latin-based
Turkish alphabet The Turkish alphabet () is a Latin-script alphabet used for writing the Turkish language, consisting of 29 letters, seven of which ( Ç, Ğ, I, İ, Ö, Ş and Ü) have been modified from their Latin originals for the phonetic requirements o ...
, and Kazakh changed from an Arabic script to a Cyrillic script due to the Soviet Union's influence. In 2021, it made a transition to the Latin alphabet, similar to Turkish. The Cyrillic script used to be official in Uzbekistan and Turkmenistan before they switched to the Latin alphabet. Uzbekistan is reforming the alphabet to use diacritics on the letters that are marked by apostrophes and the letters that are digraphs. The standard system of symbols used by
linguist Linguistics is the scientific study of language. The areas of linguistic analysis are syntax (rules governing the structure of sentences), semantics (meaning), Morphology (linguistics), morphology (structure of words), phonetics (speech sounds ...
s to represent sounds in any language, independently of orthography, is called the
International Phonetic Alphabet The International Phonetic Alphabet (IPA) is an alphabetic system of phonetic notation based primarily on the Latin script. It was devised by the International Phonetic Association in the late 19th century as a standard written representation ...
.


See also

*
Abecedarium An abecedarium (also known as an abecedary or ABCs or simply an ABC) is an inscription consisting of the letters of an alphabet, almost always listed in order. Typically, abecedaria (or abecedaries) are practice exercises. Non-Latin alphabet ...
* Alphabet book * Alphabet effect *
Fingerspelling Fingerspelling (or dactylology) is the representation of the letter (alphabet), letters of a writing system, and sometimes numeral systems, using only the hands. These manual alphabets (also known as finger alphabets or hand alphabets) have often ...
* Pangram * Letter symbolism


References


Bibliography

* * * * * * * * * * * * * * *


External links


"Language, Writing and Alphabet: An Interview with Christophe Rico"
''Damqātum 3'' (2007) *
Michael Everson Michael Everson (born January 1963) is an American and Irish linguistics, linguist, Character encoding, script encoder, typesetting, typesetter, type designer and Publishing, publisher. He runs a publishing company called Evertype, through which ...
'
Alphabets of Europe

How the Alphabet Was Born from Hieroglyphs
��Biblical Archaeology Review
An Early Hellenic Alphabet



The Alphabet
BBC Radio 4 discussion with Eleanor Robson, Alan Millard and Rosalind Thomas (''In Our Time'', 18 December 2003) {{Authority control Orthography