HOME

TheInfoList



OR:

An alphabet is a standardized set of basic written
grapheme In linguistics, a grapheme is the smallest functional unit of a writing system. The word ''grapheme'' is derived and the suffix ''-eme'' by analogy with ''phoneme'' and other names of emic units. The study of graphemes is called ''graphemics' ...
s (called
letters Letter, letters, or literature may refer to: Characters typeface * Letter (alphabet), a character representing one or more of the sounds used in speech; any of the symbols of an alphabet. * Letterform, the graphic form of a letter of the alphabe ...
) that represent the
phoneme In phonology and linguistics, a phoneme () is a unit of sound that can distinguish one word from another in a particular language. For example, in most dialects of English, with the notable exception of the West Midlands and the north-west o ...
s of certain
spoken language A spoken language is a language produced by articulate sounds or (depending on one's definition) manual gestures, as opposed to a written language. An oral language or vocal language is a language produced with the vocal tract in contrast with a si ...
s. Not all
writing system A writing system is a method of visually representing verbal communication, based on a script and a set of rules regulating its use. While both writing and speech are useful in conveying messages, writing differs in also being a reliable form ...
s represent language in this way; in a
syllabary In the linguistic study of written languages, a syllabary is a set of written symbols that represent the syllables or (more frequently) moras which make up words. A symbol in a syllabary, called a syllabogram, typically represents an (optiona ...
, each character represents a
syllable A syllable is a unit of organization for a sequence of speech sounds typically made up of a syllable nucleus (most often a vowel) with optional initial and final margins (typically, consonants). Syllables are often considered the phonological "bu ...
, and logographic systems use
characters Character or Characters may refer to: Arts, entertainment, and media Literature * ''Character'' (novel), a 1936 Dutch novel by Ferdinand Bordewijk * ''Characters'' (Theophrastus), a classical Greek set of character sketches attributed to The ...
to represent words,
morpheme A morpheme is the smallest meaningful Constituent (linguistics), constituent of a linguistic expression. The field of linguistics, linguistic study dedicated to morphemes is called morphology (linguistics), morphology. In English, morphemes are ...
s, or other semantic units. The first fully phonemic script, the
Proto-Sinaitic script Proto-Sinaitic (also referred to as Sinaitic, Proto-Canaanite when found in Canaan, the North Semitic alphabet, or Early Alphabetic) is considered the earliest trace of alphabetic writing and the common ancestor of both the Ancient South Arabian ...
, later known as the
Phoenician alphabet The Phoenician alphabet is an alphabet (more specifically, an abjad) known in modern times from the Canaanite and Aramaic inscriptions found across the Mediterranean region. The name comes from the Phoenician civilization. The Phoenician alpha ...
, is considered to be the first alphabet and is the ancestor of most modern alphabets, including
Arabic Arabic (, ' ; , ' or ) is a Semitic languages, Semitic language spoken primarily across the Arab world.Semitic languages: an international handbook / edited by Stefan Weninger; in collaboration with Geoffrey Khan, Michael P. Streck, Janet C ...
,
Cyrillic , bg, кирилица , mk, кирилица , russian: кириллица , sr, ћирилица, uk, кирилиця , fam1 = Egyptian hieroglyphs , fam2 = Proto-Sinaitic , fam3 = Phoenician , fam4 = G ...
,
Greek Greek may refer to: Greece Anything of, from, or related to Greece, a country in Southern Europe: *Greeks, an ethnic group. *Greek language, a branch of the Indo-European language family. **Proto-Greek language, the assumed last common ancestor ...
,
Hebrew Hebrew (; ; ) is a Northwest Semitic language of the Afroasiatic language family. Historically, it is one of the spoken languages of the Israelites and their longest-surviving descendants, the Jews and Samaritans. It was largely preserved ...
,
Latin Latin (, or , ) is a classical language belonging to the Italic branch of the Indo-European languages. Latin was originally a dialect spoken in the lower Tiber area (then known as Latium) around present-day Rome, but through the power of the ...
, and possibly
Brahmic The Brahmic scripts, also known as Indic scripts, are a family of abugida writing systems. They are used throughout the Indian subcontinent, Southeast Asia and parts of East Asia. They are descended from the Brahmi script of ancient India ...
. It was created by Semitic-speaking workers and slaves in the
Sinai Peninsula The Sinai Peninsula, or simply Sinai (now usually ) (, , cop, Ⲥⲓⲛⲁ), is a peninsula in Egypt, and the only part of the country located in Asia. It is between the Mediterranean Sea to the north and the Red Sea to the south, and is a l ...
(as the Proto-Sinaitic script), by selecting a small number of
hieroglyphs A hieroglyph (Greek for "sacred carvings") was a character of the ancient Egyptian writing system. Logographic scripts that are pictographic in form in a way reminiscent of ancient Egyptian are also sometimes called "hieroglyphs". In Neoplatonis ...
commonly seen in their Egyptian surroundings to describe the sounds, as opposed to the semantic values of the
Canaanite languages The Canaanite languages, or Canaanite dialects, are one of the three subgroups of the Northwest Semitic languages, the others being Aramaic and Ugaritic, all originating in the Levant and Mesopotamia. They are attested in Canaanite inscription ...
. However,
Peter T. Daniels Peter T. Daniels (born December 11, 1951) is a scholar of writing systems, specializing in typology. He was co-editor (with William Bright) of the book ''The World's Writing Systems'' (1996). He was a lecturer at University of Wisconsin–Milwauk ...
distinguishes an
abugida An abugida (, from Ge'ez language, Ge'ez: ), sometimes known as alphasyllabary, neosyllabary or pseudo-alphabet, is a segmental Writing systems#Segmental writing system, writing system in which consonant-vowel sequences are written as units; ...
, a set of graphemes that represent consonantal base letters that
diacritic A diacritic (also diacritical mark, diacritical point, diacritical sign, or accent) is a glyph added to a letter or to a basic glyph. The term derives from the Ancient Greek (, "distinguishing"), from (, "to distinguish"). The word ''diacriti ...
s modify to represent vowels (as in
Devanagari Devanagari ( ; , , Sanskrit pronunciation: ), also called Nagari (),Kathleen Kuiper (2010), The Culture of India, New York: The Rosen Publishing Group, , page 83 is a left-to-right abugida (a type of segmental Writing systems#Segmental syste ...
and other South Asian scripts), an
abjad An abjad (, ar, أبجد; also abgad) is a writing system in which only consonants are represented, leaving vowel sounds to be inferred by the reader. This contrasts with other alphabets, which provide graphemes for both consonants and vowels ...
, in which letters predominantly or exclusively represent
consonant In articulatory phonetics, a consonant is a speech sound that is articulated with complete or partial closure of the vocal tract. Examples are and pronounced with the lips; and pronounced with the front of the tongue; and pronounced wit ...
s (as in the original Phoenician,
Hebrew Hebrew (; ; ) is a Northwest Semitic language of the Afroasiatic language family. Historically, it is one of the spoken languages of the Israelites and their longest-surviving descendants, the Jews and Samaritans. It was largely preserved ...
or
Arabic Arabic (, ' ; , ' or ) is a Semitic languages, Semitic language spoken primarily across the Arab world.Semitic languages: an international handbook / edited by Stefan Weninger; in collaboration with Geoffrey Khan, Michael P. Streck, Janet C ...
), and an alphabet, a set of graphemes that represent both consonants and
vowel A vowel is a syllabic speech sound pronounced without any stricture in the vocal tract. Vowels are one of the two principal classes of speech sounds, the other being the consonant. Vowels vary in quality, in loudness and also in quantity (leng ...
s. In this narrow sense of the word, the first true alphabet was the Greek alphabet, which was based on the earlier Phoenician alphabet. Of the dozens of alphabets in use today, the most popular is the Latin alphabet, which was derived from the Greek alphabet, and which is now used by many languages worldwide, often with the addition of extra letters or diacritical marks. While most alphabets have letters composed of lines, there are also exceptions such as the alphabets used in
Braille Braille (Pronounced: ) is a tactile writing system used by people who are visually impaired, including people who are Blindness, blind, Deafblindness, deafblind or who have low vision. It can be read either on Paper embossing, embossed paper ...
. Alphabets are usually associated with a standard ordering of letters. This makes them useful for purposes of
collation Collation is the assembly of written information into a standard order. Many systems of collation are based on numerical order or alphabetical order, or extensions and combinations thereof. Collation is a fundamental element of most office fili ...
, specifically by allowing words to be sorted in
alphabetical order Alphabetical order is a system whereby character strings are placed in order based on the position of the characters in the conventional ordering of an alphabet. It is one of the methods of collation. In mathematics, a lexicographical order is t ...
. It also means that their letters can be used as an alternative method of "numbering" ordered items, in such contexts as numbered lists and number placements.


Etymology

The English word ''alphabet'' came into
Middle English Middle English (abbreviated to ME) is a form of the English language that was spoken after the Norman conquest of 1066, until the late 15th century. The English language underwent distinct variations and developments following the Old English p ...
from the
Late Latin Late Latin ( la, Latinitas serior) is the scholarly name for the form of Literary Latin of late antiquity.Roberts (1996), p. 537. English dictionary definitions of Late Latin date this period from the , and continuing into the 7th century in t ...
word ''alphabetum'', which in turn originated in the Greek ἀλφάβητος (''alphabētos''), it was made from the first two letters, ''
alpha Alpha (uppercase , lowercase ; grc, ἄλφα, ''álpha'', or ell, άλφα, álfa) is the first letter of the Greek alphabet. In the system of Greek numerals, it has a value of one. Alpha is derived from the Phoenician letter aleph , whic ...
'' (α) and ''
beta Beta (, ; uppercase , lowercase , or cursive ; grc, βῆτα, bē̂ta or ell, βήτα, víta) is the second letter of the Greek alphabet. In the system of Greek numerals, it has a value of 2. In Modern Greek, it represents the voiced labiod ...
'' (β). The names for the Greek letters came from the first two letters of the Phoenician alphabet; ''
aleph Aleph (or alef or alif, transliterated ʾ) is the first letter of the Semitic abjads, including Phoenician , Hebrew , Aramaic , Syriac , Arabic ʾ and North Arabian 𐪑. It also appears as South Arabian 𐩱 and Ge'ez . These letter ...
'', which also meant ''ox'', and ''
bet Black Entertainment Television (acronym BET) is an American basic cable channel targeting African-American audiences. It is owned by the CBS Entertainment Group unit of Paramount Global via BET Networks and has offices in New York City, Los A ...
'', which also meant ''house''.


History


Ancient Northeast African and Middle Eastern scripts

The history of the alphabet started in ancient Egypt. Egyptian writing had a set of some 24 hieroglyphs that are called uniliterals, representing syllables that begin with a single consonant of their language, plus a vowel (or no vowel) to be supplied by the native speaker. These glyphs were used as pronunciation guides for
logogram In a written language, a logogram, logograph, or lexigraph is a written character that represents a word or morpheme. Chinese characters (pronounced ''hanzi'' in Mandarin, ''kanji'' in Japanese, ''hanja'' in Korean) are generally logograms, as ...
s, to write grammatical inflections, and, later, to transcribe loan words and foreign names. In the
Middle Bronze Age The Bronze Age is a historic period, lasting approximately from 3300 BC to 1200 BC, characterized by the use of bronze, the presence of writing in some areas, and other early features of urban civilization. The Bronze Age is the second pri ...
, an apparently "alphabetic" system known as the Proto-Sinaitic script appeared in Egyptian turquoise mines in the
Sinai peninsula The Sinai Peninsula, or simply Sinai (now usually ) (, , cop, Ⲥⲓⲛⲁ), is a peninsula in Egypt, and the only part of the country located in Asia. It is between the Mediterranean Sea to the north and the Red Sea to the south, and is a l ...
dated to circa the 15th century BCE, apparently left by Canaanite workers. In 1999, John and Deborah Darnell discovered an earlier version of this first alphabet at
Wadi el-Hol Wadi el-Hol is a valley on the Farshut Road, north-west of Luxor on the Qena Bend, situated on the west bank of the river Nile in Egypt. Rock inscriptions in the valley appear to show the oldest examples of phonetic alphabetic writing discovered ...
. Dating to circa 1800 BCE and showing evidence of having been adapted from specific forms of Egyptian hieroglyphs that could be dated to circa 2000 BCE, strongly suggesting that the first alphabet had developed about that time. Based on letter appearances and names, believed to be based on Egyptian hieroglyphs. This script had no characters representing vowels. Originally, it probably was a syllabary, with symbols that were not needed being removed. An alphabetic
cuneiform Cuneiform is a logo-syllabic script that was used to write several languages of the Ancient Middle East. The script was in active use from the early Bronze Age until the beginning of the Common Era. It is named for the characteristic wedge-sha ...
script with 30 signs, including three that indicate the following vowel invented in
Ugarit ) , image =Ugarit Corbel.jpg , image_size=300 , alt = , caption = Entrance to the Royal Palace of Ugarit , map_type = Near East#Syria , map_alt = , map_size = 300 , relief=yes , location = Latakia Governorate, Syria , region = F ...
before the 15th century BCE. This script was not used after the destruction of Ugarit. The Proto-Sinaitic script eventually developed into the Phoenician alphabet, conventionally called "Proto-Canaanite" before c. 1050 BCE. The oldest text in Phoenician script is an inscription on the sarcophagus of King
Ahiram The Ahiram sarcophagus (also spelled Ahirom, in Phoenician) was the sarcophagus of a Phoenician Kings of Byblos, King of Byblos (c. 850 BC), discovered in 1923 by the French excavator Pierre Montet in tomb V of the royal necropolis of Byblos. Th ...
. This script is the parent script of all western alphabets. By the tenth century, two other forms distinguish themselves, Canaanite and
Aramaic The Aramaic languages, short Aramaic ( syc, ܐܪܡܝܐ, Arāmāyā; oar, 𐤀𐤓𐤌𐤉𐤀; arc, 𐡀𐡓𐡌𐡉𐡀; tmr, אֲרָמִית), are a language family containing many varieties (languages and dialects) that originated in ...
. The Aramaic gave rise to the
Hebrew Hebrew (; ; ) is a Northwest Semitic language of the Afroasiatic language family. Historically, it is one of the spoken languages of the Israelites and their longest-surviving descendants, the Jews and Samaritans. It was largely preserved ...
script. The
South Arabian alphabet The Ancient South Arabian script (Old South Arabian 𐩣𐩯𐩬𐩵 ''ms3nd''; modern ar, الْمُسْنَد ''musnad'') branched from the Proto-Sinaitic script in about the late 2nd millennium BCE. It was used for writing the Old Sou ...
, a sister script to the Phoenician alphabet, is the script from which the Ge'ez alphabet (an
abugida An abugida (, from Ge'ez language, Ge'ez: ), sometimes known as alphasyllabary, neosyllabary or pseudo-alphabet, is a segmental Writing systems#Segmental writing system, writing system in which consonant-vowel sequences are written as units; ...
) descended. Vowel-less alphabets are called
abjad An abjad (, ar, أبجد; also abgad) is a writing system in which only consonants are represented, leaving vowel sounds to be inferred by the reader. This contrasts with other alphabets, which provide graphemes for both consonants and vowels ...
s, currently exemplified in others such as, Arabic, Hebrew, and
Syriac Syriac may refer to: *Syriac language, an ancient dialect of Middle Aramaic *Sureth, one of the modern dialects of Syriac spoken in the Nineveh Plains region * Syriac alphabet ** Syriac (Unicode block) ** Syriac Supplement * Neo-Aramaic languages a ...
. The omission of vowels was not always a satisfactory solution, and some "weak" consonants sometimes being used to indicate the vowel quality of a syllable). These letters have a dual function since they can also be used as pure consonants. The Proto-Sinaitic script and the Ugaritic script were the first scripts with a limited number of signs, in contrast to the other widely used writing systems at the time,
Cuneiform Cuneiform is a logo-syllabic script that was used to write several languages of the Ancient Middle East. The script was in active use from the early Bronze Age until the beginning of the Common Era. It is named for the characteristic wedge-sha ...
,
Egyptian hieroglyphs Egyptian hieroglyphs (, ) were the formal writing system used in Ancient Egypt, used for writing the Egyptian language. Hieroglyphs combined logographic, syllabic and alphabetic elements, with some 1,000 distinct characters.There were about 1,00 ...
, and
Linear B Linear B was a syllabic script used for writing in Mycenaean Greek, the earliest attested form of Greek. The script predates the Greek alphabet by several centuries. The oldest Mycenaean writing dates to about 1400 BC. It is descended from ...
. The Phoenician script was probably the first phonemic script, and it contained only about two dozen distinct letters, making it a script simple enough for traders to learn. Another advantage of Phoenician was that it could write different languages since it recorded words phonemically. The script was spread across the Mediterranean by the Phoenicians. In Greece, they added vowels to the alphabet, giving rise to the ancestor of all alphabets in the West. It was the first alphabet in which vowels have independent letter forms separate from those of consonants. The Greeks chose letters representing sounds that did not exist in Greek to represent vowels. Vowels are significant in the Greek language. The syllabical Linear B script that was used by the
Mycenaean Greeks Mycenaean Greece (or the Mycenaean civilization) was the last phase of the Bronze Age in Ancient Greece, spanning the period from approximately 1750 to 1050 BC.. It represents the first advanced and distinctively Greek civilization in mainland ...
from the 16th century BCE had 87 symbols, including five vowels. In its early years, there were many variants of the Greek alphabet, a situation that caused many different alphabets to evolve from it.


European alphabets

The Greek alphabet, in Euboean form, was carried over by Greek colonists to the Italian peninsula, giving rise to many different alphabets used to write the
Italic languages The Italic languages form a branch of the Indo-European language family, whose earliest known members were spoken on the Italian Peninsula in the first millennium BC. The most important of the ancient languages was Latin, the official languag ...
. One of these became the Latin alphabet, which spread across Europe as the Romans expanded their empire. Even after the fall of the Roman state, the alphabet survived in intellectual and religious works. It eventually became used for the descendant languages of Latin (the
Romance languages The Romance languages, sometimes referred to as Latin languages or Neo-Latin languages, are the various modern languages that evolved from Vulgar Latin. They are the only extant subgroup of the Italic languages in the Indo-European language fam ...
) and most of the other languages of western and central Europe. Some adaptations of the Latin alphabet have ligatures, such as æ in
Danish Danish may refer to: * Something of, from, or related to the country of Denmark People * A national or citizen of Denmark, also called a "Dane," see Demographics of Denmark * Culture of Denmark * Danish people or Danes, people with a Danish a ...
and Icelandic and Ȣ in Algonquian; borrowings from other alphabets, such as the
thorn Thorn(s) or The Thorn(s) may refer to: Botany * Thorns, spines, and prickles, sharp structures on plants * ''Crataegus monogyna'', or common hawthorn, a plant species Comics and literature * Rose and Thorn, the two personalities of two DC Com ...
þ in
Old English Old English (, ), or Anglo-Saxon, is the earliest recorded form of the English language, spoken in England and southern and eastern Scotland in the early Middle Ages. It was brought to Great Britain by Anglo-Saxon settlement of Britain, Anglo ...
and Icelandic, which came from the
Futhark Runes are the letters in a set of related alphabets known as runic alphabets native to the Germanic peoples. Runes were used to write various Germanic languages (with some exceptions) before they adopted the Latin alphabet, and for specialis ...
runes; and modified existing letters, such as the
eth (colloquially) , former_name = eidgenössische polytechnische Schule , image = ETHZ.JPG , image_size = , established = , type = Public , budget = CHF 1.896 billion (2021) , rector = Günther Dissertori , president = Joël Mesot , a ...
ð of Old English and Icelandic, which is a modified ''d''. Other alphabets only use a subset of the Latin alphabet, such as Hawaiian, and
Italian Italian(s) may refer to: * Anything of, from, or related to the people of Italy over the centuries ** Italians, an ethnic group or simply a citizen of the Italian Republic or Italian Kingdom ** Italian language, a Romance language *** Regional Ita ...
, which uses the letters ''j, k, x, y,'' and ''w'' only in foreign words. Another notable script is
Elder Futhark The Elder Futhark (or Fuþark), also known as the Older Futhark, Old Futhark, or Germanic Futhark, is the oldest form of the runic alphabets. It was a writing system used by Germanic peoples for Northwest Germanic dialects in the Migration Peri ...
, believed to have evolved out of one of the
Old Italic alphabet The Old Italic scripts are a family of similar ancient writing systems used in the Italian Peninsula between about 700 and 100 BC, for various languages spoken in that time and place. The most notable member is the Etruscan alphabet, which ...
s. Elder Futhark gave rise to other alphabets known collectively as the
Runic alphabet Runes are the letters in a set of related alphabets known as runic alphabets native to the Germanic peoples. Runes were used to write various Germanic languages (with some exceptions) before they adopted the Latin alphabet, and for specialised ...
s. The Runic alphabets were used for Germanic languages from 100 CE to the late Middle Ages being engraved on stone and jewelry, although inscriptions found on bone and wood occasionally appear. These alphabets have since gotten replaced with the Latin alphabet. The exception being for decorative use, where the runes remained in use until the 20th century. The
Old Hungarian script The Old Hungarian script or Hungarian runes ( hu, Székely-magyar rovás, 'székely-magyar runiform', or ) is an alphabetic writing system used for writing the Hungarian language. Modern Hungarian is written using the Latin-based Hungarian alph ...
was the writing system of the Hungarians. It was in use during the entire history of Hungary, albeit not as an official writing system. From the 19th century, it once again became more and more popular. The
Glagolitic alphabet The Glagolitic script (, , ''glagolitsa'') is the oldest known Slavic alphabet. It is generally agreed to have been created in the 9th century by Saint Cyril, a monk from Thessalonica. He and his brother Saint Methodius were sent by the Byzan ...
was the initial script of the liturgical language
Old Church Slavonic Old Church Slavonic or Old Slavonic () was the first Slavic languages, Slavic literary language. Historians credit the 9th-century Byzantine Empire, Byzantine missionaries Saints Cyril and Methodius with Standard language, standardizing the lan ...
and became, together with the Greek uncial script, the basis of the
Cyrillic script The Cyrillic script ( ), Slavonic script or the Slavic script, is a writing system used for various languages across Eurasia. It is the designated national script in various Slavic languages, Slavic, Turkic languages, Turkic, Mongolic languages, ...
. Cyrillic is one of the most widely used modern alphabetic scripts, and is notable for its use in Slavic languages and also for other languages within the former
Soviet Union The Soviet Union,. officially the Union of Soviet Socialist Republics. (USSR),. was a transcontinental country that spanned much of Eurasia from 1922 to 1991. A flagship communist state, it was nominally a federal union of fifteen national ...
.
Cyrillic alphabets Numerous Cyrillic alphabets are based on the Cyrillic script. The early Cyrillic alphabet was developed in the 9th century AD and replaced the earlier Glagolitic script developed by the Byzantine theologians Saints Cyril and Methodius, Cyril ...
include Serbian, Macedonian,
Bulgarian Bulgarian may refer to: * Something of, from, or related to the country of Bulgaria * Bulgarians, a South Slavic ethnic group * Bulgarian language, a Slavic language * Bulgarian alphabet * A citizen of Bulgaria, see Demographics of Bulgaria * Bul ...
,
Russian Russian(s) refers to anything related to Russia, including: *Russians (, ''russkiye''), an ethnic group of the East Slavic peoples, primarily living in Russia and neighboring countries *Rossiyane (), Russian language term for all citizens and peo ...
, Belarusian, and
Ukrainian Ukrainian may refer to: * Something of, from, or related to Ukraine * Something relating to Ukrainians, an East Slavic people from Eastern Europe * Something relating to demographics of Ukraine in terms of demography and population of Ukraine * So ...
. The Glagolitic alphabet believed to have been created by
Saints Cyril and Methodius Cyril (born Constantine, 826–869) and Methodius (815–885) were two brothers and Byzantine Christian theologians and missionaries. For their work evangelizing the Slavs, they are known as the "Apostles to the Slavs". They are credited wit ...
, while the Cyrillic alphabet was invented by
Clement of Ohrid Saint Clement of Ohrid ( Bulgarian, Serbian and Macedonian: Свети Климент Охридски, ; el, Ἅγιος Κλήμης τῆς Ἀχρίδας; sk, svätý Kliment Ochridský; – 916) was one of the first medieval Bulgarian ...
, their disciple. They feature many letters that appear to have been borrowed from or influenced by Greek and Hebrew.


Asian alphabets

Beyond the logographic
Chinese writing Written Chinese () comprises Chinese characters used to represent the Chinese language. Chinese characters do not constitute an alphabet or a compact syllabary. Rather, the writing system is roughly logosyllabic; that is, a character generally r ...
, many phonetic scripts exist in Asia. The Arabic alphabet,
Hebrew alphabet The Hebrew alphabet ( he, wikt:אלפבית, אָלֶף־בֵּית עִבְרִי, ), known variously by scholars as the Ktav Ashuri, Jewish script, square script and block script, is an abjad script used in the writing of the Hebrew languag ...
,
Syriac alphabet The Syriac alphabet ( ) is a writing system primarily used to write the Syriac language since the 1st century AD. It is one of the Semitic abjads descending from the Aramaic alphabet through the Palmyrene alphabet, and shares similarities with ...
, and other
abjad An abjad (, ar, أبجد; also abgad) is a writing system in which only consonants are represented, leaving vowel sounds to be inferred by the reader. This contrasts with other alphabets, which provide graphemes for both consonants and vowels ...
s of the Middle East are developments of the
Aramaic alphabet The ancient Aramaic alphabet was adapted by Arameans from the Phoenician alphabet and became a distinct script by the 8th century BC. It was used to write the Aramaic languages spoken by ancient Aramean pre-Christian tribes throughout the Fertil ...
. Most alphabetic scripts of India and Eastern Asia descend from the
Brahmi script Brahmi (; ; ISO: ''Brāhmī'') is a writing system of ancient South Asia. "Until the late nineteenth century, the script of the Aśokan (non-Kharosthi) inscriptions and its immediate derivatives was referred to by various names such as 'lath' o ...
, believed to be a descendant of Aramaic.


Hangul

In
Korea Korea ( ko, 한국, or , ) is a peninsular region in East Asia. Since 1945, it has been divided at or near the 38th parallel, with North Korea (Democratic People's Republic of Korea) comprising its northern half and South Korea (Republic o ...
,
Sejong the Great Sejong of Joseon (15 May 1397 – 8 April 1450), personal name Yi Do (Korean: 이도; Hanja: 李祹), widely known as Sejong the Great (Korean: 세종대왕; Hanja: 世宗大王), was the fourth ruler of the Joseon dynasty of Korea. Initial ...
created the
Hangul The Korean alphabet, known as Hangul, . Hangul may also be written as following South Korea's standard Romanization. ( ) in South Korea and Chosŏn'gŭl in North Korea, is the modern official writing system for the Korean language. The let ...
alphabet Hangul is a unique alphabet: it is a
featural alphabet In a featural writing system, the shapes of the symbols (such as letters) are not arbitrary but encode phonological features of the phonemes that they represent. The term featural was introduced by Geoffrey Sampson to describe the Korean alph ...
, where design of many of the letters comes from a sound's place of articulation (P to look like the widened mouth, L to look like the tongue pulled in); creation of Hangul was planned by the government of the day; and it places individual letters in syllable clusters with equal dimensions, in the same way as
Chinese characters Chinese characters () are logograms developed for the writing of Chinese. In addition, they have been adapted to write other East Asian languages, and remain a key component of the Japanese writing system where they are known as ''kanji' ...
, to allow for mixed-script writing (one syllable always takes up one type-space no matter how many letters get stacked into building that one sound-block).


Zhuyin

Zhuyin Bopomofo (), or Mandarin Phonetic Symbols, also named Zhuyin (), is a Chinese transliteration system for Mandarin Chinese and other related languages and dialects. More commonly used in Taiwanese Mandarin, it may also be used to transcribe ...
(sometimes called ''Bopomofo'') is a semi-syllabary. It transcribes
Mandarin Mandarin or The Mandarin may refer to: Language * Mandarin Chinese, branch of Chinese originally spoken in northern parts of the country ** Standard Chinese or Modern Standard Mandarin, the official language of China ** Taiwanese Mandarin, Stand ...
phonetically in the Republic of China. After the later establishment of the People's Republic of China and its adoption of
Hanyu Pinyin Hanyu Pinyin (), often shortened to just pinyin, is the official romanization system for Standard Mandarin Chinese in China, and to some extent, in Singapore and Malaysia. It is often used to teach Mandarin, normally written in Chinese for ...
, the use of Zhuyin today is limited. However, it is still widely used in Taiwan, where the Republic of China governs. Zhuyin developed from a form of Chinese shorthand based on Chinese characters in the early 1900s and has elements of both an alphabet and a syllabary. Like an alphabet, the phonemes of syllable initials get represented by individual symbols, but like a syllabary, the phonemes of the syllable finals are not; each possible final (excluding the medial glide) has its own character, an example being, ''luan'' written as ㄌㄨㄢ (''l-u-an''). The last symbol ㄢ takes place as the entire final ''-an''. While Zhuyin is not a mainstream writing system, it is still often used in ways similar to a
romanization Romanization or romanisation, in linguistics, is the conversion of text from a different writing system to the Roman (Latin) script, or a system for doing so. Methods of romanization include transliteration, for representing written text, and ...
system, for aiding pronunciation and as an input method for Chinese characters on computers and cellphones.


Romanization

European alphabets, especially Latin and Cyrillic, have been adapted for many languages of Asia. Arabic is also widely used, sometimes as an abjad (as with
Urdu Urdu (;"Urdu"
''
Persian Persian may refer to: * People and things from Iran, historically called ''Persia'' in the English language ** Persians, the majority ethnic group in Iran, not to be conflated with the Iranic peoples ** Persian language, an Iranian language of the ...
) and sometimes as a complete alphabet (as with
Kurdish Kurdish may refer to: *Kurds or Kurdish people *Kurdish languages *Kurdish alphabets *Kurdistan, the land of the Kurdish people which includes: **Southern Kurdistan **Eastern Kurdistan **Northern Kurdistan **Western Kurdistan See also * Kurd (dis ...
and Uyghur).


Types

The term "alphabet" gets used by
linguists Linguistics is the scientific study of human language. It is called a scientific study because it entails a comprehensive, systematic, objective, and precise analysis of all aspects of language, particularly its nature and structure. Linguis ...
and
paleographer Palaeography ( UK) or paleography ( US; ultimately from grc-gre, , ''palaiós'', "old", and , ''gráphein'', "to write") is the study of historic writing systems and the deciphering and dating of historical manuscripts, including the analysi ...
s in both a wide and a narrow sense. In a larger sense, an alphabet is a ''segmental'' script at the
phoneme In phonology and linguistics, a phoneme () is a unit of sound that can distinguish one word from another in a particular language. For example, in most dialects of English, with the notable exception of the West Midlands and the north-west o ...
level—that is, it has separate glyphs for individual sounds and not for larger units such as syllables or words. In the narrower sense, some scholars distinguish "true" alphabets from two other types of segmental script,
abjad An abjad (, ar, أبجد; also abgad) is a writing system in which only consonants are represented, leaving vowel sounds to be inferred by the reader. This contrasts with other alphabets, which provide graphemes for both consonants and vowels ...
s, and
abugida An abugida (, from Ge'ez language, Ge'ez: ), sometimes known as alphasyllabary, neosyllabary or pseudo-alphabet, is a segmental Writing systems#Segmental writing system, writing system in which consonant-vowel sequences are written as units; ...
s. These three differ in how they treat vowels. Abjads have letters for consonants and leave most vowels unexpressed. Abugidas are also consonant-based but indicate vowels with
diacritic A diacritic (also diacritical mark, diacritical point, diacritical sign, or accent) is a glyph added to a letter or to a basic glyph. The term derives from the Ancient Greek (, "distinguishing"), from (, "to distinguish"). The word ''diacriti ...
s, a systematic graphic modification of the consonants. In alphabets in the narrow sense, on the other hand, consonants and vowels are written as independent letters. The earliest known alphabet in the wider sense is the Middle Bronze Age alphabets, Wadi el-Hol script, believed to be an abjad. Its successor, Phoenician alphabet, Phoenician is the ancestor of modern alphabets, including
Arabic Arabic (, ' ; , ' or ) is a Semitic languages, Semitic language spoken primarily across the Arab world.Semitic languages: an international handbook / edited by Stefan Weninger; in collaboration with Geoffrey Khan, Michael P. Streck, Janet C ...
,
Greek Greek may refer to: Greece Anything of, from, or related to Greece, a country in Southern Europe: *Greeks, an ethnic group. *Greek language, a branch of the Indo-European language family. **Proto-Greek language, the assumed last common ancestor ...
,
Latin Latin (, or , ) is a classical language belonging to the Italic branch of the Indo-European languages. Latin was originally a dialect spoken in the lower Tiber area (then known as Latium) around present-day Rome, but through the power of the ...
(via the
Old Italic alphabet The Old Italic scripts are a family of similar ancient writing systems used in the Italian Peninsula between about 700 and 100 BC, for various languages spoken in that time and place. The most notable member is the Etruscan alphabet, which ...
), Cyrillic (via the Greek alphabet), and
Hebrew Hebrew (; ; ) is a Northwest Semitic language of the Afroasiatic language family. Historically, it is one of the spoken languages of the Israelites and their longest-surviving descendants, the Jews and Samaritans. It was largely preserved ...
(via
Aramaic The Aramaic languages, short Aramaic ( syc, ܐܪܡܝܐ, Arāmāyā; oar, 𐤀𐤓𐤌𐤉𐤀; arc, 𐡀𐡓𐡌𐡉𐡀; tmr, אֲרָמִית), are a language family containing many varieties (languages and dialects) that originated in ...
). Examples of present-day abjads are the
Arabic Arabic (, ' ; , ' or ) is a Semitic languages, Semitic language spoken primarily across the Arab world.Semitic languages: an international handbook / edited by Stefan Weninger; in collaboration with Geoffrey Khan, Michael P. Streck, Janet C ...
and Hebrew scripts; true alphabets include Latin script, Latin, Cyrillic, and Korean hangul; and abugidas, used to write tigrinya language, Tigrinya, Amharic language, Amharic, Hindi, and Thai language, Thai. The Canadian Aboriginal syllabics are also an abugida rather than a syllabary as their name would imply, because each glyph stands for a consonant and is modified by rotation to represent the following vowel (in a true syllabary, each consonant-vowel combination gets represented by a separate glyph). All three types may be augmented with syllabic glyphs. Ugaritic script, Ugaritic, for example, is basically an abjad but has syllabic letters for (these are the only times that vowels are indicated). Coptic alphabet, Coptic has a letter for .
Devanagari Devanagari ( ; , , Sanskrit pronunciation: ), also called Nagari (),Kathleen Kuiper (2010), The Culture of India, New York: The Rosen Publishing Group, , page 83 is a left-to-right abugida (a type of segmental Writing systems#Segmental syste ...
is typically an abugida augmented with dedicated letters for initial vowels, though some traditions use अ as a zero consonant as the graphic base for such vowels. The boundaries between the three types of segmental scripts are not always clear-cut. For example, Sorani Kurdish language, Kurdish is written in the Arabic script, which when used for other languages is an abjad. In Kurdish, writing the vowels is mandatory, and whole letters get used, so the script is a true alphabet. Other languages may use a Semitic abjad with forced vowel diacritics, effectively making them abugidas. On the other hand, the Phagspa script of the Mongol Empire was based closely on the Tibetan script, Tibetan abugida, but vowel marks are written after the preceding consonant rather than as diacritic marks. Although short ''a'' not getting written, as in the Indic abugidas, one could argue that the linear arrangement made this a true alphabet. Ironically, the original source of the term "abugida", namely the Ge'ez alphabet, Ge'ez abugida now used for Amharic language, Amharic and Tigrinya language, Tigrinya, has assimilated into their consonant modifications. It is now no longer systematic and must be learned as a syllabary rather than as a segmental script. Even more extreme, the Pahlavi abjad eventually became logogram, logographic. Thus the primary categorisation of alphabets reflects how they treat vowels. For Tone (linguistics), tonal languages, further classification can be based on their treatment of tone though names do not yet exist to distinguish the various types. Some alphabets disregard tone entirely, especially when it does not carry a heavy functional load, as in Somali language, Somali and many other languages of Africa and the Americas. Such scripts are to tone what abjads are to vowels. Most commonly, tones get indicated with diacritics, which is how vowels get treated in abugidas, which is the case for Vietnamese alphabet, Vietnamese (a true alphabet) and Thai alphabet, Thai (an abugida). In Thai, the tone is determined primarily by a consonant, with diacritics for disambiguation. In the Pollard script, an abugida, vowels get indicated by diacritics. The placing of the diacritic relative to the consonant is modified to indicate the tone. More rarely, a script may have separate letters for tones, as is the case for Hmong alphabet, Hmong and Zhuang alphabet, Zhuang. For many, regardless of whether letters or diacritics get used, the most common tone is not marked, just as the most common vowel is not marked in Indic abugidas. In
Zhuyin Bopomofo (), or Mandarin Phonetic Symbols, also named Zhuyin (), is a Chinese transliteration system for Mandarin Chinese and other related languages and dialects. More commonly used in Taiwanese Mandarin, it may also be used to transcribe ...
, not only is one of the tones unmarked; but there is a diacritic to indicate a lack of tone, like the virama of Indic.


Size

The number of letters in an alphabet can be small. The Book Pahlavi scripts, Pahlavi script, an abjad, had only twelve letters at one point and may have had even fewer. Today the Rotokas alphabet has only twelve letters (the Hawaiian alphabet claimed to be that small. However, it consists of 18 letters, including the ʻokina and five long vowels). While Rotokas has a small alphabet because it has few phonemes to represent (just eleven), Book Pahlavi was small because many letters got ''conflated''—or, the graphic distinctions had gotten lost over time. In later Pahlavi papyrus, papyri, up to half of the remaining graphic distinctions of these twelve letters were lost, and the script could no longer be read as a sequence of letters at all. Instead, each word had to be learned as a whole—or, they had become
logogram In a written language, a logogram, logograph, or lexigraph is a written character that represents a word or morpheme. Chinese characters (pronounced ''hanzi'' in Mandarin, ''kanji'' in Japanese, ''hanja'' in Korean) are generally logograms, as ...
s as in Egyptian Demotic Egyptian, Demotic. Moreover, the spellings of some words were heterogram (linguistics), heterograms; that is, those spellings did not reflect the pronunciation of those words in Pahlavi but instead reflected their Aramaic language, Aramaic equivalents used as logograms (as English ''e. g.'' ''for example'', from Latin ''exempli gratia'').


Abugidas

The largest segmental script is probably
Devanagari Devanagari ( ; , , Sanskrit pronunciation: ), also called Nagari (),Kathleen Kuiper (2010), The Culture of India, New York: The Rosen Publishing Group, , page 83 is a left-to-right abugida (a type of segmental Writing systems#Segmental syste ...
. When written in Devanagari, Vedic Sanskrit has an alphabet of 53 letters, including the ''visarga'' mark for final aspiration and special letters for ''kš'' and ''jñ.'' However, one of the letters is theoretical and not used. The Hindi alphabet must represent both Sanskrit and modern vocabulary, and so has been expanded to 58 with the letters (letters with a dot added) to represent sounds from Persian and English.


Abjads

The largest known abjad is Sindhi language, Sindhi, with 52 letters. The largest alphabets in the narrow sense include Abkhaz language, Abkhaz and Kabardian language, Kabardian (for Cyrillic), with 62 and 60 letters, respectively, and Slovak language, Slovak (for the Latin script), with 46. However, these scripts either count digraph (orthography), di- and tri-graphs as separate letters, as Spanish did with ''ch'' and ''ll'' until recently, or uses
diacritic A diacritic (also diacritical mark, diacritical point, diacritical sign, or accent) is a glyph added to a letter or to a basic glyph. The term derives from the Ancient Greek (, "distinguishing"), from (, "to distinguish"). The word ''diacriti ...
s like Slovak ''č''.


Alphabets

The Georgian alphabet is an alphabetic writing system. The modern Georgian alphabet has 33 letters. The original Georgian alphabet had 38 letters, but five letters were removed in the 19th century by Ilia Chavchavadze. The Armenian alphabet is an alphabetical writing system used to write the Armenian language. It was created in year 405 A.D. originally contained 36 letters. Two more letters, օ (o) and ֆ (f), were added in the Middle Ages. During the 1920s orthography reform, a new letter և (capital ԵՎ) was added, which was a ligature before ե+ւ. The letter Ւ ւ was discarded and reintroduced as part of a new letter ՈՒ ու (which was a digraph before).


Syllabaries

Syllabaries typically contain 50 to 400 glyphs. Glyphs of logographic systems typically number from the many hundreds into the thousands. Thus a simple count of the number of distinct symbols is an important clue to the nature of an unknown script.


Alphabetical order

Alphabets often come to be associated with a standard ordering of their letters, which is for
collation Collation is the assembly of written information into a standard order. Many systems of collation are based on numerical order or alphabetical order, or extensions and combinations thereof. Collation is a fundamental element of most office fili ...
—namely, for the listing words and other items in ''
alphabetical order Alphabetical order is a system whereby character strings are placed in order based on the position of the characters in the conventional ordering of an alphabet. It is one of the methods of collation. In mathematics, a lexicographical order is t ...
''. The basic ordering of the Latin alphabet (A B C D E F G H I J K L M N O P Q R S T U V W X Y Z), which derived from the Northwest Semitic "Abgad" order, is already well established. Although, languages using this alphabet have different conventions for their treatment of modified letters (such as the French language, French ''é'', ''à'', and ''ô'') and certain combinations of letters (Multigraph (orthography), multigraphs). In French, these do not get considered to be additional letters for the purposes of collation. However, in Icelandic, the accented letters such as ''á'', ''í'', and ''ö'' are considered distinct letters representing different vowel sounds from sounds represented by their unaccented counterparts. In Spanish, ''ñ'' is considered a separate letter, but accented vowels such as ''á'' and ''é'' are not. The ''ll'' and ''ch'' were also considered single letters, but in 1994 the tenth congress of the Association of Spanish Language Academies. The collating order got changed so that ''ll'' is between ''lk'' and ''lm'' in the dictionary and ''ch'' is between ''cg'' and ''ci'', and in 2010 the Real Academia Española changed it so they were no longer letters at all. In German, words starting with ''sch-'' (which spells the German phoneme ) are inserted between words with initial ''sca-'' and ''sci-'' (all incidentally loanwords) instead of appearing after initial ''sz'', as though it were a single letter—in contrast to several languages such as Albanian alphabet, Albanian, in which ''dh-'', ''ë-'', ''gj-'', ''ll-'', ''rr-'', ''th-'', ''xh-'' and ''zh-'' (all representing phonemes and considered separate single letters) would follow the letters ''d'', ''e'', ''g'', ''l'', ''n'', ''r'', ''t'', ''x'' and ''z'' respectively, as well as Hungarian and Welsh. Further, German words with an umlaut (diacritic), umlaut are collated ignoring the umlaut as per Deutsches Institut für Normung, DIN 5007-1—contrary to Turkish alphabet, Turkish that adopted the
grapheme In linguistics, a grapheme is the smallest functional unit of a writing system. The word ''grapheme'' is derived and the suffix ''-eme'' by analogy with ''phoneme'' and other names of emic units. The study of graphemes is called ''graphemics' ...
s ö and ü, and where a word like ''tüfek'', would come after ''tuz'', in the dictionary. An exception is the German telephone directory where umlauts are sorted like ''ä''=''ae'' since names such as ''Jäger'' also appear with the spelling ''Jaeger'', and are not distinguished in the spoken language. The Danish orthography, Danish and Norwegian orthography, Norwegian alphabets end with ''æ''—''ø''—''å'', whereas the Swedish and Finnish orthography, Finnish ones conventionally put ''å''—''ä''—''ö'' at the end. It is unknown whether the earliest alphabets had a defined sequence. Some alphabets today, such as the Hanuno'o script, are learned one letter at a time, in no particular order, and are not used for
collation Collation is the assembly of written information into a standard order. Many systems of collation are based on numerical order or alphabetical order, or extensions and combinations thereof. Collation is a fundamental element of most office fili ...
where a definite order is required. However, a dozen Ugaritic alphabet, Ugaritic tablets from the fourteenth century BCE preserve the alphabet in two sequences. One, the ''ABCDE'' order later used in Phoenician, has continued with minor changes in
Hebrew Hebrew (; ; ) is a Northwest Semitic language of the Afroasiatic language family. Historically, it is one of the spoken languages of the Israelites and their longest-surviving descendants, the Jews and Samaritans. It was largely preserved ...
,
Greek Greek may refer to: Greece Anything of, from, or related to Greece, a country in Southern Europe: *Greeks, an ethnic group. *Greek language, a branch of the Indo-European language family. **Proto-Greek language, the assumed last common ancestor ...
, Armenian alphabet, Armenian, Gothic alphabet, Gothic, Cyrillic, and
Latin Latin (, or , ) is a classical language belonging to the Italic branch of the Indo-European languages. Latin was originally a dialect spoken in the lower Tiber area (then known as Latium) around present-day Rome, but through the power of the ...
; the other, ''HMĦLQ,'' was used in southern Arabia and is preserved today in Ge'ez alphabet, Ethiopic. Both orders have therefore been stable for at least 3000 years. Runic alphabet, Runic used an unrelated Elder Futhark, Futhark sequence, which was later Younger Futhark, simplified.
Arabic Arabic (, ' ; , ' or ) is a Semitic languages, Semitic language spoken primarily across the Arab world.Semitic languages: an international handbook / edited by Stefan Weninger; in collaboration with Geoffrey Khan, Michael P. Streck, Janet C ...
uses its own sequence, although Arabic retains the traditional abjadi order for numbering. The Brahmic family of alphabets used in India use a unique order based on phonology: The letters are arranged according to how and where they are produced in the mouth. This organization is used in Southeast Asia, Tibet, Korean hangul, and even Japanese kana, which is not an alphabet.


Names of letters

The Phoenician letter names, in which each letter was associated with a word that begins with that sound (acrophony), continue to be used to varying degrees in Samaritan alphabet, Samaritan,
Aramaic The Aramaic languages, short Aramaic ( syc, ܐܪܡܝܐ, Arāmāyā; oar, 𐤀𐤓𐤌𐤉𐤀; arc, 𐡀𐡓𐡌𐡉𐡀; tmr, אֲרָמִית), are a language family containing many varieties (languages and dialects) that originated in ...
,
Syriac Syriac may refer to: *Syriac language, an ancient dialect of Middle Aramaic *Sureth, one of the modern dialects of Syriac spoken in the Nineveh Plains region * Syriac alphabet ** Syriac (Unicode block) ** Syriac Supplement * Neo-Aramaic languages a ...
,
Hebrew Hebrew (; ; ) is a Northwest Semitic language of the Afroasiatic language family. Historically, it is one of the spoken languages of the Israelites and their longest-surviving descendants, the Jews and Samaritans. It was largely preserved ...
,
Greek Greek may refer to: Greece Anything of, from, or related to Greece, a country in Southern Europe: *Greeks, an ethnic group. *Greek language, a branch of the Indo-European language family. **Proto-Greek language, the assumed last common ancestor ...
and
Arabic Arabic (, ' ; , ' or ) is a Semitic languages, Semitic language spoken primarily across the Arab world.Semitic languages: an international handbook / edited by Stefan Weninger; in collaboration with Geoffrey Khan, Michael P. Streck, Janet C ...
. The names were abandoned in
Latin Latin (, or , ) is a classical language belonging to the Italic branch of the Indo-European languages. Latin was originally a dialect spoken in the lower Tiber area (then known as Latium) around present-day Rome, but through the power of the ...
, which instead referred to the letters by adding a vowel (usually "e") before or after the consonant; the two exceptions were Y and Z, which were borrowed from the Greek alphabet rather than Etruscan, and were known as ''Y Graeca'' "Greek Y" and ''zeta'' (from Greek)—this discrepancy was inherited by many European languages, as in the term ''zed'' for Z in all forms of English other than American English. Over time names sometimes shifted or were added, as in ''double U'' for W ("double V" in French), the English name for Y, and American ''zee'' for Z. Comparing names in English and French gives a clear reflection of the Great Vowel Shift: A, B, C and D are pronounced in today's English, but in contemporary French they are . The French names (from which the English names are derived) preserve the qualities of the English vowels from before the Great Vowel Shift. By contrast, the names of F, L, M, N and S () remain the same in both languages, because "short" vowels were largely unaffected by the Shift. In Cyrillic originally the letters were given names based on Slavic words; this was later abandoned as well in favor of a system similar to that used in Latin. Letters of Armenian alphabet also have distinct letter names.


Orthography and pronunciation

When an alphabet is adopted or developed to represent a given language, an orthography generally comes into being, providing rules for the spelling of words in that language. In accordance with the principle on which alphabets are based, these rules will generally map letters of the alphabet to the
phoneme In phonology and linguistics, a phoneme () is a unit of sound that can distinguish one word from another in a particular language. For example, in most dialects of English, with the notable exception of the West Midlands and the north-west o ...
s (significant sounds) of the spoken language. In a perfectly phonemic orthography there would be a consistent one-to-one correspondence between the letters and the phonemes, so that a writer could predict the spelling of a word given its pronunciation, and a speaker would always know the pronunciation of a word given its spelling, and vice versa. However, this ideal is not usually achieved in practice; some languages (such as Spanish and Finnish language, Finnish) come close to it, while others (such as English) deviate from it to a much larger degree. The pronunciation of a language often evolves independently of its writing system, and writing systems have been borrowed for languages they were not designed for, so the degree to which letters of an alphabet correspond to phonemes of a language varies greatly from one language to another and even within a single language. Languages may fail to achieve a one-to-one correspondence between letters and sounds in any of several ways: *A language may represent a given phoneme by a combination of letters rather than just a single letter. Two-letter combinations are called digraph (orthography), digraphs and three-letter groups are called trigraph (orthography), trigraphs. German language, German uses the tetragraphs (four letters) "tsch" for the phoneme and (in a few borrowed words) "dsch" for . Kabardian language, Kabardian also uses a tetragraph for one of its phonemes, namely "кхъу". Two letters representing one sound occur in several instances in Hungarian as well (where, for instance, ''cs'' stands for [tʃ], ''sz'' for [s], ''zs'' for [ʒ], ''dzs'' for [dʒ]). *A language may represent the same phoneme with two or more different letters or combinations of letters. An example is modern Greek which may write the phoneme in six different ways: , , , , , and . *A language may spell some words with unpronounced letters that exist for historical or other reasons. For example, the spelling of the Thai word for "beer" [เบียร์] retains a letter for the final consonant "r" present in the English word it was borrowed from, but silences it. *Pronunciation of individual words may change according to the presence of surrounding words in a sentence (sandhi). *Different dialects of a language may use different phonemes for the same word. *A language may use different sets of symbols or different rules for distinct sets of vocabulary items, such as the Japanese hiragana and katakana syllabaries, or the various rules in English for spelling words from Latin and Greek, or the original Germanic languages, Germanic vocabulary. National languages sometimes elect to address the problem of dialects by associating the alphabet with the national standard. Some national languages like Finnish language, Finnish, Armenian language, Armenian, Turkish language, Turkish, Russian language, Russian, Serbo-Croatian language, Serbo-Croatian (Serbian language, Serbian, Croatian language, Croatian and Bosnian language, Bosnian) and Bulgarian language, Bulgarian have a very regular spelling system with a nearly one-to-one correspondence between letters and phonemes. Strictly speaking, these national languages lack a word corresponding to the verb "to spell" (meaning to split a word into its letters), the closest match being a verb meaning to split a word into its syllables. Similarly, the
Italian Italian(s) may refer to: * Anything of, from, or related to the people of Italy over the centuries ** Italians, an ethnic group or simply a citizen of the Italian Republic or Italian Kingdom ** Italian language, a Romance language *** Regional Ita ...
verb corresponding to 'spell (out)', ''compitare'', is unknown to many Italians because spelling is usually trivial, as Italian spelling is highly phonemic. In standard Spanish, one can tell the pronunciation of a word from its spelling, but not vice versa, as certain phonemes can be represented in more than one way, but a given letter is consistently pronounced. French language, French, with its silent letters and its heavy use of nasal vowels and elision, may seem to lack much correspondence between spelling and pronunciation, but its rules on pronunciation, though complex, are actually consistent and predictable with a fair degree of accuracy. At the other extreme are languages such as English, where the pronunciations of many words must be memorized as they do not correspond to the spelling in a consistent way. For English, this is partly because the Great Vowel Shift occurred after the orthography was established, and because English has acquired a large number of loanwords at different times, retaining their original spelling at varying levels. Even English has general, albeit complex, rules that predict pronunciation from spelling, and these rules are successful most of the time; rules to predict spelling from the pronunciation have a higher failure rate. Sometimes, countries have the written language undergo a spelling reform to realign the writing with the contemporary spoken language. These can range from simple spelling changes and word forms to switching the entire writing system itself, as when Turkey switched from the Arabic alphabet to a Latin-based Turkish alphabet, and as when Kazakh alphabets, Kazakh changes from an Arabic script to a Cyrillic script due to the Soviet Union's influence, and in 2021, having a transition to the Latin alphabet, just like Turkish. The Cyrillic script used to be official in Uzbekistan and Turkmenistan before they all switched to the Latin alphabets, including Uzbekistan that is having a reform of the alphabet to use diacritics on the letters that is marked by apostrophes and the letters that are digraphs. The standard system of symbols used by linguists to represent sounds in any language, independently of orthography, is called the International Phonetic Alphabet.


See also

*Abecedarium *Acrophony *Akshara *Alphabet book *Alphabet effect *Alphabet song *Alphabetical order *Butterfly Alphabet *Character encoding *Constructed script *Fingerspelling *NATO phonetic alphabet *Lipogram *List of writing systems *Pangram *Thoth *Transliteration *Unicode


References


Bibliography

* * Overview of modern and some ancient writing systems. * * * Chapter 3 traces and summarizes the invention of alphabetic writing. * * * * * * * * Chapter 4 traces the invention of writing


External links


The Origins of abc"Language, Writing and Alphabet: An Interview with Christophe Rico"
''Damqātum 3'' (2007) *Michael Everson'
Alphabets of Europe
animation by Prof. Robert Fradkin at the University of Maryland
How the Alphabet Was Born from Hieroglyphs
Biblical Archaeology Review
An Early Hellenic AlphabetThe Alphabet
BBC Radio 4 discussion with Eleanor Robson, Alan Millard and Rosalind Thomas (''In Our Time'', 18 December 2003) {{Authority control Alphabets, Orthography