HOME

TheInfoList



OR:

An alphabet is a standardized set of basic written
grapheme In linguistics, a grapheme is the smallest functional unit of a writing system. The word ''grapheme'' is derived and the suffix ''-eme'' by analogy with ''phoneme'' and other names of emic units. The study of graphemes is called '' graphemi ...
s (called
letters Letter, letters, or literature may refer to: Characters typeface * Letter (alphabet), a character representing one or more of the sounds used in speech; any of the symbols of an alphabet. * Letterform, the graphic form of a letter of the alphabe ...
) that represent the
phoneme In phonology and linguistics, a phoneme () is a unit of sound that can distinguish one word from another in a particular language. For example, in most dialects of English, with the notable exception of the West Midlands and the north-wes ...
s of certain
spoken language A spoken language is a language produced by articulate sounds or (depending on one's definition) manual gestures, as opposed to a written language. An oral language or vocal language is a language produced with the vocal tract in contrast with a si ...
s. Not all
writing system A writing system is a method of visually representing verbal communication, based on a script and a set of rules regulating its use. While both writing and speech are useful in conveying messages, writing differs in also being a reliable fo ...
s represent language in this way; in a syllabary, each character represents a syllable, and logographic systems use
characters Character or Characters may refer to: Arts, entertainment, and media Literature * ''Character'' (novel), a 1936 Dutch novel by Ferdinand Bordewijk * ''Characters'' (Theophrastus), a classical Greek set of character sketches attributed to The ...
to represent words,
morpheme A morpheme is the smallest meaningful constituent of a linguistic expression. The field of linguistic study dedicated to morphemes is called morphology. In English, morphemes are often but not necessarily words. Morphemes that stand alone are ...
s, or other semantic units. The first fully phonemic script, the
Proto-Sinaitic script Proto-Sinaitic (also referred to as Sinaitic, Proto-Canaanite when found in Canaan, the North Semitic alphabet, or Early Alphabetic) is considered the earliest trace of alphabetic writing and the common ancestor of both the Ancient South Arabian ...
, later known as the
Phoenician alphabet The Phoenician alphabet is an alphabet (more specifically, an abjad) known in modern times from the Canaanite and Aramaic inscriptions found across the Mediterranean region. The name comes from the Phoenician civilization. The Phoenician a ...
, is considered to be the first alphabet and is the ancestor of most modern alphabets, including
Arabic Arabic (, ' ; , ' or ) is a Semitic language spoken primarily across the Arab world.Semitic languages: an international handbook / edited by Stefan Weninger; in collaboration with Geoffrey Khan, Michael P. Streck, Janet C. E.Watson; Walter ...
, Cyrillic,
Greek Greek may refer to: Greece Anything of, from, or related to Greece, a country in Southern Europe: *Greeks, an ethnic group. *Greek language, a branch of the Indo-European language family. **Proto-Greek language, the assumed last common ancestor ...
,
Hebrew Hebrew (; ; ) is a Northwest Semitic language of the Afroasiatic language family. Historically, it is one of the spoken languages of the Israelites and their longest-surviving descendants, the Jews and Samaritans. It was largely preserved ...
,
Latin Latin (, or , ) is a classical language belonging to the Italic branch of the Indo-European languages. Latin was originally a dialect spoken in the lower Tiber area (then known as Latium) around present-day Rome, but through the power of the ...
, and possibly
Brahmic The Brahmic scripts, also known as Indic scripts, are a family of abugida writing systems. They are used throughout the Indian subcontinent, Southeast Asia and parts of East Asia. They are descended from the Brahmi script of ancient India ...
. It was created by Semitic-speaking workers and slaves in the
Sinai Peninsula The Sinai Peninsula, or simply Sinai (now usually ) (, , cop, Ⲥⲓⲛⲁ), is a peninsula in Egypt, and the only part of the country located in Asia. It is between the Mediterranean Sea to the north and the Red Sea to the south, and is ...
(as the Proto-Sinaitic script), by selecting a small number of
hieroglyphs A hieroglyph (Greek for "sacred carvings") was a character of the ancient Egyptian writing system. Logographic scripts that are pictographic in form in a way reminiscent of ancient Egyptian are also sometimes called "hieroglyphs". In Neoplatonis ...
commonly seen in their Egyptian surroundings to describe the sounds, as opposed to the semantic values of the Canaanite languages. However,
Peter T. Daniels Peter T. Daniels (born December 11, 1951) is a scholar of writing systems, specializing in typology. He was co-editor (with William Bright) of the book ''The World's Writing Systems'' (1996). He was a lecturer at University of Wisconsin–Milwauk ...
distinguishes an
abugida An abugida (, from Ge'ez: ), sometimes known as alphasyllabary, neosyllabary or pseudo-alphabet, is a segmental writing system in which consonant-vowel sequences are written as units; each unit is based on a consonant letter, and vowel n ...
, a set of graphemes that represent consonantal base letters that
diacritic A diacritic (also diacritical mark, diacritical point, diacritical sign, or accent) is a glyph added to a letter or to a basic glyph. The term derives from the Ancient Greek (, "distinguishing"), from (, "to distinguish"). The word ''diacriti ...
s modify to represent vowels (as in
Devanagari Devanagari ( ; , , Sanskrit pronunciation: ), also called Nagari (),Kathleen Kuiper (2010), The Culture of India, New York: The Rosen Publishing Group, , page 83 is a left-to-right abugida (a type of segmental writing system), based on the ...
and other South Asian scripts), an
abjad An abjad (, ar, أبجد; also abgad) is a writing system in which only consonants are represented, leaving vowel sounds to be inferred by the reader. This contrasts with other alphabets, which provide graphemes for both consonants and vowels ...
, in which letters predominantly or exclusively represent
consonant In articulatory phonetics, a consonant is a speech sound that is articulated with complete or partial closure of the vocal tract. Examples are and pronounced with the lips; and pronounced with the front of the tongue; and pronounced wi ...
s (as in the original Phoenician,
Hebrew Hebrew (; ; ) is a Northwest Semitic language of the Afroasiatic language family. Historically, it is one of the spoken languages of the Israelites and their longest-surviving descendants, the Jews and Samaritans. It was largely preserved ...
or
Arabic Arabic (, ' ; , ' or ) is a Semitic language spoken primarily across the Arab world.Semitic languages: an international handbook / edited by Stefan Weninger; in collaboration with Geoffrey Khan, Michael P. Streck, Janet C. E.Watson; Walter ...
), and an alphabet, a set of graphemes that represent both consonants and
vowel A vowel is a syllabic speech sound pronounced without any stricture in the vocal tract. Vowels are one of the two principal classes of speech sounds, the other being the consonant. Vowels vary in quality, in loudness and also in quantity (leng ...
s. In this narrow sense of the word, the first true alphabet was the Greek alphabet, which was based on the earlier Phoenician alphabet. Of the dozens of alphabets in use today, the most popular is the Latin alphabet, which was derived from the Greek alphabet, and which is now used by many languages worldwide, often with the addition of extra letters or diacritical marks. While most alphabets have letters composed of lines, there are also exceptions such as the alphabets used in
Braille Braille (Pronounced: ) is a tactile writing system used by people who are visually impaired, including people who are blind, deafblind or who have low vision. It can be read either on embossed paper or by using refreshable braille disp ...
. Alphabets are usually associated with a standard ordering of letters. This makes them useful for purposes of
collation Collation is the assembly of written information into a standard order. Many systems of collation are based on numerical order or alphabetical order, or extensions and combinations thereof. Collation is a fundamental element of most office filin ...
, specifically by allowing words to be sorted in
alphabetical order Alphabetical order is a system whereby character strings are placed in order based on the position of the characters in the conventional ordering of an alphabet. It is one of the methods of collation. In mathematics, a lexicographical order is t ...
. It also means that their letters can be used as an alternative method of "numbering" ordered items, in such contexts as numbered lists and number placements.


Etymology

The English word ''alphabet'' came into
Middle English Middle English (abbreviated to ME) is a form of the English language that was spoken after the Norman conquest of 1066, until the late 15th century. The English language underwent distinct variations and developments following the Old English ...
from the
Late Latin Late Latin ( la, Latinitas serior) is the scholarly name for the form of Literary Latin of late antiquity.Roberts (1996), p. 537. English dictionary definitions of Late Latin date this period from the , and continuing into the 7th century in t ...
word ''alphabetum'', which in turn originated in the Greek ἀλφάβητος (''alphabētos''), it was made from the first two letters, '' alpha'' (α) and '' beta'' (β). The names for the Greek letters came from the first two letters of the Phoenician alphabet; '' aleph'', which also meant ''ox'', and ''
bet Black Entertainment Television (acronym BET) is an American basic cable channel targeting African-American audiences. It is owned by the CBS Entertainment Group unit of Paramount Global via BET Networks and has offices in New York City, Los ...
'', which also meant ''house''.


History


Ancient Northeast African and Middle Eastern scripts

The history of the alphabet started in ancient Egypt. Egyptian writing had a set of some 24 hieroglyphs that are called uniliterals, representing syllables that begin with a single consonant of their language, plus a vowel (or no vowel) to be supplied by the native speaker. These glyphs were used as pronunciation guides for
logogram In a written language, a logogram, logograph, or lexigraph is a written character that represents a word or morpheme. Chinese characters (pronounced '' hanzi'' in Mandarin, ''kanji'' in Japanese, ''hanja'' in Korean) are generally logograms, ...
s, to write grammatical inflections, and, later, to transcribe loan words and foreign names. In the Middle Bronze Age, an apparently "alphabetic" system known as the Proto-Sinaitic script appeared in Egyptian turquoise mines in the
Sinai peninsula The Sinai Peninsula, or simply Sinai (now usually ) (, , cop, Ⲥⲓⲛⲁ), is a peninsula in Egypt, and the only part of the country located in Asia. It is between the Mediterranean Sea to the north and the Red Sea to the south, and is ...
dated to circa the 15th century BCE, apparently left by Canaanite workers. In 1999, John and Deborah Darnell discovered an earlier version of this first alphabet at
Wadi el-Hol Wadi el-Hol is a valley on the Farshut Road, north-west of Luxor on the Qena Bend, situated on the west bank of the river Nile in Egypt. Rock inscriptions in the valley appear to show the oldest examples of phonetic alphabetic writing discovered ...
. Dating to circa 1800 BCE and showing evidence of having been adapted from specific forms of Egyptian hieroglyphs that could be dated to circa 2000 BCE, strongly suggesting that the first alphabet had developed about that time. Based on letter appearances and names, believed to be based on Egyptian hieroglyphs. This script had no characters representing vowels. Originally, it probably was a syllabary, with symbols that were not needed being removed. An alphabetic
cuneiform Cuneiform is a logo-syllabic script that was used to write several languages of the Ancient Middle East. The script was in active use from the early Bronze Age until the beginning of the Common Era. It is named for the characteristic wedge-sh ...
script with 30 signs, including three that indicate the following vowel invented in
Ugarit ) , image =Ugarit Corbel.jpg , image_size=300 , alt = , caption = Entrance to the Royal Palace of Ugarit , map_type = Near East#Syria , map_alt = , map_size = 300 , relief=yes , location = Latakia Governorate, Syria , region = ...
before the 15th century BCE. This script was not used after the destruction of Ugarit. The Proto-Sinaitic script eventually developed into the Phoenician alphabet, conventionally called "Proto-Canaanite" before c. 1050 BCE. The oldest text in Phoenician script is an inscription on the sarcophagus of King
Ahiram The Ahiram sarcophagus (also spelled Ahirom, in Phoenician) was the sarcophagus of a Phoenician Kings of Byblos, King of Byblos (c. 850 BC), discovered in 1923 by the French excavator Pierre Montet in tomb V of the royal necropolis of Byblos. Th ...
. This script is the parent script of all western alphabets. By the tenth century, two other forms distinguish themselves, Canaanite and
Aramaic The Aramaic languages, short Aramaic ( syc, ܐܪܡܝܐ, Arāmāyā; oar, 𐤀𐤓𐤌𐤉𐤀; arc, 𐡀𐡓𐡌𐡉𐡀; tmr, אֲרָמִית), are a language family containing many varieties (languages and dialects) that originated in ...
. The Aramaic gave rise to the
Hebrew Hebrew (; ; ) is a Northwest Semitic language of the Afroasiatic language family. Historically, it is one of the spoken languages of the Israelites and their longest-surviving descendants, the Jews and Samaritans. It was largely preserved ...
script. The
South Arabian alphabet The Ancient South Arabian script (Old South Arabian 𐩣𐩯𐩬𐩵 ''ms3nd''; modern ar, الْمُسْنَد ''musnad'') branched from the Proto-Sinaitic script in about the late 2nd millennium BCE. It was used for writing the Old Sou ...
, a sister script to the Phoenician alphabet, is the script from which the Ge'ez alphabet (an
abugida An abugida (, from Ge'ez: ), sometimes known as alphasyllabary, neosyllabary or pseudo-alphabet, is a segmental writing system in which consonant-vowel sequences are written as units; each unit is based on a consonant letter, and vowel n ...
) descended. Vowel-less alphabets are called
abjad An abjad (, ar, أبجد; also abgad) is a writing system in which only consonants are represented, leaving vowel sounds to be inferred by the reader. This contrasts with other alphabets, which provide graphemes for both consonants and vowels ...
s, currently exemplified in others such as, Arabic, Hebrew, and
Syriac Syriac may refer to: *Syriac language, an ancient dialect of Middle Aramaic *Sureth, one of the modern dialects of Syriac spoken in the Nineveh Plains region * Syriac alphabet ** Syriac (Unicode block) ** Syriac Supplement * Neo-Aramaic languages a ...
. The omission of vowels was not always a satisfactory solution, and some "weak" consonants sometimes being used to indicate the vowel quality of a syllable). These letters have a dual function since they can also be used as pure consonants. The Proto-Sinaitic script and the Ugaritic script were the first scripts with a limited number of signs, in contrast to the other widely used writing systems at the time,
Cuneiform Cuneiform is a logo-syllabic script that was used to write several languages of the Ancient Middle East. The script was in active use from the early Bronze Age until the beginning of the Common Era. It is named for the characteristic wedge-sh ...
,
Egyptian hieroglyphs Egyptian hieroglyphs (, ) were the formal writing system used in Ancient Egypt, used for writing the Egyptian language. Hieroglyphs combined logographic, syllabic and alphabetic elements, with some 1,000 distinct characters.There were about 1, ...
, and Linear B. The Phoenician script was probably the first phonemic script, and it contained only about two dozen distinct letters, making it a script simple enough for traders to learn. Another advantage of Phoenician was that it could write different languages since it recorded words phonemically. The script was spread across the Mediterranean by the Phoenicians. In Greece, they added vowels to the alphabet, giving rise to the ancestor of all alphabets in the West. It was the first alphabet in which vowels have independent letter forms separate from those of consonants. The Greeks chose letters representing sounds that did not exist in Greek to represent vowels. Vowels are significant in the Greek language. The syllabical Linear B script that was used by the
Mycenaean Greeks Mycenaean Greece (or the Mycenaean civilization) was the last phase of the Bronze Age in Ancient Greece, spanning the period from approximately 1750 to 1050 BC.. It represents the first advanced and distinctively Greek civilization in mainland ...
from the 16th century BCE had 87 symbols, including five vowels. In its early years, there were many variants of the Greek alphabet, a situation that caused many different alphabets to evolve from it.


European alphabets

The Greek alphabet, in Euboean form, was carried over by Greek colonists to the Italian peninsula, giving rise to many different alphabets used to write the Italic languages. One of these became the Latin alphabet, which spread across Europe as the Romans expanded their empire. Even after the fall of the Roman state, the alphabet survived in intellectual and religious works. It eventually became used for the descendant languages of Latin (the
Romance languages The Romance languages, sometimes referred to as Latin languages or Neo-Latin languages, are the various modern languages that evolved from Vulgar Latin. They are the only extant subgroup of the Italic languages in the Indo-European language ...
) and most of the other languages of western and central Europe. Some adaptations of the Latin alphabet have ligatures, such as æ in
Danish Danish may refer to: * Something of, from, or related to the country of Denmark People * A national or citizen of Denmark, also called a "Dane," see Demographics of Denmark * Culture of Denmark * Danish people or Danes, people with a Danish a ...
and Icelandic and Ȣ in Algonquian; borrowings from other alphabets, such as the
thorn Thorn(s) or The Thorn(s) may refer to: Botany * Thorns, spines, and prickles, sharp structures on plants * ''Crataegus monogyna'', or common hawthorn, a plant species Comics and literature * Rose and Thorn, the two personalities of two DC Com ...
þ in
Old English Old English (, ), or Anglo-Saxon, is the earliest recorded form of the English language, spoken in England and southern and eastern Scotland in the early Middle Ages. It was brought to Great Britain by Anglo-Saxon settlers in the mid-5th c ...
and Icelandic, which came from the
Futhark Runes are the letters in a set of related alphabets known as runic alphabets native to the Germanic peoples. Runes were used to write various Germanic languages (with some exceptions) before they adopted the Latin alphabet, and for specialis ...
runes; and modified existing letters, such as the
eth (colloquially) , former_name = eidgenössische polytechnische Schule , image = ETHZ.JPG , image_size = , established = , type = Public , budget = CHF 1.896 billion (2021) , rector = Günther Dissertori , president = Joël Mesot , a ...
ð of Old English and Icelandic, which is a modified ''d''. Other alphabets only use a subset of the Latin alphabet, such as Hawaiian, and
Italian Italian(s) may refer to: * Anything of, from, or related to the people of Italy over the centuries ** Italians, an ethnic group or simply a citizen of the Italian Republic or Italian Kingdom ** Italian language, a Romance language *** Regional Ita ...
, which uses the letters ''j, k, x, y,'' and ''w'' only in foreign words. Another notable script is Elder Futhark, believed to have evolved out of one of the Old Italic alphabets. Elder Futhark gave rise to other alphabets known collectively as the
Runic alphabet Runes are the letters in a set of related alphabets known as runic alphabets native to the Germanic peoples. Runes were used to write various Germanic languages (with some exceptions) before they adopted the Latin alphabet, and for specialised ...
s. The Runic alphabets were used for Germanic languages from 100 CE to the late Middle Ages being engraved on stone and jewelry, although inscriptions found on bone and wood occasionally appear. These alphabets have since gotten replaced with the Latin alphabet. The exception being for decorative use, where the runes remained in use until the 20th century. The
Old Hungarian script The Old Hungarian script or Hungarian runes ( hu, Székely-magyar rovás, 'székely-magyar runiform', or ) is an alphabetic writing system used for writing the Hungarian language. Modern Hungarian is written using the Latin-based Hungarian alph ...
was the writing system of the Hungarians. It was in use during the entire history of Hungary, albeit not as an official writing system. From the 19th century, it once again became more and more popular. The
Glagolitic alphabet The Glagolitic script (, , ''glagolitsa'') is the oldest known Slavic alphabet. It is generally agreed to have been created in the 9th century by Saint Cyril, a monk from Thessalonica. He and his brother Saint Methodius were sent by the Byzan ...
was the initial script of the liturgical language Old Church Slavonic and became, together with the Greek uncial script, the basis of the
Cyrillic script The Cyrillic script ( ), Slavonic script or the Slavic script, is a writing system used for various languages across Eurasia. It is the designated national script in various Slavic, Turkic, Mongolic, Uralic, Caucasian and Iranic-speaking co ...
. Cyrillic is one of the most widely used modern alphabetic scripts, and is notable for its use in Slavic languages and also for other languages within the former
Soviet Union The Soviet Union,. officially the Union of Soviet Socialist Republics. (USSR),. was a List of former transcontinental countries#Since 1700, transcontinental country that spanned much of Eurasia from 1922 to 1991. A flagship communist state, ...
.
Cyrillic alphabets Numerous Cyrillic alphabets are based on the Cyrillic script. The early Cyrillic alphabet was developed in the 9th century AD and replaced the earlier Glagolitic script developed by the Byzantine theologians Cyril and Methodius. It is the b ...
include Serbian, Macedonian,
Bulgarian Bulgarian may refer to: * Something of, from, or related to the country of Bulgaria * Bulgarians, a South Slavic ethnic group * Bulgarian language, a Slavic language * Bulgarian alphabet * A citizen of Bulgaria, see Demographics of Bulgaria * Bul ...
,
Russian Russian(s) refers to anything related to Russia, including: *Russians (, ''russkiye''), an ethnic group of the East Slavic peoples, primarily living in Russia and neighboring countries *Rossiyane (), Russian language term for all citizens and peo ...
, Belarusian, and
Ukrainian Ukrainian may refer to: * Something of, from, or related to Ukraine * Something relating to Ukrainians, an East Slavic people from Eastern Europe * Something relating to demographics of Ukraine in terms of demography and population of Ukraine * So ...
. The Glagolitic alphabet believed to have been created by
Saints Cyril and Methodius Cyril (born Constantine, 826–869) and Methodius (815–885) were two brothers and Byzantine Christian theologians and missionaries. For their work evangelizing the Slavs, they are known as the "Apostles to the Slavs". They are credited wi ...
, while the Cyrillic alphabet was invented by
Clement of Ohrid Saint Clement of Ohrid ( Bulgarian, Serbian and Macedonian: Свети Климент Охридски, ; el, Ἅγιος Κλήμης τῆς Ἀχρίδας; sk, svätý Kliment Ochridský; – 916) was one of the first medieval Bulgarian ...
, their disciple. They feature many letters that appear to have been borrowed from or influenced by Greek and Hebrew.


Asian alphabets

Beyond the logographic Chinese writing, many phonetic scripts exist in Asia. The Arabic alphabet,
Hebrew alphabet The Hebrew alphabet ( he, אָלֶף־בֵּית עִבְרִי, ), known variously by scholars as the Ktav Ashuri, Jewish script, square script and block script, is an abjad script used in the writing of the Hebrew language and other Jewi ...
, Syriac alphabet, and other
abjad An abjad (, ar, أبجد; also abgad) is a writing system in which only consonants are represented, leaving vowel sounds to be inferred by the reader. This contrasts with other alphabets, which provide graphemes for both consonants and vowels ...
s of the Middle East are developments of the Aramaic alphabet. Most alphabetic scripts of India and Eastern Asia descend from the
Brahmi script Brahmi (; ; ISO: ''Brāhmī'') is a writing system of ancient South Asia. "Until the late nineteenth century, the script of the Aśokan (non-Kharosthi) inscriptions and its immediate derivatives was referred to by various names such as 'lath' ...
, believed to be a descendant of Aramaic.


Hangul

In
Korea Korea ( ko, 한국, or , ) is a peninsular region in East Asia. Since 1945, it has been divided at or near the 38th parallel, with North Korea (Democratic People's Republic of Korea) comprising its northern half and South Korea (Republic o ...
, Sejong the Great created the
Hangul The Korean alphabet, known as Hangul, . Hangul may also be written as following South Korea's standard Romanization. ( ) in South Korea and Chosŏn'gŭl in North Korea, is the modern official writing system for the Korean language. The le ...
alphabet Hangul is a unique alphabet: it is a
featural alphabet In a featural writing system, the shapes of the symbols (such as letters) are not arbitrary but encode phonological features of the phonemes that they represent. The term featural was introduced by Geoffrey Sampson to describe the Korean alph ...
, where design of many of the letters comes from a sound's place of articulation (P to look like the widened mouth, L to look like the tongue pulled in); creation of Hangul was planned by the government of the day; and it places individual letters in syllable clusters with equal dimensions, in the same way as
Chinese characters Chinese characters () are logograms developed for the writing of Chinese. In addition, they have been adapted to write other East Asian languages, and remain a key component of the Japanese writing system where they are known as ''kanji ...
, to allow for mixed-script writing (one syllable always takes up one type-space no matter how many letters get stacked into building that one sound-block).


Zhuyin

Zhuyin Bopomofo (), or Mandarin Phonetic Symbols, also named Zhuyin (), is a Chinese transliteration system for Mandarin Chinese and other related languages and dialects. More commonly used in Taiwanese Mandarin, it may also be used to transcribe ...
(sometimes called ''Bopomofo'') is a semi-syllabary. It transcribes
Mandarin Mandarin or The Mandarin may refer to: Language * Mandarin Chinese, branch of Chinese originally spoken in northern parts of the country ** Standard Chinese or Modern Standard Mandarin, the official language of China ** Taiwanese Mandarin, Stand ...
phonetically in the Republic of China. After the later establishment of the People's Republic of China and its adoption of Hanyu Pinyin, the use of Zhuyin today is limited. However, it is still widely used in Taiwan, where the Republic of China governs. Zhuyin developed from a form of Chinese shorthand based on Chinese characters in the early 1900s and has elements of both an alphabet and a syllabary. Like an alphabet, the phonemes of syllable initials get represented by individual symbols, but like a syllabary, the phonemes of the syllable finals are not; each possible final (excluding the medial glide) has its own character, an example being, ''luan'' written as ㄌㄨㄢ (''l-u-an''). The last symbol ㄢ takes place as the entire final ''-an''. While Zhuyin is not a mainstream writing system, it is still often used in ways similar to a
romanization Romanization or romanisation, in linguistics, is the conversion of text from a different writing system to the Roman (Latin) script, or a system for doing so. Methods of romanization include transliteration, for representing written text, a ...
system, for aiding pronunciation and as an input method for Chinese characters on computers and cellphones.


Romanization

European alphabets, especially Latin and Cyrillic, have been adapted for many languages of Asia. Arabic is also widely used, sometimes as an abjad (as with
Urdu Urdu (;"Urdu"
'' Persian Persian may refer to: * People and things from Iran, historically called ''Persia'' in the English language ** Persians, the majority ethnic group in Iran, not to be conflated with the Iranic peoples ** Persian language, an Iranian language of the ...
) and sometimes as a complete alphabet (as with
Kurdish Kurdish may refer to: *Kurds or Kurdish people *Kurdish languages *Kurdish alphabets *Kurdistan, the land of the Kurdish people which includes: **Southern Kurdistan **Eastern Kurdistan **Northern Kurdistan **Western Kurdistan See also * Kurd (dis ...
and Uyghur).


Types

The term "alphabet" gets used by linguists and
paleographer Palaeography ( UK) or paleography ( US; ultimately from grc-gre, , ''palaiós'', "old", and , ''gráphein'', "to write") is the study of historic writing systems and the deciphering and dating of historical manuscripts, including the analysi ...
s in both a wide and a narrow sense. In a larger sense, an alphabet is a ''segmental'' script at the
phoneme In phonology and linguistics, a phoneme () is a unit of sound that can distinguish one word from another in a particular language. For example, in most dialects of English, with the notable exception of the West Midlands and the north-wes ...
level—that is, it has separate glyphs for individual sounds and not for larger units such as syllables or words. In the narrower sense, some scholars distinguish "true" alphabets from two other types of segmental script,
abjad An abjad (, ar, أبجد; also abgad) is a writing system in which only consonants are represented, leaving vowel sounds to be inferred by the reader. This contrasts with other alphabets, which provide graphemes for both consonants and vowels ...
s, and
abugida An abugida (, from Ge'ez: ), sometimes known as alphasyllabary, neosyllabary or pseudo-alphabet, is a segmental writing system in which consonant-vowel sequences are written as units; each unit is based on a consonant letter, and vowel n ...
s. These three differ in how they treat vowels. Abjads have letters for consonants and leave most vowels unexpressed. Abugidas are also consonant-based but indicate vowels with
diacritic A diacritic (also diacritical mark, diacritical point, diacritical sign, or accent) is a glyph added to a letter or to a basic glyph. The term derives from the Ancient Greek (, "distinguishing"), from (, "to distinguish"). The word ''diacriti ...
s, a systematic graphic modification of the consonants. In alphabets in the narrow sense, on the other hand, consonants and vowels are written as independent letters. The earliest known alphabet in the wider sense is the
Wadi el-Hol script Wadi ( ar, وَادِي, wādī), alternatively ''wād'' ( ar, وَاد), North African Arabic Oued, is the Arabic term traditionally referring to a valley. In some instances, it may refer to a wet (ephemeral) riverbed that contains water o ...
, believed to be an abjad. Its successor, Phoenician is the ancestor of modern alphabets, including
Arabic Arabic (, ' ; , ' or ) is a Semitic language spoken primarily across the Arab world.Semitic languages: an international handbook / edited by Stefan Weninger; in collaboration with Geoffrey Khan, Michael P. Streck, Janet C. E.Watson; Walter ...
,
Greek Greek may refer to: Greece Anything of, from, or related to Greece, a country in Southern Europe: *Greeks, an ethnic group. *Greek language, a branch of the Indo-European language family. **Proto-Greek language, the assumed last common ancestor ...
,
Latin Latin (, or , ) is a classical language belonging to the Italic branch of the Indo-European languages. Latin was originally a dialect spoken in the lower Tiber area (then known as Latium) around present-day Rome, but through the power of the ...
(via the Old Italic alphabet), Cyrillic (via the Greek alphabet), and
Hebrew Hebrew (; ; ) is a Northwest Semitic language of the Afroasiatic language family. Historically, it is one of the spoken languages of the Israelites and their longest-surviving descendants, the Jews and Samaritans. It was largely preserved ...
(via
Aramaic The Aramaic languages, short Aramaic ( syc, ܐܪܡܝܐ, Arāmāyā; oar, 𐤀𐤓𐤌𐤉𐤀; arc, 𐡀𐡓𐡌𐡉𐡀; tmr, אֲרָמִית), are a language family containing many varieties (languages and dialects) that originated in ...
). Examples of present-day abjads are the
Arabic Arabic (, ' ; , ' or ) is a Semitic language spoken primarily across the Arab world.Semitic languages: an international handbook / edited by Stefan Weninger; in collaboration with Geoffrey Khan, Michael P. Streck, Janet C. E.Watson; Walter ...
and
Hebrew script The Hebrew alphabet ( he, אָלֶף־בֵּית עִבְרִי, ), known variously by scholars as the Ktav Ashuri, Jewish script, square script and block script, is an abjad script used in the writing of the Hebrew language and other Jewish ...
s; true alphabets include
Latin Latin (, or , ) is a classical language belonging to the Italic branch of the Indo-European languages. Latin was originally a dialect spoken in the lower Tiber area (then known as Latium) around present-day Rome, but through the power of the ...
, Cyrillic, and Korean
hangul The Korean alphabet, known as Hangul, . Hangul may also be written as following South Korea's standard Romanization. ( ) in South Korea and Chosŏn'gŭl in North Korea, is the modern official writing system for the Korean language. The le ...
; and abugidas, used to write
Tigrinya (; also spelled Tigrigna) is an Ethio-Semitic language commonly spoken Eritrea and in northern Ethiopia's Tigray Region by the Tigrinya and Tigrayan peoples. It is also spoken by the global diaspora of these regions. History and literatur ...
, Amharic,
Hindi Hindi ( Devanāgarī: or , ), or more precisely Modern Standard Hindi (Devanagari: ), is an Indo-Aryan language spoken chiefly in the Hindi Belt region encompassing parts of northern, central, eastern, and western India. Hindi has been ...
, and Thai. The Canadian Aboriginal syllabics are also an abugida rather than a syllabary as their name would imply, because each glyph stands for a consonant and is modified by rotation to represent the following vowel (in a true syllabary, each consonant-vowel combination gets represented by a separate glyph). All three types may be augmented with syllabic glyphs.
Ugaritic Ugaritic () is an extinct Northwest Semitic language, classified by some as a dialect of the Amorite language and so the only known Amorite dialect preserved in writing. It is known through the Ugaritic texts discovered by French archaeologist ...
, for example, is basically an abjad but has syllabic letters for (these are the only times that vowels are indicated). Coptic has a letter for .
Devanagari Devanagari ( ; , , Sanskrit pronunciation: ), also called Nagari (),Kathleen Kuiper (2010), The Culture of India, New York: The Rosen Publishing Group, , page 83 is a left-to-right abugida (a type of segmental writing system), based on the ...
is typically an abugida augmented with dedicated letters for initial vowels, though some traditions use अ as a
zero consonant In orthography, a zero consonant, silent initial, or null-onset letter is a consonant letter that does not correspond to a consonant sound, but is required when a word or syllable starts with a vowel (i.e. has a null onset). Some abjads, abugidas ...
as the graphic base for such vowels. The boundaries between the three types of segmental scripts are not always clear-cut. For example,
Sorani Central Kurdish (), also called Sorani (), is a Kurdish dialect or a language that is spoken in Iraq, mainly in Iraqi Kurdistan, as well as the provinces of Kurdistan, Kermanshah, and West Azerbaijan in western Iran. Sorani is one of the two o ...
Kurdish Kurdish may refer to: *Kurds or Kurdish people *Kurdish languages *Kurdish alphabets *Kurdistan, the land of the Kurdish people which includes: **Southern Kurdistan **Eastern Kurdistan **Northern Kurdistan **Western Kurdistan See also * Kurd (dis ...
is written in the Arabic script, which when used for other languages is an abjad. In Kurdish, writing the vowels is mandatory, and whole letters get used, so the script is a true alphabet. Other languages may use a Semitic abjad with forced vowel diacritics, effectively making them abugidas. On the other hand, the Phagspa script of the Mongol Empire was based closely on the Tibetan abugida, but vowel marks are written after the preceding consonant rather than as diacritic marks. Although short ''a'' not getting written, as in the Indic abugidas, one could argue that the linear arrangement made this a true alphabet. Ironically, the original source of the term "abugida", namely the Ge'ez abugida now used for Amharic and
Tigrinya (; also spelled Tigrigna) is an Ethio-Semitic language commonly spoken Eritrea and in northern Ethiopia's Tigray Region by the Tigrinya and Tigrayan peoples. It is also spoken by the global diaspora of these regions. History and literatur ...
, has assimilated into their consonant modifications. It is now no longer systematic and must be learned as a syllabary rather than as a segmental script. Even more extreme, the Pahlavi abjad eventually became
logographic In a written language, a logogram, logograph, or lexigraph is a written character that represents a word or morpheme. Chinese characters (pronounced ''hanzi'' in Mandarin, ''kanji'' in Japanese, ''hanja'' in Korean) are generally logograms, as ...
. Thus the primary
categorisation Categorization is the ability and activity of recognizing shared features or similarities between the elements of the experience of the world (such as objects, events, or ideas), organizing and classifying experience by associating them to a ...
of alphabets reflects how they treat vowels. For
tonal languages Tone is the use of pitch in language to distinguish lexical or grammatical meaning – that is, to distinguish or to inflect words. All verbal languages use pitch to express emotional and other paralinguistic information and to convey emph ...
, further classification can be based on their treatment of tone though names do not yet exist to distinguish the various types. Some alphabets disregard tone entirely, especially when it does not carry a heavy functional load, as in Somali and many other languages of Africa and the Americas. Such scripts are to tone what abjads are to vowels. Most commonly, tones get indicated with diacritics, which is how vowels get treated in abugidas, which is the case for
Vietnamese Vietnamese may refer to: * Something of, from, or related to Vietnam, a country in Southeast Asia ** A citizen of Vietnam. See Demographics of Vietnam. * Vietnamese people, or Kinh people, a Southeast Asian ethnic group native to Vietnam ** Overse ...
(a true alphabet) and Thai (an abugida). In Thai, the tone is determined primarily by a consonant, with diacritics for disambiguation. In the
Pollard script The Pollard script, also known as Pollard Miao (Chinese: 柏格理苗文 Bó Gélǐ Miao-wen) or Miao, is an abugida loosely based on the Latin alphabet and invented by Methodist missionary Sam Pollard. Pollard invented the script for use with A ...
, an abugida, vowels get indicated by diacritics. The placing of the diacritic relative to the consonant is modified to indicate the tone. More rarely, a script may have separate letters for tones, as is the case for
Hmong Hmong may refer to: * Hmong people, an ethnic group living mainly in Southwest China, Vietnam, Laos, and Thailand * Hmong cuisine * Hmong customs and culture ** Hmong music ** Hmong textile art * Hmong language, a continuum of closely related to ...
and Zhuang. For many, regardless of whether letters or diacritics get used, the most common tone is not marked, just as the most common vowel is not marked in Indic abugidas. In
Zhuyin Bopomofo (), or Mandarin Phonetic Symbols, also named Zhuyin (), is a Chinese transliteration system for Mandarin Chinese and other related languages and dialects. More commonly used in Taiwanese Mandarin, it may also be used to transcribe ...
, not only is one of the tones unmarked; but there is a diacritic to indicate a lack of tone, like the
virama Virama ( ्) is a Sanskrit phonological concept to suppress the inherent vowel that otherwise occurs with every consonant letter, commonly used as a generic term for a codepoint in Unicode, representing either # halanta, hasanta or explicit vir� ...
of Indic.


Size

The number of letters in an alphabet can be small. The Book Pahlavi script, an abjad, had only twelve letters at one point and may have had even fewer. Today the
Rotokas alphabet The modern Rotokas alphabet is a Latin alphabet consisting of only 12 letters of the ISO basic Latin alphabet without diacritics: It is the smallest alphabet in use today. The majority of the Rotokas people are literate in their language. In ...
has only twelve letters (the Hawaiian alphabet claimed to be that small. However, it consists of 18 letters, including the ʻokina and five long vowels). While Rotokas has a small alphabet because it has few phonemes to represent (just eleven), Book Pahlavi was small because many letters got ''conflated''—or, the graphic distinctions had gotten lost over time. In later Pahlavi papyri, up to half of the remaining graphic distinctions of these twelve letters were lost, and the script could no longer be read as a sequence of letters at all. Instead, each word had to be learned as a whole—or, they had become
logogram In a written language, a logogram, logograph, or lexigraph is a written character that represents a word or morpheme. Chinese characters (pronounced '' hanzi'' in Mandarin, ''kanji'' in Japanese, ''hanja'' in Korean) are generally logograms, ...
s as in Egyptian
Demotic Demotic may refer to: * Demotic Greek, the modern vernacular form of the Greek language * Demotic (Egyptian), an ancient Egyptian script and version of the language * Chữ Nôm, the demotic script for writing Vietnamese See also * * Demos (disa ...
. Moreover, the spellings of some words were heterograms; that is, those spellings did not reflect the pronunciation of those words in Pahlavi but instead reflected their
Aramaic The Aramaic languages, short Aramaic ( syc, ܐܪܡܝܐ, Arāmāyā; oar, 𐤀𐤓𐤌𐤉𐤀; arc, 𐡀𐡓𐡌𐡉𐡀; tmr, אֲרָמִית), are a language family containing many varieties (languages and dialects) that originated in ...
equivalents used as logograms (as English ''e. g.'' ''for example'', from Latin ''exempli gratia'').


Abugidas

The largest segmental script is probably
Devanagari Devanagari ( ; , , Sanskrit pronunciation: ), also called Nagari (),Kathleen Kuiper (2010), The Culture of India, New York: The Rosen Publishing Group, , page 83 is a left-to-right abugida (a type of segmental writing system), based on the ...
. When written in Devanagari, Vedic
Sanskrit Sanskrit (; attributively , ; nominally , , ) is a classical language belonging to the Indo-Aryan branch of the Indo-European languages. It arose in South Asia after its predecessor languages had diffused there from the northwest in the late ...
has an alphabet of 53 letters, including the ''visarga'' mark for final aspiration and special letters for ''kš'' and ''jñ.'' However, one of the letters is theoretical and not used. The Hindi alphabet must represent both Sanskrit and modern vocabulary, and so has been expanded to 58 with the letters (letters with a dot added) to represent sounds from Persian and English.


Abjads

The largest known abjad is Sindhi, with 52 letters. The largest alphabets in the narrow sense include Abkhaz and Kabardian (for Cyrillic), with 62 and 60 letters, respectively, and Slovak (for the
Latin script The Latin script, also known as Roman script, is an alphabetic writing system based on the letters of the classical Latin alphabet, derived from a form of the Greek alphabet which was in use in the ancient Greek city of Cumae, in southern I ...
), with 46. However, these scripts either count di- and tri-graphs as separate letters, as Spanish did with ''ch'' and ''ll'' until recently, or uses
diacritic A diacritic (also diacritical mark, diacritical point, diacritical sign, or accent) is a glyph added to a letter or to a basic glyph. The term derives from the Ancient Greek (, "distinguishing"), from (, "to distinguish"). The word ''diacriti ...
s like Slovak ''č''.


Alphabets

The Georgian alphabet is an alphabetic writing system. The modern Georgian alphabet has 33 letters. The original Georgian alphabet had 38 letters, but five letters were removed in the 19th century by Ilia Chavchavadze. The
Armenian alphabet The Armenian alphabet ( hy, Հայոց գրեր, ' or , ') is an alphabetic writing system used to write Armenian. It was developed around 405 AD by Mesrop Mashtots, an Armenian linguist and ecclesiastical leader. The system originally had ...
is an alphabetical writing system used to write the Armenian language. It was created in year 405 A.D. originally contained 36 letters. Two more letters, օ (o) and ֆ (f), were added in the Middle Ages. During the 1920s orthography reform, a new letter և (capital ԵՎ) was added, which was a ligature before ե+ւ. The letter Ւ ւ was discarded and reintroduced as part of a new letter ՈՒ ու (which was a digraph before).


Syllabaries

Syllabaries typically contain 50 to 400 glyphs. Glyphs of logographic systems typically number from the many hundreds into the thousands. Thus a simple count of the number of distinct symbols is an important clue to the nature of an unknown script.


Alphabetical order

Alphabets often come to be associated with a standard ordering of their letters, which is for
collation Collation is the assembly of written information into a standard order. Many systems of collation are based on numerical order or alphabetical order, or extensions and combinations thereof. Collation is a fundamental element of most office filin ...
—namely, for the listing words and other items in ''
alphabetical order Alphabetical order is a system whereby character strings are placed in order based on the position of the characters in the conventional ordering of an alphabet. It is one of the methods of collation. In mathematics, a lexicographical order is t ...
''. The basic ordering of the
Latin alphabet The Latin alphabet or Roman alphabet is the collection of letters originally used by the ancient Romans to write the Latin language. Largely unaltered with the exception of extensions (such as diacritics), it used to write English and th ...
( A B C D E F G H I J K L M N O P Q R S T U V W X Y Z), which derived from the Northwest Semitic "Abgad" order, is already well established. Although, languages using this alphabet have different conventions for their treatment of modified letters (such as the French ''é'', ''à'', and ''ô'') and certain combinations of letters ( multigraphs). In French, these do not get considered to be additional letters for the purposes of collation. However, in Icelandic, the accented letters such as ''á'', ''í'', and ''ö'' are considered distinct letters representing different vowel sounds from sounds represented by their unaccented counterparts. In Spanish, ''ñ'' is considered a separate letter, but accented vowels such as ''á'' and ''é'' are not. The ''ll'' and ''ch'' were also considered single letters, but in 1994 the tenth congress of the
Association of Spanish Language Academies The Association of Academies of the Spanish Language ( es, Asociación de Academias de la Lengua Española, ASALE) is an entity whose end is to work for the unity, integrity, and growth of the Spanish language. It was created in Mexico in 1951 an ...
. The collating order got changed so that ''ll'' is between ''lk'' and ''lm'' in the dictionary and ''ch'' is between ''cg'' and ''ci'', and in 2010 the Real Academia Española changed it so they were no longer letters at all. In German, words starting with ''sch-'' (which spells the German phoneme ) are inserted between words with initial ''sca-'' and ''sci-'' (all incidentally loanwords) instead of appearing after initial ''sz'', as though it were a single letter—in contrast to several languages such as Albanian, in which ''dh-'', ''ë-'', ''gj-'', ''ll-'', ''rr-'', ''th-'', ''xh-'' and ''zh-'' (all representing phonemes and considered separate single letters) would follow the letters ''d'', ''e'', ''g'', ''l'', ''n'', ''r'', ''t'', ''x'' and ''z'' respectively, as well as Hungarian and Welsh. Further, German words with an umlaut are collated ignoring the umlaut as per DIN 5007-1—contrary to Turkish that adopted the
grapheme In linguistics, a grapheme is the smallest functional unit of a writing system. The word ''grapheme'' is derived and the suffix ''-eme'' by analogy with ''phoneme'' and other names of emic units. The study of graphemes is called '' graphemi ...
s ö and ü, and where a word like ''tüfek'', would come after ''tuz'', in the dictionary. An exception is the German telephone directory where umlauts are sorted like ''ä''=''ae'' since names such as ''Jäger'' also appear with the spelling ''Jaeger'', and are not distinguished in the spoken language. The
Danish Danish may refer to: * Something of, from, or related to the country of Denmark People * A national or citizen of Denmark, also called a "Dane," see Demographics of Denmark * Culture of Denmark * Danish people or Danes, people with a Danish a ...
and
Norwegian Norwegian, Norwayan, or Norsk may refer to: *Something of, from, or related to Norway, a country in northwestern Europe * Norwegians, both a nation and an ethnic group native to Norway * Demographics of Norway *The Norwegian language, including ...
alphabets end with ''æ''—''ø''—''å'', whereas the Swedish and
Finnish Finnish may refer to: * Something or someone from, or related to Finland * Culture of Finland * Finnish people or Finns, the primary ethnic group in Finland * Finnish language, the national language of the Finnish people * Finnish cuisine See also ...
ones conventionally put ''å''—''ä''—''ö'' at the end. It is unknown whether the earliest alphabets had a defined sequence. Some alphabets today, such as the Hanuno'o script, are learned one letter at a time, in no particular order, and are not used for
collation Collation is the assembly of written information into a standard order. Many systems of collation are based on numerical order or alphabetical order, or extensions and combinations thereof. Collation is a fundamental element of most office filin ...
where a definite order is required. However, a dozen
Ugaritic Ugaritic () is an extinct Northwest Semitic language, classified by some as a dialect of the Amorite language and so the only known Amorite dialect preserved in writing. It is known through the Ugaritic texts discovered by French archaeologist ...
tablets from the fourteenth century BCE preserve the alphabet in two sequences. One, the ''ABCDE'' order later used in Phoenician, has continued with minor changes in
Hebrew Hebrew (; ; ) is a Northwest Semitic language of the Afroasiatic language family. Historically, it is one of the spoken languages of the Israelites and their longest-surviving descendants, the Jews and Samaritans. It was largely preserved ...
,
Greek Greek may refer to: Greece Anything of, from, or related to Greece, a country in Southern Europe: *Greeks, an ethnic group. *Greek language, a branch of the Indo-European language family. **Proto-Greek language, the assumed last common ancestor ...
,
Armenian Armenian may refer to: * Something of, from, or related to Armenia, a country in the South Caucasus region of Eurasia * Armenians, the national people of Armenia, or people of Armenian descent ** Armenian Diaspora, Armenian communities across the ...
,
Gothic Gothic or Gothics may refer to: People and languages *Goths or Gothic people, the ethnonym of a group of East Germanic tribes **Gothic language, an extinct East Germanic language spoken by the Goths **Crimean Gothic, the Gothic language spoken b ...
, Cyrillic, and
Latin Latin (, or , ) is a classical language belonging to the Italic branch of the Indo-European languages. Latin was originally a dialect spoken in the lower Tiber area (then known as Latium) around present-day Rome, but through the power of the ...
; the other, ''HMĦLQ,'' was used in southern Arabia and is preserved today in Ethiopic. Both orders have therefore been stable for at least 3000 years.
Runic Runes are the letters in a set of related alphabets known as runic alphabets native to the Germanic peoples. Runes were used to write various Germanic languages (with some exceptions) before they adopted the Latin alphabet, and for specialised ...
used an unrelated
Futhark Runes are the letters in a set of related alphabets known as runic alphabets native to the Germanic peoples. Runes were used to write various Germanic languages (with some exceptions) before they adopted the Latin alphabet, and for specialis ...
sequence, which was later simplified.
Arabic Arabic (, ' ; , ' or ) is a Semitic language spoken primarily across the Arab world.Semitic languages: an international handbook / edited by Stefan Weninger; in collaboration with Geoffrey Khan, Michael P. Streck, Janet C. E.Watson; Walter ...
uses its own sequence, although Arabic retains the traditional
abjadi order The Abjad numerals, also called Hisab al-Jummal ( ar, حِسَاب ٱلْجُمَّل, ), are a decimal alphabetic numeral system/ alphanumeric code, in which the 28 letters of the Arabic alphabet are assigned numerical values. They have been u ...
for numbering. The
Brahmic family The Brahmic scripts, also known as Indic scripts, are a family of abugida writing systems. They are used throughout the Indian subcontinent, Southeast Asia and parts of East Asia. They are descended from the Brahmi script of ancient India ...
of alphabets used in India use a unique order based on
phonology Phonology is the branch of linguistics that studies how languages or dialects systematically organize their sounds or, for sign languages, their constituent parts of signs. The term can also refer specifically to the sound or sign system of a ...
: The letters are arranged according to how and where they are produced in the mouth. This organization is used in Southeast Asia, Tibet, Korean
hangul The Korean alphabet, known as Hangul, . Hangul may also be written as following South Korea's standard Romanization. ( ) in South Korea and Chosŏn'gŭl in North Korea, is the modern official writing system for the Korean language. The le ...
, and even Japanese
kana The term may refer to a number of syllabaries used to write Japanese phonological units, morae. Such syllabaries include (1) the original kana, or , which were Chinese characters (kanji) used phonetically to transcribe Japanese, the most p ...
, which is not an alphabet.


Names of letters

The Phoenician letter names, in which each letter was associated with a word that begins with that sound ( acrophony), continue to be used to varying degrees in Samaritan,
Aramaic The Aramaic languages, short Aramaic ( syc, ܐܪܡܝܐ, Arāmāyā; oar, 𐤀𐤓𐤌𐤉𐤀; arc, 𐡀𐡓𐡌𐡉𐡀; tmr, אֲרָמִית), are a language family containing many varieties (languages and dialects) that originated in ...
,
Syriac Syriac may refer to: *Syriac language, an ancient dialect of Middle Aramaic *Sureth, one of the modern dialects of Syriac spoken in the Nineveh Plains region * Syriac alphabet ** Syriac (Unicode block) ** Syriac Supplement * Neo-Aramaic languages a ...
,
Hebrew Hebrew (; ; ) is a Northwest Semitic language of the Afroasiatic language family. Historically, it is one of the spoken languages of the Israelites and their longest-surviving descendants, the Jews and Samaritans. It was largely preserved ...
,
Greek Greek may refer to: Greece Anything of, from, or related to Greece, a country in Southern Europe: *Greeks, an ethnic group. *Greek language, a branch of the Indo-European language family. **Proto-Greek language, the assumed last common ancestor ...
and
Arabic Arabic (, ' ; , ' or ) is a Semitic language spoken primarily across the Arab world.Semitic languages: an international handbook / edited by Stefan Weninger; in collaboration with Geoffrey Khan, Michael P. Streck, Janet C. E.Watson; Walter ...
. The names were abandoned in
Latin Latin (, or , ) is a classical language belonging to the Italic branch of the Indo-European languages. Latin was originally a dialect spoken in the lower Tiber area (then known as Latium) around present-day Rome, but through the power of the ...
, which instead referred to the letters by adding a vowel (usually "e") before or after the consonant; the two exceptions were Y and Z, which were borrowed from the Greek alphabet rather than Etruscan, and were known as ''Y Graeca'' "Greek Y" and ''zeta'' (from Greek)—this discrepancy was inherited by many European languages, as in the term ''zed'' for Z in all forms of English other than American English. Over time names sometimes shifted or were added, as in ''double U'' for W ("double V" in French), the English name for Y, and American ''zee'' for Z. Comparing names in English and French gives a clear reflection of the
Great Vowel Shift The Great Vowel Shift was a series of changes in the pronunciation of the English language that took place primarily between 1400 and 1700, beginning in southern England and today having influenced effectively all dialects of English. Through ...
: A, B, C and D are pronounced in today's English, but in contemporary French they are . The French names (from which the English names are derived) preserve the qualities of the English vowels from before the Great Vowel Shift. By contrast, the names of F, L, M, N and S () remain the same in both languages, because "short" vowels were largely unaffected by the Shift. In Cyrillic originally the letters were given names based on Slavic words; this was later abandoned as well in favor of a system similar to that used in Latin. Letters of
Armenian alphabet The Armenian alphabet ( hy, Հայոց գրեր, ' or , ') is an alphabetic writing system used to write Armenian. It was developed around 405 AD by Mesrop Mashtots, an Armenian linguist and ecclesiastical leader. The system originally had ...
also have distinct letter names.


Orthography and pronunciation

When an alphabet is adopted or developed to represent a given language, an
orthography An orthography is a set of conventions for writing a language, including norms of spelling, hyphenation, capitalization, word breaks, emphasis, and punctuation. Most transnational languages in the modern period have a writing system, and ...
generally comes into being, providing rules for the
spelling Spelling is a set of conventions that regulate the way of using graphemes (writing system) to represent a language in its written form. In other words, spelling is the rendering of speech sound (phoneme) into writing (grapheme). Spelling is one ...
of words in that language. In accordance with the principle on which alphabets are based, these rules will generally map letters of the alphabet to the
phoneme In phonology and linguistics, a phoneme () is a unit of sound that can distinguish one word from another in a particular language. For example, in most dialects of English, with the notable exception of the West Midlands and the north-wes ...
s (significant sounds) of the spoken language. In a perfectly
phonemic orthography A phonemic orthography is an orthography (system for writing a language) in which the graphemes (written symbols) correspond to the phonemes (significant spoken sounds) of the language. Natural languages rarely have perfectly phonemic orthographi ...
there would be a consistent one-to-one correspondence between the letters and the phonemes, so that a writer could predict the spelling of a word given its pronunciation, and a speaker would always know the pronunciation of a word given its spelling, and vice versa. However, this ideal is not usually achieved in practice; some languages (such as Spanish and
Finnish Finnish may refer to: * Something or someone from, or related to Finland * Culture of Finland * Finnish people or Finns, the primary ethnic group in Finland * Finnish language, the national language of the Finnish people * Finnish cuisine See also ...
) come close to it, while others (such as English) deviate from it to a much larger degree. The pronunciation of a language often evolves independently of its writing system, and writing systems have been borrowed for languages they were not designed for, so the degree to which letters of an alphabet correspond to phonemes of a language varies greatly from one language to another and even within a single language. Languages may fail to achieve a one-to-one correspondence between letters and sounds in any of several ways: *A language may represent a given phoneme by a combination of letters rather than just a single letter. Two-letter combinations are called digraphs and three-letter groups are called trigraphs.
German German(s) may refer to: * Germany (of or related to) ** Germania (historical use) * Germans, citizens of Germany, people of German ancestry, or native speakers of the German language ** For citizens of Germany, see also German nationality law **Ge ...
uses the
tetragraph A tetragraph (from the el, τετρα-, ''tetra-'', "four" and γράφω, ''gráphō'', "write") is a sequence of four letters used to represent a single sound (phoneme), or a combination of sounds, that do not necessarily correspond to the indi ...
s (four letters) "tsch" for the phoneme and (in a few borrowed words) "dsch" for . Kabardian also uses a tetragraph for one of its phonemes, namely "кхъу". Two letters representing one sound occur in several instances in Hungarian as well (where, for instance, ''cs'' stands for ʃ ''sz'' for ''zs'' for ''dzs'' for ʒ. *A language may represent the same phoneme with two or more different letters or combinations of letters. An example is
modern Greek Modern Greek (, , or , ''Kiní Neoellinikí Glóssa''), generally referred to by speakers simply as Greek (, ), refers collectively to the dialects of the Greek language spoken in the modern era, including the official standardized form of the ...
which may write the phoneme in six different ways: , , , , , and . *A language may spell some words with unpronounced letters that exist for historical or other reasons. For example, the spelling of the Thai word for "beer" ��บียร์retains a letter for the final consonant "r" present in the English word it was borrowed from, but silences it. *Pronunciation of individual words may change according to the presence of surrounding words in a sentence (
sandhi Sandhi ( sa, सन्धि ' , "joining") is a cover term for a wide variety of sound changes that occur at morpheme or word boundaries. Examples include fusion of sounds across word boundaries and the alteration of one sound depending on near ...
). *Different dialects of a language may use different phonemes for the same word. *A language may use different sets of symbols or different rules for distinct sets of vocabulary items, such as the Japanese
hiragana is a Japanese syllabary, part of the Japanese writing system, along with ''katakana'' as well as ''kanji''. It is a phonetic lettering system. The word ''hiragana'' literally means "flowing" or "simple" kana ("simple" originally as contrast ...
and
katakana is a Japanese syllabary, one component of the Japanese writing system along with hiragana, kanji and in some cases the Latin script (known as rōmaji). The word ''katakana'' means "fragmentary kana", as the katakana characters are derived f ...
syllabaries, or the various rules in English for spelling words from Latin and Greek, or the original Germanic vocabulary. National languages sometimes elect to address the problem of dialects by associating the alphabet with the national standard. Some national languages like
Finnish Finnish may refer to: * Something or someone from, or related to Finland * Culture of Finland * Finnish people or Finns, the primary ethnic group in Finland * Finnish language, the national language of the Finnish people * Finnish cuisine See also ...
,
Armenian Armenian may refer to: * Something of, from, or related to Armenia, a country in the South Caucasus region of Eurasia * Armenians, the national people of Armenia, or people of Armenian descent ** Armenian Diaspora, Armenian communities across the ...
, Turkish,
Russian Russian(s) refers to anything related to Russia, including: *Russians (, ''russkiye''), an ethnic group of the East Slavic peoples, primarily living in Russia and neighboring countries *Rossiyane (), Russian language term for all citizens and peo ...
,
Serbo-Croatian Serbo-Croatian () – also called Serbo-Croat (), Serbo-Croat-Bosnian (SCB), Bosnian-Croatian-Serbian (BCS), and Bosnian-Croatian-Montenegrin-Serbian (BCMS) – is a South Slavic language and the primary language of Serbia, Croatia, Bosnia an ...
( Serbian, Croatian and Bosnian) and
Bulgarian Bulgarian may refer to: * Something of, from, or related to the country of Bulgaria * Bulgarians, a South Slavic ethnic group * Bulgarian language, a Slavic language * Bulgarian alphabet * A citizen of Bulgaria, see Demographics of Bulgaria * Bul ...
have a very regular spelling system with a nearly one-to-one correspondence between letters and phonemes. Strictly speaking, these national languages lack a word corresponding to the verb "to spell" (meaning to split a word into its letters), the closest match being a verb meaning to split a word into its syllables. Similarly, the
Italian Italian(s) may refer to: * Anything of, from, or related to the people of Italy over the centuries ** Italians, an ethnic group or simply a citizen of the Italian Republic or Italian Kingdom ** Italian language, a Romance language *** Regional Ita ...
verb corresponding to 'spell (out)', ''compitare'', is unknown to many Italians because spelling is usually trivial, as Italian spelling is highly phonemic. In standard Spanish, one can tell the pronunciation of a word from its spelling, but not vice versa, as certain phonemes can be represented in more than one way, but a given letter is consistently pronounced. French, with its
silent letter In an alphabetic writing system, a silent letter is a letter that, in a particular word, does not correspond to any sound in the word's pronunciation. In linguistics, a silent letter is often symbolised with a null sign . Null is an unprono ...
s and its heavy use of nasal vowels and elision, may seem to lack much correspondence between spelling and pronunciation, but its rules on pronunciation, though complex, are actually consistent and predictable with a fair degree of accuracy. At the other extreme are languages such as English, where the pronunciations of many words must be memorized as they do not correspond to the spelling in a consistent way. For English, this is partly because the
Great Vowel Shift The Great Vowel Shift was a series of changes in the pronunciation of the English language that took place primarily between 1400 and 1700, beginning in southern England and today having influenced effectively all dialects of English. Through ...
occurred after the orthography was established, and because English has acquired a large number of loanwords at different times, retaining their original spelling at varying levels. Even English has general, albeit complex, rules that predict pronunciation from spelling, and these rules are successful most of the time; rules to predict spelling from the pronunciation have a higher failure rate. Sometimes, countries have the written language undergo a
spelling reform A spelling reform is a deliberate, often authoritatively sanctioned or mandated change to spelling rules. Proposals for such reform are fairly common, and over the years, many languages have undergone such reforms. Recent high-profile examples a ...
to realign the writing with the contemporary spoken language. These can range from simple spelling changes and word forms to switching the entire writing system itself, as when Turkey switched from the Arabic alphabet to a Latin-based
Turkish alphabet The Turkish alphabet ( tr, ) is a Latin-script alphabet used for writing the Turkish language, consisting of 29 letters, seven of which ( Ç, Ğ, I, İ, Ö, Ş and Ü) have been modified from their Latin originals for the phonetic require ...
, and as when Kazakh changes from an Arabic script to a Cyrillic script due to the Soviet Union's influence, and in 2021, having a transition to the Latin alphabet, just like Turkish. The Cyrillic script used to be official in Uzbekistan and Turkmenistan before they all switched to the Latin alphabets, including Uzbekistan that is having a reform of the alphabet to use diacritics on the letters that is marked by apostrophes and the letters that are digraphs. The standard system of symbols used by
linguist Linguistics is the scientific study of human language. It is called a scientific study because it entails a comprehensive, systematic, objective, and precise analysis of all aspects of language, particularly its nature and structure. Linguis ...
s to represent sounds in any language, independently of orthography, is called the
International Phonetic Alphabet The International Phonetic Alphabet (IPA) is an alphabetic system of phonetic notation based primarily on the Latin script. It was devised by the International Phonetic Association in the late 19th century as a standardized representation ...
.


See also

* Abecedarium * Acrophony *
Akshara Aksara (also ''akshara'', Devanagari अक्षर, IAST ''akṣara'') is a Sanskrit term translating to "imperishable, indestructible, fixed, immutable" (i.e. from अ, '' a-'' "not" and क्षर्, ''kṣar-'' "melt away, perish"). It h ...
*
Alphabet book An alphabet book is a type of children's book giving basic instruction in an alphabet. Intended for young children, alphabet books commonly use pictures, simple language and alliteration to aid language learning. Alphabet books are published ...
* Alphabet effect *
Alphabet song The alphabet song is any of various songs used to teach children an alphabet. Alphabet songs typically recite the names of all letters of the alphabet of a given language in order. The ABC (Verse 1) "The ABC Song", otherwise referred to as ...
*
Alphabetical order Alphabetical order is a system whereby character strings are placed in order based on the position of the characters in the conventional ordering of an alphabet. It is one of the methods of collation. In mathematics, a lexicographical order is t ...
*
Butterfly Alphabet The ''Butterfly Alphabet'' is a photographic artwork by the Norwegian naturalist Kjell Bloch Sandved. Sandved worked at the Smithsonian's National Museum of Natural History in Washington, D.C., and came up with the idea with Barbara Bedette, a ...
*
Character encoding Character encoding is the process of assigning numbers to Graphics, graphical character (computing), characters, especially the written characters of Language, human language, allowing them to be Data storage, stored, Data communication, transmi ...
*
Constructed script A constructed script is a new writing system specifically created by an individual or group, rather than having evolved as part of a language or culture like a natural script. Some are designed for use with constructed languages, although several ...
*
Fingerspelling Fingerspelling (or dactylology) is the representation of the letters of a writing system, and sometimes numeral systems, using only the hands. These manual alphabets (also known as finger alphabets or hand alphabets) have often been used in deaf ...
*
NATO phonetic alphabet The (International) Radiotelephony Spelling Alphabet, commonly known as the NATO phonetic alphabet, is the most widely used set of clear code words for communicating the letters of the Roman alphabet, technically a ''radiotelephonic spellin ...
*
Lipogram A lipogram (from grc, λειπογράμματος, ''leipográmmatos'', "leaving out a letter") is a kind of constrained writing or word game consisting of writing paragraphs or longer works in which a particular letter or group of letters is a ...
*
List of writing systems This is a list of writing systems (or scripts), classified according to some common distinguishing features. The usual name of the script is given first; the name of the language(s) in which the script is written follows (in brackets), particula ...
*
Pangram A pangram or holoalphabetic sentence is a sentence using every letter of a given alphabet at least once. Pangrams have been used to display typefaces, test equipment, and develop skills in handwriting, calligraphy, and keyboarding. Origins The ...
*
Thoth Thoth (; from grc-koi, Θώθ ''Thṓth'', borrowed from cop, Ⲑⲱⲟⲩⲧ ''Thōout'', Egyptian: ', the reflex of " eis like the Ibis") is an ancient Egyptian deity. In art, he was often depicted as a man with the head of an ibis or ...
*
Transliteration Transliteration is a type of conversion of a text from one script to another that involves swapping letters (thus ''trans-'' + '' liter-'') in predictable ways, such as Greek → , Cyrillic → , Greek → the digraph , Armenian → or L ...
*
Unicode Unicode, formally The Unicode Standard,The formal version reference is is an information technology standard for the consistent encoding, representation, and handling of text expressed in most of the world's writing systems. The standard, wh ...


References


Bibliography

* * Overview of modern and some ancient writing systems. * * * Chapter 3 traces and summarizes the invention of alphabetic writing. * * * * * * * * Chapter 4 traces the invention of writing


External links


The Origins of abc"Language, Writing and Alphabet: An Interview with Christophe Rico"
''Damqātum 3'' (2007) *
Michael Everson Michael Everson (born January 9, 1963) is an American and Irish linguist, script encoder, typesetter, type designer and publisher. He runs a publishing company called Evertype, through which he has published over a hundred books since 2006. H ...
'
Alphabets of Europe
animation by Prof. Robert Fradkin at the
University of Maryland The University of Maryland, College Park (University of Maryland, UMD, or simply Maryland) is a public land-grant research university in College Park, Maryland. Founded in 1856, UMD is the flagship institution of the University System of M ...

How the Alphabet Was Born from Hieroglyphs
��Biblical Archaeology Review
An Early Hellenic AlphabetThe Alphabet
BBC Radio 4 discussion with Eleanor Robson, Alan Millard and Rosalind Thomas (''In Our Time'', 18 December 2003) {{Authority control Orthography