An orthography is a set of conventions for writing a

language Language is a structured system of communication. The structure of a language is its grammar and the free components are its vocabulary. Languages are the primary means by which humans communicate, and may be conveyed through a variety of ...

, including norms of

spelling Spelling is a set of conventions that regulate the way of using graphemes (writing system) to represent a language in its written form. In other words, spelling is the rendering of speech sound (phoneme) into writing (grapheme). Spelling is on ...

hyphen The hyphen is a punctuation mark used to join words and to separate syllables of a single word. The use of hyphens is called hyphenation. ''Son-in-law'' is an example of a hyphenated word. The hyphen is sometimes confused with dashes ( figure ...

ation, capitalization, word breaks, emphasis, and

punctuation Punctuation (or sometimes interpunction) is the use of spacing, conventional signs (called punctuation marks), and certain typographical devices as aids to the understanding and correct reading of written text, whether read silently or aloud. A ...

. Most transnational languages in the modern period have a writing system, and most of these systems have undergone substantial standardization, thus exhibiting less

dialect The term dialect (from Latin , , from the Ancient Greek word , 'discourse', from , 'through' and , 'I speak') can refer to either of two distinctly different types of linguistic phenomena: One usage refers to a variety of a language that is ...

variation than the spoken language. These processes can fossilize pronunciation patterns that are no longer routinely observed in speech (e.g., "would" and "should"); they can also reflect deliberate efforts to introduce variability for the sake of national identity, as seen in

Noah Webster Noah ''Nukh''; am, ኖህ, ''Noḥ''; ar, نُوح '; grc, Νῶε ''Nôe'' () is the tenth and last of the pre-Flood patriarchs in the traditions of Abrahamic religions. His story appears in the Hebrew Bible ( Book of Genesis, chapters ...

's efforts to introduce easily noticeable differences between American and British spelling (e.g., "honor" and "honour"). Some nations (e.g.

France France (), officially the French Republic ( ), is a country primarily located in Western Europe. It also comprises of Overseas France, overseas regions and territories in the Americas and the Atlantic Ocean, Atlantic, Pacific Ocean, Pac ...

and

Spain , image_flag = Bandera de España.svg , image_coat = Escudo de España (mazonado).svg , national_motto = '' Plus ultra'' (Latin)(English: "Further Beyond") , national_anthem = (English: "Royal March") , ...

) have established language academies in an attempt to regulate orthography officially. For most languages (including English) however, there are no such authorities and a sense of 'correct' orthography evolves through encounters with print in schooling, workplace, and informal contexts. Some organizations, however, such as newspapers of record or academic journals, opt for greater orthographic homogeneity by enforcing a particular

style guide A style guide or manual of style is a set of standards for the writing, formatting, and design of documents. It is often called a style sheet, although that term also has multiple other meanings. The standards can be applied either for gene ...

Etymology and meaning

The English word ''orthography'' dates from the 15th century. It comes from the french: orthographie, from la, orthographia, which derives from grc, ὀρθός (, 'correct') and (, 'to write'). Orthography is largely concerned with matters of

, and in particular the relationship between

phoneme In phonology and linguistics, a phoneme () is a unit of sound that can distinguish one word from another in a particular language. For example, in most dialects of English, with the notable exception of the West Midlands and the north-wes ...

s and

grapheme In linguistics, a grapheme is the smallest functional unit of a writing system. The word ''grapheme'' is derived and the suffix ''-eme'' by analogy with ''phoneme'' and other names of emic units. The study of graphemes is called '' graphemi ...

s in a language. Other elements that may be considered part of orthography include

ation, capitalization, word breaks, emphasis, and

. Orthography thus describes or defines the set of symbols used in writing a language and the conventions that broadly regulate their use. Most natural languages developed as oral languages, and writing systems have usually been crafted or adapted as ways of representing the spoken language. The rules for doing this tend to become standardized for a given language, leading to the development of an orthography that is generally considered "correct". In

linguistics Linguistics is the scientific study of human language. It is called a scientific study because it entails a comprehensive, systematic, objective, and precise analysis of all aspects of language, particularly its nature and structure. Ling ...

, the term ''orthography'' is often used to refer to any method of writing a language, without judgment as to right and wrong, with a scientific understanding that orthographic standardization exists on a spectrum of strength of convention. The original sense of the word, though, implies a dichotomy of correct and incorrect, and the word is still most often used to refer specifically to a thoroughly standardized, prescriptively correct, way of writing a language. A distinction may be made here between ''etic'' and ''emic'' viewpoints: the purely descriptive (etic) approach, which simply considers any system that is actually used—and the emic view, which takes account of language users' perceptions of correctness.

Units and notation

Orthographic units, such as letters of an

alphabet An alphabet is a standardized set of basic written graphemes (called letters) that represent the phonemes of certain spoken languages. Not all writing systems represent language in this way; in a syllabary, each character represents a syllab ...

, are technically called

s. These are a type of

abstraction Abstraction in its main sense is a conceptual process wherein general rules and concepts are derived from the usage and classification of specific examples, literal ("real" or " concrete") signifiers, first principles, or other methods. "An abst ...

, analogous to the

s of spoken languages; different physical forms of written symbols are considered to represent the same grapheme if the differences between them are not significant for meaning. Thus, a grapheme can be regarded as an abstraction of a collection of glyphs that are all functionally equivalent. For example, in written English (or other languages using the

Latin alphabet The Latin alphabet or Roman alphabet is the collection of letters originally used by the ancient Romans to write the Latin language. Largely unaltered with the exception of extensions (such as diacritics), it used to write English and the ...

), there are two different physical representations (glyphs) of the lowercase Latin letter 'a': and . Since, however, the substitution of either of them for the other cannot change the meaning of a word, they are considered to be allographs of the same grapheme, which can be written . The italic and bold face forms are also allographic. Graphemes or sequences of them are sometimes placed between angle brackets, as in or . This distinguishes them from phonemic transcription, which is placed between slashes (, ), and from

phonetic transcription Phonetic transcription (also known as phonetic script or phonetic notation) is the visual representation of speech sounds (or ''phones'') by means of symbols. The most common type of phonetic transcription uses a phonetic alphabet, such as the I ...

, which is placed between square brackets (, ).

Types

The

writing systems A writing system is a method of visually representing verbal communication, based on a script and a set of rules regulating its use. While both writing and speech are useful in conveying messages, writing differs in also being a reliable f ...

on which orthographies are based can be divided into a number of types, depending on what type of unit each symbol serves to represent. The principal types are ''

logographic In a written language, a logogram, logograph, or lexigraph is a written character that represents a word or morpheme. Chinese characters (pronounced '' hanzi'' in Mandarin, '' kanji'' in Japanese, '' hanja'' in Korean) are generally logograms ...

'' (with symbols representing words or

morpheme A morpheme is the smallest meaningful Constituent (linguistics), constituent of a linguistic expression. The field of linguistics, linguistic study dedicated to morphemes is called morphology (linguistics), morphology. In English, morphemes are ...

s), '' syllabic'' (with symbols representing syllables), and ''

ic'' (with symbols roughly representing

s). Many writing systems combine features of more than one of these types, and a number of detailed classifications have been proposed. Japanese is an example of a writing system that can be written using a combination of logographic

kanji are the logographic Chinese characters taken from the Chinese script and used in the writing of Japanese. They were made a major part of the Japanese writing system during the time of Old Japanese and are still used, along with the subsequ ...

characters and syllabic

hiragana is a Japanese syllabary, part of the Japanese writing system, along with ''katakana'' as well as ''kanji''. It is a phonetic lettering system. The word ''hiragana'' literally means "flowing" or "simple" kana ("simple" originally as contras ...

and

katakana is a Japanese syllabary, one component of the Japanese writing system along with hiragana, kanji and in some cases the Latin script (known as rōmaji). The word ''katakana'' means "fragmentary kana", as the katakana characters are derived f ...

characters; as with many non-alphabetic languages, alphabetic romaji characters may also be used as needed.

Correspondence with pronunciation

Orthographies that use

s and

syllabaries In the linguistic study of written languages, a syllabary is a set of written symbols that represent the syllables or (more frequently) moras which make up words. A symbol in a syllabary, called a syllabogram, typically represents an (optiona ...

are based on the principle that the written symbols (

s) correspond to units of sound of the spoken language:

s in the former case, and

syllable A syllable is a unit of organization for a sequence of speech sounds typically made up of a syllable nucleus (most often a vowel) with optional initial and final margins (typically, consonants). Syllables are often considered the phonological ...

s in the latter. However, in virtually all cases, this correspondence is not exact. Different languages' orthographies offer different degrees of correspondence between spelling and pronunciation.

English orthography English orthography is the writing system used to represent spoken English, allowing readers to connect the graphemes to sound and to meaning. It includes English's norms of spelling, hyphenation, capitalisation, word breaks, emphasis, ...

, French orthography and

Danish orthography Danish orthography is the system and norms used for writing the Danish language, including spelling and punctuation. Officially, the norms are set by the Danish language council through the publication of Retskrivningsordbogen. Danish cu ...

, for example, are highly irregular, whereas the orthographies of languages such as Russian,

German German(s) may refer to: * Germany (of or related to) **Germania (historical use) * Germans, citizens of Germany, people of German ancestry, or native speakers of the German language ** For citizens of Germany, see also German nationality law **Ge ...

and Spanish represent pronunciation much more faithfully, although the correspondence between letters and phonemes is still not exact. Finnish, Turkish and

Serbo-Croatian Serbo-Croatian () – also called Serbo-Croat (), Serbo-Croat-Bosnian (SCB), Bosnian-Croatian-Serbian (BCS), and Bosnian-Croatian-Montenegrin-Serbian (BCMS) – is a South Slavic language and the primary language of Serbia, Croatia, Bosnia an ...

orthographies more consistently approximate the principle "one letter per sound." An orthography in which the correspondences between spelling and pronunciation are highly complex or inconsistent is called a '' deep orthography'' (or less formally, the language is said to have ''irregular spelling''). An orthography with relatively simple and consistent correspondences is called ''shallow'' (and the language has ''regular spelling''). One of the main reasons why spelling and pronunciation diverge is that sound changes taking place in the spoken language are not always reflected in the orthography, and hence spellings correspond to historical rather than present-day pronunciation. One consequence of this is that many spellings come to reflect a word's

morphophonemic Morphophonology (also morphophonemics or morphonology) is the branch of linguistics that studies the interaction between morphological and phonological or phonetic processes. Its chief focus is the sound changes that take place in morphemes ...

structure rather than its purely phonemic structure (for example, the English regular past tense

is consistently spelled ''-ed'' in spite of its different pronunciations in various words). This is discussed further at . The syllabary systems of Japanese (

and

) are examples of almost perfectly shallow orthographies—the kana correspond with almost perfect consistency to the spoken syllables, although with a few exceptions where symbols reflect historical or morphophonemic features: notably the use of ぢ ''ji'' and づ ''zu'' (rather than じ ''ji'' and ず ''zu'', their pronunciation in standard Tokyo dialect) when the character is a voicing of an underlying ち or つ (see rendaku), and the use of は, を, and へ to represent the sounds わ, お, and え, as relics of

historical kana usage The , or , refers to the in general use until orthographic reforms after World War II; the current orthography was adopted by Cabinet order in 1946. By that point the historical orthography was no longer in accord with Japanese pronunciatio ...

. The Korean ''

hangul The Korean alphabet, known as Hangul, . Hangul may also be written as following South Korea's standard Romanization. ( ) in South Korea and Chosŏn'gŭl in North Korea, is the modern official writing system for the Korean language. The l ...

'' system was also originally an extremely shallow orthography, but as a representation of the modern language it frequently also reflects morphophonemic features. For full discussion of degrees of correspondence between spelling and pronunciation in alphabetic orthographies, including reasons why such correspondence may break down, see Phonemic orthography.

Defective orthographies

An orthography based on the principle that symbols correspond to phonemes may, in some cases, lack characters to represent all the phonemes or all the phonemic distinctions in the language. This is called a defective orthography. An example in English is the lack of any indication of stress. Another is the digraph ''th'', which represents two different phonemes (as in ''then'' and ''thin'') and replaced the old letters '' ð'' and '' þ''. A more systematic example is that of

abjad An abjad (, ar, أبجد; also abgad) is a writing system in which only consonants are represented, leaving vowel sounds to be inferred by the reader. This contrasts with other alphabets, which provide graphemes for both consonants and vow ...

s like the

Arabic Arabic (, ' ; , ' or ) is a Semitic language spoken primarily across the Arab world.Semitic languages: an international handbook / edited by Stefan Weninger; in collaboration with Geoffrey Khan, Michael P. Streck, Janet C. E.Watson; Walter ...

and

Hebrew Hebrew (; ; ) is a Northwest Semitic language of the Afroasiatic language family. Historically, it is one of the spoken languages of the Israelites and their longest-surviving descendants, the Jews and Samaritans. It was largely preserved ...

alphabets, in which the short vowels are normally left unwritten and must be inferred by the reader. When an alphabet is borrowed from its original language for use with a new language—as has been done with the

for many languages, or Japanese

Katakana is a Japanese syllabary, one component of the Japanese writing system along with hiragana, kanji and in some cases the Latin script (known as rōmaji). The word ''katakana'' means "fragmentary kana", as the katakana characters are derived f ...

for non-Japanese words—it often proves defective in representing the new language's phonemes. Sometimes this problem is addressed by the use of such devices as digraphs (such as ''sh'' and ''ch'' in English, where pairs of letters represent single sounds),

diacritic A diacritic (also diacritical mark, diacritical point, diacritical sign, or accent) is a glyph added to a letter or to a basic glyph. The term derives from the Ancient Greek (, "distinguishing"), from (, "to distinguish"). The word ''diacrit ...

s (like the caron on the letters ''š'' and ''č'', which represent those same sounds in Czech), or the addition of completely new symbols (as some languages have introduced the letter '' w'' to the Latin alphabet) or of symbols from another alphabet, such as the rune '' þ'' in Icelandic. After the classical period, Greek developed a lowercase letter system that introduced

marks to enable foreigners to learn pronunciation and in some cases, grammatical features. However, as pronunciation of letters changed over time, the

marks were reduced to representing the stressed syllable. In Modern Greek typesetting, this system has been simplified to only have a single accent to indicate which syllable is stressed.

References

External links

Videos: The History and Impact of Writing in the WestPhonemic awareness
page of the CTER wiki
lonestar.texas.net/~jebbo/learn-as/
orthography of

Old English Old English (, ), or Anglo-Saxon, is the earliest recorded form of the English language, spoken in England and southern and eastern Scotland in the early Middle Ages. It was brought to Great Britain by Anglo-Saxon settlers in the mid-5th ...

{{Authority control Applied linguistics Language Linguistics terminology