HOME

TheInfoList



OR:

Czech orthography is a system of rules for proper formal writing (
orthography An orthography is a set of conventions for writing a language, including norms of spelling, hyphenation, capitalization, word breaks, emphasis, and punctuation. Most transnational languages in the modern period have a writing system, and ...
) in Czech. The earliest form of separate Latin script specifically designed to suit Czech was devised by Czech
theologian Theology is the systematic study of the nature of the divine and, more broadly, of religious belief. It is taught as an academic discipline, typically in universities and seminaries. It occupies itself with the unique content of analyzing the ...
and church reformist Jan Hus, the namesake of the Hussite movement, in one of his seminal works, ''
De orthographia bohemica ''De orthographia bohemica'' ( en, On Bohemian Orthography) is a Latin work published between 1406 and 1412. It is attributed to Charles University rector and reformer Jan Hus. The book codified the Czech language's modern spelling and orthograph ...
'' ( en, On Bohemian orthography). The modern Czech orthographic system is
diacritic A diacritic (also diacritical mark, diacritical point, diacritical sign, or accent) is a glyph added to a letter or to a basic glyph. The term derives from the Ancient Greek (, "distinguishing"), from (, "to distinguish"). The word ''diacrit ...
, having evolved from an earlier system which used many digraphs (although some digraphs have been kept - ''ch, dž''). The caron is added to standard Latin letters to express sounds which are foreign to
Latin Latin (, or , ) is a classical language belonging to the Italic languages, Italic branch of the Indo-European languages. Latin was originally a dialect spoken in the lower Tiber area (then known as Latium) around present-day Rome, but through ...
. The
acute accent The acute accent (), , is a diacritic used in many modern written languages with alphabets based on the Latin, Cyrillic, and Greek scripts. For the most commonly encountered uses of the accent in the Latin and Greek alphabets, precomposed ...
is used for long
vowel A vowel is a syllabic speech sound pronounced without any stricture in the vocal tract. Vowels are one of the two principal classes of speech sounds, the other being the consonant. Vowels vary in quality, in loudness and also in quantity (len ...
s. The Czech orthography is considered the model for many other Balto-Slavic languages using the
Latin alphabet The Latin alphabet or Roman alphabet is the collection of letters originally used by the ancient Romans to write the Latin language. Largely unaltered with the exception of extensions (such as diacritics), it used to write English and the ...
; Slovak orthography being its direct revised descendant, while the Serbo-Croatian Gaj's Latin alphabet and its Slovene descendant system are largely based on it. All of them make use of similar diacritics and also have a similar, usually interchangeable, relationship between the letters and the sounds they are meant to represent.


Alphabet

The Czech alphabet consists of 42 letters. The letters ''Q'', ''W'', and ''X'' are used exclusively in foreign words, and the former two are replaced with ''KV'' and ''V'' once the word becomes “naturalized” (assimilated into Czech); the digraphs dz and are also used mostly for foreign words and are not considered to be distinct letters in the Czech alphabet.


Orthographic principles

Czech orthography is primarily phonemic (rather than phonetic) because an individual
grapheme In linguistics, a grapheme is the smallest functional unit of a writing system. The word ''grapheme'' is derived and the suffix ''-eme'' by analogy with ''phoneme'' and other names of emic units. The study of graphemes is called '' graphemi ...
usually corresponds to an individual
phoneme In phonology and linguistics, a phoneme () is a unit of sound that can distinguish one word from another in a particular language. For example, in most dialects of English, with the notable exception of the West Midlands and the north-wes ...
(rather than a
sound In physics, sound is a vibration that propagates as an acoustic wave, through a transmission medium such as a gas, liquid or solid. In human physiology and psychology, sound is the ''reception'' of such waves and their ''perception'' by ...
). However, some graphemes and letter groups are remnants of historical phonemes which were used in the past but have since merged with other phonemes. Some changes in the
phonology Phonology is the branch of linguistics that studies how languages or dialects systematically organize their sounds or, for sign languages, their constituent parts of signs. The term can also refer specifically to the sound or sign system of a ...
have not been reflected in the orthography.


Voicing assimilation

All the obstruent consonants are subject to voicing (before voiced obstruents except ) or devoicing (before voiceless consonants and at the end of words); spelling in these cases is morphophonemic (i.e. the morpheme has the same spelling as before a vowel). An exception is the cluster , in which the is voiced to only in
Moravian dialects Moravian dialects ( cs, moravská nářečí, moravština) are the varieties of Czech spoken in Moravia, a historical region in the east of the Czech Republic. There are more forms of the Czech language used in Moravia than in the rest of the Cz ...
, while in Bohemia the is devoiced to instead (e.g. ''shodit'' , in Moravia ). Devoicing changes its articulation place: it becomes . After unvoiced consonants is devoiced: for instance, in 'three', which is pronounced . Written voiced or voiceless counterparts are kept according to the etymology of the word, e.g. ''odpadnout'' (to fall away) - ''od-'' is a
prefix A prefix is an affix which is placed before the stem of a word. Adding it to the beginning of one word changes it into another word. For example, when the prefix ''un-'' is added to the word ''happy'', it creates the word ''unhappy''. Particul ...
; written is devoiced here because of the following voiceless . For historical reasons, the consonant is written ''k'' in Czech words like ''kde'' ('where', < Proto-Slavic *kъdě) or ''kdo'' ('who', < Proto-Slavic *kъto). This is because the letter ''g'' was historically used for the consonant . The original Slavic phoneme changed into in the Old-Czech period. Thus, is not a separate phoneme (with a corresponding grapheme) in words of domestic origin; it occurs only in foreign words (e.g. ''graf'', ''gram'', etc.).


Final devoicing

Unlike in
English English usually refers to: * English language * English people English may also refer to: Peoples, culture, and language * ''English'', an adjective for something of, from, or related to England ** English national ...
but like
German German(s) may refer to: * Germany (of or related to) **Germania (historical use) * Germans, citizens of Germany, people of German ancestry, or native speakers of the German language ** For citizens of Germany, see also German nationality law **Ge ...
and Russian, voiced consonants are pronounced voicelessly in the final position in words. In
declension In linguistics, declension (verb: ''to decline'') is the changing of the form of a word, generally to express its syntactic function in the sentence, by way of some inflection. Declensions may apply to nouns, pronouns, adjectives, adverbs, and ...
, they are voiced in cases where the words take on endings. Compare: : ''led'' – ''ledy'' (ice – ices) : ''let'' – ''lety'' (flight – flights)


"Soft" I and "hard" Y

The letters and are both pronounced , while and are both pronounced . was originally pronounced as in contemporary Polish. However, in the 14th century, this difference in standard pronunciation disappeared, though it has been preserved in some
Moravian dialects Moravian dialects ( cs, moravská nářečí, moravština) are the varieties of Czech spoken in Moravia, a historical region in the east of the Czech Republic. There are more forms of the Czech language used in Moravia than in the rest of the Cz ...
. In words of native origin "soft" and cannot follow "hard" consonants, while "hard" and cannot follow "soft" consonants; "neutral" consonants can be followed by either vowel: When or is written after in native words, these consonants are soft, as if they were written . That is, the sounds are written instead of , e.g. in ''čeština'' . The sounds are denoted, respectively, by . In words of foreign origin, are pronounced ; that is, as if they were written , e.g. in ''diktát'', dictation. Historically the letter was hard, but this changed in the 19th century. However, in some words it is still followed by the letter : ''tác'' (plate) – ''tácy'' (plates). Because neutral consonants can be followed by either or , in some cases they distinguish
homophone A homophone () is a word that is pronounced the same (to varying extent) as another word but differs in meaning. A ''homophone'' may also differ in spelling. The two words may be spelled the same, for example ''rose'' (flower) and ''rose'' (pa ...
s, e.g. ''být'' (to be) vs. ''bít'' (to beat), ''mýt'' (to wash) vs. ''mít'' (to have). At school pupils must memorize word
root In vascular plants, the roots are the organs of a plant that are modified to provide anchorage for the plant and take in water and nutrients into the plant body, which allows plants to grow taller and faster. They are most often below the su ...
s and
prefix A prefix is an affix which is placed before the stem of a word. Adding it to the beginning of one word changes it into another word. For example, when the prefix ''un-'' is added to the word ''happy'', it creates the word ''unhappy''. Particul ...
es where is written; is written in other cases. Writing or in endings is dependent on the declension patterns.


Letter Ě

The letter is a vestige of Old Czech palatalization. The originally palatalizing
phoneme In phonology and linguistics, a phoneme () is a unit of sound that can distinguish one word from another in a particular language. For example, in most dialects of English, with the notable exception of the West Midlands and the north-wes ...
/ě/ became extinct, changing to or , but it is preserved as a
grapheme In linguistics, a grapheme is the smallest functional unit of a writing system. The word ''grapheme'' is derived and the suffix ''-eme'' by analogy with ''phoneme'' and other names of emic units. The study of graphemes is called '' graphemi ...
which can never appear in the initial position. * are written instead of , analogously to * are usually written instead of ** In words like ''vjezd'' (entry, drive-in) ''objem'' (volume), are written because in such cases ''–je-'' is etymologically preceded by the prefixes '' v-'' or '' ob-'' * is usually written instead of , except for morphological reasons in some words (''jemný'', soft -> ''jemně'', softly) ** The first-person singular pronouns ''mě'' (for the genitive and accusative cases) and ''mně'' (for the dative and locative) are homophones —see Czech declension


Letter Ů

There are two ways in Czech to write long : and . cannot occur in an initial position, while occurs almost exclusively in the initial position or at the beginning of a word
root In vascular plants, the roots are the organs of a plant that are modified to provide anchorage for the plant and take in water and nutrients into the plant body, which allows plants to grow taller and faster. They are most often below the su ...
in a compound. Historically, long changed into the
diphthong A diphthong ( ; , ), also known as a gliding vowel, is a combination of two adjacent vowel sounds within the same syllable. Technically, a diphthong is a vowel with two different targets: that is, the tongue (and/or other parts of the speech ...
(as also happened in the English Great Vowel Shift with words such as "house"), though not in word-initial position in the prestige form. In 1848 at the beginning of word-roots was changed into in words like to reflect this. Thus, the letter is written at the beginning of word-roots only: ''úhel'' (angle), ''trojúhelník'' (triangle), except in loanwords: ''skútr'' (scooter). Meanwhile, historical long changed into the diphthong . As was common with scribal abbreviations, the letter in the diphthong was sometimes written as a ring above the letter , producing , e.g. ''kóň > kuoň > kůň'' (horse), like the origin of the German umlaut. Later, the pronunciation changed into , but the grapheme has remained. It never occurs at the beginning of words: ''dům'' (house), ''domů'' (home, homeward). The letter now has the same pronunciation as the letter (long ), but alternates with a short when a word is inflected (e.g. nom. '' kůň'' → gen. ''koně'', nom. ''dům'' → gen. ''domu''), thus showing the historical evolution of the language.


Agreement between the subject and the predicate

The
predicate Predicate or predication may refer to: * Predicate (grammar), in linguistics * Predication (philosophy) * several closely related uses in mathematics and formal logic: **Predicate (mathematical logic) **Propositional function **Finitary relation, o ...
must be always in accordance with the subject in the sentence - in
number A number is a mathematical object used to count, measure, and label. The original examples are the natural numbers 1, 2, 3, 4, and so forth. Numbers can be represented in language with number words. More universally, individual number ...
and
person A person ( : people) is a being that has certain capacities or attributes such as reason, morality, consciousness or self-consciousness, and being a part of a culturally established form of social relations such as kinship, ownership of prope ...
(personal
pronoun In linguistics and grammar, a pronoun ( abbreviated ) is a word or a group of words that one may substitute for a noun or noun phrase. Pronouns have traditionally been regarded as one of the parts of speech, but some modern theorists would not ...
s), and with past and passive
participle In linguistics, a participle () (from Latin ' a "sharing, partaking") is a nonfinite verb form that has some of the characteristics and functions of both verbs and adjectives. More narrowly, ''participle'' has been defined as "a word derived from ...
s also in
gender Gender is the range of characteristics pertaining to femininity and masculinity and differentiating between them. Depending on the context, this may include sex-based social structures (i.e. gender roles) and gender identity. Most culture ...
. This grammatical principle affects the orthography (see also "Soft" I and "Hard" Y) – it is especially important for the correct choice and writing of plural endings of the
participle In linguistics, a participle () (from Latin ' a "sharing, partaking") is a nonfinite verb form that has some of the characteristics and functions of both verbs and adjectives. More narrowly, ''participle'' has been defined as "a word derived from ...
s. Examples: The mentioned example shows both past (byl, byla ...) and passive (koupen, koupena ...) participles. The accordance in gender takes effect in the past tense and the passive voice, not in the present and future tenses in active voice. If the complex subject is a combination of nouns of different genders, masculine animate gender is prior to others and the masculine inanimate and feminine genders are prior to the neuter gender. Examples: :''muži a ženy byli'' - men and women were :''kočky a koťata byly'' - cats and kittens were :''my jsme byli'' (my = we all/men) vs. ''my jsme byly'' (my = we women) - we were Priority of genders: :masculine animate > masculine inanimate & feminine > neuter


Punctuation

The use of the
full stop The full stop (Commonwealth English), period (North American English), or full point , is a punctuation mark. It is used for several purposes, most often to mark the end of a declarative sentence (as distinguished from a question or exclamatio ...
(.), the colon (:), the
semicolon The semicolon or semi-colon is a symbol commonly used as orthographic punctuation. In the English language, a semicolon is most commonly used to link (in a single sentence) two independent clauses that are closely related in thought. When a ...
(;), the
question mark The question mark (also known as interrogation point, query, or eroteme in journalism) is a punctuation mark that indicates an interrogative clause or phrase in many languages. History In the fifth century, Syriac Bible manuscripts used ...
(?) and the exclamation mark (!) is similar to their use in other European languages. The full stop is placed after a number if it stands for ordinal numerals (as in German), e.g. ''1. den'' (= ''první den'') – the 1st day. The
comma The comma is a punctuation mark that appears in several variants in different languages. It has the same shape as an apostrophe or single closing quotation mark () in many typefaces, but it differs from them in being placed on the baseline ...
is used to separate individual parts in
complex-compound sentence In grammar, sentence and clause structure, commonly known as sentence composition, is the classification of sentences based on the number and kind of clauses in their syntactic structure. Such division is an element of traditional grammar. Typ ...
s, lists, isolated parts of sentences, etc. Its use in Czech is different from English. Subordinate (dependent) clauses must be always separated from their principal (independent) clauses, for instance. A comma is not placed before ''a'' (and), ''i'' (as well as), ''ani'' (nor) and ''nebo'' (or) when they connect parts of sentences or clauses in copulative conjunctions (on a same level). It must be placed in non-copulative conjunctions (consequence, emphasis, exclusion, etc.). A comma can, however, occur in front of the word ''a'' (and) if the former is part of comma-delimited parenthesis: ''Jakub, můj mladší bratr, a jeho učitel Filip byli příliš zabráni do rozhovoru. Probírali látku, která bude u zkoušky, a též, kdo na ní bude.'' A comma also separates subordinate conjunctions introduced by composite conjunctions ''a proto'' (and therefore) and ''a tak'' (and so). Examples: * ''otec a matka'' – father and mother, ''otec nebo matka'' – father or mother (coordinate relation – no commas) * ''Je to pravda, nebo ne?'' – Is it true, or not? (exclusion) * ''Pršelo, a proto nikdo nepřišel.'' – It was raining, and this is why nobody came. (consequence) * ''Já vím, kdo to je.'' – I know who it is. ''Myslím, že se mýlíš.'' – I think (that) you are wrong. (subordinate relation) * ''Jak se máš, Anno?'' – How are you, Anna? (addressing a person) * ''Karel IV., římský císař a český král, založil hrad Karlštejn.'' – Charles IV, Holy Roman Emperor and Bohemian king, founded the
Karlštejn Castle Karlštejn Castle ( cs, hrad Karlštejn; german: Burg Karlstein) is a large Gothic castle founded in 1348 by Charles IV, Holy Roman Emperor-elect and King of Bohemia. The castle served as a place for safekeeping the Imperial Regalia as well ...
. (comma-delimited parenthesis) Quotation marks. The first one preceding the quoted text is placed to the bottom line: * ''Petr řekl: „Přijdu zítra.“'' – Peter said: "I'll come tomorrow." Other types of quotation marks: ‚‘ »«
Apostrophe The apostrophe ( or ) is a punctuation mark, and sometimes a diacritical mark, in languages that use the Latin alphabet and some other alphabets. In English, the apostrophe is used for two basic purposes: * The marking of the omission of one ...
s are used rarely in Czech. They can denote a missing sound in non-standard speech, but it is optional, e.g. ''řek or ''řek'' (= ''řekl'', he said).


Capital letters

The first word of every sentence and all proper names are capitalized. Special cases are: * Respect expression – optional: ''Ty'' (you sg.), ''Tvůj'' (your sg.), ''Vy'' (you pl.), ''Váš'' (your pl.); ''Bůh'' (God), ''Mistr'' (Master), etc. * Headings – The first word is capitalized. * Cities, towns and villages – All words are capitalized, except for prepositions: '' Nové Město nad Metují'' (New-Town-upon-Metuje). * Geographical or local names – The first word is capitalized, common names as ''ulice'' (street), ''náměstí'' (square) or ''moře'' (sea) are not capitalized: ''ulice Svornosti'' (Concordance Street), ''Václavské náměstí'' (Wenceslas Square), ''Severní moře'' (North Sea). Since 1993, the initial preposition and the first following word are capitalized: ''lékárna U Černého orla'' (Black Eagle Pharmacy). * Official names of institutions – The first word is capitalized: ''Městský úřad v Kolíně'' (The Municipal Office in Kolín) vs. ''městský úřad'' (a municipal office). In some cases, a initial common name is not capitalized even if it is factually a part of the name: ''okres Semily'' (Semily District), ''náměstí Míru'' (Peace Square). * Names of nations and nationality nouns are capitalized: ''Anglie'' (England), ''Angličan'' (Englishman), ''Německo'' (Germany), ''Němec'' (German). Adjectives derived from geographical names and names of nations, such as ''anglický'' (English – adjective) and ''pražský'' (Prague – adjective, e.g. ''pražské metro'', Prague subway), are not. Names of languages are not capitalized: ''angličtina'' (English). * Possessive adjectives derived from proper names are capitalized: ''Pavlův dům'' (Paul's house). * Brands are capitalized as a trademark or company name, but usually not as product names: ''přijel trabant a několik škodovek'' but ''přijelo auto značky Trabant a několik aut značky Škoda'', ''zákaz vjezdu segwayů'' but ''zákaz vjezdu vozítek Segway'' * If a proper name contains other proper names, the inner proper names keep their orthography: ''Poslanecká sněmovna Parlamentu České republiky'', ''Kostelec nad Černými lesy'', ''Filozofická fakulta Jihočeské univerzity v Českých Budějovicích''


History

In the 9th century, Glagolitic script was used, during the 11th century it was replaced by Latin script. There are five periods in the development of the Czech Latin-based orthographic system: ;Primitive orthography : For writing sounds which are foreign to the
Latin Latin (, or , ) is a classical language belonging to the Italic languages, Italic branch of the Indo-European languages. Latin was originally a dialect spoken in the lower Tiber area (then known as Latium) around present-day Rome, but through ...
alphabet An alphabet is a standardized set of basic written graphemes (called letters) that represent the phonemes of certain spoken languages. Not all writing systems represent language in this way; in a syllabary, each character represents a syllab ...
, letters with similar sounds were used. The oldest known written notes in Czech originate from the 11th century. The literature was written predominantly in Latin in this period. Unfortunately, it was very ambiguous at times, with ''c'', for example, being used for ''c'', ''č'', and ''k''. ;Digraphic orthography : Various digraphs were used for non-Latin sounds. The system was not consistent and it also did not distinguish long and short vowels. It had some features that
Polish orthography Polish orthography is the system of writing the Polish language. The language is written using the Polish alphabet, which derives from the Latin alphabet, but includes some additional letters with diacritics. The orthography is mostly phonetic, ...
has kept, such as ''cz'', ''rz'' instead of ''č'', ''ř'', but was still crippled by ambiguities, such as spelling both ''s'' and ''š'' as ''s''/''ss'', ''z'' and ''ž'' as ''z'', and sometimes even ''c'' and ''č'' both as ''cz'', only distinguishing by context. Long vowels such as ''á'' were sometimes (but not always) written double as ''aa''. Other features of the day included spelling ''j'' as ''g'' and ''v'' as ''w'', as the early modern Latin alphabet had not by then distinguished ''j'' from ''i'' or ''v'' from ''u''. ;Diacritic orthography : Introduced probably by Jan Hus. Using
diacritic A diacritic (also diacritical mark, diacritical point, diacritical sign, or accent) is a glyph added to a letter or to a basic glyph. The term derives from the Ancient Greek (, "distinguishing"), from (, "to distinguish"). The word ''diacrit ...
s for long vowels ("virgula", an
acute Acute may refer to: Science and technology * Acute angle ** Acute triangle ** Acute, a leaf shape in the glossary of leaf morphology * Acute (medicine), a disease that it is of short duration and of recent onset. ** Acute toxicity, the adverse ef ...
, "čárka" in Czech) and "soft" consonants ("punctus rotundus", a dot above a letter, which has survived in Polish ''ż'') was suggested for the first time in ''"
De orthographia Bohemica ''De orthographia bohemica'' ( en, On Bohemian Orthography) is a Latin work published between 1406 and 1412. It is attributed to Charles University rector and reformer Jan Hus. The book codified the Czech language's modern spelling and orthograph ...
"'' around 1406. Diacritics replaced digraphs almost completely. It was also suggested that the Prague dialect should become the standard for Czech. Jan Hus is considered to be the author of that work but there is some uncertainty about this. ;Brethren orthography : The
Bible of Kralice The Bible of Kralice, also called the Kralice Bible ( cs, Bible kralická), was the first complete translation of the Bible from the original languages into Czech. Translated by the Unity of the Brethren and printed in Kralice nad Oslavou, t ...
(1579–1593), the first complete Czech translation of the
Bible The Bible (from Koine Greek , , 'the books') is a collection of religious texts or scriptures that are held to be sacred in Christianity Christianity is an Abrahamic monotheistic religion based on the life and teachings of Jesus ...
from the original languages by the Czech Brethren, became the model for the literary form of the language. The punctus rotundus was replaced by the caron ("háček"). There were some differences from the current orthography, e.g. the digraph ſſ was used instead of ''š''; ''ay, ey, au'' instead of ''aj, ej, ou''; ''v'' instead of ''u'' (at the beginning of words); ''w'' instead of ''v''; ''g'' instead of ''j''; and ''j'' instead of ''í'' ( = ''její'', hers). ''Y'' was written always after ''c, s'' and ''z'' (e.g. ''cizí'', foreign, was written ''cyzý'') and the conjunction ''i'' (as well as, and) was written ''y''. ;Modern orthography : During the period of the Czech National Renaissance (end of the 18th century and the first half of the 19th century), Czech linguists ( Josef Dobrovský et al.) codified some reforms in the orthography. These principles have been effective up to the present day. The later reforms in the 20th century mostly referred to introducing loanwords into Czech and their adaptation to the Czech orthography.


Computer encoding

In
computing Computing is any goal-oriented activity requiring, benefiting from, or creating computing machinery. It includes the study and experimentation of algorithmic processes, and development of both hardware and software. Computing has scientific, ...
, several different coding standards have existed for this alphabet, among them: * ISO 8859-2 * Microsoft Windows code page 1250 * IBM PC code page 852 * Kamenický brothers or KEYBCS2 on early DOS PCs and on Fidonet. *
Unicode Unicode, formally The Unicode Standard,The formal version reference is is an information technology standard for the consistent encoding, representation, and handling of text expressed in most of the world's writing systems. The standard, ...


See also

*
Czech language Czech (; Czech ), historically also Bohemian (; ''lingua Bohemica'' in Latin), is a West Slavic language of the Czech–Slovak group, written in Latin script. Spoken by over 10 million people, it serves as the official language of the Czech Re ...
* Czech phonology * Orthographia bohemica * Czech declension * Czech verb * Czech word order *
International Phonetic Alphabet The International Phonetic Alphabet (IPA) is an alphabetic system of phonetic notation based primarily on the Latin script. It was devised by the International Phonetic Association in the late 19th century as a standardized representation ...
* Phonemic orthography * Háček *
Kroužek A ring diacritic may appear above or below letters. It may be combined with some letters of the extended Latin alphabets in various contexts. Rings Distinct letter The character Å (å) is derived from an A with a ring. It is a distinct let ...
* Non-English usage of quotation marks


References


External links


Czech Language
an
list of known encodings
(in Czech)
Typo.cz
Information on Central European typography and typesetting {{Language orthographies Czech language Indo-European Latin-script orthographies