C0 Control Characters
   HOME

TheInfoList



OR:

C, or c, is the third letter of the
Latin alphabet The Latin alphabet, also known as the Roman alphabet, is the collection of letters originally used by the Ancient Rome, ancient Romans to write the Latin language. Largely unaltered except several letters splitting—i.e. from , and from � ...
, used in the modern English alphabet, the alphabets of other western European languages and others worldwide. Its name in English is ''cee'' (pronounced ), plural ''cees''.


History

"C" comes from the same letter as "G". The Semites named it gimel. The sign is possibly adapted from an
Egyptian hieroglyph Ancient Egyptian hieroglyphs ( ) were the formal writing system used in Ancient Egypt for writing the Egyptian language. Hieroglyphs combined ideographic, logographic, syllabic and alphabetic elements, with more than 1,000 distinct characters. ...
for a staff sling, which may have been the meaning of the name ''gimel''. Another possibility is that it depicted a camel, the Semitic name for which was ''gamal''. Barry B. Powell, a specialist in the history of writing, states "It is hard to imagine how gimel = "camel" can be derived from the picture of a camel (it may show his hump, or his head and neck!)". In the
Etruscan language Etruscan ( ) was the language of the Etruscan civilization in the ancient region of Etruria, in Etruria Padana and Etruria Campana in what is now Italy. Etruscan influenced Latin but was eventually superseded by it. Around 13,000 Etruscan epigraph ...
,
plosive consonant In phonetics, a plosive, also known as an occlusive or simply a stop, is a pulmonic consonant in which the vocal tract is blocked so that all airflow ceases. The occlusion may be made with the tongue tip or blade (, ), tongue body (, ), lip ...
s had no contrastive voicing, so the Greek ' Γ' (Gamma) was adopted into the
Etruscan alphabet The Etruscan alphabet was used by the Etruscans, an ancient civilization of central and northern Italy, to write Etruscan language, their language, from about 700 BC to sometime around 100 AD. The Etruscan alphabet derives from the Euboean alpha ...
to represent . Already in the
Western Greek alphabet Many local variants of the Greek alphabet were employed in ancient Greece during the Archaic Greece, archaic and Classical Greece, early classical periods, until around 400 BC, when they were replaced by the classical 24-letter alphabet that ...
, Gamma first took a '' form in Early Etruscan, then '' in Classical Etruscan. In
Latin Latin ( or ) is a classical language belonging to the Italic languages, Italic branch of the Indo-European languages. Latin was originally spoken by the Latins (Italic tribe), Latins in Latium (now known as Lazio), the lower Tiber area aroun ...
, it eventually took the '' form in
Classical Latin Classical Latin is the form of Literary Latin recognized as a Literary language, literary standard language, standard by writers of the late Roman Republic and early Roman Empire. It formed parallel to Vulgar Latin around 75 BC out of Old Latin ...
. In the earliest Latin inscriptions, the letters '' were used to represent the sounds and (which were not differentiated in writing). Of these, '' was used to represent or before a rounded vowel, '' before '', and '' elsewhere. During the 3rd century BC, a modified character was introduced for , and '' itself was retained for . The use of '' (and its variant '') replaced most usages of '' and ''. Hence, in the classical period and after, '' was treated as the equivalent of Greek gamma, and '' as the equivalent of kappa; this shows in the romanization of Greek words, as in 'ΚΑΔΜΟΣ', 'ΚΥΡΟΣ', and 'ΦΩΚΙΣ' came into Latin as '', '' and '', respectively. Other alphabets have letters homoglyphic to 'c' but not analogous in use and derivation, like the
Cyrillic The Cyrillic script ( ) is a writing system used for various languages across Eurasia. It is the designated national script in various Slavic, Turkic, Mongolic, Uralic, Caucasian and Iranic-speaking countries in Southeastern Europe, Ea ...
letter Es (С, с) which derives from the lunate
sigma Sigma ( ; uppercase Σ, lowercase σ, lowercase in word-final position ς; ) is the eighteenth letter of the Greek alphabet. In the system of Greek numerals, it has a value of 200. In general mathematics, uppercase Σ is used as an operator ...
.


Later use

When the Roman alphabet was introduced into Britain, represented only , and this value of the letter has been retained in loanwords to all the insular Celtic languages: in Welsh, Irish, and Gaelic, represents only . The Old English Latin-based writing system was learned from the Celts, apparently of Ireland; hence, in Old English also originally represented ; the Modern English words ''kin'', ''break'', ''broken'', ''thick'', and ''seek'' all come from Old English words written with : , and . However, during the course of the Old English period, before front vowels ( and ) was palatalized, having changed by the tenth century to , though was still used, as in . On the continent, meanwhile, a similar phonetic change before the same two vowels had also been going on in almost all modern Romance languages (for example, in Italian). In Vulgar Latin, became palatalized to in Italy and Dalmatia; in France and the
Iberian Peninsula The Iberian Peninsula ( ), also known as Iberia, is a peninsula in south-western Europe. Mostly separated from the rest of the European landmass by the Pyrenees, it includes the territories of peninsular Spain and Continental Portugal, comprisin ...
, it became . Yet for these new sounds, was still used before the letters and . The letter thus represented two distinct values. Subsequently, the Latin phoneme (spelled ) de-labialized to , meaning that the various Romance languages had before front vowels. In addition, Norman used the letter so that the sound could be represented by either or , the latter of which could represent either or depending on whether it preceded a front vowel letter or not. The convention of using both and was applied to the writing of English after the
Norman Conquest The Norman Conquest (or the Conquest) was the 11th-century invasion and occupation of England by an army made up of thousands of Normans, Norman, French people, French, Flemish people, Flemish, and Bretons, Breton troops, all led by the Du ...
, causing a considerable re-spelling of the Old English words. Thus, while Old English , and , remained unchanged, , and , were now (without any change of sound) spelled , and ; even () was subsequently changed to , and () was changed to or . The Old English was also at length displaced by the French so that the Old English () and () became
Middle English Middle English (abbreviated to ME) is a form of the English language that was spoken after the Norman Conquest of 1066, until the late 15th century. The English language underwent distinct variations and developments following the Old English pe ...
and , respectively. The sound , to which Old English palatalized had advanced, also occurred in French, chiefly from Latin before . In French, it was represented by the digraph , as in (from Latin ), and this spelling was introduced into English: the Hatton Gospels, written , have in Matt. i-iii, , and , for the , and of the Old English version whence they were copied. In these cases, the Old English gave way to , and ; on the other hand, in its new value of appeared largely in French words like , and and was also substituted for in a few Old English words, as , in early Middle English . By the end of the thirteenth century, both in France and England, this sound was de-affricated to ; and from that time, has represented before front vowels either for etymological reasons, as in ''lance'', ''cent'', or to avoid the ambiguity due to the "etymological" use of for , as in ''ace'', ''mice'', ''once'', ''pence'', ''defence''. Thus, to show etymology, English spelling has ''advise'', ''devise'' (instead of ''*advize'', ''*devize''), while ''advice'', ''device'', ''dice'', ''ice'', ''mice'', ''twice'', etc., do not reflect etymology; example has extended this to ''hence'', ''pence'', ''defence'', etc., where there is no etymological reason for using . Former generations also wrote ''sence'' for ''sense''. Hence, today, the
Romance languages The Romance languages, also known as the Latin or Neo-Latin languages, are the languages that are Language family, directly descended from Vulgar Latin. They are the only extant subgroup of the Italic languages, Italic branch of the Indo-E ...
and English have a common feature inherited from Vulgar Latin spelling conventions where takes on either a "hard" or "soft" value depending on the following letter.


Use in writing systems


English

In
English orthography English orthography comprises the set of rules used when writing the English language, allowing readers and writers to associate written graphemes with the sounds of spoken English, as well as other features of the language. English's orthograp ...
, generally represents the "soft" value of before the letters (including the Latin-derived digraphs and , or the corresponding ligatures and ), , and , and a "hard" value of before any other letters or at the end of a word. However, there are a number of exceptions in English: "
soccer Association football, more commonly known as football or soccer, is a team sport played between two teams of 11 Football player, players who almost exclusively use their feet to propel a Ball (association football), ball around a rectangular f ...
", " celt" and " sceptic" are words that have where would be expected. The "soft" may represent the sound in the digraph when this precedes a vowel, as in the words 'delicious' and 'appreciate', and also in the word "ocean" and its derivatives. The digraph most commonly represents , but can also represent (mainly in words of Greek origin) or (mainly in words of French origin). For some dialects of English, it may also represent in words like ''loch'', while other speakers pronounce the final sound as . The trigraph always represents . The digraph is often used to represent the sound after short vowels, like in "wicket". C is the twelfth most frequently used letter in the English language (after E, T, A, O, I, N, S, H, R, D, and L), with a frequency of about 2.8% in words.


Other languages

In the Romance languages French, Spanish, Italian, Romanian, and Portuguese, generally has a "hard" value of and a "soft" value whose pronunciation varies by language. In French, Portuguese, Catalan, and Spanish from Latin America and some places in Spain, the soft value is as it is in English. In the Spanish spoken in most of Spain, the soft is a
voiceless dental fricative The voiceless dental non-sibilant fricative is a type of consonantal sound used in some spoken languages. It is familiar to most English speakers as the 'th' in ''think''. Though rather rare as a phoneme among the world's languages, it is encount ...
. In Italian and Romanian, the soft is . Germanic languages usually use for Romance loans or digraphs, such as and , but the rules vary across languages. Of all the Germanic languages, only English uses the initial in native Germanic words like ''come''. Other than English, Dutch uses the most, for most Romance loans and the digraph . German uses in the digraphs and , and the trigraph , but by itself only in unassimilated loanwords and proper names. Danish keeps soft in Romance words but changes hard to . Swedish has the same rules for soft and hard as Danish, and also uses in the digraph and the very common word ''och'', "and". Norwegian,
Afrikaans Afrikaans is a West Germanic languages, West Germanic language spoken in South Africa, Namibia and to a lesser extent Botswana, Zambia, Zimbabwe and also Argentina where there is a group in Sarmiento, Chubut, Sarmiento that speaks the Pat ...
, and Icelandic are the most restrictive, replacing all cases of with or , and reserving for unassimilated loanwords and names. All Balto-Slavic languages that use the Latin alphabet, as well as Albanian, Hungarian, Pashto, several Sami languages,
Esperanto Esperanto (, ) is the world's most widely spoken Constructed language, constructed international auxiliary language. Created by L. L. Zamenhof in 1887 to be 'the International Language' (), it is intended to be a universal second language for ...
, Ido, Interlingua, and Americanist phonetic notation (and those aboriginal languages of North America whose practical orthography derives from it), use to represent , the voiceless alveolar or voiceless dental sibilant affricate. In Hanyu Pinyin, the standard romanization of
Mandarin Chinese Mandarin ( ; zh, s=, t=, p=Guānhuà, l=Mandarin (bureaucrat), officials' speech) is the largest branch of the Sinitic languages. Mandarin varieties are spoken by 70 percent of all Chinese speakers over a large geographical area that stretch ...
, the letter represents an aspirated version of this sound, . Among non-European languages that have adopted the Latin alphabet, represents a variety of sounds. Yup'ik, Indonesian, Malay, and a number of African languages such as Hausa, Fula, and Manding share the soft Italian value of . In Azeri, Crimean Tatar, Kurmanji Kurdish, and Turkish, stands for the voiced counterpart of this sound, the voiced postalveolar affricate . In Yabem and similar languages, such as Bukawa, stands for a
glottal stop The glottal stop or glottal plosive is a type of consonantal sound used in many Speech communication, spoken languages, produced by obstructing airflow in the vocal tract or, more precisely, the glottis. The symbol in the International Phonetic ...
. Xhosa and Zulu use this letter to represent the click . In some other African languages, such as
Berber languages The Berber languages, also known as the Amazigh languages or Tamazight, are a branch of the Afroasiatic language family. They comprise a group of closely related but mostly mutually unintelligible languages spoken by Berbers, Berber communities, ...
, is used for . In Fijian, stands for a voiced dental fricative , while in Somali it has the value of . The letter is also used as a transliteration of Cyrillic in the Latin forms of Serbian, Macedonian, and sometimes Ukrainian, along with the digraph .


Other systems

As a
phonetic Phonetics is a branch of linguistics that studies how humans produce and perceive sounds or, in the case of sign languages, the equivalent aspects of sign. Linguists who specialize in studying the physical properties of speech are phoneticians ...
symbol, lowercase is the
International Phonetic Alphabet The International Phonetic Alphabet (IPA) is an alphabetic system of phonetic notation based primarily on the Latin script. It was devised by the International Phonetic Association in the late 19th century as a standard written representation ...
(IPA) and X-SAMPA symbol for the voiceless palatal plosive, and capital is the X-SAMPA symbol for the voiceless palatal fricative.


Digraphs

There are several common digraphs with , the most common being , which in some languages (such as German) is far more common than alone. takes various values in other languages. As in English, , with the value , is often used after short vowels in other
Germanic languages The Germanic languages are a branch of the Indo-European languages, Indo-European language family spoken natively by a population of about 515 million people mainly in Europe, North America, Oceania, and Southern Africa. The most widely spoke ...
such as German and Swedish (other Germanic languages, such as Dutch and Norwegian, use instead). The digraph is found in Polish and in Hungarian, representing and respectively. The digraph represents in Old English, Italian, and a few languages related to Italian (where this only happens before front vowels, while otherwise it represents ). The trigraph represents in German.


Other uses

* In the
hexadecimal Hexadecimal (also known as base-16 or simply hex) is a Numeral system#Positional systems in detail, positional numeral system that represents numbers using a radix (base) of sixteen. Unlike the decimal system representing numbers using ten symbo ...
(base 16) numbering system, C is a number that corresponds to the number 12 in decimal (base 10) counting. * In the
Roman numeral Roman numerals are a numeral system that originated in ancient Rome and remained the usual way of writing numbers throughout Europe well into the Late Middle Ages. Numbers are written with combinations of letters from the Latin alphabet, ea ...
system, C represents 100. * Unit prefix c, meaning one hundredth.


Related characters


Ancestors, descendants and siblings

*𐤂 : Semitic letter Gimel, from which the following symbols originally derive: ** : Greek letter Gamma, from which C derives ***G g : Latin letter G, which is derived from Latin C ****Ȝ ȝ : Latin letter Ȝ, which is derived from Latin G * Phonetic alphabet symbols related to C: ** : Small c with curl **ʗ : Stretched c **𝼏 : Stretched c with curl – Used by Douglas Beach for a nasal click in his phonetic description of Khoekhoe. **𝼝 : Small letter c with retroflex hook – Para-IPA version of the IPA retroflex tʂ. **ꟲ : Modifier letter capital c – Used to mark tone for the Chatino orthography in Oaxaca, Mexico; used as a generic transcription for a falling tone; also used in para- IPA notation. * : Modifier letter small c * : Modifier letter small c with curl *ᴄ : Small capital c is used in the Uralic Phonetic Alphabet. *Ꞔ ꞔ : C with palatal hook, used for writing
Mandarin Chinese Mandarin ( ; zh, s=, t=, p=Guānhuà, l=Mandarin (bureaucrat), officials' speech) is the largest branch of the Sinitic languages. Mandarin varieties are spoken by 70 percent of all Chinese speakers over a large geographical area that stretch ...
using the early draft version of
pinyin Hanyu Pinyin, or simply pinyin, officially the Chinese Phonetic Alphabet, is the most common romanization system for Standard Chinese. ''Hanyu'' () literally means 'Han Chinese, Han language'—that is, the Chinese language—while ''pinyin' ...
romanization during the mid-1950s. Add to C with diacritics: *C with
diacritic A diacritic (also diacritical mark, diacritical point, diacritical sign, or accent) is a glyph added to a letter or to a basic glyph. The term derives from the Ancient Greek (, "distinguishing"), from (, "to distinguish"). The word ''diacrit ...
s: Ć ć Ĉ ĉ Č č Ċ ċ Ḉ ḉ Ƈ ƈ C̈ c̈ Ȼ ȼ Ç ç Ꞓ ꞓ *Ↄ ↄ : Claudian letters


Derived ligatures, abbreviations, signs and symbols

*© : copyright symbol *°C : degree Celsius *¢ : cent *₡ : colón (currency) *₢ : Brazilian cruzeiro (currency) *₵ : Ghana cedi (currency) *₠ : European Currency Unit CE *\mathbb : blackboard bold C, denoting the complex numbers *ℭ : blackletter C *Ꜿ ꜿ :
Medieval In the history of Europe, the Middle Ages or medieval period lasted approximately from the 5th to the late 15th centuries, similarly to the post-classical period of World history (field), global history. It began with the fall of the West ...
abbreviation for
Latin Latin ( or ) is a classical language belonging to the Italic languages, Italic branch of the Indo-European languages. Latin was originally spoken by the Latins (Italic tribe), Latins in Latium (now known as Lazio), the lower Tiber area aroun ...
syllables ''con''- and ''com''-, and Portuguese -''us'' and -''os''.


Other representations


Computing

The Latin letters and have
Unicode Unicode or ''The Unicode Standard'' or TUS is a character encoding standard maintained by the Unicode Consortium designed to support the use of text in all of the world's writing systems that can be digitized. Version 16.0 defines 154,998 Char ...
encodings and . These are the same
code point A code point, codepoint or code position is a particular position in a Table (database), table, where the position has been assigned a meaning. The table may be one dimensional (a column), two dimensional (like cells in a spreadsheet), three dime ...
s as those used in
ASCII ASCII ( ), an acronym for American Standard Code for Information Interchange, is a character encoding standard for representing a particular set of 95 (English language focused) printable character, printable and 33 control character, control c ...
and
ISO 8859 ISO/IEC 8859 is a joint International Organization for Standardization, ISO and International Electrotechnical Commission, IEC series of standards for 8-bit character encodings. The series of standards consists of numbered parts, such as ISO/IEC ...
. There are also
precomposed character A precomposed character (alternatively composite character or decomposable character) is a Unicode entity that can also be defined as a sequence of one or more other characters. A precomposed character may typically represent a letter with a diac ...
encodings for and with diacritics, for most of those listed above; the remainder are produced using combining diacritics. Variant forms of the letter have unique code points for specialist use: the alphanumeric symbols set in mathematics and science, voiceless palatal sounds in linguistics, and halfwidth and fullwidth forms for legacy CJK font compatibility. The Cyrillic homoglyph of the Latin has a separate encoding: .


Other


See also

* Hard and soft C * Speed of light, ''c''


References


External links

* * * {{Latin script, C} ISO basic Latin letters