HOME

TheInfoList



OR:

A cedilla ( ; from Spanish) or cedille (from French , ) is a hook or tail ( ¸ ) added under certain letters as a diacritical mark to modify their pronunciation. In
Catalan Catalan may refer to: Catalonia From, or related to Catalonia: * Catalan language, a Romance language * Catalans, an ethnic group formed by the people from, or with origins in, Northern or southern Catalonia Places * 13178 Catalan, asteroid ...
,
French French (french: français(e), link=no) may refer to: * Something of, from, or related to France ** French language, which originated in France, and its various dialects and accents ** French people, a nation and ethnic group identified with Franc ...
, and Portuguese (called cedilha) it is used only under the ''c'' (forming ''ç''), and the entire letter is called, respectively, (i.e. "broken C"), , and (or , colloquially). It is used to mark vowel nasalization in many languages of sub-Saharan Africa, including Vute from
Cameroon Cameroon (; french: Cameroun, ff, Kamerun), officially the Republic of Cameroon (french: République du Cameroun, links=no), is a country in west- central Africa. It is bordered by Nigeria to the west and north; Chad to the northeast; th ...
.


Origin

The tail originated in Spain as the bottom half of a miniature
cursive Cursive (also known as script, among other names) is any style of penmanship in which characters are written joined in a flowing manner, generally for the purpose of making writing faster, in contrast to block letters. It varies in functional ...
z. The word ''cedilla'' is the diminutive of the Old Spanish name for this letter, (). Modern Spanish and isolationist Galician no longer use this diacritic (apart from , the nickname of the FC Barcelona football team), although it is used in Reintegrationist Galician, Portuguese,
Catalan Catalan may refer to: Catalonia From, or related to Catalonia: * Catalan language, a Romance language * Catalans, an ethnic group formed by the people from, or with origins in, Northern or southern Catalonia Places * 13178 Catalan, asteroid ...
, Occitan, and
French French (french: français(e), link=no) may refer to: * Something of, from, or related to France ** French language, which originated in France, and its various dialects and accents ** French people, a nation and ethnic group identified with Franc ...
, which gives English the alternative spellings of ''cedille'', from
French French (french: français(e), link=no) may refer to: * Something of, from, or related to France ** French language, which originated in France, and its various dialects and accents ** French people, a nation and ethnic group identified with Franc ...
"", and the Portuguese form . An obsolete spelling of ''cedilla'' is ''cerilla''. The earliest use in English cited by the ''
Oxford English Dictionary The ''Oxford English Dictionary'' (''OED'') is the first and foundational historical dictionary of the English language, published by Oxford University Press (OUP). It traces the historical development of the English language, providing a com ...
'' is a 1599 Spanish-English dictionary and grammar. Chambers' ''Cyclopædia''Chambers, Ephraim (1738) ''Cyclopædia; or, an universal dictionary of arts and sciences'' (2nd ed.) is cited for the printer-trade variant '' ceceril'' in use in 1738. The main use in English is not universal and applies to loan words from
French French (french: français(e), link=no) may refer to: * Something of, from, or related to France ** French language, which originated in France, and its various dialects and accents ** French people, a nation and ethnic group identified with Franc ...
and Portuguese such as ''
façade A façade () (also written facade) is generally the front part or exterior of a building. It is a loan word from the French (), which means 'frontage' or ' face'. In architecture, the façade of a building is often the most important aspect ...
'', '' limaçon'' and ''
cachaça ''Cachaça'' () is a distilled spirit made from fermented sugarcane juice. Also known as ''pinga'', ''caninha'', and other names, it is the most popular spirit among distilled alcoholic beverages in Brazil.Cavalcante, Messias Soares. Todos os n ...
'' (often typed ''facade'', ''limacon'' and ''cachaca'' because of lack of ''ç'' keys on English language keyboards). With the advent of
modernism Modernism is both a philosophical and arts movement that arose from broad transformations in Western society during the late 19th and early 20th centuries. The movement reflected a desire for the creation of new forms of art, philosophy, ...
, the calligraphic nature of the cedilla was thought somewhat jarring on sans-serif typefaces, and so some designers instead substituted a comma design, which could be made bolder and more compatible with the style of the text. This reduces the visual distinction between the cedilla and the diacritical comma.


C

The most frequent character with cedilla is "ç" ("c" with cedilla, as in ''façade''). It was first used for the sound of the voiceless alveolar affricate in old Spanish and stems from the Visigothic form of the letter "z" (ꝣ), whose upper loop was lengthened and reinterpreted as a "c", whereas its lower loop became the diminished appendage, the cedilla. It represents the "soft" sound , the voiceless alveolar sibilant, where a "c" would normally represent the "hard" sound (before "a", "o", "u", or at the end of a word) in English and in certain Romance languages such as
Catalan Catalan may refer to: Catalonia From, or related to Catalonia: * Catalan language, a Romance language * Catalans, an ethnic group formed by the people from, or with origins in, Northern or southern Catalonia Places * 13178 Catalan, asteroid ...
, Galician,
French French (french: français(e), link=no) may refer to: * Something of, from, or related to France ** French language, which originated in France, and its various dialects and accents ** French people, a nation and ethnic group identified with Franc ...
(where ç appears in the name of the language itself, '), Ligurian, Occitan, and Portuguese. In Occitan, Friulian and Catalan ''ç'' can also be found at the beginning of a word (', ') or at the end ('). It represents the voiceless postalveolar affricate (as in English "church") in Albanian,
Azerbaijani Azerbaijani may refer to: * Something of, or related to Azerbaijan * Azerbaijanis * Azerbaijani language See also * Azerbaijan (disambiguation) * Azeri (disambiguation) * Azerbaijani cuisine * Culture of Azerbaijan The culture of Azerbaijan ...
, Crimean Tatar, Friulian, Kurdish, Tatar,
Turkish Turkish may refer to: *a Turkic language spoken by the Turks * of or about Turkey ** Turkish language *** Turkish alphabet ** Turkish people, a Turkic ethnic group and nation *** Turkish citizen, a citizen of Turkey *** Turkish communities and mi ...
(as in ', ', ', '), and
Turkmen Turkmen, Türkmen, Turkoman, or Turkman may refer to: Peoples Historical ethnonym * Turkoman (ethnonym), ethnonym used for the Oghuz Turks during the Middle Ages Ethnic groups * Turkmen in Anatolia and the Levant (Seljuk and Ottoman-Turkish desc ...
. It is also sometimes used this way in
Manx Manx (; formerly sometimes spelled Manks) is an adjective (and derived noun) describing things or people related to the Isle of Man: * Manx people **Manx surnames * Isle of Man It may also refer to: Languages * Manx language, also known as Manx ...
, to distinguish it from the velar fricative. In the International Phonetic Alphabet, ⟨ç⟩ represents the voiceless palatal fricative.


S

The character "ş" represents the voiceless postalveolar fricative (as in "show") in several languages, including many belonging to the
Turkic languages The Turkic languages are a language family of over 35 documented languages, spoken by the Turkic peoples of Eurasia from Eastern Europe and Southern Europe to Central Asia, East Asia, North Asia (Siberia), and Western Asia. The Turkic l ...
, and included as a separate letter in their alphabets: *
Turkish Turkish may refer to: *a Turkic language spoken by the Turks * of or about Turkey ** Turkish language *** Turkish alphabet ** Turkish people, a Turkic ethnic group and nation *** Turkish citizen, a citizen of Turkey *** Turkish communities and mi ...
*
Azerbaijani Azerbaijani may refer to: * Something of, or related to Azerbaijan * Azerbaijanis * Azerbaijani language See also * Azerbaijan (disambiguation) * Azeri (disambiguation) * Azerbaijani cuisine * Culture of Azerbaijan The culture of Azerbaijan ...
* Crimean Tatar * Gagauz * Tatar *
Turkmen Turkmen, Türkmen, Turkoman, or Turkman may refer to: Peoples Historical ethnonym * Turkoman (ethnonym), ethnonym used for the Oghuz Turks during the Middle Ages Ethnic groups * Turkmen in Anatolia and the Levant (Seljuk and Ottoman-Turkish desc ...
* Romanian (substitution use when
S-comma S-comma (majuscule: Ș, minuscule: ș) is a letter which is part of the Romanian alphabet, used to represent the sound , the voiceless postalveolar fricative (like ''sh'' in ''shoe''). History The letter was proposed in the ''Buda Lexicon'' ...
was missing from pre-3.0
Unicode Unicode, formally The Unicode Standard,The formal version reference is is an information technology standard for the consistent encoding, representation, and handling of text expressed in most of the world's writing systems. The standard, ...
standards, and older standards, still frequent, but an error) * Kurdish In
HTML character entity reference In SGML, HTML and XML documents, the logical constructs known as ''character data'' and ''attribute values'' consist of sequences of characters, in which each character can manifest directly (representing itself), or can be represented by a serie ...
s Ş and ş can be used.


T

Gagauz uses Ţ (T with cedilla), one of the few languages to do so, and Ş (S with cedilla). Besides being present in some Gagauz orthographies, T with Cedilla also exists in the General Alphabet of Cameroon Languages, in the Kabyle language, in the Manjak and Mankanya languages, and possibly elsewhere. The Unicode characters for Ţ (T with cedilla) and Ş (S with cedilla) were implemented for Romanian in Windows-1250. In Windows 7, Microsoft corrected the error by replacing T-cedilla with T-comma (Ț) and S-cedilla with S-comma (Ș). In 1868, Ambroise Firmin-Didot suggested in his book ' (Observations on French Spelling) that French phonetics could be better regularized by adding a cedilla beneath the letter "t" in some words. For example, the suffix ' this letter is usually not pronounced as (or close to) in French, but as . It has to be distinctly learned that in words such as ' (but not ') it is pronounced . A similar effect occurs with other prefixes or within words. Firmin-Didot surmised that a new character could be added to French orthography. A letter of the same description T-cedilla (majuscule: Ţ, minuscule: ţ) is used in Gagauz. A similar letter, the T-comma (majuscule: Ț, minuscule: ț), does exist in Romanian, but it has a comma accent, not a cedilla.


Languages with other characters with cedillas


Latvian

Comparatively, some consider the diacritics on the palatalized Latvian consonants and formerly to be cedillas. Although their Adobe
glyph A glyph () is any kind of purposeful mark. In typography, a glyph is "the specific shape, design, or representation of a character". It is a particular graphical representation, in a particular typeface, of an element of written language. A g ...
names are commas, their names in the
Unicode Unicode, formally The Unicode Standard,The formal version reference is is an information technology standard for the consistent encoding, representation, and handling of text expressed in most of the world's writing systems. The standard, ...
Standard are "g", "k", "l", "n", and "r" with a cedilla. The letters were introduced to the
Unicode Unicode, formally The Unicode Standard,The formal version reference is is an information technology standard for the consistent encoding, representation, and handling of text expressed in most of the world's writing systems. The standard, ...
standard before 1992, and their names cannot be altered. The uppercase equivalent "Ģ" sometimes has a regular cedilla.


Marshallese

In
Marshallese orthography The Marshallese language ( mh, Kajin M̧ajeļ, link=no or ), also known as Ebon, is a Micronesian language spoken in the Marshall Islands. Spoken by the ethnic Marshallese people, the language is spoken by nearly the country's entire populatio ...
, four letters in Marshallese have cedillas: < >. In standard printed text they are ''always'' cedillas, and their omission or the substitution of
comma below The comma is a punctuation mark that appears in several variants in different languages. It has the same shape as an apostrophe or single closing quotation mark () in many typefaces, but it differs from them in being placed on the baseline o ...
and dot below diacritics are nonstandard. , many font rendering engines do not display ''any'' of these properly, for two reasons: * "" and "" usually do not display properly at all, because of the use of the cedilla in Latvian. Unicode has precombined glyphs for these letters, but most quality fonts display them with comma below diacritics to accommodate the expectations of
Latvian orthography Latvian may refer to: *Something of, from, or related to Latvia **Latvians, a Baltic ethnic group, native to what is modern-day Latvia and the immediate geographical region **Latvian language, also referred to as Lettish **Latvian cuisine **Latvi ...
. This is considered nonstandard in Marshallese. The use of a zero-width non-joiner between the letter and the diacritic can alleviate this problem: "" and "" may display properly, but may not; see below. * "" and "" do not currently exist in Unicode as precombined glyphs, and must be encoded as the plain Latin letters "" and "" with the combining cedilla diacritic. Most Unicode fonts issued with
Windows Windows is a group of several proprietary graphical operating system families developed and marketed by Microsoft. Each family caters to a certain sector of the computing industry. For example, Windows NT for consumers, Windows Server for ...
do not display combining diacritics properly, showing them too far to the right of the letter, as with Tahoma ("" and "") and Times New Roman ("" and ""). This mostly affects "", and may or may not affect "". But some common Unicode fonts like Arial Unicode MS ("" and ""), Cambria ("" and "") and Lucida Sans Unicode ("" and "") do not have this problem. When "" is properly displayed, the cedilla is either underneath the center of the letter, or is underneath the right-most leg of the letter, but is always directly underneath the letter wherever it is positioned. Because of these font display issues, it is not uncommon to find nonstandard ''ad hoc'' substitutes for these letters. The online version of the Marshallese-English Dictionary (the only complete Marshallese dictionary in existence) displays the letters with dot below diacritics, all of which do exist as precombined glyphs in Unicode: "", "", "" and "". The first three exist in the International Alphabet of Sanskrit Transliteration, and "" exists in the
Vietnamese alphabet The Vietnamese alphabet ( vi, chữ Quốc ngữ, lit=script of the National language) is the modern Latin writing script or writing system for Vietnamese. It uses the Latin script based on Romance languages originally developed by Portuguese m ...
, and both of these systems are supported by the most recent versions of common fonts like Arial, Courier New, Tahoma and Times New Roman. This sidesteps most of the Marshallese text display issues associated with the cedilla, but is still inappropriate for polished standard text.


Vute

Vute, a Mambiloid language from
Cameroon Cameroon (; french: Cameroun, ff, Kamerun), officially the Republic of Cameroon (french: République du Cameroun, links=no), is a country in west- central Africa. It is bordered by Nigeria to the west and north; Chad to the northeast; th ...
, uses cedilla for the nasalization of all vowel qualities (cf. the ogonek used in Polish and Navajo for the same purpose). This includes unconventional roman letters that are formalized from the IPA into the official writing system. These include <''i̧ ȩ ɨ̧ ə̧ a̧ u̧ o̧ ɔ̧>.''


Hebrew

The ISO 259 romanization of Biblical Hebrew uses Ȩ (E with cedilla) and Ḝ (E with cedilla and breve).


Letters with cedilla


Similar diacritics

Languages such as Romanian add a comma (virgula) to some letters, such as ', which looks like a cedilla, but is more precisely a diacritical comma. This is particularly confusing with letters which can take either diacritic: for example, the consonant is written as "ş" in
Turkish Turkish may refer to: *a Turkic language spoken by the Turks * of or about Turkey ** Turkish language *** Turkish alphabet ** Turkish people, a Turkic ethnic group and nation *** Turkish citizen, a citizen of Turkey *** Turkish communities and mi ...
but "ș" in Romanian, and Romanian writers will sometimes use the former instead of the latter because of insufficient computer knowledge. The Polish letters and and
Lithuanian Lithuanian may refer to: * Lithuanians * Lithuanian language * The country of Lithuania * Grand Duchy of Lithuania * Culture of Lithuania * Lithuanian cuisine * Lithuanian Jews as often called "Lithuanians" (''Lita'im'' or ''Litvaks'') by other Jew ...
letters and are not made with the cedilla either, but with the unrelated ogonek diacritic.


Encodings

Unicode Unicode, formally The Unicode Standard,The formal version reference is is an information technology standard for the consistent encoding, representation, and handling of text expressed in most of the world's writing systems. The standard, ...
provides
precomposed character A precomposed character (alternatively composite character or decomposable character) is a Unicode entity that can also be defined as a sequence of one or more other characters. A precomposed character may typically represent a letter with a diacr ...
s for some Latin letters with cedillas. Others can be formed using the cedilla combining character.


References


External links


ScriptSource—Positioning the traditional cedilla

Diacritics Project—All you need to design a font with correct accents


Learn how to make world language accent marks and other diacriticals on a computer {{Latin script, , cedilla Latin-script diacritics Turkish language