Terminology
The terms ''upper case'' and ''lower case'' may be written as two consecutive words, connected with a hyphen (''upper-case'' and ''lower-case''particularly if they Compound modifier, pre-modify another noun), or as a single word (''uppercase'' and ''lowercase''). These terms originated from the common layouts of the shallow Drawer (furniture), drawers called ''type cases'' used to hold the movable type for letterpress printing. Traditionally, the capital letters were stored in a separate shallow tray or "case" that was located above the case that held the small letters. ''Majuscule'' (, less commonly ), for Paleography, palaeographers, is technically any script whose letters have very few or very short ascenders and descenders, or none at all (for example, the majuscule scripts used in the Codex Vaticanus Graecus 1209, or the Book of Kells). By virtue of their visual impact, this made the term majuscule an apt descriptor for what much later came to be more commonly referred to as uppercase letters. ''Minuscule'' refers to lower-case letters. The word is often spelled ''miniscule'', by association with the unrelated word ''miniature'' and the prefix ''mini-''. This has traditionally been regarded as a spelling mistake (since ''minuscule'' is derived from the word ''minus''), but is now so common that some dictionaries tend to accept it as a nonstandard or variant spelling. ''Miniscule'' is still less likely, however, to be used in reference to lower-case letters.Typographical considerations
The glyphs of lowercase letters can resemble smaller forms of the uppercase glyphs restricted to the base band (e.g. "C/c" and "S/s", cf. small caps) or can look hardly related (e.g. "D/d" and "G/g"). Here is a comparison of the upper and lower case variants of each letter included in the English alphabet (the exact representation will vary according to the typeface and font used): Typography, Typographically, the basic difference between the majuscules and minuscules is not that the majuscules are big and minuscules small, but that the majuscules generally have the same height (although, depending on the typeface, there may be some exceptions, particularly with ''Q'' and sometimes ''J'' having a descending element; also, various diacritics can add to the normal height of a letter).Bicameral script
Capitalisation
Capitalisation is the writing of a word with its first letter (alphabet), letter in uppercase and the remaining letters in lowercase. Capitalisation rules vary by language and are often quite complex, but in most modern languages that have capitalisation, the first word of every Sentence (linguistics), sentence is capitalised, as are all proper nouns. Capitalisation in English, in terms of the general orthographic rules independent of context (e.g. title vs. heading vs. text), is universally standardised for formality, formal writing. Capital letters are used as the first letter of a sentence, a proper noun, or a proper adjective. The names of the days of the week and the names of the months are also capitalised, as are the first-person pronoun "I" and sometimes the interjection "O" (although the latter is uncommon in modern usage, with "oh" being preferred). There are a few pairs of words of different meanings whose list of case sensitive English words, only difference is capitalisation of the first letter. Honorifics and personal titles showing rank or prestige are capitalised when used together with the name of the person (for example, "Mr. Smith", "Bishop O'Brien", "Professor Moore") or as a direct address, but normally not when used alone and in a more general sense. It can also be seen as customary to capitalise any word in some contexts even a pronoun referring to the deity of a Monotheism, monotheistic religion. Other words normally start with a lower-case letter. There are, however, situations where further capitalisation may be used to give added emphasis, for example in headings and publication titles (see below). In some traditional forms of poetry, capitalisation has conventionally been used as a marker to indicate the beginning of a Line (poetry), line of verse independent of any grammatical feature. In political writing, parody and satire, the unexpected emphasis afforded by otherwise ill-advised capitalisation is often used to great stylistic effect, such as in the case of George Orwell's Big_Brother_(Nineteen_Eighty-Four), Big Brother. Other languages vary in their use of capitals. For example, in German language, German all nouns are capitalised (this was previously common in English as well, mainly in the 17th and 18th centuries), while in Romance languages, Romance and most other European languages the names of the days of the week, the names of the months, and adjectives of nationality, religion, and so on normally begin with a lower-case letter. On the other hand, in some languages it is customary to capitalise T–V distinction, formal polite pronouns, for example ''De'', ''Dem'' (Danish language, Danish), ''Sie'', ''Ihnen'' (German), and ''Vd'' or ''Ud'' (short for ''usted'' in Spanish language, Spanish). Informal communication, such as texting, instant messaging or a handwritten sticky note, may not bother to follow the conventions concerning capitalisation, but that is because its users usually do not expect it to be formal.Exceptional letters and digraphs
* The German letter "ß" formerly existed only in lower case. The orthographical capitalisation does not concern "ß", which generally does not occur at the beginning of a word, and in the all-caps style it has traditionally been replaced by the Digraph (orthography), digraph "SS". Since June 2017, however, capital ẞ is accepted as an alternative in the all-caps style. * The Greek upper-case letter "Sigma, Σ" has two different lower-case forms: "ς" in word-final position and "σ" elsewhere. In a similar manner, the Latin upper-case letter "S" used to have two different lower-case forms: "s" in word-final position and " ſ " elsewhere. The latter form, called the long s, fell out of general use before the middle of the 19th century, except for the countries that continued to use blackletter typefaces such as Fraktur. When blackletter type fell out of general use in the mid-20th century, even those countries dropped the long s. * There is a complicated treatment of the Greek Iota subscript. * Unlike most languages that use Latin-script and link the dotless upper-case "I" with the dotted lower-case "i", Turkish language has both a dotted and dotless I, each in both upper and lower case. Each of the two pairs ("İ/i" and "I/ı") represents a distinctive phoneme. * In some languages, specific digraphs may be regarded as single letters, and in Dutch language, Dutch, the digraph "IJ (digraph), IJ/ij" is even capitalised with both components written in uppercase (for example, "IJsland" rather than "Ijsland"). In other languages, such as Welsh language, Welsh and Hungarian language, Hungarian, various digraphs are regarded as single letters for collation purposes, but the second component of the digraph will still be written in lower case even if the first component is capitalised. Similarly, in South Slavic languages whose orthography is coordinated between the Cyrillic and Latin scripts, the Latin digraphs "Lj (letter), Lj/lj", "Nj (digraph), Nj/nj" and "Dž (digraph), Dž/dž" are each regarded as a single letter (like their Cyrillic equivalents "Lje, Љ/љ", "Nje, Њ/њ" and "Dzhe, Џ/џ", respectively), but only in all-caps style should both components be in upper case (e.g. Ljiljan–LJILJAN, Njonja–NJONJA, Džidža–DŽIDŽA). Unicode designates a Digraph (orthography)#In Unicode, single character for each case variant (i.e., upper case, title case and lower case) of the three digraphs. * In the Hawaiian language, Hawaiian orthography, the ʻOkina, okina is a phonemic symbol that visually resembles a left single quotation mark. Representing the glottal stop, the okina can be characterised as either a letter or a diacritic. As a unicase letter, the okina is unaffected by capitalisation; it is the following letter that is capitalised instead. According to the Unicode standard, the okina is formally encoded as , but it is not uncommon to substitute this with a similar punctuation character, such as the left single quotation mark or an apostrophe.Related phenomena
Similar orthographic and graphostylistic conventions are used for emphasis or following language-specific or other rules, including: * Font effects such as italic type or oblique type, boldface, and choice of serif vs. sans-serif. * Typographical conventions in mathematical formulae include the Greek letters used in mathematics, science, and engineering, use of Greek letters and the Latin letters used in mathematics, use of Latin letters with special formatting such as blackboard bold and blackletter. * Some letters of the Arabic alphabet, Arabic and Hebrew alphabets and some Hangul consonant and vowel tables, jamo of the Korean hangul have different forms depending on placement within a word, but these rules are strict and the different forms cannot be used for emphasis. **In the Arabic and Arabic-based alphabets, letters in a word are connected, except for several that cannot connect to the following letter. Letters may have distinct forms depending on whether they are initial (connected only to the following letter), medial (connected to both neighboring letters), final (connected only to the preceding letter), or isolated (connected to neither a preceding nor a following letter). **In the Hebrew alphabet, five letters have a distinct form (see Final form) that is used when they are word-final. * In Georgian alphabet, Georgian, some authors use isolated letters from the ancient Asomtavruli alphabet within a text otherwise written in the modern Georgian alphabet#Mkhedruli, Mkhedruli in a fashion that is reminiscent of the usage of upper-case letters in the Latin, Greek, and Cyrillic alphabets. * In the Japanese writing system, an author has the option of switching between kanji, hiragana, katakana, and rōmaji. In particular, every hiragana character has an equivalent katakana character, and vice versa. Romanised Japanese sometimes uses lowercase letters to represent words that would be written in hiragana, and uppercase letters to represent words that would be written in katakana. Some kana characters are written in smaller type when they modify or combine with the preceding sign (''yōon'') or the following sign (''sokuon'').Stylistic or specialised usage
Headings and publication titles
In English-language publications, various conventions are used for the capitalisation of words in Title (publishing), publication titles and headlines, including chapter and section headings. The rules differ substantially between individual house styles. The convention followed by many British publishers (including scientific publishers, like ''Nature (journal), Nature'', magazines, like ''The Economist'' and ''New Scientist'', and newspapers, like ''The Guardian'' and ''The Times'') and also U.S. newspapers, is sentence-style capitalisation in headlines, i.e. capitalisation follows the same rules that apply for sentences. This convention is usually called ''sentence case''. It may also be applied to publication titles, especially in bibliographic references and library catalogues. An example of a global publisher whose English-language house style prescribes sentence-case titles and headings is the International Organization for Standardization (ISO). For publication titles it is, however, a common typographic practice among both British and U.S. publishers to capitalise significant words (and in the United States, this is often applied to headings, too). This family of typographic conventions is usually called ''title case''. For example, R. M. Ritter's ''Oxford Manual of Style'' (2002) suggests capitalising "the first word and all nouns, pronouns, adjectives, verbs and adverbs, but generally not articles, conjunctions and short prepositions". This is an old form of Emphasis (typography), emphasis, similar to the more modern practice of using a larger or boldface font for titles. The rules which prescribe which words to capitalise are not based on any grammatically inherent correct–incorrect distinction and are not universally standardised; they differ between style guides, although most style guides tend to follow a few strong conventions, as follows: * Most styles capitalise all words except for short Closed class, closed-class words (certain Part of speech, parts of speech, namely, articles, prepositions, and conjunctions); but the first word (always) and last word (in many styles) are also capitalised, regardless of their part of speech. Many styles capitalise longer prepositions such as "between" and "throughout", but not shorter ones such as "for" and "with". Typically, a preposition is considered short if it has up to three or four letters. * A few styles capitalise all words in title case (the so-called ''start case''), which has the advantage of being easy to implement and hard to get "wrong" (that is, "not edited to style"). Because of this rule's simplicity, software #Case folding, case-folding routines can handle 95% or more of the editing, especially if they are programmed for desired exceptions (such as "FBI" rather than "Fbi"). * As for whether hyphenated words are capitalised not only at the beginning but also after the hyphen, there is no universal standard; variation occurs Linguistic description, in the wild and among house styles (e.g., "The Letter-''C''ase Rule in My Book"; "Short-''t''erm Follow-''u''p Care for Burns"). Traditional copyediting makes a distinction between ''temporary compounds'' (such as many Nonce word, nonce [novel instance] compound modifiers), in which every part of the hyphenated word is capitalised (e.g. "How This Particular Author Chose to Style His ''A''utumn-''A''pple-''P''icking Heading"), and ''permanent compounds'', which are terms that, although compound and hyphenated, are so well established that dictionaries enter them as headwords (e.g., "Short-''t''erm Follow-''u''p Care for Burns"). Title case is widely used in many English-language publications, especially in the United States. However, its conventions are sometimes not followed strictly especially in informal writing. In creative typography, such as music record covers and other artistic material, all styles are commonly encountered, including all-lowercase letters and special case styles, such as studly caps (see below). For example, in the wordmarks of video games it is not uncommon to use stylised upper-case letters at the beginning and end of a title, with the intermediate letters in small caps or lower case (e.g., Arcania: Gothic 4, , ARMA: Armed Assault, , and DmC: Devil May Cry, DmC).Multi-word proper nouns
Single-word proper nouns are capitalised in formal written English, unless the name is intentionally stylised to break this rule (such as the first or last name of danah boyd). Multi-word proper nouns include names of organisations, publications, and people. Often the rules for "title case" (described in the previous section) are applied to these names, so that non-initial articles, conjunctions, and short prepositions are lowercase, and all other words are uppercase. For example, the short preposition "of" and the article "the" are lowercase in "Steering Committee of the Finance Department". Usually only capitalised words are used to form an acronym variant of the name, though there is some variation in this. With personal names, this practice can vary (sometimes all words are capitalised, regardless of length or function), but is not limited to English names. Examples include the English names Tamar of Georgia and Catherine the Great, "van (Dutch), van" and "der" in Dutch names, "von" and "zu" in German name, German, "de", "los", and "y" in Spanish naming customs, Spanish names, "de" or "d'" in French names, and "ibn" in Arabic names. Some surname prefixes also affect the capitalisation of the following internal letter or word, for example "Mac" in Surname#Celtic compound names, Celtic names and "Al" in Arabic names.Unit symbols and prefixes in the metric system
Use within programming languages
Some case styles are not used in standard English, but are common in computer programming, product branding, or other specialised fields.Camel case
Camel case: "theQuickBrownFoxJumpsOverTheLazyDog" or "TheQuickBrownFoxJumpsOverTheLazyDog"Snake case
Snake case: "the_quick_brown_fox_jumps_over_the_lazy_dog"Kebab case
Kebab case: "the-quick-brown-fox-jumps-over-the-lazy-dog"Studly caps
Studly caps: e.g. "tHeqUicKBrOWnFoXJUmpsoVeRThElAzydOG"Case folding and case conversion
In the character sets developed for computing, each upper- and lower-case letter is encoded as a separate character. In order to enable case folding and case conversion, the software needs to link together the two characters representing the case variants of a letter. (Some old character-encoding systems, such as the Baudot code, are restricted to one set of letters, usually represented by the upper-case variants.) Case sensitivity, Case-insensitive operations can be said to fold case, from the idea of folding the character code table so that upper- and lower-case letters coincide. The conversion of letter case in a String (computer science), string is common practice in computer applications, for instance to make case-insensitive comparisons. Many high-level programming languages provide simple methods for case conversion, at least for the ASCII character set. Whether or not the case variants are treated as equivalent to each other varies depending on the computer system and context. For example, user passwords are generally case sensitive in order to allow more diversity and make them more difficult to break. In contrast, case is often ignored in keyword searches in order to ignore insignificant variations in keyword capitalisation both in queries and queried material.Unicode case folding and script identification
Unicode defines case folding through the three case-mapping properties of each Character (computing), character: upper case, lower case, and title case (in this context, "title case" relates to Typographic ligature, ligatures and Digraph (orthography), digraphs encoded as mixed-case Digraph (orthography)#In Unicode, single characters, in which the first component is in upper case and the second component in lower case). These properties relate all characters in scripts with differing cases to the other case variants of the character. As briefly discussed in Unicode Technical Note #26, "In terms of implementation issues, any attempt at a unification of Latin, Greek, and Cyrillic would wreak havoc [and] make casing operations an unholy mess, in effect making all casing operations context sensitive […]". In other words, while the shapes of letters like A, B, E, H, K, M, O, P, T, X, Y and so on are shared between the Latin, Greek, and Cyrillic alphabets (and small differences in their canonical forms may be considered to be of a merely Typography, typographical nature), it would still be problematic for a multilingual character set or a font to provide only a ''single'' code point for, say, uppercase letter B, as this would make it quite difficult for a wordprocessor to change that single uppercase letter to one of the three different choices for the lower-case letter, the Latin b (U+0062), Greek β (U+03B2) or Cyrillic в (U+0432). Therefore, the corresponding Latin, Greek and Cyrillic upper-case letters (U+0042, U+0392 and U+0412, respectively) are also encoded as separate characters, despite their appearance being basically identical. Without letter case, a "unified European alphabet"such as ABБCГDΔΕЄЗFΦGHIИJ...Z, with an appropriate subset for each languageis feasible; but considering letter case, it becomes very clear that these alphabets are rather distinct sets of symbols.Methods in word processing
Most modern word processors provide automated case conversion with a simple click or keystroke. For example, in Microsoft Office Word, there is a dialog box for toggling the selected text through UPPERCASE, then lowercase, then Title Case (actually start caps; exception words must be lowercased individually). The keystroke does the same thing.Methods in programming
In some forms of BASIC there are two methods for case conversion:History
Originally alphabets were written entirely in majuscule letters, spaced between well-defined upper and lower bounds. When written quickly with a pen, these tended to turn into rounder and much simpler forms. It is from these that the first minuscule hands developed, the half-uncials and cursive minuscule, which no longer stayed bound between a pair of lines. These in turn formed the foundations for the Carolingian minuscule script, developed by Alcuin for use in the court of Charlemagne, which quickly spread across Europe. The advantage of the minuscule over majuscule was improved, faster readability. In Latin, Papyrus, papyri from Herculaneum dating before 79 CE (when it was destroyed) have been found that have been written in old Roman cursive, where the early forms of minuscule letters "d", "h" and "r", for example, can already be recognised. According to papyrologist Knut Kleve, "The theory, then, that the lower-case letters have been developed from the fifth century uncials and the ninth century Carolingian minuscules seems to be wrong." Both majuscule and minuscule letters existed, but the difference between the two variants was initially stylistic rather than orthographic and the writing system was still basically unicameral: a given handwritten document could use either one style or the other but these were not mixed. European languages, except for Ancient Greek and Latin, did not make the case distinction before about 1300. The timeline of writing in Western Europe can be divided into four eras: *Greek majuscule (9th–3rd century BCE) in contrast to the Greek uncial script (3rd century BCE – 12th century CE) and the later Greek minuscule *Roman square capitals, Roman majuscule (7th century BCE – 4th century CE) in contrast to the Roman uncial (4th–8th century CE), Roman half uncial, and minuscule *Carolingian majuscule (4th–8th century CE) in contrast to the Carolingian minuscule (around 780 – 12th century) *Blackletter, Gothic majuscule (13th and 14th century), in contrast to the early Gothic (end of 11th to 13th century), Gothic (14th century), and late Gothic (16th century) minuscules. Traditionally, certain letters were rendered differently according to a set of rules. In particular, those letters that began sentences or nouns were made larger and often written in a distinct script. There was no fixed capitalisation system until the early 18th century. The English language eventually dropped the rule for nouns, while the German language keeps it. Similar developments have taken place in other alphabets. The lower-case script for the Greek alphabet has its origins in the 7th century and acquired its quadrilinear form (that is, characterised by ascenders and descenders) in the 8th century. Over time, uncial letter forms were increasingly mixed into the script. The earliest dated Greek lower-case text is the Uspenski Gospels (MS 461) in the year 835.The earliest known biblical manuscript is a palimpsest of Isajah in Syriac, written in 459/460. Bruce M. Metzger & Bart D. Ehrman, ''The Text of the New Testament'' (Oxford University Press: 2005), p. 92. The modern practice of capitalising the first letter of every sentence seems to be imported (and is rarely used when printing Ancient Greek materials even today).Type cases
The individual type blocks used in hand typesetting are stored in shallow wooden or metal drawers known as "type cases". Each is subdivided into a number of compartments ("boxes") for the storage of different individual letters. The ''Oxford Universal Dictionary on Historical Advanced Proportional Principles'' (reprinted 1952) indicates that ''case'' in this sense (referring to the box or frame used by a compositor in the printing trade) was first used in English in 1588. Originally one large case was used for each typeface, then "divided cases", pairs of cases for majuscules and minuscules, were introduced in the region of today's Belgium by 1563, England by 1588, and France before 1723. The terms ''upper'' and ''lower'' case originate from this division. By convention, when the two cases were taken out of the storage rack and placed on a rack on the Compositor (typesetting), compositor's desk, the case containing the capitals and small capitals stood at a steeper angle at the back of the desk, with the case for the small letters, punctuation, and spaces being more easily reached at a shallower angle below it to the front of the desk, hence upper and lower case. Though pairs of cases were used in English-speaking countries and many European countries in the seventeenth century, in Germany and Scandinavia the single case continued in use. Various patterns of cases are available, often with the compartments for lower-case letters varying in size according to the frequency of use of letters, so that the commonest letters are grouped together in larger boxes at the centre of the case. The compositor takes the letter blocks from the compartments and places them in a composing stick, working from left to right and placing the letters upside down with the nick to the top, then sets the assembled type in a Galley proof, galley.See also
* All caps * CamelCase * Capitalisation * Drop cap * Punctuation * Roman cursive * Roman square capitals * Shift key * Small caps * Studly caps * Text figures * UnicaseNotes
References
External links
* * One of this paper's conclusions: all-caps is harder to read. * {{Typography terms Alphabets Capitalization Orthography Typography Articles with example C code