Typographic approximation
   HOME

TheInfoList



OR:

A typographic approximation is a replacement of an element of the
writing system A writing system is a method of visually representing verbal communication, based on a script and a set of rules regulating its use. While both writing and speech are useful in conveying messages, writing differs in also being a reliable fo ...
(usually a
glyph A glyph () is any kind of purposeful mark. In typography, a glyph is "the specific shape, design, or representation of a character". It is a particular graphical representation, in a particular typeface, of an element of written language. A g ...
) with another glyph or glyphs. The replacement may be a nearly homographic character, a digraph, or a character string. An approximation is different from a
typographical error A typographical error (often shortened to typo), also called a misprint, is a mistake (such as a spelling mistake) made in the typing of printed (or electronic) material. Historically, this referred to mistakes in manual type-setting (typography) ...
in that an approximation is intentional and aims to preserve the visual appearance of the original. The concept of approximation also applies to the
World Wide Web The World Wide Web (WWW), commonly known as the Web, is an information system enabling documents and other web resources to be accessed over the Internet. Documents and downloadable media are made available to the network through web ...
and other forms of textual information available via digital media, though usually at the level of
characters Character or Characters may refer to: Arts, entertainment, and media Literature * ''Character'' (novel), a 1936 Dutch novel by Ferdinand Bordewijk * ''Characters'' (Theophrastus), a classical Greek set of character sketches attributed to The ...
, not glyphs. Historically, the main cause of typographic approximation was a low quantity of glyphs (such as
letterform A letterform, letter-form or letter form, is a term used especially in typography, palaeography, calligraphy and epigraphy to mean a letter's shape. A letterform is a type of glyph, which is a specific, concrete way of writing an abstract c ...
s and
symbol A symbol is a mark, sign, or word that indicates, signifies, or is understood as representing an idea, object, or relationship. Symbols allow people to go beyond what is known or seen by creating linkages between otherwise very different conc ...
s) available for printing. In the age of World Wide Web and
digital typesetting Typesetting is the composition of text by means of arranging physical ''type'' (or ''sort'') in mechanical systems or ''glyphs'' in digital systems representing ''characters'' (letters and other symbols).Dictionary.com Unabridged. Random H ...
, especially after the advent of
Unicode Unicode, formally The Unicode Standard,The formal version reference is is an information technology standard for the consistent encoding, representation, and handling of text expressed in most of the world's writing systems. The standard, ...
and enormous amount of
computer font A computer font is implemented as a digital data file containing a set of graphically related glyphs. A computer font is designed and created using a font editor. A computer font specifically designed for the computer screen, and not for print ...
s, typographic approximations are usually caused either by low ability of humans to distinguish and find needed symbols or by inadequate replacement patterns in
word processor A word processor (WP) is a device or computer program that provides for input, editing, formatting, and output of text, often with some additional features. Early word processors were stand-alone devices dedicated to the function, but current ...
s, rather than by lack of available characters.


Typewriter and line printer approximations


Merger of characters

On
typewriter A typewriter is a mechanical or electromechanical machine for typing characters. Typically, a typewriter has an array of keys, and each one causes a different single character to be produced on paper by striking an inked ribbon selectivel ...
, several characters were merged due to limited size of glyph repertoire. Several modern computing
characters Character or Characters may refer to: Arts, entertainment, and media Literature * ''Character'' (novel), a 1936 Dutch novel by Ferdinand Bordewijk * ''Characters'' (Theophrastus), a classical Greek set of character sketches attributed to The ...
appeared by merger of different symbols, such as the "typewriter"
apostrophe The apostrophe ( or ) is a punctuation mark, and sometimes a diacritical mark, in languages that use the Latin alphabet and some other alphabets. In English, the apostrophe is used for two basic purposes: * The marking of the omission of one ...
, ', which can denote an apostrophe proper, ’, a single
quotation mark Quotation marks (also known as quotes, quote marks, speech marks, inverted commas, or talking marks) are punctuation marks used in pairs in various writing systems to set off direct speech, a quotation, or a phrase. The pair consists of an ...
, or the
prime symbol The prime symbol , double prime symbol , triple prime symbol , and quadruple prime symbol are used to designate units and for other purposes in mathematics, science, linguistics and music. Although the characters differ little in appearance fr ...
.


Non-spacing modifiers

Some typewriters have ''non-spacing'' keys for use as
diacritic A diacritic (also diacritical mark, diacritical point, diacritical sign, or accent) is a glyph added to a letter or to a basic glyph. The term derives from the Ancient Greek (, "distinguishing"), from (, "to distinguish"). The word ''diacrit ...
al marks. After the typist pushes, say, acute accent ◌́ the caret does not move. This allows the typist to overstrike this mark by a spacing letter, say, e and obtain é, an accented letter. Due to geometrical restrictions of a
monospaced font A monospaced font, also called a fixed-pitch, fixed-width, or non-proportional font, is a font whose letters and characters each occupy the same amount of horizontal space. This contrasts with variable-width fonts, where the letters and spaci ...
, the result could not always be perfect. For example, overstriking unlikely was a feasible method to produce
uppercase Letter case is the distinction between the letters that are in larger uppercase or capitals (or more formally ''majuscule'') and smaller lowercase (or more formally ''minuscule'') in the written representation of certain languages. The writing ...
accented letters, such as É. Overstrike was used on
line printer A line printer prints one entire line of text before advancing to another line. Most early line printers were impact printers. Line printers are mostly associated with unit record equipment and the early days of digital computing, but the ...
s for the same function. This contributed to standardization of such characters as . Overstrike of the same letter was used to simulate
boldface In typography, emphasis is the strengthening of words in a text with a font in a different style from the rest of the text, to highlight them. It is the equivalent of prosody stress in speech. Methods and use The most common methods in W ...
letters on line printers.


ASCII approximations

The
US-ASCII ASCII ( ), abbreviated from American Standard Code for Information Interchange, is a character encoding standard for electronic communication. ASCII codes represent text in computers, telecommunications equipment, and other devices. Because of ...
character set and other variants of
ISO/IEC 646 ISO/IEC 646 is a set of ISO/IEC standards, described as ''Information technology — ISO 7-bit coded character set for information interchange'' and developed in cooperation with ASCII at least since 1964. Since its first edition in ...
contains 95  graphic characters. It is comparable with a (Latin script) typewriter and insufficient for a quality
typography Typography is the art and technique of arranging type to make written language legible, readable and appealing when displayed. The arrangement of type involves selecting typefaces, point sizes, line lengths, line-spacing ( leading), an ...
. But high availability and robustness of ASCII character encoding prompted computer users to invent ASCII substitutes for various glyphs. The following ASCII characters are used to approximate certain characters. Note that there are many Latin letters that are homographic to letters of other scripts, however those Latin letters are not listed below. * (space): alignment and justification. * : various type of
double quotes Quotation marks (also known as quotes, quote marks, speech marks, inverted commas, or talking marks) are punctuation marks used in pairs in various writing systems to set off direct speech, a quotation, or a phrase. The pair consists of an ...
, double prime  . * : sharp symbol  . * : various type of single quotes,
apostrophe The apostrophe ( or ) is a punctuation mark, and sometimes a diacritical mark, in languages that use the Latin alphabet and some other alphabets. In English, the apostrophe is used for two basic purposes: * The marking of the omission of one ...
 ’, prime  . * Parentheses : encircled characters, such as for Copyright symbol  ©. * – see
Asterisk The asterisk ( ), from Late Latin , from Ancient Greek , ''asteriskos'', "little star", is a typographical symbol. It is so called because it resembles a conventional image of a heraldic star. Computer scientists and mathematicians often voc ...
. * – various symbols with strokes extending to left, up, right and down. * – probably an ASCII character the most used for approximations. A conventional representation of
hyphen The hyphen is a punctuation mark used to join words and to separate syllables of a single word. The use of hyphens is called hyphenation. ''Son-in-law'' is an example of a hyphenated word. The hyphen is sometimes confused with dashes ( figure ...
, an approximation of
dash The dash is a punctuation mark consisting of a long horizontal line. It is similar in appearance to the hyphen but is longer and sometimes higher from the baseline. The most common versions are the endash , generally longer than the hyphen ...
(especially as and ), minus sign  and line drawing horizontal line ─ (see the image). * : various dot-like symbols, see
Full stop The full stop (Commonwealth English), period (North American English), or full point , is a punctuation mark. It is used for several purposes, most often to mark the end of a declarative sentence (as distinguished from a question or exclamatio ...
. * – see
Slash (punctuation) The slash is the oblique slanting line punctuation mark . Also known as a stroke, a solidus or several other historical or technical names including oblique and virgule. Once used to mark periods and commas, the slash is now used to repres ...
. * : Turkish dotless ı, Cyrillic palochka Ӏ. * : IPA reversed epsilon  ɜ, Cyrillic letter  З. * : Cyrillic letter  Ч. * : various non-Latin letters and symbols with similar grapheme. * – see
Colon (punctuation) The colon is a punctuation mark consisting of two equally sized dots aligned vertically. A colon often precedes an explanation, a list, or a quoted sentence. It is also used between hours and minutes in time, between certain elements in medica ...
. * and : chevrons ⟨ ⟩, angle quotes ‹ ›, horizontal arrows (especially as digraphs and ). * : line drawing horizontal double line ═ (see the image),
double hyphen In Latin script, the double hyphen is a punctuation mark that consists of two parallel hyphens. It was a development of the earlier , which developed from a Central European variant of the virgule slash, originally a form of scratch comma. S ...
. * – although not an approximation, the question sign sometimes replaces unrepresented and unrecognized characters. * – see
At sign The at sign, , is normally read aloud as "at"; it is also commonly called the at symbol, commercial at, or address sign. It is used as an accounting and invoice abbreviation meaning "at a rate of" (e.g. 7 widgets @ £2 per widget = £14), but ...
. * :
Numero sign The numero sign or numero symbol, №, (also represented as Nº, No, No. or no.), is a typographic abbreviation of the word ''number''(''s'') indicating ordinal numeration, especially in names and titles. For example, using the numero sign, t ...
 №. * : various symbols with strokes extending to left, right and down, but not up. * :
set union In set theory, the union (denoted by ∪) of a collection of sets is the set of all elements in the collection. It is one of the fundamental operations through which sets can be combined and related to each other. A refers to a union of ze ...
 ∪. * : . * :
X mark An X mark (also known as an ex mark or a cross mark or simply an X or ex or a cross) is used to indicate the concept of negation (for example "no, this has not been verified", "no, that is not the correct answer" or "no, I do not agree") as well ...
 ✗. * and :
checkbox A checkbox (check box, tickbox, tick box) is a graphical widget that permits the user to make a binary choice, i.e. a choice between one of two possible mutually exclusive options. For example, the user may have to answer 'yes' (checked) or 'n ...
and similar
rectangular In Euclidean plane geometry, a rectangle is a quadrilateral with four right angles. It can also be defined as: an equiangular quadrilateral, since equiangular means that all of its angles are equal (360°/4 = 90°); or a parallelogram containin ...
pictograms. * : , upwards arrow ↑, and similar symbols with the wedge at the top. * – see
Underscore An underscore, ; also called an underline, low line, or low dash; is a line drawn under a segment of text. In proofreading, underscoring is a convention that says "set this text in italic type", traditionally used on manuscript or typescript ...
. * – opening single quote ‘. * - flat symbol  * :
bullets A bullet is a kinetic projectile, a component of firearm ammunition that is shot from a gun barrel. Bullets are made of a variety of materials, such as copper, lead, steel, polymer, rubber and even wax. Bullets are made in various shapes and co ...
and various
circle A circle is a shape consisting of all points in a plane that are at a given distance from a given point, the centre. Equivalently, it is the curve traced out by a point that moves in a plane so that its distance from a given point is cons ...
-like symbols such as  and ∞ (using two consecutive characters). * : μ — SI prefix ''
micro- ''Micro'' (Greek letter μ ( U+03BC) or the legacy symbol µ (U+00B5)) is a unit prefix in the metric system denoting a factor of 10−6 (one millionth). Confirmed in 1960, the prefix comes from the Greek ('), meaning "small". The symbol for ...
'' or lowercase Greek letter mu * : downwards arrow ↓, and similar symbols with the wedge at the bottom. * :
multiplication sign The multiplication sign, also known as the times sign or the dimension sign, is the symbol , used in mathematics to denote the multiplication operation and its resulting product. While similar to a lowercase X (), the form is properly a four- ...
 ×. * (on the image, this ASCII character is rendered as a ''broken bar'' ¦): line drawing vertical symbols. * – see
Tilde The tilde () or , is a grapheme with several uses. The name of the character came into English from Spanish, which in turn came from the Latin '' titulus'', meaning "title" or "superscription". Its primary use is as a diacritic (accent) i ...
.


Approximation of non-glyphs

There exist various approximation for
typographic alignment In typesetting and page layout, alignment or range is the setting of text flow or image placement relative to a page, column (measure), table cell, or tab (and often to an image above it or under it). The type alignment setting is sometimes ref ...
. For example, justification may be emulated with inserting of spaces, and flush-right alignment may be done by padding with spaces. There are various techniques for approximation of
tables Table may refer to: * Table (furniture), a piece of furniture with a flat surface and one or more legs * Table (landform), a flat area of land * Table (information), a data arrangement with rows and columns * Table (database), how the table d ...
(historically used for
text mode Text mode is a computer display mode in which content is internally represented on a computer screen in terms of characters rather than individual pixels. Typically, the screen consists of a uniform rectangular grid of ''character cells'', each ...
displays), such as
box-drawing character Box-drawing characters, also known as line-drawing characters, are a form of semigraphics widely used in text user interfaces to draw various geometric frames and boxes. Box-drawing characters typically only work well with monospaced fonts. ...
s.


Modern situation

In digital technologies, there are still some conditions where typographic approximations are appropriate. Some devices, such as
mobile phone A mobile phone, cellular phone, cell phone, cellphone, handphone, hand phone or pocket phone, sometimes shortened to simply mobile, cell, or just phone, is a portable telephone that can make and receive calls over a radio frequency link whi ...
s, cannot support huge
character set Character encoding is the process of assigning numbers to graphical characters, especially the written characters of human language, allowing them to be stored, transmitted, and transformed using digital computers. The numerical values tha ...
s and power
text formatting Typesetting is the composition of text by means of arranging physical ''type'' (or ''sort'') in mechanical systems or ''glyphs'' in digital systems representing '' characters'' (letters and other symbols).Dictionary.com Unabridged. Random H ...
tools, which are ubiquitous on desktop computers of the 2000s.


References

{{reflist Typography Typesetting