2020–21 Newcastle United F
   HOME

TheInfoList



OR:

The dash is a
punctuation Punctuation (or sometimes interpunction) is the use of spacing, conventional signs (called punctuation marks), and certain typographical devices as aids to the understanding and correct reading of written text, whether read silently or aloud. An ...
mark consisting of a long horizontal line. It is similar in appearance to the
hyphen The hyphen is a punctuation mark used to join words and to separate syllables of a single word. The use of hyphens is called hyphenation. ''Son-in-law'' is an example of a hyphenated word. The hyphen is sometimes confused with dashes (figure d ...
but is longer and sometimes higher from the baseline. The most common versions are the endash , generally longer than the hyphen but shorter than the
minus sign The plus and minus signs, and , are mathematical symbols used to represent the notions of positive and negative, respectively. In addition, represents the operation of addition, which results in a sum, while represents subtraction, resulti ...
; the emdash , longer than either the en dash or the minus sign; and the horizontalbar , whose length varies across
typeface A typeface (or font family) is the design of lettering that can include variations in size, weight (e.g. bold), slope (e.g. italic), width (e.g. condensed), and so on. Each of these variations of the typeface is a font. There are list of type ...
s but tends to be between those of the en and em dashes.


History

In the early 1600s, in Okes-printed
plays Play most commonly refers to: * Play (activity), an activity done for enjoyment * Play (theatre), a work of drama Play may refer also to: Computers and technology * Google Play, a digital content service * Play Framework, a Java framework * P ...
of
William Shakespeare William Shakespeare ( 26 April 1564 – 23 April 1616) was an English playwright, poet and actor. He is widely regarded as the greatest writer in the English language and the world's pre-eminent dramatist. He is often called England's nation ...
, dashes are attested that indicate a thinking pause, interruption, mid-speech realization, or change of subject. The dashes are variously longer (as in
King Lear ''King Lear'' is a tragedy written by William Shakespeare. It is based on the mythological Leir of Britain. King Lear, in preparation for his old age, divides his power and land between two of his daughters. He becomes destitute and insane an ...
reprinted 1619) or composed of hyphens (as in
Othello ''Othello'' (full title: ''The Tragedy of Othello, the Moor of Venice'') is a tragedy written by William Shakespeare, probably in 1603, set in the contemporary Ottoman–Venetian War (1570–1573) fought for the control of the Island of Cypru ...
printed 1622); moreover, the dashes are often, but not always, prefixed by a comma, colon, or semicolon. In 1733, in
Jonathan Swift Jonathan Swift (30 November 1667 – 19 October 1745) was an Anglo-Irish Satire, satirist, author, essayist, political pamphleteer (first for the Whig (British political party), Whigs, then for the Tories (British political party), Tories), poe ...
's ''On Poetry'', the terms ''break'' and ''dash'' are attested for and marks:
Blot out, correct, insert, refine, Enlarge, diminish, interline; Be mindful, when Invention fails; To scratch your Head, and bite your Nails. Your poem finish'd, next your Care Is needful, to transcribe it fair. In modern Wit all printed Trash, is Set off with num'rous ''Breaks''⸺and ''Dashes''—


Types of dash

Usage varies both within English and within other languages, but the usual conventions for the most common dashes in printed English text are these: * An (unspaced) em dash or a spaced en dash can be used to mark a break in a sentence, and a pair can be used to set off a
parenthetical A bracket is either of two tall fore- or back-facing punctuation marks commonly used to isolate a segment of text or data from its surroundings. Typically deployed in symmetric pairs, an individual bracket may be identified as a 'left' or 'r ...
statement. For example: * An en dash, but not an em dash, indicates spans or differentiation, where it may replace "and", "to", or "through". For example: * An em dash or horizontal bar, but not an en dash, is used to set off the source of a direct
quotation A quotation is the repetition of a sentence, phrase, or passage from speech or text that someone has said or written. In oral speech, it is the representation of an utterance (i.e. of something that a speaker actually said) that is introduced by ...
. For example: * A horizontal bar (also called ''quotation dash'') or the em dash, but not the en dash, introduces quoted text.


Figure dash

The figure dash (, ) has the same width as a numerical digit; most
font In metal typesetting, a font is a particular size, weight and style of a typeface. Each font is a matched set of type, with a piece (a "sort") for each glyph. A typeface consists of a range of such fonts that shared an overall design. In mod ...
s have digits of equal width. It is used within numbers (e.g., the phone number 555‒0199), especially in columns, for maintaining alignment. In contrast, the en dash is generally used for a range of values. The
minus sign The plus and minus signs, and , are mathematical symbols used to represent the notions of positive and negative, respectively. In addition, represents the operation of addition, which results in a sum, while represents subtraction, resulti ...
()
glyph A glyph () is any kind of purposeful mark. In typography, a glyph is "the specific shape, design, or representation of a character". It is a particular graphical representation, in a particular typeface, of an element of written language. A g ...
is generally set a little higher, so as to be level with the
plus sign The plus and minus signs, and , are mathematical symbols used to represent the notions of positive and negative, respectively. In addition, represents the operation of addition, which results in a sum, while represents subtraction, result ...
. In informal usage, the
hyphen-minus The hyphen-minus is the most commonly used type of hyphen, widely used in digital documents. It is the only character that looks like a minus sign or a dash in many character sets such as ASCII or on most keyboards, so it is also used as such. ...
(), provided as standard on most keyboards, is often used instead of the figure dash. In
TeX Tex may refer to: People and fictional characters * Tex (nickname), a list of people and fictional characters with the nickname * Joe Tex (1933–1982), stage name of American soul singer Joseph Arrington Jr. Entertainment * ''Tex'', the Italian ...
, the standard fonts have no figure dash; however, the digits normally all have the same width as the en dash, so an en dash can be substituted. In
XeLaTeX XeTeX ( or ; see also Pronouncing and writing "TeX") is a TeX typesetting engine using Unicode and supporting modern font technologies such as OpenType, Graphite and Apple Advanced Typography (AAT). It was originally written by Jonathan Kew ...
, one can use \char"2012. The
Linux Libertine Linux Libertine is a digital typeface created by the Libertine Open Fonts Project, which aims to create free and open alternatives to proprietary typefaces such as Times New Roman. It is developed with the free font editor FontForge and is licen ...
font also has the figure dash glyph.


En dash

The en dash, en rule, or nut dash is traditionally half the width of an
em dash The dash is a punctuation mark consisting of a long horizontal line. It is similar in appearance to the hyphen but is longer and sometimes higher from the baseline. The most common versions are the endash , generally longer than the hyphen b ...
. In modern fonts, the length of the en dash is not standardized, and the en dash is often more than half the width of the em dash. The widths of en and em dashes have also been specified as being equal to those of the upper-case letters N and M, respectively, and at other times to the widths of the lower-case letters.


Usage

The three main uses of the en dash are # to connect symmetric items, such as the two ends of a range or two competitors or alternatives # as a substitute for a hyphen in a compound when one of the connected items is more complex than a single word # as an interruptor at sentence level, substituting for a pair of commas, parentheses, or to indicate a rhetorical pause. It is usually held that, when used as an interruptor, the en dash should be "open"spaced on both sidesin contrast to the em dash, which is usually closed; a common exception is in
newspapers A newspaper is a periodical publication containing written information about current events and is often typed in black ink with a white or gray background. Newspapers can cover a wide variety of fields such as politics, business, sports ...
.


Ranges of values

The en dash is commonly used to indicate a closed range of valuesa range with clearly defined and finite upper and lower boundariesroughly signifying what might otherwise be communicated by the word "through" in American English, or "to" in International English. This may include ranges such as those between dates, times, or numbers. Various
style guide A style guide or manual of style is a set of standards for the writing, formatting, and design of documents. It is often called a style sheet, although that term also has multiple other meanings. The standards can be applied either for gene ...
s restrict this range indication style to only parenthetical or tabular matter, requiring "to" or "through" in running text. Preference for hyphen vs. en dash in ranges varies. For example, the
APA style APA style (also known as APA format) is a writing style and format for academic documents such as scholarly journal articles and books. It is commonly used for citing sources within the field of behavioral and social sciences, including sociol ...
(named after the American Psychological Association) uses an en dash in ranges, but the
AMA style ''AMA Manual of Style: A Guide for Authors and Editors'' is the style guide of the American Medical Association. It is written by the editors of ''JAMA'' (''Journal of the American Medical Association'') and the JAMA Network journals and is most ...
(named after the American Medical Association) uses a hyphen: Some style guides (including the ''Guide for the Use of the International System of Units ( SI)'' and the ''
AMA Manual of Style ''AMA Manual of Style: A Guide for Authors and Editors'' is the style guide of the American Medical Association. It is written by the editors of ''JAMA (journal), JAMA'' (''Journal of the American Medical Association'') and the JAMA Network jour ...
'') recommend that, when a number range might be misconstrued as subtraction, the word "to" should be used instead of an en dash. For example, "a voltage of 50 V to 100 V" is preferable to using "a voltage of 50–100 V". Relatedly, in ranges that include negative numbers, "to" is used to avoid ambiguity or awkwardness (for example, "temperatures ranged from −18°C to −34°C"). It is also considered poor style (best avoided) to use the en dash in place of the words "to" or "and" in phrases that follow the forms ''from X to Y'' and ''between X and Y''.


Relationships and connections

The en dash is used to contrast values or illustrate a relationship between two things. Examples of this usage include: * Australia beat American Samoa 31–0. * Radical–Unionist coalition * Boston–Hartford route * New York–London flight (however, it may be argued that ''New York–to-London flight'' is more appropriate because New York is a single name composed of two valid words; with a single en dash, the phrase is ambiguous and could mean either ''Flight from New York to London'' or ''New flight from York to London''; such ambiguity is assuaged when used mid-sentence, though, because of the capital N in "New" indicating it is a special noun). If dash–hyphen use becomes too unwieldy or difficult to understand, the sentence can be rephrased for clarity and readability; for example, "The flight from New York to London was a pleasant experience". * Mother–daughter relationship * The
Supreme Court A supreme court is the highest court within the hierarchy of courts in most legal jurisdictions. Other descriptions for such courts include court of last resort, apex court, and high (or final) court of appeal. Broadly speaking, the decisions of ...
voted 5–4 to uphold the decision. A distinction is often made between "simple" attributive compounds (written with a hyphen) and other subtypes (written with an en dash); at least one authority considers name pairs, where the paired elements carry equal weight, as in the
Taft–Hartley Act The Labor Management Relations Act of 1947, better known as the Taft–Hartley Act, is a United States federal law that restricts the activities and power of labor unions. It was enacted by the 80th United States Congress over the veto of Preside ...
to be "simple", while others consider an en dash appropriate in instances such as these to represent the parallel relationship, as in the McCain–Feingold bill or
Bose–Einstein statistics In quantum statistics, Bose–Einstein statistics (B–E statistics) describes one of two possible ways in which a collection of non-interacting, indistinguishable particles may occupy a set of available discrete energy states at thermodynamic e ...
. When an act of the U.S. Congress is named using the surnames of the senator and representative who sponsored it, the hyphen-minus is used in the
short title In certain jurisdictions, including the United Kingdom and other Westminster-influenced jurisdictions (such as Canada or Australia), as well as the United States and the Philippines, primary legislation has both a short title and a long title. Th ...
; thus, the short title of Public Law 111–203 is "The Dodd-Frank Wall Street Reform and Consumer Protection Act", with a
hyphen-minus The hyphen-minus is the most commonly used type of hyphen, widely used in digital documents. It is the only character that looks like a minus sign or a dash in many character sets such as ASCII or on most keyboards, so it is also used as such. ...
rather than an en dash between "Dodd" and "Frank". However, there is a difference between something named for a parallel/coordinate relationship between two people for example,
Satyendra Nath Bose Satyendra Nath Bose (; 1 January 1894 – 4 February 1974) was a Bengali mathematician and physicist specializing in theoretical physics. He is best known for his work on quantum mechanics in the early 1920s, in developing the foundation for ...
and
Albert Einstein Albert Einstein ( ; ; 14 March 1879 – 18 April 1955) was a German-born theoretical physicist, widely acknowledged to be one of the greatest and most influential physicists of all time. Einstein is best known for developing the theory ...
and something named for a single person who had a
compound surname In some cultures, a surname, family name, or last name is the portion of one's personal name that indicates one's family, tribe or community. Practices vary by culture. The family name may be placed at either the start of a person's full name, ...
, which may be written with a hyphen or a space but not an en dashfor example, the
Lennard-Jones potential The Lennard-Jones potential (also termed the LJ potential or 12-6 potential) is an intermolecular pair potential. Out of all the intermolecular potentials, the Lennard-Jones potential is probably the one that has been the most extensively studied ...
yphenis named after one person (
John Lennard-Jones Sir John Edward Lennard-Jones (27 October 1894 – 1 November 1954) was a British mathematician and professor of theoretical physics at the University of Bristol, and then of theoretical science at the University of Cambridge. He was an imp ...
), as are
Bence Jones protein Bence Jones protein is a monoclonal globulin protein or immunoglobulin light chain found in the urine, with a molecular weight of 22–24 kDa. Detection of Bence Jones protein may be suggestive of multiple myeloma or Waldenström's macroglobuli ...
s and Hughlings Jackson syndrome. Copyeditors use dictionaries (general, medical, biographical, and geographical) to confirm the
eponym An eponym is a person, a place, or a thing after whom or which someone or something is, or is believed to be, named. The adjectives which are derived from the word eponym include ''eponymous'' and ''eponymic''. Usage of the word The term ''epon ...
ity (and thus the styling) for specific terms, given that no one can know them all offhand. Preference for an en dash instead of a hyphen in these coordinate/relationship/connection types of terms is a matter of style, not inherent orthographic "correctness"; both are equally "correct", and each is the preferred style in some style guides. For example, ''
the American Heritage Dictionary of the English Language ''The American Heritage Dictionary of the English Language'' (''AHD'') is an American English, American dictionary of English published by Boston publisher Houghton Mifflin Harcourt, Houghton Mifflin, the first edition of which appeared in 1969. ...
'', the ''
AMA Manual of Style ''AMA Manual of Style: A Guide for Authors and Editors'' is the style guide of the American Medical Association. It is written by the editors of ''JAMA (journal), JAMA'' (''Journal of the American Medical Association'') and the JAMA Network jour ...
'', and
Dorland's medical reference works ''Dorland's'' is the brand name of a family of medical reference works (including dictionaries, spellers and word books, and spell-check software) in various media spanning printed books, CD-ROMs, and online content. The flagship products are ''Do ...
use hyphens, not en dashes, in coordinate terms (such as " blood-brain barrier"), in
eponym An eponym is a person, a place, or a thing after whom or which someone or something is, or is believed to be, named. The adjectives which are derived from the word eponym include ''eponymous'' and ''eponymic''. Usage of the word The term ''epon ...
s (such as " Cheyne-Stokes respiration", " Kaplan-Meier method"), and so on.


Attributive compounds

In English, the en dash is usually used instead of a
hyphen The hyphen is a punctuation mark used to join words and to separate syllables of a single word. The use of hyphens is called hyphenation. ''Son-in-law'' is an example of a hyphenated word. The hyphen is sometimes confused with dashes (figure d ...
in compound (phrasal) attributives in which one or both elements is itself a compound, especially when the compound element is an open compound, meaning it is not itself hyphenated. This manner of usage may include such examples as: * The hospital–nursing home connection (the connection between the ''hospital'' and the ''nursing home'', not a ''home connection'' between the ''hospital'' and ''nursing'') * A nursing home–home care policy (a policy about the ''nursing home'' and ''home care'') * Pre–Civil War era *
Pulitzer Prize The Pulitzer Prize () is an award for achievements in newspaper, magazine, online journalism, literature, and musical composition within the United States. It was established in 1917 by provisions in the will of Joseph Pulitzer, who had made h ...
–winning novel *
New York–style pizza New York–style pizza is pizza made with a characteristically large hand-tossed thin crust, often sold in wide Pizza by the slice, slices Take-out, to go. The crust is thick and crisp only along its edge, yet soft, thin, and pliable enough be ...
* The non–San Francisco part of the world * The post–World War II era ** (Compare ''
post-war In Western usage, the phrase post-war era (or postwar era) usually refers to the time since the end of World War II. More broadly, a post-war period (or postwar period) is the interval immediately following the end of a war. A post-war period c ...
era'', which, if not fully compounded (''postwar''), takes a hyphen, not an en dash. The difference is that ''war'' is not an open compound, whereas ''World War II'' is.) *
Trans–New Guinea languages Trans–New Guinea (TNG) is an extensive family of Papuan languages spoken on the island of New Guinea and neighboring islands ‒ corresponding to the country Papua New Guinea as well as parts of Indonesia. Trans–New Guinea is the third-la ...
* The ex–prime minister * a long–focal length camera * water ice–based bedrock * The pro-conscription–anti-conscription debate * Public-school–private-school rivalries The disambiguating value of the en dash in these patterns was illustrated by Strunk and White in ''
The Elements of Style ''The Elements of Style'' is an American English writing style guide in numerous editions. The original was written by William Strunk Jr. in 1918, and published by Harcourt in 1920, comprising eight "elementary rules of usage", ten "elementary p ...
'' with the following example: When ''Chattanooga News'' and ''Chattanooga Free Press'' merged, the joint company was inaptly named ''
Chattanooga News-Free Press The ''Chattanooga Times Free Press'' is a daily broadsheet newspaper published in Chattanooga, Tennessee, and is distributed in the metropolitan Chattanooga region of southeastern Tennessee and northwestern Georgia. It is one of Tennessee's majo ...
'' (using a hyphen), which could be interpreted as meaning that their newspapers were news-free. An exception to the use of en dashes is usually made when
prefix A prefix is an affix which is placed before the Word stem, stem of a word. Adding it to the beginning of one word changes it into another word. For example, when the prefix ''un-'' is added to the word ''happy'', it creates the word ''unhappy'' ...
ing an already- hyphenated compound; an en dash is generally avoided as a distraction in this case. Examples of this include: * non-English-speaking air traffic controllers * semi-labor-intensive industries *
Proto-Indo-European language Proto-Indo-European (PIE) is the reconstructed common ancestor of the Indo-European language family. Its proposed features have been derived by linguistic reconstruction from documented Indo-European languages. No direct record of Proto-Indo-E ...
* The post-
MS-DOS MS-DOS ( ; acronym for Microsoft Disk Operating System, also known as Microsoft DOS) is an operating system for x86-based personal computers mostly developed by Microsoft. Collectively, MS-DOS, its rebranding as IBM PC DOS, and a few ope ...
era * non-government-owned corporations An en dash can be retained to avoid ambiguity, but whether any ambiguity is plausible is a judgment call.
AMA style ''AMA Manual of Style: A Guide for Authors and Editors'' is the style guide of the American Medical Association. It is written by the editors of ''JAMA'' (''Journal of the American Medical Association'') and the JAMA Network journals and is most ...
retains the en dashes in the following examples: * non–self-governing * non–English-language journals * non–group-specific blood * non–Q-wave myocardial infarction * non–brain-injured subjects


Differing recommendations

As discussed above, the en dash is sometimes recommended instead of a hyphen in
compound adjectives A compound modifier (also called a compound adjective, phrasal adjective, or adjectival phrase) is a compound of two or more attributive words: that is, two or more words that collectively modify a noun. Compound modifiers are grammatically equival ...
where neither part of the adjective modifies the other—that is, when each modifies the noun, as in ''
love–hate relationship A love–hate relationship is an interpersonal relationship involving simultaneous or alternating emotions of love and hate—something particularly common when emotions are intense. The term is used frequently in psychology, popular writing and ...
''. ''
The Chicago Manual of Style ''The Chicago Manual of Style'' (abbreviated in writing as ''CMOS'' or ''CMS'', or sometimes as ''Chicago'') is a style guide for American English published since 1906 by the University of Chicago Press. Its 17 editions have prescribed writi ...
'' (''CMOS''), however, limits the use of the en dash to two main purposes: * First, use it to indicate ranges of time, money, or other amounts, or in certain other cases where it replaces the word "to". * Second, use it in place of a hyphen in a compound adjective when one of the elements of the adjective is an open compound, or when two or more of its elements are compounds, open or hyphenated. That is, the ''CMOS'' favors hyphens in instances where some other guides suggest en dashes, with the 16th edition explaining that "Chicago's sense of the en dash does not extend to ''between''", to rule out its use in "US–Canadian relations". In these two uses, en dashes normally do not have spaces around them. Some make an exception when they believe avoiding spaces may cause confusion or look odd. For example, compare with . However, other authorities disagree and state there should be no space between an en dash and adjacent text. These authorities would not use a space in, for example, or .


Parenthetic and other uses at the sentence level

En dashes can be used instead of pairs of commas that mark off a nested clause or phrase. They can also be used around parenthetical expressions such as this one rather than the em dashes preferred by some publishers. The en dash can also signify a rhetorical pause. For example, an
opinion piece An opinion piece is an article, usually published in a newspaper or magazine, that mainly reflects the author's opinion about a subject. Opinion pieces are featured in many periodicals. Editorials Opinion pieces may take the form of an editorial, ...
from ''
The Guardian ''The Guardian'' is a British daily newspaper. It was founded in 1821 as ''The Manchester Guardian'', and changed its name in 1959. Along with its sister papers ''The Observer'' and ''The Guardian Weekly'', ''The Guardian'' is part of the Gu ...
'' is entitled: :Who is to blame for the sweltering weather? My kids say it's boomersand me In these situations, en dashes must have a single space on each side.


= Itemization mark

= Either the en dash or the em dash may be used as a
bullet A bullet is a kinetic projectile, a component of firearm ammunition that is shot from a gun barrel. Bullets are made of a variety of materials, such as copper, lead, steel, polymer, rubber and even wax. Bullets are made in various shapes and co ...
at the start of each item in a bulleted list. (This is a matter of
graphic design Graphic design is a profession, academic discipline and applied art whose activity consists in projecting visual communications intended to transmit specific messages to social groups, with specific objectives. Graphic design is an interdiscipli ...
rather than
orthography An orthography is a set of conventions for writing a language, including norms of spelling, hyphenation, capitalization, word breaks, emphasis, and punctuation. Most transnational languages in the modern period have a writing system, and mos ...
.)


Typography


Spacing

In most uses of en dashes, such as when used in indicating ranges, they are closed up to the joined words. It is only when en dashes are used in setting off parenthetical expressionssuch as this onethat they take spaces around them. For more on the choice of em versus en in this context, see En dash versus em dash.


Encoding and substitution

When an en dash is unavailable in a particular
character encoding Character encoding is the process of assigning numbers to Graphics, graphical character (computing), characters, especially the written characters of Language, human language, allowing them to be Data storage, stored, Data communication, transmi ...
environment—as in the
ASCII ASCII ( ), abbreviated from American Standard Code for Information Interchange, is a character encoding standard for electronic communication. ASCII codes represent text in computers, telecommunications equipment, and other devices. Because of ...
character set—there are some conventional substitutions. Often two consecutive hyphens are the substitute. The en dash is encoded in Unicode as U+2013 (decimal 8211) and represented in HTML by the named character entity –. The en dash is sometimes used as a substitute for the
minus sign The plus and minus signs, and , are mathematical symbols used to represent the notions of positive and negative, respectively. In addition, represents the operation of addition, which results in a sum, while represents subtraction, resulti ...
, when the minus sign character is not available since the en dash is usually the same width as a plus sign and is often available when the minus sign is not; see
below Below may refer to: *Earth *Ground (disambiguation) *Soil *Floor *Bottom (disambiguation) Bottom may refer to: Anatomy and sex * Bottom (BDSM), the partner in a BDSM who takes the passive, receiving, or obedient role, to that of the top or ...
. For example, the original 8-bit
Macintosh Character Set Mac OS Roman is a character encoding created by Apple Computer, Inc. for use by Macintosh computers. It is suitable for representing text in English and several other Western languages. Mac OS Roman encodes 256 characters, the first 128 of which ...
had an en dash, useful for the minus sign, years before Unicode with a dedicated minus sign was available. The hyphen-minus is usually too narrow to make a typographically acceptable minus sign. However, the en dash cannot be used for a minus sign in
programming language A programming language is a system of notation for writing computer programs. Most programming languages are text-based formal languages, but they may also be graphical. They are a kind of computer language. The description of a programming ...
s because the syntax usually requires a hyphen-minus.


Em dash

The em dash, em rule, or mutton dash is longer than an
en dash The dash is a punctuation mark consisting of a long horizontal line. It is similar in appearance to the hyphen but is longer and sometimes higher from the baseline. The most common versions are the endash , generally longer than the hyphen b ...
. The character is called an ''em dash'' because it is one em wide, a length that varies depending on the font size. One em is the same length as the font's height (which is typically measured in points). So in 9-point type, an em dash is nine points wide, while in 24-point type the em dash is 24 points wide. By comparison, the en dash, with its width, is in most
fonts In metal typesetting, a font is a particular size, weight and style of a typeface. Each font is a matched set of type, with a piece (a "sort") for each glyph. A typeface consists of a range of such fonts that shared an overall design. In mode ...
either a half-em wide or the width of an upper-case "N". The em dash is encoded in Unicode as U+2014 (decimal 8212) and represented in HTML by the named character entity —.


Usage

The em dash is used in several ways. It is primarily used in places where a set of
parentheses A bracket is either of two tall fore- or back-facing punctuation marks commonly used to isolate a segment of text or data from its surroundings. Typically deployed in symmetric pairs, an individual bracket may be identified as a 'left' or 'r ...
or a colon might otherwise be used, and it can also show an abrupt change in thought (or an interruption in speech) or be used where a
full stop The full stop (Commonwealth English), period (North American English), or full point , is a punctuation mark. It is used for several purposes, most often to mark the end of a declarative sentence (as distinguished from a question or exclamation ...
(period) is too strong and a
comma The comma is a punctuation mark that appears in several variants in different languages. It has the same shape as an apostrophe or single closing quotation mark () in many typefaces, but it differs from them in being placed on the baseline ...
too weak. Em dashes are also used to set off summaries or definitions. Common uses and definitions are cited below with examples.


''Colon-like use''


= Simple equivalence (or near-equivalence) of colon and em dash

= * ''Three alkali metals are the usual substituents: sodium, potassium, and lithium.'' * ''Three alkali metals are the usual substituents—sodium, potassium, and lithium.''


= Inversion of the function of a colon

= * These are the colors of the flag: red, white, and blue. * Red, white, and blue—these are the colors of the flag.


''Parenthesis-like use''


= Simple equivalence (or near-equivalence) of paired parenthetical marks

= * Compare parentheses with em dashes: ** ''Three alkali metals (sodium, potassium, and lithium) are the usual substituents.'' ** ''Three alkali metals—sodium, potassium, and lithium—are the usual substituents.'' * Compare commas, em dashes and parentheses (respectively) when no internal commas intervene: ** ''The food, which was delicious, reminded me of home.'' ** ''The food—which was delicious—reminded me of home.'' ** ''The food (which was delicious) reminded me of home.''


= Subtle differences in punctuation

= It may indicate an interpolation stronger than that demarcated by parentheses, as in the following from
Nicholson Baker Nicholson Baker (born January 7, 1957) is an American novelist and essayist. His fiction generally de-emphasizes narrative in favor of careful description and characterization. His early novels such as ''The Mezzanine'' and ''Room Temperature'' we ...
's ''
The Mezzanine ''The Mezzanine'' (1988) is the first novel by American writer Nicholson Baker. It narrates what goes through a man's mind during a modern lunch break. Concept On the surface, the novel deals with a man's lunchtime trip up an escalator in the mez ...
'' (the degree of difference is subjective). * "At that age I once stabbed my best friend, Fred, with a pair of pinking shears in the base of the neck, enraged because he had been given the comprehensive sixty-four-crayon Crayola box—including the gold and silver crayons—and would not let me look closely at the box to see how Crayola had stabilized the built-in crayon sharpener under the tiers of crayons."


''Interruption of a speaker''


= Interruption by someone else

= * "But I'm trying to explain that I—"
"I'm aware of your mitigating circumstances, but your negative attitude was excessive." In a related use, it may visually indicate the shift between speakers when they overlap in speech. For example, the em dash is used this way in
Joseph Heller Joseph Heller (May 1, 1923 – December 12, 1999) was an American author of novels, short stories, plays, and screenplays. His best-known work is the 1961 novel ''Catch-22'', a satire on war and bureaucracy, whose title has become a synonym for ...
's ''
Catch-22 ''Catch-22'' is a satirical war novel by American author Joseph Heller. He began writing it in 1953; the novel was first published in 1961. Often cited as one of the most significant novels of the twentieth century, it uses a distinctive non-chr ...
'': * He was Cain, Ulysses, the Flying Dutchman; he was Lot in Sodom, Deirdre of the Sorrows, Sweeney in the nightingales among trees. He was the miracle ingredient Z-147. He was—
"Crazy!" Clevinger interrupted, shrieking. "That's what you are! Crazy!"
"—immense. I'm a real, slam-bang, honest-to-goodness, three-fisted humdinger. I'm a bona fide supraman."


= Self-interruption

= * Simple revision of a statement as one's thoughts evolve on the fly: ** "I believe I shall—''no'', I'm going to do it." * Contemplative or emotional trailing off (usually in
dialogue Dialogue (sometimes spelled dialog in American English) is a written or spoken conversational exchange between two or more people, and a literary and theatrical form that depicts such an exchange. As a philosophical or didactic device, it is c ...
or in
first person narrative A first-person narrative is a mode of storytelling in which a storyteller recounts events from their own point of view using the first person It may be narrated by a first-person protagonist (or other focal character), first-person re-teller ...
): ** "I sense something; a presence I've not felt since—" in '' Star Wars Episode IV: A New Hope''. ** "Get out or else—" : Either an
ellipsis The ellipsis (, also known informally as dot dot dot) is a series of dots that indicates an intentional omission of a word, sentence, or whole section from a text without altering its original meaning. The plural is ellipses. The term origin ...
or an em dash can indicate
aposiopesis Aposiopesis (; Classical Greek: ἀποσιώπησις, "becoming silent") is a figure of speech wherein a sentence is deliberately broken off and left unfinished, the ending to be supplied by the imagination, giving an impression of unwillingness ...
, the
rhetorical device In rhetoric, a rhetorical device, persuasive device, or stylistic device is a technique that an author or speaker uses to convey to the listener or reader a meaning with the goal of persuading them towards considering a topic from a perspective, ...
by which a sentence is stopped short not because of interruption, but because the speaker is too emotional or pensive to continue. Because the ellipsis is the more common choice, an em dash for this purpose may be ambiguous in expository text, as many readers would assume interruption, although it may be used to indicate great emotion in dramatic
monologue In theatre, a monologue (from el, μονόλογος, from μόνος ''mónos'', "alone, solitary" and λόγος ''lógos'', "speech") is a speech presented by a single character, most often to express their thoughts aloud, though sometimes a ...
. * Long pause: ** In
Early Modern English Early Modern English or Early New English (sometimes abbreviated EModE, EMnE, or ENE) is the stage of the English language from the beginning of the Tudor period to the English Interregnum and Restoration, or from the transition from Middle E ...
texts and afterward, em dashes have been used to add long pauses (as noted in Joseph Robertson's 1785 ''An Essay On Punctuation''): ::


''Quotation''


= Quotation mark–like use

= This is a
quotation dash Quotation marks (also known as quotes, quote marks, speech marks, inverted commas, or talking marks) are punctuation marks used in pairs in various writing systems to set off direct speech, a quotation, or a phrase. The pair consists of an ...
. It may be distinct from an em dash in its coding (see
Horizontal bar The horizontal bar, also known as the high bar, is an apparatus used by male gymnasts in artistic gymnastics. It traditionally consists of a cylindrical metal (typically steel) bar that is rigidly held above and parallel to the floor by a syste ...
). It may be used to indicate turns in a dialogue, in which case each dash starts a paragraph. It replaces other quotation marks and was preferred by authors such as
James Joyce James Augustine Aloysius Joyce (2 February 1882 – 13 January 1941) was an Irish novelist, poet, and literary critic. He contributed to the modernist avant-garde movement and is regarded as one of the most influential and important writers of ...
: : ―O saints above! miss Douce said, sighed above her jumping rose. I wished I hadn't laughed so much. I feel all wet. : ―O, miss Douce! miss Kennedy protested. You horrid thing!


= Attribution of quote source

= * Inline quotes: ** ''A penny saved is a penny earned.'' —
Benjamin Franklin Benjamin Franklin ( April 17, 1790) was an American polymath who was active as a writer, scientist, inventor, statesman, diplomat, printer, publisher, and political philosopher. Encyclopædia Britannica, Wood, 2021 Among the leading inte ...
* Block quotes:


Redaction

An em dash may be used to indicate omitted letters in a word redacted to an initial or single letter or to
fillet Fillet may refer to: *Annulet (architecture), part of a column capital, also called a fillet *Fillet (aircraft), a fairing smoothing the airflow at a joint between two components *Fillet (clothing), a headband *Fillet (cut), a piece of meat *Fille ...
a word, by leaving the start and end letters whilst replacing the middle letters with a dash or dashes (for the purposes of
censorship Censorship is the suppression of speech, public communication, or other information. This may be done on the basis that such material is considered objectionable, harmful, sensitive, or "inconvenient". Censorship can be conducted by governments ...
or simply
data anonymization Data anonymization is a type of information sanitization whose intent is privacy protection. It is the process of removing personally identifiable information from data sets, so that the people whom the data describe remain anonymous. Overv ...
). In this use, it is sometimes doubled. * ''It was alleged that D⸺ had been threatened with blackmail.'' Three em dashes might be used to indicate a completely missing word.


Itemization mark

Either the en dash or the em dash may be used as a
bullet A bullet is a kinetic projectile, a component of firearm ammunition that is shot from a gun barrel. Bullets are made of a variety of materials, such as copper, lead, steel, polymer, rubber and even wax. Bullets are made in various shapes and co ...
at the start of each item in a bulleted list, but a plain hyphen is more commonly used.


Repetition

Three em dashes one after another can be used in a footnote, endnote, or another form of bibliographic entry to indicate repetition of the same author's name as that of the previous work, Note that his ''Pete's Guide'' website has an updated version
Version 2.0—May 27, 2002
.
which is similar to the use of


Typographic details


Spacing and substitution

According to most American sources (such as ''
The Chicago Manual of Style ''The Chicago Manual of Style'' (abbreviated in writing as ''CMOS'' or ''CMS'', or sometimes as ''Chicago'') is a style guide for American English published since 1906 by the University of Chicago Press. Its 17 editions have prescribed writi ...
'') and some British sources (such as ''
The Oxford Guide to Style ''Hart's Rules for Compositors and Readers at the University Press, Oxford''—today published under the short title ''New Hart's Rules''—is an authoritative reference book and style guide published in England by Oxford University Press (OUP ...
''), an em dash should always be set closed, meaning it should not be surrounded by spaces. But the practice in some parts of the English-speaking world, including the style recommended by ''
The New York Times Manual of Style and Usage ''The New York Times Manual of Style and Usage: The Official Style Guide Used by the Writers and Editors of the World's Most Authoritative Newspaper'' is a style guide first published in 1950 by editors at the newspaper and revised in 1974, 1999, ...
'' for printed newspapers and the ''
AP Stylebook The ''AP Stylebook'', also known by its full name ''The Associated Press Stylebook and Briefing on Media Law'', is an American English grammar style and usage guide created by American journalists working for or connected with the Associated Pr ...
'', sets it open, separating it from its surrounding words by using spaces or
hair space In computer programming, whitespace is any character or series of characters that represent horizontal or vertical space in typography. When rendered, a whitespace character does not correspond to a visible mark, but typically does occupy an area ...
s (U+200A) when it is being used parenthetically. The ''AP Stylebook'' rejects the use of the open em dash to set off introductory items in lists. However, the "space, en dash, space" sequence is the predominant style in German and French
typography Typography is the art and technique of arranging type to make written language legible, readable and appealing when displayed. The arrangement of type involves selecting typefaces, point sizes, line lengths, line-spacing ( leading), and ...
. (See En dash versus em dash below.) In Canada, ''The Canadian Style: A Guide to Writing and Editing'', ''The Oxford Canadian A to Z of Grammar, Spelling & Punctuation: Guide to Canadian English Usage'' (2nd ed.), ''Editing Canadian English'', and the ''Canadian Oxford Dictionary'' all specify that an em dash should be set closed when used between words, a word and numeral, or two numerals. The Australian government's ''Style Manual for Authors, Editors and Printers'' (6th ed.), also specifies that em dashes inserted between words, a word and numeral, or two numerals, should be set closed. A section on the 2-em rule (⸺) also explains that the 2-em can be used to mark an abrupt break in direct or reported speech, but a space is used before the 2-em if a complete word is missing, while no space is used if part of a word exists before the sudden break. Two examples of this are as follows: * I distinctly heard him say, "Go away or I'll ⸺". * It was alleged that D⸺ had been threatened with blackmail.


Approximating the em dash with two or three hyphens

When an em dash is unavailable in a particular
character encoding Character encoding is the process of assigning numbers to Graphics, graphical character (computing), characters, especially the written characters of Language, human language, allowing them to be Data storage, stored, Data communication, transmi ...
environment—as in the
ASCII ASCII ( ), abbreviated from American Standard Code for Information Interchange, is a character encoding standard for electronic communication. ASCII codes represent text in computers, telecommunications equipment, and other devices. Because of ...
character set—it has usually been approximated as consecutive double (--) or triple (---) hyphen-minuses. The two-hyphen em dash proxy is perhaps more common, being a widespread convention in the
typewriting Typing is the process of writing or inputting text by pressing keys on a typewriter, computer keyboard, mobile phone or calculator. It can be distinguished from other means of text input, such as handwriting and speech recognition. Text can b ...
era. (It is still described for hard copy manuscript preparation in the ''Chicago Manual of Style'' as of the 16th edition, although the manual conveys that typewritten manuscript and copyediting on paper are now dated practices.) The three-hyphen em dash proxy was popular with various publishers because the sequence of one, two, or three hyphens could then correspond to the hyphen, en dash, and em dash, respectively. Because early comic book
letterer A letterer is a member of a team of comic book creators responsible for drawing the comic book's text. The letterer's use of typefaces, calligraphy, letter size, and layout all contribute to the impact of the comic. The letterer crafts the comi ...
s were not aware of the typographic convention of replacing a typewritten double hyphen with an em dash, the double hyphen became traditional in American comics. This practice has continued despite the development of computer lettering.


En dash versus em dash

The en dash is wider than the
hyphen The hyphen is a punctuation mark used to join words and to separate syllables of a single word. The use of hyphens is called hyphenation. ''Son-in-law'' is an example of a hyphenated word. The hyphen is sometimes confused with dashes (figure d ...
but not as wide as the em dash. An em width is defined as the point size of the currently used font, since the M character is not always the width of the point size. In running text, various dash conventions are employed: an em dash—like so—or a spaced em dash — like so — or a spaced en dashlike socan be seen in contemporary publications. Various style guides and national varieties of languages prescribe different guidance on dashes. Dashes have been cited as being treated differently in the US and the UK, with the former preferring the use of an em dash with no additional spacing and the latter preferring a spaced en dash. As examples of the US style, ''The Chicago Manual of Style'' and '' The Publication Manual of the American Psychological Association'' recommend unspaced em dashes. Style guides outside the US are more variable. For example, ''
The Elements of Typographic Style ''The Elements of Typographic Style'' is a book on typography and style by Canadian typographer, poet and translator Robert Bringhurst. Originally published in 1992 by Hartley & Marks Publishers, it was revised in 1996, 2001 (v2.4), 2002 (v2.5), ...
'' by Canadian typographer
Robert Bringhurst Robert Bringhurst Appointments to the Order of Canada (2013). (born 16 October 1946) is a Canadian poet, typographer and author. He has translated substantial works from Haida and Navajo and from classical Greek and Arabic. He wrote ''The Eleme ...
recommends the spaced en dashlike soand argues that the length and visual magnitude of an em dash "belongs to the padded and corseted aesthetic of Victorian typography". In the United Kingdom, the spaced en dash is the house style for certain major publishers, including the
Penguin Group Penguin Group is a British trade book publisher and part of Penguin Random House, which is owned by the German media conglomerate Bertelsmann. The new company was created by a merger that was finalised on 1 July 2013, with Bertelsmann initial ...
, the
Cambridge University Press Cambridge University Press is the university press of the University of Cambridge. Granted letters patent by Henry VIII of England, King Henry VIII in 1534, it is the oldest university press A university press is an academic publishing hou ...
, and
Routledge Routledge () is a British multinational publisher. It was founded in 1836 by George Routledge, and specialises in providing academic books, journals and online resources in the fields of the humanities, behavioural science, education, law, and ...
. However, this convention is not universal. The ''Oxford Guide to Style'' (2002, section 5.10.10) acknowledges that the spaced en dash is used by "other British publishers" but states that the
Oxford University Press Oxford University Press (OUP) is the university press of the University of Oxford. It is the largest university press in the world, and its printing history dates back to the 1480s. Having been officially granted the legal right to print books ...
, like "most US publishers", uses the unspaced em dash. The en dashalways with spaces in running text when, as discussed in this section, indicating a parenthesis or pauseand the spaced em dash both have a certain technical advantage over the unspaced em dash. Most typesetting and word processing expects word spacing to vary to support full justification. Alone among punctuation that marks pauses or logical relations in text, the unspaced em dash disables this for the words it falls between. This can cause uneven spacing in the text, but can be mitigated by the use of
thin space In typography, a thin space is a space character whose width is usually or of an em. It is used to add a narrow space, such as between nested quotation marks or to separate glyphs that interfere with one another. It is not as narrow as the hair ...
s,
hair space In computer programming, whitespace is any character or series of characters that represent horizontal or vertical space in typography. When rendered, a whitespace character does not correspond to a visible mark, but typically does occupy an area ...
s, or even
zero-width space The zero-width space , abbreviated ZWSP, is a non-printing character used in computerized typesetting to indicate word boundaries to text-processing systems in scripts that do not use explicit spacing, or after characters (such as the slash) that a ...
s on the sides of the em dash. This provides the appearance of an unspaced em dash, but allows the words and dashes to break between lines. The spaced em dash risks introducing excessive separation of words. In full justification, the adjacent spaces may be stretched, and the separation of words further exaggerated. En dashes may also be preferred to em dashes when text is set in narrow columns, such as in newspapers and similar publications, since the en dash is smaller. In such cases, its use is based purely on space considerations and is not necessarily related to other typographical concerns. On the other hand, a spaced en dash may be ambiguous when it is also used for ranges, for example, in dates or between geographical locations with internal spaces.


Horizontal bar

The horizontal bar (), also known as a quotation dash, is used to introduce quoted text. This is the standard method of printing
dialogue Dialogue (sometimes spelled dialog in American English) is a written or spoken conversational exchange between two or more people, and a literary and theatrical form that depicts such an exchange. As a philosophical or didactic device, it is c ...
in some languages. The em dash is equally suitable if the quotation dash is unavailable or is contrary to the house style being used. There is no support in the standard TeX fonts, but one can use \hbox\kern-.5em--- or an em dash.


Swung dash

The swung dash () resembles a lengthened
tilde The tilde () or , is a grapheme with several uses. The name of the character came into English from Spanish, which in turn came from the Latin '' titulus'', meaning "title" or "superscription". Its primary use is as a diacritic (accent) in ...
and is used to separate alternatives or approximates. In
dictionaries A dictionary is a listing of lexemes from the lexicon of one or more specific languages, often arranged alphabetically (or by radical and stroke for ideographic languages), which may include information on definitions, usage, etymologies, p ...
, it is frequently used to stand in for the term being defined. A dictionary entry providing an example for the term ''henceforth'' might employ the swung dash as follows: :''henceforth'' (adv.) from this time forth; from now on; " she will be known as Mrs. Wales"


Typing the characters

Typewriters and computers often have no key that produces a dash. In consequence, it became common to use the hyphen. It is common for a single hyphen surrounded by spaces to represent an en dash, and for two hyphens to represent an em dash. (A hyphen surrounded by other characters is a hyphen, with a space before it or with digits it is a minus sign.) Modern word-processing software typically has support for many more characters and is usually capable of rendering both the en and em dashes correctly—albeit sometimes with an inconvenient input method. Techniques for generating em and en dashes in various
operating system An operating system (OS) is system software that manages computer hardware, software resources, and provides common services for computer programs. Time-sharing operating systems schedule tasks for efficient use of the system and may also in ...
s,
word processor A word processor (WP) is a device or computer program that provides for input, editing, formatting, and output of text, often with some additional features. Word processor (electronic device), Early word processors were stand-alone devices ded ...
s and
markup language Markup language refers to a text-encoding system consisting of a set of symbols inserted in a text document to control its structure, formatting, or the relationship between its parts. Markup is often used to control the display of the document ...
s are provided in the following table:


Unicode


In other languages

In many languages, such as
Polish Polish may refer to: * Anything from or related to Poland, a country in Europe * Polish language * Poles, people from Poland or of Polish descent * Polish chicken *Polish brothers (Mark Polish and Michael Polish, born 1970), American twin screenwr ...
, the em dash is used as an opening
quotation mark Quotation marks (also known as quotes, quote marks, speech marks, inverted commas, or talking marks) are punctuation marks used in pairs in various writing systems to set off direct speech, a quotation, or a phrase. The pair consists of an ...
. There is no matching closing quotation mark; typically a new paragraph will be started, introduced by a dash, for each turn in the dialogue.
Corpus Corpus is Latin for "body". It may refer to: Linguistics * Text corpus, in linguistics, a large and structured set of texts * Speech corpus, in linguistics, a large set of speech audio files * Corpus linguistics, a branch of linguistics Music * ...
studies indicate that em dashes are more commonly used in Russian than in English. In Russian, the em dash is used for the present copula (meaning "am"/"is"/"are"), which is unpronounced in spoken Russian. In French, em or en dashes can be used as
parentheses A bracket is either of two tall fore- or back-facing punctuation marks commonly used to isolate a segment of text or data from its surroundings. Typically deployed in symmetric pairs, an individual bracket may be identified as a 'left' or 'r ...
(brackets), but the use of a second dash as a closing parenthesis is optional. When a closing dash is not used, the sentence is ended with a period (full-stop) as usual. Dashes are, however, much less common than parentheses. In
Spanish Spanish might refer to: * Items from or related to Spain: **Spaniards are a nation and ethnic group indigenous to Spain **Spanish language, spoken in Spain and many Latin American countries **Spanish cuisine Other places * Spanish, Ontario, Cana ...
, em dashes can be used to mark off parenthetical phrases. Unlike in English, the em dashes are spaced like brackets, i.e., there is a space between main sentence and dash, but not between parenthetical phrase and dash.Raya
". In: ''Diccionario panhispánico de dudas''. Madrid: Real Academia Española, 2005.


See also

*
Leiden Conventions The Leiden Conventions or Leiden system is an established set of rules, symbols, and brackets used to indicate the condition of an epigraphic or papyrological text in a modern edition. In previous centuries of classical scholarship, scholars who ...
– rules to indicate conditions in texts (usage of " — —) * Signature dashes – signature delimiter in emails (usage of "-- " in a single line) *
Whitespace characters In computer programming, whitespace is any character or series of characters that represent horizontal or vertical space in typography. When rendered, a whitespace character does not correspond to a visible mark, but typically does occupy an area ...
– spaces of equivalent sizes to dashes


Explanatory notes


References


External links

* Wiktionary list of English phrases with em dash
Dashes and Hyphens

Commonly confused characters
{{Authority control Punctuation Typography