HOME

TheInfoList



OR:

An index (: usually indexes, more rarely indices) is a list of words or phrases ('headings') and associated pointers ('locators') to where useful material relating to that heading can be found in a document or collection of documents. Examples are an index in the back matter of a
book A book is a structured presentation of recorded information, primarily verbal and graphical, through a medium. Originally physical, electronic books and audiobooks are now existent. Physical books are objects that contain printed material, ...
and an index that serves as a
library catalog A library catalog (or library catalogue in British English) is a register of all bibliography, bibliographic items found in a library or group of libraries, such as a network of libraries at several locations. A catalog for a group of libra ...
. An index differs from a word index, or ''concordance'', in focusing on the subject of the text rather than the exact words in a text, and it differs from a
table of contents A table of contents (or simply contents, abbreviated as TOC), is a list usually part of the Book design#Front matter, front matter preceding the main text of a book or other written work containing the titles of the text's sections, sometimes with ...
because the index is ordered by subject, regardless of whether it is early or late in the book, while the listed items in a table of contents is placed in the same order as the book. In a traditional ''back-of-the-book index'', the headings will include names of people, places, events, and concepts selected as being relevant and of interest to a possible reader of the book. The indexer performing the selection may be the author, the editor, or a professional indexer working as a third party. The pointers are typically page numbers, paragraph numbers or section numbers. In a
library catalog A library catalog (or library catalogue in British English) is a register of all bibliography, bibliographic items found in a library or group of libraries, such as a network of libraries at several locations. A catalog for a group of libra ...
the words are authors, titles, subject headings, etc., and the pointers are
call number A library classification is a system used within a library to organize materials, including books, sound and video recordings, electronic materials, etc., both on shelves and in catalogs and indexes. Each item is typically assigned a call number ...
s. Internet
search engines Search engines, including web search engines, selection-based search engines, metasearch engines, desktop search tools, and web portals and vertical market websites have a search facility for online databases. By content/topic Gene ...
(such as
Google Google LLC (, ) is an American multinational corporation and technology company focusing on online advertising, search engine technology, cloud computing, computer software, quantum computing, e-commerce, consumer electronics, and artificial ...
) and full-text searching help provide access to information but are not as selective as an index, as they provide non-relevant links, and may miss relevant information if it is not phrased in exactly the way they expect. Perhaps the most advanced investigation of problems related to book indexes is made in the development of
topic maps A topic map is a standard for the representation and interchange of knowledge, with an emphasis on the findability of information. Topic maps were originally developed in the late 1990s as a way to represent back-of-the-book index structures ...
, which started as a way of representing the knowledge structures inherent in traditional back-of-the-book indexes. The
concept A concept is an abstract idea that serves as a foundation for more concrete principles, thoughts, and beliefs. Concepts play an important role in all aspects of cognition. As such, concepts are studied within such disciplines as linguistics, ...
embodied by book indexes lent its name to
database index A database index is a data structure that improves the speed of data retrieval operations on a database table at the cost of additional writes and storage space to maintain the index data structure. Indexes are used to quickly locate data withou ...
es, which similarly provide an abridged way to look up information in a larger collection, albeit one for computer use rather than human use.


Earliest examples in English

In the English language, indexes have been referred to as early as 1593, as can be seen from lines in
Christopher Marlowe Christopher Marlowe ( ; Baptism, baptised 26 February 156430 May 1593), also known as Kit Marlowe, was an English playwright, poet, and translator of the Elizabethan era. Marlowe is among the most famous of the English Renaissance theatre, Eli ...
's ''
Hero and Leander Hero and Leander (, ) is the Greek myth relating the story of Hero (, ''Hērṓ''; ), a priestess of Aphrodite (Venus in Roman mythology) who dwelt in a tower in Sestos on the European side of the Hellespont, and Leander (, ''Léandros''; ...
'' of that year: ''Therefore, even as an index to a book''
''So to his mind was young Leander's look.'' A similar reference to indexes is in
Shakespeare William Shakespeare ( 23 April 1564 – 23 April 1616) was an English playwright, poet and actor. He is widely regarded as the greatest writer in the English language and the world's pre-eminent dramatist. He is often called England's natio ...
's lines from ''
Troilus and Cressida ''The Tragedy of Troilus and Cressida'', often shortened to ''Troilus and Cressida'' ( or ), is a play by William Shakespeare, probably written in 1602. At Troy during the Trojan War, Troilus and Cressida begin a love affair. Cressida is forc ...
'' (I.3.344), written nine years later: ''And in such indexes, although small pricks''
''To their subsequent volumes, there is seen''
''The baby figure of the giant mass''
''Of things to come at large.'' But according to G. Norman Knight, "at that period, as often as not, by an 'index to a book' was meant what we should now call a table of contents."Knight, G. Norman (1979) ''Indexing, the Art of: A Guide to the Indexing of Books and Periodicals'' (HarperCollins), pp. 17–18 Until about the end of the nineteenth century, books, fiction as well as non-fiction, sometimes had very detailed chapter titles, which could be several sentences long. Among the first indexes – in the modern sense – to a book in the English language was Leonard Mascall's "A booke of the arte and maner how to plant and graffe all sortes of trees" printed in 1575. Another was one in
Plutarch Plutarch (; , ''Ploútarchos'', ; – 120s) was a Greek Middle Platonist philosopher, historian, biographer, essayist, and priest at the Temple of Apollo (Delphi), Temple of Apollo in Delphi. He is known primarily for his ''Parallel Lives'', ...
's ''
Parallel Lives * Culture of ancient Greece Culture of ancient Rome Ancient Greek biographical works Ethics literature History books about ancient Rome Cultural depictions of Gaius Marius Cultural depictions of Mark Antony Cultural depictions of Cicero ...
'', in Sir Thomas North's 1595 translation. A section entitled "An Alphabetical Table of the most material contents of the whole book" may be found in
Henry Scobell Henry Scobell (baptised 1610; died 1660) was an English Parliamentary official, and editor of official publications. He was clerk to the Long Parliament, and wrote on parliamentary procedure and precedents. Life Initially under-clerk of t ...
's ''Acts and Ordinances of Parliament'' of 1658. This section comes after "An index of the general titles comprised in the ensuing Table". Both of these indexes predate the index to
Alexander Cruden Alexander Cruden (31 May 16991 November 1770) was the Scottish author of an early Bible concordance, a proofreader and publisher, and self-styled Corrector of the nation's morals. Early life and career Alexander Cruden was born in Aberdeen in ...
's ''Concordance'' (1737), which is erroneously held to be the earliest index found in an English book.


Etymology and plural

The word is derived from
Latin Latin ( or ) is a classical language belonging to the Italic languages, Italic branch of the Indo-European languages. Latin was originally spoken by the Latins (Italic tribe), Latins in Latium (now known as Lazio), the lower Tiber area aroun ...
, in which ''index'' means "one who points out", an "indication", or a " forefinger". In Latin, the plural form of the word is ''indices''. In English, the plural "indices" is commonly used in
mathematical Mathematics is a field of study that discovers and organizes methods, Mathematical theory, theories and theorems that are developed and Mathematical proof, proved for the needs of empirical sciences and mathematics itself. There are many ar ...
and computing contexts, and sometimes in bibliographical contexts – for example, in the 17-volume '' Women in World History: A Biographical Encyclopedia'' (1999–2002). However, this form is now seen as an archaism by many writers and commentators, who prefer the anglicised plural "indexes". "Indexes" is widely used in the publishing industry; in the International Standard
ISO 999 ISO 999 (Information and documentation—Guidelines for the content, organization and presentation of indexes) is an ISO standard which provides the information industry with guidelines for the content, organisation and presentation of indexes ...
, ''Information and documentation – Guidelines for the content, organization and presentation of indexes''; and is preferred by the ''
Oxford Style Manual ''Hart's Rules'' is the oldest continuously updated style guide in the English language, providing advice on topics such as punctuation, citation, and typography. Printer and biographer Horace Hart first issued the work in 1893 for the composi ...
''. ''
The Chicago Manual of Style ''The Chicago Manual of Style'' (''CMOS'') is a style guide for American English published since 1906 by the University of Chicago Press. Its 18 editions (the most recent in 2024) have prescribed writing and citation styles widely used in publ ...
'' allows both forms. G. Norman Knight quotes
Shakespeare William Shakespeare ( 23 April 1564 – 23 April 1616) was an English playwright, poet and actor. He is widely regarded as the greatest writer in the English language and the world's pre-eminent dramatist. He is often called England's natio ...
's lines from ''
Troilus and Cressida ''The Tragedy of Troilus and Cressida'', often shortened to ''Troilus and Cressida'' ( or ), is a play by William Shakespeare, probably written in 1602. At Troy during the Trojan War, Troilus and Cressida begin a love affair. Cressida is forc ...
'' (I.3.344) – "And in such indexes ..." – and comments: "But the real importance of this passage is that it establishes for all time the correct literary plural; we can leave the Latin form "indices" to the mathematicians (and similarly "appendices" to the anatomists)."


Indexing process


Conventional indexing

The indexer reads through the text, identifying indexable concepts (those for which the text provides useful information and which will be of relevance for the text's readership). The indexer creates index headings to represent those concepts, which are phrased such that they can be found when in alphabetical order (so, for example, one would write 'indexing process' rather than 'how to create an index'). These headings and their associated locators (indicators to position in the text) are entered into specialist
indexing software Indexing software consists of computer application software, applications that help to build an index (publishing), index (like this one:Index of branches of science). Features There are several methodologies for indexing: * Standalone indexin ...
which handles the formatting of the index and facilitates the editing phase. The index is then edited to impose consistency throughout the index. Indexers must analyze the text to enable presentation of concepts and ideas in the index that may not be named within the text. The index is intended to help the reader, researcher, or information professional, rather than the author, find information, so the professional indexer must act as a liaison between the text and its ultimate user. In the United States, according to tradition, the index for a non-fiction book is the responsibility of the author, but most authors do not actually do it. Most indexing is done by freelancers hired by authors, publishers or an independent business which manages the production of a book, publishers or book packagers. Some publishers and database companies employ indexers. Before indexing software existed, indexes were created using slips of paper or, later,
index card An index card (or record card in British English and system cards in Australian English) consists of card stock (heavy paper) cut to a standard size, used for recording and storing small amounts of discrete data. A collection of such cards ei ...
s. After hundreds of such slips or cards were filled out (as the indexer worked through the pages of the book proofs), they could then be shuffled by hand into alphabetical order, at which point they served as
manuscript A manuscript (abbreviated MS for singular and MSS for plural) was, traditionally, any document written by hand or typewritten, as opposed to mechanically printed or reproduced in some indirect or automated way. More recently, the term has ...
to be typeset into the printed index.


Indexing software

Software is available to aid the indexer in building a book index. There are several dedicated indexing software programs available to assist with the special sorting and copying needs involved in index preparation.


Embedded indexing

Embedded indexing involves including the index headings in the midst of the text itself, but surrounded by codes so that they are not normally displayed. A usable index is then generated automatically from the embedded text using the position of the embedded headings to determine the locators. Thus, when the pagination is changed the index can be regenerated with the new locators.
LaTeX Latex is an emulsion (stable dispersion) of polymer microparticles in water. Latices are found in nature, but synthetic latices are common as well. In nature, latex is found as a wikt:milky, milky fluid, which is present in 10% of all floweri ...
documents support embedded indexes primarily through the MakeIndex package. Several widely used
XML Extensible Markup Language (XML) is a markup language and file format for storing, transmitting, and reconstructing data. It defines a set of rules for encoding electronic document, documents in a format that is both human-readable and Machine-r ...
DTDs, including
DocBook DocBook is a Semantics (computer science), semantic markup language for technical documentation. It was originally intended for writing technical documents related to computer hardware and software, but it can be used for any other sort of docume ...
and TEI, have elements that allow index creation directly in the XML files. Most word processing software, such as StarWriter/ OpenOffice.org Writer,
Microsoft Word Microsoft Word is a word processor program, word processing program developed by Microsoft. It was first released on October 25, 1983, under the name Multi-Tool Word for Xenix systems. Subsequent versions were later written for several other platf ...
, and
WordPerfect WordPerfect (WP) is a word processing application, now owned by Alludo, with a long history on multiple personal computer platforms. At the height of its popularity in the 1980s and early 1990s, it was the market leader of word processors, disp ...
, as well as some desktop publishing software (for example,
FrameMaker Adobe FrameMaker is a document processor designed for writing and editing large or complex documents, including structured documents. It was originally developed by Frame Technology Corporation, which was bought by Adobe. Overview FrameMake ...
and InDesign), as well as other tools (for example, MadCap Software's Flare), have some facility for embedded indexing as well. TExtract and IndexExploit support embedded indexing of Microsoft Word documents. An embedded index requires more time to create than a conventional static index; however, an embedded index can save time in the long run when the material is updated or repaginated. This is because, with a static index, if even a few pages change, the entire index must be revised or recreated while, with an embedded index, only the pages that changed need updating or indexing.


Purpose

Indexes are also designed to help the reader find information quickly and easily. A complete and truly useful index is not simply a list of the words and phrases used in a publication (which is properly called a concordance), but an organized map of its contents, including
cross-reference The term cross-reference (abbreviation: xref) can refer to either: * An instance within a document which refers to related information elsewhere in the same document. In both printed and online dictionaries cross-references are important because ...
s, grouping of like concepts, and other useful intellectual analysis. Sample back-of-the-book index excerpt: :sage, 41–42. ''See also'' Herbs ← directing the reader to related terms :Scarlet Sages. ''See'' ''Salvia coccinea'' ← redirecting the reader to term used in the text :shade plants ← grouping term (may not appear in the text; may be generated by indexer) ::hosta, 93 ← subentries ::myrtle, 46 ::Solomon's :sunflower, 47 ← regular entry In books, indexes are usually placed near the end (this is commonly known as "BoB" or back-of-book indexing). They complement the
table of contents A table of contents (or simply contents, abbreviated as TOC), is a list usually part of the Book design#Front matter, front matter preceding the main text of a book or other written work containing the titles of the text's sections, sometimes with ...
by enabling access to information by specific subject, whereas contents listings enable access through broad divisions of the text arranged in the order they occur. It has been remarked that, while " first glance the driest part of the book, on closer inspection the index may provide both interest and amusement from time to time."


Index quality

Some principles of good indexing include: * Ensuring each topic/section includes a variety of relevant index entries; use two or three entries per topic * Understanding the audience and understand what kind of index entries they're likely to look for * Use the same form throughout (singular vs. plural, capitalisation, etc.), using standard indexing conventions Indexing pitfalls: * Significant topics with no index entries at all * Indexing 'mere mentions': "But John Major was no Winston Churchill..." indexed under 'Churchill, Winston' * Circular cross-references: 'Felidae. ''See'' Cats'; 'Cats. ''See'' Felidae' * References to discussions of a single topic scattered among several main headings: 'Cats, 50–62'; 'Felidae, 175–183' * Inconsistently indexing similar topics * Confusing similar names: Henry V of England, Henri V of France * Incorrect alphabetization: 'α-Linolenic acid' under 'A' instead of 'L' * Inappropriate inversions: 'processors, word' for 'word processors' * Inappropriate subheadings: 'processors: food, 213–6; word, 33–7' * Computer indexing from section headings: e.g. 'Getting to know your printer' under 'G'


Indexer roles

Some indexers specialize in specific formats, such as scholarly books, microforms,
web indexing Web indexing, or Internet indexing, comprises methods for indexing the contents of a website or of the Internet as a whole. Individual websites or intranets may use a back-of-the-book index, while search engines usually use keywords and Metadata ( ...
(the application of a back-of-book-style index to a
website A website (also written as a web site) is any web page whose content is identified by a common domain name and is published on at least one web server. Websites are typically dedicated to a particular topic or purpose, such as news, educatio ...
or
intranet An intranet is a computer network for sharing information, easier communication, collaboration tools, operational systems, and other computing services within an organization, usually to the exclusion of access by outsiders. The term is used in ...
),
search engine indexing Search engine indexing is the collecting, parsing, and storing of data to facilitate fast and accurate information retrieval. Index design incorporates interdisciplinary concepts from linguistics, cognitive psychology, mathematics, informatics, an ...
, database indexing (the application of a pre-defined
controlled vocabulary A controlled vocabulary provides a way to organize knowledge for subsequent retrieval. Controlled vocabularies are used in subject indexing schemes, subject headings, thesauri, taxonomies and other knowledge organization systems. Controlled v ...
such as
MeSH Medical Subject Headings (MeSH) is a comprehensive controlled vocabulary for the purpose of indexing journal articles and books in the life sciences. It serves as a thesaurus of index terms that facilitates searching. Created and updated by th ...
to articles for inclusion in a database), and periodical indexing (indexing of newspapers, journals, magazines). Some indexers with expertise in controlled vocabularies also work as taxonomists and
ontologists Ontology is the philosophical study of existence, being. It is traditionally understood as the subdiscipline of metaphysics focused on the most general features of reality. As one of the most fundamental concepts, being encompasses all of realit ...
. Some indexers specialize in particular subject areas, such as anthropology, business, computers, economics, education, government documents, history, law, mathematics, medicine, psychology, and technology. An indexer can be found for any subject.


References in popular culture

In " The Library of Babel", a short story by
Jorge Luis Borges Jorge Francisco Isidoro Luis Borges Acevedo ( ; ; 24 August 1899 – 14 June 1986) was an Argentine short-story writer, essayist, poet and translator regarded as a key figure in Spanish literature, Spanish-language and international literatur ...
, there is an index of indexes that catalogues all of the books in the library, which contains all possible books.
Kurt Vonnegut Kurt Vonnegut ( ; November 11, 1922 – April 11, 2007) was an American author known for his Satire, satirical and darkly humorous novels. His published work includes fourteen novels, three short-story collections, five plays, and five nonfict ...
's novel ''
Cat's Cradle ''Cat's Cradle'' is a satirical postmodern novel, with science fiction elements, by American writer Kurt Vonnegut. Vonnegut's fourth novel, it was first published on March 18, 1963, exploring and satirizing issues of science, technology, the p ...
'' includes a character who is a professional indexer and believes that "indexing sa thing that only the most amateurish author ndertakesto do for his own book." She claims to be able to read an author's character through the index he created for his own history text, and warns the narrator, an author, "Never index your own book."
Vladimir Nabokov Vladimir Vladimirovich Nabokov ( ; 2 July 1977), also known by the pen name Vladimir Sirin (), was a Russian and American novelist, poet, translator, and entomologist. Born in Imperial Russia in 1899, Nabokov wrote his first nine novels in Rus ...
's novel ''
Pale Fire ''Pale Fire'' is a 1962 novel by Vladimir Nabokov. The novel is presented as a 999-line poem titled "Pale Fire", written by the fictional poet John Shade, with a foreword, lengthy commentary and index written by Shade's neighbor and academic co ...
'' includes a parody of an index, reflecting the insanity of the narrator. Mark Danielewski's novel '' House of Leaves'' contains an exhaustive 41 page index of words in the novel, including even large listings for inconsequential words such as ''the'', ''and'', and ''in''. J. G. Ballard's "The Index" is a short story told through the form of an index to an "unpublished and perhaps suppressed" autobiography. Barbara Pym's novel '' No Fond Return of Love'' involves two characters, Dulcie Mainwaring and Viola Dace, who work as professional indexers. Indexing is described throughout the novel as a "thankless task."


Standards

*
ISO 999 ISO 999 (Information and documentation—Guidelines for the content, organization and presentation of indexes) is an ISO standard which provides the information industry with guidelines for the content, organisation and presentation of indexes ...
:1996 Guidelines for the Content, Organization, and Presentation of Indexes (this is also the national standard in the UK, Australia, and New Zealand)
ANSI/NISO X39.4-2021 Criteria for Indexes


Societies

Th
American Society for Indexing, Inc.
(ASI) is a national association founded in 1968 to promote excellence in indexing and increase awareness of the value of well-designed indexes. ASI serves indexers, librarians, abstractors, editors, publishers, database producers, data searchers, product developers, technical writers, academic professionals, researchers and readers, and others concerned with indexing. It is the only professional organization in the United States devoted solely to the advancement of indexing, abstracting and related methods of information retrieval. Other similar societies include: * Association of Southern African Indexers and Bibliographers (ASAIB) * Australian and New Zealand Society of Indexers * British Record Society * China Society of Indexers * German Network of Indexers/Deutsches Netzwerk der Indexer (DNI) * Indexing Society of Canada/Société canadienne d'indexation (ISC/SCI) * Nederlands Indexers Netwerk (NIN) * Society of Indexers


See also

*
Concordance (publishing) A concordance is an alphabetical list of the principal words used in a book or body of work, listing every instance of each word with its immediate context (language use)#Verbal context, context. Historically, concordances have been compiled only ...
*
Indexing and abstracting service An abstracting service is a service that provides abstracts of publications, often on a subject or group of related subjects, usually on a subscription basis. An indexing service is a service that assigns descriptors and other kinds of access po ...
* '' Index, A History of the'' *
Subject (documents) In library and information science documents (such as books, articles and pictures) are classified and searched by subject – as well as by other attributes such as author, genre and document type. This makes "subject" a fundamental term in this ...
*
Subject indexing Subject indexing is the act of describing or classifying a document A document is a writing, written, drawing, drawn, presented, or memorialized representation of thought, often the manifestation of nonfiction, non-fictional, as well as ...
*
Table of contents A table of contents (or simply contents, abbreviated as TOC), is a list usually part of the Book design#Front matter, front matter preceding the main text of a book or other written work containing the titles of the text's sections, sometimes with ...
*
Web indexing Web indexing, or Internet indexing, comprises methods for indexing the contents of a website or of the Internet as a whole. Individual websites or intranets may use a back-of-the-book index, while search engines usually use keywords and Metadata ( ...
*
Codex The codex (: codices ) was the historical ancestor format of the modern book. Technically, the vast majority of modern books use the codex format of a stack of pages bound at one edge, along the side of the text. But the term ''codex'' is now r ...


References


Further reading

* * * * * * *


External links


The Indexer (the international journal)



Usability studies for indexes



"The Definite Article: Acknowledging 'The' in Index Entries
", Glenda Browne, The Indexer, vol. 22, no. 3 April 2001, pp. 119–22.
The role of indexing in technical communication






{{Authority control Book publishing Book terminology Book design Information science Publishing