MARC (machine-readable cataloging) standards are a set of
digital formats for the description of items catalogued by libraries, such as books, DVDs, and digital resources. Computerized library catalogs and library management software need to structure their catalog records as per an industry-wide standard, which is MARC, so that bibliographic information can be shared freely between computers. The structure of bibliographic records almost universally follows the MARC standard. Other standards work in conjunction with MARC, for example,
Anglo-American Cataloguing Rules
Anglo-Americans are people who are English-speaking inhabitants of Anglo-America. It typically refers to the nations and ethnic groups in the Americas that speak English as a native language, making up the majority of people in the world who spe ...
(AACR)/
Resource Description and Access
Resource Description and Access (RDA) is a standard for descriptive cataloging initially released in June 2010, providing instructions and guidelines on formulating bibliographic data. Intended for use by libraries and other cultural organization ...
(RDA) provide guidelines on formulating bibliographic data into the MARC record structure, while the
International Standard Bibliographic Description The International Standard Bibliographic Description (ISBD) is a set of rules produced by the International Federation of Library Associations and Institutions (IFLA) to create a bibliographic description in a standard, human-readable form, especial ...
(ISBD) provides guidelines for displaying MARC records in a standard, human-readable form.
History
Working with the
Library of Congress
The Library of Congress (LOC) is the research library that officially serves the United States Congress and is the ''de facto'' national library of the United States. It is the oldest federal cultural institution in the country. The library is ...
, American computer scientist
Henriette Avram
Henriette Davidson Avram (October 7, 1919 – April 22, 2006) was a computer programmer and systems analyst who developed the MARC format (Machine Readable Cataloging), the international data standard for bibliographic and holdings information ...
developed MARC during 1965–1968 to create records that could be read by computers and shared among libraries. By 1971, MARC formats had become the US national standard for dissemination of
bibliographic data. Two years later, they became the
international standard
international standard is a technical standard developed by one or more international standards organizations. International standards are available for consideration and use worldwide. The most prominent such organization is the International Or ...
. There are several versions of MARC in use around the world, the most predominant being
MARC 21, created in 1999 as a result of the harmonization of U.S. and Canadian MARC formats, and UNIMARC. UNIMARC is maintained by the Permanent UNIMARC Committee of the
(IFLA), and is widely used in Europe.
The MARC 21 family of standards now includes formats for
authority record
In information science, authority control is a process that organizes information, for example in library catalogs, by using a single, distinct spelling of a name (heading) or a numeric identifier for each topic. The word ''authority'' in ''aut ...
s, holdings records,
classification schedules, and community information, in addition to the format for
bibliographic record A bibliographic record is an entry in a bibliographic index (or a library catalog) which represents and describes a specific resource. A bibliographic record contains the data elements necessary to help users identify and retrieve that resource, as ...
s.
Record structure and field designations
The MARC standards define three aspects of a MARC record: the field designations within each record, the structure of the record, and the actual content of the record itself.
Field designations
Each field in a MARC record provides particular information about the item the record is describing, such as the author, title, publisher, date, language, media type, etc. Since it was first developed at a time when computing power was low, and space precious, MARC uses a simple three-digit numeric code (from 001-999) to identify each field in the record. MARC defines field 100 as the primary author of a work, field 245 as the title and field 260 as the publisher, for example.
Fields above 008 are further divided into subfields using a single letter or number designation. The 260, for example, is further divided into subfield "a" for the place of publication, "b" for the name of the publisher, and "c" for the date of publication.
Record structure
MARC records are typically stored and transmitted as binary files, usually with several MARC records concatenated together into a single file. MARC uses the
ISO 2709
ISO 2709 is an ISO standard for bibliographic descriptions, titled ''Information and documentation—Format for information exchange''.
It is maintained by the Technical Committee for Information and Documentation ( TC 9846).
History
In the lat ...
standard to define the structure of each record. This includes a marker to indicate where each record begins and ends, as well as a set of characters at the beginning of each record that provide a directory for locating the fields and subfields within the record.
In 2002, the Library of Congress developed the MARCXML schema as an alternative record structure, allowing MARC records to be represented in
XML
Extensible Markup Language (XML) is a markup language and file format for storing, transmitting, and reconstructing arbitrary data. It defines a set of rules for encoding documents in a format that is both human-readable and machine-readable. T ...
; the fields remain the same, but those fields are expressed in the record in XML
markup. Libraries typically expose their records as MARCXML via a
web service, often following the
SRU SRU may refer to:
Organizations
* Salve Regina University, Newport, Rhode Island, US
* Scottish Rugby Union
* Shri Rawatpura Sarkar University, Chhattisgarh, India
* Slippery Rock University of Pennsylvania, US
* Sri Ramachandra University, Chennai ...
or
OAI-PMH The Open Archives Initiative Protocol for Metadata Harvesting (OAI-PMH) is a protocol developed for harvesting metadata descriptions of records in an archive so that services can be built using metadata from many archives. An implementation of OAI- ...
standards.
Content
MARC encodes information about a bibliographic item, not information about the content of that item; this means it is a
metadata
Metadata is "data that provides information about other data", but not the content of the data, such as the text of a message or the image itself. There are many distinct types of metadata, including:
* Descriptive metadata – the descriptive ...
transmission standard, not a content standard. The actual content that a cataloger places in each MARC field is usually governed and defined by standards outside of MARC, except for a handful of fixed fields defined by the MARC standards themselves.
Resource Description and Access
Resource Description and Access (RDA) is a standard for descriptive cataloging initially released in June 2010, providing instructions and guidelines on formulating bibliographic data. Intended for use by libraries and other cultural organization ...
, for example, defines how the physical characteristics of books and other items should be expressed. The
Library of Congress Subject Headings
The Library of Congress Subject Headings (LCSH) comprise a thesaurus (in the information science sense, a controlled vocabulary) of subject headings, maintained by the United States Library of Congress, for use in bibliographic records. LC Subject ...
(LCSH) are a list of authorized subject terms used to describe the main subject content of the work. Other cataloging rules and classification schedules can also be used.
MARC formats
MARC 21
MARC 21 was designed to redefine the original MARC record format for the 21st century and to make it more accessible to the international community. MARC 21 has formats for the following five types of data: Bibliographic Format, Authority Format, Holdings Format, Community Format, and Classification Data Format.
Currently MARC 21 has been implemented successfully by The
British Library
The British Library is the national library of the United Kingdom and is one of the largest libraries in the world. It is estimated to contain between 170 and 200 million items from many countries. As a legal deposit library, the British ...
, the European Institutions and the major library institutions in the United States, and
Canada
Canada is a country in North America. Its ten provinces and three territories extend from the Atlantic Ocean to the Pacific Ocean and northward into the Arctic Ocean, covering over , making it the world's second-largest country by tot ...
.
MARC 21 is a result of the combination of the United States and Canadian MARC formats (USMARC and CAN/MARC). MARC 21 is based on the
NISO/
ANSI
The American National Standards Institute (ANSI ) is a private non-profit organization that oversees the development of voluntary consensus standards for products, services, processes, systems, and personnel in the United States. The organi ...
standard
Z39.2, which allows users of different software products to communicate with each other and to exchange data.
[ Joudrey and Taylor, Organization of Information, p. 262]
MARC 21 allows the use of two
character sets
Character encoding is the process of assigning numbers to graphical characters, especially the written characters of human language, allowing them to be stored, transmitted, and transformed using digital computers. The numerical values that ...
, either
MARC-8 or
Unicode
Unicode, formally The Unicode Standard,The formal version reference is is an information technology Technical standard, standard for the consistent character encoding, encoding, representation, and handling of Character (computing), text expre ...
encoded as
UTF-8
UTF-8 is a variable-width encoding, variable-length character encoding used for electronic communication. Defined by the Unicode Standard, the name is derived from ''Unicode'' (or ''Universal Coded Character Set'') ''Transformation Format 8-bit'' ...
. MARC-8 is based on
ISO 2022
ISO/IEC 2022 ''Information technology—Character code structure and extension techniques'', is an ISO/ IEC standard (equivalent to the ECMA standard ECMA-35, the ANSI standard ANSI X3.41 and the Japanese Industrial Standard JIS X 0202) in the ...
and allows the use of Hebrew, Cyrillic, Arabic, Greek, and East Asian scripts. MARC 21 in UTF-8 format allows all the languages supported by Unicode.
MARCXML
MARCXML is an
XML schema
An XML schema is a description of a type of Extensible Markup Language, XML document, typically expressed in terms of constraints on the structure and content of documents of that type, above and beyond the basic syntactical constraints imposed ...
based on the common MARC 21 standards.
MARCXML was developed by the
Library of Congress
The Library of Congress (LOC) is the research library that officially serves the United States Congress and is the ''de facto'' national library of the United States. It is the oldest federal cultural institution in the country. The library is ...
and adopted by it and others as a means of facilitating the sharing of, and networked access to, bibliographic information.
Being easy to parse by various systems allows it to be used as an aggregation format, as it is in software packages such as
MetaLib MetaLib is a federated search system developed by Ex Libris Group. MetaLib conducts simultaneous searches in multiple, and often heterogeneous, information resources such as library catalogs, journal articles, newspapers and selected quality Interne ...
, though that package merges it into a wider
DTD specification.
The MARCXML primary design goals included:
* Simplicity of the schema
* Flexibility and extensibility
* Lossless and reversible conversion from MARC
* Data presentation through XML stylesheets
* MARC records updates and data conversions through XML transformations
* Existence of validation tools
Future
The future of the MARC formats is a matter of some debate among libraries. On the one hand, the storage formats are quite complex and are based on outdated technology. On the other, there is no alternative bibliographic format with an equivalent degree of granularity. The billions of MARC records in tens of thousands of individual libraries (including over 50,000,000 records belonging to the
OCLC consortium alone) create inertia. The Library of Congress has launched the
Bibliographic Framework Initiative BIBFRAME (Bibliographic Framework) is a data model for bibliographic description. BIBFRAME was designed to replace the MARC standards, and to use linked data principles to make bibliographic data more useful both within and outside the library com ...
(BIBFRAME), which aims at providing a replacement for MARC that provides greater granularity and easier re-use of the data expressed in multiple catalogs. Beginning in 2013, OCLC Research exposed data detailing how various MARC elements have been used by libraries in the 400 million MARC records (as of early 2018) contained in WorldCat. The MARC formats are managed by the MARC Steering Group, which is advised by the MARC Advisory Committee.
Proposals for changes to MARC are submitted to the MARC Advisory Committee and discussed in public at the American Library Association (ALA) Midwinter and ALA Annual meetings.
See also
*
Cataloging
In library and information science, cataloging ( US) or cataloguing ( UK) is the process of creating metadata representing information resources, such as books, sound recordings, moving images, etc. Cataloging provides information such as auth ...
*
International Standard Bibliographic Description The International Standard Bibliographic Description (ISBD) is a set of rules produced by the International Federation of Library Associations and Institutions (IFLA) to create a bibliographic description in a standard, human-readable form, especial ...
(ISBD)
*
ISO 2709
ISO 2709 is an ISO standard for bibliographic descriptions, titled ''Information and documentation—Format for information exchange''.
It is maintained by the Technical Committee for Information and Documentation ( TC 9846).
History
In the lat ...
*
JACKPHY In library automation the initialism JACKPHY refers to a group of language scripts not based on Roman characters, specifically: Japanese, Arabic, Chinese, Korean, Persian, Hebrew, and Yiddish. Focus on these seven writing systems by Library of C ...
* (MAB)
*
Metadata
Metadata is "data that provides information about other data", but not the content of the data, such as the text of a message or the image itself. There are many distinct types of metadata, including:
* Descriptive metadata – the descriptive ...
and
metadata standards
A metadata standard is a requirement which is intended to establish a common understanding of the meaning or semantics of the data, to ensure correct and proper use and interpretation of the data by its owners and users. To achieve this common unde ...
*
Z39.50
*
ONIX for Books
References
* Reitz, J. M. (2004
Online Dictionary for Library and Information Science
Further reading
*
*
*
*
*
External links
Understanding MARC Bibliographic Machine Readable Cataloging a good introduction
*
*
*
Amazon to MARC Converter
NISO/ANSI Z39.2Library of Congress: MARCXMLInterpreting MARC: Where’s the Bibliographic Data?by Jason Thomale
Code4Lib Journal
The ''Code4Lib Journal'' is a quarterly journal that publishes articles about libraries and technology. It was founded by the Code4Lib community in 2007. Code4Lib publishes under a US CC-BY licence. Code4Lib Journal is also open peer reviewed.
...
Issue 11, 2010-09-21
{{Authority control
Bibliography file formats
Library automation
Library cataloging and classification
Library of Congress
Metadata publishing
XML-based standards
Country codes
Metadata standards