Encoded Archival Description (EAD) is a standard for encoding descriptive information regarding archival records.
[Pitti, D (2012). "Encoded Archival Description (EAD)." In Bates, Marcia J., (ed.) ''Understanding Information Retrieval Systems: Management, Types, and Standards.'' pp. 685–697. London: Auerbach Publications.]
Overview
Archival records differ from the items in a library collection because they are unique, usually unpublished and unavailable elsewhere, and because they exist as part of a collection that unifies them.
[Eastwood, T. "A Contested Realm: The Nature of Archives and the Orientation of Archival Science." In ''Currents of Archival Thinking'', Terry Eastwood and Heather MacNeil, eds. (Libraries Unlimited, 2017): 3–23.] For these reasons, archival description involves a hierarchical and progressive analysis that emphasizes the intellectual structure and content of the collection and does not always extend to the level of individual items within it.
Following the development of technologies in the middle to late 1980s that enabled the descriptive encoding of machine-readable findings, it became possible to consider the development of digital
finding aids for archives.
Work on an encoding standard for archival description began in 1992 at the
University of California, Berkeley
The University of California, Berkeley (UC Berkeley, Berkeley, Cal, or California) is a public land-grant research university in Berkeley, California. Established in 1868 as the University of California, it is the state's first land-grant u ...
, and in 1998 the first version of EAD was released. A second version was released in 2002, and the latest version, EAD3, was released in August 2015. The
Society of American Archivists
The Society of American Archivists is the oldest and largest archivist association in North America, serving the educational and informational needs of more than 5,000 individual archivist and institutional members. Established in 1936, the organi ...
and the
Library of Congress
The Library of Congress (LOC) is the research library that officially serves the United States Congress and is the ''de facto'' national library of the United States. It is the oldest federal cultural institution in the country. The library is ...
are jointly responsible for the maintenance and development of EAD.
EAD is now used around the world by archives, libraries, museums, national libraries and historical societies.
Through a standardized system for encoding the descriptions of archival finding aids, EAD allows users to locate primary sources that are geographically remote.
At its highest level, an EAD finding aid includes control information about the description as well as a description of the collection itself. EAD3 was revised in 2018 to address concerns relating to the ease of access to archival descriptions and its ability to interface with other systems.
[Ferro, N. and Silvello, G. (2016). "From Users to Systems: Identifying and Overcoming Barriers to Efficiently Access Archival Data." In Accessing Cultural Heritage at Scale, Proceedings of the First International Workshop on Accessing Cultural Heritage at Scale, co-located with Joint Conference on Digital Libraries 2016 (JCDL 2016), Newark, USA, June 22, 2016.]
Background and need
Archives by their very nature are different from libraries. While libraries contain individual items, such as books and journals, of which multiple, identical copies exist, archives contain records that are both unique and interrelated.
Archives represent the activities of a person, family or organization that are created and accumulated naturally in the course of their ordinary activities.
In contrast to the items in a library, therefore, all the items in an archival collection share a relationship.
The entire body of the records of an organization, family or individual have been created and accumulated as a byproduct of the organization or individual’s existence, and therefore share a common origin, which is referred to by archivists as its ''provenance''; ''provenance'' refers to both the origin of an item or collection as well as its custody and ownership.
Archivists refer to the entire body or records of an individual or organization as its ''
fonds''; the ''fonds'' is thus a conceptual whole that reflects the process of the production or accumulation of records that share a common function or activity and exhibit a natural unity.
A ''fonds'' may contain anywhere from one item to millions of items, and may consist in any form, including manuscripts, charts, drawings, maps, audio, video or electronic records.
Because published materials differ in significant and fundamental ways from the collections of interrelated and unique materials found in archives, there are significant differences in bibliographic and archival description.
A bibliographic description represents an individual published item, is based on and derived from the physical item, and is thus considered item-level.
Archival description, by contrast, represents a collection, or a fonds, often containing individual items of various media, sharing a common origin, or provenance. The description of archival materials, therefore, involves a complex hierarchical and progressive analysis.
It begins by describing the whole, then moves down to subcomponents; the description frequently does not extend to the item level.
In this way archival description focuses on the intellectual structure and content of the collection rather than its physical characteristics.
A finding aid is a tool that helps users to find materials within an archive through the description of its contents.
Most findings aids provide similar types of information, including, at a minimum, a title that connects the finding aid to the creator of the collection; a summary of the material contained in the finding aid; background and context of the collection, including major figures involved; and information about the custody of the collection as well as any conditions or restrictions regarding its use.
The unique nature of archival records and the geographic distribution of individual collections has presented a challenge for those wishing to locate and access them for over 150 years.
With the advent of international networked computing and online catalogs, however, the potential emerged for making archival collections searchable online.
History
EAD originated at the 1993 Society of American Archivists annual meeting in
and was headed by Daniel Pitti at the
University of California, Berkeley
The University of California, Berkeley (UC Berkeley, Berkeley, Cal, or California) is a public land-grant research university in Berkeley, California. Established in 1868 as the University of California, it is the state's first land-grant u ...
. The project's goal was to create a data standard for describing archives, similar to the
MARC standards
MARC (machine-readable cataloging) standards are a set of digital formats for the description of items catalogued by libraries, such as books, DVDs, and digital resources. Computerized library catalogs and library management software need to st ...
for describing bibliographic materials. The initial EAD Version 1.0 was released in the fall of 1998. Such a standard enables
archives
An archive is an accumulation of historical records or materials – in any medium – or the physical facility in which they are located.
Archives contain primary source documents that have accumulated over the course of an individual or ...
,
museum
A museum ( ; plural museums or, rarely, musea) is a building or institution that cares for and displays a collection of artifacts and other objects of artistic, cultural, historical, or scientific importance. Many public museums make these ...
s,
libraries, and manuscript repositories to list and describe their holdings in a manner that would be machine-readable and therefore easy to search, maintain and exchange.
Since its inception, many
archives
An archive is an accumulation of historical records or materials – in any medium – or the physical facility in which they are located.
Archives contain primary source documents that have accumulated over the course of an individual or ...
and
special collections have adopted it.
In addition to the development and maintenance work done by the Society of American Archivists and the Library of Congress, the
Research Libraries Group The Research Libraries Group (RLG) was a U.S.-based library consortium that existed from 1974 until its merger with the OCLC library consortium in 2006. RLG developed the Eureka interlibrary search engine, the RedLightGreen database of bibliographi ...
(RLG) has developed and published a set of "Best Practice" implementation guidelines for EAD, which lays out mandatory, recommended, and optional elements and attributes. RLG has also provided a kind of clearinghouse for finding aids in EAD format, known as
ArchiveGrid. Member libraries provide RLG the URL for their finding aids; RLG automatically
harvests data from the finding aids, indexes it, and provides a search interface for the index, thus giving researchers the ability to search across several hundred institutions' collections with a single query. RLG also has developed the "RLG Report Card," an automated quality-checking program that will analyze an EAD instance and report any areas where it diverges from the best practices guidelines.
SAA's Technical Subcommittee for Encoded Archival Description, which include international representation, embarked on a revision of the EAD standard in 2010. The latest version, EAD3, was released in August 2015.
Adoption
A number of repositories in the
United States
The United States of America (U.S.A. or USA), commonly known as the United States (U.S. or US) or America, is a country primarily located in North America. It consists of 50 states, a federal district, five major unincorporated territorie ...
,
Canada
Canada is a country in North America. Its ten provinces and three territories extend from the Atlantic Ocean to the Pacific Ocean and northward into the Arctic Ocean, covering over , making it the world's second-largest country by tot ...
, the
United Kingdom
The United Kingdom of Great Britain and Northern Ireland, commonly known as the United Kingdom (UK) or Britain, is a country in Europe, off the north-western coast of the continental mainland. It comprises England, Scotland, Wales and North ...
,
France
France (), officially the French Republic ( ), is a country primarily located in Western Europe. It also comprises of Overseas France, overseas regions and territories in the Americas and the Atlantic Ocean, Atlantic, Pacific Ocean, Pac ...
,
Australia
Australia, officially the Commonwealth of Australia, is a Sovereign state, sovereign country comprising the mainland of the Australia (continent), Australian continent, the island of Tasmania, and numerous List of islands of Australia, sma ...
and elsewhere have adopted and implemented EAD with varying levels of technical sophistication. One of the most ambitious efforts is th
Online Archive of California a
union catalog
A union catalog is a combined library catalog describing the collections of a number of libraries. Union catalogs have been created in a range of media, including book format, microform, cards and more recently, networked electronic databases. Pr ...
of over 5,000 EAD finding aids covering manuscripts and images from institutions across the state. Th
French National Library Francois Mitterrandpublishes more than 90,000 EAD finding aids covering archives and manuscripts.
EAD element set
The EAD standard's
XML schema
An XML schema is a description of a type of Extensible Markup Language, XML document, typically expressed in terms of constraints on the structure and content of documents of that type, above and beyond the basic syntactical constraints imposed ...
specifies the elements to be used to describe a manuscript collection as well as the arrangement of those elements (for example, which elements are required, or which are permitted inside which other elements). The EAD tag set has 146 elements and is used both to describe a collection as a whole, and also to encode a detailed multi-level inventory of the collection. Many EAD elements have been, or can be, mapped to content standards (such as
DACS and
ISAD(G)) and other structural standards (such as MARC or
Dublin Core), increasing the flexibility and
interoperability of the data.
EAD 1.0 was an
SGML
The Standard Generalized Markup Language (SGML; ISO 8879:1986) is a standard for defining generalized markup languages for documents. ISO 8879 Annex A.1 states that generalized markup is "based on two postulates":
* Declarative: Markup should des ...
document type definition
A document type definition (DTD) is a set of ''markup declarations'' that define a ''document type'' for an SGML-family markup language ( GML, SGML, XML, HTML).
A DTD defines the valid building blocks of an XML document. It defines the document ...
(DTD). EAD 2002, the second incarnation of EAD, was finalized in December 2002 and made available as an
XML
Extensible Markup Language (XML) is a markup language and file format for storing, transmitting, and reconstructing arbitrary data. It defines a set of rules for encoding documents in a format that is both human-readable and machine-readable. T ...
DTD. The latest version of EAD, EAD3, is available as both an
XML schema
An XML schema is a description of a type of Extensible Markup Language, XML document, typically expressed in terms of constraints on the structure and content of documents of that type, above and beyond the basic syntactical constraints imposed ...
and a DTD.
Parts of an EAD finding aid
''Note: Examples in this section are EAD2, and may not be valid against the EAD3 schema.''
eadheader
''Note: In the current release of EAD3 1.0, the eadheader element has been replaced with the control element.''
The first section of an EAD-encoded finding aid is the
eadheader
. This section contains the
title
and optional
subtitle
of the collection and detailed information about the finding aid itself: who created it, when it was created, its
revision history, the language the finding aid is written in, and so on. The
eadheader
itself has a number of required attributes that map to various
ISO
ISO is the most common abbreviation for the International Organization for Standardization.
ISO or Iso may also refer to: Business and finance
* Iso (supermarket), a chain of Danish supermarkets incorporated into the SuperBest chain in 2007
* Iso ...
standards such as
ISO 3166-1
ISO 3166-1 (''Codes for the representation of names of countries and their subdivisions – Part 1: Country codes'') is a standard defining codes for the names of countries, dependent territories, and special areas of geographical interest. It ...
for country codes and
ISO 8601 for date formats.
The
eadheader
and its child elements can be mapped to other standards for easy interchange of information. They are often mapped to Dublin Core elements such as Creator, Author, Language. For example, in the excerpt below the
relatedencoding="DC"
attribute of the
eadheader
element specifies that child elements will be mapped to Dublin Core; the child element
indicates that the EAD element
maps to the Dublin Core element
.
Example of an
eadheader
:
bachrach_lf
Louis Fabian Bachrach Papers
An inventory of his papers at Blank University
Mary Smith
Blank University
1981
John Jones
13 Sep 2006
English
archdesc
The
archdesc
section contains the description of the collection material itself. First, the ''Descriptive Identification'' or
did
element contains a description of the collection as a whole, including the creator (which may be an individual or an organization), size (usually given in linear feet), inclusive dates, language(s), and an
abstract or brief description. As with the
eadheader
above, elements may be mapped to corresponding standards; elements in this section are usually mapped to MARC elements. For example, in the excerpt below the
relatedencoding="MARC21"
attribute of the
archdesc
element specifies that child elements will be mapped to MARC21; the child element
indicates that the
unittitle
element maps to MARC field 245, subfield a.
Example:
Overview of the Collection
Blank University
Brightman, Samuel C. (Samuel Charles), 1911-1992
Samuel C. Brightman Papers
1932-1992
6 linear ft.
Papers of the American journalist including some war correspondence,
political and political humor writings, and adult education material
2458163
English
Several additional descriptive elements may follow the
did
including:
*
bioghist
- biographic description of the person or organization
*
scopecontent
- a detailed
narrative
A narrative, story, or tale is any account of a series of related events or experiences, whether nonfictional (memoir, biography, news report, documentary, travel literature, travelogue, etc.) or fictional (fairy tale, fable, legend, thriller (ge ...
description of the collection material
*
relatedmaterial
- description of items which the repository acquired separately but which are related to this collection, and which a researcher might want to be aware of
*
separatedmaterial
- items which the repository acquired as part of this collection but which have been separated from it, perhaps for special treatment, storage needs, or cataloging
*
controlaccess
- a list of subject headings or keywords for the collection, usually drawn from an
authoritative source such as
Library of Congress Subject Headings
The Library of Congress Subject Headings (LCSH) comprise a thesaurus (in the information science sense, a controlled vocabulary) of subject headings, maintained by the United States Library of Congress, for use in bibliographic records. LC Subject ...
or the
Art and Architecture Thesaurus
*
accessrestrict
and
userestrict
- statement concerning any restrictions on the material in the collection
*
arrangement
- the way in which the materials in the collection are arranged
The second, and usually largest, section of the
archdesc
is the
dsc
, which contains a full inventory of the collection broken down into progressively smaller intellectual chunks. EAD offers two options: the
c
element which can be nested within itself to an unlimited level, and a set of numbered container elements
c01
through
c12
which can only be nested numerically (i.e. a
c01
can contain only a
c02
; a
c02
can contain only a
c03
, and so on). Note that the
c
and
c0#
elements refer to ''intellectual'' subdivisions of the material; the actual physical container is specified using the
container
element. The inventory may go down to as detailed a level as desired. The example below shows an inventory to the folder level.
Example of an inventory:
Inventory
Correspondence
Adams, Martha
1962-1967
1
1
Barnett, Richard
1965
1
2
...etc
Writings
Short stories
1959-1979
5
1-9
Citing EAD
There have been some studies about how to cite EAD files with variable granularity. In particular, Buneman and Silvello proposed a rule-based system to automatically create citation snippets to be used as references when citing XML data; a case study is based on EAD. Furthermore, Silvello proposed a framework, which learning from examples, automatically creates references at a different level of coarseness for XML files. This framework has been tested on the Library of Congress collection of EAD files.
Criticism
A user study analyzing the user interaction patterns with finding aids highlighted that "
hey
Hey or Hey! may refer to:
Music
* Hey (band), a Polish rock band
Albums
* ''Hey'' (Andreas Bourani album) or the title song (see below), 2014
* ''Hey!'' (Julio Iglesias album) or the title song, 1980
* ''Hey!'' (Jullie album) or the title s ...
focus on rules for description rather than on facilitating access to and use of the materials they list and describe", and that many archive users have serious issues using finding aids. Common and frequent user interaction patterns with finding aids are navigational and thus they require to browse the archival hierarchy to make sense of the archival data.
Some critics claim that EAD constrains researcher interaction because several operations are either impossible or inefficient. For example, it is problematic to:
* let the user access a specific item on-the-fly, since it requires defining fixed access points to the archival hierarchy;
* let the user reconstruct the context of an item without browsing the whole archival hierarchy;
* present the user with only selected items from an archive, since the finding aid presents a given collection as a whole.
Furthermore, EAD allows for several degrees of freedom in tagging practice, which may turn out to be problematic in the automatic processing of EAD files, since it is difficult to know in advance how an institution will use the hierarchical elements. It has been underlined that only EAD files meeting stringent best practice guidelines are shareable and searchable.
[Prom, C.J., Rishel, C.A., Schwartz, S.W., Fox, K.J. (2007). "A Unified Platform for Archival Description and Access". In: Proc. 7th ACM/IEEE Joint Conference on Digital Libraries (JCDL 2007), pp. 157–166. ACM Press, New York (2007)]
See also
*
Encoded Archival Context Encoded Archival Context – Corporate bodies, Persons and Families (EAC-CPF) is an XML standard for encoding information about the creators of archival materials – i.e., a corporate body, person or family -- including their relationships to (a) r ...
(EAC)
*
Metadata
Metadata is "data that provides information about other data", but not the content of the data, such as the text of a message or the image itself. There are many distinct types of metadata, including:
* Descriptive metadata – the descriptive ...
*
Metadata standards
*
Extensible Markup Language (XML)
References
External links
ArchiveGridEAD official home pageat the
Library of Congress
The Library of Congress (LOC) is the research library that officially serves the United States Congress and is the ''de facto'' national library of the United States. It is the oldest federal cultural institution in the country. The library is ...
(tag library, links to schema and DTD, etc.)
EAD schema and other files for download via GitHubRLG Best Practices Guidelines for Encoded Archival DescriptionRLG EAD Report Cardan
documentation(EAD2 only)
Society of American ArchivistsSAA Encoded Archival Standards Section
{{Authority control
1998 introductions
Archival science
ISO standards
Markup languages
Technical communication
Metadata standards