Digital curation
   HOME

TheInfoList



OR:

Digital curation is the selection,
preservation Preservation may refer to: Heritage and conservation * Preservation (library and archival science), activities aimed at prolonging the life of a record while making as few changes as possible * ''Preservation'' (magazine), published by the Nat ...
, maintenance, collection and
archiving An archive is an accumulation of historical records or materials – in any medium – or the physical facility in which they are located. Archives contain primary source documents that have accumulated over the course of an individual or ...
of
digital Digital usually refers to something using discrete digits, often binary digits. Technology and computing Hardware *Digital electronics, electronic circuits which operate using digital signals ** Digital camera, which captures and stores digital ...
assets. Digital curation establishes, maintains and adds value to repositories of digital data for present and future use. This is often accomplished by
archivist An archivist is an information professional who assesses, collects, organizes, preserves, maintains control over, and provides access to records and archives determined to have long-term value. The records maintained by an archivist can consi ...
s, librarians, scientists, historians, and scholars. Enterprises are starting to use digital curation to improve the quality of information and data within their operational and strategic processes. Successful digital curation will mitigate digital obsolescence, keeping the information accessible to users indefinitely. Digital curation includes digital asset management, data curation, digital preservation, and electronic records management.


Word History

Much like the word ''archive'' has layered meanings and uses, the word ''curation'' is both a noun and a verb used originally in the field of
museology Museology or museum studies is the study of museums. It explores the history of museums and their role in society, as well as the activities they engage in, including curating, preservation, public programming, and education. Terminology The w ...
to represent a wide range of activities, most often associated with collection care, long-term preservation, and exhibition design.  ''Curation'' can be a reference to physical repositories that store cultural heritage or natural resource collections (e.g., a curatorial repository) or a representation of varied policies and processes involved with the long-term care and management of heritage collections, digital archives, and research data (e.g, curatorial/collections management plans, curation life-cycle, and data curation). Yet ''curation'' is also associated with short-term objectives and processes of selection and interpretation for the purposes of presentation, such as for gallery exhibitions and websites, which contribute to knowledge creation.  It has also been applied to interaction with
social media Social media are interactive media technologies that facilitate the creation and sharing of information, ideas, interests, and other forms of expression through virtual communities and networks. While challenges to the definition of ''social me ...
including compiling digital images, web links and movie files. The term ''curation'' entered the legal framework through federal historic preservation laws, starting with the
National Historic Preservation Act The National Historic Preservation Act (NHPA; Public Law 89-665; 54 U.S.C. 300101 ''et seq.'') is legislation intended to preserve historic and archaeological sites in the United States of America. The act created the National Register of Historic ...
of 1966, and was further defined and coded into federal regulations through 36 CFR Part 79: ''Curation of Federally-owned and Administered Archaeological Collections.'' ''Curation'' has since permeated into an array of disciplines but remains closely tied to heritage and information management.


Core Principles and Activities

The term “digital curation” was first used in the
e-science E-Science or eScience is computationally intensive science that is carried out in highly distributed network environments, or science that uses immense data sets that require grid computing; the term sometimes includes technologies that enable dist ...
and
biological science Biology is the scientific study of life. It is a natural science with a broad scope but has several unifying themes that tie it together as a single, coherent field. For instance, all organisms are made up of cells that process hereditary ...
fields as a means of differentiating the additional suite of activities ordinarily employed by library and museum curators to add value to their collections and enable its reuse from the smaller subtask of simply preserving the data, a significantly more concise archival task. Additionally, the historical understanding of the term “
curator A curator (from la, cura, meaning "to take care") is a manager or overseer. When working with cultural organizations, a curator is typically a "collections curator" or an "exhibitions curator", and has multifaceted tasks dependent on the parti ...
” demands more than simple care of the collection. A
curator A curator (from la, cura, meaning "to take care") is a manager or overseer. When working with cultural organizations, a curator is typically a "collections curator" or an "exhibitions curator", and has multifaceted tasks dependent on the parti ...
is expected to command academic mastery of the subject matter as a requisite part of appraisal and selection of assets and any subsequent adding of value to the collection through application of metadata.


Principles

There are five commonly accepted principles that govern the occupation of digital curation: * Manage the complete birth-to-retirement life cycle of the digital asset. * Evaluate and cull assets for inclusion in the collection. * Apply preservation methods to strengthen the asset’s integrity and reusability for future users. * Act proactively throughout the asset life cycle to add value to both the digital asset and the collection. * Facilitate the appropriate degree of access to users.


Methodology

The Digital Curation Center offers the following step-by-step life cycle procedures for putting the above principles into practice: Sequential Actions: * Conceptualize: Consider what digital material you will be creating and develop storage options. Take into account websites, publications, email, among other types of digital output. * Create: Produce digital material and attach all relevant
metadata Metadata is "data that provides information about other data", but not the content of the data, such as the text of a message or the image itself. There are many distinct types of metadata, including: * Descriptive metadata – the descriptive ...
, typically the more metadata the more accessible the information. * Appraise and select: Consult the mission statement of the institution or private collection and determine what digital data is relevant. There may also be legal guidelines in place that will guide the decision process for a particular collection. *Ingest: Send digital material to the predetermined storage solution. This may be an archive, repository or other facility. *Preservation action: Employ measures to maintain the integrity of the digital material. *Store: Secure data within the predetermined storage facility. * Access, use and reuse: Determine the level of accessibility for the range of digital material created. Some material may be accessible only by password and other material may be freely accessible to the public. Routinely check that material is still accessible for the intended audience and that the material has not been compromised through multiple uses. * Transform: If desirable or necessary the material may be transferred into a different digital format. Occasional Actions: * Dispose: Discard any digital material that is not deemed necessary to the institution. *Reappraise: Reevaluate material to ensure that is it still relevant and is true to its original form. *Migrate: Migrate data to another format in order to protect data for using better in the future.


Related terms

The term "digital curation" is sometimes used interchangeably with terms such as "
digital preservation In library and archival science, digital preservation is a formal endeavor to ensure that digital information of continuing value remains accessible and usable. It involves planning, resource allocation, and application of preservation methods and ...
" and "
digital archiving A document management system (DMS) is usually a computerized system used to store, share, track and manage files or documents. Some systems include history tracking where a log of the various versions created and modified by different users is r ...
". While digital preservation does focus a significant degree of energy on optimizing reusability, preservation remains a subtask to the concept of digital archiving, which is in turn a subtask of digital curation. For example, archiving is a part of curation, but so are subsequent tasks such as themed collection-building, which is not considered an archival task. Similarly, preservation is a part of archiving, as are the tasks of selection and appraisal that are not necessarily part of preservation. Data curation is another term that is often used interchangeably with digital curation, however common usage of the two terms differs. While “
data In the pursuit of knowledge, data (; ) is a collection of discrete values that convey information, describing quantity, quality, fact, statistics, other basic units of meaning, or simply sequences of symbols that may be further interpret ...
” is a more all-encompassing term that can be used generally to indicate anything recorded in binary form, the term “data curation” is most common in scientific parlance and usually refers to accumulating and managing information relative to the process of research. Data-driven research of education request the role of information professional gradually develop tradition of digital service to data curation particularly at the management of digital research data. So, while documents and other discrete digital assets are technically a subset of the broader concept of data, in the context of scientific vernacular digital curation represents a broader purview of responsibilities than data curation due to its interest in preserving and adding value to digital assets of any kind.


Challenges


Rate of creation of new data and data sets

The ever lowering cost, and increasing prevalence of entirely new categories of technology has led to a quickly growing flow of new data sets. These come from well established sources such as business and government, but the trend is also driven by new styles of sensors becoming embedded in more areas of modern life. This is particularly true of consumers, whose production of digital assets is no longer relegated strictly to work. Consumers now create wider ranges of digital assets, including videos, photos, location data, purchases, and fitness tracking data, just to name a few, and share them in wider ranges of social platforms. Additionally, the advance of technology has introduced new ways of working with data. Some examples of this are international partnerships that leverage astronomical data to create “virtual observatories”, and similar partnerships have also leveraged data resulting from research at the
Large Hadron Collider The Large Hadron Collider (LHC) is the world's largest and highest-energy particle collider. It was built by the European Organization for Nuclear Research (CERN) between 1998 and 2008 in collaboration with over 10,000 scientists and hundr ...
at
CERN The European Organization for Nuclear Research, known as CERN (; ; ), is an intergovernmental organization that operates the largest particle physics laboratory in the world. Established in 1954, it is based in a northwestern suburb of Gen ...
and the database of protein structures at the
Protein Data Bank The Protein Data Bank (PDB) is a database for the three-dimensional structural data of large biological molecules, such as proteins and nucleic acids. The data, typically obtained by X-ray crystallography, NMR spectroscopy, or, increasingly, cr ...
.


Storage format evolution and obsolescence

By comparison, archiving of analog assets is notably passive in nature, often limited to simply ensuring a suitable storage environment. Digital preservation requires a more proactive approach. Today’s artifacts of cultural significance are notably transient in nature and prone to obsolescence when social trends or dependent technologies change. This rapid progression of technology occasionally makes it necessary to migrate digital asset holdings from one file format to another in order to mitigate the dangers of hardware and software obsolescence which would render the asset unusable.


Underestimation of human labor costs

Modern tools for program planning often underestimate the amount of human labor costs required for adequate digital curation of large collections. As a result cost-benefit assessments often paint an inaccurate picture of both the amount of work involved, and the true cost to the institution for both successful outcomes and failures. The concept of cost in business field would be more obvious. Varieties of business systems are running for daily operations. For example, human resources systems deal with recruitment and payroll, communication systems manage internal and external email, and administration systems handle finance, marketing and other aspects.  However, business systems in institutions are not designed for long-term information preservation initially.Katuu, Shadrack. (2012). Enterprise Content Management and Digital Curation Applications Maturity Model Connections. In some instances, business systems are revised to become Digital Curation systems for preserving transaction information due to cost consideration. The example of business systems are Enterprise Content Management (ECM) applications, which are used by designated group people such as business executives, customers for information management that support key processes organizationally. In the long run, to transfer digital content from ECM applications to Digital Curation (DC) applications would be a trend in large organizations domestically or internationally.  The improvement of maturity models of ECM and DC may add value to information that request cost deduction and extensive use for further modification.


Standardization and coordination between institutions

An absence of coordination across different sectors of society and industry in areas such as the standardization of
semantic Semantics (from grc, σημαντικός ''sēmantikós'', "significant") is the study of reference, meaning, or truth. The term can be used to refer to subfields of several distinct disciplines, including philosophy, linguistics and comput ...
and
ontological In metaphysics, ontology is the philosophical study of being, as well as related concepts such as existence, becoming, and reality. Ontology addresses questions like how entities are grouped into categories and which of these entities exi ...
definitions, and in forming partnerships for proper stewardship of assets has resulted in a lack of interoperability between institutions, and a partial breakdown in digital curation practice from the standpoint of the ordinary user. The example of coordination is Open Archival Information System (OAIS). OAIS Reference Model allows professionals and many other organizations and individuals to contribute efforts to the OAIS open forums for developing international standards of archival information in long-term access.


Digitization of analog materials

The curation of digital objects is not limited to strictly born-digital assets. Many institutions have engaged in monumental efforts to digitize analog holdings in an effort to increase access to their collections. Examples of these materials are books, photographs, maps, audio recordings, and more. The process of converting printed resources into digital collections has been epitomized to some degree by librarians and related specialists. For example, The
Digital Curation Centre The Digital Curation Centre (DCC) was established to help solve the extensive challenges of digital preservation and digital curation and to lead research, development, advice, and support services for higher education institutions in the Unite ...
is claimed to be a "world leading centre of expertise in digital information curation" that assists higher education research institutions in such conversions.


Material Types


Manuscript A manuscript (abbreviated MS for singular and MSS for plural) was, traditionally, any document written by hand – or, once practical typewriters became available, typewritten – as opposed to mechanically printed or reproduced i ...
s


Built

Cultural Heritage Cultural heritage is the heritage of tangible and intangible heritage assets of a group or society that is inherited from past generations. Not all heritages of past generations are "heritage"; rather, heritage is a product of selection by soc ...


= Artifacts

=


=

Monument A monument is a type of structure that was explicitly created to commemorate a person or event, or which has become relevant to a social group as a part of their remembrance of historic times or cultural heritage, due to its artistic, hist ...
s or Architectural Assets

= Nowadays, with the development in ICT and computer-based visualisation, curators benefit from the 3D Reconstruction methods and Digital Twin to not only represent their updated and authentic cultural heritage data sets but also assist conservation architects and the other experts in further practices on the assets.


Intangible Cultural Heritage An intangible cultural heritage (ICH) is a practice, representation, expression, knowledge, or skill considered by UNESCO to be part of a place's cultural heritage. Buildings, historic places, monuments, and artifacts are cultural property. I ...


=

Folklore Folklore is shared by a particular group of people; it encompasses the traditions common to that culture, subculture or group. This includes oral traditions such as tales, legends, proverbs and jokes. They include material culture, rangin ...

=


New representational formats

For some topics, knowledge is embodied in forms that have not been conducive to print, such as how choreography of dance or of the motion of skilled workers or artisans is difficult to encode. New digital approaches such as 3D
hologram Holography is a technique that enables a wavefront to be recorded and later re-constructed. Holography is best known as a method of generating real three-dimensional images, but it also has a wide range of other applications. In principle, i ...
s and other computer-programmed expressions are developing. For mathematics, it seems possible for a new common language to be developed that would express mathematical ideas in ways that can be digitally stored, linked, and made accessible. The Global Digital Mathematics Library is a project to define and develop such a language.


Accessibility

The ability of the intended user community to access the repository’s holdings is of equal importance to all the preceding curatorial tasks. This must take into account not only the user community’s format and communication preferences, but also a consideration of communities that should not have access for various legal or privacy reasons. Access can be increased by providing information about
open access Open access (OA) is a set of principles and a range of practices through which research outputs are distributed online, free of access charges or other barriers. With open access strictly defined (according to the 2001 definition), or libre op ...
status with
open data Open data is data that is openly accessible, exploitable, editable and shared by anyone for any purpose. Open data is licensed under an open license. The goals of the open data movement are similar to those of other "open(-source)" movements ...
and
open source Open source is source code that is made freely available for possible modification and redistribution. Products include permission to use the source code, design documents, or content of the product. The open-source model is a decentralized so ...
methods such as the
OAI-PMH The Open Archives Initiative Protocol for Metadata Harvesting (OAI-PMH) is a protocol developed for harvesting metadata descriptions of records in an archive so that services can be built using metadata from many archives. An implementation of OAI- ...
endpoints of an open archive, which are then aggregated by databases and search engines like BASE, CORE and
Unpaywall OurResearch, formerly known as ImpactStory, is a nonprofit organization which creates and distributes tools and services for libraries, institutions and researchers. The organization follows open practices with their data (to the extent allowed by ...
for academic papers.


Responses to challenges

* Specialized research institutions There are three elements for essential needs of institutions dealing with issues of digital curation: Leadership, Resources, and Collaboration. Three elements related to the role of advance-guards for librarians and archivists working with open approaches to technology, standardized process and scholarly communication. The archivist with leadership, who needs to be a dynamic and active role to embrace technology, standardized process, and scholarly communication. In addition, Archivist leader might adopt the business concept and methods to deal with their workflow such as raise funds, invest technology system, and comply with industry standards, in order to obtain more resources. Collaboration in archives and digital curation community could provide and share training, technologies, standards and tools to help institutions on challengeable issues of digital curation. Digital Preservation Coalition (DPC), the Open Preservation Foundation or novel partnerships offer collaboration opportunity to institutions facing similar challenges in digital curation issues. * Academic courses Information field especially in libraries, archives, and museums significantly need to bring knowledge of new technologies. Traditional graduate school education is not enough to meet that demand; training program for current staffs in cultural repository would be an efficient supplement for that request, such as professional workshops, and MOOCs (Massively Open Online Courses) in data curation and management. * Dedicated symposia International Digital Curation Conference (IDCC) is an established annual event since 2005, aiming to collaborate with individuals, organizations and institutions facing challenges, supporting development, and exchanging ideas in the field. * Peer reviewed technical and industry journals The International Journal of Digital Curation (IJDC) is administered by IJDC Editorial Board including the Editor-in-Chief, Digital Curation Center (DCC), and the following members.  IJDC dedicate to provide scholarly platform for sharing, discussing, and improving knowledge and information of digital curation within the worldwide community. IJDC has two types of submission under editorial guidelines, which are peer-reviewed papers and general articles base on original research, the field information and relevant events in digital curation. IJDC is published by the University of Edinburgh for the
Digital Curation Centre The Digital Curation Centre (DCC) was established to help solve the extensive challenges of digital preservation and digital curation and to lead research, development, advice, and support services for higher education institutions in the Unite ...
in electronic form on a rolling basis two times a year. The open access to the public supports knowledge exchangeable in digital curation worldwide.


Approaches

Many approaches to digital curation exist, and have evolved over time in response to the changing technological landscape. Two examples of this are sheer curation and channelization. ''Sheer curation'' is an approach to digital curation where curation activities are quietly integrated into the normal work flow of those creating and managing data and other digital assets. The word sheer is used to emphasize the lightweight and virtually transparent nature of these curation activities. The term ''sheer curation'' was coined by Alistair Miles in the ImageStore project, and the UK Digital Curation Centre's SCARP project. The approach depends on curators having close contact or 'immersion' in data creators' working practices. An example is the case study of a neuroimaging research group by Whyte et al., which explored ways of building its digital curation capacity around the apprenticeship style of learning of neuroimaging researchers, through which they share access to datasets and re-use experimental procedures. Sheer curation depends on the hypothesis that good data and digital asset management at the point of creation and primary use is also good practice in preparation for sharing, publication and/or long-term preservation of these assets. Therefore, sheer curation attempts to identify and promote tools and good practices in local data and digital asset management in specific domains, where those tools and practices add immediate value to the creators and primary users of those assets. Curation can best be supported by identifying existing practices of sharing, stewardship and re-use that add value, and augmenting them in ways that both have short-term benefits, and in the longer term reduce risks to digital assets or provide new opportunities to sustain their long-term accessibility and re-use value. The aim of sheer curation is to establish a solid foundation for other curation activities which may not directly benefit the creators and primary users of digital assets, especially those required to ensure long-term preservation. By providing this foundation, further curation activities may be carried out by specialists at appropriate institutional and organisation levels, whilst causing the minimum of interference to others. A similar idea is ''curation at source'' used in the context of Laboratory Information Management Systems LIMS. This refers more specifically to automatic recording of metadata or information about data at the point of capture, and has been developed to apply semantic web techniques to integrate laboratory instrumentation and documentation systems.Frey, J
'Sharing and Collaboration' keynote presentation at UK e-Science All Hands Meeting
, 8–11 September 2008, Edinburgh
Sheer curation and curation-at-source can be contrasted with post hoc
digital preservation In library and archival science, digital preservation is a formal endeavor to ensure that digital information of continuing value remains accessible and usable. It involves planning, resource allocation, and application of preservation methods and ...
, where a project is initiated to preserve a collection of digital assets that have already been created and are beyond the period of their primary use. ''Channelization'' is curation of digital assets on the web, often by brands and media companies, into continuous flows of content, turning the user experience from a lean-forward interactive medium, to a lean-back passive medium. The curation of content can be done by an independent third party, that selects media from any number of on-demand outlets from across the globe and adds them to a playlist to offer a digital "channel" dedicated to certain subjects, themes, or interests so that the end user would see and/or hear a continuous stream of content.


See also

*
Digital preservation In library and archival science, digital preservation is a formal endeavor to ensure that digital information of continuing value remains accessible and usable. It involves planning, resource allocation, and application of preservation methods and ...
* Data curation * Digital asset management * Digital Forensics * Data format management * Digital artifactual value *
Digital obsolescence Digital obsolescence is the risk of data loss because of inabilities to access digital assets, due to the hardware or software required for information retrieval being repeatedly replaced by newer devices and systems, resulting in increasingly in ...
*
Curator A curator (from la, cura, meaning "to take care") is a manager or overseer. When working with cultural organizations, a curator is typically a "collections curator" or an "exhibitions curator", and has multifaceted tasks dependent on the parti ...
*
Biocurator Biocuration is the field of life sciences dedicated to organizing biomedical data, information and knowledge into structured formats, such as spreadsheets, tables and knowledge graphs. The biocuration of biomedical knowledge is made possible by th ...


References

{{reflist


External links


Animations introducing digital preservation and curationDigital Curation CentreGoogle Arts & Culture
Online platform of high-resolution images and videos of artworks and cultural artifacts from cultural organizations throughout the world.
DigitalCurator.art/
Project allows users to explore the art collections of European museums and search for artworks based on specific motifs detected by computer vision.
Digital Curation and Trusted Repositories: Steps Toward SuccessDigCurV
A project funded by the European Commission to establish a curriculum framework for vocational training in digital curation. Archival science Databases Knowledge representation Digital preservation