Digitization
[Tech Target. (2011, April). Definition: digitization. ''WhatIs.com''. Retrieved December 15, 2021, from https://whatis.techtarget.com/definition/digitization] is the process of converting information into a
digital
Digital usually refers to something using discrete digits, often binary digits.
Technology and computing Hardware
*Digital electronics, electronic circuits which operate using digital signals
**Digital camera, which captures and stores digital i ...
(i.e. computer-readable) format.
[Collins Dictionary. (n.d.). Definition of 'digitize'. Retrieved December 15, 2021, from https://www.collinsdictionary.com/dictionary/english/digitize] The result is the representation of an object,
image
An image is a visual representation of something. It can be two-dimensional, three-dimensional, or somehow otherwise feed into the visual system to convey information. An image can be an artifact, such as a photograph or other two-dimensiona ...
,
sound,
document, or
signal (usually an
analog signal
An analog signal or analogue signal (see spelling differences) is any continuous signal representing some other quantity, i.e., ''analogous'' to another quantity. For example, in an analog audio signal, the instantaneous signal voltage varies c ...
) obtained by generating a series of numbers that describe a discrete set of points or
samples. The result is called ''
digital
Digital usually refers to something using discrete digits, often binary digits.
Technology and computing Hardware
*Digital electronics, electronic circuits which operate using digital signals
**Digital camera, which captures and stores digital i ...
representation
Representation may refer to:
Law and politics
*Representation (politics), political activities undertaken by elected representatives, as well as other theories
** Representative democracy, type of democracy in which elected officials represent a ...
'' or, more specifically, a ''
digital image
A digital image is an image composed of picture elements, also known as ''pixels'', each with ''finite'', '' discrete quantities'' of numeric representation for its intensity or gray level that is an output from its two-dimensional functions ...
'', for the object, and ''digital form'', for the signal. In modern practice, the digitized data is in the form of
binary numbers
A binary number is a number expressed in the base-2 numeral system or binary numeral system, a method of mathematical expression which uses only two symbols: typically "0" (zero) and "1" ( one).
The base-2 numeral system is a positional notation ...
, which facilitates processing by
digital computers and other operations, but digitizing simply means "the conversion of analog source material into a numerical format"; the
decimal
The decimal numeral system (also called the base-ten positional numeral system and denary or decanary) is the standard system for denoting integer and non-integer numbers. It is the extension to non-integer numbers of the Hindu–Arabic numeral ...
or any other
number system can be used instead.
Digitization is of crucial importance to data processing, storage, and transmission, because it "allows information of all kinds in all formats to be carried with the same efficiency and also intermingled." Though analog data is typically more stable, digital data has the potential to be more easily shared and accessed and, in theory, can be propagated indefinitely without generation loss, provided it is
migrated to new, stable formats as needed.
[Brown, A. (2013). ''Practical digital preservation: A how-to guide for organizations of any size''. Neal Schuman.] This potential has led to institutional digitization projects designed to improve access and the rapid growth of the digital preservation field.
[Daigle, B. J. (2012). The digital transformation of special collections. ''Journal of Library Administration, 52''(3-4), 244-254. https://doi.org/10.1080/01930826.2012.684504]
Sometimes digitization and digital preservation are mistaken for the same thing. They are different, but digitization is often a vital first step in digital preservation.
[Snawder, K. (2011, July 15). Digitization is different than digital preservation: help prevent digital orphans! ''The Signal''. https://blogs.loc.gov/thesignal/2011/07/digitization-is-different-than-digital-preservation-help-prevent-digital-orphans/] Libraries, archives, museums, and other memory institutions digitize items to preserve fragile materials and create more access points for patrons.
[Riley-Reid, T.D. (2015, July 6). The hidden cost of digitization: Things to consider. ''Collection Building. 34''(3), 89-93. DOI 10.1108/CB-01-2015-0001] Doing this creates challenges for information professionals and solutions can be as varied as the institutions that implement them.
[Potgieter, A. & Mabe, K. (2018). The future of accessing our past: Collaboration and digitization in libraries, archives and museums. ''Proceedings of business and management conferences. 6809039.'' https://scholar.google.com/citations?view_op=view_citation&hl=en&user=3phltK0AAAAJ&citation_for_view=3phltK0AAAAJ:d1gkVwhDpl0C] Some analog materials, such as audio and video tapes, are nearing the end of their life-cycle, and it is important to digitize them before equipment obsolescence and media deterioration makes the data irretrievable.
There are challenges and implications surrounding digitization including time, cost, cultural history concerns, and creating an equitable platform for historically marginalized voices.
[Hughes-Watkins, L. (2018). Moving toward a reparative archive: A roadmap for a holistic approach to disrupting homogenous histories in academic repositories and creating inclusive spaces for marginalized voices. ''Journal of Contemporary Archival Studies 5,''article 6. https://elischolar.library.yale.edu/jcas/vol5/iss1/6] Many digitizing institutions develop their own solutions to these challenges.
[Riley-Reid, T.D. (2015, July 6). The hidden cost of digitization: Things to consider. ''Collection Building. 34''(3), 89-93. DOI 10.1108/CB-01-2015-0001]
Mass digitization projects have had mixed results over the years, but some institutions have had success even if not in the traditional Google Books model.
[Verheusen, A. (2008). Mass digitization by libraries: Issues concerning organisation, quality and efficiency. ''Liber Quarterly'', 18(1), 28-38.]
Technological changes can happen often and quickly, so digitization standards are difficult to keep updated. Professionals in the field can attend conferences and join organizations and working groups to keep their knowledge current and add to the conversation.
[Northeast Document Conservation Center. (n.d.). ''Session 7: Reformatting and digitization''. Preservation 101. Retrieved December 15, 2021, from https://www.nedcc.org/preservation101/session-7/7digitization]
Process
The term digitization is often used when diverse forms of information, such as an object, text, sound, image, or voice, are converted into a single
binary code
A binary code represents text, computer processor instructions, or any other data using a two-symbol system. The two-symbol system used is often "0" and "1" from the binary number system. The binary code assigns a pattern of binary digits, also ...
. The core of the process is the compromise between the capturing device and the player device so that the rendered result represents the original source with the most possible fidelity, and the advantage of digitization is the speed and accuracy in which this form of information can be transmitted with no degradation compared with analog information.
Digital information exists as one of two digits, either 0 or 1. These are known as
bits (a contraction of ''binary digits'') and the sequences of 0s and 1s that constitute information are called
bytes.
Analog signals are
continuously variable, both in the number of possible values of the signal ''at'' a given
time, as well as in the number of points in the signal ''in'' a given period of time. However, digital signals are
discrete in both of those respects – generally a finite sequence of integers – therefore a digitization can, in practical terms, only ever be an
approximation
An approximation is anything that is intentionally similar but not exactly equality (mathematics), equal to something else.
Etymology and usage
The word ''approximation'' is derived from Latin ''approximatus'', from ''proximus'' meaning ''very ...
of the signal it represents.
Digitization occurs in two parts:
;Discretization: The reading of an analog signal ''A'', and, at regular time intervals (
frequency),
sampling the value of the signal at the point. Each such reading is called a ''sample'' and may be considered to have infinite precision at this stage;
;Quantization: Samples are rounded to a fixed set of numbers (such as integers), a process known as
quantization.
In general, these can occur at the same time, though they are conceptually distinct.
A series of digital integers can be transformed into an analog output that approximates the original analog signal. Such a transformation is called a
DA conversion
In electronics, a digital-to-analog converter (DAC, D/A, D2A, or D-to-A) is a system that converts a digital signal into an analog signal. An analog-to-digital converter (ADC) performs the reverse function.
There are several DAC architec ...
. The
sampling rate and the number of bits used to represent the integers combine to determine how close such an approximation to the analog signal a digitization will be.
Examples
The term is used to describe, for example, the
scanning of analog sources (such as printed
photos or taped
videos) into computers for editing, 3D scanning that creates
3D modeling
In 3D computer graphics, 3D modeling is the process of developing a mathematical coordinate-based representation of any surface of an object (inanimate or living) in three dimensions via specialized software by manipulating edges, vertices, an ...
of an object's surface, and
audio (where sampling rate is often measured in
kilohertz
The hertz (symbol: Hz) is the unit of frequency in the International System of Units (SI), equivalent to one event (or cycle) per second. The hertz is an SI derived unit whose expression in terms of SI base units is s−1, meaning that one h ...
) and
texture map transformations. In this last case, as in normal photos, the sampling rate refers to the
resolution of the image, often measured in
pixels per inch.
Digitizing is the primary way of storing images in a form suitable for
transmission
Transmission may refer to:
Medicine, science and technology
* Power transmission
** Electric power transmission
** Propulsion transmission, technology allowing controlled application of power
*** Automatic transmission
*** Manual transmission
*** ...
and
computer
A computer is a machine that can be programmed to Execution (computing), carry out sequences of arithmetic or logical operations (computation) automatically. Modern digital electronic computers can perform generic sets of operations known as C ...
processing, whether scanned from two-dimensional analog originals or captured using an
image sensor-equipped device such as a
digital camera,
tomographical instrument such as a
CAT scan
A computed tomography scan (CT scan; formerly called computed axial tomography scan or CAT scan) is a medical imaging technique used to obtain detailed internal images of the body. The personnel that perform CT scans are called radiographers ...
ner, or acquiring precise dimensions from a real-world object, such as a
car
A car or automobile is a motor vehicle with wheels. Most definitions of ''cars'' say that they run primarily on roads, seat one to eight people, have four wheels, and mainly transport people instead of goods.
The year 1886 is regarded as ...
, using a
3D scanning
3D scanning is the process of analyzing a real-world object or environment to collect data on its shape and possibly its appearance (e.g. color). The collected data can then be used to construct digital 3D models.
A 3D scanner can be based on ...
device.
Digitizing is central to making digital representations of geographical features, using raster or vector images, in a
geographic information system
A geographic information system (GIS) is a type of database containing Geographic data and information, geographic data (that is, descriptions of phenomena for which location is relevant), combined with Geographic information system software, sof ...
, i.e., the creation of
electronic maps, either from various geographical and satellite imaging (raster) or by digitizing traditional paper
maps or
graphs (vector).
"Digitization" is also used to describe the process of populating
databases with files or data. While this usage is technically inaccurate, it originates with the previously proper use of the term to describe that part of the process involving digitization of analog sources, such as printed pictures and brochures, before uploading to target databases.
Digitizing may also be used in the field of apparel, where an image may be recreated with the help of
embroidery digitizing software tools and saved as
embroidery machine
Machine embroidery is an embroidery process whereby a sewing machine or embroidery machine is used to create patterns on textiles. It is used commercially in product branding, corporate advertising, and uniform adornment. It is also used in the ...
code. This machine code is fed into an embroidery machine and applied to the fabric. The most supported format is DST file. Apparel companies also digitize clothing patterns.
History
* 1957 The Standards Electronic Automatic Computer (SEAC) was invented.
[Roemer, C. (n.d.). What is the history of digitization? ''Aperture: A Kodak Digitizing Blog''. Retrieved November 11, 2021, from https://kodakdigitizing.com/blogs/news/what-is-the-history-of-digitization] That same year,
Russell Kirsch
Russell A. Kirsch (June 20, 1929August 11, 2020) was an American engineer at the National Bureau of Standards (now known as the National Institute of Standards and Technology). He was recognized as the developer of the first digital image scanne ...
used a rotating drum scanner and photomultiplier connected to SEAC to create the first digital image (176x176 pixels) from a photo of his infant son.
[Kirsch, R. A. (2001, January). Computer development at the National Bureau of Standards. ''A Century of Excellence in Measurements, Standards, and Technology: A Chronicle of Selected NBS/NIST Publications, 1901-2000.'' https://nistdigitalarchives.contentdm.oclc.org/digital/collection/p15421coll5/id/1386] This image was stored in SEAC memory via a staticizer and viewed via a cathode ray oscilloscope.
* 1971 Invention of Charge-Coupled Devices that made conversion from analog data to a digital format easy.
* 1986 work started on the
JPEG
JPEG ( ) is a commonly used method of lossy compression for digital images, particularly for those images produced by digital photography. The degree of compression can be adjusted, allowing a selectable tradeoff between storage size and imag ...
format.
* 1990s Libraries began scanning collections to provide access via the world wide web.
[Verheusen, A. (2008). Mass digitization by libraries: Issues concerning organisation, quality and efficiency. ''Liber Quarterly'', 18(1), 28-38.]
Analog signals to digital
Analog signals are continuous electrical signals; digital signals are non-continuous. Analog signals can be converted to digital signals by using an
analog-to-digital converter.
The process of converting analog to digital consists of two parts: sampling and quantizing. Sampling measures wave amplitudes at regular intervals, splits them along the vertical axis, and assigns them a numerical value, while quantizing looks for measurements that are between binary values and rounds them up or down.
Nearly all recorded music has been digitized, and about 12 percent of the 500,000+ movies listed on the
Internet Movie Database are digitized and were released on
DVD.
Digitization of
home movies
A home movie is a short amateur film or video typically made just to preserve a visual record of family activities, a vacation, or a special event, and intended for viewing at home by family and friends. Originally, home movies were made on ph ...
,
slides, and
photographs is a popular method of preserving and sharing personal multimedia. Slides and photographs may be scanned quickly using an
image scanner, but analog video requires a video tape player to be connected to a computer while the item plays in real time. Slides can be digitized quicker with a slide scanner such as the
Nikon Coolscan 5000ED.
Another example of digitization is the
VisualAudio process developed by the Swiss ''Fonoteca Nazionale'' in
Lugano, by scanning a high resolution photograph of a record, they are able to extract and reconstruct the sound from the processed image.
Digitization of analog tapes before they degrade, or after damage has already occurred, can rescue the only copies of local and traditional cultural music for future generations to study and enjoy.
[Breeding, M. (2014, November). Ongoing challenges in digitization. ''Computers in Libraries, 34''(9), 16-18. https://librarytechnology.org/document/20128]
Analog texts to digital
Academic and public libraries, foundations, and private companies like
Google are scanning older print books and applying
optical character recognition (OCR) technologies so they can be keyword searched, but as of 2006, only about 1 in 20 texts had been digitized. Librarians and archivists are working to increase this statistic and in 2019 began digitizing 480,000 books published between 1923 and 1964 that had entered the public domain.
Unpublished manuscripts and other rare papers and documents housed in special collections are being digitized by
libraries and
archives, but backlogs often slow this process and keep materials with enduring historical and research value hidden from most users (see
digital libraries). Digitization has not completely replaced other archival imaging options, such as
microfilming
Microforms are scaled-down reproductions of documents, typically either photographic film, films or paper, made for the purposes of transmission, storage, reading, and printing. Microform images are commonly reduced to about 4% or of the origin ...
which is still used by institutions such as the National Archives and Records Administration (
NARA) to provide preservation and access to these resources.
While digital versions of analog texts can potentially be accessed from anywhere in the world, they are not as stable as most print materials or manuscripts and are unlikely to be accessible decades from now without further preservation efforts, while many books manuscripts and scrolls have already been around for centuries.
[Breeding, M. (2014, November). Ongoing challenges in digitization. ''Computers in Libraries, 34''(9), 16-18. https://librarytechnology.org/document/20128] However, for some materials that have been damaged by water, insects, or catastrophes, digitization might be the only option for continued use.
[Breeding, M. (2014, November). Ongoing challenges in digitization. ''Computers in Libraries, 34''(9), 16-18. https://librarytechnology.org/document/20128]
Library preservation
In the context of libraries, archives, and museums, digitization is a means of creating digital surrogates of analog materials, such as books, newspapers,
microfilm and videotapes, offers a variety of benefits, including increasing access, especially for patrons at a distance; contributing to collection development, through collaborative initiatives; enhancing the potential for research and education; and supporting preservation activities. Digitization can provide a means of preserving the content of the materials by creating an accessible facsimile of the object in order to put less strain on already fragile originals. For sounds, digitization of legacy analog recordings is essential insurance against technological obsolescence. A fundamental aspect of planning digitization projects is to ensure that the digital files themselves are preserved and remain accessible; the term "
digital preservation," in its most basic sense, refers to an array of activities undertaken to maintain access to digital materials over time.
The prevalent
Brittle Books issue facing libraries across the world is being addressed with a digital solution for long term book preservation. Since the mid-1800s, books were printed on
wood-pulp paper, which turns acidic as it decays. Deterioration may advance to a point where a book is completely unusable. In theory, if these widely circulated titles are not treated with de-acidification processes, the materials upon those acid pages will be lost. As digital technology evolves, it is increasingly preferred as a method of preserving these materials, mainly because it can provide easier access points and significantly reduce the need for physical storage space.
Cambridge University Library is working on the
Cambridge Digital Library, which will initially contain digitised versions of many of its most important works relating to science and religion. These include examples such as Isaac Newton's personally annotated first edition of his
Philosophiæ Naturalis Principia Mathematica
(English: ''Mathematical Principles of Natural Philosophy'') often referred to as simply the (), is a book by Isaac Newton that expounds Newton's laws of motion and his law of universal gravitation. The ''Principia'' is written in Latin and ...
as well as college notebooks and other papers, and some Islamic manuscripts such as a
Quran from Tipu Sahib's library.
Google, Inc. has taken steps towards attempting to digitize every title with "
Google Book Search". While some academic libraries have been contracted by the service, issues of copyright law violations threaten to derail the project. However, it does provide – at the very least – an online consortium for libraries to exchange information and for researchers to search for titles as well as review the materials.
Digitization versus digital preservation
Digitizing something is not the same as digitally preserving it.
[Snawder, K. (2011, July 15). Digitization is different than digital preservation: help prevent digital orphans! ''The Signal''. https://blogs.loc.gov/thesignal/2011/07/digitization-is-different-than-digital-preservation-help-prevent-digital-orphans/] To digitize something is to create a digital surrogate (copy or format) of an existing analog item (book, photograph, or record) and is often described as converting it from analog to digital, however both copies remain. An example would be scanning a photograph and having the original piece in a photo album and a digital copy saved to a computer. This is essentially the first step in digital preservation which is to maintain the digital copy over a long period of time and making sure it remains authentic and accessible.
[Brown, A. (2013). ''Practical digital preservation: A how-to guide for organizations of any size''. Neal Schuman.]
Digitization is done once with the technology currently available, while digital preservation is more complicated because technology changes so quickly that a once popular storage format may become obsolete before it breaks.
An example is a 5 1/4" floppy drive, computers are no longer made with them and obtaining the hardware to convert a file stored on 5 1/4" floppy disc can be expensive. To combat this risk, equipment must be upgraded as newer technology becomes affordable (about 2 to 5 years), but before older technology becomes unobtainable (about 5 to 10 years).
Digital preservation can also apply to born-digital material, such as a Microsoft Word document or a social media post. In contrast, digitization only applies exclusively to analog materials. Born-digital materials present a unique challenge to digital preservation not only due to technological obsolescence but also because of the inherently unstable nature of digital storage and maintenance.
Most websites last between 2.5 and 5 years, depending on the purpose for which they were designed.
The Library of Congress provides numerous resources and tips for individuals looking to practice digitization and digital preservation for their personal collections.
Digital reformatting
Digital reformatting is the process of converting analog materials into a digital format as a surrogate of the original. The digital surrogates perform a preservation function by reducing or eliminating the use of the original. Digital reformatting is guided by established best practices to ensure that materials are being converted at the highest quality.
Digital reformatting at the Library of Congress
The
Library of Congress has been actively reformatting materials for its
American Memory project and developed best standards and practices pertaining to book handling during the digitization process, scanning resolutions, and preferred file formats. Some of these standards are:
*The use of
ISO 16067-1 and ISO 16067-2 standards for
resolution requirements.
*Recommended 400
ppi resolution for
OCR'ed printed text.
*The use of
24-bit color
In computer architecture, 4-bit integers, or other data units are those that are 4 bits wide. Also, 4-bit central processing unit (CPU) and arithmetic logic unit (ALU) architectures are those that are based on registers, or data buses of that siz ...
when color is an important attribute of a document.
*The use of the scanning device's maximum resolution for digitally reproducing photographs
*
TIFF as the standard file format.
*Attachment of descriptive, structural, and technical
metadata
Metadata is "data that provides information about other data", but not the content of the data, such as the text of a message or the image itself. There are many distinct types of metadata, including:
* Descriptive metadata – the descriptive ...
to all digitized documents.
A list of archival standards for digital preservation can be found on the
ARL website.
The Library of Congress has constituted a Preservation Digital Reformatting Program. The Three main components of the program include:
*Selection Criteria for digital reformatting
*Digital reformatting principles and specifications
*Life cycle management of LC digital data
Audio digitization and reformatting
Audio media offers a rich source of historic ethnographic information, with the earliest forms of recorded sound dating back to 1890.
According to the
International Association of Sound and Audiovisual Archives
International is an adjective (also used as a noun) meaning "between nations".
International may also refer to:
Music Albums
* ''International'' (Kevin Michael album), 2011
* ''International'' (New Order album), 2002
* ''International'' (The T ...
(IASA), these sources of audio data, as well as the aging technologies used to play them back, are in imminent danger of permanent loss due to degradation and obsolescence. These primary sources are called “carriers” and exist in a variety of formats, including wax cylinders, magnetic tape, and flat discs of grooved media, among others. Some formats are susceptible to more severe, or quicker, degradation than others. For instance, lacquer discs suffer from
delamination. Analog tape may deteriorate due to
sticky shed syndrome.
Archival workflow and file standardization have been developed to minimize loss of information from the original carrier to the resulting digital file as digitization is underway. For most at-risk formats (magnetic tape, grooved cylinders, etc.), a similar workflow can be observed. Examination of the source carrier will help determine what, if any, steps need to be taken to repair material prior to transfer. A similar inspection must be undertaken for the playback machines. If satisfactory conditions are met for both carrier and playback machine, the transfer can take place, moderated by an
analog-to-digital converter. The digital signal is then represented visually for the transfer engineer by a
digital audio workstation, like Audacity, WaveLab, or Pro Tools. Reference access copies can be made at smaller sample rates. For archival purposes, it is standard to transfer at a sample rate of 96 kHz and a bit depth of 24 bits per channel.
Challenges
Many libraries, archives, museums, and other memory institutions, struggle with catching up and staying current regarding digitization and the expectation that everything should already be online.
[Greene, M. A. (2010). MPLP: It's not just for processing anymore. ''The American Archivist, 73''(1), 175-203.][Lampert, C. (2018, January 3). Ramping up: Evaluating large-scale digitization potential with small-scale resources''. Digital Library Perspectives, 34''(1), 45-59. http://dx.doi.org/10.1108/DLP-06-2017-0020] The time spent planning, doing the work, and processing the digital files along with the expense and fragility of some materials are some of the most common.
Time spent
Digitization is a time-consuming process, even more so when the condition or format of the analog resources requires special handling. Deciding what part of a collection to digitize can sometimes take longer than digitizing it in its entirety.
[Erway, R. (2008, December). Supply and demand: Special collections and digitisation. ''Liber Quarterly, 18''(3/4), 324-336.] Each digitization project is unique and workflows for one will be different from every other project that goes through the process, so time must be spent thoroughly studying and planning each one to create the best plan for the materials and the intended audience.
Expense
Cost of equipment, staff time, metadata creation, and digital storage media make large scale digitization of collections expensive for all types of
cultural institutions.
[Sutton, S. C. (2017, April 10). Balancing boutique-level quality and large-scale production: The impact of "More Product, Less Process" on digitization in archives and special collections. ''RBM: A Journal of Rare Books, Manuscripts, and Cultural Heritage, 13''(1), 50-63. https://doi.org/10.5860/rbm.13.1.369]
Ideally all institutions want their digital copies to have the best image quality so a high-quality copy can be maintained over time.
However, smaller institutions may not be able to afford such equipment or manpower, which limits how much material can be digitized, so archivists and librarians must know what their patrons need and prioritize digitization of those items.
[Northeast Document Conservation Center. (n.d.) ''6.6 preservation and selection for digitization''. Free Resources. Retrieved October 24, 2021, from https://www.nedcc.org/free-resources/preservation-leaflets/6.-reformatting/6.6-preservation-and-selection-for-digitization] Often the cost of time and expertise involved with describing materials and adding metadata is more than the digitization process.
[Breeding, M. (2014, November). Ongoing challenges in digitization. ''Computers in Libraries, 34''(9), 16-18. https://librarytechnology.org/document/20128]
Fragility of materials
Some materials, such as brittle books, are so fragile that undergoing the process of digitization could damage them irreparably.
Despite potential damage, one reason for digitizing fragile materials is because they are so heavily used that creating a digital surrogate will help preserve the original copy long past its expected lifetime and increase access to the item.
[Riley-Reid, T.D. (2015, July 6). The hidden cost of digitization: Things to consider. ''Collection Building, 34''(3), 89-93. DOI 10.1108/CB-01-2015-0001]
Copyright
Copyright is not only a problem faced by projects like
Google Books, but by institutions that may need to contact private citizens or institutions mentioned in archival documents for permission to scan the items for digital collections.
It can be time consuming to make sure all potential copyright holders have given permission, but if copyright cannot be determined or cleared, it may be necessary to restrict even digital materials to in library use.
Solutions
Institutions can make digitization more cost-effective by planning before a project begins, including outlining what they hope to accomplish and the minimum amount of equipment, time, and effort that can meet those goals.
[Riley-Reid, T.D. (2015, July 6). The hidden cost of digitization: Things to consider. ''Collection Building, 34''(3), 89-93. DOI 10.1108/CB-01-2015-0001] If a budget needs more money to cover the cost of equipment or staff, an institution might investigate if grants are available.
[Sutton, S. C. (2017, April 10). Balancing boutique-level quality and large-scale production: The impact of "More Product, Less Process" on digitization in archives and special collections. ''RBM: A Journal of Rare Books, Manuscripts, and Cultural Heritage, 13''(1), 50-63. https://doi.org/10.5860/rbm.13.1.369]
Collaboration
Collaborations between institutions have the potential to save money on equipment, staff, and training as individual members share their equipment, manpower, and skills rather than pay outside organizations to provide these services.
[Potgieter, A. & Mabe, K. (2018). The future of accessing our past: Collaboration and digitization in libraries, archives and museums. ''Proceedings of business and management conferences. 6809039.'' https://scholar.google.com/citations?view_op=view_citation&hl=en&user=3phltK0AAAAJ&citation_for_view=3phltK0AAAAJ:d1gkVwhDpl0C] Collaborations with donors can build long-term support of current and future digitization projects.
[Lampert, C. (2018, January 3). Ramping up: Evaluating large-scale digitization potential with small-scale resources''. Digital Library Perspectives, 34''(1), 45-59. http://dx.doi.org/10.1108/DLP-06-2017-0020]
Outsourcing
Outsourcing can be an option if an institution does not want to invest in equipment but since most vendors require an inventory and basic metadata for materials, this is not an option for institutions hoping to digitize without processing.
Non-traditional staffing
Many institutions have the option of using volunteers, student employees, or temporary employees on projects. While this saves on staffing costs, it can add costs elsewhere such as on training or having to re-scan items due to poor quality.
MPLP
One way to save time and resources is by using the
More Product, Less Process (MPLP) method to digitize materials while they are being processed.
[Greene, M. A. (2010). MPLP: It's not just for processing anymore. ''The American Archivist, 73''(1), 175-203.] Since
GLAM (Galleries, Libraries, Archives, and Museums) institutions are already committed to preserving analog materials from special collections, digital access copies do not need to be high-resolution preservation copies, just good enough to provide access to rare materials.
[Erway, R. (2008, December). Supply and demand: Special collections and digitisation. ''Liber Quarterly, 18''(3/4), 324-336.] Sometimes institutions can get by with 300 dpi JPGs rather than a 600 dpi TIFF for images, and a 300 dpi grayscale scan of a document rather than a color one at 600 dpi.
Digitizing marginalized voices
Digitization can be used to highlight voices of historically marginalized peoples and add them to the greater body of knowledge. Many projects, some community archives created by members of those groups, are doing this in a way that supports the people, values their input and collaboration, and gives them a sense of ownership of the collection.
[Manzuch, Z. (2017). Ethical issues in digitization of cultural heritage. ''Journal of Contemporary Archival Studies, 4(''2), article 4. http://elischolar.library.yale.edu/jcas/vol4/iss2/4][Hughes-Watkins, L. (2018). Moving toward a reparative archive: A roadmap for a holistic approach to disrupting homogenous histories in academic repositories and creating inclusive spaces for marginalized voices. ''Journal of Contemporary Archival Studies 5,''article 6. https://elischolar.library.yale.edu/jcas/vol5/iss1/6] Examples of projects are Gi-gikinomaage-min and the
South Asian American Digital Archive
The South Asian American Digital Archive (SAADA) is a 501(c)(3) not-for-profit organization that archives materials associated with the history of South Asian Americans.
History
SAADA was established in 2008 to preserve, document, and share the re ...
(SAADA).
Gi-gikinomaage-min
Gi-gikinomaage-min is
Anishinaabemowin for "We are all teachers" and its main purpose is "to document the history of Native Americans in Grand Rapids, Michigan."
[Shell-Weiss, M. Benefiel, A. & McKee, K. (2017). We are all teachers: A collaborative approach to digital collection development. ''Collection Management'', 42(3-4), 317-337. https://doi.org/10.1080/01462679.2017.1344597] It combines new audio and video oral histories with digitized flyers, posters, and newsletters from
Grand Valley State University
Grand Valley State University (GVSU, GV, or Grand Valley) is a public university in Allendale, Michigan. It was established in 1960 as Grand Valley State College. Its main campus is situated on approximately west of Grand Rapids. The universit ...
's analog collections.
Although not entirely a newly digitized project, what was created also added item-level metadata to enhance context. At the start, collaboration between several university departments and the Native American population was deemed important and remained strong throughout the project.
SAADA
The
South Asian American Digital Archive
The South Asian American Digital Archive (SAADA) is a 501(c)(3) not-for-profit organization that archives materials associated with the history of South Asian Americans.
History
SAADA was established in 2008 to preserve, document, and share the re ...
(SAADA) has no physical building, is entirely digital and everything is handled by volunteers.
[Caswell, M. (2015, April 24). Community-centered collecting: finding out what communities want from community archives. ''Proceedings of the American Society for Information Science and Technology,'' 51(1), 1-9. https://doi.org/10.1002/meet.2014.14505101027] This archive was started by Michelle Caswell and Samip Mallick and collects a broad variety of materials "created by or about people residing in the United States who trace their heritage to Bangladesh, Bhutan, India, Maldives, Nepal, Pakistan, Sri Lanka, and the many South Asian diaspora communities across the globe."
(Caswell, 2015, 2). The collection of digitized items includes private, government, and university held materials.
Black Campus Movement Collection (BCM)
Kent State University began its BCM collection when it acquired the papers of African American alumnus Lafayette Tolliver, which included about 1,000 photographs that chronicled the black student experience at Kent State from 1968-1971.
The collection continues to add materials from the 1960s up to and including the current student body and several oral histories have been added since it debuted.
When digitizing the items, it was necessary to work with alumni to create descriptions for the images. This collaboration created changes in local controlled vocabularies the libraries used to create metadata for the images.
Mass digitization
The expectation that everything should be online has led to mass digitization practices, but it is an ongoing process with obstacles that have led to alternatives.
[Erway, R. (2008, December). Supply and demand: Special collections and digitisation. ''Liber Quarterly, 18''(3/4), 324-336.] As new technology makes automated scanning of materials safer for materials and decreases need for cropping and de-skewing, mass digitization should be able to increase.
Obstacles
Digitization can be a physically slow process involving selection and preparation of collections that can take years if materials need to be compared for completeness or are vulnerable to damage.
[Verheusen, A. (2008). Mass digitization by libraries: Issues concerning organisation, quality and Efficiency. ''Liber Quarterly'', 18(1), 28-38.] Price of specialized equipment, storage costs, website maintenance, quality control, and retrieval system limitations all add to the problems of working on a large scale.
Successes
Digitization on demand
Scanning materials as users ask for them, provides copies for others to use and cuts down on repeated copying of popular items. If one part of a folder, document, or book is asked for, scanning the entire object can save time in the future by already having the material access if someone else needs the material.
Digitizing on demand can increase volume because time spent on selection and prep has been used on scanning instead.
Google Books
From the start, Google has concentrated on text rather than images or special collections.
Although criticized in the past for poor image quality, selection practices, and lacking long-term preservation plans, their focus on quantity over quality has enabled Google to digitize more books than other digitizers.
Standards
Digitization is not a static field and standards change with new technology, so it is up to digitization managers to stay current with new developments.
[Northeast Document Conservation Center. (n.d.). ''Session 7: Reformatting and digitization''. Preservation 101. Retrieved December 15, 2021, from https://www.nedcc.org/preservation101/session-7/7digitization] Although each digitization project is different, common standards in formats, metadata, quality, naming, and file storage should be used to give the best chance of interoperability and patron access. As digitization is often the first step in digital preservation, questions about how to handle digital files should be addressed in institutional standards.
[Daigle, B. J. (2012). The digital transformation of special collections. ''Journal of Library Administration, 52''(3-4), 244-254. https://doi.org/10.1080/01930826.2012.684504]
A standard for still images adapted from the Smithsonian digitization standards might include the following:
[Smithsonian Institution Archives. (n.d.). ''Digitizing collections.'' Retrieved October 10, 2021, from https://siarchives.si.edu/what-we-do/digital-curation/digitizing-collections]
Resources to create local standards are available from the
Society of American Archivists, the
Smithsonian, and the
Northeast Document Conservation Center
The Northeast Document Conservation Center (NEDCC) is the first non-profit conservation center in the United States to specialize in the preservation of paper-based library and archival materials, founded in 1973.
The Center was initiated by the s ...
.
Implications
Cultural heritage concerns
Digitization of community archives by indigenous and other marginalized people has led to traditional memory institutions reassessing how they digitize and handle objects in their collections that may have ties to these groups.
The topics they are rethinking are varied and include how items are chosen for digitization projects, what metadata to use to convey proper context to be retrievable by the groups they represent, and whether an item should be accessed by the world or just those who the groups originally intended to have access, such as elders.
[Manzuch, Z. (2017). Ethical issues in digitization of cultural heritage. ''Journal of Contemporary Archival Studies, 4(''2), article 4. http://elischolar.library.yale.edu/jcas/vol4/iss2/4] Many navigate these concerns by collaborating with the communities they seek to represent through their digitized collections.
Lean philosophy
The broad use of internet and the increasing popularity of
lean philosophy has also increased the use and meaning of "digitizing" to describe improvements in the efficiency of organizational processes. Lean philosophy refers to the approach which considers any use of time and resources, which does not lead directly to creating a product, as waste and therefore a target for elimination.
This will often involve some kind of Lean process in order to simplify process activities, with the aim of implementing new "lean and mean" processes by digitizing data and activities. Digitization can help to eliminate time waste by introducing wider access to data, or by the implementation of enterprise resource planning systems.
Fiction
Works of science-fiction often include the term digitize as the act of transforming people into
digital signal
A digital signal is a signal that represents data as a sequence of discrete values; at any given time it can only take on, at most, one of a finite number of values. This contrasts with an analog signal, which represents continuous values; at ...
s and sending them into
digital technology. When that happens, the people disappear from the
real world and appear in a
virtual world (as featured in the
cult film
A cult film or cult movie, also commonly referred to as a cult classic, is a film that has acquired a cult following. Cult films are known for their dedicated, passionate fanbase which forms an elaborate subculture, members of which engage ...
''
Tron'', the
animated series ''
Code: Lyoko'', or the late 1980s live-action series ''
Captain Power and the Soldiers of the Future''). In the
video game ''
Beyond Good & Evil
''Beyond Good & Evil'' is a 2003 action-adventure video game developed and published by Ubisoft for the PlayStation 2, Microsoft Windows, Xbox and GameCube platforms. The story follows the adventures of Jade, an investigative reporter and m ...
'', the
protagonist's holographic friend digitizes the player's inventory
items
Item may refer to:
Organizations
* '' Instituto del Tercer Mundo'' (ITeM), the Third World Institute
* ITEM club, an economic forecasting group based in the United Kingdom
Newspapers
* '' The Item'', an American independent, morning newspap ...
. One
Super Friends cartoon episode showed
Wonder Woman and
Jayna freeing the world's men (including the male super heroes) onto computer tape by the female villainess Medula.
[The Mind Maidens. Aired Nov. 5 1977 on the ABC Network along with other segments.]
See also
*
Book scanning
*
Digital audio
Digital audio is a representation of sound recorded in, or converted into, digital form. In digital audio, the sound wave of the audio signal is typically encoded as numerical samples in a continuous sequence. For example, in CD audio, sa ...
*
Digital library
*
Digital television
*
Economics of digitization
*
ENUMERATE
*
Frame grabber
*
Graphics tablet
*
Newspaper digitization
*
Optical character recognition
*
Raster graphics
upright=1, The Smiley, smiley face in the top left corner is a raster image. When enlarged, individual pixels appear as squares. Enlarging further, each pixel can be analyzed, with their colors constructed through combination of the values for ...
*
Raster image
*
Raster to vector
*
Scannebago The Scannebago concept can be attributed to Emily Gore, currently the Director for Content for the Digital Public Library of America. The premise behind the "Scannebago" is to work with donors and other funders to create a group of mobile scan cent ...
*
Vector graphics
References
Further reading
*Anderson, Cokie G.; Maxwell, David C, ''Starting a Digitization Center'', Chandos Publishing, 2004,
*Bulow, Anna; Ahmon, Jess, ''Preparing Collections for Digitization'', Facet Publishing, 2010,
*Perrin, Joy, ‘’Digitization of Flat Media: Principles and Practices’’, Rowman & Littlefield Publishers, 2015,
*Piepenburg, Scott, "Digitizing Audiovisual and Nonprint Materials: the Innovative Librarian's Guide", Libraries Unlimited, 2015,
*Robinson, Peter, ''Digitization of Primary Textual Sources'', Office for Humanities Communication, 1993,
*S Ross; I Anderson; C Duffy; M Economou; A Gow; P McKinney; R Sharp; The NINCH Working Group on Best Practices
Guide to Good Practice in the Digital Representation and Management of Cultural Heritage Materials Washington DC: NINCH, 2002.
*Speranski, V
Challenges in AV Digitization and Digital Preservation'The Library of Congress National Recording Preservation Plan'
{{Authority control
Data transmission
Mass digitization
Digital preservation