
Digital humanities (DH) is an area of scholarly activity at the intersection of
computing
Computing is any goal-oriented activity requiring, benefiting from, or creating computing machinery. It includes the study and experimentation of algorithmic processes, and development of both hardware and software. Computing has scientific, ...
or
digital technologies and the disciplines of the
humanities
Humanities are academic disciplines that study aspects of human society and culture. In the Renaissance, the term contrasted with divinity and referred to what is now called classics, the main area of secular study in universities at th ...
. It includes the systematic use of digital resources in the humanities, as well as the analysis of their application.
DH can be defined as new ways of doing scholarship that involve collaborative, transdisciplinary, and computationally engaged research, teaching, and publishing.
It brings digital tools and methods to the study of the humanities with the recognition that the printed word is no longer the main medium for knowledge production and distribution.
By producing and using new applications and techniques, DH makes new kinds of teaching possible, while at the same time studying and critiquing how these impact cultural heritage and digital culture.
DH is also applied in research. Thus, a distinctive feature of DH is its cultivation of a two-way relationship between the humanities and the digital: the field both employs technology in the pursuit of humanities research and subjects technology to humanistic questioning and interrogation, often simultaneously.
Definition
The definition of the digital humanities is being continually formulated by scholars and practitioners. Since the field is constantly growing and changing, specific definitions can quickly become outdated or unnecessarily limit future potential. The second volume of ''Debates in the Digital Humanities'' (2016) acknowledges the difficulty in defining the field: "Along with the digital archives, quantitative analyses, and tool-building projects that once characterized the field, DH now encompasses a wide range of methods and practices: visualizations of large image sets, 3D modeling of historical artifacts, 'born digital' dissertations,
hashtag activism
Hashtag activism refers to the use of Twitter's hashtags for Internet activism. The hashtag, has become one of the many ways that social media contributes to civic engagement and social movements. The use of the hashtag on social media provides u ...
and the analysis thereof,
alternate reality game
An alternate reality game (ARG) is an interactive networked narrative that uses the real world as a platform and employs transmedia storytelling to deliver a story that may be altered by players' ideas or actions.
The form is defined by inten ...
s, mobile makerspaces, and more. In what has been called 'big tent' DH, it can at times be difficult to determine with any specificity what, precisely, digital humanities work entails."
Historically, the digital humanities developed out of humanities computing and has become associated with other fields, such as humanistic computing, social computing, and media studies. In concrete terms, the digital humanities embraces a variety of topics, from curating online collections of primary sources (primarily textual) to the
data mining of large cultural data sets to
topic modeling. Digital humanities incorporates both digitized (remediated) and
born-digital
The term born-digital refers to materials that originate in a digital form.NDIIPP"Preserving Digital Culture,"Library of Congress. This is in contrast to digital reformatting, through which analog materials become digital, as in the case of fil ...
materials and combines the methodologies from traditional humanities disciplines (such as
rhetoric
Rhetoric () is the art of persuasion, which along with grammar and logic (or dialectic), is one of the three ancient arts of discourse. Rhetoric aims to study the techniques writers or speakers utilize to inform, persuade, or motivate par ...
,
history
History (derived ) is the systematic study and the documentation of the human activity. The time period of event before the History of writing#Inventions of writing, invention of writing systems is considered prehistory. "History" is an umbr ...
,
philosophy,
linguistics
Linguistics is the scientific study of human language. It is called a scientific study because it entails a comprehensive, systematic, objective, and precise analysis of all aspects of language, particularly its nature and structure. Lingu ...
,
literature
Literature is any collection of written work, but it is also used more narrowly for writings specifically considered to be an art form, especially prose fiction, drama, and poetry. In recent centuries, the definition has expanded to inclu ...
,
art
Art is a diverse range of human activity, and resulting product, that involves creative or imaginative talent expressive of technical proficiency, beauty, emotional power, or conceptual ideas.
There is no generally agreed definition of wha ...
,
archaeology
Archaeology or archeology is the scientific study of human activity through the recovery and analysis of material culture. The archaeological record consists of Artifact (archaeology), artifacts, architecture, biofact (archaeology), biofacts ...
,
music
Music is generally defined as the The arts, art of arranging sound to create some combination of Musical form, form, harmony, melody, rhythm or otherwise Musical expression, expressive content. Exact definition of music, definitions of mu ...
, and
cultural studies) and social sciences,
with tools provided by
computing
Computing is any goal-oriented activity requiring, benefiting from, or creating computing machinery. It includes the study and experimentation of algorithmic processes, and development of both hardware and software. Computing has scientific, ...
(such as
hypertext
Hypertext is text displayed on a computer display or other electronic devices with references ( hyperlinks) to other text that the reader can immediately access. Hypertext documents are interconnected by hyperlinks, which are typicall ...
,
hypermedia
Hypermedia, an extension of the term hypertext, is a nonlinear medium of information that includes graphics, audio, video, plain text and hyperlinks. This designation contrasts with the broader term ''multimedia'', which may include non-interact ...
,
data visualisation
Data and information visualization (data viz or info viz) is an interdisciplinary field that deals with the graphic representation of data and information. It is a particularly efficient way of communicating when the data or information is num ...
,
information retrieval, data mining,
statistics,
text mining
Text mining, also referred to as ''text data mining'', similar to text analytics, is the process of deriving high-quality information from text. It involves "the discovery by computer of new, previously unknown information, by automatically extract ...
,
digital mapping
Digital mapping (also called digital or computer cartography) is the process by which a collection of spatial data is compiled and formatted into a virtual image on a computer. The primary function of this technology is to produce maps that give a ...
), and
digital publishing
Electronic publishing (also referred to as publishing, digital publishing, or online publishing) includes the digital publication of e-books, digital magazines, and the development of digital libraries and catalogues. It also includes the editin ...
. Related subfields of digital humanities have emerged like
software studies
Software studies is an emerging interdisciplinary research field, which studies software systems and their social and cultural effects. The implementation and use of software has been studied in recent fields such as cyberculture, Internet st ...
, platform studies, and
critical code studies
Critical code studies (CCS) is an emerging academic subfield, related to software studies, digital humanities, cultural studies, computer science, human–computer interface, and the do-it-yourself maker culture. Its primary focus is on the cu ...
. Fields that parallel the digital humanities include
new media studies
New media studies is an academic discipline that explores the intersections of computing, science, the humanities, and the visual and performing arts. Janet Murray, a prominent researcher in the discipline, describes this intersection as "a sing ...
and
information science
Information science (also known as information studies) is an academic field which is primarily concerned with analysis, collection, classification, manipulation, storage, retrieval, movement, dissemination, and protection of information. ...
as well as
media theory of composition Commonly called new media theory or media-centered theory of composition, stems from the rise of computers as word processing tools. Media theorists now also examine the rhetorical strengths and weakness of different media, and the implications the ...
,
game studies
Game studies, also known as ludology (from ''ludus'', "game", and ''-logia'', "study", "research"), is the study of games, the act of playing them, and the players and cultures surrounding them. It is a field of cultural studies that deals with ...
, particularly in areas related to digital humanities project design and production, and
cultural analytics Cultural analytics refers to the use of computational, visualization, and big data methods for the exploration of contemporary and historical cultures. While digital humanities research has focused on text data, cultural analytics has a particular ...
. Each disciplinary field and each country has its own unique history of digital humanities.

Berry and Fagerjord have suggested that a way to reconceptualise digital humanities could be through a "digital humanities stack". They argue that "this type of diagram is common in computation and computer science to show how technologies are 'stacked' on top of each other in increasing levels of abstraction. Here,
hey
Hey or Hey! may refer to:
Music
* Hey (band), a Polish rock band
Albums
* ''Hey'' (Andreas Bourani album) or the title song (see below), 2014
* ''Hey!'' (Julio Iglesias album) or the title song, 1980
* ''Hey!'' (Jullie album) or the title s ...
use the method in a more illustrative and creative sense of showing the range of activities, practices, skills, technologies and structures that could be said to make up the digital humanities, with the aim of providing a high-level map." Indeed, the "diagram can be read as the bottom levels indicating some of the fundamental elements of the digital humanities stack, such as computational thinking and knowledge representation, and then other elements that later build on these. "
In practical terms, a major distinction within digital humanities is the focus on the data being processed. For processing textual data, digital humanities builds on a long and extensive history of
digital edition
A digital edition is an online magazine or online newspaper delivered in electronic form which is formatted identically to the print version. Digital editions are often called digital facsimiles to underline the likeness to the print version. Dig ...
,
computational linguistics
Computational linguistics is an Interdisciplinarity, interdisciplinary field concerned with the computational modelling of natural language, as well as the study of appropriate computational approaches to linguistic questions. In general, comput ...
and
natural language processing
Natural language processing (NLP) is an interdisciplinary subfield of linguistics, computer science, and artificial intelligence concerned with the interactions between computers and human language, in particular how to program computers to proc ...
and developed an independent and highly specialized technology stack (largely cumulating in the specifications of the
Text Encoding Initiative
The Text Encoding Initiative (TEI) is a text-centric community of practice in the academic field of digital humanities, operating continuously since the 1980s. The community currently runs a mailing list, meetings and conference series, and main ...
). This part of the field is sometimes thus set apart from Digital Humanities in general as 'digital philology' or 'computational philology'. For the creation and analysis of digital editions of objects or artifacts, digital philologists have access to digital practices, methods, and technologies such as
optical character recognition
Optical character recognition or optical character reader (OCR) is the electronic or mechanical conversion of images of typed, handwritten or printed text into machine-encoded text, whether from a scanned document, a photo of a document, a sc ...
that are providing opportunities to adapt the field to the digital age.
History
Digital humanities descends from the field of humanities computing, whose origins reach back to 1940s and 50s, in the pioneering work of Jesuit scholar
Roberto Busa
Roberto Busa (November 28, 1913 – August 9, 2011) was an Italian Jesuit priest and one of the pioneers in the usage of computers for linguistic and literary analysis. He was the author of the '' Index Thomisticus'', a complete lemmatization of ...
, which began in 1946, and of English professor
Josephine Miles, beginning in the early 1950s.
In collaboration with
IBM, Busa and his team created a computer-generated concordance to
Thomas Aquinas
Thomas Aquinas, OP (; it, Tommaso d'Aquino, lit=Thomas of Aquino; 1225 – 7 March 1274) was an Italian Dominican friar and priest who was an influential philosopher, theologian and jurist in the tradition of scholasticism; he is known wi ...
' writings known as the ''
Index Thomisticus
The ''Index Thomisticus'' was a digital humanities project begun in the 1940s that created a concordance to 179 texts centering around Thomas Aquinas. Led by Roberto Busa, the project indexed 10,631,980 words over the course of 34 years, initial ...
''.
Busa's works have been collected and translated by
Julianne Nyhan and Marco Passarotti. Other scholars began using mainframe computers to automate tasks like word-searching, sorting, and counting, which was much faster than processing information from texts with handwritten or typed index cards.
Similar first advances were made by Gerhard Sperl in
Austria
Austria, , bar, Östareich officially the Republic of Austria, is a country in the southern part of Central Europe, lying in the Eastern Alps. It is a federation of nine states, one of which is the capital, Vienna, the most populous ...
using computers by
Zuse for Digital
Assyriology
Assyriology (from Greek , ''Assyriā''; and , '' -logia'') is the archaeological, anthropological, and linguistic study of Assyria and the rest of ancient Mesopotamia (a region that encompassed what is now modern Iraq, northeastern Syria, southe ...
.
In the decades which followed archaeologists, classicists, historians, literary scholars, and a broad array of humanities researchers in other disciplines applied emerging computational methods to transform humanities scholarship.
As Tara McPherson has pointed out, the digital humanities also inherit practices and perspectives developed through many artistic and theoretical engagements with electronic screen culture beginning the late 1960s and 1970s. These range from research developed by organizations such as
SIGGRAPH
SIGGRAPH (Special Interest Group on Computer Graphics and Interactive Techniques) is an annual conference on computer graphics (CG) organized by the ACM SIGGRAPH, starting in 1974. The main conference is held in North America; SIGGRAPH Asia ...
to creations by artists such as
Charles and Ray Eames
Charles Eames ( Charles Eames, Jr) and Ray Eames ( Ray-Bernice Eames) were an American married couple of industrial designers who made significant historical contributions to the development of modern architecture and furniture through the work of ...
and the members of
E.A.T. (Experiments in Art and Technology). The Eames and E.A.T. explored nascent computer culture and intermediality in creative works that dovetailed technological innovation with art.
The first specialized journal in the digital humanities was ''Computers and the Humanities'', which debuted in 1966. The
Computer Applications and Quantitative Methods in Archaeology (CAA) association was founded in 1973. The Association for Literary and Linguistic Computing (ALLC) and the Association for Computers and the Humanities (ACH) were then founded in 1977 and 1978, respectively.
Soon, there was a need for a standardized protocol for tagging digital texts, and the
Text Encoding Initiative
The Text Encoding Initiative (TEI) is a text-centric community of practice in the academic field of digital humanities, operating continuously since the 1980s. The community currently runs a mailing list, meetings and conference series, and main ...
(TEI) was developed.
The TEI project was launched in 1987 and published the first full version of the ''TEI Guidelines'' in May 1994.
TEI helped shape the field of electronic textual scholarship and led to
Extensible Markup Language
Extensible Markup Language (XML) is a markup language and file format for storing, transmitting, and reconstructing arbitrary data. It defines a set of rules for encoding electronic document, documents in a format that is both Human-readable med ...
(XML), which is a tag scheme for digital editing. Researchers also began experimenting with databases and hypertextual editing, which are structured around links and nodes, as opposed to the standard linear convention of print.
In the nineties, major digital text and image archives emerged at centers of humanities computing in the U.S. (e.g. the ''
Women Writers Project The Northeastern University Women Writers Project (formerly the Brown University Women Writers Project) or WWP, founded in 1986 at Brown University, is a long-term research and publication project which focuses on making texts from early modern wome ...
'', the ''Rossetti Archive'', and ''
The William Blake Archive
The William Blake Archive is a digital humanities project started in 1994, a first version of the website was launched in 1996.{{cite journal, last1=Crawford, first1=Kendal, last2=Levy, first2=Michelle, journal=RIDE: A Review Journal for Digital E ...
''), which demonstrated the sophistication and robustness of text-encoding for literature. The advent of personal computing and the World Wide Web meant that Digital Humanities work could become less centered on text and more on design. The multimedia nature of the internet has allowed Digital Humanities work to incorporate audio, video, and other components in addition to text.
The terminological change from "humanities computing" to "digital humanities" has been attributed to
John Unsworth, Susan Schreibman, and Ray Siemens who, as editors of the anthology ''A Companion to Digital Humanities'' (2004), tried to prevent the field from being viewed as "mere digitization".
Consequently, the hybrid term has created an overlap between fields like rhetoric and composition, which use "the methods of contemporary humanities in studying digital objects",
and digital humanities, which uses "digital technology in studying traditional humanities objects".
The use of computational systems and the study of computational media within the
humanities, arts and social sciences more generally has been termed the 'computational turn'.
In 2006 the
National Endowment for the Humanities
The National Endowment for the Humanities (NEH) is an independent federal agency of the U.S. government, established by thNational Foundation on the Arts and the Humanities Act of 1965(), dedicated to supporting research, education, preserv ...
(NEH) launched the Digital Humanities Initiative (renamed Office of Digital Humanities in 2008), which made widespread adoption of the term "digital humanities" in the United States.
Digital humanities emerged from its former niche status and became "big news"
at the 2009
MLA convention in Philadelphia, where digital humanists made "some of the liveliest and most visible contributions" and had their field hailed as "the first 'next big thing' in a long time."
In November 2018, the 10th Global Peter Drucker Forum was about the theme: “Management. The human dimension”. Among the articles presented the one that left its mark in the field of digital humanities was:
Values and methods
Although digital humanities projects and initiatives are diverse, they often reflect common values and methods.
These can help in understanding this hard-to-define field.
Values
* Critical and theoretical
* Iterative and experimental
* Collaborative and distributed
* Multimodal and performative
* Open and accessible
Methods
* Enhanced critical curation
* Augmented editions and fluid textuality
* Scale: the law of large numbers
* Distant/close, macro/micro, surface/depth
* Cultural analytics, aggregation, and data-mining
* Visualization and data design
* Locative investigation and thick mapping
* The animated archive
* Distributed knowledge production and performative access
* Humanities gaming
* Code, software, and platform studies
* Database documentaries
* Repurposable content and remix culture
* Pervasive infrastructure
* Ubiquitous scholarship
In keeping with the value of being open and accessible, many digital humanities projects and journals are
open access
Open access (OA) is a set of principles and a range of practices through which research outputs are distributed online, free of access charges or other barriers. With open access strictly defined (according to the 2001 definition), or libre o ...
and/or under
Creative Commons
Creative Commons (CC) is an American non-profit organization and international network devoted to educational access and expanding the range of creative works available for others to build upon legally and to share. The organization has releas ...
licensing, showing the field's "commitment to
open standards
An open standard is a standard that is openly accessible and usable by anyone. It is also a prerequisite to use open license, non-discrimination and extensibility. Typically, anybody can participate in the development. There is no single definitio ...
and
open source
Open source is source code that is made freely available for possible modification and redistribution. Products include permission to use the source code, design documents, or content of the product. The open-source model is a decentralized sof ...
." Open access is designed to enable anyone with an internet-enabled device and internet connection to view a website or read an article without having to pay, as well as share content with the appropriate permissions.
Digital humanities scholars use computational methods either to answer existing research questions or to challenge existing theoretical paradigms, generating new questions and pioneering new approaches. One goal is to systematically integrate computer technology into the activities of humanities scholars,
as is done in contemporary empirical
social sciences
Social science is one of the branches of science, devoted to the study of society, societies and the Social relation, relationships among individuals within those societies. The term was formerly used to refer to the field of sociology, the o ...
. Yet despite the significant trend in digital humanities towards networked and multimodal forms of knowledge, a substantial amount of digital humanities focuses on documents and text in ways that differentiate the field's work from digital research in
media studies,
information studies
Information science (also known as information studies) is an academic field which is primarily concerned with analysis, collection, classification, manipulation, storage, retrieval, movement, dissemination, and protection of information. P ...
,
communication studies
Communication studies or communication science is an academic discipline that deals with processes of human communication and behavior, patterns of communication in interpersonal relationships, social interactions and communication in diffe ...
, and
sociology
Sociology is a social science that focuses on society, human social behavior, patterns of social relationships, social interaction, and aspects of culture associated with everyday life. It uses various methods of empirical investigation and ...
. Another goal of digital humanities is to create scholarship that transcends textual sources. This includes the integration of
multimedia
Multimedia is a form of communication that uses a combination of different content forms such as text, audio, images, animations, or video into a single interactive presentation, in contrast to tradi ...
,
metadata, and dynamic environments (see
The Valley of the Shadow
The Valley of the Shadow is a digital history project about the American Civil War, launched in 1993 and hosted by the University of Virginia. It details the experiences of Confederate soldiers from Augusta County, Virginia and Union soldiers fr ...
project at the
University of Virginia
The University of Virginia (UVA) is a public research university in Charlottesville, Virginia. Founded in 1819 by Thomas Jefferson, the university is ranked among the top academic institutions in the United States, with College admission ...
, the
at
University of Southern California
, mottoeng = "Let whoever earns the palm bear it"
, religious_affiliation = Nonsectarian—historically Methodist
, established =
, accreditation = WSCUC
, type = Private research university
, academic_affiliations =
, endowment = $8. ...
, or Digital Pioneers projects at Harvard). A growing number of researchers in digital humanities are using computational methods for the analysis of large cultural data sets such as the
Google Books
Google Books (previously known as Google Book Search, Google Print, and by its code-name Project Ocean) is a service from Google Inc. that searches the full text of books and magazines that Google has scanned, converted to text using optical ...
corpus.
[Roth, S. (2014), "Fashionable functions. A Google n-gram view of trends in functional differentiation (1800-2000)", ''International Journal of Technology and Human Interaction'', Band 10, Nr. 2, S. 34-58 (online: http://ssrn.com/abstract=2491422).] Examples of such projects were highlighted by the Humanities High Performance Computing competition sponsored by the Office of Digital Humanities in 2008, and also by the Digging Into Data challenge organized in 2009 and 2011 by NEH in collaboration with NSF,
and in partnership with
JISC
Jisc is a United Kingdom not-for-profit company that provides network and IT services and digital resources in support of further and higher education institutions and research as well as not-for-profits and the public sector.
History
T ...
in the UK, and
SSHRC
The Social Sciences and Humanities Research Council of Canada (SSHRC; french: Conseil de recherches en sciences humaines du Canada, CRSH) is a Canadian federal research-funding agency that promotes and supports post-secondary research and traini ...
in Canada. In addition to books, historical newspapers can also be analyzed with big data methods. The analysis of vast quantities of historical newspaper content has showed how periodic structures can be automatically discovered, and a similar analysis was performed on social media. As part of the big data revolution,
gender bias
Sexism is prejudice or discrimination based on one's sex or gender. Sexism can affect anyone, but it primarily affects women and girls.There is a clear and broad consensus among academic scholars in multiple fields that sexism refers primaril ...
,
readability
Readability is the ease with which a reader can understand a written text. In natural language, the readability of text depends on its content (the complexity of its vocabulary and syntax) and its presentation (such as typographic aspects t ...
, content similarity, reader preferences, and even mood have been analyzed based on
text mining
Text mining, also referred to as ''text data mining'', similar to text analytics, is the process of deriving high-quality information from text. It involves "the discovery by computer of new, previously unknown information, by automatically extract ...
methods over millions of documents
and historical documents written in literary Chinese.
[Bol, P. K., C.-L. Liu, and H. Wang. (2015) "Mining and discovering biographical information in Difangzhi with a language-model-based approach", ''Proceedings of the 2015 International Conference on Digital Humanities''. (https://arxiv.org/abs/1504.02148)]
Digital humanities is also involved in the creation of software, providing "environments and tools for producing, curating, and interacting with knowledge that is 'born digital' and lives in various digital contexts." In this context, the field is sometimes known as computational humanities.
Tools
Digital humanities scholars use a variety of digital tools for their research, which may take place in an environment as small as a mobile device or as large as a
virtual reality
Virtual reality (VR) is a simulated experience that employs pose tracking and 3D near-eye displays to give the user an immersive feel of a virtual world. Applications of virtual reality include entertainment (particularly video games), e ...
lab. Environments for "creating, publishing and working with digital scholarship include everything from personal equipment to institutes and software to cyberspace." Some scholars use advanced programming languages and databases, while others use less complex tools, depending on their needs. DiRT (Digital Research Tools Directory) offers a registry of digital research tools for scholars. TAPoR (Text Analysis Portal for Research) is a gateway to text analysis and retrieval tools. An accessible, free example of an online textual analysis program is
Voyant Tools, which only requires the user to copy and paste either a body of text or a URL and then click the 'reveal' button to run the program. There is also an online list of online or downloadable Digital Humanities tools that are largely free, aimed toward helping students and others who lack access to funding or institutional servers. Free, open source web publishing platforms like
WordPress
WordPress (WP or WordPress.org) is a free and open-source software, free and open-source content management system (CMS) written in PHP, hypertext preprocessor language and paired with a MySQL or MariaDB database with supported secure hypert ...
and
Omeka are also popular tools.
Projects
Digital humanities projects are more likely than traditional humanities work to involve a team or a lab, which may be composed of faculty, staff, graduate or undergraduate students, information technology specialists, and partners in galleries, libraries, archives, and museums. Credit and authorship are often given to multiple people to reflect this collaborative nature, which is different from the sole authorship model in the traditional humanities (and more like the natural sciences).
There are thousands of digital humanities projects, ranging from small-scale ones with limited or no funding to large-scale ones with multi-year financial support. Some are continually updated while others may not be due to loss of support or interest, though they may still remain online in either a
beta version
A software release life cycle is the sum of the stages of development and maturity for a piece of computer software ranging from its initial development to its eventual release, and including updated versions of the released version to help impr ...
or a finished form. The following are a few examples of the variety of projects in the field:
Digital archives
The
Women Writers Project The Northeastern University Women Writers Project (formerly the Brown University Women Writers Project) or WWP, founded in 1986 at Brown University, is a long-term research and publication project which focuses on making texts from early modern wome ...
(begun in 1988) is a long-term research project to make pre-Victorian women writers more accessible through an electronic collection of rare texts. The Walt Whitman Archive (begun in the 1990s) sought to create a hypertext and scholarly edition of
Whitman's works and now includes photographs, sounds, and the only comprehensive current bibliography of Whitman criticism. The Emily Dickinson Archive (begun in 2013) is a collection of high-resolution images of
Dickinson's poetry manuscripts as well as a searchable lexicon of over 9,000 words that appear in the poems.

The Slave Societies Digital Archive (formerly Ecclesiastical and Secular Sources for Slave Societies), directed by Jane Landers and hosted at Vanderbilt University, preserves endangered ecclesiastical and secular documents related to Africans and African-descended peoples in slave societies. This Digital Archive currently holds 500,000 unique images, dating from the 16th to the 20th centuries, and documents the history of between 6 and 8 million individuals. They are the most extensive serial records for the history of Africans in the Atlantic World and also include valuable information on the indigenous, European, and Asian populations who lived alongside them.
The involvement of librarians and archivists plays an important part in digital humanities projects because of the recent expansion of their role so that it now covers
digital curation Digital curation is the selection, preservation, maintenance, collection and archiving of digital assets.
Digital curation establishes, maintains and adds value to repositories of digital data for present and future use. This is often accomplish ...
, which is critical in the preservation, promotion, and access to digital collections, as well as the application of scholarly orientation to digital humanities projects. A specific example involves the case of initiatives where archivists help scholars and academics build their projects through their experience in evaluating, implementing, and customizing metadata schemas for library collections.
The initiatives at the
National Autonomous University of Mexico
The National Autonomous University of Mexico ( es, Universidad Nacional Autónoma de México, UNAM) is a public research university in Mexico. It is consistently ranked as one of the best universities in Latin America, where it's also the bigge ...
is another example of a digital humanities project. These include the digitization of 17th-century manuscripts, an electronic corpus of Mexican history from the 16th to 19th century, and the visualization of pre-Hispanic archaeological sites in
3-D
3-D, 3D, or 3d may refer to:
Science, technology, and mathematics Relating to three-dimensionality
* Three-dimensional space
** 3D computer graphics, computer graphics that use a three-dimensional representation of geometric data
** 3D film, a ...
.
Cultural analytics
"Cultural analytics" refers to the use of computational method for exploration and analysis of large visual collections and also contemporary digital media. The concept was developed in 2005 by
Lev Manovich
Lev Manovich ( ) is an author of books on digital culture and new media, and professor of Computer Science at the Graduate Center, City University of New York. Manovich's current research and teaching focuses on digital humanities, social comput ...
who then established the Cultural Analytics Lab in 2007 at Qualcomm Institute at California Institute for Telecommunication and Information (Calit2). The lab has been using methods from the field of computer science called Computer Vision many types of both historical and contemporary visual media—for example, all covers of ''Time'' magazine published between 1923 and 2009, 20,000 historical art photographs from the collection in Museum of Modern Art (MoMA) in New York, one million pages from Manga books, and 16 million images shared on Instagram in 17 global cities. Cultural analytics also includes using methods from media design and data visualization to create interactive visual interfaces for exploration of large visual collections e.g., Selfiecity and On Broadway.
Cultural analytics research is also addressing a number of theoretical questions. How can we "observe" giant cultural universes of both user-generated and professional media content created today, without reducing them to averages, outliers, or pre-existing categories? How can work with large cultural data help us question our stereotypes and assumptions about cultures? What new theoretical cultural concepts and models are required for studying global digital culture with its new mega-scale, speed, and connectivity?
The term "cultural analytics" (or "culture analytics") is now used by many other researchers, as exemplified by two academic symposiums, a four-month long research program at UCLA that brought together 120 leading researchers from university and industry labs, an academic peer-review ''Journal of Cultural Analytics: CA'' established in 2016, and academic job listings.
Textual mining, analysis, and visualization
WordHoard (begun in 2004) is a free application that enables scholarly but non-technical users to read and analyze, in new ways, deeply-tagged texts, including the canon of Early Greek epic,
Chaucer
Geoffrey Chaucer (; – 25 October 1400) was an English poet, author, and civil servant best known for '' The Canterbury Tales''. He has been called the "father of English literature", or, alternatively, the "father of English poetry". He w ...
,
Shakespeare
William Shakespeare ( 26 April 1564 – 23 April 1616) was an English playwright, poet and actor. He is widely regarded as the greatest writer in the English language and the world's pre-eminent dramatist. He is often called England's natio ...
, and
Spenser. The Republic of Letters (begun in 2008) seeks to visualize the social network of Enlightenment writers through an interactive map and visualization tools. Network analysis and data visualization is also used for reflections on the field itself – researchers may produce network maps of social media interactions or infographics from data on digital humanities scholars and projects.

Document in Context of its Time (DICT) analysis style and an onlin
demo toolallow in an interactive way let users know whether the vocabulary used by an author of an input text was frequent at the time of text creation, whether the author used anachronisms or neologisms, and enables detecting terms in text that underwent considerable semantic change.
Analysis of macroscopic trends in cultural change
Culturomics
Culturomics is a form of computational lexicology that studies human behavior and cultural trends through the quantitative analysis of digitized texts. Researchers data mine large digital archives to investigate cultural phenomena reflected in la ...
is a form of
computational lexicology
Computational lexicology is a branch of computational linguistics, which is concerned with the use of computers in the study of lexicon. It has been more narrowly described by some scholars (Amsler, 1980) as the use of computers in the study of '' ...
that studies
human behavior
Human behavior is the potential and expressed capacity ( mentally, physically, and socially) of human individuals or groups to respond to internal and external stimuli throughout their life. Kagan, Jerome, Marc H. Bornstein, and Richard ...
and
cultural trends
The bandwagon effect is the tendency for people to adopt certain behaviors, styles, or attitudes simply because others are doing so. More specifically, it is a cognitive bias by which public opinion or behaviours can alter due to particular acti ...
through the
quantitative analysis
Quantitative analysis may refer to:
* Quantitative research, application of mathematics and statistics in economics and marketing
* Quantitative analysis (chemistry), the determination of the absolute or relative abundance of one or more substanc ...
of digitized texts. Researchers
data mine large
digital archive
An archive is an accumulation of historical records or materials – in any medium – or the physical facility in which they are located.
Archives contain primary source documents that have accumulated over the course of an individual or ...
s to investigate cultural phenomena reflected in language and word usage. The term is an American
neologism
A neologism Greek νέο- ''néo''(="new") and λόγος /''lógos'' meaning "speech, utterance"] is a relatively recent or isolated term, word, or phrase that may be in the process of entering common use, but that has not been fully accepted int ...
first described in a 2010 ''
Science (journal), Science'' article called ''Quantitative Analysis of Culture Using Millions of Digitized Books'', co-authored by Harvard researchers Jean-Baptiste Michel and
Erez Lieberman Aiden
Erez Lieberman Aiden (born 1980, né Erez Lieberman) is an American research scientist active in multiple fields related to applied mathematics. He is an assistant professor at the Baylor College of Medicine, and formerly a fellow at the Harvard ...
.
A 2017 study
published in the
compared the trajectory of n-grams over time in both digitised books from the 2010
Science (journal), Science article
with those found in a large corpus of regional newspapers from the United Kingdom over the course of 150 years. The study further went on to use more advanced
natural language processing
Natural language processing (NLP) is an interdisciplinary subfield of linguistics, computer science, and artificial intelligence concerned with the interactions between computers and human language, in particular how to program computers to proc ...
techniques to discover macroscopic trends in history and culture, including gender bias, geographical focus, technology, and politics, along with accurate dates for specific events.
The applications of digital humanities may be used along with other non humanities subject areas such as pure sciences, agriculture, management etc. to produce great variants of practical solutions to solve issues in industry as well as society.
Online publishing
The
Stanford Encyclopedia of Philosophy
The ''Stanford Encyclopedia of Philosophy'' (''SEP'') combines an online encyclopedia of philosophy with peer-reviewed publication of original papers in philosophy, freely accessible to Internet users. It is maintained by Stanford University. E ...
(begun in 1995) is a dynamic reference work of terms, concepts, and people from philosophy maintained by scholars in the field. MLA Commons offers an open peer-review site (where anyone can comment) for their ongoing curated collection of teaching artifacts in ''Digital Pedagogy in the Humanities: Concepts, Models, and Experiments'' (2016). The ''Debates in the Digital Humanities'' platform contains volumes of the open-access book of the same title (2012 and 2016 editions) and allows readers to interact with material by marking sentences as interesting or adding terms to a crowdsourced index.
Wikimedia projects
Some research institutions work with the
Wikimedia Foundation
The Wikimedia Foundation, Inc., or Wikimedia for short and abbreviated as WMF, is an American 501(c)(3) nonprofit organization headquartered in San Francisco, California and registered as a charitable foundation under local laws. Best kno ...
or volunteers of the community, for example, to make freely licensed media files available via
Wikimedia Commons
Wikimedia Commons (or simply Commons) is a media repository of free-to-use images, sounds, videos and other media. It is a project of the Wikimedia Foundation.
Files from Wikimedia Commons can be used across all of the Wikimedia projects in ...
or to link or load data sets with
Wikidata
Wikidata is a collaboratively edited multilingual knowledge graph hosted by the Wikimedia Foundation. It is a common source of open data that Wikimedia projects such as Wikipedia, and anyone else, can use under the CC0 public domain licen ...
. Text analysis has been performed on the contribution history of articles on
Wikipedia
Wikipedia is a multilingual free online encyclopedia written and maintained by a community of volunteers, known as Wikipedians, through open collaboration and using a wiki-based editing system. Wikipedia is the largest and most-read ref ...
or its sister projects.
Criticism
In 2012, Matthew K. Gold identified a range of perceived criticisms of the field of digital humanities: "'a lack of attention to issues of race, class, gender, and sexuality; a preference for research-driven projects over pedagogical ones; an absence of political commitment; an inadequate level of diversity among its practitioners; an inability to address texts under copyright; and an institutional concentration in well-funded research universities".
Similarly Berry and Fagerjord have argued that a digital humanities should "focus on the need to think critically about the implications of computational imaginaries, and raise some questions in this regard. This is also to foreground the importance of the politics and norms that are embedded in digital technology, algorithms and software. We need to explore how to negotiate between close and distant readings of texts and how micro-analysis and macro-analysis can be usefully reconciled in humanist work."
Alan Liu has argued, "while digital humanists develop tools, data, and metadata critically, therefore (e.g., debating the 'ordered hierarchy of content objects' principle; disputing whether computation is best used for truth finding or, as Lisa Samuels and Jerome McGann put it, 'deformance'; and so on) rarely do they extend their critique to the full register of society, economics, politics, or culture."
Some of these concerns have given rise to the emergent subfield of Critical Digital Humanities (CDH):
Some key questions include: how do we make the invisible become visible in the study of software? How is knowledge transformed when mediated through code and software? What are the critical approaches to Big Data, visualization, digital methods, etc.? How does computation create new disciplinary boundaries and gate-keeping functions? What are the new hegemonic representations of the digital – 'geons', 'pixels', 'waves', visualization, visual rhetorics, etc.? How do media changes create epistemic changes, and how can we look behind the 'screen essentialism' of computational interfaces? Here we might also reflect on the way in which the practice of making-visible also entails the making-invisible – computation involves making choices about what is to be captured.
Negative publicity
Lauren F. Klein and Gold note that many appearances of the digital humanities in public media are often in a critical fashion. Armand Leroi, writing in ''
The New York Times
''The New York Times'' (''the Times'', ''NYT'', or the Gray Lady) is a daily newspaper based in New York City with a worldwide readership reported in 2020 to comprise a declining 840,000 paid print subscribers, and a growing 6 million paid ...
'', discusses the contrast between the algorithmic analysis of themes in literary texts and the work of Harold Bloom, who qualitatively and phenomenologi