The Dataverse is an
open source
Open source is source code that is made freely available for possible modification and redistribution. Products include permission to use and view the source code, design documents, or content of the product. The open source model is a decentrali ...
web application
A web application (or web app) is application software that is created with web technologies and runs via a web browser. Web applications emerged during the late 1990s and allowed for the server to dynamically build a response to the request, ...
to share, preserve, cite, explore and analyze research data. Researchers, data authors, publishers, data distributors, and affiliated institutions all receive appropriate credit via a data citation with a
persistent identifier
A persistent identifier (PI or PID) is a long-lasting reference to a document, file, web page, or other object.
The term "persistent identifier" is usually used in the context of digital objects that are accessible over the Internet. Typically, s ...
(e.g.,
DOI, or
handle
A handle is a part of, or an attachment to, an object that allows it to be grasped and object manipulation, manipulated by hand. The design of each type of handle involves substantial ergonomics, ergonomic issues, even where these are dealt wi ...
).
A Dataverse
repository hosts multiple dataverses. Each dataverse contains
dataset(s) or other dataverses, and each dataset contains descriptive metadata and data files (including documentation and code that accompany the data).
In 2019, Dataverse won the Duke's Choice Award for university and higher education.
Background
The Dataverse Project is housed and developed by the Dataverse Team at the Institute for Quantitative Social Science (IQSS) at
Harvard University
Harvard University is a Private university, private Ivy League research university in Cambridge, Massachusetts, United States. Founded in 1636 and named for its first benefactor, the History of the Puritans in North America, Puritan clergyma ...
. Coding of the Dataverse (previously known as Dataverse Network) software began in 2006 under the leadership of
Mercè Crosas and
Gary King. The earlier Virtual Data Center (VDC) project, which spanned 1999-2006, was organized by
Micah Altman,
Gary King, and
Sidney Verba as a collaboration between the Harvard-MIT Data Center (now part of IQSS) and the
Harvard University Library
Harvard Library is the network of libraries and services at Harvard University, a private Ivy League university in Cambridge, Massachusetts. Harvard Library is the oldest library system in the United States and both the largest academic librar ...
. Precursors to the VDC date to 1987, comprising such entities as a stand-alone software guide to local data, preweb software, and tools to transfer cataloging information by
FTP
The File Transfer Protocol (FTP) is a standard communication protocol used for the transfer of computer files from a server to a client on a computer network. FTP is built on a client–server model architecture using separate control and dat ...
to other sites across campus automatically at designated times.
Installations
Harvard Dataverse
A collaboration with the Institute for Quantitative Social Science (IQSS), the
Harvard Library
Harvard Library is the network of libraries and services at Harvard University, a private Ivy League university in Cambridge, Massachusetts. Harvard Library is the oldest library system in the United States and both the largest academic librar ...
, and Harvard University Information Technology (HUIT): the Harvard Dataverse is a repository for sharing, citing, analyzing, and preserving research data. It is open to all scientific data from all disciplines worldwide.
Dataverse in Europe
Dataverse is also installed in the countries of the European Union to preserve data collected by research communities of Netherlands, Germany, France and Finland. The largest Dataverse repository is called DataverseNL and located in the Netherlands providing
data management
Data management comprises all disciplines related to handling data as a valuable resource, it is the practice of managing an organization's data so it can be analyzed for decision making.
Concept
The concept of data management emerged alongsi ...
services for 11 Dutch Universities. A similar service is established in Norway (cf. DataverseNO).
Dataverse in Canada
In Canada, Borealis is a national instance of the Dataverse repository hosted by
OCUL's Scholars Portal at the University of Toronto. Borealis allows institutions to offer a Dataverse service without operating and maintaining the software themselves. Most academic institutions offering a Dataverse service in Canada subscribe to the Borealis service. The associated community of practice is organized through the
Digital Research Alliance of Canada's Network of Experts via the Dataverse North Expert Group,
a coordination, collaboration and communication instance.
Dataverse installations around the world
There are several other Dataverse repositories installed in Universities and organizations around the world. Here is a list of some Dataverse repositories:
*The Austrian Social Science Data Archive (AUSSDA)
*Odum Institute
*
Dutch Universities (dataverse.nl operated by DANS)
*
Fudan University
Fudan University (FDU) is a public university, national public university in Yangpu, Shanghai, Yangpu, Shanghai, China. It is affiliated with the Ministry of Education (China), Ministry of Education and is co-funded with the Shanghai Municipal ...
*
University of Alberta
The University of Alberta (also known as U of A or UAlberta, ) is a public research university located in Edmonton, Alberta, Canada. It was founded in 1908 by Alexander Cameron Rutherford, the first premier of Alberta, and Henry Marshall Tory, t ...
Libraries
*Department of Cross Cultural and Regional Studies,
University of Copenhagen
The University of Copenhagen (, KU) is a public university, public research university in Copenhagen, Copenhagen, Denmark. Founded in 1479, the University of Copenhagen is the second-oldest university in Scandinavia, after Uppsala University.
...
(ToRS)
*ABACUS - British Columbia Research Libraries' Data Services
*Borealis, the Canadian Dataverse Repository - Scholars Portal -
Ontario Council of University Libraries (OCUL)
*HeiDATA -
Heidelberg University
*DataverseNO (Norwegian universities)
*CIRAD Dataverse (France)
*DataSuds (France)
*The Australian Data Archive
*
Florida International University
Florida International University (FIU) is a public research university with its main campus in Westchester, Florida, United States. Founded in 1965 by the Florida Legislature, the school opened to students in 1972. FIU is the third-largest univ ...
(Research Data Portal)
APIs and interoperability
The Dataverse currently has multiple open
APIs available, which allow for searching, depositing and accessing data.
Alternatives and similar projects
DSpace is often compared with Dataverse and is used for storing scientific data.
CKAN provides similar functions and is widely used for open data.
See also
*
Data citation
*
Data sharing
References
External links
*
*
{{Authority control
Harvard Library
Open science
Open data
Open-access archives
Open access (publishing)
Academic publishing
Data publishing
Scholarly databases