DataONE
   HOME

TheInfoList



OR:

DataONE is a network of interoperable data repositories facilitating
data sharing Data sharing is the practice of making data used for scholarly research available to other investigators. Many funding agencies, institutions, and publication venues have policies regarding data sharing because transparency and openness are consid ...
, data discovery, and open science. Originally supported by $21.2 million in funding from the US
National Science Foundation The National Science Foundation (NSF) is an independent agency of the United States government that supports fundamental research and education in all the non-medical fields of science and engineering. Its medical counterpart is the National ...
as one of the initial DataNet programs in 2009, funding was renewed in 2014 through 2020 with an additional $15 million. DataONE helps preserve, access, use, and reuse of multi-discipline scientific data through the construction of primary cyberinfrastructure and an education and outreach program. DataONE provides
scientific data archiving Research data archiving is the long-term storage of scholarly research data, including the natural sciences, social sciences, and life sciences. The various academic journals have differing policies regarding how much of their data and methods res ...
for ecological and environmental data produced by scientists. DataONE's goal is to preserve and provide access to multi-scale, multi-discipline, and multi-national data. Users include scientists, ecosystem managers, policy makers, students, educators, librarians, and the public. DataONE links together existing
cyberinfrastructure United States federal research funders use the term cyberinfrastructure to describe research environments that support advanced data acquisition, data storage, data management, data integration, data mining, data visualization and other computing ...
to provide a distributed framework, management, and technologies that enable long-term preservation of multi-scale, multi-discipline, and multi-national observational data. The distributed framework is composed of Coordinating Nodes located at the Oak Ridge Campus at Tennessee, University of California Santa Barbara, and
University of New Mexico The University of New Mexico (UNM; es, Universidad de Nuevo México) is a public research university in Albuquerque, New Mexico. Founded in 1889, it is the state's flagship academic institution and the largest by enrollment, with over 25,400 ...
, and member nodes. DataONE also provides resources including tools for accessing and using it.


Coordinating nodes

The three coordinating nodes provide network-wide services to member nodes. They are geographically replicated, with mirrored content and full copies of science
metadata Metadata is "data that provides information about other data", but not the content of the data, such as the text of a message or the image itself. There are many distinct types of metadata, including: * Descriptive metadata – the descriptive ...
. William Michener of the
University of New Mexico The University of New Mexico (UNM; es, Universidad de Nuevo México) is a public research university in Albuquerque, New Mexico. Founded in 1889, it is the state's flagship academic institution and the largest by enrollment, with over 25,400 ...
(UNM) directed the project, and UNM is one of the coordinating nodes. Coordinating nodes are UNM, Oak Ridge Campus (partnership of Oak Ridge National Laboratory ( ORNL) and
University of Tennessee The University of Tennessee (officially The University of Tennessee, Knoxville; or UT Knoxville; UTK; or UT) is a public land-grant research university in Knoxville, Tennessee. Founded in 1794, two years before Tennessee became the 16th sta ...
), and the
University of California, Santa Barbara The University of California, Santa Barbara (UC Santa Barbara or UCSB) is a public land-grant research university in Santa Barbara, California with 23,196 undergraduates and 2,983 graduate students enrolled in 2021–2022. It is part of the U ...
.


Member nodes

Member nodes consist of Earth observing institutions, projects, and networks. They provide resources for their own data and replicated data, and focus on serving their specific constituencies. These member nodes are geographically distributed and include: * Cornell Lab of Ornithology
eBird eBird is an online database of bird observations providing scientists, researchers and amateur naturalists with real-time data about bird distribution and abundance. Originally restricted to sightings from the Western Hemisphere, the project ...
*
Dryad A dryad (; el, Δρυάδες, ''sing''.: ) is a tree nymph or tree spirit in Greek mythology. ''Drys'' (δρῦς) signifies " oak" in Greek, and dryads were originally considered the nymphs of oak trees specifically, but the term has evolved t ...
* Earth Data Analysis Center (EDAC) * Environmental Data for the Oak Ridge Area (EDORA) *
Ecological Society of America The Ecological Society of America (ESA) is a professional organization of ecological scientists. Based in the United States and founded in 1915, ESA publications include peer-reviewed journals, newsletters, fact sheets, and teaching resources. I ...
(ESA) Data Registry * Europe Long-Term Ecosystem Research Network (LTER Europe) *
Global Lake Ecological Observatory Network Global Lake Ecological Observatory Network (GLEON) is an international grass-roots, voluntary network of researchers, educators, and community groups interested in making and utilizing time series of high-frequency observations made on and in lakes ...
(GLEON) * Gulf of Alaska Data Portal *
International Arctic Research Center The International Arctic Research Center, or IARC, established in 1999, is a research institution focused on integrating and coordinating study of Climate change in the Arctic. The primary partners in IARC are Japan and the United States. Parti ...
(IARC) Data Archive * Knowledge Network for Biocomplexity *
Long Term Ecological Research Network The Long Term Ecological Research Network (LTER) consists of a group of over 1800 scientists and students studying ecological processes Ecosystem ecology is the integrated study of living ( biotic) and non-living (abiotic) components of ecosyst ...
(LTER) * Merritt Repository *
Minnesota Population Center The Minnesota Population Center (MPC) is a university-wide interdisciplinary research center at the University of Minnesota. MPC was established in 2000, absorbing two earlier population research organizations. The primary goals of the center are to ...
(MPC) * Montana IoE Data Repository * Nevada Research Data Center * New Mexico Experimental Program to Stimulate Competitive Research (NM EPSCoR) * NOAA National Centers for Environmental Information (NCEI) Oceanographic Dat
Archive
* ONEShare Repository * ORNL Distributed Active Archive Center * Partnership for Interdisciplinary Studies of Coastal Oceans (PISCO) * Program for Research on Biodiversity (PPPBio) * Regional and Global Biogeochemical Dynamics Data (RGD) * SANParks Data Repository * SEAD Virtual Archive *
Taiwan Forestry Research Institute The Taiwan Forestry Research Institute (TFRI; ) a research institute under the Council of Agriculture of the Taiwan (ROC) dealing with forest. History Empire of Japan TRFI was originally established as a nursery on 6 January 1896 during the J ...
* Terrestrial Ecosystem Research Network (TERN) * University of Kansas - Biodiversity Institute * USA National Phenology Network * USGS Science Data Catalog (SDC)


Investigator Tool Kit

The Tool Kit provides tools for researchers to access DataONE. These are both general purpose and discipline-specific tools, and developers adapt existing tools where possible. The tool kit includes
Java Java (; id, Jawa, ; jv, ꦗꦮ; su, ) is one of the Greater Sunda Islands in Indonesia. It is bordered by the Indian Ocean to the south and the Java Sea to the north. With a population of 151.6 million people, Java is the world's mo ...
and Python libraries, an R programming language plug-in for analysis, extensions for Excel, the
VisTrails VisTrails is a scientific workflow management system developed at the Scientific Computing and Imaging Institute at the University of Utah that provides support for data exploration and visualization. It is written in Python and employs Qt via ...
scientific workflow, and the Kepler scientific workflow system.


Data management

DataONE provides a place for scientists to store data and its associated
metadata Metadata is "data that provides information about other data", but not the content of the data, such as the text of a message or the image itself. There are many distinct types of metadata, including: * Descriptive metadata – the descriptive ...
. The metadata makes this data searchable and accessible to other scientists. Data management practices include * Data management planning * Data acquisition (techniques, protocols, methods) * Data protection (backing up) * Data entry and manipulation (naming files, organization)
Matlab MATLAB (an abbreviation of "MATrix LABoratory") is a proprietary multi-paradigm programming language and numeric computing environment developed by MathWorks. MATLAB allows matrix manipulations, plotting of functions and data, implementat ...
, R * Quality control on data * Data analysis * Workflow tools (
VisTrails VisTrails is a scientific workflow management system developed at the Scientific Computing and Imaging Institute at the University of Utah that provides support for data exploration and visualization. It is written in Python and employs Qt via ...
, Kepler scientific workflow system) * Data documentation (
metadata Metadata is "data that provides information about other data", but not the content of the data, such as the text of a message or the image itself. There are many distinct types of metadata, including: * Descriptive metadata – the descriptive ...
) * Data sharing, citation, and discovery * Data preservation and curation Some of the additional data management planning resources include: a primer for best practices, a database for best practices in data management, educational modules and tutorials, webinars, and an investigator toolkit. These have been used or adapted for use under Creative Commons license by organizations and institutions that seek to educate other communities about data and research management. Understanding different audiences of users led to the development of possible user personas as models for users such as early-career researchers, science data librarians, citizen scientists or K-12 educators.


Collaborations

DataONE collaborates with other institutions to bring together tools that help with
data management Data management comprises all disciplines related to handling data as a valuable resource. Concept The concept of data management arose in the 1980s as technology moved from sequential processing (first punched cards, then magnetic tape) to ...
practices. One of those tools, developed in collaboration with other organizations and hosted by the University of California Digital Curation Center, is the DMPTool for
data management plan A data management plan or DMP is a formal document that outlines how data are to be handled both during a research project, and after the project is completed. The goal of a data management plan is to consider the many aspects of data management, me ...
ning. The DMP Tool is used by and referenced by many research data management plans and institutions in the US and around the world. Another recent collaboration in this area is the shared construction of a Data Management Training Clearinghouse for Earth sciences, in partnership with USGS and the Community for Data Integration (CDI).


Community

The DataONE community includes research networks, professional societies, libraries, academic institutions, data centers, data repositories, environmental observatory networks, educators, scientists, policy makers, administrators, citizen scientists, international organizations, NGOs, ecosystem managers, students, private companies and the public. DataONE has a users group that meets yearly to provide feedback.


References


External links


Nature.com


{{Authority control National Science Foundation Ecological databases Biological databases Ecological data