Climate and Forecast Metadata Conventions
   HOME

TheInfoList



OR:

The Climate and Forecast (CF) metadata conventions are conventions for the description of
Earth sciences Earth science or geoscience includes all fields of natural science related to the planet Earth. This is a branch of science dealing with the physical, chemical, and biological complex constitutions and synergistic linkages of Earth's four spheres ...
data, intended to promote the processing and sharing of
data file A data file is a computer file which stores data to be used by a computer application or system, including input and output data. A data file usually does not contain instructions or code to be executed (that is, a computer program). Most of the ...
s. The metadata defined by the CF conventions are generally included in the same file as the data, thus making the file "self-describing". The conventions provide a definitive description of what the data values found in each
netCDF NetCDF (Network Common Data Form) is a set of software libraries and self-describing, machine-independent data formats that support the creation, access, and sharing of array-oriented scientific data. The project homepage is hosted by the Unidat ...
variable represent, and of the spatial and temporal properties of the data, including information about grids, such as grid cell bounds and cell averaging methods. This enables users of files from different sources to decide which variables are comparable, and is a basis for building software applications with powerful
data extraction Data extraction is the act or process of retrieving data out of (usually unstructured or poorly structured) data sources for further data processing or data storage ( data migration). The import into the intermediate extracting system is thus usua ...
, grid remapping, data analysis, and data visualization capabilities.


History and evolution

The CF conventions were introduced in 2003, after several years of development by a collaboration that included staff from U.S. and European climate and weather laboratories. The conventions contained generalizations and extensions to the earlier Cooperative Ocean/Atmosphere Research Data Service (COARDS) conventions and the Gregory/Drach/Tett (GDT) conventions. As the scope of the CF conventions grew along with its user base, the CF community adopted an open governance model. In December 2008 the trio of standards, netCDF+CF+
OPeNDAP OPeNDAP is an acronym for "Open-source Project for a Network Data Access Protocol," an endeavor focused on enhancing the retrieval of remote, structured data through a Web-based architecture and a discipline-neutral Data Access Protocol (DAP). Widel ...
, was adopted by IOOS as a recommended standard (number 08-012) for the representation and transport of gridded data. The CF conventions are being considered by the
NASA The National Aeronautics and Space Administration (NASA ) is an independent agencies of the United States government, independent agency of the US federal government responsible for the civil List of government space agencies, space program ...
Standards Process Group (SPG) and others as more broadly applicable standards.


Applications and user base

The CF conventions have been adopted by a wide variety of national and international programs and activities in the Earth sciences. For example, they were required for the
climate model Numerical climate models use quantitative methods to simulate the interactions of the important drivers of climate, including atmosphere, oceans, land surface and ice. They are used for a variety of purposes from study of the dynamics of the c ...
output data collected for
Coupled model intercomparison project In climatology, the Coupled Model Intercomparison Project (CMIP) is a collaborative framework designed to improve knowledge of climate change. It was organized in 1995 by the Working Group on Coupled Modelling (WGCM) of the World Climate Research ...
s, which are widely used for the Intergovernmental Panel on Climate Change assessment reports. They are promoted as an important element of scientific community coordination by the
World Climate Research Programme The World Climate Research Programme (WCRP) is an international programme that helps to coordinate global climate research. The WCRP was established in 1980, under the joint sponsorship of the World Meteorological Organization (WMO) and the Inter ...
. They are also used as a technical foundation for a number of software packages and data systems, including the Climate Model Output Rewriter (CMOR), which is post processing software for climate model data, and the
Earth System Grid The Earth System Grid (ESG) is a data distribution portal whose development is funded mainly by the United States Department of Energy. It is the portal through which the Program for Climate Model Diagnosis and Intercomparison at Lawrence Livermor ...
, which distributes climate and other data. The CF conventions have also been used to describe the physical fields transferred between individual Earth system model
software components Component-based software engineering (CBSE), also called component-based development (CBD), is a branch of software engineering that emphasizes the separation of concerns with respect to the wide-ranging functionality available throughout a give ...
, such as atmosphere and ocean components, as the model runs .


Supported data types

CF is intended for use with
state estimation In control theory, a state observer or state estimator is a system that provides an estimate of the internal state of a given real system, from measurements of the input and output of the real system. It is typically computer-implemented, and pro ...
and
forecasting Forecasting is the process of making predictions based on past and present data. Later these can be compared (resolved) against what happens. For example, a company might estimate their revenue in the next year, then compare it against the actual ...
data, in the atmosphere, ocean, and other physical domains. It was designed primarily to address gridded data types such as
numerical weather prediction Numerical weather prediction (NWP) uses mathematical models of the atmosphere and oceans to predict the weather based on current weather conditions. Though first attempted in the 1920s, it was not until the advent of computer simulation in th ...
model outputs and
climatology Climatology (from Greek , ''klima'', "place, zone"; and , ''-logia'') or climate science is the scientific study of Earth's climate, typically defined as weather conditions averaged over a period of at least 30 years. This modern field of study ...
data in which
data binning Data binning, also called data discrete binning or data bucketing, is a data pre-processing technique used to reduce the effects of minor observation errors. The original data values which fall into a given small interval, a '' bin'', are replaced b ...
is used to impose a regular structure. However, the CF conventions are also applicable to many classes of
observational data In fields such as epidemiology, social sciences, psychology and statistics, an observational study draws inferences from a sample to a population where the independent variable is not under the control of the researcher because of ethical concern ...
and have been adopted by a number of groups for such applications.


Supported data formats

CF originated as a standard for data written in
netCDF NetCDF (Network Common Data Form) is a set of software libraries and self-describing, machine-independent data formats that support the creation, access, and sharing of array-oriented scientific data. The project homepage is hosted by the Unidat ...
, but its structure is general and it has been adapted for use with other data formats. For example, using the CF conventions with
Hierarchical Data Format Hierarchical Data Format (HDF) is a set of file formats (HDF4, HDF5) designed to store and organize large amounts of data. Originally developed at the U.S. National Center for Supercomputing Applications, it is supported by The HDF Group, a non-p ...
data has been explored.


Design principles

Several principles guide the development of CF conventions: * Data should be self-describing, without external tables needed for interpretation. * Conventions should be developed only as needed, rather than anticipating possible needs. * Conventions should not be onerous to use for either data-writers or data-readers. * Metadata should be readable by humans as well as interpretable by programs. * Redundancy should be avoided to prevent inconsistencies when writing data. Specific CF metadata descriptors use values of attributes to represent * Data provenance: title, institution, contact, source (e.g. model), history (audit trail of operations), references, comment * Description of associated activity: project, experiment * Description of data: units, standard_name, long_name, auxiliary_variables, missing_value, valid_range, flag_values, flag_meanings * Description of coordinates: coordinates, bounds, grid_mapping (with formula_terms); time specified with reference_time ("time since T0") and calendar attributes. * Meaning of grid cells: cell_methods, cell_measures, and climatological statistics. A central element of the CF Conventions is the CF Standard Name Table. The CF Standard Name Table uniquely associates a standard name with each geophysical parameter in a data set, where each name provides a precise description of physical quantities being represented. Note that this is the string value of the standard_name attribute, not the name of the parameter. The CF standard name table identifies over 1,000 physical quantities, each with a precise description and associated canonical units. Guidelines for construction of CF standard names are documented on the conventions web site. As an example of the information provided by CF standard names, the entry for sea-level atmospheric pressure includes: * standard name: air_pressure_at_sea_level * description: sea_level means mean
sea level Mean sea level (MSL, often shortened to sea level) is an average surface level of one or more among Earth's coastal bodies of water from which heights such as elevation may be measured. The global MSL is a type of vertical datuma standardise ...
, which is close to the
geoid The geoid () is the shape that the ocean surface would take under the influence of the gravity of Earth, including gravitational attraction and Earth's rotation, if other influences such as winds and tides were absent. This surface is extended ...
in sea areas.
Air pressure Atmospheric pressure, also known as barometric pressure (after the barometer), is the pressure within the atmosphere of Earth. The standard atmosphere (symbol: atm) is a unit of pressure defined as , which is equivalent to 1013.25 millibars ...
at sea level is the quantity often abbreviated as MSLP or PMSL. * canonical units: Pa


Software


NetCDF-Java Library
parses CF Conventions and create

objects from them * OriginPro version 2021b supports netCDF CF Convention. Averaging can be performed during import to allow handling of large datasets in a GUI software. * Th
xarray
Python library parses and decodes data stored according to CF Conventions. * Th

Python library "draws heavily from the NetCDF CF Metadata Conventions as a source for its data model".


References

{{Reflist


External links


CF Metadata Home PageNASA Standards Process GroupStandard for the CF Metadata Conventions (Marine Metadata Interoperability Project page)Ocean Data Standards on Metadata
Metadata Earth sciences metadata conventions Meteorological data and networks Science software