Minimum Information About A Proteomics Experiment
   HOME

TheInfoList



OR:

The Proteomics Standards Initiative (PSI) is a working group of the
Human Proteome Organization The Human Proteome Organization (HUPO) is an international consortium of national proteomics research associations, government researchers, academic institutions, and industry partners. The organization was launched in February 2001, and it promotes ...
. It aims to define data standards for
proteomics Proteomics is the large-scale study of proteins. Proteins are vital parts of living organisms, with many functions such as the formation of structural fibers of muscle tissue, enzymatic digestion of food, or synthesis and replication of DNA. In ...
to facilitate
data comparison In computing, file comparison is the calculation and display of the differences and similarities between data objects, typically text files such as source code. The methods, implementations, and results are typically called a diff, after the Un ...
, exchange and verification. The Proteomics Standards Initiative focuses on the following subjects: minimum information about a proteomics experiment defines the metadata that should be provided along with a proteomics experiment. a data
markup language Markup language refers to a text-encoding system consisting of a set of symbols inserted in a text document to control its structure, formatting, or the relationship between its parts. Markup is often used to control the display of the document ...
for encoding the data, and metadata ontologies for consistent annotation and representation.


Minimum information about a proteomics experiment

Minimum information about a proteomics experiment (MIAPE) is a minimum information standard, created by the Proteomics Standards Initiative of the
Human Proteome Organization The Human Proteome Organization (HUPO) is an international consortium of national proteomics research associations, government researchers, academic institutions, and industry partners. The organization was launched in February 2001, and it promotes ...
, for reporting proteomics experiments. You can't just introduce the results of an analysis, it is intended to specify all the information necessary to interpret the experiment results unambiguously and to potentially reproduce the experiment. While the MIAPE guidelines define the content required for compliant reports, it does not specify the format in which this data should be presented (which is left to the corresponding *ML format, also defined by PSI), nor does it define how to perform experiments.


Working groups

Several working groups work on several documents covering the different areas of
proteomics Proteomics is the large-scale study of proteins. Proteins are vital parts of living organisms, with many functions such as the formation of structural fibers of muscle tissue, enzymatic digestion of food, or synthesis and replication of DNA. In ...
: The
gel electrophoresis Gel electrophoresis is a method for separation and analysis of biomacromolecules ( DNA, RNA, proteins, etc.) and their fragments, based on their size and charge. It is used in clinical chemistry to separate proteins by charge or size (IEF ...
working group defined reporting requirements for gel electrophoresis experiments. The document is at the stage of a recommendation and has been published. The corresponding data exchange format is called ''GelML,'' and a stable version was released in late 2007. The gel electrophoresis working group also focuses on image analysis with the gel image informatics recommendation that is currently in the public review phase while the corresponding exchange format is only a draft (as of April 2009). The sample processing working group defines requirements concerning all the
sample Sample or samples may refer to: Base meaning * Sample (statistics), a subset of a population – complete data set * Sample (signal), a digital discrete sample of a continuous analog signal * Sample (material), a specimen or small quantity of s ...
pre-processing steps that are carried out before gel electrophoresis or
mass spectrometry Mass spectrometry (MS) is an analytical technique that is used to measure the mass-to-charge ratio of ions. The results are presented as a ''mass spectrum'', a plot of intensity as a function of the mass-to-charge ratio. Mass spectrometry is use ...
is applied. Two documents concerning column chromatography and
capillary electrophoresis Capillary electrophoresis (CE) is a family of electrokinetic separation methods performed in submillimeter diameter capillaries and in micro- and nanofluidic channels. Very often, CE refers to capillary zone electrophoresis (CZE), but other electr ...
are in the early draft stages and the ''Sample preparation and handling'' is still a project (as of April 2009). The data exchange format (spML) is also under development. Mass spectrometry and mass spectrometry informatics documents have been published as recommendations by the mass spectrometry working group. The working group has released several data exchange format: the mzML, for the capture of data generated by a mass spectrometer, which is a merge of the previous mzData (developed by PSI) and mzXML (developed at the Seattle Proteome Center at the Institute for Systems Biology); mzIdentML, for Mass spectra informatics analysis that capture the results of the identification of proteins and peptides from mass spectrometry data; and TraML, for
selected reaction monitoring Selected reaction monitoring (SRM), also called Multiple reaction monitoring, (MRM), is a method used in tandem mass spectrometry in which an ion of a particular mass is selected in the first stage of a tandem mass spectrometer and an ion product ...
input file. Finally, they develop ''MS CV'', a controlled vocabulary to use with the previous file formats. The molecular interactions working group of PSI only works on ''PSI MI XML'', a data exchange format, and on its corresponding ontologies. They have published the MIMIx guidelines (minimum information about a molecular interaction experiment) Study design and sample generation and statistical analysis of data MIAPE recommendations are also being planned or drafted.


Standard-compliant proteomics repositories

Several standard-compliant proteomics repositories exist, allowing researchers to publish their data while enforcing MIAPE guidelines. For example: MIAPEGelDB (for gel electrophoresis data),
PRIDE Pride is defined by Merriam-Webster as "reasonable self-esteem" or "confidence and satisfaction in oneself". A healthy amount of pride is good, however, pride sometimes is used interchangeably with "conceit" or "arrogance" (among other words) wh ...
(for mass spectrometry data), an
ProteoRed
MIAPE Generator tool (for gel electrophoresis and mass spectrometry data) It is expected that journal editors will eventually request authors to publish all their data to such repositories before publication.


Similar initiatives

There are similar initiatives that try to define minimal requirements. For microarrays the
MGED Society The Functional GEnomics Data Society (FGED) (formerly known as the MGED Society) was a non-profit, volunteer-run international organization of biologists, computer scientists, and data analysts that aims to facilitate biological and biomedical di ...
defined the
minimum information about a microarray experiment Minimum information about a microarray experiment (MIAME) is a standard created by the FGED Society for reporting microarray experiments. MIAME is intended to specify all the information necessary to interpret the results of the experiment unam ...
(MIAME). The
standards for reporting of diagnostic accuracy Standard may refer to: Symbols * Colours, standards and guidons, kinds of military signs * Standard (emblem), a type of a large symbol or emblem used for identification Norms, conventions or requirements * Standard (metrology), an object th ...
(STARD) is available for studies reporting
medical diagnosis Medical diagnosis (abbreviated Dx, Dx, or Ds) is the process of determining which disease or condition explains a person's symptoms and signs. It is most often referred to as diagnosis with the medical context being implicit. The information re ...
accuracies.


References

{{reflist


External links


PSI web site (psidev.info)
Bioinformatics software Minimum Information Standards Proteomics