Genomic Standards Consortium
   HOME

TheInfoList



OR:

The Genomic Standards Consortium (GSC) is an initiative working towards richer descriptions of our collection of genomes, metagenomes and marker genes. Established in September 2005, this international community includes representatives from a range of major
sequencing In genetics and biochemistry, sequencing means to determine the primary structure (sometimes incorrectly called the primary sequence) of an unbranched biopolymer. Sequencing results in a symbolic linear depiction known as a sequence which succ ...
and
bioinformatics Bioinformatics () is an interdisciplinary field that develops methods and software tools for understanding biological data, in particular when the data sets are large and complex. As an interdisciplinary field of science, bioinformatics combi ...
centres (including
NCBI The National Center for Biotechnology Information (NCBI) is part of the United States National Library of Medicine (NLM), a branch of the National Institutes of Health (NIH). It is approved and funded by the government of the United States. The ...
, EMBL,
DDBJ The DNA Data Bank of Japan (DDBJ) is a biological database that collects DNA sequences. It is located at the National Institute of Genetics (NIG) in the Shizuoka prefecture of Japan. It is also a member of the International Nucleotide Sequence D ...
, JCVI, JGI,
EBI Ebrahim Hamedi ( fa, اِبراهیم حامدی, also Romanized as "Ebrāhim Hāmedi"; born 1949), better known by his stage name Ebi (Persian: ), is an Iranian pop singer who first started his career in Tehran, gaining fame as part of a ban ...
, Sanger, FIG) and research institutions. The goal of the GSC is to promote mechanisms for standardizing the description of (meta)genomes, including the exchange and integration of (meta)genomic data. The number and pace of genomic and metagenomic sequencing projects will only increase as the use of ultra-high-throughput methods becomes common place and standards are vital to scientific progress and
data sharing Data sharing is the practice of making data used for scholarly research available to other investigators. Many funding agencies, institutions, and publication venues have policies regarding data sharing because transparency and openness are consid ...
.


Mission

Community-driven standards have the best chance of success if developed within the auspices of international working groups. Participants in the GSC include biologists, computer scientists, those building genomic databases and conducting large-scale comparative genomic analyses, and those with experience of building community-based standards. The mission of the GSC is to work with the wider community towards: * the implementation of a new genomic standards * methods of capturing and exchanging metadata * harmonization of metadata collection and analysis efforts across the wider genomics community Fulfilling this mission by holding face-to-face meetings, forming working groups, and building consensus products that can be widely used in this community. Bringing together investigators working in different systems to work on a common problem.


MIGS/MIMS/MIMARKS and other projects

The GSC has published a “Minimum Information about a (Meta)Genome Sequence” specification and has now completed a "Minimum Information about an ENvironmental Sequence" specification. MIGS/MIMS/MIMARKS provides an extension of the minimum information already captured by the primary nucleotide sequence archives
INSDC
or DDBJ/ENA/GenBank). The development of any checklist must be an open and iterative process that involves a balanced group of participants. Further, this development process must be supported by providing mechanisms for achieving compliance if a checklist is to be adopted as a tool for the standardization of a particular area of knowledge. Work towards this goal has spawned a set of interlocking projects that are described in more detail here
GSC projects
These include The Genomic Contextual Data Markup Language (GCDML), Genomic Rosetta Stone (GRS), Habitat-Lite. Newer projects include the M5 project.


Linkages to other groups

The GSC is interested in making and building links with other communities. As stated above, the GSC is engaged in ontology development within th
OBO Foundry
The GSC is also a founding member community of th
Minimum Information about a Biomedical or Biological Investigation (MIBBI)
an umbrella community for supporting and co-ordinating the development of checklists describing
Minimum Information Standards Minimum information standards are sets of guidelines and formats for reporting data derived by specific high-throughput methods. Their purpose is to ensure the data generated by these methods can be easily verified, analysed and interpreted by the ...
. GSC and the
Earth Microbiome Project The 'Earth Microbiome Project'' (EMP) is an initiative founded by Janet Jansson, Jack Gilbert and Rob Knight in 2010 to collect natural samples and to analyze the microbial community around the globe. Microbes are highly abundant, diverse, and h ...
maintain the Biological Observation Matrix (BIOM) file format, an open
JSON JSON (JavaScript Object Notation, pronounced ; also ) is an open standard file format and data interchange format that uses human-readable text to store and transmit data objects consisting of attribute–value pairs and arrays (or other ser ...
-based file format for representing arbitrary observation by sample contingency tables with associated sample and observation metadata.


Publications

The GSC maintains a list of publications on its wiki
GSC Publications
This list includes reports from all workshops, articles from the special issue of the journal OMICS on data standards, and the publications describing the MIGS/MIMS and MIMARKS specifications in the journal ''Nature Biotechnology'' (May 2008 and May 2011 respectively). The GSC has also published a series of papers "Genomic Standards Consortium and Beyond" in the journal ''
GigaScience ''GigaScience'' is a peer-reviewed scientific journal that was established in 2012. It covers research and large data-sets that result from work in the biomedical and life sciences. The editor-in-chief is Scott Edmunds. Originally, the journal wa ...
''.


References


External links


GSC home page

GSC FAQ

GSC projects
{{Authority control Genomics organizations International scientific organizations Scientific organizations established in 2005