GWAS Central
   HOME

TheInfoList



OR:

GWAS Central (previously HGBASE, HGVbase and HGVbaseG2P) is a publicly available
database In computing, a database is an organized collection of data stored and accessed electronically. Small databases can be stored on a file system, while large databases are hosted on computer clusters or cloud storage. The design of databases s ...
of summary-level findings from
genetic association Genetic association is when one or more genotypes within a population co-occur with a phenotypic trait more often than would be expected by chance occurrence. Studies of genetic association aim to test whether single-locus alleles or genotype fre ...
studies in humans, including
genome-wide association studies In genomics, a genome-wide association study (GWA study, or GWAS), also known as whole genome association study (WGA study, or WGAS), is an observational study of a genome-wide set of genetic variants in different individuals to see if any varian ...
(GWAS). It is funded through the
GEN2PHEN Genotype to Phenotype Databases: a Holistic Approach (GEN2PHEN) is a European project aiming to develop a knowledge web portal integrating information from the genotype to the phenotype in a unifying portal: The Knowledge Centre].http://www.iscb.org ...
project by the European Union under their
Seventh Framework Programme The Framework Programmes for Research and Technological Development, also called Framework Programmes or abbreviated FP1 to FP9, are funding programmes created by the European Union/European Commission to support and foster research in the Europe ...
.


Scope

GWAS Central contains the most comprehensive collection of summary-level p-value GWAS data. The web resource employs powerful graphical and text based data presentation methods for discovery of and simultaneous visualisation and co-examination of many studies, at genome-wide and region-specific levels. Studies of interest can now be identified using
chromosomal region Several chromosome regions have been defined by convenience in order to talk about gene loci. Most important is the distinction between chromosome region p and chromosome region q. The p region is represented in the shorter arm of the chromos ...
s/
gene In biology, the word gene (from , ; "... Wilhelm Johannsen coined the word gene to describe the Mendelian units of heredity..." meaning ''generation'' or ''birth'' or ''gender'') can have several different meanings. The Mendelian gene is a b ...
s and markers; there is also the facility for researchers to view their own data alongside selected studies. Current content includes ‘top’ p-values from collections; supplementary data; direct researcher submissions; and publicly available data. Consequently, the database now hosts >21 million p-values and 708 studies (vs 3,948 p-values and 798 studies in the NHGRI GWAS Catalog), representing ~5% of all such data yet produced. GWAS Central makes parts of its data freely available for
download In computer networks, download means to ''receive'' data from a remote system, typically a server such as a web server, an FTP server, an email server, or other similar system. This contrasts with uploading, where data is ''sent to'' a remote ...
by the research community. However, only parts of the data may be downloaded freely, the whole database content can be accessed as part of a collaboration.


History

The Human Genome Bi-Allelic SEquence ( HGBASE) database was the first version of what is now GWAS Central. It was first released in August 1998, focusing on providing a centralized collection of known human single nucleotide polymorphisms and other simple DNA variants. It was the first publicly available SNP database. The project was expanded over the next year by a consortium including the
Karolinska Institute The Karolinska Institute (KI; sv, Karolinska Institutet; sometimes known as the (Royal) Caroline Institute in English) is a research-led Medical school, medical university in Solna Municipality, Solna within the Stockholm urban area of Sweden. ...
, the
European Bioinformatics Institute The European Bioinformatics Institute (EMBL-EBI) is an Intergovernmental Organization (IGO) which, as part of the European Molecular Biology Laboratory (EMBL) family, focuses on research and services in bioinformatics. It is located on the Wel ...
and the
European Molecular Biology Laboratory The European Molecular Biology Laboratory (EMBL) is an intergovernmental organization dedicated to molecular biology research and is supported by 27 member states, two prospect states, and one associate member state. EMBL was created in 1974 and ...
. Corporate support was provided by
Pfizer Pfizer Inc. ( ) is an American multinational pharmaceutical and biotechnology corporation headquartered on 42nd Street in Manhattan, New York City. The company was established in 1849 in New York by two German entrepreneurs, Charles Pfizer ...
and GlaxoSmithKline. The version released in November 2001 was renamed the Human Genome Variation database (HGVbase), as this was a better reflection of the scope of the database and its emphasis on collection from many different laboratories. In addition this also highlighted its new role as a central repository for data collection efforts in collaboration with the Human Genome Variation Society HGVbase was scaled back in 2004 to simply provide an alternative representation of the full marker list from
dbSNP The Single Nucleotide Polymorphism Database (dbSNP) is a free public archive for genetic variation within and across different species developed and hosted by the National Center for Biotechnology Information (NCBI) in collaboration with the Natio ...
, but development continued on its successor: the Human Genome Variation Genotype-to-Phenotype database (HGVbaseG2P), in many ways the natural evolution of HGVbase into a central database for summary-level genetic association data. The work was originally funded by GlaxoSmithKline, the
University of Leicester , mottoeng = So that they may have life , established = , type = public research university , endowment = £20.0 million , budget = £326 million , chancellor = David Willetts , vice_chancellor = Nishan Canagarajah , head_lab ...
, and the European Community's
Sixth Framework Programme The Framework Programmes for Research and Technological Development, also called Framework Programmes or abbreviated FP1 to FP9, are funding programmes created by the European Union/European Commission to support and foster research in the Europe ...
('INFOBIOMED' Network of Excellence), but the GEN2PHEN project became the main source of funds in 2008. Early work in the project involved devising a powerful way of modeling
phenotype In genetics, the phenotype () is the set of observable characteristics or traits of an organism. The term covers the organism's morphology or physical form and structure, its developmental processes, its biochemical and physiological pr ...
and genotype- phenotype data, which itself was adopted and adapted to become the global standard Phenotype And Genotype Experiment Object model. In 2008 HGVbaseG2P went live and extended the project's content and scope by adding a far broader and more comprehensive range of markers (i.e.,
SNPs In genetics, a single-nucleotide polymorphism (SNP ; plural SNPs ) is a germline substitution of a single nucleotide at a specific position in the genome. Although certain definitions require the substitution to be present in a sufficiently larg ...
, structural variants, and STSs), along with association data from many genetic association studies. In February 2010, the project was once again renamed to GWAS Central, to reflect the growing focus on
genome-wide association studies In genomics, a genome-wide association study (GWA study, or GWAS), also known as whole genome association study (WGA study, or WGAS), is an observational study of a genome-wide set of genetic variants in different individuals to see if any varian ...
. GWAS Central is a core component of the GEN2PHEN project and intends to provide an operational model, plus an open-source
software Software is a set of computer programs and associated software documentation, documentation and data (computing), data. This is in contrast to Computer hardware, hardware, from which the system is built and which actually performs the work. ...
package, so others can create similar databases across the world. These will be hosted by institutes, consortia, and even individual laboratories; providing those groups a toolkit for publicising and publishing their genetic association findings on the web and examine their data alongside others data from similar valuable resources.


References

{{reflist


External links


GWAS Central

GEN2PHEN project


Genetic epidemiology Genetics databases Online databases Population genetics organizations