FlyBase is an online
bioinformatics
Bioinformatics () is an interdisciplinary field of science that develops methods and Bioinformatics software, software tools for understanding biological data, especially when the data sets are large and complex. Bioinformatics uses biology, ...
database
In computing, a database is an organized collection of data or a type of data store based on the use of a database management system (DBMS), the software that interacts with end users, applications, and the database itself to capture and a ...
and the primary repository of genetic and molecular data for the insect family
Drosophilidae
The Drosophilidae are a diverse, cosmopolitan family of flies, which includes species called fruit flies, although they are more accurately referred to as vinegar or pomace flies. Another distantly related family of flies, Tephritidae, are true f ...
. For the most extensively studied species and
model organism
A model organism is a non-human species that is extensively studied to understand particular biological phenomena, with the expectation that discoveries made in the model organism will provide insight into the workings of other organisms. Mo ...
, ''
Drosophila melanogaster
''Drosophila melanogaster'' is a species of fly (an insect of the Order (biology), order Diptera) in the family Drosophilidae. The species is often referred to as the fruit fly or lesser fruit fly, or less commonly the "vinegar fly", "pomace fly" ...
'', a wide range of data are presented in different formats.
Information in FlyBase originates from a variety of sources ranging from large-scale genome projects to the primary research literature. These data types include mutant
phenotypes
In genetics, the phenotype () is the set of observable characteristics or traits of an organism. The term covers the organism's morphology (physical form and structure), its developmental processes, its biochemical and physiological properti ...
; molecular characterization of mutant
alleles
An allele is a variant of the sequence of nucleotides at a particular location, or locus, on a DNA molecule.
Alleles can differ at a single position through single nucleotide polymorphisms (SNP), but they can also have insertions and deletions ...
; and other deviations, cytological maps,
wild-type
The wild type (WT) is the phenotype of the typical form of a species as it occurs in nature. Originally, the wild type was conceptualized as a product of the standard "normal" allele at a locus, in contrast to that produced by a non-standard, " ...
expression patterns, anatomical images, transgenic constructs and insertions, sequence-level gene models, and molecular classification of gene product functions. Query tools allow navigation of FlyBase through DNA or protein sequence, by gene or mutant name, or through terms from the several ontologies used to capture functional, phenotypic, and anatomical data. The database offers several different query tools in order to provide efficient access to the data available and facilitate the discovery of significant relationships within the database. Links between FlyBase and external databases, such as BDGP or modENCODE, provide opportunities for further exploration into other
model organism databases
Model organism databases (MODs) are biological databases, or knowledgebases, dedicated to the provision of in-depth biological data for intensively studied model organisms. MODs allow researchers to easily find background information on large set ...
and other resources of biological and molecular information. The FlyBase project is carried out by a consortium of ''Drosophila'' researchers and computer scientists at
Harvard University
Harvard University is a Private university, private Ivy League research university in Cambridge, Massachusetts, United States. Founded in 1636 and named for its first benefactor, the History of the Puritans in North America, Puritan clergyma ...
and
Indiana University
Indiana University (IU) is a state university system, system of Public university, public universities in the U.S. state of Indiana. The system has two core campuses, five regional campuses, and two regional centers under the administration o ...
in the United States, and
University of Cambridge
The University of Cambridge is a Public university, public collegiate university, collegiate research university in Cambridge, England. Founded in 1209, the University of Cambridge is the List of oldest universities in continuous operation, wo ...
in the United Kingdom.
FlyBase is one of the organizations contributing to the
Generic Model Organism Database (GMOD).
the FlyBase home page requested a website access fee of
US$
The United States dollar (Currency symbol, symbol: Dollar sign, $; ISO 4217, currency code: USD) is the official currency of the United States and International use of the U.S. dollar, several other countries. The Coinage Act of 1792 introdu ...
150.00 per person per year, stating that "The
NHGRI has reduced the funding of FlyBase by 50%".
Background
''Drosophila melanogaster'' has been an experimental organism since the early 1900s, and has since been placed at the forefront of many areas of research. As this field of research spread and became global, researchers working on the same problems needed a way to communicate and monitor progress in the field. This niche was initially filled by community newsletters such as the Drosophila Information Service (DIS), which dates back to 1934 when the field was starting to spread from
Thomas Hunt Morgan
Thomas Hunt Morgan (September 25, 1866 – December 4, 1945) was an Americans, American evolutionary biologist, geneticist, Embryology, embryologist, and science author who won the Nobel Prize in Physiology or Medicine in 1933 for discoveries e ...
's lab. Material in these pages presented regular 'catalogs' of mutations, and bibliographies of the Drosophila literature. As computer infrastructure developed in the '80s and '90s, these newsletters gave way and merged with internet mailing lists, and these eventually became online resources and data. In 1992, data on the genetics and genomics of ''D. melanogaster'' and related species were electronically available over the Internet through the funded FlyBase
BDGP(Berkeley Drosophila Genome Project) and EDGP (European Drosophila Genome Project) informatics groups. These groups recognized that most genome project and community data types overlapped. They decided it would be of value to present the scientific community with an integrated view of the data. In October 1992, the National Center for Human Genome Research of the NIH funded the FlyBase project with the objective of designing, building and releasing a database of genetic and molecular information concerning ''
Drosophila melanogaster
''Drosophila melanogaster'' is a species of fly (an insect of the Order (biology), order Diptera) in the family Drosophilidae. The species is often referred to as the fruit fly or lesser fruit fly, or less commonly the "vinegar fly", "pomace fly" ...
''. FlyBase also receives support from the
Medical Research Council, London. In 1998, the FlyBase consortium integrated the information into a single Drosophila genomics server. the FlyBase project was carried out by a consortium of Drosophila researchers and computer scientists at
Harvard University
Harvard University is a Private university, private Ivy League research university in Cambridge, Massachusetts, United States. Founded in 1636 and named for its first benefactor, the History of the Puritans in North America, Puritan clergyma ...
,
University of Cambridge
The University of Cambridge is a Public university, public collegiate university, collegiate research university in Cambridge, England. Founded in 1209, the University of Cambridge is the List of oldest universities in continuous operation, wo ...
(UK),
Indiana University
Indiana University (IU) is a state university system, system of Public university, public universities in the U.S. state of Indiana. The system has two core campuses, five regional campuses, and two regional centers under the administration o ...
and the
University of New Mexico
The University of New Mexico (UNM; ) is a public research university in Albuquerque, New Mexico, United States. Founded in 1889 by the New Mexico Territorial Legislature, it is the state's second oldest university, a flagship university in th ...
.
Contents
FlyBase contains a complete annotation of the ''
Drosophila melanogaster
''Drosophila melanogaster'' is a species of fly (an insect of the Order (biology), order Diptera) in the family Drosophilidae. The species is often referred to as the fruit fly or lesser fruit fly, or less commonly the "vinegar fly", "pomace fly" ...
'' genome that is updated several times per year.
It also included a searchable bibliography of research on ''Drosophila'' genetics in the last century. Information on current researchers, and a partial pedigree of relationships between current researchers, was searchable, based on registration of the participating scientist. The site also provides a large database of images illustrating the full genome, and several movies detailing
embryogenesis
An embryo ( ) is the initial stage of development for a multicellular organism. In organisms that reproduce sexually, embryonic development is the part of the life cycle that begins just after fertilization of the female egg cell by the male ...
ImageBrowser). The two major tributaries to the database are the large multispecies data sets deposited by the Drosophila 12 Genomes Consortium (Clark et al 2007) and Crosby et al 2007.
Search Strategies—Gene reports for genes from all twelve sequenced Drosophila genomes are available in FlyBase. There are four main ways this data can be browsed: Precomputed Files BLAST, Gbrowse, and Gene Report Pages. Gbrowse and precomputed files are for genome-wide analysis, bioinformatics, and comparative genomics. BLAST and gene report pages are for a specific gene, protein, or region across the species.
When looking for cytology there are two main tools available. Use Cytosearch when looking for cytologically-mapped genes or deficiencies, that have not been molecularly mapped to the sequence. Use Gbrowse when looking for molecularly mapped sequences, insertions, or Affymetrix probes.
There are two main query tools in FlyBase. The first main query tool is called Jump to Gene (J2G). This is found in the top right of the blue navigation bar on every page of FlyBase. This tool is useful when you know exactly what you are looking for and want to go to the report page with that data. The second main query tool is called QuickSearch. This is located on the FlyBase homepage. This tool is most useful when you want to look up something quickly that you may only know a little about. Searching can be performed within D. melanogaster only or within all species. Data other than genes can be searched using the ‘data class’ menu.
Related research
The following provides two examples of research that is related to or uses FlyBase:
* The first is a study of expressed genes from
alate
Alate (Latin ''ālātus'', from ''āla'' (“wing”)) is an adjective and noun used in entomology and botany to refer to something that has wings or winglike structures.
In entomology
In entomology, "alate" usually refers to the winged form of ...
(meaning "having wings")
Toxoptera citricida
''Toxoptera citricida'' ( syn. ''Toxoptera citricidus'') is a species of aphid known by the common names brown citrus aphid, black citrus aphid, and oriental citrus aphid. It is a pest of citrus and vector for the pathogenic plant virus citrus ...
, more commonly known as the brown citrus aphid. The brown citrus aphid, is considered the primary vector of
citrus tristeza virus
''Citrus'' is a genus of flowering trees and shrubs in the family Rutaceae. Plants in the genus produce citrus fruits, including important crops such as oranges, mandarins, lemons, grapefruits, pomelos, and limes.
''Citrus'' is native to S ...
, a severe pathogen which causes losses to citrus industries worldwide. The winged form of this aphid can fly long distances with the wind, enabling them to spread the citrus tristeza virus in citrus growing regions. To better understand the biology of the brown citrus aphid and the emergence of genes expressed during wing development, researchers undertook a large-scale 5′ end sequencing project of cDNA clones from winged aphids. Similar large-scale expressed sequence tag (EST) sequencing projects from other insects have provided a vehicle for answering biological questions relating to development and physiology. Although there is a growing database i
GenBankof ESTs from insects, most are from Drosophila melanogaster, with relatively few specifically derived from aphids. The researchers were able to provide a large data set of ESTs from the alate (winged) brown citrus aphid and have begun to analyze this valuable resource. They were able to do this with the help of information on Drosophila melanogaster in FlyBase. Putative sequence identity was determined using BLAST searches. Sequence matches with E-value scores ≤ −10 were considered significant and were categorized according to the Gene Ontology (GO) classification system based on annotation of the 5 ‘best hit’ matches in BLASTX searches. All D. melanogaster matches were cataloged using FlyBase. Nearly all of these ‘best hit’ matches were characterized with respect to the functionally annotated genes in D. melanogaster using FlyBase. Genetic information is crucial to advancing the understanding of aphid biology, and will play a major role in the development of future non-chemical, gene-based control strategies against these insect pests.
* Enhancing Drosophila Gene Ontology Annotation: What gene products do and where they do it are important questions for biologists. The Gene Ontology project was established 13 years ago in order to summarize this data consistently across different databases by using a common set of defined vocabulary terms. They also encode relationships between terms. The Gene Ontology Project is a major bioinformatics initiative with the aim of standardizing the representation of gene and gene product attributes across species and databases. The project also provides gene product annotation data from GO consortium members. FlyBase was one of the three founding members of the Gene Ontology Consortium. GO annotation comprises at least three components: a GO term that describes molecular function, biological role, or subcellular location; an "evidence code" that describes the type of analysis used to support the GO term; and an attribution to a specific reference. GO annotation is useful for both small-scale and large-scale analyses. It can provide a first indication of the nature of a gene product and, in conjunction with evidence codes, point directly to papers with pertinent experimental data. The current priorities for annotation are: homologs of human disease genes, genes that are highly conserved across species, genes involved in biochemical/signaling pathways, and topical genes shown to be of significant interest in recent publications. FlyBase has been contributing GO annotations to the project since it started in August 2006. GO annotations appear on the Gene Report page in FlyBase. GO data are searchable in FlyBase using both TermLink and QueryBuilder. The GO is dynamic and can change on a daily basis, for example the addition of new terms. To keep up, FlyBase loads a new version of the GO every one or two releases of FlyBase. The GO annotation set is submitted to the GOC at the same time as a new version of FlyBase is released.
See also
*
List of Drosophila databases
*
Model Organism Databases
Model organism databases (MODs) are biological databases, or knowledgebases, dedicated to the provision of in-depth biological data for intensively studied model organisms. MODs allow researchers to easily find background information on large set ...
*
WormBase
WormBase is an online biological database about the biology and genome of the nematode model organism ''Caenorhabditis elegans'' and contains information about other related nematodes. WormBase is used by the ''C. elegans'' research community bo ...
*
Xenbase
Notes and references
External links
Official Site
{{Bioinformatics
Drosophila melanogaster genetics
Insect developmental biology
Model organism databases