MutationTaster
   HOME

TheInfoList



OR:

MutationTaster is a free web-based application to evaluate DNA sequence variants for their disease-causing potential. The software performs a battery of ''in silico'' tests to estimate the impact of the variant on the gene product / protein. Tests are made on both, protein and DNA level, MutationTaster is hence not limited to substitutions of single amino acids but can also handle
synonymous A synonym is a word, morpheme, or phrase that means exactly or nearly the same as another word, morpheme, or phrase in a given language. For example, in the English language, the words ''begin'', ''start'', ''commence'', and ''initiate'' are all ...
or
intron An intron is any nucleotide sequence within a gene that is not expressed or operative in the final RNA product. The word ''intron'' is derived from the term ''intragenic region'', i.e. a region inside a gene."The notion of the cistron .e., gene. ...
ic variants.


Background

Many
genetic disorder A genetic disorder is a health problem caused by one or more abnormalities in the genome. It can be caused by a mutation in a single gene (monogenic) or multiple genes (polygenic) or by a chromosomal abnormality. Although polygenic disorders ...
s can be caused by mutation of a single gene. New sequencing techniques however, have shown that a single individual can have up to 3.5 million alterations in the whole genome, most of which do not have a detrimental health effect. The challenge of prediction tools is thus to filter harmless mutations from disease-causing ones. It is important to note that these tools are not designed to predict sources of complex diseases such as cancer. The latter usually do not have a monogenic causation but are caused by multiple gene defects that develop cumulatively into a disease.


Approach and tests

Mutation Taster is written in
Perl Perl is a family of two high-level, general-purpose, interpreted, dynamic programming languages. "Perl" refers to Perl 5, but from 2000 to 2019 it also referred to its redesigned "sister language", Perl 6, before the latter's name was offici ...
and can process
Next-generation sequencing Massive parallel sequencing or massively parallel sequencing is any of several high-throughput approaches to DNA sequencing using the concept of massively parallel processing; it is also called next-generation sequencing (NGS) or second-generation s ...
data of all major platforms (Roche 454, Illumina Genome Analyzer and ABI SOLiD). The program first discards mutations that are known, harmless polymorphisms by comparison with the integrated databases. The remaining SNPs (
Single-nucleotide polymorphism In genetics, a single-nucleotide polymorphism (SNP ; plural SNPs ) is a germline substitution of a single nucleotide at a specific position in the genome. Although certain definitions require the substitution to be present in a sufficiently lar ...
) are tested according to the gene alteration they are causing: * Silent synonymous or intronic alterations that do not lead to an amino acid exchange * Mutations that affect a single amino acid * Mutations causing complex changes in the amino acid sequence (such as
indel Indel is a molecular biology term for an insertion or deletion of bases in the genome of an organism. It is classified among small genetic variations, measuring from 1 to 10 000 base pairs in length, including insertion and deletion events that ...
s) Multiple tests are performed to determine the nature of the given SNP. These tests comprise (among others): * amino acid substitution(s) * conservation of affected amino acid(s) * potential loss of functional protein domains * length of protein * effect on splicing * conservation on DNA level (phastCons / phyloP) * potential abrogation of regulatory elements (such as transcription factor binding sites) Integrated data sources (among others): * Ensembl * UniProt * ClinVar * ExAC * 1000 Genomes Project * phyloP * phastCons The single results are then assessed by a
Naive Bayes classifier In statistics, naive Bayes classifiers are a family of simple "probabilistic classifiers" based on applying Bayes' theorem with strong (naive) independence assumptions between the features (see Bayes classifier). They are among the simplest Baye ...
which decides whether or not their combined effect might be deleterious for the protein. The 'raw' accuracy of MutationTaster is about 90%, with the inclusion of knowledge about common (harmless) polymorphisms and known disease mutations, the actual rate of correct classifications is much higher. The test output explains if the alteration is a known or predicted harmless or disease-causing mutation and gives detailed information about the mutation. Importantly, the predictions of clinical effects of mutations suffer from a lack of specificity, which appears to be the common constraint of all recently used prediction methods, including those named above. Despite that, predictions mediated by these methods are associated with nearly absolute sensitivity. The outcomes of prediction methods are often uncritically used, particularly by non-specialists in the field.


Development

Development of MutationTaster has started in 2007, the software is available online since 2009. MutationTaster is hosted at the Charité Berlin and its current developers are Olivia Ebner, Daniela Hombach, Markus Schülke, Jana Marie Schwarz, Dominik Seelow. Current efforts are focused on integrating mutations that do not directly alter protein coding genes but have an effect on gene regulation and expression.


References

{{Reflist


External links

* Official websit

Charité Genetics databases Genetics software