Chimera (EST)
   HOME

TheInfoList



OR:

In
molecular biology Molecular biology is the branch of biology that seeks to understand the molecular basis of biological activity in and between cells, including biomolecular synthesis, modification, mechanisms, and interactions. The study of chemical and physi ...
, and more importantly high-throughput DNA sequencing, a chimera is a single DNA sequence originating when multiple transcripts or DNA sequences get joined. It can occur in various contexts. Chimeric reads are generally considered artifacts in sequencing applications (such a
amplicon sequencing
and are filtered out from the data during processing to prevent spurious inferences of biological variation. In a different context, the deliberate creation of artificial chimeras can also be a useful tool in the molecular biology. For example, in
protein engineering Protein engineering is the process of developing useful or valuable proteins. It is a young discipline, with much research taking place into the understanding of protein folding and recognition for protein design principles. It has been used to imp ...
, "chimeragenesis (forming chimeras between proteins that are encoded by homologous
cDNAs In genetics, complementary DNA (cDNA) is DNA synthesized from a single-stranded RNA (e.g., messenger RNA (mRNA) or microRNA (miRNA)) template in a reaction catalyzed by the enzyme reverse transcriptase. cDNA is often used to express a spe ...
)" p. 424 is one of the "two major techniques used to manipulate cDNA sequences". For gene fusions that occur through natural processes, see chimeric genes and
fusion genes A fusion gene is a hybrid gene formed from two previously independent genes. It can occur as a result of translocation, interstitial deletion, or chromosomal inversion. Fusion genes have been found to be prevalent in all main types of human neopla ...
.


Description


Transcript chimera

A chimera can occur as a single cDNA sequence originating from two transcripts. It is usually considered to be a contaminant in transcript and
expressed sequence tag In genetics, an expressed sequence tag (EST) is a short sub-sequence of a cDNA sequence. ESTs may be used to identify gene transcripts, and were instrumental in gene discovery and in gene-sequence determination. The identification of ESTs has proc ...
(which results in the moniker of EST chimera) databases. It is estimated that approximately 1% of all transcripts in the
National Center for Biotechnology Information The National Center for Biotechnology Information (NCBI) is part of the United States National Library of Medicine (NLM), a branch of the National Institutes of Health (NIH). It is approved and funded by the government of the United States. The ...
's Unigene database contain a "chimeric sequence".


PCR chimera

A chimera can also be an artifact of PCR amplification. It occurs when the extension of an
amplicon In molecular biology, an amplicon is a piece of DNA or RNA that is the source and/or product of amplification (molecular biology), amplification or DNA replication, replication events. It can be formed artificially, using various methods including ...
is aborted, and the aborted product functions as a
primer Primer may refer to: Arts, entertainment, and media Films * ''Primer'' (film), a 2004 feature film written and directed by Shane Carruth * ''Primer'' (video), a documentary about the funk band Living Colour Literature * Primer (textbook), a t ...
in the next PCR cycle. The aborted product anneals to the wrong template and continues to extend, thereby synthesizing a single sequence sourced from two different templates. PCR chimeras are an important issue to take into account during
metabarcoding Metabarcoding is the barcoding of DNA/RNA (or eDNA/ eRNA) in a manner that allows for the simultaneous identification of many taxa within the same sample. The main difference between barcoding and metabarcoding is that metabarcoding does n ...
, where DNA sequences from environmental samples are used to determine biodiversity. A chimera is a novel sequence that will most probably not match to any known organism. Hence, it might be interpreted as a new species thereby overinflating the diversity.


Chimeric read

A chimeric read is a digital DNA sequence (i.e. a string of letters in a file that can be read as a DNA sequence) that originates from an actual chimera (i.e. a physical DNA sequence in a sample) ''or'' produced due to misreading the sample. The latter is known to occur with sequencing of
electrophoresis Electrophoresis, from Ancient Greek ἤλεκτρον (ḗlektron, "amber") and φόρησις (phórēsis, "the act of bearing"), is the motion of dispersed particles relative to a fluid under the influence of a spatially uniform electric fie ...
gels. Chimeric reads are common with amplicon sequencing applications such as 16S rRNA gene sequencing, since closely related sequences are amplified. The most common mechanism is that incomplete extension during the PCR results in partial sequence strands that can act as primers in subsequent PCR cycles on similar but non identical sequences. Extension of such hybrid priming events causes the formation of chimeric sequences. Some computational methods have been devised to detect and remove chimeras, like: * CHECK_CHIMERA of the Ribosomal Database Project * ChimeraSlayer in QIIME * uchime in usearch * removeBimeraDenovo() in dada2 * Bellerophon * CATCh * DECIPHER


Examples

* "The first mRNA transcript isolated for..." the human gene
C2orf3 GC-rich sequence DNA-binding factor is a protein that in humans is encoded by the ''GCFC2'' gene. The first mRNA transcript isolated for this gene was part of an artificial chimera derived from two distinct gene transcripts and a primer used in ...
"...was part of an artificial chimera..." * CYP2C17 was thought to be a human gene, but "...is now considered an artefact based on a chimera of
CYP2C18 Cytochrome P450 2C18 is a protein that in humans is encoded by the ''CYP2C18'' gene. Function This gene encodes a member of the cytochrome P450 superfamily of enzymes. The cytochrome P450 proteins are monooxygenases which catalyze many reactio ...
and CYP2C19." * Researchers have created receptor chimeras in their studies of
Oncostatin M Oncostatin M, also known as OSM, is a protein that in humans is encoded by the ''OSM'' gene. OSM is a pleiotropic cytokine that belongs to the interleukin 6 group of cytokines. Of these cytokines it most closely resembles leukemia inhibitory fact ...
.


See also

*
Ribosome Ribosomes ( ) are macromolecular machines, found within all cells, that perform biological protein synthesis (mRNA translation). Ribosomes link amino acids together in the order specified by the codons of messenger RNA (mRNA) molecules to ...
*
Transgene A transgene is a gene that has been transferred naturally, or by any of a number of genetic engineering techniques, from one organism to another. The introduction of a transgene, in a process known as transgenesis, has the potential to change th ...
*
Trans-splicing ''Trans''-splicing is a special form of RNA processing where exons from two different primary RNA transcripts are joined end to end and ligated. It is usually found in eukaryotes and mediated by the spliceosome, although some bacteria and archa ...
*
Chimera (genetics) A genetic chimerism or chimera ( ) is a single organism composed of cells with more than one distinct genotype. In animals, this means an individual derived from two or more zygotes, which can include possessing blood cells of different blood ...
* chimeric gene *
fusion gene A fusion gene is a hybrid gene formed from two previously independent genes. It can occur as a result of translocation, interstitial deletion, or chromosomal inversion. Fusion genes have been found to be prevalent in all main types of human neopla ...


References

Genetics {{genetics-stub