HOME

TheInfoList



OR:

A frameshift mutation (also called a framing error or a reading frame shift) is a
genetic mutation In biology, a mutation is an alteration in the nucleic acid sequence of the genome of an organism, virus, or extrachromosomal DNA. Viral genomes contain either DNA or RNA. Mutations result from errors during DNA or viral replication, mitosis ...
caused by
indel Indel (insertion-deletion) is a molecular biology term for an insertion or deletion of bases in the genome of an organism. Indels ≥ 50 bases in length are classified as structural variants. In coding regions of the genome, unless the lengt ...
s ( insertions or deletions) of a number of
nucleotide Nucleotides are Organic compound, organic molecules composed of a nitrogenous base, a pentose sugar and a phosphate. They serve as monomeric units of the nucleic acid polymers – deoxyribonucleic acid (DNA) and ribonucleic acid (RNA), both o ...
s in a DNA sequence that is not divisible by three. Due to the triplet nature of
gene expression Gene expression is the process (including its Regulation of gene expression, regulation) by which information from a gene is used in the synthesis of a functional gene product that enables it to produce end products, proteins or non-coding RNA, ...
by
codon Genetic code is a set of rules used by living cells to translate information encoded within genetic material (DNA or RNA sequences of nucleotide triplets or codons) into proteins. Translation is accomplished by the ribosome, which links prote ...
s, the insertion or deletion can change the
reading frame In molecular biology, a reading frame is a specific choice out of the possible ways to read the nucleic acid sequence, sequence of nucleotides in a nucleic acid (DNA or RNA) molecule as a sequence of triplets. Where these triplets equate to amino ...
(the grouping of the codons), resulting in a completely different
translation Translation is the communication of the semantics, meaning of a #Source and target languages, source-language text by means of an Dynamic and formal equivalence, equivalent #Source and target languages, target-language text. The English la ...
from the original. The earlier in the sequence the deletion or insertion occurs, the more altered the protein. A frameshift mutation is not the same as a
single-nucleotide polymorphism In genetics and bioinformatics, a single-nucleotide polymorphism (SNP ; plural SNPs ) is a germline substitution of a single nucleotide at a specific position in the genome. Although certain definitions require the substitution to be present in a ...
in which a nucleotide is replaced, rather than inserted or deleted. A frameshift mutation will in general cause the reading of the codons after the mutation to code for different amino acids. The frameshift mutation will also alter the first stop codon ("UAA", "UGA" or "UAG") encountered in the sequence. The polypeptide being created could be abnormally short or abnormally long, and will most likely not be functional. Frameshift mutations are apparent in severe genetic diseases such as
Tay–Sachs disease Tay–Sachs disease is an Genetic disorder, inherited fatal lysosomal storage disease that results in the destruction of nerve cells in the brain and spinal cord. The most common form is infantile Tay–Sachs disease, which becomes apparent arou ...
; they increase susceptibility to certain cancers and classes of familial hypercholesterolaemia; in 1997, a frameshift mutation was linked to resistance to infection by the HIV retrovirus. Frameshift mutations have been proposed as a source of biological novelty, as with the alleged creation of nylonase, however, this interpretation is controversial. A study by Negoro ''et al.'' (2006) found that a frameshift mutation was unlikely to have been the cause and that rather a two amino acid substitution in the
active site In biology and biochemistry, the active site is the region of an enzyme where substrate molecules bind and undergo a chemical reaction. The active site consists of amino acid residues that form temporary bonds with the substrate, the ''binding s ...
of an ancestral
esterase In biochemistry, an esterase is a class of enzyme that splits esters into an acid and an alcohol in a chemical reaction with water called hydrolysis (and as such, it is a type of hydrolase). A wide range of different esterases exist that differ ...
resulted in nylonase.


Background

The information contained in DNA determines protein function in the cells of all organisms. Transcription and translation allow this information to be communicated into making proteins. However, an error in reading this communication can cause protein function to be incorrect and eventually cause disease even as the cell incorporates a variety of corrective measures.Genetic information is conveyed by DNA for protein synthesis within cells. Misinterpretation can lead to faulty function and disease, despite cellular correction mechanisms.


Central dogma

In 1956
Francis Crick Francis Harry Compton Crick (8 June 1916 – 28 July 2004) was an English molecular biologist, biophysicist, and neuroscientist. He, James Watson, Rosalind Franklin, and Maurice Wilkins played crucial roles in deciphering the Nucleic acid doub ...
described the flow of genetic information from
DNA Deoxyribonucleic acid (; DNA) is a polymer composed of two polynucleotide chains that coil around each other to form a double helix. The polymer carries genetic instructions for the development, functioning, growth and reproduction of al ...
to a specific amino acid arrangement for making a
protein Proteins are large biomolecules and macromolecules that comprise one or more long chains of amino acid residue (biochemistry), residues. Proteins perform a vast array of functions within organisms, including Enzyme catalysis, catalysing metab ...
as the central dogma. For a cell to properly function, proteins are required to be produced accurately for structural and for
catalytic Catalysis () is the increase in reaction rate, rate of a chemical reaction due to an added substance known as a catalyst (). Catalysts are not consumed by the reaction and remain unchanged after it. If the reaction is rapid and the catalyst ...
activities. An incorrectly made protein can have detrimental effects on
cell Cell most often refers to: * Cell (biology), the functional basic unit of life * Cellphone, a phone connected to a cellular network * Clandestine cell, a penetration-resistant form of a secret or outlawed organization * Electrochemical cell, a de ...
viability and in most cases cause the higher
organism An organism is any life, living thing that functions as an individual. Such a definition raises more problems than it solves, not least because the concept of an individual is also difficult. Many criteria, few of them widely accepted, have be ...
to become unhealthy by abnormal cellular functions. To ensure that the
genome A genome is all the genetic information of an organism. It consists of nucleotide sequences of DNA (or RNA in RNA viruses). The nuclear genome includes protein-coding genes and non-coding genes, other functional regions of the genome such as ...
successfully passes the information on,
proofreading Proofreading is a phase in the process of publishing where galley proofs are compared against the original manuscripts or graphic artworks, to identify transcription errors in the typesetting process. In the past, proofreaders would place corr ...
mechanisms such as
exonuclease Exonucleases are enzymes that work by cleaving nucleotides one at a time from the end (exo) of a polynucleotide chain. A hydrolyzing reaction that breaks phosphodiester bonds at either the 3′ or the 5′ end occurs. Its close relative is th ...
s and
mismatch repair DNA mismatch repair (MMR) is a system for recognizing and repairing erroneous insertion, deletion, and mis-incorporation of nucleobase, bases that can arise during DNA replication and Genetic recombination, recombination, as well as DNA repair, ...
systems are incorporated in
DNA replication In molecular biology, DNA replication is the biological process of producing two identical replicas of DNA from one original DNA molecule. DNA replication occurs in all life, living organisms, acting as the most essential part of heredity, biolog ...
.


Transcription and translation

After DNA replication, the reading of a selected section of genetic information is accomplished by transcription. Nucleotides containing the genetic information are now on a single strand messenger template called
mRNA In molecular biology, messenger ribonucleic acid (mRNA) is a single-stranded molecule of RNA that corresponds to the genetic sequence of a gene, and is read by a ribosome in the process of Protein biosynthesis, synthesizing a protein. mRNA is ...
. The mRNA is incorporated with a subunit of the
ribosome Ribosomes () are molecular machine, macromolecular machines, found within all cell (biology), cells, that perform Translation (biology), biological protein synthesis (messenger RNA translation). Ribosomes link amino acids together in the order s ...
and interacts with an
rRNA Ribosomal ribonucleic acid (rRNA) is a type of non-coding RNA which is the primary component of ribosomes, essential to all cells. rRNA is a ribozyme which carries out protein synthesis in ribosomes. Ribosomal RNA is transcribed from ribosomal ...
. The genetic information carried in the codons of the mRNA are now read (decoded) by anticodons of the tRNA. As each codon (triplet) is read,
amino acids Amino acids are organic compounds that contain both amino and carboxylic acid functional groups. Although over 500 amino acids exist in nature, by far the most important are the Proteinogenic amino acid, 22 α-amino acids incorporated into p ...
are being joined until a
stop codon In molecular biology, a stop codon (or termination codon) is a codon (nucleotide triplet within messenger RNA) that signals the termination of the translation process of the current protein. Most codons in messenger RNA correspond to the additio ...
(UAG, UGA or UAA) is reached. At this point the
polypeptide Peptides are short chains of amino acids linked by peptide bonds. A polypeptide is a longer, continuous, unbranched peptide chain. Polypeptides that have a molecular mass of 10,000 Da or more are called proteins. Chains of fewer than twenty ...
(protein) has been synthesised and is released. For every 1000 amino acid incorporated into the protein, no more than one is incorrect. This fidelity of codon recognition, maintaining the importance of the proper reading frame, is accomplished by proper base pairing at the ribosome A site, GTP hydrolysis activity of
EF-Tu EF-Tu (elongation factor thermo unstable) is a prokaryotic elongation factor responsible for catalyzing the binding of an aminoacyl-tRNA (aa-tRNA) to the ribosome. It is a G-protein, and facilitates the selection and binding of an aa-tRNA to t ...
a form of kinetic stability, and a proofreading mechanism as EF-Tu is released. Frameshifting may also occur during
prophase Prophase () is the first stage of cell division in both mitosis and meiosis. Beginning after interphase, DNA has already been replicated when the cell enters prophase. The main occurrences in prophase are the condensation of the chromatin retic ...
translation, producing different proteins from overlapping open reading frames, such as the gag-pol-env
retroviral A retrovirus is a type of virus that inserts a DNA copy of its RNA genome into the DNA of a host cell that it invades, thus changing the genome of that cell. After invading a host cell's cytoplasm, the virus uses its own reverse transcriptase e ...
proteins. This is fairly common in
viruses A virus is a submicroscopic infectious agent that replicates only inside the living cells of an organism. Viruses infect all life forms, from animals and plants to microorganisms, including bacteria and archaea. Viruses are found in almo ...
and also occurs in
bacteria Bacteria (; : bacterium) are ubiquitous, mostly free-living organisms often consisting of one Cell (biology), biological cell. They constitute a large domain (biology), domain of Prokaryote, prokaryotic microorganisms. Typically a few micr ...
and
yeast Yeasts are eukaryotic, single-celled microorganisms classified as members of the fungus kingdom (biology), kingdom. The first yeast originated hundreds of millions of years ago, and at least 1,500 species are currently recognized. They are est ...
(Farabaugh, 1996).
Reverse transcriptase A reverse transcriptase (RT) is an enzyme used to convert RNA genome to DNA, a process termed reverse transcription. Reverse transcriptases are used by viruses such as HIV and hepatitis B to replicate their genomes, by retrotransposon mobi ...
, as opposed to
RNA Polymerase II RNA polymerase II (RNAP II and Pol II) is a Protein complex, multiprotein complex that Transcription (biology), transcribes DNA into precursors of messenger RNA (mRNA) and most small nuclear RNA (snRNA) and microRNA. It is one of the three RNA pol ...
, is thought to be a stronger cause of the occurrence of frameshift mutations. In experiments only 3–13% of all frameshift mutations occurred because of RNA Polymerase II. In
prokaryotes A prokaryote (; less commonly spelled procaryote) is a single-celled organism whose cell lacks a nucleus and other membrane-bound organelles. The word ''prokaryote'' comes from the Ancient Greek (), meaning 'before', and (), meaning 'nut' ...
the error rate inducing frameshift mutations is only somewhere in the range of .0001 and .00001. There are several biological processes that help to prevent frameshift mutations. Reverse mutations occur which change the mutated sequence back to the original
wild type The wild type (WT) is the phenotype of the typical form of a species as it occurs in nature. Originally, the wild type was conceptualized as a product of the standard "normal" allele at a locus, in contrast to that produced by a non-standard, " ...
sequence. Another possibility for mutation correction is the use of a suppressor mutation. This offsets the effect of the original mutation by creating a secondary mutation, shifting the sequence to allow for the correct amino acids to be read. Guide RNA can also be used to insert or delete Uridine into the mRNA after transcription, this allows for the correct reading frame.


Codon-triplet importance

A
codon Genetic code is a set of rules used by living cells to translate information encoded within genetic material (DNA or RNA sequences of nucleotide triplets or codons) into proteins. Translation is accomplished by the ribosome, which links prote ...
is a set of three
nucleotides Nucleotides are Organic compound, organic molecules composed of a nitrogenous base, a pentose sugar and a phosphate. They serve as monomeric units of the nucleic acid polymers – deoxyribonucleic acid (DNA) and ribonucleic acid (RNA), both o ...
, a triplet that codes for a certain
amino acid Amino acids are organic compounds that contain both amino and carboxylic acid functional groups. Although over 500 amino acids exist in nature, by far the most important are the 22 α-amino acids incorporated into proteins. Only these 22 a ...
. The first codon establishes the reading frame, whereby a new codon begins. A protein's amino acid backbone
sequence In mathematics, a sequence is an enumerated collection of objects in which repetitions are allowed and order matters. Like a set, it contains members (also called ''elements'', or ''terms''). The number of elements (possibly infinite) is cal ...
is defined by contiguous triplets. Codons are key to translation of genetic information for the synthesis of proteins. The reading frame is set when translating the mRNA begins and is maintained as it reads one triplet to the next. The reading of the genetic code is subject to three rules the monitor codons in mRNA. First, codons are read in a 5' to 3' direction. Second, codons are nonoverlapping and the message has no gaps. The last rule, as stated above, that the message is translated in a fixed reading frame.


Mechanism

Frameshift mutations can occur randomly or be caused by an external stimulus. The detection of frameshift mutations can occur via several different methods. Frameshifts are just one type of mutation that can lead to incomplete or incorrect proteins, but they account for a significant percentage of errors in DNA.In an unaltered gene, codons (triplets of nucleotides) are sequentially interpreted, with each codon encoding a specific amino acid. This is known as the standard reading frame. However, in cases of frame shift mutations, an extra nucleotide (or more) is inserted into the DNA sequence, disrupting the typical reading frame and causing a shift in the sequence. This insertion prompts a shift in the reading frame due to the triplet nature of the genetic code. For instance, the addition of an extra "A" leads to a sequence shift, triggering the reading of an entirely different set of codons. This deviation in genetic information causes the ribosome, which reads the mRNA for protein synthesis, to misinterpret the genetic data. Consequently, an entirely different series of amino acids is generated, resulting in the generation of an altered protein sequence. In most instances, the new reading frame results in an early encounter with a stop codon, leading to the formation of a shortened and usually inactive protein. This form of mutation is termed an early stop codon or a nonsense mutation.


Genetic or environmental

This is a genetic mutation at the level of nucleotide bases. Why and how frameshift mutations occur are continually being sought after. An environmental study, specifically the production of UV-induced frameshift mutations by DNA polymerases deficient in 3′ → 5′ exonuclease activity was done. The normal sequence 5′ GTC GTT TTA CAA 3′ was changed to GTC GTT T TTA CAA (MIDT) of GTC GTT C TTA CAA (MIDC) to study frameshifts.
E. coli ''Escherichia coli'' ( )Wells, J. C. (2000) Longman Pronunciation Dictionary. Harlow ngland Pearson Education Ltd. is a gram-negative, facultative anaerobic, rod-shaped, coliform bacterium of the genus ''Escherichia'' that is commonly foun ...
pol I Kf and T7 DNA polymerase mutant
enzymes An enzyme () is a protein that acts as a biological catalyst by accelerating chemical reactions. The molecules upon which enzymes may act are called substrates, and the enzyme converts the substrates into different molecules known as pro ...
devoid of 3′ → 5′ exonuclease activity produce UV-induced revertants at higher frequency than did their
exonuclease Exonucleases are enzymes that work by cleaving nucleotides one at a time from the end (exo) of a polynucleotide chain. A hydrolyzing reaction that breaks phosphodiester bonds at either the 3′ or the 5′ end occurs. Its close relative is th ...
proficient counterparts. The data indicates that loss of proofreading activity increases the frequency of UV-induced frameshifts.


Detection


Fluorescence

The effects of neighboring bases and secondary structure to detect the frequency of frameshift mutations has been investigated in depth using
fluorescence Fluorescence is one of two kinds of photoluminescence, the emission of light by a substance that has absorbed light or other electromagnetic radiation. When exposed to ultraviolet radiation, many substances will glow (fluoresce) with colore ...
. Fluorescently tagged DNA, by means of base analogues, permits one to study the local changes of a DNA sequence. Studies on the effects of the length of the primer strand reveal that an equilibrium mixture of four hybridization conformations was observed when template bases looped-out as a bulge, i.e. a structure flanked on both sides by duplex DNA. In contrast, a double-loop structure with an unusual unstacked DNA conformation at its downstream edge was observed when the extruded bases were positioned at the primer–template junction, showing that misalignments can be modified by neighboring DNA secondary structure.


Sequencing

Sanger sequencing Sanger sequencing is a method of DNA sequencing that involves electrophoresis and is based on the random incorporation of chain-terminating dideoxynucleotides by DNA polymerase during in vitro DNA replication. After first being developed by Fred ...
and
pyrosequencing Pyrosequencing is a method of DNA sequencing (determining the order of nucleotides in DNA) based on the "sequencing by synthesis" principle, in which the sequencing is performed by detecting the nucleotide incorporated by a DNA polymerase. Pyrosequ ...
are two methods that have been used to detect frameshift mutations, however, it is likely that data generated will not be of the highest quality. Even still, 1.96 million
indel Indel (insertion-deletion) is a molecular biology term for an insertion or deletion of bases in the genome of an organism. Indels ≥ 50 bases in length are classified as structural variants. In coding regions of the genome, unless the lengt ...
s have been identified through Sanger sequencing that do not overlap with other databases. When a frameshift mutation is observed it is compared against the Human Genome Mutation Database (HGMD) to determine if the mutation has a damaging effect. This is done by looking at four features. First, the ratio between the affected and conserved DNA, second the location of the mutation relative to the transcript, third the ratio of conserved and affected amino acids and finally the distance of the indel to the end of the
exon An exon is any part of a gene that will form a part of the final mature RNA produced by that gene after introns have been removed by RNA splicing. The term ''exon'' refers to both the DNA sequence within a gene and to the corresponding sequence ...
. Massively Parallel Sequencing is a newer method that can be used to detect mutations. Using this method, up to 17 gigabases can be sequenced at once, as opposed to limited ranges for
Sanger sequencing Sanger sequencing is a method of DNA sequencing that involves electrophoresis and is based on the random incorporation of chain-terminating dideoxynucleotides by DNA polymerase during in vitro DNA replication. After first being developed by Fred ...
of only about 1 kilobase. Several technologies are available to perform this test and it is being looked at to be used in clinical applications. When testing for different carcinomas, current methods only allow for looking at one gene at a time. Massively Parallel Sequencing can test for a variety of cancer causing mutations at once as opposed to several specific tests. An experiment to determine the accuracy of this newer sequencing method tested for 21 genes and had no false positive calls for frameshift mutations.


Diagnosis

A US
patent A patent is a type of intellectual property that gives its owner the legal right to exclude others from making, using, or selling an invention for a limited period of time in exchange for publishing an sufficiency of disclosure, enabling discl ...
(5,958,684) in 1999 by Leeuwen, details the methods and reagents for diagnosis of diseases caused by or associated with a gene having a somatic mutation giving rise to a frameshift mutation. The methods include providing a tissue or fluid sample and conducting gene analysis for frameshift mutation or a protein from this type of mutation. The nucleotide sequence of the suspected gene is provided from published gene sequences or from
cloning Cloning is the process of producing individual organisms with identical genomes, either by natural or artificial means. In nature, some organisms produce clones through asexual reproduction; this reproduction of an organism by itself without ...
and sequencing of the suspect gene. The amino acid sequence encoded by the gene is then predicted. NA Sequencing: Sanger sequencing or Next-Generation Sequencing (NGS) can be used to directly sequence the DNA and identify insertions or deletions.Polymerase Chain Reaction (PCR): PCR can be used to amplify the specific region containing the mutation for subsequent analysis.Multiplex Ligation-dependent Probe Amplification (MLPA): MLPA is a technique used to detect copy number variations and small insertions or deletions.Comparative Genomic Hybridization (CGH): CGH is used to detect chromosomal imbalances, which may include large insertions or deletions.


Frequency

Despite the rules that govern the genetic code and the various mechanisms present in a cell to ensure the correct transfer of genetic information during the process of DNA replication as well as during translation, mutations do occur; frameshift mutation is not the only type. There are at least two other types of recognized point mutations, specifically
missense mutation In genetics, a missense mutation is a point mutation in which a single nucleotide change results in a codon that codes for a different amino acid. It is a type of nonsynonymous substitution. Missense mutations change amino acids, which in turn alt ...
and
nonsense mutation In genetics, a nonsense mutation is a point mutation in a sequence of DNA that results in a ''nonsense codon'', or a premature stop codon in the transcribed mRNA, and leads to a truncated, incomplete, and possibly nonfunctional protein product. No ...
. A frameshift mutation can drastically change the coding capacity (genetic information) of the message. Small insertions or deletions (those less than 20 base pairs) make up 24% of mutations that manifest in currently recognized genetic disease. Frameshift mutations are found to be more common in repeat regions of DNA. A reason for this is because of slipping of the polymerase enzyme in repeat regions, allowing for mutations to enter the
sequence In mathematics, a sequence is an enumerated collection of objects in which repetitions are allowed and order matters. Like a set, it contains members (also called ''elements'', or ''terms''). The number of elements (possibly infinite) is cal ...
.
Experiment An experiment is a procedure carried out to support or refute a hypothesis, or determine the efficacy or likelihood of something previously untried. Experiments provide insight into cause-and-effect by demonstrating what outcome occurs whe ...
s can be run to determine the frequency of the frameshift mutation by adding or removing a pre-set number of nucleotides. Experiments have been run by adding four basepairs, called the +4 experiments, but a team from
Emory University Emory University is a private university, private research university in Atlanta, Georgia, United States. It was founded in 1836 as Emory College by the Methodist Episcopal Church and named in honor of Methodist bishop John Emory. Its main campu ...
looked at the difference in frequency of the mutation by both adding and deleting a base pair. It was shown that there was no difference in the frequency between the addition and deletion of a base pair. There is however, a difference in the result of the protein.
Huntington's disease Huntington's disease (HD), also known as Huntington's chorea, is an incurable neurodegenerative disease that is mostly Genetic disorder#Autosomal dominant, inherited. It typically presents as a triad of progressive psychiatric, cognitive, and ...
is one of the nine codon reiteration disorders caused by polyglutamine expansion mutations that include spino-cerebellar ataxia (SCA) 1, 2, 6, 7 and 3, spinobulbar muscular atrophy and dentatorubal-pallidoluysianatrophy. There may be a link between diseases caused by polyglutamine and polyalanine expansion mutations, as frame shifting of the original SCA3 gene product encoding CAG/polyglutamines to GCA/polyalanines. Ribosomal slippage during translation of the SCA3 protein has been proposed as the mechanism resulting in shifting from the polyglutamine to the polyalanine-encoding frame. A dinucleotide deletion or single nucleotide insertion within the polyglutamine tract of huntingtin exon 1 would shift the CAG, polyglutamineen coding frame by +1 (+1 frame shift) to the GCA, polyalanine-encoding frame and introduce a novel epitope to the C terminus of Htt exon 1 (APAAAPAATRPGCG).


Diseases

Several diseases have frameshift mutations as at least part of the cause. Knowing prevalent mutations can also aid in the diagnosis of the disease. Currently there are attempts to use frameshift mutations beneficially in the treatment of diseases, changing the reading frame of the amino acids.


Cancer

Frameshift mutations are known to be a factor in colorectal cancer as well as other
cancers Cancer is a group of diseases involving Cell growth#Disorders, abnormal cell growth with the potential to Invasion (cancer), invade or Metastasis, spread to other parts of the body. These contrast with benign tumors, which do not spread. Po ...
with
microsatellite instability Microsatellite instability (MSI) is the condition of genetic hypermutability (predisposition to mutation) that results from impaired DNA mismatch repair (MMR). The presence of MSI represents phenotypic evidence that MMR is not functioning norm ...
. As stated previously, frameshift mutations are more likely to occur in a region of repeat sequence. When DNA mismatch repair does not fix the addition or deletion of bases, these mutations are more likely to be pathogenic. This may be in part because the tumor is not told to stop growing. Experiments in yeast and bacteria help to show characteristics of microsatellites that may contribute to defective DNA mismatch repair. These include the length of the
microsatellite A microsatellite is a tract of repetitive DNA in which certain Sequence motif, DNA motifs (ranging in length from one to six or more base pairs) are repeated, typically 5–50 times. Microsatellites occur at thousands of locations within an organ ...
, the makeup of the genetic material and how pure the repeats are. Based on experimental results longer microsatellites have a higher rate of frameshift mutations. The flanking DNA can also contribute to frameshift mutations. In prostate cancer a frameshift mutation changes the
open reading frame In molecular biology, reading frames are defined as spans of DNA sequence between the start and stop codons. Usually, this is considered within a studied region of a prokaryotic DNA sequence, where only one of the six possible reading frames ...
(ORF) and prevents
apoptosis Apoptosis (from ) is a form of programmed cell death that occurs in multicellular organisms and in some eukaryotic, single-celled microorganisms such as yeast. Biochemistry, Biochemical events lead to characteristic cell changes (Morphology (biol ...
from occurring. This leads to an unregulated growth of the
tumor A neoplasm () is a type of abnormal and excessive growth of tissue. The process that occurs to form or produce a neoplasm is called neoplasia. The growth of a neoplasm is uncoordinated with that of the normal surrounding tissue, and persists ...
. While there are environmental factors that contribute to the progression of
prostate cancer Prostate cancer is the neoplasm, uncontrolled growth of cells in the prostate, a gland in the male reproductive system below the bladder. Abnormal growth of the prostate tissue is usually detected through Screening (medicine), screening tests, ...
, there is also a genetic component. During testing of coding regions to identify mutations, 116 genetic variants were discovered, including 61 frameshift mutations. There are over 500 mutations on chromosome 17 that seem to play a role in the development of breast and ovarian cancer in the BRCA1 gene, many of which are frameshift.


Crohn's disease

Crohn's disease Crohn's disease is a type of inflammatory bowel disease (IBD) that may affect any segment of the gastrointestinal tract. Symptoms often include abdominal pain, diarrhea, fever, abdominal distension, and weight loss. Complications outside of the ...
has an association with the NOD2 gene. The mutation is an insertion of a
Cytosine Cytosine () (symbol C or Cyt) is one of the four nucleotide bases found in DNA and RNA, along with adenine, guanine, and thymine ( uracil in RNA). It is a pyrimidine derivative, with a heterocyclic aromatic ring and two substituents attac ...
at position 3020. This leads to a premature stop codon, shortening the protein that is supposed to be transcribed. When the protein is able to form normally, it responds to bacterial liposaccharides, where the 3020insC mutation prevents the protein from being responsive.


Cystic fibrosis

Cystic fibrosis Cystic fibrosis (CF) is a genetic disorder inherited in an autosomal recessive manner that impairs the normal clearance of Sputum, mucus from the lungs, which facilitates the colonization and infection of the lungs by bacteria, notably ''Staphy ...
(CF) is a disease based on mutations in the CF
transmembrane A transmembrane protein is a type of integral membrane protein that spans the entirety of the cell membrane. Many transmembrane proteins function as gateways to permit the transport of specific substances across the membrane. They frequently u ...
conductance regulator (CFTR) gene. There are over 1500 mutations identified, but not all cause the disease. Most cases of cystic fibrosis are a result of the ∆F508 mutation, which deletes the entire amino acid. Two frameshift mutations are of interest in diagnosing CF, CF1213delT and CF1154-insTC. Both of these mutations commonly occur in tandem with at least one other mutation. They both lead to a small decrease in the function of the
lungs The lungs are the primary organs of the respiratory system in many animals, including humans. In mammals and most other tetrapods, two lungs are located near the backbone on either side of the heart. Their function in the respiratory syste ...
and occur in about 1% of patients tested. These mutations were identified through Sanger sequencing.


HIV

CCR5 C-C chemokine receptor type 5, also known as CCR5 or CD195, is a protein on the surface of white blood cells that is involved in the immune system as it acts as a receptor for chemokines. In humans, the ''CCR5'' gene that encodes the CCR5 p ...
is one of the cell entry co-factors associated with HIV, most frequently involved with nonsyncytium-inducing strains, is most apparent in HIV patients as opposed to AIDS patients. A 32 base pair deletion in CCR5 has been identified as a mutation that negates the likelihood of an HIV infection. This region on the open reading frame ORF contains a frameshift mutation leading to a premature stop codon. This leads to the loss of the HIV-coreceptor function in vitro. CCR5-1 is considered the wild type and CCR5-2 is considered to be the mutant allele. Those with a heterozygous mutation for the CCR5 were less susceptible to the development of HIV. In a study, despite high exposure to the HIV virus, there was no one homozygous for the CCR5 mutation that tested positive for HIV.


Tay–Sachs disease

Tay–Sachs disease Tay–Sachs disease is an Genetic disorder, inherited fatal lysosomal storage disease that results in the destruction of nerve cells in the brain and spinal cord. The most common form is infantile Tay–Sachs disease, which becomes apparent arou ...
is a fatal disease affecting the central nervous system. It is most frequently found in infants and small children. Disease progression begins in the
womb The uterus (from Latin ''uterus'', : uteri or uteruses) or womb () is the organ in the reproductive system of most female mammals, including humans, that accommodates the embryonic and fetal development of one or more fertilized eggs until bi ...
but symptoms do not appear until approximately 6 months of age. There is no cure for the disease. Mutations in the β-hexosaminidase A (Hex A) gene are known to affect the onset of Tay-Sachs, with 78 mutations of different types being described, 67 of which are known to cause disease. Most of the mutations observed (65/78) are single base substitutions or SNPs, 11 deletions, 1 large and 10 small, and 2 insertions. 8 of the observed mutations are frameshift, 6 deletions and 2 insertions. A 4 base pair insertion in exon 11 is observed in 80% of Tay-Sachs disease presence in the
Ashkenazi Ashkenazi Jews ( ; also known as Ashkenazic Jews or Ashkenazim) form a distinct subgroup of the Jewish diaspora, that Ethnogenesis, emerged in the Holy Roman Empire around the end of the first millennium Common era, CE. They traditionally spe ...
Jewish population. The frameshift mutations lead to an early stop codon which is known to play a role in the disease in infants. Delayed onset disease appears to be caused by 4 different mutations, one being a 3 base pair deletion.


Smith–Magenis syndrome

Smith–Magenis syndrome Smith–Magenis syndrome (SMS), also known as 17p-microdeletion syndrome, is a microdeletion syndrome characterized by an abnormality in the short (p) arm of chromosome 17. It has features including intellectual disability, facial abnormalities, ...
(SMS) is a complex
syndrome A syndrome is a set of medical signs and symptoms which are correlated with each other and often associated with a particular disease or disorder. The word derives from the Greek language, Greek σύνδρομον, meaning "concurrence". When a sy ...
involving intellectual disabilities, sleep disturbance, behavioural problems, and a variety of craniofacial, skeletal, and visceral anomalies. The majority of SMS cases harbor an ~3.5 Mb common deletion that encompasses the retinoic acid induced-1 ('' RAI1'') gene. Other cases illustrate variability in the SMS
phenotype In genetics, the phenotype () is the set of observable characteristics or traits of an organism. The term covers the organism's morphology (physical form and structure), its developmental processes, its biochemical and physiological propert ...
not previously shown for RAI1 mutation, including hearing loss, self-abusive behaviours, and mild global delays. Sequencing of RAI1 revealed mutation of a heptamericC-tract (CCCCCCC) in exon 3 resulting in frameshift mutations. Of the seven reported frameshift mutations occurring in poly C-tracts in RAI1, four cases (~57%) occur at this heptameric C-tract. The results indicate that this heptameric C-tract is a preferential
recombination hotspot Recombination hotspots are regions in a genome that exhibit elevated rates of recombination relative to a neutral expectation. The recombination rate within hotspots can be hundreds of times that of the surrounding region. Recombination hotspots r ...
insertion/deletions (SNindels) and therefore a primary target for analysis in patients suspected for mutations in RAI1.


Hypertrophic cardiomyopathy

Hypertrophic cardiomyopathy Hypertrophic cardiomyopathy (HCM, or HOCM when obstructive) is a condition in which muscle tissues of the heart become thickened without an obvious cause. The parts of the heart most commonly affected are the interventricular septum and the ...
is the most common cause of sudden death in young people, including trained athletes, and is caused by mutations in genes encoding proteins of the cardiac sarcomere. Mutations in the Troponin C gene (''
TNNC1 Troponin C, also known as TN-C or TnC, is a protein that resides in the troponin complex on actin thin filaments of striated muscle (cardiac, fast-twitch skeletal, or slow-twitch skeletal) and is responsible for binding calcium in biology, calci ...
'') are a rare genetic cause of hypertrophic cardiomyopathy. A recent study has indicated that a frameshift mutation (c.363dupG or p.Gln122AlafsX30) in Troponin C was the cause of hypertrophic cardiomyopathy (and sudden cardiac death) in a 19-year-old male.


Cures

Finding a cure for the diseases caused by frameshift mutations is rare. Research into this is ongoing. One example is a
primary immunodeficiency Primary immunodeficiencies are disorders in which part of the body's immune system is missing or does not function normally. To be considered a ''primary'' immunodeficiency (PID), the immune deficiency must be inborn, not caused by secondary facto ...
(PID), an inherited condition which can lead to an increase in infections. There are 120 genes and 150 mutations that play a role in primary immunodeficiencies. The standard treatment is currently gene therapy, but this is a highly risky treatment and can often lead to other diseases, such as leukemia. Gene therapy procedures include modifying the zinc fringer nuclease fusion protein, cleaving both ends of the mutation, which in turn removes it from the sequence. Antisense-oligonucleotide mediated exon skipping is another possibility for Duchenne
muscular dystrophy Muscular dystrophies (MD) are a genetically and clinically heterogeneous group of rare neuromuscular diseases that cause progressive weakness and breakdown of skeletal muscles over time. The disorders differ as to which muscles are primarily affe ...
. This process allows for passing over the mutation so that the rest of the sequence remains in frame and the function of the protein stays intact. This, however, does not cure the disease, just treats symptoms, and is only practical in structural proteins or other repetitive genes. A third form of repair is revertant mosaicism, which is naturally occurring by creating a reverse mutation or a mutation at a second site that corrects the reading frame. This reversion may happen by intragenic recombination,
mitotic Mitosis () is a part of the cell cycle in eukaryotic cells in which replicated chromosomes are separated into two new nuclei. Cell division by mitosis is an equational division which gives rise to genetically identical cells in which the t ...
gene conversion, second site DNA slipping or site-specific reversion. This is possible in several diseases, such as X-linked severe combined immunodeficiency (SCID),
Wiskott–Aldrich syndrome Wiskott–Aldrich syndrome (WAS) is a rare X-linked recessive disease characterized by eczema, thrombocytopenia (low platelet count), immune deficiency, and bloody diarrhea (secondary to the thrombocytopenia). It is also sometimes called the e ...
, and
Bloom syndrome Bloom syndrome (often abbreviated as BS in literature) is a rare autosomal recessive genetic disorder characterized by short stature, predisposition to the development of cancer, and genomic instability. BS is caused by mutations in the '' BLM'' g ...
. There are no drugs or other pharmacogenomic methods that help with PIDs. A European patent (EP1369126A1) in 2003 by Bork records a method used for prevention of cancers and for the curative treatment of cancers and precancers such as DNA-mismatch repair deficient (MMR) sporadic tumours and HNPCC associated tumours. The idea is to use immunotherapy with combinatorial mixtures of tumour-specific frameshift mutation-derived peptides to elicit a cytotoxic T-cell response specifically directed against tumour cells.European Paten

(December 10, 2003) "Use of coding microsatellite region frameshift mutation-derived peptides for treating cancer" by Bork ''et al''


See also

* Translational frameshift *
Mutation In biology, a mutation is an alteration in the nucleic acid sequence of the genome of an organism, virus, or extrachromosomal DNA. Viral genomes contain either DNA or RNA. Mutations result from errors during DNA or viral replication, ...
*
Transcription (genetics) Transcription is the process of copying a segment of DNA into RNA for the purpose of gene expression. Some segments of DNA are transcribed into RNA molecules that can encode proteins, called messenger RNA (mRNA). Other segments of DNA are transc ...
*
Translation (biology) In biology, translation is the process in living Cell (biology), cells in which proteins are produced using RNA molecules as templates. The generated protein is a sequence of amino acids. This sequence is determined by the sequence of nucleotide ...
*
codon Genetic code is a set of rules used by living cells to translate information encoded within genetic material (DNA or RNA sequences of nucleotide triplets or codons) into proteins. Translation is accomplished by the ribosome, which links prote ...
*
protein Proteins are large biomolecules and macromolecules that comprise one or more long chains of amino acid residue (biochemistry), residues. Proteins perform a vast array of functions within organisms, including Enzyme catalysis, catalysing metab ...
*
reading frame In molecular biology, a reading frame is a specific choice out of the possible ways to read the nucleic acid sequence, sequence of nucleotides in a nucleic acid (DNA or RNA) molecule as a sequence of triplets. Where these triplets equate to amino ...
*
point mutation A point mutation is a genetic mutation where a single nucleotide base is changed, inserted or deleted from a DNA or RNA sequence of an organism's genome. Point mutations have a variety of effects on the downstream protein product—consequences ...
*
Crohn's disease Crohn's disease is a type of inflammatory bowel disease (IBD) that may affect any segment of the gastrointestinal tract. Symptoms often include abdominal pain, diarrhea, fever, abdominal distension, and weight loss. Complications outside of the ...
*
Tay–Sachs disease Tay–Sachs disease is an Genetic disorder, inherited fatal lysosomal storage disease that results in the destruction of nerve cells in the brain and spinal cord. The most common form is infantile Tay–Sachs disease, which becomes apparent arou ...


References


Further reading

* * *


External links

*
NCBI dbSNP database
— "a central repository for both single base nucleotide substitutions and short deletion and insertion polymorphisms"

- aligns a
protein Proteins are large biomolecules and macromolecules that comprise one or more long chains of amino acid residue (biochemistry), residues. Proteins perform a vast array of functions within organisms, including Enzyme catalysis, catalysing metab ...
against a DNA sequence allowing frameshifts and
intron An intron is any nucleotide sequence within a gene that is not expressed or operative in the final RNA product. The word ''intron'' is derived from the term ''intragenic region'', i.e., a region inside a gene."The notion of the cistron .e., gen ...
s
FastY
- compare a DNA sequence to a protein sequence database, allowing gaps and frameshifts
Path
- tool that compares two frameshift proteins (back-
translation Translation is the communication of the semantics, meaning of a #Source and target languages, source-language text by means of an Dynamic and formal equivalence, equivalent #Source and target languages, target-language text. The English la ...
principle)
HGMD
- Human Genome Mutation Database {{Mutation Mutation