RNA Splicing
   HOME

TheInfoList



OR:

RNA splicing is a process in molecular biology where a newly-made precursor messenger RNA (pre- mRNA) transcript is transformed into a
mature messenger RNA Mature messenger RNA, often abbreviated as mature mRNA is a eukaryotic RNA transcript that has been spliced and processed and is ready for translation in the course of protein synthesis. Unlike the eukaryotic RNA immediately after transcription ...
( mRNA). It works by removing all the
intron An intron is any nucleotide sequence within a gene that is not expressed or operative in the final RNA product. The word ''intron'' is derived from the term ''intragenic region'', i.e. a region inside a gene."The notion of the cistron .e., gene. ...
s (non-coding regions of RNA) and ''splicing'' back together
exon An exon is any part of a gene that will form a part of the final mature RNA produced by that gene after introns have been removed by RNA splicing. The term ''exon'' refers to both the DNA sequence within a gene and to the corresponding sequen ...
s (coding regions). For nuclear-encoded genes, splicing occurs in the
nucleus Nucleus ( : nuclei) is a Latin word for the seed inside a fruit. It most often refers to: *Atomic nucleus, the very dense central region of an atom *Cell nucleus, a central organelle of a eukaryotic cell, containing most of the cell's DNA Nucle ...
either during or immediately after
transcription Transcription refers to the process of converting sounds (voice, music etc.) into letters or musical notes, or producing a copy of something in another medium, including: Genetics * Transcription (biology), the copying of DNA into RNA, the fir ...
. For those eukaryotic genes that contain introns, splicing is usually needed to create an mRNA molecule that can be translated into protein. For many eukaryotic introns, splicing occurs in a series of reactions which are catalyzed by the spliceosome, a complex of small nuclear ribonucleoproteins ( snRNPs). There exist self-splicing introns, that is, ribozymes that can catalyze their own excision from their parent RNA molecule. The process of transcription, splicing and translation is called
gene expression Gene expression is the process by which information from a gene is used in the synthesis of a functional gene product that enables it to produce end products, protein or non-coding RNA, and ultimately affect a phenotype, as the final effect. The ...
, the
central dogma of molecular biology The central dogma of molecular biology is an explanation of the flow of genetic information within a biological system. It is often stated as "DNA makes RNA, and RNA makes protein", although this is not its original meaning. It was first stated by ...
.


Splicing pathways

Several methods of RNA splicing occur in nature; the type of splicing depends on the structure of the spliced intron and the catalysts required for splicing to occur.


Spliceosomal complex


Introns

The word ''intron'' is derived from the terms ''intragenic region'', and ''intracistron'', that is, a segment of DNA that is located between two exons of a gene. The term intron refers to both the DNA sequence within a gene and the corresponding sequence in the unprocessed RNA transcript. As part of the RNA processing pathway, introns are removed by RNA splicing either shortly after or concurrent with
transcription Transcription refers to the process of converting sounds (voice, music etc.) into letters or musical notes, or producing a copy of something in another medium, including: Genetics * Transcription (biology), the copying of DNA into RNA, the fir ...
. Introns are found in the genes of most organisms and many viruses. They can be located in a wide range of genes, including those that generate
proteins Proteins are large biomolecules and macromolecules that comprise one or more long chains of amino acid residues. Proteins perform a vast array of functions within organisms, including catalysing metabolic reactions, DNA replication, respo ...
, ribosomal RNA (rRNA), and
transfer RNA Transfer RNA (abbreviated tRNA and formerly referred to as sRNA, for soluble RNA) is an adaptor molecule composed of RNA, typically 76 to 90 nucleotides in length (in eukaryotes), that serves as the physical link between the mRNA and the amino ac ...
(tRNA). Within introns, a donor site (5' end of the intron), a branch site (near the 3' end of the intron) and an acceptor site (3' end of the intron) are required for splicing. The splice donor site includes an almost invariant sequence GU at the 5' end of the intron, within a larger, less highly conserved region. The splice acceptor site at the 3' end of the intron terminates the intron with an almost invariant AG sequence. Upstream (5'-ward) from the AG there is a region high in
pyrimidine Pyrimidine (; ) is an aromatic, heterocyclic, organic compound similar to pyridine (). One of the three diazines (six-membered heterocyclics with two nitrogen atoms in the ring), it has nitrogen atoms at positions 1 and 3 in the ring. The other ...
s (C and U), or polypyrimidine tract. Further upstream from the polypyrimidine tract is the branchpoint, which includes an adenine nucleotide involved in lariat formation. The
consensus sequence In molecular biology and bioinformatics, the consensus sequence (or canonical sequence) is the calculated order of most frequent residues, either nucleotide or amino acid, found at each position in a sequence alignment. It serves as a simplified r ...
for an intron (in IUPAC nucleic acid notation) is: G-G- utG-U-R-A-G-U (donor site) ... intron sequence ... Y-U-R-A-C (branch sequence 20-50 nucleotides upstream of acceptor site) ... Y-rich-N-C-A-G- utG (acceptor site). However, it is noted that the specific sequence of intronic splicing elements and the number of nucleotides between the branchpoint and the nearest 3’ acceptor site affect splice site selection. Also, point mutations in the underlying DNA or errors during transcription can activate a ''cryptic splice site'' in part of the transcript that usually is not spliced. This results in a
mature messenger RNA Mature messenger RNA, often abbreviated as mature mRNA is a eukaryotic RNA transcript that has been spliced and processed and is ready for translation in the course of protein synthesis. Unlike the eukaryotic RNA immediately after transcription ...
with a missing section of an exon. In this way, a point mutation, which might otherwise affect only a single amino acid, can manifest as a
deletion Deletion or delete may refer to: Computing * File deletion, a way of removing a file from a computer's file system * Code cleanup, a way of removing unnecessary variables, data structures, cookies, and temporary files in a programming language * ...
or truncation in the final protein.


Formation and activity

Splicing is catalyzed by the spliceosome, a large RNA-protein complex composed of five small nuclear ribonucleoproteins ( snRNPs). Assembly and activity of the spliceosome occurs during transcription of the pre-mRNA. The RNA components of snRNPs interact with the intron and are involved in catalysis. Two types of spliceosomes have been identified (major and minor) which contain different snRNPs. *The major spliceosome splices introns containing GU at the 5' splice site and AG at the 3' splice site. It is composed of the U1, U2, U4, U5, and U6 snRNPs and is active in the nucleus. In addition, a number of proteins including
U2 small nuclear RNA auxiliary factor 1 Splicing factor U2AF 35 kDa subunit is a protein that in humans is encoded by the ''U2AF1'' gene. Function This gene belongs to the splicing factor SR family of genes . U2AF1 is a subunit of the U2 Auxiliary Factor complex alongside a larger ...
(U2AF35), U2AF2 (U2AF65) and SF1 are required for the assembly of the spliceosome. The spliceosome forms different complexes during the splicing process: :*Complex E :**The U1 snRNP binds to the GU sequence at the 5' splice site of an intron; :** Splicing factor 1 binds to the intron branch point sequence; :**U2AF1 binds at the 3' splice site of the intron; :** U2AF2 binds to the polypyrimidine tract; :*Complex A (pre-spliceosome) :**The U2 snRNP displaces SF1 and binds to the branch point sequence and ATP is hydrolyzed; :*Complex B (pre-catalytic spliceosome) :**The U5/U4/U6 snRNP trimer binds, and the U5 snRNP binds exons at the 5' site, with U6 binding to U2; :*Complex B* :**The U1 snRNP is released, U5 shifts from exon to intron, and the U6 binds at the 5' splice site; :*Complex C (catalytic spliceosome) :**U4 is released, U6/U2 catalyzes transesterification, making the 5'-end of the intron ligate to the A on intron and form a lariat, U5 binds exon at 3' splice site, and the 5' site is cleaved, resulting in the formation of the lariat; :*Complex C* (post-spliceosomal complex) :**U2/U5/U6 remain bound to the lariat, and the 3' site is cleaved and exons are ligated using ATP hydrolysis. The spliced RNA is released, the lariat is released and degraded, and the snRNPs are recycled. :This type of splicing is termed ''canonical splicing'' or termed the ''lariat pathway'', which accounts for more than 99% of splicing. By contrast, when the intronic flanking sequences do not follow the GU-AG rule, ''noncanonical splicing'' is said to occur (see "minor spliceosome" below). *The
minor spliceosome The minor spliceosome is a ribonucleoprotein complex that catalyses the removal ( splicing) of an atypical class of spliceosomal introns (U12-type) from messenger RNAs in some clades of eukaryotes. This process is called noncanonical splicing, as op ...
is very similar to the major spliceosome, but instead it splices out rare introns with different splice site sequences. While the minor and major spliceosomes contain the same U5 snRNP, the minor spliceosome has different but functionally analogous snRNPs for U1, U2, U4, and U6, which are respectively called U11, U12, U4atac, and U6atac.


Recursive splicing

In most cases, splicing removes introns as single units from precursor mRNA transcripts. However, in some cases, especially in mRNAs with very long introns, splicing happens in steps, with part of an intron removed and then the remaining intron is spliced out in a following step. This has been found first in the ''
Ultrabithorax Ultrabithorax (Ubx) is a homeobox gene found in insects, and is used in the regulation of patterning in morphogenesis. There are many possible products of this gene, which function as transcription factors. Ubx is used in the specification of se ...
'' (''Ubx'') gene of the fruit fly, '' Drosophila melanogaster'', and a few other ''Drosophila'' genes, but cases in humans have been reported as well.


Trans-splicing

Trans-splicing is a form of splicing that removes introns or
outron An outron is a nucleotide sequence at the 5' end of the primary transcript of a gene that is removed by a special form of RNA splicing during maturation of the final RNA product. Whereas intron sequences are located inside the gene, outron sequ ...
s, and joins two exons that are not within the same RNA transcript. Trans-splicing can occur between two different endogenous pre-mRNAs or between an endogenous and an exogenous (such as from viruses) or artificial RNAs.


Self-splicing

Self-splicing occurs for rare introns that form a ribozyme, performing the functions of the spliceosome by RNA alone. There are three kinds of self-splicing introns, '' Group I'', ''
Group II A group is a number of persons or things that are located, gathered, or classed together. Groups of people * Cultural group, a group whose members share the same cultural identity * Ethnic group, a group whose members share the same ethnic ide ...
'' and '' Group III''. Group I and II introns perform splicing similar to the spliceosome without requiring any protein. This similarity suggests that Group I and II introns may be evolutionarily related to the spliceosome. Self-splicing may also be very ancient, and may have existed in an RNA world present before protein. Two transesterifications characterize the mechanism in which group I introns are spliced: # 3'OH of a free guanine nucleoside (or one located in the intron) or a nucleotide cofactor (GMP, GDP, GTP) attacks phosphate at the 5' splice site. # 3'OH of the 5' exon becomes a nucleophile and the second transesterification results in the joining of the two exons. The mechanism in which group II introns are spliced (two transesterification reaction like group I introns) is as follows: # The 2'OH of a specific adenosine in the intron attacks the 5' splice site, thereby forming the ''lariat'' # The 3'OH of the 5' exon triggers the second transesterification at the 3' splice site, thereby joining the exons together.


tRNA splicing

tRNA (also tRNA-like) splicing is another rare form of splicing that usually occurs in tRNA. The splicing reaction involves a different biochemistry than the spliceosomal and self-splicing pathways. In the yeast '' Saccharomyces cerevisiae'', a yeast tRNA splicing
endonuclease Endonucleases are enzymes that cleave the phosphodiester bond within a polynucleotide chain. Some, such as deoxyribonuclease I, cut DNA relatively nonspecifically (without regard to sequence), while many, typically called restriction endonucleases ...
heterotetramer, composed of TSEN54, TSEN2, TSEN34, and
TSEN15 tRNA-splicing endonuclease subunit Sen15 is an enzyme that in humans is encoded by the ''TSEN15'' gene. tRNA splicing is a fundamental process for cell growth and division. SEN15 is a subunit of the tRNA splicing endonuclease, which catalyzes the ...
, cleaves pre-tRNA at two sites in the acceptor loop to form a 5'-half tRNA, terminating at a 2',3'-cyclic phosphodiester group, and a 3'-half tRNA, terminating at a 5'-hydroxyl group, along with a discarded intron. Yeast tRNA kinase then phosphorylates the 5'-hydroxyl group using adenosine triphosphate. Yeast tRNA cyclic phosphodiesterase cleaves the cyclic phosphodiester group to form a 2'-phosphorylated 3' end. Yeast tRNA ligase adds an adenosine monophosphate group to the 5' end of the 3'-half and joins the two halves together. NAD-dependent 2'-phosphotransferase then removes the 2'-phosphate group.


Evolution

Splicing occurs in all the kingdoms or
domain Domain may refer to: Mathematics *Domain of a function, the set of input values for which the (total) function is defined **Domain of definition of a partial function **Natural domain of a partial function **Domain of holomorphy of a function * Do ...
s of life, however, the extent and types of splicing can be very different between the major divisions.
Eukaryote Eukaryotes () are organisms whose cells have a nucleus. All animals, plants, fungi, and many unicellular organisms, are Eukaryotes. They belong to the group of organisms Eukaryota or Eukarya, which is one of the three domains of life. Bacte ...
s splice many protein-coding
messenger RNA In molecular biology, messenger ribonucleic acid (mRNA) is a single-stranded molecule of RNA that corresponds to the genetic sequence of a gene, and is read by a ribosome in the process of synthesizing a protein. mRNA is created during the p ...
s and some non-coding RNAs. Prokaryotes, on the other hand, splice rarely and mostly non-coding RNAs. Another important difference between these two groups of organisms is that prokaryotes completely lack the spliceosomal pathway. Because spliceosomal introns are not conserved in all species, there is debate concerning when spliceosomal splicing evolved. Two models have been proposed: the intron late and intron early models (see intron evolution).


Biochemical mechanism

Spliceosomal splicing and self-splicing involve a two-step biochemical process. Both steps involve transesterification reactions that occur between RNA nucleotides. tRNA splicing, however, is an exception and does not occur by transesterification. Spliceosomal and self-splicing transesterification reactions occur via two sequential transesterification reactions. First, the 2'OH of a specific ''branchpoint'' nucleotide within the intron, defined during spliceosome assembly, performs a nucleophilic attack on the first nucleotide of the intron at the 5' splice site, forming the ''lariat intermediate''. Second, the 3'OH of the released 5' exon then performs an nucleophilic attack at the first nucleotide following the last nucleotide of the intron at the 3' splice site, thus joining the exons and releasing the intron lariat.


Alternative splicing

In many cases, the splicing process can create a range of unique proteins by varying the exon composition of the same mRNA. This phenomenon is then called
alternative splicing Alternative splicing, or alternative RNA splicing, or differential splicing, is an alternative splicing process during gene expression that allows a single gene to code for multiple proteins. In this process, particular exons of a gene may be ...
. Alternative splicing can occur in many ways. Exons can be extended or skipped, or introns can be retained. It is estimated that 95% of transcripts from multiexon genes undergo alternative splicing, some instances of which occur in a tissue-specific manner and/or under specific cellular conditions. Development of high throughput mRNA sequencing technology can help quantify the expression levels of alternatively spliced isoforms. Differential expression levels across tissues and cell lineages allowed computational approaches to be developed to predict the functions of these isoforms. Given this complexity, alternative splicing of pre-mRNA transcripts is regulated by a system of trans-acting proteins (activators and repressors) that bind to cis-acting sites or "elements" (enhancers and silencers) on the pre-mRNA transcript itself. These proteins and their respective binding elements promote or reduce the usage of a particular splice site. The binding specificity comes from the sequence and structure of the cis-elements, e.g. in HIV-1 there are many donor and acceptor splice sites. Among the various splice sites, ssA7, which is 3' acceptor site, folds into three stem loop structures, i.e. Intronic splicing silencer (ISS), Exonic splicing enhancer (ESE), and Exonic splicing silencer (ESSE3). Solution structure of Intronic splicing silencer and its interaction to host protein hnRNPA1 give insight into specific recognition. However, adding to the complexity of alternative splicing, it is noted that the effects of regulatory factors are many times position-dependent. For example, a splicing factor that serves as a splicing activator when bound to an intronic enhancer element may serve as a repressor when bound to its splicing element in the context of an exon, and vice versa. In addition to the position-dependent effects of enhancer and silencer elements, the location of the branchpoint (i.e., distance upstream of the nearest 3’ acceptor site) also affects splicing. The secondary structure of the pre-mRNA transcript also plays a role in regulating splicing, such as by bringing together splicing elements or by masking a sequence that would otherwise serve as a binding element for a splicing factor.


Role of splicing/alternative splicing in HIV-integration

The process of splicing is linked with
HIV integration AIDS ("acquired immune deficiency syndrome") is caused by the human immunodeficiency virus (HIV). Individuals with HIV have what is referred to as a "HIV infection". When infected semen, vaginal secretions, or blood come in contact with the mucou ...
, as HIV-1 targets highly spliced genes.


Splicing response to DNA damage

DNA damage DNA repair is a collection of processes by which a cell identifies and corrects damage to the DNA molecules that encode its genome. In human cells, both normal metabolic activities and environmental factors such as radiation can cause DNA da ...
affects splicing factors by altering their post-translational modification, localization, expression and activity. Furthermore, DNA damage often disrupts splicing by interfering with its coupling to
transcription Transcription refers to the process of converting sounds (voice, music etc.) into letters or musical notes, or producing a copy of something in another medium, including: Genetics * Transcription (biology), the copying of DNA into RNA, the fir ...
. DNA damage also has an impact on the splicing and alternative splicing of genes intimately associated with DNA repair. For instance, DNA damages modulate the alternative splicing of the DNA repair genes '' Brca1'' and '' Ercc1''.


Experimental manipulation of splicing

Splicing events can be experimentally altered by binding steric-blocking antisense oligos, such as Morpholinos or Peptide nucleic acids to snRNP binding sites, to the branchpoint nucleotide that closes the lariat, or to splice-regulatory element binding sites.


Splicing errors and variation

It has been suggested that one third of all disease-causing mutations impact on splicing. Common errors include: *Mutation of a splice site resulting in loss of function of that site. Results in exposure of a premature stop codon, loss of an exon, or inclusion of an intron. *Mutation of a splice site reducing specificity. May result in variation in the splice location, causing insertion or deletion of amino acids, or most likely, a disruption of the reading frame. *Displacement of a splice site, leading to inclusion or exclusion of more RNA than expected, resulting in longer or shorter exons. Although many splicing errors are safeguarded by a cellular quality control mechanism termed
nonsense-mediated mRNA decay Nonsense-mediated mRNA decay (NMD) is a surveillance pathway that exists in all eukaryotes. Its main function is to reduce errors in gene expression by eliminating mRNA transcripts that contain premature stop codons. Translation of these aberrant ...
(NMD), a number of splicing-related diseases also exist, as suggested above. Allelic differences in mRNA splicing are likely to be a common and important source of phenotypic diversity at the molecular level, in addition to their contribution to genetic disease susceptibility. Indeed, genome-wide studies in humans have identified a range of genes that are subject to allele-specific splicing. In plants, variation for flooding stress tolerance correlated with stress-induced alternative splicing of transcripts associated with gluconeogenesis and other processes.


Protein splicing

In addition to RNA, proteins can undergo splicing. Although the biomolecular mechanisms are different, the principle is the same: parts of the protein, called inteins instead of introns, are removed. The remaining parts, called exteins instead of exons, are fused together. Protein splicing has been observed in a wide range of organisms, including bacteria,
archaea Archaea ( ; singular archaeon ) is a domain of single-celled organisms. These microorganisms lack cell nuclei and are therefore prokaryotes. Archaea were initially classified as bacteria, receiving the name archaebacteria (in the Archaebac ...
, plants, yeast and humans.


See also

*
cDNA In genetics, complementary DNA (cDNA) is DNA synthesized from a single-stranded RNA (e.g., messenger RNA (mRNA) or microRNA (miRNA)) template in a reaction catalyzed by the enzyme reverse transcriptase. cDNA is often used to express a speci ...
* DBASS3/5 *
Exon junction complex An exon junction complex (EJC) is a protein complex which forms on a pre-messenger RNA strand at the junction of two exons which have been joined together during RNA splicing. The EJC has major influences on translation, surveillance and localiza ...
* mRNA capping *
Polyadenylation Polyadenylation is the addition of a poly(A) tail to an RNA transcript, typically a messenger RNA (mRNA). The poly(A) tail consists of multiple adenosine monophosphates; in other words, it is a stretch of RNA that has only adenine bases. In euk ...
* Post-transcriptional modification * RNA editing *
SWAP protein domain In molecular biology, the protein domain SWAP is derived from the term Suppressor-of-White-APricot, a splicing regulator from the model organism ''Drosophila melanogaster''. The protein domain is found in regulators that control splicing. It is f ...
, a splicing regulator


References


External links


Virtual Cell Animation Collection: mRNA Splicing
* {{DEFAULTSORT:Rna Splicing Gene expression Spliceosome