snRNPs (pronounced "snurps"), or
small
nuclear
ribo
nucleo
proteins, are
RNA-
protein complex
A protein complex or multiprotein complex is a group of two or more associated polypeptide chains. Protein complexes are distinct from multienzyme complexes, in which multiple catalytic domains are found in a single polypeptide chain.
Protein ...
es that combine with unmodified
pre-mRNA
A primary transcript is the single-stranded ribonucleic acid ( RNA) product synthesized by transcription of DNA, and processed to yield various mature RNA products such as mRNAs, tRNAs, and rRNAs. The primary transcripts designated to be mRNAs ...
and various other proteins to form a
spliceosome
A spliceosome is a large ribonucleoprotein (RNP) complex found primarily within the nucleus of eukaryotic cells. The spliceosome is assembled from small nuclear RNAs ( snRNA) and numerous proteins. Small nuclear RNA (snRNA) molecules bind to specif ...
, a large RNA-protein molecular complex upon which
splicing of
pre-mRNA
A primary transcript is the single-stranded ribonucleic acid ( RNA) product synthesized by transcription of DNA, and processed to yield various mature RNA products such as mRNAs, tRNAs, and rRNAs. The primary transcripts designated to be mRNAs ...
occurs. The action of snRNPs is essential to the removal of
introns
An intron is any nucleotide sequence within a gene that is not expressed or operative in the final RNA product. The word ''intron'' is derived from the term ''intragenic region'', i.e. a region inside a gene."The notion of the cistron .e., gene ...
from
pre-mRNA
A primary transcript is the single-stranded ribonucleic acid ( RNA) product synthesized by transcription of DNA, and processed to yield various mature RNA products such as mRNAs, tRNAs, and rRNAs. The primary transcripts designated to be mRNAs ...
, a critical aspect of
post-transcriptional modification
Transcriptional modification or co-transcriptional modification is a set of biological processes common to most eukaryotic cells by which an RNA primary transcript is chemically altered following transcription from a gene to produce a mature, fu ...
of RNA, occurring only in the
nucleus of
eukaryotic cells
Eukaryotes () are organisms whose cells have a nucleus. All animals, plants, fungi, and many unicellular organisms, are Eukaryotes. They belong to the group of organisms Eukaryota or Eukarya, which is one of the three domains of life. Bact ...
.
Additionally, ''
U7 snRNP'' is not involved in splicing at all, as U7 snRNP is responsible for processing the 3′ stem-loop of histone pre-mRNA.
The two essential components of snRNPs are
protein
Proteins are large biomolecules and macromolecules that comprise one or more long chains of amino acid residues. Proteins perform a vast array of functions within organisms, including catalysing metabolic reactions, DNA replication, res ...
molecules and
RNA. The RNA found within each snRNP particle is known as ''small nuclear RNA'', or
snRNA, and is usually about 150
nucleotides
Nucleotides are organic molecules consisting of a nucleoside and a phosphate. They serve as monomeric units of the nucleic acid polymers – deoxyribonucleic acid (DNA) and ribonucleic acid (RNA), both of which are essential biomolecules with ...
in length. The snRNA component of the snRNP gives specificity to individual introns by "
recognizing" the sequences of critical splicing signals at the 5' and 3' ends and
branch site of introns. The snRNA in snRNPs is similar to
ribosomal RNA
Ribosomal ribonucleic acid (rRNA) is a type of non-coding RNA which is the primary component of ribosomes, essential to all cells. rRNA is a ribozyme which carries out protein synthesis in ribosomes. Ribosomal RNA is transcribed from ribosom ...
in that it directly incorporates both an enzymatic and a structural role.
SnRNPs were discovered by
Michael R. Lerner and
Joan A. Steitz.
Thomas R. Cech and
Sidney Altman
Sidney Altman (May 7, 1939 – April 5, 2022) was a Canadian-American molecular biologist, who was the Sterling Professor of Molecular, Cellular, and Developmental Biology and Chemistry at Yale University. In 1989, he shared the Nobel Prize in ...
also played a role in the discovery, winning the Nobel Prize for Chemistry in 1989 for their independent discoveries that RNA can act as a catalyst in cell development.
Types
At least five different kinds of snRNPs join the spliceosome to participate in
splicing. They can be visualized by
gel electrophoresis and are known individually as: U1, U2, U4, U5, and U6. Their snRNA components are known, respectively, as:
U1 snRNA
U1 spliceosomal RNA is the small nuclear RNA (snRNA) component of U1 snRNP (''small nuclear ribonucleoprotein''), an RNA-protein complex that combines with other snRNPs, unmodified pre-mRNA, and various other proteins to assemble a spliceosome, ...
,
U2 snRNA
U2 spliceosomal snRNAs are a species of small nuclear RNA (Small nuclear RNA, snRNA) molecules found in the major spliceosomal (Sm) machinery of virtually all eukaryotic organisms. ''In vivo'', U2 snRNA along with its associated polypeptides assem ...
,
U4 snRNA
The U4 small nuclear Ribo-Nucleic Acid (U4 snRNA) is a non-coding RNA component of the major or U2-dependent spliceosome – a eukaryotic molecular machine involved in the splicing of pre-messenger RNA (pre-mRNA). It forms a duplex with U6, ...
,
U5 snRNA
U5 snRNA is a small nuclear RNA (snRNA) that participates in RNA splicing as a component of the spliceosome. It forms the U5 snRNP (''small nuclear ribonucleoprotein'') by associating with several proteins including Prp8 - the largest and most co ...
, and
U6 snRNA
U6 snRNA is the non-coding small nuclear RNA (snRNA) component of U6 snRNP (''small nuclear ribonucleoprotein''), an RNA-protein complex that combines with other snRNPs, unmodified pre-mRNA, and various other proteins to assemble a spliceosome, a ...
.
In the mid-1990s, it was discovered that a variant class of snRNPs exists to help in the splicing of a class of introns found only in
metazoans
Animals are multicellular, eukaryotic organisms in the biological kingdom Animalia. With few exceptions, animals consume organic material, breathe oxygen, are able to move, can reproduce sexually, and go through an ontogenetic stage in ...
, with highly conserved 5' splice sites and branch sites. This variant class of snRNPs includes:
U11 snRNA
The U11 snRNA ( small nuclear ribonucleic acid) is an important non-coding RNA in the minor spliceosome protein complex, which activates the alternative splicing mechanism. The minor spliceosome is associated with similar protein components as ...
,
U12 snRNA
U12 minor spliceosomal RNA is formed from U12 small nuclear (snRNA), together with U4atac/ U6atac, U5, and U11 snRNAs and associated proteins, forms a spliceosome that cleaves a divergent class of low-abundance pre-mRNA introns. Although the U ...
,
U4atac snRNA, and
U6atac snRNA. While different, they perform the same functions as do
U1,
U2,
U4, and
U6, respectively.
Additionally, U7 snRNP is made of
U7 small nuclear RNA
The U7 small nuclear RNA (U7 snRNA) is an RNA molecule and a component of the small nuclear ribonucleoprotein complex (U7 snRNP). The U7 snRNA is required for histone pre-mRNA processing.
The 5' end of the U7 snRNA binds the HDE (histone downstre ...
and associated proteins and is involved in the processing of the 3′ stem-loop of histone pre-mRNA.
Biogenesis
Small nuclear ribonucleoproteins (snRNPs) assemble in a tightly orchestrated and regulated process that involves both the
cell nucleus
The cell nucleus (pl. nuclei; from Latin or , meaning ''kernel'' or ''seed'') is a membrane-bound organelle found in eukaryotic cells. Eukaryotic cells usually have a single nucleus, but a few cell types, such as mammalian red blood cells, h ...
and
cytoplasm
In cell biology, the cytoplasm is all of the material within a eukaryotic cell, enclosed by the cell membrane, except for the cell nucleus. The material inside the nucleus and contained within the nuclear membrane is termed the nucleoplasm. The ...
.
Synthesis and export of RNA in the nucleus
The
RNA polymerase II
RNA polymerase II (RNAP II and Pol II) is a multiprotein complex that transcribes DNA into precursors of messenger RNA (mRNA) and most small nuclear RNA (snRNA) and microRNA. It is one of the three RNAP enzymes found in the nucleus of eukaryoti ...
transcribes
U1,
U2,
U4,
U5 and the less abundant
U11,
U12 and U4atac (
snRNAs) acquire a m7G-cap which serves as an export signal. Nuclear export is mediated by CRM1.
Synthesis and storage of Sm proteins in the cytoplasm
The
Sm proteins are synthesized in the cytoplasm by
ribosomes
Ribosomes ( ) are macromolecular machines, found within all cells, that perform biological protein synthesis (mRNA translation). Ribosomes link amino acids together in the order specified by the codons of messenger RNA (mRNA) molecules to f ...
translating Sm
messenger RNA
In molecular biology, messenger ribonucleic acid (mRNA) is a single-stranded molecule of RNA that corresponds to the genetic sequence of a gene, and is read by a ribosome in the process of synthesizing a protein.
mRNA is created during the p ...
, just like any other protein. These are stored in the cytoplasm in the form of three partially assembled rings complexes all associated with the pICln protein. They are a 6S pentamer complex of SmD1, SmD2, SmF, SmE and SmG with pICln, a 2-4S complex of SmB, possibly with SmD3 and pICln and the 20S methylosome, which is a large complex of SmD3, SmB, SmD1, pICln and the arginine methyltransferase-5 (PRMT5) protein. SmD3, SmB and SmD1 undergo
post-translational modification
Post-translational modification (PTM) is the covalent and generally enzymatic modification of proteins following protein biosynthesis. This process occurs in the endoplasmic reticulum and the golgi apparatus. Proteins are synthesized by ribosome ...
in the methylosome. These three Sm proteins have repeated
arginine
Arginine is the amino acid with the formula (H2N)(HN)CN(H)(CH2)3CH(NH2)CO2H. The molecule features a guanidino group appended to a standard amino acid framework. At physiological pH, the carboxylic acid is deprotonated (−CO2−) and both the am ...
-
glycine
Glycine (symbol Gly or G; ) is an amino acid that has a single hydrogen atom as its side chain. It is the simplest stable amino acid (carbamic acid is unstable), with the chemical formula NH2‐ CH2‐ COOH. Glycine is one of the proteinogeni ...
motifs in the
C-terminal end
The C-terminus (also known as the carboxyl-terminus, carboxy-terminus, C-terminal tail, C-terminal end, or COOH-terminus) is the end of an amino acid chain (protein or polypeptide), terminated by a free carboxyl group (-COOH). When the protein is ...
s of SmD1, SmD3 and SmB, and the arginine side chains are symmetrically dimethylated to ω-N
G, N
G'-dimethyl-arginine. It has been suggested that pICln, which occurs in all three precursor complexes but is absent in the mature snRNPs, acts as a specialized
chaperone, preventing premature assembly of Sm proteins.
Assembly of core snRNPs in the SMN complex
The
snRNAs (U1, U2, U4, U5, and the less abundant U11, U12 and U4atac) quickly interact with the
SMN (survival of motor neuron protein); encoded by ''
SMN1
Survival of motor neuron 1 (''SMN1''), also known as component of gems 1 or ''GEMIN1'', is a gene that encodes the SMN protein in humans.
Gene
''SMN1'' is the telomeric copy of the gene encoding the SMN protein; the centromeric copy is term ...
'' gene) and Gemins 2-8 (Gem-associated proteins:
GEMIN2,
GEMIN3
Probable ATP-dependent RNA helicase DDX20, also known as DEAD-box helicase 20 and gem-associated protein 3 (GEMIN3), is an enzyme that in humans is encoded by the ''DDX20'' gene.
Function
DEAD box proteins, characterized by the conserved motif ...
,
GEMIN4
Gem-associated protein 4 is a protein that in humans is encoded by the ''GEMIN4'' gene.
Function
The product of this gene is part of the SMN protein complex localized to the cytoplasm, nucleoli, and to discrete nuclear bodies called Gemini b ...
,
GEMIN5
Gem-associated protein 5 is a protein that in humans is encoded by the ''GEMIN5'' gene.
Function
Gem-associated protein 5 is part of the survival of motor neuron protein, SMN a large protein complex localized to both the cytoplasm and the cell ...
,
GEMIN6,
GEMIN7,
GEMIN8) forming the SMN complex. It is here that the snRNA binds to the SmD1-SmD2-SmF-SmE-SmG pentamer, followed by addition of the SmD3-SmB dimer to complete the Sm ring around the so-called
Sm site of the snRNA. This Sm site is a conserved sequence of nucleotides in these snRNAs, typically AUUUGUGG (where A, U and G represent the
nucleosides
Nucleosides are glycosylamines that can be thought of as nucleotides without a phosphate group. A nucleoside consists simply of a nucleobase (also termed a nitrogenous base) and a five-carbon sugar (ribose or 2'-deoxyribose) whereas a nucleotide ...
adenosine
Adenosine ( symbol A) is an organic compound that occurs widely in nature in the form of diverse derivatives. The molecule consists of an adenine attached to a ribose via a β-N9-glycosidic bond. Adenosine is one of the four nucleoside building ...
,
uridine
Uridine (symbol U or Urd) is a glycosylated pyrimidine analog containing uracil attached to a ribose ring (or more specifically, a ribofuranose) via a β-N1-glycosidic bond. The analog is one of the five standard nucleosides which make up nuclei ...
and
guanosine
Guanosine (symbol G or Guo) is a purine nucleoside comprising guanine attached to a ribose (ribofuranose) ring via a β-N9-glycosidic bond. Guanosine can be phosphorylated to become guanosine monophosphate (GMP), cyclic guanosine monophosphate (c ...
, respectively). After assembly of the Sm ring around the snRNA, the 5' terminal nucleoside (already modified to a
7-methylguanosine
7-Methylguanosine (m7G) is a modified purine nucleoside. It is a methylated version of guanosine and when found in human urine, it may be a biomarker of some types of cancer. In the RNAs, 7-methylguanosine have been used to study and examine the ...
cap) is hyper-methylated to 2,2,7-trimethylguanosine and the other (3') end of the snRNA is trimmed. This modification, and the presence of a complete Sm ring, is recognized by the snurportin 1 protein.
Final assembly of the snRNPs in the nucleus
The completed core snRNP-snurportin 1 complex is transported into the nucleus via the protein importin β. Inside the nucleus, the core snRNPs appear in the
Cajal bodies
Cajal bodies (CBs) also coiled bodies, are spherical nuclear bodies of 0.3–1.0 µm in diameter found in the nucleus of proliferative cells like embryonic cells and tumor cells, or metabolically active cells like neurons. CBs are membrane ...
, where final assembly of the snRNPs take place. This consists of additional proteins and other modifications specific to the particular snRNP (U1, U2, U4, U5). The biogenesis of the U6 snRNP occurs in the nucleus, although large amounts of free U6 are found in the cytoplasm. The
LSm
In molecular biology, LSm proteins are a family of RNA-binding proteins found in virtually every cellular organism. LSm is a contraction of 'like Sm', because the first identified members of the LSm protein family were the Sm proteins. LSm pro ...
ring may assemble first, and then associate with the
U6 snRNA
U6 snRNA is the non-coding small nuclear RNA (snRNA) component of U6 snRNP (''small nuclear ribonucleoprotein''), an RNA-protein complex that combines with other snRNPs, unmodified pre-mRNA, and various other proteins to assemble a spliceosome, a ...
.
Disassembly of snRNPs
The snRNPs are very long-lived, but are assumed to be eventually disassembled and degraded. Little is known about the degradation process.
Defective assembly
Defective function of the
survival of motor neuron
Survival of motor neuron or survival motor neuron (SMN) is a protein that in humans is encoded by the '' SMN1'' and '' SMN2'' genes.
SMN is found in the cytoplasm of all animal cells and also in the nuclear gems. It functions in transcriptional r ...
(SMN) protein in snRNP biogenesis, caused by a genetic defect in the ''
SMN1
Survival of motor neuron 1 (''SMN1''), also known as component of gems 1 or ''GEMIN1'', is a gene that encodes the SMN protein in humans.
Gene
''SMN1'' is the telomeric copy of the gene encoding the SMN protein; the centromeric copy is term ...
'' gene which codes for SMN, may account for the motor neuron pathology observed in the genetic disorder
spinal muscular atrophy
Spinal muscular atrophy (SMA) is a rare neuromuscular disorder that results in the loss of motor neurons and progressive muscle wasting. It is usually diagnosed in infancy or early childhood and if left untreated it is the most common genetic ...
.
Structures, function and organization
Several human and yeast snRNP structures were determined by the
cryo-electron microscopy
Cryogenic electron microscopy (cryo-EM) is a cryomicroscopy technique applied on samples cooled to cryogenic temperatures. For biological specimens, the structure is preserved by embedding in an environment of vitreous ice. An aqueous sample s ...
and successive
single particle analysis
Single particle analysis is a group of related computerized image processing techniques used to analyze images from transmission electron microscopy (TEM). These methods were developed to improve and extend the information obtainable from TEM imag ...
.
Recently, the human U1 snRNP core structure was determined by
X-ray crystallography
X-ray crystallography is the experimental science determining the atomic and molecular structure of a crystal, in which the crystalline structure causes a beam of incident X-rays to diffract into many specific directions. By measuring the angles ...
(3CW1, 3PGW), followed by a structure of the U4 core snRNP (2Y9A), which yielded first insights into atomic contacts, especially the binding mode of the Sm proteins to the Sm site. The structure of U6 UsnRNA was solved in complex with a specific protein Prp24 (4N0T), as well as a structure of its 3'-
nucleotides
Nucleotides are organic molecules consisting of a nucleoside and a phosphate. They serve as monomeric units of the nucleic acid polymers – deoxyribonucleic acid (DNA) and ribonucleic acid (RNA), both of which are essential biomolecules with ...
bound to the special Lsm2-8 protein ring (4M7A). The
PDB codes for the respective structures are mentioned in parenthesis.
The structures determined by single particle electron microscopy analysis are: human U1 snRNP,
human U11/U12 di-snRNP,
human U5 snRNP, U4/U6 di-snRNP, U4/U6∙U5 tri-snRNP.
The further progress determining the structures and functions of snRNPs and spliceosomes continues.
Anti-snRNP antibodies
Autoantibodies
An autoantibody is an antibody (a type of protein) produced by the immune system that is directed against one or more of the individual's own proteins. Many autoimmune diseases (notably lupus erythematosus) are associated with such antibodies.
Pr ...
may be produced against the body's own snRNPs, most notably the
anti-Sm antibodies targeted against the
Sm protein
In molecular biology, LSm proteins are a family of RNA-binding proteins found in virtually every cellular organism. LSm is a contraction of 'like Sm', because the first identified members of the LSm protein family were the Sm proteins. LSm pro ...
type of snRNP specifically in
systemic lupus erythematosus
Lupus, technically known as systemic lupus erythematosus (SLE), is an autoimmune disease in which the body's immune system mistakenly attacks healthy tissue in many parts of the body. Symptoms vary among people and may be mild to severe. Comm ...
(SLE).
References
External links
Joan Steitz's Short Talk: "SNURPs and Serendipity"*
{{Ribonucleoproteins
Molecular biology
RNA
Spliceosome
RNA splicing