Polyadenylation is the addition of a poly(A) tail to an RNA transcript, typically a
messenger RNA (mRNA). The poly(A) tail consists of multiple
adenosine monophosphate
Adenosine monophosphate (AMP), also known as 5'-adenylic acid, is a nucleotide. AMP consists of a phosphate group, the sugar ribose, and the nucleobase adenine; it is an ester of phosphoric acid and the nucleoside adenosine. As a substit ...
s; in other words, it is a stretch of RNA that has only
adenine
Adenine () ( symbol A or Ade) is a nucleobase (a purine derivative). It is one of the four nucleobases in the nucleic acid of DNA that are represented by the letters G–C–A–T. The three others are guanine, cytosine and thymine. Its deri ...
bases. In
eukaryotes, polyadenylation is part of the process that produces mature mRNA for
translation
Translation is the communication of the meaning of a source-language text by means of an equivalent target-language text. The English language draws a terminological distinction (which does not exist in every language) between ''transla ...
. In many
bacteria
Bacteria (; singular: bacterium) are ubiquitous, mostly free-living organisms often consisting of one Cell (biology), biological cell. They constitute a large domain (biology), domain of prokaryotic microorganisms. Typically a few micrometr ...
, the poly(A) tail promotes degradation of the mRNA. It, therefore, forms part of the larger process of
gene expression.
The process of polyadenylation begins as the
transcription
Transcription refers to the process of converting sounds (voice, music etc.) into letters or musical notes, or producing a copy of something in another medium, including:
Genetics
* Transcription (biology), the copying of DNA into RNA, the fir ...
of a
gene
In biology, the word gene (from , ; "... Wilhelm Johannsen coined the word gene to describe the Mendelian units of heredity..." meaning ''generation'' or ''birth'' or ''gender'') can have several different meanings. The Mendelian gene is a b ...
terminates. The
3′-most segment of the newly made pre-mRNA is first cleaved off by a
set of proteins; these proteins then synthesize the poly(A) tail at the RNA's 3′ end. In some genes these proteins add a poly(A) tail at one of several possible sites. Therefore, polyadenylation can produce more than one transcript from a single gene (alternative polyadenylation), similar to
alternative splicing.
[
The poly(A) tail is important for the nuclear export, translation and stability of mRNA. The tail is shortened over time, and, when it is short enough, the mRNA is enzymatically degraded.] However, in a few cell types, mRNAs with short poly(A) tails are stored for later activation by re-polyadenylation in the cytosol. In contrast, when polyadenylation occurs in bacteria, it promotes RNA degradation. This is also sometimes the case for eukaryotic non-coding RNA
A non-coding RNA (ncRNA) is a functional RNA molecule that is not Translation (genetics), translated into a protein. The DNA sequence from which a functional non-coding RNA is transcribed is often called an RNA gene. Abundant and functionally im ...
s.
mRNA molecules in both prokaryotes and eukaryotes have polyadenylated 3′-ends, with the prokaryotic poly(A) tails generally shorter and fewer mRNA molecules polyadenylated.
Background on RNA
RNAs are a type of large biological molecules, whose individual building blocks are called nucleotides. The name ''poly(A) tail'' (for polyadenylic acid tail) reflects the way RNA nucleotides are abbreviated, with a letter for the base the nucleotide contains (A for adenine
Adenine () ( symbol A or Ade) is a nucleobase (a purine derivative). It is one of the four nucleobases in the nucleic acid of DNA that are represented by the letters G–C–A–T. The three others are guanine, cytosine and thymine. Its deri ...
, C for cytosine
Cytosine () ( symbol C or Cyt) is one of the four nucleobases found in DNA and RNA, along with adenine, guanine, and thymine (uracil in RNA). It is a pyrimidine derivative, with a heterocyclic aromatic ring and two substituents attached (an ...
, G for guanine
Guanine () ( symbol G or Gua) is one of the four main nucleobases found in the nucleic acids DNA and RNA, the others being adenine, cytosine, and thymine (uracil in RNA). In DNA, guanine is paired with cytosine. The guanine nucleoside is c ...
and U for uracil
Uracil () (symbol U or Ura) is one of the four nucleobases in the nucleic acid RNA. The others are adenine (A), cytosine (C), and guanine (G). In RNA, uracil binds to adenine via two hydrogen bonds. In DNA, the uracil nucleobase is replaced b ...
). RNAs are produced ( ''transcribed'') from a DNA template. By convention, RNA sequences are written in a 5′ to 3′ direction. The 5′ end is the part of the RNA molecule that is transcribed first, and the 3′ end is transcribed last. The 3′ end is also where the poly(A) tail is found on polyadenylated RNAs.
Messenger RNA (mRNA) is RNA that has a coding region that acts as a template for protein synthesis ( ''translation''). The rest of the mRNA, the ''untranslated regions'', tune how active the mRNA is. There are also many RNAs that are not translated, called non-coding RNAs. Like the untranslated regions, many of these non-coding RNAs have regulatory roles.
Nuclear polyadenylation
Function
In nuclear polyadenylation, a poly(A) tail is added to an RNA at the end of transcription. On mRNAs, the poly(A) tail protects the mRNA molecule from enzymatic degradation in the cytoplasm
In cell biology, the cytoplasm is all of the material within a eukaryotic cell, enclosed by the cell membrane, except for the cell nucleus. The material inside the nucleus and contained within the nuclear membrane is termed the nucleoplasm. ...
and aids in transcription termination, export of the mRNA from the nucleus, and translation.[ Almost all eukaryotic mRNAs are polyadenylated,] with the exception of animal replication-dependent histone
In biology, histones are highly basic proteins abundant in lysine and arginine residues that are found in eukaryotic cell nuclei. They act as spools around which DNA winds to create structural units called nucleosomes. Nucleosomes in turn a ...
mRNAs. These are the only mRNAs in eukaryotes that lack a poly(A) tail, ending instead in a stem-loop structure followed by a purine-rich sequence, termed histone downstream element, that directs where the RNA is cut so that the 3′ end of the histone mRNA is formed.
Many eukaryotic non-coding RNAs are always polyadenylated at the end of transcription. There are small RNAs where the poly(A) tail is seen only in intermediary forms and not in the mature RNA as the ends are removed during processing, the notable ones being microRNA
MicroRNA (miRNA) are small, single-stranded, non-coding RNA molecules containing 21 to 23 nucleotides. Found in plants, animals and some viruses, miRNAs are involved in RNA silencing and post-transcriptional regulation of gene expression. mi ...
s. But, for many long noncoding RNA
Long non-coding RNAs (long ncRNAs, lncRNA) are a type of RNA, generally defined as transcripts more than 200 nucleotides that are not translated into protein. This arbitrary limit distinguishes long ncRNAs from small non-coding RNAs, such as m ...
s – a seemingly large group of regulatory
Regulation is the management of complex systems according to a set of rules and trends. In systems theory, these types of rules exist in various fields of biology and society, but the term has slightly different meanings according to context. Fo ...
RNAs that, for example, includes the RNA Xist, which mediates X chromosome inactivation
X-inactivation (also called Lyonization, after English geneticist Mary Lyon) is a process by which one of the copies of the X chromosome is inactivated in therian female mammals. The inactive X chromosome is silenced by being packaged into a ...
– a poly(A) tail is part of the mature RNA.
Mechanism
The processive polyadenylation complex in the nucleus of eukaryotes works on products of RNA polymerase II
RNA polymerase II (RNAP II and Pol II) is a multiprotein complex that transcribes DNA into precursors of messenger RNA (mRNA) and most small nuclear RNA (snRNA) and microRNA. It is one of the three RNAP enzymes found in the nucleus of eukaryo ...
, such as precursor mRNA
Precursor or Precursors may refer to:
*Precursor (religion), a forerunner, predecessor
** The Precursor, John the Baptist
Science and technology
* Precursor (bird), a hypothesized genus of fossil birds that was composed of fossilized parts of un ...
. Here, a multi-protein complex ''(see components on the right)'' cleaves the 3′-most part of a newly produced RNA and polyadenylates the end produced by this cleavage. The cleavage is catalysed by the enzyme CPSF
Cleavage and polyadenylation specificity factor (CPSF) is involved in the cleavage of the 3' signaling region from a newly synthesized pre- messenger RNA (pre-mRNA) molecule in the process of gene transcription. It is the first protein to bind t ...
[ and occurs 10–30 nucleotides downstream of its binding site.] This site often has the polyadenylation signal sequence AAUAAA on the RNA, but variants of it that bind more weakly to CPSF
Cleavage and polyadenylation specificity factor (CPSF) is involved in the cleavage of the 3' signaling region from a newly synthesized pre- messenger RNA (pre-mRNA) molecule in the process of gene transcription. It is the first protein to bind t ...
exist.[ Two other proteins add specificity to the binding to an RNA: CstF and CFI. CstF binds to a GU-rich region further downstream of CPSF's site.] CFI recognises a third site on the RNA (a set of UGUAA sequences in mammals) and can recruit CPSF even if the AAUAAA sequence is missing. The polyadenylation signal – the sequence motif recognised by the RNA cleavage complex – varies between groups of eukaryotes. Most human polyadenylation sites contain the AAUAAA sequence,[ but this sequence is less common in plants and fungi.]
The RNA is typically cleaved before transcription termination, as CstF also binds to RNA polymerase II. Through a poorly understood mechanism (as of 2002), it signals for RNA polymerase II to slip off of the transcript. Cleavage also involves the protein CFII, though it is unknown how. The cleavage site associated with a polyadenylation signal can vary up to some 50 nucleotides.
When the RNA is cleaved, polyadenylation starts, catalysed by polyadenylate polymerase. Polyadenylate polymerase
In enzymology, a polynucleotide adenylyltransferase () is an enzyme that catalyzes the chemical reaction
:ATP + RNA-3'OH \rightleftharpoons pyrophosphate + RNApA-3'OH
Thus, the two substrates of this enzyme are ATP and RNA, whereas its two ...
builds the poly(A) tail by adding adenosine monophosphate
Adenosine monophosphate (AMP), also known as 5'-adenylic acid, is a nucleotide. AMP consists of a phosphate group, the sugar ribose, and the nucleobase adenine; it is an ester of phosphoric acid and the nucleoside adenosine. As a substit ...
units from adenosine triphosphate
Adenosine triphosphate (ATP) is an organic compound that provides energy to drive many processes in living cells, such as muscle contraction, nerve impulse propagation, condensate dissolution, and chemical synthesis. Found in all known forms o ...
to the RNA, cleaving off pyrophosphate
In chemistry, pyrophosphates are phosphorus oxyanions that contain two phosphorus atoms in a P–O–P linkage. A number of pyrophosphate salts exist, such as disodium pyrophosphate (Na2H2P2O7) and tetrasodium pyrophosphate (Na4P2O7), among othe ...
. Another protein, PAB2, binds to the new, short poly(A) tail and increases the affinity of polyadenylate polymerase for the RNA. When the poly(A) tail is approximately 250 nucleotide
Nucleotides are organic molecules consisting of a nucleoside and a phosphate. They serve as monomeric units of the nucleic acid polymers – deoxyribonucleic acid (DNA) and ribonucleic acid (RNA), both of which are essential biomolecule ...
s long the enzyme can no longer bind to CPSF and polyadenylation stops, thus determining the length of the poly(A) tail. CPSF is in contact with RNA polymerase II, allowing it to signal the polymerase to terminate transcription. When RNA polymerase II reaches a "termination sequence" (⁵'TTTATT3' on the DNA template and ⁵'AAUAAA3' on the primary transcript), the end of transcription is signaled. The polyadenylation machinery is also physically linked to the spliceosome
A spliceosome is a large ribonucleoprotein (RNP) complex found primarily within the nucleus of eukaryotic cells. The spliceosome is assembled from small nuclear RNAs ( snRNA) and numerous proteins. Small nuclear RNA (snRNA) molecules bind to specif ...
, a complex that removes introns from RNAs.[
]
Downstream effects
The poly(A) tail acts as the binding site for poly(A)-binding protein
Poly(A)-binding protein (PAB or PABP) is an RNA-binding protein which triggers the binding of eukaryotic initiation factor 4 complex (eIF4G) directly to the poly(A) tail of mRNA which is 200-250 nucleotides long. The poly(A) tail is located on th ...
. Poly(A)-binding protein promotes export from the nucleus and translation, and inhibits degradation. This protein binds to the poly(A) tail prior to mRNA export from the nucleus and in yeast also recruits poly(A) nuclease, an enzyme that shortens the poly(A) tail and allows the export of the mRNA. Poly(A)-binding protein is exported to the cytoplasm with the RNA. mRNAs that are not exported are degraded by the exosome. Poly(A)-binding protein also can bind to, and thus recruit, several proteins that affect translation,[ one of these is ]initiation factor Initiation factors are proteins that bind to the small subunit of the ribosome during the initiation of translation, a part of protein biosynthesis.
Initiation factors can interact with repressors to slow down or prevent translation. They have t ...
-4G, which in turn recruits the 40S ribosomal subunit. However, a poly(A) tail is not required for the translation of all mRNAs. Further, poly(A) tailing (oligo-adenylation) can determine the fate of RNA molecules that are usually not poly(A)-tailed (such as (small) non-coding (sn)RNAs etc.) and thereby induce their RNA decay.
Deadenylation
In eukaryotic somatic cells, the poly(A) tails of most mRNAs in the cytoplasm gradually get shorter, and mRNAs with shorter poly(A) tail are translated less and degraded sooner. However, it can take many hours before an mRNA is degraded. This deadenylation and degradation process can be accelerated by microRNAs complementary to the 3′ untranslated region of an mRNA. In immature egg cells, mRNAs with shortened poly(A) tails are not degraded, but are instead stored and translationally inactive. These short tailed mRNAs are activated by cytoplasmic polyadenylation after fertilisation, during egg activation Oocyte (or ovum/egg) activation is a series of processes that occur in the oocyte during fertilization.
Sperm entry causes calcium release into the oocyte. In mammals, this is caused by the introduction of phospholipase C isoform zeta (PLCζ) from ...
.
In animals, poly(A) ribonuclease ( PARN) can bind to the 5′ cap
In molecular biology, the five-prime cap (5′ cap) is a specially altered nucleotide on the 5′ end of some primary transcripts such as precursor messenger RNA. This process, known as mRNA capping, is highly regulated and vital in the creation o ...
and remove nucleotides from the poly(A) tail. The level of access to the 5′ cap and poly(A) tail is important in controlling how soon the mRNA is degraded. PARN deadenylates less if the RNA is bound by the initiation factors 4E (at the 5′ cap) and 4G (at the poly(A) tail), which is why translation reduces deadenylation. The rate of deadenylation may also be regulated by RNA-binding proteins. Additionally, RNA triple helix structures and RNA motifs such as the poly(A) tail 3’ end binding pocket retard deadenylation process and inhibit poly(A) tail removal. Once the poly(A) tail is removed, the decapping complex removes the 5′ cap, leading to a degradation of the RNA. Several other proteins are involved in deadenylation in budding yeast
''Saccharomyces cerevisiae'' () (brewer's yeast or baker's yeast) is a species of yeast (single-celled fungus microorganisms). The species has been instrumental in winemaking, baking, and brewing since ancient times. It is believed to have been ...
and human cells, most notably the CCR4-Not
Carbon Catabolite Repression—Negative On TATA-less, or CCR4-Not, is a multiprotein complex that functions in gene expression. The complex has multiple enzymatic activities as both a poly(A) 3′-5′ exonuclease and a ubiquitin ligase. The com ...
complex.
Cytoplasmic polyadenylation
There is polyadenylation in the cytosol of some animal cell types, namely in the germ line
In biology and genetics, the germline is the population of a multicellular organism's cells that pass on their genetic material to the progeny (offspring). In other words, they are the cells that form the egg, sperm and the fertilised egg. They ...
, during early embryogenesis and in post- synaptic sites of nerve cells
A neuron, neurone, or nerve cell is an electrically excitable cell that communicates with other cells via specialized connections called synapses. The neuron is the main component of nervous tissue in all animals except sponges and placozoa. ...
. This lengthens the poly(A) tail of an mRNA with a shortened poly(A) tail, so that the mRNA will be translated
Translation is the communication of the meaning of a source-language text by means of an equivalent target-language text. The English language draws a terminological distinction (which does not exist in every language) between ''transla ...
. These shortened poly(A) tails are often less than 20 nucleotides, and are lengthened to around 80–150 nucleotides.
In the early mouse embryo, cytoplasmic polyadenylation of maternal RNAs from the egg cell allows the cell to survive and grow even though transcription does not start until the middle of the 2-cell stage (4-cell stage in human). In the brain, cytoplasmic polyadenylation is active during learning and could play a role in long-term potentiation
In neuroscience, long-term potentiation (LTP) is a persistent strengthening of synapses based on recent patterns of activity. These are patterns of synaptic activity that produce a long-lasting increase in signal transmission between two neurons ...
, which is the strengthening of the signal transmission from a nerve cell to another in response to nerve impulses and is important for learning and memory formation.
Cytoplasmic polyadenylation requires the RNA-binding proteins CPSF
Cleavage and polyadenylation specificity factor (CPSF) is involved in the cleavage of the 3' signaling region from a newly synthesized pre- messenger RNA (pre-mRNA) molecule in the process of gene transcription. It is the first protein to bind t ...
and CPEB
CPEB, or cytoplasmic polyadenylation element binding protein, is a highly conserved RNA-binding protein that promotes the elongation of the polyadenine tail of messenger RNA. CPEB most commonly activates the target RNA for translation, but ca ...
, and can involve other RNA-binding proteins like Pumilio. Depending on the cell type, the polymerase can be the same type of polyadenylate polymerase (PAP) that is used in the nuclear process, or the cytoplasmic polymerase GLD-2
GLD-2 (which stands for Germ Line Development 2) is a cytoplasmic poly(A) polymerase (cytoPAPs) which adds successive AMP monomers to the 3’ end of specific RNAs, forming a poly(A) tail, which is a process known as polyadenylation.
For RNA ...
.
Alternative polyadenylation
Many protein-coding genes have more than one polyadenylation site, so a gene can code for several mRNAs that differ in their 3′ end
Directionality, in molecular biology and biochemistry, is the end-to-end chemical orientation of a single strand of nucleic acid. In a single strand of DNA or RNA, the chemical convention of naming carbon atoms in the nucleotide pentose-sugar-r ...
. The 3’ region of a transcript contains many polyadenylation signals (PAS). When more proximal (closer towards 5’ end) PAS sites are utilized, this shortens the length of the 3’ untranslated region (3' UTR) of a transcript. Studies in both humans and flies have shown tissue specific APA. With neuronal tissues preferring distal PAS usage, leading to longer 3’ UTRs and testis tissues preferring proximal PAS leading to shorter 3’ UTRs. Studies have shown there is a correlation between a gene's conservation level and its tendency to do alternative polyadenylation, with highly conserved genes exhibiting more APA. Similarly, highly expressed genes follow this same pattern. Ribo-sequencing data (sequencing of only mRNAs inside ribosomes) has shown that mRNA isoforms with shorter 3’ UTRs are more likely to be translated.
Since alternative polyadenylation changes the length of the 3' UTR
In molecular genetics, the three prime untranslated region (3′-UTR) is the section of messenger RNA (mRNA) that immediately follows the translation termination codon. The 3′-UTR often contains regulatory regions that post-transcriptionally ...
, it can also change which binding sites are available for microRNA
MicroRNA (miRNA) are small, single-stranded, non-coding RNA molecules containing 21 to 23 nucleotides. Found in plants, animals and some viruses, miRNAs are involved in RNA silencing and post-transcriptional regulation of gene expression. mi ...
s in the 3′ UTR. MicroRNAs tend to repress translation and promote degradation of the mRNAs they bind to, although there are examples of microRNAs that stabilise transcripts. Alternative polyadenylation can also shorten the coding region, thus making the mRNA code for a different protein, but this is much less common than just shortening the 3′ untranslated region.
The choice of poly(A) site can be influenced by extracellular stimuli and depends on the expression of the proteins that take part in polyadenylation. For example, the expression of CstF-64, a subunit of cleavage stimulatory factor Cleavage stimulatory factor or cleavage stimulation factor (CstF or CStF) is a heterotrimeric protein, made up of the proteins CSTF1 (55 kDa), CSTF2 (64kDa) and CSTF3 (77kDa), totalling about 200 kDa. It is involved in the cleavage of the 3' sign ...
(CstF), increases in macrophages in response to lipopolysaccharide
Lipopolysaccharides (LPS) are large molecules consisting of a lipid and a polysaccharide that are bacterial toxins. They are composed of an O-antigen, an outer core, and an inner core all joined by a covalent bond, and are found in the outer ...
s (a group of bacterial compounds that trigger an immune response). This results in the selection of weak poly(A) sites and thus shorter transcripts. This removes regulatory elements in the 3′ untranslated regions of mRNAs for defense-related products like lysozyme
Lysozyme (EC 3.2.1.17, muramidase, ''N''-acetylmuramide glycanhydrolase; systematic name peptidoglycan ''N''-acetylmuramoylhydrolase) is an antimicrobial enzyme produced by animals that forms part of the innate immune system. It is a glycoside ...
and TNF-α
Tumor necrosis factor (TNF, cachexin, or cachectin; formerly known as tumor necrosis factor alpha or TNF-α) is an adipokine and a cytokine. TNF is a member of the TNF superfamily, which consists of various transmembrane proteins with a homolog ...
. These mRNAs then have longer half-lives and produce more of these proteins. RNA-binding proteins other than those in the polyadenylation machinery can also affect whether a polyadenylation site is used, as can DNA methylation near the polyadenylation signal.
Tagging for degradation in eukaryotes
For many non-coding RNA
A non-coding RNA (ncRNA) is a functional RNA molecule that is not Translation (genetics), translated into a protein. The DNA sequence from which a functional non-coding RNA is transcribed is often called an RNA gene. Abundant and functionally im ...
s, including tRNA
Transfer RNA (abbreviated tRNA and formerly referred to as sRNA, for soluble RNA) is an adaptor molecule composed of RNA, typically 76 to 90 nucleotides in length (in eukaryotes), that serves as the physical link between the mRNA and the amino ...
, rRNA
Ribosomal ribonucleic acid (rRNA) is a type of non-coding RNA which is the primary component of ribosomes, essential to all cells. rRNA is a ribozyme which carries out protein synthesis in ribosomes. Ribosomal RNA is transcribed from ribosoma ...
, snRNA
Small nuclear RNA (snRNA) is a class of small RNA molecules that are found within the splicing speckles and Cajal bodies of the cell nucleus in eukaryotic cells. The length of an average snRNA is approximately 150 nucleotides. They are transcri ...
, and snoRNA
In molecular biology, Small nucleolar RNAs (snoRNAs) are a class of small RNA molecules that primarily guide chemical modifications of other RNAs, mainly ribosomal RNAs, transfer RNAs and small nuclear RNAs. There are two main classes of snoRNA, ...
, polyadenylation is a way of marking the RNA for degradation, at least in yeast
Yeasts are eukaryotic, single-celled microorganisms classified as members of the fungus kingdom. The first yeast originated hundreds of millions of years ago, and at least 1,500 species are currently recognized. They are estimated to constit ...
. This polyadenylation is done in the nucleus by the TRAMP complex TRAMP complex (Trf4/Air2/Mtr4p Polyadenylation complex) is a multiprotein, heterotrimeric complex having distributive polyadenylation activity and identifies wide varieties of RNAs produced by polymerases. It was originally discovered in ''Saccharo ...
, which maintains a tail that is around 4 nucleotides long to the 3′ end. The RNA is then degraded by the exosome. Poly(A) tails have also been found on human rRNA fragments, both the form of homopolymeric (A only) and heterpolymeric (mostly A) tails.
In prokaryotes and organelles
In many bacteria, both mRNAs and non-coding RNAs can be polyadenylated. This poly(A) tail promotes degradation by the degradosome The degradosome is a multiprotein complex present in most bacteria that is involved in the processing of ribosomal RNA and the degradation of messenger RNA and is regulated by Non-coding RNA. It contains the proteins RNA helicase B, RNase E and ...
, which contains two RNA-degrading enzymes: polynucleotide phosphorylase and RNase E. Polynucleotide phosphorylase binds to the 3′ end of RNAs and the 3′ extension provided by the poly(A) tail allows it to bind to the RNAs whose secondary structure would otherwise block the 3′ end. Successive rounds of polyadenylation and degradation of the 3′ end by polynucleotide phosphorylase allows the degradosome to overcome these secondary structures. The poly(A) tail can also recruit RNases that cut the RNA in two. These bacterial poly(A) tails are about 30 nucleotides long.
In as different groups as animals and trypanosomes, the mitochondria contain both stabilising and destabilising poly(A) tails. Destabilising polyadenylation targets both mRNA and noncoding RNAs. The poly(A) tails are 43 nucleotides long on average. The stabilising ones start at the stop codon, and without them the stop codon (UAA) is not complete as the genome only encodes the U or UA part. Plant mitochondria have only destabilising polyadenylation. Mitochondrial polyadenylation has never been observed in either budding or fission yeast.
While many bacteria and mitochondria have polyadenylate polymerases, they also have another type of polyadenylation, performed by polynucleotide phosphorylase
Polynucleotide Phosphorylase (PNPase) is a bifunctional enzyme with a phosphorolytic 3' to 5' exoribonuclease activity and a 3'-terminal oligonucleotide polymerase activity. That is, it dismantles the RNA chain starting at the 3' end and working ...
itself. This enzyme is found in bacteria, mitochondria, plastids and as a constituent of the archaeal exosome (in those archaea that have an exosome). It can synthesise a 3′ extension where the vast majority of the bases are adenines. Like in bacteria, polyadenylation by polynucleotide phosphorylase promotes degradation of the RNA in plastids and likely also archaea.[
]
Evolution
Although polyadenylation is seen in almost all organisms, it is not universal. However, the wide distribution of this modification and the fact that it is present in organisms from all three domains of life implies that the last universal common ancestor
The last universal common ancestor (LUCA) is the most recent population from which all organisms now living on Earth share common descent—the most recent common ancestor of all current life on Earth. This includes all cellular organisms; th ...
of all living organisms, it is presumed, had some form of polyadenylation system.[ A few organisms do not polyadenylate mRNA, which implies that they have lost their polyadenylation machineries during evolution. Although no examples of eukaryotes that lack polyadenylation are known, mRNAs from the bacterium '']Mycoplasma gallisepticum
''Mycoplasma gallisepticum'' (MG) is a bacterium belonging to the class Mollicutes and the family Mycoplasmataceae. It is the causative agent of chronic respiratory disease (CRD) in chickens and infectious sinusitis in turkeys, chickens, game ...
'' and the salt-tolerant archaean ''Haloferax volcanii
''Haloferax volcanii'' is a species of organism in the genus ''Haloferax'' in the Archaea.
Description and significance
Microbiologist Benjamin Elazari Volcani first discovered ''Haloferax volcanii'', a self-named extremophile, in the 1930s. '' ...
'' lack this modification.
The most ancient polyadenylating enzyme is polynucleotide phosphorylase
Polynucleotide Phosphorylase (PNPase) is a bifunctional enzyme with a phosphorolytic 3' to 5' exoribonuclease activity and a 3'-terminal oligonucleotide polymerase activity. That is, it dismantles the RNA chain starting at the 3' end and working ...
. This enzyme is part of both the bacterial degradosome The degradosome is a multiprotein complex present in most bacteria that is involved in the processing of ribosomal RNA and the degradation of messenger RNA and is regulated by Non-coding RNA. It contains the proteins RNA helicase B, RNase E and ...
and the archaeal exosome, two closely related complexes that recycle RNA into nucleotides. This enzyme degrades RNA by attacking the bond between the 3′-most nucleotides with a phosphate, breaking off a diphosphate nucleotide. This reaction is reversible, and so the enzyme can also extend RNA with more nucleotides. The heteropolymeric tail added by polynucleotide phosphorylase is very rich in adenine. The choice of adenine is most likely the result of higher ADP concentrations than other nucleotides as a result of using ATP as an energy currency, making it more likely to be incorporated in this tail in early lifeforms. It has been suggested that the involvement of adenine-rich tails in RNA degradation prompted the later evolution of polyadenylate polymerases (the enzymes that produce poly(A) tails with no other nucleotides in them).
Polyadenylate polymerases are not as ancient. They have separately evolved in both bacteria and eukaryotes from CCA-adding enzyme, which is the enzyme that completes the 3′ ends of tRNAs
Transfer RNA (abbreviated tRNA and formerly referred to as sRNA, for soluble RNA) is an adaptor molecule composed of RNA, typically 76 to 90 nucleotides in length (in eukaryotes), that serves as the physical link between the mRNA and the amino a ...
. Its catalytic domain is homologous to that of other polymerases.[ It is presumed that the horizontal transfer of bacterial CCA-adding enzyme to eukaryotes allowed the archaeal-like CCA-adding enzyme to switch function to a poly(A) polymerase.][ Some lineages, like archaea and cyanobacteria, never evolved a polyadenylate polymerase.][
Polyadenylate tails are observed in several RNA viruses, including ]Influenza A
'' A virus'' (''IAV'') causes influenza in birds and some mammals, and is the only species of the genus ''Alphainfluenzavirus'' of the virus family ''Orthomyxoviridae''. Strains of all subtypes of influenza A virus have been isolated from wild ...
, Coronavirus, Alfalfa mosaic virus, and Duck Hepatitis A. Some viruses, such as HIV-1
The subtypes of HIV include two major types, HIV type 1 (HIV-1) and HIV type 2 (HIV-2). HIV-1 is related to viruses found in chimpanzees and gorillas living in western Africa, while HIV-2 viruses are related to viruses found in the sooty mangabey ...
and Poliovirus
A poliovirus, the causative agent of polio (also known as poliomyelitis), is a serotype of the species '' Enterovirus C'', in the family of ''Picornaviridae''. There are three poliovirus serotypes: types 1, 2, and 3.
Poliovirus is composed of a ...
, inhibit the cell's poly-A binding protein (PABPC1
Polyadenylate-binding protein 1 is a protein that in humans is encoded by the ''PABPC1'' gene. The protein PABP1 binds mRNA and facilitates a variety of functions such as transport into and out of the nucleus, degradation, translation, and stabili ...
) in order to emphasize their own genes' expression over the host cell's.
History
Poly(A)polymerase was first identified in 1960 as an enzymatic activity
Enzyme assays are laboratory methods for measuring enzymatic activity. They are vital for the study of enzyme kinetics and enzyme inhibition.
Enzyme units
The quantity or concentration of an enzyme can be expressed in molar amounts, as with an ...
in extracts made from cell nuclei that could polymerise ATP, but not ADP, into polyadenine. Although identified in many types of cells, this activity had no known function until 1971, when poly(A) sequences were found in mRNAs. The only function of these sequences was thought at first to be protection of the 3′ end of the RNA from nucleases, but later the specific roles of polyadenylation in nuclear export and translation were identified. The polymerases responsible for polyadenylation were first purified and characterized in the 1960s and 1970s, but the large number of accessory proteins that control this process were discovered only in the early 1990s.[
]
See also
* Simian virus 40 late polyadenylation signal (SVLPA)
References
Further reading
*
External links
*
{{Post transcriptional modification
Gene expression
RNA