Ribonucleic acid (RNA) is a
polymer
A polymer (; Greek '' poly-'', "many" + ''-mer'', "part")
is a substance or material consisting of very large molecules called macromolecules, composed of many repeating subunits. Due to their broad spectrum of properties, both synthetic a ...
ic molecule essential in various biological roles in
coding,
decoding,
regulation
Regulation is the management of complex systems according to a set of rules and trends. In systems theory, these types of rules exist in various fields of biology and society, but the term has slightly different meanings according to context. Fo ...
and
expression
Expression may refer to:
Linguistics
* Expression (linguistics), a word, phrase, or sentence
* Fixed expression, a form of words with a specific meaning
* Idiom, a type of fixed expression
* Metaphorical expression, a particular word, phrase, o ...
of
gene
In biology, the word gene (from , ; "... Wilhelm Johannsen coined the word gene to describe the Mendelian units of heredity..." meaning ''generation'' or ''birth'' or ''gender'') can have several different meanings. The Mendelian gene is a b ...
s. RNA and deoxyribonucleic acid (
DNA) are
nucleic acids. Along with
lipid
Lipids are a broad group of naturally-occurring molecules which includes fats, waxes, sterols, fat-soluble vitamins (such as vitamins A, D, E and K), monoglycerides, diglycerides, phospholipids, and others. The functions of lipids includ ...
s,
protein
Proteins are large biomolecules and macromolecules that comprise one or more long chains of amino acid residues. Proteins perform a vast array of functions within organisms, including catalysing metabolic reactions, DNA replication, res ...
s, and
carbohydrate
In organic chemistry, a carbohydrate () is a biomolecule consisting of carbon (C), hydrogen (H) and oxygen (O) atoms, usually with a hydrogen–oxygen atom ratio of 2:1 (as in water) and thus with the empirical formula (where ''m'' may or m ...
s, nucleic acids constitute one of the four major
macromolecules essential for all known forms of
life
Life is a quality that distinguishes matter that has biological processes, such as Cell signaling, signaling and self-sustaining processes, from that which does not, and is defined by the capacity for Cell growth, growth, reaction to Stimu ...
. Like DNA, RNA is assembled as a chain of
nucleotide
Nucleotides are organic molecules consisting of a nucleoside and a phosphate. They serve as monomeric units of the nucleic acid polymers – deoxyribonucleic acid (DNA) and ribonucleic acid (RNA), both of which are essential biomolecule ...
s, but unlike DNA, RNA is found in nature as a single strand folded onto itself, rather than a paired double strand. Cellular organisms use
messenger RNA (''mRNA'') to convey genetic information (using the
nitrogenous bases of
guanine
Guanine () ( symbol G or Gua) is one of the four main nucleobases found in the nucleic acids DNA and RNA, the others being adenine, cytosine, and thymine (uracil in RNA). In DNA, guanine is paired with cytosine. The guanine nucleoside is c ...
,
uracil
Uracil () (symbol U or Ura) is one of the four nucleobases in the nucleic acid RNA. The others are adenine (A), cytosine (C), and guanine (G). In RNA, uracil binds to adenine via two hydrogen bonds. In DNA, the uracil nucleobase is replaced b ...
,
adenine
Adenine () ( symbol A or Ade) is a nucleobase (a purine derivative). It is one of the four nucleobases in the nucleic acid of DNA that are represented by the letters G–C–A–T. The three others are guanine, cytosine and thymine. Its deri ...
, and
cytosine
Cytosine () ( symbol C or Cyt) is one of the four nucleobases found in DNA and RNA, along with adenine, guanine, and thymine (uracil in RNA). It is a pyrimidine derivative, with a heterocyclic aromatic ring and two substituents attached (an ...
, denoted by the letters G, U, A, and C) that directs synthesis of specific proteins. Many
virus
A virus is a submicroscopic infectious agent that replicates only inside the living cells of an organism. Viruses infect all life forms, from animals and plants to microorganisms, including bacteria and archaea.
Since Dmitri Ivanovsk ...
es encode their genetic information using an RNA
genome
In the fields of molecular biology and genetics, a genome is all the genetic information of an organism. It consists of nucleotide sequences of DNA (or RNA in RNA viruses). The nuclear genome includes protein-coding genes and non-coding g ...
.
Some RNA molecules play an active role within cells by catalyzing biological reactions, controlling
gene expression, or sensing and communicating responses to cellular signals. One of these active processes is
protein synthesis, a universal function in which RNA molecules direct the synthesis of proteins on
ribosomes. This process uses
transfer RNA (''tRNA'') molecules to deliver
amino acid
Amino acids are organic compounds that contain both amino and carboxylic acid functional groups. Although hundreds of amino acids exist in nature, by far the most important are the alpha-amino acids, which comprise proteins. Only 22 alpha a ...
s to the
ribosome, where
ribosomal RNA
Ribosomal ribonucleic acid (rRNA) is a type of non-coding RNA which is the primary component of ribosomes, essential to all cells. rRNA is a ribozyme which carries out protein synthesis in ribosomes. Ribosomal RNA is transcribed from ribosom ...
(''rRNA'') then links amino acids together to form coded proteins.
Comparison with DNA
The chemical structure of RNA is very similar to that of
DNA, but differs in three primary ways:
* Unlike double-stranded DNA, RNA is usually a single-stranded molecule (ssRNA) in many of its biological roles and consists of much shorter chains of nucleotides. However,
double-stranded RNA
Ribonucleic acid (RNA) is a polymeric molecule essential in various biological roles in coding, decoding, regulation and expression of genes. RNA and deoxyribonucleic acid ( DNA) are nucleic acids. Along with lipids, proteins, and carbohydr ...
(dsRNA) can form and (moreover) a single RNA molecule can, by complementary base pairing, form intrastrand double helixes, as in tRNA.
* While the sugar-phosphate "backbone" of DNA contains ''
deoxyribose
Deoxyribose, or more precisely 2-deoxyribose, is a monosaccharide with idealized formula H−(C=O)−(CH2)−(CHOH)3−H. Its name indicates that it is a deoxy sugar, meaning that it is derived from the sugar ribose by loss of a hydroxy group. D ...
'', RNA contains ''
ribose
Ribose is a simple sugar and carbohydrate with molecular formula C5H10O5 and the linear-form composition H−(C=O)−(CHOH)4−H. The naturally-occurring form, , is a component of the ribonucleotides from which RNA is built, and so this compo ...
'' instead. Ribose has a
hydroxyl
In chemistry, a hydroxy or hydroxyl group is a functional group with the chemical formula and composed of one oxygen atom covalently bonded to one hydrogen atom. In organic chemistry, alcohols and carboxylic acids contain one or more hydro ...
group attached to the pentose ring in the
2' position, whereas deoxyribose does not. The hydroxyl groups in the ribose backbone make RNA more chemically
labile
Lability refers to something that is constantly undergoing change or is likely to undergo change.
Biochemistry
In reference to biochemistry, this is an important concept as far as kinetics is concerned in metalloproteins. This can allow for th ...
than DNA by lowering the
activation energy
In chemistry and physics, activation energy is the minimum amount of energy that must be provided for compounds to result in a chemical reaction. The activation energy (''E''a) of a reaction is measured in joules per mole (J/mol), kilojoules p ...
of
hydrolysis
Hydrolysis (; ) is any chemical reaction in which a molecule of water breaks one or more chemical bonds. The term is used broadly for substitution, elimination, and solvation reactions in which water is the nucleophile.
Biological hydrolys ...
.
* The complementary base to
adenine
Adenine () ( symbol A or Ade) is a nucleobase (a purine derivative). It is one of the four nucleobases in the nucleic acid of DNA that are represented by the letters G–C–A–T. The three others are guanine, cytosine and thymine. Its deri ...
in DNA is
thymine
Thymine () ( symbol T or Thy) is one of the four nucleobases in the nucleic acid of DNA that are represented by the letters G–C–A–T. The others are adenine, guanine, and cytosine. Thymine is also known as 5-methyluracil, a pyrimidi ...
, whereas in RNA, it is
uracil
Uracil () (symbol U or Ura) is one of the four nucleobases in the nucleic acid RNA. The others are adenine (A), cytosine (C), and guanine (G). In RNA, uracil binds to adenine via two hydrogen bonds. In DNA, the uracil nucleobase is replaced b ...
, which is an
unmethylated form of thymine.
[
]
Like DNA, most biologically active RNAs, including
mRNA
In molecular biology, messenger ribonucleic acid (mRNA) is a single-stranded molecule of RNA that corresponds to the genetic sequence of a gene, and is read by a ribosome in the process of synthesizing a protein.
mRNA is created during the ...
,
tRNA
Transfer RNA (abbreviated tRNA and formerly referred to as sRNA, for soluble RNA) is an adaptor molecule composed of RNA, typically 76 to 90 nucleotides in length (in eukaryotes), that serves as the physical link between the mRNA and the amino ...
,
rRNA
Ribosomal ribonucleic acid (rRNA) is a type of non-coding RNA which is the primary component of ribosomes, essential to all cells. rRNA is a ribozyme which carries out protein synthesis in ribosomes. Ribosomal RNA is transcribed from ribosoma ...
,
snRNA
Small nuclear RNA (snRNA) is a class of small RNA molecules that are found within the splicing speckles and Cajal bodies of the cell nucleus in eukaryotic cells. The length of an average snRNA is approximately 150 nucleotides. They are transcri ...
s, and other
non-coding RNA
A non-coding RNA (ncRNA) is a functional RNA molecule that is not Translation (genetics), translated into a protein. The DNA sequence from which a functional non-coding RNA is transcribed is often called an RNA gene. Abundant and functionally im ...
s, contain self-complementary sequences that allow parts of the RNA to fold and pair with itself to form double helices. Analysis of these RNAs has revealed that they are highly structured. Unlike DNA, their structures do not consist of long double helices, but rather collections of short helices packed together into structures akin to proteins.
In this fashion, RNAs can achieve chemical
catalysis
Catalysis () is the process of increasing the rate of a chemical reaction by adding a substance known as a catalyst (). Catalysts are not consumed in the reaction and remain unchanged after it. If the reaction is rapid and the catalyst recyc ...
(like enzymes). For instance, determination of the structure of the ribosome—an RNA-protein complex that catalyzes peptide bond formation—revealed that its active site is composed entirely of RNA.
Structure
Each
nucleotide
Nucleotides are organic molecules consisting of a nucleoside and a phosphate. They serve as monomeric units of the nucleic acid polymers – deoxyribonucleic acid (DNA) and ribonucleic acid (RNA), both of which are essential biomolecule ...
in RNA contains a
ribose
Ribose is a simple sugar and carbohydrate with molecular formula C5H10O5 and the linear-form composition H−(C=O)−(CHOH)4−H. The naturally-occurring form, , is a component of the ribonucleotides from which RNA is built, and so this compo ...
sugar, with carbons numbered 1' through 5'. A base is attached to the 1' position, in general,
adenine
Adenine () ( symbol A or Ade) is a nucleobase (a purine derivative). It is one of the four nucleobases in the nucleic acid of DNA that are represented by the letters G–C–A–T. The three others are guanine, cytosine and thymine. Its deri ...
(A),
cytosine
Cytosine () ( symbol C or Cyt) is one of the four nucleobases found in DNA and RNA, along with adenine, guanine, and thymine (uracil in RNA). It is a pyrimidine derivative, with a heterocyclic aromatic ring and two substituents attached (an ...
(C),
guanine
Guanine () ( symbol G or Gua) is one of the four main nucleobases found in the nucleic acids DNA and RNA, the others being adenine, cytosine, and thymine (uracil in RNA). In DNA, guanine is paired with cytosine. The guanine nucleoside is c ...
(G), or
uracil
Uracil () (symbol U or Ura) is one of the four nucleobases in the nucleic acid RNA. The others are adenine (A), cytosine (C), and guanine (G). In RNA, uracil binds to adenine via two hydrogen bonds. In DNA, the uracil nucleobase is replaced b ...
(U). Adenine and guanine are
purine
Purine is a heterocyclic aromatic organic compound that consists of two rings ( pyrimidine and imidazole) fused together. It is water-soluble. Purine also gives its name to the wider class of molecules, purines, which include substituted purines ...
s, cytosine and uracil are
pyrimidines. A
phosphate
In chemistry, a phosphate is an anion, salt, functional group or ester derived from a phosphoric acid. It most commonly means orthophosphate, a derivative of orthophosphoric acid .
The phosphate or orthophosphate ion is derived from phosph ...
group is attached to the 3' position of one ribose and the 5' position of the next. The phosphate groups have a negative charge each, making RNA a charged molecule (polyanion). The bases form
hydrogen bonds between cytosine and guanine, between adenine and uracil and between guanine and uracil.
However, other interactions are possible, such as a group of adenine bases binding to each other in a bulge,
or the GNRA
tetraloop that has a guanine–adenine base-pair.
An important structural component of RNA that distinguishes it from DNA is the presence of a
hydroxyl
In chemistry, a hydroxy or hydroxyl group is a functional group with the chemical formula and composed of one oxygen atom covalently bonded to one hydrogen atom. In organic chemistry, alcohols and carboxylic acids contain one or more hydro ...
group at the 2' position of the
ribose sugar. The presence of this functional group causes the helix to mostly take the
A-form geometry, although in single strand dinucleotide contexts, RNA can rarely also adopt the B-form most commonly observed in DNA. The A-form geometry results in a very deep and narrow major groove and a shallow and wide minor groove. A second consequence of the presence of the 2'-hydroxyl group is that in conformationally flexible regions of an RNA molecule (that is, not involved in formation of a double helix), it can chemically attack the adjacent phosphodiester bond to cleave the backbone.
RNA is transcribed with only four bases (adenine, cytosine, guanine and uracil), but these bases and attached sugars can be modified in numerous ways as the RNAs mature.
Pseudouridine
Pseudouridine (abbreviated by the Greek letter psi- Ψ) is an isomer of the nucleoside uridine in which the uracil is attached via a carbon-carbon instead of a nitrogen-carbon glycosidic bond. (In this configuration, uracil is sometimes referred ...
(Ψ), in which the linkage between uracil and ribose is changed from a C–N bond to a C–C bond, and
ribothymidine (T) are found in various places (the most notable ones being in the TΨC loop of
tRNA
Transfer RNA (abbreviated tRNA and formerly referred to as sRNA, for soluble RNA) is an adaptor molecule composed of RNA, typically 76 to 90 nucleotides in length (in eukaryotes), that serves as the physical link between the mRNA and the amino ...
). Another notable modified base is
hypoxanthine
Hypoxanthine is a naturally occurring purine derivative. It is occasionally found as a constituent of nucleic acids, where it is present in the anticodon of tRNA in the form of its nucleoside inosine. It has a tautomer known as 6-hydroxypurine. ...
, a deaminated adenine base whose
nucleoside
Nucleosides are glycosylamines that can be thought of as nucleotides without a phosphate group. A nucleoside consists simply of a nucleobase (also termed a nitrogenous base) and a five-carbon sugar (ribose or 2'-deoxyribose) whereas a nucleoti ...
is called
inosine
Inosine is a nucleoside that is formed when hypoxanthine is attached to a ribose ring (also known as a ribofuranose) via a β-N9- glycosidic bond. It was discovered in 1965 in analysis of RNA transferase.
Inosine is commonly found in tRNAs and is ...
(I). Inosine plays a key role in the
wobble hypothesis
A wobble base pair is a pairing between two nucleotides in RNA molecules that does not follow Watson-Crick base pair rules. The four main wobble base pairs are guanine-uracil (G-U), hypoxanthine-uracil (I-U), hypoxanthine-adenine (I-A), and hypo ...
of the
genetic code
The genetic code is the set of rules used by living cells to translate information encoded within genetic material ( DNA or RNA sequences of nucleotide triplets, or codons) into proteins. Translation is accomplished by the ribosome, which links ...
.
There are more than 100 other naturally occurring modified nucleosides. The greatest structural diversity of modifications can be found in
tRNA
Transfer RNA (abbreviated tRNA and formerly referred to as sRNA, for soluble RNA) is an adaptor molecule composed of RNA, typically 76 to 90 nucleotides in length (in eukaryotes), that serves as the physical link between the mRNA and the amino ...
, while pseudouridine and nucleosides with
2'-O-methylribose often present in rRNA are the most common. The specific roles of many of these modifications in RNA are not fully understood. However, it is notable that, in ribosomal RNA, many of the post-transcriptional modifications occur in highly functional regions, such as the peptidyl transferase center and the subunit interface, implying that they are important for normal function.
The functional form of single-stranded RNA molecules, just like proteins, frequently requires a specific
tertiary structure
Protein tertiary structure is the three dimensional shape of a protein. The tertiary structure will have a single polypeptide chain "backbone" with one or more protein secondary structures, the protein domains. Amino acid side chains may i ...
. The scaffold for this structure is provided by
secondary structural elements that are hydrogen bonds within the molecule. This leads to several recognizable "domains" of secondary structure like
hairpin loops, bulges, and
internal loops. In order create, i.e., design, a RNA for any given secondary structure, two or three bases would not be enough, but four bases are enough. This is likely why nature has "chosen" a four base alphabet: less than four does not allow to create all structures, while more than four bases are not necessary. Since RNA is charged, metal ions such as
Mg2+ are needed to stabilise many secondary and
tertiary structures
Biomolecular structure is the intricate folded, three-dimensional shape that is formed by a molecule of protein, DNA, or RNA, and that is important to its function. The structure of these molecules may be considered at any of several length s ...
.
The naturally occurring
enantiomer of RNA is
D-RNA composed of
D-ribonucleotides. All chirality centers are located in the
D-ribose. By the use of
L-ribose or rather
L-ribonucleotides,
L-RNA can be synthesized.
L-RNA is much more stable against degradation by
RNase
Ribonuclease (commonly abbreviated RNase) is a type of nuclease that catalyzes the degradation of RNA into smaller components. Ribonucleases can be divided into endoribonucleases and exoribonucleases, and comprise several sub-classes within t ...
.
Like other structured
biopolymers such as proteins, one can define topology of a folded RNA molecule. This is often done based on arrangement of intra-chain contacts within a folded RNA, termed as
circuit topology
The circuit topology of a folded linear polymer refers to the arrangement of its intra-molecular contacts. Examples of linear polymers with intra-molecular contacts are nucleic acids and proteins. Proteins fold via formation of contacts of variou ...
.
Synthesis
Synthesis of RNA is usually catalyzed by an enzyme—
RNA polymerase—using DNA as a template, a process known as
transcription
Transcription refers to the process of converting sounds (voice, music etc.) into letters or musical notes, or producing a copy of something in another medium, including:
Genetics
* Transcription (biology), the copying of DNA into RNA, the fir ...
. Initiation of transcription begins with the binding of the enzyme to a
promoter sequence in the DNA (usually found "upstream" of a gene). The DNA double helix is unwound by the
helicase
Helicases are a class of enzymes thought to be vital to all organisms. Their main function is to unpack an organism's genetic material. Helicases are motor proteins that move directionally along a nucleic acid phosphodiester backbone, separatin ...
activity of the enzyme. The enzyme then progresses along the template strand in the 3’ to 5’ direction, synthesizing a complementary RNA molecule with elongation occurring in the 5’ to 3’ direction. The DNA sequence also dictates where termination of RNA synthesis will occur.
Primary transcript
A primary transcript is the single-stranded ribonucleic acid ( RNA) product synthesized by transcription of DNA, and processed to yield various mature RNA products such as mRNAs, tRNAs, and rRNAs. The primary transcripts designated to be mRNAs ...
RNAs are often
modified by enzymes after transcription. For example, a
poly(A) tail
Polyadenylation is the addition of a poly(A) tail to an RNA transcript, typically a messenger RNA (mRNA). The poly(A) tail consists of multiple adenosine monophosphates; in other words, it is a stretch of RNA that has only adenine bases. In eu ...
and a
5' cap are added to eukaryotic
pre-mRNA
A primary transcript is the single-stranded ribonucleic acid ( RNA) product synthesized by transcription of DNA, and processed to yield various mature RNA products such as mRNAs, tRNAs, and rRNAs. The primary transcripts designated to be mRNAs ...
and
introns are removed by the
spliceosome
A spliceosome is a large ribonucleoprotein (RNP) complex found primarily within the nucleus of eukaryotic cells. The spliceosome is assembled from small nuclear RNAs ( snRNA) and numerous proteins. Small nuclear RNA (snRNA) molecules bind to specif ...
.
There are also a number of
RNA-dependent RNA polymerases that use RNA as their template for synthesis of a new strand of RNA. For instance, a number of
RNA viruses (such as poliovirus) use this type of enzyme to replicate their genetic material. Also, RNA-dependent RNA polymerase is part of the
RNA interference
RNA interference (RNAi) is a biological process in which RNA molecules are involved in sequence-specific suppression of gene expression by double-stranded RNA, through translational or transcriptional repression. Historically, RNAi was known by ...
pathway in many organisms.
Types of RNA
Overview
Messenger RNA (mRNA) is the RNA that carries information from DNA to the
ribosome, the sites of protein synthesis (
translation
Translation is the communication of the meaning of a source-language text by means of an equivalent target-language text. The English language draws a terminological distinction (which does not exist in every language) between ''transla ...
) in the cell. The mRNA is a copy of DNA. The coding sequence of the mRNA determines the
amino acid
Amino acids are organic compounds that contain both amino and carboxylic acid functional groups. Although hundreds of amino acids exist in nature, by far the most important are the alpha-amino acids, which comprise proteins. Only 22 alpha a ...
sequence in the
protein
Proteins are large biomolecules and macromolecules that comprise one or more long chains of amino acid residues. Proteins perform a vast array of functions within organisms, including catalysing metabolic reactions, DNA replication, res ...
that is produced.
[ However, many RNAs do not code for protein (about 97% of the transcriptional output is non-protein-coding in eukaryotes).
These so-called ]non-coding RNA
A non-coding RNA (ncRNA) is a functional RNA molecule that is not Translation (genetics), translated into a protein. The DNA sequence from which a functional non-coding RNA is transcribed is often called an RNA gene. Abundant and functionally im ...
s ("ncRNA") can be encoded by their own genes (RNA genes), but can also derive from mRNA introns.[ The most prominent examples of non-coding RNAs are transfer RNA (tRNA) and ]ribosomal RNA
Ribosomal ribonucleic acid (rRNA) is a type of non-coding RNA which is the primary component of ribosomes, essential to all cells. rRNA is a ribozyme which carries out protein synthesis in ribosomes. Ribosomal RNA is transcribed from ribosom ...
(rRNA), both of which are involved in the process of translation. There are also non-coding RNAs involved in gene regulation, RNA processing
Transcriptional modification or co-transcriptional modification is a set of biological processes common to most eukaryotic cells by which an RNA primary transcript is chemically altered following transcription from a gene to produce a mature, f ...
and other roles. Certain RNAs are able to catalyse
Catalysis () is the process of increasing the rate of a chemical reaction by adding a substance known as a catalyst (). Catalysts are not consumed in the reaction and remain unchanged after it. If the reaction is rapid and the catalyst recyc ...
chemical reactions such as cutting and ligating other RNA molecules, and the catalysis of peptide bond formation in the ribosome;[ these are known as ]ribozyme
Ribozymes (ribonucleic acid enzymes) are RNA molecules that have the ability to catalyze specific biochemical reactions, including RNA splicing in gene expression, similar to the action of protein enzymes. The 1982 discovery of ribozymes demons ...
s.
In length
According to the length of RNA chain, RNA includes small RNA
Small RNA (sRNA) are polymeric RNA molecules that are less than 200 nucleotides in length, and are usually non-coding. RNA silencing is often a function of these molecules, with the most common and well-studied example being RNA interference (RN ...
and long RNA. Usually, small RNA
Small RNA (sRNA) are polymeric RNA molecules that are less than 200 nucleotides in length, and are usually non-coding. RNA silencing is often a function of these molecules, with the most common and well-studied example being RNA interference (RN ...
s are shorter than 200 nt in length, and long RNAs are greater than 200 nt long. Long RNAs, also called large RNAs, mainly include long non-coding RNA
Long non-coding RNAs (long ncRNAs, lncRNA) are a type of RNA, generally defined as transcripts more than 200 nucleotides that are not translated into protein. This arbitrary limit distinguishes long ncRNAs from small non-coding RNAs, such as mi ...
(lncRNA) and mRNA
In molecular biology, messenger ribonucleic acid (mRNA) is a single-stranded molecule of RNA that corresponds to the genetic sequence of a gene, and is read by a ribosome in the process of synthesizing a protein.
mRNA is created during the ...
. Small RNAs mainly include 5.8S ribosomal RNA
Ribosomal ribonucleic acid (rRNA) is a type of non-coding RNA which is the primary component of ribosomes, essential to all cells. rRNA is a ribozyme which carries out protein synthesis in ribosomes. Ribosomal RNA is transcribed from ribosom ...
(rRNA), 5S rRNA
The 5S ribosomal RNA (5S rRNA) is an approximately 120 nucleotide-long ribosomal RNA molecule with a mass of 40 kDa. It is a structural and functional component of the large subunit of the ribosome in all domains of life (bacteria, archaea, and e ...
, transfer RNA (tRNA), microRNA
MicroRNA (miRNA) are small, single-stranded, non-coding RNA molecules containing 21 to 23 nucleotides. Found in plants, animals and some viruses, miRNAs are involved in RNA silencing and post-transcriptional regulation of gene expression. mi ...
(miRNA), small interfering RNA
Small interfering RNA (siRNA), sometimes known as short interfering RNA or silencing RNA, is a class of double-stranded RNA at first non-coding RNA molecules, typically 20-24 (normally 21) base pairs in length, similar to MicroRNA, miRNA, and op ...
(siRNA), small nucleolar RNA (snoRNAs), Piwi-interacting RNA
Piwi-interacting RNA (piRNA) is the largest class of small non-coding RNA, non-coding RNA molecules expressed in animal cells. piRNAs form RNA-protein complexes through interactions with piwi-subfamily Argonaute proteins. These piRNA complexes are ...
(piRNA), tRNA-derived small RNA (tsRNA) and small rDNA-derived RNA (srRNA).
There are certain exceptions as in the case of the 5S rRNA
The 5S ribosomal RNA (5S rRNA) is an approximately 120 nucleotide-long ribosomal RNA molecule with a mass of 40 kDa. It is a structural and functional component of the large subunit of the ribosome in all domains of life (bacteria, archaea, and e ...
of the members of the genus Halococcus ( Archaea), which have an insertion, thus increasing its size.
In translation
Messenger RNA (mRNA) carries information about a protein sequence to the ribosomes, the protein synthesis factories in the cell. It is coded so that every three nucleotides (a codon) corresponds to one amino acid. In eukaryotic
Eukaryotes () are organisms whose Cell (biology), cells have a cell nucleus, nucleus. All animals, plants, fungi, and many unicellular organisms, are Eukaryotes. They belong to the group of organisms Eukaryota or Eukarya, which is one of the ...
cells, once precursor mRNA (pre-mRNA) has been transcribed from DNA, it is processed to mature mRNA. This removes its introns—non-coding sections of the pre-mRNA. The mRNA is then exported from the nucleus to the cytoplasm
In cell biology, the cytoplasm is all of the material within a eukaryotic cell, enclosed by the cell membrane, except for the cell nucleus. The material inside the nucleus and contained within the nuclear membrane is termed the nucleoplasm. ...
, where it is bound to ribosomes and translated
Translation is the communication of the meaning of a source-language text by means of an equivalent target-language text. The English language draws a terminological distinction (which does not exist in every language) between ''transla ...
into its corresponding protein form with the help of tRNA
Transfer RNA (abbreviated tRNA and formerly referred to as sRNA, for soluble RNA) is an adaptor molecule composed of RNA, typically 76 to 90 nucleotides in length (in eukaryotes), that serves as the physical link between the mRNA and the amino ...
. In prokaryotic cells, which do not have nucleus and cytoplasm compartments, mRNA can bind to ribosomes while it is being transcribed from DNA. After a certain amount of time, the message degrades into its component nucleotides with the assistance of ribonuclease
Ribonuclease (commonly abbreviated RNase) is a type of nuclease that catalyzes the degradation of RNA into smaller components. Ribonucleases can be divided into endoribonucleases and exoribonucleases, and comprise several sub-classes within ...
s.[
Transfer RNA (tRNA) is a small RNA chain of about 80 ]nucleotide
Nucleotides are organic molecules consisting of a nucleoside and a phosphate. They serve as monomeric units of the nucleic acid polymers – deoxyribonucleic acid (DNA) and ribonucleic acid (RNA), both of which are essential biomolecule ...
s that transfers a specific amino acid to a growing polypeptide chain at the ribosomal site of protein synthesis during translation. It has sites for amino acid attachment and an anticodon
Transfer RNA (abbreviated tRNA and formerly referred to as sRNA, for soluble RNA) is an adaptor molecule composed of RNA, typically 76 to 90 nucleotides in length (in eukaryotes), that serves as the physical link between the mRNA and the amino ...
region for codon recognition that binds to a specific sequence on the messenger RNA chain through hydrogen bonding.[
]Ribosomal RNA
Ribosomal ribonucleic acid (rRNA) is a type of non-coding RNA which is the primary component of ribosomes, essential to all cells. rRNA is a ribozyme which carries out protein synthesis in ribosomes. Ribosomal RNA is transcribed from ribosom ...
(rRNA) is the catalytic component of the ribosomes. The rRNA is the component of the ribosome that hosts translation. Eukaryotic ribosomes contain four different rRNA molecules: 18S, 5.8S, 28S and 5S rRNA. Three of the rRNA molecules are synthesized in the nucleolus
The nucleolus (, plural: nucleoli ) is the largest structure in the nucleus of eukaryotic cells. It is best known as the site of ribosome biogenesis, which is the synthesis of ribosomes. The nucleolus also participates in the formation of ...
, and one is synthesized elsewhere. In the cytoplasm, ribosomal RNA and protein combine to form a nucleoprotein called a ribosome. The ribosome binds mRNA and carries out protein synthesis. Several ribosomes may be attached to a single mRNA at any time. Nearly all the RNA found in a typical eukaryotic cell is rRNA.
Transfer-messenger RNA (tmRNA) is found in many bacteria
Bacteria (; singular: bacterium) are ubiquitous, mostly free-living organisms often consisting of one Cell (biology), biological cell. They constitute a large domain (biology), domain of prokaryotic microorganisms. Typically a few micrometr ...
and plastids. It tags proteins encoded by mRNAs that lack stop codons for degradation and prevents the ribosome from stalling.
Regulatory RNA
The earliest known regulators of gene expression were proteins known as repressors and activators – regulators with specific short binding sites within enhancer regions near the genes to be regulated. Later studies have shown that RNAs also regulate genes. There are several kinds of RNA-dependent processes in eukaryotes regulating the expression of genes at various points, such as RNAi
RNA interference (RNAi) is a biological process in which RNA molecules are involved in sequence-specific suppression of gene expression by double-stranded RNA, through translational or transcriptional repression. Historically, RNAi was known by ...
repressing genes post-transcriptionally, long non-coding RNA
Long non-coding RNAs (long ncRNAs, lncRNA) are a type of RNA, generally defined as transcripts more than 200 nucleotides that are not translated into protein. This arbitrary limit distinguishes long ncRNAs from small non-coding RNAs, such as mi ...
s shutting down blocks of chromatin
Chromatin is a complex of DNA and protein found in eukaryotic cells. The primary function is to package long DNA molecules into more compact, denser structures. This prevents the strands from becoming tangled and also plays important roles in r ...
epigenetically
In biology, epigenetics is the study of stable phenotypic changes (known as ''marks'') that do not involve alterations in the DNA sequence. The Greek prefix '' epi-'' ( "over, outside of, around") in ''epigenetics'' implies features that are "o ...
, and enhancer RNA
Enhancer RNAs (eRNAs) represent a class of relatively long non-coding RNA molecules (50-2000 nucleotides) transcribed from the DNA sequence of enhancer regions. They were first detected in 2010 through the use of genome-wide techniques such as RNA ...
s inducing increased gene expression. Bacteria and archaea have also been shown to use regulatory RNA systems such as bacterial small RNAs and CRISPR. Fire and Mello were awarded the 2006 Nobel Prize in Physiology or Medicine
The Nobel Prize in Physiology or Medicine is awarded yearly by the Nobel Assembly at the Karolinska Institute for outstanding discoveries in physiology or medicine. The Nobel Prize is not a single prize, but five separate prizes that, accord ...
for discovering microRNA
MicroRNA (miRNA) are small, single-stranded, non-coding RNA molecules containing 21 to 23 nucleotides. Found in plants, animals and some viruses, miRNAs are involved in RNA silencing and post-transcriptional regulation of gene expression. mi ...
s (miRNAs), specific short RNA molecules that can base-pair with mRNAs.["The Nobel Prize in Physiology or Medicine 2006". ''Nobelprize.org.'' Nobel Media AB 2014. Web. 6 Aug 2018. http://www.nobelprize.org/nobel_prizes/medicine/laureates/2006]
RNA interference by miRNAs
Post-transcriptional expression levels of many genes can be controlled by RNA interference
RNA interference (RNAi) is a biological process in which RNA molecules are involved in sequence-specific suppression of gene expression by double-stranded RNA, through translational or transcriptional repression. Historically, RNAi was known by ...
, in which miRNA
MicroRNA (miRNA) are small, single-stranded, non-coding RNA molecules containing 21 to 23 nucleotides. Found in plants, animals and some viruses, miRNAs are involved in RNA silencing and post-transcriptional regulation of gene expression. miR ...
s, specific short RNA molecules, pair with mRNA regions and target them for degradation. This antisense
In molecular biology and genetics, the sense of a nucleic acid molecule, particularly of a strand of DNA or RNA, refers to the nature of the roles of the strand and its complement in specifying a sequence of amino acids. Depending on the context ...
-based process involves steps that first process the RNA so that it can base-pair
A base pair (bp) is a fundamental unit of double-stranded nucleic acids consisting of two nucleobases bound to each other by hydrogen bonds. They form the building blocks of the DNA double helix and contribute to the folded structure of both DNA ...
with a region of its target mRNAs. Once the base pairing occurs, other proteins direct the mRNA to be destroyed by nucleases.
Long non-coding RNAs
Next to be linked to regulation were Xist and other long noncoding RNA
Long non-coding RNAs (long ncRNAs, lncRNA) are a type of RNA, generally defined as transcripts more than 200 nucleotides that are not translated into protein. This arbitrary limit distinguishes long ncRNAs from small non-coding RNAs, such as m ...
s associated with X chromosome inactivation
X-inactivation (also called Lyonization, after English geneticist Mary Lyon) is a process by which one of the copies of the X chromosome is inactivated in therian female mammals. The inactive X chromosome is silenced by being packaged into a ...
. Their roles, at first mysterious, were shown by Jeannie T. Lee and others to be the silencing of blocks of chromatin via recruitment of Polycomb
Polycomb-group proteins (PcG proteins) are a family of protein complexes first discovered in fruit flies that can remodel chromatin such that epigenetic silencing of genes takes place. Polycomb-group proteins are well known for silencing Hox genes ...
complex so that messenger RNA could not be transcribed from them. Additional lncRNAs, currently defined as RNAs of more than 200 base pairs that do not appear to have coding potential, have been found associated with regulation of stem cell pluripotency Pluripotency: These are the cells that can generate into any of the three Germ layers which imply Endodermal, Mesodermal, and Ectodermal cells except tissues like the placenta.
According to Latin terms, Pluripotentia means the ability for many thin ...
and cell division
Cell division is the process by which a parent cell divides into two daughter cells. Cell division usually occurs as part of a larger cell cycle in which the cell grows and replicates its chromosome(s) before dividing. In eukaryotes, there ar ...
.
Enhancer RNAs
The third major group of regulatory RNAs is called enhancer RNA
Enhancer RNAs (eRNAs) represent a class of relatively long non-coding RNA molecules (50-2000 nucleotides) transcribed from the DNA sequence of enhancer regions. They were first detected in 2010 through the use of genome-wide techniques such as RNA ...
s. It is not clear at present whether they are a unique category of RNAs of various lengths or constitute a distinct subset of lncRNAs. In any case, they are transcribed from enhancers
In genetics, an enhancer is a short (50–1500 bp) region of DNA that can be bound by proteins ( activators) to increase the likelihood that transcription of a particular gene will occur. These proteins are usually referred to as transcriptio ...
, which are known regulatory sites in the DNA near genes they regulate. They up-regulate the transcription of the gene(s) under control of the enhancer from which they are transcribed.
Regulatory RNA in prokaryotes
At first, regulatory RNA was thought to be a eukaryotic phenomenon, a part of the explanation for why so much more transcription in higher organisms was seen than had been predicted. But as soon as researchers began to look for possible RNA regulators in bacteria, they turned up there as well, termed as small RNA (sRNA). Currently, the ubiquitous nature of systems of RNA regulation of genes has been discussed as support for the RNA World
The RNA world is a hypothetical stage in the evolutionary history of life on Earth, in which self-replicating RNA molecules proliferated before the evolution of DNA and proteins. The term also refers to the hypothesis that posits the existen ...
theory. Bacterial small RNAs generally act via antisense
In molecular biology and genetics, the sense of a nucleic acid molecule, particularly of a strand of DNA or RNA, refers to the nature of the roles of the strand and its complement in specifying a sequence of amino acids. Depending on the context ...
pairing with mRNA to down-regulate its translation, either by affecting stability or affecting cis-binding ability. Riboswitch
In molecular biology, a riboswitch is a regulatory segment of a messenger RNA molecule that binds a small molecule, resulting in a change in production of the proteins encoded by the mRNA. Thus, an mRNA that contains a riboswitch is directly in ...
es have also been discovered. They are cis-acting regulatory RNA sequences acting allosterically
In biochemistry, allosteric regulation (or allosteric control) is the regulation of an enzyme by binding an effector molecule at a site other than the enzyme's active site.
The site to which the effector binds is termed the ''allosteric site ...
. They change shape when they bind metabolites so that they gain or lose the ability to bind chromatin to regulate expression of genes.
Archaea also have systems of regulatory RNA. The CRISPR system, recently being used to edit DNA ''in situ'', acts via regulatory RNAs in archaea and bacteria to provide protection against virus invaders.
In RNA processing
Many RNAs are involved in modifying other RNAs.
Introns are spliced
Spliced may refer to:
*Spliced, the result of rope splicing
Rope splicing in ropework is the forming of a semi-permanent joint between two ropes or two parts of the same rope by partly untwisting and then interweaving their strands. Splices ca ...
out of pre-mRNA
A primary transcript is the single-stranded ribonucleic acid ( RNA) product synthesized by transcription of DNA, and processed to yield various mature RNA products such as mRNAs, tRNAs, and rRNAs. The primary transcripts designated to be mRNAs ...
by spliceosome
A spliceosome is a large ribonucleoprotein (RNP) complex found primarily within the nucleus of eukaryotic cells. The spliceosome is assembled from small nuclear RNAs ( snRNA) and numerous proteins. Small nuclear RNA (snRNA) molecules bind to specif ...
s, which contain several small nuclear RNA
Small nuclear RNA (snRNA) is a class of small RNA molecules that are found within the splicing speckles and Cajal bodies of the cell nucleus in eukaryotic cells. The length of an average snRNA is approximately 150 nucleotides. They are transcribe ...
s (snRNA),[ or the introns can be ribozymes that are spliced by themselves.
RNA can also be altered by having its nucleotides modified to nucleotides other than A, C, G and U.
In eukaryotes, modifications of RNA nucleotides are in general directed by small nucleolar RNAs (snoRNA; 60–300 nt),] found in the nucleolus
The nucleolus (, plural: nucleoli ) is the largest structure in the nucleus of eukaryotic cells. It is best known as the site of ribosome biogenesis, which is the synthesis of ribosomes. The nucleolus also participates in the formation of ...
and cajal bodies
Cajal bodies (CBs) also coiled bodies, are spherical nuclear bodies of 0.3–1.0 µm in diameter found in the nucleus of proliferative cells like embryonic cells and tumor cells, or metabolically active cells like neurons. CBs are membrane ...
. snoRNAs associate with enzymes and guide them to a spot on an RNA by basepairing to that RNA. These enzymes then perform the nucleotide modification. rRNAs and tRNAs are extensively modified, but snRNAs and mRNAs can also be the target of base modification. RNA can also be methylated.
RNA genomes
Like DNA, RNA can carry genetic information. RNA viruses have genome
In the fields of molecular biology and genetics, a genome is all the genetic information of an organism. It consists of nucleotide sequences of DNA (or RNA in RNA viruses). The nuclear genome includes protein-coding genes and non-coding g ...
s composed of RNA that encodes a number of proteins. The viral genome is replicated by some of those proteins, while other proteins protect the genome as the virus particle moves to a new host cell. Viroid
Viroids are small single-stranded, circular RNAs that are infectious pathogens. Unlike viruses, they have no protein coating. All known viroids are inhabitants of angiosperms (flowering plants), and most cause diseases, whose respective economi ...
s are another group of pathogens, but they consist only of RNA, do not encode any protein and are replicated by a host plant cell's polymerase.
In reverse transcription
Reverse transcribing viruses replicate their genomes by reverse transcribing DNA copies from their RNA; these DNA copies are then transcribed to new RNA. Retrotransposon
Retrotransposons (also called Class I transposable elements or transposons via RNA intermediates) are a type of genetic component that copy and paste themselves into different genomic locations (transposon) by converting RNA back into DNA through ...
s also spread by copying DNA and RNA from one another, and telomerase
Telomerase, also called terminal transferase, is a ribonucleoprotein that adds a species-dependent telomere repeat sequence to the 3' end of telomeres. A telomere is a region of repetitive sequences at each end of the chromosomes of most euka ...
contains an RNA that is used as template for building the ends of eukaryotic chromosomes.
Double-stranded RNA
Double-stranded RNA (dsRNA) is RNA with two complementary strands, similar to the DNA found in all cells, but with the replacement of thymine by uracil and the adding of one oxygen atom. dsRNA forms the genetic material of some virus
A virus is a submicroscopic infectious agent that replicates only inside the living cells of an organism. Viruses infect all life forms, from animals and plants to microorganisms, including bacteria and archaea.
Since Dmitri Ivanovsk ...
es (double-stranded RNA viruses
Double-stranded RNA viruses (dsRNA viruses) are a Polyphyly, polyphyletic group of viruses that have Nucleic acid double helix, double-stranded genomes made of ribonucleic acid. The double-stranded genome is used to transcribe a Sense (molecular ...
). Double-stranded RNA, such as viral RNA or siRNA
Small interfering RNA (siRNA), sometimes known as short interfering RNA or silencing RNA, is a class of double-stranded RNA at first non-coding RNA molecules, typically 20-24 (normally 21) base pairs in length, similar to miRNA, and operating ...
, can trigger RNA interference
RNA interference (RNAi) is a biological process in which RNA molecules are involved in sequence-specific suppression of gene expression by double-stranded RNA, through translational or transcriptional repression. Historically, RNAi was known by ...
in eukaryotes, as well as interferon response in vertebrate
Vertebrates () comprise all animal taxa within the subphylum Vertebrata () (chordates with backbones), including all mammals, birds, reptiles, amphibians, and fish. Vertebrates represent the overwhelming majority of the phylum Chordata, with c ...
s. In Eukaryotes, Double-stranded RNA (dsRNA) plays a role in the activation of the innate immune system against viral infections.
Circular RNA
In the late 1970s, it was shown that there is a single stranded covalently closed, i.e. circular form of RNA expressed throughout the animal and plant kingdom (see circRNA
Circular RNA (or circRNA) is a type of single-stranded RNA which, unlike linear RNA, forms a covalently closed continuous loop. In circular RNA, the 3' and 5' ends normally present in an RNA molecule have been joined together. This feature confe ...
). circRNAs are thought to arise via a "back-splice" reaction where the spliceosome
A spliceosome is a large ribonucleoprotein (RNP) complex found primarily within the nucleus of eukaryotic cells. The spliceosome is assembled from small nuclear RNAs ( snRNA) and numerous proteins. Small nuclear RNA (snRNA) molecules bind to specif ...
joins a upstream 3' acceptor to a downstream 5' donor splice site. So far the function of circRNAs is largely unknown, although for few examples a microRNA sponging activity has been demonstrated.
Key discoveries in RNA biology
Research on RNA has led to many important biological discoveries and numerous Nobel Prize
The Nobel Prizes ( ; sv, Nobelpriset ; no, Nobelprisen ) are five separate prizes that, according to Alfred Nobel's will of 1895, are awarded to "those who, during the preceding year, have conferred the greatest benefit to humankind." Alfr ...
s. Nucleic acids were discovered in 1868 by Friedrich Miescher
Johannes Friedrich Miescher (13 August 1844 – 26 August 1895) was a Swiss physician and biologist. He was the first scientist to isolate nucleic acid in 1869. He also identified protamine and made a number of other discoveries.
Miescher had i ...
, who called the material 'nuclein' since it was found in the nucleus
Nucleus ( : nuclei) is a Latin word for the seed inside a fruit. It most often refers to:
*Atomic nucleus, the very dense central region of an atom
* Cell nucleus, a central organelle of a eukaryotic cell, containing most of the cell's DNA
Nucl ...
. It was later discovered that prokaryotic cells, which do not have a nucleus, also contain nucleic acids. The role of RNA in protein synthesis was suspected already in 1939. Severo Ochoa
Severo Ochoa de Albornoz (; 24 September 1905 – 1 November 1993) was a Spanish physician and biochemist, and winner of the 1959 Nobel Prize in Physiology or Medicine together with Arthur Kornberg for their discovery of "the mechanisms in ...
won the 1959 Nobel Prize in Medicine
The Nobel Prize in Physiology or Medicine is awarded yearly by the Nobel Assembly at the Karolinska Institute for outstanding discoveries in physiology or medicine. The Nobel Prize is not a single prize, but five separate prizes that, according ...
(shared with Arthur Kornberg
Arthur Kornberg (March 3, 1918 – October 26, 2007) was an American biochemist who won the Nobel Prize in Physiology or Medicine 1959 for the discovery of "the mechanisms in the biological synthesis of ribonucleic acid and deoxyribonucleic ac ...
) after he discovered an enzyme that can synthesize RNA in the laboratory. However, the enzyme discovered by Ochoa (polynucleotide phosphorylase
Polynucleotide Phosphorylase (PNPase) is a bifunctional enzyme with a phosphorolytic 3' to 5' exoribonuclease activity and a 3'-terminal oligonucleotide polymerase activity. That is, it dismantles the RNA chain starting at the 3' end and working ...
) was later shown to be responsible for RNA degradation, not RNA synthesis. In 1956 Alex Rich and David Davies hybridized two separate strands of RNA to form the first crystal of RNA whose structure could be determined by X-ray crystallography.
The sequence of the 77 nucleotides of a yeast tRNA was found by Robert W. Holley
Robert William Holley (January 28, 1922 – February 11, 1993) was an American biochemist. He shared the Nobel Prize in Physiology or Medicine in 1968 (with Har Gobind Khorana and Marshall Warren Nirenberg) for describing the structure of alani ...
in 1965, winning Holley the 1968 Nobel Prize in Medicine (shared with Har Gobind Khorana
Har Gobind Khorana (9 January 1922 – 9 November 2011) was an Indian American biochemist. While on the faculty of the University of Wisconsin–Madison, he shared the 1968 Nobel Prize for Physiology or Medicine with Marshall W. Nirenberg and ...
and Marshall Nirenberg
Marshall Warren Nirenberg (April 10, 1927 – January 15, 2010) was an American biochemist and geneticist. He shared a Nobel Prize in Physiology or Medicine in 1968 with Har Gobind Khorana and Robert W. Holley for "breaking the genetic code" ...
).
In the early 1970s, retroviruses and reverse transcriptase were discovered, showing for the first time that enzymes could copy RNA into DNA (the opposite of the usual route for transmission of genetic information). For this work, David Baltimore
David Baltimore (born March 7, 1938) is an American biologist, university administrator, and 1975 Nobel laureate in Physiology or Medicine. He is President Emeritus and Distinguished Professor of Biology at the California Institute of Technol ...
, Renato Dulbecco and Howard Temin
Howard Martin Temin (December 10, 1934 – February 9, 1994) was an American geneticist and virologist. He discovered reverse transcriptase in the 1970s at the University of Wisconsin–Madison, for which he shared the 1975 Nobel Prize in Phy ...
were awarded a Nobel Prize in 1975.
In 1976, Walter Fiers
Walter Fiers (31 January 1931 in Ypres, West Flanders – 28 July 2019 in Destelbergen) was a Belgian molecular biologist.
He obtained a degree of Engineer for Chemistry and Agricultural Industries at the University of Ghent in 1954, and started ...
and his team determined the first complete nucleotide sequence of an RNA virus genome, that of bacteriophage MS2.
In 1977, introns and RNA splicing were discovered in both mammalian viruses and in cellular genes, resulting in a 1993 Nobel to Philip Sharp and Richard Roberts.
Catalytic RNA molecules (ribozyme
Ribozymes (ribonucleic acid enzymes) are RNA molecules that have the ability to catalyze specific biochemical reactions, including RNA splicing in gene expression, similar to the action of protein enzymes. The 1982 discovery of ribozymes demons ...
s) were discovered in the early 1980s, leading to a 1989 Nobel award to Thomas Cech
Thomas Robert Cech (born December 8, 1947) is an American chemist who shared the 1989 Nobel Prize in Chemistry with Sidney Altman, for their discovery of the catalytic properties of RNA. Cech discovered that RNA could itself cut strands of RNA, ...
and Sidney Altman
Sidney Altman (May 7, 1939 – April 5, 2022) was a Canadian-American molecular biologist, who was the Sterling Professor of Molecular, Cellular, and Developmental Biology and Chemistry at Yale University. In 1989, he shared the Nobel Prize in ...
. In 1990, it was found in ''Petunia
''Petunia'' is genus of 20 species of flowering plants of South American origin. The popular flower of the same name derived its epithet from the French, which took the word ''petun'', meaning "tobacco," from a Tupi–Guarani language. A tende ...
'' that introduced genes can silence similar genes of the plant's own, now known to be a result of RNA interference
RNA interference (RNAi) is a biological process in which RNA molecules are involved in sequence-specific suppression of gene expression by double-stranded RNA, through translational or transcriptional repression. Historically, RNAi was known by ...
.
At about the same time, 22 nt long RNAs, now called microRNA
MicroRNA (miRNA) are small, single-stranded, non-coding RNA molecules containing 21 to 23 nucleotides. Found in plants, animals and some viruses, miRNAs are involved in RNA silencing and post-transcriptional regulation of gene expression. mi ...
s, were found to have a role in the development
Development or developing may refer to:
Arts
*Development hell, when a project is stuck in development
*Filmmaking, development phase, including finance and budgeting
*Development (music), the process thematic material is reshaped
* Photograph ...
of ''C. elegans
''Caenorhabditis elegans'' () is a free-living transparent nematode about 1 mm in length that lives in temperate soil environments. It is the type species of its genus. The name is a blend of the Greek ''caeno-'' (recent), ''rhabditis'' (r ...
''.
Studies on RNA interference gleaned a Nobel Prize for Andrew Fire
Andrew Zachary Fire (born April 27, 1959) is an American biologist and professor of pathology and of genetics at the Stanford University School of Medicine. He was awarded the 2006 Nobel Prize in Physiology or Medicine, along with Craig C. Mello ...
and Craig Mello
Craig Cameron Mello (born October 18, 1960) is an American biologist and professor of molecular medicine at the University of Massachusetts Medical School in Worcester, Massachusetts. He was awarded the 2006 Nobel Prize for Physiology or Medicine, ...
in 2006, and another Nobel was awarded for studies on the transcription of RNA to Roger Kornberg
Roger David Kornberg (born April 24, 1947) is an American biochemist and professor of structural biology at Stanford University School of Medicine. Kornberg was awarded the Nobel Prize in Chemistry in 2006 for his studies of the process by wh ...
in the same year. The discovery of gene regulatory RNAs has led to attempts to develop drugs made of RNA, such as siRNA
Small interfering RNA (siRNA), sometimes known as short interfering RNA or silencing RNA, is a class of double-stranded RNA at first non-coding RNA molecules, typically 20-24 (normally 21) base pairs in length, similar to miRNA, and operating ...
, to silence genes. Adding to the Nobel prizes awarded for research on RNA in 2009 it was awarded for the elucidation of the atomic structure of the ribosome to Venki Ramakrishnan
Venkatraman Ramakrishnan (born 1952) is an Indian-born British and American structural biologist who shared the 2009 Nobel Prize in Chemistry with Thomas A. Steitz and Ada Yonath, "for studies of the structure and function of the ribosome" ...
, Thomas A. Steitz, and Ada Yonath
Ada E. Yonath ( he, עדה יונת, ; born 22 June 1939) is an Israeli crystallographer best known for her pioneering work on the structure of ribosomes. She is the current director of the Helen and Milton A. Kimmelman Center for Biomolecular ...
.
Relevance for prebiotic chemistry and abiogenesis
In 1968, Carl Woese
Carl Richard Woese (; July 15, 1928 – December 30, 2012) was an American microbiologist and biophysicist. Woese is famous for defining the Archaea (a new domain of life) in 1977 through a pioneering phylogenetic taxonomy of 16S ribosomal RNA, ...
hypothesized that RNA might be catalytic and suggested that the earliest forms of life (self-replicating molecules) could have relied on RNA both to carry genetic information and to catalyze biochemical reactions—an RNA world
The RNA world is a hypothetical stage in the evolutionary history of life on Earth, in which self-replicating RNA molecules proliferated before the evolution of DNA and proteins. The term also refers to the hypothesis that posits the existen ...
. In May 2022, scientists reported that RNA was found to be formed spontaneously on prebiotic basalt lava glass which is presumed to have been abundantly available on the early Earth
The early Earth is loosely defined as Earth in its first one billion years, or gigayear (Ga, 109y). The “early Earth” encompasses approximately the first gigayear in the evolution of our planet, from its initial formation in the young Solar Sy ...
.
In March 2015, complex DNA and RNA nucleotide
Nucleotides are organic molecules consisting of a nucleoside and a phosphate. They serve as monomeric units of the nucleic acid polymers – deoxyribonucleic acid (DNA) and ribonucleic acid (RNA), both of which are essential biomolecule ...
s, including uracil
Uracil () (symbol U or Ura) is one of the four nucleobases in the nucleic acid RNA. The others are adenine (A), cytosine (C), and guanine (G). In RNA, uracil binds to adenine via two hydrogen bonds. In DNA, the uracil nucleobase is replaced b ...
, cytosine
Cytosine () ( symbol C or Cyt) is one of the four nucleobases found in DNA and RNA, along with adenine, guanine, and thymine (uracil in RNA). It is a pyrimidine derivative, with a heterocyclic aromatic ring and two substituents attached (an ...
and thymine
Thymine () ( symbol T or Thy) is one of the four nucleobases in the nucleic acid of DNA that are represented by the letters G–C–A–T. The others are adenine, guanine, and cytosine. Thymine is also known as 5-methyluracil, a pyrimidi ...
, were reportedly formed in the laboratory under outer space
Outer space, commonly shortened to space, is the expanse that exists beyond Earth and its atmosphere and between celestial bodies. Outer space is not completely empty—it is a near-perfect vacuum containing a low density of particles, pred ...
conditions, using starter chemicals, such as pyrimidine, an organic compound
In chemistry, organic compounds are generally any chemical compounds that contain carbon-hydrogen or carbon-carbon bonds. Due to carbon's ability to catenate (form chains with other carbon atoms), millions of organic compounds are known. T ...
commonly found in meteorites. Pyrimidine, like polycyclic aromatic hydrocarbons
A polycyclic aromatic hydrocarbon (PAH) is a class of organic compounds that is composed of multiple aromatic rings. The simplest representative is naphthalene, having two aromatic rings and the three-ring compounds anthracene and phenanthrene. ...
(PAHs), is one of the most carbon-rich compounds found in the Universe
The universe is all of space and time and their contents, including planets, stars, galaxies, and all other forms of matter and energy. The Big Bang theory is the prevailing cosmological description of the development of the universe. ...
and may have been formed in red giants or in interstellar dust
Cosmic dust, also called extraterrestrial dust, star dust or space dust, is dust which exists in outer space, or has fallen on Earth. Most cosmic dust particles measure between a few molecules and 0.1 mm (100 micrometers). Larger particles are c ...
and gas clouds. In July 2022, astronomers reported the discovery of massive amounts of prebiotic molecules, including possible RNA precursors, in the Galactic Center
The Galactic Center or Galactic Centre is the rotational center, the barycenter, of the Milky Way galaxy. Its central massive object is a supermassive black hole of about 4 million solar masses, which is called Sagittarius A*, a compact ra ...
of the Milky Way Galaxy
The Milky Way is the galaxy that includes our Solar System, with the name describing the galaxy's appearance from Earth: a hazy band of light seen in the night sky formed from stars that cannot be individually distinguished by the naked eye. ...
.
See also
* Biomolecular structure
Biomolecular structure is the intricate folded, three-dimensional shape that is formed by a molecule of protein, DNA, or RNA, and that is important to its function. The structure of these molecules may be considered at any of several length s ...
* RNA virus
* DNA
* History of RNA Biology
Numerous key discoveries in biology have emerged from studies of RNA (ribonucleic acid), including seminal work in the fields of biochemistry, genetics, microbiology, molecular biology, molecular evolution and structural biology. As of 2010, 30 sc ...
* List of RNA Biologists
For related information, see the articles on History of RNA Biology, History of Molecular Biology, and History of Genetics.
{, class="wikitable"
, -
! scope="col" , Name
! scope="col" , Dates
! scope="col" , Institution
! scope="col" , Awards ...
* RNA Society
The RNA Society is a non-profit, international scientific society that serves to facilitate the dissemination of scientific results and concepts in ribonucleic acid (RNA) research. The society was founded in 1993 and currently consists of over 1 ...
* Macromolecule
* RNA-based evolution
RNA-based evolution is a theory that posits that RNA is not merely an intermediate between Watson and Crick model of the DNA molecule and proteins, but rather a far more dynamic and independent role-player in determining phenotype. By regulating ...
* Aptamer
Aptamers are short sequences of artificial DNA, RNA, XNA, or peptide that bind a specific target molecule, or family of target molecules. They exhibit a range of affinities ( KD in the pM to μM range), with little or no off-target bindin ...
* RNA origami
* Transcriptome
The transcriptome is the set of all RNA transcripts, including coding and non-coding, in an individual or a population of cells. The term can also sometimes be used to refer to all RNAs, or just mRNA, depending on the particular experiment. The t ...
* RNA world hypothesis
The RNA world is a hypothetical stage in the evolutionary history of life on Earth, in which self-replicating RNA molecules proliferated before the evolution of DNA and proteins. The term also refers to the hypothesis that posits the existenc ...
References
External links
RNA World website
Link collection (structures, sequences, tools, journals)
Nucleic Acid Database
Images of DNA, RNA and complexes.
Anna Marie Pyle's Seminar: RNA Structure, Function, and Recognition
{{DEFAULTSORT:Rna
RNA splicing
Molecular biology
Biotechnology
Nucleic acids