
Nucleotide bases (also nucleobases, nitrogenous bases) are
nitrogen
Nitrogen is a chemical element; it has Symbol (chemistry), symbol N and atomic number 7. Nitrogen is a Nonmetal (chemistry), nonmetal and the lightest member of pnictogen, group 15 of the periodic table, often called the Pnictogen, pnictogens. ...
-containing biological compounds that form
nucleosides
Nucleosides are glycosylamines that can be thought of as nucleotides without a phosphate group. A nucleoside consists simply of a nucleobase (also termed a nitrogenous base) and a five-carbon sugar (ribose or 2'-deoxyribose) whereas a nucleotide ...
, which, in turn, are components of
nucleotide
Nucleotides are Organic compound, organic molecules composed of a nitrogenous base, a pentose sugar and a phosphate. They serve as monomeric units of the nucleic acid polymers – deoxyribonucleic acid (DNA) and ribonucleic acid (RNA), both o ...
s, with all of these
monomer
A monomer ( ; ''mono-'', "one" + '' -mer'', "part") is a molecule that can react together with other monomer molecules to form a larger polymer chain or two- or three-dimensional network in a process called polymerization.
Classification
Chemis ...
s constituting the basic building blocks of
nucleic acids
Nucleic acids are large biomolecules that are crucial in all cells and viruses. They are composed of nucleotides, which are the monomer components: a 5-carbon sugar, a phosphate group and a nitrogenous base. The two main classes of nucleic a ...
. The ability of nucleobases to form
base pair
A base pair (bp) is a fundamental unit of double-stranded nucleic acids consisting of two nucleobases bound to each other by hydrogen bonds. They form the building blocks of the DNA double helix and contribute to the folded structure of both DNA ...
s and to stack one upon another leads directly to long-chain helical structures such as
ribonucleic acid
Ribonucleic acid (RNA) is a polymeric molecule that is essential for most biological functions, either by performing the function itself (non-coding RNA) or by forming a template for the production of proteins ( messenger RNA). RNA and deoxyr ...
(RNA) and
deoxyribonucleic acid (DNA). Five nucleobases—
adenine
Adenine (, ) (nucleoside#List of nucleosides and corresponding nucleobases, symbol A or Ade) is a purine nucleotide base that is found in DNA, RNA, and Adenosine triphosphate, ATP. Usually a white crystalline subtance. The shape of adenine is ...
(A),
cytosine
Cytosine () (symbol C or Cyt) is one of the four nucleotide bases found in DNA and RNA, along with adenine, guanine, and thymine ( uracil in RNA). It is a pyrimidine derivative, with a heterocyclic aromatic ring and two substituents attac ...
(C),
guanine
Guanine () (symbol G or Gua) is one of the four main nucleotide bases found in the nucleic acids DNA and RNA, the others being adenine, cytosine, and thymine ( uracil in RNA). In DNA, guanine is paired with cytosine. The guanine nucleoside ...
(G),
thymine
Thymine () (symbol T or Thy) is one of the four nucleotide bases in the nucleic acid of DNA that are represented by the letters G–C–A–T. The others are adenine, guanine, and cytosine. Thymine is also known as 5-methyluracil, a pyrimidine ...
(T), and
uracil
Uracil () (nucleoside#List of nucleosides and corresponding nucleobases, symbol U or Ura) is one of the four nucleotide bases in the nucleic acid RNA. The others are adenine (A), cytosine (C), and guanine (G). In RNA, uracil binds to adenine via ...
(U)—are called ''primary'' or ''canonical''. They function as the fundamental units of the
genetic code
Genetic code is a set of rules used by living cell (biology), cells to Translation (biology), translate information encoded within genetic material (DNA or RNA sequences of nucleotide triplets or codons) into proteins. Translation is accomplished ...
, with the bases A, G, C, and T being found in DNA while A, G, C, and U are found in RNA. Thymine and uracil are distinguished by merely the presence or absence of a methyl group on the fifth carbon (C5) of these heterocyclic six-membered rings.
In addition, some viruses have
aminoadenine (Z) instead of adenine. It differs in having an extra
amine
In chemistry, amines (, ) are organic compounds that contain carbon-nitrogen bonds. Amines are formed when one or more hydrogen atoms in ammonia are replaced by alkyl or aryl groups. The nitrogen atom in an amine possesses a lone pair of elec ...
group, creating a more stable bond to thymine.
Adenine and guanine have a
fused-ring skeletal structure derived of
purine
Purine is a heterocyclic aromatic organic compound that consists of two rings (pyrimidine and imidazole) fused together. It is water-soluble. Purine also gives its name to the wider class of molecules, purines, which include substituted puri ...
, hence they are called purine bases. The purine nitrogenous bases are characterized by their single
amino group (), at the C6 carbon in adenine and C2 in guanine.
Similarly, the simple-ring structure of cytosine, uracil, and thymine is derived of
pyrimidine
Pyrimidine (; ) is an aromatic, heterocyclic, organic compound similar to pyridine (). One of the three diazines (six-membered heterocyclics with two nitrogen atoms in the ring), it has nitrogen atoms at positions 1 and 3 in the ring. The oth ...
, so those three bases are called the pyrimidine bases.
Each of the base pairs in a typical double-
helix DNA comprises a purine and a pyrimidine: either an A paired with a T or a C paired with a G. These purine-pyrimidine pairs, which are called
''base complements'', connect the two strands of the helix and are often compared to the rungs of a ladder. Only pairing purine with pyrimidine ensures a constant width for the DNA. The A–T pairing is based on two
hydrogen bond
In chemistry, a hydrogen bond (H-bond) is a specific type of molecular interaction that exhibits partial covalent character and cannot be described as a purely electrostatic force. It occurs when a hydrogen (H) atom, Covalent bond, covalently b ...
s, while the C–G pairing is based on three. In both cases, the hydrogen bonds are between the
amine
In chemistry, amines (, ) are organic compounds that contain carbon-nitrogen bonds. Amines are formed when one or more hydrogen atoms in ammonia are replaced by alkyl or aryl groups. The nitrogen atom in an amine possesses a lone pair of elec ...
and
carbonyl groups on the complementary bases.
Nucleobases such as adenine, guanine,
xanthine,
hypoxanthine
Hypoxanthine is a naturally occurring purine derivative. It is occasionally found as a constituent of nucleic acids, where it is present in the anticodon of tRNA in the form of its nucleoside inosine. It has a tautomer known as 6-hydroxypurine. Hyp ...
, purine,
2,6-diaminopurine, and 6,8-diaminopurine may have formed in outer space as well as on earth.
The origin of the term ''
base'' reflects these compounds' chemical properties in
acid–base reaction
In chemistry, an acid–base reaction is a chemical reaction that occurs between an acid and a base. It can be used to determine pH via titration. Several theoretical frameworks provide alternative conceptions of the reaction mechanisms an ...
s, but those properties are not especially important for understanding most of the biological functions of nucleobases.
Structure

At the sides of nucleic acid structure, phosphate molecules successively connect the two sugar-rings of two adjacent nucleotide monomers, thereby creating a long chain
biomolecule
A biomolecule or biological molecule is loosely defined as a molecule produced by a living organism and essential to one or more typically biological processes. Biomolecules include large macromolecules such as proteins, carbohydrates, lipids ...
. These chain-joins of phosphates with sugars (
ribose
Ribose is a simple sugar and carbohydrate with molecular formula C5H10O5 and the linear-form composition H−(C=O)−(CHOH)4−H. The naturally occurring form, , is a component of the ribonucleotides from which RNA is built, and so this comp ...
or
deoxyribose
Deoxyribose, or more precisely 2-deoxyribose, is a monosaccharide with idealized formula H−(C=O)−(CH2)−(CHOH)3−H. Its name indicates that it is a deoxy sugar, meaning that it is derived from the sugar ribose by loss of a hydroxy group. D ...
) create the "backbone" strands for a single- or double helix biomolecule. In the double helix of DNA, the two strands are oriented chemically in opposite directions, which permits base pairing by providing
complementarity between the two bases, and which is essential for
replication of or
transcription of the encoded information found in DNA.
Modified nucleobases
DNA and RNA also contain other (non-primary) bases that have been modified after the nucleic acid chain has been formed. In DNA, the most common modified base is
5-methylcytosine (m
5C). In RNA, there are many modified bases, including those contained in the nucleosides
pseudouridine (Ψ),
dihydrouridine (D),
inosine (I), and
7-methylguanosine (m
7G).
Hypoxanthine
Hypoxanthine is a naturally occurring purine derivative. It is occasionally found as a constituent of nucleic acids, where it is present in the anticodon of tRNA in the form of its nucleoside inosine. It has a tautomer known as 6-hydroxypurine. Hyp ...
and
xanthine are two of the many bases created through
mutagen presence, both of them through
deamination
Deamination is the removal of an amino group from a molecule. Enzymes that catalysis, catalyse this reaction are called deaminases.
In the human body, deamination takes place primarily in the liver; however, it can also occur in the kidney. In s ...
(replacement of the amine-group with a carbonyl-group). Hypoxanthine is produced from adenine, xanthine from guanine,
and uracil results from deamination of cytosine.
Modified purine nucleobases
These are examples of modified adenosine or guanosine.
Modified pyrimidine nucleobases
These are examples of modified cytidine, thymidine or uridine.
Artificial nucleobases
A vast number of nucleobase analogues exist.
The most common applications are used as fluorescent probes, either directly or indirectly, such as
aminoallyl nucleotide, which are used to label cRNA or cDNA in
microarrays. Several groups are working on alternative "extra" base pairs to extend the genetic code, such as
isoguanine and
isocytosine or the fluorescent
2-amino-6-(2-thienyl)purine and
pyrrole-2-carbaldehyde.
In medicine, several
nucleoside analogues are used as anticancer and antiviral agents. The viral polymerase incorporates these compounds with non-canonical bases. These compounds are activated in the cells by being converted into nucleotides; they are administered as
nucleosides as charged nucleotides cannot easily cross cell membranes. At least one set of new base pairs has been announced as of May 2014.
Prebiotic condensation of nucleobases with ribose
In order to understand
how life arose, knowledge is required of chemical pathways that permit formation of the key building blocks of
life
Life, also known as biota, refers to matter that has biological processes, such as Cell signaling, signaling and self-sustaining processes. It is defined descriptively by the capacity for homeostasis, Structure#Biological, organisation, met ...
under plausible
prebiotic conditions. According to the
RNA world hypothesis, free-floating
ribonucleotides were present in the
primordial soup. These were the fundamental molecules that combined in series to form
RNA
Ribonucleic acid (RNA) is a polymeric molecule that is essential for most biological functions, either by performing the function itself (non-coding RNA) or by forming a template for the production of proteins (messenger RNA). RNA and deoxyrib ...
. Molecules as complex as RNA must have arisen from small molecules whose reactivity was governed by physico-chemical processes. RNA is composed of
purine
Purine is a heterocyclic aromatic organic compound that consists of two rings (pyrimidine and imidazole) fused together. It is water-soluble. Purine also gives its name to the wider class of molecules, purines, which include substituted puri ...
and
pyrimidine
Pyrimidine (; ) is an aromatic, heterocyclic, organic compound similar to pyridine (). One of the three diazines (six-membered heterocyclics with two nitrogen atoms in the ring), it has nitrogen atoms at positions 1 and 3 in the ring. The oth ...
nucleotides, both of which are necessary for reliable information transfer, and thus
Darwinian evolution
Evolution is the change in the heritable Phenotypic trait, characteristics of biological populations over successive generations. It occurs when evolutionary processes such as natural selection and genetic drift act on genetic variation, re ...
. Nam et al. demonstrated the direct condensation of nucleobases with ribose to give ribonucleosides in aqueous microdroplets, a key step leading to RNA formation. Similar results were obtained by Becker et al.
See also
*
*
*
*
References
External links
Base pairing in DNA Double Helix (shows specific hydrogen bonds)
{{Authority control
DNA
RNA