Directionality, in
molecular biology
Molecular biology is the branch of biology that seeks to understand the molecular basis of biological activity in and between cells, including biomolecular synthesis, modification, mechanisms, and interactions. The study of chemical and physi ...
and
biochemistry
Biochemistry or biological chemistry is the study of chemical processes within and relating to living organisms. A sub-discipline of both chemistry and biology, biochemistry may be divided into three fields: structural biology, enzymology and ...
, is the end-to-end chemical orientation of a single strand of
nucleic acid
Nucleic acids are biopolymers, macromolecules, essential to all known forms of life. They are composed of nucleotides, which are the monomers made of three components: a 5-carbon sugar, a phosphate group and a nitrogenous base. The two main cl ...
. In a single strand of
DNA or
RNA
Ribonucleic acid (RNA) is a polymeric molecule essential in various biological roles in coding, decoding, regulation and expression of genes. RNA and deoxyribonucleic acid ( DNA) are nucleic acids. Along with lipids, proteins, and carbohydra ...
, the chemical convention of naming carbon atoms in the
nucleotide
Nucleotides are organic molecules consisting of a nucleoside and a phosphate. They serve as monomeric units of the nucleic acid polymers – deoxyribonucleic acid (DNA) and ribonucleic acid (RNA), both of which are essential biomolecules wi ...
pentose-sugar-ring means that there will be a 5′ end (usually pronounced "five-prime end"), which frequently contains a
phosphate
In chemistry, a phosphate is an anion, salt, functional group or ester derived from a phosphoric acid. It most commonly means orthophosphate, a derivative of orthophosphoric acid .
The phosphate or orthophosphate ion is derived from phospho ...
group attached to the 5′ carbon of the
ribose
Ribose is a simple sugar and carbohydrate with molecular formula C5H10O5 and the linear-form composition H−(C=O)−(CHOH)4−H. The naturally-occurring form, , is a component of the ribonucleotides from which RNA is built, and so this compo ...
ring, and a 3′ end (usually pronounced "three-prime end"), which typically is unmodified from the ribose -OH substituent. In a
DNA double helix, the strands run in opposite directions to permit
base pairing
A base pair (bp) is a fundamental unit of double-stranded nucleic acids consisting of two nucleobases bound to each other by hydrogen bonds. They form the building blocks of the DNA double helix and contribute to the folded structure of both DNA ...
between them, which is essential for replication or
transcription
Transcription refers to the process of converting sounds (voice, music etc.) into letters or musical notes, or producing a copy of something in another medium, including:
Genetics
* Transcription (biology), the copying of DNA into RNA, the fir ...
of the encoded information.
Nucleic acids can only be synthesized
in vivo
Studies that are ''in vivo'' (Latin for "within the living"; often not italicized in English) are those in which the effects of various biological entities are tested on whole, living organisms or cells, usually animals, including humans, and ...
in the 5′-to-3′ direction, as the
polymerase
A polymerase is an enzyme ( EC 2.7.7.6/7/19/48/49) that synthesizes long chains of polymers or nucleic acids. DNA polymerase and RNA polymerase are used to assemble DNA and RNA molecules, respectively, by copying a DNA template strand using base- ...
s that assemble various types of new strands generally rely on the energy produced by breaking
nucleoside triphosphate
A nucleoside triphosphate is a nucleoside containing a nitrogenous base bound to a 5-carbon sugar (either ribose or deoxyribose), with three phosphate groups bound to the sugar. They are the molecular precursors of both DNA and RNA, which are cha ...
bonds to attach new nucleoside monophosphates to the 3′-
hydroxyl
In chemistry, a hydroxy or hydroxyl group is a functional group with the chemical formula and composed of one oxygen atom covalently bonded to one hydrogen atom. In organic chemistry, alcohols and carboxylic acids contain one or more hydroxy ...
(−OH) group, via a
phosphodiester bond
In chemistry, a phosphodiester bond occurs when exactly two of the hydroxyl groups () in phosphoric acid react with hydroxyl groups on other molecules to form two ester bonds. The "bond" involves this linkage . Discussion of phosphodiesters is ...
. The relative positions of structures along strands of nucleic acid, including
gene
In biology, the word gene (from , ; "...Wilhelm Johannsen coined the word gene to describe the Mendelian units of heredity..." meaning ''generation'' or ''birth'' or ''gender'') can have several different meanings. The Mendelian gene is a ba ...
s and various protein
binding site
In biochemistry and molecular biology, a binding site is a region on a macromolecule such as a protein that binds to another molecule with specificity. The binding partner of the macromolecule is often referred to as a ligand. Ligands may inclu ...
s, are usually noted as being either ''upstream'' (towards the 5′-end) or ''downstream'' (towards the 3′-end). (See also
upstream and downstream.)
Directionality is related to, but different from,
sense
A sense is a biological system used by an organism for sensation, the process of gathering information about the world through the detection of Stimulus (physiology), stimuli. (For example, in the human body, the brain which is part of the cen ...
. Transcription of single-stranded RNA from a double-stranded DNA template requires the selection of one strand of the DNA template as the template strand that directly interacts with the nascent RNA due to
complementary sequence
: ''For complementary sequences in biology, see complementarity (molecular biology). For integer sequences with complementary sets of members see Lambek–Moser theorem.''
In applied mathematics, complementary sequences (CS) are pairs of sequences ...
. The other strand is not copied directly, but necessarily its sequence will be similar to that of the RNA.
Transcription initiation sites generally occur on both strands of an organism's DNA, and specify the location, direction, and circumstances under which transcription will occur. If the transcript encodes one or (rarely) more
protein
Proteins are large biomolecules and macromolecules that comprise one or more long chains of amino acid residues. Proteins perform a vast array of functions within organisms, including catalysing metabolic reactions, DNA replication, respo ...
s, translation of each protein by the
ribosome
Ribosomes ( ) are macromolecular machines, found within all cells, that perform biological protein synthesis (mRNA translation). Ribosomes link amino acids together in the order specified by the codons of messenger RNA (mRNA) molecules to ...
will proceed in a 5′-to-3′ direction, and will extend the protein from its N terminus toward its C terminus. For example, in a typical gene a
start codon
The start codon is the first codon of a messenger RNA (mRNA) transcript translated by a ribosome. The start codon always codes for methionine in eukaryotes and Archaea and a N-formylmethionine (fMet) in bacteria, mitochondria and plastids. The ...
(5′-ATG-3′) is a DNA sequence within the sense strand. Transcription begins at an upstream site (relative to the sense strand), and as it proceeds through the region it copies the 3′-TAC-5′ from the template strand to produce 5′-AUG-3′ within a
messenger RNA
In molecular biology, messenger ribonucleic acid (mRNA) is a single-stranded molecule of RNA that corresponds to the genetic sequence of a gene, and is read by a ribosome in the process of synthesizing a protein.
mRNA is created during the p ...
(mRNA). The mRNA is scanned by the ribosome from the 5′ end, where the start codon directs the incorporation of a
methionine
Methionine (symbol Met or M) () is an essential amino acid in humans. As the precursor of other amino acids such as cysteine and taurine, versatile compounds such as SAM-e, and the important antioxidant glutathione, methionine plays a critical ro ...
(
bacteria
Bacteria (; singular: bacterium) are ubiquitous, mostly free-living organisms often consisting of one biological cell. They constitute a large domain of prokaryotic microorganisms. Typically a few micrometres in length, bacteria were among ...
,
mitochondria
A mitochondrion (; ) is an organelle found in the Cell (biology), cells of most Eukaryotes, such as animals, plants and Fungus, fungi. Mitochondria have a double lipid bilayer, membrane structure and use aerobic respiration to generate adenosi ...
, and
plastids
The plastid (Greek: πλαστός; plastós: formed, molded – plural plastids) is a membrane-bound organelle found in the cells of plants, algae, and some other eukaryotic organisms. They are considered to be intracellular endosymbiotic cyanobac ...
use
''N''-formylmethionine instead) at the N terminus of the protein. By convention, single strands of
DNA and
RNA
Ribonucleic acid (RNA) is a polymeric molecule essential in various biological roles in coding, decoding, regulation and expression of genes. RNA and deoxyribonucleic acid ( DNA) are nucleic acids. Along with lipids, proteins, and carbohydra ...
sequences are written in a 5′-to-3′ direction except as needed to illustrate the pattern of base pairing.
5′-end
The 5′-end (pronounced "five prime end") designates the end of the DNA or RNA strand that has the fifth carbon in the
sugar-ring of the
deoxyribose
Deoxyribose, or more precisely 2-deoxyribose, is a monosaccharide with idealized formula H−(C=O)−(CH2)−(CHOH)3−H. Its name indicates that it is a deoxy sugar, meaning that it is derived from the sugar ribose by loss of a hydroxy group. D ...
or
ribose
Ribose is a simple sugar and carbohydrate with molecular formula C5H10O5 and the linear-form composition H−(C=O)−(CHOH)4−H. The naturally-occurring form, , is a component of the ribonucleotides from which RNA is built, and so this compo ...
at its terminus. A
phosphate
In chemistry, a phosphate is an anion, salt, functional group or ester derived from a phosphoric acid. It most commonly means orthophosphate, a derivative of orthophosphoric acid .
The phosphate or orthophosphate ion is derived from phospho ...
group attached to the 5′-end permits
ligation
Ligation may refer to:
* Ligation (molecular biology), the covalent linking of two ends of DNA or RNA molecules
* In medicine, the making of a ligature (tie)
* Chemical ligation, the production of peptides from amino acids
* Tubal ligation, a meth ...
of two
nucleotide
Nucleotides are organic molecules consisting of a nucleoside and a phosphate. They serve as monomeric units of the nucleic acid polymers – deoxyribonucleic acid (DNA) and ribonucleic acid (RNA), both of which are essential biomolecules wi ...
s, i.e., the covalent binding of a 5′-phosphate to the 3′-hydroxyl group of another nucleotide, to form a
phosphodiester bond
In chemistry, a phosphodiester bond occurs when exactly two of the hydroxyl groups () in phosphoric acid react with hydroxyl groups on other molecules to form two ester bonds. The "bond" involves this linkage . Discussion of phosphodiesters is ...
. Removal of the 5′-phosphate prevents ligation. To prevent unwanted nucleic acid ligation (e.g. self-ligation of a
plasmid vector
A plasmid is a small, extrachromosomal DNA molecule within a cell that is physically separated from chromosomal DNA and can replicate independently. They are most commonly found as small circular, double-stranded DNA molecules in bacteria; how ...
in
DNA cloning
Molecular cloning is a set of experimental methods in molecular biology that are used to assemble recombinant DNA molecules and to direct their replication within host organisms. The use of the word ''cloning'' refers to the fact that the metho ...
),
molecular biologists
Molecular biology is the branch of biology that seeks to understand the molecular basis of biological activity in and between cells, including biomolecular synthesis, modification, mechanisms, and interactions. The study of chemical and physi ...
commonly remove the 5′-phosphate with a
phosphatase
In biochemistry, a phosphatase is an enzyme that uses water to cleave a phosphoric acid Ester, monoester into a phosphate ion and an Alcohol (chemistry), alcohol. Because a phosphatase enzyme catalysis, catalyzes the hydrolysis of its Substrate ...
.
The 5′-end of nascent
messenger RNA
In molecular biology, messenger ribonucleic acid (mRNA) is a single-stranded molecule of RNA that corresponds to the genetic sequence of a gene, and is read by a ribosome in the process of synthesizing a protein.
mRNA is created during the p ...
is the site at which
post-transcriptional capping occurs, a process which is vital to producing mature messenger RNA. Capping increases the stability of the messenger RNA while it undergoes
translation
Translation is the communication of the Meaning (linguistic), meaning of a #Source and target languages, source-language text by means of an Dynamic and formal equivalence, equivalent #Source and target languages, target-language text. The ...
, providing resistance to the degradative effects of
exonuclease
Exonucleases are enzymes that work by cleaving nucleotides one at a time from the end (exo) of a polynucleotide chain. A hydrolyzing reaction that breaks phosphodiester bonds at either the 3′ or the 5′ end occurs. Its close relative is the ...
s.
It consists of a
methylated
In the chemical sciences, methylation denotes the addition of a methyl group on a substrate, or the substitution of an atom (or group) by a methyl group. Methylation is a form of alkylation, with a methyl group replacing a hydrogen atom. These ...
nucleotide (
methylguanosine) attached to the messenger RNA in a rare 5′- to 5′-triphosphate linkage.
The
5′-''flanking'' region of a
gene
In biology, the word gene (from , ; "...Wilhelm Johannsen coined the word gene to describe the Mendelian units of heredity..." meaning ''generation'' or ''birth'' or ''gender'') can have several different meanings. The Mendelian gene is a ba ...
often denotes a region of DNA which is not
transcribed into RNA. The 5′-flanking region contains the
gene promoter
In genetics, a promoter is a sequence of DNA to which proteins bind to initiate transcription of a single RNA transcript from the DNA downstream of the promoter. The RNA transcript may encode a protein (mRNA), or can have a function in and of i ...
, and may also contain enhancers or other protein binding sites.
The
5′-''untranslated'' region (5′-UTR) is a region of a gene which is transcribed into mRNA, and is located at the 5′-end of the mRNA. This region of an
mRNA
In molecular biology, messenger ribonucleic acid (mRNA) is a single-stranded molecule of RNA that corresponds to the genetic sequence of a gene, and is read by a ribosome in the process of Protein biosynthesis, synthesizing a protein.
mRNA is ...
may or may not be
translated
Translation is the communication of the meaning of a source-language text by means of an equivalent target-language text. The English language draws a terminological distinction (which does not exist in every language) between ''transla ...
, but is usually involved in the regulation of translation. The 5′-untranslated region is the portion of the DNA starting from the cap site and extending to the base just before the AUG translation initiation codon of the main coding sequence. This region may have sequences, such as the
ribosome binding site A ribosome binding site, or ribosomal binding site (RBS), is a sequence of nucleotides upstream of the start codon of an mRNA transcript that is responsible for the recruitment of a ribosome during the initiation of translation. Mostly, RBS refers t ...
and
Kozak sequence, which determine the translation efficiency of the mRNA, or which may affect the stability of the mRNA.
3′-end
The 3′-end (three prime end) of a strand is so named due to it terminating at the
hydroxyl
In chemistry, a hydroxy or hydroxyl group is a functional group with the chemical formula and composed of one oxygen atom covalently bonded to one hydrogen atom. In organic chemistry, alcohols and carboxylic acids contain one or more hydroxy ...
group of the third carbon in the
sugar-ring, and is known as the ''tail end''. The 3′-hydroxyl is necessary in the synthesis of new nucleic acid molecules as it is
ligated (joined) to the 5′-phosphate of a separate nucleotide, allowing the formation of strands of linked nucleotides.
Molecular biologists
Molecular biology is the branch of biology that seeks to understand the molecular basis of biological activity in and between cells, including biomolecular synthesis, modification, mechanisms, and interactions. The study of chemical and physi ...
can use
nucleotides
Nucleotides are organic molecules consisting of a nucleoside and a phosphate. They serve as monomeric units of the nucleic acid polymers – deoxyribonucleic acid (DNA) and ribonucleic acid (RNA), both of which are essential biomolecules w ...
that lack a 3′-hydroxyl (dideoxyribonucleotides) to interrupt the replication of
DNA. This technique is known as the dideoxy chain-termination method or the Sanger method, and is used to
determine the order of nucleotides in DNA.
The 3′-end of nascent
messenger RNA
In molecular biology, messenger ribonucleic acid (mRNA) is a single-stranded molecule of RNA that corresponds to the genetic sequence of a gene, and is read by a ribosome in the process of synthesizing a protein.
mRNA is created during the p ...
is the site of
post-transcriptional polyadenylation, which attaches a chain of 50 to 250
adenosine
Adenosine ( symbol A) is an organic compound that occurs widely in nature in the form of diverse derivatives. The molecule consists of an adenine attached to a ribose via a β-N9-glycosidic bond. Adenosine is one of the four nucleoside building ...
residues to produce mature messenger RNA. This chain helps in determining how long the messenger RNA lasts in the cell, influencing how much protein is produced from it.
The 3′-''flanking'' region is a region of DNA that is not copied into the mature mRNA, but which is present adjacent to 3′-end of the gene. It was originally thought that the 3′-flanking DNA was not transcribed at all, but it was discovered to be transcribed into RNA and quickly removed during processing of the primary transcript to form the mature mRNA. The 3′-flanking region often contains sequences that affect the formation of the 3′-end of the message. It may also contain enhancers or other sites to which proteins may bind.
The
3′-''untranslated'' region (3′-UTR) is a region of the DNA which ''is'' transcribed into mRNA and becomes the 3′-end of the message, but which does not contain protein coding sequence. Everything between the
stop codon
In molecular biology (specifically protein biosynthesis), a stop codon (or termination codon) is a codon (nucleotide triplet within messenger RNA) that signals the termination of the translation process of the current protein. Most codons in me ...
and the
polyA tail
Polyadenylation is the addition of a poly(A) tail to an RNA transcript, typically a messenger RNA (mRNA). The poly(A) tail consists of multiple adenosine monophosphates; in other words, it is a stretch of RNA that has only adenine bases. In eu ...
is considered to be 3′-untranslated. The 3′-untranslated region may affect the translation efficiency of the mRNA or the stability of the mRNA. It also has sequences which are required for the addition of the poly(A) tail to the message, including the hexanucleotide AAUAAA.
See also
*
Sense (molecular biology)
In molecular biology and genetics, the sense of a nucleic acid molecule, particularly of a strand of DNA or RNA, refers to the nature of the roles of the strand and its complement in specifying a sequence of amino acids. Depending on the context ...
Further reading
*
{{Reflist
External links
A Molecular Biology Glossary
DNA
Molecular genetics
RNA