
In
biochemistry
Biochemistry, or biological chemistry, is the study of chemical processes within and relating to living organisms. A sub-discipline of both chemistry and biology, biochemistry may be divided into three fields: structural biology, enzymology, a ...
, a nuclease (also archaically known as nucleodepolymerase or polynucleotidase) is an
enzyme
An enzyme () is a protein that acts as a biological catalyst by accelerating chemical reactions. The molecules upon which enzymes may act are called substrate (chemistry), substrates, and the enzyme converts the substrates into different mol ...
capable of cleaving the
phosphodiester bond
In chemistry, a phosphodiester bond occurs when exactly two of the hydroxyl groups () in phosphoric acid react with hydroxyl groups on other molecules to form two ester bonds. The "bond" involves this linkage . Discussion of phosphodiesters is d ...
s that link
nucleotide
Nucleotides are Organic compound, organic molecules composed of a nitrogenous base, a pentose sugar and a phosphate. They serve as monomeric units of the nucleic acid polymers – deoxyribonucleic acid (DNA) and ribonucleic acid (RNA), both o ...
s together to form
nucleic acid
Nucleic acids are large biomolecules that are crucial in all cells and viruses. They are composed of nucleotides, which are the monomer components: a pentose, 5-carbon sugar, a phosphate group and a nitrogenous base. The two main classes of nuclei ...
s. Nucleases variously affect
single and
double
Double, The Double or Dubble may refer to:
Mathematics and computing
* Multiplication by 2
* Double precision, a floating-point representation of numbers that is typically 64 bits in length
* A double number of the form x+yj, where j^2=+1
* A ...
stranded breaks in their target molecules. In living organisms, they are essential machinery for many aspects of
DNA repair
DNA repair is a collection of processes by which a cell (biology), cell identifies and corrects damage to the DNA molecules that encode its genome. A weakened capacity for DNA repair is a risk factor for the development of cancer. DNA is cons ...
. Defects in certain nucleases can cause
genetic instability or
immunodeficiency
Immunodeficiency, also known as immunocompromise, is a state in which the immune system's ability to fight infectious diseases and cancer is compromised or entirely absent. Most cases are acquired ("secondary") due to extrinsic factors that aff ...
.
Nucleases are also extensively used in
molecular cloning
Molecular cloning is a set of experimental methods in molecular biology that are used to assemble recombinant DNA molecules and to direct their DNA replication, replication within Host (biology), host organisms. The use of the word ''cloning'' re ...
.
There are two primary classifications based on the locus of activity.
Exonucleases digest nucleic acids from the ends.
Endonuclease
In molecular biology, endonucleases are enzymes that cleave the phosphodiester bond within a polynucleotide chain (namely DNA or RNA). Some, such as deoxyribonuclease I, cut DNA relatively nonspecifically (with regard to sequence), while man ...
s act on regions in the ''middle'' of target molecules. They are further subcategorized as
deoxyribonucleases and
ribonucleases. The former acts on
DNA
Deoxyribonucleic acid (; DNA) is a polymer composed of two polynucleotide chains that coil around each other to form a double helix. The polymer carries genetic instructions for the development, functioning, growth and reproduction of al ...
, the latter on
RNA
Ribonucleic acid (RNA) is a polymeric molecule that is essential for most biological functions, either by performing the function itself (non-coding RNA) or by forming a template for the production of proteins (messenger RNA). RNA and deoxyrib ...
.
History
In the late 1960s, scientists
Stuart Linn and
Werner Arber isolated examples of the two types of enzymes responsible for phage growth restriction in
Escherichia coli
''Escherichia coli'' ( )Wells, J. C. (2000) Longman Pronunciation Dictionary. Harlow ngland Pearson Education Ltd. is a gram-negative, facultative anaerobic, rod-shaped, coliform bacterium of the genus '' Escherichia'' that is commonly fo ...
(
E. coli) bacteria.
One of these enzymes added a
methyl group
In organic chemistry, a methyl group is an alkyl derived from methane, containing one carbon atom bonded to three hydrogen atoms, having chemical formula (whereas normal methane has the formula ). In formulas, the group is often abbreviated a ...
to the DNA, generating
methylated DNA, while the other cleaved unmethylated DNA at a wide variety of locations along the length of the molecule. The first type of enzyme was called a "
methylase" and the other a "
restriction nuclease". These enzymatic tools were important to scientists who were gathering the tools needed to "
cut and paste" DNA molecules. What was then needed was a tool that would cut DNA at specific sites, rather than at random sites along the length of the molecule, so that scientists could cut
DNA
Deoxyribonucleic acid (; DNA) is a polymer composed of two polynucleotide chains that coil around each other to form a double helix. The polymer carries genetic instructions for the development, functioning, growth and reproduction of al ...
molecules in a predictable and reproducible way.
An important development came when
H.O. Smith, K.W. Wilcox, and
T.J. Kelly, working at
Johns Hopkins University
The Johns Hopkins University (often abbreviated as Johns Hopkins, Hopkins, or JHU) is a private university, private research university in Baltimore, Maryland, United States. Founded in 1876 based on the European research institution model, J ...
in 1968, isolated and characterized the first
restriction nuclease whose functioning depended on a specific DNA
nucleotide
Nucleotides are Organic compound, organic molecules composed of a nitrogenous base, a pentose sugar and a phosphate. They serve as monomeric units of the nucleic acid polymers – deoxyribonucleic acid (DNA) and ribonucleic acid (RNA), both o ...
sequence. Working with
Haemophilus influenzae
''Haemophilus influenzae'' (formerly called Pfeiffer's bacillus or ''Bacillus influenzae'') is a Gram-negative, Motility, non-motile, Coccobacillus, coccobacillary, facultative anaerobic organism, facultatively anaerobic, Capnophile, capnophili ...
bacteria, this group isolated an enzyme, called
''Hind''II, that always cut DNA molecules at a particular point within a specific sequence of six base pairs. They found that the ''Hind''II enzyme always cuts directly in the center of this sequence (between the 3rd and 4th base pairs).
Numerical Classification System
Most nucleases are classified by the
Enzyme Commission number
The Enzyme Commission number (EC number) is a numerical classification scheme for enzymes, based on the chemical reactions they catalyze. As a system of enzyme nomenclature, every EC number is associated with a recommended name for the correspon ...
of the "Nomenclature Committee of the
International Union of Biochemistry and Molecular Biology
The International Union of Biochemistry and Molecular Biology (IUBMB) is an international non-governmental organisation concerned with biochemistry and molecular biology. Formed in 1955 as the International Union of Biochemistry (IUB), the union ...
" as
hydrolase
In biochemistry, hydrolases constitute a class of enzymes that commonly function as biochemical catalysts that use water to break a chemical bond:
:\ce \quad \xrightarrowtext\quad \ce
This typically results in dividing a larger molecule into s ...
s (EC-number 3). The nucleases belong just like
phosphodiesterase,
lipase
In biochemistry, lipase ( ) refers to a class of enzymes that catalyzes the hydrolysis of fats. Some lipases display broad substrate scope including esters of cholesterol, phospholipids, and of lipid-soluble vitamins and sphingomyelinases; howe ...
and
phosphatase
In biochemistry, a phosphatase is an enzyme that uses water to cleave a phosphoric acid Ester, monoester into a phosphate ion and an Alcohol (chemistry), alcohol. Because a phosphatase enzyme catalysis, catalyzes the hydrolysis of its Substrate ...
to the
esterases (EC-number 3.1), a subgroup of the hydrolases. The esterases to which nucleases belong are classified with the EC-numbers 3.1.11 - EC-number 3.1.31.
Structure
Nuclease
primary structure
Protein primary structure is the linear sequence of amino acids in a peptide or protein. By convention, the primary structure of a protein is reported starting from the amino-terminal (N) end to the carboxyl-terminal (C) end. Protein biosynthe ...
is by and large poorly conserved and minimally conserved at active sites, the surfaces of which primarily comprise acidic and basic amino acid residues. Nucleases can be classified into folding families.
Site recognition
A nuclease must associate with a nucleic acid before it can cleave the molecule. That entails a degree of recognition. Nucleases variously employ both nonspecific and specific associations in their modes of recognition and binding. Both modes play important roles in living organisms, especially in DNA repair.
Nonspecific endonucleases involved in DNA repair can scan DNA for
target sequences or
damage. Such a nuclease diffuses along DNA until it encounters a target, upon which the
residues of its
active site interact with the chemical groups of the DNA. In the case of endonucleases such as
EcoRV,
BamHI, and PvuII, this nonspecific binding involves electrostatic interactions between minimal surface area of the protein and the DNA. This weak association leaves the overall shape of the DNA undeformed, remaining in
B-form.
A forms far stronger associations by contrast. It draws DNA into the deep groove of its
DNA-binding domain
A DNA-binding domain (DBD) is an independently folded protein domain that contains at least one structural motif that recognizes double- or single-stranded DNA. A DBD can recognize a specific DNA sequence (a recognition sequence) or have a gener ...
. This results in significant deformation of the DNA
tertiary structure
Protein tertiary structure is the three-dimensional shape of a protein. The tertiary structure will have a single polypeptide chain "backbone" with one or more protein secondary structures, the protein domains. Amino acid side chains and the ...
and is accomplished with a surfaces rich in
basic
Basic or BASIC may refer to:
Science and technology
* BASIC, a computer programming language
* Basic (chemistry), having the properties of a base
* Basic access authentication, in HTTP
Entertainment
* Basic (film), ''Basic'' (film), a 2003 film
...
(positively charged) residues. It engages in extensive electrostatic interaction with the DNA.
Some nucleases involved in DNA repair exhibit partial sequence-specificity. However most are nonspecific, instead recognizing structural abnormalities produced in the DNA
backbone by
base pair
A base pair (bp) is a fundamental unit of double-stranded nucleic acids consisting of two nucleobases bound to each other by hydrogen bonds. They form the building blocks of the DNA double helix and contribute to the folded structure of both DNA ...
mismatches.
Structure specific nuclease
For details see
flap endonuclease.
Sequence specific nuclease
There are more than 900 restriction enzymes, some sequence specific and some not, have been isolated from over 230 strains of bacteria since the initial discovery of ''Hind''II. These restriction enzymes generally have names that reflect their origin—The first letter of the name comes from the genus and the second two letters come from the species of the prokaryotic cell from which they were isolated. For example,
''Eco''RI comes from
Escherichia coli
''Escherichia coli'' ( )Wells, J. C. (2000) Longman Pronunciation Dictionary. Harlow ngland Pearson Education Ltd. is a gram-negative, facultative anaerobic, rod-shaped, coliform bacterium of the genus '' Escherichia'' that is commonly fo ...
RY13 bacteria, while HindII comes from
Haemophilus influenzae
''Haemophilus influenzae'' (formerly called Pfeiffer's bacillus or ''Bacillus influenzae'') is a Gram-negative, Motility, non-motile, Coccobacillus, coccobacillary, facultative anaerobic organism, facultatively anaerobic, Capnophile, capnophili ...
strain Rd. Numbers following the nuclease names indicate the order in which the enzymes were isolated from single strains of bacteria:
''Eco''RI,
''Eco''RII.
Endonucleases
A restriction endonuclease functions by "scanning" the length of a DNA molecule. Once it encounters its particular specific recognition sequence, it will bind to the DNA molecule and makes one cut in each of the two sugar-phosphate backbones. The positions of these two cuts, both in relation to each other, and to the recognition sequence itself, are determined by the identity of the restriction endonuclease. Different endonucleases yield different sets of cuts, but one endonuclease will always cut a particular base sequence the same way, no matter what DNA molecule it is acting on. Once the cuts have been made, the DNA molecule will break into fragments.
Staggered cutting
Not all restriction endonucleases cut symmetrically and leave blunt ends like ''Hind''II described above. Many endonucleases cleave the DNA backbones in positions that are not directly opposite each other, creating overhangs. For example, the nuclease ''Eco''RI has the recognition sequence
5'—GAATTC—3'
.
When the enzyme encounters this sequence, it cleaves each backbone between the G and the closest A base residues. Once the cuts have been made, the resulting fragments are held together only by the relatively weak hydrogen bonds that hold the complementary bases to each other. The weakness of these bonds allows the DNA fragments to separate from each other. Each resulting fragment has a protruding 5' end composed of unpaired bases. Other enzymes create cuts in the DNA backbone which result in protruding 3' ends. Protruding ends—both 3' and 5'—are sometimes called "
sticky ends" because they tend to bond with complementary sequences of bases. In other words, if an unpaired length of bases
5'—AATT—3'
encounters another unpaired length with the sequence
3'—TTAA—5'
they will bond to each other—they are "sticky" for each other.
Ligase enzyme is then used to join the phosphate backbones of the two molecules. The cellular origin, or even the species origin, of the sticky ends does not affect their stickiness. Any pair of complementary sequences will tend to bond, even if one of the sequences comes from a length of human DNA, and the other comes from a length of bacterial DNA. In fact, it is this quality of stickiness that allows production of recombinant DNA molecules, molecules which are composed of DNA from different sources, and which has given birth to the
genetic engineering
Genetic engineering, also called genetic modification or genetic manipulation, is the modification and manipulation of an organism's genes using technology. It is a set of Genetic engineering techniques, technologies used to change the genet ...
technology.
Role in nature
DNA repair
With all cells depending on DNA as the medium of genetic information, genetic quality control is an essential function of all organisms.
DNA replication
In molecular biology, DNA replication is the biological process of producing two identical replicas of DNA from one original DNA molecule. DNA replication occurs in all life, living organisms, acting as the most essential part of heredity, biolog ...
is an error prone process, and DNA molecules themselves are vulnerable to modification by many metabolic and environmental stressors. Ubiquitous examples include
reactive oxygen species
In chemistry and biology, reactive oxygen species (ROS) are highly Reactivity (chemistry), reactive chemicals formed from diatomic oxygen (), water, and hydrogen peroxide. Some prominent ROS are hydroperoxide (H2O2), superoxide (O2−), hydroxyl ...
, near
ultraviolet
Ultraviolet radiation, also known as simply UV, is electromagnetic radiation of wavelengths of 10–400 nanometers, shorter than that of visible light, but longer than X-rays. UV radiation is present in sunlight and constitutes about 10% of ...
, and
ionizing radiation
Ionizing (ionising) radiation, including Radioactive decay, nuclear radiation, consists of subatomic particles or electromagnetic waves that have enough energy per individual photon or particle to ionization, ionize atoms or molecules by detaching ...
. Many nucleases participate in DNA repair by recognizing damage sites and cleaving them from the surrounding DNA. These enzymes function independently or in
complexes. Most nucleases involved in DNA repair are not sequence-specific. They recognize damage sites through deformation of double stranded DNA (dsDNA) secondary structure.
Replication proofreading
During
DNA replication
In molecular biology, DNA replication is the biological process of producing two identical replicas of DNA from one original DNA molecule. DNA replication occurs in all life, living organisms, acting as the most essential part of heredity, biolog ...
,
DNA polymerase
A DNA polymerase is a member of a family of enzymes that catalyze the synthesis of DNA molecules from nucleoside triphosphates, the molecular precursors of DNA. These enzymes are essential for DNA replication and usually work in groups to create t ...
s
elongate new strands of DNA against complementary template strands. Most DNA polymerases comprise two different
enzymatic domains: a
polymerase
In biochemistry, a polymerase is an enzyme (Enzyme Commission number, EC 2.7.7.6/7/19/48/49) that synthesizes long chains of polymers or nucleic acids. DNA polymerase and RNA polymerase are used to assemble DNA and RNA molecules, respectively, by ...
and a
proofreading
Proofreading is a phase in the process of publishing where galley proofs are compared against the original manuscripts or graphic artworks, to identify transcription errors in the typesetting process. In the past, proofreaders would place corr ...
exonuclease. The polymerase elongates the new strand in the 5' → 3' direction. The exonuclease removes erroneous nucleotides from the same strand in the 3’ → 5’ direction. This exonuclease activity is essential for a DNA polymerase's ability to proofread.
Deletions inactivating or removing these nucleases increase rates of mutation and mortality in affected
microbe
A microorganism, or microbe, is an organism of microscopic size, which may exist in its single-celled form or as a colony of cells. The possible existence of unseen microbial life was suspected from antiquity, with an early attestation in ...
s and cancer in mice.
Halted replication fork
Many forms of
DNA damage stop progression of the
replication fork, causing the DNA polymerases and associated machinery to abandon the fork. It must then be processed by fork-specific proteins. The most notable is
MUS81. Deletions of which causes UV or methylation damage sensitivity in
yeast
Yeasts are eukaryotic, single-celled microorganisms classified as members of the fungus kingdom (biology), kingdom. The first yeast originated hundreds of millions of years ago, and at least 1,500 species are currently recognized. They are est ...
, in addition to meiotic defects.
Okazaki fragment processing
A ubiquitous task in cells is the removal of
Okazaki fragment RNA primers from replication. Most such primers are excised from newly synthesized
lagging strand DNA by endonucleases of the family
RNase H. In
eukaryote
The eukaryotes ( ) constitute the Domain (biology), domain of Eukaryota or Eukarya, organisms whose Cell (biology), cells have a membrane-bound cell nucleus, nucleus. All animals, plants, Fungus, fungi, seaweeds, and many unicellular organisms ...
s and in
archaea
Archaea ( ) is a Domain (biology), domain of organisms. Traditionally, Archaea only included its Prokaryote, prokaryotic members, but this has since been found to be paraphyletic, as eukaryotes are known to have evolved from archaea. Even thou ...
, the
flap endonuclease FEN1 also participates in the processing of Okazaki fragments.
Mismatch repair
DNA mismatch repair in any given organism is effected by a suite of mismatch-specific endonucleases. In prokaryotes, this role is primarily filled by MutSLH and
very short patch repair (VSP repair) associated proteins.
The MutSLH system (comprising
MutS, MutL, and MutH) corrects
point mutation
A point mutation is a genetic mutation where a single nucleotide base is changed, inserted or deleted from a DNA or RNA sequence of an organism's genome. Point mutations have a variety of effects on the downstream protein product—consequences ...
s and small
turns. MutS recognizes and binds to mismatches, where it recruits MutL and MutH. MutL mediates the interaction between MutS and MutH, and enhances the endonucleasic activity of the latter. MutH recognizes hemimethylated
5'—GATC—3'
sites and cleaves next to the
G
of the non-methylated strand (the more recently synthesized strand).
VSP repair is initiated by the endonuclease Vsr. It corrects a specific
T/G
mismatch caused by the spontaneous
deamination
Deamination is the removal of an amino group from a molecule. Enzymes that catalysis, catalyse this reaction are called deaminases.
In the human body, deamination takes place primarily in the liver; however, it can also occur in the kidney. In s ...
of
methylated cytosines to thymines. Vsr recognizes the sequence
5'—CT WGG—3'
,
where it nicks the DNA strand on the 5' side of the mismatched thymine (underlined in the previous sequence). One of the exonucleases RecJ,
ExoVII, or
ExoI then degrades the site before DNA polymerase resynthesizes the gap in the strand.
Base excision repair
AP site formation is a common occurrence in dsDNA. It is the result of spontaneous hydrolysis and the activity of
DNA glycosylases as an intermediary step in
base excision repair. These AP sites are removed by
AP endonucleases, which effect single strand breaks around the site.
Nucleotide excision repair
Nucleotide excision repair, not to be confused with base excision repair, involves the removal and replacement of damaged nucleotides. Instances of
crosslinking,
adducts, and
lesions (generated by ultraviolet light or
reactive oxygen species
In chemistry and biology, reactive oxygen species (ROS) are highly Reactivity (chemistry), reactive chemicals formed from diatomic oxygen (), water, and hydrogen peroxide. Some prominent ROS are hydroperoxide (H2O2), superoxide (O2−), hydroxyl ...
) can trigger this repair pathway. Short stretches of single stranded DNA containing such damaged nucleotide are removed from duplex DNA by separate endonucleases effecting nicks upstream and downstream of the damage. Deletions or mutations which affect these nucleases instigate increased sensitivity to ultraviolet damage and carcinogenesis. Such abnormalities can even impinge neural development.
In bacteria, both cuts executed by the
UvrB-UvrC complex. In budding yeast, Rad2 and the Rad1-Rad10 complex make the 5' and 3' cuts, respectively. In mammals, the homologs
XPG and
XPF-
ERCC1 affect the same respective nicks.
Double-strand break repair
Double-strand breaks, both intentional and unintentional, regularly occur in cells. Unintentional breaks are commonly generated by
ionizing radiation
Ionizing (ionising) radiation, including Radioactive decay, nuclear radiation, consists of subatomic particles or electromagnetic waves that have enough energy per individual photon or particle to ionization, ionize atoms or molecules by detaching ...
, various exogenous and endogenous chemical agents, and halted replication forks. Intentional breaks are generated as intermediaries in
meiosis
Meiosis () is a special type of cell division of germ cells in sexually-reproducing organisms that produces the gametes, the sperm or egg cells. It involves two rounds of division that ultimately result in four cells, each with only one c ...
and
V(D)J recombination
V(D)J recombination (variable–diversity–joining rearrangement) is the mechanism of somatic recombination that occurs only in developing lymphocytes during the early stages of T and B cell maturation. It results in the highly diverse repertoire ...
, which are primarily repaired through
homologous recombination
Homologous recombination is a type of genetic recombination in which genetic information is exchanged between two similar or identical molecules of double-stranded or single-stranded nucleic acids (usually DNA as in Cell (biology), cellular organi ...
and
non-homologous end joining
Non-homologous end joining (NHEJ) is a pathway that repairs double-strand breaks in DNA. It is called "non-homologous" because the break ends are directly ligated without the need for a homologous template, in contrast to homology directed repair ...
. Both cases require the ends in double strand breaks be processed by nucleases before repair can take place. One such nuclease is
Mre11 complexed with
Rad50
DNA repair protein RAD50, also known as RAD50, is a protein that in humans is encoded by the ''RAD50'' gene.
Function
The protein encoded by this gene is highly similar to ''Saccharomyces cerevisiae'' Rad50, a protein involved in DNA double- ...
. Mutations of Mre11 can precipitate
ataxia-telangiectasia-like disorder.
V(D)J recombination involves opening
stem-loop
Stem-loops are nucleic acid Biomolecular structure, secondary structural elements which form via intramolecular base pairing in single-stranded DNA or RNA. They are also referred to as hairpins or hairpin loops. A stem-loop occurs when two regi ...
s structures associated with double-strand breaks and subsequently joining both ends. The
Artemis-DNAPKcs complex participates in this reaction. Although Artemis exhibits 5' → 3' ssDNA exonuclease activity when alone, its complexing with
DNA-PKcs allows for endonucleasic processing of the stem-loops. Defects of either protein confers severe immunodeficiency.
Homologous recombination, on the other hand, involves two
homologous DNA duplexes connected by
D-loops or
Holliday junctions. In bacteria, endonucleases like
RuvC ''resolve'' Holliday junctions into two separate dsDNAs by cleaving the junctions at two symmetrical sites near the junction centre. In eukaryotes,
FEN1,
XPF-
ERCC1, and
MUS81 cleave the D-loops, and
Cce1/
Ydc2 processes Holliday junctions in mitochondria.
Meganucleases
The frequency at which a particular nuclease will cut a given DNA molecule depends on the complexity of the DNA and the length of the nuclease's recognition sequence; due to the statistical likelihood of finding the bases in a particular order by chance, a longer recognition sequence will result in less frequent digestion. For example, a given four-base sequence (corresponding to the recognition site for a hypothetical nuclease) would be predicted to occur every 256 base pairs on average (where 4^4=256), but any given six-base sequence would be expected to occur once every 4,096 base pairs on average (4^6=4096).
One unique family of nucleases is the
meganucleases, which are characterized by having larger, and therefore less common, recognition sequences consisting of 12 to 40 base pairs. These nucleases are particularly useful for genetic engineering and
Genome engineering applications in complex organisms such as plants and mammals, where typically larger genomes (numbering in the billions of base pairs) would result in frequent and deleterious site-specific digestion using traditional nucleases.
See also
*
HindIII
*
Ligase
*
Micrococcal nuclease
*
Nuclease protection assay
*
P1 nuclease
*
PIN domain
*
Polymerase
In biochemistry, a polymerase is an enzyme (Enzyme Commission number, EC 2.7.7.6/7/19/48/49) that synthesizes long chains of polymers or nucleic acids. DNA polymerase and RNA polymerase are used to assemble DNA and RNA molecules, respectively, by ...
*
Serratia marcescens nuclease (benzonase)
*
S1 nuclease
Nuclease S1 () is an endonuclease enzyme that splits single-stranded DNA (ssDNA) and RNA into oligo- or mononucleotides. This enzyme catalysis, catalyses the following chemical reaction
: Endonucleolytic cleavage to 5'-phosphomononucleotide and ...
References
External links
Examples of Restriction Enzymes Chart
{{Authority control
EC 3.1