
A catalytic triad is a set of three coordinated
amino acid residues
Protein structure is the three-dimensional arrangement of atoms in an amino acid-chain molecule. Proteins are polymers specifically polypeptides formed from sequences of amino acids, which are the monomers of the polymer. A single amino acid m ...
that can be found in the
active site
In biology and biochemistry, the active site is the region of an enzyme where substrate molecules bind and undergo a chemical reaction. The active site consists of amino acid residues that form temporary bonds with the substrate, the ''binding s ...
of some
enzymes
An enzyme () is a protein that acts as a biological catalyst by accelerating chemical reactions. The molecules upon which enzymes may act are called substrates, and the enzyme converts the substrates into different molecules known as pro ...
.
Catalytic triads are most commonly found in
hydrolase
In biochemistry, hydrolases constitute a class of enzymes that commonly function as biochemical catalysts that use water to break a chemical bond:
:\ce \quad \xrightarrowtext\quad \ce
This typically results in dividing a larger molecule into s ...
and
transferase
In biochemistry, a transferase is any one of a class of enzymes that catalyse the transfer of specific functional groups (e.g. a methyl or glycosyl group) from one molecule (called the donor) to another (called the acceptor). They are involved ...
enzymes (e.g.
protease
A protease (also called a peptidase, proteinase, or proteolytic enzyme) is an enzyme that catalysis, catalyzes proteolysis, breaking down proteins into smaller polypeptides or single amino acids, and spurring the formation of new protein products ...
s,
amidase
In enzymology, an amidase (, ''acylamidase'', ''acylase (misleading)'', ''amidohydrolase (ambiguous)'', ''deaminase (ambiguous)'', ''fatty acylamidase'', ''N-acetylaminohydrolase (ambiguous)'') is an enzyme that catalysis, catalyzes the hydrolysis ...
s,
esterase
In biochemistry, an esterase is a class of enzyme that splits esters into an acid and an alcohol in a chemical reaction with water called hydrolysis (and as such, it is a type of hydrolase).
A wide range of different esterases exist that differ ...
s,
acylases,
lipase
In biochemistry, lipase ( ) refers to a class of enzymes that catalyzes the hydrolysis of fats. Some lipases display broad substrate scope including esters of cholesterol, phospholipids, and of lipid-soluble vitamins and sphingomyelinases; howe ...
s and
β-lactamase
Beta-lactamases (β-lactamases) are enzymes () produced by bacteria that provide Multiple drug resistance, multi-resistance to beta-lactam antibiotics such as penicillins, cephalosporins, cephamycins, monobactams and carbapenems (ertapenem ...
s). An
acid
An acid is a molecule or ion capable of either donating a proton (i.e. Hydron, hydrogen cation, H+), known as a Brønsted–Lowry acid–base theory, Brønsted–Lowry acid, or forming a covalent bond with an electron pair, known as a Lewis ...
-
base-
nucleophile
In chemistry, a nucleophile is a chemical species that forms bonds by donating an electron pair. All molecules and ions with a free pair of electrons or at least one pi bond can act as nucleophiles. Because nucleophiles donate electrons, they are ...
triad is a common motif for generating a nucleophilic residue for
covalent catalysis. The residues form a charge-relay network to polarise and activate the nucleophile, which attacks the
substrate
Substrate may refer to:
Physical layers
*Substrate (biology), the natural environment in which an organism lives, or the surface or medium on which an organism grows or is attached
** Substrate (aquatic environment), the earthy material that exi ...
, forming a
covalent
A covalent bond is a chemical bond that involves the sharing of electrons to form electron pairs between atoms. These electron pairs are known as shared pairs or bonding pairs. The stable balance of attractive and repulsive forces between atom ...
intermediate which is then
hydrolysed to release the
product and regenerate free enzyme. The nucleophile is most commonly a
serine
Serine
(symbol Ser or S) is an α-amino acid that is used in the biosynthesis of proteins. It contains an α- amino group (which is in the protonated − form under biological conditions), a carboxyl group (which is in the deprotonated − ...
or
cysteine
Cysteine (; symbol Cys or C) is a semiessential proteinogenic amino acid with the chemical formula, formula . The thiol side chain in cysteine enables the formation of Disulfide, disulfide bonds, and often participates in enzymatic reactions as ...
, but occasionally
threonine
Threonine (symbol Thr or T) is an amino acid that is used in the biosynthesis of proteins. It contains an α-amino group (which is in the protonated −NH form when dissolved in water), a carboxyl group (which is in the deprotonated −COO− ...
or even
selenocysteine
Selenocysteine (symbol Sec or U, in older publications also as Se-Cys) is the 21st proteinogenic amino acid. Selenoproteins contain selenocysteine residues. Selenocysteine is an analogue of the more common cysteine with selenium in place of the ...
. The
3D structure of the enzyme brings together the triad residues in a precise orientation, even though they may be far apart in the sequence (
primary structure
Protein primary structure is the linear sequence of amino acids in a peptide or protein. By convention, the primary structure of a protein is reported starting from the amino-terminal (N) end to the carboxyl-terminal (C) end. Protein biosynthe ...
).
As well as
divergent evolution
Divergent evolution or divergent selection is the accumulation of differences between closely related populations within a species, sometimes leading to speciation. Divergent evolution is typically exhibited when two populations become separate ...
of function (and even the triad's nucleophile), catalytic triads show some of the best examples of
convergent evolution
Convergent evolution is the independent evolution of similar features in species of different periods or epochs in time. Convergent evolution creates analogous structures that have similar form or function but were not present in the last comm ...
. Chemical constraints on catalysis have led to the same catalytic solution independently evolving in at least 23 separate
superfamilies.
Their
mechanism of action
In pharmacology, the term mechanism of action (MOA) refers to the specific biochemical Drug interaction, interaction through which a Medication, drug substance produces its pharmacological effect. A mechanism of action usually includes mention o ...
is consequently one of the best studied in
biochemistry
Biochemistry, or biological chemistry, is the study of chemical processes within and relating to living organisms. A sub-discipline of both chemistry and biology, biochemistry may be divided into three fields: structural biology, enzymology, a ...
.
History
In the 1950s, a serine residue was identified as the catalytic nucleophile of
trypsin
Trypsin is an enzyme in the first section of the small intestine that starts the digestion of protein molecules by cutting long chains of amino acids into smaller pieces. It is a serine protease from the PA clan superfamily, found in the dig ...
and
chymotrypsin
Chymotrypsin (, chymotrypsins A and B, alpha-chymar ophth, avazyme, chymar, chymotest, enzeon, quimar, quimotrase, alpha-chymar, alpha-chymotrypsin A, alpha-chymotrypsin) is a digestive enzyme component of pancreatic juice acting in the duodenu ...
(first purified in the 1930s) by
diisopropyl fluorophosphate
Diisopropyl fluorophosphate (DFP) or Isoflurophate is an oily, colorless liquid with the chemical formula C6H14FO3P. It is used in medicine and as an organophosphorus insecticide. It is stable, but undergoes hydrolysis when subjected to moisture ...
modification. The structure of chymotrypsin was solved by
X-ray crystallography
X-ray crystallography is the experimental science of determining the atomic and molecular structure of a crystal, in which the crystalline structure causes a beam of incident X-rays to Diffraction, diffract in specific directions. By measuring th ...
in the 1960s, showing the orientation of the catalytic triad in the
active site
In biology and biochemistry, the active site is the region of an enzyme where substrate molecules bind and undergo a chemical reaction. The active site consists of amino acid residues that form temporary bonds with the substrate, the ''binding s ...
. Other proteases were sequenced and aligned to reveal a family of related proteases, now called the S1 family. Simultaneously, the structures of the evolutionarily unrelated
papain
Papain, also known as papaya proteinase I, is a cysteine protease () enzyme present in papaya (''Carica papaya'') and mountain papaya (''Vasconcellea cundinamarcensis''). It is the namesake member of the papain-like protease family.
It has wi ...
and
subtilisin
Subtilisin is a protease (a protein-digesting enzyme) initially obtained from ''Bacillus subtilis''.
Subtilisins belong to subtilases, a group of serine proteases that – like all serine proteases – initiate the nucleophilic attack on the ...
proteases were found to contain analogous triads. The 'charge-relay' mechanism for the activation of the nucleophile by the other triad members was proposed in the late 1960s. As more protease structures were solved by
X-ray crystallography
X-ray crystallography is the experimental science of determining the atomic and molecular structure of a crystal, in which the crystalline structure causes a beam of incident X-rays to Diffraction, diffract in specific directions. By measuring th ...
in the 1970s and 80s,
homologous (such as
TEV protease) and analogous (such as papain) triads were found. The
MEROPS
MEROPS is an online database for peptidases (also known as proteases, proteinases and proteolytic enzymes) and their inhibitors. The classification scheme for peptidases was published by Rawlings & Barrett in 1993, and that for protein inhibito ...
classification system in the 1990s and 2000s began classing proteases into
structurally related enzyme
superfamilies and so acts as a database of the convergent evolution of triads in over 20 superfamilies.
Understanding how chemical constraints on evolution led to the convergence of so many enzyme families on the same triad
geometries
Geometry is a branch of mathematics concerned with questions of shape, size, relative position of figures, and the properties of space. Geometry is one of the oldest mathematical sciences.
Types, methodologies, and terminologies of geometry. ...
has developed in the 2010s.
Researchers have since conducted increasingly detailed investigations of the triad's exact catalytic mechanism. Of particular contention in the 1990s and 2000s was whether
low-barrier hydrogen bonding contributed to catalysis, or whether ordinary
hydrogen bonding
In chemistry, a hydrogen bond (H-bond) is a specific type of molecular interaction that exhibits partial covalent character and cannot be described as a purely electrostatic force. It occurs when a hydrogen (H) atom, Covalent bond, covalently b ...
is sufficient to explain the mechanism.
The massive body of work on the charge-relay, covalent catalysis used by catalytic triads has led to the mechanism being the best characterised in all of biochemistry.
Function
Enzymes that contain a catalytic triad use it for one of two reaction types: either to
split
Split(s) or The Split may refer to:
Places
* Split, Croatia, the largest coastal city in Croatia
* Split Island, Canada, an island in the Hudson Bay
* Split Island, Falkland Islands
* Split Island, Fiji, better known as Hạfliua
Arts, enter ...
a substrate (
hydrolase
In biochemistry, hydrolases constitute a class of enzymes that commonly function as biochemical catalysts that use water to break a chemical bond:
:\ce \quad \xrightarrowtext\quad \ce
This typically results in dividing a larger molecule into s ...
s) or to transfer one portion of a substrate over to a second substrate (
transferase
In biochemistry, a transferase is any one of a class of enzymes that catalyse the transfer of specific functional groups (e.g. a methyl or glycosyl group) from one molecule (called the donor) to another (called the acceptor). They are involved ...
s). Triads are an inter-dependent set of residues in the
active site
In biology and biochemistry, the active site is the region of an enzyme where substrate molecules bind and undergo a chemical reaction. The active site consists of amino acid residues that form temporary bonds with the substrate, the ''binding s ...
of an enzyme and act in concert with other residues (e.g.
binding site
In biochemistry and molecular biology, a binding site is a region on a macromolecule such as a protein that binds to another molecule with specificity. The binding partner of the macromolecule is often referred to as a ligand. Ligands may includ ...
and
oxyanion hole
An oxyanion hole is a pocket in the active site of an enzyme that stabilizes transition state negative charge on a deprotonation, deprotonated oxygen or alkoxide. The pocket typically consists of backbone amides or positively charged residues. Sta ...
) to achieve
nucleophilic catalysis. These triad residues act together to make the
nucleophile
In chemistry, a nucleophile is a chemical species that forms bonds by donating an electron pair. All molecules and ions with a free pair of electrons or at least one pi bond can act as nucleophiles. Because nucleophiles donate electrons, they are ...
member highly
reactive, generating a covalent intermediate with the substrate that is then resolved to complete catalysis.
Mechanism
Catalytic triads perform
covalent catalysis using a residue as a nucleophile. The reactivity of the nucleophilic residue is increased by the
functional group
In organic chemistry, a functional group is any substituent or moiety (chemistry), moiety in a molecule that causes the molecule's characteristic chemical reactions. The same functional group will undergo the same or similar chemical reactions r ...
s of the other triad members. The nucleophile is polarised and oriented by the base, which is itself bound and stabilised by the acid.
Catalysis is performed in two stages. First, the activated nucleophile attacks the
carbonyl
In organic chemistry, a carbonyl group is a functional group with the formula , composed of a carbon atom double bond, double-bonded to an oxygen atom, and it is divalent at the C atom. It is common to several classes of organic compounds (such a ...
carbon and forces the carbonyl oxygen to accept an electron pair, leading to a
tetrahedral intermediate
A tetrahedral intermediate is a reaction intermediate in which the bond arrangement around an initially double-bonded carbon atom has been transformed from trigonal to tetrahedral. Tetrahedral intermediates result from nucleophilic addition to a c ...
. The resulting build-up of negative charge is typically stabilized by an oxyanion hole within the active site. The intermediate then collapses back to a carbonyl, ejecting the first half of the substrate, but leaving the second half still covalently bound to the enzyme as an
acyl-enzyme intermediate. Although general-acid catalysis for breakdown of the First and Second tetrahedral intermediate may occur by the path shown in the diagram, evidence supporting such a mechanism with chymotrypsin has been controverted.
The second stage of catalysis is the resolution of the acyl-enzyme intermediate by the attack of a second substrate. If the substrate is water then hydrolysis results; if it is an organic molecule then that molecule is transferred onto the first substrate. Attack by the second substrate forms a new tetrahedral intermediate, which resolves by ejecting the enzyme's nucleophile, releasing the second product and regenerating free enzyme.
Identity of triad members
Nucleophile
The side-chain of the nucleophilic residue performs covalent catalysis on the
substrate
Substrate may refer to:
Physical layers
*Substrate (biology), the natural environment in which an organism lives, or the surface or medium on which an organism grows or is attached
** Substrate (aquatic environment), the earthy material that exi ...
. The
lone pair
In chemistry, a lone pair refers to a pair of valence electrons that are not shared with another atom in a covalent bondIUPAC ''Gold Book'' definition''lone (electron) pair''/ref> and is sometimes called an unshared pair or non-bonding pair. Lone ...
of electrons present on the oxygen or sulfur attacks the electropositive
carbonyl
In organic chemistry, a carbonyl group is a functional group with the formula , composed of a carbon atom double bond, double-bonded to an oxygen atom, and it is divalent at the C atom. It is common to several classes of organic compounds (such a ...
carbon.
The 20
naturally occurring biological amino acids do not contain any sufficiently nucleophilic functional groups for many difficult
catalytic reactions. Embedding the nucleophile in a triad increases its reactivity for efficient catalysis. The most commonly used nucleophiles are the
hydroxyl
In chemistry, a hydroxy or hydroxyl group is a functional group with the chemical formula and composed of one oxygen atom covalently bonded to one hydrogen atom. In organic chemistry, alcohols and carboxylic acids contain one or more hydroxy ...
(OH) of serine and the
thiol
In organic chemistry, a thiol (; ), or thiol derivative, is any organosulfur compound of the form , where R represents an alkyl or other organic substituent. The functional group itself is referred to as either a thiol group or a sulfhydryl grou ...
/thiolate ion (SH/S
−) of cysteine.
Alternatively,
threonine protease
Threonine proteases are a family of proteolytic enzymes harbouring a threonine (Thr) residue within the active site. The prototype members of this class of enzymes are the catalytic subunits of the proteasome, however, the acyltransferases conve ...
s use the
secondary hydroxyl of threonine, however due to
steric hindrance
Steric effects arise from the spatial arrangement of atoms. When atoms come close together there is generally a rise in the energy of the molecule. Steric effects are nonbonding interactions that influence the shape ( conformation) and reactivi ...
of the side chain's extra
methyl group
In organic chemistry, a methyl group is an alkyl derived from methane, containing one carbon atom bonded to three hydrogen atoms, having chemical formula (whereas normal methane has the formula ). In formulas, the group is often abbreviated a ...
, such proteases use their
''N''-terminal amide as the base rather than a separate amino acid.
Use of oxygen or sulfur as the nucleophilic atom causes minor differences in catalysis. Compared to
oxygen
Oxygen is a chemical element; it has chemical symbol, symbol O and atomic number 8. It is a member of the chalcogen group (periodic table), group in the periodic table, a highly reactivity (chemistry), reactive nonmetal (chemistry), non ...
,
sulfur
Sulfur ( American spelling and the preferred IUPAC name) or sulphur ( Commonwealth spelling) is a chemical element; it has symbol S and atomic number 16. It is abundant, multivalent and nonmetallic. Under normal conditions, sulfur atoms ...
's extra
d orbital makes it larger (by 0.4 Ã…)
and softer, allows it to form longer bonds (d
C-X and d
X-H by 1.3-fold), and gives it a lower p''K''
a (by 5 units).
Serine is therefore more dependent than cysteine on optimal orientation of the acid-base triad members to reduce its p''K''
a in order to achieve concerted
deprotonation
Deprotonation (or dehydronation) is the removal (transfer) of a proton (or hydron, or hydrogen cation), (H+) from a Brønsted–Lowry acid in an acid–base reaction.Henry Jakubowski, Biochemistry Online Chapter 2A3, https://employees.csbsju.ed ...
with catalysis.
The low p''K''
a of cysteine works to its disadvantage in the resolution of the first tetrahedral intermediate as unproductive reversal of the original nucleophilic attack is the more favourable breakdown product.
The triad base is therefore preferentially oriented to
protonate the leaving group amide to ensure that it is ejected to leave the enzyme sulfur covalently bound to the substrate N-terminus. Finally, resolution of the acyl-enzyme (to release the substrate C-terminus) requires serine to be re-protonated whereas cysteine can leave as S
−.
Sterically, the sulfur of cysteine also forms longer bonds and has a bulkier
van der Waals radius
The van der Waals radius, ''r'', of an atom is the radius of an imaginary hard sphere representing the distance of closest approach for another atom.
It is named after Johannes Diderik van der Waals, winner of the 1910 Nobel Prize in Physics ...
and if
mutated
In biology, a mutation is an alteration in the nucleic acid sequence of the genome of an organism, virus, or extrachromosomal DNA. Viral genomes contain either DNA or RNA. Mutations result from errors during DNA replication, DNA or viral rep ...
to serine can be trapped in unproductive orientations in the active site.
Very rarely, the
selenium
Selenium is a chemical element; it has symbol (chemistry), symbol Se and atomic number 34. It has various physical appearances, including a brick-red powder, a vitreous black solid, and a grey metallic-looking form. It seldom occurs in this elem ...
atom of the uncommon amino acid
selenocysteine
Selenocysteine (symbol Sec or U, in older publications also as Se-Cys) is the 21st proteinogenic amino acid. Selenoproteins contain selenocysteine residues. Selenocysteine is an analogue of the more common cysteine with selenium in place of the ...
is used as a nucleophile.
The deprotonated Se
− state is strongly favoured when in a catalytic triad.
Base
Since no natural amino acids are strongly nucleophilic, the base in a catalytic triad
polarises and
deprotonate
Deprotonation (or dehydronation) is the removal (transfer) of a proton (or hydron, or hydrogen cation), (H+) from a Brønsted–Lowry acid in an acid–base reaction.Henry Jakubowski, Biochemistry Online Chapter 2A3, https://employees.csbsju.ed ...
s the nucleophile to increase its reactivity.
Additionally, it
protonates the first
product to aid leaving group departure.
The base is most commonly histidine since its p''K''
a allows for effective base catalysis, hydrogen bonding to the acid residue, and deprotonation of the nucleophile residue.
β-lactamase
Beta-lactamases (β-lactamases) are enzymes () produced by bacteria that provide Multiple drug resistance, multi-resistance to beta-lactam antibiotics such as penicillins, cephalosporins, cephamycins, monobactams and carbapenems (ertapenem ...
s such as
TEM-1 use a
lysine
Lysine (symbol Lys or K) is an α-amino acid that is a precursor to many proteins. Lysine contains an α-amino group (which is in the protonated form when the lysine is dissolved in water at physiological pH), an α-carboxylic acid group ( ...
residue as the base. Because lysine's p''K''
a is so high (p''K''
a=11), a glutamate and several other residues act as the acid to stabilise its deprotonated state during the catalytic cycle. Threonine proteases use their ''N''-terminal amide as the base, since steric crowding by the catalytic threonine's methyl prevents other residues from being close enough.
Acid
The acidic triad member forms a hydrogen bond with the basic residue, leading to mutual alignment via restriction of the basic residue's side-chain rotation. The positive charge on the basic residue is simultaneously stabilised, leading to its polarisation.
Two amino acids have acidic
side chains at physiological pH (aspartate or glutamate) and so are the most common members of the acidic triad residue.
Cytomegalovirus
''Cytomegalovirus'' (CMV) (from ''cyto-'' 'cell' via Greek - 'container' + 'big, megalo-' + -''virus'' via Latin 'poison') is a genus of viruses in the order '' Herpesvirales'', in the family '' Herpesviridae'', in the subfamily '' Betaherp ...
protease uses a pair of histidines, one as the base, as usual, and one as the acid.
The second histidine is not as effective an acid as the more common aspartate or glutamate, leading to a lower catalytic efficiency.
Examples of triads
Ser-His-Asp
The Serine-Histidine-Aspartate motif is one of the most thoroughly characterised catalytic motifs in biochemistry.
The triad is exemplified by
chymotrypsin
Chymotrypsin (, chymotrypsins A and B, alpha-chymar ophth, avazyme, chymar, chymotest, enzeon, quimar, quimotrase, alpha-chymar, alpha-chymotrypsin A, alpha-chymotrypsin) is a digestive enzyme component of pancreatic juice acting in the duodenu ...
, a model serine protease from the
PA superfamily which uses its triad to hydrolyse protein backbones. The aspartate is hydrogen bonded to the histidine, increasing the p''K''
a of its imidazole nitrogen from 7 to around 12. Histidine is thus able to act as a powerful general base, activating the serine nucleophile. The histidine base aids the first leaving group by donating a proton, and also activates the hydrolytic water substrate by abstracting a proton as the remaining OH
− attacks the acyl-enzyme intermediate.
[Rauwerdink, Alissa, and Romas J. Kazlauskas. "How the same core catalytic machinery catalyzes 17 different reactions: the serine-histidine-aspartate catalytic triad of α/β-hydrolase fold enzymes." ACS catalysis 5.10 (2015): 6153-6176.]
The same triad has also convergently evolved in
α/β hydrolases such as some
lipase
In biochemistry, lipase ( ) refers to a class of enzymes that catalyzes the hydrolysis of fats. Some lipases display broad substrate scope including esters of cholesterol, phospholipids, and of lipid-soluble vitamins and sphingomyelinases; howe ...
s and
esterase
In biochemistry, an esterase is a class of enzyme that splits esters into an acid and an alcohol in a chemical reaction with water called hydrolysis (and as such, it is a type of hydrolase).
A wide range of different esterases exist that differ ...
s, however
orientation
Orientation may refer to:
Positioning in physical space
* Map orientation, the relationship between directions on a map and compass directions
* Orientation (housing), the position of a building with respect to the sun, a concept in building des ...
of the triad members is reversed.
Additionally, brain
acetyl hydrolase (which has the same fold as a small
G-protein
G proteins, also known as guanine nucleotide-binding proteins, are a family of proteins that act as molecular switches inside cells, and are involved in transmitting signals from a variety of stimuli outside a cell to its interior. Their ...
) has also been found to have this triad.
Cys-His-Asp
The second most studied triad is the Cysteine-Histidine-Aspartate motif.
Several families of
cysteine protease
Cysteine proteases, also known as thiol proteases, are hydrolase enzymes that degrade proteins. These proteases share a common catalytic mechanism that involves a nucleophilic cysteine thiol in a catalytic triad or dyad.
Discovered by Gopal Chu ...
s use this triad set, for example TEV protease and
papain
Papain, also known as papaya proteinase I, is a cysteine protease () enzyme present in papaya (''Carica papaya'') and mountain papaya (''Vasconcellea cundinamarcensis''). It is the namesake member of the papain-like protease family.
It has wi ...
. The triad acts similarly to serine protease triads, with a few notable differences. Due to cysteine's low p''K''
a, the importance of the Asp to catalysis varies and several cysteine proteases are effectively Cys-His dyads (e.g.
hepatitis A virus protease), whilst in others the cysteine is already deprotonated before catalysis begins (e.g. papain).
This triad is also used by some amidases, such as
''N''-glycanase to hydrolyse non-peptide C-N bonds.
Ser-His-His
The triad of
cytomegalovirus
''Cytomegalovirus'' (CMV) (from ''cyto-'' 'cell' via Greek - 'container' + 'big, megalo-' + -''virus'' via Latin 'poison') is a genus of viruses in the order '' Herpesvirales'', in the family '' Herpesviridae'', in the subfamily '' Betaherp ...
protease uses histidine as both the acid and base triad members. Removing the acid histidine results in only a 10-fold activity loss (compared to >10,000-fold when aspartate is removed from chymotrypsin). This triad has been interpreted as a possible way of generating a less active enzyme to control cleavage rate.
Ser-Glu-Asp
An unusual triad is found in
sedolisin proteases. The low p''K''
a of the glutamate carboxylate group means that it only acts as a base in the triad at very low pH. The triad is hypothesised to be an
adaptation
In biology, adaptation has three related meanings. Firstly, it is the dynamic evolutionary process of natural selection that fits organisms to their environment, enhancing their evolutionary fitness. Secondly, it is a state reached by the p ...
to specific environments like
acidic
An acid is a molecule or ion capable of either donating a proton (i.e. hydrogen cation, H+), known as a Brønsted–Lowry acid, or forming a covalent bond with an electron pair, known as a Lewis acid.
The first category of acids are the ...
hot spring
A hot spring, hydrothermal spring, or geothermal spring is a Spring (hydrology), spring produced by the emergence of Geothermal activity, geothermally heated groundwater onto the surface of the Earth. The groundwater is heated either by shallow ...
s (e.g.
kumamolysin) or cell
lysosome
A lysosome () is a membrane-bound organelle that is found in all mammalian cells, with the exception of red blood cells (erythrocytes). There are normally hundreds of lysosomes in the cytosol, where they function as the cell’s degradation cent ...
(e.g.
tripeptidyl peptidase).
Cys-His-Ser
The
endothelial
The endothelium (: endothelia) is a single layer of squamous endothelial cells that line the interior surface of blood vessels and lymphatic vessels. The endothelium forms an interface between circulating blood or lymph in the lumen and the res ...
protease
vasohibin uses a cysteine as the nucleophile, but a serine to coordinate the histidine base.
Despite the serine being a poor acid, it is still effective in orienting the histidine in the catalytic triad.
Some homologues alternatively have a threonine instead of serine at the acid location.
Thr-Nter, Ser-Nter and Cys-Nter
Threonine proteases, such as the
proteasome
Proteasomes are essential protein complexes responsible for the degradation of proteins by proteolysis, a chemical reaction that breaks peptide bonds. Enzymes that help such reactions are called proteases. Proteasomes are found inside all e ...
protease subunit and
ornithine acyltransferases use the secondary hydroxyl of threonine in a manner analogous to the use of the serine
primary hydroxyl.
However, due to the steric interference of the extra methyl group of threonine, the base member of the triad is the ''N''-terminal amide which polarises an ordered water which, in turn,
deprotonates the catalytic hydroxyl to increase its reactivity.
Similarly, there exist equivalent 'serine only' and 'cysteine only' configurations such as
penicillin acylase G and
penicillin acylase V which are evolutionarily related to the proteasome proteases. Again, these use their ''N''-terminal amide as a base.
Ser-''cis''Ser-Lys
This unusual triad occurs only in one superfamily of amidases. Here, the lysine acts to polarise the middle serine.
The middle serine then forms two strong hydrogen bonds to the nucleophilic serine to activate it (one with the side chain hydroxyl and the other with the backbone amide). The middle serine is held in an unusual ''
cis'' orientation to facilitate precise contacts with the other two triad residues. The triad is further unusual in that the lysine and ''cis''-serine both act as the base in activating the catalytic serine, but the same lysine also performs the role of the acid member as well as making key structural contacts.
Sec-His-Glu
The rare, but naturally occurring amino acid
selenocysteine
Selenocysteine (symbol Sec or U, in older publications also as Se-Cys) is the 21st proteinogenic amino acid. Selenoproteins contain selenocysteine residues. Selenocysteine is an analogue of the more common cysteine with selenium in place of the ...
(Sec), can also be found as the nucleophile in some catalytic triads.
Selenocysteine is similar to cysteine, but contains a
selenium
Selenium is a chemical element; it has symbol (chemistry), symbol Se and atomic number 34. It has various physical appearances, including a brick-red powder, a vitreous black solid, and a grey metallic-looking form. It seldom occurs in this elem ...
atom instead of a sulfur. A selenocysteine residue is found in the active site of
thioredoxin reductase
Thioredoxin reductases (TR, TrxR) () are enzymes that reduce thioredoxin (Trx). Two classes of thioredoxin reductase have been identified: one class in bacteria and some eukaryotes and one in animals. Bacterial TrxR also catalyzes the reduction ...
, which uses the
selenol
Selenols are organic compounds that contain the functional group with the connectivity . Selenols are sometimes also called selenomercaptans and selenothiols. Selenols are one of the principal classes of organoselenium compounds. A well-known sel ...
group for reduction of disulfide in thioredoxin.
Engineered triads
In addition to naturally occurring types of catalytic triads,
protein engineering has been used to create enzyme variants with non-native amino acids, or entirely synthetic amino acids. Catalytic triads have also been inserted into otherwise non-catalytic proteins, or protein mimics.
Subtilisin (a serine protease) has had its oxygen nucleophile replaced with each of sulfur,
selenium
Selenium is a chemical element; it has symbol (chemistry), symbol Se and atomic number 34. It has various physical appearances, including a brick-red powder, a vitreous black solid, and a grey metallic-looking form. It seldom occurs in this elem ...
,
or
tellurium
Tellurium is a chemical element; it has symbol Te and atomic number 52. It is a brittle, mildly toxic, rare, silver-white metalloid. Tellurium is chemically related to selenium and sulfur, all three of which are chalcogens. It is occasionally fou ...
.
Cysteine and selenocysteine were inserted by
mutagenesis
Mutagenesis () is a process by which the genetic information of an organism is changed by the production of a mutation. It may occur spontaneously in nature, or as a result of exposure to mutagens. It can also be achieved experimentally using lab ...
, whereas the non-natural amino acid,
tellurocysteine
Tellurocysteine (in some publications referred to as Te-Cys) is an amino acid with the formula . It is the heavy analogue of serine, cysteine, and selenocysteine. Tellurol (RTeH) is a rare and fragile functional group, especially alkyl derivative ...
, was inserted using
auxotrophic cells fed with synthetic tellurocysteine. These elements are all in the 16th
periodic table column (
chalcogen
The chalcogens (ore forming) ( ) are the chemical elements in group 16 of the periodic table. This group is also known as the oxygen family. Group 16 consists of the elements oxygen (O), sulfur (S), selenium (Se), tellurium (Te), and the rad ...
s), so have similar properties. In each case, changing the nucleophile lowered the enzyme's protease activity, but increased a new activity. A sulfur nucleophile improved the enzymes
transferase
In biochemistry, a transferase is any one of a class of enzymes that catalyse the transfer of specific functional groups (e.g. a methyl or glycosyl group) from one molecule (called the donor) to another (called the acceptor). They are involved ...
activity (sometimes called subtiligase). Selenium and tellurium nucleophiles converted the enzyme into an
oxidoreductase
In biochemistry, an oxidoreductase is an enzyme that catalyzes the transfer of electrons from one molecule, the reductant, also called the electron donor, to another, the oxidant, also called the electron acceptor. This group of enzymes usually ut ...
.
When the nucleophile of TEV protease was converted from cysteine to serine, its protease activity was strongly reduced, but it was possible to restore it by
directed evolution
Directed evolution (DE) is a method used in protein engineering that mimics the process of natural selection to steer proteins or nucleic acids toward a user-defined goal. It consists of subjecting a gene to iterative rounds of mutagenesis (cre ...
.
Non-catalytic proteins have been used as scaffolds, having catalytic triads inserted into them which were then improved by directed evolution. The Ser-His-Asp triad has been inserted into an antibody, as well as a range of other proteins. Similarly, catalytic triad mimics have been created in
small organic molecules like diaryl diselenide, and displayed on larger polymers like
Merrifield resins, and
self-assembling short peptide nanostructures.
Divergent evolution
The sophistication of the active site network causes residues involved in catalysis (and residues in contact with these) to be highly
evolutionarily conserved. However, many examples of divergent evolution in catalytic triads exist, both in the reaction catalysed, and the residues used in catalysis. The triad remains the core of the active site, but it is evolutionarily
adapted to serve different functions. Some proteins, called
pseudoenzymes, have non-catalytic functions (e.g. regulation by inhibitory binding) and have accumulated mutations that inactivate their catalytic triad.
Reaction changes
Catalytic triads perform
covalent catalysis via an acyl-enzyme intermediate. If the intermediate is resolved by water, the result is hydrolysis of the substrate. However, if the intermediate is resolved by attack by a second substrate, then the enzyme acts as a
transferase
In biochemistry, a transferase is any one of a class of enzymes that catalyse the transfer of specific functional groups (e.g. a methyl or glycosyl group) from one molecule (called the donor) to another (called the acceptor). They are involved ...
. For example, attack by an
acyl
In chemistry, an acyl group is a moiety derived by the removal of one or more hydroxyl groups from an oxoacid, including inorganic acids. It contains a double-bonded oxygen atom and an organyl group () or hydrogen in the case of formyl grou ...
group results in an
acyltransferase
Acyltransferase is a type of transferase enzyme that acts upon acyl
In chemistry, an acyl group is a moiety derived by the removal of one or more hydroxyl groups from an oxoacid, including inorganic acids. It contains a double-bonded oxygen ...
reaction. Several families of transferase enzymes have evolved from hydrolases by adaptation to exclude water and favour attack of a second substrate. In different members of the α/β-hydrolase superfamily, the Ser-His-Asp triad is tuned by surrounding residues to perform at least 17 different reactions.
Some of these reactions are also achieved with mechanisms that have altered formation, or resolution of the acyl-enzyme intermediate, or that don't proceed via an acyl-enzyme intermediate.
Additionally, an alternative transferase mechanism has been evolved by
amidophosphoribosyltransferase, which has two active sites. In the first active site, a cysteine triad hydrolyses a
glutamine
Glutamine (symbol Gln or Q) is an α-amino acid that is used in the biosynthesis of proteins. Its side chain is similar to that of glutamic acid, except the carboxylic acid group is replaced by an amide. It is classified as a charge-neutral ...
substrate to release free ammonia. The ammonia then diffuses though an
internal tunnel in the enzyme to the second active site, where it is transferred to a second substrate.
Nucleophile changes

Divergent evolution of active site residues is slow, due to strong chemical constraints. Nevertheless, some protease
superfamilies have evolved from one nucleophile to another. This can be inferred when a superfamily (with the same
fold) contains
families
Family (from ) is a group of people related either by consanguinity (by recognized birth) or affinity (by marriage or other relationship). It forms the basis for social order. Ideally, families offer predictability, structure, and safety as ...
that use different nucleophiles.
Such nucleophile switches have occurred several times during evolutionary history, however the mechanisms by which this happen are still unclear.
Within protease superfamilies that contain a mixture of nucleophiles (e.g. the
PA clan), families are designated by their catalytic nucleophile (C=cysteine proteases, S=serine proteases).
Pseudoenzymes
A further subclass of catalytic triad variants are
pseudoenzymes, which have triad mutations that make them catalytically inactive, but able to function as binding or structural proteins.
For example, the
heparin
Heparin, also known as unfractionated heparin (UFH), is a medication and naturally occurring glycosaminoglycan. Heparin is a blood anticoagulant that increases the activity of antithrombin. It is used in the treatment of myocardial infarction, ...
-binding protein
Azurocidin is a member of the PA clan, but with a glycine in place of the nucleophile and a serine in place of the histidine. Similarly,
RHBDF1 is a homolog of the S54 family
rhomboid protease
The rhomboid proteases are a family of enzymes that exist in almost all species. They are proteases: they cut the polypeptide chain of other proteins. This proteolytic cleavage is irreversible in cell (biology), cells, and an important type of cel ...
s with an alanine in the place of the nucleophilic serine. In some cases, pseudoenzymes may still have an intact catalytic triad but mutations in the rest of the protein remove catalytic activity. The CA clan contains catalytically inactive members with mutated triads (
calpamodulin has lysine in place of its cysteine nucleophile) and with intact triads but inactivating mutations elsewhere (rat testin retains a Cys-His-Asn triad).
Convergent evolution
The
enzymology
An enzyme () is a protein that acts as a biological catalyst by accelerating chemical reactions. The molecules upon which enzymes may act are called substrate (chemistry), substrates, and the enzyme converts the substrates into different mol ...
of proteases provides some of the clearest known examples of
convergent evolution
Convergent evolution is the independent evolution of similar features in species of different periods or epochs in time. Convergent evolution creates analogous structures that have similar form or function but were not present in the last comm ...
at a molecular level. The same geometric arrangement of triad residues occurs in over 20 separate enzyme
superfamilies. Each of these superfamilies is the result of convergent evolution for the same triad arrangement within a different
structural fold. This is because there are limited productive ways to arrange three triad residues, the enzyme backbone and the substrate. These examples reflect the intrinsic chemical and physical constraints on enzymes, leading evolution to repeatedly and independently converge on equivalent solutions.
Cysteine and serine hydrolases
The same triad geometries been converged upon by serine proteases such as the
chymotrypsin
Chymotrypsin (, chymotrypsins A and B, alpha-chymar ophth, avazyme, chymar, chymotest, enzeon, quimar, quimotrase, alpha-chymar, alpha-chymotrypsin A, alpha-chymotrypsin) is a digestive enzyme component of pancreatic juice acting in the duodenu ...
and
subtilisin
Subtilisin is a protease (a protein-digesting enzyme) initially obtained from ''Bacillus subtilis''.
Subtilisins belong to subtilases, a group of serine proteases that – like all serine proteases – initiate the nucleophilic attack on the ...
superfamilies. Similar convergent evolution has occurred with cysteine proteases such as viral
C3 protease and
papain
Papain, also known as papaya proteinase I, is a cysteine protease () enzyme present in papaya (''Carica papaya'') and mountain papaya (''Vasconcellea cundinamarcensis''). It is the namesake member of the papain-like protease family.
It has wi ...
superfamilies. These triads have converged to almost the same arrangement due to the mechanistic similarities in cysteine and serine
proteolysis
Proteolysis is the breakdown of proteins into smaller polypeptides or amino acids. Protein degradation is a major regulatory mechanism of gene expression and contributes substantially to shaping mammalian proteomes. Uncatalysed, the hydrolysis o ...
mechanisms.
Families of cysteine proteases
Families of serine proteases
Threonine proteases
Threonine proteases use the amino acid threonine as their catalytic nucleophile. Unlike cysteine and serine, threonine is a secondary hydroxyl (i.e. has a methyl group). This methyl group greatly restricts the possible orientations of triad and substrate as the methyl clashes with either the enzyme backbone or histidine base.
When the nucleophile of a serine protease was mutated to threonine, the methyl occupied a mixture of positions, most of which prevented substrate binding.
Consequently, the
catalytic residue of a threonine protease is located at its ''N''-terminus.
Two evolutionarily independent
enzyme superfamilies
A protein superfamily is the largest grouping (clade) of proteins for which common ancestry can be inferred (see homology). Usually this common ancestry is inferred from structural alignment and mechanistic similarity, even if no sequence simila ...
with different protein folds are known to use the ''N''-terminal residue as a nucleophile: Superfamily PB (proteasomes using the Ntn fold)
and Superfamily PE (
acetyltransferase
An acetyltransferase (also referred to as a transacetylase) is any of a class of transferase enzymes that transfers an acetyl group in a reaction called acetylation. In biological organisms, post-translational modification of a protein via acetyl ...
s using the DOM fold)
This commonality of
active site
In biology and biochemistry, the active site is the region of an enzyme where substrate molecules bind and undergo a chemical reaction. The active site consists of amino acid residues that form temporary bonds with the substrate, the ''binding s ...
structure in completely different protein folds indicates that the active site evolved convergently in those superfamilies.
Families of threonine proteases
See also
*
Enzyme catalysis
Enzyme catalysis is the increase in the rate of a process by an "enzyme", a biological molecule. Most enzymes are proteins, and most such processes are chemical reactions. Within the enzyme, generally catalysis occurs at a localized site, calle ...
*
Enzyme superfamily
References
Notes
Citations
{{Use British English, date=August 2015
Molecular biology
Catalysis
Evolution