PA clan
   HOME

TheInfoList



OR:

The PA clan ( Proteases of mixed nucleophile,
superfamily SUPERFAMILY is a database and search platform of structural and functional annotation for all proteins and genomes. It classifies amino acid sequences into known structural domains, especially into SCOP superfamilies. Domains are functional, str ...
A) is the largest group of
protease A protease (also called a peptidase, proteinase, or proteolytic enzyme) is an enzyme that catalyzes (increases reaction rate or "speeds up") proteolysis, breaking down proteins into smaller polypeptides or single amino acids, and spurring the ...
s with common ancestry as identified by structural homology. Members have a
chymotrypsin Chymotrypsin (, chymotrypsins A and B, alpha-chymar ophth, avazyme, chymar, chymotest, enzeon, quimar, quimotrase, alpha-chymar, alpha-chymotrypsin A, alpha-chymotrypsin) is a digestive enzyme component of pancreatic juice acting in the duod ...
-like fold and similar
proteolysis Proteolysis is the breakdown of proteins into smaller polypeptides or amino acids. Uncatalysed, the hydrolysis of peptide bonds is extremely slow, taking hundreds of years. Proteolysis is typically catalysed by cellular enzymes called protease ...
mechanisms but can have identity of <10%. The clan contains both
cysteine Cysteine (symbol Cys or C; ) is a semiessential proteinogenic amino acid with the formula . The thiol side chain in cysteine often participates in enzymatic reactions as a nucleophile. When present as a deprotonated catalytic residue, some ...
and serine proteases (different nucleophiles). PA clan proteases can be found in plants,
animals Animals are multicellular, eukaryotic organisms in the biological kingdom Animalia. With few exceptions, animals consume organic material, breathe oxygen, are able to move, can reproduce sexually, and go through an ontogenetic stage in ...
,
fungi A fungus ( : fungi or funguses) is any member of the group of eukaryotic organisms that includes microorganisms such as yeasts and molds, as well as the more familiar mushrooms. These organisms are classified as a kingdom, separately fr ...
,
eubacteria Bacteria (; singular: bacterium) are ubiquitous, mostly free-living organisms often consisting of one biological cell. They constitute a large domain of prokaryotic microorganisms. Typically a few micrometres in length, bacteria were amon ...
,
archaea Archaea ( ; singular archaeon ) is a domain of single-celled organisms. These microorganisms lack cell nuclei and are therefore prokaryotes. Archaea were initially classified as bacteria, receiving the name archaebacteria (in the Archaeba ...
and
viruses A virus is a submicroscopic infectious agent that replicates only inside the living cells of an organism. Viruses infect all life forms, from animals and plants to microorganisms, including bacteria and archaea. Since Dmitri Ivanovsky's ...
. The common use of the
catalytic triad A catalytic triad is a set of three coordinated amino acids that can be found in the active site of some enzymes. Catalytic triads are most commonly found in hydrolase and transferase enzymes (e.g. proteases, amidases, esterases, acylases, li ...
for hydrolysis by multiple clans of proteases, including the PA clan, represents an example of
convergent evolution Convergent evolution is the independent evolution of similar features in species of different periods or epochs in time. Convergent evolution creates analogous structures that have similar form or function but were not present in the last com ...
. The differences in the catalytic triad within the PA clan is also an example of divergent evolution of
active site In biology and biochemistry, the active site is the region of an enzyme where substrate molecules bind and undergo a chemical reaction. The active site consists of amino acid residues that form temporary bonds with the substrate ( binding site) ...
s in enzymes.


History

In the 1960s, the sequence similarity of several proteases indicated that they were evolutionarily related. These were grouped into the chymotrypsin-like serine proteases (now called the
S1 family S1, S01, S.I, S-1, S.1, Š-1 or S 1 may refer to: Biology and chemistry * S1 nuclease, an enzyme that digests singled-stranded DNA and RNA * S1: Keep locked up, a safety phrase in chemistry * Primary somatosensory cortex, also known as S1 * Tegaf ...
). As the structures of these, and other proteases were solved by
X-ray crystallography X-ray crystallography is the experimental science determining the atomic and molecular structure of a crystal, in which the crystalline structure causes a beam of incident X-rays to diffract into many specific directions. By measuring the angles ...
in the 1970s and 80s, it was noticed that several viral proteases such as Tobacco Etch Virus protease showed structural homology despite no discernible sequence similarity and even a different nucleophile. Based on structural homology, a
superfamily SUPERFAMILY is a database and search platform of structural and functional annotation for all proteins and genomes. It classifies amino acid sequences into known structural domains, especially into SCOP superfamilies. Domains are functional, str ...
was defined and later named the PA clan (by the
MEROPS MEROPS is an online database for peptidases (also known as proteases, proteinases and proteolytic enzymes) and their inhibitors. The classification scheme for peptidases was published by Rawlings & Barrett in 1993, and that for protein inhibitors ...
classification system). As more structures are solved, more protease families have been added to the PA clan superfamily.


Etymology

The ''P'' refers to ''P''roteases of mixed nucleophile. The ''A'' indicates that it was the first such clan to be identified (there also exist the PB, PC, PD and PE clans).


Structure

Despite retaining as little as 10% sequence identity, PA clan members isolated from viruses, prokaryotes and eukaryotes show structural homology and can be aligned by structural similarity (e.g. with DALI).


Double β-barrel

PA clan proteases all share a core motif of two β-barrels with covalent catalysis performed by an acid-histidine-nucleophile
catalytic triad A catalytic triad is a set of three coordinated amino acids that can be found in the active site of some enzymes. Catalytic triads are most commonly found in hydrolase and transferase enzymes (e.g. proteases, amidases, esterases, acylases, li ...
motif. The barrels are arranged perpendicularly beside each other with hydrophobic residues holding them together as the core scaffold for the enzyme. The triad residues are split between the two barrels so that
catalysis Catalysis () is the process of increasing the rate of a chemical reaction by adding a substance known as a catalyst (). Catalysts are not consumed in the reaction and remain unchanged after it. If the reaction is rapid and the catalyst recyc ...
takes place at their interface.


Viral protease loop

In addition to the double β-barrel core, some viral proteases (such as TEV protease) have a long, flexible C-terminal loop that forms a lid that completely covers the substrate and create a binding tunnel. This tunnel contains a set of tight binding pockets such that each side chain of the substrate peptide (P6 to P1’) is bound in a complementary site (S6 to S1’) and specificity is endowed by the large contact area between enzyme and substrate. Conversely, cellular proteases that lack this loop, such as
trypsin Trypsin is an enzyme in the first section of the small intestine that starts the digestion of protein molecules by cutting these long chains of amino acids into smaller pieces. It is a serine protease from the PA clan superfamily, found in the d ...
have broader specificity.


Evolution and function


Catalytic activity

Structural homology indicates that the PA clan members are descended from a common ancestor of the same fold. Although PA clan proteases use a catalytic triad perform 2-step
nucleophilic catalysis Enzyme catalysis is the increase in the rate of a process by a biological molecule, an "enzyme". Most enzymes are proteins, and most such processes are chemical reactions. Within the enzyme, generally catalysis occurs at a localized site, calle ...
, some families use
serine Serine (symbol Ser or S) is an α-amino acid that is used in the biosynthesis of proteins. It contains an α- amino group (which is in the protonated − form under biological conditions), a carboxyl group (which is in the deprotonated − for ...
as the
nucleophile In chemistry, a nucleophile is a chemical species that forms bonds by donating an electron pair. All molecules and ions with a free pair of electrons or at least one pi bond can act as nucleophiles. Because nucleophiles donate electrons, they ar ...
whereas others use
cysteine Cysteine (symbol Cys or C; ) is a semiessential proteinogenic amino acid with the formula . The thiol side chain in cysteine often participates in enzymatic reactions as a nucleophile. When present as a deprotonated catalytic residue, some ...
. The superfamily is therefore an extreme example of divergent enzyme evolution since during evolutionary history, the core catalytic residue of the enzyme has switched in different families. In addition to their structural similarity,
directed evolution Directed evolution (DE) is a method used in protein engineering that mimics the process of natural selection to steer proteins or nucleic acids toward a user-defined goal. It consists of subjecting a gene to iterative rounds of mutagenesis ...
has been shown to be able to convert a cysteine protease into an active serine protease. All cellular PA clan proteases are serine proteases, however there are both serine and
cysteine protease Cysteine proteases, also known as thiol proteases, are hydrolase enzymes that degrade proteins. These proteases share a common catalytic mechanism that involves a nucleophilic cysteine thiol in a catalytic triad or dyad. Discovered by Gopal ...
families of viral proteases. The majority are endopeptidases, with the exception being the S46 family of
exopeptidases An exopeptidase is any peptidase that catalyzes the cleavage of the terminal (or the penultimate) peptide bond; the process releases a single amino acid, dipeptide or a tripeptide from the peptide chain. Depending on whether the amino acid is re ...
.


Biological role and substrate specificity

In addition to divergence in their core catalytic machinery, the PA clan proteases also show wide divergent evolution in function. Members of the PA clan can be found in
eukaryote Eukaryotes () are organisms whose cells have a nucleus. All animals, plants, fungi, and many unicellular organisms, are Eukaryotes. They belong to the group of organisms Eukaryota or Eukarya, which is one of the three domains of life. Bacter ...
s,
prokaryote A prokaryote () is a single-celled organism that lacks a nucleus and other membrane-bound organelles. The word ''prokaryote'' comes from the Greek πρό (, 'before') and κάρυον (, 'nut' or 'kernel').Campbell, N. "Biology:Concepts & Con ...
s and
virus A virus is a submicroscopic infectious agent that replicates only inside the living cells of an organism. Viruses infect all life forms, from animals and plants to microorganisms, including bacteria and archaea. Since Dmitri Ivanovsk ...
es and encompass a wide range of functions. In mammals, some are involved in
blood clotting Coagulation, also known as clotting, is the process by which blood changes from a liquid to a gel, forming a blood clot. It potentially results in hemostasis, the cessation of blood loss from a damaged vessel, followed by repair. The mechan ...
(e.g. thrombin) and so have high substrate specificity as well as
digestion Digestion is the breakdown of large insoluble food molecules into small water-soluble food molecules so that they can be absorbed into the watery blood plasma. In certain organisms, these smaller substances are absorbed through the small intest ...
(e.g.
trypsin Trypsin is an enzyme in the first section of the small intestine that starts the digestion of protein molecules by cutting these long chains of amino acids into smaller pieces. It is a serine protease from the PA clan superfamily, found in the d ...
) with broad substrate specificity. Several snake venoms are also PA clan proteases, such as
pit viper The Crotalinae, commonly known as pit vipers,Mehrtens JM (1987). ''Living Snakes of the World in Color''. New York: Sterling Publishers. 480 pp. . crotaline snakes (from grc, κρόταλον ''krotalon'' castanet), or pit adders, are a subfa ...
haemotoxin and interfere with the victim's blood clotting cascade. Additionally, bacteria such as ''
Staphylococcus aureus ''Staphylococcus aureus'' is a Gram-positive spherically shaped bacterium, a member of the Bacillota, and is a usual member of the microbiota of the body, frequently found in the upper respiratory tract and on the skin. It is often posit ...
'' secrete exfoliative toxin which digest and damage the host's tissues. Many viruses express their
genome In the fields of molecular biology and genetics, a genome is all the genetic information of an organism. It consists of nucleotide sequences of DNA (or RNA in RNA viruses). The nuclear genome includes protein-coding genes and non-coding ...
as a single, massive polyprotein and use a PA clan protease to cleave this into functional units (e.g.
polio Poliomyelitis, commonly shortened to polio, is an infectious disease caused by the poliovirus. Approximately 70% of cases are asymptomatic; mild symptoms which can occur include sore throat and fever; in a proportion of cases more severe sy ...
,
norovirus Norovirus, sometimes referred to as the winter vomiting disease, is the most common cause of gastroenteritis. Infection is characterized by non-bloody diarrhea, vomiting, and stomach pain. Fever or headaches may also occur. Symptoms usually devel ...
, and TEV proteases). There are also several pseudoenzymes in the superfamily, where the catalytic triad residues have been mutated and so function as binding proteins. For example, the
heparin Heparin, also known as unfractionated heparin (UFH), is a medication and naturally occurring glycosaminoglycan. Since heparins depend on the activity of antithrombin, they are considered anticoagulants. Specifically it is also used in the treat ...
-binding protein Azurocidin has a glycine in place of the nucleophile and a serine in place of the histidine.


Families

Within the PA clan (P=proteases of mixed nucleophiles), families are designated by their catalytic nucleophile (C=
cysteine protease Cysteine proteases, also known as thiol proteases, are hydrolase enzymes that degrade proteins. These proteases share a common catalytic mechanism that involves a nucleophilic cysteine thiol in a catalytic triad or dyad. Discovered by Gopal ...
s, S= serine proteases). Despite the lack of sequence homology for the PA clan as a whole, individual families within it can be identified by sequence similarity.


See also

*
Protease A protease (also called a peptidase, proteinase, or proteolytic enzyme) is an enzyme that catalyzes (increases reaction rate or "speeds up") proteolysis, breaking down proteins into smaller polypeptides or single amino acids, and spurring the ...
** cysteine- ** serine- ** threonine- ** aspartic- ** metallo- *
Catalytic triad A catalytic triad is a set of three coordinated amino acids that can be found in the active site of some enzymes. Catalytic triads are most commonly found in hydrolase and transferase enzymes (e.g. proteases, amidases, esterases, acylases, li ...
*
Homology (biology) In biology, homology is similarity due to shared ancestry between a pair of structures or genes in different taxa. A common example of homologous structures is the forelimbs of vertebrates, where the Bat wing development, wings of bats and Ori ...
*
MEROPS MEROPS is an online database for peptidases (also known as proteases, proteinases and proteolytic enzymes) and their inhibitors. The classification scheme for peptidases was published by Rawlings & Barrett in 1993, and that for protein inhibitors ...
*
Protein family A protein family is a group of evolutionarily related proteins. In many cases, a protein family has a corresponding gene family, in which each gene encodes a corresponding protein with a 1:1 relationship. The term "protein family" should not be ...
*
Protein superfamily A protein superfamily is the largest grouping (clade) of proteins for which common ancestry can be inferred (see homology). Usually this common ancestry is inferred from structural alignment and mechanistic similarity, even if no sequence similar ...
*
Protein structure Protein structure is the three-dimensional arrangement of atoms in an amino acid-chain molecule. Proteins are polymers specifically polypeptides formed from sequences of amino acids, the monomers of the polymer. A single amino acid monom ...
*
Structural alignment Structural alignment attempts to establish homology between two or more polymer structures based on their shape and three-dimensional conformation. This process is usually applied to protein tertiary structures but can also be used for large R ...


References


External links


MEROPS
- Comprehensive protease database
Superfamily
- A database of protein folds {{Enzymes EC 3.4 Molecular evolution Proteases Protein superfamilies