Papain-like proteases (or papain-like (cysteine) peptidases; abbreviated PLP or PLCP) are a large
protein family
A protein family is a group of evolutionarily related proteins. In many cases, a protein family has a corresponding gene family, in which each gene encodes a corresponding protein with a 1:1 relationship. The term "protein family" should not be c ...
of
cysteine protease
Cysteine proteases, also known as thiol proteases, are hydrolase enzymes that degrade proteins. These proteases share a common catalytic mechanism that involves a nucleophilic cysteine thiol in a catalytic triad or dyad.
Discovered by Gopal Chund ...
enzyme
Enzymes () are proteins that act as biological catalysts by accelerating chemical reactions. The molecules upon which enzymes may act are called substrates, and the enzyme converts the substrates into different molecules known as products. A ...
s that share
structural
A structure is an arrangement and organization of interrelated elements in a material object or system, or the object or system so organized. Material structures include man-made objects such as buildings and machines and natural objects such ...
and
enzymatic
Enzymes () are proteins that act as biological catalysts by accelerating chemical reactions. The molecules upon which enzymes may act are called substrates, and the enzyme converts the substrates into different molecules known as products. ...
properties with the group's namesake member,
papain
Papain, also known as papaya proteinase I, is a cysteine protease () enzyme present in papaya (''Carica papaya'') and mountain papaya (''Vasconcellea cundinamarcensis''). It is the namesake member of the papain-like protease family.
It has wide ...
. They are found in all
domains of life. In animals, the group is often known as cysteine
cathepsins
Cathepsins (Ancient Greek ''kata-'' "down" and ''hepsein'' "boil"; abbreviated CTS) are proteases (enzymes that degrade proteins) found in all animals as well as other organisms. There are approximately a dozen members of this family, which are di ...
or, in older literature,
lysosomal
A lysosome () is a membrane-bound organelle found in many animal Cell (biology), cells. They are spherical Vesicle (biology and chemistry), vesicles that contain Hydrolysis, hydrolytic enzymes that can break down many kinds of biomolecules. A ly ...
peptidases.
In the
MEROPS
MEROPS is an online database for peptidases (also known as proteases, proteinases and proteolytic enzymes) and their inhibitors. The classification scheme for peptidases was published by Rawlings & Barrett in 1993, and that for protein inhibitor ...
protease enzyme classification system, papain-like proteases form Clan CA.
Papain-like proteases share a common
catalytic dyad active site
In biology and biochemistry, the active site is the region of an enzyme where substrate molecules bind and undergo a chemical reaction. The active site consists of amino acid residues that form temporary bonds with the substrate (binding site) a ...
featuring a
cysteine
Cysteine (symbol Cys or C; ) is a semiessential proteinogenic amino acid with the formula . The thiol side chain in cysteine often participates in enzymatic reactions as a nucleophile.
When present as a deprotonated catalytic residue, sometime ...
amino acid residue that acts as a
nucleophile
In chemistry, a nucleophile is a chemical species that forms bonds by donating an electron pair. All molecules and ions with a free pair of electrons or at least one pi bond can act as nucleophiles. Because nucleophiles donate electrons, they are ...
.
The
human genome
The human genome is a complete set of nucleic acid sequences for humans, encoded as DNA within the 23 chromosome pairs in cell nuclei and in a small DNA molecule found within individual mitochondria. These are usually treated separately as the n ...
encodes eleven cysteine cathepsins which have a broad range of physiological functions.
In some
parasite
Parasitism is a close relationship between species, where one organism, the parasite, lives on or inside another organism, the host, causing it some harm, and is adapted structurally to this way of life. The entomologist E. O. Wilson has ...
s papain-like proteases have roles in
host
A host is a person responsible for guests at an event or for providing hospitality during it.
Host may also refer to:
Places
* Host, Pennsylvania, a village in Berks County
People
*Jim Host (born 1937), American businessman
* Michel Host ...
invasion, such as
cruzipain
Cruzipain is a cysteine protease expressed by ''Trypanosoma cruzi''.
It is classified under .
Cruzipain is expressed by all strains and developmental forms of ''Trypanosoma cruzi''. It is secreted and can be found in the membrane of the parasite ...
from ''
Trypanosoma cruzi
''Trypanosoma cruzi'' is a species of parasitic euglenoids. Among the protozoa, the trypanosomes characteristically bore tissue in another organism and feed on blood (primarily) and also lymph. This behaviour causes disease or the likelihood of ...
''.
In plants, they are involved in host defense and in development.
Studies of papain-like proteases from
prokaryote
A prokaryote () is a single-celled organism that lacks a nucleus and other membrane-bound organelles. The word ''prokaryote'' comes from the Greek πρό (, 'before') and κάρυον (, 'nut' or 'kernel').Campbell, N. "Biology:Concepts & Connec ...
s have lagged their
eukaryotic
Eukaryotes () are organisms whose cells have a nucleus. All animals, plants, fungi, and many unicellular organisms, are Eukaryotes. They belong to the group of organisms Eukaryota or Eukarya, which is one of the three domains of life. Bacte ...
counterparts.
In cellular organisms they are synthesized as
preproenzyme A preproenzyme is an enzyme with two additional characteristics: "pre" refers to a signal sequence (signal peptide) which directs the enzyme to a specific organelle or subcellular localization; "pro" indicates that the enzyme is present in an inacti ...
s that are not enzymatically active until mature, and their activities are tightly regulated, often by the presence of endogenous
protease inhibitors such as
cystatin
The cystatins are a family of cysteine protease inhibitors which share a sequence homology and a common tertiary structure of an alpha helix lying on top of an anti-parallel beta sheet. The family is subdivided as described below.
Cystatins sho ...
s.
In many
RNA virus
An RNA virus is a virusother than a retrovirusthat has ribonucleic acid (RNA) as its genetic material. The nucleic acid is usually single-stranded RNA ( ssRNA) but it may be double-stranded (dsRNA). Notable human diseases caused by RNA viruses ...
es, including significant human
pathogen
In biology, a pathogen ( el, πάθος, "suffering", "passion" and , "producer of") in the oldest and broadest sense, is any organism or agent that can produce disease. A pathogen may also be referred to as an infectious agent, or simply a germ ...
s such as the
coronavirus
Coronaviruses are a group of related RNA viruses that cause diseases in mammals and birds. In humans and birds, they cause respiratory tract infections that can range from mild to lethal. Mild illnesses in humans include some cases of the com ...
es
SARS-CoV
Severe acute respiratory syndrome coronavirus 1 (SARS-CoV-1; or Severe acute respiratory syndrome coronavirus, SARS-CoV) is a strain of coronavirus that causes severe acute respiratory syndrome (SARS), the respiratory illness responsible for ...
and
SARS-CoV-2
Severe acute respiratory syndrome coronavirus 2 (SARS‑CoV‑2) is a strain of coronavirus that causes COVID-19 (coronavirus disease 2019), the respiratory illness responsible for the ongoing COVID-19 pandemic. The virus previously had a ...
, papain-like protease
protein domain
In molecular biology, a protein domain is a region of a protein's polypeptide chain that is self-stabilizing and that folds independently from the rest. Each domain forms a compact folded three-dimensional structure. Many proteins consist of s ...
s often have roles in processing of
polyprotein
Proteolysis is the breakdown of proteins into smaller polypeptides or amino acids. Uncatalysed, the hydrolysis of peptide bonds is extremely slow, taking hundreds of years. Proteolysis is typically catalysed by cellular enzymes called protease ...
s into mature
viral nonstructural protein In virology, a nonstructural protein is a protein encoded by a virus but that is not part of the viral particle. They typically include the various enzymes and transcription factors the virus uses to replicate itself, such as a viral protease ( 3CL ...
s.
Many papain-like proteases are considered potential
drug target
A biological target is anything within a living organism to which some other entity (like an endogenous ligand or a drug) is directed and/or binds, resulting in a change in its behavior or function. Examples of common classes of biological targets ...
s.
Classification
The
MEROPS
MEROPS is an online database for peptidases (also known as proteases, proteinases and proteolytic enzymes) and their inhibitors. The classification scheme for peptidases was published by Rawlings & Barrett in 1993, and that for protein inhibitor ...
system of protease enzyme classification defines clan CA as containing the papain-like proteases. They are thought to have a shared
evolution
Evolution is change in the heritable characteristics of biological populations over successive generations. These characteristics are the expressions of genes, which are passed on from parent to offspring during reproduction. Variation ...
ary origin. As of 2021, the clan contained 45 families.
Structure
The structure of papain was among the earliest
protein structure
Protein structure is the three-dimensional arrangement of atoms in an amino acid-chain molecule. Proteins are polymers specifically polypeptides formed from sequences of amino acids, the monomers of the polymer. A single amino acid monomer ma ...
s experimentally determined by
X-ray crystallography
X-ray crystallography is the experimental science determining the atomic and molecular structure of a crystal, in which the crystalline structure causes a beam of incident X-rays to diffract into many specific directions. By measuring the angles ...
.
Many papain-like protease enzymes function as
monomer
In chemistry, a monomer ( ; ''mono-'', "one" + '' -mer'', "part") is a molecule that can react together with other monomer molecules to form a larger polymer chain or three-dimensional network in a process called polymerization.
Classification
Mo ...
s, though a few, such as
cathepsin C
Cathepsin C (CTSC) also known as dipeptidyl peptidase I (DPP-I) is a lysosomal exo-cysteine protease belonging to the peptidase C1 protein family, a subgroup of the cysteine cathepsins. In humans, it is encoded by the ''CTSC'' gene.
Function ...
(Dipeptidyl-peptidase I), are
homotetramers. The mature monomer structure is characteristically divided into two lobes or subdomains, known as the L-domain (
N-terminal
The N-terminus (also known as the amino-terminus, NH2-terminus, N-terminal end or amine-terminus) is the start of a protein or polypeptide, referring to the free amine group (-NH2) located at the end of a polypeptide. Within a peptide, the ami ...
) and the R-domain (
C-terminal
The C-terminus (also known as the carboxyl-terminus, carboxy-terminus, C-terminal tail, C-terminal end, or COOH-terminus) is the end of an amino acid chain (protein or polypeptide), terminated by a free carboxyl group (-COOH). When the protein is ...
), where the
active site
In biology and biochemistry, the active site is the region of an enzyme where substrate molecules bind and undergo a chemical reaction. The active site consists of amino acid residues that form temporary bonds with the substrate (binding site) a ...
is located between them.
The L-domain is primarily
helical
Helical may refer to:
* Helix, the mathematical concept for the shape
* Helical engine, a proposed spacecraft propulsion drive
* Helical spring, a coilspring
* Helical plc, a British property company, once a maker of steel bar stock
* Helicoil
A t ...
while the R-domain contains
beta-sheet
The beta sheet, (β-sheet) (also β-pleated sheet) is a common motif of the regular protein secondary structure. Beta sheets consist of beta strands (β-strands) connected laterally by at least two or three backbone hydrogen bonds, forming a g ...
s in a
beta-barrel
In protein structures, a beta barrel is a beta sheet composed of tandem repeats that twists and coils to form a closed toroidal structure in which the first strand is bonded to the last strand (hydrogen bond). Beta-strands in many beta-barrels are ...
-like shape, surrounded by a helix.
The
enzyme substrate
In chemistry, the term substrate is highly context-dependent. Broadly speaking, it can refer either to a chemical species being observed in a chemical reaction, or to a surface on which other chemical reactions or microscopy are performed.
In the ...
interacts with both domains in an extended conformation.
Papain-like proteases are often
synthesized as
preproenzyme A preproenzyme is an enzyme with two additional characteristics: "pre" refers to a signal sequence (signal peptide) which directs the enzyme to a specific organelle or subcellular localization; "pro" indicates that the enzyme is present in an inacti ...
s, or enzymatically inactive precursors. A
signal peptide
A signal peptide (sometimes referred to as signal sequence, targeting signal, localization signal, localization sequence, transit peptide, leader sequence or leader peptide) is a short peptide (usually 16-30 amino acids long) present at the N-ter ...
at the
N-terminus
The N-terminus (also known as the amino-terminus, NH2-terminus, N-terminal end or amine-terminus) is the start of a protein or polypeptide, referring to the free amine group (-NH2) located at the end of a polypeptide. Within a peptide, the ami ...
, which serves as a
subcellular localization The cells of eukaryotic organisms are elaborately subdivided into functionally-distinct membrane-bound compartments. Some major constituents of eukaryotic cells are: extracellular space, plasma membrane, cytoplasm, nucleus, mitochondria, Golgi ap ...
signal, is cleaved by
signal peptidase
Signal peptidases are enzymes that convert secretory and some membrane proteins to their mature or pro forms by cleaving their signal peptides from their N-termini.
Signal peptidases were initially observed in endoplasmic reticulum (ER)-deri ...
to form a
zymogen
In biochemistry, a zymogen (), also called a proenzyme (), is an inactive precursor of an enzyme. A zymogen requires a biochemical change (such as a hydrolysis reaction revealing the active site, or changing the configuration to reveal the active ...
.
Post-translational modification
Post-translational modification (PTM) is the covalent and generally enzymatic modification of proteins following protein biosynthesis. This process occurs in the endoplasmic reticulum and the golgi apparatus. Proteins are synthesized by ribosome ...
in the form of
N-linked glycosylation
''N''-linked glycosylation, is the attachment of an oligosaccharide, a carbohydrate consisting of several sugar molecules, sometimes also referred to as glycan, to a nitrogen atom (the amide nitrogen of an asparagine (Asn) residue of a protein), ...
also occurs in parallel.
The zymogen is still inactive due to the presence of a
propeptide which functions as an inhibitor blocking access to the active site. The propeptide is removed by
proteolysis
Proteolysis is the breakdown of proteins into smaller polypeptides or amino acids. Uncatalysed, the hydrolysis of peptide bonds is extremely slow, taking hundreds of years. Proteolysis is typically catalysed by cellular enzymes called protease ...
to form the mature enzyme.
Catalytic mechanism
Papain-like proteases have a
catalytic dyad consisting of a
cysteine
Cysteine (symbol Cys or C; ) is a semiessential proteinogenic amino acid with the formula . The thiol side chain in cysteine often participates in enzymatic reactions as a nucleophile.
When present as a deprotonated catalytic residue, sometime ...
and a
histidine
Histidine (symbol His or H) is an essential amino acid that is used in the biosynthesis of proteins. It contains an α-amino group (which is in the protonated –NH3+ form under biological conditions), a carboxylic acid group (which is in the de ...
residue, which form an
ion pair
In chemistry, ion association is a chemical reaction whereby ions of opposite electric charge come together in solution to form a distinct chemical entity. Ion associates are classified, according to the number of ions that associate with each o ...
through their charged
thiolate
In organic chemistry, a thiol (; ), or thiol derivative, is any organosulfur compound of the form , where R represents an alkyl or other organic substituent. The functional group itself is referred to as either a thiol group or a sulfhydryl grou ...
and
imidazolium
Imidazole (ImH) is an organic compound with the formula C3N2H4. It is a white or colourless solid that is soluble in water, producing a mildly alkaline solution. In chemistry, it is an aromatic heterocycle, classified as a diazole, and has non-ad ...
side chains. The negatively charged cysteine thiolate functions as a
nucleophile
In chemistry, a nucleophile is a chemical species that forms bonds by donating an electron pair. All molecules and ions with a free pair of electrons or at least one pi bond can act as nucleophiles. Because nucleophiles donate electrons, they are ...
.
Additional neighboring residues -
aspartate
Aspartic acid (symbol Asp or D; the ionic form is known as aspartate), is an α-amino acid that is used in the biosynthesis of proteins. Like all other amino acids, it contains an amino group and a carboxylic acid. Its α-amino group is in the pro ...
,
asparagine
Asparagine (symbol Asn or N) is an α-amino acid that is used in the biosynthesis of proteins. It contains an α-amino group (which is in the protonated −NH form under biological conditions), an α-carboxylic acid group (which is in the depro ...
, or
glutamine
Glutamine (symbol Gln or Q) is an α-amino acid that is used in the biosynthesis of proteins. Its side chain is similar to that of glutamic acid, except the carboxylic acid group is replaced by an amide. It is classified as a charge-neutral, ...
- position the catalytic residues;
in papain, the required catalytic residues cysteine, histidine, and aspartate are sometimes called the catalytic triad (similar to
serine protease
Serine proteases (or serine endopeptidases) are enzymes that cleave peptide bonds in proteins. Serine serves as the nucleophilic amino acid at the (enzyme's) active site.
They are found ubiquitously in both eukaryotes and prokaryotes. ...
s).
Papain-like proteases are usually
endopeptidase
Endopeptidase or endoproteinase are proteolytic peptidases that break peptide bonds of nonterminal amino acids (i.e. within the molecule), in contrast to exopeptidases, which break peptide bonds from end-pieces of terminal amino acids. For this ...
s, but some members of the group are also, or even exclusively,
exopeptidase
An exopeptidase is any peptidase that catalyzes the cleavage of the terminal (or the penultimate) peptide bond; the process releases a single amino acid, dipeptide or a tripeptide from the peptide chain. Depending on whether the amino acid is rel ...
s.
Some viral papain-like proteases, including those of
coronavirus
Coronaviruses are a group of related RNA viruses that cause diseases in mammals and birds. In humans and birds, they cause respiratory tract infections that can range from mild to lethal. Mild illnesses in humans include some cases of the com ...
es, can also cleave
isopeptide bonds and can function as
deubiquitinases.
Function
Eukaryotes
Mammals
In animals, especially in mammalian biology, members of the papain-like protease family are usually referred to as cysteine cathepsins - that is, the
cysteine protease
Cysteine proteases, also known as thiol proteases, are hydrolase enzymes that degrade proteins. These proteases share a common catalytic mechanism that involves a nucleophilic cysteine thiol in a catalytic triad or dyad.
Discovered by Gopal Chund ...
members of the group of proteases known as
cathepsin
Cathepsins (Ancient Greek ''kata-'' "down" and ''hepsein'' "boil"; abbreviated CTS) are proteases (enzymes that degrade proteins) found in all animals as well as other organisms. There are approximately a dozen members of this family, which are di ...
s (which includes cysteine,
serine
Serine (symbol Ser or S) is an α-amino acid that is used in the biosynthesis of proteins. It contains an α-amino group (which is in the protonated − form under biological conditions), a carboxyl group (which is in the deprotonated − form un ...
, and
aspartic protease
Aspartic proteases are a catalytic type of protease enzymes that use an activated water molecule bound to one or more aspartate residues for catalysis of their peptide substrates. In general, they have two highly conserved aspartates in the activ ...
s). In humans, there are 11 cysteine cathepsins:
B,
C,
F,
H,
K,
L,
O,
S,
V,
X, and
W. Most cathepsins are
expressed throughout the body, but some have narrower
tissue distribution.
Although historically known as
lysosomal
A lysosome () is a membrane-bound organelle found in many animal Cell (biology), cells. They are spherical Vesicle (biology and chemistry), vesicles that contain Hydrolysis, hydrolytic enzymes that can break down many kinds of biomolecules. A ly ...
proteases and studied mainly for their role in protein
catabolism
Catabolism () is the set of metabolic pathways that breaks down molecules into smaller units that are either oxidized to release energy or used in other anabolic reactions. Catabolism breaks down large molecules (such as polysaccharides, lipids, ...
, cysteine cathepsins have since been identified playing major roles in a number of physiological processes and disease states. As part of normal physiological processes, they are involved in key steps of
antigen presentation
Antigen presentation is a vital immune process that is essential for T cell immune response triggering. Because T cells recognize only fragmented antigens displayed on cell surfaces, antigen processing must occur before the antigen fragment, now ...
as part of the
adaptive immune system
The adaptive immune system, also known as the acquired immune system, is a subsystem of the immune system that is composed of specialized, systemic cells and processes that eliminate pathogens or prevent their growth. The acquired immune system ...
, remodeling of the
extracellular matrix
In biology, the extracellular matrix (ECM), also called intercellular matrix, is a three-dimensional network consisting of extracellular macromolecules and minerals, such as collagen, enzymes, glycoproteins and hydroxyapatite that provide stru ...
,
differentiation of
keratinocyte
Keratinocytes are the primary type of Cell (biology), cell found in the epidermis (skin), epidermis, the outermost layer of the skin. In humans, they constitute 90% of epidermal skin cells.
Basal cells in the stratum basale, basal layer (''str ...
s, and processing of
peptide hormone
Peptide hormones or protein hormones are hormones whose molecules are peptide, or proteins, respectively. The latter have longer amino acid chain lengths than the former. These hormones have an effect on the endocrine system of animals, including h ...
s.
Cysteine cathepsins have been associated with
cancer
Cancer is a group of diseases involving abnormal cell growth with the potential to invade or spread to other parts of the body. These contrast with benign tumors, which do not spread. Possible signs and symptoms include a lump, abnormal b ...
and
tumor progression
Tumor progression is the third and last phase in tumor development. This phase is characterised by increased growth speed and invasiveness of the tumor cells. As a result of the progression, phenotypical changes occur and the tumor becomes more agg ...
,
cardiovascular disease
Cardiovascular disease (CVD) is a class of diseases that involve the heart or blood vessels. CVD includes coronary artery diseases (CAD) such as angina and myocardial infarction (commonly known as a heart attack). Other CVDs include stroke, h ...
,
autoimmune disease
An autoimmune disease is a condition arising from an abnormal immune response to a functioning body part. At least 80 types of autoimmune diseases have been identified, with some evidence suggesting that there may be more than 100 types. Nearly a ...
, and other human health conditions.
Cathepsin K
Cathepsin K, abbreviated CTSK, is an enzyme that in humans is encoded by the ''CTSK'' gene.
Function
The protein encoded by this gene is a cysteine cathepsin, a lysosomal cysteine protease involved in bone remodeling and resorption. This prote ...
has a role in
bone resorption
Bone resorption is resorption of bone tissue, that is, the process by which osteoclasts break down the tissue in bones and release the minerals, resulting in a transfer of calcium from bone tissue to the blood.
The osteoclasts are multi-nucleated ...
and has been studied as a
drug target
A biological target is anything within a living organism to which some other entity (like an endogenous ligand or a drug) is directed and/or binds, resulting in a change in its behavior or function. Examples of common classes of biological targets ...
for
osteoporosis
Osteoporosis is a systemic skeletal disorder characterized by low bone mass, micro-architectural deterioration of bone tissue leading to bone fragility, and consequent increase in fracture risk. It is the most common reason for a broken bone ...
.
Parasites
A number of
parasite
Parasitism is a close relationship between species, where one organism, the parasite, lives on or inside another organism, the host, causing it some harm, and is adapted structurally to this way of life. The entomologist E. O. Wilson has ...
s, including
helminth
Parasitic worms, also known as helminths, are large macroparasites; adults can generally be seen with the naked eye. Many are intestinal worms that are soil-transmitted and infect the gastrointestinal tract. Other parasitic worms such as schi ...
s (parasitic worms), use papain-like proteases as mechanisms for invasion of their
hosts
A host is a person responsible for guests at an event or for providing hospitality during it.
Host may also refer to:
Places
*Host, Pennsylvania, a village in Berks County
People
*Jim Host (born 1937), American businessman
*Michel Host ( ...
. Examples include ''
Toxoplasma gondii
''Toxoplasma gondii'' () is an obligate intracellular parasitic protozoan (specifically an apicomplexan) that causes toxoplasmosis. Found worldwide, ''T. gondii'' is capable of infecting virtually all warm-blooded animals, but felids, such as d ...
'' and ''
Giardia lamblia
''Giardia duodenalis'', also known as ''Giardia intestinalis'' and ''Giardia lamblia'', is a flagellated parasitic microorganism of the genus '' Giardia'' that colonizes the small intestine, causing a diarrheal condition known as giardiasis. ...
''. In many flatworms, there are very high levels of expression of cysteine cathepsins; in the
liver fluke
Liver fluke is a collective name of a polyphyletic group of parasitic trematodes under the phylum Platyhelminthes.
They are principally parasites of the liver of various mammals, including humans. Capable of moving along the blood circulation, t ...
''
Fasciola hepatica
''Fasciola hepatica'', also known as the common liver fluke or sheep liver fluke, is a parasitic trematode (fluke or flatworm, a type of helminth) of the class Trematoda, phylum Platyhelminthes. It infects the livers of various mammals, inc ...
'',
gene duplication
Gene duplication (or chromosomal duplication or gene amplification) is a major mechanism through which new genetic material is generated during molecular evolution. It can be defined as any duplication of a region of DNA that contains a gene. ...
s have produced over 20
paralog
Sequence homology is the biological homology between DNA, RNA, or protein sequences, defined in terms of shared ancestry in the evolutionary history of life. Two segments of DNA can have shared ancestry because of three phenomena: either a sp ...
s of a
cathepsin L-like enzyme.
Cysteine cathepsins are also part of the normal life cycle of the unicellular parasite ''
Leishmania
''Leishmania'' is a parasitic protozoan, a single-celled organism of the genus '' Leishmania'' that are responsible for the disease leishmaniasis. They are spread by sandflies of the genus ''Phlebotomus'' in the Old World, and of the genus '' ...
'', where they function as
virulence factor
Virulence factors (preferably known as pathogenicity factors or effectors in plant science) are cellular structures, molecules and regulatory systems that enable microbial pathogens (bacteria, viruses, fungi, and protozoa) to achieve the following ...
s.
The enzyme and potential
drug target
A biological target is anything within a living organism to which some other entity (like an endogenous ligand or a drug) is directed and/or binds, resulting in a change in its behavior or function. Examples of common classes of biological targets ...
cruzipain
Cruzipain is a cysteine protease expressed by ''Trypanosoma cruzi''.
It is classified under .
Cruzipain is expressed by all strains and developmental forms of ''Trypanosoma cruzi''. It is secreted and can be found in the membrane of the parasite ...
is important for the life cycle of the parasite ''
Trypanosoma cruzi
''Trypanosoma cruzi'' is a species of parasitic euglenoids. Among the protozoa, the trypanosomes characteristically bore tissue in another organism and feed on blood (primarily) and also lymph. This behaviour causes disease or the likelihood of ...
'', which causes
Chagas' disease.
Plants
Members of the papain-like protease family play a number of important roles in
plant development Important structures in plant development are buds, shoots, roots, leaves, and flowers; plants produce these tissues and structures throughout their life from meristems located at the tips of organs, or between mature tissues. Thus, a living plant a ...
, including
seed germination
Germination is the process by which an organism grows from a seed or spore. The term is applied to the sprouting of a seedling from a seed of an flowering plant, angiosperm or gymnosperm, the growth of a sporeling from a spore, such as the spor ...
,
leaf senescence, and responding to
abiotic stress
Abiotic stress is the negative impact of non-living factors on the living organisms in a specific environment. The non-living variable must influence the environment beyond its normal range of variation to adversely affect the population performan ...
. Papain-like proteases are involved in regulation of
programmed cell death
Programmed cell death (PCD; sometimes referred to as cellular suicide) is the death of a cell as a result of events inside of a cell, such as apoptosis or autophagy. PCD is carried out in a biological process, which usually confers advantage durin ...
in plants, for example in
tapetum during development of
pollen
Pollen is a powdery substance produced by seed plants. It consists of pollen grains (highly reduced microgametophytes), which produce male gametes (sperm cells). Pollen grains have a hard coat made of sporopollenin that protects the gametophyt ...
. They are also important in
plant immunity providing defense against
pests
PESTS was an anonymous American activist group formed in 1986 to critique racism, tokenism, and exclusion in the art world. PESTS produced newsletters, posters, and other print material highlighting examples of discrimination in gallery represent ...
and
pathogens
In biology, a pathogen ( el, πάθος, "suffering", "passion" and , "producer of") in the oldest and broadest sense, is any organism or agent that can produce disease. A pathogen may also be referred to as an infectious agent, or simply a germ ...
.
The relationship between plant papain-like proteases and pathogen responses - such as
cystatin
The cystatins are a family of cysteine protease inhibitors which share a sequence homology and a common tertiary structure of an alpha helix lying on top of an anti-parallel beta sheet. The family is subdivided as described below.
Cystatins sho ...
inhibitors - have been described as an
evolutionary arms race
In evolutionary biology, an evolutionary arms race is an ongoing struggle between competing sets of co-evolving genes, phenotypic and behavioral traits that develop escalating adaptations and counter-adaptations against each other, resembling an a ...
.
Some PLP family members in plants have culinary and commercial applications. The family's namesake member,
papain
Papain, also known as papaya proteinase I, is a cysteine protease () enzyme present in papaya (''Carica papaya'') and mountain papaya (''Vasconcellea cundinamarcensis''). It is the namesake member of the papain-like protease family.
It has wide ...
, is a protease derived from
papaya
The papaya (, ), papaw, () or pawpaw () is the plant species ''Carica papaya'', one of the 21 accepted species in the genus ''Carica'' of the family Caricaceae. It was first domesticated in Mesoamerica, within modern-day southern Mexico and ...
, used as a
meat tenderizer
A meat tenderizer, or meat pounder is a hand-powered tool used to tenderize slabs of meat in the preparation for cooking. Although a meat tenderizer can be made out of virtually any object, there are three types manufactured specifically for ...
.
Similar but less widely used plant products include
bromelain
Bromelain is an enzyme extract derived from the stems of pineapples, although it exists in all parts of the fresh pineapple. The extract has a history of folk medicine use. As an ingredient, it is used in cosmetics, as a topical medication, and as ...
from
pineapple
The pineapple (''Ananas comosus'') is a tropical plant with an edible fruit; it is the most economically significant plant in the family Bromeliaceae. The pineapple is indigenous to South America, where it has been cultivated for many centuri ...
and
ficin
Ficain also known as ficin, debricin, or higueroxyl delabarre () is a proteolytic enzyme extracted from the latex sap from the stems, leaves, and unripe fruit of the American wild fig tree ''Ficus insipida''.
Ficain was originally called ficin, a ...
from
fig
The fig is the edible fruit of ''Ficus carica'', a species of small tree in the flowering plant family Moraceae. Native to the Mediterranean and western Asia, it has been cultivated since ancient times and is now widely grown throughout the world ...
s.
Prokaryotes
Although papain-like proteases are found in all
domains of life, they have been less well-studied in
prokaryote
A prokaryote () is a single-celled organism that lacks a nucleus and other membrane-bound organelles. The word ''prokaryote'' comes from the Greek πρό (, 'before') and κάρυον (, 'nut' or 'kernel').Campbell, N. "Biology:Concepts & Connec ...
s than in
eukaryote
Eukaryotes () are organisms whose cells have a nucleus. All animals, plants, fungi, and many unicellular organisms, are Eukaryotes. They belong to the group of organisms Eukaryota or Eukarya, which is one of the three domains of life. Bacte ...
s.
Only a few prokaryotic PLP enzymes have been characterized by
X-ray crystallography
X-ray crystallography is the experimental science determining the atomic and molecular structure of a crystal, in which the crystalline structure causes a beam of incident X-rays to diffract into many specific directions. By measuring the angles ...
or enzymatic studies, mostly from pathogenic bacteria, including
streptopain from ''
Streptococcus pyogenes
''Streptococcus pyogenes'' is a species of Gram-positive, aerotolerant bacteria in the genus ''Streptococcus''. These bacteria are extracellular, and made up of non-motile and non-sporing cocci (round cells) that tend to link in chains. They are ...
'';
xylellain, from the plant pathogen ''
Xylella fastidiosa
''Xylella fastidiosa'' is an aerobic, Gram-negative bacterium, Gram-negative bacterium of the genus ''Xylella''. It is a plant pathogen, that grows in the water transport tissues of plants (Xylem, xylem vessels) and is transmitted exclusively by ...
'';
Cwp84 from ''
Clostridium difficile'';
and
Lpg2622 from ''
Legionella pneumophila
''Legionella pneumophila'' is a thin, aerobic, pleomorphic, flagellated, non-spore-forming, Gram-negative bacterium of the genus ''Legionella''. ''L. pneumophila'' is the primary human pathogenic bacterium in this group and is the causative age ...
''.
Viruses
The papain-like protease family includes a number of
protein domain
In molecular biology, a protein domain is a region of a protein's polypeptide chain that is self-stabilizing and that folds independently from the rest. Each domain forms a compact folded three-dimensional structure. Many proteins consist of s ...
s that are found in large
polyprotein
Proteolysis is the breakdown of proteins into smaller polypeptides or amino acids. Uncatalysed, the hydrolysis of peptide bonds is extremely slow, taking hundreds of years. Proteolysis is typically catalysed by cellular enzymes called protease ...
s expressed by
RNA virus
An RNA virus is a virusother than a retrovirusthat has ribonucleic acid (RNA) as its genetic material. The nucleic acid is usually single-stranded RNA ( ssRNA) but it may be double-stranded (dsRNA). Notable human diseases caused by RNA viruses ...
es.
Among the best studied viral PLPs are
nidoviral papain-like protease domains from
nidovirus
''Nidovirales'' is an order of enveloped, positive-strand RNA viruses which infect vertebrates and invertebrates. Host organisms include mammals, birds, reptiles, amphibians, fish, arthropods, molluscs, and helminths. The order includes the f ...
es, particularly those from
coronavirus
Coronaviruses are a group of related RNA viruses that cause diseases in mammals and birds. In humans and birds, they cause respiratory tract infections that can range from mild to lethal. Mild illnesses in humans include some cases of the com ...
es. These PLPs are responsible for several cleavage events that process a large polyprotein into
viral nonstructural protein In virology, a nonstructural protein is a protein encoded by a virus but that is not part of the viral particle. They typically include the various enzymes and transcription factors the virus uses to replicate itself, such as a viral protease ( 3CL ...
s, although they perform fewer cleavages than the
3C-like protease (also known as the main protease).
Coronavirus PLPs are multifunctional enzymes that can also act as
deubiquitinases (cleaving the
isopeptide bond to
ubiquitin
Ubiquitin is a small (8.6 kDa) regulatory protein found in most tissues of eukaryotic organisms, i.e., it is found ''ubiquitously''. It was discovered in 1975 by Gideon Goldstein and further characterized throughout the late 1970s and 1980s. Fo ...
) and "deISGylating enzymes" with analogous activity against the
ubiquitin-like protein
Ubiquitin-like proteins (UBLs) are a family of small proteins involved in post-translational modification of other proteins in a cell, usually with a regulatory function. The UBL protein family derives its name from the first member of the class ...
ISG15
Interferon-stimulated gene 15 (ISG15) is a 17 kDA secreted protein that in humans is encoded by the ''ISG15'' gene. ISG15 is induced by type I interferon (IFN) and serves many functions, acting both as an extracellular cytokine and an intracellul ...
.
In human pathogens including
SARS-CoV
Severe acute respiratory syndrome coronavirus 1 (SARS-CoV-1; or Severe acute respiratory syndrome coronavirus, SARS-CoV) is a strain of coronavirus that causes severe acute respiratory syndrome (SARS), the respiratory illness responsible for ...
,
MERS-CoV
''Middle East respiratory syndrome–related coronavirus'' (''MERS-CoV''), or EMC/2012 ( HCoV-EMC/2012), is the virus that causes Middle East respiratory syndrome (MERS). It is a species of coronavirus which infects humans, bats, and camels. Th ...
, and
SARS-CoV-2
Severe acute respiratory syndrome coronavirus 2 (SARS‑CoV‑2) is a strain of coronavirus that causes COVID-19 (coronavirus disease 2019), the respiratory illness responsible for the ongoing COVID-19 pandemic. The virus previously had a ...
, the PLP domain is
essential for
viral replication
Viral replication is the formation of biological viruses during the infection process in the target host cells. Viruses must first get into the cell before viral replication can occur. Through the generation of abundant copies of its genome an ...
and is therefore considered a
drug target
A biological target is anything within a living organism to which some other entity (like an endogenous ligand or a drug) is directed and/or binds, resulting in a change in its behavior or function. Examples of common classes of biological targets ...
for the development of
antiviral drug
Antiviral drugs are a class of medication used for treating viral infections. Most antivirals target specific viruses, while a broad-spectrum antiviral is effective against a wide range of viruses. Unlike most antibiotics, antiviral drugs do n ...
s.
References
{{reflist, 30em
Proteases