
In
molecular biology
Molecular biology is a branch of biology that seeks to understand the molecule, molecular basis of biological activity in and between Cell (biology), cells, including biomolecule, biomolecular synthesis, modification, mechanisms, and interactio ...
, protein fold classes are broad categories of
protein tertiary structure
Protein tertiary structure is the three-dimensional shape of a protein. The tertiary structure will have a single polypeptide chain "backbone" with one or more protein secondary structures, the protein domains. Amino acid side chains and the b ...
topology. They describe groups of
protein
Proteins are large biomolecules and macromolecules that comprise one or more long chains of amino acid residue (biochemistry), residues. Proteins perform a vast array of functions within organisms, including Enzyme catalysis, catalysing metab ...
s that share similar
amino acid
Amino acids are organic compounds that contain both amino and carboxylic acid functional groups. Although over 500 amino acids exist in nature, by far the most important are the 22 α-amino acids incorporated into proteins. Only these 22 a ...
and
secondary structure
Protein secondary structure is the local spatial conformation of the polypeptide backbone excluding the side chains. The two most common Protein structure#Secondary structure, secondary structural elements are alpha helix, alpha helices and beta ...
proportions. Each class contains multiple, independent
protein superfamilies
A protein superfamily is the largest grouping (clade) of proteins for which common ancestry can be inferred (see homology (biology), homology). Usually this common ancestry is inferred from structural alignment and mechanistic similarity, even if n ...
(i.e. are not necessarily
evolutionarily related to one another).
Generally recognised classes
Four large classes of protein that are generally agreed upon by the two main structure classification databases (
SCOP
A ( or ) was a poet as represented in Old English poetry. The scop is the Old English counterpart of the Old Norse ', with the important difference that "skald" was applied to historical persons, and scop is used, for the most part, to designat ...
and
CATH).
all-α
All-α proteins are a class of
structural domain
In molecular biology, a protein domain is a region of a protein's Peptide, polypeptide chain that is self-stabilizing and that Protein folding, folds independently from the rest. Each domain forms a compact folded Protein tertiary structure, thre ...
s in which the
secondary structure
Protein secondary structure is the local spatial conformation of the polypeptide backbone excluding the side chains. The two most common Protein structure#Secondary structure, secondary structural elements are alpha helix, alpha helices and beta ...
is composed entirely of
α-helices, with the possible exception of a few isolated
β-sheets on the periphery.
Common examples include the
bromodomain, the
globin fold and the
homeobox
A homeobox is a Nucleic acid sequence, DNA sequence, around 180 base pairs long, that regulates large-scale anatomical features in the early stages of embryonic development. Mutations in a homeobox may change large-scale anatomical features of ...
.
all-β
All-β proteins are a class of
structural domain
In molecular biology, a protein domain is a region of a protein's Peptide, polypeptide chain that is self-stabilizing and that Protein folding, folds independently from the rest. Each domain forms a compact folded Protein tertiary structure, thre ...
s in which the
secondary structure
Protein secondary structure is the local spatial conformation of the polypeptide backbone excluding the side chains. The two most common Protein structure#Secondary structure, secondary structural elements are alpha helix, alpha helices and beta ...
is composed entirely of
β-sheets, with the possible exception of a few isolated
α-helices on the periphery.
Common examples include the
SH3 domain
The SRC Homology 3 Domain (or SH3 domain) is a small protein domain of about 60 amino acid residues. Initially, SH3 was described as a conserved sequence in the viral adaptor protein v-Crk. This domain is also present in the molecules of ph ...
, the
beta-propeller domain, the
immunoglobulin fold and
B3 DNA binding domain.
α+β
α+β proteins are a class of
structural domain
In molecular biology, a protein domain is a region of a protein's Peptide, polypeptide chain that is self-stabilizing and that Protein folding, folds independently from the rest. Each domain forms a compact folded Protein tertiary structure, thre ...
s in which the
secondary structure
Protein secondary structure is the local spatial conformation of the polypeptide backbone excluding the side chains. The two most common Protein structure#Secondary structure, secondary structural elements are alpha helix, alpha helices and beta ...
is composed of
α-helices and
β-strands that occur separately along the
backbone. The
β-strands are therefore mostly ''antiparallel''.
Common examples include the
ferredoxin fold,
ribonuclease A, and the
SH2 domain.
α/β
α/β proteins are a class of
structural domain
In molecular biology, a protein domain is a region of a protein's Peptide, polypeptide chain that is self-stabilizing and that Protein folding, folds independently from the rest. Each domain forms a compact folded Protein tertiary structure, thre ...
s in which the
secondary structure
Protein secondary structure is the local spatial conformation of the polypeptide backbone excluding the side chains. The two most common Protein structure#Secondary structure, secondary structural elements are alpha helix, alpha helices and beta ...
is composed of alternating
α-helices and
β-strands along the backbone. The
β-strands are therefore mostly ''parallel''.
Common examples include the
flavodoxin fold, the
TIM barrel and leucine-rich-repeat (LRR) proteins such as
ribonuclease inhibitor.
Additional classes
Membrane proteins
Membrane protein
Membrane proteins are common proteins that are part of, or interact with, biological membranes. Membrane proteins fall into several broad categories depending on their location. Integral membrane proteins are a permanent part of a cell membrane ...
s interact with
biological membrane
A biological membrane, biomembrane or cell membrane is a selectively permeable membrane that separates the interior of a cell from the external environment or creates intracellular compartments by serving as a boundary between one part of th ...
s either by inserting into it, or being tethered via a covalently attached lipid. They are one of the common types of protein along with soluble
globular proteins,
fibrous proteins, and
disordered proteins.
They are targets of over 50% of all modern medicinal drugs.
It is estimated that 20–30% of all
gene
In biology, the word gene has two meanings. The Mendelian gene is a basic unit of heredity. The molecular gene is a sequence of nucleotides in DNA that is transcribed to produce a functional RNA. There are two types of molecular genes: protei ...
s in most
genome
A genome is all the genetic information of an organism. It consists of nucleotide sequences of DNA (or RNA in RNA viruses). The nuclear genome includes protein-coding genes and non-coding genes, other functional regions of the genome such as ...
s encode membrane proteins.
Intrinsically disordered proteins
Intrinsically disordered proteins lack a fixed or ordered
three-dimensional structure.
IDPs cover a spectrum of states from fully unstructured to partially structured and include
random coil
In polymer chemistry, a random coil is a conformation of polymers where the monomer subunits are oriented randomly while still being bonded to adjacent units. It is not one specific shape, but a statistical distribution of shapes for all the cha ...
s, (pre-)
molten globules, and large multi-domain proteins connected by flexible linkers. They constitute one of the main types of protein (alongside
globular,
fibrous and
membrane protein
Membrane proteins are common proteins that are part of, or interact with, biological membranes. Membrane proteins fall into several broad categories depending on their location. Integral membrane proteins are a permanent part of a cell membrane ...
s).
Coiled coil proteins
Coiled coil proteins form long, insoluble
fibers
Fiber (spelled fibre in British English; from ) is a natural or artificial substance that is significantly longer than it is wide. Fibers are often used in the manufacture of other materials. The strongest engineering materials often inco ...
involved in the
extracellular matrix
In biology, the extracellular matrix (ECM), also called intercellular matrix (ICM), is a network consisting of extracellular macromolecules and minerals, such as collagen, enzymes, glycoproteins and hydroxyapatite that provide structural and bio ...
. There are many
scleroprotein superfamilies including
keratin
Keratin () is one of a family of structural fibrous proteins also known as ''scleroproteins''. It is the key structural material making up Scale (anatomy), scales, hair, Nail (anatomy), nails, feathers, horn (anatomy), horns, claws, Hoof, hoove ...
,
collagen
Collagen () is the main structural protein in the extracellular matrix of the connective tissues of many animals. It is the most abundant protein in mammals, making up 25% to 35% of protein content. Amino acids are bound together to form a trip ...
,
elastin
Elastin is a protein encoded by the ''ELN'' gene in humans and several other animals. Elastin is a key component in the extracellular matrix of gnathostomes (jawed vertebrates). It is highly Elasticity (physics), elastic and present in connective ...
, and
fibroin
Fibroin is an insoluble protein present in silk produced by numerous insects, such as the larvae of ''Bombyx mori'', and other moth genera such as ''Antheraea'', ''Cricula trifenestrata, Cricula'', ''Samia (moth), Samia'' and ''Gonometa''. Sil ...
. The roles of such proteins include protection and support, forming
connective tissue
Connective tissue is one of the four primary types of animal tissue, a group of cells that are similar in structure, along with epithelial tissue, muscle tissue, and nervous tissue. It develops mostly from the mesenchyme, derived from the mesod ...
,
tendon
A tendon or sinew is a tough band of fibrous connective tissue, dense fibrous connective tissue that connects skeletal muscle, muscle to bone. It sends the mechanical forces of muscle contraction to the skeletal system, while withstanding tensi ...
s,
bone matrices, and
muscle fiber
A muscle cell, also known as a myocyte, is a mature contractile cell in the muscle of an animal. In humans and other vertebrates there are three types: skeletal, smooth, and cardiac (cardiomyocytes). A skeletal muscle cell is long and threadl ...
.
Small proteins
Small proteins typically have a tertiary structure that is maintained by
disulphide bridges (
cysteine-rich proteins),
metal ligands (
metal-binding proteins), and or
cofactors such as
heme
Heme (American English), or haem (Commonwealth English, both pronounced /Help:IPA/English, hi:m/ ), is a ring-shaped iron-containing molecule that commonly serves as a Ligand (biochemistry), ligand of various proteins, more notably as a Prostheti ...
.
Designed proteins
Numerous protein structures are the result of
rational design
In chemical biology and biomolecular engineering, rational design (RD) is an umbrella term which invites the strategy of creating new molecules with a certain functionality, based upon the ability to predict how the molecule's structure (specific ...
and do not exist in nature. Proteins can be designed from scratch (''de novo'' design) or by making calculated variations on a known protein structure and its sequence (known as ''protein redesign''). Rational protein design approaches make protein-sequence predictions that will fold to specific structures. These predicted sequences can then be validated experimentally through methods such as
peptide synthesis,
site-directed mutagenesis, or
Artificial gene synthesis.
See also
*
Protein superfamily
A protein superfamily is the largest grouping (clade) of proteins for which common ancestry can be inferred (see homology (biology), homology). Usually this common ancestry is inferred from structural alignment and mechanistic similarity, even if n ...
*
SCOP
A ( or ) was a poet as represented in Old English poetry. The scop is the Old English counterpart of the Old Norse ', with the important difference that "skald" was applied to historical persons, and scop is used, for the most part, to designat ...
database
*
CATH database
*
FSSP database
References
{{reflist
Protein folds