
In
molecular biology
Molecular biology is a branch of biology that seeks to understand the molecule, molecular basis of biological activity in and between Cell (biology), cells, including biomolecule, biomolecular synthesis, modification, mechanisms, and interactio ...
, protein aggregation is a phenomenon in which
intrinsically-disordered or mis-folded
protein
Proteins are large biomolecules and macromolecules that comprise one or more long chains of amino acid residue (biochemistry), residues. Proteins perform a vast array of functions within organisms, including Enzyme catalysis, catalysing metab ...
s aggregate (i.e., accumulate and clump together) either intra- or extracellularly.
Protein aggregates have been implicated in a wide variety of diseases known as
amyloidoses, including
ALS
Amyotrophic lateral sclerosis (ALS), also known as motor neuron disease (MND) or—in the United States—Lou Gehrig's disease (LGD), is a rare, terminal neurodegenerative disorder that results in the progressive loss of both upper and low ...
,
Alzheimer's,
Parkinson's and
prion
A prion () is a Proteinopathy, misfolded protein that induces misfolding in normal variants of the same protein, leading to cellular death. Prions are responsible for prion diseases, known as transmissible spongiform encephalopathy (TSEs), w ...
disease.
After synthesis, proteins typically
fold into a particular
three-dimensional conformation that is the most
thermodynamically favorable: their
native state
In biochemistry, the native state of a protein or nucleic acid is its properly Protein folding, folded and/or assembled form, which is operative and functional. The native state of a biomolecule may possess all four levels of biomolecular structu ...
. This folding process is driven by the
hydrophobic effect
The hydrophobic effect is the observed tendency of nonpolar substances to aggregate in an aqueous solution and to be excluded by water. The word hydrophobic literally means "water-fearing", and it describes the segregation of water and nonpola ...
: a tendency for
hydrophobic
In chemistry, hydrophobicity is the chemical property of a molecule (called a hydrophobe) that is seemingly repelled from a mass of water. In contrast, hydrophiles are attracted to water.
Hydrophobic molecules tend to be nonpolar and, thu ...
(water-fearing) portions of the protein to shield themselves from the
hydrophilic
A hydrophile is a molecule or other molecular entity that is attracted to water molecules and tends to be dissolved by water.Liddell, H.G. & Scott, R. (1940). ''A Greek-English Lexicon'' Oxford: Clarendon Press.
In contrast, hydrophobes are n ...
(water-loving) environment of the cell by burying into the interior of the protein. Thus, the exterior of a protein is typically hydrophilic, whereas the interior is typically hydrophobic.
Protein structures are stabilized by
non-covalent interactions
In chemistry, a non-covalent interaction differs from a covalent bond in that it does not involve the sharing of electrons, but rather involves more dispersed variations of electromagnetic interactions between molecules or within a molecule. The ...
and
disulfide bonds
In chemistry, a disulfide (or disulphide in British English) is a compound containing a functional group or the anion. The linkage is also called an SS-bond or sometimes a disulfide bridge and usually derived from two thiol groups.
In in ...
between two
cysteine
Cysteine (; symbol Cys or C) is a semiessential proteinogenic amino acid with the chemical formula, formula . The thiol side chain in cysteine enables the formation of Disulfide, disulfide bonds, and often participates in enzymatic reactions as ...
residues. The non-covalent interactions include
ionic interactions and weak
van der Waals interactions. Ionic interactions form between an anion and a cation and form
salt bridge
In electrochemistry, a salt bridge or ion bridge is an essential laboratory device discovered over 100 years ago. It contains an electrolyte solution, typically an inert solution, used to connect the Redox, oxidation and reduction Half cell, ...
s that help stabilize the protein. Van der Waals interactions include
nonpolar
In chemistry, polarity is a separation of electric charge leading to a molecule or its chemical groups having an electric dipole moment, with a negatively charged end and a positively charged end.
Polar molecules must contain one or more polar ...
interactions (i.e.
London dispersion force
London dispersion forces (LDF, also known as dispersion forces, London forces, instantaneous dipole–induced dipole forces, fluctuating induced dipole bonds or loosely as van der Waals forces) are a type of intermolecular force acting between at ...
) and polar interactions (i.e.
hydrogen bond
In chemistry, a hydrogen bond (H-bond) is a specific type of molecular interaction that exhibits partial covalent character and cannot be described as a purely electrostatic force. It occurs when a hydrogen (H) atom, Covalent bond, covalently b ...
s,
dipole-dipole bond). These play an important role in a protein's
secondary structure
Protein secondary structure is the local spatial conformation of the polypeptide backbone excluding the side chains. The two most common Protein structure#Secondary structure, secondary structural elements are alpha helix, alpha helices and beta ...
, such as forming an
alpha helix
An alpha helix (or α-helix) is a sequence of amino acids in a protein that are twisted into a coil (a helix).
The alpha helix is the most common structural arrangement in the Protein secondary structure, secondary structure of proteins. It is al ...
or a
beta sheet
The beta sheet (β-sheet, also β-pleated sheet) is a common motif of the regular protein secondary structure. Beta sheets consist of beta strands (β-strands) connected laterally by at least two or three backbone hydrogen bonds, forming a gene ...
, and tertiary structure. Interactions between amino acid residues in a specific protein are very important in that protein's final structure.
When there are changes in the non-covalent interactions, as may happen with a change in the amino acid sequence, the protein is susceptible to misfolding or unfolding. In these cases, if the cell does not assist the protein in re-folding, or
degrade the unfolded protein, the unfolded/misfolded protein may aggregate, in which the exposed hydrophobic portions of the protein may interact with the exposed hydrophobic patches of other proteins.
There are three main types of protein aggregates that may form: amorphous aggregates,
oligomer
In chemistry and biochemistry, an oligomer () is a molecule that consists of a few repeating units which could be derived, actually or conceptually, from smaller molecules, monomers.Quote: ''Oligomer molecule: A molecule of intermediate relativ ...
s, and
amyloid
Amyloids are aggregates of proteins characterised by a fibrillar morphology of typically 7–13 nm in diameter, a β-sheet secondary structure (known as cross-β) and ability to be stained by particular dyes, such as Congo red. In the human ...
fibrils.
Causes
Protein aggregation can occur due to a variety of causes. There are four classes that these causes can be categorized into, which are detailed below.
Mutations
Mutation
In biology, a mutation is an alteration in the nucleic acid sequence of the genome of an organism, virus, or extrachromosomal DNA. Viral genomes contain either DNA or RNA. Mutations result from errors during DNA or viral replication, ...
s that occur in the DNA sequence may or may not affect the amino acid sequence of the protein. When the sequence is affected, a different amino acid may change the interactions between the side chains that affect the folding of the protein. This can lead to exposed hydrophobic regions of the protein that aggregate with the same misfolded/unfolded protein or a different protein.
In addition to mutations in the affected proteins themselves, protein aggregation could also be caused indirectly through mutations in proteins in regulatory pathways such as the refolding pathway (molecular
chaperones) or the
ubiquitin-proteasome pathway (ubiquitin ligases).
Chaperones help with protein refolding by providing a safe environment for the protein to fold. Ubiquitin ligases target proteins for degradation through ubiquitin modification.
Problems with protein synthesis
Protein aggregation can be caused by problems that occur during
transcription or
translation
Translation is the communication of the semantics, meaning of a #Source and target languages, source-language text by means of an Dynamic and formal equivalence, equivalent #Source and target languages, target-language text. The English la ...
. During transcription, DNA is copied into mRNA, forming a strand of pre-mRNA that undergoes
RNA processing to form mRNA.
During translation,
ribosome
Ribosomes () are molecular machine, macromolecular machines, found within all cell (biology), cells, that perform Translation (biology), biological protein synthesis (messenger RNA translation). Ribosomes link amino acids together in the order s ...
s and
tRNA
Transfer ribonucleic acid (tRNA), formerly referred to as soluble ribonucleic acid (sRNA), is an adaptor molecule composed of RNA, typically 76 to 90 nucleotides in length (in eukaryotes). In a cell, it provides the physical link between the gene ...
help translate the mRNA sequence into an amino acid sequence.
If problems arise during either step, making an incorrect mRNA strand and/or an incorrect amino acid sequence, this can cause the protein to misfold, leading to protein aggregation.
Environmental stresses
Environmental stresses such as extreme temperatures and pH or
oxidative stress
Oxidative stress reflects an imbalance between the systemic manifestation of reactive oxygen species and a biological system's ability to readily detoxify the reactive intermediates or to repair the resulting damage. Disturbances in the normal ...
can also lead to protein aggregation.
One such disease is
cryoglobulinemia.
Extreme temperatures can weaken and destabilize the non-covalent interactions between the amino acid residues. pHs outside of the protein's pH range can change the protonation state of the amino acids, which can increase or decrease the non-covalent interactions. This can also lead to less stable interactions and result in protein unfolding.
Oxidative stress can be caused by radicals such as
reactive oxygen species
In chemistry and biology, reactive oxygen species (ROS) are highly Reactivity (chemistry), reactive chemicals formed from diatomic oxygen (), water, and hydrogen peroxide. Some prominent ROS are hydroperoxide (H2O2), superoxide (O2−), hydroxyl ...
(ROS). These unstable radicals can attack the amino acid residues, leading to oxidation of side chains (e.g.
aromatic
In organic chemistry, aromaticity is a chemical property describing the way in which a conjugated system, conjugated ring of unsaturated bonds, lone pairs, or empty orbitals exhibits a stabilization stronger than would be expected from conjugati ...
side chains,
methionine
Methionine (symbol Met or M) () is an essential amino acid in humans.
As the precursor of other non-essential amino acids such as cysteine and taurine, versatile compounds such as SAM-e, and the important antioxidant glutathione, methionine play ...
side chains) and/or cleavage of the polypeptide bonds. This can affect the non-covalent interactions that hold the protein together correctly, which can cause protein destabilization, and may cause the protein to unfold.
Aging
Cells have mechanisms that can refold or degrade protein aggregates. However, as cells age, these control mechanisms are weakened and the cell is less able to resolve the aggregates.
The hypothesis that protein aggregation is a causative process in aging is testable now since some models of delayed aging are in hand. If the development of protein aggregates was an aging independent process, slowing down aging will show no
effect on the rate of
proteotoxicity over time. However, if aging is associated
with decline in the activity of protective mechanisms against proteotoxicity,
the slow aging models would show reduced aggregation and proteotoxicity. To
address this problem several toxicity assays have been done in ''C. elegans''.
These studies indicated that reducing the activity of insulin/IGF signaling
(IIS), a prominent aging regulatory pathway protects from
neurodegeneration-linked toxic protein aggregation. The validity of this approach
has been tested and confirmed in mammals as reducing the activity of the IGF-1
signaling pathway protected Alzheimer's model mice from the behavioral and
biochemical impairments associated with the disease.
Aggregate localization
Several studies have shown that cellular responses to protein aggregation are well-regulated and organized. Protein aggregates localize to specific areas in the cell, and research has been done on these localizations in prokaryotes (E.coli) and eukaryotes (yeast, mammalian cells). From the macroscopic point of view,
positron emission tomography
Positron emission tomography (PET) is a functional imaging technique that uses radioactive substances known as radiotracers to visualize and measure changes in metabolic processes, and in other physiological activities including blood flow, r ...
tracers are used for certain misfolded proitein. Recently, a team of researchers led by Dr. Alessandro Crimi has proposed a
machine learning
Machine learning (ML) is a field of study in artificial intelligence concerned with the development and study of Computational statistics, statistical algorithms that can learn from data and generalise to unseen data, and thus perform Task ( ...
method to predict future deposition in the brain.
Bacteria
The aggregates in bacteria asymmetrically end up at one of the poles of the cell, the "older pole." After the cell divides, the daughter cells with the older pole gets the protein aggregate and grows more slowly than daughter cells without the aggregate. This provides a natural selection mechanism for reducing protein aggregates in the bacterial population.
Yeast

Most of the protein aggregates in yeast cells get refolded by molecular chaperones. However, some aggregates, such as the oxidatively damaged proteins or the proteins marked for degradation, cannot be refolded. Rather, there are two compartments that they can end up in. Protein aggregates can be localized at the Juxtanuclear quality-control compartment (
JUNQ), which is near the nuclear membrane, or at the Insoluble Protein deposit (
IPOD
The iPod is a series of portable media players and multi-purpose mobile devices that were designed and marketed by Apple Inc. from 2001 to 2022. The iPod Classic#1st generation, first version was released on November 10, 2001, about mon ...
), near the vacuole in yeast cells.
Protein aggregates localize at JUNQ when they are ubiquitinated and targeted for degradation. The aggregated and insoluble proteins localize at IPOD as a more permanent deposition. There is evidence that the proteins here may be removed by autophagy.
These two pathways work together in that the proteins tend to come to the IPOD when the proteasome pathway is being overworked.
Mammalian cells
In mammalian cells, these protein aggregates are termed "aggresomes" and they are formed when the cell is diseased. This is because aggregates tend to form when there are
heterologous proteins present in the cell, which can arise when the cell is mutated. Different mutates of the same protein may form aggresomes of different morphologies, ranging from diffuse dispersion of soluble species to large puncta, which in turn bear different pathogenicity. The E3 ubiquitin ligase is able to recognize misfolded proteins and ubiquinate them. HDAC6 can then bind to the ubiquitin and the motor protein
dynein to bring the marked aggregates to the microtubule organizing center (
MTOC). There, they pack together into a sphere that surrounds the MTOC. They bring over chaperones and proteasomes and activate autophagy.
Elimination
There are two main protein quality control systems in the cell that are responsible for eliminating protein aggregates. Misfolded proteins can get refolded by the bi-chaperone system or degraded by the ubiquitin proteasome system or autophagy.
Refolding
The bi-chaperone system utilizes the Hsp70 (DnaK-DnaJ-GrpE in E. coli and Ssa1-Ydj1/Sis1-Sse1/Fe1 in yeast) and Hsp100 (ClpB in E. coli and Hsp104 in yeast) chaperones for protein disaggregation and refolding.
Hsp70 interacts with the protein aggregates and recruits Hsp100. Hsp70 stabilizes an activated Hsp100. Hsp100 proteins have aromatic pore loops that are used for threading activity to disentangle single polypeptides. This threading activity can be initiated at the N-terminus, C-terminus or in the middle of the polypeptide. The polypeptide gets translocated through Hsp100 in a series of steps, utilizing an ATP at each step.
The polypeptide unfolds and is then allowed to refold either by itself or with the help of heat shock proteins.
Degradation
Misfolded proteins can be eliminated through the ubiquitin-proteasome system (
UPS). This consists of an E1-E2-E3 pathway that ubiquinates proteins to mark them for degradation. In eukaryotes, the proteins get degraded by the 26S proteasome. In mammalian cells, the E3 ligase, carboxy-terminal Hsp70 interacting protein (CHIP), targets Hsp70-bound proteins. In yeast, the E3 ligases Doa10 and Hrd1 have similar functions on
endoplasmic reticulum
The endoplasmic reticulum (ER) is a part of a transportation system of the eukaryote, eukaryotic cell, and has many other important functions such as protein folding. The word endoplasmic means "within the cytoplasm", and reticulum is Latin for ...
proteins.
On the molecular level, degradation rate of aggregates vary from protein to protein due to their different internal environments, and thus different accessibility for protease molecules.

Misfolded proteins can also be eliminated through autophagy, in which the protein aggregates are delivered to the lysosome.
Toxicity
Although it has been thought that the mature protein aggregates themselves are toxic, evidence suggests that it is in fact immature protein aggregates that are most toxic.
The hydrophobic patches of these aggregates can interact with other components of the cell and damage them. The hypotheses are that the toxicity of protein aggregates is related to mechanisms of the sequestration of cellular components, the generation of reactive oxygen species and the binding to specific receptors in the membrane or through the disruption of membranes. A quantitative assay has been used to determine that higher molecular weight species are responsible for the membrane permeation.
It is known that protein aggregates in vitro can destabilize artificial
phospholipid bilayer
The lipid bilayer (or phospholipid bilayer) is a thin polar membrane made of two layers of lipid molecules. These membranes form a continuous barrier around all cells. The cell membranes of almost all organisms and many viruses are made of a l ...
s, leading to permeabilization of the membrane.
In biomanufacturing
Protein aggregation is also a common phenomenon in the biopharmaceutical manufacturing process, which may pose risks to patients via generating adverse immune responses.
[{{cite journal , vauthors = Vázquez-Rey M, Lang DA , title = Aggregates in monoclonal antibody manufacturing processes , journal = Biotechnology and Bioengineering , volume = 108 , issue = 7 , pages = 1494–1508 , date = July 2011 , pmid = 21480193 , doi = 10.1002/bit.23155 , s2cid = 33285577 ]
See also
*
Amyloid
Amyloids are aggregates of proteins characterised by a fibrillar morphology of typically 7–13 nm in diameter, a β-sheet secondary structure (known as cross-β) and ability to be stained by particular dyes, such as Congo red. In the human ...
*
Proteopathy
*
Amyloidosis
Amyloidosis is a group of diseases in which abnormal proteins, known as amyloid fibrils, build up in tissue. There are several non-specific and vague signs and symptoms associated with amyloidosis. These include fatigue, peripheral edema, weigh ...
*
JUNQ and IPOD
*
Protein aggregation predictors
References
Alzheimer's disease
Neurodegenerative disorders
Neurological disorders
Structural proteins