O-glycan Biosynthesis
   HOME

TheInfoList



OR:

''O''-linked glycosylation is the attachment of a
sugar Sugar is the generic name for sweet-tasting, soluble carbohydrates, many of which are used in food. Simple sugars, also called monosaccharides, include glucose, fructose, and galactose. Compound sugars, also called disaccharides or double ...
molecule to the
oxygen Oxygen is the chemical element with the symbol O and atomic number 8. It is a member of the chalcogen group in the periodic table, a highly reactive nonmetal, and an oxidizing agent that readily forms oxides with most elements as wel ...
atom of
serine Serine (symbol Ser or S) is an α-amino acid that is used in the biosynthesis of proteins. It contains an α-amino group (which is in the protonated − form under biological conditions), a carboxyl group (which is in the deprotonated − form un ...
(Ser) or
threonine Threonine (symbol Thr or T) is an amino acid that is used in the biosynthesis of proteins. It contains an α-amino group (which is in the protonated −NH form under biological conditions), a carboxyl group (which is in the deprotonated −COO ...
(Thr) residues in a protein. ''O''-glycosylation is a
post-translational modification Post-translational modification (PTM) is the covalent and generally enzymatic modification of proteins following protein biosynthesis. This process occurs in the endoplasmic reticulum and the golgi apparatus. Proteins are synthesized by ribosome ...
that occurs after the protein has been synthesised. In
eukaryote Eukaryotes () are organisms whose cells have a nucleus. All animals, plants, fungi, and many unicellular organisms, are Eukaryotes. They belong to the group of organisms Eukaryota or Eukarya, which is one of the three domains of life. Bacte ...
s, it occurs in the
endoplasmic reticulum The endoplasmic reticulum (ER) is, in essence, the transportation system of the eukaryotic cell, and has many other important functions such as protein folding. It is a type of organelle made up of two subunits – rough endoplasmic reticulum ( ...
,
Golgi apparatus The Golgi apparatus (), also known as the Golgi complex, Golgi body, or simply the Golgi, is an organelle found in most eukaryotic cells. Part of the endomembrane system in the cytoplasm, it packages proteins into membrane-bound vesicles ins ...
and occasionally in the
cytoplasm In cell biology, the cytoplasm is all of the material within a eukaryotic cell, enclosed by the cell membrane, except for the cell nucleus. The material inside the nucleus and contained within the nuclear membrane is termed the nucleoplasm. The ...
; in
prokaryote A prokaryote () is a single-celled organism that lacks a nucleus and other membrane-bound organelles. The word ''prokaryote'' comes from the Greek πρό (, 'before') and κάρυον (, 'nut' or 'kernel').Campbell, N. "Biology:Concepts & Connec ...
s, it occurs in the cytoplasm. Several different sugars can be added to the serine or threonine, and they affect the protein in different ways by changing protein stability and regulating protein activity. O-glycans, which are the sugars added to the serine or threonine, have numerous functions throughout the body, including trafficking of cells in the immune system, allowing recognition of foreign material, controlling cell
metabolism Metabolism (, from el, μεταβολή ''metabolē'', "change") is the set of life-sustaining chemical reactions in organisms. The three main functions of metabolism are: the conversion of the energy in food to energy available to run cell ...
and providing cartilage and tendon flexibility. Because of the many functions they have, changes in O-glycosylation are important in many diseases including
cancer Cancer is a group of diseases involving abnormal cell growth with the potential to invade or spread to other parts of the body. These contrast with benign tumors, which do not spread. Possible signs and symptoms include a lump, abnormal b ...
,
diabetes Diabetes, also known as diabetes mellitus, is a group of metabolic disorders characterized by a high blood sugar level ( hyperglycemia) over a prolonged period of time. Symptoms often include frequent urination, increased thirst and increased ap ...
and
Alzheimer's Alzheimer's disease (AD) is a neurodegenerative disease that usually starts slowly and progressively worsens. It is the cause of 60–70% of cases of dementia. The most common early symptom is difficulty in remembering recent events. As t ...
. O-glycosylation occurs in all domains of life, including
eukaryotes Eukaryotes () are organisms whose cells have a nucleus. All animals, plants, fungi, and many unicellular organisms, are Eukaryotes. They belong to the group of organisms Eukaryota or Eukarya, which is one of the three domains of life. Bacte ...
,
archaea Archaea ( ; singular archaeon ) is a domain of single-celled organisms. These microorganisms lack cell nuclei and are therefore prokaryotes. Archaea were initially classified as bacteria, receiving the name archaebacteria (in the Archaebac ...
and a number of
pathogenic In biology, a pathogen ( el, πάθος, "suffering", "passion" and , "producer of") in the oldest and broadest sense, is any organism or agent that can produce disease. A pathogen may also be referred to as an infectious agent, or simply a germ ...
bacteria including ''Burkholderia cenocepacia'', ''Neisseria gonorrhoeae'' and ''Acinetobacter baumannii''.


Common types of ''O''-glycosylation


''O''-''N''-acetylgalactosamine (''O''-GalNAc)

Addition of ''N''-acetylgalactosamine (GalNAc) to a serine or threonine occurs in the
Golgi apparatus The Golgi apparatus (), also known as the Golgi complex, Golgi body, or simply the Golgi, is an organelle found in most eukaryotic cells. Part of the endomembrane system in the cytoplasm, it packages proteins into membrane-bound vesicles ins ...
, after the protein has been folded. The process is performed by
enzymes Enzymes () are proteins that act as biological catalysts by accelerating chemical reactions. The molecules upon which enzymes may act are called substrate (chemistry), substrates, and the enzyme converts the substrates into different molecule ...
known as GalNAc transferases (GALNTs), of which there are 20 different types. The initial ''O''-GalNAc structure can be modified by the addition of other sugars, or other compounds such as methyl and acetyl groups. These modifications produce 8 core structures known to date. Different cells have different enzymes that can add further sugars, known as
glycosyltransferases Glycosyltransferases (GTFs, Gtfs) are enzymes ( EC 2.4) that establish natural glycosidic linkages. They catalyze the transfer of saccharide moieties from an activated nucleotide sugar (also known as the "glycosyl donor") to a nucleophilic glycos ...
, and structures therefore change from cell to cell. Common sugars added include galactose, ''N''-acetylglucosamine, fucose and
sialic acid Sialic acids are a class of alpha-keto acid sugars with a nine-carbon backbone. The term "sialic acid" (from the Greek for saliva, - ''síalon'') was first introduced by Swedish biochemist Gunnar Blix in 1952. The most common member of this gr ...
. These sugars can also be modified by the addition of sulfates or acetyl groups.


Biosynthesis

GalNAc is added onto a serine or threonine residue from a precursor molecule, through the activity of a GalNAc transferase enzyme. This precursor is necessary so that the sugar can be transported to where it will be added to the protein. The specific residue onto which GalNAc will be attached is not defined, because there are numerous enzymes that can add the sugar and each one will favour different residues. However, there are often proline (Pro) residues near the threonine or serine. Once this initial sugar has been added, other glycosyltransferases can catalyse the addition of additional sugars. Two of the most common structures formed are Core 1 and Core 2. Core 1 is formed by the addition of a galactose sugar onto the initial GalNAc. Core 2 consists of a Core 1 structure with an additional ''N''-acetylglucosamine (GlcNAc) sugar. A poly-''N''-acetyllactosamine structure can be formed by the alternating addition of GlcNAc and galactose sugars onto the GalNAc sugar. Terminal sugars on O-glycans are important in recognition by lectins and play a key role in the immune system. Addition of fucose sugars by fucosyltransferases forms Lewis epitopes and the scaffold for blood group determinants. Addition of a fucose alone creates the H-antigen, present in people with blood type O. By adding a galactose onto this structure, the B-antigen of blood group B is created. Alternatively, adding a GalNAc sugar will create the A-antigen for blood group A.


Functions

''O''-GalNAc sugars are important in a variety of processes, including
leukocyte White blood cells, also called leukocytes or leucocytes, are the cells of the immune system that are involved in protecting the body against both infectious disease and foreign invaders. All white blood cells are produced and derived from mult ...
circulation during an immune response, fertilisation, and protection against invading microbes. ''O''-GalNAc sugars are common on membrane glycoproteins, where they help increase rigidity of the region close to the membrane so that the protein extends away from the surface. For example, the low-density lipoprotein receptor (LDL) is projected from the cell surface by a region rigidified by O-glycans. In order for leukocytes of the immune system to move into infected cells, they have to interact with these cells through receptors. Leukocytes express
ligands In coordination chemistry, a ligand is an ion or molecule (functional group) that binds to a central metal atom to form a coordination complex. The bonding with the metal generally involves formal donation of one or more of the ligand's electro ...
on their cell surface to allow this interaction to occur. P-selectin glycoprotein ligand-1 (PSGL-1) is such a ligand, and contains a lot of O-glycans that are necessary for its function. O-glycans near the membrane maintain the elongated structure and a terminal sLex epitope is necessary for interactions with the receptor. Mucins are a group of heavily O-glycosylated proteins that line the gastrointestinal and respiratory tracts to protect these regions from infection. Mucins are negatively charged, which allows them to interact with water and prevent it from evaporating. This is important in their protective function as it lubricates the tracts so bacteria cannot bind and infect the body. Changes in mucins are important in numerous diseases, including
cancer Cancer is a group of diseases involving abnormal cell growth with the potential to invade or spread to other parts of the body. These contrast with benign tumors, which do not spread. Possible signs and symptoms include a lump, abnormal b ...
and
inflammatory bowel disease Inflammatory bowel disease (IBD) is a group of inflammation, inflammatory conditions of the colon (anatomy), colon and small intestine, Crohn's disease and ulcerative colitis being the principal types. Crohn's disease affects the small intestine a ...
. Absence of O-glycans on mucin proteins changes their 3D shape dramatically and often prevents correct function.


''O''-''N''-acetylglucosamine (''O''-GlcNAc)

Addition of ''N''-acetylglucosamine (O-GlcNAc) to serine and threonine residues usually occurs on cytoplasmic and nuclear proteins that remain in the cell, compared to ''O''-GalNAc modifications which usually occur on proteins that will be secreted. O-GlcNAc modifications were only recently discovered, but the number of proteins with known O-GlcNAc modifications is increasing rapidly. It is the first example of glycosylation that does not occur on secretory proteins. ''O''-GlcNAcylation differs from other O-glycosylation processes because there are usually no sugars added onto the core structure and because the sugar can be attached or removed from a protein several times. This addition and removal occurs in cycles and is performed by two very specific enzymes. O-GlcNAc is added by
O-GlcNAc transferase Protein ''O''-GlcNAc transferase also known as OGT or O-linked N-acetylglucosaminyltransferase is an enzyme () that in humans is encoded by the ''OGT'' gene. OGT catalyzes the addition of the ''O''-GlcNAc post-translational modification to prote ...
(OGT) and removed by
O-GlcNAcase Protein ''O''-GlcNAcase (, OGA, glycoside hydrolase ''O''-GlcNAcase, ''O''-GlcNAcase, BtGH84, ''O''-GlcNAc hydrolase) is an enzyme with List of enzymes, systematic name (protein)-3-''O''-(''N''-acetyl-D-glucosaminyl)-L-serine/threonine ''N''-acet ...
(OGA). Because there are only two enzymes that affect this specific modification, they are very tightly regulated and depend on a lot of other factors. Because O-GlcNAc can be added and removed, it is known as a dynamic modification and has a lot of similarities to
phosphorylation In chemistry, phosphorylation is the attachment of a phosphate group to a molecule or an ion. This process and its inverse, dephosphorylation, are common in biology and could be driven by natural selection. Text was copied from this source, wh ...
. O-GlcNAcylation and phosphorylation can occur on the same threonine and serine residues, suggesting a complex relationship between these modifications that can affect many functions of the cell. The modification affects processes like the cells response to cellular stress, the cell cycle, protein stability and protein turnover. It may be implicated in neurodegenerative diseases like Parkinson’s and late-onset
Alzheimer’s Alzheimer's disease (AD) is a neurodegenerative disease that usually starts slowly and progressively worsens. It is the cause of 60–70% of cases of dementia. The most common early symptom is difficulty in remembering recent events. As t ...
and has been found to play a role in
diabetes Diabetes, also known as diabetes mellitus, is a group of metabolic disorders characterized by a high blood sugar level ( hyperglycemia) over a prolonged period of time. Symptoms often include frequent urination, increased thirst and increased ap ...
. Additionally, O-GlcNAcylation can enhance the Warburg Effect, which is defined as the change that occurs in the
metabolism Metabolism (, from el, μεταβολή ''metabolē'', "change") is the set of life-sustaining chemical reactions in organisms. The three main functions of metabolism are: the conversion of the energy in food to energy available to run cell ...
of cancer cells to favour their growth. Because both O-GlcNAcylation and phosphorylation can affect specific residues and therefore both have important functions in regulating signalling pathways, both of these processes provide interesting targets for cancer therapy.


''O''-Mannose (''O''-Man)

O-mannosylation involves the transfer of a
mannose Mannose is a sugar monomer of the aldohexose series of carbohydrates. It is a C-2 epimer of glucose. Mannose is important in human metabolism, especially in the glycosylation of certain proteins. Several congenital disorders of glycosylation ...
from a dolichol-''P''-mannose donor molecule onto the serine or threonine residue of a protein. Most other O-glycosylation processes use a sugar nucleotide as a donor molecule. A further difference from other O-glycosylations is that the process is initiated in the endoplasmic reticulum of the cell, rather than the Golgi apparatus. However, further addition of sugars occurs in the Golgi. Until recently, it was believed that the process is restricted to
fungi A fungus ( : fungi or funguses) is any member of the group of eukaryotic organisms that includes microorganisms such as yeasts and molds, as well as the more familiar mushrooms. These organisms are classified as a kingdom, separately from ...
, however it occurs in all domains of life; eukaryotes, (eu)bacteria and archae(bacteri)a. The best characterised O-mannosylated human protein is α-dystroglycan. O-Man sugars separate two domains of the protein, required to connect the extracellular and intracellular regions to anchor the cell in position.
Ribitol Ribitol, or adonitol, is a crystalline pentose alcohol (C5H12O5) formed by the reduction of ribose. It occurs naturally in the plant ''Adonis vernalis'' as well as in the cell walls of some Gram-positive bacteria, in the form of ribitol phosphat ...
,
xylose Xylose ( grc, ξύλον, , "wood") is a sugar first isolated from wood, and named for it. Xylose is classified as a monosaccharide of the aldopentose type, which means that it contains five carbon atoms and includes an aldehyde functional gro ...
and
glucuronic acid Glucuronic acid (from Greek γλεῦκος "''wine, must''" and οὖρον "''urine''") is a uronic acid that was first isolated from urine (hence the name). It is found in many gums such as gum arabic (c. 18%), xanthan, and kombucha tea and ...
can be added to this structure in a complex modification that forms a long sugar chain. This is required to stabilise the interaction between α-dystroglycan and the extracellular basement membrane. Without these modifications, the glycoprotein cannot anchor the cell which leads to
congenital muscular dystrophy Congenital muscular dystrophies are autosomal recessively-inherited muscle diseases. They are a group of heterogeneous disorders characterized by muscle weakness which is present at birth and the different changes on muscle biopsy that ranges fro ...
(CMD), characterised by severe brain malformations.


''O''-Galactose (''O''-Gal)

O-galactose is commonly found on
lysine Lysine (symbol Lys or K) is an α-amino acid that is a precursor to many proteins. It contains an α-amino group (which is in the protonated form under biological conditions), an α-carboxylic acid group (which is in the deprotonated −C ...
residues in
collagen Collagen () is the main structural protein in the extracellular matrix found in the body's various connective tissues. As the main component of connective tissue, it is the most abundant protein in mammals, making up from 25% to 35% of the whole ...
, which often have a hydroxyl group added to form
hydroxylysine Hydroxylysine (Hyl) is an amino acid with the molecular formula C6H14N2O3. It was first discovered in 1921 by Donald Van Slyke as the 5-hydroxylysine form. It arises from a post-translational hydroxy modification of lysine. It is most widely kno ...
. Because of this addition of an oxygen, hydroxylysine can then be modified by O-glycosylation. Addition of a galactose to the hydroxyl group is initiated in the endoplasmic reticulum, but occurs predominantly in the Golgi apparatus and only on hydroxylysine residues in a specific sequence. While this O-galactosylation is necessary for correct function in all collagens, it is especially common in collagen types IV and V. In some cases, a glucose sugar can be added to the core galactose.


O-Fucose (O-Fuc)

Addition of fucose sugars to serine and threonine residues is an unusual form of O-glycosylation that occurs in the endoplasmic reticulum and is catalysed by two fucosyltransferases. These were discovered in ''Plasmodium falciparum'' and ''Toxoplasma gondii''. Several different enzymes catalyse the elongation of the core fucose, meaning that different sugars can be added to the initial fucose on the protein. Along with O-glucosylation, O-fucosylation is mainly found on
epidermal growth factor Epidermal growth factor (EGF) is a protein that stimulates cell growth and differentiation by binding to its receptor, EGFR. Human EGF is 6-k Da and has 53 amino acid residues and three intramolecular disulfide bonds. EGF was originally descr ...
(EGF) domains found in proteins. O-fucosylation on EGF domains occurs between the second and third conserved
cysteine Cysteine (symbol Cys or C; ) is a semiessential proteinogenic amino acid with the formula . The thiol side chain in cysteine often participates in enzymatic reactions as a nucleophile. When present as a deprotonated catalytic residue, sometime ...
residues in the protein sequence. Once the core O-fucose has been added, it is often elongated by addition of GlcNAc, galactose and sialic acid. Notch is an important protein in development, with several EGF domains that are O-fucosylated. Changes in the elaboration of the core fucose determine what interactions the protein can form, and therefore which genes will be transcribed during development. O-fucosylation might also play a role in protein breakdown in the liver.


O-Glucose (O-Glc)

Similarly to O-fucosylation, O-glucosylation is an unusual O-linked modification as it occurs in the endoplasmic reticulum, catalysed by O-glucosyltransferases, and also requires a defined sequence in order to be added to the protein. O-glucose is often attached to serine residues between the first and second conserved cysteine residues of EGF domains, for example in clotting factors VII and IX. O-glucosylation also appears to be necessary for the proper folding of EGF domains in the Notch protein.


Proteoglycans

Proteoglycans consist of a protein with one or more sugar side chains, known as
glycosaminoglycans Glycosaminoglycans (GAGs) or mucopolysaccharides are long, linear polysaccharides consisting of repeating disaccharide units (i.e. two-sugar units). The repeating two-sugar unit consists of a uronic sugar and an amino sugar, except in the case ...
(GAGs), attached to the oxygen of serine and threonine residues. GAGs consist of long chains of repeating sugar units. Proteoglycans are usually found on the cell surface and in the
extracellular matrix In biology, the extracellular matrix (ECM), also called intercellular matrix, is a three-dimensional network consisting of extracellular macromolecules and minerals, such as collagen, enzymes, glycoproteins and hydroxyapatite that provide stru ...
(ECM), and are important for the strength and flexibility of cartilage and tendons. Absence of proteoglycans is associated with heart and respiratory failure, defects in skeletal development and increased tumor metastasis. Different types of proteoglycans exist, depending on the sugar that is linked to the oxygen atom of the residue in the protein. For example, the GAG
heparan sulphate Heparan sulfate (HS) is a linear polysaccharide found in all animal tissues. It occurs as a proteoglycan (HSPG, i.e. Heparan Sulfate ProteoGlycan) in which two or three HS chains are attached in close proximity to cell surface or extracellular ...
is attached to a protein serine residue through a
xylose Xylose ( grc, ξύλον, , "wood") is a sugar first isolated from wood, and named for it. Xylose is classified as a monosaccharide of the aldopentose type, which means that it contains five carbon atoms and includes an aldehyde functional gro ...
sugar. The structure is extended with several ''N''-acetyllactosamine repeating sugar units added onto the xylose. This process is unusual and requires specific xylosyltransferases.
Keratan sulphate Keratan sulfate (KS), also called keratosulfate, is any of several sulfated glycosaminoglycans (structural carbohydrates) that have been found especially in the cornea, cartilage, and bone. It is also synthesized in the central nervous system ...
attaches to a serine or threonine residue through GalNAc, and is extended with two galactose sugars, followed by repeating units of glucuronic acid (GlcA) and GlcNAc. Type II keratan sulphate is especially common in cartilage.


Lipids

Galactose or glucose sugars can be attached to a hydroxyl group of ceramide lipids in a different form of O-glycosylation, as it does not occur on proteins. This forms
glycosphingolipids Glycosphingolipids are a subtype of glycolipids containing the amino alcohol sphingosine. They may be considered as sphingolipids with an attached carbohydrate. Glycosphingolipids are a group of lipids (more specifically, sphingolipids) and are a p ...
, which are important for the localisation of receptors in membranes. Incorrect breakdown of these lipids leads to a group of diseases known as
sphingolipidoses Sphingolipidoses are a class of lipid storage disorders or degenerative storage disorders caused by deficiency of an enzyme that is required for the catabolism of lipids that contain ceramide,Lynn, D. Joanne, Newton, Herbert B. and Rae-Grant, Alex ...
, which are often characterised by neurodegeneration and developmental disabilities. Because both galactose and glucose sugars can be added to the ceramide lipid, we have two groups of glycosphingolipids. Galactosphingolipids are generally very simple in structure and the core galactose is not usually modified. Glucosphingolipids, however, are often modified and can become a lot more complex. Biosynthesis of galacto- and glucosphingolipids occurs differently. Glucose is added onto ceramide from its precursor in the endoplasmic reticulum, before further modifications occur in the Golgi apparatus. Galactose, on the other hand, is added to ceramide already in the Golgi apparatus, where the galactosphingolipid formed is often sulfated by addition of sulfate groups.


Glycogenin

One of the first and only examples of O-glycosylation on
tyrosine -Tyrosine or tyrosine (symbol Tyr or Y) or 4-hydroxyphenylalanine is one of the 20 standard amino acids that are used by cells to synthesize proteins. It is a non-essential amino acid with a polar side group. The word "tyrosine" is from the Gr ...
, rather than on serine or threonine residues, is the addition of glucose to a tyrosine residue in glycogenin. Glycogenin is a glycosyltransferase that initiates the conversion of glucose to glycogen, present in muscle and liver cells.


Clinical significance

All forms of O-glycosylation are abundant throughout the body and play important roles in many cellular functions. Lewis epitopes are important in determining blood groups, and allow the generation of an immune response if we detect foreign organs. Understanding them is important in
organ transplants Organ transplantation is a medical procedure in which an organ is removed from one body and placed in the body of a recipient, to replace a damaged or missing organ. The donor and recipient may be at the same location, or organs may be transpo ...
. Hinge regions of immunoglobulins contain highly O-glycosylated regions between individual domains to maintain their structure, allow interactions with foreign antigens and protect the region from proteolytic cleavage.
Alzheimer’s Alzheimer's disease (AD) is a neurodegenerative disease that usually starts slowly and progressively worsens. It is the cause of 60–70% of cases of dementia. The most common early symptom is difficulty in remembering recent events. As t ...
may be affected by O-glycosylation. Tau, the protein that accumulates to cause neurodegeneration in Alzheimer’s, contains O-GlcNAc modifications which may be implicated in disease progression. Changes in O-glycosylation are extremely common in
cancer Cancer is a group of diseases involving abnormal cell growth with the potential to invade or spread to other parts of the body. These contrast with benign tumors, which do not spread. Possible signs and symptoms include a lump, abnormal b ...
. O-glycan structures, and especially the terminal Lewis epitopes, are important in allowing tumor cells to invade new tissues during metastasis. Understanding these changes in O-glycosylation of cancer cells can lead to new diagnostic approaches and therapeutic opportunities.


See also

*
Glycosylation Glycosylation is the reaction in which a carbohydrate (or ' glycan'), i.e. a glycosyl donor, is attached to a hydroxyl or other functional group of another molecule (a glycosyl acceptor) in order to form a glycoconjugate. In biology (but not al ...
* ''N''-linked glycosylation


References

{{Metabolism


External links


GlycoEP
In silico Platform for Prediction of ''N''-, O- and ''C''-Glycosites in Eukaryotic Protein Sequences Post-translational modification