Glycoprotein Hormones, Alpha Polypeptide
   HOME

TheInfoList



OR:

Glycoproteins are
protein Proteins are large biomolecules and macromolecules that comprise one or more long chains of amino acid residue (biochemistry), residues. Proteins perform a vast array of functions within organisms, including Enzyme catalysis, catalysing metab ...
s which contain
oligosaccharide An oligosaccharide (; ) is a carbohydrate, saccharide polymer containing a small number (typically three to ten) of monosaccharides (simple sugars). Oligosaccharides can have many functions including Cell–cell recognition, cell recognition and ce ...
(sugar) chains covalently attached to
amino acid Amino acids are organic compounds that contain both amino and carboxylic acid functional groups. Although over 500 amino acids exist in nature, by far the most important are the 22 α-amino acids incorporated into proteins. Only these 22 a ...
side-chains. The
carbohydrate A carbohydrate () is a biomolecule composed of carbon (C), hydrogen (H), and oxygen (O) atoms. The typical hydrogen-to-oxygen atomic ratio is 2:1, analogous to that of water, and is represented by the empirical formula (where ''m'' and ''n'' ...
is attached to the protein in a cotranslational or
posttranslational modification In molecular biology, post-translational modification (PTM) is the covalent process of changing proteins following protein biosynthesis. PTMs may involve enzymes or occur spontaneously. Proteins are created by ribosomes, which translate mRNA ...
. This process is known as
glycosylation Glycosylation is the reaction in which a carbohydrate (or ' glycan'), i.e. a glycosyl donor, is attached to a hydroxyl or other functional group of another molecule (a glycosyl acceptor) in order to form a glycoconjugate. In biology (but not ...
. Secreted extracellular proteins are often glycosylated. In proteins that have segments extending extracellularly, the extracellular segments are also often glycosylated. Glycoproteins are also often important integral membrane proteins, where they play a role in cell–cell interactions. It is important to distinguish endoplasmic reticulum-based glycosylation of the secretory system from reversible cytosolic-nuclear glycosylation. Glycoproteins of the
cytosol The cytosol, also known as cytoplasmic matrix or groundplasm, is one of the liquids found inside cells ( intracellular fluid (ICF)). It is separated into compartments by membranes. For example, the mitochondrial matrix separates the mitochondri ...
and nucleus can be modified through the reversible addition of a single GlcNAc residue that is considered reciprocal to phosphorylation and the functions of these are likely to be an additional regulatory mechanism that controls phosphorylation-based signalling. In contrast, classical secretory glycosylation can be structurally essential. For example, inhibition of asparagine-linked, i.e. N-linked, glycosylation can prevent proper glycoprotein folding and full inhibition can be toxic to an individual cell. In contrast, perturbation of glycan processing (enzymatic removal/addition of carbohydrate residues to the glycan), which occurs in both the
endoplasmic reticulum The endoplasmic reticulum (ER) is a part of a transportation system of the eukaryote, eukaryotic cell, and has many other important functions such as protein folding. The word endoplasmic means "within the cytoplasm", and reticulum is Latin for ...
and
Golgi apparatus The Golgi apparatus (), also known as the Golgi complex, Golgi body, or simply the Golgi, is an organelle found in most eukaryotic Cell (biology), cells. Part of the endomembrane system in the cytoplasm, it protein targeting, packages proteins ...
, is dispensable for isolated cells (as evidenced by survival with glycosides inhibitors) but can lead to human disease (congenital disorders of glycosylation) and can be lethal in animal models. It is therefore likely that the fine processing of glycans is important for endogenous functionality, such as cell trafficking, but that this is likely to have been secondary to its role in host-pathogen interactions. A famous example of this latter effect is the
ABO blood group system The ABO blood group system is used to denote the presence of one, both, or neither of the A and B antigens on erythrocytes (red blood cells). For human blood transfusions, it is the most important of the 47 different blood type (or group) c ...
. Though there are different types of glycoproteins, the most common are ''N''-linked and ''O''-linked glycoproteins. These two types of glycoproteins are distinguished by structural differences that give them their names. Glycoproteins vary greatly in composition, making many different compounds such as antibodies or hormones. Due to the wide array of functions within the body, interest in glycoprotein synthesis for medical use has increased. There are now several methods to synthesize glycoproteins, including recombination and glycosylation of proteins. Glycosylation is also known to occur on nucleo
cytoplasm The cytoplasm describes all the material within a eukaryotic or prokaryotic cell, enclosed by the cell membrane, including the organelles and excluding the nucleus in eukaryotic cells. The material inside the nucleus of a eukaryotic cell a ...
ic proteins in the form of ''O''-GlcNAc.


Types of glycosylation

There are several types of glycosylation, although the first two are the most common. * In ''N''-glycosylation, sugars are attached to nitrogen, typically on the
amide In organic chemistry, an amide, also known as an organic amide or a carboxamide, is a chemical compound, compound with the general formula , where R, R', and R″ represent any group, typically organyl functional group, groups or hydrogen at ...
side-chain of asparagine. * In ''O''-glycosylation, sugars are attached to oxygen, typically on
serine Serine (symbol Ser or S) is an α-amino acid that is used in the biosynthesis of proteins. It contains an α- amino group (which is in the protonated − form under biological conditions), a carboxyl group (which is in the deprotonated − ...
or
threonine Threonine (symbol Thr or T) is an amino acid that is used in the biosynthesis of proteins. It contains an α-amino group (which is in the protonated −NH form when dissolved in water), a carboxyl group (which is in the deprotonated −COO− ...
, but also on
tyrosine -Tyrosine or tyrosine (symbol Tyr or Y) or 4-hydroxyphenylalanine is one of the 20 standard amino acids that are used by cells to synthesize proteins. It is a conditionally essential amino acid with a polar side group. The word "tyrosine" is ...
or non-canonical amino acids such as hydroxylysine and hydroxyproline. * In ''P''-glycosylation, sugars are attached to
phosphorus Phosphorus is a chemical element; it has Chemical symbol, symbol P and atomic number 15. All elemental forms of phosphorus are highly Reactivity (chemistry), reactive and are therefore never found in nature. They can nevertheless be prepared ar ...
on a phosphoserine. * In ''C''-glycosylation, sugars are attached directly to carbon, such as in the addition of mannose to
tryptophan Tryptophan (symbol Trp or W) is an α-amino acid that is used in the biosynthesis of proteins. Tryptophan contains an α-amino group, an α-carboxylic acid group, and a side chain indole, making it a polar molecule with a non-polar aromat ...
. * In ''S''-glycosylation, a beta- GlcNAc is attached to the sulfur atom of a
cysteine Cysteine (; symbol Cys or C) is a semiessential proteinogenic amino acid with the chemical formula, formula . The thiol side chain in cysteine enables the formation of Disulfide, disulfide bonds, and often participates in enzymatic reactions as ...
residue. * In glypiation, a GPI glycolipid is attached to the
C-terminus The C-terminus (also known as the carboxyl-terminus, carboxy-terminus, C-terminal tail, carboxy tail, C-terminal end, or COOH-terminus) is the end of an amino acid chain (protein Proteins are large biomolecules and macromolecules that comp ...
of a
polypeptide Peptides are short chains of amino acids linked by peptide bonds. A polypeptide is a longer, continuous, unbranched peptide chain. Polypeptides that have a molecular mass of 10,000 Da or more are called proteins. Chains of fewer than twenty ...
, serving as a membrane anchor. * In
glycation Glycation (non-enzymatic glycosylation) is the covalent bond, covalent attachment of a sugar to a protein, lipid or nucleic acid molecule. Typical sugars that participate in glycation are glucose, fructose, and their derivatives. Glycation is th ...
, also known as non-enzymatic glycosylation, sugars are covalently bonded to a protein or lipid molecule, without the controlling action of an enzyme, but through a Maillard reaction.


Monosaccharides

Monosaccharides commonly found in eukaryotic glycoproteins include: The sugar group(s) can assist in
protein folding Protein folding is the physical process by which a protein, after Protein biosynthesis, synthesis by a ribosome as a linear chain of Amino acid, amino acids, changes from an unstable random coil into a more ordered protein tertiary structure, t ...
, improve proteins' stability and are involved in cell signalling.


Structure

The critical structural element of all glycoproteins is having
oligosaccharide An oligosaccharide (; ) is a carbohydrate, saccharide polymer containing a small number (typically three to ten) of monosaccharides (simple sugars). Oligosaccharides can have many functions including Cell–cell recognition, cell recognition and ce ...
s bonded covalently to a protein. There are 10 common monosaccharides in mammalian
glycan The terms glycans and polysaccharides are defined by IUPAC as synonyms meaning "compounds consisting of a large number of monosaccharides linked glycosidically". However, in practice the term glycan may also be used to refer to the carbohydrate ...
s including:
glucose Glucose is a sugar with the Chemical formula#Molecular formula, molecular formula , which is often abbreviated as Glc. It is overall the most abundant monosaccharide, a subcategory of carbohydrates. It is mainly made by plants and most algae d ...
(Glc), fucose (Fuc), xylose (Xyl), mannose (Man),
galactose Galactose (, ''wikt:galacto-, galacto-'' + ''wikt:-ose#Suffix 2, -ose'', ), sometimes abbreviated Gal, is a monosaccharide sugar that is about as sweetness, sweet as glucose, and about 65% as sweet as sucrose. It is an aldohexose and a C-4 epime ...
(Gal), N- acetylglucosamine (GlcNAc),
glucuronic acid Glucuronic acid (GCA, from ) is a uronic acid that was first isolated from urine (hence the name "uronic acid"). It is found in many natural gum, gums such as gum arabic ( 18%), xanthan, and kombucha tea and is important for the metabolism of ...
(GlcA), iduronic acid (IdoA), N-acetylgalactosamine (GalNAc), sialic acid, and 5- N-acetylneuraminic acid (Neu5Ac). These glycans link themselves to specific areas of the protein
amino acid Amino acids are organic compounds that contain both amino and carboxylic acid functional groups. Although over 500 amino acids exist in nature, by far the most important are the 22 α-amino acids incorporated into proteins. Only these 22 a ...
chain. The two most common linkages in glycoproteins are ''N''-linked and ''O''-linked glycoproteins. An ''N''-linked glycoprotein has glycan bonds to the nitrogen containing an asparagine amino acid within the protein sequence. An ''O''-linked glycoprotein has the sugar is bonded to an oxygen atom of a
serine Serine (symbol Ser or S) is an α-amino acid that is used in the biosynthesis of proteins. It contains an α- amino group (which is in the protonated − form under biological conditions), a carboxyl group (which is in the deprotonated − ...
or
threonine Threonine (symbol Thr or T) is an amino acid that is used in the biosynthesis of proteins. It contains an α-amino group (which is in the protonated −NH form when dissolved in water), a carboxyl group (which is in the deprotonated −COO− ...
amino acid in the protein. Glycoprotein size and composition can vary largely, with carbohydrate composition ranges from 1% to 70% of the total mass of the glycoprotein. Within the cell, they appear in the blood, the
extracellular matrix In biology, the extracellular matrix (ECM), also called intercellular matrix (ICM), is a network consisting of extracellular macromolecules and minerals, such as collagen, enzymes, glycoproteins and hydroxyapatite that provide structural and bio ...
, or on the outer surface of the plasma membrane, and make up a large portion of the proteins secreted by eukaryotic cells. They are very broad in their applications and can function as a variety of chemicals from antibodies to hormones.


Glycomics

Glycomics is the study of the carbohydrate components of cells. Though not exclusive to glycoproteins, it can reveal more information about different glycoproteins and their structure. One of the purposes of this field of study is to determine which proteins are glycosylated and where in the amino acid sequence the glycosylation occurs. Historically, mass spectrometry has been used to identify the structure of glycoproteins and characterize the carbohydrate chains attached.


Examples

The unique interaction between the oligosaccharide chains have different applications. First, it aids in quality control by identifying misfolded proteins. The oligosaccharide chains also change the solubility and polarity of the proteins that they are bonded to. For example, if the oligosaccharide chains are negatively charged, with enough density around the protein, they can repulse proteolytic enzymes away from the bonded protein. The diversity in interactions lends itself to different types of glycoproteins with different structures and functions. One example of glycoproteins found in the body is
mucin Mucins () are a family of high molecular weight, heavily glycosylated proteins ( glycoconjugates) produced by epithelial tissues in most animals. Mucins' key characteristic is their ability to form gels; therefore they are a key component in ...
s, which are secreted in the mucus of the respiratory and digestive tracts. The sugars when attached to mucins give them considerable water-holding capacity and also make them resistant to
proteolysis Proteolysis is the breakdown of proteins into smaller polypeptides or amino acids. Protein degradation is a major regulatory mechanism of gene expression and contributes substantially to shaping mammalian proteomes. Uncatalysed, the hydrolysis o ...
by digestive enzymes. Glycoproteins are important for
white blood cell White blood cells (scientific name leukocytes), also called immune cells or immunocytes, are cells of the immune system that are involved in protecting the body against both infectious disease and foreign entities. White blood cells are genera ...
recognition. Examples of glycoproteins in the
immune system The immune system is a network of biological systems that protects an organism from diseases. It detects and responds to a wide variety of pathogens, from viruses to bacteria, as well as Tumor immunology, cancer cells, Parasitic worm, parasitic ...
are: * molecules such as
antibodies An antibody (Ab) or immunoglobulin (Ig) is a large, Y-shaped protein belonging to the immunoglobulin superfamily which is used by the immune system to identify and neutralize antigens such as bacteria and viruses, including those that caus ...
(immunoglobulins), which interact directly with
antigen In immunology, an antigen (Ag) is a molecule, moiety, foreign particulate matter, or an allergen, such as pollen, that can bind to a specific antibody or T-cell receptor. The presence of antigens in the body may trigger an immune response. ...
s. * molecules of the ''
major histocompatibility complex The major histocompatibility complex (MHC) is a large Locus (genetics), locus on vertebrate DNA containing a set of closely linked polymorphic genes that code for Cell (biology), cell surface proteins essential for the adaptive immune system. The ...
'' (or MHC), which are expressed on the surface of cells and interact with
T cell T cells (also known as T lymphocytes) are an important part of the immune system and play a central role in the adaptive immune response. T cells can be distinguished from other lymphocytes by the presence of a T-cell receptor (TCR) on their cell ...
s as part of the adaptive immune response. * sialyl Lewis X antigen on the surface of leukocytes. H antigen of the ABO blood compatibility antigens. Other examples of glycoproteins include: * gonadotropins (luteinizing hormone and follicle-stimulating hormone) * glycoprotein IIb/IIIa, an integrin found on
platelet Platelets or thrombocytes () are a part of blood whose function (along with the coagulation#Coagulation factors, coagulation factors) is to react to bleeding from blood vessel injury by clumping to form a thrombus, blood clot. Platelets have no ...
s that is required for normal platelet aggregation and adherence to the
endothelium The endothelium (: endothelia) is a single layer of squamous endothelial cells that line the interior surface of blood vessels and lymphatic vessels. The endothelium forms an interface between circulating blood or lymph in the lumen and the r ...
. * components of the
zona pellucida The ''zona pellucida'' (Latin meaning "transparent zone") is the specialized area surrounding mammalian oocytes (eggs). It is also known as an egg coat. The ''zona pellucida'' is essential for oocyte growth and fertilization. The ''zona pelluc ...
, which surrounds the
oocyte An oocyte (, oöcyte, or ovocyte) is a female gametocyte or germ cell involved in reproduction. In other words, it is an immature ovum, or egg cell. An oocyte is produced in a female fetus in the ovary during female gametogenesis. The female ger ...
, and is important for
sperm Sperm (: sperm or sperms) is the male reproductive Cell (biology), cell, or gamete, in anisogamous forms of sexual reproduction (forms in which there is a larger, female reproductive cell and a smaller, male one). Animals produce motile sperm ...
-egg interaction. * structural glycoproteins, which occur in
connective tissue Connective tissue is one of the four primary types of animal tissue, a group of cells that are similar in structure, along with epithelial tissue, muscle tissue, and nervous tissue. It develops mostly from the mesenchyme, derived from the mesod ...
. These help bind together the fibers, cells, and ground substance of
connective tissue Connective tissue is one of the four primary types of animal tissue, a group of cells that are similar in structure, along with epithelial tissue, muscle tissue, and nervous tissue. It develops mostly from the mesenchyme, derived from the mesod ...
. They may also help components of the tissue bind to inorganic substances, such as
calcium Calcium is a chemical element; it has symbol Ca and atomic number 20. As an alkaline earth metal, calcium is a reactive metal that forms a dark oxide-nitride layer when exposed to air. Its physical and chemical properties are most similar to it ...
in
bone A bone is a rigid organ that constitutes part of the skeleton in most vertebrate animals. Bones protect the various other organs of the body, produce red and white blood cells, store minerals, provide structure and support for the body, ...
. * Glycoprotein-41 ( gp41) and glycoprotein-120 ( gp120) are HIV viral coat proteins. Soluble glycoproteins often show a high
viscosity Viscosity is a measure of a fluid's rate-dependent drag (physics), resistance to a change in shape or to movement of its neighboring portions relative to one another. For liquids, it corresponds to the informal concept of ''thickness''; for e ...
, for example, in
egg white Egg white is the clear liquid (also called the albumen or the glair/glaire) contained within an egg. In chickens, it is formed from the layers of secretions of the anterior section of the hen's oviduct during the passage of the egg. It forms a ...
and
blood plasma Blood plasma is a light Amber (color), amber-colored liquid component of blood in which blood cells are absent, but which contains Blood protein, proteins and other constituents of whole blood in Suspension (chemistry), suspension. It makes up ...
. *
Miraculin Miraculin is a taste modifier, a glycoprotein extracted from the fruit of '' Synsepalum dulcificum''. The berry, also known as the miracle fruit, was documented by explorer Chevalier des Marchais, who searched for many different fruits during ...
, is a glycoprotein extracted from '' Synsepalum dulcificum'' a
berry A berry is a small, pulpy, and often edible fruit. Typically, berries are juicy, rounded, brightly colored, sweet, sour or tart, and do not have a stone or pit although many pips or seeds may be present. Common examples of berries in the cul ...
which alters human tongue receptors to recognize sour foods as sweet. Variable surface glycoproteins allow the sleeping sickness ''Trypanosoma'' parasite to escape the immune response of the host. The viral spike of the human immunodeficiency virus is heavily glycosylated. Approximately half the mass of the spike is glycosylation and the glycans act to limit antibody recognition as the glycans are assembled by the host cell and so are largely 'self'. Over time, some patients can evolve antibodies to recognise the HIV glycans and almost all so-called 'broadly neutralising antibodies (bnAbs) recognise some glycans. This is possible mainly because the unusually high density of glycans hinders normal glycan maturation and they are therefore trapped in the premature, high-mannose, state. This provides a window for immune recognition. In addition, as these glycans are much less variable than the underlying protein, they have emerged as promising targets for vaccine design. P-glycoproteins are critical for antitumor research due to its ability block the effects of antitumor drugs. P-glycoprotein, or multidrug transporter (MDR1), is a type of ABC transporter that transports compounds out of cells. This transportation of compounds out of cells includes drugs made to be delivered to the cell, causing a decrease in drug effectiveness. Therefore, being able to inhibit this behavior would decrease P-glycoprotein interference in drug delivery, making this an important topic in drug discovery. For example, P-Glycoprotein causes a decrease in anti-cancer drug accumulation within tumor cells, limiting the effectiveness of chemotherapies used to treat cancer.


Hormones

Hormone A hormone (from the Ancient Greek, Greek participle , "setting in motion") is a class of cell signaling, signaling molecules in multicellular organisms that are sent to distant organs or tissues by complex biological processes to regulate physio ...
s that are glycoproteins include: *
Follicle-stimulating hormone Follicle-stimulating hormone (FSH) is a gonadotropin, a glycoprotein polypeptide hormone. FSH is synthesized and secreted by the gonadotropic cells of the anterior pituitary gland and regulates the development, growth, puberty, pubertal maturat ...
* Luteinizing hormone * Thyroid-stimulating hormone * Human chorionic gonadotropin * Alpha-fetoprotein * Erythropoietin (EPO)


Distinction between glycoproteins and proteoglycans


Functions


Analysis

A variety of methods used in detection, purification, and structural analysis of glycoproteins are


Synthesis

The glycosylation of proteins has an array of different applications from influencing cell to cell communication to changing the thermal stability and the folding of proteins. Due to the unique abilities of glycoproteins, they can be used in many therapies. By understanding glycoproteins and their synthesis, they can be made to treat cancer,
Crohn's Disease Crohn's disease is a type of inflammatory bowel disease (IBD) that may affect any segment of the gastrointestinal tract. Symptoms often include abdominal pain, diarrhea, fever, abdominal distension, and weight loss. Complications outside of the ...
, high cholesterol, and more. The process of glycosylation (binding a carbohydrate to a protein) is a
post-translational modification In molecular biology, post-translational modification (PTM) is the covalent process of changing proteins following protein biosynthesis. PTMs may involve enzymes or occur spontaneously. Proteins are created by ribosomes, which translation (biolog ...
, meaning it happens after the production of the protein. Glycosylation is a process that roughly half of all human proteins undergo and heavily influences the properties and functions of the protein. Within the cell, glycosylation occurs in the
endoplasmic reticulum The endoplasmic reticulum (ER) is a part of a transportation system of the eukaryote, eukaryotic cell, and has many other important functions such as protein folding. The word endoplasmic means "within the cytoplasm", and reticulum is Latin for ...
.


Recombination

There are several techniques for the assembly of glycoproteins. One technique utilizes recombination. The first consideration for this method is the choice of host, as there are many different factors that can influence the success of glycoprotein recombination such as cost, the host environment, the efficacy of the process, and other considerations. Some examples of host cells include E. coli, yeast, plant cells, insect cells, and mammalian cells. Of these options, mammalian cells are the most common because their use does not face the same challenges that other host cells do such as different glycan structures, shorter half life, and potential unwanted immune responses in humans. Of mammalian cells, the most common cell line used for recombinant glycoprotein production is the Chinese hamster ovary line. However, as technologies develop, the most promising cell lines for recombinant glycoprotein production are human cell lines.


Glycosylation

The formation of the link between the glycan and the protein is key element of the synthesis of glycoproteins. The most common method of glycosylation of N-linked glycoproteins is through the reaction between a protected glycan and a protected Asparagine. Similarly, an O-linked glycoprotein can be formed through the addition of a glycosyl donor with a protected
Serine Serine (symbol Ser or S) is an α-amino acid that is used in the biosynthesis of proteins. It contains an α- amino group (which is in the protonated − form under biological conditions), a carboxyl group (which is in the deprotonated − ...
or
Threonine Threonine (symbol Thr or T) is an amino acid that is used in the biosynthesis of proteins. It contains an α-amino group (which is in the protonated −NH form when dissolved in water), a carboxyl group (which is in the deprotonated −COO− ...
. These two methods are examples of natural linkage. However, there are also methods of unnatural linkages. Some methods include ligation and a reaction between a serine-derived sulfamidate and thiohexoses in water. Once this linkage is complete, the amino acid sequence can be expanded upon using solid-phase peptide synthesis.


See also

* Ero1 *
Female sperm storage Female sperm storage is a biological process and often a type of sexual selection in which spermatozoa, sperm cells transferred to a female during mating are temporarily retained within a specific part of the reproductive tract before the oocyte, ...
*
Glycocalyx The glycocalyx (: glycocalyces or glycocalyxes), also known as the pericellular matrix and cell coat, is a layer of glycoproteins and glycolipids which surround the cell membranes of bacteria, epithelial cells, and other cells. Animal epithe ...
* Glycome * Glycopeptide * Gp120 * Gp41 *
Miraculin Miraculin is a taste modifier, a glycoprotein extracted from the fruit of '' Synsepalum dulcificum''. The berry, also known as the miracle fruit, was documented by explorer Chevalier des Marchais, who searched for many different fruits during ...
* P-glycoprotein *
Proteoglycan Proteoglycans are proteins that are heavily glycosylated. The basic proteoglycan unit consists of a "core protein" with one or more covalently attached glycosaminoglycan (GAG) chain(s). The point of attachment is a serine (Ser) residue to w ...
* Ribophorin *
Glycan The terms glycans and polysaccharides are defined by IUPAC as synonyms meaning "compounds consisting of a large number of monosaccharides linked glycosidically". However, in practice the term glycan may also be used to refer to the carbohydrate ...
*
Protein Proteins are large biomolecules and macromolecules that comprise one or more long chains of amino acid residue (biochemistry), residues. Proteins perform a vast array of functions within organisms, including Enzyme catalysis, catalysing metab ...
* Monosaccharides


Notes and references


Further reading

* *


External links

* * * * * {{Portal bar, Biology Carbohydrate chemistry