HOME

TheInfoList



OR:

Glycan-Protein interactions represent a class of biomolecular interactions that occur between free or protein-bound
glycans The terms glycans and polysaccharides are defined by IUPAC as synonyms meaning "compounds consisting of a large number of monosaccharides linked glycosidically". However, in practice the term glycan may also be used to refer to the carbohydrate p ...
and their cognate binding partners. Intramolecular glycan-protein (protein-glycan) interactions occur between glycans and proteins that they are covalently attached to. Together with protein-protein interactions, they form a mechanistic basis for many essential
cell Cell most often refers to: * Cell (biology), the functional basic unit of life Cell may also refer to: Locations * Monastic cell, a small room, hut, or cave in which a religious recluse lives, alternatively the small precursor of a monastery w ...
processes, especially for cell-cell interactions and host-cell interactions. For instance,
SARS-CoV-2 Severe acute respiratory syndrome coronavirus 2 (SARS‑CoV‑2) is a strain of coronavirus that causes COVID-19 (coronavirus disease 2019), the respiratory illness responsible for the ongoing COVID-19 pandemic. The virus previously had a ...
, the causative agent of
COVID-19 Coronavirus disease 2019 (COVID-19) is a contagious disease caused by a virus, the severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2). The first known case was COVID-19 pandemic in Hubei, identified in Wuhan, China, in December ...
, employs its extensively
glycosylated Glycosylation is the reaction in which a carbohydrate (or 'glycan'), i.e. a glycosyl donor, is attached to a hydroxyl or other functional group of another molecule (a glycosyl acceptor) in order to form a glycoconjugate. In biology (but not alw ...
spike (S) protein to bind to the
ACE2 Angiotensin-converting enzyme 2 (ACE2) is an enzyme that can be found either attached to the membrane of cells (mACE2) in the intestines, kidney, testis, gallbladder, and heart or in a soluble form (sACE2). Both membrane bound and soluble ACE2 a ...
receptor, allowing it to enter host cells. The spike protein is a trimeric structure, with each
subunit Subunit may refer to: *Subunit HIV vaccine, a class of HIV vaccine *Protein subunit, a protein molecule that assembles with other protein molecules *Monomer, a molecule that may bind chemically to other molecules to form a polymer *Sub-subunit, a ...
containing 22
N-glycosylation ''N''-linked glycosylation, is the attachment of an oligosaccharide, a carbohydrate consisting of several sugar molecules, sometimes also referred to as glycan, to a nitrogen atom (the amide nitrogen of an asparagine (Asn) residue of a protein), ...
sites, making it an attractive target for
vaccine A vaccine is a biological Dosage form, preparation that provides active acquired immunity to a particular infectious disease, infectious or cancer, malignant disease. The safety and effectiveness of vaccines has been widely studied and verifie ...
search. Glycosilation, i.e., the addition of glycans (a generic name for
monosaccharides Monosaccharides (from Greek ''monos'': single, '' sacchar'': sugar), also called simple sugars, are the simplest forms of sugar and the most basic units (monomers) from which all carbohydrates are built. They are usually colorless, water-solu ...
and
oligosaccharides An oligosaccharide (/ˌɑlɪgoʊˈsækəˌɹaɪd/; from the Greek ὀλίγος ''olígos'', "a few", and σάκχαρ ''sácchar'', "sugar") is a saccharide polymer containing a small number (typically two to ten) of monosaccharides (simple sugar ...
) to a protein, is one of the major
post-translational modification Post-translational modification (PTM) is the covalent and generally enzymatic modification of proteins following protein biosynthesis. This process occurs in the endoplasmic reticulum and the golgi apparatus. Proteins are synthesized by ribosome ...
of
proteins Proteins are large biomolecules and macromolecules that comprise one or more long chains of amino acid residues. Proteins perform a vast array of functions within organisms, including catalysing metabolic reactions, DNA replication, respo ...
contributing to the enormous biological complexity of life. Indeed, three different
hexoses In chemistry, a hexose is a monosaccharide (simple sugar) with six carbon atoms. The chemical formula for all hexoses is C6H12O6, and their molecular weight is 180.156 g/mol. Hexoses exist in two forms, open-chain or cyclic, that easily conver ...
could theoretically produce from 1056 to 27,648 unique trisaccharides in contrast to only 6
peptides Peptides (, ) are short chains of amino acids linked by peptide bonds. Long chains of amino acids are called proteins. Chains of fewer than twenty amino acids are called oligopeptides, and include dipeptides, tripeptides, and tetrapeptides. A p ...
or
oligonucleotides Oligonucleotides are short DNA or RNA molecules, oligomers, that have a wide range of applications in genetic testing, research, and forensics. Commonly made in the laboratory by solid-phase chemical synthesis, these small bits of nucleic acids c ...
formed from 3
amino acids Amino acids are organic compounds that contain both amino and carboxylic acid functional groups. Although hundreds of amino acids exist in nature, by far the most important are the alpha-amino acids, which comprise proteins. Only 22 alpha am ...
or 3
nucleotides Nucleotides are organic molecules consisting of a nucleoside and a phosphate. They serve as monomeric units of the nucleic acid polymers – deoxyribonucleic acid (DNA) and ribonucleic acid (RNA), both of which are essential biomolecules w ...
respectively. In contrast to template-driven
protein biosynthesis Protein biosynthesis (or protein synthesis) is a core biological process, occurring inside cells, balancing the loss of cellular proteins (via degradation or export) through the production of new proteins. Proteins perform a number of critical ...
, the "language" of glycosylation is still unknown, making
glycobiology Defined in the narrowest sense, glycobiology is the study of the structure, biosynthesis, and biology of saccharides (sugar chains or glycans) that are widely distributed in nature. Sugars or saccharides are essential components of all living thing ...
a hot topic of current research given their prevalence in living organisms. The study of glycan-protein interactions provides insight into the mechanisms of cell-signaling and allows to create better-diagnosing tools for many diseases, including
cancer Cancer is a group of diseases involving abnormal cell growth with the potential to invade or spread to other parts of the body. These contrast with benign tumors, which do not spread. Possible signs and symptoms include a lump, abnormal b ...
. Indeed, there are no known types of cancer that do not involve erratic patterns of protein
glycosylation Glycosylation is the reaction in which a carbohydrate (or ' glycan'), i.e. a glycosyl donor, is attached to a hydroxyl or other functional group of another molecule (a glycosyl acceptor) in order to form a glycoconjugate. In biology (but not al ...
.


Thermodynamics of Binding

The binding of glycan-binding proteins (GBPs) to glycans could be modeled with simple equilibrium. Denoting glycans as G and proteins as P: Protein (P) + Glycan (G) \rightleftharpoons PG With an associated
equilibrium constant The equilibrium constant of a chemical reaction is the value of its reaction quotient at chemical equilibrium, a state approached by a dynamic chemical system after sufficient time has elapsed at which its composition has no measurable tendency ...
of K_a = \frac Which is rearranged to give
dissociation constant In chemistry, biochemistry, and pharmacology, a dissociation constant (K_D) is a specific type of equilibrium constant that measures the propensity of a larger object to separate (dissociate) reversibly into smaller components, as when a complex fa ...
K_d following biochemical conventions: K_d = \frac Given that many GBPs exhibit multivalency, this model may be expanded to account for multiple equilibria: P + G \rightleftharpoons PG PG + G \rightleftharpoons PG_2 \dots PG_ + G \rightleftharpoons PG_n Denoting cumulative equilibrium of binding with i ligands as P + iG \rightleftharpoons PG_i With corresponding equilibrium constant: \beta_i = \frac And writing
material balance In physics, a mass balance, also called a material balance, is an application of conservation of mass to the analysis of physical systems. By accounting for material entering and leaving a system, mass flows can be identified which might have bee ...
for protein (c_P denotes the total
concentration In chemistry, concentration is the abundance of a constituent divided by the total volume of a mixture. Several types of mathematical description can be distinguished: '' mass concentration'', ''molar concentration'', ''number concentration'', an ...
of protein): c_P = + G+ \dots + G_n Expressing the terms through an equilibrium constant, a final result is found: c_P = 1 + \beta_1 + \dots + \beta_n n The concentration of free protein is, thus: = \frac If n=1, i.e. there is only one carbohydrate receptor domain, the equation reduces to = \frac With increasing i the concentration of free protein decreases; hence, the apparent K_D decreases too.


Binding with aromatic rings

The chemical intuition suggests that the glycan-binding sites may be enriched in polar amino acid residues that form
non-covalent interactions In chemistry, a non-covalent interaction differs from a covalent bond in that it does not involve the sharing of electrons, but rather involves more dispersed variations of electromagnetic interactions between molecules or within a molecule. The c ...
, such as
hydrogen bonds In chemistry, a hydrogen bond (or H-bond) is a primarily electrostatic force of attraction between a hydrogen (H) atom which is covalently bound to a more electronegative "donor" atom or group (Dn), and another electronegative atom bearing a ...
, with
polar Polar may refer to: Geography Polar may refer to: * Geographical pole, either of two fixed points on the surface of a rotating body or planet, at 90 degrees from the equator, based on the axis around which a body rotates * Polar climate, the c ...
carbohydrates. Indeed, a statistical analysis of carbohydrate-binding pockets shows that
aspartic acid Aspartic acid (symbol Asp or D; the ionic form is known as aspartate), is an α-amino acid that is used in the biosynthesis of proteins. Like all other amino acids, it contains an amino group and a carboxylic acid. Its α-amino group is in the pro ...
and
asparagine Asparagine (symbol Asn or N) is an α-amino acid that is used in the biosynthesis of proteins. It contains an α-amino group (which is in the protonated −NH form under biological conditions), an α-carboxylic acid group (which is in the depro ...
residues are present twice as often as would be predicted by chance. Surprisingly, there is an even stronger preference for
aromatic amino acids An aromatic amino acid is an amino acid that includes an aromatic ring. Among the 20 standard amino acids, the following are classically considered aromatic: phenylalanine, tryptophan and tyrosine. Although histidine contains an aromatic ring, ...
:
tryptophan Tryptophan (symbol Trp or W) is an α-amino acid that is used in the biosynthesis of proteins. Tryptophan contains an α-amino group, an α- carboxylic acid group, and a side chain indole, making it a polar molecule with a non-polar aromatic ...
has a 9-fold increase in prevalence,
tyrosine -Tyrosine or tyrosine (symbol Tyr or Y) or 4-hydroxyphenylalanine is one of the 20 standard amino acids that are used by cells to synthesize proteins. It is a non-essential amino acid with a polar side group. The word "tyrosine" is from the Gr ...
a 3-fold one, and
histidine Histidine (symbol His or H) is an essential amino acid that is used in the biosynthesis of proteins. It contains an α-amino group (which is in the protonated –NH3+ form under biological conditions), a carboxylic acid group (which is in the de ...
a 2-fold increase. It has been shown that the underlying force is the CH-\pi interaction between the aromatic \pi system and the C-H in carbohydrate as shown in ''Figure 1''. The CH-\pi interaction is identified if the \theta \leqslant 40°, the CH-\pi distance (distance from C to X) is less than 4.5Å.


Effects of stereochemistry

This CH-\pi interaction strongly depends on the
stereochemistry Stereochemistry, a subdiscipline of chemistry, involves the study of the relative spatial arrangement of atoms that form the structure of molecules and their manipulation. The study of stereochemistry focuses on the relationships between stereois ...
of the
carbohydrate In organic chemistry, a carbohydrate () is a biomolecule consisting of carbon (C), hydrogen (H) and oxygen (O) atoms, usually with a hydrogen–oxygen atom ratio of 2:1 (as in water) and thus with the empirical formula (where ''m'' may or ma ...
molecule. For example, consider the top (\beta) and bottom (\alpha) faces of \beta-D-Glucose and \beta-D-Galactose. It has been shown that a single change in the stereochemistry at C4 carbon shifts preference for aromatic residues from \beta side (2.7 fold preference for glucose) to the \alpha side (14 fold preference for galactose).


Effects of electronics

The comparison of electrostatic surface
potentials Potential generally refers to a currently unrealized ability, in a wide variety of fields from physics to the social sciences. Mathematics and physics * Scalar potential, a scalar field whose gradient is a given vector field * Vector potential ...
(ESPs) of
aromatic In chemistry, aromaticity is a chemical property of cyclic ( ring-shaped), ''typically'' planar (flat) molecular structures with pi bonds in resonance (those containing delocalized electrons) that gives increased stability compared to satur ...
rings in
tryptophan Tryptophan (symbol Trp or W) is an α-amino acid that is used in the biosynthesis of proteins. Tryptophan contains an α-amino group, an α- carboxylic acid group, and a side chain indole, making it a polar molecule with a non-polar aromatic ...
,
tyrosine -Tyrosine or tyrosine (symbol Tyr or Y) or 4-hydroxyphenylalanine is one of the 20 standard amino acids that are used by cells to synthesize proteins. It is a non-essential amino acid with a polar side group. The word "tyrosine" is from the Gr ...
,
phenylalanine Phenylalanine (symbol Phe or F) is an essential α-amino acid with the formula . It can be viewed as a benzyl group substituted for the methyl group of alanine, or a phenyl group in place of a terminal hydrogen of alanine. This essential amino a ...
, and
histidine Histidine (symbol His or H) is an essential amino acid that is used in the biosynthesis of proteins. It contains an α-amino group (which is in the protonated –NH3+ form under biological conditions), a carboxylic acid group (which is in the de ...
suggests that electronic effects also play a role in the binding to glycans (see ''Figure 2''). After normalizing the electron densities for surface area, the tryptophan still remains the most electron rich acceptor of CH-\pi interactions, suggesting a possible reason for its 9-fold prevalence in carbohydrate binding pockets. Overall, the electrostatic potential maps follow the prevalence trend of Trp >> Tyr > (Phe) > His.


Carbohydrate-binding partners

There are many proteins capable of binding to glycans, including
lectins Lectins are carbohydrate-binding proteins that are highly specific for sugar groups that are part of other molecules, so cause agglutination of particular cells or precipitation of glycoconjugates and polysaccharides. Lectins have a role in rec ...
,
antibodies An antibody (Ab), also known as an immunoglobulin (Ig), is a large, Y-shaped protein used by the immune system to identify and neutralize foreign objects such as pathogenic bacteria and viruses. The antibody recognizes a unique molecule of the ...
, microbial
adhesins Adhesins are cell-surface components or appendages of bacteria that facilitate adhesion or adherence to other cells or to surfaces, usually in the host they are infecting or living in. Adhesins are a type of virulence factor. Adherence is an essent ...
, viral
agglutinins Agglutination is the clumping of particles. The word ''agglutination'' comes from the Latin '' agglutinare'' (glueing to). Agglutination is the process that occurs if an antigen is mixed with its corresponding antibody called isoagglutinin. Th ...
, etc.


Lectins

Lectins is a generic name for proteins with carbohydrate-recognizing domains (CRD). Although it became almost synonymous with glycan-binding proteins, it does not include
antibodies An antibody (Ab), also known as an immunoglobulin (Ig), is a large, Y-shaped protein used by the immune system to identify and neutralize foreign objects such as pathogenic bacteria and viruses. The antibody recognizes a unique molecule of the ...
which also belong to the class. Lectins found in
plants Plants are predominantly Photosynthesis, photosynthetic eukaryotes of the Kingdom (biology), kingdom Plantae. Historically, the plant kingdom encompassed all living things that were not animals, and included algae and fungi; however, all curr ...
and
fungi A fungus ( : fungi or funguses) is any member of the group of eukaryotic organisms that includes microorganisms such as yeasts and molds, as well as the more familiar mushrooms. These organisms are classified as a kingdom, separately from ...
cells have been extensively used in research as a tool to detect, purify, and analyze glycans. However, useful lectins usually have sub-optimal specificities. For instance, ''
Ulex europaeus ''Ulex europaeus'', the gorse, common gorse, furze or whin, is a species of flowering plant in the family Fabaceae, native to the British Isles and Western Europe. Description Growing to tall, it is an evergreen shrub. The young stems are g ...
'' agglutinin-1 (UEA-1), a plant-extracted lectin capable of binding to human blood type O
antigen In immunology, an antigen (Ag) is a molecule or molecular structure or any foreign particulate matter or a pollen grain that can bind to a specific antibody or T-cell receptor. The presence of antigens in the body may trigger an immune response. ...
, can also bind to unrelated glycans such as 2'-fucosyllactose, GalNAcα1-4(Fucα1-2)Galβ1-4GlcNAc, and Lewis-Y antigen.


Antibodies

Although
antibodies An antibody (Ab), also known as an immunoglobulin (Ig), is a large, Y-shaped protein used by the immune system to identify and neutralize foreign objects such as pathogenic bacteria and viruses. The antibody recognizes a unique molecule of the ...
exhibit nanomolar affinities toward protein antigens, the specificity against glycans is very limited. In fact, available antibodies may bind only <4% of the 7000 mammalian glycan antigens; moreover, most of those antibodies have low affinity and exhibit cross-reactivity.


Lambodies

In contrast with jawed
vertebrates Vertebrates () comprise all animal taxa within the subphylum Vertebrata () ( chordates with backbones), including all mammals, birds, reptiles, amphibians, and fish. Vertebrates represent the overwhelming majority of the phylum Chordata, ...
whose
immunity Immunity may refer to: Medicine * Immunity (medical), resistance of an organism to infection or disease * ''Immunity'' (journal), a scientific journal published by Cell Press Biology * Immune system Engineering * Radiofrequence immunity desc ...
is based on variable, diverse, and joining gene segments (VDJs) of
immunoglobulins An antibody (Ab), also known as an immunoglobulin (Ig), is a large, Y-shaped protein used by the immune system to identify and neutralize foreign objects such as pathogenic bacteria and viruses. The antibody recognizes a unique molecule of the ...
, the jawless
invertebrates Invertebrates are a paraphyletic group of animals that neither possess nor develop a vertebral column (commonly known as a ''backbone'' or ''spine''), derived from the notochord. This is a grouping including all animals apart from the chordate ...
, such as
lamprey Lampreys (sometimes inaccurately called lamprey eels) are an ancient extant lineage of jawless fish of the order Petromyzontiformes , placed in the superclass Cyclostomata. The adult lamprey may be characterized by a toothed, funnel-like s ...
and
hagfish Hagfish, of the class Myxini (also known as Hyperotreti) and order Myxiniformes , are eel-shaped, slime-producing marine fish (occasionally called slime eels). They are the only known living animals that have a skull but no vertebral column, a ...
, create a receptor diversity by somatic DNA rearrangement of
leucine Leucine (symbol Leu or L) is an essential amino acid that is used in the biosynthesis of proteins. Leucine is an α-amino acid, meaning it contains an α-amino group (which is in the protonated −NH3+ form under biological conditions), an α- ca ...
-rich repeat (LRR) modules that are incorporate in *vlr*
genes In biology, the word gene (from , ; "...Wilhelm Johannsen coined the word gene to describe the Mendelian units of heredity..." meaning ''generation'' or ''birth'' or ''gender'') can have several different meanings. The Mendelian gene is a ba ...
(variable leukocyte receptors). Those LRR form 3D structures resembling curved
solenoids upright=1.20, An illustration of a solenoid upright=1.20, Magnetic field created by a seven-loop solenoid (cross-sectional view) described using field lines A solenoid () is a type of electromagnet formed by a helix, helical coil of wire whose ...
that selectively bind specific glycans. A study from University of Maryland has shown that lamprey antibodies (lambodies) could selectively bind to
tumor A neoplasm () is a type of abnormal and excessive growth of tissue. The process that occurs to form or produce a neoplasm is called neoplasia. The growth of a neoplasm is uncoordinated with that of the normal surrounding tissue, and persists ...
-associated carbohydrate antigens (such as Tn and TF\alpha) at nanomolar affinities. The T-nouvelle antigen (Tn) and TF\alpha are present in proteins in as much as 90% of different
cancer Cancer is a group of diseases involving abnormal cell growth with the potential to invade or spread to other parts of the body. These contrast with benign tumors, which do not spread. Possible signs and symptoms include a lump, abnormal b ...
cells after
post-translational modification Post-translational modification (PTM) is the covalent and generally enzymatic modification of proteins following protein biosynthesis. This process occurs in the endoplasmic reticulum and the golgi apparatus. Proteins are synthesized by ribosome ...
, whereas in healthy cells those antigens are much more complex. A selection of lambodies that could bind to aGPA, a human
erythrocyte Red blood cells (RBCs), also referred to as red cells, red blood corpuscles (in humans or other animals not having nucleus in red blood cells), haematids, erythroid cells or erythrocytes (from Greek ''erythros'' for "red" and ''kytos'' for "holl ...
membrane A membrane is a selective barrier; it allows some things to pass through but stops others. Such things may be molecules, ions, or other small particles. Membranes can be generally classified into synthetic membranes and biological membranes. B ...
glycoprotein Glycoproteins are proteins which contain oligosaccharide chains covalently attached to amino acid side-chains. The carbohydrate is attached to the protein in a cotranslational or posttranslational modification. This process is known as glycos ...
that is covered with 16 TF\alpha moieties, through magnetic-activated cell sorting (MACS) and fluorescence-activated cell sorting (FACS) has yielded a leucine-rich lambody ''VLRB.aGPA.23''. This lambody selectively stained (over healthy samples) cells from 14 different types of
adenocarcinomas Adenocarcinoma (; plural adenocarcinomas or adenocarcinomata ) (AC) is a type of cancerous tumor that can occur in several parts of the body. It is defined as neoplasia of epithelial tissue that has glandular origin, glandular characteristics, or ...
:
bladder The urinary bladder, or simply bladder, is a hollow organ in humans and other vertebrates that stores urine from the kidneys before disposal by urination. In humans the bladder is a distensible organ that sits on the pelvic floor. Urine enters ...
,
esophagus The esophagus (American English) or oesophagus (British English; both ), non-technically known also as the food pipe or gullet, is an organ in vertebrates through which food passes, aided by peristaltic contractions, from the pharynx to the ...
,
ovary The ovary is an organ in the female reproductive system that produces an ovum. When released, this travels down the fallopian tube into the uterus, where it may become fertilized by a sperm. There is an ovary () found on each side of the body. ...
,
tongue The tongue is a muscular organ (anatomy), organ in the mouth of a typical tetrapod. It manipulates food for mastication and swallowing as part of the digestive system, digestive process, and is the primary organ of taste. The tongue's upper surfa ...
, cheek,
cervix The cervix or cervix uteri (Latin, 'neck of the uterus') is the lower part of the uterus (womb) in the human female reproductive system. The cervix is usually 2 to 3 cm long (~1 inch) and roughly cylindrical in shape, which changes during ...
,
liver The liver is a major Organ (anatomy), organ only found in vertebrates which performs many essential biological functions such as detoxification of the organism, and the Protein biosynthesis, synthesis of proteins and biochemicals necessary for ...
, nose,
nasopharynx The pharynx (plural: pharynges) is the part of the throat behind the mouth and nasal cavity, and above the oesophagus and trachea (the tubes going down to the stomach and the lungs). It is found in vertebrates and invertebrates, though its struct ...
, greater omentum, colon,
breast The breast is one of two prominences located on the upper ventral region of a primate's torso. Both females and males develop breasts from the same embryological tissues. In females, it serves as the mammary gland, which produces and secret ...
,
larynx The larynx (), commonly called the voice box, is an organ in the top of the neck involved in breathing, producing sound and protecting the trachea against food aspiration. The opening of larynx into pharynx known as the laryngeal inlet is about ...
, and
lung The lungs are the primary organs of the respiratory system in humans and most other animals, including some snails and a small number of fish. In mammals and most other vertebrates, two lungs are located near the backbone on either side of t ...
. Moreover, patients whose tissues stained positive with ''VLRB.aGPA.23'' had a significantly smaller survival rate. A close look at the crystal structure of ''VLRB.aGPA.23'' reveals a tryptophan residue at position 187 right over the carbohydrate binding pocket.


Multivalency in structure

Many glycan binding proteins (GBPs) are
oligomeric In chemistry and biochemistry, an oligomer () is a molecule that consists of a few repeating units which could be derived, actually or conceptually, from smaller molecules, monomers.Quote: ''Oligomer molecule: A molecule of intermediate relativ ...
and typically contain multiple sites for glycan binding (also called carbohydrate-recognition domains). The ability to form multivalent protein-
ligand In coordination chemistry, a ligand is an ion or molecule (functional group) that binds to a central metal atom to form a coordination complex. The bonding with the metal generally involves formal donation of one or more of the ligand's electr ...
interactions significantly enhances the strength of binding: while K_D values for individual CRD-glycan interactions may be in the mM range, the overall affinity of GBP towards glycans may reach
nanomolar Molar concentration (also called molarity, amount concentration or substance concentration) is a measure of the concentration of a chemical species, in particular of a solute in a solution, in terms of amount of substance per unit volume of solut ...
or even
picomolar Molar concentration (also called molarity, amount concentration or substance concentration) is a measure of the concentration of a chemical species, in particular of a solute in a solution, in terms of amount of substance per unit volume of solu ...
ranges. The overall strength of interactions is described as ''
avidity In biochemistry, avidity refers to the accumulated strength of ''multiple'' affinities of individual non-covalent binding interactions, such as between a protein receptor and its ligand, and is commonly referred to as functional affinity. Avidity di ...
'' K_D (in contrast with an ''
affinity Affinity may refer to: Commerce, finance and law * Affinity (law), kinship by marriage * Affinity analysis, a market research and business management technique * Affinity Credit Union, a Saskatchewan-based credit union * Affinity Equity Partn ...
'' K_D which describes single equilibrium). Sometimes the ''avidity'' is also called an ''apparent'' K_D to emphasize the non-equilibrium nature of the interaction. Common oligomerization structures of
lectins Lectins are carbohydrate-binding proteins that are highly specific for sugar groups that are part of other molecules, so cause agglutination of particular cells or precipitation of glycoconjugates and polysaccharides. Lectins have a role in rec ...
are shown below. For example, galectins are usually observed as dimers, while intelectins form trimers and
pentraxins Pentraxins (PTX), also known as pentaxins, are an evolutionary conserved family of proteins characterised by containing a pentraxin protein domain. Proteins of the pentraxin family are involved in acute immunological responses. They are a clas ...
assemble into pentamers. Larger structures, like hexameric Reg proteins, may assemble into membrane penetrating pores. Collectins may form even more bizarre complexes: bouquets of trimers or even cruciform-like structures (e.g. in
SP-D Surfactant protein D, also known as SP-D, is a lung surfactant protein part of the collagenous family of proteins called collectin. In humans, SP-D is encoded by the ''SFTPD'' gene and is part of the innate immune system. Each SP-D subunit is com ...
).


Current Research

Given the importance of glycan-protein interactions, there is an ongoing research dedicated to the a) creation of new tools to detect glycan-protein interactions and b) using those tools to decipher the so-called sugar code.


Glycan Arrays

One of the most widely used tools for probing glycan-protein interactions is glycan arrays. A glycan array usually is an NHS- or
epoxy Epoxy is the family of basic components or cured end products of epoxy resins. Epoxy resins, also known as polyepoxides, are a class of reactive prepolymers and polymers which contain epoxide groups. The epoxide functional group is also coll ...
-activated glass slides on which various
glycans The terms glycans and polysaccharides are defined by IUPAC as synonyms meaning "compounds consisting of a large number of monosaccharides linked glycosidically". However, in practice the term glycan may also be used to refer to the carbohydrate p ...
were printed using robotic printing. These commercially available arrays may contain up to 600 different glycans, specificity of which has been extensively studied. Glycan-protein interactions may be detected by testing proteins of interest (or
libraries A library is a collection of materials, books or media that are accessible for use and not just for display purposes. A library provides physical (hard copies) or digital access (soft copies) materials, and may be a physical location or a vir ...
of those) that bear fluorescent tags. The structure of the glycan-binding protein may be deciphered by several analytical methods based on mass-spectrometry, including MALDI-MS, LC-MS, tandem MS-MS, and/or
2D NMR Two-dimensional nuclear magnetic resonance spectroscopy (2D NMR) is a set of nuclear magnetic resonance spectroscopy (NMR) methods which give data plotted in a space defined by two frequency axes rather than one. Types of 2D NMR include correlation ...
.


Bioinformatics driven research

Computational methods have been applied to search for parameters (e.g. residue propensity, hydrophobicity, planarity) that could distinguish glycan-binding proteins from other surface patches. For example, a model trained on 19 non-homologous carbohydrate binding structures was able to predict carbohydrate-binding domains (CRDs) with an accuracy of 65% for non-enzymatic structures and 87% for enzymatic ones. Further studies have employed calculations of Van der Waals energies of protein-probe interactions and amino acid propensities to identify CRDs with 98% specificity at 73% sensitivity. More recent methods can predict CRDs even from
protein sequences Protein primary structure is the linear sequence of amino acids in a peptide or protein. By convention, the primary structure of a protein is reported starting from the amino-terminal (N) end to the carboxyl-terminal (C) end. Protein biosynthes ...
, by comparing the sequence with those for which structures are already known.


Sugar code

In contrast with protein studies, where a
primary protein structure Protein primary structure is the linear sequence of amino acids in a peptide or protein. By convention, the primary structure of a protein is reported starting from the amino-terminal (N) end to the carboxyl-terminal (C) end. Protein biosynthesi ...
is unambiguously defined by the sequence of
nucleotides Nucleotides are organic molecules consisting of a nucleoside and a phosphate. They serve as monomeric units of the nucleic acid polymers – deoxyribonucleic acid (DNA) and ribonucleic acid (RNA), both of which are essential biomolecules w ...
(the
genetic code The genetic code is the set of rules used by living cells to translate information encoded within genetic material ( DNA or RNA sequences of nucleotide triplets, or codons) into proteins. Translation is accomplished by the ribosome, which links ...
), the glycobiology still cannot explain how a certain "message" is encoded using carbohydrates or how it is "read" and "translated" by other biological entities. An interdisciplinary effort, combining chemistry, biology, and biochemistry, studies glycan-protein interactions to see how different sequences of carbohydrates initiate different cellular responses.


See also

* Protein-protein interactions *
Glycobiology Defined in the narrowest sense, glycobiology is the study of the structure, biosynthesis, and biology of saccharides (sugar chains or glycans) that are widely distributed in nature. Sugars or saccharides are essential components of all living thing ...


References

{{reflist Glycoproteins Monosaccharides Oligosaccharides Glycobiology Protein–protein interaction assays