HOME

TheInfoList



OR:

Amino acids are
organic compound In chemistry, organic compounds are generally any chemical compounds that contain carbon-hydrogen or carbon-carbon bonds. Due to carbon's ability to catenate (form chains with other carbon atoms), millions of organic compounds are known. The s ...
s that contain both
amino In chemistry, amines (, ) are compounds and functional groups that contain a basic nitrogen atom with a lone pair. Amines are formally derivatives of ammonia (), wherein one or more hydrogen atoms have been replaced by a substituent such ...
and
carboxylic acid In organic chemistry, a carboxylic acid is an organic acid that contains a carboxyl group () attached to an R-group. The general formula of a carboxylic acid is or , with R referring to the alkyl, alkenyl, aryl, or other group. Carboxylic ...
functional group In organic chemistry, a functional group is a substituent or moiety in a molecule that causes the molecule's characteristic chemical reactions. The same functional group will undergo the same or similar chemical reactions regardless of the res ...
s. Although hundreds of amino acids exist in nature, by far the most important are the alpha-amino acids, which comprise proteins. Only 22 alpha amino acids appear in the
genetic code The genetic code is the set of rules used by living cells to translate information encoded within genetic material ( DNA or RNA sequences of nucleotide triplets, or codons) into proteins. Translation is accomplished by the ribosome, which links ...
. Amino acids can be classified according to the locations of the core structural functional groups, as Alpha and beta carbon, alpha- (α-), beta- (β-), gamma- (γ-) or delta- (δ-) amino acids; other categories relate to Chemical polarity, polarity, ionization, and side chain group type (aliphatic, Open-chain compound, acyclic, aromatic, containing hydroxyl or
sulfur Sulfur (or sulphur in British English) is a chemical element with the symbol S and atomic number 16. It is abundant, multivalent and nonmetallic. Under normal conditions, sulfur atoms form cyclic octatomic molecules with a chemical formula ...
, etc.). In the form of proteins, amino acid '' residues'' form the second-largest component (
water Water (chemical formula ) is an Inorganic compound, inorganic, transparent, tasteless, odorless, and Color of water, nearly colorless chemical substance, which is the main constituent of Earth's hydrosphere and the fluids of all known living ...
being the largest) of human
muscle Skeletal muscles (commonly referred to as muscles) are organs of the vertebrate muscular system and typically are attached by tendons to bones of a skeleton. The muscle cells of skeletal muscles are much longer than in the other types of muscle ...
s and other tissues. Beyond their role as residues in proteins, amino acids participate in a number of processes such as
neurotransmitter A neurotransmitter is a signaling molecule secreted by a neuron to affect another cell across a synapse. The cell receiving the signal, any main body part or target cell, may be another neuron, but could also be a gland or muscle cell. Neurotr ...
transport and
biosynthesis Biosynthesis is a multi-step, enzyme- catalyzed process where substrates are converted into more complex products in living organisms. In biosynthesis, simple compounds are modified, converted into other compounds, or joined to form macromolecule ...
. It is thought that they played a key role in enabling life on Earth and its emergence. Amino acids are formally named by the IUPAC-IUBMB Joint Commission on Biochemical Nomenclature in terms of the fictitious "neutral" structure shown in the illustration. For example, the systematic name of alanine is 2-aminopropanoic acid, based on the formula . The Commission justified this approach as follows:
The systematic names and formulas given refer to hypothetical forms in which amino groups are unprotonated and carboxyl groups are undissociated. This convention is useful to avoid various nomenclatural problems but should not be taken to imply that these structures represent an appreciable fraction of the amino-acid molecules.


History

The first few amino acids were discovered in the early 1800s. In 1806, French chemists
Louis-Nicolas Vauquelin Prof. Louis Nicolas Vauquelin FRS(For) HFRSE (16 May 1763 – 14 November 1829) was a French pharmacist and chemist. He was the discoverer of both chromium and beryllium. Early life Vauquelin was born at Saint-André-d'Hébertot in Normandy, Fr ...
and
Pierre Jean Robiquet Pierre Jean Robiquet (13 January 1780 – 29 April 1840) was a French chemist. He laid founding work in identifying amino acids, the fundamental building blocks of proteins. He did this through recognizing the first of them, asparagine, in 180 ...
isolated a compound from asparagus that was subsequently named
asparagine Asparagine (symbol Asn or N) is an α-amino acid that is used in the biosynthesis of proteins. It contains an α-amino group (which is in the protonated −NH form under biological conditions), an α-carboxylic acid group (which is in the depro ...
, the first amino acid to be discovered.
Cystine Cystine is the oxidized derivative of the amino acid cysteine and has the formula (SCH2CH(NH2)CO2H)2. It is a white solid that is poorly soluble in water. As a residue in proteins, cystine serves two functions: a site of redox reactions and a mec ...
was discovered in 1810, although its monomer, cysteine, remained undiscovered until 1884.
Glycine Glycine (symbol Gly or G; ) is an amino acid that has a single hydrogen atom as its side chain. It is the simplest stable amino acid ( carbamic acid is unstable), with the chemical formula NH2‐ CH2‐ COOH. Glycine is one of the proteinogen ...
and
leucine Leucine (symbol Leu or L) is an essential amino acid that is used in the biosynthesis of proteins. Leucine is an α-amino acid, meaning it contains an α-amino group (which is in the protonated −NH3+ form under biological conditions), an α- c ...
were discovered in 1820. The last of the 20 common amino acids to be discovered was
threonine Threonine (symbol Thr or T) is an amino acid that is used in the biosynthesis of proteins. It contains an α-amino group (which is in the protonated −NH form under biological conditions), a carboxyl group (which is in the deprotonated −COO ...
in 1935 by
William Cumming Rose William Cumming Rose (April 4, 1887 – September 25, 1985) was an American biochemist and nutritionist. He discovered the amino acid threonine, and his research determined the necessity for essential amino acids in diet and the minimum daily re ...
, who also determined the
essential amino acid An essential amino acid, or indispensable amino acid, is an amino acid that cannot be synthesized from scratch by the organism fast enough to supply its demand, and must therefore come from the diet. Of the 21 amino acids common to all life form ...
s and established the minimum daily requirements of all amino acids for optimal growth. The unity of the chemical category was recognized by Wurtz in 1865, but he gave no particular name to it. The first use of the term "amino acid" in the English language dates from 1898, while the German term, , was used earlier. Proteins were found to yield amino acids after enzymatic digestion or acid
hydrolysis Hydrolysis (; ) is any chemical reaction in which a molecule of water breaks one or more chemical bonds. The term is used broadly for substitution, elimination, and solvation reactions in which water is the nucleophile. Biological hydrolysis i ...
. In 1902,
Emil Fischer Hermann Emil Louis Fischer (; 9 October 1852 – 15 July 1919) was a German chemist and 1902 recipient of the Nobel Prize in Chemistry. He discovered the Fischer esterification. He also developed the Fischer projection, a symbolic way of dra ...
and
Franz Hofmeister Franz Hofmeister (30 August 1850, in Prague – 26 July 1922, in Würzburg) was an early protein scientist, and is famous for his studies of salts that influence the solubility and conformational stability of proteins. In 1902, Hofmeister became t ...
independently proposed that proteins are formed from many amino acids, whereby bonds are formed between the amino group of one amino acid with the carboxyl group of another, resulting in a linear structure that Fischer termed "
peptide Peptides (, ) are short chains of amino acids linked by peptide bonds. Long chains of amino acids are called proteins. Chains of fewer than twenty amino acids are called oligopeptides, and include dipeptides, tripeptides, and tetrapeptides. A p ...
".


General structure

In the structure shown at the top of the page, R represents a
side chain In organic chemistry and biochemistry, a side chain is a chemical group that is attached to a core part of the molecule called the "main chain" or backbone. The side chain is a hydrocarbon branching element of a molecule that is attached to a ...
specific to each amino acid. The carbon atom next to the
carboxyl group In organic chemistry, a carboxylic acid is an organic acid that contains a carboxyl group () attached to an R-group. The general formula of a carboxylic acid is or , with R referring to the alkyl, alkenyl, aryl, or other group. Carboxylic ...
is called the α–carbon. Amino acids containing an
amino group In chemistry, amines (, ) are compounds and functional groups that contain a basic nitrogen atom with a lone pair. Amines are formally derivatives of ammonia (), wherein one or more hydrogen atoms have been replaced by a substituent such ...
bonded directly to the α-carbon are referred to as ''α-amino acids''. These include
proline Proline (symbol Pro or P) is an organic acid classed as a proteinogenic amino acid (used in the biosynthesis of proteins), although it does not contain the amino group but is rather a secondary amine. The secondary amine nitrogen is in the prot ...
and
hydroxyproline (2''S'',4''R'')-4-Hydroxyproline, or L-hydroxyproline ( C5 H9 O3 N), is an amino acid, abbreviated as Hyp or O, ''e.g.'', in Protein Data Bank. Structure and discovery In 1902, Hermann Emil Fischer isolated hydroxyproline from hydrolyzed gelatin. ...
, which are
secondary amine In chemistry, amines (, ) are compounds and functional groups that contain a basic nitrogen atom with a lone pair. Amines are formally derivatives of ammonia (), wherein one or more hydrogen atoms have been replaced by a substituent such ...
s. In the past they were often called ''imino acids'', a misnomer because they do not contain an imine grouping .Retrieved 2 April 2012 The obsolete term remains frequent.


Isomerism

The common natural forms of amino acids have the structure ( in the case of proline) and functional groups attached to the same C atom, and are thus α-amino acids. With the exception of achiral glycine, natural amino acids have the L configuration, and are the only ones found in proteins during translation in the ribosome. The L and D convention for amino acid configuration refers not to the optical activity of the amino acid itself but rather to the optical activity of the isomer of
glyceraldehyde Glyceraldehyde (glyceral) is a triose monosaccharide with chemical formula C3 H6 O3. It is the simplest of all common aldoses. It is a sweet, colorless, crystalline solid that is an intermediate compound in carbohydrate metabolism. The word comes ...
from which that amino acid can, in theory, be synthesized (D-glyceraldehyde is dextrorotatory; L-glyceraldehyde is levorotatory). An alternative convention is to use the (''S'') and (''R'') designators to specify the ''absolute configuration''. Almost all of the amino acids in proteins are (''S'') at the α carbon, with cysteine being (''R'') and glycine non-
chiral Chirality is a property of asymmetry important in several branches of science. The word ''chirality'' is derived from the Greek (''kheir''), "hand", a familiar chiral object. An object or a system is ''chiral'' if it is distinguishable from i ...
. Cysteine has its side chain in the same geometric location as the other amino acids, but the ''R''/''S'' terminology is reversed because
sulfur Sulfur (or sulphur in British English) is a chemical element with the symbol S and atomic number 16. It is abundant, multivalent and nonmetallic. Under normal conditions, sulfur atoms form cyclic octatomic molecules with a chemical formula ...
has higher atomic number compared to the carboxyl oxygen which gives the side chain a higher priority by the Cahn-Ingold-Prelog sequence rules, whereas the atoms in most other side chains give them lower priority compared to the carboxyl group. D-amino acid residues are found in some proteins, but they are rare.


Side chains

Amino acids are designated as α- when the amino nitrogen atom is attached to the α-carbon, the carbon atom adjacent to the carboxylate group. In all cases below in this section the \mathrmK_\mathrm values (if any) refer to the ionization of the groups as amino acid residues in proteins. They are not \mathrmK_\mathrm values for the free amino acids (which are of little biochemical importance).


Aliphatic side-chains

Seven (of the 21 proteinogenic) amino acids have side-chains that contain only H and C. These, therefore, do not ionize. They are as follows (with three- and one-letter symbols in parentheses): *
Glycine Glycine (symbol Gly or G; ) is an amino acid that has a single hydrogen atom as its side chain. It is the simplest stable amino acid ( carbamic acid is unstable), with the chemical formula NH2‐ CH2‐ COOH. Glycine is one of the proteinogen ...
(Gly, G): *
Alanine Alanine (symbol Ala or A), or α-alanine, is an α-amino acid that is used in the biosynthesis of proteins. It contains an amine group and a carboxylic acid group, both attached to the central carbon atom which also carries a methyl group side ...
(Ala, A): *
Valine Valine (symbol Val or V) is an α-amino acid that is used in the biosynthesis of proteins. It contains an α-amino group (which is in the protonated −NH3+ form under biological conditions), an α- carboxylic acid group (which is in the deprotonat ...
(Val, V): *
Isoleucine Isoleucine (symbol Ile or I) is an α-amino acid that is used in the biosynthesis of proteins. It contains an α-amino group (which is in the protonated −NH form under biological conditions), an α-carboxylic acid group (which is in the depro ...
(Ile, I): *
Leucine Leucine (symbol Leu or L) is an essential amino acid that is used in the biosynthesis of proteins. Leucine is an α-amino acid, meaning it contains an α-amino group (which is in the protonated −NH3+ form under biological conditions), an α- c ...
(Leu, L): *
Phenylalanine Phenylalanine (symbol Phe or F) is an essential α-amino acid with the formula . It can be viewed as a benzyl group substituted for the methyl group of alanine, or a phenyl group in place of a terminal hydrogen of alanine. This essential amino a ...
(Phe, F): *
Proline Proline (symbol Pro or P) is an organic acid classed as a proteinogenic amino acid (used in the biosynthesis of proteins), although it does not contain the amino group but is rather a secondary amine. The secondary amine nitrogen is in the prot ...
(Pro, P): cyclized onto the amine


Polar neutral side-chains

Two amino acids contain alcohol side chains. These do not ionize in normal conditions, though one, serine, becomes deprotonated during the catalysis by
serine protease Serine proteases (or serine endopeptidases) are enzymes that cleave peptide bonds in proteins. Serine serves as the nucleophilic amino acid at the (enzyme's) active site. They are found ubiquitously in both eukaryotes and prokaryotes. Seri ...
s: this is an example of severe perturbation, and is not characteristic of serine residues in general. *
Serine Serine (symbol Ser or S) is an α-amino acid that is used in the biosynthesis of proteins. It contains an α-amino group (which is in the protonated − form under biological conditions), a carboxyl group (which is in the deprotonated − form un ...
(Ser, S, no \mathrmK_\mathrm when not severely perturbed): *
Threonine Threonine (symbol Thr or T) is an amino acid that is used in the biosynthesis of proteins. It contains an α-amino group (which is in the protonated −NH form under biological conditions), a carboxyl group (which is in the deprotonated −COO ...
(Thr, T, no \mathrmK_\mathrm): Threonine has two chiral centers, not only the L (2''S'') chiral center at the α-carbon shared by all amino acids apart from achiral glycine, but also (3''R'') at the β-carbon. The full stereochemical specification is L-threonine (2''S'',3''R'').


Amide side-chains

Two amino acids have amide side-chains, as follows: *
Asparagine Asparagine (symbol Asn or N) is an α-amino acid that is used in the biosynthesis of proteins. It contains an α-amino group (which is in the protonated −NH form under biological conditions), an α-carboxylic acid group (which is in the depro ...
(Asn, N): *
Glutamine Glutamine (symbol Gln or Q) is an α-amino acid that is used in the biosynthesis of proteins. Its side chain is similar to that of glutamic acid, except the carboxylic acid group is replaced by an amide. It is classified as a charge-neutral, ...
(Gln, Q): These side-chains do not ionize in the normal range of pH.


Sulfur-containing side-chains

Two side-chains contain sulfur atoms, of which one ionizes in the normal range (with \mathrmK_\mathrm indicated) and the other does not: * Cysteine (Cys, C, \mathrmK_\mathrm = 8.3): *
Methionine Methionine (symbol Met or M) () is an essential amino acid in humans. As the precursor of other amino acids such as cysteine and taurine, versatile compounds such as SAM-e, and the important antioxidant glutathione, methionine plays a critical ro ...
(Met, M, no \mathrmK_\mathrm):


Aromatic side-chains

Three amino acids have aromatic ring structures as side-chains, as illustrated. Of these, tyrosine ionizes in the normal range; the other two do not). *
Phenylalanine Phenylalanine (symbol Phe or F) is an essential α-amino acid with the formula . It can be viewed as a benzyl group substituted for the methyl group of alanine, or a phenyl group in place of a terminal hydrogen of alanine. This essential amino a ...
(Phe, F, no \mathrmK_\mathrm): left in the illustration *
Tyrosine -Tyrosine or tyrosine (symbol Tyr or Y) or 4-hydroxyphenylalanine is one of the 20 standard amino acids that are used by cells to synthesize proteins. It is a non-essential amino acid with a polar side group. The word "tyrosine" is from the Gr ...
(Tyr, Y, \mathrmK_\mathrm = 9.6): middle in the illustration *
Tryptophan Tryptophan (symbol Trp or W) is an α-amino acid that is used in the biosynthesis of proteins. Tryptophan contains an α-amino group, an α-carboxylic acid group, and a side chain indole, making it a polar molecule with a non-polar aromatic ...
(Trp, W, no \mathrmK_\mathrm): right in the illustration


Anionic side-chains

Two amino acids have side-chains that are anions at ordinary pH. These amino acids are often referred to as if carboxylic acids but are more correctly called carboxylates, as they are deprotonated at most relevant pH values. The anionic carboxylate groups behave as Brønsted bases in all circumstances except for enzymes like
pepsin Pepsin is an endopeptidase that breaks down proteins into smaller peptides. It is produced in the gastric chief cells of the stomach lining and is one of the main digestive enzymes in the digestive systems of humans and many other animals, ...
that act in environments of very low pH like the mammalian stomach. *
Aspartate Aspartic acid (symbol Asp or D; the ionic form is known as aspartate), is an α-amino acid that is used in the biosynthesis of proteins. Like all other amino acids, it contains an amino group and a carboxylic acid. Its α-amino group is in the pro ...
("aspartic acid", Asp, D, \mathrmK_\mathrm = 4.1): *
Glutamate Glutamic acid (symbol Glu or E; the ionic form is known as glutamate) is an α-amino acid that is used by almost all living beings in the biosynthesis of proteins. It is a non-essential nutrient for humans, meaning that the human body can syn ...
("glutamic acid", Glu, E, \mathrmK_\mathrm = 4.5):


Cationic side-chains

There are three amino acids with side-chains that are cations at neutral pH (though in one, histidine, cationic and neutral forms both exist). They are commonly called ''basic amino acids'', but this term is misleading: histidine can act both as a Brønsted acid and as a Brønsted base at neutral pH, lysine acts as a Brønsted acid, and arginine has a fixed positive charge and does not ionize in neutral conditions. The names ''histidinium, lysinium'' and ''argininium'' would be more accurate names for the structures, but have essentially no currency. *
Histidine Histidine (symbol His or H) is an essential amino acid that is used in the biosynthesis of proteins. It contains an α-amino group (which is in the protonated –NH3+ form under biological conditions), a carboxylic acid group (which is in the d ...
(His, H, \mathrmK_\mathrm = 6.3): Protonated and deprotonated forms in equilibrium are shown at the left of the image *
Lysine Lysine (symbol Lys or K) is an α-amino acid that is a precursor to many proteins. It contains an α-amino group (which is in the protonated form under biological conditions), an α-carboxylic acid group (which is in the deprotonated −CO ...
(Lys, K, \mathrmK_\mathrm = 10.4): Shown in the middle of the image *
Arginine Arginine is the amino acid with the formula (H2N)(HN)CN(H)(CH2)3CH(NH2)CO2H. The molecule features a guanidino group appended to a standard amino acid framework. At physiological pH, the carboxylic acid is deprotonated (−CO2−) and both the a ...
(Arg, R, \mathrmK_\mathrm > 12): Shown at the right of the image


β- and γ-amino acids

Amino acids with the structure , such as β-alanine, a component of
carnosine Carnosine (''beta''-alanyl-L-histidine) is a dipeptide molecule, made up of the amino acids beta-alanine and histidine. It is highly concentrated in muscle and brain tissues. Carnosine was discovered by Russian chemist Vladimir Gulevich. Ca ...
and a few other peptides, are β-amino acids. Ones with the structure are γ-amino acids, and so on, where X and Y are two substituents (one of which is normally H).


Zwitterions

In aqueous solution at pH close to neutrality, amino acids exist as
zwitterion In chemistry, a zwitterion ( ; ), also called an inner salt or dipolar ion, is a molecule that contains an equal number of positively- and negatively-charged functional groups. : With amino acids, for example, in solution a chemical equilibrium w ...
s, i.e. as dipolar ions with both and in charged states, so the overall structure is . At physiological pH the so-called "neutral forms" are not present to any measurable degree. Although the two charges in the zwitterion structure add up to zero it is misleading to call a species with a net charge of zero "uncharged". In strongly acidic conditions (pH below 3), the carboxylate group becomes protonated and the structure becomes an ammonio carboxylic acid, . This is relevant for enzymes like pepsin that are active in acidic environments such as the mammalian stomach and
lysosomes A lysosome () is a membrane-bound organelle found in many animal cells. They are spherical vesicles that contain hydrolytic enzymes that can break down many kinds of biomolecules. A lysosome has a specific composition, of both its membrane pro ...
, but does not significantly apply to intracellular enzymes. In highly basic conditions (pH greater than 10, not normally seen in physiological conditions), the ammonio group is deprotonated to give . Although various definitions of acids and bases are used in chemistry, the only one that is useful for chemistry in aqueous solution is that of Brønsted: an acid is a species that can donate a proton to another species, and a base is one that can accept a proton. This criterion is used to label the groups in the above illustration. Notice that aspartate and glutamate are the principal groups that act as Brønsted bases, and the common references to these as ''acidic amino acids'' (together with the C terminal) is completely wrong and misleading. Likewise the so-called ''basic amino acids'' include one (histidine) that acts as both a Brønsted acid and a base, one (lysine) that acts primarily as a Brønsted acid, and one (arginine) that is normally irrelevant to acid-base behavior as it has a fixed positive charge. In addition, tyrosine and cysteine, which act primarily as acids at neutral pH, are usually forgotten in the usual classification.


Isoelectric point

For amino acids with uncharged side-chains the zwitterion predominates at pH values between the two p''K''a values, but coexists in equilibrium with small amounts of net negative and net positive ions. At the midpoint between the two p''K''a values, the trace amount of net negative and trace of net positive ions balance, so that average net charge of all forms present is zero. This pH is known as the
isoelectric point The isoelectric point (pI, pH(I), IEP), is the pH at which a molecule carries no net electrical charge or is electrically neutral in the statistical mean. The standard nomenclature to represent the isoelectric point is pH(I). However, pI is also ...
p''I'', so p''I'' = (p''K''a1 + p''K''a2). For amino acids with charged side chains, the p''K''a of the side chain is involved. Thus for aspartate or glutamate with negative side chains, the terminal amino group is essentially entirely in the charged form , but this positive charge needs to be balanced by the state with just one C-terminal carboxylate group is negatively charged. This occurs halfway between the two carboxylate p''K''a values: p''I'' = (p''K''a1 + p''K''a(R)), where p''K''a(R) is the side chain p''K''a. Similar considerations apply to other amino acids with ionizable side-chains, including not only glutamate (similar to aspartate), but also cysteine, histidine, lysine, tyrosine and arginine with positive side chains Amino acids have zero mobility in electrophoresis at their isoelectric point, although this behaviour is more usually exploited for peptides and proteins than single amino acids. Zwitterions have minimum solubility at their isoelectric point, and some amino acids (in particular, with nonpolar side chains) can be isolated by precipitation from water by adjusting the pH to the required isoelectric point.


Physicochemical properties of amino acids

The ca. 20 canonical amino acids can be classified according to their properties. Important factors are charge,
hydrophilicity A hydrophile is a molecule or other molecular entity that is attracted to water molecules and tends to be dissolved by water.Liddell, H.G. & Scott, R. (1940). ''A Greek-English Lexicon'' Oxford: Clarendon Press. In contrast, hydrophobes are no ...
or
hydrophobicity In chemistry, hydrophobicity is the physical property of a molecule that is seemingly repelled from a mass of water (known as a hydrophobe). In contrast, hydrophiles are attracted to water. Hydrophobic molecules tend to be nonpolar and, t ...
, size, and functional groups. These properties influence
protein structure Protein structure is the three-dimensional arrangement of atoms in an amino acid-chain molecule. Proteins are polymers specifically polypeptides formed from sequences of amino acids, the monomers of the polymer. A single amino acid monomer ma ...
and
protein–protein interaction Protein–protein interactions (PPIs) are physical contacts of high specificity established between two or more protein molecules as a result of biochemical events steered by interactions that include electrostatic forces, hydrogen bonding and ...
s. The water-soluble proteins tend to have their hydrophobic residues (
Leu Leu may refer to: Businesses and organisations * LEU, NYSE American stock symbol for Centrus Energy Corp. * London Ecology Unit, a former body (1986-2000) which advised London boroughs on environmental matters * Free and Equal (''LeU - Liberi e ...
,
Ile Ile may refer to: * iLe, a Puerto Rican singer * Ile District (disambiguation), multiple places * Ilé-Ifẹ̀, an ancient Yoruba city in south-western Nigeria * Interlingue (ISO 639:ile), a planned language * Isoleucine, an amino acid * Another ...
,
Val Val may refer to: Val-a Film * ''Val'' (film), an American documentary about Val Kilmer, directed by Leo Scott and Ting Poo Military equipment * Aichi D3A, a Japanese World War II dive bomber codenamed "Val" by the Allies * AS Val, a Sov ...
, Phe, and Trp) buried in the middle of the protein, whereas hydrophilic side chains are exposed to the aqueous solvent. (Note that in
biochemistry Biochemistry or biological chemistry is the study of chemical processes within and relating to living organisms. A sub-discipline of both chemistry and biology, biochemistry may be divided into three fields: structural biology, enzymology and ...
, a residue refers to a specific
monomer In chemistry, a monomer ( ; ''mono-'', "one" + ''-mer'', "part") is a molecule that can react together with other monomer molecules to form a larger polymer chain or three-dimensional network in a process called polymerization. Classification M ...
within the
polymer A polymer (; Greek '' poly-'', "many" + ''-mer'', "part") is a substance or material consisting of very large molecules called macromolecules, composed of many repeating subunits. Due to their broad spectrum of properties, both synthetic and ...
ic chain of a
polysaccharide Polysaccharides (), or polycarbohydrates, are the most abundant carbohydrates found in food. They are long chain polymeric carbohydrates composed of monosaccharide units bound together by glycosidic linkages. This carbohydrate can react with w ...
, protein or
nucleic acid Nucleic acids are biopolymers, macromolecules, essential to all known forms of life. They are composed of nucleotides, which are the monomers made of three components: a 5-carbon sugar, a phosphate group and a nitrogenous base. The two main clas ...
.) The
integral membrane protein An integral, or intrinsic, membrane protein (IMP) is a type of membrane protein that is permanently attached to the biological membrane. All ''transmembrane proteins'' are IMPs, but not all IMPs are transmembrane proteins. IMPs comprise a signif ...
s tend to have outer rings of exposed
hydrophobic In chemistry, hydrophobicity is the physical property of a molecule that is seemingly repelled from a mass of water (known as a hydrophobe). In contrast, hydrophiles are attracted to water. Hydrophobic molecules tend to be nonpolar and, t ...
amino acids that anchor them into the
lipid bilayer The lipid bilayer (or phospholipid bilayer) is a thin polar membrane made of two layers of lipid molecules. These membranes are flat sheets that form a continuous barrier around all cells. The cell membranes of almost all organisms and many vi ...
. Some
peripheral membrane protein Peripheral membrane proteins, or extrinsic membrane proteins, are membrane proteins that adhere only temporarily to the biological membrane with which they are associated. These proteins attach to integral membrane proteins, or penetrate the periph ...
s have a patch of hydrophobic amino acids on their surface that locks onto the membrane. In similar fashion, proteins that have to bind to positively charged molecules have surfaces rich with negatively charged amino acids like
glutamate Glutamic acid (symbol Glu or E; the ionic form is known as glutamate) is an α-amino acid that is used by almost all living beings in the biosynthesis of proteins. It is a non-essential nutrient for humans, meaning that the human body can syn ...
and
aspartate Aspartic acid (symbol Asp or D; the ionic form is known as aspartate), is an α-amino acid that is used in the biosynthesis of proteins. Like all other amino acids, it contains an amino group and a carboxylic acid. Its α-amino group is in the pro ...
, while proteins binding to negatively charged molecules have surfaces rich with positively charged chains like
lysine Lysine (symbol Lys or K) is an α-amino acid that is a precursor to many proteins. It contains an α-amino group (which is in the protonated form under biological conditions), an α-carboxylic acid group (which is in the deprotonated −CO ...
and
arginine Arginine is the amino acid with the formula (H2N)(HN)CN(H)(CH2)3CH(NH2)CO2H. The molecule features a guanidino group appended to a standard amino acid framework. At physiological pH, the carboxylic acid is deprotonated (−CO2−) and both the a ...
. For example, lysine and arginine are highly enriched in low-complexity regions of nucleic-acid binding proteins. There are various
hydrophobicity scale Hydrophobicity scales are values that define the relative hydrophobicity or hydrophilicity of amino acid residues. The more positive the value, the more hydrophobic are the amino acids located in that region of the protein. These scales are commonly ...
s of amino acid residues. Some amino acids have special properties such as cysteine, that can form covalent
disulfide bond In biochemistry, a disulfide (or disulphide in British English) refers to a functional group with the structure . The linkage is also called an SS-bond or sometimes a disulfide bridge and is usually derived by the coupling of two thiol groups. In ...
s to other cysteine residues,
proline Proline (symbol Pro or P) is an organic acid classed as a proteinogenic amino acid (used in the biosynthesis of proteins), although it does not contain the amino group but is rather a secondary amine. The secondary amine nitrogen is in the prot ...
that forms a cycle to the polypeptide backbone, and glycine that is more flexible than other amino acids. Furthermore, glycine and proline are highly enriched within low complexity regions of eukaryotic and prokaryotic proteins, whereas the opposite (under-represented) has been observed for highly reactive, or complex, or hydrophobic amino acids, such as cysteine, phenylalanine, tryptophan, methionine, valine, leucine, isoleucine. Many proteins undergo a range of
posttranslational modification Post-translational modification (PTM) is the covalent and generally enzymatic modification of proteins following protein biosynthesis. This process occurs in the endoplasmic reticulum and the golgi apparatus. Proteins are synthesized by ribosom ...
s, whereby additional chemical groups are attached to the amino acid side chains. Some modifications can produce hydrophobic
lipoprotein A lipoprotein is a biochemical assembly whose primary function is to transport hydrophobic lipid (also known as fat) molecules in water, as in blood plasma or other extracellular fluids. They consist of a triglyceride and cholesterol center, su ...
s, or hydrophilic
glycoprotein Glycoproteins are proteins which contain oligosaccharide chains covalently attached to amino acid side-chains. The carbohydrate is attached to the protein in a cotranslational or posttranslational modification. This process is known as glyc ...
s. These types of modification allow the reversible targeting of a protein to a membrane. For example, the addition and removal of the fatty acid
palmitic acid Palmitic acid (hexadecanoic acid in IUPAC nomenclature) is a fatty acid with a 16-carbon chain. It is the most common saturated fatty acid found in animals, plants and microorganisms.Gunstone, F. D., John L. Harwood, and Albert J. Dijkstra. The L ...
to cysteine residues in some signaling proteins causes the proteins to attach and then detach from cell membranes.


Table of standard amino acid abbreviations and properties

Although one-letter symbols are included in the table, IUPAC–IUBMB recommend that "Use of the one-letter symbols should be restricted to the comparison of long sequences". Two additional amino acids are in some species coded for by
codons The genetic code is the set of rules used by living cells to translate information encoded within genetic material ( DNA or RNA sequences of nucleotide triplets, or codons) into proteins. Translation is accomplished by the ribosome, which links ...
that are usually interpreted as
stop codon In molecular biology (specifically protein biosynthesis), a stop codon (or termination codon) is a codon (nucleotide triplet within messenger RNA) that signals the termination of the translation process of the current protein. Most codons in m ...
s: In addition to the specific amino acid codes, placeholders are used in cases where chemical or
crystallographic Crystallography is the experimental science of determining the arrangement of atoms in crystalline solids. Crystallography is a fundamental subject in the fields of materials science and solid-state physics (condensed matter physics). The wor ...
analysis of a peptide or protein cannot conclusively determine the identity of a residue. They are also used to summarise conserved protein sequence motifs. The use of single letters to indicate sets of similar residues is similar to the use of abbreviation codes for degenerate bases. Unk is sometimes used instead of Xaa, but is less standard. Ter or * (from termination) is used in notation for mutations in proteins when a stop codon occurs. It correspond to no amino acid at all. In addition, many nonstandard amino acids have a specific code. For example, several peptide drugs, such as
Bortezomib Bortezomib, sold under the brand name Velcade among others, is an anti-cancer medication used to treat multiple myeloma and mantle cell lymphoma. This includes multiple myeloma in those who have and have not previously received treatment. It i ...
and MG132, are artificially synthesized and retain their
protecting group A protecting group or protective group is introduced into a molecule by chemical modification of a functional group to obtain chemoselectivity in a subsequent chemical reaction. It plays an important role in multistep organic synthesis. In many ...
s, which have specific codes. Bortezomib is Pyz–Phe–boroLeu, and MG132 is Z–Leu–Leu–Leu–al. To aid in the analysis of protein structure,
photo-reactive amino acid analog Photo-reactive amino acid analogs are artificial analogs of natural amino acids that can be used for crosslinking of protein complexes. Photo-reactive amino acid analogs may be incorporated into proteins and peptides ''in vivo'' or in ''vitro''. Pho ...
s are available. These include photoleucine (pLeu) and photomethionine (pMet).


Occurrence and functions in biochemistry

Amino acids which have the amine group attached to the (alpha-) carbon atom next to the carboxyl group have primary importance in living organisms since they participate in protein synthesis. They are known as 2-, alpha-, or α-amino acids (generic
formula In science, a formula is a concise way of expressing information symbolically, as in a mathematical formula or a ''chemical formula''. The informal use of the term ''formula'' in science refers to the general construct of a relationship betwe ...
in most cases, where R is an organic
substituent A substituent is one or a group of atoms that replaces (one or more) atoms, thereby becoming a moiety in the resultant (new) molecule. (In organic chemistry and biochemistry, the terms ''substituent'' and ''functional group'', as well as ''sid ...
known as a "
side chain In organic chemistry and biochemistry, a side chain is a chemical group that is attached to a core part of the molecule called the "main chain" or backbone. The side chain is a hydrocarbon branching element of a molecule that is attached to a ...
"); often the term "amino acid" is used to refer specifically to these. They include the 22
proteinogenic Proteinogenic amino acids are amino acids that are incorporated biosynthetically into proteins during translation. The word "proteinogenic" means "protein creating". Throughout known life, there are 22 genetically encoded (proteinogenic) amino aci ...
("protein-building") amino acids, which combine into peptide chains ("polypeptides") to form the building blocks of a vast array of proteins. These are all L-
stereoisomers In stereochemistry, stereoisomerism, or spatial isomerism, is a form of isomerism in which molecules have the same molecular formula and sequence of bonded atoms (constitution), but differ in the three-dimensional orientations of their atoms in ...
("left-handed"
enantiomer In chemistry, an enantiomer ( /ɪˈnænti.əmər, ɛ-, -oʊ-/ ''ih-NAN-tee-ə-mər''; from Ancient Greek ἐνάντιος ''(enántios)'' 'opposite', and μέρος ''(méros)'' 'part') – also called optical isomer, antipode, or optical ant ...
s), although a few D-amino acids ("right-handed") occur in bacterial envelopes, as a
neuromodulator Neuromodulation is the physiological process by which a given neuron uses one or more chemicals to regulate diverse populations of neurons. Neuromodulators typically bind to metabotropic, G-protein coupled receptors (GPCRs) to initiate a second m ...
(D-
serine Serine (symbol Ser or S) is an α-amino acid that is used in the biosynthesis of proteins. It contains an α-amino group (which is in the protonated − form under biological conditions), a carboxyl group (which is in the deprotonated − form un ...
), and in some
antibiotic An antibiotic is a type of antimicrobial substance active against bacteria. It is the most important type of antibacterial agent for fighting bacterial infections, and antibiotic medications are widely used in the treatment and prevention ...
s. Many proteinogenic and non-proteinogenic amino acids have biological functions. For example, in the
human brain The human brain is the central organ of the human nervous system, and with the spinal cord makes up the central nervous system. The brain consists of the cerebrum, the brainstem and the cerebellum. It controls most of the activities of the ...
, glutamate (standard
glutamic acid Glutamic acid (symbol Glu or E; the ionic form is known as glutamate) is an α-amino acid that is used by almost all living beings in the biosynthesis of proteins. It is a non-essential nutrient for humans, meaning that the human body can syn ...
) and gamma-aminobutyric acid ("GABA", nonstandard gamma-amino acid) are, respectively, the main excitatory and inhibitory neurotransmitters.
Hydroxyproline (2''S'',4''R'')-4-Hydroxyproline, or L-hydroxyproline ( C5 H9 O3 N), is an amino acid, abbreviated as Hyp or O, ''e.g.'', in Protein Data Bank. Structure and discovery In 1902, Hermann Emil Fischer isolated hydroxyproline from hydrolyzed gelatin. ...
, a major component of the connective tissue collagen, is synthesised from
proline Proline (symbol Pro or P) is an organic acid classed as a proteinogenic amino acid (used in the biosynthesis of proteins), although it does not contain the amino group but is rather a secondary amine. The secondary amine nitrogen is in the prot ...
.
Glycine Glycine (symbol Gly or G; ) is an amino acid that has a single hydrogen atom as its side chain. It is the simplest stable amino acid ( carbamic acid is unstable), with the chemical formula NH2‐ CH2‐ COOH. Glycine is one of the proteinogen ...
is a biosynthetic precursor to
porphyrin Porphyrins ( ) are a group of heterocyclic macrocycle organic compounds, composed of four modified pyrrole subunits interconnected at their α carbon atoms via methine bridges (=CH−). The parent of porphyrin is porphine, a rare chemical compo ...
s used in
red blood cell Red blood cells (RBCs), also referred to as red cells, red blood corpuscles (in humans or other animals not having nucleus in red blood cells), haematids, erythroid cells or erythrocytes (from Greek ''erythros'' for "red" and ''kytos'' for "holl ...
s.
Carnitine Carnitine is a quaternary ammonium compound involved in metabolism in most mammals, plants, and some bacteria. In support of energy metabolism, carnitine transports long-chain fatty acids into mitochondria to be oxidized for energy production, an ...
is used in lipid transport. Nine proteinogenic amino acids are called " essential" for humans because they cannot be produced from other compounds by the human body and so must be taken in as food. Others may be conditionally essential for certain ages or medical conditions. Essential amino acids may also vary from
species In biology, a species is the basic unit of Taxonomy (biology), classification and a taxonomic rank of an organism, as well as a unit of biodiversity. A species is often defined as the largest group of organisms in which any two individuals of ...
to species. Because of their biological significance, amino acids are important in nutrition and are commonly used in
nutritional supplement A dietary supplement is a manufactured product intended to supplement one's diet by taking a pill, capsule, tablet, powder, or liquid. A supplement can provide nutrients either extracted from food sources or that are synthetic in order ...
s,
fertilizer A fertilizer (American English) or fertiliser (British English; see spelling differences) is any material of natural or synthetic origin that is applied to soil or to plant tissues to supply plant nutrients. Fertilizers may be distinct from ...
s, feed, and
food technology Food technology is a branch of food science that deals with the production, preservation, quality control and research and development of the food products. Early scientific research into food technology concentrated on food preservation. Nic ...
. Industrial uses include the production of
drugs A drug is any chemical substance that causes a change in an organism's physiology or psychology when consumed. Drugs are typically distinguished from food and substances that provide nutritional support. Consumption of drugs can be via inhalati ...
,
biodegradable plastic Biodegradable plastics are plastics that can be decomposed by the action of living organisms, usually microbes, into water, carbon dioxide, and biomass. Biodegradable plastics are commonly produced with renewable raw materials, micro-organisms, ...
s, and chiral catalysts.


Proteinogenic amino acids

Amino acids are the precursors to proteins. They join by condensation reactions to form short polymer chains called peptides or longer chains called either polypeptides or proteins. These chains are linear and unbranched, with each amino acid residue within the chain attached to two neighboring amino acids. In Nature, the process of making proteins encoded by DNA/RNA genetic material is called ''
translation Translation is the communication of the Meaning (linguistic), meaning of a #Source and target languages, source-language text by means of an Dynamic and formal equivalence, equivalent #Source and target languages, target-language text. The ...
'' and involves the step-by-step addition of amino acids to a growing protein chain by a
ribozyme Ribozymes (ribonucleic acid enzymes) are RNA molecules that have the ability to catalyze specific biochemical reactions, including RNA splicing in gene expression, similar to the action of protein enzymes. The 1982 discovery of ribozymes demonst ...
that is called a
ribosome Ribosomes ( ) are macromolecular machines, found within all cells, that perform biological protein synthesis (mRNA translation). Ribosomes link amino acids together in the order specified by the codons of messenger RNA (mRNA) molecules to f ...
. The order in which the amino acids are added is read through the
genetic code The genetic code is the set of rules used by living cells to translate information encoded within genetic material ( DNA or RNA sequences of nucleotide triplets, or codons) into proteins. Translation is accomplished by the ribosome, which links ...
from an
mRNA In molecular biology, messenger ribonucleic acid (mRNA) is a single-stranded molecule of RNA that corresponds to the genetic sequence of a gene, and is read by a ribosome in the process of synthesizing a protein. mRNA is created during the p ...
template, which is an
RNA Ribonucleic acid (RNA) is a polymeric molecule essential in various biological roles in coding, decoding, regulation and expression of genes. RNA and deoxyribonucleic acid ( DNA) are nucleic acids. Along with lipids, proteins, and carbohyd ...
copy of one of the organism's
gene In biology, the word gene (from , ; "...Wilhelm Johannsen coined the word gene to describe the Mendelian units of heredity..." meaning ''generation'' or ''birth'' or ''gender'') can have several different meanings. The Mendelian gene is a ba ...
s. Twenty-two amino acids are naturally incorporated into polypeptides and are called
proteinogenic Proteinogenic amino acids are amino acids that are incorporated biosynthetically into proteins during translation. The word "proteinogenic" means "protein creating". Throughout known life, there are 22 genetically encoded (proteinogenic) amino aci ...
or natural amino acids. Of these, 20 are encoded by the universal genetic code. The remaining 2,
selenocysteine Selenocysteine (symbol Sec or U, in older publications also as Se-Cys) is the 21st proteinogenic amino acid. Selenoproteins contain selenocysteine residues. Selenocysteine is an analogue of the more common cysteine with selenium in place of the ...
and
pyrrolysine Pyrrolysine (symbol Pyl or O; encoded by the 'amber' stop codon UAG) is an α-amino acid that is used in the biosynthesis of proteins in some methanogenic archaea and bacteria; it is not present in humans. It contains an α-amino group (which ...
, are incorporated into proteins by unique synthetic mechanisms. Selenocysteine is incorporated when the mRNA being translated includes a
SECIS element In biology, the SECIS element (SECIS: ''selenocysteine insertion sequence'') is an RNA element around 60 nucleotides in length that adopts a stem-loop structure. This structural motif (pattern of nucleotides) directs the cell to translate UGA ...
, which causes the UGA codon to encode selenocysteine instead of a stop codon.
Pyrrolysine Pyrrolysine (symbol Pyl or O; encoded by the 'amber' stop codon UAG) is an α-amino acid that is used in the biosynthesis of proteins in some methanogenic archaea and bacteria; it is not present in humans. It contains an α-amino group (which ...
is used by some
methanogen Methanogens are microorganisms that produce methane as a metabolic byproduct in hypoxic conditions. They are prokaryotic and belong to the domain Archaea. All known methanogens are members of the archaeal phylum Euryarchaeota. Methanogens are co ...
ic archaea in enzymes that they use to produce
methane Methane ( , ) is a chemical compound with the chemical formula (one carbon atom bonded to four hydrogen atoms). It is a group-14 hydride, the simplest alkane, and the main constituent of natural gas. The relative abundance of methane on ...
. It is coded for with the codon UAG, which is normally a stop codon in other organisms. This UAG codon is followed by a
PYLIS downstream sequence In biology, the PYLIS downstream sequence (PYLIS: ''pyrrolysine insertion sequence'') is a stem-loop structure that appears on some mRNA sequences. This structural motif was previously thought to cause the UAG (amber) stop codon to be translated ...
. Several independent evolutionary studies have suggested that Gly, Ala, Asp, Val, Ser, Pro, Glu, Leu, Thr may belong to a group of amino acids that constituted the early genetic code, whereas Cys, Met, Tyr, Trp, His, Phe may belong to a group of amino acids that constituted later additions of the genetic code.


Standard vs nonstandard amino acids

The 20 amino acids that are encoded directly by the codons of the universal genetic code are called ''standard'' or ''canonical'' amino acids. A modified form of methionine ( ''N''-formylmethionine) is often incorporated in place of methionine as the initial amino acid of proteins in bacteria, mitochondria and chloroplasts. Other amino acids are called ''nonstandard'' or ''non-canonical''. Most of the nonstandard amino acids are also non-proteinogenic (i.e. they cannot be incorporated into proteins during translation), but two of them are proteinogenic, as they can be incorporated translationally into proteins by exploiting information not encoded in the universal genetic code. The two nonstandard proteinogenic amino acids are selenocysteine (present in many non-eukaryotes as well as most eukaryotes, but not coded directly by DNA) and
pyrrolysine Pyrrolysine (symbol Pyl or O; encoded by the 'amber' stop codon UAG) is an α-amino acid that is used in the biosynthesis of proteins in some methanogenic archaea and bacteria; it is not present in humans. It contains an α-amino group (which ...
(found only in some archaea and at least one
bacterium Bacteria (; singular: bacterium) are ubiquitous, mostly free-living organisms often consisting of one biological cell. They constitute a large domain of prokaryotic microorganisms. Typically a few micrometres in length, bacteria were among ...
). The incorporation of these nonstandard amino acids is rare. For example, 25 human proteins include selenocysteine in their primary structure, and the structurally characterized enzymes (selenoenzymes) employ selenocysteine as the catalytic
moiety Moiety may refer to: Chemistry * Moiety (chemistry), a part or functional group of a molecule ** Moiety conservation, conservation of a subgroup in a chemical species Anthropology * Moiety (kinship), either of two groups into which a society is ...
in their active sites. Pyrrolysine and selenocysteine are encoded via variant codons. For example, selenocysteine is encoded by stop codon and
SECIS element In biology, the SECIS element (SECIS: ''selenocysteine insertion sequence'') is an RNA element around 60 nucleotides in length that adopts a stem-loop structure. This structural motif (pattern of nucleotides) directs the cell to translate UGA ...
. ''N''-formylmethionine (which is often the initial amino acid of proteins in bacteria,
mitochondria A mitochondrion (; ) is an organelle found in the cells of most Eukaryotes, such as animals, plants and fungi. Mitochondria have a double membrane structure and use aerobic respiration to generate adenosine triphosphate (ATP), which is use ...
, and
chloroplast A chloroplast () is a type of membrane-bound organelle known as a plastid that conducts photosynthesis mostly in plant and algal cells. The photosynthetic pigment chlorophyll captures the energy from sunlight, converts it, and stores it in ...
s) is generally considered as a form of
methionine Methionine (symbol Met or M) () is an essential amino acid in humans. As the precursor of other amino acids such as cysteine and taurine, versatile compounds such as SAM-e, and the important antioxidant glutathione, methionine plays a critical ro ...
rather than as a separate proteinogenic amino acid. Codon–
tRNA Transfer RNA (abbreviated tRNA and formerly referred to as sRNA, for soluble RNA) is an adaptor molecule composed of RNA, typically 76 to 90 nucleotides in length (in eukaryotes), that serves as the physical link between the mRNA and the amino a ...
combinations not found in nature can also be used to "expand" the genetic code and form novel proteins known as
alloprotein An alloprotein is a novel synthetic protein containing one or more "non-natural" amino acids. Non-natural in the context means an amino acid either not occurring in nature (novel and synthesised amino acids),
s incorporating
non-proteinogenic amino acid In biochemistry, non-coded or non-proteinogenic amino acids are distinct from the 22 proteinogenic amino acids (21 in eukaryotesplus formylmethionine in eukaryotes with prokaryote organelles like mitochondria) which are naturally encoded in the g ...
s.


Non-proteinogenic amino acids

Aside from the 22
proteinogenic amino acid Proteinogenic amino acids are amino acids that are incorporated biosynthetically into proteins during translation. The word "proteinogenic" means "protein creating". Throughout known life, there are 22 genetically encoded (proteinogenic) amino ac ...
s, many ''non-proteinogenic'' amino acids are known. Those either are not found in proteins (for example
carnitine Carnitine is a quaternary ammonium compound involved in metabolism in most mammals, plants, and some bacteria. In support of energy metabolism, carnitine transports long-chain fatty acids into mitochondria to be oxidized for energy production, an ...
, GABA,
levothyroxine Levothyroxine, also known as -thyroxine, is a manufactured form of the thyroid hormone thyroxine (T4). It is used to treat thyroid hormone deficiency (hypothyroidism), including a severe form known as myxedema coma. It may also be used to tr ...
) or are not produced directly and in isolation by standard cellular machinery (for example,
hydroxyproline (2''S'',4''R'')-4-Hydroxyproline, or L-hydroxyproline ( C5 H9 O3 N), is an amino acid, abbreviated as Hyp or O, ''e.g.'', in Protein Data Bank. Structure and discovery In 1902, Hermann Emil Fischer isolated hydroxyproline from hydrolyzed gelatin. ...
and
selenomethionine Selenomethionine (SeMet) is a naturally occurring amino acid. The L-selenomethionine enantiomer is the main form of selenium found in Brazil nuts, cereal grains, soybeans, and grassland legumes, while ''Se''-methylselenocysteine, or its γ-glu ...
). Non-proteinogenic amino acids that are found in proteins are formed by
post-translational modification Post-translational modification (PTM) is the covalent and generally enzymatic modification of proteins following protein biosynthesis. This process occurs in the endoplasmic reticulum and the golgi apparatus. Proteins are synthesized by ribosom ...
, which is modification after translation during protein synthesis. These modifications are often essential for the function or regulation of a protein. For example, the
carboxylation Carboxylation is a chemical reaction in which a carboxylic acid is produced by treating a substrate with carbon dioxide. The opposite reaction is decarboxylation. In chemistry, the term carbonation is sometimes used synonymously with carboxylation ...
of
glutamate Glutamic acid (symbol Glu or E; the ionic form is known as glutamate) is an α-amino acid that is used by almost all living beings in the biosynthesis of proteins. It is a non-essential nutrient for humans, meaning that the human body can syn ...
allows for better binding of calcium cations, and collagen contains hydroxyproline, generated by
hydroxylation In chemistry, hydroxylation can refer to: *(i) most commonly, hydroxylation describes a chemical process that introduces a hydroxyl group () into an organic compound. *(ii) the ''degree of hydroxylation'' refers to the number of OH groups in a ...
of
proline Proline (symbol Pro or P) is an organic acid classed as a proteinogenic amino acid (used in the biosynthesis of proteins), although it does not contain the amino group but is rather a secondary amine. The secondary amine nitrogen is in the prot ...
. Another example is the formation of
hypusine Hypusine is an uncommon amino acid found in all eukaryotes and in some archaea, but not in bacteria. The only known proteins containing the hypusine residue is eukaryotic translation initiation factor 5A (eIF-5A) and a similar protein found in ar ...
in the
translation initiation factor Initiation factors are proteins that bind to the small subunit of the ribosome during the initiation of translation, a part of protein biosynthesis. Initiation factors can interact with repressors to slow down or prevent translation. They have th ...
EIF5A Eukaryotic translation initiation factor 5A-1 is a protein that in humans is encoded by the ''EIF5A'' gene. It is the only known protein to contain the unusual amino acid hypusine 'N''ε-(4-amino-2-hydroxybutyl)-lysine which is synthesized on e ...
, through modification of a lysine residue. Such modifications can also determine the localization of the protein, e.g., the addition of long hydrophobic groups can cause a protein to bind to a
phospholipid Phospholipids, are a class of lipids whose molecule has a hydrophilic "head" containing a phosphate group and two hydrophobic "tails" derived from fatty acids, joined by an alcohol residue (usually a glycerol molecule). Marine phospholipids typ ...
membrane. Some non-proteinogenic amino acids are not found in proteins. Examples include
2-aminoisobutyric acid 2-Aminoisobutyric acid (also known as α-aminoisobutyric acid, AIB, α-methylalanine, or 2-methylalanine) is the non-proteinogenic amino acid with the structural formula H2N-C(CH3)2-COOH. It is rare in nature, having been only found in meteorites, ...
and the neurotransmitter gamma-aminobutyric acid. Non-proteinogenic amino acids often occur as intermediates in the
metabolic pathway In biochemistry, a metabolic pathway is a linked series of chemical reactions occurring within a cell. The reactants, products, and intermediates of an enzymatic reaction are known as metabolites, which are modified by a sequence of chemical rea ...
s for standard amino acids – for example,
ornithine Ornithine is a non-proteinogenic amino acid that plays a role in the urea cycle. Ornithine is abnormally accumulated in the body in ornithine transcarbamylase deficiency. The radical is ornithyl. Role in urea cycle L-Ornithine is one of the produc ...
and
citrulline The organic compound citrulline is an α-amino acid. Its name is derived from ''citrullus'', the Latin word for watermelon. Although named and described by gastroenterologists since the late 19th century, it was first isolated from watermelon in ...
occur in the
urea cycle The urea cycle (also known as the ornithine cycle) is a cycle of biochemical reactions that produces urea (NH2)2CO from ammonia (NH3). Animals that use this cycle, mainly amphibians and mammals, are called ureotelic. The urea cycle converts highl ...
, part of amino acid
catabolism Catabolism () is the set of metabolic pathways that breaks down molecules into smaller units that are either oxidized to release energy or used in other anabolic reactions. Catabolism breaks down large molecules (such as polysaccharides, lipids, ...
(see below). A rare exception to the dominance of α-amino acids in biology is the β-amino acid beta alanine (3-aminopropanoic acid), which is used in plants and microorganisms in the synthesis of
pantothenic acid Pantothenic acid, also called vitamin B5 is a water-soluble B vitamin and therefore an essential nutrient. All animals require pantothenic acid in order to synthesize coenzyme A (CoA) – essential for fatty acid metabolism – as well as to, i ...
(vitamin B5), a component of
coenzyme A Coenzyme A (CoA, SHCoA, CoASH) is a coenzyme, notable for its role in the synthesis and oxidation of fatty acids, and the oxidation of pyruvate in the citric acid cycle. All genomes sequenced to date encode enzymes that use coenzyme A as a substr ...
.


In human nutrition

When taken up into the human body from the diet, the 20 standard amino acids either are used to synthesize proteins, other biomolecules, or are oxidized to
urea Urea, also known as carbamide, is an organic compound with chemical formula . This amide has two amino groups (–) joined by a carbonyl functional group (–C(=O)–). It is thus the simplest amide of carbamic acid. Urea serves an important ...
and carbon dioxide as a source of energy. The oxidation pathway starts with the removal of the amino group by a
transaminase Transaminases or aminotransferases are enzymes that catalyze a transamination reaction between an amino acid and an α-keto acid. They are important in the synthesis of amino acids, which form proteins. Function and mechanism An amino acid co ...
; the amino group is then fed into the
urea cycle The urea cycle (also known as the ornithine cycle) is a cycle of biochemical reactions that produces urea (NH2)2CO from ammonia (NH3). Animals that use this cycle, mainly amphibians and mammals, are called ureotelic. The urea cycle converts highl ...
. The other product of transamidation is a
keto acid In organic chemistry, keto acids or ketoacids (also called oxo acids or oxoacids) are organic compounds that contain a carboxylic acid group () and a ketone group ().Franz Dietrich Klingler, Wolfgang Ebertz "Oxocarboxylic Acids" in Ullmann's ...
that enters the
citric acid cycle The citric acid cycle (CAC)—also known as the Krebs cycle or the TCA cycle (tricarboxylic acid cycle)—is a series of chemical reactions to release stored energy through the oxidation of acetyl-CoA derived from carbohydrates, fats, and protei ...
.
Glucogenic amino acid A glucogenic amino acid (or glucoplastic amino acid) is an amino acid that can be converted into glucose through gluconeogenesis. This is in contrast to the ketogenic amino acids, which are converted into ketone bodies. The production of glucose f ...
s can also be converted into glucose, through
gluconeogenesis Gluconeogenesis (GNG) is a metabolic pathway that results in the generation of glucose from certain non- carbohydrate carbon substrates. It is a ubiquitous process, present in plants, animals, fungi, bacteria, and other microorganisms. In vertebra ...
. Of the 20 standard amino acids, nine (
His His or HIS may refer to: Computing * Hightech Information System, a Hong Kong graphics card company * Honeywell Information Systems * Hybrid intelligent system * Microsoft Host Integration Server Education * Hangzhou International School, ...
,
Ile Ile may refer to: * iLe, a Puerto Rican singer * Ile District (disambiguation), multiple places * Ilé-Ifẹ̀, an ancient Yoruba city in south-western Nigeria * Interlingue (ISO 639:ile), a planned language * Isoleucine, an amino acid * Another ...
,
Leu Leu may refer to: Businesses and organisations * LEU, NYSE American stock symbol for Centrus Energy Corp. * London Ecology Unit, a former body (1986-2000) which advised London boroughs on environmental matters * Free and Equal (''LeU - Liberi e ...
, Lys, Met, Phe, Thr, Trp and
Val Val may refer to: Val-a Film * ''Val'' (film), an American documentary about Val Kilmer, directed by Leo Scott and Ting Poo Military equipment * Aichi D3A, a Japanese World War II dive bomber codenamed "Val" by the Allies * AS Val, a Sov ...
) are called
essential amino acid An essential amino acid, or indispensable amino acid, is an amino acid that cannot be synthesized from scratch by the organism fast enough to supply its demand, and must therefore come from the diet. Of the 21 amino acids common to all life form ...
s because the
human body The human body is the structure of a human being. It is composed of many different types of cells that together create tissues and subsequently organ systems. They ensure homeostasis and the viability of the human body. It comprises a head ...
cannot synthesize them from other compounds at the level needed for normal growth, so they must be obtained from food. In addition, cysteine,
tyrosine -Tyrosine or tyrosine (symbol Tyr or Y) or 4-hydroxyphenylalanine is one of the 20 standard amino acids that are used by cells to synthesize proteins. It is a non-essential amino acid with a polar side group. The word "tyrosine" is from the Gr ...
, and
arginine Arginine is the amino acid with the formula (H2N)(HN)CN(H)(CH2)3CH(NH2)CO2H. The molecule features a guanidino group appended to a standard amino acid framework. At physiological pH, the carboxylic acid is deprotonated (−CO2−) and both the a ...
are considered semiessential amino acids, and taurine a semiessential aminosulfonic acid in children. The metabolic pathways that synthesize these monomers are not fully developed. The amounts required also depend on the age and health of the individual, so it is hard to make general statements about the dietary requirement for some amino acids. Dietary exposure to the nonstandard amino acid
BMAA β-Methylamino--alanine, or BMAA, is a non-proteinogenic amino acid produced by cyanobacteria. BMAA is a neurotoxin and its potential role in various neurodegenerative disorders is the subject of scientific research. Structure and properties ...
has been linked to human neurodegenerative diseases, including
ALS Amyotrophic lateral sclerosis (ALS), also known as motor neuron disease (MND) or Lou Gehrig's disease, is a neurodegenerative disease that results in the progressive loss of motor neurons that control voluntary muscles. ALS is the most comm ...
.


Non-protein functions

In humans, non-protein amino acids also have important roles as
metabolic intermediate Metabolic intermediates are molecules that are the precursors or metabolites of biologically significant molecules. Although these intermediates are of relatively minor direct importance to cellular function, they can play important roles in the a ...
s, such as in the biosynthesis of the
neurotransmitter A neurotransmitter is a signaling molecule secreted by a neuron to affect another cell across a synapse. The cell receiving the signal, any main body part or target cell, may be another neuron, but could also be a gland or muscle cell. Neurotr ...
gamma-aminobutyric acid (GABA). Many amino acids are used to synthesize other molecules, for example: *
Tryptophan Tryptophan (symbol Trp or W) is an α-amino acid that is used in the biosynthesis of proteins. Tryptophan contains an α-amino group, an α-carboxylic acid group, and a side chain indole, making it a polar molecule with a non-polar aromatic ...
is a precursor of the neurotransmitter
serotonin Serotonin () or 5-hydroxytryptamine (5-HT) is a monoamine neurotransmitter. Its biological function is complex and multifaceted, modulating mood, cognition, reward, learning, memory, and numerous physiological processes such as vomiting and vas ...
. *
Tyrosine -Tyrosine or tyrosine (symbol Tyr or Y) or 4-hydroxyphenylalanine is one of the 20 standard amino acids that are used by cells to synthesize proteins. It is a non-essential amino acid with a polar side group. The word "tyrosine" is from the Gr ...
(and its precursor phenylalanine) are precursors of the
catecholamine A catecholamine (; abbreviated CA) is a monoamine neurotransmitter, an organic compound that has a catechol (benzene with two hydroxyl side groups next to each other) and a side-chain amine. Catechol can be either a free molecule or a substi ...
neurotransmitter A neurotransmitter is a signaling molecule secreted by a neuron to affect another cell across a synapse. The cell receiving the signal, any main body part or target cell, may be another neuron, but could also be a gland or muscle cell. Neurotr ...
s
dopamine Dopamine (DA, a contraction of 3,4-dihydroxyphenethylamine) is a neuromodulatory molecule that plays several important roles in cells. It is an organic chemical of the catecholamine and phenethylamine families. Dopamine constitutes about 80% o ...
,
epinephrine Adrenaline, also known as epinephrine, is a hormone and medication which is involved in regulating visceral functions (e.g., respiration). It appears as a white microcrystalline granule. Adrenaline is normally produced by the adrenal glands and ...
and
norepinephrine Norepinephrine (NE), also called noradrenaline (NA) or noradrenalin, is an organic chemical in the catecholamine family that functions in the brain and body as both a hormone and neurotransmitter. The name "noradrenaline" (from Latin '' ad'', ...
and various
trace amine Trace amines are an endogenous group of trace amine-associated receptor 1 (TAAR1) agonists – and hence, monoaminergic neuromodulators – that are structurally and metabolically related to classical monoamine neurotransmitters. Compared to the ...
s. *
Phenylalanine Phenylalanine (symbol Phe or F) is an essential α-amino acid with the formula . It can be viewed as a benzyl group substituted for the methyl group of alanine, or a phenyl group in place of a terminal hydrogen of alanine. This essential amino a ...
is a precursor of
phenethylamine Phenethylamine (PEA) is an organic compound, natural monoamine alkaloid, and trace amine, which acts as a central nervous system stimulant in humans. In the brain, phenethylamine regulates monoamine neurotransmission by binding to trace ami ...
and tyrosine in humans. In plants, it is a precursor of various
phenylpropanoid The phenylpropanoids are a diverse family of organic compounds that are synthesized by plants from the amino acids phenylalanine and tyrosine. Their name is derived from the six-carbon, aromatic phenyl group and the three-carbon propene tail of ...
s, which are important in plant metabolism. *
Glycine Glycine (symbol Gly or G; ) is an amino acid that has a single hydrogen atom as its side chain. It is the simplest stable amino acid ( carbamic acid is unstable), with the chemical formula NH2‐ CH2‐ COOH. Glycine is one of the proteinogen ...
is a precursor of
porphyrin Porphyrins ( ) are a group of heterocyclic macrocycle organic compounds, composed of four modified pyrrole subunits interconnected at their α carbon atoms via methine bridges (=CH−). The parent of porphyrin is porphine, a rare chemical compo ...
s such as
heme Heme, or haem (pronounced / hi:m/ ), is a precursor to hemoglobin, which is necessary to bind oxygen in the bloodstream. Heme is biosynthesized in both the bone marrow and the liver. In biochemical terms, heme is a coordination complex "consist ...
. *
Arginine Arginine is the amino acid with the formula (H2N)(HN)CN(H)(CH2)3CH(NH2)CO2H. The molecule features a guanidino group appended to a standard amino acid framework. At physiological pH, the carboxylic acid is deprotonated (−CO2−) and both the a ...
is a precursor of
nitric oxide Nitric oxide (nitrogen oxide or nitrogen monoxide) is a colorless gas with the formula . It is one of the principal oxides of nitrogen. Nitric oxide is a free radical: it has an unpaired electron, which is sometimes denoted by a dot in its ch ...
. *
Ornithine Ornithine is a non-proteinogenic amino acid that plays a role in the urea cycle. Ornithine is abnormally accumulated in the body in ornithine transcarbamylase deficiency. The radical is ornithyl. Role in urea cycle L-Ornithine is one of the produc ...
and ''S''-adenosylmethionine are precursors of
polyamine A polyamine is an organic compound having more than two amino groups. Alkyl polyamines occur naturally, but some are synthetic. Alkylpolyamines are colorless, hygroscopic, and water soluble. Near neutral pH, they exist as the ammonium derivatives. ...
s. *
Aspartate Aspartic acid (symbol Asp or D; the ionic form is known as aspartate), is an α-amino acid that is used in the biosynthesis of proteins. Like all other amino acids, it contains an amino group and a carboxylic acid. Its α-amino group is in the pro ...
,
glycine Glycine (symbol Gly or G; ) is an amino acid that has a single hydrogen atom as its side chain. It is the simplest stable amino acid ( carbamic acid is unstable), with the chemical formula NH2‐ CH2‐ COOH. Glycine is one of the proteinogen ...
, and
glutamine Glutamine (symbol Gln or Q) is an α-amino acid that is used in the biosynthesis of proteins. Its side chain is similar to that of glutamic acid, except the carboxylic acid group is replaced by an amide. It is classified as a charge-neutral, ...
are precursors of
nucleotide Nucleotides are organic molecules consisting of a nucleoside and a phosphate. They serve as monomeric units of the nucleic acid polymers – deoxyribonucleic acid (DNA) and ribonucleic acid (RNA), both of which are essential biomolecules with ...
s. However, not all of the functions of other abundant nonstandard amino acids are known. Some nonstandard amino acids are used as defenses against herbivores in plants. For example,
canavanine L-(+)-(''S'')-Canavanine is a non-proteinogenic amino acid found in certain leguminous plants. It is structurally related to the proteinogenic α-amino acid L-arginine, the sole difference being the replacement of a methylene bridge (-- unit) in ...
is an analogue of
arginine Arginine is the amino acid with the formula (H2N)(HN)CN(H)(CH2)3CH(NH2)CO2H. The molecule features a guanidino group appended to a standard amino acid framework. At physiological pH, the carboxylic acid is deprotonated (−CO2−) and both the a ...
that is found in many
legume A legume () is a plant in the family Fabaceae (or Leguminosae), or the fruit or seed of such a plant. When used as a dry grain, the seed is also called a pulse. Legumes are grown agriculturally, primarily for human consumption, for livestock for ...
s, and in particularly large amounts in ''
Canavalia gladiata ''Canavalia gladiata'', the sword bean or scimitar bean, is a domesticated plant species in the legume family Fabaceae. It is used as a vegetable in interior central and south central India, though not commercially farmed. The unripe pods are al ...
'' (sword bean). This amino acid protects the plants from predators such as insects and can cause illness in people if some types of legumes are eaten without processing. The non-protein amino acid
mimosine Mimosine or leucenol is a toxic non-protein amino acid chemically similar to tyrosine. It occurs in some ''Mimosa'' spp. (including '' M. pudica'') and all members of the closely related genus '' Leucaena''. This compound, also known as leucenol ...
is found in other species of legume, in particular ''
Leucaena leucocephala ''Leucaena leucocephala'' is a small fast-growing mimosoid tree native to southern Mexico and northern Central America (Belize and Guatemala) and is now naturalized throughout the tropics including parts of Asia. Common names include jumbay, ...
''. This compound is an analogue of
tyrosine -Tyrosine or tyrosine (symbol Tyr or Y) or 4-hydroxyphenylalanine is one of the 20 standard amino acids that are used by cells to synthesize proteins. It is a non-essential amino acid with a polar side group. The word "tyrosine" is from the Gr ...
and can poison animals that graze on these plants.


Uses in industry


Fertilizer

The chelating ability of amino acids is sometimes used in fertilizers to facilitate the delivery of minerals to plants in order to correct mineral deficiencies, such as iron chlorosis. These fertilizers are also used to prevent deficiencies from occurring and to improve the overall health of the plants.


Animal feed

Amino acids are sometimes added to
animal feed Animal feed is food given to domestic animals, especially livestock, in the course of animal husbandry. There are two basic types: fodder and forage. Used alone, the word ''feed'' more often refers to fodder. Animal feed is an important input to ...
because some of the components of these feeds, such as
soybean The soybean, soy bean, or soya bean (''Glycine max'') is a species of legume native to East Asia, widely grown for its edible bean, which has numerous uses. Traditional unfermented food uses of soybeans include soy milk, from which tofu an ...
s, have low levels of some of the
essential amino acid An essential amino acid, or indispensable amino acid, is an amino acid that cannot be synthesized from scratch by the organism fast enough to supply its demand, and must therefore come from the diet. Of the 21 amino acids common to all life form ...
s, especially of lysine, methionine, threonine, and tryptophan. Likewise amino acids are used to chelate metal cations in order to improve the absorption of minerals from feed supplements.


Food

The
food industry The food industry is a complex, global network of diverse businesses that supplies most of the food consumed by the world's population. The food industry today has become highly diversified, with manufacturing ranging from small, traditional, ...
is a major consumer of amino acids, especially
glutamic acid Glutamic acid (symbol Glu or E; the ionic form is known as glutamate) is an α-amino acid that is used by almost all living beings in the biosynthesis of proteins. It is a non-essential nutrient for humans, meaning that the human body can syn ...
, which is used as a
flavor enhancer A flavoring (or flavouring), also known as flavor (or flavour) or flavorant, is a food additive used to improve the taste or smell of food. It changes the perceptual impression of food as determined primarily by the chemoreceptors of the gustat ...
, and
aspartame Aspartame is an artificial non-saccharide sweetener 200 times sweeter than sucrose and is commonly used as a sugar substitute in foods and beverages. It is a methyl ester of the aspartic acid/phenylalanine dipeptide with the trade nam ...
(aspartylphenylalanine 1-methyl ester), which is used as an
artificial sweetener A sugar substitute is a food additive that provides a sweetness like that of sugar while containing significantly less food energy than sugar-based sweeteners, making it a zero-calorie () or low-calorie sweetener. Artificial sweeteners may be d ...
. Amino acids are sometimes added to food by manufacturers to alleviate symptoms of mineral deficiencies, such as anemia, by improving mineral absorption and reducing negative side effects from inorganic mineral supplementation.


Pharmaceuticals and cosmetics

Similarly, some amino acids derivatives are used in pharmaceutical industry. They include
5-HTP 5-Hydroxytryptophan (5-HTP), also known as oxitriptan, is a naturally occurring amino acid and chemical precursor as well as a metabolic intermediate in the biosynthesis of the neurotransmitter serotonin. Uses 5-HTP is sold over the counter ...
(5-hydroxytryptophan) used for experimental treatment of depression, L-DOPA (L-dihydroxyphenylalanine) for
Parkinson's Parkinson's disease (PD), or simply Parkinson's, is a long-term degenerative disorder of the central nervous system that mainly affects the motor system. The symptoms usually emerge slowly, and as the disease worsens, non-motor symptoms becom ...
treatment, and
eflornithine Eflornithine, sold under the brand name Vaniqa among others, is a medication used to treat African trypanosomiasis (sleeping sickness) and excessive hair growth on the face in women. Specifically it is used for the 2nd stage of sleeping sickness ...
drug that inhibits
ornithine decarboxylase The enzyme ornithine decarboxylase (, ODC) catalyzes the decarboxylation of ornithine (a product of the urea cycle) to form putrescine. This reaction is the committed step in polyamine synthesis. In humans, this protein has 461 amino acids and fo ...
and used in the treatment of
sleeping sickness African trypanosomiasis, also known as African sleeping sickness or simply sleeping sickness, is an insect-borne parasitic infection of humans and other animals. It is caused by the species ''Trypanosoma brucei''. Humans are infected by two ty ...
. Amino acids are used in the synthesis of some
cosmetics Cosmetics are constituted mixtures of chemical compounds derived from either natural sources, or synthetically created ones. Cosmetics have various purposes. Those designed for personal care and skin care can be used to cleanse or protect ...
.


Expanded genetic code

Since 2001, 40 non-natural amino acids have been added into protein by creating a unique codon (recoding) and a corresponding transfer-RNA:aminoacyl – tRNA-synthetase pair to encode it with diverse physicochemical and biological properties in order to be used as a tool to exploring
protein structure Protein structure is the three-dimensional arrangement of atoms in an amino acid-chain molecule. Proteins are polymers specifically polypeptides formed from sequences of amino acids, the monomers of the polymer. A single amino acid monomer ma ...
and function or to create novel or enhanced proteins.


Nullomers

Nullomers are codons that in theory code for an amino acid, however, in nature there is a selective bias against using this codon in favor of another, for example bacteria prefer to use CGA instead of AGA to code for arginine. This creates some sequences that do not appear in the genome. This characteristic can be taken advantage of and used to create new selective cancer-fighting drugs and to prevent cross-contamination of DNA samples from crime-scene investigations.


Chemical building blocks

Amino acids are important as low-cost
feedstock A raw material, also known as a feedstock, unprocessed material, or primary commodity, is a basic material that is used to produce goods, finished goods, energy, or intermediate materials that are feedstock for future finished products. As feedst ...
s. These compounds are used in
chiral pool synthesis The chiral pool is a "collection of abundant enantiopure building blocks provided by nature" used in synthesis. In other words, a chiral pool would be a large quantity of common organic enantiomers. Contributors to the chiral pool are amino acids, s ...
as enantiomerically pure building blocks. Amino acids have been investigated as precursors chiral catalysts, such as for asymmetric
hydrogenation Hydrogenation is a chemical reaction between molecular hydrogen (H2) and another compound or element, usually in the presence of a catalyst such as nickel, palladium or platinum. The process is commonly employed to reduce or saturate organic c ...
reactions, although no commercial applications exist.


Biodegradable plastics

Amino acids have been considered as components of biodegradable polymers, which have applications as
environmentally friendly Environment friendly processes, or environmental-friendly processes (also referred to as eco-friendly, nature-friendly, and green), are sustainability and marketing terms referring to goods and services, laws, guidelines and policies that clai ...
packaging and in medicine in
drug delivery Drug delivery refers to approaches, formulations, manufacturing techniques, storage systems, and technologies involved in transporting a pharmaceutical compound to its target site to achieve a desired therapeutic effect. Principles related to d ...
and the construction of prosthetic implants. An interesting example of such materials is polyaspartate, a water-soluble biodegradable polymer that may have applications in disposable
diaper A diaper /ˈdaɪpə(r)/ ( American and Canadian English) or a nappy (Australian English, British English, and Hiberno-English) is a type of underwear that allows the wearer to urinate or defecate without using a toilet, by absorbing or conta ...
s and agriculture. Due to its solubility and ability to
chelate Chelation is a type of bonding of ions and molecules to metal ions. It involves the formation or presence of two or more separate coordinate bonds between a polydentate (multiple bonded) ligand and a single central metal atom. These ligands are ...
metal ions, polyaspartate is also being used as a biodegradable anti
scaling Scaling may refer to: Science and technology Mathematics and physics * Scaling (geometry), a linear transformation that enlarges or diminishes objects * Scale invariance, a feature of objects or laws that do not change if scales of length, energ ...
agent and a
corrosion inhibitor In chemistry, a corrosion inhibitor or anti-corrosive is a chemical compound that, when added to a liquid or gas, decreases the corrosion rate of a material, typically a metal or an alloy, that comes into contact with the fluid. The effectivene ...
. In addition, the aromatic amino acid
tyrosine -Tyrosine or tyrosine (symbol Tyr or Y) or 4-hydroxyphenylalanine is one of the 20 standard amino acids that are used by cells to synthesize proteins. It is a non-essential amino acid with a polar side group. The word "tyrosine" is from the Gr ...
has been considered as a possible replacement for
phenols In organic chemistry, phenols, sometimes called phenolics, are a class of chemical compounds consisting of one or more hydroxyl groups (— O H) bonded directly to an aromatic hydrocarbon group. The simplest is phenol, . Phenolic compounds are ...
such as
bisphenol A Bisphenol A (BPA) is a chemical compound primarily used in the manufacturing of various plastics. It is a colourless solid which is soluble in most common organic solvents, but has very poor solubility in water. BPA is produced on an industrial s ...
in the manufacture of
polycarbonate Polycarbonates (PC) are a group of thermoplastic polymers containing carbonate groups in their chemical structures. Polycarbonates used in engineering are strong, tough materials, and some grades are optically transparent. They are easily work ...
s.


Synthesis


Chemical synthesis

The commercial production of amino acids usually relies on mutant bacteria that overproduce individual amino acids using glucose as a carbon source. Some amino acids are produced by enzymatic conversions of synthetic intermediates.
2-Aminothiazoline-4-carboxylic acid 2-Aminothiazoline-4-carboxylic acid (ACTA) is the organosulfur compound and a heterocycle with the formula HO2CCHCH2SCNH2N. This derivative of thiazoline is an intermediate in the industrial synthesis of L- cysteine, an amino acid. ACTA exists i ...
is an intermediate in one industrial synthesis of L-cysteine for example.
Aspartic acid Aspartic acid (symbol Asp or D; the ionic form is known as aspartate), is an α-amino acid that is used in the biosynthesis of proteins. Like all other amino acids, it contains an amino group and a carboxylic acid. Its α-amino group is in the pro ...
is produced by the addition of ammonia to
fumarate Fumaric acid is an organic compound with the formula HO2CCH=CHCO2H. A white solid, fumaric acid occurs widely in nature. It has a fruit-like taste and has been used as a food additive. Its E number is E297. The salts and esters are known as fum ...
using a lyase.


Biosynthesis

In plants, nitrogen is first assimilated into organic compounds in the form of
glutamate Glutamic acid (symbol Glu or E; the ionic form is known as glutamate) is an α-amino acid that is used by almost all living beings in the biosynthesis of proteins. It is a non-essential nutrient for humans, meaning that the human body can syn ...
, formed from alpha-ketoglutarate and ammonia in the mitochondrion. For other amino acids, plants use
transaminase Transaminases or aminotransferases are enzymes that catalyze a transamination reaction between an amino acid and an α-keto acid. They are important in the synthesis of amino acids, which form proteins. Function and mechanism An amino acid co ...
s to move the amino group from glutamate to another alpha-keto acid. For example, aspartate aminotransferase converts glutamate and oxaloacetate to alpha-ketoglutarate and aspartate. Other organisms use transaminases for amino acid synthesis, too. Nonstandard amino acids are usually formed through modifications to standard amino acids. For example,
homocysteine Homocysteine is a non-proteinogenic α-amino acid. It is a homologue of the amino acid cysteine, differing by an additional methylene bridge (-CH2-). It is biosynthesized from methionine by the removal of its terminal Cε methyl group. In the ...
is formed through the
transsulfuration pathway The transsulfuration pathway is a metabolic pathway involving the interconversion of cysteine and homocysteine through the intermediate cystathionine. Two transsulfurylation pathways are known: the ''forward'' and the ''reverse''. The ''forward ...
or by the demethylation of methionine via the intermediate metabolite ''S''-adenosylmethionine, while
hydroxyproline (2''S'',4''R'')-4-Hydroxyproline, or L-hydroxyproline ( C5 H9 O3 N), is an amino acid, abbreviated as Hyp or O, ''e.g.'', in Protein Data Bank. Structure and discovery In 1902, Hermann Emil Fischer isolated hydroxyproline from hydrolyzed gelatin. ...
is made by a
post translational modification Post-translational modification (PTM) is the covalent and generally enzymatic modification of proteins following protein biosynthesis. This process occurs in the endoplasmic reticulum and the golgi apparatus. Proteins are synthesized by ribosom ...
of
proline Proline (symbol Pro or P) is an organic acid classed as a proteinogenic amino acid (used in the biosynthesis of proteins), although it does not contain the amino group but is rather a secondary amine. The secondary amine nitrogen is in the prot ...
.
Microorganism A microorganism, or microbe,, ''mikros'', "small") and ''organism'' from the el, ὀργανισμός, ''organismós'', "organism"). It is usually written as a single word but is sometimes hyphenated (''micro-organism''), especially in olde ...
s and plants synthesize many uncommon amino acids. For example, some microbes make
2-aminoisobutyric acid 2-Aminoisobutyric acid (also known as α-aminoisobutyric acid, AIB, α-methylalanine, or 2-methylalanine) is the non-proteinogenic amino acid with the structural formula H2N-C(CH3)2-COOH. It is rare in nature, having been only found in meteorites, ...
and
lanthionine Lanthionine is a nonproteinogenic amino acid with the chemical formula (HOOC-CH(NH2)-CH2-S-CH2-CH(NH2)-COOH). It is typically formed by a cysteine residue and a dehydrated serine residue. Despite its name, lanthionine does not contain the element ...
, which is a sulfide-bridged derivative of alanine. Both of these amino acids are found in peptidic
lantibiotics Lantibiotics are a class of polycyclic peptide antibiotics that contain the characteristic thioether amino acids lanthionine or methyllanthionine, as well as the unsaturated amino acids dehydroalanine, and 2-aminoisobutyric acid. They belong to ...
such as alamethicin. However, in plants,
1-aminocyclopropane-1-carboxylic acid 1-Aminocyclopropane-1-carboxylic acid (ACC) is a disubstituted cyclic α-amino acid in which a cyclopropane ring is fused to the C atom of the amino acid. It is a white solid. Many cyclopropane-substituted amino acids are known, but this one occ ...
is a small disubstituted cyclic amino acid that is an intermediate in the production of the plant hormone
ethylene Ethylene ( IUPAC name: ethene) is a hydrocarbon which has the formula or . It is a colourless, flammable gas with a faint "sweet and musky" odour when pure. It is the simplest alkene (a hydrocarbon with carbon-carbon double bonds). Ethylene ...
.


Primordial synthesis

The formation of amino acids and peptides are assumed to precede and perhaps induce the emergence of life on earth. Amino acids can form from simple precursors under various conditions. Surface-based chemical metabolism of amino acids and very small compounds may have led to the build-up of amino acids, coenzymes and phosphate-based small carbon molecules. Amino acids and similar building blocks could have been elaborated into proto-
peptide Peptides (, ) are short chains of amino acids linked by peptide bonds. Long chains of amino acids are called proteins. Chains of fewer than twenty amino acids are called oligopeptides, and include dipeptides, tripeptides, and tetrapeptides. A p ...
s, with peptides being considered key players in the origin of life. In the famous Urey-Miller experiment, the passage of an electric arc through a mixture of methane, hydrogen, and ammonia produces a large number of amino acids. Since then, scientists have discovered a range of ways and components by which the potentially prebiotic formation and chemical evolution of peptides may have occurred, such as condensing agents, the design of self-replicating peptides and a number of non-enzymatic mechanisms by which amino acids could have emerged and elaborated into peptides. Several hypotheses invoke the
Strecker synthesis The Strecker amino acid synthesis, also known simply as the Strecker synthesis, is a method for the synthesis of amino acids by the reaction of an aldehyde with ammonia in the presence of potassium cyanide. The condensation reaction yields an α- ...
whereby hydrogen cyanide, simple aldehydes, ammonia, and water produce amino acids. According to a review, amino acids, and even peptides, "turn up fairly regularly in the various experimental broths that have been allowed to be cooked from simple chemicals. This is because
nucleotide Nucleotides are organic molecules consisting of a nucleoside and a phosphate. They serve as monomeric units of the nucleic acid polymers – deoxyribonucleic acid (DNA) and ribonucleic acid (RNA), both of which are essential biomolecules with ...
s are far more difficult to synthesize chemically than amino acids." For a chronological order, it suggests that there must have been a 'protein world' or at least a 'polypeptide world', possibly later followed by the '
RNA world The RNA world is a hypothetical stage in the evolutionary history of life on Earth, in which self-replicating RNA molecules proliferated before the evolution of DNA and proteins. The term also refers to the hypothesis that posits the existence ...
' and the ' DNA world'.
Codon The genetic code is the set of rules used by living cells to translate information encoded within genetic material ( DNA or RNA sequences of nucleotide triplets, or codons) into proteins. Translation is accomplished by the ribosome, which links p ...
–amino acids mappings may be the
biological Biology is the scientific study of life. It is a natural science with a broad scope but has several unifying themes that tie it together as a single, coherent field. For instance, all organisms are made up of cells that process hereditary in ...
information system at the primordial origin of life on Earth. While amino acids and consequently simple peptides must have formed under different experimentally probed geochemical scenarios, the transition from an abiotic world to the first life forms is to a large extent still unresolved.


Reactions

Amino acids undergo the reactions expected of the constituent functional groups.


Peptide bond formation

As both the amine and carboxylic acid groups of amino acids can react to form amide bonds, one amino acid molecule can react with another and become joined through an amide linkage. This
polymerization In polymer chemistry, polymerization (American English), or polymerisation (British English), is a process of reacting monomer molecules together in a chemical reaction to form polymer chains or three-dimensional networks. There are many for ...
of amino acids is what creates proteins. This
condensation reaction In organic chemistry, a condensation reaction is a type of chemical reaction in which two molecules are combined to form a single molecule, usually with the loss of a small molecule such as water. If water is lost, the reaction is also known as a ...
yields the newly formed peptide bond and a molecule of water. In cells, this reaction does not occur directly; instead, the amino acid is first activated by attachment to a
transfer RNA Transfer RNA (abbreviated tRNA and formerly referred to as sRNA, for soluble RNA) is an adaptor molecule composed of RNA, typically 76 to 90 nucleotides in length (in eukaryotes), that serves as the physical link between the mRNA and the amino ac ...
molecule through an
ester In chemistry, an ester is a compound derived from an oxoacid (organic or inorganic) in which at least one hydroxyl group () is replaced by an alkoxy group (), as in the substitution reaction of a carboxylic acid and an alcohol. Glycerides are f ...
bond. This aminoacyl-tRNA is produced in an ATP-dependent reaction carried out by an
aminoacyl tRNA synthetase An aminoacyl-tRNA synthetase (aaRS or ARS), also called tRNA-ligase, is an enzyme that attaches the appropriate amino acid onto its corresponding tRNA. It does so by catalyzing the transesterification of a specific cognate amino acid or its pre ...
. This aminoacyl-tRNA is then a substrate for the ribosome, which catalyzes the attack of the amino group of the elongating protein chain on the ester bond. As a result of this mechanism, all proteins made by ribosomes are synthesized starting at their ''N''-terminus and moving toward their ''C''-terminus. However, not all peptide bonds are formed in this way. In a few cases, peptides are synthesized by specific enzymes. For example, the tripeptide
glutathione Glutathione (GSH, ) is an antioxidant in plants, animals, fungi, and some bacteria and archaea. Glutathione is capable of preventing damage to important cellular components caused by sources such as reactive oxygen species, free radicals, per ...
is an essential part of the defenses of cells against oxidative stress. This peptide is synthesized in two steps from free amino acids. In the first step,
gamma-glutamylcysteine synthetase γ -L-Glutamyl-L-cysteine, also known as γ-glutamylcysteine (GGC), is a dipeptide found in animals, plants, fungi, some bacteria, and archaea. It has a relatively unusual γ-bond between the constituent amino acids, L-glutamic acid and L-cyst ...
condenses cysteine and
glutamate Glutamic acid (symbol Glu or E; the ionic form is known as glutamate) is an α-amino acid that is used by almost all living beings in the biosynthesis of proteins. It is a non-essential nutrient for humans, meaning that the human body can syn ...
through a peptide bond formed between the side chain carboxyl of the glutamate (the gamma carbon of this side chain) and the amino group of the cysteine. This dipeptide is then condensed with glycine by
glutathione synthetase Glutathione synthetase (GSS) () is the second enzyme in the glutathione (GSH) biosynthesis pathway. It catalyses the condensation of gamma-glutamylcysteine and glycine, to form glutathione. Glutathione synthetase is also a potent antioxidant ...
to form glutathione. In chemistry, peptides are synthesized by a variety of reactions. One of the most-used in solid-phase peptide synthesis uses the aromatic oxime derivatives of amino acids as activated units. These are added in sequence onto the growing peptide chain, which is attached to a solid resin support. Libraries of peptides are used in drug discovery through
high-throughput screening High-throughput screening (HTS) is a method for scientific experimentation especially used in drug discovery and relevant to the fields of biology, materials science and chemistry. Using robotics, data processing/control software, liquid handling ...
. The combination of functional groups allow amino acids to be effective polydentate ligands for metal–amino acid chelates. The multiple side chains of amino acids can also undergo chemical reactions.


Catabolism

Degradation of an amino acid often involves
deamination Deamination is the removal of an amino group from a molecule. Enzymes that catalyse this reaction are called deaminases. In the human body, deamination takes place primarily in the liver, however it can also occur in the kidney. In situations of ...
by moving its amino group to alpha-ketoglutarate, forming
glutamate Glutamic acid (symbol Glu or E; the ionic form is known as glutamate) is an α-amino acid that is used by almost all living beings in the biosynthesis of proteins. It is a non-essential nutrient for humans, meaning that the human body can syn ...
. This process involves transaminases, often the same as those used in amination during synthesis. In many vertebrates, the amino group is then removed through the
urea cycle The urea cycle (also known as the ornithine cycle) is a cycle of biochemical reactions that produces urea (NH2)2CO from ammonia (NH3). Animals that use this cycle, mainly amphibians and mammals, are called ureotelic. The urea cycle converts highl ...
and is excreted in the form of
urea Urea, also known as carbamide, is an organic compound with chemical formula . This amide has two amino groups (–) joined by a carbonyl functional group (–C(=O)–). It is thus the simplest amide of carbamic acid. Urea serves an important ...
. However, amino acid degradation can produce
uric acid Uric acid is a heterocyclic compound of carbon, nitrogen, oxygen, and hydrogen with the formula C5H4N4O3. It forms ions and salts known as urates and acid urates, such as ammonium acid urate. Uric acid is a product of the metabolic breakdown o ...
or ammonia instead. For example,
serine dehydratase Serine dehydratase or L-serine ammonia lyase (SDH) is in the β-family of pyridoxal phosphate-dependent (PLP) enzymes. SDH is found widely in nature, but its structural and properties vary among species. SDH is found in yeast, bacteria, and the ...
converts serine to pyruvate and ammonia. After removal of one or more amino groups, the remainder of the molecule can sometimes be used to synthesize new amino acids, or it can be used for energy by entering
glycolysis Glycolysis is the metabolic pathway that converts glucose () into pyruvate (). The free energy released in this process is used to form the high-energy molecules adenosine triphosphate (ATP) and reduced nicotinamide adenine dinucleotide (NADH) ...
or the
citric acid cycle The citric acid cycle (CAC)—also known as the Krebs cycle or the TCA cycle (tricarboxylic acid cycle)—is a series of chemical reactions to release stored energy through the oxidation of acetyl-CoA derived from carbohydrates, fats, and protei ...
, as detailed in image at right.


Complexation

Amino acids are bidentate ligands, forming transition metal amino acid complexes. :


Chemical analysis

The total nitrogen content of organic matter is mainly formed by the amino groups in proteins. The Total Kjeldahl Nitrogen (
TKN The Kjeldahl method or Kjeldahl digestion () in analytical chemistry is a method for the quantitative determination of nitrogen contained in organic substances plus the nitrogen contained in the inorganic compounds ammonia and ammonium (NH3/NH4+) ...
) is a measure of nitrogen widely used in the analysis of (waste) water, soil, food, feed and organic matter in general. As the name suggests, the
Kjeldahl method The Kjeldahl method or Kjeldahl digestion () in analytical chemistry is a method for the quantitative determination of nitrogen contained in organic substances plus the nitrogen contained in the inorganic compounds ammonia and ammonium (NH3/NH4+). ...
is applied. More sensitive methods are available.


See also

* Amino acid dating * Beta-peptide *
Degron A degron is a portion of a protein that is important in regulation of protein degradation rates. Known degrons include short amino acid sequences, structural motifs and exposed amino acids (often Lysine or Arginine) located anywhere in the prote ...
* Erepsin *
Homochirality Homochirality is a uniformity of chirality, or handedness. Objects are chiral when they cannot be superposed on their mirror images. For example, the left and right hands of a human are approximately mirror images of each other but are not their ow ...
* Hyperaminoacidemia * Leucines *
Miller–Urey experiment The Miller–Urey experiment (or Miller experiment) is a famous chemistry experiment that simulated the conditions thought at the time (1952) to be present in the atmosphere of the early, prebiotic Earth, in order to test the hypothesis of the ...
*
Nucleic acid sequence A nucleic acid sequence is a succession of bases signified by a series of a set of five different letters that indicate the order of nucleotides forming alleles within a DNA (using GACT) or RNA (GACU) molecule. By convention, sequences are usua ...
* RNA codon table


Notes


References


Further reading

* * * *


External links

* {{DEFAULTSORT:Amino Acid Nitrogen cycle Zwitterions