HOME

TheInfoList




In
biology Biology is the natural science that studies life and living organisms, including their anatomy, physical structure, Biochemistry, chemical processes, Molecular biology, molecular interactions, Physiology, physiological mechanisms, Development ...

biology
, histones are highly
basic BASIC (Beginners' All-purpose Symbolic Instruction Code) is a family of general-purpose, high-level programming language In computer science Computer science deals with the theoretical foundations of information, algorithms and the ar ...
protein Proteins are large biomolecule , showing alpha helices, represented by ribbons. This poten was the first to have its suckture solved by X-ray crystallography by Max Perutz and Sir John Cowdery Kendrew in 1958, for which they received a No ...

protein
s abundant in
lysine Lysine (symbol Lys or K) is an α-amino acid Amino acids are organic compound , CH4; is among the simplest organic compounds. In chemistry, organic compounds are generally any chemical compounds that contain carbon-hydrogen chemical bond, ...

lysine
and
arginine Arginine, also known as -arginine (symbol Arg or R), is an α-amino acid Amino acids are organic compound , CH4; is among the simplest organic compounds. In chemistry, organic compounds are generally any chemical compounds that contain carb ...

arginine
residues that are found in
eukaryotic Eukaryotes () are organism In biology Biology is the natural science that studies life and living organisms, including their anatomy, physical structure, Biochemistry, chemical processes, Molecular biology, molecular interact ...
cell nuclei In cell biology Cell biology (also cellular biology or cytology) is a branch of biology Biology is the natural science that studies life and living organisms, including their anatomy, physical structure, Biochemistry, chemical process ...
. They act as spools around which
DNA Deoxyribonucleic acid (; DNA) is a molecule File:Pentacene on Ni(111) STM.jpg, A scanning tunneling microscopy image of pentacene molecules, which consist of linear chains of five carbon rings. A molecule is an electrically neutral gro ...

DNA
winds to create structural units called
nucleosome A nucleosome is the basic structural unit of packaging in . The structure of a nucleosome consists of a segment of DNA wound around eight   and resembles thread wrapped around a spool. The nucleosome is the fundamental subunit of . Each nucleo ...

nucleosome
s. Nucleosomes in turn are wrapped into 30-
nanometer file:EM Spectrum Properties edit.svg, 330px, Different lengths as in respect to the Electromagnetic spectrum, measured by the Metre and its derived scales. The nanometre is often used to express dimensions on an atomic scale and mostly in the Mo ...
fibers that form tightly packed
chromatin Chromatin is a complex of DNA Deoxyribonucleic acid (; DNA) is a molecule File:Pentacene on Ni(111) STM.jpg, A scanning tunneling microscopy image of pentacene molecules, which consist of linear chains of five carbon rings. A molecu ...
. Histones prevent DNA from becoming tangled and protect it from
DNA damage DNA repair is a collection of processes by which a identifies and corrects damage to the molecules that encode its . In human cells, both normal activities and environmental factors such as can cause DNA damage, resulting in tens of thousan ...
. In addition, histones play important roles in
gene regulation Regulation of gene expression, or gene regulation, includes a wide range of mechanisms that are used by cells to increase or decrease the production of specific gene products (protein or RNA). Sophisticated programs of gene expression are widely ...
and
DNA replication In , DNA replication is the of producing two identical replicas of DNA from one original molecule. DNA replication occurs in all acting as the most essential part for . This is essential for cell division during growth and repair of damaged tis ...

DNA replication
. Without histones, unwound DNA in
chromosome A chromosome is a long DNA Deoxyribonucleic acid (; DNA) is a molecule File:Pentacene on Ni(111) STM.jpg, A scanning tunneling microscopy image of pentacene molecules, which consist of linear chains of five carbon rings. A mole ...

chromosome
s would be very long. For example, each human cell has about 1.8 meters of DNA if completely stretched out, however when wound about histones, this length is reduced to about 90 micrometers (0.09 mm) of 30 nm diameter chromatin fibers. There are five families of histones which are designated H1/H5 (linker histones), H2, H3, and H4 (core histones). The nucleosome core is formed of two H2A-H2B dimers and a H3-H4
tetramer A tetramer () (''wikt:tetra-, tetra-'', "four" + ''wikt:-mer, -mer'', "parts") is an oligomer formed from four monomers or Protein subunit, subunits. The associated property is called ''tetramery''. An example from inorganic chemistry is titanium ...
. The tight wrapping of DNA around histones is to a large degree a result of
electrostatic Electrostatics is a branch of physics Physics is the that studies , its , its and behavior through , and the related entities of and . "Physical science is that department of knowledge which relates to the order of nature, or, in oth ...
attraction between the positively charged histones and negatively charged phosphate backbone of DNA. Histones may be chemically modified through the action of enzymes to regulate gene transcription. The most common modification are the
methylation In the chemical sciences, methylation denotes the addition of a methyl group A methyl group is an alkyl derived from methane, containing one carbon atom chemical bond, bonded to three hydrogen atoms — CH3. In chemical formula, fo ...

methylation
of arginine or lysine residues or the
acetylation Acetylation (or in IUPAC nomenclature The International Union of Pure and Applied Chemistry The International Union of Pure and Applied Chemistry (IUPAC ) is an international federation of National Adhering Organizations that represents chemi ...

acetylation
of lysine. Methylation can affect how other protein such as
transcription factor In molecular biology Molecular biology is the branch of biology Biology is the natural science that studies life and living organisms, including their anatomy, physical structure, Biochemistry, chemical processes, Molecular biology, m ...
s interact with the nucleosomes. Lysine acetylation eliminates a positive charge on lysine thereby weakening the electrostatic attraction between histone and DNA resulting in partial unwinding of the DNA making it more accessible for gene expression.


Classes and variants

Five major families of histones exist: H1/H5, H2A, H2B, H3, and H4. Histones H2A, H2B, H3 and H4 are known as the core histones, while histones H1/H5 are known as the linker histones. The core histones all exist as dimers, which are similar in that they all possess the histone fold domain: three alpha helices linked by two loops. It is this helical structure that allows for interaction between distinct dimers, particularly in a head-tail fashion (also called the handshake motif). The resulting four distinct dimers then come together to form one octameric
nucleosome A nucleosome is the basic structural unit of packaging in . The structure of a nucleosome consists of a segment of DNA wound around eight   and resembles thread wrapped around a spool. The nucleosome is the fundamental subunit of . Each nucleo ...

nucleosome
core, approximately 63 Angstroms in diameter (a
solenoid (DNA) The solenoid structure of chromatin Chromatin is a complex of DNA, protein Proteins are large biomolecules or macromolecules that are comprised of one or more long chains of amino acid residue (biochemistry), residues. Proteins perform a vas ...
-like particle). Around 146
base pairs A base pair (bp) is a fundamental unit of double-stranded nucleic acids consisting of two nucleobases bound to each other by hydrogen bonds. They form the building blocks of the DNA double helix and contribute to the folded structure of both DNA ...
(bp) of DNA wrap around this core particle 1.65 times in a left-handed super-helical turn to give a particle of around 100 Angstroms across. The linker histone H1 binds the nucleosome at the entry and exit sites of the DNA, thus locking the DNA into place and allowing the formation of higher order structure. The most basic such formation is the 10 nm fiber or beads on a string conformation. This involves the wrapping of DNA around nucleosomes with approximately 50 base pairs of
DNA Deoxyribonucleic acid (; DNA) is a molecule File:Pentacene on Ni(111) STM.jpg, A scanning tunneling microscopy image of pentacene molecules, which consist of linear chains of five carbon rings. A molecule is an electrically neutral gro ...

DNA
separating each pair of
nucleosome A nucleosome is the basic structural unit of packaging in . The structure of a nucleosome consists of a segment of DNA wound around eight   and resembles thread wrapped around a spool. The nucleosome is the fundamental subunit of . Each nucleo ...

nucleosome
s (also referred to as linker
DNA Deoxyribonucleic acid (; DNA) is a molecule File:Pentacene on Ni(111) STM.jpg, A scanning tunneling microscopy image of pentacene molecules, which consist of linear chains of five carbon rings. A molecule is an electrically neutral gro ...

DNA
). Higher-order structures include the 30 nm fiber (forming an irregular zigzag) and 100 nm fiber, these being the structures found in normal cells. During mitosis and meiosis, the condensed
chromosome A chromosome is a long DNA Deoxyribonucleic acid (; DNA) is a molecule File:Pentacene on Ni(111) STM.jpg, A scanning tunneling microscopy image of pentacene molecules, which consist of linear chains of five carbon rings. A mole ...

chromosome
s are assembled through interactions between nucleosomes and other regulatory proteins. Histones are subdivided into canonical replication-dependent histones that are expressed during the
S-phase S phase (Synthesis Phase) is the phase of the cell cycle The cell cycle, or cell-division cycle, is the series of events that take place in a cell Cell most often refers to: * Cell (biology), the functional basic unit of life Cell may ...
of the
cell cycle The cell cycle, or cell-division cycle, is the series of events that take place in a cell Cell most often refers to: * Cell (biology), the functional basic unit of life Cell may also refer to: Closed spaces * Monastic cell, a small room, ...

cell cycle
and replication-independent
histone variants Histone variants are proteins that substitute for the core canonical histones (Histone H3, H3, Histone H4, H4, Histone H2A, H2A, Histone H2B, H2B) in nucleosomes in eukaryotes and often confer specific structural and functional features. The term mi ...
, expressed during the whole cell cycle. In animals, genes encoding canonical histones are typically clustered along the chromosome, lack
intron An intron (for ''intragenic region'') is any nucleotide sequence A nucleic acid sequence is a succession of bases signified by a series of a set of five different letters that indicate the order of nucleotides Nucleotides are organic molecul ...

intron
s and use a stem loop structure at the
3' end Directionality, in molecular biology and biochemistry Biochemistry or biological chemistry, is the study of chemical processes within and relating to living organisms. A sub-discipline of both chemistry and biology, biochemistry may be divide ...
instead of a
polyA tail Polyadenylation is the addition of a poly(A) tail to an RNA transcript, typically a messenger RNA Image:MRNA-interaction.png, 500px, The "life cycle" of an mRNA in a eukaryote, eukaryotic cell. RNA is transcription (genetics), transcribed in the ...
. Genes encoding histone variants are usually not clustered, have introns and their mRNAs are regulated with polyA tails. Complex multicellular organisms typically have a higher number of histone variants providing a variety of different functions. Recent data are accumulating about the roles of diverse histone variants highlighting the functional links between variants and the delicate regulation of organism development. Histone variants from different organisms, their classification and variant specific features can be found i
"HistoneDB 2.0 - Variants"
database. The following is a list of human histone proteins:


Structure

The
nucleosome A nucleosome is the basic structural unit of packaging in . The structure of a nucleosome consists of a segment of DNA wound around eight   and resembles thread wrapped around a spool. The nucleosome is the fundamental subunit of . Each nucleo ...

nucleosome
core is formed of two H2A-H2B dimers and a H3-H4 tetramer, forming two nearly
symmetrical Symmetry (from Greek συμμετρία ''symmetria'' "agreement in dimensions, due proportion, arrangement") in everyday language refers to a sense of harmonious and beautiful proportion and balance. In mathematics, "symmetry" has a more pre ...

symmetrical
halves by
tertiary structure Biomolecular structure is the intricate folded, three-dimensional shape that is formed by a molecule A scanning tunneling microscopy image of pentacene molecules, which consist of linear chains of five carbon rings. A molecule is an elect ...

tertiary structure
( C2 symmetry; one
macromolecule macromolecule A macromolecule is a very large molecule File:Pentacene on Ni(111) STM.jpg, A scanning tunneling microscopy image of pentacene molecules, which consist of linear chains of five carbon rings. A molecule is an electrically neu ...
is the mirror image of the other). The H2A-H2B dimers and H3-H4 tetramer also show pseudodyad symmetry. The 4 'core' histones (H2A, H2B, H3 and H4) are relatively similar in structure and are highly conserved through
evolution Evolution is change in the heritable Heredity, also called inheritance or biological inheritance, is the passing on of Phenotypic trait, traits from parents to their offspring; either through asexual reproduction or sexual reproduction, ...

evolution
, all featuring a ' helix turn helix turn helix' motif (DNA-binding protein motif that recognize specific DNA sequence). They also share the feature of long 'tails' on one end of the
amino acid Amino acids are organic compound In , organic compounds are generally any s that contain - . Due to carbon's ability to (form chains with other carbon s), millions of organic compounds are known. The study of the properties, reactions, a ...

amino acid
structure - this being the location of post-translational modification (see below). Archaeal histone only contains a H3-H4 like dimeric structure made out of the same protein. Such dimeric structures can stack into a tall superhelix ("hypernucleosome") onto which DNA coils in a manner similar to nucleosome spools. Only some archaeal histones have tails. The distance between the spools around which eukaryotic cells wind their DNA has been determined to range from 59 to 70 Å. In all, histones make five types of interactions with DNA: * Salt bridges and hydrogen bonds between side chains of basic amino acids (especially
lysine Lysine (symbol Lys or K) is an α-amino acid Amino acids are organic compound , CH4; is among the simplest organic compounds. In chemistry, organic compounds are generally any chemical compounds that contain carbon-hydrogen chemical bond, ...

lysine
and
arginine Arginine, also known as -arginine (symbol Arg or R), is an α-amino acid Amino acids are organic compound , CH4; is among the simplest organic compounds. In chemistry, organic compounds are generally any chemical compounds that contain carb ...

arginine
) and phosphate oxygens on DNA * Helix-dipoles form in H2B, H3, and H4 cause a net positive charge to accumulate at the point of interaction with negatively charged
phosphate In chemistry, a phosphate is an anion, salt (chemistry), salt, functional group or ester derived from a phosphoric acids and phosphates, phosphoric acid. It most commonly means orthophosphate, a derivative of phosphoric acid, orthophosphoric a ...

phosphate
groups on DNA *
Hydrogen bonds A hydrogen bond (or H-bond) is a primarily Electrostatics, electrostatic force of attraction between a hydrogen Hydrogen is the chemical element Image:Simple Periodic Table Chart-blocks.svg, 400px, Periodic table, The periodic table of ...
between the DNA backbone and the
amide In organic chemistry, an amide, also known as an organic amide or a carboxamide, is a chemical compound, compound with the general formula RC(=O)NR′R″, where R, R', and R″ represent organic compound, organic functional group, groups or ...
group on the main chain of histone proteins * Nonpolar interactions between the histone and
deoxyribose Deoxyribose, or more precisely 2-deoxyribose, is a monosaccharide Monosaccharides (from Greek#REDIRECT Greek Greek may refer to: Greece Anything of, from, or related to Greece Greece ( el, Ελλάδα, , ), officially the Hellenic Rep ...

deoxyribose
sugars on DNA * Non-specific minor groove insertions of the H3 and H2B N-terminal tails into two minor grooves each on the DNA molecule The highly basic nature of histones, aside from facilitating DNA-histone interactions, contributes to their water solubility. Histones are subject to post translational modification by enzymes primarily on their N-terminal tails, but also in their globular domains. Such modifications include
methylation In the chemical sciences, methylation denotes the addition of a methyl group A methyl group is an alkyl derived from methane, containing one carbon atom chemical bond, bonded to three hydrogen atoms — CH3. In chemical formula, fo ...

methylation
, ,
acetylation Acetylation (or in IUPAC nomenclature The International Union of Pure and Applied Chemistry The International Union of Pure and Applied Chemistry (IUPAC ) is an international federation of National Adhering Organizations that represents chemi ...

acetylation
,
phosphorylation In chemistry Chemistry is the study of the properties and behavior of . It is a that covers the that make up matter to the composed of s, s and s: their composition, structure, properties, behavior and the changes they undergo during ...

phosphorylation
,
SUMOylationSmall Ubiquitin-like Modifier (or SUMO) proteins are a family of small proteins Proteins are large biomolecule , showing alpha helices, represented by ribbons. This poten was the first to have its suckture solved by X-ray crystallography by M ...
,
ubiquitin Ubiquitin is a small (8.6 ) found in most tissues of organisms, i.e., it is found . It was discovered in 1975 by Gideon Goldstein and further characterized throughout the late 1970s and 1980s. Four genes in the code for ubiquitin: , , and . ...
ation, and
ADP-ribosylation ADP-ribosylation is the addition of one or more ADP-ribose moieties to a protein. It is a reversible post-translational modification that is involved in many cellular processes, including cell signaling, DNA repair, gene regulation and apoptosis. ...
. This affects their function of gene regulation. In general,
gene In biology Biology is the natural science that studies life and living organisms, including their anatomy, physical structure, Biochemistry, chemical processes, Molecular biology, molecular interactions, Physiology, physiological mecha ...

gene
s that are active have less bound histone, while inactive genes are highly associated with histones during
interphase Interphase is the portion of the cell cycle that is not accompanied by gross changes under the microscope, and includes the G1, S and G2 phases. During interphase, the cell grows (G1), replicates its DNA (S) and prepares for mitosis (G2). A cel ...

interphase
. It also appears that the structure of histones has been
evolution Evolution is change in the heritable Heredity, also called inheritance or biological inheritance, is the passing on of Phenotypic trait, traits from parents to their offspring; either through asexual reproduction or sexual reproduction, ...

evolution
arily conserved, as any deleterious
mutations Image:Darwin Hybrid Tulip Mutation 2014-05-01.jpg, A red tulip exhibiting a partially yellow petal due to a mutation in its genes In biology, a mutation is an alteration in the base sequence, nucleotide sequence of the genome of an organism, vi ...

mutations
would be severely maladaptive. All histones have a highly positively charged N-terminus with many
lysine Lysine (symbol Lys or K) is an α-amino acid Amino acids are organic compound , CH4; is among the simplest organic compounds. In chemistry, organic compounds are generally any chemical compounds that contain carbon-hydrogen chemical bond, ...

lysine
and
arginine Arginine, also known as -arginine (symbol Arg or R), is an α-amino acid Amino acids are organic compound , CH4; is among the simplest organic compounds. In chemistry, organic compounds are generally any chemical compounds that contain carb ...

arginine
residues.


Evolution and species distribution

Core histones are found in the
nuclei ''Nucleus'' (plural nuclei) is a Latin word for the seed inside a fruit. It most often refers to: *Atomic nucleus, the very dense central region of an atom *Cell nucleus, a central organelle of a eukaryotic cell, containing most of the cell's DNA ...
of
eukaryotic Eukaryotes () are organism In biology Biology is the natural science that studies life and living organisms, including their anatomy, physical structure, Biochemistry, chemical processes, Molecular biology, molecular interact ...

eukaryotic
cells Cell most often refers to: * Cell (biology), the functional basic unit of life Cell may also refer to: Closed spaces * Monastic cell, a small room, hut, or cave in which a monk or religious recluse lives * Prison cell, a room used to hold peopl ...
and in most
Archaea Archaea ( ; singular archaeon ) constitute a domain Domain may refer to: Mathematics *Domain of a function, the set of input values for which the (total) function is defined **Domain of definition of a partial function **Natural domain of a pa ...

Archaea
l phyla, but not in
bacteria Bacteria (; common noun bacteria, singular bacterium) are ubiquitous, mostly free-living organisms often consisting of one Cell (biology), biological cell. They constitute a large domain (biology), domain of prokaryotic microorganisms. Typ ...

bacteria
. However the linker histones have homologs in bacteria. The unicellular algae known as
dinoflagellate The dinoflagellates (Greek Greek may refer to: Greece Anything of, from, or related to Greece Greece ( el, Ελλάδα, , ), officially the Hellenic Republic, is a country located in Southeast Europe. Its population is approximately 10.7 m ...
s were previously thought to be the only eukaryotes that completely lack histones, however, later studies showed that their DNA still encodes histone genes. Unlike the core histones, lysine-rich linker histone (H1) proteins are found in bacteria, otherwise known as nucleoprotein HC1/HC2. It has been proposed that histone proteins are evolutionarily related to the helical part of the extended AAA+ ATPase domain, the C-domain, and to the N-terminal substrate recognition domain of Clp/Hsp100 proteins. Despite the differences in their topology, these three folds share a homologous helix-strand-helix (HSH) motif. Archaeal histones may well resemble the evolutionary precursors to eukaryotic histones. Furthermore, the nucleosome (core) histones may have evolved from ribosomal proteins (/) with which they share much in common, both being short and basic proteins. Histone proteins are among the most highly conserved proteins in eukaryotes, emphasizing their important role in the biology of the nucleus. In contrast mature sperm cells largely use
protamines Protamines are small, arginine-rich, nuclear proteins Proteins are large biomolecule , showing alpha helices, represented by ribbons. This poten was the first to have its suckture solved by X-ray crystallography by Max Perutz and Sir John ...
to package their genomic DNA, most likely because this allows them to achieve an even higher packaging ratio. There are some ''variant'' forms in some of the major classes. They share amino acid sequence homology and core structural similarity to a specific class of major histones but also have their own feature that is distinct from the major histones. These ''minor histones'' usually carry out specific functions of the chromatin metabolism. For example, histone H3-like CENPA is associated with only the
centromere In this diagram of a duplicated chromosome, (2) identifies the centromere—the region that joins the two sister chromatids, or each half of the chromosome. In prophase of mitosis, specialized regions on centromeres called kinetochores attach chr ...
region of the chromosome. Histone H2A variant H2A.Z is associated with the promoters of actively transcribed genes and also involved in the prevention of the spread of silent
heterochromatin Heterochromatin is a tightly packed form of DNA Deoxyribonucleic acid (; DNA) is a molecule File:Pentacene on Ni(111) STM.jpg, A scanning tunneling microscopy image of pentacene molecules, which consist of linear chains of five carbon r ...
. Furthermore, H2A.Z has roles in chromatin for genome stability. Another H2A variant H2A.X is phosphorylated at S139 in regions around
double-strand breaks DNA repair is a collection of processes by which a cell identifies and corrects damage to the DNA molecules that encode its genome In the fields of molecular biology and genetics Genetics is a branch of biology concerned with th ...
and marks the region undergoing
DNA repair DNA repair is a collection of processes by which a cell identifies and corrects damage to the DNA Deoxyribonucleic acid (; DNA) is a molecule File:Pentacene on Ni(111) STM.jpg, A scanning tunneling microscopy image of pentacene mol ...

DNA repair
. Histone H3.3 is associated with the body of actively transcribed genes.


Function


Compacting DNA strands

Histones act as spools around which DNA winds. This enables the compaction necessary to fit the large
genome In the fields of molecular biology Molecular biology is the branch of biology Biology is the natural science that studies life and living organisms, including their anatomy, physical structure, Biochemistry, chemical processes, M ...

genome
s of eukaryotes inside cell nuclei: the compacted molecule is 40,000 times shorter than an unpacked molecule.


Chromatin regulation

Histones undergo
posttranslational modification Post-translational modification (PTM) refers to the covalent and generally enzyme, enzymatic modification of proteins following protein biosynthesis. Proteins are synthesized by ribosomes translation (biology), translating mRNA into polypeptide c ...
s that alter their interaction with
DNA Deoxyribonucleic acid (; DNA) is a molecule File:Pentacene on Ni(111) STM.jpg, A scanning tunneling microscopy image of pentacene molecules, which consist of linear chains of five carbon rings. A molecule is an electrically neutral gro ...

DNA
and nuclear proteins. The H3 and H4 histones have long tails protruding from the
nucleosome A nucleosome is the basic structural unit of packaging in . The structure of a nucleosome consists of a segment of DNA wound around eight   and resembles thread wrapped around a spool. The nucleosome is the fundamental subunit of . Each nucleo ...

nucleosome
, which can be
covalent A covalent bond is a chemical bond A chemical bond is a lasting attraction between atom An atom is the smallest unit of ordinary matter In classical physics and general chemistry, matter is any substance that has mass and take ...

covalent
ly modified at several places. Modifications of the tail include
methylation In the chemical sciences, methylation denotes the addition of a methyl group A methyl group is an alkyl derived from methane, containing one carbon atom chemical bond, bonded to three hydrogen atoms — CH3. In chemical formula, fo ...

methylation
,
acetylation Acetylation (or in IUPAC nomenclature The International Union of Pure and Applied Chemistry The International Union of Pure and Applied Chemistry (IUPAC ) is an international federation of National Adhering Organizations that represents chemi ...

acetylation
,
phosphorylation In chemistry Chemistry is the study of the properties and behavior of . It is a that covers the that make up matter to the composed of s, s and s: their composition, structure, properties, behavior and the changes they undergo during ...

phosphorylation
,
ubiquitination Ubiquitin is a small (8.6 ) found in most tissues of organisms, i.e., it is found . It was discovered in 1975 by Gideon Goldstein and further characterized throughout the late 1970s and 1980s. Four genes in the code for ubiquitin: , , and . ...
,
SUMOylationSmall Ubiquitin-like Modifier (or SUMO) proteins are a family of small proteins Proteins are large biomolecule , showing alpha helices, represented by ribbons. This poten was the first to have its suckture solved by X-ray crystallography by M ...
, , and ADP-ribosylation. The core of the histones H2A and H2B can also be modified. Combinations of modifications are thought to constitute a code, the so-called "
histone codeThe histone code is a hypothesis that the transcription of genetic information encoded in DNA is in part regulated by chemical modifications to histone proteins, primarily on their unstructured ends. Together with similar modifications such as DNA m ...

histone code
". Histone modifications act in diverse biological processes such as
gene regulation Regulation of gene expression, or gene regulation, includes a wide range of mechanisms that are used by cells to increase or decrease the production of specific gene products (protein or RNA). Sophisticated programs of gene expression are widely ...
,
DNA repair DNA repair is a collection of processes by which a cell identifies and corrects damage to the DNA Deoxyribonucleic acid (; DNA) is a molecule File:Pentacene on Ni(111) STM.jpg, A scanning tunneling microscopy image of pentacene mol ...

DNA repair
, chromosome condensation (
mitosis In cell biology Cell biology (also cellular biology or cytology) is a branch of biology Biology is the natural science that studies life and living organisms, including their anatomy, physical structure, Biochemistry, chemical proce ...

mitosis
) and
spermatogenesis Spermatogenesis is the process by which haploid spermatozoa develop from germ cells in the seminiferous tubules of the testis. This process starts with the Mitosis, mitotic division of the stem cells located close to the basement membrane of the ...

spermatogenesis
(
meiosis Meiosis (; , because it is a reductional division) is a special type of of in organisms used to produce the , such as or . It involves two rounds of division that ultimately result in four cells with only one copy of each (). Additionall ...

meiosis
). The common nomenclature of histone modifications is: * The name of the histone (e.g., H3) * The single-letter
amino acid Amino acids are organic compound In , organic compounds are generally any s that contain - . Due to carbon's ability to (form chains with other carbon s), millions of organic compounds are known. The study of the properties, reactions, a ...

amino acid
abbreviation (e.g., K for
Lysine Lysine (symbol Lys or K) is an α-amino acid Amino acids are organic compound , CH4; is among the simplest organic compounds. In chemistry, organic compounds are generally any chemical compounds that contain carbon-hydrogen chemical bond, ...

Lysine
) and the amino acid position in the protein * The type of modification (Me:
methyl A methyl group is an alkyl In organic chemistry, an alkyl substituent is an alkane missing one hydrogen. The term alkyl is intentionally unspecific to include many possible substitutions. An acyclic alkyl has the general formula of C''n''H2' ...

methyl
, P:
phosphate In chemistry, a phosphate is an anion, salt (chemistry), salt, functional group or ester derived from a phosphoric acids and phosphates, phosphoric acid. It most commonly means orthophosphate, a derivative of phosphoric acid, orthophosphoric a ...

phosphate
, Ac:
acetyl In organic chemistry Organic chemistry is a branch of chemistry that studies the structure, properties and reactions of organic compounds, which contain carbon in covalent bonding.Clayden, J.; Greeves, N. and Warren, S. (2012) ''Organic Chemis ...

acetyl
, Ub:
ubiquitin Ubiquitin is a small (8.6 ) found in most tissues of organisms, i.e., it is found . It was discovered in 1975 by Gideon Goldstein and further characterized throughout the late 1970s and 1980s. Four genes in the code for ubiquitin: , , and . ...
) * The number of modifications (only Me is known to occur in more than one copy per residue. 1, 2 or 3 is mono-, di- or tri-methylation) So
H3K4me1 H3K4me1 is an epigenetic In biology Biology is the natural science that studies life and living organisms, including their anatomy, physical structure, Biochemistry, chemical processes, Molecular biology, molecular interactions, Physiolog ...
denotes the monomethylation of the 4th residue (a lysine) from the start (i.e., the
N-terminal 350px, A Val-Glycine.html"_;"title="Valine.html"_;"title="tetrapeptide_(example:_Valine">Val-Glycine">Gly-Serine">Ser-Alanine.html" ;"title="Glycine">Gly-Serine.html" ;"title="Valine">Val-Glycine.html" ;"title="Valine.html" ;"title="tetrapeptid ...
) of the H3 protein.


Modification

A huge catalogue of histone modifications have been described, but a functional understanding of most is still lacking. Collectively, it is thought that histone modifications may underlie a
histone codeThe histone code is a hypothesis that the transcription of genetic information encoded in DNA is in part regulated by chemical modifications to histone proteins, primarily on their unstructured ends. Together with similar modifications such as DNA m ...

histone code
, whereby combinations of histone modifications have specific meanings. However, most functional data concerns individual prominent histone modifications that are biochemically amenable to detailed study.


Chemistry


Lysine methylation

The addition of one, two, or many methyl groups to lysine has little effect on the chemistry of the histone; methylation leaves the charge of the lysine intact and adds a minimal number of atoms so steric interactions are mostly unaffected. However, proteins containing Tudor, chromo or PHD domains, amongst others, can recognise lysine methylation with exquisite sensitivity and differentiate mono, di and tri-methyl lysine, to the extent that, for some lysines (e.g.: H4K20) mono, di and tri-methylation appear to have different meanings. Because of this, lysine methylation tends to be a very informative mark and dominates the known histone modification functions.


Glutamine serotonylation

Recently it has been shown, that the addition of a
serotonin Serotonin () or 5-hydroxytryptamine (5-HT) is a monoamine neurotransmitter Monoamine neurotransmitters are s and s that contain one group connected to an by a two-carbon chain (such as -CH2-CH2-). Examples are , and . All monoamines are ...

serotonin
group to the position 5 glutamine of H3, happens in serotonergic cells such as neurons. This is part of the differentiation of the serotonergic cells. This post-translational modification happens in conjunction with the H3K4me3 modification. The serotonylation potentiates the binding of the general transcription factor
TFIIDTranscription factor II D (TFIID) is one of several general transcription factors that make up the RNA polymerase II RNA polymerase II (RNAP II and Pol II) is a multiprotein complex that transcribes DNA into precursors of messenger RNA Image:MR ...
to the
TATA box In molecular biology, the TATA box (also called the Goldberg–Hogness box) is a DNA sequence, sequence of DNA found in the Promoter (genetics), core promoter region of genes in archaea and eukaryotes. The bacterial homolog of the TATA box is c ...
.


Arginine methylation

What was said above of the chemistry of lysine methylation also applies to arginine methylation, and some protein domains—e.g., Tudor domains—can be specific for methyl arginine instead of methyl lysine. Arginine is known to be mono- or di-methylated, and methylation can be symmetric or asymmetric, potentially with different meanings.


Arginine citrullination

Enzymes called peptidylarginine deiminases (PADs) hydrolyze the imine group of arginines and attach a keto group, so that there is one less positive charge on the amino acid residue. This process has been involved in the activation of gene expression by making the modified histones less tightly bound to DNA and thus making the chromatin more accessible. PADs can also produce the opposite effect by removing or inhibiting mono-methylation of arginine residues on histones and thus antagonizing the positive effect arginine methylation has on transcriptional activity.


Lysine acetylation

Addition of an acetyl group has a major chemical effect on lysine as it neutralises the positive charge. This reduces electrostatic attraction between the histone and the negatively charged DNA backbone, loosening the chromatin structure; highly acetylated histones form more accessible chromatin and tend to be associated with active transcription. Lysine acetylation appears to be less precise in meaning than methylation, in that histone acetyltransferases tend to act on more than one lysine; presumably this reflects the need to alter multiple lysines to have a significant effect on chromatin structure. The modification includes
H3K27ac H3K27ac is an epigenetic In biology, epigenetics is the study of heritability, heritable phenotype changes that do not involve alterations in the DNA sequence. The Ancient Greek, Greek prefix ''wikt:epi-, epi-'' ( "over, outside of, around") in ...
.


Serine/threonine/tyrosine phosphorylation

Addition of a negatively charged phosphate group can lead to major changes in protein structure, leading to the well-characterised role of
phosphorylation In chemistry Chemistry is the study of the properties and behavior of . It is a that covers the that make up matter to the composed of s, s and s: their composition, structure, properties, behavior and the changes they undergo during ...

phosphorylation
in controlling protein function. It is not clear what structural implications histone phosphorylation has, but histone phosphorylation has clear functions as a post-translational modification, and binding domains such as BRCT have been characterised.


Effects on transcription

Most well-studied histone modifications are involved in control of transcription.


Actively transcribed genes

Two histone modifications are particularly associated with active transcription: ;''Trimethylation of H3 lysine 4 (H3K4me3)'': This trimethylation occurs at the promoter of active genes and is performed by the COMPASS complex. Despite the conservation of this complex and histone modification from yeast to mammals, it is not entirely clear what role this modification plays. However, it is an excellent mark of active promoters and the level of this histone modification at a gene's promoter is broadly correlated with transcriptional activity of the gene. The formation of this mark is tied to transcription in a rather convoluted manner: early in transcription of a gene,
RNA polymerase II RNA polymerase II (RNAP II and Pol II) is a multiprotein complex is a protein complex functioning as a molecular biological machine A molecular machine, nanite, or nanomachine is a molecular component that produces quasi-mechanical movements (o ...
undergoes a switch from to , marked by a change in the phosphorylation states of the RNA polymerase II C terminal domain (CTD). The same enzyme that
phosphorylates In chemistry Chemistry is the scientific discipline involved with Chemical element, elements and chemical compound, compounds composed of atoms, molecules and ions: their composition, structure, properties, behavior and the changes they unde ...

phosphorylates
the CTD also phosphorylates the Rad6 complex, which in turn adds a ubiquitin mark to H2B K123 (K120 in mammals). H2BK123Ub occurs throughout transcribed regions, but this mark is required for COMPASS to trimethylate H3K4 at promoters. ;''Trimethylation of H3 lysine 36 (
H3K36me3 H3K36me3 is an epigenetic In biology, epigenetics is the study of heritability, heritable phenotype changes that do not involve alterations in the DNA sequence. The Ancient Greek, Greek prefix ''wikt:epi-, epi-'' ( "over, outside of, around") i ...
)'': This trimethylation occurs in the body of active genes and is deposited by the methyltransferase Set2. This protein associates with elongating
RNA polymerase II RNA polymerase II (RNAP II and Pol II) is a multiprotein complex is a protein complex functioning as a molecular biological machine A molecular machine, nanite, or nanomachine is a molecular component that produces quasi-mechanical movements (o ...
, and H3K36Me3 is indicative of actively transcribed genes. H3K36Me3 is recognised by the Rpd3 histone deacetylase complex, which removes acetyl modifications from surrounding histones, increasing chromatin compaction and repressing spurious transcription. Increased chromatin compaction prevents transcription factors from accessing DNA, and reduces the likelihood of new transcription events being initiated within the body of the gene. This process therefore helps ensure that transcription is not interrupted.


Repressed genes

Three histone modifications are particularly associated with repressed genes: ;''Trimethylation of H3 lysine 27 (H3K27me3)'': This histone modification is deposited by the polycomb complex PRC2. It is a clear marker of gene repression, and is likely bound by other proteins to exert a repressive function. Another polycomb complex, PRC1, can bind
H3K27me3 H3K27me3 is an epigenetic In biology, epigenetics is the study of heritability, heritable phenotype changes that do not involve alterations in the DNA sequence. The Ancient Greek, Greek prefix ''wikt:epi-, epi-'' ( "over, outside of, around") ...
and adds the histone modification H2AK119Ub which aids chromatin compaction. Based on this data it appears that PRC1 is recruited through the action of PRC2, however, recent studies show that PRC1 is recruited to the same sites in the absence of PRC2. ;''Di and tri-methylation of H3 lysine 9 (H3K9me2/3)'': H3K9me2/3 is a well-characterised marker for
heterochromatin Heterochromatin is a tightly packed form of DNA Deoxyribonucleic acid (; DNA) is a molecule File:Pentacene on Ni(111) STM.jpg, A scanning tunneling microscopy image of pentacene molecules, which consist of linear chains of five carbon r ...
, and is therefore strongly associated with gene repression. The formation of heterochromatin has been best studied in the yeast ''
Schizosaccharomyces pombe ''Schizosaccharomyces pombe'', also called "fission yeast", is a species In biology, a species is the basic unit of biological classification, classification and a taxonomic rank of an organism, as well as a unit of biodiversity. A species ...
'', where it is initiated by recruitment of the RNA-induced transcriptional silencing (RITS) complex to double stranded RNAs produced from centromeric repeats. RITS recruits the Clr4 histone methyltransferase which deposits H3K9me2/3. This process is called histone methylation. H3K9Me2/3 serves as a binding site for the recruitment of Swi6 (heterochromatin protein 1 or HP1, another classic heterochromatin marker) which in turn recruits further repressive activities including histone modifiers such as histone deacetylases and histone methyltransferases. ;''Trimethylation of H4 lysine 20 (H4K20me3)'': This modification is tightly associated with heterochromatin, although its functional importance remains unclear. This mark is placed by the Suv4-20h methyltransferase, which is at least in part recruited by heterochromatin protein 1.


Bivalent promoters

Analysis of histone modifications in embryonic stem cells (and other stem cells) revealed many gene promoters carrying both #Actively transcribed genes, H3K4Me3 and #Repressed genes, H3K27Me3, in other words these promoters display both activating and repressing marks simultaneously. This peculiar combination of modifications marks genes that are poised for transcription; they are not required in stem cells, but are rapidly required after differentiation into some lineages. Once the cell starts to differentiate, these bivalent promoters are resolved to either active or repressive states depending on the chosen lineage.


Other functions


DNA damage

Marking sites of DNA damage is an important function for histone modifications. It also protects DNA from getting destroyed by ultraviolet radiation of sun. ; ''Phosphorylation of H2AX at serine 139 (γH2AX)'': Phosphorylated H2AX (also known as gamma H2AX) is a marker for DNA double strand breaks, and forms part of the DNA repair#Global response to DNA damage, response to DNA damage. H2AX is phosphorylated early after detection of DNA double strand break, and forms a domain extending many kilobases either side of the damage. Gamma H2AX acts as a binding site for the protein MDC1, which in turn recruits key DNA repair proteins (this complex topic is well reviewed in) and as such, gamma H2AX forms a vital part of the machinery that ensures genome stability. ;''Acetylation of H3 lysine 56 (H3K56Ac)'': H3K56Acx is required for genome stability. H3K56 is acetylated by the p300/Rtt109 complex, but is rapidly deacetylated around sites of DNA damage. H3K56 acetylation is also required to stabilise stalled replication forks, preventing dangerous replication fork collapses. Although in general mammals make far greater use of histone modifications than microorganisms, a major role of H3K56Ac in DNA replication exists only in fungi, and this has become a target for antibiotic development.


DNA repair

; ''Trimethylation of H3 lysine 36 (H3K36me3)'' H3K36me3 has the ability to recruit the MSH2-MSH6 (hMutSα) complex of the DNA mismatch repair pathway. Consistently, regions of the human genome with high levels of H3K36me3 accumulate less somatic mutations due to DNA mismatch repair, mismatch repair activity.


Chromosome condensation

; ''Phosphorylation of H3 at serine 10 (phospho-H3S10)'': The mitotic kinase aurora B phosphorylates histone H3 at serine 10, triggering a cascade of changes that mediate mitotic chromosome condensation. Condensed chromosomes therefore stain very strongly for this mark, but H3S10 phosphorylation is also present at certain chromosome sites outside mitosis, for example in pericentric heterochromatin of cells during G2. H3S10 phosphorylation has also been linked to DNA damage caused by R-loop formation at highly transcribed sites. ;''Phosphorylation H2B at serine 10/14 (phospho-H2BS10/14)'': Phosphorylation of H2B at serine 10 (yeast) or serine 14 (mammals) is also linked to chromatin condensation, but for the very different purpose of mediating chromosome condensation during apoptosis. This mark is not simply a late acting bystander in apoptosis as yeast carrying mutations of this residue are resistant to hydrogen peroxide-induced apoptotic cell death.


Addiction

Epigenetic modifications of histone tails in specific regions of the brain are of central importance in addictions. Once particular epigenetic alterations occur, they appear to be long lasting "molecular scars" that may account for the persistence of addictions. Cigarette#consumtpion, Cigarette smokers (about 15% of the US population) are usually addicted to nicotine. After 7 days of nicotine treatment of mice, acetylation of both histone H3 and histone H4 was increased at the FosB promoter in the nucleus accumbens of the brain, causing 61% increase in FosB expression. This would also increase expression of the Alternative splicing, splice variant FosB#Delta FosB, Delta FosB. In the nucleus accumbens of the brain, FosB#Delta FosB, Delta FosB functions as a "sustained molecular switch" and "master control protein" in the development of an addiction. About 7% of the US population is addicted to Alcoholism, alcohol. In rats exposed to alcohol for up to 5 days, there was an increase in histone 3 lysine 9 acetylation in the pronociceptin promoter in the brain amygdala complex. This acetylation is an activating mark for pronociceptin. The nociceptin/nociceptin opioid receptor system is involved in the reinforcing or conditioning effects of alcohol. Methamphetamine addiction occurs in about 0.2% of the US population. Chronic methamphetamine use causes Histone methylation#Function, methylation of the lysine in position 4 of histone 3 located at the Promoter (genetics), promoters of the ''c-fos'' and the ''CCR2, C-C chemokine receptor 2 (ccr2)'' genes, activating those genes in the nucleus accumbens (NAc). c-fos is well known to be important in addiction. The ''ccr2'' gene is also important in addiction, since mutational inactivation of this gene impairs addiction.


Synthesis

The first step of chromatin structure duplication is the synthesis of histone proteins: H1, H2A, H2B, H3, H4. These proteins are synthesized during S phase of the cell cycle. There are different mechanisms which contribute to the increase of histone synthesis.


Yeast

Yeast carry one or two copies of each histone gene, which are not clustered but rather scattered throughout chromosomes. Histone gene transcription is controlled by multiple gene regulatory proteins such as transcription factors which bind to histone promoter regions. In budding yeast, the candidate gene for activation of histone gene expression is SBF. SBF is a transcription factor that is activated in late G1 phase, when it dissociates from its repressor Whi5. This occurs when Whi5 is phosphorylated by Cdc8 which is a G1/S Cdk. Suppression of histone gene expression outside of S phases is dependent on Hir proteins which form inactive chromatin structure at the locus of histone genes, causing transcriptional activators to be blocked.


Metazoan

In metazoans the increase in the rate of histone synthesis is due to the increase in processing of pre-mRNA to its mature form as well as decrease in mRNA degradation; this results in an increase of active mRNA for translation of histone proteins. The mechanism for mRNA activation has been found to be the removal of a segment of the 3' end of the mRNA strand, and is dependent on association with stem-loop binding protein (SLBP). SLBP also stabilizes histone mRNAs during S phase by blocking degradation by the 3'hExo nuclease. SLBP levels are controlled by cell-cycle proteins, causing SLBP to accumulate as cells enter S phase and degrade as cells leave S phase. SLBP are marked for degradation by phosphorylation at two threonine residues by cyclin dependent kinases, possibly cyclin A/ cdk2, at the end of S phase. Metazoans also have multiple copies of histone genes clustered on chromosomes which are localized in structures called Cajal bodies as determined by genome-wide chromosome conformation capture analysis (4C-Seq).


Link between cell-cycle control and synthesis

Nuclear protein Ataxia-Telangiectasia (NPAT), also known as nuclear protein coactivator of histone transcription, is a transcription factor which activates histone gene transcription on chromosomes 1 and 6 of human cells. NPAT is also a substrate of cyclin E-Cdk2, which is required for the transition between G1 phase and S phase. NPAT activates histone gene expression only after it has been phosphorylated by the G1/S-Cdk cyclin E-Cdk2 in early S phase. This shows an important regulatory link between cell-cycle control and histone synthesis.


History

Histones were discovered in 1884 by Albrecht Kossel. The word "histone" dates from the late 19th century and is derived from the German word ''"Histon"'', a word itself of uncertain origin, perhaps from Ancient Greek ''ἵστημι'' (hístēmi, “make stand”) or ''ἱστός'' (histós, “loom”). In the early 1960s, before the types of histones were known and before histones were known to be highly conserved across taxonomically diverse organisms, James F. Bonner and his collaborators began a study of these proteins that were known to be tightly associated with the DNA in the nucleus of higher organisms. Bonner and his postdoctoral fellow Ru Chih C Huang, Ru Chih C. Huang showed that isolated chromatin would not support RNA transcription in the test tube, but if the histones were extracted from the chromatin, RNA could be transcribed from the remaining DNA. Their paper became a citation classic. Paul T'so and James Bonner had called together a World Congress on Histone Chemistry and Biology in 1964, in which it became clear that there was no consensus on the number of kinds of histone and that no one knew how they would compare when isolated from different organisms.James Bonner and Paul T'so (1965) ''The Nucleohistones''. Holden-Day Inc, San Francisco, London, Amsterdam. Bonner and his collaborators then developed methods to separate each type of histone, purified individual histones, compared amino acid compositions in the same histone from different organisms, and compared amino acid sequences  of the same histone from different organisms in collaboration with Emil L. Smith, Emil Smith from UCLA. For example, they found Histone IV sequence to be highly conserved between peas and calf thymus. However, their work on the biochemical characteristics of individual histones did not reveal how the histones interacted with each other or with DNA to which they were tightly bound. Also in the 1960s, Vincent Allfrey and Alfred Mirsky had suggested, based on their analyses of histones, that acetylation and methylation of histones could provide a transcriptional control mechanism, but did not have available the kind of detailed analysis that later investigators were able to conduct to show how such regulation could be gene-specific. Until the early 1990s, histones were dismissed by most as inert packing material for eukaryotic nuclear DNA, a view based in part on the models of Mark Ptashne and others, who believed that transcription was activated by protein-DNA and protein-protein interactions on largely naked DNA templates, as is the case in bacteria. During the 1980s, Yahli Lorch and Roger D. Kornberg, Roger Kornberg showed that a nucleosome on a core promoter prevents the initiation of transcription in vitro, and Michael Grunstein demonstrated that histones repress transcription in vivo, leading to the idea of the nucleosome as a general gene repressor. Relief from repression is believed to involve both histone modification and the action of chromatin-remodeling complexes. Vincent Allfrey and Alfred Mirsky had earlier proposed a role of histone modification in transcriptional activation, regarded as a molecular manifestation of epigenetics. Michael Grunstein and David Allis found support for this proposal, in the importance of histone acetylation for transcription in yeast and the activity of the transcriptional activator Gcn5 as a histone acetyltransferase. The discovery of the H5 histone appears to date back to the 1970s, and it is now considered an isoform of Histone H1.


See also

* Histone variants * Chromatin * Gene silencing * Genetics * Histone acetyltransferase * Histone deacetylases * Histone methyltransferase * Histone-modifying enzymes * Nucleosome * PRMT4 pathway * Histone H1


References


External links


HistoneDB 2.0 - Database of histones and variants
at National Center for Biotechnology Information, NCBI
Chromatin, Histones & Cathepsin
PMAP The Proteolysis Map-animation {{Authority control Epigenetics Proteins DNA-binding proteins