RNA polymerase II holoenzyme
   HOME

TheInfoList



OR:

RNA polymerase II holoenzyme is a form of
eukaryotic Eukaryotes () are organisms whose Cell (biology), cells have a cell nucleus, nucleus. All animals, plants, fungi, and many unicellular organisms, are Eukaryotes. They belong to the group of organisms Eukaryota or Eukarya, which is one of the ...
RNA polymerase II that is recruited to the promoters of
protein Proteins are large biomolecules and macromolecules that comprise one or more long chains of amino acid residues. Proteins perform a vast array of functions within organisms, including catalysing metabolic reactions, DNA replication, res ...
-coding genes in living cells. It consists of
RNA polymerase II RNA polymerase II (RNAP II and Pol II) is a multiprotein complex that transcribes DNA into precursors of messenger RNA (mRNA) and most small nuclear RNA (snRNA) and microRNA. It is one of the three RNAP enzymes found in the nucleus of eukaryo ...
, a subset of general
transcription factor In molecular biology, a transcription factor (TF) (or sequence-specific DNA-binding factor) is a protein that controls the rate of transcription of genetic information from DNA to messenger RNA, by binding to a specific DNA sequence. The f ...
s, and regulatory proteins known as .


RNA polymerase II

RNA polymerase II (also called RNAP II and Pol II) is an enzyme found in
eukaryotic Eukaryotes () are organisms whose Cell (biology), cells have a cell nucleus, nucleus. All animals, plants, fungi, and many unicellular organisms, are Eukaryotes. They belong to the group of organisms Eukaryota or Eukarya, which is one of the ...
cells. It catalyzes the transcription of DNA to synthesize precursors of
mRNA In molecular biology, messenger ribonucleic acid (mRNA) is a single-stranded molecule of RNA that corresponds to the genetic sequence of a gene, and is read by a ribosome in the process of synthesizing a protein. mRNA is created during the ...
and most
snRNA Small nuclear RNA (snRNA) is a class of small RNA molecules that are found within the splicing speckles and Cajal bodies of the cell nucleus in eukaryotic cells. The length of an average snRNA is approximately 150 nucleotides. They are transcri ...
and
microRNA MicroRNA (miRNA) are small, single-stranded, non-coding RNA molecules containing 21 to 23 nucleotides. Found in plants, animals and some viruses, miRNAs are involved in RNA silencing and post-transcriptional regulation of gene expression. mi ...
. In humans, RNAP II consists of seventeen protein molecules (gene products encoded by POLR2A-L, where the proteins synthesized from ''
POLR2C DNA-directed RNA polymerase II subunit RPB3 is an enzyme that in humans is encoded by the ''POLR2C'' gene. Function This gene encodes the third largest subunit of RNA polymerase II, the polymerase responsible for synthesizing messenger RNA in ...
'', '' POLR2E'', and ''
POLR2F DNA-directed RNA polymerases I, II, and III subunit RPABC2 is a protein that in humans is encoded by the ''POLR2F'' gene. This gene encodes the sixth largest subunit of RNA polymerase II, the polymerase responsible for synthesizing messenger RNA i ...
'' form homodimers).


General transcription factors

General transcription factors (GTFs) or basal transcription factors are
protein Proteins are large biomolecules and macromolecules that comprise one or more long chains of amino acid residues. Proteins perform a vast array of functions within organisms, including catalysing metabolic reactions, DNA replication, res ...
transcription factor In molecular biology, a transcription factor (TF) (or sequence-specific DNA-binding factor) is a protein that controls the rate of transcription of genetic information from DNA to messenger RNA, by binding to a specific DNA sequence. The f ...
s that have been shown to be important in the transcription of class II genes to
mRNA In molecular biology, messenger ribonucleic acid (mRNA) is a single-stranded molecule of RNA that corresponds to the genetic sequence of a gene, and is read by a ribosome in the process of synthesizing a protein. mRNA is created during the ...
templates. Many of them are involved in the formation of a preinitiation complex, which, together with
RNA polymerase II RNA polymerase II (RNAP II and Pol II) is a multiprotein complex that transcribes DNA into precursors of messenger RNA (mRNA) and most small nuclear RNA (snRNA) and microRNA. It is one of the three RNAP enzymes found in the nucleus of eukaryo ...
, bind to and read the single-stranded DNA gene template. The cluster of RNA polymerase II and various transcription factors is known as a basal transcriptional complex (BTC).


Preinitiation complex

The preinitiation complex (PIC) is a large complex of
protein Proteins are large biomolecules and macromolecules that comprise one or more long chains of amino acid residues. Proteins perform a vast array of functions within organisms, including catalysing metabolic reactions, DNA replication, res ...
s that is necessary for the transcription of protein-coding
gene In biology, the word gene (from , ; "... Wilhelm Johannsen coined the word gene to describe the Mendelian units of heredity..." meaning ''generation'' or ''birth'' or ''gender'') can have several different meanings. The Mendelian gene is a b ...
s in eukaryotes and archaea. The PIC helps position RNA polymerase II over gene
transcription start site Transcription is the process of copying a segment of DNA into RNA. The segments of DNA transcribed into RNA molecules that can encode proteins are said to produce messenger RNA (mRNA). Other segments of DNA are copied into RNA molecules calle ...
s, denatures the DNA, and positions the DNA in the RNA polymerase II active site for transcription. The typical PIC is made up of six general transcription factors:
TFIIA Transcription factor TFIIA is a nuclear protein involved in the RNA polymerase II-dependent transcription of DNA. TFIIA is one of several general (basal) transcription factors ( GTFs) that are required for all transcription events that use RNA ...
( GTF2A1, GTF2A2),
TFIIB Transcription factor II B (TFIIB) is a general transcription factor that is involved in the formation of the RNA polymerase II preinitiation complex (PIC) and aids in stimulating transcription initiation. TFIIB is localised to the nucleus and pr ...
( GTF2B), B-TFIID ( BTAF1, TBP),
TFIID Transcription factor II D (TFIID) is one of several general transcription factors that make up the RNA polymerase II preinitiation complex. RNA polymerase II holoenzyme is a form of eukaryotic RNA polymerase II that is recruited to the promoters o ...
( BTAF1, BTF3, BTF3L4, EDF1, TAF1-15, 16 total),
TFIIE Transcription factor II E (TFIIE) is one of several general transcription factors that make up the RNA polymerase II preinitiation complex. It is a tetramer of two alpha and two beta chains and interacts with TAF6/TAFII80, ATF7IP, and varicella ...
,
TFIIF Transcription factor II F (TFIIF) is one of several general transcription factors that make up the RNA polymerase II preinitiation complex. TFIIF is encoded by the , , and genes. TFIIF binds to RNA polymerase II RNA polymerase II (RNAP II ...
,
TFIIH Transcription factor II Human (transcription factor II H; TFIIH) is an important protein complex, having roles in transcription of various protein-coding genes and DNA nucleotide excision repair (NER) pathways. TFIIH first came to light in 1989 ...
and TFIIJ. The construction of the polymerase complex takes place on the
gene In biology, the word gene (from , ; "... Wilhelm Johannsen coined the word gene to describe the Mendelian units of heredity..." meaning ''generation'' or ''birth'' or ''gender'') can have several different meanings. The Mendelian gene is a b ...
promoter. The
TATA box In molecular biology, the TATA box (also called the Goldberg–Hogness box) is a sequence of DNA found in the core promoter region of genes in archaea and eukaryotes. The bacterial homolog of the TATA box is called the Pribnow box which has ...
is one well-studied example of a promoter element that occurs in approximately 10% of genes. It is conserved in many (though not all) model eukaryotes and is found in a fraction of the promoters in these organisms. The sequence TATA (or variations) is located at approximately 25 nucleotides upstream of the Transcription Start Point (TSP). In addition, there are also some weakly conserved features including the TFIIB-Recognition Element (BRE), approximately 5 nucleotides upstream (BREu) and 5 nucleotides downstream (BREd) of the TATA box.


Assembly of the PIC

Although the sequence of steps involved in the assembly of the PIC can vary, in general, they follow step 1, binding to the promoter. # The
TATA-binding protein The TATA-binding protein (TBP) is a general transcription factor that binds specifically to a DNA sequence called the TATA box. This DNA sequence is found about 30 base pairs upstream of the transcription start site in some eukaryotic gene p ...
(TBP, a subunit of
TFIID Transcription factor II D (TFIID) is one of several general transcription factors that make up the RNA polymerase II preinitiation complex. RNA polymerase II holoenzyme is a form of eukaryotic RNA polymerase II that is recruited to the promoters o ...
),
TBPL1 TATA box-binding protein-like protein 1 is a protein that in humans is encoded by the ''TBPL1'' gene. Function Initiation of transcription by RNA polymerase II requires the activities of more than 70 polypeptides. The protein that coordinates ...
, or TBPL2 can bind the promoter or
TATA box In molecular biology, the TATA box (also called the Goldberg–Hogness box) is a sequence of DNA found in the core promoter region of genes in archaea and eukaryotes. The bacterial homolog of the TATA box is called the Pribnow box which has ...
. Most
gene In biology, the word gene (from , ; "... Wilhelm Johannsen coined the word gene to describe the Mendelian units of heredity..." meaning ''generation'' or ''birth'' or ''gender'') can have several different meanings. The Mendelian gene is a b ...
s lack a
TATA box In molecular biology, the TATA box (also called the Goldberg–Hogness box) is a sequence of DNA found in the core promoter region of genes in archaea and eukaryotes. The bacterial homolog of the TATA box is called the Pribnow box which has ...
and use an
initiator element The initiator element (''Inr''), sometimes referred to as initiator motif, is a core promoter that is similar in function to the Pribnow box (in prokaryotes) or the TATA box (in eukaryotes). The ''Inr'' is the simplest functional promoter that is ...
(Inr) or downstream core promoter instead. Nevertheless, TBP is always involved and is forced to bind without sequence specificity. TAFs from
TFIID Transcription factor II D (TFIID) is one of several general transcription factors that make up the RNA polymerase II preinitiation complex. RNA polymerase II holoenzyme is a form of eukaryotic RNA polymerase II that is recruited to the promoters o ...
can also be involved when the
TATA box In molecular biology, the TATA box (also called the Goldberg–Hogness box) is a sequence of DNA found in the core promoter region of genes in archaea and eukaryotes. The bacterial homolog of the TATA box is called the Pribnow box which has ...
is absent. A TFIID TAF will bind sequence specifically, and force the TBP to bind non-sequence specifically, bringing the remaining portions of TFIID to the promoter. #
TFIIA Transcription factor TFIIA is a nuclear protein involved in the RNA polymerase II-dependent transcription of DNA. TFIIA is one of several general (basal) transcription factors ( GTFs) that are required for all transcription events that use RNA ...
interacts with the TBP subunit of TFIID and aids in the binding of TBP to TATA-box containing promoter DNA. Although TFIIA does not recognize DNA itself, its interactions with TBP allow it to stabilize and facilitate formation of the PIC. # The N-terminal domain of
TFIIB Transcription factor II B (TFIIB) is a general transcription factor that is involved in the formation of the RNA polymerase II preinitiation complex (PIC) and aids in stimulating transcription initiation. TFIIB is localised to the nucleus and pr ...
brings the DNA into proper position for entry into the active site of
RNA polymerase II RNA polymerase II (RNAP II and Pol II) is a multiprotein complex that transcribes DNA into precursors of messenger RNA (mRNA) and most small nuclear RNA (snRNA) and microRNA. It is one of the three RNAP enzymes found in the nucleus of eukaryo ...
. TFIIB binds partially sequence specifically, with some preference for BRE. The TFIID-TFIIA-TFIIB (DAB)-promoter complex subsequently recruits RNA polymerase II and TFIIF. #
TFIIF Transcription factor II F (TFIIF) is one of several general transcription factors that make up the RNA polymerase II preinitiation complex. TFIIF is encoded by the , , and genes. TFIIF binds to RNA polymerase II RNA polymerase II (RNAP II ...
(two subunits, RAP30 and RAP74, showing some similarity to bacterial sigma factors) and Pol II enter the complex together. TFIIF helps to speed up the polymerization process. #
TFIIE Transcription factor II E (TFIIE) is one of several general transcription factors that make up the RNA polymerase II preinitiation complex. It is a tetramer of two alpha and two beta chains and interacts with TAF6/TAFII80, ATF7IP, and varicella ...
joins the growing complex and recruits
TFIIH Transcription factor II Human (transcription factor II H; TFIIH) is an important protein complex, having roles in transcription of various protein-coding genes and DNA nucleotide excision repair (NER) pathways. TFIIH first came to light in 1989 ...
. TFIIE may be involved in
DNA melting Nucleic acid thermodynamics is the study of how temperature affects the nucleic acid structure of double-stranded DNA (dsDNA). The melting temperature (''Tm'') is defined as the temperature at which half of the DNA strands are in the random coil o ...
at the promoter: it contains a zinc ribbon motif that can bind single-stranded DNA.
TFIIE Transcription factor II E (TFIIE) is one of several general transcription factors that make up the RNA polymerase II preinitiation complex. It is a tetramer of two alpha and two beta chains and interacts with TAF6/TAFII80, ATF7IP, and varicella ...
helps to open and close the Pol II’s ''Jaw''-like structure, which enables movement down the DNA strand. # DNA may be wrapped one complete turn around the preinitiation complex and it is TFIIF that helps keep this tight wrapping. In the process, the torsional strain on the DNA may aid in
DNA melting Nucleic acid thermodynamics is the study of how temperature affects the nucleic acid structure of double-stranded DNA (dsDNA). The melting temperature (''Tm'') is defined as the temperature at which half of the DNA strands are in the random coil o ...
at the promoter, forming the
transcription bubble A transcription bubble is a molecular structure formed during DNA transcription when a limited portion of the DNA double helix is unwound. The size of a transcription bubble ranges from 12-14 base pairs. A transcription bubble is formed when the ...
. #
TFIIH Transcription factor II Human (transcription factor II H; TFIIH) is an important protein complex, having roles in transcription of various protein-coding genes and DNA nucleotide excision repair (NER) pathways. TFIIH first came to light in 1989 ...
enters the complex. TFIIH is a large protein complex that contains among others the
CDK7 Cyclin-dependent kinase 7, or cell division protein kinase 7, is an enzyme that in humans is encoded by the ''CDK7'' gene. The protein encoded by this gene is a member of the cyclin-dependent protein kinase (CDK) family. CDK family members are h ...
/ cyclin H kinase complex and a DNA helicase. TFIIH has three functions: It binds specifically to the template strand to ensure that the correct strand of DNA is transcribed and melts or unwinds the DNA ( ATP-dependent) to separate the two strands using its
helicase Helicases are a class of enzymes thought to be vital to all organisms. Their main function is to unpack an organism's genetic material. Helicases are motor proteins that move directionally along a nucleic acid phosphodiester backbone, separatin ...
activity. It has a kinase activity that phosphorylates the C-terminal domain (CTD) of Pol II at the amino acid serine. This switches the RNA polymerase to start producing RNA. Finally it is essential for Nucleotide Excision Repair (NER) of damaged DNA. TFIIH and TFIIE strongly interact with one another. TFIIE affects TFIIH's catalytic activity. Without TFIIE, TFIIH will not unwind the promoter. #
TFIIH Transcription factor II Human (transcription factor II H; TFIIH) is an important protein complex, having roles in transcription of various protein-coding genes and DNA nucleotide excision repair (NER) pathways. TFIIH first came to light in 1989 ...
helps create the
transcription bubble A transcription bubble is a molecular structure formed during DNA transcription when a limited portion of the DNA double helix is unwound. The size of a transcription bubble ranges from 12-14 base pairs. A transcription bubble is formed when the ...
and may be required for transcription if the DNA template is not already denatured or if it is supercoiled. # Mediator then encases all the transcription factors and Pol II. It interacts with enhancers, areas very far away (upstream or downstream) that help regulate transcription. The formation of the preinitiation complex (PIC) is analogous to the mechanism seen in
bacteria Bacteria (; singular: bacterium) are ubiquitous, mostly free-living organisms often consisting of one Cell (biology), biological cell. They constitute a large domain (biology), domain of prokaryotic microorganisms. Typically a few micrometr ...
l initiation. In bacteria, the sigma factor recognizes and binds to the promoter sequence. In eukaryotes, the
transcription factor In molecular biology, a transcription factor (TF) (or sequence-specific DNA-binding factor) is a protein that controls the rate of transcription of genetic information from DNA to messenger RNA, by binding to a specific DNA sequence. The f ...
s perform this role.


Mediator complex

Mediator is a multiprotein complex that functions as a transcriptional
coactivator A coactivator is a type of transcriptional coregulator that binds to an activator (a transcription factor) to increase the rate of transcription of a gene or set of genes. The activator contains a DNA binding domain that binds either to a DNA ...
. The Mediator complex is required for the successful transcription of nearly all class II gene promoters in yeast. It works in the same manner in mammals. The mediator functions as a coactivator and binds to the C-terminal domain (CTD) of
RNA polymerase II RNA polymerase II (RNAP II and Pol II) is a multiprotein complex that transcribes DNA into precursors of messenger RNA (mRNA) and most small nuclear RNA (snRNA) and microRNA. It is one of the three RNAP enzymes found in the nucleus of eukaryo ...
holoenzyme Enzymes () are proteins that act as biological catalysts by accelerating chemical reactions. The molecules upon which enzymes may act are called substrates, and the enzyme converts the substrates into different molecules known as products. ...
, acting as a bridge between this enzyme and
transcription factor In molecular biology, a transcription factor (TF) (or sequence-specific DNA-binding factor) is a protein that controls the rate of transcription of genetic information from DNA to messenger RNA, by binding to a specific DNA sequence. The f ...
s.


C-terminal domain (CTD)

The carboxy-terminal domain (CTD) of RNA polymerase II is that portion of the polymerase that is involved in the initiation of
DNA transcription Transcription is the process of copying a segment of DNA into RNA. The segments of DNA transcribed into RNA molecules that can encode proteins are said to produce messenger RNA (mRNA). Other segments of DNA are copied into RNA molecules called ...
, the
cap A cap is a flat headgear, usually with a visor. Caps have crowns that fit very close to the head. They made their first appearance as early as 3200 BC. Caps typically have a visor, or no brim at all. They are popular in casual and informal se ...
ping of the RNA transcript, and attachment to the
spliceosome A spliceosome is a large ribonucleoprotein (RNP) complex found primarily within the nucleus of eukaryotic cells. The spliceosome is assembled from small nuclear RNAs ( snRNA) and numerous proteins. Small nuclear RNA (snRNA) molecules bind to specif ...
for RNA splicing. The CTD typically consists of up to 52 repeats (in humans) of the sequence Tyr-Ser-Pro-Thr-Ser-Pro-Ser. The carboxy-terminal repeat domain (CTD) is essential for life. Cells containing only RNAPII with none or only up to one-third of its repeats are inviable. The CTD is an extension appended to the C terminus of RPB1, the largest subunit of RNA polymerase II. It serves as a flexible binding
scaffold Scaffolding, also called scaffold or staging, is a temporary structure used to support a work crew and materials to aid in the construction, maintenance and repair of buildings, bridges and all other man-made structures. Scaffolds are widely used ...
for numerous nuclear factors, determined by the phosphorylation patterns on the CTD repeats. Each repeat contains an evolutionary conserved and repeated heptapeptide, Tyr1-Ser2-Pro3-Thr4-Ser5-Pro6-Ser7, which is subjected to reversible phosphorylations during each transcription cycle. This domain is inherently unstructured yet evolutionarily conserved, and in eukaryotes it comprises from 25 to 52 tandem copies of the consensus repeat heptad. As the CTD is frequently not required for
general transcription factor General transcription factors (GTFs), also known as basal transcriptional factors, are a class of protein transcription factors that bind to specific sites ( promoter) on DNA to activate transcription of genetic information from DNA to messenger ...
(GTF)-mediated initiation and RNA synthesis, it does not form a part of the catalytic essence of RNAPII, but performs other functions.


CTD phosphorylation

RNAPII can exist in two forms: RNAPII0, with a highly phosphorylated CTD, and RNAPIIA, with a nonphosphorylated CTD. Phosphorylation occurs principally on Ser2 and Ser5 of the repeats, although these positions are not equivalent. The phosphorylation state changes as RNAPII progresses through the transcription cycle: The initiating RNAPII is form IIA, and the elongating enzyme is form II0. While RNAPII0 does consist of RNAPs with hyperphosphorylated CTDs, the pattern of phosphorylation on individual CTDs can vary due to differential phosphorylation of Ser2 versus Ser5 residues and/or to differential phosphorylation of repeats along the length of the CTD. The PCTD (phosphoCTD of an RNAPII0) physically links pre-mRNA processing to transcription by tethering processing factors to elongating RNAPII, e.g., 5′-end capping, 3′-end cleavage, and polyadenylation. Ser5 phosphorylation (Ser5PO4) near the 5′ ends of genes depends principally on the kinase activity of
TFIIH Transcription factor II Human (transcription factor II H; TFIIH) is an important protein complex, having roles in transcription of various protein-coding genes and DNA nucleotide excision repair (NER) pathways. TFIIH first came to light in 1989 ...
(Kin28 in
yeast Yeasts are eukaryotic, single-celled microorganisms classified as members of the fungus kingdom. The first yeast originated hundreds of millions of years ago, and at least 1,500 species are currently recognized. They are estimated to constit ...
;
CDK7 Cyclin-dependent kinase 7, or cell division protein kinase 7, is an enzyme that in humans is encoded by the ''CDK7'' gene. The protein encoded by this gene is a member of the cyclin-dependent protein kinase (CDK) family. CDK family members are h ...
in
metazoan Animals are multicellular, eukaryotic organisms in the biological kingdom Animalia. With few exceptions, animals consume organic material, breathe oxygen, are able to move, can reproduce sexually, and go through an ontogenetic stage in ...
s). The transcription factor TFIIH is a kinase and will hyperphosphorylate the CTD of RNAP, and in doing so, causes the RNAP complex to move away from the initiation site. Subsequent to the action of TFIIH kinase, Ser2 residues are phosphorylated by CTDK-I in yeast (
CDK9 Cyclin-dependent kinase 9 or CDK9 is a cyclin-dependent kinase associated with P-TEFb. Function The protein encoded by this gene is a member of the cyclin-dependent kinase (CDK) family. CDK family members are highly similar to the gene produc ...
kinase in metazoans). Ctk1 (CDK9) acts in complement to phosphorylation of serine 5 and is, thus, seen in middle to late elongation. CDK8 and
cyclin C Cyclin-C is a protein that in humans is encoded by the ''CCNC'' gene. The protein encoded by this gene is a member of the cyclin family of proteins. The encoded protein interacts with cyclin-dependent kinase 8 and induces the phosphorylation of t ...
(CCNC) are components of the RNA polymerase II
holoenzyme Enzymes () are proteins that act as biological catalysts by accelerating chemical reactions. The molecules upon which enzymes may act are called substrates, and the enzyme converts the substrates into different molecules known as products. ...
that phosphorylate the carboxy-terminal domain (CTD). CDK8 regulates transcription by targeting the
CDK7 Cyclin-dependent kinase 7, or cell division protein kinase 7, is an enzyme that in humans is encoded by the ''CDK7'' gene. The protein encoded by this gene is a member of the cyclin-dependent protein kinase (CDK) family. CDK family members are h ...
/
cyclin H Cyclin-H is a protein that in humans is encoded by the ''CCNH'' gene. Function The protein encoded by this gene belongs to the highly conserved cyclin family, whose members are characterized by a dramatic periodicity in protein abundance throu ...
subunits of the general transcription initiation factor IIH (
TFIIH Transcription factor II Human (transcription factor II H; TFIIH) is an important protein complex, having roles in transcription of various protein-coding genes and DNA nucleotide excision repair (NER) pathways. TFIIH first came to light in 1989 ...
), thereby providing a link between the mediator and the basal transcription machinery. The gene
CTDP1 RNA polymerase II subunit A C-terminal domain phosphatase is an enzyme that in humans is encoded by the ''CTDP1'' gene. This gene encodes a protein which interacts with the carboxy-terminus of transcription initiation factor TFIIF, a transcription ...
encodes a phosphatase that interacts with the carboxy-terminus of transcription initiation factor
TFIIF Transcription factor II F (TFIIF) is one of several general transcription factors that make up the RNA polymerase II preinitiation complex. TFIIF is encoded by the , , and genes. TFIIF binds to RNA polymerase II RNA polymerase II (RNAP II ...
, a transcription factor that regulates elongation as well as initiation by
RNA polymerase II RNA polymerase II (RNAP II and Pol II) is a multiprotein complex that transcribes DNA into precursors of messenger RNA (mRNA) and most small nuclear RNA (snRNA) and microRNA. It is one of the three RNAP enzymes found in the nucleus of eukaryo ...
. Also involved in the phosphorylation and regulation of the RPB1 CTD is cyclin T1 ( CCNT1). Cyclin T1 tightly associates and forms a complex with
CDK9 Cyclin-dependent kinase 9 or CDK9 is a cyclin-dependent kinase associated with P-TEFb. Function The protein encoded by this gene is a member of the cyclin-dependent kinase (CDK) family. CDK family members are highly similar to the gene produc ...
kinase, both of which are involved in the phosphorylation and regulation. : ATP + [DNA-directed
RNA polymerase II RNA polymerase II (RNAP II and Pol II) is a multiprotein complex that transcribes DNA into precursors of messenger RNA (mRNA) and most small nuclear RNA (snRNA) and microRNA. It is one of the three RNAP enzymes found in the nucleus of eukaryo ...
] <=> Adenosine diphosphate, ADP + [DNA-directed RNA polymerase II] phosphate : catalyzed by
CDK9 Cyclin-dependent kinase 9 or CDK9 is a cyclin-dependent kinase associated with P-TEFb. Function The protein encoded by this gene is a member of the cyclin-dependent kinase (CDK) family. CDK family members are highly similar to the gene produc ...
EC 2.7.11.23.
TFIIF Transcription factor II F (TFIIF) is one of several general transcription factors that make up the RNA polymerase II preinitiation complex. TFIIF is encoded by the , , and genes. TFIIF binds to RNA polymerase II RNA polymerase II (RNAP II ...
and FCP1 cooperate for RNAPII recycling. FCP1, the CTD phosphatase, interacts with RNA polymerase II. Transcription is regulated by the state of phosphorylation of a heptapeptide repeat. The nonphosphorylated form, RNAPIIA, is recruited to the initiation complex, whereas the elongating polymerase is found with RNAPII0. RNAPII cycles during transcription. CTD phosphatase activity is regulated by two GTFs (
TFIIF Transcription factor II F (TFIIF) is one of several general transcription factors that make up the RNA polymerase II preinitiation complex. TFIIF is encoded by the , , and genes. TFIIF binds to RNA polymerase II RNA polymerase II (RNAP II ...
and
TFIIB Transcription factor II B (TFIIB) is a general transcription factor that is involved in the formation of the RNA polymerase II preinitiation complex (PIC) and aids in stimulating transcription initiation. TFIIB is localised to the nucleus and pr ...
). The large subunit of TFIIF (RAP74) stimulates the CTD phosphatase activity, whereas TFIIB inhibits TFIIF-mediated stimulation. Dephosphorylation of the CTD alters the migration of the largest subunit of RNAPII (RPB1).


5' capping

The carboxy-terminal domain is also the binding site of the cap-synthesizing and cap-binding complex. In eukaryotes, after transcription of the 5' end of an RNA transcript, the cap-synthesizing complex on the CTD will remove the gamma-phosphate from the 5'-phosphate and attach a GMP, forming a 5',5'-triphosphate linkage. The synthesizing complex falls off and the cap then binds to the cap-binding complex (CBC), which is bound to the CTD. The 5'cap of eukaryotic RNA transcripts is important for binding of the mRNA transcript to the ribosome during translation, to the CTD of RNAP, and prevents RNA degradation.


Spliceosome

The carboxy-terminal domain is also the binding site for
spliceosome A spliceosome is a large ribonucleoprotein (RNP) complex found primarily within the nucleus of eukaryotic cells. The spliceosome is assembled from small nuclear RNAs ( snRNA) and numerous proteins. Small nuclear RNA (snRNA) molecules bind to specif ...
factors that are part of RNA splicing. These allow for the splicing and removal of introns (in the form of a lariat structure) during RNA transcription.


Mutation in the CTD

Major studies in which knockout of particular amino acids was achieved in the CTD have been carried out. The results indicate that RNA polymerase II CTD truncation mutations affect the ability to induce transcription of a subset of genes ''in vivo'', and the lack of response to induction maps to the upstream activating sequences of these genes.


Genome surveillance complex

Several protein members of the
BRCA1 Breast cancer type 1 susceptibility protein is a protein that in humans is encoded by the ''BRCA1'' () gene. Orthologs are common in other vertebrate species, whereas invertebrate genomes may encode a more distantly related gene. ''BRCA1'' is a ...
-associated genome surveillance complex (BASC) associate with RNA polymerase II and play a role in transcription. The transcription factor
TFIIH Transcription factor II Human (transcription factor II H; TFIIH) is an important protein complex, having roles in transcription of various protein-coding genes and DNA nucleotide excision repair (NER) pathways. TFIIH first came to light in 1989 ...
is involved in transcription initiation and DNA repair. MAT1 (for 'ménage à trois-1') is involved in the assembly of the CAK complex. CAK is a multisubunit protein that includes
CDK7 Cyclin-dependent kinase 7, or cell division protein kinase 7, is an enzyme that in humans is encoded by the ''CDK7'' gene. The protein encoded by this gene is a member of the cyclin-dependent protein kinase (CDK) family. CDK family members are h ...
, cyclin H ( CCNH), and
MAT1 CDK-activating kinase assembly factor MAT1 is an enzyme that in humans is encoded by the ''MNAT1'' gene. Function Cyclin-dependent kinases (CDKs), which play an essential role in cell cycle control of eukaryotic cells, are phosphorylated and ...
. CAK is an essential component of the transcription factor TFIIH that is involved in transcription initiation and
DNA repair DNA repair is a collection of processes by which a cell identifies and corrects damage to the DNA molecules that encode its genome. In human cells, both normal metabolic activities and environmental factors such as radiation can cause DNA da ...
. The nucleotide excision repair (NER) pathway is a mechanism to repair damage to DNA.
ERCC2 __NOTOC__ ERCC2, or XPD is a protein involved in transcription-coupled nucleotide excision repair. The XPD (ERCC2) gene encodes for a 2.3-kb mRNA containing 22 exons and 21 introns. The XPD protein contains 760 amino acids and is a polypeptide ...
is involved in transcription-coupled NER and is an integral member of the basal transcription factor BTF2/TFIIH complex. ERCC3 is an ATP-dependent DNA helicase that functions in NER. It also is a subunit of basal transcription factor 2 (TFIIH) and, thus, functions in class II transcription. XPG ( ERCC5) forms a stable complex with
TFIIH Transcription factor II Human (transcription factor II H; TFIIH) is an important protein complex, having roles in transcription of various protein-coding genes and DNA nucleotide excision repair (NER) pathways. TFIIH first came to light in 1989 ...
, which is active in transcription and NER.
ERCC6 DNA excision repair protein ERCC-6 (also CS-B protein) is a protein that in humans is encoded by the ''ERCC6'' gene. The ''ERCC6'' gene is located on the long arm of chromosome 10 at position 11.23.NIH. "ERCC6 Gene." Genetics Home Reference. Natio ...
encodes a DNA-binding protein that is important in transcription-coupled excision repair. ERCC8 interacts with Cockayne syndrome type B ( CSB) protein, with p44 ( GTF2H2), a subunit of the RNA polymerase II transcription factor IIH, and ERCC6. It is involved in transcription-coupled excision repair. Higher error ratios in transcription by RNA polymerase II are observed in the presence of Mn2+ compared to Mg2+.


Transcription coactivators

The EDF1 gene encodes a protein that acts as a transcriptional coactivator by interconnecting the general transcription factor TATA element-binding protein ( TBP) and gene-specific activators.
TFIID Transcription factor II D (TFIID) is one of several general transcription factors that make up the RNA polymerase II preinitiation complex. RNA polymerase II holoenzyme is a form of eukaryotic RNA polymerase II that is recruited to the promoters o ...
and human mediator coactivator ( THRAP3) complexes (mediator complex, plus THRAP3 protein) assemble cooperatively on promoter DNA, from which they become part of the RNAPII holoenzyme.


Transcription initiation

The completed assembly of the holoenzyme with transcription factors and RNA polymerase II bound to the promoter forms the eukaryotic transcription initiation complex. Transcription in the archaea domain is similar to transcription in eukaryotes. Transcription begins with matching of NTPs to the first and second in the DNA sequence. This, like most of the remainder of transcription, is an
energy In physics, energy (from Ancient Greek: ἐνέργεια, ''enérgeia'', “activity”) is the quantitative property that is transferred to a body or to a physical system, recognizable in the performance of work and in the form of hea ...
-dependent process, consuming
adenosine triphosphate Adenosine triphosphate (ATP) is an organic compound that provides energy to drive many processes in living cells, such as muscle contraction, nerve impulse propagation, condensate dissolution, and chemical synthesis. Found in all known forms o ...
(ATP) or other NTP.


Promoter clearance

After the first bond is synthesized, the RNA polymerase must clear the promoter. During this time, there is a tendency to release the RNA transcript and produce truncated transcripts. This is called ''
abortive initiation Abortive initiation, also known as abortive transcription, is an early process of genetic transcription in which RNA polymerase binds to a DNA promoter and enters into cycles of synthesis of short mRNA transcripts which are released before the tra ...
'' and is common for both eukaryotes and prokaryotes. Abortive initiation continues to occur until the σ factor rearranges, resulting in the transcription elongation complex (which gives a 35 bp-moving footprint). The σ factor is released before 80 nucleotides of mRNA are synthesized. Once the transcript reaches approximately 23 nucleotides, it no longer slips and elongation can occur.


Initiation regulation

Due to the range of genes that Pol II transcribes, this is the polymerase that experiences the most regulation by a range of factors at each stage of transcription. It is also one of the most complex in terms of polymerase cofactors involved. Initiation is regulated by many mechanisms. These can be separated into two main categories: #Protein interference. #Regulation by phosphorylation.


Regulation by protein interference

Protein interference is the process where in some signaling protein interacts, either with the promoter or with some stage of the partially constructed complex, to prevent further construction of the polymerase complex, so preventing initiation. In general, this is a very rapid response and is used for fine level, individual gene control and for 'cascade' processes for a group of genes useful under a specific conditions (for example, DNA repair genes or heat shock genes).
Chromatin Chromatin is a complex of DNA and protein found in eukaryotic cells. The primary function is to package long DNA molecules into more compact, denser structures. This prevents the strands from becoming tangled and also plays important roles in r ...
structure inhibition is the process wherein the promoter is hidden by
chromatin Chromatin is a complex of DNA and protein found in eukaryotic cells. The primary function is to package long DNA molecules into more compact, denser structures. This prevents the strands from becoming tangled and also plays important roles in r ...
structure. Chromatin structure is controlled by post-translational modification of the
histone In biology, histones are highly basic proteins abundant in lysine and arginine residues that are found in eukaryotic cell nuclei. They act as spools around which DNA winds to create structural units called nucleosomes. Nucleosomes in turn a ...
s involved and leads to gross levels of high or low transcription levels. See:
chromatin Chromatin is a complex of DNA and protein found in eukaryotic cells. The primary function is to package long DNA molecules into more compact, denser structures. This prevents the strands from becoming tangled and also plays important roles in r ...
,
histone In biology, histones are highly basic proteins abundant in lysine and arginine residues that are found in eukaryotic cell nuclei. They act as spools around which DNA winds to create structural units called nucleosomes. Nucleosomes in turn a ...
, and nucleosome. These methods of control can be combined in a modular method, allowing very high specificity in transcription initiation control.


Regulation by phosphorylation

The largest subunit of Pol II (Rpb1) has a domain at its C-terminus called the CTD (C-terminal domain). This is the target of
kinases In biochemistry, a kinase () is an enzyme that catalyzes the transfer of phosphate groups from high-energy, phosphate-donating molecules to specific substrates. This process is known as phosphorylation, where the high-energy ATP molecule don ...
and phosphatases. The phosphorylation of the CTD is an important regulation mechanism, as this allows attraction and rejection of factors that have a function in the transcription process. The CTD can be considered as a platform for
transcription factors In molecular biology, a transcription factor (TF) (or sequence-specific DNA-binding factor) is a protein that controls the rate of transcription of genetic information from DNA to messenger RNA, by binding to a specific DNA sequence. The fun ...
. The CTD consists of repetitions of an
amino acid Amino acids are organic compounds that contain both amino and carboxylic acid functional groups. Although hundreds of amino acids exist in nature, by far the most important are the alpha-amino acids, which comprise proteins. Only 22 alpha a ...
motif, YSPTSPS, of which Serines and Threonines can be
phosphorylated In chemistry, phosphorylation is the attachment of a phosphate group to a molecule or an ion. This process and its inverse, dephosphorylation, are common in biology and could be driven by natural selection. Text was copied from this source, wh ...
. The number of these repeats varies; the mammalian protein contains 52, while the yeast protein contains 26. Site-directed-mutagenesis of the yeast protein has found at least 10 repeats are needed for viability. There are many different combinations of phosphorylations possible on these repeats and these can change rapidly during transcription. The regulation of these phosphorylations and the consequences for the association of transcription factors plays a major role in the regulation of transcription. During the transcription cycle, the CTD of the large subunit of RNAP II is reversibly phosphorylated. RNAP II containing unphosphorylated CTD is recruited to the promoter, whereas the hyperphosphorylated CTD form is involved in active transcription. Phosphorylation occurs at two sites within the heptapeptide repeat, at Serine 5 and Serine 2. Serine 5 phosphorylation is confined to promoter regions and is necessary for the initiation of transcription, whereas Serine 2 phosphorylation is important for mRNA elongation and 3'-end processing.


Elongation

The process of elongation is the synthesis of a copy of the DNA into messenger RNA. RNA Pol II matches complementary RNA nucleotides to the template DNA by Watson-Crick base pairing. These RNA nucleotides are ligated, resulting in a strand of messenger RNA. Unlike DNA replication, mRNA transcription can involve multiple RNA polymerases on a single DNA template and multiple rounds of transcription (amplification of particular mRNA), so many mRNA molecules can be rapidly produced from a single copy of a gene. Elongation also involves a proofreading mechanism that can replace incorrectly incorporated bases. In eukaryotes, this may correspond with short pauses during transcription that allow appropriate RNA editing factors to bind. These pauses may be intrinsic to the RNA polymerase or due to chromatin structure.


Elongation regulation

RNA Pol II elongation promoters can be summarised in three classes: # Drug/sequence-dependent arrest affected factors, e.g., SII (TFIIS) and P-TEFb protein families. # Chromatin structure oriented factors. Based on histone post translational modifications – phosphorylation, acetylation, methylation and ubiquination. #: ''See:
chromatin Chromatin is a complex of DNA and protein found in eukaryotic cells. The primary function is to package long DNA molecules into more compact, denser structures. This prevents the strands from becoming tangled and also plays important roles in r ...
,
histone In biology, histones are highly basic proteins abundant in lysine and arginine residues that are found in eukaryotic cell nuclei. They act as spools around which DNA winds to create structural units called nucleosomes. Nucleosomes in turn a ...
, and nucleosome'' # RNA Pol II catalysis improving factors. Improve the Vmax or Km of RNA Pol II, so improving the catalytic quality of the polymerase enzyme. E.g. TFIIF, Elongin and ELL families. #: ''See:
Enzyme kinetics Enzyme kinetics is the study of the rates of enzyme-catalysed chemical reactions. In enzyme kinetics, the reaction rate is measured and the effects of varying the conditions of the reaction are investigated. Studying an enzyme's kinetics in thi ...
, Henri–Michaelis–Menten kinetics,
Michaelis constant Michaelis or Michelis is a surname. Notable people and characters with the surname include: * Adolf Michaelis, German classical scholar * Anthony R. Michaelis, German science writer * Edward Michelis, German theologian * Georg Michaelis, German p ...
, and
Lineweaver–Burk plot In biochemistry, the Lineweaver–Burk plot (or double reciprocal plot) is a graphical representation of the Lineweaver–Burk equation of enzyme kinetics, described by Hans Lineweaver and Dean Burk in 1934. The Lineweaver–Burk plot for inhibit ...
'' As for initiation, protein interference, seen as the "drug/sequence-dependent arrest affected factors" and "RNA Pol II catalysis improving factors" provide a very rapid response and is used for fine level individual gene control. Elongation downregulation is also possible, in this case usually by blocking polymerase progress or by deactivating the polymerase. Chromatin structure-oriented factors are more complex than for initiation control. Often the chromatin-altering factor becomes bound to the polymerase complex, altering the histones as they are encountered and providing a semi-permanent 'memory' of previous promotion and transcription.


Termination

Termination is the process of breaking up the polymerase complex and ending the RNA strand. In eukaryotes using RNA Pol II, this termination is very variable (up to 2000 bases), relying on post transcriptional modification. Little regulation occurs at termination, although it has been proposed newly transcribed RNA is held in place if proper termination is inhibited, allowing very fast expression of genes given a stimulus. This has not yet been demonstrated in eukaryotes.


Transcription factory

Active RNA Pol II transcription holoenzymes can be clustered in the nucleus, in discrete sites called ''
transcription factories Transcription factories, in genetics describe the discrete sites where transcription occurs in the cell nucleus, and are an example of a biomolecular condensate. They were first discovered in 1993 and have been found to have structures analogous ...
''. There are ~8,000 such factories in the nucleoplasm of a HeLa cell, but only 100–300 RNAP II foci per nucleus in erythroid cells, as in many other tissue types. The number of transcription factories in tissues is far more restricted than indicated by previous estimates from cultured cells. As an active transcription unit is usually associated with only one Pol II holoenzyme, a polymerase II factory may contain on average ~8 holoenzymes. Colocalization of transcribed genes has not been observed when using cultured fibroblast-like cells. Differentiated or committed tissue types have a limited number of available transcription sites. Estimates show that erythroid cells express at least 4,000 genes, so many genes are obliged to seek out and share the same factory. The intranuclear position of many genes is correlated with their activity state. During transcription ''in vivo'', distal active genes are dynamically organized into shared nuclear subcompartments and colocalize to the same transcription factory at high frequencies. Movement into or out of these factories results in activation (On) or abatement (Off) of transcription, rather than by recruiting and assembling a transcription complex. Usually, genes migrate to preassembled factories for transcription. An expressed
gene In biology, the word gene (from , ; "... Wilhelm Johannsen coined the word gene to describe the Mendelian units of heredity..." meaning ''generation'' or ''birth'' or ''gender'') can have several different meanings. The Mendelian gene is a b ...
is preferentially located outside of its chromosome territory, but a closely linked, inactive gene is located inside.


Holoenzyme stability

RNA polymerase II holoenzyme stability determines the number of base pairs that can be transcribed before the holoenzyme loses its ability to transcribe. The length of the CTD is essential for RNA polymerase II stability. RNA polymerase II stability has been shown to be regulated by post-translation proline hydroxylation. The von Hippel–Lindau tumor suppressor protein (pVHL, human GeneID: 7428) complex binds the hyperphosphorylated large subunit of the RNA polymerase II complex, in a proline hydroxylation- and CTD phosphorylation-dependent manner, targeting it for ubiquitination.


See also

* RNA polymerase I *
RNA polymerase III In eukaryote cells, RNA polymerase III (also called Pol III) is a protein that transcribes DNA to synthesize ribosomal 5S rRNA, tRNA and other small RNAs. The genes transcribed by RNA Pol III fall in the category of "housekeeping" genes whose e ...
*
Post-transcriptional modification Transcriptional modification or co-transcriptional modification is a set of biological processes common to most eukaryotic cells by which an RNA primary transcript is chemically altered following transcription from a gene to produce a mature, fu ...
*
Transcription (genetics) Transcription is the process of copying a segment of DNA into RNA. The segments of DNA transcribed into RNA molecules that can encode proteins are said to produce messenger RNA (mRNA). Other segments of DNA are copied into RNA molecules calle ...
*
Eukaryotic transcription Eukaryotic transcription is the elaborate process that eukaryotic cells use to copy genetic information stored in DNA into units of transportable complementary RNA replica. Gene transcription occurs in both eukaryotic and prokaryotic cells. ...


References

*
RNA Polymerase: Components of the Transcription Initiation Machinery
* *


External links



* {{DEFAULTSORT:Rna Polymerase Ii Holoenzyme Enzymes Protein complexes Gene expression