RNA polymerase II

RNA polymerase II (also called RNAP II and Pol II) is an enzyme found in eukaryotic cells. It catalyzes the transcription of DNA to synthesize precursors of mRNA and most snRNA and microRNA.^[2]^[3] A 550 kDa complex of 12 subunits, RNAP II is the most studied type of RNA polymerase. A wide range of transcription factors are required for it to bind to its promoters and begin transcription.

1 Subunits
2 Assembly
3 Kinetics
4 Holoenzyme
5 Control by chromatin structure
6 Protein Complexes Involved
7 N-terminus
8 C-terminus
- 8.1 CTD of RNA polymerase
9 See also
10 References
11 External links

Subunits

The eukaryotic core RNA polymerase II was first purified using transcription assays.^[4] The purified enzyme has typically 10-12 subunits (12 in humans and yeast) and is incapable of specific promoter recognition.^[5] Many subunit-subunit interactions are known.^[6]

Computer-generated image of POLR2A gene with colorized subunits: green - RPB1 domain 1, blue - RPB1 domain 2, sand - RPB1 domain 3, light blue - RPB1 domain 4, brown - RPB1 domain 6, and magenta - RPB1 CTD.

DNA-directed RNA polymerase II subunit RPB1 - an enzyme that in humans is encoded by the POLR2A gene. RPB1 is the largest subunit of RNA polymerase II. It contains a carboxy terminal domain (CTD) composed of up to 52 heptapeptide repeats (YSPTSPS) that are essential for polymerase activity.^[7] In combination with several other polymerase subunits, it forms the DNA binding domain of the polymerase, a groove in which the DNA template is transcribed into RNA.^[8] It strongly interacts with RPB8.^[6]

RPB2 (POLR2B) - the second-largest subunit that in combination with at least two other polymerase subunits forms a structure within the polymerase that maintains contact in the active site of the enzyme between the DNA template and the newly synthesized RNA.^[9]

RPB3 (POLR2C) - the third-largest subunit. Exists as a heterodimer with another polymerase subunit, POLR2J forming a core subassembly. RPB3 strongly interacts with RPB1-5, 7, 10-12.^[6]

RNA polymerase II subunit B4 (RPB4) - encoded by the POLR2D gene^[10] is the fourth-largest subunit and may have a stress protective role.

RPB5 - In humans is encoded by the POLR2E gene. Two molecules of this subunit are present in each RNA polymerase II.^[11] RPB5 strongly interacts with RPB1, RPB3, and RPB6.^[6]

RPB6 (POLR2F) - forms a structure with at least two other subunits that stabilizes the transcribing polymerase on the DNA template.^[12]

RPB7 - encoded by POLR2G and may play a role in regulating polymerase function.^[13] RPB7 interacts strongly with RPB1 and RPB5.^[6]

RPB8 (POLR2H) - interacts with subunits RPB1-3, 5, and 7.^[6]

RPB9 - The groove in which the DNA template is transcribed into RNA is composed of RPB9 (POLR2I) and RPB1.

RPB10 - the product of gene POLR2L. It interacts with RPB1-3 and 5, and strongly with RPB3.^[6]

RPB11 - the RPB11 subunit is itself composed of three subunits in humans: POLR2J (RPB11-a), POLR2J2 (RPB11-b), and POLR2J3^[14] (RPB11-c).

RPB12 - Also interacting with RPB3 is RPB12 (POLR2K).^[6]

Assembly

RPB3 is involved in RNA polymerase II assembly.^[15] A subcomplex of RPB2 and RPB3 appears soon after subunit synthesis.^[15] This complex subsequently interacts with RPB1.^[15] RPB3, RPB5, and RPB7 interact with themselves to form homodimers, and RPB3 and RPB5 together are able to contact all of the other RPB subunits, except RPB9.^[6] Only RPB1 strongly binds to RPB5.^[6] The RPB1 subunit also contacts RPB7, RPB10, and more weakly but most efficiently with RPB8.^[6] Once RPB1 enters the complex, other subunits such as RPB5 and RPB7 can enter, where RPB5 binds to RPB6 and RPB8 and RPB3 brings in RPB10, RPB 11, and RPB12.^[6] RPB4 and RPB9 may enter once most of the complex is assembled. RPB4 forms a complex with RPB7.^[6]

Kinetics

Enzymes can catalyze up to several million reactions per second. Enzyme rates depend on solution conditions and substrate concentration. Like other enzymes POLR2 has a saturation curve and a maximum velocity (V_max). It has a K_m (substrate concentration required for one-half V_max) and a k_cat (the number of substrate molecules handled by one active site per second). The specificity constant is given by k_cat/K_m. The theoretical maximum for the specificity constant is the diffusion limit of about 10⁸ to 10⁹ (M⁻¹ s⁻¹), where every collision of the enzyme with its substrate results in catalysis.

The turnover number for RNA polymerase II is 0.16 s⁻¹ subject to concentration.^[16] Bacterial RNA polymerase, a relative of RNA Polymerase II, switches between inactivated and activated states by translocating back and forth along the DNA.^[17] Concentrations of [NTP]_eq = 10 μM GTP, 10 μM UTP, 5 μM ATP and 2.5 μM CTP, produce a mean elongation rate, turnover number, of ~1 bp (NTP)⁻¹ for bacterial RNAP, a relative of RNA polymerase II.^[17]

RNA Polymerase II is inhibited by α-amanitin.

Holoenzyme

Main article: RNA polymerase II holoenzyme

RNA polymerase II holoenzyme is a form of eukaryotic RNA polymerase II that is recruited to the promoters of protein-coding genes in living cells.^[5] It consists of RNA polymerase II, a subset of general transcription factors, and regulatory proteins known as SRB proteins.

Part of the assembly of the holoenzyme is referred to as the preinitiation complex, because its assembly takes place on the gene promoter before the initiation of transcription. The mediator complex acts as a bridge between RNA polymerase II and the transcription factors.

Control by chromatin structure

This is an outline of an example mechanism of yeast cells by which chromatin structure and histone posttranslational modification help regulate and record the transcription of genes by RNA polymerase II.

This pathway gives examples of regulation at these points of transcription:

Pre-initiation (promotion by Bre1, histone modification)
Initiation (promotion by TFIIH, Pol II modification AND promotion by COMPASS, histone modification)
Elongation (promotion by Set2, Histone Modification)

Please note that this refers to various stages of the process as regulatory steps. It has not been proven that they are used for regulation, but is very likely they are.

RNA Pol II elongation promoters can be summarised in 3 classes.

Drug/sequence-dependent arrest-affected factors (Various interfering proteins)
Chromatin structure-oriented factors (Histone posttranscriptional modifiers, e.g., HMTs)
RNA Pol II catalysis-improving factors (Various interfering proteins and Pol II cofactors; see RNA polymerase II).

Protein Complexes Involved

Chromatin structure oriented factors:
(HMTs (Histone MethylTransferases)):
COMPASS§† - (COMplex of Proteins ASsociated with Set1) - Methylates lysine 4 of histone H3.
Set2 - Methylates lysine 36 of histone H3.
(interesting irrelevant example: Dot1*‡ - Methylates lysine 79 of histone H3.)

(Other): Bre1 - Ubiquinates (adds ubiquitin to) lysine 123 of histone H2B. Associated with pre-initiation and allowing RNA Pol II binding.

N-terminus

The N-terminus (also known as the amino-terminus, NH₂-terminus, N-terminal end or amine-terminus) refers to the start of a protein or polypeptide terminated by an amino acid with a free amine group (-NH₂). The convention for writing peptide sequences is to put the N-terminus on the left and write the sequence from N- to C-terminus. When the protein is translated from messenger RNA, it is created from N-terminus to C-terminus.

The N-terminus is the first part of the protein that exits the ribosome during protein biosynthesis. It often contains sequences that act as targeting signals, basically intracellular zip codes, that allow for the protein to be delivered to its designated location within the cell. The targeting signal is usually cleaved off after successful targeting by a processing peptidase. Some proteins are modified posttranslationally.

C-terminus

The C-terminus (also known as the carboxyl-terminus, carboxy-terminus, C-terminal end, or COOH-terminus) of a protein or polypeptide is the end of the amino acid chain terminated by a free carboxyl group (-COOH). The convention for writing peptide sequences is to put the C-terminal end on the right and write the sequence from N- to C-terminus.

Each amino acid has a carboxyl group and an amine group, and amino acids link to one another to form a chain by a dehydration reaction by joining the amine group of one amino acid to the carboxyl group of the next. Thus polypeptide chains have an end with an unbound carboxyl group, the C-terminus, and an end with an amine group, the N-terminus. Proteins are naturally synthesized starting from the N-terminus and ending at the C-terminus.

The C-terminus can contain retention signals for protein sorting. The most common ER retention signal is the amino acid sequence -KDEL (or -HDEL) at the C-terminus, which keeps the protein in the endoplasmic reticulum and prevents it from entering the secretory pathway.

The C-terminus of proteins can be modified posttranslationally, for example, most commonly by the addition of a lipid anchor to the C-terminus that allows the protein to be inserted into a membrane without having a transmembrane domain. With Pol II, the C-terminus of RPB1 is appended to form the C-terminal domain (CTD).

CTD of RNA polymerase

The carboxy-terminal domain of RNA polymerase II typically consists of up to 52 repeats of the sequence Tyr-Ser-Pro-Thr-Ser-Pro-Ser.^[18] Other proteins often bind the C-terminal domain of RNA polymerase in order to activate polymerase activity. It is the protein domain that is involved in the initiation of DNA transcription, the capping of the RNA transcript, and attachment to the spliceosome for RNA splicing.^[19]

References

^ Meyer PA, Ye P, Zhang M, Suh MH, Fu J (Jun 2006). "Phasing RNA polymerase II using intrinsically bound Zn atoms: an updated structural model". Structure. 14 (6): 973–82. doi:10.1016/j.str.2006.04.003. PMID 16765890. http://linkinghub.elsevier.com/retrieve/pii/S0969212606002152.
^ Kornberg R (1999). "Eukaryotic transcriptional control". Trends in Cell Biology 9 (12): M46. doi:10.1016/S0962-8924(99)01679-7. PMID 10611681.
^ Sims RJ 3rd, Mandal SS, Reinberg D (Jun 2004). "Recent highlights of RNA-polymerase-II-mediated transcription". Current opinion in cell biology 16 (3): 263–271. doi:10.1016/j.ceb.2004.04.004. ISSN 0955-0674. PMID 15145350. edit
^ Sawadogo M, Sentenac A (1990). "RNA polymerase B (II) and general transcription factors.". Annu Rev Biochem. 59: 711–54. doi:10.1146/annurev.bi.59.070190.003431. PMID 2197989.
^ ^a ^b Myer VE, Young RA (October 1998). "RNA polymerase II holoenzymes and subcomplexes". J. Biol. Chem. 273 (43): 27757–60. doi:10.1074/jbc.273.43.27757. PMID 9774381. http://www.jbc.org/cgi/reprint/273/43/27757.pdf.
^ ^a ^b ^c ^d ^e ^f ^g ^h ⁱ ^j ^k ^l ^m Acker J, de Graaff M, Cheynel I, Khazak V, Kedinger C, Vigneron M (Jul 1997). "Interactions between the human RNA polymerase II subunits". J Biol Chem. 272 (27): 16815–21. doi:10.1074/jbc.272.27.16815. PMID 9201987.
^ Brickey WJ, Greenleaf AL (June 1995). "Functional studies of the carboxy-terminal repeat domain of Drosophila RNA polymerase II in vivo". Genetics 140 (2): 599–613. PMC 1206638. PMID 7498740. http://www.pubmedcentral.nih.gov/articlerender.fcgi?tool=pmcentrez&artid=1206638.
^ "Entrez Gene: POLR2A polymerase (RNA) II (DNA directed) polypeptide A, 220kDa". http://www.ncbi.nlm.nih.gov/sites/entrez?Db=gene&Cmd=ShowDetailView&TermToSearch=5430.
^ "Entrez Gene: POLR2B polymerase (RNA) II (DNA directed) polypeptide B, 140kDa". http://www.ncbi.nlm.nih.gov/sites/entrez?Db=gene&Cmd=ShowDetailView&TermToSearch=5431.
^ Khazak V, Estojak J, Cho H, Majors J, Sonoda G, Testa JR, Golemis EA (May 1998). "Analysis of the interaction of the novel RNA polymerase II (pol II) subunit hsRPB4 with its partner hsRPB7 and with pol II". Mol Cell Biol. 18 (4): 1935–45. PMC 121423. PMID 9528765. http://www.pubmedcentral.nih.gov/articlerender.fcgi?tool=pmcentrez&artid=121423.
^ "Entrez Gene: POLR2E polymerase (RNA) II (DNA directed) polypeptide E, 25kDa". http://www.ncbi.nlm.nih.gov/sites/entrez?Db=gene&Cmd=ShowDetailView&TermToSearch=5434.
^ "Entrez Gene: POLR2F polymerase (RNA) II (DNA directed) polypeptide F". http://www.ncbi.nlm.nih.gov/sites/entrez?Db=gene&Cmd=ShowDetailView&TermToSearch=5435.
^ "Entrez Gene: POLR2G polymerase (RNA) II (DNA directed) polypeptide G". http://www.ncbi.nlm.nih.gov/sites/entrez?Db=gene&Cmd=ShowDetailView&TermToSearch=5436.
^ "POLR2J3 polymerase (RNA) II (DNA directed) polypeptide J3". http://www.ncbi.nlm.nih.gov/gene/548644?ordinalpos=1&itool=EntrezSystem2.PEntrez.Gene.Gene_ResultsPanel.Gene_RVDocSum.
^ ^a ^b ^c Kolodziej PA, Young RA (Sep 1991). "Mutations in the three largest subunits of yeast RNA polymerase II that affect enzyme assembly". Mol Cell Biol. 11 (9): 4669–78. PMC 361357. PMID 1715023. http://mcb.asm.org/cgi/reprint/11/9/4669?ijkey=9d60d05ed32981de57ecc990796689311e8f86a0.
^ Jin J, Dong W, Guarino LA (Dec 1998). "The LEF-4 subunit of Baculovirus RNA polymerase has RNA 5'-triphosphatase and ATPase activities". J Virol. 72 (12): 10011–9. PMC 110520. PMID 9811739. http://jvi.highwire.org/cgi/reprint/72/12/10011.
^ ^a ^b Abbondanzieri EA, Greenleaf WJ, Shaevitz JW, Landick R, Block SM (Nov 2005). "Direct observation of base-pair stepping by RNA polymerase". Nature. 438 (7067): 460–5. doi:10.1038/nature04268. PMC 1356566. PMID 16284617. http://www.pubmedcentral.nih.gov/articlerender.fcgi?tool=pmcentrez&artid=1356566.
^ Meinhart A, Cramer P (July 2004). "Recognition of RNA polymerase II carboxy-terminal domain by 3'-RNA-processing factors". Nature 430 (6996): 223–6. doi:10.1038/nature02679. PMID 15241417. http://www.nature.com/nature/journal/v430/n6996/abs/nature02679.html.
^ Brickey WJ, Greenleaf AL (June 1995). "Functional studies of the carboxy-terminal repeat domain of Drosophila RNA polymerase II in vivo". Genetics 140 (2): 599–613. PMC 1206638. PMID 7498740. http://www.pubmedcentral.nih.gov/articlerender.fcgi?tool=pmcentrez&artid=1206638.

External links

Transferases: phosphorus-containing groups (EC 2.7)

2.7.1-2.7.4:
phosphotransferase/kinase
(PO₄)

2.7.1: OH acceptor	Hexo- · Gluco- · Fructo- (Hepatic) · Galacto- · Phosphofructo- (1, Liver, Muscle, Platelet, 2) · Riboflavin · Shikimate · Thymidine (ADP-thymidine) · NAD⁺ · Glycerol · Pantothenate · Mevalonate · Pyruvate · Deoxycytidine · PFP · Diacylglycerol · Phosphoinositide 3 (Class I PI 3, Class II PI 3) · Sphingosine · Glucose-1,6-bisphosphate synthase

2.7.2: COOH acceptor	Phosphoglycerate · Aspartate

2.7.3: N acceptor	Creatine

2.7.4: PO₄ acceptor	Phosphomevalonate · Adenylate · Nucleoside-diphosphate · Uridylate · Guanylate · Thiamine-diphosphate

2.7.6: diphosphotransferase
(P₂O₇)

Ribose-phosphate diphosphokinase · Thiamine diphosphokinase

2.7.7: nucleotidyltransferase
(PO₄-nucleoside)

Polymerase

DNA polymerase	DNA-directed DNA polymerase: DNA polymerase I · DNA polymerase II · DNA polymerase III holoenzyme RNA-directed DNA polymerase: Reverse transcriptase (Telomerase) DNA nucleotidylexotransferase/Terminal deoxynucleotidyl transferase

RNA nucleotidyltransferase	RNA polymerase/DNA-directed RNA polymerase: RNA polymerase I · RNA polymerase II · RNA polymerase III · RNA polymerase IV · Primase · RNA-dependent RNA polymerase PNPase

Phosphorolytic
3' to 5' exoribonuclease

RNase PH · PNPase

Uridylyltransferase

Glucose-1-phosphate uridylyltransferase · Galactose-1-phosphate uridylyltransferase

Guanylyltransferase

mRNA capping enzyme

Other

Recombinase (Integrase) · Transposase

2.7.8: miscellaneous

Phosphatidyltransferases	CDP-diacylglycerol—glycerol-3-phosphate 3-phosphatidyltransferase · CDP-diacylglycerol—serine O-phosphatidyltransferase · CDP-diacylglycerol—inositol 3-phosphatidyltransferase · CDP-diacylglycerol—choline O-phosphatidyltransferase

Glycosyl-1-phosphotransferase	N-acetylglucosamine-1-phosphate transferase

2.7.10-2.7.13: protein kinase
(PO₄; protein acceptor)

2.7.10: protein-tyrosine	see tyrosine kinases

2.7.11: protein-serine/threonine	see serine/threonine-specific protein kinases

2.7.12: protein-dual-specificity	see serine/threonine-specific protein kinases

2.7.13: protein-histidine	Protein-histidine pros-kinase · Protein-histidine tele-kinase · Histidine kinase

B enzm: 1.1/2/3/4/5/6/7/8/10/11/13/14/15-18, 2.1/2/3/4/5/6/7/8, 2.7.10, 2.7.11-12, 3.1/2/3/4/5/6/7, 3.1.3.48, 3.4.21/22/23/24, 4.1/2/3/4/5/6, 5.1/2/3/4/99, 6.1-3/4/5-6