Kimberly J. Durniak 1, Scott Bailey 1, 2, *, Thomas A. Steitz 1, 2, 3, @
1 Department of Molecular Biophysics and Biochemistry,
Yale University, 266 Whitney Avenue, New Haven, CT 06520–8114, USA.
2 Howard Hughes Medical Institute, Yale University, 266
Whitney Avenue, New Haven, CT 06520–8114, USA.
3 Department of Chemistry, Yale University, 266 Whitney
Avenue, New Haven, CT 06520–8114, USA.
* Present address: Department of Biochemistry and Molecular Biology, Bloomberg School of Public Health, Johns Hopkins University, 615 North Wolfe Street, Baltimore, MD 21205, USA.
@ To whom correspondence should be addressed. E-mail:
thomas.steitz@yale.edu
Structural studies of the T7 bacteriophage DNA-dependent RNA polymerase (T7 RNAP) have shown that the conformation of the amino-terminal domain changes substantially between the initiation and elongation phases of transcription, but how this transition is achieved remains unclear. We report crystal structures of T7 RNAP bound to promoter DNA containing either a 7- or an 8-nucleotide (nt) RNA transcript that illuminate intermediate states along the transition pathway. The amino-terminal domain comprises the C-helix subdomain and the promoter binding domain (PBD), which consists of two segments separated by subdomain H. The structures of the intermediate complex reveal that the PBD and the bound promoter rotate by ~45° upon synthesis of an 8-nt RNA transcript. This allows the promoter contacts to be maintained while the active site is expanded to accommodate a growing heteroduplex. The C-helix subdomain moves modestly toward its elongation conformation, whereas subdomain H remains in its initiation- rather than its elongation-phase location, more than 70 angstroms away.
RNA polymerases (RNAPs) exhibit three phases of transcription—initiation,
elongation,
and termination. The initiation and elongation phases have been
studied extensively in the T7 RNAP system by biochemical and structural
approaches [reviewed in (1, 2)]. During the initiation
phase, the RNAP binds to a specific promoter DNA sequence, opens the DNA
duplex, and feeds the template strand into the active site (3,
4). The structure of a T7 RNAP initiation complex identified a six-helix-bundle
subdomain (residues 72 to 150, and 191 to 267)— the promoter binding domain
(PBD)—that is responsible for many of the interactions with the
17–base pair (bp) promoter and, in part, for melting the DNA duplex
(3, 4) (Fig. 1A). While remaining bound
to the promoter, the polymerase undergoes abortive synthesis, producing
many short transcripts from 2 to 12 nucleotides (nt) in length (5,
6). After the transition to the elongation phase and release of the
promoter, the polymerase proceeds processively down the DNA template, producing
a full-length RNA transcript. Comparison of the structures of the T7 RNAP
initiation and elongation complexes revealed extensive conformational
changes within the N-terminal 267 residues (N-terminal domain) and
little change in the rest of the RNAP (Fig. 1B) (7,
8). A rigid body rotation of the PBD as well as the refolding of the
C helix (residues 28 to 71) and H (residues 151 to 190) subdomains abolishes
the promoter binding site, enlarges the active site, and creates an exit
tunnel for the RNA transcript.
Fig. 1. A Comparison of the structures of T7 RNAP initiation,
intermediate, and elongation complexes.
Fig. 1. A Comparison of the structures of T7 RNAP initiation, intermediate, and elongation complexes. The molecules have been similarly oriented by superposition of their palm domains. The C-terminal domain is shown as a surface with the thumb domain removed to allow views of the DNA and RNA. The nontemplate strand is shown in light green, the template in blue, and the RNA transcript in red. Subdomains of the N-terminal domain are colored yellow (C-helix), green (subdomain H), and purple (PBD).
(A) The initiation complex bound to the promoter has a 3-nt transcript with its 5' end in contact with the PBD.
(B) The elongation complex exhibits a 220° right-hand rotation of the PBD, a refolding of subdomain H onto the top of the polymerase, and the formation of an elongated C-helix subdomain, when compared to the initiation complex.
(C) (Top) The T7 RNAP in the 7-nt RNA intermediate complex is bound to both promoter and downstream DNA. The PBD has rotated by 40° away from the C-terminal domain, avoiding a steric clash with the transcript and allowing for 7 bp of heteroduplex to form in the active site. (Bottom) A schematic drawing of the sequences constituting the subdomains of the N-terminal domain.
The structural changes within the N-terminal domain account for the increased stability and the processivity of the elongation complex, yet provide little insight into how the polymerase accommodates a growing transcript while maintaining its promoter contacts during abortive synthesis. Covalent cross-linking of the template strand to the RNA transcript established that the heteroduplex can be as long as 8 bp (9, 10). Yet, modeling the elongation of the 3-nt RNA transcript observed in the structure of the initiation complex to a 4-nt transcript produced a steric clash with the PBD, and incorporation of an additional nucleoside triphosphate (NTP) destroyed the crystal (4). Whereas the enlarged active site observed in the structure of the elongation complex (Fig. 1B) can accommodate a 7- to 8-bp heteroduplex, the new orientation of the PBD abolishes the promoter binding site.
Cross-linking, mutagenesis, and proteolytic digestion experiments suggest the existence of at least one intermediate structure and a transition mechanism that consists of two stages. The first stage allows synthesis of up to 8 nt of RNA with minimal changes to the N-terminal subdomains (11–15), whereas the second stage is presumed to include the major refolding events that occur during the synthesis of 9 to 14 nt and allow the stable elongation complex to form (16). A transition mechanism consisting of two stages has also been proposed for the larger, multisubunit eukaryotic RNAPs (17).
Models of the transition from initiation to elongation proposed previously
have suggested that the N-terminal domain undergoes a gradual structural
rearrangement to accommodate an 8-bp heteroduplex (7,
8). One model proposes a 10 Å translation of the PBD and subdomain
H away from the active site (18), while another suggests
that the PBD can maintain its promoter contacts after rotating into the
position observed in the elongation complex (7). None
of these models are entirely consistent with the biochemical data (14,
19).
However, Tang et al. recently posited that a rotation of the N-terminal
domain could accommodate their fluorescence resonance energy transfer (FRET)
data, as well as previous biochemical data (2). Although
these FRET data predict a 20° rotation, they do not provide the direction
of the rotation or which components are rotating.
Results:
Here, we have determined the structures of T7 RNAP bound to promoter DNA containing either a 7- or an 8-nt RNA transcript representing complexes that are between the early initiation and the elongation phases.
Structures of intermediate transcription complexes.
The structure of a mutant T7 RNAP with promoter DNA and a 7-nt RNA transcript was solved at 3.0 Å resolution (Fig. 1C). Our initial attempts to capture this structure by using wild-type T7 RNAP resulted in structures of complexes with DNA that lacked the added RNA transcript, presumably due to the instability of an intermediate complex relative to an aborted complex. To address this problem, we used a mutant T7 RNAP that produces fewer abortive products and dissociates more slowly than the wild type when stalled at 6 nt of RNA (13). The polymerase with a proline-to-leucine point mutation at residue 266 (P266L) was cocrystallized with a 33-bp duplex DNA containing a 17-bp conserved promoter region, a transcription bubble of six non-complementary bases followed by a 10-bp downstream complementary region, and a 7-nt-long synthetic RNA that was complementary to the template in the bubble region (Fig. 2B). Complexes with smaller RNA transcripts either failed to crystallize or resulted in aborted complexes. The substrate template DNA, nontemplate DNA, and nascent RNA were annealed and then mixed with the P266L mutant polymerase to form the intermediate complex (20). The structure was solved by molecular replacement with the protein component of the initiation complex (PDB accession no. 1QLN) as the search model (4). An electron density map calculated with 2Fo – Fc as coefficients and phases derived from coordinates that did not ever include the nucleic acid is shown in Fig. 2A. The refined model has an Rfactor of 25.0% (Rfree of 29.2%). Data collection and refinement statistics are shown in Table 1. We also determined the structure of an initiation complex bound to promoter DNA and an 8-nt RNA transcript at 6.7 Å resolution (supporting online material).
Fig. 2. Nucleic acid substrate.
(A) A 3 Å resolution electron density omit map of the nucleic acid component of the active site calculated with 2Fo – Fc as coefficients and contoured at 1.0 s. A model of the template (blue) and the RNA (red) is superimposed.
(B) The sequence of the substrate observed in the 7-nt intermediate
complex structure. The polymerase has opened an 11-nt transcription bubble,
and "+1" marks the transcription start site. Shaded nucleotides are not
seen in the electron density map.
The structure of the polymerase containing the 7-nt RNA exhibits interactions with both the upstream and downstream DNA and has an 11-nt transcription bubble (Fig. 2, A and B). The single-stranded nontemplate strand in the transcription bubble makes interactions with the finger and thumb domains similar to those seen in the elongation complex (fig S2, A and B). The RNA backbone of nucleotides 4 and 5 contacts the thumb domain. The 5' end of the RNA is located toward the specificity loop, and its 3' end is situated in the active site in a pretranslocated position with the closed fingers domain conformation. The Ca backbones of the palm domains (residues 411 to 552) of the initiation and the intermediate complexes superimpose with a root mean square deviation (RMSD) of 0.40 Å.
Movements of the N-terminal subdomains, specificity loop, and the promoter DNA.
The structure of the 7-nt intermediate complex reveals a left-handed
40° rotation of the PBD, the specificity loop, and the bound promoter
about an axis passing through residues 198 to 204, consistent with biochemical
data indicating that T7 RNAP maintains promoter contacts during abortive
synthesis (Fig. 3, A and B). The specificity loop (residues
739 to 769), which is an insertion into the fingers domain, is responsible
for making sequence-specific contacts with base pairs in the promoter region
(4, 9, 21). Superimposing
the C a backbone of the PBD and the specificity
loop of the initiation complex on that of the intermediate complex gives
an RMSD of 2.0 Å, in agreement with a largely rigid body motion of
the PBD and the specificity loop. The contacts between the PBD, the specificity
loop, and the promoter DNA in the structure of the intermediate complex
remain the same as those in the initiation complex, but their rotational
movement enlarges the active site to accommodate the 7 nt of elongated
RNA. Previous footprinting analyses of the promoter DNA show protection
by T7 RNAP during synthesis of up to 9 nt of RNA (22).
Experiments using ultraviolet-laser cross-linking, mutagenesis of the promoter
sequence, or transcription assays in conjunction with a promoter sink challenge
indicate that the specificity loop loses its promoter contacts after the
synthesis of 8 to 9 nt (16, 23).
Furthermore, the PBD rotates by 45° during the transcription of 8 nt
of RNA (movie S1), which is consistent with these
biochemical data. In the initiation-to-elongation structural change, a
loop between the C1 helix (residues 28 to 41) and the C2 helix (residues
46 to 55) refolds to create an elongated C-helix subdomain that is important
for elongation complex stability (14). Superposition
of the polymerase palm domains of the initiation and elongation complexes
reveals that the C1 helix does not change its position; rather, the C2
helix stacks onto the C1 helix, thereby lengthening it from 20 to 50 Å
(8) (Fig. 4A). The C2 helix in the
intermediate complex has a structure in between the conformations observed
in the initiation and elongation complexes. As the PBD rotates away from
the active site during the transition, the loop between the C1 and C2 helices
starts to refold as the C2 helix rotates by 40° toward the C1 helix.
The C2 helix must rotate by an additional 50° to form the continuous
elongated C-helix subdomain observed in the elongation complex. A comparison
of all three structures reveals that the refolding of this loop is important
for formation of the C-helix subdomain (Fig. 4A). This
is consistent with the mutagenesis of residues within this loop to proline,
which disrupts the refolding event, increases the accumulation of transcripts
6 to 7 nt long, and prevents the formation of stable elongation complexes
(14).
Fig. 3. Promoter and PBD movements during the transition.
Fig. 3. Promoter and PBD movements during the transition.
(A) A view looking down onto the promoter bound to the PBD. A 40° rotation of the PBD away from the active site occurs around an axis that passes through a flexible loop of residues 198 to 204. The catalytic aspartic acid residues (D812 and D537) represent the active site.
(B) The same view as in (A), without the PBD but showing the specificity loop, which also rotates.
Fig. 4. Other conformational changes associated with the transition.
(A) A comparison of the C-helix subdomain conformations between the initiation (pink), intermediate (yellow), and elongation (blue) complexes. During the transition, the C2 helix rotates by 90°, resulting in its stacking on the C1 helix in the elongation phase, increasing the length of the C-helix by an additional 30 Å.
(B) A 180° rotation around the peptide bond between M267 and F268 allows for the conformational changes in the N-terminal domain with little effect on the C-terminal domain. The elongation complex is shown in light blue, the initiation complex is in pink, and residue 266 is highlighted in red.
(C) The same region shown in (B) but with the intermediate complex (dark blue) and the initiation complex (pink) superimposed.
(D) A superposition of the thumb domain from the initiation complex (pink), the intermediate complex (pale orange), and elongation complex (blue). The thumb backs away from the active site by 4 Å during the transition and by another 5 Å in the elongation complex, creating a binding cleft for the upstream DNA in the elongation complex. Abbreviations for the amino acid residues are as follows: D, Asp; E, Glu; F, Phe; M, Met; P, Pro; Q, Gln; and S, Ser.
Subdomain H is an insertion into the six-helix bundle of the PBD, and the flexible loop connecting these two domains allows the PBD to rotate with minimal changes in the conformation and location of subdomain H. During the initiation phase, subdomain H consists of a loop (residues 165 to 190) and a helix of 14 residues (151 to 164). Although 10 residues within this loop (residues 167 to 177) are disordered in the structure of the intermediate complex, superposition of the initiation and intermediate complexes along the palm domain reveals little movement in the helix of subdomain H during the first part of the transition from initiation to elongation (fig. S3). This is consistent with experiments that show that subdomain H remains sensitive to proteolytic cleavage until a 12-nt transcript is formed and subdomain H takes on the conformation of antiparallel helices in the elongation complex (8, 16).
Role of the P266L mutation.
Comparison of the conformations of residue 266 in the structures of the initiation, the intermediate, and the elongation complexes reveals that this residue plays a key role during the transition. The P266L mutation is located at the C terminus of the PBD in a loop connecting the N-terminal domain with the C-terminal thumb, palm, and fingers domains (Fig. 1C). A rotation of 180°around the peptide bond between residues 267 and 268 allows the conformational changes in the N-terminal domain to occur without any change in the C-terminal domain (Fig. 4, B and C). A proline at residue 266, though not at the center of rotation, presumably limits the flexibility of the protein near this point because the covalent bond between the CD and the backbone nitrogen restricts the peptide backbone f angle of proline to about –70°. However, the weak electron density in this region (residues 258 to 264) precludes a more detailed analysis of the backbone angles of the P266L mutant. Perhaps the mutation allows formation of an intermediate structure that is more energetically favorable relative to the initiation structure than is allowed by the proline. Consistent with this hypothesis, we were unable to produce crystals of this intermediate complex with wild-type T7 RNAP.
Movements within the thumb domain and the downstream DNA.
The position of the thumb domain in the intermediate complex is between its position in the initiation complex and in the elongation complexes (Fig. 4D), as is observed with the C2 helix. In the structure of the elongation complex, the thumb domain has moved by 8 Å from its position in the initiation complex, creating a binding cleft for the upstream DNA (8). Although the binding cleft has not yet formed in the structure of the intermediate complex, the thumb domain has moved by 4 Å from its position in the initiation complex and makes interactions with the backbone of the nontemplate strand as well as with the RNA chain at residues 4 and 5. These interactions are similar to those seen in the structure of the elongation complex. This structure is consistent with mutagenesis experiments indicating the role that the thumb domain plays in stabilizing the transcription complex (24, 25).
The structure of the T7 RNAP intermediate complex shows the polymerase
bound to the promoter as well as to downstream DNA. Comparison of the downstream
DNA in the structures of the intermediate complex with those of the elongation
complexes reveals that the downstream DNA is rotated by 30° toward
the N-terminal domain in the intermediate complex (Fig. 5A).
The angle between the promoter and the downstream DNA is about 40°,
bringing the upstream and downstream duplex DNAs to within 6 Å of
each other (Fig. 5B). The differences in the positions
of the downstream DNA may be correlated to the positions of subdomain H.
A superposition of the C a backbone of the palm
domains of the intermediate and the elongation complexes (PDB accession
no. 1MSW) (8) reveals that the position of subdomain
H in the elongation complex would clash with the position of the downstream
DNA in the intermediate complex (Fig. 5C). Perhaps upon
refolding into its position on top of the polymerase in the elongation
complex, subdomain H is responsible for not only interacting with the nascent
RNA chain and the specificity loop (8, 26),
but also for moving the downstream DNA into the position observed in the
structure of the elongation complex. In our low-resolution structure of
the 8-nt intermediate complex, the position of the downstream DNA is different
in each of the two copies in the asymmetric unit. The unbiased electron
density for the downstream DNA agrees with the position of the downstream
DNA seen in the 7-nt intermediate complex in one copy, while in the other
copy, the density agrees better with the position seen in the elongation
complex. This is consistent with movement of the downstream DNA during
the transition from initiation to elongation. The results of FRET (27)
and chemically tethered nuclease (28) experiments that
were conducted to determine the relation of the downstream DNA to the fingers
domain are consistent with either of the positions observed in the crystal
structures.
Fig. 5. Arrangement of the downstream DNA.
Fig. 5. Arrangement of the downstream DNA.
(A) The downstream duplex is rotated by 30° toward the N-terminal domain compared to its position in the elongation complex (shown in gray).
(B) The angle between the upstream and downstream duplex DNA is about 40°, bringing the phosphate backbones within 6 Å of each other.
(C) The refolded subdomain H from the structure of the elongation complex (gray) creates a clash with the position of the downstream DNA as observed in the intermediate complex. Residues involved in the clash are shown as spheres.
(D) A close-up view of the 5' end of the 7-nt RNA reveals that a modeled extension of the RNA by three additional nucleotides would clash with the specificity loop in the position observed in the intermediate complex.
These structures of intermediate complexes resolve the ambiguity in the current models of the transition mechanism (7, 8, 18). The model proposed by Theis et al., and later supported by FRET data, suggested that the initial step of the transition from initiation to elongation involves a 10 Å translation of the PBD and subdomain H away from the C-terminal active site (18, 19). However, the FRET data also correlate well with these crystal structures of the intermediate complex. The left-handed rotation of the PBD moves the FRET probe on the nontemplate strand ~10 Å from its original position, in agreement with the 10 Å change observed from the FRET data, but achieved through a different mechanism. Our structures also reveal minimal changes in subdomain H during the initial stage of the transition. The rotation of the C2 helix toward the C1 helix observed in the structure of the 7-nt RNA intermediate complex indicates that formation of the C-helix subdomain is underway before the loss of promoter contacts and is consistent with mutagenesis experiments that disrupt the formation of the C-helix subdomain and result in the accumulation of RNA transcripts of 6 to 7nt in length (14). Experimental FRET data have disproved another model that proposed that the PBD would be able to maintain its contacts with the promoter DNA after the conformational changes and refolding of the N-terminal subdomains that occur during the transition from initiation to elongation (7, 19).
Recently, Tang et al. (2) proposed on the basis of FRET data that the promoter and the N-terminal domain rotate by 20° about an axis passing through the –4 position on the nontemplate strand, during the synthesis of a 3- to 7-nt RNA transcript. However, the direction of the rotation, as well as the specific changes of the individual subdomains of the N-terminal domain during the rotation, was not determined. Our structures show that the synthesis of an 8-nt RNA transcript results in a left-handed rotation of 45° about an axis that passes through the end of a central helix within the middle of the PBD (movie S1) rather than at one end of the PBD (–4NT strand) as proposed from the FRET measurements (Fig. 3).
It is possible that during the initial stages of the transition, additional stable intermediate conformations occur. For example, as the RNA elongates, the PBD may undergo a stepwise rotation to reach the position observed in the structure of the 7-nt intermediate complex (movie S1). Attempts to capture additional intermediates by using shorter RNAs have thus far failed to yield crystals or have resulted in structures without an RNA transcript in the active site. The difference between the 7-nt RNA and a shorter 6-nt RNA could be the lower stability of the shorter heteroduplex, which may shift the equilibrium toward the more stable T7 RNAP initiation state conformation. In addition, the buried surface area (~100 Å2) between part of the PBD (residues 246 to 266) and part of the C-terminal thumb domain (residues 398 to 403) in the initiation complex (18) becomes partially solvent accessible. The energy cost of exposing the hydrophobic surfaces may also contribute to the instability of the intermediate states. The PBD rotates by an additional 5° in the 8-nt intermediate complex compared to the 7-nt intermediate complex, consistent with multiple rotation states and the recent FRET data (2) that show an increase in probe distances as the transcript increases in length.
The second stage of the transition, after the RNA is extended beyond 8 nt, involves the loss of promoter contacts and larger structural changes in the specificity loop, the PBD, and subdomain H (12, 14, 15, 29). Modeling the longer RNA transcript observed in the structure of the elongation complex onto the intermediate structure shows that an RNA chain extended beyond 8 nt will clash with the specificity loop (Fig. 5D). This is consistent both with mutagenesis (16) and cross-linking studies (9) that indicate that the specificity loop loses its contacts with the promoter and begins to interact with the growing RNA chain upon synthesis of 8 to 9 nt. The PBD must also be released from its contacts with the promoter and rotate into the position it occupies in the elongation phase. During the first stage of the transition that we observe here, the PBD undergoes a left-handed 45° rotation. However, further examination suggests that the PBD can only achieve its final position in the elongation complex through a 260° right-handed rotation, as proposed by Theis et al. (18). This larger rotation of the PBD is likely to be associated with changes in subdomain H, consistent with mutant T7 RNAPs that suggest that the hinge region between subdomain H and the PBD is important for the transition to the elongation phase (26).
Relation to the multisubunit RNA polymerases.
The abortive synthesis phase of transcription initiation raises a
common problem among the structurally unrelated RNAPs of bacteriophage
T7, bacteria, and eukaryotes: How does the polymerase maintain contact
with the promoter while accommodating an increase in the size of the elongating
heteroduplex and progress down the template DNA? The sequence-specific
recognition of the promoter DNA is carried out by the s
factor in Escherichia coli RNA polymerase and transcription factor
IIB (TFIIB) (with other cofactors) in the eukaryotic RNA polymerase
II (30–32). Structural studies of the three RNAP families
have revealed that extension of the RNA transcript requires displacement
of a steric block during the transition from initiation to elongation (4,
33–35).
The steric block is caused by a region of the polymerase or associated
protein that is important for promoter recognition—the PBD in T7 RNAP,
the s factor in bacterial RNAP, or TFIIB in
eukaryotic RNAP. The intermediate structure of T7 RNAP reveals that as
the RNA chain lengthens during abortive synthesis, the obstacle is rotated
out of the way, which allows the polymerase to maintain promoter contacts
while enlarging the product site. Likewise, the extension of the transcript
is proposed to push a domain of the s protein
from the exit tunnel of bacterial RNAP (35, 36), leading
to promoter release, and the extension of the heteroduplex by yeast polymerase
II must displace TFIIB from the product binding site (34).
References and Notes
* 1. T. A. Steitz, Curr. Opin. Struct. Biol.
14, 4 (2004).
* 2. G. Q. Tang, R. Roy, T. Ha, S. S. Patel,
Mol.
Cell 30, 567 (2008).
* 3. R. A. Ikeda, C. C. Richardson, J. Biol.
Chem. 262, 3790 (1987).
* 4. G. M. T. Cheetham, T. A. Steitz, Science
286, 2305 (1999).
* 5. C. T. Martin, D. K. Muller, J. E. Coleman,
Biochemistry 27, 3966 (1988).
* 6. L. G. Brieba, R. Sousa, EMBO J. 20, 6826
(2001).
* 7. T. H. Tahirov et al., Nature 420, 43 (2002).
* 8. Y. W. Yin, T. A. Steitz, Science 298, 1387
(2002).
* 9. D. Temiakov et al., Proc. Natl. Acad. Sci.
U.S.A. 97, 14109 (2000).
* 10. J. Huang, R. Sousa, J. Mol. Biol. 303,
347 (2000).
* 11. K. Ma, D. Temiakov, M. Anikin, W. T. McAllister,
Proc. Natl. Acad. Sci. U.S.A. 102, 17612 (2005).
* 12. Q. Guo, D. Nayak, L. G. Brieba, R. Sousa,
J. Mol. Biol. 353, 256 (2005).
* 13. J. Guillerez, P. J. Lopez, F. Proux, H.
Launay, M. Dreyfus, Proc. Natl. Acad. Sci. U.S.A. 102, 5958 (2005).
* 14. R. P. Bandwar et al., J. Biol. Chem. 282,
22879 (2007).
* 15. S. Mukherjee, L. G. Brieba, R. Sousa, Cell
110, 81 (2002).
* 16. R. P. Bandwar, G. Q. Tang, S. S. Patel,
J. Mol. Biol. 360, 466 (2006).
* 17. M. Pal, D. S. Luse, Proc. Natl. Acad. Sci.
U.S.A. 100, 5700 (2003).
* 18. K. Theis, P. Gong, C. T. Martin, Biochemistry
43, 12709 (2004).
* 19. R. S. Turingan, K. Theis, C. T. Martin,
Biochemistry 46, 6165 (2007).
* 20. Materials and methods are available as
supporting material on Science Online.
* 21. G. M. T. Cheetham, D. Jeruzalmi, T. A.
Steitz, Nature 399, 80 (1999).
* 22. R. A. Ikeda, C. C. Richardson, Proc. Natl.
Acad. Sci. U.S.A. 83, 3614 (1986).
* 23. C. Place, J. Oddos, H. Buc, W. T. McAllister,
M. Buckle, Biochemistry 38, 4948 (1999).
* 24. L. G. Brieba, V. Gopal, R. Sousa, J. Biol.
Chem. 276, 10306 (2001).
* 25. P. E. Mentesana, S. T. Chin-Bow, R. Sousa,
W. T. McAllister, J. Mol. Biol. 302, 1049 (2000).
* 26. B. He, M. Rong, R. K. Durbin, W. T. McAllister,
J. Mol. Biol. 265, 275 (1997).
* 27. R. S. Turingan, C. Liu, M. E. Hawkins,
C. T. Martin, Biochemistry 46, 1714 (2007).
* 28. D. Nayak, Q. Guo, R. Sousa, J. Mol. Biol.
371, 490 (2007).
* 29. K. Ma, D. Temiakov, M. Jianh, M. Anikin,
W. T. McAllister, J. Biol. Chem. 277, 43206 (2002).
* 30. D. K. Muller, C. T. Martin, J. E. Coleman,
Biochemistry 28, 3306 (1989).
* 31. K. S. Murakami, S. A. Darst, Curr. Opin.
Struct. Biol. 13, 31 (2003).
* 32. S. Hahn, Nat. Struct. Mol. Biol. 11, 394
(2004).
* 33. H. T. Chen, S. Hahn, Cell 119, 169 (2004).
* 34. D. A. Bushnell, K. D. Westover, R. E. Davis,
R. D. Kornberg, Science 303, 983 (2004).
* 35. K. S. Murakami, S. Masuda, S. A. Darst,
Science 296, 1280 (2002).
* 36. D. G. Vassylyev et al., Nature 417, 712
(2002).
* 37. We thank the staff at the Advanced Photon
Source beamline 24-ID and at the National Synchrotron Light Source beamline
X25. We also thank T. Jorgensen for help with the supplemental movie and
the staff of the Center for Structural Biology core facility at Yale University.
This work was supported by NIH grant GM57510 (to T.A.S.). The coordinates
have been deposited in the Research Collaboratory for Structural Bioinformatics
Protein Data Bank with accession numbers 3E2E and 3E3J for the 7- and
8-nt RNA intermediate complex structures, respectively.
Supporting Online Material:
http://www.sciencemag.org/cgi/content/full/322/5901/553/DC1
Figure S1: Nucleic Acid Substrate.
Figure S2: Interactions between the Thumb Domain and Nucleic Acid.
Figure S3: Overlay of the H Subdomains.
Movie S1: A movie of T7 RNAP bound to DNA.
The 8nt RNA Transcript Intermediate Complex Structure
The structure of an initiation complex bound to promoter DNA and
an 8nt RNA transcript was solved at 6.7Å resolution. The experimental
design was similar to that of the 7nt RNA transcript complex. The 32bp
DNA substrate contained the 17bp promoter, followed by an 8nt transcription
bubble and 7bp of downstream duplex DNA (20) (Fig
S1A). This structure was solved using molecular replacement with the
C-terminal residues of the polymerase, followed by an additional search
for the PBD and 17bp promoter DNA which was necessary to include in the
search for the PBD due to its large scattering mass. Phases calculated
from a model excluding the active site nucleic acid contained electron
density for the active site DNA and RNA in a 2Fo-Fc electron density
map calculated to 6.7Å resolution (Fig S1B).
When a central helix in the PBD (residues 204-223) was left out during
calculation of another electron density map, electron density appears for
this helix in a different position from the 7nt RNA model. Comparison
of the 8nt intermediate complex structure with that of the 7nt intermediate
complex shows the PBD rotates about the same axis by an additional 6o
and moves 3-5Å further from the catalytic site.
Expression and Purification
The plasmid containing the His6-tagged P266L mutant polymerase was
a kind gift from the Dreyfus laboratory (1). The plasmid
was transformed into E. coli BL21(DE3) cells and grown in LB media
at 37ºC until an OD600 of 0.5. Expression was induced by
the addition of 1mM Isopropyl-beta-D-thiogalactopyranoside (IPTG)
4 hours
before harvesting. Protein purification proceeded immediately after
harvesting the cells, as freezing the cell pellet reduced the yield of
purified T7 RNAP. The cells were lysed in buffer containing 20mM Tris pH
8.0, 500mM NaCl, 5mM Imidazole and 0.2M Phenylmethylsulphonylfluroide (PMSF
- a protease inhibitor). The lysate was centrifuged at 18000rpm in an SS-34
rotor, for 1hr at 4ºC, before the clarified lysate was injected onto
a Ni-NTA column equilibrated in Binding Buffer (lysis buffer without PMSF).
Wash Buffer (Binding Buffer with 20mM Imidazole) was used to wash away
unspecific binding contaminants, and the RNAP was eluted from the column
with 100mM Imidazole. Although the His-tag allowed for the removal of the
majority of the contaminants, the Ni-NTA column was followed by a Mono-S
column. The sample was diluted into S1 Buffer without salt (5% Glycerol,
50mM HEPES pH 7.0, 1mM EDTA and 2mM DTT). The RNAP was eluted from the
Mono-S column using a gradient from 75mM NaCl to 1M NaCl over 10 column
volumes. The eluate was further purified on a Superdex G200 gel filtration
column equilibrated in 200mM NaCl and 20mM Tris pH 7.8. This protocol yields
about 15-20mg of purified protein from 8L of cell culture.
Crystallization and Data Collection
The DNA and RNA substrates for the 7nt intermediate complex were
prepared as follows.
The template strand (3’ATTATGCTGAGTGATATCCCTCACGCGTGCCGA)
and non-template strand (5’ TAATACGACTCACTATATTTCTGGCGCACGGCT)
containing a forced non-complementary transcription bubble
6nt in length (underlined), were mixed with synthetic RNA (Dharmacon)
(5’GGGAGUG) at concentrations of 0.7mM and 1.625mM respectively, and annealed
in annealing buffer (50mM Tris pH7.0, 20mM NaCl2, 10mM MgAcetate) by melting
the DNA at 75ºC for 5min and cooling to 20ºC over a 4 hour period.
The same protocol was used in preparing the substrates for the 8nt intermediate
complex. The substrates for the 8nt intermediate complex are as follows:
The template strand (3’ATTATGCTGAGTGATATCCCTCATTTTTGCCG),
the nontemplate strand (5’ TAATACGACTCACTATATTTCTGCCAAACGGC)
with the noncomplementary 8nt transcription bubble underlined,
and the synthetic RNA (Dharmacon)
(5’GGGAGUAA). The DNA/RNA mixture was then incubated with T7 RNAP
at room temperature for 10 minutes before crystallization experiments were
set up. Using the sitting drop method of vapor diffusion, the protein:nucleic
acid (125mM:162.5mM)
reaction was mixed at a 2:1 ratio with well solution (reaction:well solution)
and incubated against the well solution containing 16-18% PEG 6K, 0.1-0.2M
MgCl2, Tris pH 8.0-8.5.
Crystals were stabilized and cryoprotected by adding the cyroprotectant
to the crystal drop. The cryoprotectant solution consisted of 0.1-0.2M
MgCl2, Tris pH 8.0-8.5, 24% PEG 6K and 20% ethlyene glycol.
The crystals were frozen in liquid propane. Data were collected at the
Advanced Photon Source (APS) on the 24-ID beamline at Argonne National
Laboratory, and at the National Synchrotron Light Source (NSLS)
on the X25 beamline at the Brookhaven National Laboratory. The data were
indexed, integrated and scaled using the HKL suite of programs (2).
Structure Determination and Refinement
The structures were solved by molecular replacement using the program
PHASER (3) and search models that were created by removing
the nucleic acid from the existing T7 RNAP structures. The best PHASER
Z-score for the 7nt intermediate complex was from an initiation complex
search model, TFZ=14.8 with a LLG=148. The 7nt
intermediate complex was used as a search model for the 8nt intermediate
complex, followed by an additional search for the PBD + promoter
DNA. Model building was performed in the program COOT (4).
Refinement was performed using REFMAC (5). The initial
Rfree of the 7nt intermediate structure was 48.9%, which decreased to 37.6%
after further refinement of the protein model. Addition of nucleic acid
decreased the Rfree to 34.5% and the model was further refined. For the
8nt intermediate complex structure, rigid body refinement was performed
in REFMAC and NCS averaging was performed using DMMULTI (6).
Movie S1: A movie of T7 RNAP bound to DNA shows the
changes in the N-terminal subdomains and promoter DNA as an 8nt transcript
is synthesized. Solved crystal structures are indicated by referencing
the refinement R-factors and the PBD code. The downstream DNA from the
intermediate structures is modeled onto the initiation crystal structures
1CEZ and 1QLN during the beginning of the movie (shown in beige).
Coordinates for the other frames of the movie were made by interpolation
between the known crystal structures. The template strand is shown in dark
blue, the non-template strand is light blue,
and the RNA is red. The C-terminal domain
is shown as a white ribbon, and N-terminal subdomains are colored
as in Fig 1.
Supplemental Online Material References:
1. J. Guillerez, P. J. Lopez, F. Proux, H. Launay, M. Dreyfus, Proc.
Natl. Acad. Sci. U.S.A. 102, 5958 (2005).
2. Z. Otwinowski, W. Minor, Methods in Enzymology 276, 307 (1997).
3. A. J. G.-K. McCoy, R. W. Storoni, L. C. Read, R. J., Acta Crystallogr
D Biol Crystallogr 61, 458 (2005).
4. P. Emsley, K. Cowtan, Acta Crystallogr D Biol Crystallogr 60,
2126 (Dec, 2004).
5. G. N. Murshudov, Vagin, A. A., Dodson, E. J., Acta Crystallogr
D Biol Crystallogr 53, 240 (May 1, 1997).
6. CCP4, Acta Crystallogr D Biol Crystallogr 50, 760 (1994).
Figure S1: Nucleic Acid Substrate.
A. The sequence of the substrate observed in the 8nt intermediate complex structure. The polymerase has melted a 12nt transcription bubble. Shaded nucleotides are not seen in the electron density map.
B. A 6.7Å resolution electron density omit map of the nucleic acid component of the active site calculated using 2Fo-Fc as coefficients and contoured at 1.0s. A model of the template (blue) and the RNA (red) base-pairs outlined in (A) is superimposed on the map.
Figure S2: Interactions between the Thumb Domain and Nucleic Acid.
A. The thumb domain from the intermediate complex (pale orange) and elongation complex (white) are shown after alignment by superposition of their respective palm domains (residues 411-552). Arginine 394 from both complexes, (only the elongation complex residue is shown for simplicity), interacts with the phosphate backbone of the RNA chain (red) at residues 4 and 5 (counting the 5’ end of the RNA transcript as residue 1).
B. A view of the other side of the thumb helix after rotation by
180o shows tyrosine 385 making a stacking
interaction with a base on the non-template strand (green).
Figure S3: Overlay of the H Subdomains.
A. Comparison of the Subdomain H structures the initiation complex
(pale green) with the intermediate
complex (green) after orienting them
by superimposing the palm domains (residues 411-552), reveals little change
in its
structure. The C-terminal domain is shown in white.
In this detailed study of the 3D geometry of T3 transcription of
RNA, Kimberly Durniak, Scott Bailey, and Thomas Steitz demonstrate
a seven nt RNA species as necessary and sufficient for the transition
of gene transcription from the initiation stage to the elongation stage.
Geometric studies suggest an interaction of the RNA element with upstream
promoter elements, and a possible interaction with RNA enhancers
from cis
sites, including sigma-like sites.
1. Gingeras TR, "Mapping the strand-specific transcriptome of fission yeast".
2. Dutrow N, Nix DA, Holt D, Milash B, Dalley B, Westbroek E, Parnell
TJ, and Cairns BR,
"Dynamic transcriptome
of Schizosaccharomyces pombe shown by RNA-DNA hybrid mapping".
3. Veyrieras J-B, Kudaravalli S, Kim SY, Dermitzakis ET, Gilad Y,
Stephens M, and Pritchard JK,
"High-Resolution
Mapping of Expression-QTLs Yields Insight into Human Gene Regulation".
4. Wen B, Wu H, Bjornsson H, Green RD, Irizarry R, and Feinberg AP,
"Overlapping
euchromatin/heterochromatin- associated marks are enriched in imprinted
gene regions and predict allele-specific modification".
5. Ebisuya M, Yamamoto T, Nakajima M, and Nishida E,
"Ripples
from neighbouring transcription".
6. Hirota K, Miyoshi T, Kugou K, Hoffman CS, Shibata T, and
Ohta K,
"Stepwise
chromatin remodelling by a cascade of transcription initiation of non-coding
RNAs".
7. Cowell IG, Sunter NJ, Singh PB, Austin CA, Durkacz BW, Tilby MJ,
"gH2AX
Foci Form Preferentially in Euchromatin after Ionising-Radiation".
8. Tkacik G, Callan, CGJr , and Bialek W,
"Information flow
and optimization in transcriptional regulation".
9. Schwartz JC, Younger ST, Nguyen N-B, Hardy DB, Monia BP, Corey
DR, and Janowski BA,
"Antisense transcripts
are targets for activating small RNAs".
10. Place RF, Li L-C, Pookot D, Noonan EJ, and Dahiya R, "MicroRNA-373 induces expression of genes with complementary promoter sequences".
11. Zhu X, Ling J, Zhang L, Pi W, Wu M, and Tuan D, "A facilitated tracking and transcription mechanism of long-range enhancer function".
12. Frenster JH, and Hovsepian JA, "DNase-I Ultrastructural Probe Sites and Kissing Chromosomes".
Links to RNA and Biological Causality:
Links to
Euchromatin Activator RNA Reviews:
Links to
Euchromatin Activator RNA Research:
Links to Ultrastructural
Probes of DNase I-Sensitive Sites:
Links to
RNA as a Therapeutic Agent:
Links to Hodgkin Lymphoma
Immuno-Pathology:
Links to Activated
T-Lymphocyte Immunotherapy:
Links to Medical
Systems Biology:
Links to Selective
Gene Transcription:
Links to RNA-Induced
Epigenetics:
Links to RNA-Induced
Embryogenesis:
Links to RNA and
Biological Causality:
Links to Reprogramming
and Neoplasia:
A Brief History of Activator RNA:
"Ultrastructural Probes of Active DNA Sites, and the RNA Activators of DNA". (PowerPoint Presentation).