Cryo-EM Structure of the Replisome Reveals Multiple Interactions Coordinating DNA Synthesis
Total Page:16
File Type:pdf, Size:1020Kb
Cryo-EM structure of the replisome reveals multiple interactions coordinating DNA synthesis Arkadiusz W. Kulczyka,1, Arne Moellerb,2, Peter Meyera, Piotr Sliza, and Charles C. Richardsona,1 aDepartment of Biological Chemistry and Molecular Pharmacology, Harvard Medical School, Boston, MA 02115; and bDepartment of Molecular Biology and Genetics, The Danish Research Institute of Translational Neuroscience, Aarhus University, 8000 Aarhus, Denmark Contributed by Charles C. Richardson, January 27, 2017 (sent for review December 7, 2016; reviewed by James M. Berger and Nicholas E. Dixon) We present a structure of the ∼650-kDa functional replisome of with genetically altered gp4 and gp5, trx, and DNA resembling a bacteriophage T7 assembled on DNA resembling a replication fork. replication fork in buffer containing dTTP (Fig. 1A). Gp4 contains A structure of the complex consisting of six domains of DNA heli- an amino acid substitution, E343Q. Glutamate 343 functions as a case, five domains of RNA primase, two DNA polymerases, and catalytic base for hydrolysis of dTTP, an event that leads to trans- two thioredoxin (processivity factor) molecules was determined location on DNA. A crystal structure of the hexameric gp4 reveals by single-particle cryo-electron microscopy. The two molecules of that E343 is in position to activate the water molecule for a nu- DNA polymerase adopt a different spatial arrangement at the repli- cleophilic attack on the nucleoside γ-phosphate (16). Consequently, cation fork, reflecting their roles in leading- and lagging-strand syn- this altered gp4 (gp4-E343Q) binds dTTP but does not catalyze thesis. The structure, in combination with biochemical data, reveals hydrolysis on DNA to which it is tightly bound. The genetically molecular mechanisms for coordination of leading- and lagging- altered gp5 contains two amino acid substitutions, D5A and E7A, in strand synthesis. Because mechanisms of DNA replication are highly the active site of the 3′-5′ exonuclease domain. The altered protein conserved, the observations are relevant to other replication systems. exhibits a polymerization activity similar to the activity of wild-type DNA polymerase, but its exonuclease activity is reduced by a factor cryo-EM structure | replisome | coordination of leading- and of 106 (10). lagging-strands synthesis | DNA replication | DNA polymerase The DNA consists of a 77-nt oligonucleotide containing two self-complementary regions that anneal to form a tetra-loop and he reverse polarity and antiparallel nature of the two strands a 5-bp double-helical region in the middle of the sequence with a Tin duplex DNA pose a topological problem for their simul- 32-nt 5′-overhang and a 31-nt 3′-overhang (Fig. 1A). These two taneous synthesis. The “trombone” model of DNA replication overhangs mimic the lagging and the leading strand, respectively. postulates that the lagging strand forms a loop such that the Although the sequence is capable of forming intermolecular leading- and lagging-strand replication proteins contact one an- polymers as well as the desired intramolecular monomeric other (1). This replication loop contains a nascent Okazaki species, the monomeric form is favored by inclusion of the tetra- fragment and allows for coordination of leading- and lagging- loop sequence. The tetra-loop forms a highly stable turn confor- strand synthesis. The bacteriophage T7 replisome has been mation only in the monomeric form that we previously confirmed studied extensively (2). Thus, the macromolecular complex of T7 by 1D and 2D 1H-NMR (17). We assessed the folding prop- replication proteins provides an excellent model for structural erties of an oligonucleotide resembling a replication fork using analysis of a replisome. Notably, the human mitochondrial and the T7 replication systems are similar (3). Significance The phage T7 replisome is relatively simple. Four proteins are sufficient for reconstitution of the functional replisome, yet the The antiparallel nature of the two strands in duplex DNA poses assembled replisome recapitulates all of the key features of more a topological problem for their simultaneous synthesis. The complex prokaryotic and eukaryotic systems (2, 4). These pro- “trombone” model of the replication fork postulates that the teins are the following: DNA polymerase (gp5) and its proc- Escherichia coli lagging-strand forms a loop such that the leading- and lagging- essivity factor, thioredoxin (trx), single-stranded strand replication proteins contact one another. The replisome DNA (ssDNA)-binding protein (gp2.5), and the bifunctional then can move in one direction along the DNA while synthe- DNA primase-helicase (gp4). sizing both strands. Physical interactions between the replica- Gp4 forms multiple oligomeric forms (5). The negative stain tion proteins and DNA coordinate processive synthesis of the EM revealed that hexamers and heptamers are the most abun- leading and lagging strands. Here, we present the structure ∼ ∼ dant ( 22% and 78%, respectively). Upon addition of ssDNA, of a functional replisome from bacteriophage T7. Our struc- the equilibrium between the two oligomeric forms changes tural and biochemical analyses provide an explanation of the ∼ ∼ ( 77% protein rings are now hexameric, and 18% are hepta- mechanisms governing coordination of leading- and lagging- meric), indicative of DNA binding (6). The hexamer is an en- strand synthesis. zymatically active form of gp4. It binds to ssDNA, hydrolyzes nucleotides, translocates on ssDNA, and unwinds dsDNA (7–9). Author contributions: A.W.K. and C.C.R. designed research; A.W.K. performed research; In contrast, the heptamer does not bind to ssDNA (6). A.W.K., A.M., P.M., and P.S. contributed new reagents/analytic tools; A.W.K. and C.C.R. Crystal structures of all of the individual T7 replication pro- analyzed data; and A.W.K. and C.C.R. wrote the paper. teins have been determined (10–15). However, to date there is Reviewers: J.M.B., Johns Hopkins University School of Medicine; and N.E.D., University of Wollongong. no structural information on the T7 replisome or its subassemblies. Consequently, intermolecular interactions involved in coordination The authors declare no conflict of interest. of DNA synthesis have not been identified. We have therefore used Data deposition: The cryo-EM map of the replisome was deposited in the EM Data Bank, www.emdatabank.org (accession no. EMD-8565). cryo-electron microscopy (cryo-EM) and single-particle recon- 1To whom correspondence may be addressed. Email: [email protected] or ccr@hms. struction methods to determine a structure of the T7 replisome. harvard.edu. 2Present address: Department of Structural Biology, Max Planck Institute of Biophysics, Results 60538 Frankfurt, Germany. Replisome Assembly. We have identified conditions necessary to as- This article contains supporting information online at www.pnas.org/lookup/suppl/doi:10. semble the T7 replication complex. A stable complex was obtained 1073/pnas.1701252114/-/DCSupplemental. E1848–E1856 | PNAS | Published online February 21, 2017 www.pnas.org/cgi/doi/10.1073/pnas.1701252114 Downloaded by guest on September 25, 2021 PNAS PLUS BIOPHYSICS AND COMPUTATIONAL BIOLOGY Fig. 1. cryo-EM analysis and assembly of the replisome. (A) A fork-shaped DNA is formed by folding of ssDNA such that a stable tetra-loop is formed in the middle of the sequence flanked by a short double-helical region; two oligonucleotides are annealed to serve as primers. (B and C) Melting profiles of the tetra-loop DNA. Samples containing different concentrations of the fork-shaped DNA are represented by colored curved lines. Melting curves (B) and their first derivatives (C) show a biphasic transition with the melting temperatures (Tm)of53± 2°C.(D) Fluorescence anisotropy measurements of replisome formation. The replication complex assembled in the presence of gp4-E343Q is four times more stable than the complex assembled with wild-type gp4. Apparent binding constants are 0.2 ± 0.01 μM and 0.8 ± 0.05 μM, respectively. (E) A representative cryo-EM image. (Scale bar, 10 nm.) (F) The 2D class av- erages. (Scale bar, 10 nm.) (G) A calculated 3D map of the replisome. The initial model for the 3D calculation was generated using the atomic coordinates of the helicase domain of gp4 filtered to a resolution of 60 Å. (H) Fourier Schell Correlation (FSC). The resolution of 13.8 Å is based on the gold-standard FSC0.5 criterion. (I) Comparison of reprojections from the 3D structure with the 2D class averages. (Scale bar, 10 nm.) Kulczyk et al. PNAS | Published online February 21, 2017 | E1849 Downloaded by guest on September 25, 2021 fluorescence melting spectroscopy in the presence of a fluores- The initial cryo-EM analysis revealed that ∼95% of helicase cent dye that intercalates into double-stranded DNA. The de- rings are hexameric and ∼5% are heptameric. Thus, the initial crease in fluorescence as a function of increasing temperature model for the 3D calculation was generated using the atomic suggests folding of the template and formation of the duplex coordinates of a hexameric helicase domain of gp4 [Protein (Fig. 1B). The melting profiles show a biphasic transition in- Data Bank (PDB) ID code 1CR0] filtered to a resolution of 60 Å dicative of formation of a single monomeric species in solution (Fig. 1G). Fig. 1F displays representative 2D class averages. We with a melting temperature of 53 ± 2°C(Fig.1C). In the control clearly see additional densities associated with the hexameric experiment, a similar DNA lacking a stable tetra-loop sequence ring of DNA helicase likely representing DNA polymerases does not exhibit a biphasic melting transition (Fig. 1B). The bound to the replisome. To confirm that these densities are gp5/trx 5′-overhang of the folded template contains the primase recog- complexes, we labeled thioredoxin with 5 nm Ni-NTA Nanogold nition sequence 5′-GGGTC-3′. The lagging-strand primer is a (SI Appendix, Fig. S3). Thioredoxin was altered by addition of a tetraribonucleotide complementary to the primase recognition poly-His tag at the N terminus.