299385677-Oa

299385677-Oa

RESEARCH ◥ volves nucleophilic attack by the 3′ ends of the RESEARCH ARTICLE viral DNA fragment, called a protospacer, at each end of the repeat (Fig. 1A) (7). Half-site inter- mediates form when one of the two protospacer CRISPR BIOLOGY DNA ends attacks the CRISPR locus integration site, and these can either progress to full-site integration products or be disintegrated, leaving Structures of the CRISPR genome thetargetsequenceintact(7, 8). To ensure effective acquisition of new immu- nity and avoid deleterious insertions into the integration complex genome, integration by Cas1-Cas2 must be highly 1 1,2 1 3 specific for the CRISPR locus. In the type I CRISPR Addison V. Wright, * Jun-Jie Liu, * Gavin J. Knott, Kevin W. Doxzen, system from Escherichia coli,acquisitionrequires 1,2,4 1,2,3,4,5,6,7† Eva Nogales, Jennifer A. Doudna sequences spanning the leader-repeat junction, as well as an inverted repeat motif in the repeat CRISPR-Cas systems depend on the Cas1-Cas2 integrase to capture and integrate short foreign (8–11). IHF (integration host factor), a histone- DNA fragments into the CRISPR locus, enabling adaptation to new viruses. We present crystal like protein, binds in the leader and assists in structures of Cas1-Cas2 bound to both donor and target DNA in intermediate and product recruiting Cas1-Cas2 to the leader-proximal re- – integration complexes, as well as a cryo electron microscopy structure of the full CRISPR locus peat, possibly involving a secondary upstream integration complex, including the accessory protein IHF (integration host factor). The binding site (10, 12, 13). The mechanism by which structures show unexpectedly that indirect sequence recognition dictates integration site Cas1-Cas2 recognizes these sequences has thus selection by favoring deformation of the repeat and the flanking sequences. IHF binding bends far been unknown. Downloaded from the DNA sharply, bringing an upstream recognition motif into contact with Cas1 to increase both Here we present structures of the Cas1-Cas2 the specificity and efficiency of integration. These results explain how the Cas1-Cas2 CRISPR CRISPR integrase bound to both substrate and integrase recognizes a sequence-dependent DNA structure to ensure site-selective CRISPR target DNA in intermediate and product inte- array expansion during the initial step of bacterial adaptive immunity. gration states. We also present a structure of the entire natural integration complex, including Cas1- RISPR-Cas (clustered regularly interspaced sequence preceding the first CRISPR repeat gives Cas2, the DNA substrate, and a 130–base pair DNA http://science.sciencemag.org/ short palindromic repeats–CRISPR associ- rise to precursor CRISPR transcripts that are target sequence in complex with IHF. These struc- ated) bacterial adaptive immune systems processed and used to recognize viral nucleic tures show how specificity for the CRISPR repeat C store fragments of viral DNA in the CRISPR acids by base-pairing with complementary se- relies on target DNA deformation to allow access array, a genomic locus comprising direct quences. Bacteria acquire immunity to new viruses to both Cas1 integrase active sites. In addition sequence repeats of ~20 to 50 base pairs, sep- when the CRISPR integrase, a heterohexameric to recruiting a secondary recognition site, IHF arated by virally derived spacer sequences of complex of four Cas1 and two Cas2 proteins, sharply bends the target DNA adjacent to the similar length (1–4). In most systems, a tran- inserts new viral DNA at the first CRISPR repeat integration site, favoring integrase binding to this scriptional promoter located in an AT-rich leader after the leader sequence (5–7). Integration in- locus and thereby suppressing off-target integration. Fig. 1. Half-site binding by Cas1-Cas2. protospacer Cas2 (A) Cartoon of integration by Cas1- Cas1 LR S on March 1, 2020 Cas2. Crystallography substrates are target-bound [Cas1-2] shown next to the corresponding reac- (nM): input0 10 1001000 Leader Repeat Spacer tion intermediate, with nucleotide nt spacer lengths indicated. Red asterisks repre- K259 60 33 50 sent integration events. (B)Cartoon K259 half-site 40 repeat and surface representations of the half- 5528 K12 site substrate bound by Cas1-Cas2. K12 N63 K38 30 DNA is colored as in (A). A substrate N63 16 16 R40 K38 20 schematic is shown above, with full-site disordered regions shown as dashed R40 pspacer lines. (C) Close-up of backbone 10 interactions between Cas1-Cas2 and half-site repeat DNA. Polar contacts are protospacer shown as dotted lines. (D) Hydroxyl- Cas1 radical footprinting of radiolabeled Cas1 LR S half-site DNA. The input is untreated [Cas1-2] DNA.The substrates are shown above the Cas2 (nM): input0 10 1001000 gel, with the radiolabel indicated with a 90˚ nt 60 leader red circle (L, leader; R, repeat, S, spacer). leader 50 Cas1 Cas1 Regions of the gel corresponding to repeat spacer 40 the leader, repeat, spacer, and proto- repeat 30 spacer (pspacer) are indicated alongside the gel. The inverted repeat regions of 20 the repeat are boxed. nt, nucleotides. Single-letter abbreviations for the amino 90˚ spacer acid residues are as follows: A, Ala; D, 10 Asp; E, Glu; F, Phe; H, His; K, Lys; N, Asn; Q, Gln; R, Arg; S, Ser. Wright et al., Science 357, 1113–1118 (2017) 15 September 2017 1of6 RESEARCH | RESEARCH ARTICLE These results suggest an unexpected mechanism of target recognition with implications for the engineering of the CRISPR integrase as a genome- tagging tool. Target binding in the half-site intermediate To determine the mechanism by which Cas1- Cas2 recognizes its target sequence, we crys- tallized the integrase bound to DNA substrates representing a half-site integration intermediate and the full-site integration product (Fig. 1A). The full-site product mimic, which we term the pseudo–full-site substrate, was designed with a break in the middle of the protospacer to allow Cas1-Cas2 to access the repeat (Fig. 1A). Both substrates bound to Cas1-Cas2 with high affinity (fig. S1). The half-site–bound structure, refined at 3.9-Å resolution, revealed an overall complex architecture similar to that of the previously solved protospacer-bound structures (Fig. 1B, fig. S2, and table S1) (14, 15). A Cas2 dimer sits Downloaded from at the center of two Cas1 dimers, with the proto- spacer DNA stretching across the flat back of the complex. The first 18 base pairs of the repeat sequence bind across a central channel formed by Cas2 and the noncatalytic Cas1 monomers, with the leader-repeat junction positioned across http://science.sciencemag.org/ a Cas1 active site (Fig. 1B and fig. S3, A and B). Seven nucleotides of the spacer-proximal repeat are unresolved, whereas the repeat-spacer junc- Fig. 2. Pseudo–full-site binding by Cas1-Cas2. (A)Overviewofpseudo–full-site substrate binding tion binds at the distal Cas1 active site. Basic by Cas1-Cas2. In the second view, the expected path of the disordered DNA is shown as dashed lines. A residues on both Cas2 (K38 and R40) and the schematic of the substrate is shown above, with the disordered region as dashed lines. (B)Aviewof noncatalytic Cas1 monomers (K12 and K259) are minor groove insertion by a-helix 7. Dotted lines in the close-up show polar contacts. The sequence positioned to contact the phosphate backbone of of the leader-repeat junction and residue numbering are shown above. Residues are numbered such 15 the midrepeat DNA (Fig. 1, B and C) ( ). Charge- that the final residue of the leader is –1 and the first residue of the repeat is 1. (C)Agarosegelofa swap mutations of these residues reduce or representative in vivo acquisition assay with indicated Cas1 mutants and wild-type Cas2. Acquisition eliminate acquisition of new spacers in vivo, con- results in expansion of the CRISPR array, which is visible as larger bands above the parental locus. The firming their importance for the CRISPR inte- on March 1, 2020 H208A active-site mutant is a negative control. bp, base pairs; WT, wild type. gration reaction (fig. S4A). Although earlier work suggested that inverted sequence motifs in the repeat might form a cru- groove. To test for contacts in solution, we per- and spacer-adjacent integration sites are clearly ciform structure during target recognition, our formed hydroxyl-radical footprinting of the half- resolved, whereas the middle of the repeat is dis- structure shows that the center of the repeat re- site substrate bound by the complex (Fig. 1D). ordered, suggesting that the repeat disengages mains a canonical duplex at this intermediate Protection of the backbone is evident in the proto- from Cas2 after full integration (Fig. 2A and fig. stage of integration (7, 16, 17). Although the spacer, including in the single-stranded end where S3, C and D). Previous crystal structures have sug- inverted repeat sequences are critical for spacer the DNA binds in a channel of Cas1. Only weak gested that the Cas1 a-helix 7 might interact with acquisition, we found no evidence of sequence- protection occurs near the ends of the repeat on targetDNA,andweindeedobservedinsertionof specific contacts in these motifs (Fig. 1C) (9, 11, 18). the nonintegrated target strand and largely does this helix into the minor groove of both the leader Contacts between the midrepeat DNA and the not overlap with the inverted repeats. Several hy- and spacer regions of the target DNA (Fig. 2B) (14). integrase proteins are limited to nonspecific back- persensitive nucleotides are apparent at the begin- The terminal residues of the leader sequence bone interactions, with no regions of Cas1 or Cas2 ning of the second inverted repeat even in the contribute to integration efficiency, and our struc- positioned to insert into either the major or minor absence of protein, suggesting that these nucle- ture reveals that several residues make hydrogen otides exhibit increased flexibility or a distorted bonds with the minor-groove face of leader bases conformation in solution.

View Full Text

Details

  • File Type
    pdf
  • Upload Time
    -
  • Content Languages
    English
  • Upload User
    Anonymous/Not logged-in
  • File Pages
    6 Page
  • File Size
    -

Download

Channel Download Status
Express Download Enable

Copyright

We respect the copyrights and intellectual property rights of all users. All uploaded documents are either original works of the uploader or authorized works of the rightful owners.

  • Not to be reproduced or distributed without explicit permission.
  • Not used for commercial purposes outside of approved use cases.
  • Not used to infringe on the rights of the original creators.
  • If you believe any content infringes your copyright, please contact us immediately.

Support

For help with questions, suggestions, or problems, please contact us