Supporting Information
Total Page:16
File Type:pdf, Size:1020Kb
Supporting Information She et al. 10.1073/pnas.1618370114 SI Materials and Methods data, and then the data within a progressively finer grid of subtiles Strains Used in This Study. The strains used in this study are as are each individually registered to the subimage in their local follows: query strain: S. cerevisiae BY4742 MATα vts1Δ::NatMX neighborhood. In this study, we did two levels of hierarchical can1Δ::STE2pr-Sp_his5 lyp1Δ his3Δ1 leu2Δ0 ura3Δ0 met15Δ0 registration, a coarse registration with a 4 × 4 grid followed by a (40); single deletion strains: BY4741 MATa vts1::KanMX his3Δ1 fine registration with a 16 × 16 grid. After this progressive dis- leu2Δ0 ura3Δ0met15Δ0; BY4741 MATa tor1::KanMX his3Δ1 crete registration is complete, a continuous function describing leu2Δ0ura3Δ0met15Δ0; BY4741 MATa rev3::KanMX the aberrations between platforms in both x and y is fit to a his3Δ1leu2Δ0ura3Δ0met15Δ0; DaMP strain: BY4741 MATa quadratic surface: tor2-DaMP his3Δ1leu2Δ0ura3Δ0met15Δ0 (34); double deletion ð Þ = 2 + 2 + + + + strains: BY4741 MATa vts1Δ::NatMX tor1::KanMX can1Δ:: fΔx x, y Axy Bxx Cxyx Dxy Exx Fx, STE2pr-Sp_his5 lyp1Δ his3Δ1leu2Δ0ura3Δ0met15Δ0; BY4741 MATa vts1Δ::NatMX tor2-DaMP can1Δ::STE2pr-Sp_his5 lyp1Δ 2 2 fΔyðx, yÞ = Ayy + Byx + Cyyx + Dyy + Eyx + Fy. his3Δ1leu2Δ0ura3Δ0met15Δ0; BY4741 MATa vts1Δ::NatMX rev3::KanMX can1Δ::STE2pr-Sp_his5 lyp1Δ his3Δ1leu2Δ0 Δ Δ Together, these functions constitute a continuous offset map, ura3 0met15 0. which is then applied to the data to achieve precise alignment of sequence data and experimental images. Construction of a Custom Microfluidic Imaging Platform. Internally, the MiSeq determines the sequence of each cluster by fluores- ′ Library Design and Construction. We used a standard Nextera li- cence imaging of reversibly labeled and 3 -terminated nucleotides brary preparation kit (Illumina) to enzymatically fragment the that are serially incorporated by a DNA polymerase as templated S. cerevisiae genome into 80- to 300-nt fragments. We used PCR by library constructs. Randomly arrayed clonal clusters on the “ ” to attach an 8-nt i5 barcode, an E. coli RNA polymerase (RNAP) flow cell are imaged in tiles, where each tile corresponds to a promoter, an RNAP stall sequence, and Illumina sequencing field of view along the length of the flow cell lane (41). Because adapters (Fig. S3A). This initial PCR was run with 0.5 μMof the MiSeq is not amenable to custom protocols, we use the fact each primer (IDT) and stopped before completion. The con- that tile number and cluster position are reported along with centration of DNA was quantified via qPCR and the PCR was each sequence to enable downstream assays that are read out further amplified to a final concentration of 4 nM using short P1/ on a separate, custom-built instrument, using the previously se- P2 primers that anneal to the 5′ ends. The final PCR was puri- quenced MiSeq flow cell as an ultrahigh-throughput array and a fied using AMPure XP beads (Beckman Coulter). A conceptu- substrate for subsequent in situ RNA generation. ally similar PCR protocol was used to amplify each of the 3 SRE The custom imaging station was built from repurposed com- variants from a parent vector kindly provided by C. Smibert, ponents harvested from an Illumina Genome Analyzer II (GAII). University of Toronto, Toronto (https://benchling.com/s/EHytJEzc/ The GAII is a previous-generation sequencer that, due to limi- edit). Primers were chosen such that an additional Nextera adapter tations in read length, speed, reagent availability, as well as the sequence was added on each end to match the fragmented genomic large amount of hands-on time required for operation, is all but library. The SRE library preparation was added to the genomic defunct and is rapidly being phased out in favor of newer se- library prep at a molar ratio of 1:1,000. quencers that are more convenient and economical. Although pioneering sequencing-flow cell-array experiments done by our Sequencing Amplified Libraries. The library was sequenced using a group and others have used the GAII as both a platform for paired-end 2 × 75 MiSeq Reagent Kit, version 3, at a cluster sequencing and subsequent assays (28, 29, 42), the Genome density of 946,000 per mm2 with 95% clusters PF and 97% ≥ Analyzer is likely not a practical option for further development Q30. The flow cell was removed from the sequencer before the of these high-throughput experiments. However, as outdated standard postrun wash step and stored in its original storage GAII sequencers are decommissioned and often available for buffer for up to 1 mo. All FASTQ files from individual barcodes repurposing, many high-quality components still within their were provided as input for the software analysis pipeline. usable service life are readily available. We built an instrument that accepts previously sequenced Generation of a Transcribed Genome Array. Double-stranded DNA MiSeq flow cells, interfacing with the fluidics, allowing thermal (dsDNA) clusters on the MiSeq flow cell were denatured with control, and enabling fluorescence imaging. Using separate in- formamide at 55 °C to remove all fluorescent nucleotide analogs. struments for sequencing and downstream assays allows maximum Following denaturation, we confirmed no residual fluorescence flexibility for custom protocols, for example, RNA generation and from the sequencing. To regenerate dsDNA with standard nu- binding. This instrument was built combining many components cleotides, we annealed a 5′ biotinylated primer to the 3′ se- from used GAII instruments with custom-engineered retrofit quencing adaptor and resynthesized dsDNA using Klenow DNA components, electronics, and software (Table S1). polymerase (1× NEB buffer 2, 250 μM dNTP mix, 0.1 units/μL Just as in the MiSeq, our custom imaging station images the NEB Klenow, 0.01% Tween 20) incubated for 30 min at 37 °C. flow cell in tiles along the length of the lane. To associate the We then flowed in 1 μM RNase-free streptavidin to bind to the signal from each cluster in the image with its corresponding se- 5′ biotinylated primer and passivized with a 5 μM biotin wash. To quence, cross-correlation methods are used to register tile images block and quantify all potential single-stranded DNA, we from the custom imaging station to the sequence data positions annealed an Alexa 647-labeled oligo complementary to the for each tile from the MiSeq. However, nonaffine optical aber- constant stall sequence. As a control, we flowed in 10 nM Vts1 at rations between the two platforms make simple translation (by the this stage and observed no binding to DNA. We then incubated offset indicated by the cross-correlation peak) insufficient for the dsDNA with a transcription initiation mix containing sigma- precise registration. To this end, we use a hierarchical registration saturated RNAP and 3 nt at 2.5 μM(1× T7A1 reaction buffer method where the entire image is first globally registered to the [20 mM Tris, 20 mM NaCl, 7 mM MgCl2, 0.1 mM EDTA, 0.1% She et al. www.pnas.org/cgi/content/short/1618370114 1of13 BME, 0.03 mg/mL BSA, 2.14% (vol/vol) glycerol], 25 μM of each DTT) (Fig. S7A). Concentration of labeled protein was measured NTP [ATP, GTP, and UTP], 125 U/mL RNAP [sigma-saturated using A280 absorbance and was corrected for dye absorbance. holoenzyme from NEB], and 0.01% Tween-20) for 10 min at 37 °C. In this buffer, RNAP initiated onto dsDNA clusters and Vts1 Binding Experiments on the TGA. SNAP-Surface 549 labeled progressed to the first cytosine where it stalled. These initial Vts1 was diluted in binding buffer (20 mM Tris·HCl, pH 7.4, 26 bases of transcription were sufficient to keep the RNAP 150 mM NaCl, 0.01% Tween 20, 5 mM MgCl2, 0.1 mg/mL BSA) bound to the dsDNA, but short enough such that the footprint to obtain a twofold protein dilution series from 1 to 128 nM. The from the stalled RNAP occluded the initiation site from addi- MiSeq flow cell was first imaged in the green channel under tional RNAP. Excess RNAP was washed from solution with the buffer-only conditions (zero concentration point) and subse- original transcription initiation mix minus RNAP. Finally, ex- quently after flowing protein at increasing concentrations. At ∼ – tension buffer (1 mM all four NTPs in 1× T7A1 reaction buffer) each concentration, we took nine images over a period of 40 was added for 5 min at 37 °C to allow transcription to proceed. 45 min to empirically verify that equilibrium had been reached. After transcription, RNAP was terminally stalled at the biotin– Following binding at the highest concentration (128 nM of la- streptavidin roadblock, generating a stable RNAP-mediated beled protein), a washout experiment was conducted by flushing DNA–RNA tether. binding buffer containing no protein into the flow cell and re- peated imaging at 12 time points spanning 800 min in a geo- Vts1 Protein Purification, Labeling, and Quantification. Gibson metric spread over time. cloning was used to insert the RNA binding domain of Vts1 (442– A fraction of the fluorescence signal was often present at the 523) into a pET vector containing C-terminal SNAP and 6×His end of the experiments on clusters positive for binding, suggesting tags (https://benchling.com/s/yBCqWM/edit). The resultant plas- the possibility of incubation time-dependent off-rates or a per- manently bound fraction, which is an area of future character- mid [Vts1(442–523)-SNAP-His6] was transformed into E. coli BL21(DE3) cells. Following induction trials, a 2-L culture derived ization. Before fitting of Kd, therefore, we normalized this from a single transformant was grown at 37 °C in Luria–Bertani residual signal such that the fraction expected for each in- cubation time was proportional to the incubated protein con- medium containing 50 μg/mL kanamycin until the OD600 reached 0.6.