By DNA Microarrays
Total Page:16
File Type:pdf, Size:1020Kb
BBRC Biochemical and Biophysical Research Communications 322 (2004) 347–354 www.elsevier.com/locate/ybbrc Genome-scale identification of conditionally essential genes in E. coli by DNA microarrays Xin Tonga, John W. Campbellb,Ga´bor Bala´zsia, Krin A. Kaya, Barry L. Wannerc, Svetlana Y. Gerdesb, Zolta´n N. Oltvaia,* a Department of Pathology, Northwestern University, Chicago, IL 60611, USA b Integrated Genomics Inc., Chicago, IL 60612, USA c Department of Biological Sciences, Purdue University, West Lafayette, IN 47907, USA Received 9 July 2004 Available online 6 August 2004 Abstract Identifying the genes required for the growth or viability of an organism under a given condition is an important step toward understanding the roles these genes play in the physiology of the organism. Currently, the combination of global transposon muta- genesis with PCR-based mapping of transposon insertion sites is the most common method for determining conditional gene essen- tiality. In order to accelerate the detection of essential gene products, here we test the utility and reliability of a DNA microarray technology-based method for the identification of conditionally essential genes of the bacterium, Escherichia coli, grown in rich med- ium under aerobic or anaerobic growth conditions using two different DNA microarray platforms. Identification and experimental verification of five hypothetical E. coli genes essential for anaerobic growth directly demonstrated the utility of the method. How- ever, the two different DNA microarray platforms yielded largely non-overlapping results after a two standard deviations cutoff and were subjected to high false positive background levels. Thus, further methodological improvements are needed prior to the use of DNA microarrays to reliably identify conditionally essential genes on genome-scale. Ó 2004 Elsevier Inc. All rights reserved. Keywords: Essential gene; Transposon mutagenesis; DNA microarray; E. coli; Aerobic and anaerobic conditions With the rapidly increasing numbers of fully se- In recent years, a number of new experimen- quenced genomes, there is an increasing need for a com- tal approaches have been developed to determine gen- prehensive understanding of the roles of the thousands ome-wide gene essentiality. These include systematic of genes that make each organism unique. However, knockouts in Saccharomyces cerevisiae [2,3] and Caenor- the function of more than 35% of the genes in even habditis elegans [4], RNA interference in C. elegans [5], the most closely studied model organisms, such as Esch- and genetic footprinting in several microorganisms erichia coli, remains poorly understood, while their indi- [1,6,7]. Standard genetic footprinting consists of three vidual analysis represents a substantial challenge. An main steps that include random transposon mutagenesis important step toward understanding the function of of a large number of cells, selective outgrowth of the uncharacterized genes is to identify which gene plays mutagenized population, and determination of transpo- an essential role in cell growth and survival, and under son insertion sites in the genome using PCR and gel elec- what conditions such genes are essential [1]. trophoresis [8,9]. Although this is a rather precise, semi- automatic analysis, transposon insertions are detected individually using position-specific PCR primers fol- * Corresponding author. Fax: +1 312 503 8240. lowed by electrophoretic resolution of the amplification E-mail address: [email protected] (Z.N. Oltvai). products on agarose gels. Thus, the readout phase of 0006-291X/$ - see front matter Ó 2004 Elsevier Inc. All rights reserved. doi:10.1016/j.bbrc.2004.07.110 348 X. Tong et al. / Biochemical and Biophysical Research Communications 322 (2004) 347–354 this process is time- and labor-intensive, which substan- LB agar containing 3.2mM sodium sulfide, using Resazurin (0.2mg/ tially limits the utility of the approach. 100ml) as a redox indicator [19]. DNA microarrays have been used for genome-wide Construction of E. coli MG1655 microarrays. We used two different microarrays for the experiments. For the construction of the first set of monitoring of gene expression [10], DNA copy-number microarrays, unique cDNA fragments corresponding to each predicted changes [11], and the detection of DNA–protein interac- ORF of E. coli MG1655 were produced by PCR amplification from tion [12]. Recently, several attempts to accelerate the genomic DNA. To ensure that each primer pair would amplify the transposon insert detection phase of genetic footprinting most unique 200–350 base pair region within each ORF, the PCR by utilizing DNA microarrays for detecting transposon amplification product sequences were compared with all others in the genome by BLAST analysis [20]. Sequences producing the minimum E insertions have been reported [13–15]. The general strat- values, relative to all other ORFs in E. coli, were selected. Of the 4485 egy involves digesting transposon-mutagenized genomic E. coli ORFs listed in the ERGO database [21], unique primers were DNA and adding a linker to the end of the DNA frag- found for 4442 ORFs (sequences available upon request). Most of the ments to serve as a priming site for subsequent PCR remaining ORFs represent genes that are shorter than 240bp. Thus, amplification. DNA sequences within the transposon each predicted ORF of E. coli MG1655 was represented by a fragment predicted to minimally cross-hybridize with other E. coli MG1655 serve as the corresponding matching primer sites. sequences. These fragments varied in length from 200 to 350bp, with a Mutagenized genomic DNAs from different growth con- median length of 300bp. ditions are fluorescently labeled and compared by After two rounds of PCR amplification the final products were hybridization to DNA microarrays. purified on 384-well format ArrayIt PCR purification Kits (TeleChem In this study, we report the results of a genetic foot- International). PCR products were resuspended in 15ll spotting buffer (3· SSC (1· SSC is 0.15M NaCl plus 0.015M sodium citrate), 1.5M printing approach to identify E. coli MG1655 genes that betaine) and printed in triplicate onto amino-alkylsilane coated slides are selectively essential under aerobic vs. anaerobic (Sigma) with an OmniGrid arrayer (Gene Machines) equipped with growth conditions by using two different custom-built Telechem SMP3 split pins. The DNA was cross-linked to the slides E. coli DNA microarrays. Our results demonstrate that with UV light and baked in a vacuum oven at 60°C for at least two although this approach provides high throughput read- hours. Residual salt and unbound DNA were removed by rinsing the slides with 0.5% sodium dodecyl sulfate and water. The slides were out of putative essential and non-essential genes, the stored desiccated at RT until use. assortment of essentiality is affected by the large number The second set of microarrays containing full-length E. coli ORFs of false positive data points. However, the procedure did was printed as previously described [22] and was augmented to contain allow us to hypothesize that the genes for a number of stable RNAs as well as various other controls, e.g., E. coli genomic hypothetical proteins were essential for anaerobic DNA. Generation of sample RNA. Five micrograms of mutagenized growth, some of which were confirmed experimentally. genomic DNA was partially digested with HinP1 I or HpyCH4 IV Thus, although the method lacks precision it can be use- (New England Biolabs), 0.5llof10U/ll for 15min at 37°C. Frag- ful for discovering conditionally essential genes. ments from both enzymesÕ digestion (larger than 500bp) were purified from a 1.2% agarose gel with a QIAquick Gel Extraction Kit (Qiagen). The resulting size-selected, purified DNA fragments were ligated to a Y-shaped adaptor modified from Badarinarayana et al. [13] (CGG Materials and methods ACGCTACGTCCGTGTTGTCGGTCCTG and ACTACGCACCG GACGAGACGTAGCGTC) using T4 DNA ligase (Invitrogen) at Bacterial strain, growth conditions, and transposon mutagenesis. 16°C overnight. Escherichia coli strain MG1655 (FÀ, kÀ, ilvG, rfb50, rph1) [16] was used After the Y-shaped adaptor was ligated to the transposon-mutag- throughout this work. Generation of the transposon mutant library enized DNA fragments, the DNA was re-digested with PflM I (New and outgrowth of the mutagenized population were performed, as England Biolabs), 1llof8U/ll for 1h at 37°C. A 1ll aliquot of the described [17]. Briefly, a transposon library was constructed by incu- digest was used as template in two separate PCRs containing 0.5lM bating transposon DNA EZ::TN<KAN-2> with the hyperactive Tn5 the adaptor primer (ACTACGCACCGGACGA) and 0.5lM of one of EZ::TN transposase [18] (Epicentre Technologies) to form a transpo- the transposon-specific primers (primer 1: GTTCCGTGGCAAAG some complex, which was subsequently transformed to electrocompe- CAAAAGTTCAA or primer 2: CCGACATTATCGCGAGCCC tent E. coli cells by electroporation. Cultures were immediately diluted ATTTAT), 0.2mM dNTP mix, PCR buffer (Invitrogen), and 0.5llof with LB-based rich medium with supplements, incubated at 37°C for 5U/ll Taq polymerase (Invitrogen). Cycling conditions were 95°C for 40min, and then used to inoculate a BIOFLO 2000 fermentor (New 1min; 25 cycles of 94°C for 30s, 55°C for 30s, and 72°C for 1min; and Brunswick Scientific) containing the same medium supplemented with 72°C for 5min. Two microliters of the undiluted PCR was used as kanamycin (10lg/ml). For aerobic growth, dissolved oxygen was held template