Published online 26 October 2016 Nucleic Acids Research, 2017, Vol. 45, Database issue D679–D686 doi: 10.1093/nar/gkw997 GenomeCRISPR - a database for high-throughput CRISPR/Cas9 screens Benedikt Rauscher1,†, Florian Heigwer1,†, Marco Breinig1, Jan Winter1 and Michael Boutros1,2,*
1German Cancer Research Center (DKFZ), Division Signaling and Functional Genomics and Heidelberg University, Department of Cell and Molecular Biology, Medical Faculty Mannheim, 69120 Heidelberg, Germany and 2German Cancer Consortium (DKTK), 69120 Heidelberg, Germany
Received August 06, 2016; Revised October 12, 2016; Editorial Decision October 13, 2016; Accepted October 14, 2016
ABSTRACT biological systems – an essential requirement in understand- / ing how genes function in the context of a cell or organism Over the past years, CRISPR Cas9 mediated genome in health and disease (1). In the past, RNA interference has editing has developed into a powerful tool for been widely used to perform such screens (2–5). Discoveries modifying genomes in various organisms. In high- from many of these experiments have been made accessible throughput screens, CRISPR/Cas9 mediated gene through centralized data resources such as GenomeRNAi perturbations can be used for the systematic func- (6,7) and have fueled further developments in the area of tional analysis of whole genomes. Discoveries from systems genetics. such screens provide a wealth of knowledge about Recently, CRISPR/Cas9 (clustered regularly interspaced gene to phenotype relationships in various bio- short palindromic repeats with the CRISPR associated pro- logical model systems. However, a database re- tein Cas9) has gained substantial use in genetic screens in source to query results efficiently has been lack- various organisms including human, mouse and zebrafish (8–10). In order to support the rapid development of this ing. To this end, we developed GenomeCRISPR (http: new field, accessibility of CRISPR/Cas9-derived functional //genomecrispr.org), a database for genome-scale data is crucial (11). Several resources have been estab- / CRISPR Cas9 screens. Currently, GenomeCRISPR lished to address this need. These include CRISPRz (12), contains data on more than 550 000 single guide a database of sgRNAs validated in zebrafish, WGE13 ( ), RNAs (sgRNA) derived from 84 different experiments a data resource that contains information about sgRNAs performed in 48 different human cell lines, compris- that can be used to target genes of interest and CrisprGE ing all screens in human cells using CRISPR/Cas (14), a platform that provides knowledge about sequence published to date. GenomeCRISPR provides data mutations caused by sgRNAs previously used in various mining options and tools, such as gene or genomic experimental settings. Nevertheless, a database that allows region search. Phenotypic and genome track views to compare screening results, such as perturbation pheno- allow users to investigate and compare the results of types or sgRNA efficiency, of many different experiments on a genome-wide scale has so far not been available. To different screens, or the impact of different sgRNAs fill this gap, we here report GenomeCRISPR (Supplemen- on the gene of interest. An Application Programming tary Figure S1), a database for high-throughput screening Interface (API) allows for automated data access and experiments using CRISPR/Cas9. At the time of submis- batch download. As more screening data will become sion, GenomeCRISPR comprises data from a total of 84 available, we also aim at extending the database to in- different high-throughput screening experiments performed clude functional genomic data from other organisms across 48 human cell lines. In such pooled screening format, and enable cross-species comparisons. similar to shRNA screens, high-complexity sgRNA collec- tions are transduced into culture cells using lentiviruses ap- / INTRODUCTION proaches (4,15). Following CRISPR Cas9 induced mutage- nesis, a particular condition (e.g. drug selection) is applied High-throughput screening experiments have been an indis- to the pooled mutant cell library, and enrichment or deple- pensable tool in functional genome research for many years. tion of sgRNAs are measured as phenotypes using next gen- Functional screens can systematically interrogate genotype eration sequencing. GenomeCRISPR is rapidly updateable to phenotype relationships and identify key dependencies of
*To whom correspondence should be addressed. Tel: +49 6221 42 1951; Email: [email protected] †These authors contributed equally to this work as the first authors.