Efficient Mapping of Transgene Integration Sites and Local Structural
Total Page:16
File Type:pdf, Size:1020Kb
Published online 4 January 2017 Nucleic Acids Research, 2017, Vol. 45, No. 8 e62 doi: 10.1093/nar/gkw1329 Efficient mapping of transgene integration sites and local structural changes in Cre transgenic mice using targeted locus amplification Carol Cain-Hom1, Erik Splinter2, Max van Min2, Marieke Simonis2, Monique van de Heijning2, Maria Martinez1, Vida Asghari1, J. Colin Cox1,* and Søren Warming3,* 1Department of Transgenic Technology, Genentech, Inc., 1 DNA Way, South San Francisco, CA 94080, USA, 2Cergentis BV, Yalelaan 62, 3584 CM Utrecht, the Netherlands and 3Department of Molecular Biology, Genentech, Inc., 1 DNA Way, South San Francisco, CA 94080, USA Received October 7, 2016; Revised December 13, 2016; Editorial Decision December 19, 2016; Accepted December 20, 2016 ABSTRACT function (1–3) and hundreds of Cre ‘deleter’ lines are avail- / able to the mouse community. Cell-type specific expression Cre LoxP technology is widely used in the field of of Cre allows for specific deletion of a gene of interest by / mouse genetics for spatial and or temporal regula- the use of a ‘conditional knock-out’ (CKO) allele of that tion of gene function. For Cre lines generated via gene (2). Typically, for a conditional allele a critical exon(s) pronuclear microinjection of a Cre transgene con- is flanked by two loxP sites (‘floxed’) and in the cells where struct, the integration site is random and in most Cre is expressed, the floxed exon(s) is removed, resulting in cases not known. Integration of a transgene can dis- a deletion, or knock-out, allele. Cre deleter lines are gener- rupt an endogenous gene, potentially interfering with ated either by targeted knock-in of the Cre cDNA into an interpretation of the phenotype. In addition, knowl- endogenous locus or by pronuclear microinjection of a Cre edge of where the transgene is integrated is impor- transgene driven by a cell-type specific promoter. For the tant for planning of crosses between animals carry- latter, the integration site is random and in most cases not known. Knowledge of where the transgene is integrated is ing a conditional allele and a given Cre allele in case important for planning of crosses between animals carrying the alleles are on the same chromosome. We have a conditional allele and a given Cre allele in case the alle- used targeted locus amplification (TLA) to efficiently les are on the same chromosome. This becomes increasingly map the transgene location in seven previously pub- important in complex crosses with multiple conditional al- lished Cre and CreERT2 transgenic lines. In all lines, leles, as some combinations of alleles might not be possible. transgene insertion was associated with structural Importantly, integration of a transgene can disrupt an en- changes of variable complexity, illustrating the im- dogenous gene, potentially interfering with interpretation portance of testing for rearrangements around the of the transgenic phenotype (4–6) or preventing the gen- integration site. In all seven lines the exact integra- eration of homozygous transgenic animals due to embry- tion site and breakpoint sequences were identified. onic lethality when the transgene is bred to homozygosity. Our methods, data and genotyping assays can be Transgenes often integrate as a multicopy concatemer (7) and in the absence of integration site data, hemi- and ho- used as a resource for the mouse community and mozygous animals have to be distinguished by copy number our results illustrate the power of the TLA method variation (CNV) analysis which involves quantitative poly- to not only efficiently map the integration site of any merase chain reaction (PCR) and reference DNA with a transgene, but also provide additional information re- known copy number. The resolution and reliability of CNV garding the transgene integration events. analysis for accurate genotyping decreases as copy num- ber increases. Therefore, another important and practical use of knowing the exact transgene insertion site is that ef- INTRODUCTION ficient, copy-number independent and locus-specific geno- Cre/LoxP technology is widely used in the field of mouse typing assays can be developed. For large transgenic ani- genetics for spatial and/or temporal regulation of gene mal facilities, where a large number of genotyping assays are *To whom correspondence should be addressed. Tel: +1 650 467 8027; Email: [email protected] Correspondence may also be addressed to J. Colin Cox. Tel: +1 530 753 5883; Email: [email protected] C The Author(s) 2017. Published by Oxford University Press on behalf of Nucleic Acids Research. This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by-nc/4.0/), which permits non-commercial re-use, distribution, and reproduction in any medium, provided the original work is properly cited. For commercial re-use, please contact [email protected] e62 Nucleic Acids Research, 2017, Vol. 45, No. 8 PAGE 2 OF 9 performed, automated genotyping platforms and robust as- selective amplification of DNA and results in the amplifica- says are necessary (8). CNV analysis is more labor-intensive tion of >100 kb of sequence information at either end of a than real-time PCR assays due to the need for an internal primer pair complementary to a short (trans-) gene specific copy number standard and the additional requirement for sequence. As such, TLA-based targeted sequencing enables replicates to ensure accurate copy-number calling (9). Al- targeted complete sequencing of loci of interest and detec- ternatively, if the insertion site is known, an assay can be tion of all single nucleotide variations (SNVs) and struc- designed for the wild-type DNA and for the insertion, al- tural variants. Here we describe the use of TLA to efficiently lowing for analysis without having to calculate copy number map the transgene location in seven previously published in order to determine zygosity. Thus the characterization of Cre and CreERT2 transgenic lines. These seven mouse lines transgene integration sites is important both to know if any are all widely used by the mouse community. For exam- rearrangements have occurred in the endogenous sequence ple, see http://www.informatics.jax.org/home/recombinase of the integration site and to enable automated genotyping for an up-to-date list of references for each transgenic line. of as many of the Cre transgenic lines as possible. With very limited sequence data available for the transgenes The most commonly used method for identification of used to generate these lines, Cre transgenic lines provided transgene insertion sites is based on inverse PCR (iPCR) an ideal test-case for the power of the TLA approach to (10,11). This method relies on knowledge of appropriate transgene mapping. Because the mouse lines all have the restriction sites located in the transgene and on sequence Cre sequence in common, we designed primers on the Cre information to design appropriate primers for the iPCR. sequence and these primers could then be used in the anal- Furthermore, iPCR works best if the number of transgene ysis for all Cre lines. After mapping the exact integration copies is low as selective amplification of the transgene con- sites and identifying structural changes, we used the infor- catemer reduces the chance of identifying a PCR prod- mation to design quantitative PCR genotyping assays that uct containing flanking genomic sequence12 ( ). An alter- distinguish wild-type, hemizygous and homozygous ani- native method for identification of insertion sites, ‘Splink- mals for four of the lines. Our data illustrates the power erette’, was first developed for the cloning of retroviral in- of the TLA method to sequence transgenes and efficiently tegration sites (13). Just as for iPCR, Splinkerette is best map their integration sites with very limited requirement for suited for single or low copy integrations. While iPCR and prior knowledge of the transgene sequence. Furthermore, Splinkerette can be used to identify the genomic/transgene our data illustrates the power of TLA to identify structural junction, both methods rely on prior knowledge of the changes occurring at the site of transgene integration. end(s) of integrated transgene sequences and on whether the breakpoint sequence is suited for PCR amplification. In MATERIALS AND METHODS addition, neither method provides any information about Mouse strains structural changes at the integration site. Although both iPCR and Splinkerette methods now take advantage of The Cre and CreERT2 transgenic strains used in next-generation sequencing (NGS) (for example, see (14)), this study were licensed and obtained from different NGS technology alternatives to iPCR and Splinkerette for sources and then maintained at Genentech. ACTB-Cre: transgene mapping now include whole genome sequencing FVB/N-Tg(ACTB-cre)2Mrt/J(19). Obtained from (15,16) and sequence capture followed by NGS (17). Whole the Jackson laboratory (stock no. 003376) and back- genome sequencing analyses are not suited for the detailed crossed until congenic on C57BL/6N at Genentech. analysis of the transgene sequence since most of the gener- BEST1-Cre: C57BL/6-Tg(BEST1-cre)1Jdun/J(20). Ob- ated sequencing data is uninformative. More importantly, in tained from the Jackson laboratory (stock no. 017557). hemizygous mice, the generated sequencing data is not only Generated on C57BL/6NCrl, then crossed to and derived from the locus of integration but also from the wild maintained on C57BL/6J. Cdh5-CreERT2: Tg(Cdh5- type locus, obscuring interpretation of structural changes cre/ERT2)CIVE23Mlia (21). Obtained from University specific to the integration site. Moreover, many transgenes of California, LA (UCLA). Generated on C57BL/6J and contain sequences that are homologous to endogenous se- kept on a C57BL/6J background at Genentech. ‘CIVE’: quences and the presence of the endogenous homologs in C57BL/6J, Inducible, VE-cadherin Cre-recombinase whole genome sequencing data precludes sensitive and spe- mouse.