Published online 26 May 2017 Nucleic Acids Research, 2017, Vol. 45, No. 15 8773–8784 doi: 10.1093/nar/gkx482 Regulatory dynamics of 11p13 suggest a role for EHF in modifying CF lung disease severity Lindsay R. Stolzenburg1,2, Rui Yang1,2, Jenny L. Kerschner1,2,3, Sara Fossum1,2, Matthew Xu1,2, Andrew Hoffmann1,2, Kay-Marie Lamar1,2,3, Sujana Ghosh1,2, Sarah Wachtel1,2, Shih-Hsing Leir1,2,3 and Ann Harris1,2,3,*
1Human Molecular Genetics Program, Lurie Children’s Research Center, Chicago, IL 60614, USA, 2Department of Pediatrics, Northwestern University Feinberg School of Medicine, Chicago, IL 60611, USA and 3Department of Genetics and Genome Sciences, Case Western Reserve University, Cleveland, OH 44016, USA
Received March 25, 2017; Revised May 01, 2017; Editorial Decision May 16, 2017; Accepted May 17, 2017
ABSTRACT states is immense (reviewed in (1,2)).However,thechal- lenges of moving from single nucleotide polymorphisms Mutations in the cystic fibrosis transmembrane con- (SNPs) exhibiting high associations with a trait to causative ductance regulator (CFTR) gene cause cystic fibro- variants are even greater (reviewed in (3)). In the simplest sis (CF), but are not good predictors of lung phe- case, the variant lies in the coding region of a gene and com- notype. Genome-wide association studies (GWAS) promises protein function by altering amino acids. More of- previously identified additional genomic sites asso- ten, the SNPs map to non-coding regions of the genome ciated with CF lung disease severity. One of these, and are hence assumed to impact regulatory elements for at chromosome 11p13, is an intergenic region be- nearby genes (4). In rare cases, the most significant SNP tween Ets homologous factor (EHF)andApaf-1in- alters the binding site of a critical transcription factor in teracting protein (APIP). Our goal was to determine a known cis-regulatory element (5). More commonly, the the functional significance of this region, which be- highest P-value SNPs merely mark an intergenic region and the associated haplotype then provides a starting point for ing intergenic is probably regulatory. To identify cis- exhaustive molecular studies aimed at deciphering the regu- acting elements, we used DNase-seq and H3K4me1 latory mechanisms and key gene targets. An example of this and H3K27Ac ChIP-seq to map open and active chro- is the cystic fibrosis modifier locus on chromosome 11p13. matin respectively, in lung epithelial cells. Two ele- Cystic fibrosis (CF), the most common autosomal reces- ments showed strong enhancer activity for the pro- sive disease among Caucasians is caused by mutations in the moters of EHF and the 5 adjacent gene E47 like cystic fibrosis transmembrane conductance regulator gene ETS transcription factor 5 (ELF5) in reporter gene (CFTR)(6–8). Lung and digestive system pathology are the assays. No enhancers of the APIP promoter were major features of the disease, which is associated with more found. Circular chromosome conformation capture than 2000 different mutations within CFTR. One mutation, (4C-seq) identified direct physical interactions of el- F508del (the deletion of a phenylalanine residue at amino ements within 11p13. This confirmed the enhancer- acid 508) occurs on about 70% of CF chromosomes (9). However, even among CF patients homozygous for this mu- promoter associations, identified additional inter- tation, there is no correlation with lung disease severity, the acting elements and defined topologically associat- major predictor of lifespan (10). Despite this lack of con- ing domain (TAD) boundaries, enriched for CCCTC- cordance with CFTR mutation, CF lung function exhibits binding factor (CTCF). No strong interactions were a strong genetic component, implicating the existence of observed with the APIP promoter, which lies outside other loci that modify the phenotype (11,12). the main TAD encompassing the GWAS signal. These Extensive GWAS for lung disease severity in CF patients results focus attention on the role of EHF in modify- identified multiple regions of the genome that strongly asso- ing CF lung disease severity. ciated with this trait (13,14). These sites include both known loci (the MUC4/MUC20 mucin genes at chr3q29 and the SLC9A3 solute carrier gene at chr5p15.3, among others) INTRODUCTION and an intergenic region at chr11p13, located between the The power of genome-wide association studies (GWAS) to Ets-homologous factor (EHF) and Apaf-1 interacting pro- link regions of the genome with physiological or disease tein (APIP) genes. Here we investigate the chr11p13 region
*To whom correspondence should be addressed. Email: [email protected]