A Deep Learning Framework Identifies Pathogenic Noncoding Somatic Mutations from Personal Prostate Cancer Genomes

Total Page:16

File Type:pdf, Size:1020Kb

A Deep Learning Framework Identifies Pathogenic Noncoding Somatic Mutations from Personal Prostate Cancer Genomes Author Manuscript Published OnlineFirst on September 9, 2020; DOI: 10.1158/0008-5472.CAN-20-1791 Author manuscripts have been peer reviewed and accepted for publication but have not yet been edited. A deep learning framework identifies pathogenic noncoding somatic mutations from personal prostate cancer genomes Cheng Wang and Jingjing Li* the Eli and Edythe Broad Center of Regeneration Medicine and Stem Cell Research, the Parker Institute for Cancer Immunotherapy, the Bakar Computational Health Sciences Institute, the Department of Neurology, School of Medicine, University of California, San Francisco, 35 Medical Center Way, San Francisco, CA 94143 *correspondence should be addressed to: [email protected] Mailing address: 35 Medical Center Way, San Francisco, CA 94143 Tel: 415-502-2572 Declaration of Interests. C.W. and J.L. are listed as inventors on a pending patent application filed by UCSF related to part of the methods presented here for prostate cancer analysis. J.L. is a co-founder and a member of the scientific advisory board of SensOmics, Inc. Running Title: deep learning for prostate cancer Abstract Our understanding of noncoding mutations in cancer genomes has been derived primarily from mutational recurrence analysis by aggregating clinical samples on a large scale. These cohort- based approaches cannot directly identify individual pathogenic noncoding mutations from personal cancer genomes. Therefore, although most somatic mutations are localized in the noncoding cancer genome, their effects on driving tumorigenesis and progression have not been systematically explored and noncoding somatic alleles have not been leveraged in current clinical practice to guide personalized screening, diagnosis, and treatment. Here we present a deep learning framework to capture pathogenic noncoding mutations in personal cancer genomes, which perturb gene regulation by altering chromatin architecture. We deployed the system specifically for localized prostate cancer by integrating large-scale prostate cancer genomes and the prostate-specific epigenome. We exhaustively evaluated somatic mutations in each patient's genome and agnostically identified thousands of somatic alleles altering the prostate epigenome. Functional genomic analyses subsequently demonstrated that affected genes displayed differential expression in prostate tumor samples, were vulnerable to expression alterations, and were convergent onto androgen-receptor- mediated signaling pathways. Accumulation of pathogenic regulatory mutations in these affected genes was predictive of clinical observations, suggesting potential clinical utility of this approach. Overall, the deep learning framework has significantly expanded our view of somatic mutations in the vast noncoding genome, uncovered novel genes in localized prostate cancer, and will foster the development of personalized screening and therapeutic strategies for prostate cancer. Statement of Significance. This study's characterization of the noncoding genome in prostate cancer reveals mutational signatures predictive of clinical observations, which may serve as a powerful prognostic tool in this disease. 1 Downloaded from cancerres.aacrjournals.org on October 6, 2021. © 2020 American Association for Cancer Research. Author Manuscript Published OnlineFirst on September 9, 2020; DOI: 10.1158/0008-5472.CAN-20-1791 Author manuscripts have been peer reviewed and accepted for publication but have not yet been edited. Introduction To date, more than 81 million simple somatic mutations (base substitutions and short indels) from cancer genomes have been catalogued by International Cancer Genome Consortium (ICGC)(1), among which >90% fall in non-coding regions. However, our current cancer genome analysis has been largely confined to somatic mutations in coding sequences, which only account for ~2% of the human genome, leaving the vast majority of the human genome unexplored. Multiple lines of evidence have suggested a significant involvement of non-coding somatic mutations in cancer genomes: (i) it has been established that more than 90% of functional loci in complex diseases reside in the non-coding genome(2,3), and therefore significant contribution to cancer etiologies by perturbing non-coding regulatory elements would also be expected(4); (ii) specifically for cancer, individual cases studies have identified several non-coding mutations driving tumorigenesis and progression, including the well-known somatic mutations in the TERT promoter(5) as well as the clustered somatic mutations affecting the cis- regulatory elements of FOXA1, which promotes prostate cancer cell proliferation(6); (iii) the landmark cancer epigenome study has revealed distinct chromatin architecture in tumors relative to normal tissues(7), prompting further investigation on somatic mutations that alter tissue-specific chromatin structure leading to tumor formation and progression. Despite these considerations, the recent pan-cancer genome analysis unexpectedly reported a paucity of somatic driver mutations in the non-coding genome by identifying only a handful of non-coding somatic mutations displaying significant recurrence across tumor samples(8). Mutational recurrence has been used as the primary approach to infer mutational pathogenicity, which indirectly infers mutational pathogenicity based on statistical enrichment, but cannot directly assess the molecular effects of individual non-coding somatic mutations. As such, mutational recurrence analysis effectively identifies mutations forming “hotspots” in a patient cohort, but cannot capture individual pathogenic mutations from personal genomes. Given the sporadic nature of somatic mutations, it is reasonable to expect that many pathogenic mutations do not form clusters, but individually exert their effects on tumor initiation, formation or progression. Identifying these individual mutations will not only significantly expand our view of the non-coding cancer genome, but will also foster the development of clinical tools to screen pathogenic regulatory mutations. In addition to mutational recurrence analysis (or identifying mutation hotspots), several approaches have been proposed to individually annotate non-coding somatic mutations in 2 Downloaded from cancerres.aacrjournals.org on October 6, 2021. © 2020 American Association for Cancer Research. Author Manuscript Published OnlineFirst on September 9, 2020; DOI: 10.1158/0008-5472.CAN-20-1791 Author manuscripts have been peer reviewed and accepted for publication but have not yet been edited. cancer genomes (as reviewed in these excellent articles(9,10)). Some of the studies examined mutational localization in known regulatory regions, followed by motif analysis. However, many transcription factors bind to degenerate DNA sequences(11), and a single base change is more likely to be neutral than consequential. Therefore, merely examining mutational localization in regulatory elements is not sufficient to determine specific allelic effects on perturbing gene regulation. Evolutionary conservation was also integrated in the analysis; however, even if one somatic mutation affects a conserved site, the observed conservation could merely result from background selection. Some studies identified somatic mutations displaying allele-specific expression (ASE, or somatic eQTLs) in tumor samples(12,13). However, this association framework to identify allelic imbalance at the RNA level cannot confirm causative roles of somatic alleles. Importantly, this practice also requires a large cohort of patient tumor tissues for genome and transcriptome sequencing, which is often impractical for regular clinical analyses. Taken together, like the mutational recurrence analysis, our current understanding of the non- coding genome in cancer has been derived largely from indirect inference from large-scale patient cohorts, which cannot directly determine the pathogenic effects of individual non-coding mutations from personal genomes. As such, non-coding mutations are often excluded from cancer genome analysis and have not been leveraged to guide personalized screening, diagnosis and treatment. Because sequence composition of non-coding regulatory elements is not random, tissue- specific regulatory capacity of a given sequence could be predicted(14-16). Therefore, an allelic change that significantly alters regulatory capacity of a sequence element would be considered consequential leading to gene dysregulation. Like deleterious missense mutations altering protein structure, the deleteriousness of non-coding mutations is thus defined by their mutational impacts on altering epigenomic architecture in specific cell and tissue types. Given the sporadic nature of cancer somatic mutations that are sparsely localized across the genome, it is reasonable to assume that these mutations individually exert their effects; in this way, one can evaluate the effect of each somatic mutation one at a time across the genome and identify those disrupting epigenomic architecture in disease-related tissue and cell types. We herein leveraged a deep-learning based framework to achieve this goal, and applied this strategy to analyzing somatic mutations in localized prostate cancer genomes, which serve as an excellent model based on the following considerations: (1) compared with metastatic cancers, localized prostate cancers are more likely to be affected by simple somatic mutations (base substitutions and short insertion and deletions, indels) than somatic copy number aberrations(17). However,
Recommended publications
  • Environmental Influences on Endothelial Gene Expression
    ENDOTHELIAL CELL GENE EXPRESSION John Matthew Jeff Herbert Supervisors: Prof. Roy Bicknell and Dr. Victoria Heath PhD thesis University of Birmingham August 2012 University of Birmingham Research Archive e-theses repository This unpublished thesis/dissertation is copyright of the author and/or third parties. The intellectual property rights of the author or third parties in respect of this work are as defined by The Copyright Designs and Patents Act 1988 or as modified by any successor legislation. Any use made of information contained in this thesis/dissertation must be in accordance with that legislation and must be properly acknowledged. Further distribution or reproduction in any format is prohibited without the permission of the copyright holder. ABSTRACT Tumour angiogenesis is a vital process in the pathology of tumour development and metastasis. Targeting markers of tumour endothelium provide a means of targeted destruction of a tumours oxygen and nutrient supply via destruction of tumour vasculature, which in turn ultimately leads to beneficial consequences to patients. Although current anti -angiogenic and vascular targeting strategies help patients, more potently in combination with chemo therapy, there is still a need for more tumour endothelial marker discoveries as current treatments have cardiovascular and other side effects. For the first time, the analyses of in-vivo biotinylation of an embryonic system is performed to obtain putative vascular targets. Also for the first time, deep sequencing is applied to freshly isolated tumour and normal endothelial cells from lung, colon and bladder tissues for the identification of pan-vascular-targets. Integration of the proteomic, deep sequencing, public cDNA libraries and microarrays, delivers 5,892 putative vascular targets to the science community.
    [Show full text]
  • 1 Supporting Information for a Microrna Network Regulates
    Supporting Information for A microRNA Network Regulates Expression and Biosynthesis of CFTR and CFTR-ΔF508 Shyam Ramachandrana,b, Philip H. Karpc, Peng Jiangc, Lynda S. Ostedgaardc, Amy E. Walza, John T. Fishere, Shaf Keshavjeeh, Kim A. Lennoxi, Ashley M. Jacobii, Scott D. Rosei, Mark A. Behlkei, Michael J. Welshb,c,d,g, Yi Xingb,c,f, Paul B. McCray Jr.a,b,c Author Affiliations: Department of Pediatricsa, Interdisciplinary Program in Geneticsb, Departments of Internal Medicinec, Molecular Physiology and Biophysicsd, Anatomy and Cell Biologye, Biomedical Engineeringf, Howard Hughes Medical Instituteg, Carver College of Medicine, University of Iowa, Iowa City, IA-52242 Division of Thoracic Surgeryh, Toronto General Hospital, University Health Network, University of Toronto, Toronto, Canada-M5G 2C4 Integrated DNA Technologiesi, Coralville, IA-52241 To whom correspondence should be addressed: Email: [email protected] (M.J.W.); yi- [email protected] (Y.X.); Email: [email protected] (P.B.M.) This PDF file includes: Materials and Methods References Fig. S1. miR-138 regulates SIN3A in a dose-dependent and site-specific manner. Fig. S2. miR-138 regulates endogenous SIN3A protein expression. Fig. S3. miR-138 regulates endogenous CFTR protein expression in Calu-3 cells. Fig. S4. miR-138 regulates endogenous CFTR protein expression in primary human airway epithelia. Fig. S5. miR-138 regulates CFTR expression in HeLa cells. Fig. S6. miR-138 regulates CFTR expression in HEK293T cells. Fig. S7. HeLa cells exhibit CFTR channel activity. Fig. S8. miR-138 improves CFTR processing. Fig. S9. miR-138 improves CFTR-ΔF508 processing. Fig. S10. SIN3A inhibition yields partial rescue of Cl- transport in CF epithelia.
    [Show full text]
  • Supplemental Figure and Table Legends
    Supplemental figure and table legends Supplementary Figure 1: KIAA1841 is well conserved among vertebrates. NCBI HomoloGene pairwise alignment scores of human KIAA1841 sequence compared to other vertebrate orthologs. Supplementary Figure 2: µ-germline transcripts (GLT) and AID mRNA expression are not affected by overexpression of KIAA1841. Splenic B cells were isolated from wild-type mice, and transduced with retroviral vector control (pMIG) or a vector expressing KIAA1841. Levels of µ-GLT and AID mRNA were determined at 72h post-infection by RT-qPCR, and normalized to -actin mRNA and the pMIG control. The mean of three independent experiments +/- SD is shown. NS, p = not significant, p 0.05, two-tailed paired student’s t-test. Supplementary Figure 3: Overexpression of untagged and Xpress-tagged KIAA1841 does not affect cell proliferation. Splenic B cells were isolated from wild-type mice, stimulated with LPS+IL4, and transduced with retroviral vector control (pMIG) or a vector expressing KIAA1841 or Xpress (Xp)-tagged KIAA1841. Cells are labeled with seminaphthorhodafluor (SNARF) cell tracking dye and SNARF intensity was measured at 0h, 24h, and 48h after retroviral infection. Histograms of transduced cells (GFP+) for pMIG control, KIAA1841 and Xp-KIAA1841 were superimposed at each time point. Three independent retroviral infection experiments are shown. Supplementary Figure 4: Sequence alignment of the putative SANT domain of KIAA1841 with the SANT domain of SWI3. Alignment was performed using ClustalOmega; *, conserved residue, :, strongly similar residues, ., weakly similar residues. Numbers indicate amino acid residues in each sequence. Helix 3, which has been reported to be important for the chromatin remodeling function of SWI3 (Boyer et.
    [Show full text]
  • Chapter 2 Gene Regulation and Speciation in House Mice
    UC Berkeley UC Berkeley Electronic Theses and Dissertations Title Gene regulation and the genomic basis of speciation and adaptation in house mice (Mus musculus) Permalink https://escholarship.org/uc/item/8ck133qd Author Mack, Katya L Publication Date 2018 Peer reviewed|Thesis/dissertation eScholarship.org Powered by the California Digital Library University of California Gene regulation and the genomic basis of speciation and adaptation in house mice (Mus musculus) By Katya L. Mack A dissertation submitted in partial satisfaction of the requirements for the degree of Doctor of Philosophy in Integrative Biology in the Graduate Division of the University of California, Berkeley Committee in charge: Professor Michael W. Nachman, Chair Professor Rasmus Nielsen Professor Craig T. Miller Fall 2018 Abstract Gene regulation and the genomic basis of speciation and adaptation in house mice (Mus musculus) by Katya Mack Doctor of Philosophy in Integrative Biology University of California, Berkeley Professor Michael W. Nachman, Chair Gene expression is a molecular phenotype that is essential to organismal form and fitness. However, how gene regulation evolves over evolutionary time and contributes to phenotypic differences within and between species is still not well understood. In my dissertation, I examined the role of gene regulation in adaptation and speciation in house mice (Mus musculus). In chapter 1, I reviewed theoretical models and empirical data on the role of gene regulation in the origin of new species. I discuss how regulatory divergence between species can result in hybrid dysfunction and point to areas that could benefit from future research. In chapter 2, I characterized regulatory divergence between M.
    [Show full text]
  • Gene Regulation and the Genomic Basis of Speciation and Adaptation in House Mice (Mus Musculus)
    Gene regulation and the genomic basis of speciation and adaptation in house mice (Mus musculus) By Katya L. Mack A dissertation submitted in partial satisfaction of the requirements for the degree of Doctor of Philosophy in Integrative Biology in the Graduate Division of the University of California, Berkeley Committee in charge: Professor Michael W. Nachman, Chair Professor Rasmus Nielsen Professor Craig T. Miller Fall 2018 Abstract Gene regulation and the genomic basis of speciation and adaptation in house mice (Mus musculus) by Katya Mack Doctor of Philosophy in Integrative Biology University of California, Berkeley Professor Michael W. Nachman, Chair Gene expression is a molecular phenotype that is essential to organismal form and fitness. However, how gene regulation evolves over evolutionary time and contributes to phenotypic differences within and between species is still not well understood. In my dissertation, I examined the role of gene regulation in adaptation and speciation in house mice (Mus musculus). In chapter 1, I reviewed theoretical models and empirical data on the role of gene regulation in the origin of new species. I discuss how regulatory divergence between species can result in hybrid dysfunction and point to areas that could benefit from future research. In chapter 2, I characterized regulatory divergence between M. m. domesticus and M. m. musculus associated with male hybrid sterility. The major model for the evolution of post-zygotic isolation proposes that hybrid sterility or inviability will evolve as a product of deleterious interactions (i.e., negative epistasis) between alleles at different loci when joined together in hybrids. As the regulation of gene expression is inherently based on interactions between loci, disruption of gene regulation in hybrids may be a common mechanism for post-zygotic isolation.
    [Show full text]
  • Variation in Protein Coding Genes Identifies Information Flow
    bioRxiv preprint doi: https://doi.org/10.1101/679456; this version posted June 21, 2019. The copyright holder for this preprint (which was not certified by peer review) is the author/funder, who has granted bioRxiv a license to display the preprint in perpetuity. It is made available under aCC-BY-NC-ND 4.0 International license. Animal complexity and information flow 1 1 2 3 4 5 Variation in protein coding genes identifies information flow as a contributor to 6 animal complexity 7 8 Jack Dean, Daniela Lopes Cardoso and Colin Sharpe* 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 Institute of Biological and Biomedical Sciences 25 School of Biological Science 26 University of Portsmouth, 27 Portsmouth, UK 28 PO16 7YH 29 30 * Author for correspondence 31 [email protected] 32 33 Orcid numbers: 34 DLC: 0000-0003-2683-1745 35 CS: 0000-0002-5022-0840 36 37 38 39 40 41 42 43 44 45 46 47 48 49 Abstract bioRxiv preprint doi: https://doi.org/10.1101/679456; this version posted June 21, 2019. The copyright holder for this preprint (which was not certified by peer review) is the author/funder, who has granted bioRxiv a license to display the preprint in perpetuity. It is made available under aCC-BY-NC-ND 4.0 International license. Animal complexity and information flow 2 1 Across the metazoans there is a trend towards greater organismal complexity. How 2 complexity is generated, however, is uncertain. Since C.elegans and humans have 3 approximately the same number of genes, the explanation will depend on how genes are 4 used, rather than their absolute number.
    [Show full text]
  • Content Based Search in Gene Expression Databases and a Meta-Analysis of Host Responses to Infection
    Content Based Search in Gene Expression Databases and a Meta-analysis of Host Responses to Infection A Thesis Submitted to the Faculty of Drexel University by Francis X. Bell in partial fulfillment of the requirements for the degree of Doctor of Philosophy November 2015 c Copyright 2015 Francis X. Bell. All Rights Reserved. ii Acknowledgments I would like to acknowledge and thank my advisor, Dr. Ahmet Sacan. Without his advice, support, and patience I would not have been able to accomplish all that I have. I would also like to thank my committee members and the Biomed Faculty that have guided me. I would like to give a special thanks for the members of the bioinformatics lab, in particular the members of the Sacan lab: Rehman Qureshi, Daisy Heng Yang, April Chunyu Zhao, and Yiqian Zhou. Thank you for creating a pleasant and friendly environment in the lab. I give the members of my family my sincerest gratitude for all that they have done for me. I cannot begin to repay my parents for their sacrifices. I am eternally grateful for everything they have done. The support of my sisters and their encouragement gave me the strength to persevere to the end. iii Table of Contents LIST OF TABLES.......................................................................... vii LIST OF FIGURES ........................................................................ xiv ABSTRACT ................................................................................ xvii 1. A BRIEF INTRODUCTION TO GENE EXPRESSION............................. 1 1.1 Central Dogma of Molecular Biology........................................... 1 1.1.1 Basic Transfers .......................................................... 1 1.1.2 Uncommon Transfers ................................................... 3 1.2 Gene Expression ................................................................. 4 1.2.1 Estimating Gene Expression ............................................ 4 1.2.2 DNA Microarrays ......................................................
    [Show full text]
  • Proteomics of Nucleocytoplasmic Partitioning
    Available online at www.sciencedirect.com ScienceDirect Proteomics of nucleocytoplasmic partitioning Thao Nguyen, Nishant Pappireddi and Martin Wu¨ hr The partitioning of the proteome between nucleus and activation. Kinase activity can also be regulated via sub- cytoplasm affects nearly every aspect of eukaryotic biology. cellular localization: cyclin B needs to relocalize into the Despite this central role, we still have a poor understanding of nucleus to induce NE breakdown, which is required for which proteins localize in the nucleus and how this varies in the transition from G2 to mitosis [3]. Considering the different cell types and conditions. Recent advances in importance of subcellular localization in encoding impor- quantitative proteomics and high-throughput imaging are tant cellular information, it is not surprising that mis- starting to close this knowledge gap. Studies on protein regulation of nuclear transport has been associated with interaction are beginning to reveal the spectrum of cargos of multiple diseases, including developmental defects and nuclear import and export receptors.We anticipate that it will cancer [4–6]. Targeting mis-regulation of NC partitioning soon be possible to predict each protein’s nucleocytoplasmic has emerged as a promising therapeutic approach, partic- localization based on its importin/exportin interactions and its ularly for cancer treatment [7–11]. estimated diffusion rate through the nuclear pore. This insight is likely to provide us with a fundamental understanding of Many previous studies have reported this subcellular how cells use nucleocytoplasmic partitioning to encode and localization of individual proteins. Recent technological relay information. advances in methods such as mass spectrometry (MS) now allow us to look at the entire proteome at once.
    [Show full text]
  • IPO11 Mediates Βcatenin Nuclear Import in a Subset of Colorectal Cancers
    REPORT IPO11 mediates βcatenin nuclear import in a subset of colorectal cancers Monika Mis1, Siobhan O’Brien2, Zachary Steinhart1, Sichun Lin1,TraverHart3,4,JasonMoffat4,5,6, and Stephane Angers1,2 Activation of Wnt signaling entails βcatenin protein stabilization and translocation to the nucleus to regulate context-specific transcriptional programs. The majority of colorectal cancers (CRCs) initiate following APC mutations, resulting in Wnt ligand—independent stabilization and nuclear accumulation of βcatenin. The mechanisms underlying βcatenin nucleocytoplasmic shuttling remain incompletely defined. Using a novel, positive selection, functional genomic strategy, DEADPOOL, we performed a genome-wide CRISPR screen and identified IPO11 as a required factor for βcatenin-mediated transcription in APC mutant CRC cells. IPO11 (Importin-11) is a nuclear import protein that shuttles cargo from the cytoplasm to the nucleus. IPO11−/− cells exhibit reduced nuclear βcatenin protein levels and decreased βcatenin target gene activation, suggesting IPO11 facilitates βcatenin nuclear import. IPO11 knockout decreased colony formation of CRC cell lines and decreased proliferation of patient-derived CRC organoids. Our findings uncover a novel nuclear import mechanism for βcatenin in cells with high Wnt activity. Introduction During embryonic development and adult tissue homeostasis, no clinically approved therapeutics to target the Wnt-βcatenin the Wnt-βcatenin signaling pathway governs context-dependent pathway. transcriptional programs regulating stem cell renewal, cell pro- The nuclear localization of βcatenin is controlled by its rate of liferation, and differentiation (Wodarz and Nusse, 1998; Steinhart nuclear entry versus exit, along with retention factors anchoring and Angers, 2018). In the absence of Wnt ligand, the destruction βcatenin in the cytoplasm or nucleus.
    [Show full text]
  • Predict AID Targeting in Non-Ig Genes Multiple Transcription Factor
    Downloaded from http://www.jimmunol.org/ by guest on September 26, 2021 is online at: average * The Journal of Immunology published online 20 March 2013 from submission to initial decision 4 weeks from acceptance to publication Multiple Transcription Factor Binding Sites Predict AID Targeting in Non-Ig Genes Jamie L. Duke, Man Liu, Gur Yaari, Ashraf M. Khalil, Mary M. Tomayko, Mark J. Shlomchik, David G. Schatz and Steven H. Kleinstein J Immunol http://www.jimmunol.org/content/early/2013/03/20/jimmun ol.1202547 Submit online. Every submission reviewed by practicing scientists ? is published twice each month by http://jimmunol.org/subscription Submit copyright permission requests at: http://www.aai.org/About/Publications/JI/copyright.html Receive free email-alerts when new articles cite this article. Sign up at: http://jimmunol.org/alerts http://www.jimmunol.org/content/suppl/2013/03/20/jimmunol.120254 7.DC1 Information about subscribing to The JI No Triage! Fast Publication! Rapid Reviews! 30 days* Why • • • Material Permissions Email Alerts Subscription Supplementary The Journal of Immunology The American Association of Immunologists, Inc., 1451 Rockville Pike, Suite 650, Rockville, MD 20852 Copyright © 2013 by The American Association of Immunologists, Inc. All rights reserved. Print ISSN: 0022-1767 Online ISSN: 1550-6606. This information is current as of September 26, 2021. Published March 20, 2013, doi:10.4049/jimmunol.1202547 The Journal of Immunology Multiple Transcription Factor Binding Sites Predict AID Targeting in Non-Ig Genes Jamie L. Duke,* Man Liu,†,1 Gur Yaari,‡ Ashraf M. Khalil,x Mary M. Tomayko,{ Mark J. Shlomchik,†,x David G.
    [Show full text]
  • A New Genomewide Association Meta-Analysis of Alcohol Dependence
    ALCOHOLISM:CLINICAL AND EXPERIMENTAL RESEARCH Vol. 39, No. 8 August 2015 A New Genomewide Association Meta-Analysis of Alcohol Dependence Lingjun Zuo, Yunlong Tan, Xiangyang Zhang, Xiaoping Wang, John Krystal, Boris Tabakoff, Chunlong Zhong, and Xingguang Luo Background: Conventional meta-analysis based on genetic markers may be less powerful for hetero- geneous samples. In this study, we introduced a new meta-analysis for 4 genomewide association studies on alcohol dependence that integrated the information of putative causal variants. Methods: A total of 12,481 subjects in 4 independent cohorts were analyzed, including 1 European American cohort (1,409 cases with alcohol dependence and 1,518 controls), 1 European Australian cohort (a total of 6,438 family subjects with 1,645 probands), 1 African American cohort from SAGE + COGA (681 cases and 508 controls), and 1 African American cohort from Yale (1,429 cases and 498 controls). The genomewide association analysis was conducted for each cohort, and then, a new meta-analysis was performed to derive the combined p-values. cis-Acting expression of quantitative locus (cis-eQTL) analysis of each risk variant in human tissues and RNA expression analysis of each risk gene in rat brain served as functional validation. Results: In meta-analysis of European American and European Australian cohorts, we found 10 top-ranked single nucleotide polymorphisms (SNPs) (p < 10À6) that were associated with alcohol depen- dence. They included 6 at SERINC2 (3.1 9 10À8 ≤ p ≤ 9.6 9 10À8), 1 at STK40 (p = 1.3 9 10À7), 2 at KIAA0040 (3.3 9 10À7 ≤ p ≤ 5.2 9 10À7), and 1 at IPO11 (p = 6.9 9 10À7).
    [Show full text]
  • Significant Association Between Rare IPO11-HTR1A Variants and Attention Deficit Hyperactivity Disorder in Caucasians
    HHS Public Access Author manuscript Author ManuscriptAuthor Manuscript Author Am J Med Manuscript Author Genet B Neuropsychiatr Manuscript Author Genet. Author manuscript; available in PMC 2016 April 30. Published in final edited form as: Am J Med Genet B Neuropsychiatr Genet. 2015 October ; 168(7): 544–556. doi:10.1002/ajmg.b.32329. Significant association between rare IPO11-HTR1A variants and attention deficit hyperactivity disorder in Caucasians Lingjun Zuo1,*, Laura Saba2, Xiandong Lin3, Yunlong Tan4, Kesheng Wang5, John H. Krystal1,6, Boris Tabakoff2, and Xingguang Luo1,4,* 1Department of Psychiatry, Yale University School of Medicine, New Haven, CT, USA 2Department of Pharmacology, University of Colorado Denver School of Medicine, Aurora, CO, USA 3Provincial Key Laboratory of Translational Cancer Medicine, Fujian Provincial Cancer Hospital, Fuzhou, Fujian, China 4Biological Psychiatry Research Center, Beijing Huilongguan Hospital, Beijing, China 5Department of Biostatistics and Epidemiology, College of Public Health, East Tennessee State University, Johnson City, TN, USA 6VA Alcohol Research Center, VA Connecticut Healthcare System, West Haven, CT, USA Abstract Objective—We comprehensively examined the rare variants in the IPO11-HTR1A region to explore their roles in neuropsychiatric disorders. Method—Five hundred seventy-three to 1,181 rare SNPs in subjects of European descent and 1,234-2,529 SNPs in subjects of African descent (0 < minor allele frequency (MAF) < 0.05) were analyzed in a total of 49,268 subjects in 21 independent cohorts with 11 different neuropsychiatric disorders. Associations between rare variant constellations and diseases and associations between individual rare variants and diseases were tested. RNA expression changes of this region were also explored.
    [Show full text]