Integrative Methods for the Analysis of Genome Wide Association Studies

Total Page:16

File Type:pdf, Size:1020Kb

Integrative Methods for the Analysis of Genome Wide Association Studies INTEGRATIVE METHODS FOR THE ANALYSIS OF GENOME WIDE ASSOCIATION STUDIES A DISSERTATION SUBMITTED TO THE DEPARTMENT OF COMPUTER SCIENCE AND THE COMMITTEE ON GRADUATE STUDIES OF STANFORD UNIVERSITY IN PARTIAL FULFILLMENT OF THE REQUIREMENTS FOR THE DEGREE OF DOCTOR OF PHILOSOPHY Marc A. Schaub June 2012 © 2012 by Marc Andreas Schaub. All Rights Reserved. Re-distributed by Stanford University under license with the author. This work is licensed under a Creative Commons Attribution- Noncommercial 3.0 United States License. http://creativecommons.org/licenses/by-nc/3.0/us/ This dissertation is online at: http://purl.stanford.edu/qt820xd3631 ii I certify that I have read this dissertation and that, in my opinion, it is fully adequate in scope and quality as a dissertation for the degree of Doctor of Philosophy. Serafim Batzoglou, Primary Adviser I certify that I have read this dissertation and that, in my opinion, it is fully adequate in scope and quality as a dissertation for the degree of Doctor of Philosophy. Atul Butte I certify that I have read this dissertation and that, in my opinion, it is fully adequate in scope and quality as a dissertation for the degree of Doctor of Philosophy. David Dill Approved for the Stanford University Committee on Graduate Studies. Patricia J. Gumport, Vice Provost Graduate Education This signature page was generated electronically upon submission of this dissertation in electronic format. An original signed hard copy of the signature page is on file in University Archives. iii Abstract Genome Wide Association Studies (GWAS) have identified over 4,500 common vari- ants in the human genome that are statistically associated with diseases and other phenotypical traits. Most identified associations, however, only have a small effect on disease risk, and their relevance in a clinical setting remains the subject of extensive debate. In this thesis I present three integrative analysis directions that extend on GWAS by developing new methods, by using genotyping data to ask new questions, and by integrating additional types of data to generate functional hypotheses about the biological processes underlying associations. First, I introduce a new classifier-based methodology that identifies similarities in the genetic architecture of diseases. This method can successfully identify both known and novel relationships between common diseases such as type 1 diabetes, rheumatoid arthritis, hypertension and bipolar disease. Second, I show how control individuals from a GWAS can be used to detect genetic differences between the pseudoautosomal regions of chromosomes X and Y in the general population, which can be attributed to differences in allele frequency between the two sex chromosomes likely caused by selective pressure. Finally, I present an approach that integrates experimental data generated by the ENCODE consortium in order to identify functional Single Nucleotide Polymorphisms (SNPs). These functional SNPs are associated with a phenotype, either directly or through linkage disequilibrium, and overlap a functional part of the genome such as a transcribed region or a transcription factor binding site. GWAS associations are significantly enriched for functional annotations, and up to 80% of all associations iv previously reported in a GWAS can be mapped to a functional SNP. For most asso- ciations the functional SNP most strongly supported by experimental evidence is a SNP in linkage disequilibrium with the reported association rather then the reported SNP itself. v Acknowledgment I would like to thank my advisor Serafim Batzoglou for his advice, feedback, encour- agements and support throughout my graduate career, for giving me the freedom to explore a broad range of research directions, and for bringing together such a truly outstanding group of researchers. In particular, I would like to thank George Asi- menos and Chuong Do for all their highly valuable advice early in my graduate career, Anshul Kundaje for his advice and support during the second half of my thesis work, and Irene Kaplow for having been such a fantastic summer student. I would like to thank Atul Butte for all the feedback and encouragement through- out my thesis work, for inviting me to his group meetings, for serving on my qualifying examination, defense and reading committees, and for giving me the privilege of col- laborating closely with two amazing Ph.D. students in his group, Marina Sirota and Linda Liu. Working with Marina and Linda has certainly been the favorite part of my research work at Stanford, and I'm very grateful for everything they have done to make these joint projects such a successful and tremendously enriching experience. I would like to thank Michael Snyder for giving me the opportunity to collaborate with his group on the analysis of the ENCODE data, for his advice and support, and for serving on my defense committee, Ross Hardison for his support of my work on linking ENCODE and GWAS data, David Dill for serving on my qualifying exam, defense and reading committees and Arend Sidow for chairing my thesis defense. I would like to thank my friends and colleagues in the Batzoglou, Butte and Snyder labs and in the Stanford biomedical research community for their support, encour- agements, advice, feedback, and the many fruitful discussions we had about research and life in general: Sarah Aerni, Andy Beck, Sivan Bercovici, Alan Boyle, Leticia vi Britos, David Chen, Rong Chen, Tiffany Chen, Annie Chiang, Erik Corona, Michelle Davison, Eugene Davydov, Omkar Deshpande, Joel Dudley, Robert Edgar, Megan El- more, Sangeeta English, Patrick Flaherty, Jason Flannick, Chuan Sheng Foo, Eugene Fratkin, Yael Garten, Andrew Gentles, Sam Gross, Adam Grossman, Philip Guo, Manoj Hariharan, Lin Huang, Nadine Hussami, Robert Ikeda, Konrad Karczewski, Dorna Kashef-Haghighi, Peter Kang, Purvesh Khatri, Keiichi Kodama, Andy Kogel- nik, Sofia Kyriazopoulou-Panagiotopoulou, Wei-Nchih Lee, Daniel Li, Li Li, Max Libbrecht, Irene Liu, Yuling Liu, Alex Morgan, Daniel Newburger, Tony Novak, Jon Palma, Chirag Patel, Victoria Popic, Yannick Pouliot, Dmitry Pushkarev, Jesse Rodriguez, Jon Rodriguez, David Ruau, Olga Russakovsky, Karen Sachs, Raheleh Salari, Nicelio Sanchez-Luege, Shai Shen-Orr, Andreas Sundquist, Silpa Suthram, Nick Tatonetti, Rob Tirrell, Shivkumar Venkatasubrahmanyam, Dan Webster and Noah Zimmerman. My research work would not have been possible without the outstanding techni- cal and administrative support of Miles Davis, Kathi DiTommaso, Sebastian Gutier- rez, Alex Sandra Pinedo, Alex Skrenchuk, Tanya Raschke, Liliana Rivera and Verna Wong. During my time at Stanford, I had the privilege of being involved in a broad range of extracurricular activities. I would like to thank all my friends in Stan- ford EMS, and in particular Florian Schmitzberger, Chris Cheung, Brian Cheung, Glenn Ulansey, Lauren Mamer, Mark Liao and James Liao, the teaching staff of the Stanford EMT program, the Stanford Wilderness Medicine instructor team, and the Escondido Village Community Associates for their friendship, encouragements and support throughout my graduate career, and for just being an amazing group of peo- ple! These programs tremendously enriched my experience at Stanford, and would not have been possible without the support of the Department of Public Safety, the Division of Emergency Medicine and Stanford Outdoor Education. I would like to thank the Graduate Life Office, and in particular Ken Hsu, Laurette Beeson and Anne Boswell for their support of my work as a Community Associate in Studio 2, and all the great work they do to assist the Stanford graduate student community in general. vii While many miles away, my friends and family in Switzerland, and in particular Fr´ed´eric Ev´equoz,Gr´egoryMermoud´ and Gr´egoryTh´eoduloz as well as my brother Alain have always been very supportive of my work. Finally, none of this would have been possible without the unwavering support of my family throughout my entire career. I am deeply grateful to my father Andreas and my mother Margrith for everything they have done in order to give me the opportunity to follow my interests, and for always encouraging me to do so, even when it meant living nine timezones away from home. Danke viel, vielmals f¨uralles! Joint Work Chapter 3 and Sections 2.1 and 2.2 of Chapter 2 are a reproduction, in part, of a previously published article: M.A. Schaub, I.M. Kaplow, M. Sirota, C.B. Do, A.J. Butte, S. Batzoglou. A Classifier-based Approach to Identify Genetic Similarities Between Diseases. Bioin- formatics 25: i21-29. 2009. I would like to thank my co-authors Irene M. Kaplow, Marina Sirota, Chuong B. Do, Atul J. Butte and Serafim Batzoglou for their contributions to this project. I conceived and designed the study, performed all data preprocessing, implemented the version of the decision tree classifier used to obtain the reported results, analyzed the data and wrote the manuscript. Irene M. Kaplow performed exploratory research comparing various classifiers, which lead to the choice of the Decision Tree classifier we used. Marina Sirota revised Figure 3.1, and designed the version shown herein. Chuong B. Do and Marina Sirota provided input and feedback on the study design and data analysis. Atul J. Butte and Serafim Batzoglou helped conceive the study and supervised the study. All authors revised the manuscript. Chapter 4 and Section 2.3 of Chapter 2 represent joint work that will become part of a manuscript to be submitted after the time of submission of this thesis. I would like to thank my co-authors on this upcoming manuscript Linda Y. Liu, Marina Sirota, Serafim Batzoglou and Atul J. Butte for their contributions to this viii project. Linda Y. Liu and I jointly conceived and designed the study. Linda Y. Liu performed the analysis on the WTCCC data set. I performed the analysis on the HapMap 3 dataset, developed the modified Hardy-Weinberg model, identified the sequence homology issue leading to false positives in autosomes, and wrote the chapter.
Recommended publications
  • A Computational Approach for Defining a Signature of Β-Cell Golgi Stress in Diabetes Mellitus
    Page 1 of 781 Diabetes A Computational Approach for Defining a Signature of β-Cell Golgi Stress in Diabetes Mellitus Robert N. Bone1,6,7, Olufunmilola Oyebamiji2, Sayali Talware2, Sharmila Selvaraj2, Preethi Krishnan3,6, Farooq Syed1,6,7, Huanmei Wu2, Carmella Evans-Molina 1,3,4,5,6,7,8* Departments of 1Pediatrics, 3Medicine, 4Anatomy, Cell Biology & Physiology, 5Biochemistry & Molecular Biology, the 6Center for Diabetes & Metabolic Diseases, and the 7Herman B. Wells Center for Pediatric Research, Indiana University School of Medicine, Indianapolis, IN 46202; 2Department of BioHealth Informatics, Indiana University-Purdue University Indianapolis, Indianapolis, IN, 46202; 8Roudebush VA Medical Center, Indianapolis, IN 46202. *Corresponding Author(s): Carmella Evans-Molina, MD, PhD ([email protected]) Indiana University School of Medicine, 635 Barnhill Drive, MS 2031A, Indianapolis, IN 46202, Telephone: (317) 274-4145, Fax (317) 274-4107 Running Title: Golgi Stress Response in Diabetes Word Count: 4358 Number of Figures: 6 Keywords: Golgi apparatus stress, Islets, β cell, Type 1 diabetes, Type 2 diabetes 1 Diabetes Publish Ahead of Print, published online August 20, 2020 Diabetes Page 2 of 781 ABSTRACT The Golgi apparatus (GA) is an important site of insulin processing and granule maturation, but whether GA organelle dysfunction and GA stress are present in the diabetic β-cell has not been tested. We utilized an informatics-based approach to develop a transcriptional signature of β-cell GA stress using existing RNA sequencing and microarray datasets generated using human islets from donors with diabetes and islets where type 1(T1D) and type 2 diabetes (T2D) had been modeled ex vivo. To narrow our results to GA-specific genes, we applied a filter set of 1,030 genes accepted as GA associated.
    [Show full text]
  • Evaluation of the NOD/SCID Xenograft Model for Glucocorticoid-Regulated
    Bhadri et al. BMC Genomics 2011, 12:565 http://www.biomedcentral.com/1471-2164/12/565 RESEARCHARTICLE Open Access Evaluation of the NOD/SCID xenograft model for glucocorticoid-regulated gene expression in childhood B-cell precursor acute lymphoblastic leukemia Vivek A Bhadri1,3, Mark J Cowley2, Warren Kaplan2, Toby N Trahair1,3 and Richard B Lock1* Abstract Background: Glucocorticoids such as prednisolone and dexamethasone are critical drugs used in multi-agent chemotherapy protocols used to treat acute lymphoblastic leukemia (ALL), and response to glucocorticoids is highly predictive of outcome. The NOD/SCID xenograft mouse model of ALL is a clinically relevant model in which the mice develop a systemic leukemia which retains the fundamental biological characteristics of the original disease. Here we report a study evaluating the NOD/SCID xenograft mouse model to investigate glucocorticoid- induced gene expression. Cells from a glucocorticoid-sensitive xenograft derived from a child with B-cell precursor ALL were inoculated into NOD/SCID mice. When highly engrafted the mice were randomized into groups of 4 to receive dexamethasone 15 mg/kg by intraperitoneal injection or vehicle control. Leukemia cells were harvested from mice spleens at 0, 8, 24 or 48 hours thereafter, and gene expression analyzed on Illumina WG-6_V3 chips, comparing all groups to time 0 hours. Results: The 8 hour dexamethasone-treated timepoint had the highest number of significantly differentially expressed genes, with fewer observed at the 24 and 48 hour timepoints, and with minimal changes seen across the time-matched controls. When compared to publicly available datasets of glucocorticoid-induced gene expression from an in vitro cell line study and from an in vivo study of patients with ALL, at the level of pathways, expression changes in the 8 hour xenograft samples showed a similar response to patients treated with glucocorticoids.
    [Show full text]
  • (COPD) and Lung Cancer by Means of Cell Specific
    UNDERSTANDING SHARED PATHOGENESIS BETWEEN CHRONIC OBSTRUCTIVE PULMONARY DISEASE (COPD) AND LUNG CANCER BY MEANS OF CELL SPECIFIC GENOMICS CLARA EMILY GREEN A thesis submitted to the University of Birmingham for the degree of DOCTOR OF PHILOSOPHY The Institute of Inflammation and Ageing College of Medical and Dental Sciences University of Birmingham February 2018 University of Birmingham Research Archive e-theses repository This unpublished thesis/dissertation is copyright of the author and/or third parties. The intellectual property rights of the author or third parties in respect of this work are as defined by The Copyright Designs and Patents Act 1988 or as modified by any successor legislation. Any use made of information contained in this thesis/dissertation must be in accordance with that legislation and must be properly acknowledged. Further distribution or reproduction in any format is prohibited without the permission of the copyright holder. Abstract Introduction COPD (Chronic Obstructive Pulmonary Disease) and lung cancer are related conditions associated with inflammation. Relatively little focus has been given to the endothelium, through which inflammatory cells transmigrate to reach the lung. We sought to determine if coding and non-coding alterations in pulmonary endothelium exist in COPD and lung cancer. Methods Patients with and without COPD undergoing thoracic surgery were recruited. Pulmonary Endothelial Cells were isolated from lung and tumour and extracted RNA (ribonucleic acid) used for miRNA (micro-RNA) and mRNA (messenger RNA) microarrays. Ingenuity pathway analysis (IPA) was also carried out. Results 2071 genes and 43 miRNAs were significantly upregulated in COPD. 4 targets were validated by quantitative polymerase chain reaction, of which miR-181b-3p was chosen for functional validation.
    [Show full text]
  • A Dissertation Entitled the Androgen Receptor
    A Dissertation entitled The Androgen Receptor as a Transcriptional Co-activator: Implications in the Growth and Progression of Prostate Cancer By Mesfin Gonit Submitted to the Graduate Faculty as partial fulfillment of the requirements for the PhD Degree in Biomedical science Dr. Manohar Ratnam, Committee Chair Dr. Lirim Shemshedini, Committee Member Dr. Robert Trumbly, Committee Member Dr. Edwin Sanchez, Committee Member Dr. Beata Lecka -Czernik, Committee Member Dr. Patricia R. Komuniecki, Dean College of Graduate Studies The University of Toledo August 2011 Copyright 2011, Mesfin Gonit This document is copyrighted material. Under copyright law, no parts of this document may be reproduced without the expressed permission of the author. An Abstract of The Androgen Receptor as a Transcriptional Co-activator: Implications in the Growth and Progression of Prostate Cancer By Mesfin Gonit As partial fulfillment of the requirements for the PhD Degree in Biomedical science The University of Toledo August 2011 Prostate cancer depends on the androgen receptor (AR) for growth and survival even in the absence of androgen. In the classical models of gene activation by AR, ligand activated AR signals through binding to the androgen response elements (AREs) in the target gene promoter/enhancer. In the present study the role of AREs in the androgen- independent transcriptional signaling was investigated using LP50 cells, derived from parental LNCaP cells through extended passage in vitro. LP50 cells reflected the signature gene overexpression profile of advanced clinical prostate tumors. The growth of LP50 cells was profoundly dependent on nuclear localized AR but was independent of androgen. Nevertheless, in these cells AR was unable to bind to AREs in the absence of androgen.
    [Show full text]
  • Identification of Potential Key Genes in Gastric Cancer Using Bioinformatics Analysis
    178 BIOMEDICAL REPORTS 12: 178-192, 2020 Identification of potential key genes in gastric cancer using bioinformatics analysis WEI WANG1, YING HE2, QI ZHAO3, XIAODONG ZHAO3 and ZHIHONG LI1 1Department of Gastroenterology, Dongzhimen Hospital, Beijing University of Chinese Medicine, Beijing 100700; 2Hospital of Chengdu University of Traditional Chinese Medicine, Chengdu, Sichuan 610075; 3Dongzhimen Hospital, Beijing University of Chinese Medicine, Beijing 100700, P.R. China Received October 5, 2019; Accepted January 27, 2020 DOI: 10.3892/br.2020.1281 Abstract. Gastric cancer (GC) is one of the most common types results of the present study suggest that FN1, COL1A1, INHBA of cancer worldwide. Patients must be identified at an early and CST1 may be potential biomarkers and therapeutic targets stage of tumor progression for treatment to be effective. The for GC. Additional studies are required to explore the potential aim of the present study was to identify potential biomarkers value of ATP4A and ATP4B in the treatment of GC. with diagnostic value in patients with GC. To examine potential therapeutic targets for GC, four Gene Expression Introduction Omnibus (GEO) datasets were downloaded and screened for differentially expressed genes (DEGs). Gene Ontology Gastric cancer (GC) is a malignant tumor that originates and Kyoto Encyclopedia of Genes and Genomes (KEGG) in the epithelium of the gastric mucosa and is one of the analyses were subsequently performed to study the func- most common types of malignant tumors in the world (1). tion and pathway enrichment of the identified DEGs. A According to GLOBOCAN 2018, there were >1,000,000 new protein-protein interaction (PPI) network was constructed.
    [Show full text]
  • Research Article Identification of Key Genes and Pathways in Triple-Negative Breast Cancer by Integrated Bioinformatics Analysis
    Hindawi BioMed Research International Volume 2018, Article ID 2760918, 10 pages https://doi.org/10.1155/2018/2760918 Research Article Identification of Key Genes and Pathways in Triple-Negative Breast Cancer by Integrated Bioinformatics Analysis Pengzhi Dong ,1 Bing Yu,2 Lanlan Pan,1 Xiaoxuan Tian ,1 and Fangfang Liu 3 1 Tianjin State Key Laboratory of Modern Chinese Medicine, Tianjin University of Traditional Chinese Medicine, Tianjin 300193, China 2Tianjin Central Hospital of Gynecology Obstetrics, Tianjin 300100, China 3Department of Breast Pathology and Research Laboratory, Key Laboratory of Breast Cancer Prevention and Terapy (Ministry of Education), National Clinical Research Center for Cancer, Tianjin Medical University Cancer Institute and Hospital, Tianjin 300060, China Correspondence should be addressed to Fangfang Liu; [email protected] Received 14 March 2018; Revised 15 June 2018; Accepted 4 July 2018; Published 2 August 2018 Academic Editor: Robert A. Vierkant Copyright © 2018 Pengzhi Dong et al. Tis is an open access article distributed under the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited. Purpose. Triple-negative breast cancer refers to breast cancer that does not express estrogen receptor (ER), progesterone receptor (PR), or human epidermal growth factor receptor 2 (Her2). Tis study aimed to identify the key pathways and genes and fnd the potential initiation and progression mechanism of triple-negative breast cancer (TNBC). Methods. We downloaded the gene expression profles of GSE76275 from Gene Expression Omnibus (GEO) datasets. Tis microarray Super-Series sets are composed of gene expression data from 265 samples which included 67 non-TNBC and 198 TNBC.
    [Show full text]
  • Regulation of the Glucocorticoid Receptor Via a BET
    RESEARCH ARTICLE Regulation of the glucocorticoid receptor via a BET-dependent enhancer drives antiandrogen resistance in prostate cancer Neel Shah1,2, Ping Wang3, John Wongvipat1, Wouter R Karthaus1, Wassim Abida4, Joshua Armenia1, Shira Rockowitz3, Yotam Drier5, Bradley E Bernstein5, Henry W Long6, Matthew L Freedman6, Vivek K Arora7, Deyou Zheng3, Charles L Sawyers1,8* 1Human Oncology and Pathogenesis Program, Memorial Sloan Kettering Cancer Center, New York, United States; 2The Louis V. Gerstner Graduate School of Biomedical Sciences, Sloan Kettering Institute, Memorial Sloan Kettering Cancer Center, New York, United States; 3Department of Neurology, Genetics and Neuroscience, Albert Einstein College of Medicine, Bronx, United States; 4Department of Medicine, Memorial Sloan Kettering Cancer Center, New York, United States; 5Department of Pathology, Massachusetts General Hospital and Harvard Medical School, Boston, United States; 6Department of Medical Oncology, Dana-Farber Cancer Institute and Harvard Medical School, Boston, United States; 7Division of Medical Oncology, Washington University School of Medicine, St Louis, United States; 8Howard Hughes Medical Institute, Memorial Sloan Kettering Cancer Center, New York, United States Abstract In prostate cancer, resistance to the antiandrogen enzalutamide (Enz) can occur through bypass of androgen receptor (AR) blockade by the glucocorticoid receptor (GR). In contrast to fixed genomic alterations, here we show that GR-mediated antiandrogen resistance is *For correspondence: [email protected] adaptive and reversible due to regulation of GR expression by a tissue-specific enhancer. GR expression is silenced in prostate cancer by a combination of AR binding and EZH2-mediated Competing interest: See repression at the GR locus, but is restored in advanced prostate cancers upon reversion of both page 15 repressive signals.
    [Show full text]
  • Supplementary Table 1
    Supplementary Table 1. 492 genes are unique to 0 h post-heat timepoint. The name, p-value, fold change, location and family of each gene are indicated. Genes were filtered for an absolute value log2 ration 1.5 and a significance value of p ≤ 0.05. Symbol p-value Log Gene Name Location Family Ratio ABCA13 1.87E-02 3.292 ATP-binding cassette, sub-family unknown transporter A (ABC1), member 13 ABCB1 1.93E-02 −1.819 ATP-binding cassette, sub-family Plasma transporter B (MDR/TAP), member 1 Membrane ABCC3 2.83E-02 2.016 ATP-binding cassette, sub-family Plasma transporter C (CFTR/MRP), member 3 Membrane ABHD6 7.79E-03 −2.717 abhydrolase domain containing 6 Cytoplasm enzyme ACAT1 4.10E-02 3.009 acetyl-CoA acetyltransferase 1 Cytoplasm enzyme ACBD4 2.66E-03 1.722 acyl-CoA binding domain unknown other containing 4 ACSL5 1.86E-02 −2.876 acyl-CoA synthetase long-chain Cytoplasm enzyme family member 5 ADAM23 3.33E-02 −3.008 ADAM metallopeptidase domain Plasma peptidase 23 Membrane ADAM29 5.58E-03 3.463 ADAM metallopeptidase domain Plasma peptidase 29 Membrane ADAMTS17 2.67E-04 3.051 ADAM metallopeptidase with Extracellular other thrombospondin type 1 motif, 17 Space ADCYAP1R1 1.20E-02 1.848 adenylate cyclase activating Plasma G-protein polypeptide 1 (pituitary) receptor Membrane coupled type I receptor ADH6 (includes 4.02E-02 −1.845 alcohol dehydrogenase 6 (class Cytoplasm enzyme EG:130) V) AHSA2 1.54E-04 −1.6 AHA1, activator of heat shock unknown other 90kDa protein ATPase homolog 2 (yeast) AK5 3.32E-02 1.658 adenylate kinase 5 Cytoplasm kinase AK7
    [Show full text]
  • Insights Into MYC Biology Through Investigation of Synthetic Lethal Interactions with MYC Deregulation
    Insights into MYC biology through investigation of synthetic lethal interactions with MYC deregulation Mai Sato Submitted in partial fulfillment of the requirements for the degree of Doctor of Philosophy under the Executive Committee of the Graduate School of Arts and Sciences COLUMBIA UNIVERSITY 2014 © 2014 Mai Sato All Rights Reserved ABSTRACT Insights into MYC biology through investigation of synthetic lethal interactions with MYC deregulation Mai Sato MYC (or c-myc) is a bona fide “cancer driver” oncogene that is deregulated in up to 70% of human tumors. In addition to its well-characterized role as a transcription factor that can directly promote tumorigenic growth and proliferation, MYC has transcription-independent functions in vital cellular processes including DNA replication and protein synthesis, contributing to its complex biology. MYC expression, activity, and stability are highly regulated through multiple mechanisms. MYC deregulation triggers genome instability and oncogene-induced DNA replication stress, which are thought to be critical in promoting cancer via mechanisms that are still unclear. Because regulated MYC activity is essential for normal cell viability and MYC is a difficult protein to target pharmacologically, targeting genes or pathways that are essential to survive MYC deregulation offer an attractive alternative as a means to combat tumor cells with MYC deregulation. To this end, we conducted a genome-wide synthetic lethal shRNA screen in MCF10A breast epithelial cells stably expressing an inducible MYCER transgene. We identified and validated FBXW7 as a high-confidence synthetic lethal (MYC-SL) candidate gene. FBXW7 is a component of an E3 ubiquitin ligase complex that degrades MYC. FBXW7 knockdown in MCF10A cells selectively induced cell death in MYC-deregulated cells compared to control.
    [Show full text]
  • Genomic Signature of Parity in the Breast of Premenopausal Women
    Santucci-Pereira et al. Breast Cancer Research (2019) 21:46 https://doi.org/10.1186/s13058-019-1128-x RESEARCH ARTICLE Open Access Genomic signature of parity in the breast of premenopausal women Julia Santucci-Pereira1*† , Anne Zeleniuch-Jacquotte2,3†, Yelena Afanasyeva2†, Hua Zhong2†, Michael Slifker4, Suraj Peri4, Eric A. Ross4, Ricardo López de Cicco1, Yubo Zhai1, Theresa Nguyen1, Fathima Sheriff1, Irma H. Russo1, Yanrong Su1, Alan A. Arslan2,5, Pal Bordas6,7, Per Lenner7, Janet Åhman6, Anna Stina Landström Eriksson6, Robert Johansson8, Göran Hallmans9, Paolo Toniolo5 and Jose Russo1 Abstract Background: Full-term pregnancy (FTP) at an early age confers long-term protection against breast cancer. Previously, we reported that a FTP imprints a specific gene expression profile in the breast of postmenopausal women. Herein, we evaluated gene expression changes induced by parity in the breast of premenopausal women. Methods: Gene expression profiling of normal breast tissue from 30 nulliparous (NP) and 79 parous (P) premenopausal volunteers was performed using Affymetrix microarrays. In addition to a discovery/validation analysis, we conducted an analysis of gene expression differences in P vs. NP women as a function of time since last FTP. Finally, a laser capture microdissection substudy was performed to compare the gene expression profile in the whole breast biopsy with that in the epithelial and stromal tissues. Results: Discovery/validation analysis identified 43 differentially expressed genes in P vs. NP breast. Analysis of expression as a function of time since FTP revealed 286 differentially expressed genes (238 up- and 48 downregulated) comparing all P vs. all NP, and/or P women whose last FTP was less than 5 years before biopsy vs.
    [Show full text]
  • Molecular Characterization of Breast and Lung Tumors by Integration of Multiple Data Types with Sparse-Factor Analysis
    bioRxiv preprint doi: https://doi.org/10.1101/183582; this version posted September 14, 2017. The copyright holder for this preprint (which was not certified by peer review) is the author/funder, who has granted bioRxiv a license to display the preprint in perpetuity. It is made available under aCC-BY-NC 4.0 International license. Molecular characterization of breast and lung tumors by integration of multiple data types with sparse-factor analysis Tycho Bismeijer1, Sander Canisius1,2, Lodewyk Wessels1,3 1 Division of Molecular Carcinogenesis, The Netherlands Cancer Institute, Amsterdam, The Netherlands 2 Division of Molecular Pathology, The Netherlands Cancer Institute, Amsterdam, The Netherlands 3 Faculty of EEMCS, Delft University of Technology, Delft, The Netherlands Abstract Effective cancer treatment is crucially dependent on the identification of the biological processes that drive a tumor. This process is complicated by the fact that multiple processes may be active simultaneously and that each tumor has a unique spectrum of active processes. While clustering has been applied extensively to subtype tumors, its discrete nature makes it inherently unsuitable to this task. In addition, the availability of multiple data types per tumor has become commonplace and provides the opportunity to comprehensively profile the processes driving a tumor. Here we introduce Functional Sparse Factor Analysis (FuncSFA) to address these challenges. FuncSFA integrates multiple data types to define a lower dimensional space capturing the variation in a set of tumors across those data types. A tailor-made module allows the association of the identified factors to biological processes. FuncSFA is inspired by iCluster, which we improve in several key aspects.
    [Show full text]
  • PRODUCT SPECIFICATION Prest Antigen C9orf152 Product
    PrEST Antigen C9orf152 Product Datasheet PrEST Antigen PRODUCT SPECIFICATION Product Name PrEST Antigen C9orf152 Product Number APrEST75420 Gene Description chromosome 9 open reading frame 152 Alternative Gene bA470J20.2 Names Corresponding Anti-C9orf152 (HPA050769) Antibodies Description Recombinant protein fragment of Human C9orf152 Amino Acid Sequence Recombinant Protein Epitope Signature Tag (PrEST) antigen sequence: PQDTSHQVHHRGKLVGSDQRLPPEGDTHLFETNQMTQQGTGIPEAAQLPC QVGNTQTKAVESGLKFSTQCPLSIKNPHRSGKPAYYPFPQRKTPRISQAA RNL Fusion Tag N-terminal His6ABP (ABP = Albumin Binding Protein derived from Streptococcal Protein G) Expression Host E. coli Purification IMAC purification Predicted MW 29 kDa including tags Usage Suitable as control in WB and preadsorption assays using indicated corresponding antibodies. Purity >80% by SDS-PAGE and Coomassie blue staining Buffer PBS and 1M Urea, pH 7.4. Unit Size 100 µl Concentration Lot dependent Storage Upon delivery store at -20°C. Avoid repeated freeze/thaw cycles. Notes Gently mix before use. Optimal concentrations and conditions for each application should be determined by the user. Product of Sweden. For research use only. Not intended for pharmaceutical development, diagnostic, therapeutic or any in vivo use. No products from Atlas Antibodies may be resold, modified for resale or used to manufacture commercial products without prior written approval from Atlas Antibodies AB. Warranty: The products supplied by Atlas Antibodies are warranted to meet stated product specifications and to conform to label descriptions when used and stored properly. Unless otherwise stated, this warranty is limited to one year from date of sales for products used, handled and stored according to Atlas Antibodies AB's instructions. Atlas Antibodies AB's sole liability is limited to replacement of the product or refund of the purchase price.
    [Show full text]