Downloaded from Rnaseq CEU60/ and All the Source Codes for the Current Study Are Available from the Corresponding Author

Total Page:16

File Type:pdf, Size:1020Kb

Downloaded from Rnaseq CEU60/ and All the Source Codes for the Current Study Are Available from the Corresponding Author Choi et al. BMC Genetics (2017) 18:93 DOI 10.1186/s12863-017-0561-z METHODOLOGY ARTICLE Open Access Network analysis for count data with excess zeros Hosik Choi1, Jungsoo Gim2, Sungho Won3,YouJinKim4, Sunghoon Kwon5 and Changyi Park6* Abstract Background: Undirected graphical models or Markov random fields have been a popular class of models for representing conditional dependence relationships between nodes. In particular, Markov networks help us to understand complex interactions between genes in biological processes of a cell. Local Poisson models seem to be promising in modeling positive as well as negative dependencies for count data. Furthermore, when zero counts are more frequent than are expected, excess zeros should be considered in the model. Methods: We present a penalized Poisson graphical model for zero inflated count data and derive an expectation- maximization (EM) algorithm built on coordinate descent. Our method is shown to be effective through simulated and real data analysis. Results: Results from the simulated data indicate that our method outperforms the local Poisson graphical model in the presence of excess zeros. In an application to a RNA sequencing data, we also investigate the gender effect by comparing the estimated networks according to different genders. Our method may help us in identifying biological pathways linked to sex hormone regulation and thus understanding underlying mechanisms of the gender differences. Conclusions: We have presented a penalized version of zero inflated spatial Poisson regression and derive an efficient EM algorithm built on coordinate descent. We discuss possible improvements of our method as well as potential research directions associated with our findings from the RNA sequencing data. Keywords: Count data, EM algorithm, Network, Zero inflation Background The main focus of this study is to infer the network Graphical models help us to explore relationships between structure for a count data. The auto-Poisson model in [5] nodes in graphs. Undirected graphical models or Markov is a natural extension of univariate Poisson distribution. random fields have been a popular class of models for rep- However it can model only negative dependencies, so that resenting conditional dependence relationships between the conditional distributions define a unique joint distri- nodes. Examples include Gaussian graphical models for bution consistently. Yang et al. [6] propose variants of the continuous data, Ising model for binary data, and multi- auto-Poisson model such as truncated, quadratic, and sub- nomial graphical models. These Markov networks help us linear Poisson graphical models(PGM). However none of to understand complex interactions between genes in bio- them provide a satisfactory answer to the question of how logical processes of a cell and have been well studied in to specify a consistent joint graphical model for count data bioinformatics. Examples of Markov networks in learning capturing both positive and negative dependencies. Allen the network structure from microarray and next genera- and Liu [4] consider a local PGM (LPGM). The LPGM tion sequencing data include [1–4]. For more details on does not have a consistent joint graphical model, but it Markov network inference, see those and the references has the local Markov property and thus the zero coeffi- therein. cient of an edge weight between two nodes implies the conditional independence of the two nodes given the oth- ers. Žitnik and Zupan [7] consider a latent factor Poisson *Correspondence: [email protected] 6Department of Statistics, University of Seoul, 02504, Seoul, Korea model and [8] propose to learn conditional dependence Full list of author information is available at the end of the article © The Author(s). 2017 Open Access This article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated. Choi et al. BMC Genetics (2017) 18:93 Page 2 of 10 structures for binary and Poisson data via marginal loss Methods functions. Also a semiparametric Guassian copula, called In this section, we present our graph learning method the nonparanormal graphical model (NPGM), has been based on a penalization of the ZISP in [14] and derive an proposed [9]. efficient EM algorithm for its computation. In practice, zero counts are sometimes more frequent than are expected under a univariate Poisson distribution. Zero inflated local Poisson graphical model In such cases, a zero-inflated Poisson (ZIP) distribution is Let N denote the number of observations and p denote often adopted. Applications of ZIP models include mod- thenumberofvariablesornodes.DenoteG = (V, E), eling of defects in quality control [10] and alcoholism where V ={1, ..., p} is the set of vertices or nodes and and substance abuse in medicine [11]. Extensions of a E is the set of edges. We use uppercase letters such as ZIP model in different frameworks are well-studied in X and Z when we refer to random variables. Observa- the literature. Dobbie and Welsh [12] extend the two tions are written in lowercase. For example, xi denote ith component approach in [13] for serially correlated count observation of X. Vectors and matrices are represented data exhibiting extra zeros. Monod [14] develops a zero- by boldface and blackboard boldface letters, respectively. inflated spatial Poisson (ZISP) model. Buu et al. [11] Define X = (xij)N×p,wherexij is generated from two studyvariableselectionmethodssuchasLASSOand latent components with zero and Poisson states. Let zij one-step SCAD for ZIP regression models. For compu- be a latent variable such that zij = 1ifxij is from zero tation, a local linear approximation (LLA) is adopted. state and zij = 0ifxij is from Poisson state. zij follows a The LLA algorithm fails to converge particularly with Bernoulli distribution with πj.LetI(·) denotes an indica- small sample sizes because it requires fitting unpenalized tor function. Then the ZISP model in [14] is defined by ZIP regression models. Wang et al. [15] propose an expec- tation maximization (EM) algorithm [16] for a penalized − x ZIP regression model built on coordinate descent algo- μj j e μj rithms. The EM algorithm seems to have some advantages P Xj = xj|Xk =xk, k = j = πjI(xj = 0)+ 1 − πj , x ! over the LLA algorithm in numerical convergence and j tuning. (1) In this paper, we are interested in the construction of = + graphical models for count data, particularly, with exces- where μj exp βj k=j βjkxk , βj is an intercept sive zeros. To this end, we propose a penalized version adjusting for Xj,andβjk is the parameter accounting for the ZISP model in [14] called zero inflated local Poisson the conditional relation between Xj and Xk. graphical model (ZILPGM) and derive an EM algorithm Due to the zero inflation term in the conditional prob- built on coordinate descent as in [15]. We show the effec- ability, the situation becomes more complicated in our tiveness of our method on simulated and real data. In case than in LPGM. Because the important part is the an application to a RNA sequencing data, we investigate pairwise interaction term in the pairwise-only depen- the gender effect by comparing the estimated networks dency models, the situation is basically similar. In order according to different genders. It has been well noted that to have a valid joint distribution, the coefficient for the gender is one of the major contributors in the differen- interaction term βjk should be non-positive. As in the tiation of gene expression profiles [17, 18] and various LPGM,wedonotsolvetheissueofnegativeparameters sexually dimorphic phenotypes, most of which result from in the Poisson graphical model. Note that any existing hormonal differences [19]. It was reported that transcrip- approaches (e.g. in [6]) do not succeed in giving a satis- tome study could be predicted to represent a different factory answer to the consistency issue. Rather, we focus promising approach for the identification of biological not on the consistency issue but on the practical issue pathways linked to sex hormone regulation and the analy- of estimating positive as well as negative dependencies as sis of associated gene regulatory networks [20]. However, in LPGM. the elucidation of underlying mechanisms of the gen- In order to learn graph structures, we consider the min- der differences is still an area of interest and intense imization of the penalized pseudo log-likelihood of (1) in investigation. the general weighted LASSO form: The paper is organized as follows. In “Methods” section, − x we propose a new graph learning method based on ZISP N p μij ij 1 e μij and provide an efficient EM type numerical algorithm. − log πjI(xij = 0) + 1 − πj N x ! In “Results” section, we compare performances of our i=1 j=1 ij method with LPGM on simulated and real data sets. p Some discussions and concluding remarks are given in + λ wjk|βjk|,(2) “Conclusions” section. j=1 k=j Choi et al. BMC Genetics (2017) 18:93 Page 3 of 10 = + ≥ where μij exp βj k=j βjkxik , λ 0isthepenalty distribution by introducing a latent variable and derive an parameter, and w ≥ 0 is an appropriate weight. As in EM algorithm. jk T = = [4], we can select the tuning parameter using the sta- Define β−j β0, (βk)k=j and xi,−j bility selection criterion in [21]. More specifically, we (1, (x ) = ))T . The log-likelihood function with respect select the optimal λ is selected from 30 equal-spaced ik k j max min max to complete data can be written as grid points in log scale on [λ , λ ], where λ = 1 N min = max × N N maxj∈{1,··· ,p} maxk=j i=1 xikxij and λ λ − N c =− − − 10 4.Foreachj, we fit poisson regression using glmnet.
Recommended publications
  • Haploid Genetic Screens Identify an Essential Role for PLP2 in the Downregulation of Novel Plasma Membrane Targets by Viral E3 Ubiquitin Ligases
    Haploid Genetic Screens Identify an Essential Role for PLP2 in the Downregulation of Novel Plasma Membrane Targets by Viral E3 Ubiquitin Ligases Richard T. Timms1, Lidia M. Duncan1, Iva A. Tchasovnikarova1, Robin Antrobus1, Duncan L. Smith2, Gordon Dougan3, Michael P. Weekes1, Paul J. Lehner1* 1 Cambridge Institute for Medical Research, Addenbrooke’s Hospital, Cambridge, United Kingdom, 2 Paterson Institute for Cancer Research, University of Manchester, Withington, Manchester, United Kingdom, 3 Wellcome Trust Sanger Institute, Wellcome Trust Genome Campus, Cambridge, United Kingdom Abstract The Kaposi’s sarcoma-associated herpesvirus gene products K3 and K5 are viral ubiquitin E3 ligases which downregulate MHC-I and additional cell surface immunoreceptors. To identify novel cellular genes required for K5 function we performed a forward genetic screen in near-haploid human KBM7 cells. The screen identified proteolipid protein 2 (PLP2), a MARVEL domain protein of unknown function, as essential for K5 activity. Genetic loss of PLP2 traps the viral ligase in the endoplasmic reticulum, where it is unable to ubiquitinate and degrade its substrates. Subsequent analysis of the plasma membrane proteome of K5-expressing KBM7 cells in the presence and absence of PLP2 revealed a wide range of novel K5 targets, all of which required PLP2 for their K5-mediated downregulation. This work ascribes a critical function to PLP2 for viral ligase activity and underlines the power of non-lethal haploid genetic screens in human cells to identify the genes involved in pathogen manipulation of the host immune system. Citation: Timms RT, Duncan LM, Tchasovnikarova IA, Antrobus R, Smith DL, et al. (2013) Haploid Genetic Screens Identify an Essential Role for PLP2 in the Downregulation of Novel Plasma Membrane Targets by Viral E3 Ubiquitin Ligases.
    [Show full text]
  • Exceptional Conservation of Horse–Human Gene Order on X Chromosome Revealed by High-Resolution Radiation Hybrid Mapping
    Exceptional conservation of horse–human gene order on X chromosome revealed by high-resolution radiation hybrid mapping Terje Raudsepp*†, Eun-Joon Lee*†, Srinivas R. Kata‡, Candice Brinkmeyer*, James R. Mickelson§, Loren C. Skow*, James E. Womack‡, and Bhanu P. Chowdhary*¶ʈ *Department of Veterinary Anatomy and Public Health, ‡Department of Veterinary Pathobiology, College of Veterinary Medicine, and ¶Department of Animal Science, College of Agriculture and Life Science, Texas A&M University, College Station, TX 77843; and §Department of Veterinary Pathobiology, University of Minnesota, 295f AS͞VM, St. Paul, MN 55108 Contributed by James E. Womack, December 30, 2003 Development of a dense map of the horse genome is key to efforts ciated with the traits, once they are mapped by genetic linkage aimed at identifying genes controlling health, reproduction, and analyses with highly polymorphic markers. performance. We herein report a high-resolution gene map of the The X chromosome is the most conserved mammalian chro- horse (Equus caballus) X chromosome (ECAX) generated by devel- mosome (18, 19). Extensive comparisons of structure, organi- oping and typing 116 gene-specific and 12 short tandem repeat zation, and gene content of this chromosome in evolutionarily -markers on the 5,000-rad horse ؋ hamster whole-genome radia- diverse mammals have revealed a remarkable degree of conser tion hybrid panel and mapping 29 gene loci by fluorescence in situ vation (20–22). Until now, the chromosome has been best hybridization. The human X chromosome sequence was used as a studied in humans and mice, where the focus of research has template to select genes at 1-Mb intervals to develop equine been the intriguing patterns of X inactivation and the involve- orthologs.
    [Show full text]
  • Analysis of Gene Expression Data for Gene Ontology
    ANALYSIS OF GENE EXPRESSION DATA FOR GENE ONTOLOGY BASED PROTEIN FUNCTION PREDICTION A Thesis Presented to The Graduate Faculty of The University of Akron In Partial Fulfillment of the Requirements for the Degree Master of Science Robert Daniel Macholan May 2011 ANALYSIS OF GENE EXPRESSION DATA FOR GENE ONTOLOGY BASED PROTEIN FUNCTION PREDICTION Robert Daniel Macholan Thesis Approved: Accepted: _______________________________ _______________________________ Advisor Department Chair Dr. Zhong-Hui Duan Dr. Chien-Chung Chan _______________________________ _______________________________ Committee Member Dean of the College Dr. Chien-Chung Chan Dr. Chand K. Midha _______________________________ _______________________________ Committee Member Dean of the Graduate School Dr. Yingcai Xiao Dr. George R. Newkome _______________________________ Date ii ABSTRACT A tremendous increase in genomic data has encouraged biologists to turn to bioinformatics in order to assist in its interpretation and processing. One of the present challenges that need to be overcome in order to understand this data more completely is the development of a reliable method to accurately predict the function of a protein from its genomic information. This study focuses on developing an effective algorithm for protein function prediction. The algorithm is based on proteins that have similar expression patterns. The similarity of the expression data is determined using a novel measure, the slope matrix. The slope matrix introduces a normalized method for the comparison of expression levels throughout a proteome. The algorithm is tested using real microarray gene expression data. Their functions are characterized using gene ontology annotations. The results of the case study indicate the protein function prediction algorithm developed is comparable to the prediction algorithms that are based on the annotations of homologous proteins.
    [Show full text]
  • Identification of the Binding Partners for Hspb2 and Cryab Reveals
    Brigham Young University BYU ScholarsArchive Theses and Dissertations 2013-12-12 Identification of the Binding arP tners for HspB2 and CryAB Reveals Myofibril and Mitochondrial Protein Interactions and Non- Redundant Roles for Small Heat Shock Proteins Kelsey Murphey Langston Brigham Young University - Provo Follow this and additional works at: https://scholarsarchive.byu.edu/etd Part of the Microbiology Commons BYU ScholarsArchive Citation Langston, Kelsey Murphey, "Identification of the Binding Partners for HspB2 and CryAB Reveals Myofibril and Mitochondrial Protein Interactions and Non-Redundant Roles for Small Heat Shock Proteins" (2013). Theses and Dissertations. 3822. https://scholarsarchive.byu.edu/etd/3822 This Thesis is brought to you for free and open access by BYU ScholarsArchive. It has been accepted for inclusion in Theses and Dissertations by an authorized administrator of BYU ScholarsArchive. For more information, please contact [email protected], [email protected]. Identification of the Binding Partners for HspB2 and CryAB Reveals Myofibril and Mitochondrial Protein Interactions and Non-Redundant Roles for Small Heat Shock Proteins Kelsey Langston A thesis submitted to the faculty of Brigham Young University in partial fulfillment of the requirements for the degree of Master of Science Julianne H. Grose, Chair William R. McCleary Brian Poole Department of Microbiology and Molecular Biology Brigham Young University December 2013 Copyright © 2013 Kelsey Langston All Rights Reserved ABSTRACT Identification of the Binding Partners for HspB2 and CryAB Reveals Myofibril and Mitochondrial Protein Interactors and Non-Redundant Roles for Small Heat Shock Proteins Kelsey Langston Department of Microbiology and Molecular Biology, BYU Master of Science Small Heat Shock Proteins (sHSP) are molecular chaperones that play protective roles in cell survival and have been shown to possess chaperone activity.
    [Show full text]
  • Identification of Key Pathways and Genes in Endometrial Cancer Using Bioinformatics Analyses
    ONCOLOGY LETTERS 17: 897-906, 2019 Identification of key pathways and genes in endometrial cancer using bioinformatics analyses YAN LIU, TENG HUA, SHUQI CHI and HONGBO WANG Department of Obstetrics and Gynecology, Union Hospital, Tongji Medical College, Huazhong University of Science and Technology, Wuhan, Hubei 430022, P.R. China Received March 16, 2018; Accepted October 12, 2018 DOI: 10.3892/ol.2018.9667 Abstract. Endometrial cancer (EC) is one of the most Introduction common gynecological cancer types worldwide. However, to the best of our knowledge, its underlying mechanisms Endometrial carcinoma (EC) is one of the most common remain unknown. The current study downloaded three mRNA gynecological cancer types, with increasing global incidence and microRNA (miRNA) datasets of EC and normal tissue in recent years (1). A total of 60,050 cases of EC and 10,470 samples, GSE17025, GSE63678 and GSE35794, from the EC-associated cases of mortality were reported in the USA in Gene Expression Omnibus to identify differentially expressed 2016 (1), which was markedly higher than the 2012 statistics genes (DEGs) and miRNAs (DEMs) in EC tumor tissues. of 47,130 cases and 8,010 mortalities (2). Although numerous The DEGs and DEMs were then validated using data from studies have been conducted to investigate the mechanisms of The Cancer Genome Atlas and subjected to gene ontology endometrial tumorigenesis and development, to the best of our and Kyoto Encyclopedia of Genes and Genomes pathway knowledge, the exact etiology remains unknown. Understanding analysis. STRING and Cytoscape were used to construct a the potential molecular mechanisms underlying EC initiation protein-protein interaction network and the prognostic effects and progression is of great clinical significance.
    [Show full text]
  • Seq2pathway Vignette
    seq2pathway Vignette Bin Wang, Xinan Holly Yang, Arjun Kinstlick May 19, 2021 Contents 1 Abstract 1 2 Package Installation 2 3 runseq2pathway 2 4 Two main functions 3 4.1 seq2gene . .3 4.1.1 seq2gene flowchart . .3 4.1.2 runseq2gene inputs/parameters . .5 4.1.3 runseq2gene outputs . .8 4.2 gene2pathway . 10 4.2.1 gene2pathway flowchart . 11 4.2.2 gene2pathway test inputs/parameters . 11 4.2.3 gene2pathway test outputs . 12 5 Examples 13 5.1 ChIP-seq data analysis . 13 5.1.1 Map ChIP-seq enriched peaks to genes using runseq2gene .................... 13 5.1.2 Discover enriched GO terms using gene2pathway_test with gene scores . 15 5.1.3 Discover enriched GO terms using Fisher's Exact test without gene scores . 17 5.1.4 Add description for genes . 20 5.2 RNA-seq data analysis . 20 6 R environment session 23 1 Abstract Seq2pathway is a novel computational tool to analyze functional gene-sets (including signaling pathways) using variable next-generation sequencing data[1]. Integral to this tool are the \seq2gene" and \gene2pathway" components in series that infer a quantitative pathway-level profile for each sample. The seq2gene function assigns phenotype-associated significance of genomic regions to gene-level scores, where the significance could be p-values of SNPs or point mutations, protein-binding affinity, or transcriptional expression level. The seq2gene function has the feasibility to assign non-exon regions to a range of neighboring genes besides the nearest one, thus facilitating the study of functional non-coding elements[2]. Then the gene2pathway summarizes gene-level measurements to pathway-level scores, comparing the quantity of significance for gene members within a pathway with those outside a pathway.
    [Show full text]
  • A Computational Approach for Defining a Signature of Β-Cell Golgi Stress in Diabetes Mellitus
    Page 1 of 781 Diabetes A Computational Approach for Defining a Signature of β-Cell Golgi Stress in Diabetes Mellitus Robert N. Bone1,6,7, Olufunmilola Oyebamiji2, Sayali Talware2, Sharmila Selvaraj2, Preethi Krishnan3,6, Farooq Syed1,6,7, Huanmei Wu2, Carmella Evans-Molina 1,3,4,5,6,7,8* Departments of 1Pediatrics, 3Medicine, 4Anatomy, Cell Biology & Physiology, 5Biochemistry & Molecular Biology, the 6Center for Diabetes & Metabolic Diseases, and the 7Herman B. Wells Center for Pediatric Research, Indiana University School of Medicine, Indianapolis, IN 46202; 2Department of BioHealth Informatics, Indiana University-Purdue University Indianapolis, Indianapolis, IN, 46202; 8Roudebush VA Medical Center, Indianapolis, IN 46202. *Corresponding Author(s): Carmella Evans-Molina, MD, PhD ([email protected]) Indiana University School of Medicine, 635 Barnhill Drive, MS 2031A, Indianapolis, IN 46202, Telephone: (317) 274-4145, Fax (317) 274-4107 Running Title: Golgi Stress Response in Diabetes Word Count: 4358 Number of Figures: 6 Keywords: Golgi apparatus stress, Islets, β cell, Type 1 diabetes, Type 2 diabetes 1 Diabetes Publish Ahead of Print, published online August 20, 2020 Diabetes Page 2 of 781 ABSTRACT The Golgi apparatus (GA) is an important site of insulin processing and granule maturation, but whether GA organelle dysfunction and GA stress are present in the diabetic β-cell has not been tested. We utilized an informatics-based approach to develop a transcriptional signature of β-cell GA stress using existing RNA sequencing and microarray datasets generated using human islets from donors with diabetes and islets where type 1(T1D) and type 2 diabetes (T2D) had been modeled ex vivo. To narrow our results to GA-specific genes, we applied a filter set of 1,030 genes accepted as GA associated.
    [Show full text]
  • The Inactive X Chromosome Is Epigenetically Unstable and Transcriptionally Labile in Breast Cancer
    Supplemental Information The inactive X chromosome is epigenetically unstable and transcriptionally labile in breast cancer Ronan Chaligné1,2,3,8, Tatiana Popova1,4, Marco-Antonio Mendoza-Parra5, Mohamed-Ashick M. Saleem5 , David Gentien1,6, Kristen Ban1,2,3,8, Tristan Piolot1,7, Olivier Leroy1,7, Odette Mariani6, Hinrich Gronemeyer*5, Anne Vincent-Salomon*1,4,6,8, Marc-Henri Stern*1,4,6 and Edith Heard*1,2,3,8 Extended Experimental Procedures Cell Culture Human Mammary Epithelial Cells (HMEC, Invitrogen) were grown in serum-free medium (HuMEC, Invitrogen). WI- 38, ZR-75-1, SK-BR-3 and MDA-MB-436 cells were grown in Dulbecco’s modified Eagle’s medium (DMEM; Invitrogen) containing 10% fetal bovine serum (FBS). DNA Methylation analysis. We bisulfite-treated 2 µg of genomic DNA using Epitect bisulfite kit (Qiagen). Bisulfite converted DNA was amplified with bisulfite primers listed in Table S3. All primers incorporated a T7 promoter tag, and PCR conditions are available upon request. We analyzed PCR products by MALDI-TOF mass spectrometry after in vitro transcription and specific cleavage (EpiTYPER by Sequenom®). For each amplicon, we analyzed two independent DNA samples and several CG sites in the CpG Island. Design of primers and selection of best promoter region to assess (approx. 500 bp) were done by a combination of UCSC Genome Browser (http://genome.ucsc.edu) and MethPrimer (http://www.urogene.org). All the primers used are listed (Table S3). NB: MAGEC2 CpG analysis have been done with a combination of two CpG island identified in the gene core. Analysis of RNA allelic expression profiles (based on Human SNP Array 6.0) DNA and RNA hybridizations were normalized by Genotyping console.
    [Show full text]
  • A Multi-Omics Interpretable Machine Learning Model Reveals Modes of Action of Small Molecules Natasha L
    www.nature.com/scientificreports OPEN A Multi-Omics Interpretable Machine Learning Model Reveals Modes of Action of Small Molecules Natasha L. Patel-Murray1, Miriam Adam2, Nhan Huynh2, Brook T. Wassie2, Pamela Milani2 & Ernest Fraenkel 2,3* High-throughput screening and gene signature analyses frequently identify lead therapeutic compounds with unknown modes of action (MoAs), and the resulting uncertainties can lead to the failure of clinical trials. We developed an approach for uncovering MoAs through an interpretable machine learning model of transcriptomics, epigenomics, metabolomics, and proteomics. Examining compounds with benefcial efects in models of Huntington’s Disease, we found common MoAs for compounds with unrelated structures, connectivity scores, and binding targets. The approach also predicted highly divergent MoAs for two FDA-approved antihistamines. We experimentally validated these efects, demonstrating that one antihistamine activates autophagy, while the other targets bioenergetics. The use of multiple omics was essential, as some MoAs were virtually undetectable in specifc assays. Our approach does not require reference compounds or large databases of experimental data in related systems and thus can be applied to the study of agents with uncharacterized MoAs and to rare or understudied diseases. Unknown modes of action of drug candidates can lead to unpredicted consequences on efectiveness and safety. Computational methods, such as the analysis of gene signatures, and high-throughput experimental methods have accelerated the discovery of lead compounds that afect a specifc target or phenotype1–3. However, these advances have not dramatically changed the rate of drug approvals. Between 2000 and 2015, 86% of drug can- didates failed to earn FDA approval, with toxicity or a lack of efcacy being common reasons for their clinical trial termination4,5.
    [Show full text]
  • Downloaded from [266]
    Patterns of DNA methylation on the human X chromosome and use in analyzing X-chromosome inactivation by Allison Marie Cotton B.Sc., The University of Guelph, 2005 A THESIS SUBMITTED IN PARTIAL FULFILLMENT OF THE REQUIREMENTS FOR THE DEGREE OF DOCTOR OF PHILOSOPHY in The Faculty of Graduate Studies (Medical Genetics) THE UNIVERSITY OF BRITISH COLUMBIA (Vancouver) January 2012 © Allison Marie Cotton, 2012 Abstract The process of X-chromosome inactivation achieves dosage compensation between mammalian males and females. In females one X chromosome is transcriptionally silenced through a variety of epigenetic modifications including DNA methylation. Most X-linked genes are subject to X-chromosome inactivation and only expressed from the active X chromosome. On the inactive X chromosome, the CpG island promoters of genes subject to X-chromosome inactivation are methylated in their promoter regions, while genes which escape from X- chromosome inactivation have unmethylated CpG island promoters on both the active and inactive X chromosomes. The first objective of this thesis was to determine if the DNA methylation of CpG island promoters could be used to accurately predict X chromosome inactivation status. The second objective was to use DNA methylation to predict X-chromosome inactivation status in a variety of tissues. A comparison of blood, muscle, kidney and neural tissues revealed tissue-specific X-chromosome inactivation, in which 12% of genes escaped from X-chromosome inactivation in some, but not all, tissues. X-linked DNA methylation analysis of placental tissues predicted four times higher escape from X-chromosome inactivation than in any other tissue. Despite the hypomethylation of repetitive elements on both the X chromosome and the autosomes, no changes were detected in the frequency or intensity of placental Cot-1 holes.
    [Show full text]
  • 1 Supporting Information for a Microrna Network Regulates
    Supporting Information for A microRNA Network Regulates Expression and Biosynthesis of CFTR and CFTR-ΔF508 Shyam Ramachandrana,b, Philip H. Karpc, Peng Jiangc, Lynda S. Ostedgaardc, Amy E. Walza, John T. Fishere, Shaf Keshavjeeh, Kim A. Lennoxi, Ashley M. Jacobii, Scott D. Rosei, Mark A. Behlkei, Michael J. Welshb,c,d,g, Yi Xingb,c,f, Paul B. McCray Jr.a,b,c Author Affiliations: Department of Pediatricsa, Interdisciplinary Program in Geneticsb, Departments of Internal Medicinec, Molecular Physiology and Biophysicsd, Anatomy and Cell Biologye, Biomedical Engineeringf, Howard Hughes Medical Instituteg, Carver College of Medicine, University of Iowa, Iowa City, IA-52242 Division of Thoracic Surgeryh, Toronto General Hospital, University Health Network, University of Toronto, Toronto, Canada-M5G 2C4 Integrated DNA Technologiesi, Coralville, IA-52241 To whom correspondence should be addressed: Email: [email protected] (M.J.W.); yi- [email protected] (Y.X.); Email: [email protected] (P.B.M.) This PDF file includes: Materials and Methods References Fig. S1. miR-138 regulates SIN3A in a dose-dependent and site-specific manner. Fig. S2. miR-138 regulates endogenous SIN3A protein expression. Fig. S3. miR-138 regulates endogenous CFTR protein expression in Calu-3 cells. Fig. S4. miR-138 regulates endogenous CFTR protein expression in primary human airway epithelia. Fig. S5. miR-138 regulates CFTR expression in HeLa cells. Fig. S6. miR-138 regulates CFTR expression in HEK293T cells. Fig. S7. HeLa cells exhibit CFTR channel activity. Fig. S8. miR-138 improves CFTR processing. Fig. S9. miR-138 improves CFTR-ΔF508 processing. Fig. S10. SIN3A inhibition yields partial rescue of Cl- transport in CF epithelia.
    [Show full text]
  • Aberrant Promoter Methylation and Tumor Suppressive Activity of the DFNA5 Gene in Colorectal Carcinoma
    Oncogene (2008) 27, 3624–3634 & 2008 Nature Publishing Group All rights reserved 0950-9232/08 $30.00 www.nature.com/onc ORIGINAL ARTICLE Aberrant promoter methylation and tumor suppressive activity of the DFNA5 gene in colorectal carcinoma MS Kim1, X Chang1, K Yamashita1, JK Nagpal1, JH Baek2,GWu3, B Trink1, EA Ratovitski1, M Mori4 and D Sidransky1 1Department of Otolaryngology, Head and Neck Cancer Research Division, Johns Hopkins University, Baltimore, MD, USA; 2Department of Genetic Medicine, Institute of Cell Engineering, Johns Hopkins University, Baltimore, MD, USA; 3Karmanos Cancer Institute, Department of Pathology, Wayne State University, Detroit, MI, USA and 4Department of Surgical Oncology, Medical Institute of Bioregulation, Kyushu University, Tsurumibaru, Beppu, Japan To identify novel methylated gene promoters, we com- Introduction pared differential RNA expression profiles of colorectal cancer (CRC) cell lines with or without treatment of Aberrant gene expression is a characteristic of human 5-aza-20-deoxycytidine (5-aza-dC). Out of 1776 genes cancers, and changes in DNA methylation status can that were initially ‘absent (that is, silenced)’ by gene have profound effects on the expression of genes. Tumor expression array analysis, we selected 163 genes that were suppressor genes (TSGs) display both genetic and increased after 5-aza-dC treatment in at least two of three epigenetic inactivation in human tumors, and the CRC cell lines. The microarray results were confirmed by transcriptional silencing of TSGs has established hy- Reverse Transcription–PCR, and CpG island of the gene permethylation as a common mechanism for loss of promoters were amplified and sequenced for examination TSG function in human cancer (Herman, 1999).
    [Show full text]