Combining Heterogeneous Sources of Data for the Reverse-Engineering of Gene Regulatory Networks

Total Page:16

File Type:pdf, Size:1020Kb

Combining Heterogeneous Sources of Data for the Reverse-Engineering of Gene Regulatory Networks Combining heterogeneous sources of data for the reverse-engineering of gene regulatory networks A thesis submitted for the degree of Doctor of Philosophy Emma Steele School of Information Systems, Computing and Mathematics Brunel University January 2010 Abstract Gene Regulatory Networks (GRNs) represent how genes interact in various cellular processes by describing how the expression level, or activity, of genes can a®ect the expression of the other genes. Reverse- engineering GRN models can help biologists understand and gain in- sight into genetic conditions and diseases. Recently, the increasingly widespread use of DNA microarrays, a high-throughput technology that allows the expression of thousands of genes to be measured si- multaneously in biological experiments, has led to many datasets of gene expression measurements becoming publicly available and a sub- sequent explosion of research in the reverse-engineering of GRN mod- els. However, microarray technology has a number of limitations as a data source for the modelling of GRNs, due to concerns over its reliability and the reproducibility of experimental results. The underlying theme of the research presented in this thesis is the incorporation of multiple sources and di®erent types of data into tech- niques for reverse-engineering or learning GRNs from data. By draw- ing on many data sources, the resulting network models should be more robust, accurate and reliable than models that have been learnt using a single data source. This is achieved by focusing on two main strands of research. First, the thesis presents some of the earliest work in the incorporation of prior knowledge that has been generated from a large body of scienti¯c papers, for Bayesian network based GRN models. Second, novel methods for the use of multiple microarray datasets to produce Bayesian network based GRN models are intro- duced. Empirical evaluations are used to show that the incorporation of literature-based prior knowledge and combining multiple microar- ray datasets can provide an improvement, when compared to the use of a single microarray dataset, for the reverse-engineering of Bayesian network based GRN models. Acknowledgements Firstly I would like to thank my supervisor Allan Tucker. Without his support, encouragement, guidance and understanding throughout my PhD, this thesis would have never been completed, and certainly not within the magic three years! I would also thank my second supervisor Xiaohui Liu for all of his invaluable advice and wise words. I would like to acknowledge my colleagues (past and present) in the Centre for Intelligent Data Analysis at Brunel for interesting discus- sions that have contributed towards my research, in particular Michael Hirsch, Veronica Vinciotti and Stephen Swift. I would also like to thank members of the Biosemantics Association in the Netherlands for all their help and support during my PhD. They have been the perfect partners in collaboration! In particular Peter- Bram 't Hoen, Matthew Hestand (Leiden University) and Martijn Schuemie (Erasmus University Rotterdam), who have spent a lot of their time assisting me with this research. To my o±ce buddies, past and present, for keeping the IDA lab fun and a pleasure to work in: Kyri, Karl, Dan, Ana, Natalie, Jose, Jian, Yahya, Christine, Nota and others. A big thankyou to my family for their encouragement throughout my PhD but also for not asking how the thesis was going! Last but de¯nitely not least, thankyou to my husband Andy for all his love and support, and especially for reading this thesis and al- lowing me to monopolise his brand-new computer without too much complaint! And a last minute addition to these acknowledgements: my daughter Isla who arrived after this thesis was submitted, but before the viva. Thankyou for allowing me the time to ¯nish this PhD! Publications The following publications have resulted from the research presented in this thesis: 1. Peeling, E., Tucker, A. & 't Hoen, P. (2007). Discovery of local regulatory structure from microarray gene expression data using Bayesian networks. In Proceedings of the annual workshop on Intelligent Data Analysis in bioMedicine And Pharmacology (IDAMAP). 2. Peeling, E. & Tucker, A. (2007). Consensus gene regula- tory networks: combining multiple microarray gene expression datasets. In AIP Conference Proceedings vol. 940, The 3rd In- ternational Symposium on Computational Life Sciences (COM- PLIFE 2007). 3. Steele, E. & Tucker, A. (2008). Consensus and meta- analysis regulatory networks for combining multiple microarray gene expression datasets. Journal of Biomedical Informatics, 41, 914{926. 4. Steele, E., Tucker, A., 't Hoen, P., & Schuemie, M.J. (2009) Literature-based priors for gene regulatory networks. Bioin- formatics, 25, 1768-1774. 5. Steele, E. & Tucker, A. (2009) Selecting and weighting data for building Consensus Gene Regulatory Networks. In LNCS 5772, Proceedings of the 8th International Symposium on Intelli- gent Data Analysis: Advances in Intelligent Data Analysis VIII , 190-201. Please note that Peeling is the author's maiden name. Item 1 results from preliminary work for this thesis. Items 2 and 3 result from the research described in Chapter 5. Item 4 results from the research presented in Chapter 4. Finally, item 5 is based on Chapter 6. Contents 1 Introduction 1 1.1 Reverse-engineering gene regulatory network models . 2 1.2 Thesis aims . 5 1.3 Thesis contributions . 6 1.4 Thesis outline . 7 2 Gene Regulatory Networks 9 2.1 Regulation of gene expression . 9 2.2 Microarray technology . 12 2.2.1 The basics: data collection . 13 2.2.2 Data processing . 16 2.2.2.1 Image processing . 16 2.2.2.2 Expression ratios . 17 2.2.2.3 Normalisation . 18 2.2.3 Analysis of microarray expression data . 22 2.2.3.1 The gene expression matrix . 22 2.2.3.2 Basic analysis techniques . 23 2.2.3.3 Inferring regulatory relationships . 26 2.2.4 Using multiple microarray gene expression datasets . 30 2.3 Other types of data . 34 2.3.1 Protein-protein interaction data . 34 2.3.2 Transcription factor binding site data . 35 2.3.3 Literature-based knowledge . 35 2.3.4 Use of prior knowledge in inferring regulatory relationships 38 2.4 Summary . 41 v CONTENTS 3 Modelling GRNs using Bayesian networks 43 3.1 Bayesian networks . 43 3.1.1 Conditional independence . 45 3.1.2 Inference . 45 3.1.3 Explaining away . 48 3.1.4 Parameter learning . 49 3.1.5 Equivalence classes . 50 3.1.6 Causality . 50 3.2 Bayesian networks to model gene regulation . 51 3.2.1 Simple regulatory structures . 52 3.2.2 Modelling using microarray expression data . 52 3.2.3 More complex regulatory interactions . 54 3.3 Learning Bayesian network structures . 56 3.3.1 Constraint-based learning . 56 3.3.2 Score-based learning . 57 3.3.3 Comparison of both approaches . 58 3.3.4 Con¯dence levels for network edges . 59 3.4 Evaluation of model performance . 61 3.4.1 Network structure comparison to current knowledge . 62 3.4.2 Prediction of gene expression values . 65 3.5 Recent work in using BNs to model GRNs . 65 4 Prior knowledge 70 4.1 Introduction . 70 4.2 Related work . 71 4.3 Informative Bayesian network priors . 73 4.3.1 Calculating the prior probability of a network structure . 73 4.3.2 Weighting the prior . 76 4.4 Literature-based Bayesian network priors . 77 4.4.1 Literature-based gene concept pro¯ling . 77 4.4.2 Edge prior probabilities from literature-based knowledge . 79 4.5 Evaluating the use of literature-based priors . 81 4.5.1 Evaluation procedure . 81 vi CONTENTS 4.5.2 Application: yeast . 83 4.5.2.1 Comparison to documented interactions . 84 4.5.2.2 Expression value prediction . 84 4.5.3 Application: E. coli ...................... 88 4.5.3.1 Comparison to documented interactions . 88 4.5.3.2 Expression value prediction . 89 4.5.4 Application: human . 92 4.6 Discussion . 96 5 Combining multiple microarray gene expression datasets 99 5.1 Introduction . 99 5.2 Pre-learning aggregation . 100 5.3 Post-learning aggregation . 101 5.3.1 Consensus Bayesian networks . 101 5.3.2 Bayesian network meta-analysis . 103 5.3.3 Related work . 105 5.4 Comparison of pre- and post-learning aggregation methods . 106 5.4.1 Comparison procedure . 107 5.4.2 Application: synthetic network . 108 5.4.3 Application: E. coli SOS response network . 112 5.4.4 Application: yeast heat stress network . 116 5.5 Discussion . 120 6 Incorporating network reliability 123 6.1 Introduction . 123 6.2 Consensus Bootstrapped Bayesian Networks . 125 6.2.1 Algorithm . 125 6.2.2 Incorporating weights . 128 6.2.3 Comparison to the original Consensus algorithm . 133 6.3 Measuring network reliability . 133 6.3.1 A prediction-based reliability measure for networks . 134 6.3.2 Prediction-based network weighting . 136 6.3.3 Prediction-based network selection . 136 6.3.4 Related work . 137 vii CONTENTS 6.4 Network selection or network weighting: a comparison . 138 6.4.1 Datasets and experiment design . 138 6.4.2 Application: synthetic networks . 141 6.4.2.1 Performance comparison of approaches . 141 6.4.2.2 Selection of the consensus threshold . 144 6.4.2.3 Method comparison for the network selection ap- proach . 147 6.4.3 Application: yeast heat-stress network . 148 6.5 Discussion . 152 7 Inducing heuristics for the Consensus approach 156 7.1 Introduction . 156 7.2 Motivation . 157 7.3 Rule learning . 159 7.3.1 Inputs and Outputs . 160 7.3.2 Algorithm basics . 161 7.3.3 Relational rule learning . 162 7.3.4 ACE . 166 7.4 Experimental results . 167 7.4.1 Rule learning inputs .
Recommended publications
  • A Computational Approach for Defining a Signature of Β-Cell Golgi Stress in Diabetes Mellitus
    Page 1 of 781 Diabetes A Computational Approach for Defining a Signature of β-Cell Golgi Stress in Diabetes Mellitus Robert N. Bone1,6,7, Olufunmilola Oyebamiji2, Sayali Talware2, Sharmila Selvaraj2, Preethi Krishnan3,6, Farooq Syed1,6,7, Huanmei Wu2, Carmella Evans-Molina 1,3,4,5,6,7,8* Departments of 1Pediatrics, 3Medicine, 4Anatomy, Cell Biology & Physiology, 5Biochemistry & Molecular Biology, the 6Center for Diabetes & Metabolic Diseases, and the 7Herman B. Wells Center for Pediatric Research, Indiana University School of Medicine, Indianapolis, IN 46202; 2Department of BioHealth Informatics, Indiana University-Purdue University Indianapolis, Indianapolis, IN, 46202; 8Roudebush VA Medical Center, Indianapolis, IN 46202. *Corresponding Author(s): Carmella Evans-Molina, MD, PhD ([email protected]) Indiana University School of Medicine, 635 Barnhill Drive, MS 2031A, Indianapolis, IN 46202, Telephone: (317) 274-4145, Fax (317) 274-4107 Running Title: Golgi Stress Response in Diabetes Word Count: 4358 Number of Figures: 6 Keywords: Golgi apparatus stress, Islets, β cell, Type 1 diabetes, Type 2 diabetes 1 Diabetes Publish Ahead of Print, published online August 20, 2020 Diabetes Page 2 of 781 ABSTRACT The Golgi apparatus (GA) is an important site of insulin processing and granule maturation, but whether GA organelle dysfunction and GA stress are present in the diabetic β-cell has not been tested. We utilized an informatics-based approach to develop a transcriptional signature of β-cell GA stress using existing RNA sequencing and microarray datasets generated using human islets from donors with diabetes and islets where type 1(T1D) and type 2 diabetes (T2D) had been modeled ex vivo. To narrow our results to GA-specific genes, we applied a filter set of 1,030 genes accepted as GA associated.
    [Show full text]
  • 1 UST College of Science Department of Biological Sciences
    UST College of Science Department of Biological Sciences 1 Pharmacogenomics of Myofascial Pain Syndrome An Undergraduate Thesis Submitted to the Department of Biological Sciences College of Science University of Santo Tomas In Partial Fulfillment of the Requirements for the Degree of Bachelor of Science in Biology Jose Marie V. Lazaga Marc Llandro C. Fernandez May 2021 UST College of Science Department of Biological Sciences 2 PANEL APPROVAL SHEET This undergraduate research manuscript entitled: Pharmacogenomics of Myofascial Pain Syndrome prepared and submitted by Jose Marie V. Lazaga and Marc Llandro C. Fernandez, was checked and has complied with the revisions and suggestions requested by panel members after thorough evaluation. This final version of the manuscript is hereby approved and accepted for submission in partial fulfillment of the requirements for the degree of Bachelor of Science in Biology. Noted by: Asst. Prof. Marilyn G. Rimando, PhD Research adviser, Bio/MicroSem 602-603 Approved by: Bio/MicroSem 603 panel member Bio/MicroSem 603 panel member Date: Date: UST College of Science Department of Biological Sciences 3 DECLARATION OF ORIGINALITY We hereby affirm that this submission is our own work and that, to the best of our knowledge and belief, it contains no material previously published or written by another person nor material to which a substantial extent has been accepted for award of any other degree or diploma of a university or other institute of higher learning, except where due acknowledgement is made in the text. We also declare that the intellectual content of this undergraduate research is the product of our work, even though we may have received assistance from others on style, presentation, and language expression.
    [Show full text]
  • A Multi-Stage Genome-Wide Association Study of Uterine Fibroids in African Americans
    UCLA UCLA Previously Published Works Title A multi-stage genome-wide association study of uterine fibroids in African Americans. Permalink https://escholarship.org/uc/item/0mc5r0xh Journal Human genetics, 136(10) ISSN 0340-6717 Authors Hellwege, Jacklyn N Jeff, Janina M Wise, Lauren A et al. Publication Date 2017-10-01 DOI 10.1007/s00439-017-1836-1 Peer reviewed eScholarship.org Powered by the California Digital Library University of California Hum Genet (2017) 136:1363–1373 DOI 10.1007/s00439-017-1836-1 ORIGINAL INVESTIGATION A multi‑stage genome‑wide association study of uterine fbroids in African Americans Jacklyn N. Hellwege1,2,3 · Janina M. Jef4 · Lauren A. Wise5,6 · C. Scott Gallagher7 · Melissa Wellons8,9 · Katherine E. Hartmann3,9 · Sarah F. Jones1,3 · Eric S. Torstenson1,2 · Scott Dickinson10 · Edward A. Ruiz‑Narváez6 · Nadin Rohland7 · Alexander Allen7 · David Reich7,11,12 · Arti Tandon7 · Bogdan Pasaniuc13,14 · Nicholas Mancuso13 · Hae Kyung Im10 · David A. Hinds15 · Julie R. Palmer6 · Lynn Rosenberg6 · Joshua C. Denny16,17 · Dan M. Roden2,16,17,18 · Elizabeth A. Stewart19 · Cynthia C. Morton12,20,21,22 · Eimear E. Kenny4 · Todd L. Edwards1,2,3 · Digna R. Velez Edwards2,3,9 Received: 12 April 2017 / Accepted: 16 August 2017 / Published online: 23 August 2017 © Springer-Verlag GmbH Germany 2017 Abstract Uterine fbroids are benign tumors of the uterus imaging, genotyped and imputed to 1000 Genomes. Stage 2 afecting up to 77% of women by menopause. They are the used self-reported fbroid and GWAS data from 23andMe, leading indication for hysterectomy, and account for $34 bil- Inc.
    [Show full text]
  • Stx5-Mediated ER-Golgi Transport in Mammals and Yeast
    cells Review Stx5-Mediated ER-Golgi Transport in Mammals and Yeast Peter TA Linders 1 , Chiel van der Horst 1, Martin ter Beest 1 and Geert van den Bogaart 1,2,* 1 Tumor Immunology Lab, Radboud University Medical Center, Radboud Institute for Molecular Life Sciences, Geert Grooteplein 28, 6525 GA Nijmegen, The Netherlands 2 Department of Molecular Immunology, Groningen Biomolecular Sciences and Biotechnology Institute, University of Groningen, Nijenborgh 7, 9747 AG Groningen, The Netherlands * Correspondence: [email protected]; Tel.: +31-50-36-35230 Received: 8 July 2019; Accepted: 25 July 2019; Published: 26 July 2019 Abstract: The soluble N-ethylmaleimide-sensitive factor attachment protein receptor (SNARE) syntaxin 5 (Stx5) in mammals and its ortholog Sed5p in Saccharomyces cerevisiae mediate anterograde and retrograde endoplasmic reticulum (ER)-Golgi trafficking. Stx5 and Sed5p are structurally highly conserved and are both regulated by interactions with other ER-Golgi SNARE proteins, the Sec1/Munc18-like protein Scfd1/Sly1p and the membrane tethering complexes COG, p115, and GM130. Despite these similarities, yeast Sed5p and mammalian Stx5 are differently recruited to COPII-coated vesicles, and Stx5 interacts with the microtubular cytoskeleton, whereas Sed5p does not. In this review, we argue that these different Stx5 interactions contribute to structural differences in ER-Golgi transport between mammalian and yeast cells. Insight into the function of Stx5 is important given its essential role in the secretory pathway of eukaryotic cells and its involvement in infections and neurodegenerative diseases. Keywords: syntaxin 5; Golgi apparatus; endoplasmic reticulum; membrane trafficking; secretory pathway 1. Introduction The secretory pathway is essential for secretion of cytokines, hormones, growth factors, and extracellular matrix proteins, as well as for the delivery of receptors and transporters to the cell membrane and lytic proteins to endo-lysosomal compartments.
    [Show full text]
  • The Genetics of Bipolar Disorder
    Molecular Psychiatry (2008) 13, 742–771 & 2008 Nature Publishing Group All rights reserved 1359-4184/08 $30.00 www.nature.com/mp FEATURE REVIEW The genetics of bipolar disorder: genome ‘hot regions,’ genes, new potential candidates and future directions A Serretti and L Mandelli Institute of Psychiatry, University of Bologna, Bologna, Italy Bipolar disorder (BP) is a complex disorder caused by a number of liability genes interacting with the environment. In recent years, a large number of linkage and association studies have been conducted producing an extremely large number of findings often not replicated or partially replicated. Further, results from linkage and association studies are not always easily comparable. Unfortunately, at present a comprehensive coverage of available evidence is still lacking. In the present paper, we summarized results obtained from both linkage and association studies in BP. Further, we indicated new potential interesting genes, located in genome ‘hot regions’ for BP and being expressed in the brain. We reviewed published studies on the subject till December 2007. We precisely localized regions where positive linkage has been found, by the NCBI Map viewer (http://www.ncbi.nlm.nih.gov/mapview/); further, we identified genes located in interesting areas and expressed in the brain, by the Entrez gene, Unigene databases (http://www.ncbi.nlm.nih.gov/entrez/) and Human Protein Reference Database (http://www.hprd.org); these genes could be of interest in future investigations. The review of association studies gave interesting results, as a number of genes seem to be definitively involved in BP, such as SLC6A4, TPH2, DRD4, SLC6A3, DAOA, DTNBP1, NRG1, DISC1 and BDNF.
    [Show full text]
  • A Trafficome-Wide Rnai Screen Reveals Deployment of Early and Late Secretory Host Proteins and the Entire Late Endo-/Lysosomal V
    bioRxiv preprint doi: https://doi.org/10.1101/848549; this version posted November 19, 2019. The copyright holder for this preprint (which was not certified by peer review) is the author/funder, who has granted bioRxiv a license to display the preprint in perpetuity. It is made available under aCC-BY 4.0 International license. 1 A trafficome-wide RNAi screen reveals deployment of early and late 2 secretory host proteins and the entire late endo-/lysosomal vesicle fusion 3 machinery by intracellular Salmonella 4 5 Alexander Kehl1,4, Vera Göser1, Tatjana Reuter1, Viktoria Liss1, Maximilian Franke1, 6 Christopher John1, Christian P. Richter2, Jörg Deiwick1 and Michael Hensel1, 7 8 1Division of Microbiology, University of Osnabrück, Osnabrück, Germany; 2Division of Biophysics, University 9 of Osnabrück, Osnabrück, Germany, 3CellNanOs – Center for Cellular Nanoanalytics, Fachbereich 10 Biologie/Chemie, Universität Osnabrück, Osnabrück, Germany; 4current address: Institute for Hygiene, 11 University of Münster, Münster, Germany 12 13 Running title: Host factors for SIF formation 14 Keywords: siRNA knockdown, live cell imaging, Salmonella-containing vacuole, Salmonella- 15 induced filaments 16 17 Address for correspondence: 18 Alexander Kehl 19 Institute for Hygiene 20 University of Münster 21 Robert-Koch-Str. 4148149 Münster, Germany 22 Tel.: +49(0)251/83-55233 23 E-mail: [email protected] 24 25 or bioRxiv preprint doi: https://doi.org/10.1101/848549; this version posted November 19, 2019. The copyright holder for this preprint (which was not certified by peer review) is the author/funder, who has granted bioRxiv a license to display the preprint in perpetuity. It is made available under aCC-BY 4.0 International license.
    [Show full text]
  • Transcriptome-Wide Association Study of Attention Deficit Hyperactivity Disorder Identifies Associated Genes and Phenotypes
    bioRxiv preprint doi: https://doi.org/10.1101/642231; this version posted May 24, 2019. The copyright holder for this preprint (which was not certified by peer review) is the author/funder, who has granted bioRxiv a license to display the preprint in perpetuity. It is made available under aCC-BY-NC-ND 4.0 International license. Transcriptome-wide association study of attention deficit hyperactivity disorder identifies associated genes and phenotypes Calwing Liao1,2, Alexandre D. Laporte2, Dan Spiegelman2, Fulya Akçimen1,2, Ridha Joober3, Patrick A. Dion2,4, Guy A. Rouleau1,2,4 1Department of Human Genetics, McGill University, Montréal, Quebec, Canada 2Montreal Neurological Institute, McGill University, Montréal, Quebec, Canada 3Department of Psychiatry, McGill University, Montréal, Quebec, Canada 4Department of Neurology and Neurosurgery, McGill University, Montréal, Quebec, Canada Word count: 2,619 excluding abstract and references Figures: 2 Tables: 3 Keywords: ADHD, transcriptome-wide association study, KAT2B, TMEM161B, amygdala, prefrontal cortex *Correspondence: Dr. Guy A. Rouleau Montreal Neurological Institute and Hospital Department of Neurology and Neurosurgery 3801 University Street, Montreal, QC Canada H3A 2B4. Tel: +1 514 398 2690 Fax: +1 514 398 8248 E-mail: [email protected] bioRxiv preprint doi: https://doi.org/10.1101/642231; this version posted May 24, 2019. The copyright holder for this preprint (which was not certified by peer review) is the author/funder, who has granted bioRxiv a license to display the preprint in perpetuity. It is made available under aCC-BY-NC-ND 4.0 International license. Abstract Attention deficit/hyperactivity disorder (ADHD) is one of the most common neurodevelopmental psychiatric disorders.
    [Show full text]
  • Characterizing Genomic Duplication in Autism Spectrum Disorder by Edward James Higginbotham a Thesis Submitted in Conformity
    Characterizing Genomic Duplication in Autism Spectrum Disorder by Edward James Higginbotham A thesis submitted in conformity with the requirements for the degree of Master of Science Graduate Department of Molecular Genetics University of Toronto © Copyright by Edward James Higginbotham 2020 i Abstract Characterizing Genomic Duplication in Autism Spectrum Disorder Edward James Higginbotham Master of Science Graduate Department of Molecular Genetics University of Toronto 2020 Duplication, the gain of additional copies of genomic material relative to its ancestral diploid state is yet to achieve full appreciation for its role in human traits and disease. Challenges include accurately genotyping, annotating, and characterizing the properties of duplications, and resolving duplication mechanisms. Whole genome sequencing, in principle, should enable accurate detection of duplications in a single experiment. This thesis makes use of the technology to catalogue disease relevant duplications in the genomes of 2,739 individuals with Autism Spectrum Disorder (ASD) who enrolled in the Autism Speaks MSSNG Project. Fine-mapping the breakpoint junctions of 259 ASD-relevant duplications identified 34 (13.1%) variants with complex genomic structures as well as tandem (193/259, 74.5%) and NAHR- mediated (6/259, 2.3%) duplications. As whole genome sequencing-based studies expand in scale and reach, a continued focus on generating high-quality, standardized duplication data will be prerequisite to addressing their associated biological mechanisms. ii Acknowledgements I thank Dr. Stephen Scherer for his leadership par excellence, his generosity, and for giving me a chance. I am grateful for his investment and the opportunities afforded me, from which I have learned and benefited. I would next thank Drs.
    [Show full text]
  • Machine Learning-Based Identification and Cellular Validation of Tropomyosin 1 As a Genetic Inhibitor of Hematopoiesis
    bioRxiv preprint doi: https://doi.org/10.1101/631895; this version posted May 8, 2019. The copyright holder for this preprint (which was not certified by peer review) is the author/funder, who has granted bioRxiv a license to display the preprint in perpetuity. It is made available under aCC-BY-NC-ND 4.0 International license. Machine learning-based identification and cellular validation of Tropomyosin 1 as a genetic inhibitor of hematopoiesis. Thom CS1,2,3*, Jobaliya CD4,5, Lorenz K2,3, Maguire JA4,5, Gagne A4,5, Gadue P4,5, French DL4,5, Voight BF2,3,6* 1Division of Neonatology, Children’s Hospital of Philadelphia, Philadelphia, PA, USA 2Department of Systems Pharmacology and Translational Therapeutics, Perelman School of Medicine, University of Pennsylvania, Philadelphia, PA, USA 3Department of Genetics, Perelman School of Medicine, University of Pennsylvania, Philadelphia, PA, USA 4Center for Cellular and Molecular Therapeutics, The Children's Hospital of Philadelphia, PA, USA 5Department of Pathology and Laboratory Medicine, The Children's Hospital of Philadelphia, University of Pennsylvania, Philadelphia, PA, USA 6Institute of Translational Medicine and Therapeutics, University of Pennsylvania, PA, USA *Corresponding authors: [email protected], [email protected] bioRxiv preprint doi: https://doi.org/10.1101/631895; this version posted May 8, 2019. The copyright holder for this preprint (which was not certified by peer review) is the author/funder, who has granted bioRxiv a license to display the preprint in perpetuity. It is made available under aCC-BY-NC-ND 4.0 International license. Introductory paragraph A better understanding of the genetic mechanisms regulating hematopoiesis are necessary, and could augment translational efforts to generate red blood cells (RBCs) and/or platelets in vitro.
    [Show full text]
  • Transdifferentiation of Human Mesenchymal Stem Cells
    Transdifferentiation of Human Mesenchymal Stem Cells Dissertation zur Erlangung des naturwissenschaftlichen Doktorgrades der Julius-Maximilians-Universität Würzburg vorgelegt von Tatjana Schilling aus San Miguel de Tucuman, Argentinien Würzburg, 2007 Eingereicht am: Mitglieder der Promotionskommission: Vorsitzender: Prof. Dr. Martin J. Müller Gutachter: PD Dr. Norbert Schütze Gutachter: Prof. Dr. Georg Krohne Tag des Promotionskolloquiums: Doktorurkunde ausgehändigt am: Hiermit erkläre ich ehrenwörtlich, dass ich die vorliegende Dissertation selbstständig angefertigt und keine anderen als die von mir angegebenen Hilfsmittel und Quellen verwendet habe. Des Weiteren erkläre ich, dass diese Arbeit weder in gleicher noch in ähnlicher Form in einem Prüfungsverfahren vorgelegen hat und ich noch keinen Promotionsversuch unternommen habe. Gerbrunn, 4. Mai 2007 Tatjana Schilling Table of contents i Table of contents 1 Summary ........................................................................................................................ 1 1.1 Summary.................................................................................................................... 1 1.2 Zusammenfassung..................................................................................................... 2 2 Introduction.................................................................................................................... 4 2.1 Osteoporosis and the fatty degeneration of the bone marrow..................................... 4 2.2 Adipose and bone
    [Show full text]
  • Imputed Gene Associations Identify Replicable Trans-Acting Genes Enriched in Transcription Pathways and Complex Traits
    bioRxiv preprint doi: https://doi.org/10.1101/471748; this version posted February 26, 2019. The copyright holder for this preprint (which was not certified by peer review) is the author/funder, who has granted bioRxiv a license to display the preprint in perpetuity. It is made available under aCC-BY-ND 4.0 International license. Running head: trans-PrediXcan 1 Imputed gene associations identify replicable trans-acting genes enriched in transcription pathways and complex traits Heather E. Wheeler1,2,3,∗, Sally Ploch1, Alvaro N. Barbeira4, Rodrigo Bonazzola4, Angela Andaleon1, Alireza Fotuhi Sishpirani5, Ashis Saha6, Alexis Battle6,7, Sushmita Roy8, Hae Kyung Im4∗ 1Department of Biology, Loyola University Chicago, Chicago, IL 2Department of Computer Science, Loyola University Chicago, Chicago, IL 3Department of Public Health Sciences, Stritch School of Medicine, Loyola University Chicago, Maywood, IL 4Section of Genetic Medicine, Department of Medicine, University of Chicago, Chicago, IL 5Department of Computer Sciences, University of Wisconsin-Madison, Madison, WI 6Department of Computer Science, Johns Hopkins University, Baltimore, MD 7 Department of Biomedical Engineering, Johns Hopkins University, Baltimore, MD 8Department of Biostatistics and Medical Informatics, University of Wisconsin-Madison, Madison, WI Correspondence ∗[email protected] or [email protected] Grants This work is supported by the NIH/NHGRI Academic Research Enhancement Award R15 HG009569 (Wheeler), NIH/NIMH R01 MH107666 and NIDDK P30 DK20595 (Im), and NIH/NIMH R01 MH109905 (Battle). bioRxiv preprint doi: https://doi.org/10.1101/471748; this version posted February 26, 2019. The copyright holder for this preprint (which was not certified by peer review) is the author/funder, who has granted bioRxiv a license to display the preprint in perpetuity.
    [Show full text]
  • Network-Based Analysis of Key Regulatory Genes Implicated in Type
    www.nature.com/scientificreports OPEN Network‑based analysis of key regulatory genes implicated in Type 2 Diabetes Mellitus and Recurrent Miscarriages in Turner Syndrome Anam Farooqui1, Alaa Alhazmi2, Shaful Haque3, Naaila Tamkeen4, Mahboubeh Mehmankhah1, Safa Tazyeen1, Sher Ali5 & Romana Ishrat1* The information on the genotype–phenotype relationship in Turner Syndrome (TS) is inadequate because very few specifc candidate genes are linked to its clinical features. We used the microarray data of TS to identify the key regulatory genes implicated with TS through a network approach. The causative factors of two common co‑morbidities, Type 2 Diabetes Mellitus (T2DM) and Recurrent Miscarriages (RM), in the Turner population, are expected to be diferent from that of the general population. Through microarray analysis, we identifed nine signature genes of T2DM and three signature genes of RM in TS. The power‑law distribution analysis showed that the TS network carries scale‑free hierarchical fractal attributes. Through local‑community‑paradigm (LCP) estimation we fnd that a strong LCP is also maintained which means that networks are dynamic and heterogeneous. We identifed nine key regulators which serve as the backbone of the TS network. Furthermore, we recognized eight interologs functional in seven diferent organisms from lower to higher levels. Overall, these results ofer few key regulators and essential genes that we envisage have potential as therapeutic targets for the TS in the future and the animal models studied here may prove useful in the validation of such targets. Te medical systems and scientists throughout the world are under an unprecedented challenge to meet the medical needs of much of the world’s population that are sufering from chromosomal anomalies.
    [Show full text]