Gene Expression Signatures for Cancer Cell Line Drug Sensitivity and Patient Outcome

Total Page:16

File Type:pdf, Size:1020Kb

Gene Expression Signatures for Cancer Cell Line Drug Sensitivity and Patient Outcome This dissertation is submitted for the degree of Doctor of Philosophy Gene expression signatures for cancer cell line drug sensitivity and patient outcome Michael Schubert Submitted August 2016 Corpus Christi College, University of Cambridge EMBL-EBI DECLARATION This dissertation is the result of my own work and includes nothing which is the outcome of work done in collaboration except as declared in the Preface and specified in the text. It is not substantially the same as any that I have submitted, or, is being concurrently submitted for a degree or diploma or other qualific- ation at the University of Cambridge or any other University or similar institution except as declared in the Preface and specified in the text. I further state that no substantial part of my dissertation has already been submitted, or, is being concurrently submitted for any such de- gree, diploma or other qualification at the University of Cambridge or any other University of similar institution except as declared in the Preface and specified in the text. It does not exceed the prescribed word limit for the relevant Degree Committee. Michael Schubert August, 2016 3 SUMMARY The explanation of phenotypes in cancer, such as cell line drug response or patient survival, has largely been focussed on genomic alterations. While this approach has generated many profound insights into cancer biology, it does not directly make statements about the signaling im- pact those cellular aberrations create. With direct measurements much less widely available than gene expression, pathway methods (mostly mapping gene expression onto signalling proteins) have so far fallen short on delivering actionable evidence. This may in part be due to lack of robustness, but these approaches are fundamentally at odds with the notion of tight post-translational control of signal transduc- tion. A way to solve this may be to derive consensus signatures of pathway activity to make inferences about signalling, or signatures of specific drugs to investigate their interactions. As a baseline (chapter 2), I investigated how well pathway meth- ods compare to driver mutations in terms of explaining cell line drug response. I went on to analyse the value of gene expression as a down- stream signature of signaling activity instead of mapping it to the path- way components by means of a previously published platform. This improved over mapping pathway members using Gene Ontology or Re- actome, but it considered only sets of up-regulated genes defined by multiple arbitrary cutoffs. Hence, I extended the data set and created a linear model (chapter 3) that compares favourably to gene set as well as state of the art pathway methods in terms of recovering driver mutations and providing biomarkers for cell line drug response and patient survival (chapter 4). To complement this, I investigate how gene expression signatures of drugs can be used in conjunction with vi- ability data to suggest effective drug combinations where no pathway information is available (chapter 5). To the best of my knowledge, this thesis represents the first compre- hensive analysis of different pathway methods across primary cohorts and cancer cell lines, as well as the first large-scale systematic analysis of drug sensitisation that could lead to new drug combinations. 5 ACKNOWLEDGEMENTS This thesis would not have been possible without the generous support and feedback of many people, for which I am deeply grateful. First and foremost, I would like to thank my supervisor Julio Saez- Rodriguez for providing me with the opportunity to undertake my PhD in his group at EMBL-EBI as well as his support during my PhD. I would also like to extend my gratitude to all members of the Saez Rodriguez group, especially Francesco Iorio, Emanuel Goncalves, Luz Garcia Alonso, Michael Menden and Camille Terfve for fruitful dicus- sions and their friendship. I would like to thank my academic collaborators, among which are my industry supervisor Joanna Betts, the possibility to visit the US site of GSK with Pankaj Agarwal, the Sanger GDSC team lead by Mathew Garnett and Ultan McDermott, as well as people involved in smaller side projects too numerous to name. Many thanks to my TAC members Florian Markowetz, Sarah Teich- mann, and Christoph Merten who I was always looking forward to meet once a year for a useful critique of my project and much needed direction. Finally, I would like to thank the EMBL community for an amazing four years that I spent at the institute. You know who you are. 7 CONTENTS List of Abbreviations 14 List of Figures 15 List of Tables 19 1 introduction 21 1.1 Cancer Biology 21 1.1.1 Significance and epidemiology 21 1.1.2 Causes of genetic variation 21 1.1.3 Functional impact of mutations: the central dogma 23 1.1.4 Malignant transformation 24 1.1.5 Tumour heterogeneity 25 1.2 Therapeutic interventions 25 1.2.1 The therapeutic window 25 1.2.2 Cytotoxic drugs 26 1.2.3 Targeted therapies 26 1.2.4 Development of resistance 27 1.3 Disease models 28 1.3.1 Rationale 28 1.3.2 Cell lines 29 1.3.3 Animal models 30 1.4 Molecular data types and assays 30 1.4.1 Role of molecular data 30 1.4.2 DNA 31 1.4.3 RNA 31 1.4.4 Proteins and phosphorylation 33 1.5 Data sets 34 1.5.1 Public gene expression repositories 34 1.5.2 Gene sets and pathways 35 1.5.3 Primary cancer cohorts 36 1.5.4 Cancer cell line drug sensitivity resources 36 1.5.5 Drug-induced transcriptional changes 37 1.6 Computational methods 38 1.6.1 Patterns of mutations 39 1.6.2 Gene expression clustering 40 1.6.3 Gene expression signatures 42 1.6.4 Networks 43 1.7 Reproducibility of results 46 1.7.1 Experimental and computational 46 9 10 Contents 1.7.2 Scientific software ecosystem 47 1.7.3 Reproducible workflows 48 1.8 Motivation and outlook 49 2 gene set methods for drug response 51 2.1 Methods used throughout this thesis 52 2.1.1 Gene sets 52 2.1.2 Gene Set Variation Analysis (GSVA) 53 2.1.3 Drug associations using the half-maximum inhib- itory concentration (IC50) 53 2.2 Cell Line Drug Response 56 2.2.1 Associations with Mutations 56 2.2.2 Associations with Gene Ontology categories 58 2.2.3 Associations with Reactome pathways 59 2.3 Pathway-responsive genes: The SPEED Platform 59 2.3.1 Separability-optimised Gene Sets 60 2.3.2 Associations between Pathway Scores and Tis- sues 62 2.3.3 Pan-Cancer Drug Associations 64 2.4 Discussion 66 2.4.1 Cell line drug response 66 2.4.2 Pathway-responsive genes 67 3 building an improved model of perturbation-re- sponse genes 69 3.1 Curating and assembling a database of publicly available perturbation experiments 70 3.1.1 Defining scope and format 70 3.1.2 Finding suitable Experiments 72 3.1.3 From Experiments to Expression Data 76 3.2 A model for perturbation-response experiments 77 3.2.1 Building a linear model from z-scores 77 3.2.2 Computing pathway scores 81 3.2.3 Consensus gene signatures reveal pathway-response transcriptional modules 82 3.3 Comparison to pathway mapping methods 85 3.3.1 Transcriptional footprints are fundamentally dif- ferent to pathway expression 85 3.3.2 Computing pathway scores for other methods 87 3.3.3 Comparison within perturbation experiments 89 3.3.4 Comparison across perturbation experiments 92 3.4 Discussion 94 3.4.1 Need for an improved database and model 94 3.4.2 Value of curated perturbation experiments 94 Contents 11 3.4.3 Importance of distinguishing between pathway expression and activity 95 4 functional evaluation of pathway methods in basal gene expression 97 4.1 Perturbation-response signatures for basal gene expres- sion 97 4.1.1 Correlation in basal expression 97 4.1.2 A linear model is easily transferable 97 4.1.3 Dependence on input experiments 99 4.1.4 Pathway scores for other methods 100 4.2 Recall of known pathway modifiers 102 4.2.1 Pathway scores and mutations/CNAs 102 4.2.2 Associations using driver mutations and CNAs 102 4.2.3 Comparison of methods 106 4.3 Cell line drug response using the GDSC 106 4.3.1 Drug associations using GDSC cell lines 106 4.3.2 Pan-cancer Associations with Drug Response 108 4.3.3 Tissue-specific Associations with Drug Response 113 4.4 Patient survival using the TCGA 115 4.4.1 Clinical data and methods 115 4.4.2 Pan-cancer associations with survival 116 4.4.3 Tissue-specific associations with survival 116 4.4.4 Kaplan-Meier survival curves for specific hits 117 4.5 Discussion 118 5 modelling drug interactions 121 5.1 MANTRA and the original Connectivity Map 122 5.1.1 Two-Tailed GSEA using MANTRA 122 5.1.2 Pan-cancer associations 124 5.1.3 Tissue-specific associations 125 5.2 A pan-cancer view of drug sensitisation using LINCS 126 5.2.1 Data quality of the LINCS 126 5.2.2 Creating drug signatures and scoring GDSC cell lines 126 5.2.3 Naive associations with drug response 127 5.2.4 Pathway correlation as a major source for false positives 129 5.2.5 Building an improved model 131 5.3 Tissue-specific synergistic compounds 134 5.3.1 Consensus models for breast cancer cell lines 134 5.3.2 Defining synergy for drug combinations 136 5.3.3 Single-agent drug response curves 138 5.3.4 Synergy of drug combinations 141 5.4 Discussion 143 12 Contents 5.4.1 Original Connectivity Map 143 5.4.2 Pan-cancer view of the LINCS 144 5.4.3 Tissue-specific models and validation 144 6 conclusions 147 bibliography 149 appendix 165 a Associations for baseline methods (chapter 2) 165 a.1 Drug response with unbiased gene sets 165 a.2 SPEED platform 170 b Associations for evaluating signatures (chapter 4) 172 b.1 Pathway scores and mutations 172 b.2 Pathway scores and CNAs 182 b.3 Pathway
Recommended publications
  • The Proximal Signaling Network of the BCR-ABL1 Oncogene Shows a Modular Organization
    Oncogene (2010) 29, 5895–5910 & 2010 Macmillan Publishers Limited All rights reserved 0950-9232/10 www.nature.com/onc ORIGINAL ARTICLE The proximal signaling network of the BCR-ABL1 oncogene shows a modular organization B Titz, T Low, E Komisopoulou, SS Chen, L Rubbi and TG Graeber Crump Institute for Molecular Imaging, Institute for Molecular Medicine, Jonsson Comprehensive Cancer Center, California NanoSystems Institute, Department of Molecular and Medical Pharmacology, University of California, Los Angeles, CA, USA BCR-ABL1 is a fusion tyrosine kinase, which causes signaling effects of BCR-ABL1 toward leukemic multiple types of leukemia. We used an integrated transformation. proteomic approach that includes label-free quantitative Oncogene (2010) 29, 5895–5910; doi:10.1038/onc.2010.331; protein complex and phosphorylation profiling by mass published online 9 August 2010 spectrometry to systematically characterize the proximal signaling network of this oncogenic kinase. The proximal Keywords: adaptor protein; BCR-ABL1; phospho- BCR-ABL1 signaling network shows a modular and complex; quantitative mass spectrometry; signaling layered organization with an inner core of three leukemia network; systems biology transformation-relevant adaptor protein complexes (Grb2/Gab2/Shc1 complex, CrkI complex and Dok1/ Dok2 complex). We introduced an ‘interaction direction- ality’ analysis, which annotates static protein networks Introduction with information on the directionality of phosphorylation- dependent interactions. In this analysis, the observed BCR-ABL1 is a constitutively active oncogenic fusion network structure was consistent with a step-wise kinase that arises through a chromosomal translocation phosphorylation-dependent assembly of the Grb2/Gab2/ and causes multiple types of leukemia. It is found in Shc1 and the Dok1/Dok2 complexes on the BCR-ABL1 many cases (B25%) of adult acute lymphoblastic core.
    [Show full text]
  • Defining Functional Interactions During Biogenesis of Epithelial Junctions
    ARTICLE Received 11 Dec 2015 | Accepted 13 Oct 2016 | Published 6 Dec 2016 | Updated 5 Jan 2017 DOI: 10.1038/ncomms13542 OPEN Defining functional interactions during biogenesis of epithelial junctions J.C. Erasmus1,*, S. Bruche1,*,w, L. Pizarro1,2,*, N. Maimari1,3,*, T. Poggioli1,w, C. Tomlinson4,J.Lees5, I. Zalivina1,w, A. Wheeler1,w, A. Alberts6, A. Russo2 & V.M.M. Braga1 In spite of extensive recent progress, a comprehensive understanding of how actin cytoskeleton remodelling supports stable junctions remains to be established. Here we design a platform that integrates actin functions with optimized phenotypic clustering and identify new cytoskeletal proteins, their functional hierarchy and pathways that modulate E-cadherin adhesion. Depletion of EEF1A, an actin bundling protein, increases E-cadherin levels at junctions without a corresponding reinforcement of cell–cell contacts. This unexpected result reflects a more dynamic and mobile junctional actin in EEF1A-depleted cells. A partner for EEF1A in cadherin contact maintenance is the formin DIAPH2, which interacts with EEF1A. In contrast, depletion of either the endocytic regulator TRIP10 or the Rho GTPase activator VAV2 reduces E-cadherin levels at junctions. TRIP10 binds to and requires VAV2 function for its junctional localization. Overall, we present new conceptual insights on junction stabilization, which integrate known and novel pathways with impact for epithelial morphogenesis, homeostasis and diseases. 1 National Heart and Lung Institute, Faculty of Medicine, Imperial College London, London SW7 2AZ, UK. 2 Computing Department, Imperial College London, London SW7 2AZ, UK. 3 Bioengineering Department, Faculty of Engineering, Imperial College London, London SW7 2AZ, UK. 4 Department of Surgery & Cancer, Faculty of Medicine, Imperial College London, London SW7 2AZ, UK.
    [Show full text]
  • Mining Hub Genes Correlated with Immune Infiltrating Level Across
    Mining hub genes correlated with immune inltrating level across multiple tumors microenvironment Ning Zhao Wuhan Union Hospital Liang Wu Wuhan Union Hospital Zili Zhou Wuhan Union Hospital Xudan Zhang Wuhan Union Hospital Shengbo Han Wuhan Union Hospital Haijun Bao Wuhan Union Hospital Xiaogang Shu ( [email protected] ) Wuhan Union Hospital https://orcid.org/0000-0003-1336-1772 Research article Keywords: Immune inltrating level; PTPN6; Tumors microenvironment; Multiple cancers Posted Date: March 30th, 2020 DOI: https://doi.org/10.21203/rs.2.20604/v2 License: This work is licensed under a Creative Commons Attribution 4.0 International License. Read Full License Page 1/18 Abstract Background: Previous studies revealed that cancer-associated differentially expressed genes (DEGs) in an independent cancer type are rarely related to the tumorigenesis and metastasis, while the common DEGs across multiple types of cancer may be proved as potential oncogenes or tumor suppressors. Although tumor-inltrating immune cells have been reported to be associated with prognosis in multiple types of cancer, the hub genes regulating immune cells function in different cancer types remain unclear. Methods: To screen for the hub genes regulating immune inltrating level across multiple tumors microenvironment, the raw data containing RNA sequencing and clinical information from TCGA database and immune scores from ESTIMATE website across 25 cancer types were obtained. Results: Based on the immune scores, all cases were categorized into high-score and low-score groups. Kaplan–Meier survival analysis demonstrated that a strong correlation between immune inltrating level and survival prognosis was found in six cancer types. The functional enrichment analysis of common DEGs revealed that infection and immune response are the most prominent biological characteristics.
    [Show full text]
  • Quantitative Interactomics in Primary T Cells Unveils TCR Signal
    Quantitative interactomics in primary T cells unveils TCR signal diversification extent and dynamics Guillaume Voisinne, Kristof Kersse, Karima Chaoui, Liaoxun Lu, Julie Chaix, Lichen Zhang, Marisa Goncalves Menoita, Laura Girard, Youcef Ounoughene, Hui Wang, et al. To cite this version: Guillaume Voisinne, Kristof Kersse, Karima Chaoui, Liaoxun Lu, Julie Chaix, et al.. Quantitative interactomics in primary T cells unveils TCR signal diversification extent and dynamics. Nature Immunology, Nature Publishing Group, 2019, 20 (11), pp.1530 - 1541. 10.1038/s41590-019-0489-8. hal-03013469 HAL Id: hal-03013469 https://hal.archives-ouvertes.fr/hal-03013469 Submitted on 23 Nov 2020 HAL is a multi-disciplinary open access L’archive ouverte pluridisciplinaire HAL, est archive for the deposit and dissemination of sci- destinée au dépôt et à la diffusion de documents entific research documents, whether they are pub- scientifiques de niveau recherche, publiés ou non, lished or not. The documents may come from émanant des établissements d’enseignement et de teaching and research institutions in France or recherche français ou étrangers, des laboratoires abroad, or from public or private research centers. publics ou privés. RESOURCE https://doi.org/10.1038/s41590-019-0489-8 Quantitative interactomics in primary T cells unveils TCR signal diversification extent and dynamics Guillaume Voisinne 1, Kristof Kersse1, Karima Chaoui2, Liaoxun Lu3,4, Julie Chaix1, Lichen Zhang3, Marisa Goncalves Menoita1, Laura Girard1,5, Youcef Ounoughene1, Hui Wang3, Odile Burlet-Schiltz2, Hervé Luche5,6, Frédéric Fiore5, Marie Malissen1,5,6, Anne Gonzalez de Peredo 2, Yinming Liang 3,6*, Romain Roncagalli 1* and Bernard Malissen 1,5,6* The activation of T cells by the T cell antigen receptor (TCR) results in the formation of signaling protein complexes (signalo­ somes), the composition of which has not been analyzed at a systems level.
    [Show full text]
  • A Graph-Theoretic Approach to Model Genomic Data and Identify Biological Modules Asscociated with Cancer Outcomes
    A Graph-Theoretic Approach to Model Genomic Data and Identify Biological Modules Asscociated with Cancer Outcomes Deanna Petrochilos A dissertation presented in partial fulfillment of the requirements for the degree of Doctor of Philosophy University of Washington 2013 Reading Committee: Neil Abernethy, Chair John Gennari, Ali Shojaie Program Authorized to Offer Degree: Biomedical Informatics and Health Education UMI Number: 3588836 All rights reserved INFORMATION TO ALL USERS The quality of this reproduction is dependent upon the quality of the copy submitted. In the unlikely event that the author did not send a complete manuscript and there are missing pages, these will be noted. Also, if material had to be removed, a note will indicate the deletion. UMI 3588836 Published by ProQuest LLC (2013). Copyright in the Dissertation held by the Author. Microform Edition © ProQuest LLC. All rights reserved. This work is protected against unauthorized copying under Title 17, United States Code ProQuest LLC. 789 East Eisenhower Parkway P.O. Box 1346 Ann Arbor, MI 48106 - 1346 ©Copyright 2013 Deanna Petrochilos University of Washington Abstract Using Graph-Based Methods to Integrate and Analyze Cancer Genomic Data Deanna Petrochilos Chair of the Supervisory Committee: Assistant Professor Neil Abernethy Biomedical Informatics and Health Education Studies of the genetic basis of complex disease present statistical and methodological challenges in the discovery of reliable and high-confidence genes that reveal biological phenomena underlying the etiology of disease or gene signatures prognostic of disease outcomes. This dissertation examines the capacity of graph-theoretical methods to model and analyze genomic information and thus facilitate using prior knowledge to create a more discrete and functionally relevant feature space.
    [Show full text]
  • Construction and Analysis of Protein-Protein Interaction Network of Non-Alcoholic Fatty Liver Disease
    bioRxiv preprint doi: https://doi.org/10.1101/2020.12.01.406215; this version posted December 9, 2020. The copyright holder for this preprint (which was not certified by peer review) is the author/funder. All rights reserved. No reuse allowed without permission. Construction and Analysis of Protein-Protein Interaction Network of Non-Alcoholic Fatty Liver Disease Athina I. Amanatidou and George V. Dedoussis Department of Nutrition and Dietetics, School of Health Science and Education, Harokopio University, El. Venizelou 70, 17671, Athens, Greece Correspondence to: G. V. Dedoussis, A. I. Amanatidou, Department of Nutrition and Dietetics, School of Health Science and Education, Harokopio University, El. Venizelou 70, 17671, Athens, Greece E-mail addresses: [email protected] (G.V. Dedoussis), [email protected] (A. I. Amanatidou) Telephone: +302109549179 (G.V. Dedoussis), +306949293472 (A. I. Amanatidou) 1 bioRxiv preprint doi: https://doi.org/10.1101/2020.12.01.406215; this version posted December 9, 2020. The copyright holder for this preprint (which was not certified by peer review) is the author/funder. All rights reserved. No reuse allowed without permission. Abstract Non-alcoholic fatty liver disease (NAFLD) is a disease with multidimensional complexities. Many attempts have been made over the years to treat this disease but its incidence is rising. For this reason, the need to identify and study new candidate proteins that may be associated with NAFLD is of utmost importance. Systems-based approaches such as the analysis of protein-protein interaction (PPI) network could lead to the discovery of new proteins associated with a disease that can then be translated into clinical practice.
    [Show full text]
  • Platform Abstracts
    American Society of Human Genetics 65th Annual Meeting October 6–10, 2015 Baltimore, MD PLATFORM ABSTRACTS Wednesday, October 7, 9:50-10:30am Abstract #’s Friday, October 9, 2:15-4:15 pm: Concurrent Platform Session D: 4. Featured Plenary Abstract Session I Hall F #1-#2 46. Hen’s Teeth? Rare Variants and Common Disease Ballroom I #195-#202 Wednesday, October 7, 2:30-4:30pm Concurrent Platform Session A: 47. The Zen of Gene and Variant 15. Update on Breast and Prostate Assessment Ballroom III #203-#210 Cancer Genetics Ballroom I #3-#10 48. New Genes and Mechanisms in 16. Switching on to Regulatory Variation Ballroom III #11-#18 Developmental Disorders and 17. Shedding Light into the Dark: From Intellectual Disabilities Room 307 #211-#218 Lung Disease to Autoimmune Disease Room 307 #19-#26 49. Statistical Genetics: Networks, 18. Addressing the Difficult Regions of Pathways, and Expression Room 309 #219-#226 the Genome Room 309 #27-#34 50. Going Platinum: Building a Better 19. Statistical Genetics: Complex Genome Room 316 #227-#234 Phenotypes, Complex Solutions Room 316 #35-#42 51. Cancer Genetic Mechanisms Room 318/321 #235-#242 20. Think Globally, Act Locally: Copy 52. Target Practice: Therapy for Genetic Hilton Hotel Number Variation Room 318/321 #43-#50 Diseases Ballroom 1 #243-#250 21. Recent Advances in the Genetic Basis 53. The Real World: Translating Hilton Hotel of Neuromuscular and Other Hilton Hotel Sequencing into the Clinic Ballroom 4 #251-#258 Neurodegenerative Phenotypes Ballroom 1 #51-#58 22. Neuropsychiatric Diseases of Hilton Hotel Friday, October 9, 4:30-6:30pm Concurrent Platform Session E: Childhood Ballroom 4 #59-#66 54.
    [Show full text]
  • Aedes Aegypti Mosquito Haemocytes and Its Involvement in Immunity-Like Functions in Mosquito: a Molecular Study
    Parasitology Research (2019) 118:2509–2521 https://doi.org/10.1007/s00436-019-06408-5 ARTHROPODS AND MEDICAL ENTOMOLOGY - ORIGINAL PAPER Identification of a new type of haematopoietic progenitor kinase-interacting protein (HIP-55) in Aedes aegypti mosquito haemocytes and its involvement in immunity-like functions in mosquito: a molecular study M. Mohiadeen Batcha1,2 & A. Sajith Ahamed3 & Chiung Fang Peng4 Received: 13 July 2018 /Accepted: 25 July 2019 /Published online: 3 August 2019 # Springer-Verlag GmbH Germany, part of Springer Nature 2019 Abstract In this study, we characterize the HIP-55 protein in the mosquito Aedes aegypti for the first time. HIP-55 is a 55-kDa HPK1- interacting protein that is also called SH3P7. HIP-55 constitutively binds HPK1 ‘via’ an HPK1 proline-rich motif 2(PR2) through its C-terminal SH3 domain. HIP-55 critically interacts with ZAP-70, and this interaction was induced by TCR signalling. ZAP-70 phosphorylated HIP-55 at Tyr-334 and Tyr-344 in vitro and in vivo. In our previous findings, AaZAP gene expression strongly proved that AaZAP-70 was involved in immunity-like functions in mosquito. Northern blot analysis of HIP-55 mRNA expres- sion confirmed that it is only expressed in the abdomen and haemocyte tissues; this prediction correlates 100% and a polyclonal antibody also confirmed its localization in haemocytes and the abdomen. We prepared extracts to show the cytoplasmic expres- sion (CE) of this protein. Previous results had proven that this protein is secreted from the cytoplasm; thus, we confirmed here that the protein is a cytoplasmic adaptor protein in mosquitoes and mammalian systems.
    [Show full text]
  • Phenotype Informatics
    Freie Universit¨atBerlin Department of Mathematics and Computer Science Phenotype informatics: Network approaches towards understanding the diseasome Sebastian Kohler¨ Submitted on: 12th September 2012 Dissertation zur Erlangung des Grades eines Doktors der Naturwissenschaften (Dr. rer. nat.) am Fachbereich Mathematik und Informatik der Freien Universitat¨ Berlin ii 1. Gutachter Prof. Dr. Martin Vingron 2. Gutachter: Prof. Dr. Peter N. Robinson 3. Gutachter: Christopher J. Mungall, Ph.D. Tag der Disputation: 16.05.2013 Preface This thesis presents research work on novel computational approaches to investigate and characterise the association between genes and pheno- typic abnormalities. It demonstrates methods for organisation, integra- tion, and mining of phenotype data in the field of genetics, with special application to human genetics. Here I will describe the parts of this the- sis that have been published in peer-reviewed journals. Often in modern science different people from different institutions contribute to research projects. The same is true for this thesis, and thus I will itemise who was responsible for specific sub-projects. In chapter 2, a new method for associating genes to phenotypes by means of protein-protein-interaction networks is described. I present a strategy to organise disease data and show how this can be used to link diseases to the corresponding genes. I show that global network distance measure in interaction networks of proteins is well suited for investigat- ing genotype-phenotype associations. This work has been published in 2008 in the American Journal of Human Genetics. My contribution here was to plan the project, implement the software, and finally test and evaluate the method on human genetics data; the implementation part was done in close collaboration with Sebastian Bauer.
    [Show full text]
  • MAP4K Family Kinases and DUSP Family Phosphatases in T-Cell Signaling and Systemic Lupus Erythematosus
    cells Review MAP4K Family Kinases and DUSP Family Phosphatases in T-Cell Signaling and Systemic Lupus Erythematosus Huai-Chia Chuang and Tse-Hua Tan * Immunology Research Center, National Health Research Institutes, Zhunan 35053, Taiwan; [email protected] * Correspondence: [email protected] Received: 3 October 2019; Accepted: 11 November 2019; Published: 13 November 2019 Abstract: T cells play a critical role in the pathogenesis of systemic lupus erythematosus (SLE), which is a severe autoimmune disease. In the past 60 years, only one new therapeutic agent with limited efficacy has been approved for SLE treatment; therefore, the development of early diagnostic biomarkers and therapeutic targets for SLE is desirable. Mitogen-activated protein kinase kinase kinase kinases (MAP4Ks) and dual-specificity phosphatases (DUSPs) are regulators of MAP kinases. Several MAP4Ks and DUSPs are involved in T-cell signaling and autoimmune responses. HPK1 (MAP4K1), DUSP22 (JKAP), and DUSP14 are negative regulators of T-cell activation. Consistently, HPK1 and DUSP22 are downregulated in the T cells of human SLE patients. In contrast, MAP4K3 (GLK) is a positive regulator of T-cell signaling and T-cell-mediated immune responses. MAP4K3 overexpression-induced RORγt–AhR complex specifically controls interleukin 17A (IL-17A) production in T cells, leading to autoimmune responses. Consistently, MAP4K3 and the RORγt–AhR complex are overexpressed in the T cells of human SLE patients, as are DUSP4 and DUSP23. In addition, DUSPs are also involved in either human autoimmune diseases (DUSP2, DUSP7, DUSP10, and DUSP12) or T-cell activation (DUSP1, DUSP5, and DUSP14). In this review, we summarize the MAP4Ks and DUSPs that are potential biomarkers and/or therapeutic targets for SLE.
    [Show full text]
  • Mechanisms of Hippo Pathway Regulation
    Downloaded from genesdev.cshlp.org on September 30, 2021 - Published by Cold Spring Harbor Laboratory Press REVIEW Mechanisms of Hippo pathway regulation Zhipeng Meng, Toshiro Moroishi, and Kun-Liang Guan Department of Pharmacology, Moores Cancer Center, University of California at San Diego, La Jolla, California 92093, USA The Hippo pathway was initially identified in Drosophila 1/2 (LATS1/2; homologs of Drosophila Warts [Wts]) (Fig. melanogaster screens for tissue growth two decades ago 1A). The physiological output of this kinase cascade is and has been a subject extensively studied in both Droso- to restrict the activities of two transcriptional coactiva- phila and mammals in the last several years. The core of tors, Yes-associated protein (YAP) and transcriptional co- the Hippo pathway consists of a kinase cascade, transcrip- activator with PDZ-binding motif (TAZ; two homologs tion coactivators, and DNA-binding partners. Recent of Drosophila Yorkie [Yki]). When YAP and TAZ are ac- studies have expanded the Hippo pathway as a complex tive, they translocate into the nucleus to bind the TEAD signaling network with >30 components. This pathway transcription factor family (homologs of Drosophila Scal- is regulated by intrinsic cell machineries, such as cell– loped [Sd]) and induce expression of a wide range of genes cell contact, cell polarity, and actin cytoskeleton, as that are involved in cell proliferation, survival, and well as a wide range of signals, including cellular energy migration. status, mechanical cues, and hormonal signals that act Mechanistically, the Hippo kinase cascade can be initi- through G-protein-coupled receptors. The major func- ated by TAO kinases (TAOK1/2/3), which phosphorylate tions of the Hippo pathway have been defined to restrict the activation loop of MST1/2 (Thr183 for MST1 and tissue growth in adults and modulate cell proliferation, Thr180 for MST2; hereafter, all residues refer to human differentiation, and migration in developing organs.
    [Show full text]
  • MOL #82305 TITLE PAGE Title: Induced CYP3A4 Expression In
    Downloaded from molpharm.aspetjournals.org at ASPET Journals on September 28, 2021 1 This article has not been copyedited and formatted. The final version may differ from this version. This article has not been copyedited and formatted. The final version may differ from this version. This article has not been copyedited and formatted. The final version may differ from this version. This article has not been copyedited and formatted. The final version may differ from this version. This article has not been copyedited and formatted. The final version may differ from this version. This article has not been copyedited and formatted. The final version may differ from this version. This article has not been copyedited and formatted. The final version may differ from this version. This article has not been copyedited and formatted. The final version may differ from this version. This article has not been copyedited and formatted. The final version may differ from this version. This article has not been copyedited and formatted. The final version may differ from this version. This article has not been copyedited and formatted. The final version may differ from this version. This article has not been copyedited and formatted. The final version may differ from this version. This article has not been copyedited and formatted. The final version may differ from this version. This article has not been copyedited and formatted. The final version may differ from this version. This article has not been copyedited and formatted. The final version may differ from this version. This article has not been copyedited and formatted.
    [Show full text]