Novel Computational Methods for Cancer Genomics Data Analysis

Total Page:16

File Type:pdf, Size:1020Kb

Novel Computational Methods for Cancer Genomics Data Analysis University of Kentucky UKnowledge Theses and Dissertations--Computer Science Computer Science 2021 NOVEL COMPUTATIONAL METHODS FOR CANCER GENOMICS DATA ANALYSIS Jinpeng Liu University of Kentucky, [email protected] Digital Object Identifier: https://doi.org/10.13023/etd.2021.215 Right click to open a feedback form in a new tab to let us know how this document benefits ou.y Recommended Citation Liu, Jinpeng, "NOVEL COMPUTATIONAL METHODS FOR CANCER GENOMICS DATA ANALYSIS" (2021). Theses and Dissertations--Computer Science. 108. https://uknowledge.uky.edu/cs_etds/108 This Doctoral Dissertation is brought to you for free and open access by the Computer Science at UKnowledge. It has been accepted for inclusion in Theses and Dissertations--Computer Science by an authorized administrator of UKnowledge. For more information, please contact [email protected]. STUDENT AGREEMENT: I represent that my thesis or dissertation and abstract are my original work. Proper attribution has been given to all outside sources. I understand that I am solely responsible for obtaining any needed copyright permissions. I have obtained needed written permission statement(s) from the owner(s) of each third-party copyrighted matter to be included in my work, allowing electronic distribution (if such use is not permitted by the fair use doctrine) which will be submitted to UKnowledge as Additional File. I hereby grant to The University of Kentucky and its agents the irrevocable, non-exclusive, and royalty-free license to archive and make accessible my work in whole or in part in all forms of media, now or hereafter known. I agree that the document mentioned above may be made available immediately for worldwide access unless an embargo applies. I retain all other ownership rights to the copyright of my work. I also retain the right to use in future works (such as articles or books) all or part of my work. I understand that I am free to register the copyright to my work. REVIEW, APPROVAL AND ACCEPTANCE The document mentioned above has been reviewed and accepted by the student’s advisor, on behalf of the advisory committee, and by the Director of Graduate Studies (DGS), on behalf of the program; we verify that this is the final, approved version of the student’s thesis including all changes required by the advisory committee. The undersigned agree to abide by the statements above. Jinpeng Liu, Student Dr. Zongming Fei, Major Professor Dr. Zongming Fei, Director of Graduate Studies NOVEL COMPUTATIONAL METHODS FOR CANCER GENOMICS DATA ANALYSIS ________________________________________ DISSERTATION ________________________________________ A dissertation submitted in partial fulfillment of the requirements for the degree of Doctor of Philosophy in the College of Engineering at the University of Kentucky By Jinpeng Liu Lexington, Kentucky Co- Directors: Dr. Zongming Fei, Professor of Computer Science and Dr. Licong Cui, Assistant Professor of Computer Science Lexington, Kentucky 2021 Copyright © Jinpeng Liu 2021 ABSTRACT OF DISSERTATION NOVEL COMPUTATIONAL METHODS FOR CANCER GENOMICS DATA ANALYSIS Cancer is a genetic disease responsible for one in eight deaths worldwide. The advancement of next-generation sequencing (NGS) technology has revolutionized the cancer research, allowing comprehensively profiling the cancer genome at great resolution. Large-scale cancer genomics research has sparked the needs for efficient and accurate Bioinformatics methods to analyze the data. The research presented in this dissertation focuses on three areas in cancer genomics: cancer somatic mutation detection; cancer driver genes identification and transcriptome profiling on single-cell level. NGS data analysis involves a series of complicated data transformation that convert raw sequencing data to the information that is interpretable by cancer researchers. The first project in the dissertation established a robust, reproducible and scalable cancer genomics data analysis workflow management system that automates the best practice mutation calling pipelines to detect somatic single nucleotide polymorphisms, insertion, deletion and copy number variation from NGS data. It integrates mutation annotation, clinically actionable therapy prediction and data visualization that streamlines the sequence-to-report data transformation. In order to differentiate the driver mutations buried among a vast pool of passenger mutations from a somatic mutation calling project, we developed MEScan in the second project, a novel method that enables genome-scale driver mutations identification based on mutual exclusivity test using cancer somatic mutation data. MEScan implements an efficient statistical framework to de novo screen mutual exclusive patterns and in the meantime taking into account the patient-specific and gene-specific background mutation rate and adjusting the heterogenous mutation frequency. It outperforms several existing methods based on simulation studies and real-world datasets. Genome-wide screening using existing TCGA somatic mutation data discovers novel cancer-specific and pan- cancer mutually exclusive patterns. Bulk RNA sequencing (RNA-Seq) has become one of the most commonly used techniques for transcriptome profiling in a wide spectrum of biomedical and biological research. Analyzing bulk RNA-Seq reads to quantify expression at each gene locus is the first step towards the identification of differentially expressed genes for downstream biological interpretation. Recent advances in single-cell RNA-seq (scRNA-seq) technology allows cancer biologists to profile gene expression on higher resolution cellular level. Preprocessing scRNA-seq data to quantify UMI-based gene count is the key to characterize intra-tumor cellular heterogeneity and identify rare cells that governs tumor progression, metastasis and treatment resistance. Despite its popularity, summarizing gene count from raw sequencing reads remains the one of the most time-consuming steps with existing tools. Current pipelines do not balance the efficiency and accuracy in large-scale gene count summarization in both bulk and scRNA-seq experiments. In the third project, we developed a light-weight k-mer based gene counting algorithm, FastCount, to accurately and efficiently quantify gene-level abundance using bulk RNA-seq or UMI- based scRNA-seq data. It achieves at least an order-of-magnitude speed improvement over the current gold standard pipelines while providing competitive accuracy. KEYWORDS: Cancer Genomics, Pipeline Framework; Somatic Mutation, Driver Mutations, Single Cell RNA-seq Quantification Jinpeng Liu (Name of Student) 06/06/2021 Date NOVEL COMPUTATIONAL METHODS FOR CANCER GENOMICS DATA ANALYSIS By Jinpeng Liu Zongming Fei Co-Director of Dissertation Licong Cui Co-Director of Dissertation Zongming Fei Director of Graduate Studies 06/06/2021 Date ACKNOWLEDGMENTS First, I would like to express my deepest gratitude to Dr. Zongming Fei and Dr. Licong Cui for their continuous support and encouragement throughout my Ph.D. study. They are not only great researchers with immense knowledge, but also excellent advisors that drive students to the right direction. Without their patient guidance, I would not have been able to finish this dissertation. I would also like to express my deepest appreciation to Dr. Chi Wang. He not only taught me a lot of knowledge in statistics, but also helped me learn to collaborate with researchers of different backgrounds. I am also extremely grateful to Dr. Dakshnamoorthy Manivannan for his invaluable advice and directions on my dissertation. I am also extremely grateful to have Dr. Daniela Moga as my Ph.D. Outside Examiner. I wish to thank her for the time and considerations on my Ph.D. defense. I would like to thank my collaborators from Markey Cancer Center for the valuable discussion we had during the past years. Especially, I would like to thank Dr. Susanne Arnold, Dr. Kathleen O’ Connor, Dr. Jill Kolesar, Dr. Chunming Liu and Dr. Hunter Moseley for their suggestion and input in my research. I wish to thank Dr. Heidi Weiss and all the team members in Biostatistcs and Bioinformatics Shared Resource Facility in Markey Cancer Center. Thank you for their support and constructive suggestions on my Ph.D. study. I would also like to thank Dr. Jinze Liu and the members of her lab. I cherish the opportunity to work with my fellows and establish a friendship with them: Dr. Xinan Liu, Dr. Ye Yu, Dr. Yi Zhang, Eamonn Manager and Xiaofei Zhang. My Ph.D. would not have been possible without my family and friends’ support and encouragement. I am very grateful to my parents, my wife, and my kids for their love and for providing me a relaxed and supportive environment. iii TABLE OF CONTENTS ACKNOWLEDGMENTS ................................................................................................. iii LIST OF TABLES ............................................................................................................. vi LIST OF FIGURES ........................................................................................................... vii CHAPTER 1.INTRODUCTION ........................................................................................ 1 1.1 Cancer Biology ....................................................................................................... 1 1.1.1 Cancer Genome ............................................................................................... 1 1.1.2 Tumor heterogeneity ......................................................................................
Recommended publications
  • Transcriptome Analyses of Rhesus Monkey Pre-Implantation Embryos Reveal A
    Downloaded from genome.cshlp.org on September 23, 2021 - Published by Cold Spring Harbor Laboratory Press Transcriptome analyses of rhesus monkey pre-implantation embryos reveal a reduced capacity for DNA double strand break (DSB) repair in primate oocytes and early embryos Xinyi Wang 1,3,4,5*, Denghui Liu 2,4*, Dajian He 1,3,4,5, Shengbao Suo 2,4, Xian Xia 2,4, Xiechao He1,3,6, Jing-Dong J. Han2#, Ping Zheng1,3,6# Running title: reduced DNA DSB repair in monkey early embryos Affiliations: 1 State Key Laboratory of Genetic Resources and Evolution, Kunming Institute of Zoology, Chinese Academy of Sciences, Kunming, Yunnan 650223, China 2 Key Laboratory of Computational Biology, CAS Center for Excellence in Molecular Cell Science, Collaborative Innovation Center for Genetics and Developmental Biology, Chinese Academy of Sciences-Max Planck Partner Institute for Computational Biology, Shanghai Institutes for Biological Sciences, Chinese Academy of Sciences, Shanghai 200031, China 3 Yunnan Key Laboratory of Animal Reproduction, Kunming Institute of Zoology, Chinese Academy of Sciences, Kunming, Yunnan 650223, China 4 University of Chinese Academy of Sciences, Beijing, China 5 Kunming College of Life Science, University of Chinese Academy of Sciences, Kunming, Yunnan 650204, China 6 Primate Research Center, Kunming Institute of Zoology, Chinese Academy of Sciences, Kunming, 650223, China * Xinyi Wang and Denghui Liu contributed equally to this work 1 Downloaded from genome.cshlp.org on September 23, 2021 - Published by Cold Spring Harbor Laboratory Press # Correspondence: Jing-Dong J. Han, Email: [email protected]; Ping Zheng, Email: [email protected] Key words: rhesus monkey, pre-implantation embryo, DNA damage 2 Downloaded from genome.cshlp.org on September 23, 2021 - Published by Cold Spring Harbor Laboratory Press ABSTRACT Pre-implantation embryogenesis encompasses several critical events including genome reprogramming, zygotic genome activation (ZGA) and cell fate commitment.
    [Show full text]
  • Filamins but Not Janus Kinases Are Substrates of the Asb2a Cullin-Ring E3 Ubiquitin Ligase in Hematopoietic Cells
    Filamins but Not Janus Kinases Are Substrates of the ASB2a Cullin-Ring E3 Ubiquitin Ligase in Hematopoietic Cells Isabelle Lamsoul1,2, Monique Erard1,2, Peter F. M. van der Ven3, Pierre G. Lutz1,2* 1 CNRS, IPBS (Institut de Pharmacologie et de Biologie Structurale), 205 route de Narbonne BP64182, F-31077 Toulouse, France, 2 Universite´ de Toulouse, UPS, IPBS, F- 31077 Toulouse, France, 3 Department of Molecular Cell Biology, Institute of Cell Biology, University of Bonn, Bonn, Germany Abstract The ASB2a protein is the specificity subunit of an E3 ubiquitin ligase complex involved in hematopoietic differentiation and is proposed to exert its effects by regulating the turnover of specific proteins. Three ASB2a substrates have been described so far: the actin-binding protein filamins, the Mixed Lineage Leukemia protein, and the Janus kinases 2 and 3. To determine the degradation of which substrate drives ASB2a biological effects is crucial for the understanding of ASB2a functions in hematopoiesis. Here, we show that neither endogenous nor exogenously expressed ASB2a induces degradation of JAK proteins in hematopoietic cells. Furthermore, we performed molecular modeling to generate the first structural model of an E3 ubiquitin ligase complex of an ASB protein bound to one of its substrates. Citation: Lamsoul I, Erard M, van der Ven PFM, Lutz PG (2012) Filamins but Not Janus Kinases Are Substrates of the ASB2a Cullin-Ring E3 Ubiquitin Ligase in Hematopoietic Cells. PLoS ONE 7(8): e43798. doi:10.1371/journal.pone.0043798 Editor: Zhengqi Wang, Emory University, United States of America Received March 21, 2012; Accepted July 26, 2012; Published August 20, 2012 Copyright: ß 2012 Lamsoul et al.
    [Show full text]
  • Cullin 5 Regulates Cortical Layering by Modulating the Speed and Duration of Dab1-Dependent Neuronal Migration
    5668 • The Journal of Neuroscience, April 21, 2010 • 30(16):5668–5676 Cellular/Molecular Cullin 5 Regulates Cortical Layering by Modulating the Speed and Duration of Dab1-Dependent Neuronal Migration Sergi Simó, Yves Jossin, and Jonathan A. Cooper Division of Basic Sciences, Fred Hutchinson Cancer Research Center, Seattle, Washington 98109 Themultilayeredmammalianneocortexdevelopsbythecoordinatedimmigrationanddifferentiationofcellsthatareproducedatdistant sites. Correct layering requires an extracellular protein, Reelin (Reln), an intracellular signaling molecule, Disabled-1 (Dab1), and an E3 ubiquitinligase,Cullin-5(Cul5).RelnactivatesDab1,whichisthendegradedbyCul5.HerewetestwhetherCul5regulatesneuronlayering by affecting Dab1 stability or other mechanisms. We find that a stabilized mutant Dab1, which resists Cul5-dependent degradation, causes a similar phenotype to Cul5 deficiency. Moreover, Cul5 has no effect when Dab1 is absent. The effects of Cul5 and Dab1 are cell autonomous, and Cul5 regulates movement of early as well as late cortical neurons. Removing Cul5 increases the speed at which neurons migrate through the cortical plate by reducing the time spent stationary and increasing the speed of individual steps. These results show thatCul5regulatesneuronlayeringbystimulatingDab1degradationandthatCul5controlsmigrationspeedandstoppingpoint,andthey demonstrate the importance of negative feedback in signaling during cortical development. Introduction (Pinto-Lord et al., 1982; Dulabon et al., 2000; Sanada et al., 2004). The mammalian
    [Show full text]
  • Temporal Proteomic Analysis of HIV Infection Reveals Remodelling of The
    1 1 Temporal proteomic analysis of HIV infection reveals 2 remodelling of the host phosphoproteome 3 by lentiviral Vif variants 4 5 Edward JD Greenwood 1,2,*, Nicholas J Matheson1,2,*, Kim Wals1, Dick JH van den Boomen1, 6 Robin Antrobus1, James C Williamson1, Paul J Lehner1,* 7 1. Cambridge Institute for Medical Research, Department of Medicine, University of 8 Cambridge, Cambridge, CB2 0XY, UK. 9 2. These authors contributed equally to this work. 10 *Correspondence: [email protected]; [email protected]; [email protected] 11 12 Abstract 13 Viruses manipulate host factors to enhance their replication and evade cellular restriction. 14 We used multiplex tandem mass tag (TMT)-based whole cell proteomics to perform a 15 comprehensive time course analysis of >6,500 viral and cellular proteins during HIV 16 infection. To enable specific functional predictions, we categorized cellular proteins regulated 17 by HIV according to their patterns of temporal expression. We focussed on proteins depleted 18 with similar kinetics to APOBEC3C, and found the viral accessory protein Vif to be 19 necessary and sufficient for CUL5-dependent proteasomal degradation of all members of the 20 B56 family of regulatory subunits of the key cellular phosphatase PP2A (PPP2R5A-E). 21 Quantitative phosphoproteomic analysis of HIV-infected cells confirmed Vif-dependent 22 hyperphosphorylation of >200 cellular proteins, particularly substrates of the aurora kinases. 23 The ability of Vif to target PPP2R5 subunits is found in primate and non-primate lentiviral 2 24 lineages, and remodeling of the cellular phosphoproteome is therefore a second ancient and 25 conserved Vif function.
    [Show full text]
  • E3 Ubiquitin Ligase Cullin-5 Modulates Multiple Molecular and Cellular Responses to Heat Shock Protein 90 Inhibition in Human Cancer Cells
    E3 ubiquitin ligase Cullin-5 modulates multiple molecular and cellular responses to heat shock protein 90 inhibition in human cancer cells Rahul S. Samant, Paul A. Clarke, and Paul Workman1 Cancer Research UK Cancer Therapeutics Unit, The Institute of Cancer Research, London SM2 5NG, UK Edited by Melanie H. Cobb, University of Texas Southwestern Medical Center, Dallas, TX, and approved April 3, 2014 (received for review December 24, 2013) The molecular chaperone heat shock protein 90 (HSP90) is required Given the link between CUL5 and the HSP90 inhibitor- for the activity and stability of its client proteins. Pharmacologic induced degradation of ERBB2 (12), we have investigated the inhibition of HSP90 leads to the ubiquitin-mediated degradation of role of Cullin-RING ligases with respect to HSP90’s protein kinase clients, particularly activated or mutant oncogenic protein kinases. clients in human cancer cell lines. Our initial focused siRNA Client ubiquitination occurs via the action of one or more E3 screen of 28 Cullin-RING ligase family members identified five ubiquitin ligases. We sought to identify the role of Cullin-RING fam- genes, including CUL5, that were required for ERBB2 degra- ily E3 ubiquitin ligases in the cellular response to HSP90 inhibition. dation following treatment with 17-AAG—which we use here as Through a focused siRNA screen of 28 Cullin-RING ligase family a representative HSP90 inhibitor and chemical tool to promote members, we found that CUL5 and RBX2 were required for degra- client protein turnover. We go on to show for the first time to our dation of several HSP90 clients upon treatment of human cancer knowledge that RNAi silencing of CUL5 reduces the 17-AAG– cells with the clinical HSP90 inhibitor 17-AAG.
    [Show full text]
  • "The Genecards Suite: from Gene Data Mining to Disease Genome Sequence Analyses". In: Current Protocols in Bioinformat
    The GeneCards Suite: From Gene Data UNIT 1.30 Mining to Disease Genome Sequence Analyses Gil Stelzer,1,5 Naomi Rosen,1,5 Inbar Plaschkes,1,2 Shahar Zimmerman,1 Michal Twik,1 Simon Fishilevich,1 Tsippi Iny Stein,1 Ron Nudel,1 Iris Lieder,2 Yaron Mazor,2 Sergey Kaplan,2 Dvir Dahary,2,4 David Warshawsky,3 Yaron Guan-Golan,3 Asher Kohn,3 Noa Rappaport,1 Marilyn Safran,1 and Doron Lancet1,6 1Department of Molecular Genetics, Weizmann Institute of Science, Rehovot, Israel 2LifeMap Sciences Ltd., Tel Aviv, Israel 3LifeMap Sciences Inc., Marshfield, Massachusetts 4Toldot Genetics Ltd., Hod Hasharon, Israel 5These authors contributed equally to the paper 6Corresponding author GeneCards, the human gene compendium, enables researchers to effectively navigate and inter-relate the wide universe of human genes, diseases, variants, proteins, cells, and biological pathways. Our recently launched Version 4 has a revamped infrastructure facilitating faster data updates, better-targeted data queries, and friendlier user experience. It also provides a stronger foundation for the GeneCards suite of companion databases and analysis tools. Improved data unification includes gene-disease links via MalaCards and merged biological pathways via PathCards, as well as drug information and proteome expression. VarElect, another suite member, is a phenotype prioritizer for next-generation sequencing, leveraging the GeneCards and MalaCards knowledgebase. It au- tomatically infers direct and indirect scored associations between hundreds or even thousands of variant-containing genes and disease phenotype terms. Var- Elect’s capabilities, either independently or within TGex, our comprehensive variant analysis pipeline, help prepare for the challenge of clinical projects that involve thousands of exome/genome NGS analyses.
    [Show full text]
  • CRL5-Dependent Regulation of the Small Gtpases ARL4C and ARF6 Controls Hippocampal Morphogenesis
    CRL5-dependent regulation of the small GTPases ARL4C and ARF6 controls hippocampal morphogenesis Jisoo S. Hana,1, Keiko Hinoa,1, Wenzhe Lia, Raenier V. Reyesa, Cesar P. Canalesa,2, Adam M. Miltnera, Yasmin Haddadia, Junqing Sunb, Chao-Yin Chenb, Anna La Torrea, and Sergi Simóa,3 aDepartment of Cell Biology and Human Anatomy, University of California, Davis, CA 95616; and bDepartment of Pharmacology, University of California, Davis, CA 95616 Edited by Mary E. Hatten, Rockefeller University, New York, NY, and approved August 3, 2020 (received for review February 14, 2020) The small GTPase ARL4C participates in the regulation of cell migra- Importantly, CRL5 substrate adaptors are dynamically regulated tion, cytoskeletal rearrangements, and vesicular trafficking in epithe- during development, directing CRL5 activity to specific substrates lial cells. The ARL4C signaling cascade starts by the recruitment of the at different times and areas in the CNS (19). In the neocortex, ARF–GEF cytohesins to the plasma membrane, which, in turn, bind CRL5 opposes projection neuron migration by down-regulating and activate the small GTPase ARF6. However, the role of ARL4C– the Reelin/DAB1 signaling pathway as migrating projection neu- cytohesin–ARF6 signaling during hippocampal development remains rons reach the top of the cortical plate (18, 20, 21). In the CNS, elusive. Here, we report that the E3 ubiquitin ligase Cullin 5/RBX2 Reelin/DAB1 signaling participates in several developmental (CRL5) controls the stability of ARL4C and its signaling effectors to processes, including neuron migration, dendritogenesis, axo- regulate hippocampal morphogenesis. Both RBX2 knockout and genesis, and synaptogenesis (22–24). Moreover, CRL5 also con- Cullin 5 knockdown cause hippocampal pyramidal neuron mislocali- trols levels of the tyrosine kinase FYN, which exerts important zation and development of multiple apical dendrites.
    [Show full text]
  • Transcriptional Elongation Factor Elongin a Regulates Retinoic Acid-Induced Gene Expression During Neuronal Differentiation
    Cell Reports Report Transcriptional Elongation Factor Elongin A Regulates Retinoic Acid-Induced Gene Expression during Neuronal Differentiation Takashi Yasukawa,1 Shachi Bhatt,3,4 Tamotsu Takeuchi,6 Junya Kawauchi,7 Hidehisa Takahashi,3,8 Aya Tsutsui,1 Takuya Muraoka,1 Makoto Inoue,7 Masayuki Tsuda,2 Shigetaka Kitajima,7 Ronald C. Conaway,3,5 Joan W. Conaway,3,5 Paul A. Trainor,3,4 and Teijiro Aso1,* 1Department of Functional Genomics 2Institute for Laboratory Animal Research Kochi Medical School, Kohasu, Oko-cho, Nankoku, Kochi 783-8505, Japan 3Stowers Institute for Medical Research, Kansas City, MO 64110, USA 4Department of Anatomy and Cell Biology 5Department of Biochemistry and Molecular Biology Kansas University Medical Center, Kansas City, KS 66160, USA 6Department of Immunopathology, Gifu University School of Medicine, 1-1 Yanagido, Gifu, Gifu 501-1194, Japan 7Department of Biochemical Genetics, Medical Research Institute, Tokyo Medical and Dental University, Yushima, Bunkyo-ku, Tokyo 113-8510, Japan 8Department of Biochemistry, Hokkaido University Graduate School of Medicine, Sapporo, Hokkaido 060-8638, Japan *Correspondence: [email protected] http://dx.doi.org/10.1016/j.celrep.2012.09.031 SUMMARY target for gene regulation by a diverse collection of elongation factors that promote efficient elongation of transcripts by pol II Elongin A increases the rate of RNA polymerase II in vitro (Saunders et al., 2006). Elongin is a member of a family (pol II) transcript elongation by suppressing transient of elongation factors that can increase the overall rate of RNA pausing by the enzyme. Elongin A also acts as chain elongation by decreasing the frequency and/or duration a component of a cullin-RING ligase that can target of transient pausing by pol II as it traverses the DNA template stalled pol II for ubiquitylation and proteasome- (Conaway et al., 2000; Shilatifard et al., 2003).
    [Show full text]
  • A High-Throughput Approach to Uncover Novel Roles of APOBEC2, a Functional Orphan of the AID/APOBEC Family
    Rockefeller University Digital Commons @ RU Student Theses and Dissertations 2018 A High-Throughput Approach to Uncover Novel Roles of APOBEC2, a Functional Orphan of the AID/APOBEC Family Linda Molla Follow this and additional works at: https://digitalcommons.rockefeller.edu/ student_theses_and_dissertations Part of the Life Sciences Commons A HIGH-THROUGHPUT APPROACH TO UNCOVER NOVEL ROLES OF APOBEC2, A FUNCTIONAL ORPHAN OF THE AID/APOBEC FAMILY A Thesis Presented to the Faculty of The Rockefeller University in Partial Fulfillment of the Requirements for the degree of Doctor of Philosophy by Linda Molla June 2018 © Copyright by Linda Molla 2018 A HIGH-THROUGHPUT APPROACH TO UNCOVER NOVEL ROLES OF APOBEC2, A FUNCTIONAL ORPHAN OF THE AID/APOBEC FAMILY Linda Molla, Ph.D. The Rockefeller University 2018 APOBEC2 is a member of the AID/APOBEC cytidine deaminase family of proteins. Unlike most of AID/APOBEC, however, APOBEC2’s function remains elusive. Previous research has implicated APOBEC2 in diverse organisms and cellular processes such as muscle biology (in Mus musculus), regeneration (in Danio rerio), and development (in Xenopus laevis). APOBEC2 has also been implicated in cancer. However the enzymatic activity, substrate or physiological target(s) of APOBEC2 are unknown. For this thesis, I have combined Next Generation Sequencing (NGS) techniques with state-of-the-art molecular biology to determine the physiological targets of APOBEC2. Using a cell culture muscle differentiation system, and RNA sequencing (RNA-Seq) by polyA capture, I demonstrated that unlike the AID/APOBEC family member APOBEC1, APOBEC2 is not an RNA editor. Using the same system combined with enhanced Reduced Representation Bisulfite Sequencing (eRRBS) analyses I showed that, unlike the AID/APOBEC family member AID, APOBEC2 does not act as a 5-methyl-C deaminase.
    [Show full text]
  • Cullin 5 Regulates Dab1 Protein Levels and Neuron Positioning During Cortical Development
    Downloaded from genesdev.cshlp.org on September 28, 2021 - Published by Cold Spring Harbor Laboratory Press Cullin 5 regulates Dab1 protein levels and neuron positioning during cortical development Libing Feng,1,3 Nathaniel S. Allen,1,2,3 Sergi Simo,1 and Jonathan A. Cooper1,2,4 1Division of Basic Sciences, Fred Hutchinson Cancer Research Center, Seattle, Washington 98109, USA; 2Molecular and Cellular Biology Program, Fred Hutchinson Cancer Research Center and University of Washington, Seattle, Washington 98109, USA Many laminated regions of the mammalian brain develop by the migration of neuronal precursor cells, whose final positions are coordinated by signals from the secreted molecule Reelin. Early events in Reelin signaling have been identified, but the mechanism of signal down-regulation has been unclear. A possible source of negative feedback is the Reelin-induced degradation of the critical intracellular signaling component, Disabled-1 (Dab1). Here we show that degradation of Dab1 depends on Dab1 phosphorylation at specific tyrosine residues and on the E3 ubiquitin ligase component Cullin 5 (Cul5). Cul5 forms complexes with SOCS (suppressors of cytokine signaling) proteins, which bind to phosphorylated Dab1 and target it for degradation in tissue culture cells. Ablation of Cul5 in migrating neurons causes an accumulation of active Dab1 protein and a unique cortical layering defect, characterized by excess migration and buildup of neurons at the top of the cortical plate. The results implicate Cul5 and SOCS proteins in down-regulation of Dab1 in vivo and show that Cul5 plays an essential role in regulating neuron migrations during cortical development, possibly by opposing a promigratory effect of Dab1.
    [Show full text]
  • Normal Human Cell Proteins That Interact with the Adenovirus Type 5 E1B 55 Kda Protein
    HHS Public Access Author manuscript Author ManuscriptAuthor Manuscript Author Virology Manuscript Author . Author manuscript; Manuscript Author available in PMC 2018 April 01. Published in final edited form as: Virology. 2017 April ; 504: 12–24. doi:10.1016/j.virol.2017.01.013. Normal human cell proteins that interact with the adenovirus type 5 E1B 55 kDa protein George Hung1 and S.J. Flint* Department of Molecular Biology, Princeton University, Princeton, NJ 08544, USA Abstract Several of the functions of the human adenovirus type 5 E1B 55 kDa protein are fulfilled via the virus-specific E3 ubiquitin ligase it forms with the viral E4 Orf6 protein and several cellular proteins. Important substrates of this enzyme have not been identified, and other functions, including repression of transcription of interferon-sensitive genes, do not require the ligase. We therefore used immunoaffinity purification and liquid chromatography-mass spectrometry of lysates of normal human cells infected in parallel with HAdV-C5 and E1B 55 kDa protein-null mutant viruses to identify specifically E1B 55 kDa-associated proteins. The resulting set of > 90 E1B-associated proteins contained the great majority identified previously, and was enriched for those associated with the ubiquitin-proteasome system, RNA metabolism and the cell cycle. We also report very severe inhibition of viral genome replication when cells were exposed to both specific or non-specific siRNAs and interferon prior to infection. Keywords Human adenovirus type 5 (HAdV-C5); E1B 55 kDa protein; Normal human cells; Mass spectrometry; Type I interferons; Ubiquitin-proteasome system; RNA metabolism 1. Introduction Successful reproduction of species C human adenoviruses such as HAdV-C5 depends on extensive cooperation among viral and host cell components to achieve efficient expression of viral genes, replication of the viral genome and assembly of progeny virus particles in infected nuclei (Berk, 2013).
    [Show full text]
  • Regulation of Hsp90 Client Proteins by a Cullin5-RING E3 Ubiquitin Ligase
    Regulation of Hsp90 client proteins by a Cullin5-RING E3 ubiquitin ligase Elana S. Ehrlicha, Tao Wanga, Kun Luoa, Zuoxiang Xiaoa, Anna Maria Niewiadomskaa, Tara Martineza, Wanping Xub, Len Neckersb, and Xiao-Fang Yua,1 aDepartment of Molecular Microbiology and Immunology, Johns Hopkins Bloomberg School of Public Health, Baltimore, MD 21205; and bUrologic Oncology Branch, National Cancer Institute, Bethesda, MD 20892 Edited by William J. Muller, McGill University, Montreal, Canada, and accepted by the Editorial Board September 23, 2009 (received for review October 22, 2008) We report a link between Cullin5 (Cul5) E3 ubiquitin ligase and the Here we demonstrate that Cul5 regulates Hsp90 clients. Our heat shock protein 90 (Hsp90) chaperone complex. Hsp90 partici- results indicate that Cul5 interacts with the Hsp90 chaperone pates in the folding of its client proteins into their functional complex and the Hsp90 client ErbB2 and demonstrate that Cul5 conformation. Many Hsp90 clients have been reported to be is recruited to the site of ErbB2 on the plasma membrane, aberrantly expressed in a number of cancers. We demonstrate Cul5 thereby inducing its polyubiquitination and proteasome- interaction with members of the Hsp90 chaperone complex as well mediated degradation. Other Hsp90 client proteins, such as as the Hsp90 client, ErbB2. We observed recruitment of Cul5 to the HIF1-␣, were also regulated by Cul5. We observed Cul5- site of ErbB2 at the plasma membrane and subsequent induction of mediated degradation of ErbB2 to occur in the absence of the polyubiquitination and proteasomal degradation. We also dem- traditional Cul5 adaptors ElonginB and ElonginC, suggesting onstrate Cul5 involvement in regulation of another Hsp90 client, that a component of the Hsp90 chaperone complex may serve Hif-1␣.
    [Show full text]