GUO-DISSERTATION-2017.Pdf (3.383Mb)

Total Page:16

File Type:pdf, Size:1020Kb

GUO-DISSERTATION-2017.Pdf (3.383Mb) APPLICATIONS OF HIGH-THROUGHPUT SEQUENCING DATA ANALYSIS IN TRANSCRIPTIONAL STUDIES A Dissertation by ZHENGYU GUO Submitted to the Office of Graduate and Professional Studies of Texas A&M University in partial fulfillment of the requirements for the degree of DOCTOR OF PHILOSOPHY Chair of Committee, Aniruddha Datta Committee Members, Ulisses Braga-Neto Xiaoning Qian Alan Dabney Head of Department, Miroslav M. Begovic December 2017 Major Subject: Electrical Engineering Copyright 2017 Zhengyu Guo ABSTRACT High-throughput sequencing has become one of the most powerful tools for studies in genomics, transcriptomics, epigenomics, and metagenomics. In recent years, HTS pro- tocols for enhancing the understanding of the diverse cellular roles of RNA have been designed, such as RNA-Seq, CLIP-Seq, and RIP-Seq. In this work, we explore the appli- cations of HTS data analysis in transcriptional studies. First, the differential expression analysis of RNA-Seq data is discussed and applied to a sheep RNA-Seq dataset to exam- ine the biological mechanisms of the sheep resistance to worm infection. We develop an automatic pipeline to analyze the RNA-Seq dataset, and use a negative binomial model for gene expression analysis. Functional analysis is conducted over the differentially ex- pressed genes, and a broad range of mechanisms providing protection against the parasite are identified in the resistant sheep breed. This study provides insights into the underlying biology of sheep host resistance. Then, a deep learning method is proposed to predict the RNA binding protein binding preferences using CLIP-Seq data. The proposed method uses a deep convolutional autoencoder to effectively learn the robust sequence features, and a softmax classifier to predict the RBP binding sites. To demonstrate the efficacy of the proposed method, we evaluate its performance over a dataset containing 31 CLIP-Seq experiments. This benchmarking shows that the proposed method improves the prediction performance in terms of AUC, compared with the existing methods. The analysis also shows that the proposed method is able to provide insights to identify new RBP binding motifs. Therefore, the proposed method will be of great help in understanding the dynamic regulations of RBPs in various biological processes and diseases. Finally, a database is cre- ated to facilitate the reuse of the public available mouse RNA-Seq dataset. The metadata of the publicly available mouse RNA-Seq datasets is manually curated and is served by a ii well-designed website. The database can be scaled up in the future to serve more types of HTS data. iii DEDICATION To my family. iv ACKNOWLEDGMENTS I would like to express my gratitude to Dr. Aniruddha Datta for his kind help on this work and my stay at Texas A&M University. I thank Dr. Ulisses Braga-Neto, Dr. Xiaoning Qian, and Dr. Alan Dabney, for serving on my committee and for their support. I thank my fellow students and friends, Yue Gan, Yukun Tan, and Xingde Jiang for their friendships, memories, and supports. Most importantly, I would like to thank my family for their encouragement, love and help. v CONTRIBUTORS AND FUNDING SOURCES Contributors This work was supervised by a dissertation committee consisting of Dr. Aniruddha Datta, Dr. Ulisses Braga-Neto and Dr. Xiaoning Qian of the Department of Electrical and Computer Engineering and Dr. Alan Dabney of the Department of Statistics. The content of Chapter 2 was completed by the student in collaboration with Dr. Robert Li. Jorge Francisco González and Dr. Robert Li conceived and designed the ex- periment. Biological experiments were performed by Julia Hernandez, Tom McNeilly, Yolanda Corripio-Miyar, David Frew, Jorge Francisco González and Dr. Robert Li. The manuscript partly forms Chapter 2 was written by Dr. Li. In Chapter 4, the curation was conducted in part by Boriana Tzvetkova, Jennifer M. Bassik, Tara Bodziak and Brianna Wojnar from Dr. Bo Hua Hu’s team. The website was built in part by Wei Qiao and Md Obaida. Dr. Peng Yu supervised the work of Chapter 2 and Chapter 4. All other work conducted for the dissertation was completed by the student indepen- dently. Funding Sources The student was supported by TEES-AgriLife Center for Bioinformatics and Genomic Systems Engineering (CBGSE) at Texas A&M University. This work was made possible in part by startup funding from the ECE department and Texas A&M Engineering Experiment Station/Dwight Look College of Engineering at Texas A&M University, NIDCD R01DC010154, AGL2009/09985 and Fondo Social Europeo (FSE). vi NOMENCLATURE AUC Area Under the receiver operating characteristic Curve CHB Canaria Hair Breed CLIP-Seq High-throughput sequencing of RNA isolated by crosslinking immunoprecipitation CAE Convolutional Autoencoder CNN Convolutional Neural Network CS Canaria Sheep DEG Differentially Expressed Gene dpi Days post infection FDR False Discovery Rate GO Gene Ontology HTS High Throughput Sequencing NB Negative Binomial PWM Positional Weight Matrix RBP RNA Binding Protein ReLU Rectified Linear Unit RNA-Seq RNA Sequencing vii TABLE OF CONTENTS Page ABSTRACT ............................................................................... ii DEDICATION ............................................................................. iv ACKNOWLEDGMENTS ................................................................ v CONTRIBUTORS AND FUNDING SOURCES ....................................... vi NOMENCLATURE ....................................................................... vii TABLE OF CONTENTS ................................................................. viii LIST OF FIGURES ....................................................................... x LIST OF TABLES......................................................................... xii 1. INTRODUCTION .................................................................... 1 1.1 DNA, RNA and Protein ......................................................... 1 1.2 The Central Dogma of Molecular Biology..................................... 4 1.2.1 DNA Replication ....................................................... 4 1.2.2 Transcription............................................................ 5 1.2.3 Translation .............................................................. 6 1.3 High-Throughput Sequencing .................................................. 6 1.4 High-Throughput Sequencing for Transcriptomics............................ 9 1.5 Dissertation Outline ............................................................. 11 2. COMPUTATIONAL DIFFERENTIAL EXPRESSION ANALYSIS FOR UN- DERSTANDING THE MOLECULAR BASIS OF SHEEP HOST RESIS- TANCE ................................................................................ 13 2.1 Introduction...................................................................... 13 2.1.1 Sheep Host Resistance ................................................. 13 2.1.2 Comparative Analysis of Indigenous Sheep from Canary Islands ... 14 2.2 Materials and Methods.......................................................... 15 2.2.1 RNA Extraction and Sequencing ...................................... 15 viii 2.2.2 Computational Analysis of Sheep Host Resistance RNA-Seq Data ..................................................................... 17 2.2.3 Data Preprocessing and Alignment .................................... 17 2.2.4 Differential Expression Analysis ...................................... 18 2.2.5 Gene Ontology Analysis ............................................... 21 2.3 Results ........................................................................... 23 2.3.1 Haemonchus Infection Induced Distinctly Different Transcrip- tome Patterns in the Abomasal Mucosa of CHB and CS Breeds..... 23 2.3.2 Gene Ontology Implicated in Host Resistance........................ 32 2.4 Discussion ....................................................................... 36 3. A CONVOLUTIONAL AUTOENCODER BASED METHOD FOR PRE- DICTING RNA PROTEIN INTERACTIONS ...................................... 43 3.1 Introduction...................................................................... 43 3.2 Materials and Methods ......................................................... 46 3.2.1 Dataset .................................................................. 46 3.2.2 RNA Sequence Encoding ............................................. 46 3.2.3 Convolutional Layers................................................... 47 3.2.4 Model Architecture ..................................................... 49 3.2.5 Model Implementation ................................................. 52 3.3 Results .......................................................................... 52 3.3.1 Model Evaluation....................................................... 52 3.3.2 Insight in Motif Discovery ............................................. 54 3.4 Conclusion ...................................................................... 58 4. CREATION OF A DATABASE AND WEB-SERVER FOR METADATA OF PUBLICLY AVAILABLE MOUSE RNA-SEQ DATA SETS....................... 60 4.1 Introduction...................................................................... 60 4.2 Methods.......................................................................... 62 4.3 Results ........................................................................... 64 4.4 Discussion ....................................................................... 66 5. CONCLUSIONS ..................................................................... 67 REFERENCES
Recommended publications
  • Bioinformatics Identi Cation of Prognostic Factors Associated With
    Bioinformatics Identication of Prognostic Factors Associated with Breast Cancer Ying Wei Sichuan University https://orcid.org/0000-0001-8178-4705 Shipeng Zhang College of Pharmacy, North Sichuan Medical College Li Xiao West China School of Basic Medical Sciences and Forensic Medicine Jing Zou West China School of Basic Medical Sciences and Forensic Medicine Yingqing Fu West China School of Basic Medical Sciences and Forensic Medicine Yi Ye West China School of Basic Medical Sciences and Forensic Medicine Linchuan Liao ( [email protected] ) West China School of Basic Medical Sciences and Forensic Medicine https://orcid.org/0000-0003-3700-8471 Research Keywords: Breast cancer, Differentially expressed genes, miRNAs, Transcription factors, Bioinformatic analysis Posted Date: December 2nd, 2020 DOI: https://doi.org/10.21203/rs.3.rs-117477/v1 License: This work is licensed under a Creative Commons Attribution 4.0 International License. Read Full License Page 1/23 Abstract Background: Breast cancer (BRCA) remains one of the most common forms of cancer and is the most prominent driver of cancer-related death among women. The mechanistic basis for BRCA, however, remains incompletely understood. In particular, the relationships between driver mutations and signaling pathways in BRCA are poorly characterized, making it dicult to identify reliable clinical biomarkers that can be employed in diagnostic, therapeutic, or prognostic contexts. Methods: First, we downloaded publically available BRCA datasets (GSE45827, GSE42568, and GSE61304) from the Gene Expression Omnibus (GEO) database. We then compared gene expression proles between tumor and control tissues in these datasets using Venn diagrams and the GEO2R analytical tool. We further explore the functional relevance of BRCA-associated differentially expressed genes (DEGs) via functional and pathway enrichment analyses using the DAVID tool, and we then constructed a protein-protein interaction network incorporating DEGs of interest using the Search Tool for the Retrieval of Interacting Genes (STRING) database.
    [Show full text]
  • AURKA, DLGAP5, TPX2, KIF11 and CKAP5: Five Specific Mitosis-Associated Genes Correlate with Poor Prognosis for Non-Small Cell Lung Cancer Patients
    INTERNATIONAL JOURNAL OF ONCOLOGY 50: 365-372, 2017 AURKA, DLGAP5, TPX2, KIF11 and CKAP5: Five specific mitosis-associated genes correlate with poor prognosis for non-small cell lung cancer patients MARC A. SCHNEIDER1,8, PETROS CHristopoulos2,8, THOMAS MULEY1,8, ARNE WartH5,8, URSULA KLINGMUELLER6,8,9, MICHAEL THOMAS2,8, FELIX J.F. HertH3,8, HENDRIK DIENEMANN4,8, NIKOLA S. MUELLER7,9, FABIAN THEIS7,9 and MICHAEL MEISTER1,8,9 1Translational Research Unit, 2Department of Thoracic Oncology, 3Department of Pneumology and Critical Care Medicine, and 4Department of Surgery, Thoraxklinik at University Hospital Heidelberg, Heidelberg; 5Institute of Pathology, University of Heidelberg, Heidelberg; 6Systems Biology of Signal Transduction, German Cancer Research Center, Heidelberg; 7Cellular Dynamics and Cell Patterning, Max Planck Institute of Biochemistry, Martinsried; 8Translational Lung Research Center Heidelberg (TLRC-H), Member of the German Center for Lung Research (DZL) Heidelberg; 9CancerSys network ‘LungSysII’, Heidelberg, Germany Received October 26, 2016; Accepted December 5, 2016 DOI: 10.3892/ijo.2017.3834 Abstract. The growth of a tumor depends to a certain extent genes AURKA, DLGAP5, TPX2, KIF11 and CKAP5 is asso- on an increase in mitotic events. Key steps during mitosis are ciated with the prognosis of NSCLC patients. the regulated assembly of the spindle apparatus and the sepa- ration of the sister chromatids. The microtubule-associated Introduction protein Aurora kinase A phosphorylates DLGAP5 in order to correctly segregate the chromatids. Its activity and recruitment Lung cancer is globally the leading cause of cancer-related to the spindle apparatus is regulated by TPX2. KIF11 and deaths (1). Non-small cell lung cancer (NSCLC), which accounts CKAP5 control the correct arrangement of the microtubules for more than 80% of all cases, is divided in adenocarcinoma and prevent their degradation.
    [Show full text]
  • AURKA Mrna Expression Is an Independent Predictor of Poor Prognosis in Patients with Non-Small Cell Lung Cancer
    ONCOLOGY LETTERS 13: 4463-4468, 2017 AURKA mRNA expression is an independent predictor of poor prognosis in patients with non-small cell lung cancer AHMED S.K. AL‑KHAFAJI1,2, MICHAEL W. MARCUS1, MICHAEL P.A. DAVIES1, JANET M. RISK1, RICHARD J. SHAW1, JOHN K. FIELD1 and TRIANTAFILLOS LILOGLOU1 1Department of Molecular and Clinical Cancer Medicine, Institute of Translational Medicine, University of Liverpool, Liverpool L7 8TX, UK; 2Department of Biology, College of Science, University of Baghdad, Al‑Jadriya, Baghdad 10070, Iraq Received September 12, 2016; Accepted January 17, 2017 DOI: 10.3892/ol.2017.6012 Abstract. Deregulation of mitotic spindle genes has been Introduction reported to contribute to the development and progression of malignant tumours. The aim of the present study was to Lung cancer is the most common cause of cancer-associated explore the association between the expression profiles of mortality in the UK for both males and females (1), and Aurora kinases (AURKA, AURKB and AURKC), cytoskel- >1/5 patients with cancer succumb to this malignancy world- eton-associated protein 5 (CKAP5), discs large-associated wide (2). Non-small cell lung carcinoma (NSCLC) accounts protein 5 (DLGAP5), kinesin-like protein 11 (KIF11), micro- for 80-85% of all cases of lung cancer, and develops through tubule nucleation factor (TPX2), monopolar spindle 1 kinase the accumulation of molecular alterations, which may serve as (TTK), and β-tubulins (TUBB) and (TUBB3) genes and clini- prognostic biomarkers for NSCLC outcome (3). copathological characteristics in human non-small cell lung Mitotic spindle formation and the spindle checkpoint are carcinoma (NSCLC). Reverse transcription-quantitative poly- critical for the maintenance of cell division and chromosome merase chain reaction‑based RNA gene expression profiles of segregation (4).
    [Show full text]
  • Literature Mining Sustains and Enhances Knowledge Discovery from Omic Studies
    LITERATURE MINING SUSTAINS AND ENHANCES KNOWLEDGE DISCOVERY FROM OMIC STUDIES by Rick Matthew Jordan B.S. Biology, University of Pittsburgh, 1996 M.S. Molecular Biology/Biotechnology, East Carolina University, 2001 M.S. Biomedical Informatics, University of Pittsburgh, 2005 Submitted to the Graduate Faculty of School of Medicine in partial fulfillment of the requirements for the degree of Doctor of Philosophy University of Pittsburgh 2016 UNIVERSITY OF PITTSBURGH SCHOOL OF MEDICINE This dissertation was presented by Rick Matthew Jordan It was defended on December 2, 2015 and approved by Shyam Visweswaran, M.D., Ph.D., Associate Professor Rebecca Jacobson, M.D., M.S., Professor Songjian Lu, Ph.D., Assistant Professor Dissertation Advisor: Vanathi Gopalakrishnan, Ph.D., Associate Professor ii Copyright © by Rick Matthew Jordan 2016 iii LITERATURE MINING SUSTAINS AND ENHANCES KNOWLEDGE DISCOVERY FROM OMIC STUDIES Rick Matthew Jordan, M.S. University of Pittsburgh, 2016 Genomic, proteomic and other experimentally generated data from studies of biological systems aiming to discover disease biomarkers are currently analyzed without sufficient supporting evidence from the literature due to complexities associated with automated processing. Extracting prior knowledge about markers associated with biological sample types and disease states from the literature is tedious, and little research has been performed to understand how to use this knowledge to inform the generation of classification models from ‘omic’ data. Using pathway analysis methods to better understand the underlying biology of complex diseases such as breast and lung cancers is state-of-the-art. However, the problem of how to combine literature- mining evidence with pathway analysis evidence is an open problem in biomedical informatics research.
    [Show full text]
  • Unbiased Data Mining Identifies Cell Cycle Transcripts That Predict Non-Indolent Gleason Score 7 Prostate Cancer Wendy L
    Johnston et al. BMC Urology (2019) 19:4 https://doi.org/10.1186/s12894-018-0433-5 RESEARCHARTICLE Open Access Unbiased data mining identifies cell cycle transcripts that predict non-indolent Gleason score 7 prostate cancer Wendy L. Johnston1* , Charles N. Catton1,2 and Carol J. Swallow3,4,5,6 Abstract Background: Patients with newly diagnosed non-metastatic prostate adenocarcinoma are typically classified as at low, intermediate, or high risk of disease progression using blood prostate-specific antigen concentration, tumour T category, and tumour pathological Gleason score. Classification is used to both predict clinical outcome and to inform initial management. However, significant heterogeneity is observed in outcome, particularly within the intermediate risk group, and there is an urgent need for additional markers to more accurately hone risk prediction. Recently developed web-based visualization and analysis tools have facilitated rapid interrogation of large transcriptome datasets, and querying broadly across multiple large datasets should identify predictors that are widely applicable. Methods: We used camcAPP, cBioPortal, CRN, and NIH NCI GDC Data Portal to data mine publicly available large prostate cancer datasets. A test set of biomarkers was developed by identifying transcripts that had: 1) altered abundance in prostate cancer, 2) altered expression in patients with Gleason score 7 tumours and biochemical recurrence, 3) correlation of expression with time until biochemical recurrence across three datasets (Cambridge, Stockholm, MSKCC). Transcripts that met these criteria were then examined in a validation dataset (TCGA-PRAD) using univariate and multivariable models to predict biochemical recurrence in patients with Gleason score 7 tumours. Results: Twenty transcripts met the test criteria, and 12 were validated in TCGA-PRAD Gleason score 7 patients.
    [Show full text]
  • A High-Throughput Approach to Uncover Novel Roles of APOBEC2, a Functional Orphan of the AID/APOBEC Family
    Rockefeller University Digital Commons @ RU Student Theses and Dissertations 2018 A High-Throughput Approach to Uncover Novel Roles of APOBEC2, a Functional Orphan of the AID/APOBEC Family Linda Molla Follow this and additional works at: https://digitalcommons.rockefeller.edu/ student_theses_and_dissertations Part of the Life Sciences Commons A HIGH-THROUGHPUT APPROACH TO UNCOVER NOVEL ROLES OF APOBEC2, A FUNCTIONAL ORPHAN OF THE AID/APOBEC FAMILY A Thesis Presented to the Faculty of The Rockefeller University in Partial Fulfillment of the Requirements for the degree of Doctor of Philosophy by Linda Molla June 2018 © Copyright by Linda Molla 2018 A HIGH-THROUGHPUT APPROACH TO UNCOVER NOVEL ROLES OF APOBEC2, A FUNCTIONAL ORPHAN OF THE AID/APOBEC FAMILY Linda Molla, Ph.D. The Rockefeller University 2018 APOBEC2 is a member of the AID/APOBEC cytidine deaminase family of proteins. Unlike most of AID/APOBEC, however, APOBEC2’s function remains elusive. Previous research has implicated APOBEC2 in diverse organisms and cellular processes such as muscle biology (in Mus musculus), regeneration (in Danio rerio), and development (in Xenopus laevis). APOBEC2 has also been implicated in cancer. However the enzymatic activity, substrate or physiological target(s) of APOBEC2 are unknown. For this thesis, I have combined Next Generation Sequencing (NGS) techniques with state-of-the-art molecular biology to determine the physiological targets of APOBEC2. Using a cell culture muscle differentiation system, and RNA sequencing (RNA-Seq) by polyA capture, I demonstrated that unlike the AID/APOBEC family member APOBEC1, APOBEC2 is not an RNA editor. Using the same system combined with enhanced Reduced Representation Bisulfite Sequencing (eRRBS) analyses I showed that, unlike the AID/APOBEC family member AID, APOBEC2 does not act as a 5-methyl-C deaminase.
    [Show full text]
  • Defining NOTCH3 Target Genes in Ovarian Cancer
    Published OnlineFirst March 6, 2012; DOI: 10.1158/0008-5472.CAN-11-2181 Cancer Molecular and Cellular Pathobiology Research Defining NOTCH3 Target Genes in Ovarian Cancer Xu Chen1, Michelle M. Thiaville1, Li Chen1,4, Alexander Stoeck1, Jianhua Xuan4, Min Gao1, Ie-Ming Shih1,2,3, and Tian-Li Wang2,3 Abstract NOTCH3 gene amplification plays an important role in the progression of many ovarian and breast cancers, but the targets of NOTCH3 signaling are unclear. Here, we report the use of an integrated systems biology approach to identify direct target genes for NOTCH3. Transcriptome analysis showed that suppression of NOTCH signaling in ovarian and breast cancer cells led to downregulation of genes in pathways involved in cell-cycle regulation and nucleotide metabolism. Chromatin immunoprecipitation (ChIP)-on-chip analysis defined promoter target sequences, including a new CSL binding motif (N1) in addition to the canonical CSL binding motif, that were occupied by the NOTCH3/CSL transcription complex. Integration of transcriptome and ChIP-on-chip data showed that the ChIP target genes overlapped significantly with the NOTCH-regulated transcriptome in ovarian cancer cells. From the set of genes identified, we showed that the mitotic apparatus organizing protein DLGAP5 (HURP/DLG7) was a critical target. Both the N1 motif and the canonical CSL binding motif were essential to activate DLGAP5 transcription. DLGAP5 silencing in cancer cells suppressed tumorigenicity and inhibited cellular proliferation by arresting the cell cycle at the G2–M phase. In contrast, enforced expression of DLGAP5 partially counteracted the growth inhibitory effects of a pharmacologic or RNA interference–mediated NOTCH inhibition in cancer cells.
    [Show full text]
  • Genomic Analysis of Allele-Specific Expression in the Mouse Liver
    bioRxiv preprint doi: https://doi.org/10.1101/024588; this version posted August 13, 2015. The copyright holder for this preprint (which was not certified by peer review) is the author/funder. All rights reserved. No reuse allowed without permission. 1 TITLE: Genomic analysis of allele-specific expression in the mouse liver Ashutosh K. Pandey* and Robert W. Williams* *Department of Genetics, Genomics and Informatics, University of Tennessee Health Science Center, Memphis, TN, 38103 bioRxiv preprint doi: https://doi.org/10.1101/024588; this version posted August 13, 2015. The copyright holder for this preprint (which was not certified by peer review) is the author/funder. All rights reserved. No reuse allowed without permission. 2 RUNNING TITLE: Allele-specific expression in liver KEYWORDS: BXD, DBA/2J, haplotype-aware alignment, purifying selection, cis eQTL CORRESPONDING AUTHOR: Ashutosh K. Pandey 855 Monroe Avenue, Suite 512 Memphis, TN, 38163 Phone: 901-448-1761 Email addresses: [email protected] bioRxiv preprint doi: https://doi.org/10.1101/024588; this version posted August 13, 2015. The copyright holder for this preprint (which was not certified by peer review) is the author/funder. All rights reserved. No reuse allowed without permission. 3 ABSTRACT Genetic differences in gene expression contribute significantly to phenotypic diversity and differences in disease susceptibility. In fact, the great majority of causal variants highlighted by genome-wide association are in non-coding regions that modulate expression. In order to quantify the extent of allelic differences in expression, we analyzed liver transcriptomes of isogenic F1 hybrid mice. Allele-specific expression (ASE) effects are pervasive and are detected in over 50% of assayed genes.
    [Show full text]
  • The Greatwall Kinase Safeguards the Genome Integrity by Affecting the Kinome Activity in Mitosis
    Oncogene (2020) 39:6816–6840 https://doi.org/10.1038/s41388-020-01470-1 ARTICLE The Greatwall kinase safeguards the genome integrity by affecting the kinome activity in mitosis 1,9 1 2 1,10 1,3 1 Xavier Bisteau ● Joann Lee ● Vinayaka Srinivas ● Joanna H. S. Lee ● Joanna Niska-Blakie ● Gifford Tan ● 1 1 4 5 1,6 7 Shannon Y. X. Yap ● Kevin W. Hom ● Cheng Kit Wong ● Jeongjun Chae ● Loo Chien Wang ● Jinho Kim ● 4 1,6 1,3,11 1,8 Giulia Rancati ● Radoslaw M. Sobota ● Chris S. H. Tan ● Philipp Kaldis Received: 17 June 2020 / Revised: 21 August 2020 / Accepted: 10 September 2020 / Published online: 25 September 2020 © The Author(s) 2020. This article is published with open access Abstract Progression through mitosis is balanced by the timely regulation of phosphorylation and dephosphorylation events ensuring the correct segregation of chromosomes before cytokinesis. This balance is regulated by the opposing actions of CDK1 and PP2A, as well as the Greatwall kinase/MASTL. MASTL is commonly overexpressed in cancer, which makes it a potential therapeutic anticancer target. Loss of Mastl induces multiple chromosomal errors that lead to the accumulation of micronuclei and multilobulated cells in mitosis. Our analyses revealed that loss of Mastl leads to chromosome breaks and abnormalities 1234567890();,: 1234567890();,: impairing correct segregation. Phospho-proteomic data for Mastl knockout cells revealed alterations in proteins implicated in multiple processes during mitosis including double-strand DNA damage repair. In silico prediction of the kinases with affected activity unveiled NEK2 to be regulated in the absence of Mastl. We uncovered that, RAD51AP1, involved in regulation of homologous recombination, is phosphorylated by NEK2 and CDK1 but also efficiently dephosphorylated by PP2A/B55.
    [Show full text]
  • Cell Cycle Arrest Through Indirect Transcriptional Repression by P53: I Have a DREAM
    Cell Death and Differentiation (2018) 25, 114–132 Official journal of the Cell Death Differentiation Association OPEN www.nature.com/cdd Review Cell cycle arrest through indirect transcriptional repression by p53: I have a DREAM Kurt Engeland1 Activation of the p53 tumor suppressor can lead to cell cycle arrest. The key mechanism of p53-mediated arrest is transcriptional downregulation of many cell cycle genes. In recent years it has become evident that p53-dependent repression is controlled by the p53–p21–DREAM–E2F/CHR pathway (p53–DREAM pathway). DREAM is a transcriptional repressor that binds to E2F or CHR promoter sites. Gene regulation and deregulation by DREAM shares many mechanistic characteristics with the retinoblastoma pRB tumor suppressor that acts through E2F elements. However, because of its binding to E2F and CHR elements, DREAM regulates a larger set of target genes leading to regulatory functions distinct from pRB/E2F. The p53–DREAM pathway controls more than 250 mostly cell cycle-associated genes. The functional spectrum of these pathway targets spans from the G1 phase to the end of mitosis. Consequently, through downregulating the expression of gene products which are essential for progression through the cell cycle, the p53–DREAM pathway participates in the control of all checkpoints from DNA synthesis to cytokinesis including G1/S, G2/M and spindle assembly checkpoints. Therefore, defects in the p53–DREAM pathway contribute to a general loss of checkpoint control. Furthermore, deregulation of DREAM target genes promotes chromosomal instability and aneuploidy of cancer cells. Also, DREAM regulation is abrogated by the human papilloma virus HPV E7 protein linking the p53–DREAM pathway to carcinogenesis by HPV.Another feature of the pathway is that it downregulates many genes involved in DNA repair and telomere maintenance as well as Fanconi anemia.
    [Show full text]
  • Promising Therapy in Lung Cancer: Spotlight on Aurora Kinases
    cancers Review Promising Therapy in Lung Cancer: Spotlight on Aurora Kinases Domenico Galetta 1,* and Lourdes Cortes-Dericks 2 1 Division of Thoracic Surgery, European Institute of Oncology, IRCCS, 20141 Milan, Italy 2 Department of Biology, University of Hamburg, 20146 Hamburg, Germany; [email protected] * Correspondence: [email protected] Received: 4 September 2020; Accepted: 12 November 2020; Published: 14 November 2020 Simple Summary: Lung cancer has remained one of the major causes of death worldwide. Thus, a more effective treatment approach is essential, such as the inhibition of specific cancer-promoting molecules. Aurora kinases regulate the process of mitosis—a process of cell division that is necessary for normal cell proliferation. Dysfunction of these kinases can contribute to cancer formation. In this review, we present studies indicating the implication of Aurora kinases in tumor formation, drug resistance, and disease prognosis. The effectivity of using Aurora kinase inhibitors in the pre-clinical and clinical investigations has proven their therapeutic potential in the setting of lung cancer. This work may provide further information to broaden the development of anticancer drugs and, thus, improve the conventional lung cancer management. Abstract: Despite tremendous efforts to improve the treatment of lung cancer, prognosis still remains poor; hence, the search for efficacious therapeutic option remains a prime concern in lung cancer research. Cell cycle regulation including mitosis has emerged as an important target for cancer management. Novel pharmacological agents blocking the activities of regulatory molecules that control the functional aspects of mitosis such as Aurora kinases are now being investigated. The Aurora kinases, Aurora-A (AURKA), and Aurora B (AURKB) are overexpressed in many tumor entities such as lung cancer that correlate with poor survival, whereby their inhibition, in most cases, enhances the efficacy of chemo-and radiotherapies, indicating their implication in cancer therapy.
    [Show full text]
  • Identi Cation of Novel Hub Genes Through Expression Pro Les
    Identication of Novel Hub Genes Through Expression Proles Analysis in Adrenocortical Carcinoma Chenhe Yao Beijing Zhicheng Biomedical Technology Co,Ltd Xiaoling Zhao Beijing Zhicheng Biomedical Technology Co,Ltd Xuemeng Shang Beijing Zhicheng Biomedical Technology Co,Ltd Binghan Jia Beijing Zhicheng Biomedical Technology Co,Ltd Shuaijie Dou Beijing Zhicheng Biomedical Technology Co,Ltd Weiliang Ye Beijing Zhicheng Biomedical Technology Co,Ltd Yuemei Yang ( [email protected] ) Beijing Zhicheng Biomedical Technology,Co.,Ltd. Research Keywords: Adrenocortical carcinoma, biomarkers, gene expression proling, Co-expression, prognosis Posted Date: July 1st, 2020 DOI: https://doi.org/10.21203/rs.3.rs-38768/v1 License: This work is licensed under a Creative Commons Attribution 4.0 International License. Read Full License Page 1/21 Abstract Background: Adrenocortical carcinoma (ACC) is a heterogeneous and rare malignant tumor associated with a poor prognosis. The molecular mechanisms of ACC remain elusive and more accurate biomarkers for the prediction of prognosis are needed. Methods: In this study, integrative proling analyses were performed to identify novel hub genes in ACC to provide promising targets for future investigation. Three gene expression proling datasets in the GEO database were used for the identication of overlapped differentially expressed genes (DEGs) following the criteria of adj.P.Value<0.05 and |log2 FC|>0.5 in ACC. Novel hub genes were screened out following a series of processes: the retrieval of DEGs with no known associations with ACC on Pubmed, then the cross-validation of expression values and signicant associations with overall survival in the GEPIA2 and starBase databases, and nally the prediction of gene-tumor association in the GeneCards database.
    [Show full text]