Supplementary Information

Total Page:16

File Type:pdf, Size:1020Kb

Supplementary Information Supplementary Information PathwayMatcher: multi-omics pathway mapping and proteoform network generation Luis Francisco Hernández Sánchez1,2,3, Bram Burger4,5, Carlos Horro4,5, Antonio Fabregat3, Stefan Johansson1,2, Pål Rasmus Njølstad1,6, Harald Barsnes4,5, Henning Hermjakob3,7, and Marc Vaudel1,2,* 1 K.G. Jebsen Center for Diabetes Research, Department of Clinical Science, University of Bergen, Norway 2 Center for Medical Genetics and Molecular Medicine, Haukeland University Hospital, Bergen, Norway 3 European Molecular Biology Laboratory, European Bioinformatics Institute (EMBL-EBI), Wellcome Genome Campus, Hinxton, Cambridge, United Kingdom 4 Proteomics Unit, Department of Biomedicine, University of Bergen, Bergen, Norway 5 Computational Biology Unit, Department of Informatics, University of Bergen, Bergen, Norway 6 Department of Pediatrics, Haukeland University Hospital, Bergen, Norway 7 Beijing Proteome Research Center, National Center for Protein Sciences Beijing, Beijing, China * To whom correspondence should be addressed Abstract Mapping biomedical data to functional knowledge is an essential task in biomedicine and can be achieved by querying gene or protein identifiers in pathway knowledgebases. Here, we demonstrate that including fine-granularity information such as post-translational modifications greatly increases the specificity of the analysis. We present PathwayMatcher (github.com/PathwayAnalysisPlatform/PathwayMatcher), a bioinformatic application for mapping multi-omics data to pathways and show how this enables the building of biological networks at the proteoform level. Hernández Sánchez et al. PathwayMatcher Table of Contents 1. Introduction ......................................................................................................................................................... 3 2. Availability ............................................................................................................................................................ 5 3. Post-translational modifications in the Reactome data model ....................................................... 6 4. Mapping omics data to pathways ................................................................................................................ 7 5. Input ..................................................................................................................................................................... 11 a) Genetic variants .......................................................................................................................................... 11 b) Genes ............................................................................................................................................................... 12 c) Peptides .......................................................................................................................................................... 13 d) Proteins .......................................................................................................................................................... 14 e) Proteoforms .................................................................................................................................................. 15 a) Superset (with and without PTM types) .......................................................................................... 17 b) Subset (with and without PTM types) ............................................................................................... 17 c) One (with and without PTM types) .................................................................................................... 18 d) Strict................................................................................................................................................................. 19 6. Output .................................................................................................................................................................. 21 a) Search .............................................................................................................................................................. 21 b) Analysis .......................................................................................................................................................... 22 c) Graph ............................................................................................................................................................... 25 7. Performance ...................................................................................................................................................... 28 8. Metrics and Figures ........................................................................................................................................ 30 9. References .......................................................................................................................................................... 31 2 Hernández Sánchez et al. PathwayMatcher 1. Introduction Biological pathways are a common way to represent biological processes. A pathway is a sequence of biochemical reactions in a cell that achieves a specific biological goal. Pathways are consolidated in public knowledgebases where they can be accessed, queried, and navigated. One of the main use cases is to map biomedical data to provide functional interpretation, and potentially uncover underlying causes for certain diseases, through so-called pathway analysis. Pathway analysis consist of two steps: (i) mapping of omics data to the knowledgebase, and (ii) statistical analysis evaluating how confidently the pathways relate to a clinical sample. The search for relevant pathways can be done using lists of genes or proteins. Proteins provide a finer level of detail given that multiple protein products can originate from the same gene. After the search has been performed, statistical methods are applied to filter and rank the resulting pathways (García-Campos et al., 2015). Proteins are the main participants of pathways, acting as reactants, catalysts, regulators or products. They take multiple forms and can also be chemically modified, all referred to as proteoforms, giving them the ability to perform highly specific tasks. Knowledgebases such as PhosphoSitePlus (Hornbeck et al., 2014) or Reactome (Fabregat et al., 2018a) gather information on proteoforms. Reactome notably annotates reactions involving proteoforms, which include the proteins’ processed peptide sequences, isoforms and sets of known post- translational modifications (PTMs). This type of annotation reflects the dynamic nature of the proteins and allows identifying the reactions and pathways where proteins need specific sets of PTMs to achieve the reactions. However, so far, no bioinformatic tool allowed the mapping and analysis of the detailed information contained in proteoform pathway networks. Here we present a more fine-grained approach to pathway search, not only using gene names or protein identifiers, but also proteoforms. As demonstrated in the main text, this tailored matching allows for a more specific analysis, and can reduce the prevalence of artefacts in the matching of the results. 3 Hernández Sánchez et al. PathwayMatcher We developed PathwayMatcher, an open-source standalone Java command line tool that maps multiple types of omics data to the pathways in the Reactome graph database, including: (i) lists of genetic variants, (ii) gene or protein identifiers, (iii) lists of peptides including post-translational modifications, and (iv) lists of proteoform identifiers. PathwayMatcher converts the input to either proteins or proteoforms and searches for them as participants in pathway reactions. The output comprises three types of files: (i) a list of the matched pathways, (ii) the result of an over- representation analysis, and (iii) the connection graphs. PathwayMatcher uses Reactome, a free, open source, curated knowledgebase containing human reactions categorized in hierarchical pathways which also includes proteoform-level annotation. Protein post-translational modifications are notably supported through the protein sequence coordinate and the modification type following the PSI-MOD ontology (Montecchi- Palazzi et al., 2008). Proteins have a UniProt (The UniProt, 2017) identifier associated with an additional indication of the isoform participating in the reaction. The detailed annotation of Reactome is therefore instrumental in our new fine-grained pathway search. PathwayMatcher contains all mappings internally and therefore does not rely on web services e.g. from Ensembl, Uniprot, or Reactome. This allows PathwayMatcher to run on high- performance setups without compromising efficiency through dependency on third-party services. Furthermore, it allows PathwayMatcher to run in secure environments without access to the internet. PathwayMatcher is readily available for integration in bioinformatic workflows thanks to implementations in Bioconda and Galaxy, as detailed below. 4 Hernández Sánchez et al. PathwayMatcher 2. Availability PathwayMatcher is freely available at github.com/PathwayAnalysisPlatform/PathwayMatcher under the permissive Apache 2.0 license. It is also possible to use PathwayMatcher as a Docker image: hub.docker.com/r/lfhs/pathwaymatcher. The Docker image allows the creation of isolated, self-contained containers comprising PathwayMatcher, its dependencies and internal data without
Recommended publications
  • Identifying Novel Disease-Associated Variants and Understanding The
    Identifying Novel Disease-variants and Understanding the Role of the STAT1-STAT4 Locus in SLE A dissertation submitted to the Graduate School of University of Cincinnati In partial fulfillment of the requirements for the degree of Doctor of Philosophy in the Immunology Graduate Program of the College of Medicine by Zubin H. Patel B.S., Worcester Polytechnic Institute, 2009 John B. Harley, M.D., Ph.D. Committee Chair Gurjit Khurana Hershey, M.D., Ph.D Leah C. Kottyan, Ph.D. Harinder Singh, Ph.D. Matthew T. Weirauch, Ph.D. Abstract Systemic Lupus Erythematosus (SLE) or lupus is an autoimmune disorder caused by an overactive immune system with dysregulation of both innate and adaptive immune pathways. It can affect all major organ systems and may lead to inflammation of the serosal and mucosal surfaces. The pathogenesis of lupus is driven by genetic factors, environmental factors, and gene-environment interactions. Heredity accounts for a substantial proportion of SLE risk, and the role of specific genetic risk loci has been well established. Identifying the specific causal genetic variants and the underlying molecular mechanisms has been a major area of investigation. This thesis describes efforts to develop an analytical approach to identify candidate rare variants from trio analyses and a fine-mapping analysis at the STAT1-STAT4 locus, a well-replicated SLE-risk locus. For the STAT1-STAT4 locus, subsequent functional biological studies demonstrated genotype dependent gene expression, transcription factor binding, and DNA regulatory activity. Rare variants are classified as variants across the genome with an allele-frequency less than 1% in ancestral populations.
    [Show full text]
  • Seq2pathway Vignette
    seq2pathway Vignette Bin Wang, Xinan Holly Yang, Arjun Kinstlick May 19, 2021 Contents 1 Abstract 1 2 Package Installation 2 3 runseq2pathway 2 4 Two main functions 3 4.1 seq2gene . .3 4.1.1 seq2gene flowchart . .3 4.1.2 runseq2gene inputs/parameters . .5 4.1.3 runseq2gene outputs . .8 4.2 gene2pathway . 10 4.2.1 gene2pathway flowchart . 11 4.2.2 gene2pathway test inputs/parameters . 11 4.2.3 gene2pathway test outputs . 12 5 Examples 13 5.1 ChIP-seq data analysis . 13 5.1.1 Map ChIP-seq enriched peaks to genes using runseq2gene .................... 13 5.1.2 Discover enriched GO terms using gene2pathway_test with gene scores . 15 5.1.3 Discover enriched GO terms using Fisher's Exact test without gene scores . 17 5.1.4 Add description for genes . 20 5.2 RNA-seq data analysis . 20 6 R environment session 23 1 Abstract Seq2pathway is a novel computational tool to analyze functional gene-sets (including signaling pathways) using variable next-generation sequencing data[1]. Integral to this tool are the \seq2gene" and \gene2pathway" components in series that infer a quantitative pathway-level profile for each sample. The seq2gene function assigns phenotype-associated significance of genomic regions to gene-level scores, where the significance could be p-values of SNPs or point mutations, protein-binding affinity, or transcriptional expression level. The seq2gene function has the feasibility to assign non-exon regions to a range of neighboring genes besides the nearest one, thus facilitating the study of functional non-coding elements[2]. Then the gene2pathway summarizes gene-level measurements to pathway-level scores, comparing the quantity of significance for gene members within a pathway with those outside a pathway.
    [Show full text]
  • Biological and Prognostic Significance of Chromosome 5Q Deletions in Myeloid Malignancies Aristoteles A.N
    Review Biological and Prognostic Significance of Chromosome 5q Deletions in Myeloid Malignancies Aristoteles A.N. Giagounidis,1Ulrich Germing,2 and Carlo Aul1 Abstract The presence of del(5q), either as the sole karyotypic abnormality or as part of a more complex karyotype, has distinct clinical implications for myelodysplastic syndromes (MDS) and acute myeloid leukemia. The 5qÀ syndrome, a subtype of low-riskMDS, is characterized by an isolated 5q deletion and <5% blasts in the bone marrow and can serve as a useful model for studying the role of 5q deletions in the pathogenesis and prognosis of myeloid malignancies. Recent clinical results with lenalidomide, an oral immunomodulatory drug, have shown durable erythroid responses, including transfusion independence and complete cytogenetic remissions in patients with del(5q) MDS with or without additional chromosomal abnormalities. These results indicate that lenalidomide can overcome the pathogenic effect of 5q deletion in MDS and restore bone marrow balance. The data provide important new insights into the pathobiology of 5q chromo- somal deletions in myeloid malignancies. Cytogenetic abnormalities are detected in the bone marrow of preponderance, refractory macrocytic anemia, normal or high over 50% of patients diagnosed with primary myelodysplastic platelet counts, hypolobulated megakaryocytes, and modest syndromes (MDS) or myeloid leukemias, and up to 80% of leukopenia (11, 14, 17). The prognosis is favorable in 5qÀ patients with secondary or therapy-related MDS (1, 2). These syndrome with relatively low risk of transformation to AML abnormalities can be characterized as being balanced or (11, 18). Although the limits of 5q deletions vary among unbalanced (3, 4). Balanced cytogenetic abnormalities include patients with 5qÀ syndrome, the most frequent deletion is reciprocal translocations, inversions, and insertions (3, 5, 6).
    [Show full text]
  • Mutation Analysis of Genes Within the Dynactin Complex in a Cohort of Hereditary Peripheral Neuropathies
    Clin Genet 2016: 90: 127–133 © 2015 John Wiley & Sons A/S. Printed in Singapore. All rights reserved Published by John Wiley & Sons Ltd CLINICAL GENETICS doi: 10.1111/cge.12712 Original Article Mutation analysis of genes within the dynactin complex in a cohort of hereditary peripheral neuropathies a a Tey S., Ahmad-Annuar A., Drew A.P., Shahrizaila N., Nicholson G.A., S. Tey , A. Ahmad-Annuar , Kennerson M.L. Mutation analysis of genes within the dynactin complex in A.P. Drewb, N. Shahrizailac, , a cohort of hereditary peripheral neuropathies. G.A. Nicholsonb d and Clin Genet 2016: 90: 127–133. © John Wiley & Sons A/S. Published by M.L. Kennersonb,d John Wiley & Sons Ltd, 2015 aDepartment of Biomedical Science, The cytoplasmic dynein–dynactin genes are attractive candidates for Faculty of Medicine, University of Malaya, b neurodegenerative disorders given their functional role in retrograde Kuala Lumpur, Malaysia, Northcott transport along neurons. The cytoplasmic dynein heavy chain (DYNC1H1) Neuroscience Laboratory, ANZAC Research Institute, and Sydney Medical gene has been implicated in various neurodegenerative disorders, and School, University of Sydney, Sydney, dynactin 1 (DCTN1) genes have been implicated in a wide spectrum of Australia, cDepartment of Medicine, disorders including motor neuron disease, Parkinson’s disease, spinobulbar Faculty of Medicine, University of Malaya, muscular atrophy and hereditary spastic paraplegia. However, the Kuala Lumpur, Malaysia, and dMolecular involvement of other dynactin genes with inherited peripheral neuropathies Medicine Laboratory, Concord Hospital, (IPN) namely, hereditary sensory neuropathy, hereditary motor neuropathy Sydney, Australia and Charcot–Marie–Tooth disease is under reported. We screened eight genes; DCTN1-6 and ACTR1A and ACTR1B in 136 IPN patients using Key words: Charcot–Marie–Tooth – whole-exome sequencing and high-resolution melt (HRM) analysis.
    [Show full text]
  • Open Data for Differential Network Analysis in Glioma
    International Journal of Molecular Sciences Article Open Data for Differential Network Analysis in Glioma , Claire Jean-Quartier * y , Fleur Jeanquartier y and Andreas Holzinger Holzinger Group HCI-KDD, Institute for Medical Informatics, Statistics and Documentation, Medical University Graz, Auenbruggerplatz 2/V, 8036 Graz, Austria; [email protected] (F.J.); [email protected] (A.H.) * Correspondence: [email protected] These authors contributed equally to this work. y Received: 27 October 2019; Accepted: 3 January 2020; Published: 15 January 2020 Abstract: The complexity of cancer diseases demands bioinformatic techniques and translational research based on big data and personalized medicine. Open data enables researchers to accelerate cancer studies, save resources and foster collaboration. Several tools and programming approaches are available for analyzing data, including annotation, clustering, comparison and extrapolation, merging, enrichment, functional association and statistics. We exploit openly available data via cancer gene expression analysis, we apply refinement as well as enrichment analysis via gene ontology and conclude with graph-based visualization of involved protein interaction networks as a basis for signaling. The different databases allowed for the construction of huge networks or specified ones consisting of high-confidence interactions only. Several genes associated to glioma were isolated via a network analysis from top hub nodes as well as from an outlier analysis. The latter approach highlights a mitogen-activated protein kinase next to a member of histondeacetylases and a protein phosphatase as genes uncommonly associated with glioma. Cluster analysis from top hub nodes lists several identified glioma-associated gene products to function within protein complexes, including epidermal growth factors as well as cell cycle proteins or RAS proto-oncogenes.
    [Show full text]
  • Epigenetic Mechanisms Are Involved in the Oncogenic Properties of ZNF518B in Colorectal Cancer
    Epigenetic mechanisms are involved in the oncogenic properties of ZNF518B in colorectal cancer Francisco Gimeno-Valiente, Ángela L. Riffo-Campos, Luis Torres, Noelia Tarazona, Valentina Gambardella, Andrés Cervantes, Gerardo López-Rodas, Luis Franco and Josefa Castillo SUPPLEMENTARY METHODS 1. Selection of genomic sequences for ChIP analysis To select the sequences for ChIP analysis in the five putative target genes, namely, PADI3, ZDHHC2, RGS4, EFNA5 and KAT2B, the genomic region corresponding to the gene was downloaded from Ensembl. Then, zoom was applied to see in detail the promoter, enhancers and regulatory sequences. The details for HCT116 cells were then recovered and the target sequences for factor binding examined. Obviously, there are not data for ZNF518B, but special attention was paid to the target sequences of other zinc-finger containing factors. Finally, the regions that may putatively bind ZNF518B were selected and primers defining amplicons spanning such sequences were searched out. Supplementary Figure S3 gives the location of the amplicons used in each gene. 2. Obtaining the raw data and generating the BAM files for in silico analysis of the effects of EHMT2 and EZH2 silencing The data of siEZH2 (SRR6384524), siG9a (SRR6384526) and siNon-target (SRR6384521) in HCT116 cell line, were downloaded from SRA (Bioproject PRJNA422822, https://www.ncbi. nlm.nih.gov/bioproject/), using SRA-tolkit (https://ncbi.github.io/sra-tools/). All data correspond to RNAseq single end. doBasics = TRUE doAll = FALSE $ fastq-dump -I --split-files SRR6384524 Data quality was checked using the software fastqc (https://www.bioinformatics.babraham. ac.uk /projects/fastqc/). The first low quality removing nucleotides were removed using FASTX- Toolkit (http://hannonlab.cshl.edu/fastxtoolkit/).
    [Show full text]
  • Genome-Wide Characterization of Genetic Variants and Putative
    Boschiero et al. BMC Genomics (2018) 19:83 DOI 10.1186/s12864-018-4444-0 RESEARCH ARTICLE Open Access Genome-wide characterization of genetic variants and putative regions under selection in meat and egg-type chicken lines Clarissa Boschiero1,4* , Gabriel Costa Monteiro Moreira1,AlmasAraGheyas2, Thaís Fernanda Godoy1, Gustavo Gasparin1, Pilar Drummond Sampaio Corrêa Mariani1,MarcelaPaduan1, Aline Silva Mello Cesar1, Mônica Corrêa Ledur3 and Luiz Lehmann Coutinho1 Abstract Background: Meat and egg-type chickens have been selected for several generations for different traits. Artificial and natural selection for different phenotypes can change frequency of genetic variants, leaving particular genomic footprints throghtout the genome. Thus, the aims of this study were to sequence 28 chickens from two Brazilian lines (meat and white egg-type) and use this information to characterize genome-wide genetic variations, identify putative regions under selection using Fst method, and find putative pathways under selection. Results: A total of 13.93 million SNPs and 1.36 million INDELs were identified, with more variants detected from the broiler (meat-type) line. Although most were located in non-coding regions, we identified 7255 intolerant non-synonymous SNPs, 512 stopgain/loss SNPs, 1381 frameshift and 1094 non-frameshift INDELs that may alter protein functions. Genes harboring intolerant non-synonymous SNPs affected metabolic pathways related mainly to reproduction and endocrine systems in the white-egg layer line, and lipid metabolism and metabolic diseases in the broiler line. Fst analysis in sliding windows, using SNPs and INDELs separately, identified over 300 putative regions of selection overlapping with more than 250 genes. For the first time in chicken, INDEL variants were considered for selection signature analysis, showing high level of correlation in results between SNP and INDEL data.
    [Show full text]
  • DCTN4 Antibody Purified Mouse Monoclonal Antibody Catalog # Ao2290a
    10320 Camino Santa Fe, Suite G San Diego, CA 92121 Tel: 858.875.1900 Fax: 858.622.0609 DCTN4 Antibody Purified Mouse Monoclonal Antibody Catalog # AO2290a Specification DCTN4 Antibody - Product Information Application E, WB, IF, FC, IHC Primary Accession Q9UJW0 Reactivity Human, Mouse Host Mouse Clonality Monoclonal Isotype IgG1 Calculated MW 52.3kDa KDa Description Dynactin 4 could have a dual role in dynein targeting and in ACTR1A/Arp1 subunit of dynactin pointed-end capping. Could be involved in ACTR1A pointed-end binding and in additional roles in linking dynein and dynactin to the cortical cytoskeleton.The dynactin complex binds cargo, such as vesicles and organelles, to cytoplasmic dynein for retrograde microtubule-mediated trafficking and could feasibly be involved in the copper-regulated trafficking of ATP7B. Immunogen Purified recombinant fragment of human DCTN4 (AA: 57-298) expressed in E. Coli. Formulation Ascitic fluid containing 0.03% sodium azide. DCTN4 Antibody - Additional Information Gene ID 51164 Other Names Dynactin subunit 4, Dyn4, Dynactin subunit p62, DCTN4 Dilution E~~1/10000 WB~~1/500 - 1/2000 IF~~1/200 - 1/1000 FC~~1/200 - 1/400 IHC~~1/200 - 1/1000 Storage Maintain refrigerated at 2-8°C for up to 6 months. For long term storage store at -20°C in small aliquots to prevent freeze-thaw cycles. Precautions DCTN4 Antibody is for research use only and not for use in diagnostic or therapeutic procedures. DCTN4 Antibody - Protein Information Page 1/2 10320 Camino Santa Fe, Suite G San Diego, CA 92121 Tel: 858.875.1900 Fax: 858.622.0609 Name DCTN4 Function Could have a dual role in dynein targeting and in ACTR1A/Arp1 subunit of dynactin pointed-end capping.
    [Show full text]
  • Chemico-Biological Interactions 196 (2012) 89–95
    Chemico-Biological Interactions 196 (2012) 89–95 Contents lists available at ScienceDirect Chemico-Biological Interactions journal homepage: www.elsevier.com/locate/chembioint Exposure to sodium tungstate and Respiratory Syncytial Virus results in hematological/immunological disease in C57BL/6J mice ⇑ Cynthia D. Fastje a, , Kevin Harper a, Chad Terry a, Paul R. Sheppard b, Mark L. Witten a,1 a Steele Children’s Research Center, PO Box 245073, University of Arizona, Tucson, AZ 85724-5073, USA b Laboratory of Tree-Ring Research, PO Box 210058, University of Arizona, Tucson, AZ 85721-0058, USA article info abstract Article history: The etiology of childhood leukemia is not known. Strong evidence indicates that precursor B-cell Acute Available online 1 May 2011 Lymphoblastic Leukemia (Pre-B ALL) is a genetic disease originating in utero. Environmental exposures in two concurrent, childhood leukemia clusters have been profiled and compared with geographically Keywords: similar control communities. The unique exposures, shared in common by the leukemia clusters, have Tungsten been modeled in C57BL/6 mice utilizing prenatal exposures. This previous investigation has suggested Respiratory Syncytial Virus in utero exposure to sodium tungstate (Na2WO4) may result in hematological/immunological disease Childhood leukemia through genes associated with viral defense. The working hypothesis is (1) in addition to spontaneously and/or chemically generated genetic lesions forming pre-leukemic clones, in utero exposure to Na2WO4 increases genetic susceptibility to viral influence(s); (2) postnatal exposure to a virus possessing the 1FXXKXFXXA/V9 peptide motif will cause an unnatural immune response encouraging proliferation in the B-cell precursor compartment. This study reports the results of exposing C57BL/6J mice to Na2WO4 3 in utero via water (15 ppm, ad libetum) and inhalation (mean concentration PM5 3.33 mg/m ) and to Respiratory Syncytial Virus (RSV) within 2 weeks of weaning.
    [Show full text]
  • Binding Between ROCK1 and DCTN2 Triggers Diabetes‑Associated Centrosome Amplification in Colon Cancer Cells
    ONCOLOGY REPORTS 46: 151, 2021 Binding between ROCK1 and DCTN2 triggers diabetes‑associated centrosome amplification in colon cancer cells YUAN FEI LI1, LIN JIE SHI1,2, PU WANG3, JIA WEN WANG4, GUANG YI SHI4 and SHAO CHIN LEE4 1Department of Oncology, The First Hospital, Shanxi Medical University, Taiyuan, Shanxi 030001; 2Intensive Care Unit, Shaanxi Provincial Cancer Hospital, Xian, Shaanxi 710000; 3Changzhi Medical University, Changzhi, Shanxi 030001; 4Institute of Biomedical Sciences of The School of Life Sciences, Jiangsu Normal University, Xuzhou, Jiangsu 221116, P.R. China Received January 17, 2021; Accepted May 5, 2021 DOI: 10.3892/or.2021.8102 Abstract. Type 2 diabetes increases the risk various types of Dynactin subunit 2 (DCTN2) was confirmed to be localized cancer and is associated with a poor prognosis therein. There to the centrosomes. Treating the cells with high glucose, is also evidence that the disease is associated with cancer insulin and palmitic acid increased the protein levels of metastasis. Centrosome amplification can initiate tumori‑ ROCK1 and DCTN2, promoted their binding with each genesis with metastasis in vivo and increase the invasiveness other, and triggered centrosome amplification. Disruption of of cancer cells in vitro. Our previous study reported that the protein complex by knocking down ROCK1 or DCTN2 type 2 diabetes promotes centrosome amplification via the expression partially attenuated centrosome amplification, upregulation and centrosomal translocation of Rho‑associated while simultaneous knockdown of both proteins completely protein kinase 1 (ROCK1), which suggests that centrosome inhibited centrosome amplification. These results suggested amplification is a candidate biological link between type 2 ROCK1‑DCTN2 binding as a signal for the regulation of diabetes and cancer development.
    [Show full text]
  • UNIVERSITY of CALIFORNIA, SAN DIEGO Measuring
    UNIVERSITY OF CALIFORNIA, SAN DIEGO Measuring and Correlating Blood and Brain Gene Expression Levels: Assays, Inbred Mouse Strain Comparisons, and Applications to Human Disease Assessment A dissertation submitted in partial satisfaction of the requirements for the degree of Doctor of Philosophy in Biomedical Sciences by Mary Elizabeth Winn Committee in charge: Professor Nicholas J Schork, Chair Professor Gene Yeo, Co-Chair Professor Eric Courchesne Professor Ron Kuczenski Professor Sanford Shattil 2011 Copyright Mary Elizabeth Winn, 2011 All rights reserved. 2 The dissertation of Mary Elizabeth Winn is approved, and it is acceptable in quality and form for publication on microfilm and electronically: Co-Chair Chair University of California, San Diego 2011 iii DEDICATION To my parents, Dennis E. Winn II and Ann M. Winn, to my siblings, Jessica A. Winn and Stephen J. Winn, and to all who have supported me throughout this journey. iv TABLE OF CONTENTS Signature Page iii Dedication iv Table of Contents v List of Figures viii List of Tables x Acknowledgements xiii Vita xvi Abstract of Dissertation xix Chapter 1 Introduction and Background 1 INTRODUCTION 2 Translational Genomics, Genome-wide Expression Analysis, and Biomarker Discovery 2 Neuropsychiatric Diseases, Tissue Accessibility and Blood-based Gene Expression 4 Mouse Models of Human Disease 5 Microarray Gene Expression Profiling and Globin Reduction 7 Finding and Accessible Surrogate Tissue for Neural Tissue 9 Genetic Background Effect Analysis 11 SPECIFIC AIMS 12 ENUMERATION OF CHAPTERS
    [Show full text]
  • Functional Genomics of Cohesin Acetyltransferases in Human Cells
    FUNCTIONAL GENOMICS OF COHESIN ACETYLTRANSFERASES IN HUMAN CELLS by Sadia Rahman A Dissertation Presented to the Faculty of the Louis V. Gerstner, Jr. Graduate School of Biomedical Sciences, Memorial Sloan Kettering Cancer Center in Partial Fulfillment of the Requirements for the Degree of Doctor of Philosophy New York, NY March, 2014 __________________________ _________________ Prasad V. Jallepalli, MD, PhD Date Dissertation Mentor ABSTRACT Accurate chromosome segregation during cell division requires that sister chromatids be physically linked from the time of their replication until their separation at anaphase. The cohesin complex, consisting of SMC1, SMC3, RAD21 and SCC3 arranges to form a ring-shaped structure that holds sister chromatids together. Acetylation of the cohesin SMC3 subunit by acetyltransferases ESCO1 and ESCO2 is essential for cohesion establishment. In addition to cohesion, cohesin also has roles in gene expression through its regulation of chromatin architecture. Acetylation of cohesin by ESCO1/2 is regulated temporally and spatially. In human cells, it begins in G1 phase, rises in S-phase and persists until mitosis. The reaction occurs only on DNA-bound cohesin and SMC3 is quickly deacetylated after cohesin is removed from DNA. In this study, we map genome-wide ESCO1/2 and AcSMC3 sites by ChIP- Seq, study their regulation, and contribution to cohesion and gene expression functions. Genome-wide mapping of ESCO1/2 reveals that they differ in their distribution: ESCO1 has many discrete binding sites that largely overlap with cohesin/CTCF sites, whereas ESCO2 has few sites of enrichment. A monoclonal antibody against the acetylated form of cohesin was also generated in this study to map cohesin acetylation, and this shows that cohesin is already acetylated in G1 at the majority of its sites and that this depends on ESCO1.
    [Show full text]