Biodata Mining Biomed Central

Total Page:16

File Type:pdf, Size:1020Kb

Biodata Mining Biomed Central BioData Mining BioMed Central Software article Open Access Fast Gene Ontology based clustering for microarray experiments Kristian Ovaska, Marko Laakso and Sampsa Hautaniemi* Address: Computational Systems Biology Laboratory, Institute of Biomedicine and Genome-Scale Biology Program, Biomedicum Helsinki, University of Helsinki, PO Box 63 (Haartmaninkatu 8), 00014 UNIVERSITY OF HELSINKI, Finland Email: Kristian Ovaska - [email protected]; Marko Laakso - [email protected]; Sampsa Hautaniemi* - [email protected] * Corresponding author Published: 21 November 2008 Received: 8 April 2008 Accepted: 21 November 2008 BioData Mining 2008, 1:11 doi:10.1186/1756-0381-1-11 This article is available from: http://www.biodatamining.org/content/1/1/11 © 2008 Ovaska et al; licensee BioMed Central Ltd. This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited. Abstract Background: Analysis of a microarray experiment often results in a list of hundreds of disease- associated genes. In order to suggest common biological processes and functions for these genes, Gene Ontology annotations with statistical testing are widely used. However, these analyses can produce a very large number of significantly altered biological processes. Thus, it is often challenging to interpret GO results and identify novel testable biological hypotheses. Results: We present fast software for advanced gene annotation using semantic similarity for Gene Ontology terms combined with clustering and heat map visualisation. The methodology allows rapid identification of genes sharing the same Gene Ontology cluster. Conclusion: Our R based semantic similarity open-source package has a speed advantage of over 2000-fold compared to existing implementations. From the resulting hierarchical clustering dendrogram genes sharing a GO term can be identified, and their differences in the gene expression patterns can be seen from the heat map. These methods facilitate advanced annotation of genes resulting from data analysis. Background protein product encoded by a gene) in terms of their bio- A microarray experiment may result in hundreds of differ- logical processes, cellular components and molecular entially expressed genes that are subject to interpretation functions [1]. The GO annotations are carried out by cura- and further analysis. As analysing these lists gene-by-gene tors of several bioinformatics databases, so the GO data- is tedious and error prone, the genes in the lists are rou- base is constantly updated. The ontology defines terms tinely annotated using Gene Ontology (GO) with an aim that are linked together to form a directed acyclic graph. to identify statistically significant biological processes or Gene products are annotated with a number of ontology pathways [1]. However, statistical analysis of GO annota- terms. Annotation with a given term also implies annota- tions can produce a very large number of significantly tion with all ancestors of the term. enriched or down-regulated biological processes. Thus, it is often challenging to interpret GO results and identify In this study we present methodology and software to novel testable biological hypotheses. cluster genes based on their biological functionality using GO annotations. Integral part of the methodology is the The GO project provides a species-independent control- ability to rapidly compute pair-wise distances between the led vocabulary for describing gene products (an RNA or gene annotation similarities. Page 1 of 8 (page number not for citation purposes) BioData Mining 2008, 1:11 http://www.biodatamining.org/content/1/1/11 Two approaches to gene similarity computation are graph and were introduced in the context of GO by Lord et al. structure -based (GS) and information content -based [7]. To compute the semantic similarity between terms t1 (IC) measures. GS-based methods use the hierarchical and t2, we first find the most informative common ances- structure of GO in computing gene similarity. IC-based tor A of t1 and t2, i.e., A is a term that is an ancestor of both methods additionally consider the a priori probabilities, t1 and t2 and has the maximum IC among common ances- or information contents, of GO terms in a reference gene tors CommonAnc(t1,t2) of the terms. Now, the Resnik sim- set. IC-based measures have been found to perform better ilarity [8] is defined as than pure graph-based measures [2,3]. SimResnik(t1, t2) = IC(A). Czekanowski-Dice similarity [4] is a GS-based method. Distance of genes G1 and G2 is defined as Several other measures are defined that also take the infor- mation contents of t1 and t2 into account. The Lin measure Δ [9] is defined as = #(GO ( G12 ) GO ( G )) dG(,12 G ) , #(GO ( G12 )∪+ GO ( G )) #( GO ( G 12 ) ∩ GO ( G )) = 2IC() A where Δ is the symmetric set difference, # is the number of Sim Lin(,tt12 ) . (1) IC() t12+ IC ( t ) elements in a set and GO(Gi) is the set of GO annotations for gene Gi. Similarity can be defined as 1 - d(G1, G2). Jiang and Conrath [10] define a semantic distance metric as In Kappa statistics [5], each gene is represented as a binary vector (g1,...,gN), where gi is 1 if the gene is annotated with dJC(t1, t2) = IC(t1) + IC(t2) - 2IC(A). the GO term gi and 0 otherwise. N is the total number of GO terms under consideration. The corresponding similarity measure for dJC(t1, t2) [6] is given by Similarity of genes G1 and G2 is defined as 1 Sim (,tt )= . OGG,,−AGG JC 12 d (,)tt +1 K = 12 12, JC 12 GG12, 1−A GG12, Finally, the Relevance measure [11] that combines Lin's where O represents observed co-occurrence of GO and Resnik's measures is defined as GG12, terms and A represents random co-occurrence. − − GG12, = 21logpt ( )( pt ( )) = 21IC()( A p ()) A Sim Rel(,tt12 ) max . t∈ CommonAnc(,) t t logpt ( ))log(+ pt ) IC() t+ IC ( t ) O is the relative frequency of agreeing locations in 12 1 2 12 GG12, The MICA-based measures can be modified to take into the two binary vectors, i.e., locations that are either both account so called disjunctive ancestor terms [6]. Two 0 or both 1. A is the expected relative frequency of ancestors a1 and a2 of a term t are disjunctive if there are GG12, independent paths from a1 to t and from a2 to t. Such such locations if the binary vectors were random, taking ancestors represent distinct interpretations of the term t. into account the observed probabilities of 0's and 1's. In the GraSM enhancement, all common disjunctive ancestors of terms t1 and t2 are considered when comput- The following discussion considers IC-based similarity ing Sim(t1,t2) [6]. GraSM modifies the computation of measures. The information content of a GO term is com- IC(A) and can be applied to the Resnik, Lin and Jiang- puted by the frequency of the term occurring in annota- Conrath measures. tions; a rarely used term contains a greater amount of information. Probability for observing a term t is defined After computing the pair-wise term similarities, the next Freq() t step in MICA-based measures is to calculate the similarity as pt()= , where MaxFreq is the maximum fre- MaxFreq between genes G1 and G2. This can be done in several ways quency of all terms [6]. The information content for a and our package supports three most commonly used term t is given as IC(t) = -log2p(t). Probabilities can be esti- methods. In the two simplest methods, the maximum or mated from a corpus of annotations, such as the Gene the mean of pair-wise GO term similarities between anno- Ontology database. tation sets of G1 and G2 is used as the similarity value [12]. That is, when G1 is annotated with terms t1,...,tn and G2 Several related similarity metrics are based on the most ′′ informative common ancestor (MICA) of two GO terms with terms tt1,..., m , pair-wise term similarities form an n Page 2 of 8 (page number not for citation purposes) BioData Mining 2008, 1:11 http://www.biodatamining.org/content/1/1/11 Czekanowski-Dice, Kappa, Resnik (with GraSM as an × m matrix S. Now, Simgene(G1, G2) is the maximum or the option), Jiang-Conrath (GraSM), Lin (GraSM), Relevance, mean of the matrix. In the third method, similarity is Cosine and SimGIC. The MICA-based measures (Resnik, defined as Sim (G , G ) = max{rowScore, columnScore} gene 1 2 Lin, Jiang-Conrath, Relevance and GraSM enhancements) [11], where are implemented as a combination of R and C++ code; the four other measures are implemented in R. In addition to n m the regular R package, csbl.go is available as a component ==11 rowScore ∑ maxSSij and columnScore ∑ max ij. for Anduril [15], a framework for high-throughput data n 1≤≤jm m 1≤≤in i=1 j=1 analysis we recently developed. The package is extensively In addition to MICA- and GraSM-based measures, we tested and includes a user guide. have implemented the cosine similarity and SimGIC measures. In cosine similarity [13], each gene G is repre- Similarity computation needs GO term probabilities for the reference gene set. We provide precomputed probabil- sented as a vector (w , w ,...,w ), where each w is IC(t ) if 1 2 N i i ity tables for Homo sapiens, Saccharomyces cerevisiae, G is annotated with the term ti, or 0 otherwise. N is the Caenorhabditis elegans, Drosophila melanogaster, Mus muscu- total number of GO terms under consideration. Similarity lus and Rattus norvegicus. The tables are computed based GG⋅ on all gene and protein annotations for the given organ- of genes G and G is defined as 12, where · is the 1 2 ||||GG12 ism found in the geneontology.org database.
Recommended publications
  • Analysis of Trans Esnps Infers Regulatory Network Architecture
    Analysis of trans eSNPs infers regulatory network architecture Anat Kreimer Submitted in partial fulfillment of the requirements for the degree of Doctor of Philosophy in the Graduate School of Arts and Sciences COLUMBIA UNIVERSITY 2014 © 2014 Anat Kreimer All rights reserved ABSTRACT Analysis of trans eSNPs infers regulatory network architecture Anat Kreimer eSNPs are genetic variants associated with transcript expression levels. The characteristics of such variants highlight their importance and present a unique opportunity for studying gene regulation. eSNPs affect most genes and their cell type specificity can shed light on different processes that are activated in each cell. They can identify functional variants by connecting SNPs that are implicated in disease to a molecular mechanism. Examining eSNPs that are associated with distal genes can provide insights regarding the inference of regulatory networks but also presents challenges due to the high statistical burden of multiple testing. Such association studies allow: simultaneous investigation of many gene expression phenotypes without assuming any prior knowledge and identification of unknown regulators of gene expression while uncovering directionality. This thesis will focus on such distal eSNPs to map regulatory interactions between different loci and expose the architecture of the regulatory network defined by such interactions. We develop novel computational approaches and apply them to genetics-genomics data in human. We go beyond pairwise interactions to define network motifs, including regulatory modules and bi-fan structures, showing them to be prevalent in real data and exposing distinct attributes of such arrangements. We project eSNP associations onto a protein-protein interaction network to expose topological properties of eSNPs and their targets and highlight different modes of distal regulation.
    [Show full text]
  • Protein Interaction Network of Alternatively Spliced Isoforms from Brain Links Genetic Risk Factors for Autism
    ARTICLE Received 24 Aug 2013 | Accepted 14 Mar 2014 | Published 11 Apr 2014 DOI: 10.1038/ncomms4650 OPEN Protein interaction network of alternatively spliced isoforms from brain links genetic risk factors for autism Roser Corominas1,*, Xinping Yang2,3,*, Guan Ning Lin1,*, Shuli Kang1,*, Yun Shen2,3, Lila Ghamsari2,3,w, Martin Broly2,3, Maria Rodriguez2,3, Stanley Tam2,3, Shelly A. Trigg2,3,w, Changyu Fan2,3, Song Yi2,3, Murat Tasan4, Irma Lemmens5, Xingyan Kuang6, Nan Zhao6, Dheeraj Malhotra7, Jacob J. Michaelson7,w, Vladimir Vacic8, Michael A. Calderwood2,3, Frederick P. Roth2,3,4, Jan Tavernier5, Steve Horvath9, Kourosh Salehi-Ashtiani2,3,w, Dmitry Korkin6, Jonathan Sebat7, David E. Hill2,3, Tong Hao2,3, Marc Vidal2,3 & Lilia M. Iakoucheva1 Increased risk for autism spectrum disorders (ASD) is attributed to hundreds of genetic loci. The convergence of ASD variants have been investigated using various approaches, including protein interactions extracted from the published literature. However, these datasets are frequently incomplete, carry biases and are limited to interactions of a single splicing isoform, which may not be expressed in the disease-relevant tissue. Here we introduce a new interactome mapping approach by experimentally identifying interactions between brain-expressed alternatively spliced variants of ASD risk factors. The Autism Spliceform Interaction Network reveals that almost half of the detected interactions and about 30% of the newly identified interacting partners represent contribution from splicing variants, emphasizing the importance of isoform networks. Isoform interactions greatly contribute to establishing direct physical connections between proteins from the de novo autism CNVs. Our findings demonstrate the critical role of spliceform networks for translating genetic knowledge into a better understanding of human diseases.
    [Show full text]
  • Nuclear Organization and the Epigenetic Landscape of the Mus Musculus X-Chromosome Alicia Liu University of Connecticut - Storrs, [email protected]
    University of Connecticut OpenCommons@UConn Doctoral Dissertations University of Connecticut Graduate School 8-9-2019 Nuclear Organization and the Epigenetic Landscape of the Mus musculus X-Chromosome Alicia Liu University of Connecticut - Storrs, [email protected] Follow this and additional works at: https://opencommons.uconn.edu/dissertations Recommended Citation Liu, Alicia, "Nuclear Organization and the Epigenetic Landscape of the Mus musculus X-Chromosome" (2019). Doctoral Dissertations. 2273. https://opencommons.uconn.edu/dissertations/2273 Nuclear Organization and the Epigenetic Landscape of the Mus musculus X-Chromosome Alicia J. Liu, Ph.D. University of Connecticut, 2019 ABSTRACT X-linked imprinted genes have been hypothesized to contribute parent-of-origin influences on social cognition. A cluster of imprinted genes Xlr3b, Xlr4b, and Xlr4c, implicated in cognitive defects, are maternally expressed and paternally silent in the murine brain. These genes defy classic mechanisms of autosomal imprinting, suggesting a novel method of imprinted gene regulation. Using Xlr3b and Xlr4c as bait, this study uses 4C-Seq on neonatal whole brain of a 39,XO mouse model, to provide the first in-depth analysis of chromatin dynamics surrounding an imprinted locus on the X-chromosome. Significant differences in long-range contacts exist be- tween XM and XP monosomic samples. In addition, XM interaction profiles contact a greater number of genes linked to cognitive impairment, abnormality of the nervous system, and abnormality of higher mental function. This is not a pattern that is unique to the imprinted Xlr3/4 locus. Additional Alicia J. Liu - University of Connecticut - 2019 4C-Seq experiments show that other genes on the X-chromosome, implicated in intellectual disability and/or ASD, also produce more maternal contacts to other X-linked genes linked to cognitive impairment.
    [Show full text]
  • A Computational Approach for Defining a Signature of Β-Cell Golgi Stress in Diabetes Mellitus
    Page 1 of 781 Diabetes A Computational Approach for Defining a Signature of β-Cell Golgi Stress in Diabetes Mellitus Robert N. Bone1,6,7, Olufunmilola Oyebamiji2, Sayali Talware2, Sharmila Selvaraj2, Preethi Krishnan3,6, Farooq Syed1,6,7, Huanmei Wu2, Carmella Evans-Molina 1,3,4,5,6,7,8* Departments of 1Pediatrics, 3Medicine, 4Anatomy, Cell Biology & Physiology, 5Biochemistry & Molecular Biology, the 6Center for Diabetes & Metabolic Diseases, and the 7Herman B. Wells Center for Pediatric Research, Indiana University School of Medicine, Indianapolis, IN 46202; 2Department of BioHealth Informatics, Indiana University-Purdue University Indianapolis, Indianapolis, IN, 46202; 8Roudebush VA Medical Center, Indianapolis, IN 46202. *Corresponding Author(s): Carmella Evans-Molina, MD, PhD ([email protected]) Indiana University School of Medicine, 635 Barnhill Drive, MS 2031A, Indianapolis, IN 46202, Telephone: (317) 274-4145, Fax (317) 274-4107 Running Title: Golgi Stress Response in Diabetes Word Count: 4358 Number of Figures: 6 Keywords: Golgi apparatus stress, Islets, β cell, Type 1 diabetes, Type 2 diabetes 1 Diabetes Publish Ahead of Print, published online August 20, 2020 Diabetes Page 2 of 781 ABSTRACT The Golgi apparatus (GA) is an important site of insulin processing and granule maturation, but whether GA organelle dysfunction and GA stress are present in the diabetic β-cell has not been tested. We utilized an informatics-based approach to develop a transcriptional signature of β-cell GA stress using existing RNA sequencing and microarray datasets generated using human islets from donors with diabetes and islets where type 1(T1D) and type 2 diabetes (T2D) had been modeled ex vivo. To narrow our results to GA-specific genes, we applied a filter set of 1,030 genes accepted as GA associated.
    [Show full text]
  • Multi-Targeted Mechanisms Underlying the Endothelial Protective Effects of the Diabetic-Safe Sweetener Erythritol
    Multi-Targeted Mechanisms Underlying the Endothelial Protective Effects of the Diabetic-Safe Sweetener Erythritol Danie¨lle M. P. H. J. Boesten1*., Alvin Berger2.¤, Peter de Cock3, Hua Dong4, Bruce D. Hammock4, Gertjan J. M. den Hartog1, Aalt Bast1 1 Department of Toxicology, Maastricht University, Maastricht, The Netherlands, 2 Global Food Research, Cargill, Wayzata, Minnesota, United States of America, 3 Cargill RandD Center Europe, Vilvoorde, Belgium, 4 Department of Entomology and UCD Comprehensive Cancer Center, University of California Davis, Davis, California, United States of America Abstract Diabetes is characterized by hyperglycemia and development of vascular pathology. Endothelial cell dysfunction is a starting point for pathogenesis of vascular complications in diabetes. We previously showed the polyol erythritol to be a hydroxyl radical scavenger preventing endothelial cell dysfunction onset in diabetic rats. To unravel mechanisms, other than scavenging of radicals, by which erythritol mediates this protective effect, we evaluated effects of erythritol in endothelial cells exposed to normal (7 mM) and high glucose (30 mM) or diabetic stressors (e.g. SIN-1) using targeted and transcriptomic approaches. This study demonstrates that erythritol (i.e. under non-diabetic conditions) has minimal effects on endothelial cells. However, under hyperglycemic conditions erythritol protected endothelial cells against cell death induced by diabetic stressors (i.e. high glucose and peroxynitrite). Also a number of harmful effects caused by high glucose, e.g. increased nitric oxide release, are reversed. Additionally, total transcriptome analysis indicated that biological processes which are differentially regulated due to high glucose are corrected by erythritol. We conclude that erythritol protects endothelial cells during high glucose conditions via effects on multiple targets.
    [Show full text]
  • Characterization of MAGED1 As a Component of E3 Ubiquitin Ligase Complexes
    Characterization of MAGED1 as a component of E3 ubiquitin ligase complexes Nora Riems Student number: 01206813 Promoter: Prof. Dr. Mathieu Bertrand Scientific supervisor: Dario Priem Master’s dissertation submitted to Ghent University to obtain the degree of Master of Science in Biochemistry and Biotechnology. Major Biomedical Biotechnology. Academic year: 2016 - 2017 Ghent University – Department of Biomedical Molecular Biology VIB – Center for Inflammation Research Research Group: Molecular Signalling and Cell Death Acknowledgments Many people have contributed to the realization of this master dissertation and I would like to give a well-deserved thank you to everybody. First of all, I would like to thank my promotor Mathieu Bertrand and scientific supervisor Dario Priem for giving me the opportunity to work on this project. I would like to express my sincere appreciation for your constant guidance and the immense amount of feedback. Without it, this project would have remained uncompleted. Thank you. I would also like to thank Ria Roelandt and Inge Bruggeman for the help and guidance throughout my project. You have not only helped met with practical work but you were also the persons I could turn to with all my questions. Next, I would like to warmly thank all the people of the research group for the fun moments in the lab. In particular Wannes, you have supported me in an emotional way and never failed to make me smile. Finally, I would like to thank my parents. Although the last couple of years have not always been the easiest, you never failed to continuously encourage and support me.
    [Show full text]
  • MAGED1 (Human) Recombinant Protein (Q01)
    MAGED1 (Human) Recombinant Protein (Q01) Catalog # : H00009500-Q01 規格 : [ 10 ug ] [ 25 ug ] List All Specification Application Image Product Human MAGED1 partial ORF ( NP_001005333, 117 a.a. - 226 a.a.) Enzyme-linked Immunoabsorbent Assay Description: recombinant protein with GST-tag at N-terminal. Western Blot (Recombinant Sequence: EMADIQVSAAAARPKSAFKVQNATTKGPNGVYDFSQAHNAKDVPNTQPK protein) AAFKSQNATPKGPNAAYDFSQAATTGELAANKSEMAFKAQNATTKVGP NATYNFSQSLNAN Antibody Production Host: Wheat Germ (in vitro) Protein Array Theoretical MW 37.84 (kDa): Preparation in vitro wheat germ expression system Method: Purification: Glutathione Sepharose 4 Fast Flow Quality Control 12.5% SDS-PAGE Stained with Coomassie Blue. Testing: Storage Buffer: 50 mM Tris-HCI, 10 mM reduced Glutathione, pH=8.0 in the elution buffer. Storage Store at -80°C. Aliquot to avoid repeated freezing and thawing. Instruction: Note: Best use within three months from the date of receipt of this protein. MSDS: Download Datasheet: Download Applications Enzyme-linked Immunoabsorbent Assay Western Blot (Recombinant protein) Antibody Production Page 1 of 2 2016/5/22 Protein Array Gene Information Entrez GeneID: 9500 GeneBank NM_001005333 Accession#: Protein NP_001005333 Accession#: Gene Name: MAGED1 Gene Alias: DLXIN-1,NRAGE Gene melanoma antigen family D, 1 Description: Omim ID: 300224 Gene Ontology: Hyperlink Gene Summary: This gene is a member of the melanoma antigen gene (MAGE) family. Most of the genes of this family encode tumor specific antigens that are not expressed in normal adult tissues except testis. Although the protein encoded by this gene shares strong homology with members of the MAGE family, it is expressed in almost all normal adult tissues. This gene has been demonstrated to be involved in the p75 neurotrophin receptor mediated programmed cell death pathway.
    [Show full text]
  • Supplementary Materials
    Supplementary materials Supplementary Table S1: MGNC compound library Ingredien Molecule Caco- Mol ID MW AlogP OB (%) BBB DL FASA- HL t Name Name 2 shengdi MOL012254 campesterol 400.8 7.63 37.58 1.34 0.98 0.7 0.21 20.2 shengdi MOL000519 coniferin 314.4 3.16 31.11 0.42 -0.2 0.3 0.27 74.6 beta- shengdi MOL000359 414.8 8.08 36.91 1.32 0.99 0.8 0.23 20.2 sitosterol pachymic shengdi MOL000289 528.9 6.54 33.63 0.1 -0.6 0.8 0 9.27 acid Poricoic acid shengdi MOL000291 484.7 5.64 30.52 -0.08 -0.9 0.8 0 8.67 B Chrysanthem shengdi MOL004492 585 8.24 38.72 0.51 -1 0.6 0.3 17.5 axanthin 20- shengdi MOL011455 Hexadecano 418.6 1.91 32.7 -0.24 -0.4 0.7 0.29 104 ylingenol huanglian MOL001454 berberine 336.4 3.45 36.86 1.24 0.57 0.8 0.19 6.57 huanglian MOL013352 Obacunone 454.6 2.68 43.29 0.01 -0.4 0.8 0.31 -13 huanglian MOL002894 berberrubine 322.4 3.2 35.74 1.07 0.17 0.7 0.24 6.46 huanglian MOL002897 epiberberine 336.4 3.45 43.09 1.17 0.4 0.8 0.19 6.1 huanglian MOL002903 (R)-Canadine 339.4 3.4 55.37 1.04 0.57 0.8 0.2 6.41 huanglian MOL002904 Berlambine 351.4 2.49 36.68 0.97 0.17 0.8 0.28 7.33 Corchorosid huanglian MOL002907 404.6 1.34 105 -0.91 -1.3 0.8 0.29 6.68 e A_qt Magnogrand huanglian MOL000622 266.4 1.18 63.71 0.02 -0.2 0.2 0.3 3.17 iolide huanglian MOL000762 Palmidin A 510.5 4.52 35.36 -0.38 -1.5 0.7 0.39 33.2 huanglian MOL000785 palmatine 352.4 3.65 64.6 1.33 0.37 0.7 0.13 2.25 huanglian MOL000098 quercetin 302.3 1.5 46.43 0.05 -0.8 0.3 0.38 14.4 huanglian MOL001458 coptisine 320.3 3.25 30.67 1.21 0.32 0.9 0.26 9.33 huanglian MOL002668 Worenine
    [Show full text]
  • Systematic Screening Reveals a Role for BRCA1 in the Response to Transcription-Associated DNA Damage
    Downloaded from genesdev.cshlp.org on October 6, 2021 - Published by Cold Spring Harbor Laboratory Press RESOURCE/METHODOLOGY Systematic screening reveals a role for BRCA1 in the response to transcription-associated DNA damage Sarah J. Hill,1,2 Thomas Rolland,1,2,3 Guillaume Adelmant,1,4,5 Xianfang Xia,1,2,3 Matthew S. Owen,1,2,3 Amelie Dricot,1,2,3 Travis I. Zack,1,6 Nidhi Sahni,1,2,3 Yves Jacob,1,2,3,7,8,9 Tong Hao,1,2,3 Kristine M. McKinney,1,2 Allison P. Clark,1,2 Deepak Reyon,10,11,12 Shengdar Q. Tsai,10,11,12 J. Keith Joung,10,11,12 Rameen Beroukhim,1,6,13 Jarrod A. Marto,1,4,5 Marc Vidal,1,2,3 Suzanne Gaudet,1,2,3 David E. Hill,1,2,3,14 and David M. Livingston1,2,14 1Department of Cancer Biology, Dana-Farber Cancer Institute, Boston, Massachusetts 02215, USA; 2Department of Genetics, Harvard Medical School, Boston, Massachusetts 02115, USA; 3Center for Cancer Systems Biology (CCSB), Dana-Farber Cancer Institute, Boston, Massachusetts 02215, USA; 4Department of Biological Chemistry and Molecular Pharmacology, Harvard Medical School, Boston, Massachusetts 02115, USA; 5Blais Proteomics Center, Dana-Farber Cancer Institute, Boston, Massachusetts 02215, USA; 6The Broad Institute, Cambridge, Massachusetts 02142, USA; 7Departement de Virologie, Unite de Gen etique Moleculaire des Virus a ARN, Institut Pasteur, F-75015 Paris, France; 8UMR3569, Centre National de la Recherche Scientifique, F-75015 Paris, France; 9UnitedeG en etique Moleculaire des Virus a ARN, Universite Paris Diderot, F-75015 Paris, France; 10Molecular Pathology Unit, Center for Computational and Integrative Biology, 11Center for Cancer Research, Massachusetts General Hospital, Charlestown, Massachusetts 02129, USA; 12Department of Pathology, Harvard Medical School, Boston, Massachusetts 02115, USA; 13Department of Medical Oncology, Dana-Farber Cancer Institute, Boston, Massachusetts 02215, USA BRCA1 is a breast and ovarian tumor suppressor.
    [Show full text]
  • Role and Regulation of the P53-Homolog P73 in the Transformation of Normal Human Fibroblasts
    Role and regulation of the p53-homolog p73 in the transformation of normal human fibroblasts Dissertation zur Erlangung des naturwissenschaftlichen Doktorgrades der Bayerischen Julius-Maximilians-Universität Würzburg vorgelegt von Lars Hofmann aus Aschaffenburg Würzburg 2007 Eingereicht am Mitglieder der Promotionskommission: Vorsitzender: Prof. Dr. Dr. Martin J. Müller Gutachter: Prof. Dr. Michael P. Schön Gutachter : Prof. Dr. Georg Krohne Tag des Promotionskolloquiums: Doktorurkunde ausgehändigt am Erklärung Hiermit erkläre ich, dass ich die vorliegende Arbeit selbständig angefertigt und keine anderen als die angegebenen Hilfsmittel und Quellen verwendet habe. Diese Arbeit wurde weder in gleicher noch in ähnlicher Form in einem anderen Prüfungsverfahren vorgelegt. Ich habe früher, außer den mit dem Zulassungsgesuch urkundlichen Graden, keine weiteren akademischen Grade erworben und zu erwerben gesucht. Würzburg, Lars Hofmann Content SUMMARY ................................................................................................................ IV ZUSAMMENFASSUNG ............................................................................................. V 1. INTRODUCTION ................................................................................................. 1 1.1. Molecular basics of cancer .......................................................................................... 1 1.2. Early research on tumorigenesis ................................................................................. 3 1.3. Developing
    [Show full text]
  • The Viral Oncoproteins Tax and HBZ Reprogram the Cellular Mrna Splicing Landscape
    bioRxiv preprint doi: https://doi.org/10.1101/2021.01.18.427104; this version posted January 18, 2021. The copyright holder for this preprint (which was not certified by peer review) is the author/funder. All rights reserved. No reuse allowed without permission. The viral oncoproteins Tax and HBZ reprogram the cellular mRNA splicing landscape Charlotte Vandermeulen1,2,3, Tina O’Grady3, Bartimee Galvan3, Majid Cherkaoui1, Alice Desbuleux1,2,4,5, Georges Coppin1,2,4,5, Julien Olivet1,2,4,5, Lamya Ben Ameur6, Keisuke Kataoka7, Seishi Ogawa7, Marc Thiry8, Franck Mortreux6, Michael A. Calderwood2,4,5, David E. Hill2,4,5, Johan Van Weyenbergh9, Benoit Charloteaux2,4,5,10, Marc Vidal2,4*, Franck Dequiedt3*, and Jean-Claude Twizere1,2,11* 1Laboratory of Viral Interactomes, GIGA Institute, University of Liege, Liege, Belgium.2Center for Cancer Systems Biology (CCSB), Dana-Farber Cancer Institute, Boston, MA, USA.3Laboratory of Gene Expression and Cancer, GIGA Institute, University of Liege, Liege, Belgium.4Department of Genetics, Blavatnik Institute, Harvard Medical School, Boston, MA, USA. 5Department of Cancer Biology, Dana-Farber Cancer Institute, Boston, MA, USA.6Laboratory of Biology and Modeling of the Cell, CNRS UMR 5239, INSERM U1210, University of Lyon, Lyon, France.7Department of Pathology and Tumor Biology, Kyoto University, Japan.8Unit of Cell and Tissue Biology, GIGA Institute, University of Liege, Liege, Belgium.9Laboratory of Clinical and Epidemiological Virology, Rega Institute for Medical Research, Department of Microbiology, Immunology and Transplantation, Catholic University of Leuven, Leuven, Belgium.10Department of Human Genetics, CHU of Liege, University of Liege, Liege, Belgium.11Lead Contact. *Correspondence: [email protected]; [email protected]; [email protected] bioRxiv preprint doi: https://doi.org/10.1101/2021.01.18.427104; this version posted January 18, 2021.
    [Show full text]
  • A Transcriptional Signature of Postmitotic Maintenance in Neural Tissues
    Neurobiology of Aging 74 (2019) 147e160 Contents lists available at ScienceDirect Neurobiology of Aging journal homepage: www.elsevier.com/locate/neuaging Postmitotic cell longevityeassociated genes: a transcriptional signature of postmitotic maintenance in neural tissues Atahualpa Castillo-Morales a,b, Jimena Monzón-Sandoval a,b, Araxi O. Urrutia b,c,*, Humberto Gutiérrez a,** a School of Life Sciences, University of Lincoln, Lincoln, UK b Milner Centre for Evolution, Department of Biology and Biochemistry, University of Bath, Bath, UK c Instituto de Ecología, Universidad Nacional Autónoma de México, Ciudad de México, Mexico article info abstract Article history: Different cell types have different postmitotic maintenance requirements. Nerve cells, however, are Received 11 April 2018 unique in this respect as they need to survive and preserve their functional complexity for the entire Received in revised form 3 October 2018 lifetime of the organism, and failure at any level of their supporting mechanisms leads to a wide range of Accepted 11 October 2018 neurodegenerative conditions. Whether these differences across tissues arise from the activation of Available online 19 October 2018 distinct cell typeespecific maintenance mechanisms or the differential activation of a common molecular repertoire is not known. To identify the transcriptional signature of postmitotic cellular longevity (PMCL), Keywords: we compared whole-genome transcriptome data from human tissues ranging in longevity from 120 days Neural maintenance Cell longevity to over 70 years and found a set of 81 genes whose expression levels are closely associated with Transcriptional signature increased cell longevity. Using expression data from 10 independent sources, we found that these genes Functional genomics are more highly coexpressed in longer-living tissues and are enriched in specific biological processes and transcription factor targets compared with randomly selected gene samples.
    [Show full text]