Comprehensive Identification and Characterization of Conserved Small

Total Page:16

File Type:pdf, Size:1020Kb

Comprehensive Identification and Characterization of Conserved Small bioRxiv preprint doi: https://doi.org/10.1101/017772; this version posted April 9, 2015. The copyright holder for this preprint (which was not certified by peer review) is the author/funder. All rights reserved. No reuse allowed without permission. 1 Comprehensive identification and 2 characterization of conserved small 3 ORFs in animals 4 5 6 Sebastian D. Mackowiak1, Henrik Zauber1, Chris Bielow2, Denise Thiel1, Kamila Kutz1, 7 Lorenzo Calviello1, Guido Mastrobuoni1, Nikolaus RaJewsky1, Stefan Kempa1, Matthias 8 Selbach1, Benedikt Obermayer1* 9 10 1Max-Delbrück-Center for Molecular Medicine, Berlin Institute for Medical Systems 11 Biology, 13125 Berlin, Germany 12 2Berlin Institute of Health, Kapelle-Ufer 2, 10117 Berlin, Germany 13 14 *[email protected] 1 bioRxiv preprint doi: https://doi.org/10.1101/017772; this version posted April 9, 2015. The copyright holder for this preprint (which was not certified by peer review) is the author/funder. All rights reserved. No reuse allowed without permission. 1 Abstract 2 There is increasing evidence that non-annotated short open reading frames (sORFs) can 3 encode functional micropeptides, but computational identification remains challenging. 4 We expand our published method and predict conserved sORFs in human, mouse, 5 zebrafish, fruit fly and the nematode C. elegans. Isolating specific conservation 6 signatures indicative of purifying selection on encoded amino acid sequence, we identify 7 about 2000 novel sORFs in the untranslated regions of canonical mRNAs or on 8 transcripts annotated as non-coding. Predicted sORFs show stronger conservation 9 signatures than those identified in previous studies and are sometimes conserved over 10 large evolutionary distances. Encoded peptides have little homology to known proteins 11 and are enriched in disordered regions and short interaction motifs. Published ribosome 12 profiling data indicate translation for more than 100 of novel sORFs, and mass 13 spectrometry data gives peptidomic evidence for more than 70 novel candidates. We 14 thus provide a catalog of conserved micropeptides for functional validation in vivo. 2 bioRxiv preprint doi: https://doi.org/10.1101/017772; this version posted April 9, 2015. The copyright holder for this preprint (which was not certified by peer review) is the author/funder. All rights reserved. No reuse allowed without permission. 1 Introduction 2 Ongoing efforts to comprehensively annotate the genomes of humans and other species 3 revealed that a much larger fraction of the genome is transcribed than initially 4 appreciated1. Pervasive transcription produces a number of novel classes of non-coding 5 RNAs, in particular long intergenic non-coding RNAs (lincRNAs)2. The defining feature of 6 lincRNAs is the lack of canonical open reading frames (ORFs), classified mainly by 7 length, nucleotide sequence statistics, conservation signatures and similarity to known 8 protein domains2. Although coding-independent RNA-level functions have been 9 established for a growing number of lincRNAs3,4, there is little consensus about their 10 general roles5. Moreover, the distinction between lincRNAs and mRNAs is not always 11 clear-cut6, since many lincRNAs have short ORFs, which easily occur by chance in any 12 stretch of nucleotide sequence. However, recent observations suggest that lincRNAs and 13 other non-coding regions are often associated with ribosomes and sometimes in fact 14 translated7-16. Indeed, some of the encoded peptides have been detected via mass 15 spectrometry10,17-23. Small peptides have been marked as essential cellular components 16 in bacteria24 and yeast25. More detailed functional studies have identified the well- 17 known tarsal-less peptides in insects26-29, characterized a short secreted peptide as an 18 important developmental signal in vertebrates30, and established a fundamental link 19 between different animal micropeptides and cellular calcium uptake31,32. 20 Importantly, some ambiguity between coding and non-coding regions has been 21 observed even on canonical mRNAs15: upstream ORFs (uORFs) in 5' untranslated 22 regions (5'UTRs) are frequent, well-known and mostly linked to the translational 23 regulation of the main CDS33,34. To a lesser extent, mRNA 3'UTRs have also been found 24 associated to ribosomes, which has been attributed to stop-codon read-through35, in 25 other cases to delayed drop-off, translational regulation or ribosome recycling36, and 26 even to the translation of 3'UTR ORFs (dORFs)10. Translational regulation could be the 27 main role of these ORFs, and regulatory effects of translation (e.g., mRNA decay) could 28 be a major function of lincRNA translation12. Alternatively, they could be ORFs in their 29 own right, considering well-known examples of polycistronic transcripts in animals such 30 as the tarsal-less mRNA26-28. Indeed, many non-annotated ORFs have been found to 31 produce detectable peptides10,17, and might therefore encode functional 32 micropeptides37. 33 Typically, lincRNAs are poorly conserved on the nucleotide level, and it is hard to 34 computationally detect functional conservation despite sequence divergence even when 35 it is suggested by synteny2,38. In contrast, many of the sORFs known to produce 3 bioRxiv preprint doi: https://doi.org/10.1101/017772; this version posted April 9, 2015. The copyright holder for this preprint (which was not certified by peer review) is the author/funder. All rights reserved. No reuse allowed without permission. 1 functional micropeptides display striking sequence conservation, highlighted by a 2 characteristic depletion of nonsynonymous compared to synonymous mutations. This 3 suggests purifying selection on the level of encoded peptide (rather than DNA or RNA) 4 sequence. Also, the sequence conservation rarely extends far beyond the ORF itself, and 5 an absence of insertions or deletions implies conservation of the reading frame. These 6 features are well-known characteristics of canonical protein-coding genes and have in 7 fact been used for many years in comparative genomics39,40. While many powerful 8 computational methods to identify protein-coding regions are based on sequence 9 statistics and suffer high false-positive rates for very short ORFs41,42, comparative 10 genomics methods have gained statistical power over the last years given the vastly 11 increased number of sequenced animal genomes. 12 Here, we present results of an integrated computational pipeline to identify conserved 13 sORFs using comparative genomics. We greatly extended our previously published 14 approach10 and applied it to the entire transcriptome of five animal species: human (H. 15 sapiens), mouse (M. musculus), zebrafish (D. rerio), fruit fly (D. melanogaster), and the 16 nematode C. elegans. Applying rigorous filtering criteria, we find a total of about 2000 17 novel conserved sORFs in lincRNAs as well as other regions of the transcriptome 18 annotated as non-coding. By means of comparative and population genomics, we detect 19 purifying selection on the encoded peptide sequence, suggesting that the detected 20 sORFs, of which some are conserved over wide evolutionary distances, give rise to 21 functional micropeptides. We compare our results to published catalogs of peptides 22 from non-annotated regions, to sets of sORFs found to be translated using ribosome 23 profiling, and to a number of computational sORF predictions. While there is often little 24 overlap, we find in all cases consistently stronger conservation for our candidates, 25 confirming the high stringency of our approach. Overall, predicted peptides have little 26 homology to known proteins and are rich in disordered regions and peptide binding 27 motifs which could mediate protein-protein interactions. Finally, we use published high- 28 throughput datasets to analyze expression of their host transcripts, confirm translation 29 of more than 100 novel sORFs using published ribosome profiling data, and mine in- 30 house and published mass spectrometry datasets to support protein expression from 31 more than 70 novel sORFs. Altogether, we provide a comprehensive catalog of 32 conserved sORFs in animals to aid functional studies. 4 bioRxiv preprint doi: https://doi.org/10.1101/017772; this version posted April 9, 2015. The copyright holder for this preprint (which was not certified by peer review) is the author/funder. All rights reserved. No reuse allowed without permission. 1 Results 2 Identification of conserved coding sORFs from multiple species alignments. 3 Our approach, which is summarized in Fig. 1A, is a significant extension of our 4 previously published method10. Like most other computational studies, we take an 5 annotated transcriptome together with published lincRNA catalogs as a starting point. 6 We chose the Ensembl annotation (v74), which is currently one of the most 7 comprehensive ones, especially for the species considered here. In contrast to de novo 8 genome-wide predictions17,43, we rely on annotated transcript structures including 9 splice sites. We then identified canonical ORFs for each transcript, using the most 10 upstream AUG for each stop codon; although use of non-canonical start codons has been 11 frequently described15-17,44,45, there is currently no clear consensus how alternative 12 translation start sites are selected. Next, ORFs were classified according to their location 13 on lincRNAs or on transcripts from protein-coding loci: annotated ORFs serving as 14 positive control; ORFs in 3'UTRs, 5'UTRs or overlapping
Recommended publications
  • A Flexible Microfluidic System for Single-Cell Transcriptome Profiling
    www.nature.com/scientificreports OPEN A fexible microfuidic system for single‑cell transcriptome profling elucidates phased transcriptional regulators of cell cycle Karen Davey1,7, Daniel Wong2,7, Filip Konopacki2, Eugene Kwa1, Tony Ly3, Heike Fiegler2 & Christopher R. Sibley 1,4,5,6* Single cell transcriptome profling has emerged as a breakthrough technology for the high‑resolution understanding of complex cellular systems. Here we report a fexible, cost‑efective and user‑ friendly droplet‑based microfuidics system, called the Nadia Instrument, that can allow 3′ mRNA capture of ~ 50,000 single cells or individual nuclei in a single run. The precise pressure‑based system demonstrates highly reproducible droplet size, low doublet rates and high mRNA capture efciencies that compare favorably in the feld. Moreover, when combined with the Nadia Innovate, the system can be transformed into an adaptable setup that enables use of diferent bufers and barcoded bead confgurations to facilitate diverse applications. Finally, by 3′ mRNA profling asynchronous human and mouse cells at diferent phases of the cell cycle, we demonstrate the system’s ability to readily distinguish distinct cell populations and infer underlying transcriptional regulatory networks. Notably this provided supportive evidence for multiple transcription factors that had little or no known link to the cell cycle (e.g. DRAP1, ZKSCAN1 and CEBPZ). In summary, the Nadia platform represents a promising and fexible technology for future transcriptomic studies, and other related applications, at cell resolution. Single cell transcriptome profling has recently emerged as a breakthrough technology for understanding how cellular heterogeneity contributes to complex biological systems. Indeed, cultured cells, microorganisms, biopsies, blood and other tissues can be rapidly profled for quantifcation of gene expression at cell resolution.
    [Show full text]
  • WO 2019/079361 Al 25 April 2019 (25.04.2019) W 1P O PCT
    (12) INTERNATIONAL APPLICATION PUBLISHED UNDER THE PATENT COOPERATION TREATY (PCT) (19) World Intellectual Property Organization I International Bureau (10) International Publication Number (43) International Publication Date WO 2019/079361 Al 25 April 2019 (25.04.2019) W 1P O PCT (51) International Patent Classification: CA, CH, CL, CN, CO, CR, CU, CZ, DE, DJ, DK, DM, DO, C12Q 1/68 (2018.01) A61P 31/18 (2006.01) DZ, EC, EE, EG, ES, FI, GB, GD, GE, GH, GM, GT, HN, C12Q 1/70 (2006.01) HR, HU, ID, IL, IN, IR, IS, JO, JP, KE, KG, KH, KN, KP, KR, KW, KZ, LA, LC, LK, LR, LS, LU, LY, MA, MD, ME, (21) International Application Number: MG, MK, MN, MW, MX, MY, MZ, NA, NG, NI, NO, NZ, PCT/US2018/056167 OM, PA, PE, PG, PH, PL, PT, QA, RO, RS, RU, RW, SA, (22) International Filing Date: SC, SD, SE, SG, SK, SL, SM, ST, SV, SY, TH, TJ, TM, TN, 16 October 2018 (16. 10.2018) TR, TT, TZ, UA, UG, US, UZ, VC, VN, ZA, ZM, ZW. (25) Filing Language: English (84) Designated States (unless otherwise indicated, for every kind of regional protection available): ARIPO (BW, GH, (26) Publication Language: English GM, KE, LR, LS, MW, MZ, NA, RW, SD, SL, ST, SZ, TZ, (30) Priority Data: UG, ZM, ZW), Eurasian (AM, AZ, BY, KG, KZ, RU, TJ, 62/573,025 16 October 2017 (16. 10.2017) US TM), European (AL, AT, BE, BG, CH, CY, CZ, DE, DK, EE, ES, FI, FR, GB, GR, HR, HU, ΓΕ , IS, IT, LT, LU, LV, (71) Applicant: MASSACHUSETTS INSTITUTE OF MC, MK, MT, NL, NO, PL, PT, RO, RS, SE, SI, SK, SM, TECHNOLOGY [US/US]; 77 Massachusetts Avenue, TR), OAPI (BF, BJ, CF, CG, CI, CM, GA, GN, GQ, GW, Cambridge, Massachusetts 02139 (US).
    [Show full text]
  • The Function and Evolution of C2H2 Zinc Finger Proteins and Transposons
    The function and evolution of C2H2 zinc finger proteins and transposons by Laura Francesca Campitelli A thesis submitted in conformity with the requirements for the degree of Doctor of Philosophy Department of Molecular Genetics University of Toronto © Copyright by Laura Francesca Campitelli 2020 The function and evolution of C2H2 zinc finger proteins and transposons Laura Francesca Campitelli Doctor of Philosophy Department of Molecular Genetics University of Toronto 2020 Abstract Transcription factors (TFs) confer specificity to transcriptional regulation by binding specific DNA sequences and ultimately affecting the ability of RNA polymerase to transcribe a locus. The C2H2 zinc finger proteins (C2H2 ZFPs) are a TF class with the unique ability to diversify their DNA-binding specificities in a short evolutionary time. C2H2 ZFPs comprise the largest class of TFs in Mammalian genomes, including nearly half of all Human TFs (747/1,639). Positive selection on the DNA-binding specificities of C2H2 ZFPs is explained by an evolutionary arms race with endogenous retroelements (EREs; copy-and-paste transposable elements), where the C2H2 ZFPs containing a KRAB repressor domain (KZFPs; 344/747 Human C2H2 ZFPs) are thought to diversify to bind new EREs and repress deleterious transposition events. However, evidence of the gain and loss of KZFP binding sites on the ERE sequence is sparse due to poor resolution of ERE sequence evolution, despite the recent publication of binding preferences for 242/344 Human KZFPs. The goal of my doctoral work has been to characterize the Human C2H2 ZFPs, with specific interest in their evolutionary history, functional diversity, and coevolution with LINE EREs.
    [Show full text]
  • Whole Genome Sequencing of Familial Non-Medullary Thyroid Cancer Identifies Germline Alterations in MAPK/ERK and PI3K/AKT Signaling Pathways
    biomolecules Article Whole Genome Sequencing of Familial Non-Medullary Thyroid Cancer Identifies Germline Alterations in MAPK/ERK and PI3K/AKT Signaling Pathways Aayushi Srivastava 1,2,3,4 , Abhishek Kumar 1,5,6 , Sara Giangiobbe 1, Elena Bonora 7, Kari Hemminki 1, Asta Försti 1,2,3 and Obul Reddy Bandapalli 1,2,3,* 1 Division of Molecular Genetic Epidemiology, German Cancer Research Center (DKFZ), D-69120 Heidelberg, Germany; [email protected] (A.S.); [email protected] (A.K.); [email protected] (S.G.); [email protected] (K.H.); [email protected] (A.F.) 2 Hopp Children’s Cancer Center (KiTZ), D-69120 Heidelberg, Germany 3 Division of Pediatric Neurooncology, German Cancer Research Center (DKFZ), German Cancer Consortium (DKTK), D-69120 Heidelberg, Germany 4 Medical Faculty, Heidelberg University, D-69120 Heidelberg, Germany 5 Institute of Bioinformatics, International Technology Park, Bangalore 560066, India 6 Manipal Academy of Higher Education (MAHE), Manipal, Karnataka 576104, India 7 S.Orsola-Malphigi Hospital, Unit of Medical Genetics, 40138 Bologna, Italy; [email protected] * Correspondence: [email protected]; Tel.: +49-6221-42-1709 Received: 29 August 2019; Accepted: 10 October 2019; Published: 13 October 2019 Abstract: Evidence of familial inheritance in non-medullary thyroid cancer (NMTC) has accumulated over the last few decades. However, known variants account for a very small percentage of the genetic burden. Here, we focused on the identification of common pathways and networks enriched in NMTC families to better understand its pathogenesis with the final aim of identifying one novel high/moderate-penetrance germline predisposition variant segregating with the disease in each studied family.
    [Show full text]
  • Supplementary Table 1 Genes Tested in Qrt-PCR in Nfpas
    Supplementary Table 1 Genes tested in qRT-PCR in NFPAs Gene Bank accession Gene Description number ABI assay ID a disintegrin-like and metalloprotease with thrombospondin type 1 motif 7 ADAMTS7 NM_014272.3 Hs00276223_m1 Rho guanine nucleotide exchange factor (GEF) 3 ARHGEF3 NM_019555.1 Hs00219609_m1 BCL2-associated X protein BAX NM_004324 House design Bcl-2 binding component 3 BBC3 NM_014417.2 Hs00248075_m1 B-cell CLL/lymphoma 2 BCL2 NM_000633 House design Bone morphogenetic protein 7 BMP7 NM_001719.1 Hs00233476_m1 CCAAT/enhancer binding protein (C/EBP), alpha CEBPA NM_004364.2 Hs00269972_s1 coxsackie virus and adenovirus receptor CXADR NM_001338.3 Hs00154661_m1 Homo sapiens Dicer1, Dcr-1 homolog (Drosophila) (DICER1) DICER1 NM_177438.1 Hs00229023_m1 Homo sapiens dystonin DST NM_015548.2 Hs00156137_m1 fms-related tyrosine kinase 3 FLT3 NM_004119.1 Hs00174690_m1 glutamate receptor, ionotropic, N-methyl D-aspartate 1 GRIN1 NM_000832.4 Hs00609557_m1 high-mobility group box 1 HMGB1 NM_002128.3 Hs01923466_g1 heterogeneous nuclear ribonucleoprotein U HNRPU NM_004501.3 Hs00244919_m1 insulin-like growth factor binding protein 5 IGFBP5 NM_000599.2 Hs00181213_m1 latent transforming growth factor beta binding protein 4 LTBP4 NM_001042544.1 Hs00186025_m1 microtubule-associated protein 1 light chain 3 beta MAP1LC3B NM_022818.3 Hs00797944_s1 matrix metallopeptidase 17 MMP17 NM_016155.4 Hs01108847_m1 myosin VA MYO5A NM_000259.1 Hs00165309_m1 Homo sapiens nuclear factor (erythroid-derived 2)-like 1 NFE2L1 NM_003204.1 Hs00231457_m1 oxoglutarate (alpha-ketoglutarate)
    [Show full text]
  • S41467-019-13840-9.Pdf
    ARTICLE https://doi.org/10.1038/s41467-019-13840-9 OPEN Accurate quantification of circular RNAs identifies extensive circular isoform switching events Jinyang Zhang 1,2, Shuai Chen1, Jingwen Yang1 & Fangqing Zhao1,2,3* Detection and quantification of circular RNAs (circRNAs) face several significant challenges, including high false discovery rate, uneven rRNA depletion and RNase R treatment efficiency, and underestimation of back-spliced junction reads. Here, we propose a novel algorithm, fi 1234567890():,; CIRIquant, for accurate circRNA quanti cation and differential expression analysis. By con- structing pseudo-circular reference for re-alignment of RNA-seq reads and employing sophisticated statistical models to correct RNase R treatment biases, CIRIquant can provide more accurate expression values for circRNAs with significantly reduced false discovery rate. We further develop a one-stop differential expression analysis pipeline implementing two independent measures, which helps unveil the regulation of competitive splicing between circRNAs and their linear counterparts. We apply CIRIquant to RNA-seq datasets of hepa- tocellular carcinoma, and characterize two important groups of linear-circular switching and circular transcript usage switching events, which demonstrate the promising ability to explore extensive transcriptomic changes in liver tumorigenesis. 1 Computational Genomics Lab, Beijing Institutes of Life Science, Chinese Academy of Sciences, 100101 Beijing, China. 2 University of Chinese Academy of Sciences, 100049 Beijing, China.
    [Show full text]
  • Supplemental Information
    SUPPLEMENTARY APPENDIX Plerixafor and G-CSF combination mobilizes hematopoietic stem and progenitors cells with a distinct transcriptional profile and a reduced in vivo homing capacity compared to plerixafor alone Maria Rosa Lidonnici, 1,2 * Annamaria Aprile, 1,3* Marta Claudia Frittoli, 4 Giacomo Mandelli, 1 Ylenia Paleari, 1,2 Antonello Spinelli, 5 Bernhard Gentner, 4 Matilde Zambelli, 6 Cristina Parisi, 6 Laura Bellio, 6 Elena Cassinerio, 7 Laura Zanaboni, 7 Maria Domenica Cappellini, 7 Fabio Ciceri, 4 Sarah Marktel 4 and Giuliana Ferrari 1,2 *MRL and AA contributed equally to this work. 1San Raffaele-Telethon Institute for Gene Therapy (SR-TIGET), Division of Regenerative Medicine, Stem Cells and Gene Therapy, IRCCS San Raffaele Sci - entific Institute, Milan; 2Vita-Salute San Raffaele University, Milan; 3Università degli Studi di Roma “Tor Vergata”; 4Hematology and Bone Marrow Transplan - tation Unit, IRCCS San Raffaele Scientific Institute, Milan; 5Experimental Imaging Centre, San Raffaele Scientific Institute, Milan; 6Immunohematology and Transfusion Medicine Unit, IRCCS San Raffaele Scientific Institute, Milan and 7Università di Milano, Ca Granda Foundation IRCCS, Italy Correspondence: [email protected] doi:10.3324/haematol.2016.154740 SUPPLEMENTARY FIGURES Fig. S1. Gene Ontology analysis of differentially expressed genes in Plx PB. Functional annotation by Partek software showed that genes could be grouped into a limited number of biological categories. Most of probesets upregulated in Plerixafor mobilized cells were enriched in cellular organization and metabolic process (p value <0.05) (A-B), while most of the probesets down regulated in Plx PB are involved in cell cycle process (p value <0.001) (C-D). One-way ANOVA statistical analysis was performed.
    [Show full text]
  • Evolutionary Analysis of Viral Sequences in Eukaryotic Genomes
    Evolutionary analysis of viral sequences in eukaryotic genomes Sean Schneider A dissertation submitted in partial fulfillment of the requirements for the degree of Doctor of Philosophy University of Washington 2014 Reading Committee: James H. Thomas, Chair Willie Swanson Phil Green Program Authorized to Offer Degree: Genome Sciences ©Copyright 2014 Sean Schneider University of Washington Abstract Evolutionary analysis of viral sequences in eukaryotic genomes Sean Schneider Chair of the supervisory committee: Professor James H. Thomas Genome Sciences The focus of this work is several evolutionary analyses of endogenous viral sequences in eukaryotic genomes. Endogenous viral sequences can provide key insights into the past forms and evolutionary history of viruses, as well as the responses of host organisms they infect. In this work I have examined viral sequences in a diverse assortment of eukaryotic hosts in order to study coevolution between hosts and the organisms that infect them. This research consisted of two major lines of investigation. In the first portion of this work, I outline the hypothesis that the C2H2 zinc finger gene family in vertebrates has evolved by birth-death evolution in response to sporadic retroviral infection. The hypothesis suggests an evolutionary model in which newly duplicated zinc finger genes are retained by selection in response to retroviral infection. This hypothesis is supported by a strong association (R2=0.67) between the number of endogenous retroviruses and the number of zinc fingers in diverse vertebrate genomes. Based on this and other evidence, the zinc finger gene family appears to act as a “genomic immune system” against retroviral infections. The other major line of investigation in this work examines endogenous virus sequences utilized by parasitic wasps to disable hosts that they infect.
    [Show full text]
  • The Interactome of KRAB Zinc Finger Proteins Reveals the Evolutionary History of Their Functional Diversification
    Research Collection Journal Article The interactome of KRAB zinc finger proteins reveals the evolutionary history of their functional diversification Author(s): Helleboid, Pierre-Yves; Heusel, Moritz; Duc, Julien; Piot, Cecile; Thorball, Christian W.; Coluccio, Andrea; Pontis, Julien; Turelli, Priscilla; Aebersold, Ruedi; Trono, Didier Publication Date: 2019-09-16 Permanent Link: https://doi.org/10.3929/ethz-b-000360670 Originally published in: The EMBO Journal 38(18), http://doi.org/10.15252/embj.2018101220 Rights / License: Creative Commons Attribution 4.0 International This page was generated automatically upon download from the ETH Zurich Research Collection. For more information please consult the Terms of use. ETH Library Resource The interactome of KRAB zinc finger proteins reveals the evolutionary history of their functional diversification Pierre-Yves Helleboid1,†, Moritz Heusel2,†, Julien Duc1, Cécile Piot1, Christian W Thorball1, Andrea Coluccio1, Julien Pontis1, Michaël Imbeault1, Priscilla Turelli1, Ruedi Aebersold2,3,* & Didier Trono1,** Abstract years ago (MYA) (Imbeault et al, 2017). Their products harbor an N-terminal KRAB (Kru¨ppel-associated box) domain related to that of Krüppel-associated box (KRAB)-containing zinc finger proteins Meisetz (a.k.a. PRDM9), a protein that originated prior to the diver- (KZFPs) are encoded in the hundreds by the genomes of higher gence of chordates and echinoderms, and a C-terminal array of zinc vertebrates, and many act with the heterochromatin-inducing fingers (ZNF) with sequence-specific DNA-binding potential (Urru- KAP1 as repressors of transposable elements (TEs) during early tia, 2003; Birtle & Ponting, 2006; Imbeault et al, 2017). KZFP genes embryogenesis. Yet, their widespread expression in adult tissues multiplied by gene and segment duplication to count today more and enrichment at other genetic loci indicate additional roles.
    [Show full text]
  • A Grainyhead-Like 2/Ovo-Like 2 Pathway Regulates Renal Epithelial Barrier Function and Lumen Expansion
    BASIC RESEARCH www.jasn.org A Grainyhead-Like 2/Ovo-Like 2 Pathway Regulates Renal Epithelial Barrier Function and Lumen Expansion † ‡ | Annekatrin Aue,* Christian Hinze,* Katharina Walentin,* Janett Ruffert,* Yesim Yurtdas,*§ | Max Werth,* Wei Chen,* Anja Rabien,§ Ergin Kilic,¶ Jörg-Dieter Schulzke,** †‡ Michael Schumann,** and Kai M. Schmidt-Ott* *Max Delbrueck Center for Molecular Medicine, Berlin, Germany; †Experimental and Clinical Research Center, and Departments of ‡Nephrology, §Urology, ¶Pathology, and **Gastroenterology, Charité Medical University, Berlin, Germany; and |Berlin Institute of Urologic Research, Berlin, Germany ABSTRACT Grainyhead transcription factors control epithelial barriers, tissue morphogenesis, and differentiation, but their role in the kidney is poorly understood. Here, we report that nephric duct, ureteric bud, and collecting duct epithelia express high levels of grainyhead-like homolog 2 (Grhl2) and that nephric duct lumen expansion is defective in Grhl2-deficient mice. In collecting duct epithelial cells, Grhl2 inactivation impaired epithelial barrier formation and inhibited lumen expansion. Molecular analyses showed that GRHL2 acts as a transcrip- tional activator and strongly associates with histone H3 lysine 4 trimethylation. Integrating genome-wide GRHL2 binding as well as H3 lysine 4 trimethylation chromatin immunoprecipitation sequencing and gene expression data allowed us to derive a high-confidence GRHL2 target set. GRHL2 transactivated a group of genes including Ovol2, encoding the ovo-like 2 zinc finger transcription factor, as well as E-cadherin, claudin 4 (Cldn4), and the small GTPase Rab25. Ovol2 induction alone was sufficient to bypass the requirement of Grhl2 for E-cadherin, Cldn4,andRab25 expression. Re-expression of either Ovol2 or a combination of Cldn4 and Rab25 was sufficient to rescue lumen expansion and barrier formation in Grhl2-deficient collecting duct cells.
    [Show full text]
  • Molecular Targeting and Enhancing Anticancer Efficacy of Oncolytic HSV-1 to Midkine Expressing Tumors
    University of Cincinnati Date: 12/20/2010 I, Arturo R Maldonado , hereby submit this original work as part of the requirements for the degree of Doctor of Philosophy in Developmental Biology. It is entitled: Molecular Targeting and Enhancing Anticancer Efficacy of Oncolytic HSV-1 to Midkine Expressing Tumors Student's name: Arturo R Maldonado This work and its defense approved by: Committee chair: Jeffrey Whitsett Committee member: Timothy Crombleholme, MD Committee member: Dan Wiginton, PhD Committee member: Rhonda Cardin, PhD Committee member: Tim Cripe 1297 Last Printed:1/11/2011 Document Of Defense Form Molecular Targeting and Enhancing Anticancer Efficacy of Oncolytic HSV-1 to Midkine Expressing Tumors A dissertation submitted to the Graduate School of the University of Cincinnati College of Medicine in partial fulfillment of the requirements for the degree of DOCTORATE OF PHILOSOPHY (PH.D.) in the Division of Molecular & Developmental Biology 2010 By Arturo Rafael Maldonado B.A., University of Miami, Coral Gables, Florida June 1993 M.D., New Jersey Medical School, Newark, New Jersey June 1999 Committee Chair: Jeffrey A. Whitsett, M.D. Advisor: Timothy M. Crombleholme, M.D. Timothy P. Cripe, M.D. Ph.D. Dan Wiginton, Ph.D. Rhonda D. Cardin, Ph.D. ABSTRACT Since 1999, cancer has surpassed heart disease as the number one cause of death in the US for people under the age of 85. Malignant Peripheral Nerve Sheath Tumor (MPNST), a common malignancy in patients with Neurofibromatosis, and colorectal cancer are midkine- producing tumors with high mortality rates. In vitro and preclinical xenograft models of MPNST were utilized in this dissertation to study the role of midkine (MDK), a tumor-specific gene over- expressed in these tumors and to test the efficacy of a MDK-transcriptionally targeted oncolytic HSV-1 (oHSV).
    [Show full text]
  • 393LN V 393P 344SQ V 393P Probe Set Entrez Gene
    393LN v 393P 344SQ v 393P Entrez fold fold probe set Gene Gene Symbol Gene cluster Gene Title p-value change p-value change chemokine (C-C motif) ligand 21b /// chemokine (C-C motif) ligand 21a /// chemokine (C-C motif) ligand 21c 1419426_s_at 18829 /// Ccl21b /// Ccl2 1 - up 393 LN only (leucine) 0.0047 9.199837 0.45212 6.847887 nuclear factor of activated T-cells, cytoplasmic, calcineurin- 1447085_s_at 18018 Nfatc1 1 - up 393 LN only dependent 1 0.009048 12.065 0.13718 4.81 RIKEN cDNA 1453647_at 78668 9530059J11Rik1 - up 393 LN only 9530059J11 gene 0.002208 5.482897 0.27642 3.45171 transient receptor potential cation channel, subfamily 1457164_at 277328 Trpa1 1 - up 393 LN only A, member 1 0.000111 9.180344 0.01771 3.048114 regulating synaptic membrane 1422809_at 116838 Rims2 1 - up 393 LN only exocytosis 2 0.001891 8.560424 0.13159 2.980501 glial cell line derived neurotrophic factor family receptor alpha 1433716_x_at 14586 Gfra2 1 - up 393 LN only 2 0.006868 30.88736 0.01066 2.811211 1446936_at --- --- 1 - up 393 LN only --- 0.007695 6.373955 0.11733 2.480287 zinc finger protein 1438742_at 320683 Zfp629 1 - up 393 LN only 629 0.002644 5.231855 0.38124 2.377016 phospholipase A2, 1426019_at 18786 Plaa 1 - up 393 LN only activating protein 0.008657 6.2364 0.12336 2.262117 1445314_at 14009 Etv1 1 - up 393 LN only ets variant gene 1 0.007224 3.643646 0.36434 2.01989 ciliary rootlet coiled- 1427338_at 230872 Crocc 1 - up 393 LN only coil, rootletin 0.002482 7.783242 0.49977 1.794171 expressed sequence 1436585_at 99463 BB182297 1 - up 393
    [Show full text]