Human Accelerated Regions and Other Human-Specific Sequence

Total Page:16

File Type:pdf, Size:1020Kb

Human Accelerated Regions and Other Human-Specific Sequence GBE Human Accelerated Regions and Other Human-Specific Sequence Variations in the Context of Evolution and Their Relevance for Brain Development Anastasia Levchenko1,*, Alexander Kanapin1,2, Anastasia Samsonova1,2, and Raul R. Gainetdinov1,3 1Institute of Translational Biomedicine, Saint Petersburg State University, Russia 2Department of Oncology, University of Oxford, United Kingdom 3Skolkovo Institute of Science and Technology, Skolkovo, Moscow, Russia *Corresponding author: E-mail: [email protected]. Accepted: November 14, 2017 Abstract The review discusses, in a format of a timeline, the studies of different types of genetic variants, present in Homo sapiens,but absent in all other primate, mammalian, or vertebrate species, tested so far. The main characteristic of these variants is that they are found in regions of high evolutionary conservation. These sequence variations include single nucleotide substitutions (called human accelerated regions), deletions, and segmental duplications. The rationale for finding such variations in the human genome is that they could be responsible for traits, specific to our species, of which the human brain is the most remarkable. As became obvious, the vast majority of human-specific single nucleotide substitutions are found in noncoding, likely regulatory regions. A number of genes, associated with these human-specific alleles, often through novel enhancer activity, were in fact shown to be implicated in human-specific development of certain brain areas, including the prefrontal cortex. Human-specific deletions may remove regulatory sequences, such as enhancers. Segmental duplications, because of their large size, create new coding sequences, like new functional paralogs. Further functional study of these variants will shed light on evolution of our species, as well as on the etiology of neurodevelopmental disorders. Key words: substitutions, duplications, deletions, neurodevelopmental disorders, psychiatry, genes. Introduction The human brain is indeed much more sophisticated when Human accelerated regions (HARs) and human-specific geno- compared with our closest living relative, the chimpanzee, mic rearrangements, including deletions and segmental dupli- from whom we diverged 8 Ma and with whom we share cations, are among the most interesting sequences in the 99% of our genome (The Chimpanzee Sequencing and human genome to study, because they seem to shed light on Analysis Consortium 2005; Moorjani et al. 2016). As con- the appearance of Homo sapiens as a species, especially in the cluded by the classic study by King and Wilson (King and sense of the exceptionally developed human brain (O’Bleness Wilson 1975), functionally new protein-coding genes cannot et al. 2012a; Sassa 2013; Hubisz and Pollard 2014; Zhang and account for all obvious morphological differences between Long 2014; Bae et al. 2015; Franchini and Pollard 2015; Dennis humans and chimpanzees, because human protein-coding and Eichler 2016; Silver 2016; Namba and Huttner 2017). The genes are very similar to chimpanzee genes and constitute human brain appeared in evolution probably as a result of nat- only 1.5% of the human genome. The genetic basis of ural selection among the intellectually developed members of the morphological differences must therefore primarily the Homo genus. Only Homo sapiens survived, by presenting come from noncoding, regulatory sequences. One of the an exceptionally developed telencephalon, the prefrontal cor- most appealing ways to demonstrate that evolutionarily tex in particular, that gave us the capacity to invent more so- new regulatory regions play an important role in the function phisticated tools and to plan ahead. of the human brain was the discovery of HARs of 2006, An obvious question is: which evolutionarily new sequence followed by a number of similar studies (Pollard et al. variations determined the new pattern of brain development? 2006a; Prabhakar et al. 2006; Bird et al. 2007; Bush and ß The Author 2017. Published by Oxford University Press on behalf of the Society for Molecular Biology and Evolution. This is an Open Access article distributed under the terms of the Creative Commons Attribution Non-Commercial License (http://creativecommons.org/licenses/by-nc/4.0/), which permits non-commercial re-use, distribution, and reproduction in any medium, provided the original work is properly cited. For commercial re-use, please contact [email protected] 166 Genome Biol. Evol. 10(1):166–188. doi:10.1093/gbe/evx240 Advance Access publication November 14, 2017 Human Accelerated Regions and Other Human-Specific Sequence Variations GBE Lahn 2008; Lindblad-Toh et al. 2011; Gittelman et al. 2015). 2014), abnormal spindle-like microcephaly associated HARs are short, 260 bp on an average, stretches of DNA, to (ASPM) that bears a number of human-specific synonymous 97% noncoding. They are conserved in vertebrates, including and nonsynonymous substitutions (Zhang 2003; Evans et al. Pan troglodytes, but not in Homo sapiens, in whom the con- 2004; Buchman et al. 2011; Jayaraman et al. 2016)and served sequences were subjected to significantly, in many Abelson helper integration site 1 (AHI1) that also underwent cases dramatically, higher rates of single nucleotide substitu- positive selection in humans (Ferland et al. 2004; Cheng et al. tions (Hubisz and Pollard 2014). An example of a HAR is given 2012). The reader is referred to reviews that describe these on figure 1a. Because conserved noncoding sequences are genes in further detail (Vallender et al. 2008; O’Bleness et al. most likely regions regulating gene expression, new substitu- 2012a; Bae et al. 2015). tions in these regions imply new patterns of regulation. A Taken together, currently available scientific data indicate fraction of HARs affect noncoding RNA genes (Pollard et al. that both noncoding regulatory regions, modified by single 2006b),whichareoftenalsoimplicatedinregulationofgene nucleotide substitutions and deletions, and new coding expression. In fact, a meta-analysis (Haygood et al. 2010), that sequences, created by segmental duplications, are important used data from Pollard et al. (2006a), Prabhakar et al. (2006) in human evolution and seem to shape the human brain. Only and a study that investigated positive selection in human pro- genomic regions and genes that were discovered in genome- moters (Haygood et al. 2007), indicated that human genes wide studies of human evolution will be dealt with in this regulating neurodevelopment were subjected, during evolu- review. Table 1 summarizes these regions and genes. tion, to the most prominent positive selection only within their Figure 2 presents the timeline of the studies. noncoding sequences. This tendency is exclusive to neurode- velopmental genes, whereas positive selection in coding sequences is characteristic for evolution of human genes im- Original Studies Describing HARs plicated in other functions, such as immune system and ol- faction (Haygood et al. 2010). Similar results were obtained by The First Discovery of HARs two other scientific teams that showed that substitutions in The two pioneering studies, in which HARs were de- the coding sequence of human brain-expressed genes were scribed, appeared in 2006 (Pollard et al. 2006a, 2006b). not numerous enough to indicate positive selection (Shi et al. The studies used a biostatistical method, which aimed at 2006; Wang et al. 2007). Finally, coding sequences of genes detecting any significant acceleration in the rate of nucle- associated with schizophrenia and autism were not subjected otide substitutions in human. First, Pollard et al. (2006a) to accelerated nonsynonymous substitutions in humans aligned genomes of chimpanzee, mouse, and rat, in order (Ogawa and Vallender 2014). to define regions of at least 96% conservation over As became clear starting in 2003 (Locke et al. 2003), the 100 bp; second, they aligned the orthologous conserved human genome is also subject to an important number of sequences of the total of 17 vertebrates, including hu- complex genomic rearrangements, including segmental dupli- man. Software MultiZ and PhastCons from the PHAST cations, that can also, because of their large size, affect cod- package (Blanchette et al. 2004; Siepel et al. 2005; ing genes (Fortna et al. 2004), in particular, create new Hubisz et al. 2011) and a likelihood ratio test were used functional paralogs (Charrier et al. 2012; Dennis et al. 2012, to this end. This method was different from the approach 2017). An example is given on figure 1b. Human-specific where the rate of substitutions in the genome of a species deletions may also remove regulatory sequences (McLean was compared with theoretical neutral expectations. In et al. 2011). An example is shown on figure 1c. The study other words, Pollard et al. (2006a)reliedonlyonfactual of the patterns of human-specific large structural variants is sequences of a number of different vertebrates, to directly currently in its infancy (Dennis and Eichler 2016), but it has compare them with Homo sapiens. Pollard et al. (2006a) already become clear that the impact of these variants in the discovered a more stringent data set of 49 HARs, at a false context of human evolution is paramount. discovery rate (FDR)-corrected P value <0.05, and a The majority of single nucleotide substitutions relevant to broader data set of 202 HARs in the human genome. As human brain evolution are in noncoding regions and affected an example, the top five HARs had substitution rates 26 coding genes are
Recommended publications
  • Universidade Estadual De Campinas Instituto De Biologia
    UNIVERSIDADE ESTADUAL DE CAMPINAS INSTITUTO DE BIOLOGIA VERÔNICA APARECIDA MONTEIRO SAIA CEREDA O PROTEOMA DO CORPO CALOSO DA ESQUIZOFRENIA THE PROTEOME OF THE CORPUS CALLOSUM IN SCHIZOPHRENIA CAMPINAS 2016 1 VERÔNICA APARECIDA MONTEIRO SAIA CEREDA O PROTEOMA DO CORPO CALOSO DA ESQUIZOFRENIA THE PROTEOME OF THE CORPUS CALLOSUM IN SCHIZOPHRENIA Dissertação apresentada ao Instituto de Biologia da Universidade Estadual de Campinas como parte dos requisitos exigidos para a obtenção do Título de Mestra em Biologia Funcional e Molecular na área de concentração de Bioquímica. Dissertation presented to the Institute of Biology of the University of Campinas in partial fulfillment of the requirements for the degree of Master in Functional and Molecular Biology, in the area of Biochemistry. ESTE ARQUIVO DIGITAL CORRESPONDE À VERSÃO FINAL DA DISSERTAÇÃO DEFENDIDA PELA ALUNA VERÔNICA APARECIDA MONTEIRO SAIA CEREDA E ORIENTADA PELO DANIEL MARTINS-DE-SOUZA. Orientador: Daniel Martins-de-Souza CAMPINAS 2016 2 Agência(s) de fomento e nº(s) de processo(s): CNPq, 151787/2F2014-0 Ficha catalográfica Universidade Estadual de Campinas Biblioteca do Instituto de Biologia Mara Janaina de Oliveira - CRB 8/6972 Saia-Cereda, Verônica Aparecida Monteiro, 1988- Sa21p O proteoma do corpo caloso da esquizofrenia / Verônica Aparecida Monteiro Saia Cereda. – Campinas, SP : [s.n.], 2016. Orientador: Daniel Martins de Souza. Dissertação (mestrado) – Universidade Estadual de Campinas, Instituto de Biologia. 1. Esquizofrenia. 2. Espectrometria de massas. 3. Corpo caloso.
    [Show full text]
  • Differences Between Human and Chimpanzee Genomes and Their Implications in Gene Expression, Protein Functions and Biochemical Properties of the Two Species Maria V
    Suntsova and Buzdin BMC Genomics 2020, 21(Suppl 7):535 https://doi.org/10.1186/s12864-020-06962-8 REVIEW Open Access Differences between human and chimpanzee genomes and their implications in gene expression, protein functions and biochemical properties of the two species Maria V. Suntsova1 and Anton A. Buzdin1,2,3,4* From 11th International Young Scientists School “Systems Biology and Bioinformatics”–SBB-2019 Novosibirsk, Russia. 24-28 June 2019 Abstract Chimpanzees are the closest living relatives of humans. The divergence between human and chimpanzee ancestors dates to approximately 6,5–7,5 million years ago. Genetic features distinguishing us from chimpanzees and making us humans are still of a great interest. After divergence of their ancestor lineages, human and chimpanzee genomes underwent multiple changes including single nucleotide substitutions, deletions and duplications of DNA fragments of different size, insertion of transposable elements and chromosomal rearrangements. Human-specific single nucleotide alterations constituted 1.23% of human DNA, whereas more extended deletions and insertions cover ~ 3% of our genome. Moreover, much higher proportion is made by differential chromosomal inversions and translocations comprising several megabase-long regions or even whole chromosomes. However, despite of extensive knowledge of structural genomic changes accompanying human evolution we still cannot identify with certainty the causative genes of human identity. Most structural gene-influential changes happened at the level of expression regulation, which in turn provoked larger alterations of interactome gene regulation networks. In this review, we summarized the available information about genetic differences between humans and chimpanzees and their potential functional impacts on differential molecular, anatomical, physiological and cognitive peculiarities of these species.
    [Show full text]
  • Molecular Evolutionary Routes That Lead to Innovations
    International Journal of Evolutionary Biology Molecular Evolutionary Routes That Lead to Innovations Guest Editors: Frédéric Brunet, Hideki Innan, Ben-Yang Liao, and Wen Wang Molecular Evolutionary Routes That Lead to Innovations International Journal of Evolutionary Biology Molecular Evolutionary Routes That Lead to Innovations Guest Editors: Fred´ eric´ Brunet, Hideki Innan, Ben-YangLiao, and Wen Wang Copyright © 2012 Hindawi Publishing Corporation. All rights reserved. This is a special issue published in “International Journal of Evolutionary Biology.” All articles are open access articles distributed under the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited. Editorial Board Giacomo Bernardi, USA Kazuho Ikeo, Japan Jeffrey R. Powell, USA Terr y Burke, UK Yoh Iwasa, Japan Hudson Kern Reeve, USA Ignacio Doadrio, Spain Henrik J. Jensen, UK Y. Satta, Japan Simon Easteal, Australia Amitabh Joshi, India Koji Tamura, Japan Santiago F. Elena, Spain Hirohisa Kishino, Japan Yoshio Tateno, Japan Renato Fani, Italy A. Moya, Spain E. N. Trifonov, Israel Dmitry A. Filatov, UK G. Pesole, Italy Eske Willerslev, Denmark F. Gonzalez-Candelas,´ Spain I. Popescu, USA Shozo Yokoyama, Japan D. Graur, USA David Posada, Spain Contents Molecular Evolutionary Routes That Lead to Innovations,Fred´ eric´ Brunet, Hideki Innan, Ben-Yang Liao, and Wen Wang Volume 2012, Article ID 483176, 2 pages Purifying Selection Bias against Microsatellites in Gene
    [Show full text]
  • Differential Gene Expression in Patients with Prostate Cancer and in Patients with Parkinson Disease: an Example of Inverse Comorbidity
    Differential Gene Expression in Patients With Prostate Cancer and in Patients With Parkinson Disease: an Example of Inverse Comorbidity Pietro Pepe Ospedale Cannizzaro Simona Vetrano Ospedale Cannizzaro Rossella Cannarella University of Catania Aldo E Calogero University of Catania Giovanna Marchese University of Salerno Maria Ravo University of Salerno Filippo Fraggetta Ospedale Cannizzaro Ludovica Pepe Ospedale Cannizzaro Michele Pennisi Ospedale Cannizzaro Corrado Romano Oasi Maria SS Raffaele Ferri Oasi Maria SS Michele Salemi ( [email protected] ) Oasi Maria SS Research Article Keywords: Prostate cancer, Parkinson disease, Inverse comorbidity, NGS Posted Date: March 11th, 2021 DOI: https://doi.org/10.21203/rs.3.rs-289371/v1 Page 1/11 License: This work is licensed under a Creative Commons Attribution 4.0 International License. Read Full License Page 2/11 Abstract Prostate cancer (PCa) is one of the leading causes of death in Western countries. Environmental and genetic factors play a pivotal role in PCa etiology. Timely identication of the genetic causes is useful for an early diagnosis. Parkinson’s disease (PD) is the most frequent neurodegenerative movement disorder; it is associated with the presence of Lewy bodies (LBs) and genetic factors are involved in its pathogenesis. Several studies have indicated that the expression of target genes in patients with PD is inversely related to cancer development; this phenomenon has been named “inverse comorbidity”. The present study was undertaken to evaluate whether a genetic dysregulation occurs in opposite directions in patients with PD or PCa. In the present study, next-generation sequencing (NGS) transcriptome analysis was used to assess whether a genetic dysregulation in opposite directions occurs in patients with PD or PCa.
    [Show full text]
  • Placenta-Derived Exosomes Continuously Increase in Maternal
    Sarker et al. Journal of Translational Medicine 2014, 12:204 http://www.translational-medicine.com/content/12/1/204 RESEARCH Open Access Placenta-derived exosomes continuously increase in maternal circulation over the first trimester of pregnancy Suchismita Sarker1, Katherin Scholz-Romero1, Alejandra Perez2, Sebastian E Illanes1,2,3, Murray D Mitchell1, Gregory E Rice1,2 and Carlos Salomon1,2* Abstract Background: Human placenta releases specific nanovesicles (i.e. exosomes) into the maternal circulation during pregnancy, however, the presence of placenta-derived exosomes in maternal blood during early pregnancy remains to be established. The aim of this study was to characterise gestational age related changes in the concentration of placenta-derived exosomes during the first trimester of pregnancy (i.e. from 6 to 12 weeks) in plasma from women with normal pregnancies. Methods: A time-series experimental design was used to establish pregnancy-associated changes in maternal plasma exosome concentrations during the first trimester. A series of plasma were collected from normal healthy women (10 patients) at 6, 7, 8, 9, 10, 11 and 12 weeks of gestation (n = 70). We measured the stability of these vesicles by quantifying and observing their protein and miRNA contents after the freeze/thawing processes. Exosomes were isolated by differential and buoyant density centrifugation using a sucrose continuous gradient and characterised by their size distribution and morphology using the nanoparticles tracking analysis (NTA; Nanosight™) and electron microscopy (EM), respectively. The total number of exosomes and placenta-derived exosomes were determined by quantifying the immunoreactive exosomal marker, CD63 and a placenta-specific marker (Placental Alkaline Phosphatase PLAP).
    [Show full text]
  • Precise, Pan-Cancer Discovery of Gene Fusions Reveals a Signature Of
    bioRxiv preprint doi: https://doi.org/10.1101/178061; this version posted August 18, 2017. The copyright holder for this preprint (which was not certified by peer review) is the author/funder, who has granted bioRxiv a license to display the preprint in perpetuity. It is made available under aCC-BY 4.0 International license. 1 Precise, pan-cancer discovery of gene fusions reveals a signature of selection in primary 2 tumors 3 4 Donald Eric Freeman1,3,Gillian Lee Hsieh1, Jonathan Michael Howard1, Erik Lehnert2, Julia 5 Salzman1,3,4* 6 7 Author affiliation 8 1Stanford University Department of Biochemistry, 279 Campus Drive, Stanford, CA 94305 9 2Seven Bridges Genomics, 1 Main Street, Suite 500, Cambridge MA 02142 10 3Stanford University Department of Biomedical Data Science, Stanford, CA 94305-5456 11 4Stanford Cancer Institute, Stanford, CA 94305 12 13 *Corresponding author [email protected] 14 15 Short Abstract: 16 The eXtent to which gene fusions function as drivers of cancer remains a critical open question 17 in cancer biology. In principle, transcriptome sequencing provided by The Cancer Genome 18 Atlas (TCGA) enables unbiased discovery of gene fusions and post-analysis that informs the 19 answer to this question. To date, such an analysis has been impossible because of 20 performance limitations in fusion detection algorithms. By engineering a new, more precise, 21 algorithm and statistical approaches to post-analysis of fusions called in TCGA data, we report 22 new recurrent gene fusions, including those that could be druggable; new candidate pan-cancer 23 oncogenes based on their profiles in fusions; and prevalent, previously overlooked, candidate 24 oncogenic gene fusions in ovarian cancer, a disease with minimal treatment advances in recent 25 decades.
    [Show full text]
  • Arnau Soler2019.Pdf
    This thesis has been submitted in fulfilment of the requirements for a postgraduate degree (e.g. PhD, MPhil, DClinPsychol) at the University of Edinburgh. Please note the following terms and conditions of use: This work is protected by copyright and other intellectual property rights, which are retained by the thesis author, unless otherwise stated. A copy can be downloaded for personal non-commercial research or study, without prior permission or charge. This thesis cannot be reproduced or quoted extensively from without first obtaining permission in writing from the author. The content must not be changed in any way or sold commercially in any format or medium without the formal permission of the author. When referring to this work, full bibliographic details including the author, title, awarding institution and date of the thesis must be given. Genetic responses to environmental stress underlying major depressive disorder Aleix Arnau Soler Doctor of Philosophy The University of Edinburgh 2019 Declaration I hereby declare that this thesis has been composed by myself and that the work presented within has not been submitted for any other degree or professional qualification. I confirm that the work submitted is my own, except where work which has formed part of jointly-authored publications has been included. My contribution and those of the other authors to this work are indicated below. I confirm that appropriate credit has been given within this thesis where reference has been made to the work of others. I composed this thesis under guidance of Dr. Pippa Thomson. Chapter 2 has been published in PLOS ONE and is attached in the Appendix A, chapter 4 and chapter 5 are published in Translational Psychiatry and are attached in the Appendix C and D, and I expect to submit chapter 6 as a manuscript for publication.
    [Show full text]
  • Shear Stress Modulates Gene Expression in Normal Human Dermal Fibroblasts
    University of Calgary PRISM: University of Calgary's Digital Repository Graduate Studies The Vault: Electronic Theses and Dissertations 2017 Shear Stress Modulates Gene Expression in Normal Human Dermal Fibroblasts Zabinyakov, Nikita Zabinyakov, N. (2017). Shear Stress Modulates Gene Expression in Normal Human Dermal Fibroblasts (Unpublished master's thesis). University of Calgary, Calgary, AB. doi:10.11575/PRISM/27775 http://hdl.handle.net/11023/3639 master thesis University of Calgary graduate students retain copyright ownership and moral rights for their thesis. You may use this material in any way that is permitted by the Copyright Act or through licensing that has been assigned to the document. For uses that are not allowable under copyright legislation or licensing, you are required to seek permission. Downloaded from PRISM: https://prism.ucalgary.ca UNIVERSITY OF CALGARY Shear Stress Modulates Gene Expression in Normal Human Dermal Fibroblasts by Nikita Zabinyakov A THESIS SUBMITTED TO THE FACULTY OF GRADUATE STUDIES IN PARTIAL FULFILMENT OF THE REQUIREMENTS FOR THE DEGREE OF MASTER OF SCIENCE GRADUATE PROGRAM IN BIOMEDICAL ENGINEERING CALGARY, ALBERTA JANUARY 2017 © Nikita Zabinyakov 2017 Abstract Applied mechanical forces, such as those resulting from fluid flow, trigger cells to change their functional behavior or phenotype. However, there is little known about how fluid flow affects fibroblasts. The hypothesis of this thesis is that dermal fibroblasts undergo significant changes of expression of differentiation genes after exposure to fluid flow (or shear stress). To test the hypothesis, human dermal fibroblasts were exposed to laminar steady fluid flow for 20 and 40 hours and RNA was collected for microarray analysis.
    [Show full text]
  • Hominin-Specific NOTCH2 Paralogs Expand Human Cortical Neurogenesis
    bioRxiv preprint doi: https://doi.org/10.1101/221358; this version posted November 17, 2017. The copyright holder for this preprint (which was not certified by peer review) is the author/funder. All rights reserved. No reuse allowed without permission. Hominin-specific NOTCH2 paralogs expand human cortical neurogenesis through regulation of Delta/Notch interactions. Ikuo K. Suzuki1,2, David Gacquer1, Roxane Van Heurck1,2, Devesh Kumar1,2, Marta Wojno1,2, Angéline Bilheu1,2, Adèle Herpoel1,2, Julian Chéron1,2, Franck Polleux6, Vincent Detours1, and Pierre Vanderhaeghen1,2,3,4,5*. 1 Université Libre de Bruxelles (ULB), Institute for Interdisciplinary Research (IRIBHM), B-1070 Brussels, Belgium 2 ULB Institute of Neuroscience (UNI), B-1070 Brussels, Belgium 3 WELBIO, Université Libre de Bruxelles, B-1070 Brussels, Belgium 4 VIB, Center for Brain and Disease Research, B-3000 Leuven, Belgium 5 University of Leuven (KU Leuven), Department of Neurosciences, B-3000 Leuven, Belgium 6 Department of Neuroscience, Columbia University Medical Center, Columbia University, New York, NY 10027, USA *Correspondence and Lead Contact: [email protected] Keywords Human brain development, human brain evolution, neurogenesis, Notch pathway, cerebral cortex bioRxiv preprint doi: https://doi.org/10.1101/221358; this version posted November 17, 2017. The copyright holder for this preprint (which was not certified by peer review) is the author/funder. All rights reserved. No reuse allowed without permission. Summary The human cerebral cortex has undergone rapid expansion and increased complexity during recent evolution. Hominid-specific gene duplications represent a major driving force of evolution, but their impact on human brain evolution remains unclear.
    [Show full text]
  • Conserved Exchange of Paralog Proteins During Neuronal
    bioRxiv preprint doi: https://doi.org/10.1101/2021.07.22.453347; this version posted July 23, 2021. The copyright holder for this preprint (which was not certified by peer review) is the author/funder, who has granted bioRxiv a license to display the preprint in perpetuity. It is made available under aCC-BY-NC-ND 4.0 International license. 1 Conserved exchange of paralog proteins during neuronal 2 differentiation 3 4 Domenico Di Fraia1, Mihaela Anitei1, Marie-Therese Mackmull*2, Luca Parca*3, Laura 5 Behrendt1, Amparo Andres-Pons4, Darren Gilmour5, Manuela Helmer Citterich3, Christoph 6 Kaether1, Martin Beck6 and Alessandro Ori1# 7 8 Affiliations 9 1 - Leibniz Institute on Aging - Fritz Lipmann Institute (FLI) Beutenbergstraße 1107745 Jena, Germany 10 2 - ETH Zurich Institute of Molecular Systems Biology Otto-Stern-Weg 3, 8093 Zürich, Switzerland 11 3 - Department of Biology, University of Tor Vergata, Rome, Italy 12 4 - European Molecular Biology Laboratory - EMBL, Meyerhofstraße 1, 69117, Heidelberg, Germany 13 5 - University of Zurich, Department of Molecular Life Sciences, Rämistrasse 71 CH-8006 Zürich, Switzerland 14 6 - Max Planck Institute of Biophysics, department of Molecular Sociology, Max-von-Laue-Straße 3, 60438 Frankfurt am Main 15 16 * contributed equally # 17 correspondence to [email protected] 18 19 20 21 22 Abstract 23 Gene duplication enables the emergence of new functions by lowering the general 24 evolutionary pressure. Previous studies have highlighted the role of specific paralog genes 25 during cell differentiation, e.g., in chromatin remodeling complexes. It remains unexplored 26 whether similar mechanisms extend to other biological functions and whether the regulation 27 of paralog genes is conserved across species.
    [Show full text]
  • Transcriptional Fates of Human-Specific Segmental Duplications in Brain
    Downloaded from genome.cshlp.org on September 27, 2021 - Published by Cold Spring Harbor Laboratory Press Method Transcriptional fates of human-specific segmental duplications in brain Max L. Dougherty,1,7 Jason G. Underwood,1,2,7 Bradley J. Nelson,1 Elizabeth Tseng,2 Katherine M. Munson,1 Osnat Penn,1 Tomasz J. Nowakowski,3,4 Alex A. Pollen,5 and Evan E. Eichler1,6 1Department of Genome Sciences, University of Washington School of Medicine, Seattle, Washington 98195, USA; 2Pacific Biosciences (PacBio) of California, Incorporated, Menlo Park, California 94025, USA; 3Department of Anatomy, 4Department of Psychiatry, 5Department of Neurology, University of California, San Francisco, San Francisco, California 94158, USA; 6Howard Hughes Medical Institute, University of Washington, Seattle, Washington 98195, USA Despite the importance of duplicate genes for evolutionary adaptation, accurate gene annotation is often incomplete, in- correct, or lacking in regions of segmental duplication. We developed an approach combining long-read sequencing and hybridization capture to yield full-length transcript information and confidently distinguish between nearly identical genes/paralogs. We used biotinylated probes to enrich for full-length cDNA from duplicated regions, which were then am- plified, size-fractionated, and sequenced using single-molecule, long-read sequencing technology, permitting us to distin- guish between highly identical genes by virtue of multiple paralogous sequence variants. We examined 19 gene families as expressed in developing and adult human brain, selected for their high sequence identity (average >99%) and overlap with human-specific segmental duplications (SDs). We characterized the transcriptional differences between related paralogs to better understand the birth–death process of duplicate genes and particularly how the process leads to gene innovation.
    [Show full text]
  • Interlocus Gene Conversion Events Introduce Deleterious Mutations Into at Least 1% of Human Genes Associated with Inherited Disease
    Supplemental Material for: Interlocus gene conversion events introduce deleterious mutations into at least 1% of human genes associated with inherited disease Claudio Casola1, Ugne Zekonyte1, Andrew D. Phillips2, David N. Cooper2, and Matthew W. Hahn1,3 1Department of Biology and 3School of Informatics and Computing, Indiana University, Bloomington, IN 47405 2 Institute of Medical Genetics, School of Medicine, Cardiff University, Heath Park, Cardiff CF14 4XN, United Kingdom. Corresponding author: Claudio Casola Department of Biology, Indiana University 1001 East 3rd Street, Bloomington, IN 47405 Phone: 812-856-7016 Fax: 812-855-6705 Email: [email protected] 1 Methods Validation of disease alleles originating from interlocus gene conversion All candidate IGC-derived disease alleles identified by our BLAST searches were further examined to identify high-confidence IGC events. Such alleles required at least one disease-associated mutation plus another mutation to be shared by the IGC donor sequence. In addition, we excluded from our data set those cases where hypermutable CpG sites constituted all the diagnostic mutations. After careful examination of the available literature, we also disregarded several putative IGC events that were deemed likely to represent experimental artifacts. In particular, we identified several cases where primer pairs, which were originally considered to have amplified a single genomic locus or cDNA transcript, also appeared to be capable of amplifying sequences paralogous to the target genes. Such instances included disease alleles from the genes CFC1 (Bamford et al. 2000), GPD2 (Novials et al. 1997) and HLA-DRB1 (Velickovic et al. 2004). In the first two of these cases, we noted that the primer pairs that had been originally designed to amplify the genes specifically, also perfectly matched the paralogous gene CFC1B and an unlinked processed pseudogene of GPD2, respectively.
    [Show full text]