From Sequence to Phenotype: the Impact of Deleterious Variation in Livestock

Total Page:16

File Type:pdf, Size:1020Kb

From Sequence to Phenotype: the Impact of Deleterious Variation in Livestock From Sequence to Phenotype: From Sequence to Phenotype: The Impact of Deleterious Variation in Livestock Martijn F.L. Derks The Impact of Deleterious Variation in Livestock From sequence to phenotype: the impact of deleterious variation in livestock Martijn F.L. Derks Thesis committee Promotor Prof. Dr M.A.M. Groenen Professor of Animal Breeding and Genomics Wageningen University & Research Co-promotor Dr H.-J.W.C. Megens Assistant professor, Animal Breeding and Genomics Wageningen University & Research Other members Dr C. Charlier, University of Liège, Belgium Dr P. Knap, Genus-PIC, Rabenkirchen-Fauluck, Germany Prof. Dr B.J. Zwaan, Wageningen University & Research, Netherlands Prof. Dr Cord Drögemüller, University of Bern, Switzerland This research was conducted under the auspices of the Graduate School of Wageningen Institute of Animal Sciences (WIAS). From sequence to phenotype: the impact of deleterious variation in livestock Martijn F.L. Derks Thesis submitted in fulfillment of the requirements for the degree of doctor at Wageningen University by the authority of the Rector Magnificus, Prof. Dr A.P.J. Mol, in the presence of the Thesis Committee appointed by the Academic Board to be defended in public on Wednesday 2 September, 2020 at 4. p.m. in the Aula. Martijn F.L. Derks From sequence to phenotype: the impact of deleterious variation in livestock PhD thesis, Wageningen University, the Netherlands (2020) With references, with summary in English ISBN 978-94-6395-320-7 DOI: https://doi.org/10.18174/515207 Abstract Derks, MFL. (2020). From sequence to phenotype: the impact of deleterious variation in livestock. PhD thesis, Wageningen University, the Netherlands The genome provides a blueprint of life containing the instruction, together with the environment, that determine the phenotype. In animal breeding, we try to understand this relationship between an animals genetic code (genome sequence) and its performance (phenotype) to select the best performing animals for the next generation. Recently, rapid improvements in genome sequencing have opened up new possibilities to explore and use the complete set of genetic variation seen in (livestock) animals that can be exploited by scientists to try and further close the genotype-phenotype link. However, despite the vast increase of molecular data, pinpointing the exact variants underlying a phenotype of interest is still challenging. In this thesis I provide an in-depth analysis of population genomics and transcriptomics data to identify deleterious and functional variation in livestock populations. More specifically, I report several variants that causes lethality in homozygous state within different stages of development. Moreover, I pinpoint the exact causal mutations and describe its functional consequences at the molecular, phenotypic, and population level. Subsequently, I focus on the identification of functional variation underlying important selection traits. I combine various sources of functional (epi)genomic data to predict the impact of variation in livestock. Together I provide a comprehensive overview of high-impact variation and molecular mechanisms affecting important phenotypes in various livestock breeds, and discuss that molecular genomics could benefit genomic prediction in livestock. 5 Contents 9 1 – General introduction 23 2 – A systematic survey to identify lethal recessive variation in highly managed pig populations 45 3 – Early and late feathering in turkey and chicken: same gene but different mutations 57 4 – A survey of functional genomic variation in domesticated chickens 81 5 – Balancing selection on a recessive lethal deletion with pleiotropic effects on two neighboring genes in the porcine genome 105 6 – Loss of function mutations in essential genes cause embryonic lethality in pigs 129 7 – Detection of a frameshift deletion in the SPTBN4 gene leads to prevention of severe myopathy and postnatal mortality in pigs 145 8 – Accelerated discovery of functional variation in pigs 167 9 – General discussion 186 References 209 Summary 213 Curriculum Vitae 221 Acknowledgements 1 General introduction 1 General introduction 1.1 Genome variation The central dogma in molecular biology describes a process by which DNA is transcribed to RNA which is translated to proteins (Crick 1970). The proteins perform most of the work in our cells, which in return are responsible for the regulation, structure, and functioning of the individuals tissues and organs. The DNA provides a blueprint of life containing the instruction to grow, develop, survive, and reproduce. The complete genetic code (i.e. the genome sequence) defines the phenotype of an individual (Lehner 2013). Hence, the differences (i.e. genetic variation) in genome sequences contribute to the observed phenotypic variation in a population. Animal breeders try to utilize variation in genomes to select the best performing animals for the next generation (Georges et al. 2019). However, until recently, the characterization of the entire genome sequence of an individual was a costly and time-consuming procedure (Shendure et al. 2017). Yet, rapid improvements in genome sequencing have opened up new possibilities to explore and use the complete set of genetic variation seen in (livestock) animals. Within the department of Animal Breeding and Genomics at Wageningen University, hundreds of animals from various livestock populations have been sequenced. The sequences can be mapped back to a reference genome, built from a single individual to produce a high quality genome (Hillier et al. 2004; Groenen et al. 2012), to explore the millions of genetic variants present in the population. Most of these variants are substitutions of a single nucleotide that occurs at a specific position in the genome (SNPs), or short deletions or insertions (indels). Until recently, the livestock genomes were of much lower quality compared to the human or mouse genome. With the developments in long read sequencing technology, much improved reference genomes for chicken, cattle, and pig have been created (Warren et al. 2017; Warr et al. 2019). These improved reference genomes have significantly improved variant discovery and further downstream analysis. 1.2 From sequence to phenotype Domestication constitutes a long-term biological experiment, started by early farming societies and currently done by animal breeders, to constantly improve stock. The ultimate goal in livestock genomics is to understand the relationship between the genome of the animal and its phenotype (i.e. the performance). However, this link between an animals genetic code, and the performance is extremely complex (Gjuvsland et al. 2013). 11 1 General introduction A major breakthrough in animal breeding was provided with the discovery of genomic selection, at the start of this century (Meuwissen et al. 2001), which led to enhanced rates of genetic improvement and shortening of generation intervals. In general, genomic selection uses a variant panel on a chip (SNP chip) that is distributed across the genome, allowing to capture within-breed genetic variation used to select the best animals for breeding. By that, a wealth of genomic information has become available for many species, including livestock (Dekkers 2012). However, genomic selection uses the genome as a black box, as the SNPs on the chip are not causal, but genetically linked to the actual causal variants and genes (Habier et al. 2013). Despite the remarkable success of genomic selection, the molecular drivers underlying the selection traits are still largely unknown. One factor that explains the success of genomic selection is that most important traits (e.g. growth, fertility) are actually regulated by a large set of genes (Hayes and Goddard 2001), with often subtle effects (polygenic traits), while only a few important traits are inherited by a variant in a single gene, or a small subset of genes with large effects (monogenic traits). To unravel the genetic factors underlying these mono- and polygenic traits requires both genomic and phenotypic information to associate regions in the DNA with important traits, the so-called quantitative trait loci (QTL). Although thousands of such QTL have been reported by now (Hu et al. 2019), identification of the causal genes and variants underlying these QTL is still a major challenge. Hence, only few functional variants have been discovered that are currently used in animal breeding (Goddard et al. 2016). An additional type of functional (high-impact) variants are those that exhibit harmful effects (i.e. deleterious). Inferring the function of deleterious variants (e.g. underlying recessive defects), and underpinning the genetic basis of the disorder is still a major challenge. However, variants underlying severe phenotypes can likely be identified more easily compared to variation with small effects on important selection traits, thereby providing a starting point to study the genotype-phenotype link in livestock. 12 1 General introduction Glossary SNP: Single nucleotide polymorphism, resulting in another nucleotide at one single base in the genome. Indel: Small deletion or insertion of bases in the genome. Haplotype: A group of alleles in an organism that are inherited together from a single parent. Genetic drift: The random fluctuations of allele frequencies from generation to generation. Mutation load: The total genetic burden in a population resulting from accumulated deleterious mutations Recessive lethal: A variant resulting in a lethal phenotype in homozygous
Recommended publications
  • Near-Atomic Structures of the Bbsome Reveal the Basis for Bbsome
    RESEARCH ARTICLE Near-atomic structures of the BBSome reveal the basis for BBSome activation and binding to GPCR cargoes Shuang Yang1†, Kriti Bahl2†, Hui-Ting Chou1‡, Jonathan Woodsmith3, Ulrich Stelzl3, Thomas Walz1*, Maxence V Nachury2* 1Laboratory of Molecular Electron Microscopy, The Rockefeller University, New York, United States; 2Department of Ophthalmology, University of California San Francisco, San Francisco, United States; 3Department of Pharmaceutical Chemistry, Institute of Pharmaceutical Sciences, University of Graz and BioTechMed-Graz, Graz, Austria Abstract Dynamic trafficking of G protein-coupled receptors (GPCRs) out of cilia is mediated by the BBSome. In concert with its membrane recruitment factor, the small GTPase ARL6/BBS3, the BBSome ferries GPCRs across the transition zone, a diffusion barrier at the base of cilia. Here, we present the near-atomic structures of the BBSome by itself and in complex with ARL6GTP, and we describe the changes in BBSome conformation induced by ARL6GTP binding. Modeling the *For correspondence: interactions of the BBSome with membranes and the GPCR Smoothened (SMO) reveals that SMO, [email protected] (TW); and likely also other GPCR cargoes, must release their amphipathic helix 8 from the membrane to [email protected] (MVN) be recognized by the BBSome. †These authors contributed equally to this work Present address: ‡Department of Therapeutic Discovery, Amgen Introduction Inc, South San Francisco, United Cilia dynamically concentrate signaling receptors to sense and transduce signals as varied as light, States odorant molecules, Hedgehog morphogens and ligands of G protein-coupled receptors (GPCRs) Competing interests: The (Anvarian et al., 2019; Bangs and Anderson, 2017; Nachury and Mick, 2019). Highlighting the authors declare that no functional importance of dynamic ciliary trafficking, the appropriate transduction of Hedgehog signal competing interests exist.
    [Show full text]
  • Regulation and Dysregulation of Chromosome Structure in Cancer
    Regulation and Dysregulation of Chromosome Structure in Cancer The MIT Faculty has made this article openly available. Please share how this access benefits you. Your story matters. Citation Hnisz, Denes et al. “Regulation and Dysregulation of Chromosome Structure in Cancer.” Annual Review of Cancer Biology 2, 1 (March 2018): 21–40 © 2018 Annual Reviews As Published https://doi.org/10.1146/annurev-cancerbio-030617-050134 Version Author's final manuscript Citable link http://hdl.handle.net/1721.1/117286 Terms of Use Creative Commons Attribution-Noncommercial-Share Alike Detailed Terms http://creativecommons.org/licenses/by-nc-sa/4.0/ Regulation and dysregulation of chromosome structure in cancer Denes Hnisz1*, Jurian Schuijers1, Charles H. Li1,2, Richard A. Young1,2* 1 Whitehead Institute for Biomedical Research, 455 Main Street, Cambridge, MA 02142, USA 2 Department of Biology, Massachusetts Institute of Technology, Cambridge, MA 02139, USA * Corresponding authors Corresponding Authors: Denes Hnisz Whitehead Institute for Biomedical Research 455 Main Street Cambridge, MA 02142 Tel: (617) 258-7181 Fax: (617) 258-0376 [email protected] Richard A. Young Whitehead Institute for Biomedical Research 455 Main Street Cambridge, MA 02142 Tel: (617) 258-5218 Fax: (617) 258-0376 [email protected] 1 Summary Cancer arises from genetic alterations that produce dysregulated gene expression programs. Normal gene regulation occurs in the context of chromosome loop structures called insulated neighborhoods, and recent studies have shown that these structures are altered and can contribute to oncogene dysregulation in various cancer cells. We review here the types of genetic and epigenetic alterations that influence neighborhood structures and contribute to gene dysregulation in cancer, present models for insulated neighborhoods associated with the most prominent human oncogenes, and discuss how such models may lead to further advances in cancer diagnosis and therapy.
    [Show full text]
  • Ciliopathiesneuromuscularciliopathies Disorders Disorders Ciliopathiesciliopathies
    NeuromuscularCiliopathiesNeuromuscularCiliopathies Disorders Disorders CiliopathiesCiliopathies AboutAbout EGL EGL Genet Geneticsics EGLEGL Genetics Genetics specializes specializes in ingenetic genetic diagnostic diagnostic testing, testing, with with ne nearlyarly 50 50 years years of of clinical clinical experience experience and and board-certified board-certified labor laboratoryatory directorsdirectors and and genetic genetic counselors counselors reporting reporting out out cases. cases. EGL EGL Genet Geneticsics offers offers a combineda combined 1000 1000 molecular molecular genetics, genetics, biochemical biochemical genetics,genetics, and and cytogenetics cytogenetics tests tests under under one one roof roof and and custom custom test testinging for for all all medically medically relevant relevant genes, genes, for for domestic domestic andand international international clients. clients. EquallyEqually important important to to improving improving patient patient care care through through quality quality genetic genetic testing testing is is the the contribution contribution EGL EGL Genetics Genetics makes makes back back to to thethe scientific scientific and and medical medical communities. communities. EGL EGL Genetics Genetics is is one one of of only only a afew few clinical clinical diagnostic diagnostic laboratories laboratories to to openly openly share share data data withwith the the NCBI NCBI freely freely available available public public database database ClinVar ClinVar (>35,000 (>35,000 variants variants on on >1700 >1700 genes) genes) and and is isalso also the the only only laboratory laboratory with with a a frefree oen olinnlein dea dtabtaabsaes (eE m(EVmCVlaCslas)s,s f)e, afetuatruinrgin ag vaa vraiarniatn ctl acslasisfiscifiactiaotino sne saercahrc ahn adn rde rpeoprot rrte rqeuqeuset sint tinetrefarcfaec, ew, hwichhic fha cfailcitialiteatse rsa praidp id interactiveinteractive curation curation and and reporting reporting of of variants.
    [Show full text]
  • Synergistic Genetic Interactions Between Pkhd1 and Pkd1 Result in an ARPKD-Like Phenotype in Murine Models
    BASIC RESEARCH www.jasn.org Synergistic Genetic Interactions between Pkhd1 and Pkd1 Result in an ARPKD-Like Phenotype in Murine Models Rory J. Olson,1 Katharina Hopp ,2 Harrison Wells,3 Jessica M. Smith,3 Jessica Furtado,1,4 Megan M. Constans,3 Diana L. Escobar,3 Aron M. Geurts,5 Vicente E. Torres,3 and Peter C. Harris 1,3 Due to the number of contributing authors, the affiliations are listed at the end of this article. ABSTRACT Background Autosomal recessive polycystic kidney disease (ARPKD) and autosomal dominant polycystic kidney disease (ADPKD) are genetically distinct, with ADPKD usually caused by the genes PKD1 or PKD2 (encoding polycystin-1 and polycystin-2, respectively) and ARPKD caused by PKHD1 (encoding fibrocys- tin/polyductin [FPC]). Primary cilia have been considered central to PKD pathogenesis due to protein localization and common cystic phenotypes in syndromic ciliopathies, but their relevance is questioned in the simple PKDs. ARPKD’s mild phenotype in murine models versus in humans has hampered investi- gating its pathogenesis. Methods To study the interaction between Pkhd1 and Pkd1, including dosage effects on the phenotype, we generated digenic mouse and rat models and characterized and compared digenic, monogenic, and wild-type phenotypes. Results The genetic interaction was synergistic in both species, with digenic animals exhibiting pheno- types of rapidly progressive PKD and early lethality resembling classic ARPKD. Genetic interaction be- tween Pkhd1 and Pkd1 depended on dosage in the digenic murine models, with no significant enhancement of the monogenic phenotype until a threshold of reduced expression at the second locus was breached.
    [Show full text]
  • A Computational Approach for Defining a Signature of Β-Cell Golgi Stress in Diabetes Mellitus
    Page 1 of 781 Diabetes A Computational Approach for Defining a Signature of β-Cell Golgi Stress in Diabetes Mellitus Robert N. Bone1,6,7, Olufunmilola Oyebamiji2, Sayali Talware2, Sharmila Selvaraj2, Preethi Krishnan3,6, Farooq Syed1,6,7, Huanmei Wu2, Carmella Evans-Molina 1,3,4,5,6,7,8* Departments of 1Pediatrics, 3Medicine, 4Anatomy, Cell Biology & Physiology, 5Biochemistry & Molecular Biology, the 6Center for Diabetes & Metabolic Diseases, and the 7Herman B. Wells Center for Pediatric Research, Indiana University School of Medicine, Indianapolis, IN 46202; 2Department of BioHealth Informatics, Indiana University-Purdue University Indianapolis, Indianapolis, IN, 46202; 8Roudebush VA Medical Center, Indianapolis, IN 46202. *Corresponding Author(s): Carmella Evans-Molina, MD, PhD ([email protected]) Indiana University School of Medicine, 635 Barnhill Drive, MS 2031A, Indianapolis, IN 46202, Telephone: (317) 274-4145, Fax (317) 274-4107 Running Title: Golgi Stress Response in Diabetes Word Count: 4358 Number of Figures: 6 Keywords: Golgi apparatus stress, Islets, β cell, Type 1 diabetes, Type 2 diabetes 1 Diabetes Publish Ahead of Print, published online August 20, 2020 Diabetes Page 2 of 781 ABSTRACT The Golgi apparatus (GA) is an important site of insulin processing and granule maturation, but whether GA organelle dysfunction and GA stress are present in the diabetic β-cell has not been tested. We utilized an informatics-based approach to develop a transcriptional signature of β-cell GA stress using existing RNA sequencing and microarray datasets generated using human islets from donors with diabetes and islets where type 1(T1D) and type 2 diabetes (T2D) had been modeled ex vivo. To narrow our results to GA-specific genes, we applied a filter set of 1,030 genes accepted as GA associated.
    [Show full text]
  • Supplemental Information Proximity Interactions Among Centrosome
    Current Biology, Volume 24 Supplemental Information Proximity Interactions among Centrosome Components Identify Regulators of Centriole Duplication Elif Nur Firat-Karalar, Navin Rauniyar, John R. Yates III, and Tim Stearns Figure S1 A Myc Streptavidin -tubulin Merge Myc Streptavidin -tubulin Merge BirA*-PLK4 BirA*-CEP63 BirA*- CEP192 BirA*- CEP152 - BirA*-CCDC67 BirA* CEP152 CPAP BirA*- B C Streptavidin PCM1 Merge Myc-BirA* -CEP63 PCM1 -tubulin Merge BirA*- CEP63 DMSO - BirA* CEP63 nocodazole BirA*- CCDC67 Figure S2 A GFP – + – + GFP-CEP152 + – + – Myc-CDK5RAP2 + + + + (225 kDa) Myc-CDK5RAP2 (216 kDa) GFP-CEP152 (27 kDa) GFP Input (5%) IP: GFP B GFP-CEP152 truncation proteins Inputs (5%) IP: GFP kDa 1-7481-10441-1290218-1654749-16541045-16541-7481-10441-1290218-1654749-16541045-1654 250- Myc-CDK5RAP2 150- 150- 100- 75- GFP-CEP152 Figure S3 A B CEP63 – – + – – + GFP CCDC14 KIAA0753 Centrosome + – – + – – GFP-CCDC14 CEP152 binding binding binding targeting – + – – + – GFP-KIAA0753 GFP-KIAA0753 (140 kDa) 1-496 N M C 150- 100- GFP-CCDC14 (115 kDa) 1-424 N M – 136-496 M C – 50- CEP63 (63 kDa) 1-135 N – 37- GFP (27 kDa) 136-424 M – kDa 425-496 C – – Inputs (2%) IP: GFP C GFP-CEP63 truncation proteins D GFP-CEP63 truncation proteins Inputs (5%) IP: GFP Inputs (5%) IP: GFP kDa kDa 1-135136-424425-4961-424136-496FL Ctl 1-135136-424425-4961-424136-496FL Ctl 1-135136-424425-4961-424136-496FL Ctl 1-135136-424425-4961-424136-496FL Ctl Myc- 150- Myc- 100- CCDC14 KIAA0753 100- 100- 75- 75- GFP- GFP- 50- CEP63 50- CEP63 37- 37- Figure S4 A siCtl
    [Show full text]
  • Recent Advances in Drosophila Models of Charcot-Marie-Tooth Disease
    International Journal of Molecular Sciences Review Recent Advances in Drosophila Models of Charcot-Marie-Tooth Disease Fukiko Kitani-Morii 1,2,* and Yu-ichi Noto 2 1 Department of Molecular Pathobiology of Brain Disease, Kyoto Prefectural University of Medicine, Kyoto 6028566, Japan 2 Department of Neurology, Kyoto Prefectural University of Medicine, Kyoto 6028566, Japan; [email protected] * Correspondence: [email protected]; Tel.: +81-75-251-5793 Received: 31 August 2020; Accepted: 6 October 2020; Published: 8 October 2020 Abstract: Charcot-Marie-Tooth disease (CMT) is one of the most common inherited peripheral neuropathies. CMT patients typically show slowly progressive muscle weakness and sensory loss in a distal dominant pattern in childhood. The diagnosis of CMT is based on clinical symptoms, electrophysiological examinations, and genetic testing. Advances in genetic testing technology have revealed the genetic heterogeneity of CMT; more than 100 genes containing the disease causative mutations have been identified. Because a single genetic alteration in CMT leads to progressive neurodegeneration, studies of CMT patients and their respective models revealed the genotype-phenotype relationships of targeted genes. Conventionally, rodents and cell lines have often been used to study the pathogenesis of CMT. Recently, Drosophila has also attracted attention as a CMT model. In this review, we outline the clinical characteristics of CMT, describe the advantages and disadvantages of using Drosophila in CMT studies, and introduce recent advances in CMT research that successfully applied the use of Drosophila, in areas such as molecules associated with mitochondria, endosomes/lysosomes, transfer RNA, axonal transport, and glucose metabolism.
    [Show full text]
  • Centrin2 Regulates CP110 Removal in Primary Cilium Formation
    Published March 9, 2015 JCB: Report Centrin2 regulates CP110 removal in primary cilium formation Suzanna L. Prosser and Ciaran G. Morrison Centre for Chromosome Biology, School of Natural Sciences, National University of Ireland, Galway, Galway, Ireland rimary cilia are antenna-like sensory microtubule ciliogenesis in chicken DT40 B lymphocytes required cen- structures that extend from basal bodies, plasma trin2. We disrupted CETN2 in human retinal pigmented P membrane–docked mother centrioles. Cellular qui- epithelial cells, and despite having intact centrioles, they escence potentiates ciliogenesis, but the regulation of basal were unable to make cilia upon serum starvation, show- body formation is not fully understood. We used reverse ing abnormal localization of distal appendage proteins genetics to test the role of the small calcium-binding pro- and failing to remove the ciliation inhibitor CP110. Knock- tein, centrin2, in ciliogenesis. Primary cilia arise in most down of CP110 rescued ciliation in CETN2-deficient cells. cell types but have not been described in lymphocytes. Thus, centrin2 regulates primary ciliogenesis through con- We show here that serum starvation of transformed, cul- trolling CP110 levels. Downloaded from tured B and T cells caused primary ciliogenesis. Efficient Introduction Primary cilia are crucial for several signal transduction path- the immune synapse, a membrane region at which cells of on April 19, 2017 ways (Goetz and Anderson, 2010). Their assembly is an or- the immune system contact their target cells, helps to estab- dered process that must be closely integrated with the cell cycle lish a specialized structure that has several similarities with cilia because of the dual roles of centrioles in ciliation and in mito- (Stinchcombe and Griffiths, 2014).
    [Show full text]
  • WO 2019/079361 Al 25 April 2019 (25.04.2019) W 1P O PCT
    (12) INTERNATIONAL APPLICATION PUBLISHED UNDER THE PATENT COOPERATION TREATY (PCT) (19) World Intellectual Property Organization I International Bureau (10) International Publication Number (43) International Publication Date WO 2019/079361 Al 25 April 2019 (25.04.2019) W 1P O PCT (51) International Patent Classification: CA, CH, CL, CN, CO, CR, CU, CZ, DE, DJ, DK, DM, DO, C12Q 1/68 (2018.01) A61P 31/18 (2006.01) DZ, EC, EE, EG, ES, FI, GB, GD, GE, GH, GM, GT, HN, C12Q 1/70 (2006.01) HR, HU, ID, IL, IN, IR, IS, JO, JP, KE, KG, KH, KN, KP, KR, KW, KZ, LA, LC, LK, LR, LS, LU, LY, MA, MD, ME, (21) International Application Number: MG, MK, MN, MW, MX, MY, MZ, NA, NG, NI, NO, NZ, PCT/US2018/056167 OM, PA, PE, PG, PH, PL, PT, QA, RO, RS, RU, RW, SA, (22) International Filing Date: SC, SD, SE, SG, SK, SL, SM, ST, SV, SY, TH, TJ, TM, TN, 16 October 2018 (16. 10.2018) TR, TT, TZ, UA, UG, US, UZ, VC, VN, ZA, ZM, ZW. (25) Filing Language: English (84) Designated States (unless otherwise indicated, for every kind of regional protection available): ARIPO (BW, GH, (26) Publication Language: English GM, KE, LR, LS, MW, MZ, NA, RW, SD, SL, ST, SZ, TZ, (30) Priority Data: UG, ZM, ZW), Eurasian (AM, AZ, BY, KG, KZ, RU, TJ, 62/573,025 16 October 2017 (16. 10.2017) US TM), European (AL, AT, BE, BG, CH, CY, CZ, DE, DK, EE, ES, FI, FR, GB, GR, HR, HU, ΓΕ , IS, IT, LT, LU, LV, (71) Applicant: MASSACHUSETTS INSTITUTE OF MC, MK, MT, NL, NO, PL, PT, RO, RS, SE, SI, SK, SM, TECHNOLOGY [US/US]; 77 Massachusetts Avenue, TR), OAPI (BF, BJ, CF, CG, CI, CM, GA, GN, GQ, GW, Cambridge, Massachusetts 02139 (US).
    [Show full text]
  • Variation in Protein Coding Genes Identifies Information
    bioRxiv preprint doi: https://doi.org/10.1101/679456; this version posted June 21, 2019. The copyright holder for this preprint (which was not certified by peer review) is the author/funder, who has granted bioRxiv a license to display the preprint in perpetuity. It is made available under aCC-BY-NC-ND 4.0 International license. Animal complexity and information flow 1 1 2 3 4 5 Variation in protein coding genes identifies information flow as a contributor to 6 animal complexity 7 8 Jack Dean, Daniela Lopes Cardoso and Colin Sharpe* 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 Institute of Biological and Biomedical Sciences 25 School of Biological Science 26 University of Portsmouth, 27 Portsmouth, UK 28 PO16 7YH 29 30 * Author for correspondence 31 [email protected] 32 33 Orcid numbers: 34 DLC: 0000-0003-2683-1745 35 CS: 0000-0002-5022-0840 36 37 38 39 40 41 42 43 44 45 46 47 48 49 Abstract bioRxiv preprint doi: https://doi.org/10.1101/679456; this version posted June 21, 2019. The copyright holder for this preprint (which was not certified by peer review) is the author/funder, who has granted bioRxiv a license to display the preprint in perpetuity. It is made available under aCC-BY-NC-ND 4.0 International license. Animal complexity and information flow 2 1 Across the metazoans there is a trend towards greater organismal complexity. How 2 complexity is generated, however, is uncertain. Since C.elegans and humans have 3 approximately the same number of genes, the explanation will depend on how genes are 4 used, rather than their absolute number.
    [Show full text]
  • Supplementary Table S4. FGA Co-Expressed Gene List in LUAD
    Supplementary Table S4. FGA co-expressed gene list in LUAD tumors Symbol R Locus Description FGG 0.919 4q28 fibrinogen gamma chain FGL1 0.635 8p22 fibrinogen-like 1 SLC7A2 0.536 8p22 solute carrier family 7 (cationic amino acid transporter, y+ system), member 2 DUSP4 0.521 8p12-p11 dual specificity phosphatase 4 HAL 0.51 12q22-q24.1histidine ammonia-lyase PDE4D 0.499 5q12 phosphodiesterase 4D, cAMP-specific FURIN 0.497 15q26.1 furin (paired basic amino acid cleaving enzyme) CPS1 0.49 2q35 carbamoyl-phosphate synthase 1, mitochondrial TESC 0.478 12q24.22 tescalcin INHA 0.465 2q35 inhibin, alpha S100P 0.461 4p16 S100 calcium binding protein P VPS37A 0.447 8p22 vacuolar protein sorting 37 homolog A (S. cerevisiae) SLC16A14 0.447 2q36.3 solute carrier family 16, member 14 PPARGC1A 0.443 4p15.1 peroxisome proliferator-activated receptor gamma, coactivator 1 alpha SIK1 0.435 21q22.3 salt-inducible kinase 1 IRS2 0.434 13q34 insulin receptor substrate 2 RND1 0.433 12q12 Rho family GTPase 1 HGD 0.433 3q13.33 homogentisate 1,2-dioxygenase PTP4A1 0.432 6q12 protein tyrosine phosphatase type IVA, member 1 C8orf4 0.428 8p11.2 chromosome 8 open reading frame 4 DDC 0.427 7p12.2 dopa decarboxylase (aromatic L-amino acid decarboxylase) TACC2 0.427 10q26 transforming, acidic coiled-coil containing protein 2 MUC13 0.422 3q21.2 mucin 13, cell surface associated C5 0.412 9q33-q34 complement component 5 NR4A2 0.412 2q22-q23 nuclear receptor subfamily 4, group A, member 2 EYS 0.411 6q12 eyes shut homolog (Drosophila) GPX2 0.406 14q24.1 glutathione peroxidase
    [Show full text]
  • Human DNA Glycosylase NEIL1's Interactions with Downstream
    Biomolecules 2012, 2, 564-578; doi:10.3390/biom2040564 OPEN ACCESS biomolecules ISSN 2218-273X www.mdpi.com/journal/biomolecules/ Article Human DNA Glycosylase NEIL1’s Interactions with Downstream Repair Proteins Is Critical for Efficient Repair of Oxidized DNA Base Damage and Enhanced Cell Survival Muralidhar L. Hegde 1,2, Pavana M. Hegde 1, Dutta Arijit 1, Istvan Boldogh 3 and Sankar Mitra 1,* 1 Department of Biochemistry and Molecular Biology, University of Texas Medical Branch (UTMB) at Galveston, Texas 77555-1079, USA; E-Mails: [email protected] (M.L.H.); [email protected] (P.M.H.); [email protected] (D.A.) 2 Department of Neurology, University of Texas Medical Branch (UTMB) at Galveston, Texas 77555, USA 3 Department of Microbiology and Immunology, University of Texas Medical Branch (UTMB) at Galveston, Texas 77555, USA; E-Mail: [email protected] (I.B.) * Author to whom correspondence should be addressed; E-Mail: [email protected] (S.M.); Tel.: +1-409-772-1780; Fax: +1-409-747-8608. Received: 15 October 2012; in revised form: 7 November 2012 / Accepted: 9 November 2012 / Published: 15 November 2012 Abstract: NEIL1 is unique among the oxidatively damaged base repair-initiating DNA glycosylases in the human genome due to its S phase-specific activation and ability to excise substrate base lesions from single-stranded DNA. We recently characterized NEIL1’s specific binding to downstream canonical repair and non-canonical accessory proteins, all of which involve NEIL1’s disordered C-terminal segment as the common interaction domain (CID). This domain is dispensable for NEIL1’s base excision and abasic (AP) lyase activities, but is required for its interactions with other repair proteins.
    [Show full text]