Evolution of Y Chromosome Ampliconic Genes in Great Apes

Total Page:16

File Type:pdf, Size:1020Kb

Evolution of Y Chromosome Ampliconic Genes in Great Apes The Pennsylvania State University The Graduate School EVOLUTION OF Y CHROMOSOME AMPLICONIC GENES IN GREAT APES A Dissertation in Bioinformatics and Genomics by Rahulsimham Vegesna © 2020 Rahulsimham Vegesna Submitted in Partial Fulfillment of the Requirements for the Degree of Doctor of Philosophy May 2020 The dissertation of Rahulsimham Vegesna was reviewed and approved by the following: Paul Medvedev Associate Professor of Computer Science & Engineering Associate Professor of Biochemistry & Molecular Biology Dissertation Co-Adviser Co-Chair of Committee Kateryna D. Makova Pentz Professor of Biology Dissertation Co-Adviser Co-Chair of Committee Michael DeGiorgio Associate Professor of Biology and Statistics Wansheng Liu Professor of Animal Genomics George H. Perry Chair, Intercollege Graduate Degree Program in Bioinformatics and Genomics Associate Professor of Anthropology and Biology ii ABSTRACT In addition to the sex-determining gene SRY and several other single-copy genes, the human Y chromosome harbors nine multi-copy gene families which are expressed exclusively in testis. In humans, these gene families are important for spermatogenesis and their loss is observed in patients suffering from infertility. However, only five of the nine ampliconic gene families are found across great apes, while others are missing or pseudogenized in some species. My research goal is to understand the evolution of the Y ampliconic gene families in humans and in non-human great ape species. The specific objectives I addressed in this dissertation are 1. To test whether Y ampliconic gene expression levels depend on their copy number and whether there is a gene dosage compensation to counteract the ampliconic gene copy number variation observed in humans. For the nine ampliconic gene families found in humans, the copy number and expression levels were estimated in 149 men. Among the Y ampliconic gene families, higher copy number leads to higher expression. Within the Y ampliconic gene families, copy number does not influence gene expression, rather a high tolerance for variation in gene expression was observed in testis of presumably healthy men. We also found that expression of five Y ampliconic gene families is coordinated with that of their non-Y (i.e. X or autosomal) homologs. Indeed, five ampliconic gene families had consistently lower expression levels when compared to their non-Y homologs suggesting dosage regulation, while the HSFY family had higher expression levels than its X homolog and thus lacked dosage regulation. 2. To test whether the Y ampliconic gene copy number and gene expression levels are conserved across great apes. For the ampliconic gene families found in great apes, the copy number and expression levels were estimated in independent datasets ranging from two to 14 samples per species. Our results indicate high variability in gene family size but conservation in gene expression levels in Y ampliconic gene families. This relationship was similar to what was observed in humans. However, for three gene families, size was positively correlated with gene expression levels across species, suggesting that, given sufficient evolutionary time, copy number influences gene expression on the Y chromosome. 3. To study the dynamics of gene (and gene family) loss and gain in great ape Y chromosomes. Given the assemblies and alignments of great ape Y chromosomes, we determined the gene content on the Y chromosome of bonobo and orangutan. We then reconstructed the evolutionary history of gene content across great apes to observe that there was an increased rate of loss of genes in Pan genus (bonobo and chimpanzee) when compared to other great apes. The human palindromes P6 and P7 which are void of known ampliconic genes are conserved across great apes. The potential reason for their conservation is presence of possible gene expression regulators and not genes on these palindromes. The results of this dissertation significantly advance our understanding of Y chromosome evolution in great apes. They provide an overview of variation in gene copy number and expression levels of these highly similar gene families which have been a challenge to study previously. Table of Contents LIST OF TABLES ......................................................................................................... viii LIST OF FIGURES ......................................................................................................... ix ACKNOWLEDGMENTS ............................................................................................... xiii Chapter 1 ....................................................................................................................... 1 Introduction .................................................................................................................... 1 References ................................................................................................................. 3 Chapter 2 ....................................................................................................................... 7 Dosage regulation, and variation in gene expression and copy number at human Y chromosome ampliconic genes ...................................................................................... 7 Abstract ...................................................................................................................... 7 Introduction ................................................................................................................. 8 Results .......................................................................................................................11 AmpliCoNE: Ampliconic Copy Number Estimator ...................................................11 Y ampliconic gene copy number estimates .............................................................13 Y ampliconic gene families with low copy number in humans are frequently deleted in non-human great apes ........................................................................................14 Y ampliconic gene expression ................................................................................15 More copious gene families have higher gene expression levels ............................15 Within a family, copy number and gene expression are not correlated ...................16 Y haplogroups and ampliconic gene families ..........................................................17 The role of age in ampliconic gene expression .......................................................20 Ampliconic gene dosage regulation ........................................................................20 Discussion .................................................................................................................26 Variability in Y ampliconic gene copy number .........................................................27 Variability in Y ampliconic gene expression ............................................................28 Dosage regulation of Y ampliconic genes ...............................................................30 Materials and Methods ...............................................................................................34 AmpliCoNE: Ampliconic Copy Number Estimator ...................................................34 Simulation-based validation of AmpliCoNE .............................................................36 Datasets .................................................................................................................36 Pipeline for human WGS analysis ..........................................................................37 Experimental validation with droplet digital PCR (ddPCR) ......................................37 iv Estimating gene expression levels ..........................................................................38 Human Y haplogroup determination .......................................................................38 Code availability .....................................................................................................39 References ................................................................................................................39 Chapter 3 ......................................................................................................................47 Ampliconic genes on the great ape Y chromosomes: Rapid evolution of copy number but conservation of expression levels ..................................................................................47 Abstract .....................................................................................................................47 Introduction ................................................................................................................48 Results .......................................................................................................................52 Dynamic evolution of Y ampliconic gene copy number ...........................................52 Conservation of Y ampliconic gene expression in great apes .................................60 The relationship between copy number and gene expression levels ......................62 Y ampliconic gene copy number variation and phenotypes related to sperm competition .............................................................................................................64 Discussion .................................................................................................................65
Recommended publications
  • Prenatal Diagnosis of Sex Chromosome Mosaicism with Two Marker Chromosomes in Three Cell Lines and a Review of the Literature
    MOLECULAR MEDICINE REPORTS 19: 1791-1796, 2019 Prenatal diagnosis of sex chromosome mosaicism with two marker chromosomes in three cell lines and a review of the literature JIANLI ZHENG1, XIAOYU YANG2, HAIYAN LU1, YONGJUAN GUAN1, FANGFANG YANG1, MENGJUN XU1, MIN LI1, XIUQING JI3, YAN WANG3, PING HU3 and YUN ZHOU1 1Department of Prenatal Diagnosis, Laboratory of Clinical Genetics, Maternity and Child Health Care Hospital, Yancheng, Jiangsu 224001; 2Department of Clinical Reproductive Medicine, State Key Laboratory of Reproductive Medicine, The First Affiliated Hospital of Nanjing Medical University, Nanjing, Jiangsu 210029; 3Department of Prenatal Diagnosis, State Key Laboratory of Reproductive Medicine, Obstetrics and Gynecology Hospital Affiliated to Nanjing Medical University, Nanjing, Jiangsu 210004, P.R. China Received March 31, 2018; Accepted November 21, 2018 DOI: 10.3892/mmr.2018.9798 Abstract. The present study described the diagnosis of a fetus identifying the karyotype, identifying the origin of the marker with sex chromosome mosaicism in three cell lines and two chromosome and preparing effective genetic counseling. marker chromosomes. A 24-year-old woman underwent amniocentesis at 21 weeks and 4 days of gestation due to Introduction noninvasive prenatal testing identifying that the fetus had sex chromosome abnormalities. Amniotic cell culture revealed a Abnormalities involving sex chromosomes account for karyotype of 45,X[13]/46,X,+mar1[6]/46,X,+mar2[9], and approximately 0.5% of live births. Individuals with mosaic prenatal ultrasound was unremarkable. The woman underwent structural aberrations of the X and Y chromosomes exhibit repeat amniocentesis at 23 weeks and 4 days of gestation for complicated and variable phenotypes. The phenotypes of molecular detection.
    [Show full text]
  • Network Medicine Approach for Analysis of Alzheimer's Disease Gene Expression Data
    International Journal of Molecular Sciences Article Network Medicine Approach for Analysis of Alzheimer’s Disease Gene Expression Data David Cohen y, Alexander Pilozzi y and Xudong Huang * Neurochemistry Laboratory, Department of Psychiatry, Massachusetts General Hospital and Harvard Medical School, Charlestown, MA 02129, USA; [email protected] (D.C.); [email protected] (A.P.) * Correspondence: [email protected]; Tel./Fax: +1-617-724-9778 These authors contributed equally to this work. y Received: 15 November 2019; Accepted: 30 December 2019; Published: 3 January 2020 Abstract: Alzheimer’s disease (AD) is the most widespread diagnosed cause of dementia in the elderly. It is a progressive neurodegenerative disease that causes memory loss as well as other detrimental symptoms that are ultimately fatal. Due to the urgent nature of this disease, and the current lack of success in treatment and prevention, it is vital that different methods and approaches are applied to its study in order to better understand its underlying mechanisms. To this end, we have conducted network-based gene co-expression analysis on data from the Alzheimer’s Disease Neuroimaging Initiative (ADNI) database. By processing and filtering gene expression data taken from the blood samples of subjects with varying disease states and constructing networks based on that data to evaluate gene relationships, we have been able to learn about gene expression correlated with the disease, and we have identified several areas of potential research interest. Keywords: Alzheimer’s disease; network medicine; gene expression; neurodegeneration; neuroinflammation 1. Introduction Alzheimer’s disease (AD) is the most widespread diagnosed cause of dementia in the elderly [1].
    [Show full text]
  • The Role of Chromosome X in Intraocular Pressure Variation and Sex-Specific Effects
    Genetics The Role of Chromosome X in Intraocular Pressure Variation and Sex-Specific Effects Mark J. Simcoe,1–3 Anthony P. Khawaja,4,5 Omar A. Mahroo,3 Christopher J. Hammond,1,2 and Pirro G. Hysi1,2; for the UK Biobank Eye and Vision Consortium 1Department of Ophthalmology, Kings College London, London, United Kingdom 2KCL Department of Twin Research and Genetic Epidemiology, London, United Kingdom 3Institute of Ophthalmology, University College London, London, United Kingdom 4NIHR Biomedical Research Centre, Moorfield’s Eye Hospital NHS Foundation Trust and UCL Institute of Ophthalmology, London, United Kingdom 5Department of Public Health and Primary Care, Institute of Public Health, University of Cambridge School of Clinical Medicine, Cambridge, United Kingdom Correspondence: Pirro G. Hysi, PURPOSE. The purpose of this study was to identify genetic variants on chromosome Department of Ophthalmology, X associated with intraocular pressure (IOP) and determine if they possess any sex- Kings College London, St. Thomas specific effects. Hospital, Westminster Bridge Road, London, SE1 7EH UK; METHODS. Association analyses were performed across chromosome X using 102,407 [email protected]. participants from the UK Biobank. Replication and validation analyses were conducted in an additional 6599 participants from the EPIC-Norfolk cohort, and an independent Members of the UK Biobank Eye and Vision Consortium are listed in 331,682 participants from the UK Biobank. the Supplementary Material. RESULTS. We identified three loci associated with IOP at genomewide significance (P < 5 × 10−8), located within or near the following genes: MXRA5 (rs2107482, Received: June 19, 2020 − − Accepted: August 19, 2020 P = 7.1 × 10 11), GPM6B (rs66819623, P = 6.9 × 10 10), NDP,andEFHC2 (rs12558081, − Published: September 14, 2020 P = 4.9 × 10 11).
    [Show full text]
  • Ageing-Associated Changes in DNA Methylation in X and Y Chromosomes
    Kananen and Marttila Epigenetics & Chromatin (2021) 14:33 Epigenetics & Chromatin https://doi.org/10.1186/s13072-021-00407-6 RESEARCH Open Access Ageing-associated changes in DNA methylation in X and Y chromosomes Laura Kananen1,2,3,4* and Saara Marttila4,5* Abstract Background: Ageing displays clear sexual dimorphism, evident in both morbidity and mortality. Ageing is also asso- ciated with changes in DNA methylation, but very little focus has been on the sex chromosomes, potential biological contributors to the observed sexual dimorphism. Here, we sought to identify DNA methylation changes associated with ageing in the Y and X chromosomes, by utilizing datasets available in data repositories, comprising in total of 1240 males and 1191 females, aged 14–92 years. Results: In total, we identifed 46 age-associated CpG sites in the male Y, 1327 age-associated CpG sites in the male X, and 325 age-associated CpG sites in the female X. The X chromosomal age-associated CpGs showed signifcant overlap between females and males, with 122 CpGs identifed as age-associated in both sexes. Age-associated X chro- mosomal CpGs in both sexes were enriched in CpG islands and depleted from gene bodies and showed no strong trend towards hypermethylation nor hypomethylation. In contrast, the Y chromosomal age-associated CpGs were enriched in gene bodies, and showed a clear trend towards hypermethylation with age. Conclusions: Signifcant overlap in X chromosomal age-associated CpGs identifed in males and females and their shared features suggest that despite the uneven chromosomal dosage, diferences in ageing-associated DNA methylation changes in the X chromosome are unlikely to be a major contributor of sex dimorphism in ageing.
    [Show full text]
  • The Origin and Evolution of Human Ampliconic Gene Families and Ampliconic Structure
    Downloaded from genome.cshlp.org on September 28, 2021 - Published by Cold Spring Harbor Laboratory Press Letter The origin and evolution of human ampliconic gene families and ampliconic structure Bejon Kumar Bhowmick, Yoko Satta, and Naoyuki Takahata1 Department of Biosystems Science, The Graduate University for Advance Studies (Sokendai), Kanagawa 240-0193, Japan Out of the nine male-specific gene families in the human Y chromosome amplicons, we investigate the origin and evolution of seven families for which gametologous and orthologous sequences are available. Proto-X/Y gene pairs in the original mammalian sex chromosomes played major roles in origins and gave rise to five gene families: XKRY, VCY, HSFY, RBMY, and TSPY. The divergence times between gametologous X- and Y-linked copies in these families are well correlated with the former X-chromosomal locations. The CDY and DAZ families originated exceptionally by retroposition and transposition of autosomal copies, respectively, but CDY possesses an X-linked copy of enigmatic origin. We also investigate the evolutionary relatedness among Y-linked copies of a gene family in light of their ampliconic locations (palindromes, inverted repeats, and the TSPY array). Although any pair of copies located at the same arm positions within a palindrome is identical or nearly so by frequent gene conversion, copies located at different arm positions are distinctively different. Since these and other distinct copies in various gene families were amplified almost simultaneously in the stem lineage of Catarrhini, we take these simultaneous amplifications as evidence for the elaborate formation of Y ampliconic structure. Curiously, some copies in a gene family located at different palindromes exhibit high sequence similarity, and in most cases, such similarity greatly extends to repeat units that harbor these copies.
    [Show full text]
  • Discovery of Candidate Genes for Stallion Fertility from the Horse Y Chromosome
    DISCOVERY OF CANDIDATE GENES FOR STALLION FERTILITY FROM THE HORSE Y CHROMOSOME A Dissertation by NANDINA PARIA Submitted to the Office of Graduate Studies of Texas A&M University in partial fulfillment of the requirements for the degree of DOCTOR OF PHILOSOPHY August 2009 Major Subject: Biomedical Sciences DISCOVERY OF CANDIDATE GENES FOR STALLION FERTILITY FROM THE HORSE Y CHROMOSOME A Dissertation by NANDINA PARIA Submitted to the Office of Graduate Studies of Texas A&M University in partial fulfillment of the requirements for the degree of DOCTOR OF PHILOSOPHY Approved by: Chair of Committee, Terje Raudsepp Committee Members, Bhanu P. Chowdhary William J. Murphy Paul B. Samollow Dickson D. Varner Head of Department, Evelyn Tiffany-Castiglioni August 2009 Major Subject: Biomedical Sciences iii ABSTRACT Discovery of Candidate Genes for Stallion Fertility from the Horse Y Chromosome. (August 2009) Nandina Paria, B.S., University of Calcutta; M.S., University of Calcutta Chair of Advisory Committee: Dr. Terje Raudsepp The genetic component of mammalian male fertility is complex and involves thousands of genes. The majority of these genes are distributed on autosomes and the X chromosome, while a small number are located on the Y chromosome. Human and mouse studies demonstrate that the most critical Y-linked male fertility genes are present in multiple copies, show testis-specific expression and are different between species. In the equine industry, where stallions are selected according to pedigrees and athletic abilities but not for reproductive performance, reduced fertility of many breeding stallions is a recognized problem. Therefore, the aim of the present research was to acquire comprehensive information about the organization of the horse Y chromosome (ECAY), identify Y-linked genes and investigate potential candidate genes regulating stallion fertility.
    [Show full text]
  • Supplementary Table 1: Adhesion Genes Data Set
    Supplementary Table 1: Adhesion genes data set PROBE Entrez Gene ID Celera Gene ID Gene_Symbol Gene_Name 160832 1 hCG201364.3 A1BG alpha-1-B glycoprotein 223658 1 hCG201364.3 A1BG alpha-1-B glycoprotein 212988 102 hCG40040.3 ADAM10 ADAM metallopeptidase domain 10 133411 4185 hCG28232.2 ADAM11 ADAM metallopeptidase domain 11 110695 8038 hCG40937.4 ADAM12 ADAM metallopeptidase domain 12 (meltrin alpha) 195222 8038 hCG40937.4 ADAM12 ADAM metallopeptidase domain 12 (meltrin alpha) 165344 8751 hCG20021.3 ADAM15 ADAM metallopeptidase domain 15 (metargidin) 189065 6868 null ADAM17 ADAM metallopeptidase domain 17 (tumor necrosis factor, alpha, converting enzyme) 108119 8728 hCG15398.4 ADAM19 ADAM metallopeptidase domain 19 (meltrin beta) 117763 8748 hCG20675.3 ADAM20 ADAM metallopeptidase domain 20 126448 8747 hCG1785634.2 ADAM21 ADAM metallopeptidase domain 21 208981 8747 hCG1785634.2|hCG2042897 ADAM21 ADAM metallopeptidase domain 21 180903 53616 hCG17212.4 ADAM22 ADAM metallopeptidase domain 22 177272 8745 hCG1811623.1 ADAM23 ADAM metallopeptidase domain 23 102384 10863 hCG1818505.1 ADAM28 ADAM metallopeptidase domain 28 119968 11086 hCG1786734.2 ADAM29 ADAM metallopeptidase domain 29 205542 11085 hCG1997196.1 ADAM30 ADAM metallopeptidase domain 30 148417 80332 hCG39255.4 ADAM33 ADAM metallopeptidase domain 33 140492 8756 hCG1789002.2 ADAM7 ADAM metallopeptidase domain 7 122603 101 hCG1816947.1 ADAM8 ADAM metallopeptidase domain 8 183965 8754 hCG1996391 ADAM9 ADAM metallopeptidase domain 9 (meltrin gamma) 129974 27299 hCG15447.3 ADAMDEC1 ADAM-like,
    [Show full text]
  • The Role of the X Chromosome in Embryonic and Postnatal Growth
    The role of the X chromosome in embryonic and postnatal growth Daniel Mark Snell A dissertation submitted in partial fulfillment of the requirements for the degree of Doctor of Philosophy of University College London. Francis Crick Institute/Medical Research Council National Institute for Medical Research University College London January 28, 2018 2 I, Daniel Mark Snell, confirm that the work presented in this thesis is my own. Where information has been derived from other sources, I confirm that this has been indicated in the work. Abstract Women born with only a single X chromosome (XO) have Turner syndrome (TS); and they are invariably of short stature. XO female mice are also small: during embryogenesis, female mice with a paternally-inherited X chromosome (XPO) are smaller than XX littermates; whereas during early postnatal life, both XPO and XMO (maternal) mice are smaller than their XX siblings. Here I look to further understand the genetic bases of these phenotypes, and potentially inform areas of future investigation into TS. Mouse pre-implantation embryos preferentially silence the XP via the non-coding RNA Xist.XPO embryos are smaller than XX littermates at embryonic day (E) 10.5, whereas XMO embryos are not. Two possible hypotheses explain this obser- vation. Inappropriate expression of Xist in XPO embryos may cause transcriptional silencing of the single X chromosome and result in embryos nullizygous for X gene products. Alternatively, there could be imprinted genes on the X chromosome that impact on growth and manifest in growth retarded XPO embryos. In contrast, dur- ing the first three weeks of postnatal development, both XPO and XMO mice show a growth deficit when compared with XX littermates.
    [Show full text]
  • Identification of Potential Key Genes and Pathway Linked with Sporadic Creutzfeldt-Jakob Disease Based on Integrated Bioinformatics Analyses
    medRxiv preprint doi: https://doi.org/10.1101/2020.12.21.20248688; this version posted December 24, 2020. The copyright holder for this preprint (which was not certified by peer review) is the author/funder, who has granted medRxiv a license to display the preprint in perpetuity. All rights reserved. No reuse allowed without permission. Identification of potential key genes and pathway linked with sporadic Creutzfeldt-Jakob disease based on integrated bioinformatics analyses Basavaraj Vastrad1, Chanabasayya Vastrad*2 , Iranna Kotturshetti 1. Department of Biochemistry, Basaveshwar College of Pharmacy, Gadag, Karnataka 582103, India. 2. Biostatistics and Bioinformatics, Chanabasava Nilaya, Bharthinagar, Dharwad 580001, Karanataka, India. 3. Department of Ayurveda, Rajiv Gandhi Education Society`s Ayurvedic Medical College, Ron, Karnataka 562209, India. * Chanabasayya Vastrad [email protected] Ph: +919480073398 Chanabasava Nilaya, Bharthinagar, Dharwad 580001 , Karanataka, India NOTE: This preprint reports new research that has not been certified by peer review and should not be used to guide clinical practice. medRxiv preprint doi: https://doi.org/10.1101/2020.12.21.20248688; this version posted December 24, 2020. The copyright holder for this preprint (which was not certified by peer review) is the author/funder, who has granted medRxiv a license to display the preprint in perpetuity. All rights reserved. No reuse allowed without permission. Abstract Sporadic Creutzfeldt-Jakob disease (sCJD) is neurodegenerative disease also called prion disease linked with poor prognosis. The aim of the current study was to illuminate the underlying molecular mechanisms of sCJD. The mRNA microarray dataset GSE124571 was downloaded from the Gene Expression Omnibus database. Differentially expressed genes (DEGs) were screened.
    [Show full text]
  • Whole Exome Sequencing in Families at High Risk for Hodgkin Lymphoma: Identification of a Predisposing Mutation in the KDR Gene
    Hodgkin Lymphoma SUPPLEMENTARY APPENDIX Whole exome sequencing in families at high risk for Hodgkin lymphoma: identification of a predisposing mutation in the KDR gene Melissa Rotunno, 1 Mary L. McMaster, 1 Joseph Boland, 2 Sara Bass, 2 Xijun Zhang, 2 Laurie Burdett, 2 Belynda Hicks, 2 Sarangan Ravichandran, 3 Brian T. Luke, 3 Meredith Yeager, 2 Laura Fontaine, 4 Paula L. Hyland, 1 Alisa M. Goldstein, 1 NCI DCEG Cancer Sequencing Working Group, NCI DCEG Cancer Genomics Research Laboratory, Stephen J. Chanock, 5 Neil E. Caporaso, 1 Margaret A. Tucker, 6 and Lynn R. Goldin 1 1Genetic Epidemiology Branch, Division of Cancer Epidemiology and Genetics, National Cancer Institute, NIH, Bethesda, MD; 2Cancer Genomics Research Laboratory, Division of Cancer Epidemiology and Genetics, National Cancer Institute, NIH, Bethesda, MD; 3Ad - vanced Biomedical Computing Center, Leidos Biomedical Research Inc.; Frederick National Laboratory for Cancer Research, Frederick, MD; 4Westat, Inc., Rockville MD; 5Division of Cancer Epidemiology and Genetics, National Cancer Institute, NIH, Bethesda, MD; and 6Human Genetics Program, Division of Cancer Epidemiology and Genetics, National Cancer Institute, NIH, Bethesda, MD, USA ©2016 Ferrata Storti Foundation. This is an open-access paper. doi:10.3324/haematol.2015.135475 Received: August 19, 2015. Accepted: January 7, 2016. Pre-published: June 13, 2016. Correspondence: [email protected] Supplemental Author Information: NCI DCEG Cancer Sequencing Working Group: Mark H. Greene, Allan Hildesheim, Nan Hu, Maria Theresa Landi, Jennifer Loud, Phuong Mai, Lisa Mirabello, Lindsay Morton, Dilys Parry, Anand Pathak, Douglas R. Stewart, Philip R. Taylor, Geoffrey S. Tobias, Xiaohong R. Yang, Guoqin Yu NCI DCEG Cancer Genomics Research Laboratory: Salma Chowdhury, Michael Cullen, Casey Dagnall, Herbert Higson, Amy A.
    [Show full text]
  • Sequence Analysis in Bos Taurus Reveals Pervasiveness of X–Y Arms Races in Mammalian Lineages
    Downloaded from genome.cshlp.org on September 25, 2021 - Published by Cold Spring Harbor Laboratory Press Research Sequence analysis in Bos taurus reveals pervasiveness of X–Y arms races in mammalian lineages Jennifer F. Hughes,1 Helen Skaletsky,1,2 Tatyana Pyntikova,1 Natalia Koutseva,1 Terje Raudsepp,3 Laura G. Brown,1,2 Daniel W. Bellott,1 Ting-Jan Cho,1 Shannon Dugan-Rocha,4 Ziad Khan,4 Colin Kremitzki,5 Catrina Fronick,5 Tina A. Graves-Lindsay,5 Lucinda Fulton,5 Wesley C. Warren,5,7 Richard K. Wilson,5,8 Elaine Owens,3 James E. Womack,3 William J. Murphy,3 Donna M. Muzny,4 Kim C. Worley,4 Bhanu P. Chowdhary,3,9 Richard A. Gibbs,4 and David C. Page1,2,6 1Whitehead Institute, Cambridge, Massachusetts 02142, USA; 2Howard Hughes Medical Institute, Whitehead Institute, Cambridge, Massachusetts 02142, USA; 3College of Veterinary Medicine and Biomedical Sciences, Texas A&M University, College Station, Texas 77843, USA; 4Human Genome Sequencing Center, Baylor College of Medicine, Houston, Texas 77030, USA; 5The McDonnell Genome Institute, Washington University School of Medicine, St. Louis, Missouri 63108, USA; 6Department of Biology, Massachusetts Institute of Technology, Cambridge, Massachusetts 02142, USA Studies of Y Chromosome evolution have focused primarily on gene decay, a consequence of suppression of crossing-over with the X Chromosome. Here, we provide evidence that suppression of X–Y crossing-over unleashed a second dynamic: selfish X–Y arms races that reshaped the sex chromosomes in mammals as different as cattle, mice, and men. Using su- per-resolution sequencing, we explore the Y Chromosome of Bos taurus (bull) and find it to be dominated by massive, lin- eage-specific amplification of testis-expressed gene families, making it the most gene-dense Y Chromosome sequenced to date.
    [Show full text]
  • Soft Computing in Bioinformatics Outline
    Soft Computing in Bioinformatics James M. Keller and Mihail Popescu Electrical and Computer Engineering Department Health Management and Informatics Department University of Missouri-Columbia With a lot of help from our friends at MU, Univ of Utah, Univ. West FL and Indian Statistical Institute Outline I. Background 1. Genes and Gene Products i. Sequences ii. Structure 2. Microarrays (expression, hypermethylation) 3. Taxonomies: Gene Ontology and MeSH. II. Gene Product Similarity Measures 1. Introduction 2. Dot-Plot 3. Smith-Waterman 4. BLAST 5. GO-based measures i. Jaccard, Cosine, Dice ii. Fuzzy measures iii. Choquet Integrals 6. Domain and Motif measures 05/22/2005 2 1 Outline (Continued) III. Visualization and Clustering 1. Hierarchical clustering 2. Visual Assessment of cluster Tendency 3. FCM and NERFCM 4. Bi-clustering (AKA co-clustering, two-way clustering) IV. Knowledge Discovery 1. Functional annotation of gene products 2. Functional Clustering of proteins in families 3. Summarization of a set of gene products 4. Hot applications: i. Methylation microarrays ii. Learning biochemical networks from microarray data 05/22/2005 3 I. Background 2 Introduction • Principal features of gene products are – the sequence and expression values following a microarray experiment • Sequence comparisons – DNA, Amino Acids, Motifs, Secondary Structure • For many gene products, additional functional information comes from – the set of Gene Ontology (GO) annotations and – the set of journal abstracts related to the gene (MeSH annotations) • For
    [Show full text]