Genetic Nomenclature Zhiliang Hu Iowa State University, [email protected]

Total Page:16

File Type:pdf, Size:1020Kb

Genetic Nomenclature Zhiliang Hu Iowa State University, Zhu@Iastate.Edu Animal Science Publications Animal Science 2012 Genetic Nomenclature Zhiliang Hu Iowa State University, [email protected] James M. Reecy Iowa State University, [email protected] Follow this and additional works at: http://lib.dr.iastate.edu/ans_pubs Part of the Agriculture Commons, Animal Sciences Commons, and the Genetics Commons The ompc lete bibliographic information for this item can be found at http://lib.dr.iastate.edu/ ans_pubs/170. For information on how to cite this item, please visit http://lib.dr.iastate.edu/ howtocite.html. This Book Chapter is brought to you for free and open access by the Animal Science at Iowa State University Digital Repository. It has been accepted for inclusion in Animal Science Publications by an authorized administrator of Iowa State University Digital Repository. For more information, please contact [email protected]. Genetic Nomenclature Abstract Genetics includes the study of genotypes, phenotypes and the mechanisms of genetic control between them. Genetic terms describe the processes, genes, alleles and traits with which genetic phenomena are described and examined. In this chapter we will concentrate on the discussions of genetic term standardizations and, at the end of the chapter, we will list some terms relevant to genetic processes and concepts in a Genetic Glossary. Disciplines Agriculture | Animal Sciences | Genetics Comments This is a chapter from The Genetics of the Dog, 2nd edition, chapter 23 (2012): 496. Posted with permission. This book chapter is available at Iowa State University Digital Repository: http://lib.dr.iastate.edu/ans_pubs/170 2 Genetic om ncla ure Zhi-Liang Hu and James M. Reecy Department of Animal Science, Iowa State University, Ames, Iowa, USA Introduction 496 Locus and Gene Names and Symbols 497 Locus name and symbol 497 Allele name and symbol 497 Genotype terminology 498 Future Prospects 498 References 499 Genetic Glossary 499 Introduction 8 gene (GDF8) (one can also find inappropriate abbreviations such as GDF-8 in the literature) and Genetics includes the study of genotypes, is referred to as the 'bully whippet' locus in dogs. phenotypes and the mechanisms of genetic While all these names are interchangeably used control between them. Genetic terms describe in the literature, it gets more complicated when the processes, genes, alleles and traits with one considers paralogous gene duplications which genetic phenomena are described and across species, which led Rodgers eta/. (2007) examined. In this chapter we will concentrate to propose MSTN-1 and MSTN-2 as paralogue on the discussions of genetic term standardiza­ names. Unfortunately, this naming scheme does tions and, at the end of the chapter, we will list not follow Human Gene Nomenclature some terms relevant to genetic processes and Committee (HGNC) guidelines, which would concepts in a Genetic Glossary. indicate that the relevant genes should be named A standardized genetic nomenclature is MSTNl and MSTN2. (Note that work on the vital for unambiguous concept description, effi­ standardization of human gene nomenclature is cient genetic data management and effective far more advanced than that in other species.) communications among not only scientists, but From the previous example, we see that also among canine veterinarians, breeding there is evidently a need for all researchers to societies and those individuals who are inter­ follow a standardized genetic nomenclature in ested in the subject. This issue becomes even order for them to communicate to each other more evident in the post-genomics era, owing correctly on the terms they use. When a term to the rapid accumulation of large quantities of is used by someone to describe a genetic phe­ genetic data, and the use of computer software nomenon, the correct use of the term will help to manage such data, which imposes a chal­ to quickly and precisely place the subject under lenge for the precise definition and interpreta­ the unambiguously defined scope. More impor­ tion of genetic terms. tantly, a standard nomenclature will help to For example, the Myostatin gene (MSTN) is minimize the time one has to spend in differ­ also known as Growth and Differentiation Factor entiating the instances where two terms may ©CAB International 2012. The Genetics of the Dog, 496 2nd Edn (eds E.A. Ostrander and A. Ruvinsky) actually mean the same or different things, for the gene name. The symbol will consist of which is often a costly process. The need for a upper-case Latin letters and possibly Arabic standardized genetic nomenclature becomes numerals. Gene symbols must be unique. more pressing when ontologies are employed The locus name should be in capitalized in biological research for computers to man­ Latin letters or a combination of Latin letters age the genetic terms. Ontology provides a and Arabic numerals. If the locus name is two new way to effectively use, standardize and or more words, each word should be in capital manage genetics terms. The Gene Ontology Latin characters. The locus symbol should con­ (GO) Consortium has provided a good exam­ sist of as few Latin letters as possible or a com­ ple (The Gene Ontology Consortium, 2000). bination of Latin letters and Arabic numerals. When genomics information must be trans­ The characters of a symbol should always be ferred across species to perpetuate genetic dis­ capital Latin characters, and should begin with coveries, the role of a standardized genetic the initial letter of the name of the locus. If the nomenclature becomes even more important locus name is two or more words, then the ini­ (Bruford, 2010). tial letters should be used in the locus symbol. The goal of this chapter is to help estab­ The locus name and symbol should be printed lish guidelines for nomenclature, with the hope in italics wherever possible; otherwise they that it will facilitate comparison of results should be underlined. between experiments and, most importantly, When assigning gene nomenclature, the prevent confusion. gene name and symbol should be assigned based on existing HGNC nomenclature where possible (i.e. 1:1 for canine: human orthologues). Ensembl has used the new EPO (Enredo, Pecan, Locus and Gene Names Ortheus) pipeline (Paten eta/., 2008) for whole­ and Symbols genome alignment of the dog genome. Initial efforts to provide information about genes pre­ Locus name and symbol dicted during the canine genome sequencing effort assigned standardized nomenclature based These guidelines for gene nomenclature are on human gene nomenclature for 3613 canine adapted and abbreviated from the Human genes (http://uswest.ensembl.org/Canis_famil­ Gene Nomenclature Committee Guidelines iaris/lnfo/StatsTable). (http://www.genenames.org/guidelines.html). There are two categories of novel canine A gene is defined as: 'A functional heredi­ genes: (i) novel genes predicted by bioinfor­ tary unit that occupies a fixed location on a matic gene prediction programs; and (ii) novel chromosome, has a specific influence on phe­ canine genes that were studied before the com­ notype, and is capable of mutation to various pletion of the canine genome. In cases where allelic forms. In the absence of demonstrated no known strict 1: 1 human orthologue exists, function a gene may be characterized by the LOC # or Ensembl ID should be used as a sequence, transcription or homology'. A 'locus', temporary gene symbol. In order to assign a which is not synonymous with a gene, refers to name to a novel gene, it will need to be manu­ a position in the genome that can be identified ally curated and assigned a unique name fol­ by a marker. A 'chromosome region' is defined lowing HGNC guidelines. as a genomic region that has been associated with a particular syndrome or phenotype. Gene names and symbols will follow the human gene when 1: 1 orthology is known. Allele name and symbol Gene names should be short and specific and convey the character or function of the gene. These guidelines for allele nomenclature are They will be written using American spelling adapted from Young (1998) and mouse and contain only Latin letters or a combination genome nomenclature guidelines (http://www. of Latin letters and Arabic numerals. The first informatics.jax.org/mgihome/nomen/gene. letter of a gene symbol should be the same as shtml), in accordance with HGNC guidelines. The allele names should be as brief as pos­ Genotype terminology sible, yet still convey the variation associated with the allele. Alleles do not have to be named, The genotype of an individual should be but should be given symbols. If a new allele is shown by printing the relevant locus, gene similar to one that is already named, it should or allele symbols for the two homologous be named according to the breed, geographical chromosomes concerned, separated by a location or population of origin. If new alleles slash, e.g. MSTNcs1234567-T!rs1234567·C. Unlinked are to be named for a recognized locus, they loci should be separated by semicolons, e.g. should conform to nomenclature established CDll Rsa/·2400/2200; ESRPuull·5700!4200. Linked or for that locus. The first letter of the allele name syntenic loci should be separated by a space should be lower case. However, this does not or dash and listed in linkage order (e.g. apply when the allele is only a symbol. POU1FJNG_STCHCIG-PRSS7Nl), or in alpha­ An allele symbol should be as brief as pos­ betical order if the linkage order is not known. sible and consist of Latin letters or a combina­ For X-linked loci, the hemizygous case should tion of Latin letters and Arabic numerals. Like have a /Y following the locus and allele sym­ a gene symbol, an allele symbol should be an bol, e.g. AREcoSII·l094 /Y. Likewise, Y-linked abbreviation of the allele name, and should loci should be designated by IX following the start with the same letter.
Recommended publications
  • Illegitimate DNA Integration in Mammalian Cells
    Gene Therapy (2003) 10, 1791–1799 & 2003 Nature Publishing Group All rights reserved 0969-7128/03 $25.00 www.nature.com/gt REVIEW Illegitimate DNA integration in mammalian cells HWu¨ rtele1, KCE Little2 and P Chartrand2,3 1Programme de Biologie Mole´culaire, Universite´ de Montre´al, Montre´al, Canada; 2Department of Medicine, Division of Experimental Medicine, McGill University, Montre´al, Que´bec, Canada; and 3Centre Hospitalier de l’Universite´ de Montre´al and De´partement de Pathologie et de Biologie Cellulaire, Universite´ de Montre´al, Montre´al, Que´bec, Canada Foreign DNA integration is one of the most widely exploited therapy procedures can result in illegitimate integration of cellular processes in molecular biology. Its technical use introduced sequences and thus pose a risk of unforeseeable permits us to alter a cellular genome by incorporating a genomic alterations. The choice of insertion site, the degree fragment of foreign DNA into the chromosomal DNA. This to which the foreign DNA and endogenous locus are modified process employs the cell’s own endogenous DNA modifica- before or during integration, and the resulting impact on tion and repair machinery. Two main classes of integration structure, expression, and stability of the genome are all mechanisms exist: those that draw on sequence similarity factors of illegitimate DNA integration that must be con- between the foreign and genomic sequences to carry out sidered, in particular when designing genetic therapies. homology-directed modifications, and the nonhomologous or Gene Therapy (2003) 10, 1791–1799. doi:10.1038/ ‘illegitimate’ insertion of foreign DNA into the genome. Gene sj.gt.3302074 Keywords: illegitimate DNA integration; DNA repair; transgenesis; recombination; mutagenesis Introduction timate integration.
    [Show full text]
  • Gene Nomenclature System for Rice
    Rice (2008) 1:72–84 DOI 10.1007/s12284-008-9004-9 Gene Nomenclature System for Rice Susan R. McCouch & CGSNL (Committee on Gene Symbolization, Nomenclature and Linkage, Rice Genetics Cooperative) Received: 11 April 2008 /Accepted: 3 June 2008 /Published online: 15 August 2008 # The Author(s) 2008 Abstract The Committee on Gene Symbolization, Nomen- protein sequence information that have been annotated and clature and Linkage (CGSNL) of the Rice Genetics Cooper- mapped on the sequenced genome assemblies, as well as ative has revised the gene nomenclature system for rice those determined by biochemical characterization and/or (Oryza) to take advantage of the completion of the rice phenotype characterization by way of forward genetics. With genome sequence and the emergence of new methods for these revisions, we enhance the potential for structural, detecting, characterizing, and describing genes in the functional, and evolutionary comparisons across organisms biological community. This paper outlines a set of standard and seek to harmonize the rice gene nomenclature system procedures for describing genes based on DNA, RNA, and with that of other model organisms. Newly identified rice genes can now be registered on-line at http://shigen.lab.nig. ac.jp/rice/oryzabase_submission/gene_nomenclature/. The current ex-officio member list below is correct as of the date of this galley proof. The current ex-officio members of CGSNL (The Keywords Oryza sativa . Genome sequencing . Committee on Gene Symbolization, Nomenclature and Linkage) are: Gene symbolization Atsushi Yoshimura from the Faculty of Agriculture, Kyushu University, Fukuoka, 812-8581, Japan, email: [email protected] Guozhen Liu from the Beijing Institute of Genomics of the Chinese Introduction Academy of Sciences, Beijing, People’s Republic of China, email: [email protected] The biological community is moving towards a universal Yasuo Nagato from the Graduate School of Agricultural and Life system for the naming of genes.
    [Show full text]
  • Systematic Review of the Genetics of Sudden Unexpected Death in Epilepsy: Potential Overlap with Sudden Cardiac Death and Arrhythmia-Related Genes C
    SYSTEMATIC REVIEW AND META-ANALYSIS Systematic Review of the Genetics of Sudden Unexpected Death in Epilepsy: Potential Overlap With Sudden Cardiac Death and Arrhythmia-Related Genes C. Anwar A. Chahal, MBChB; Mohammad N. Salloum, MD; Fares Alahdab, MD; Joseph A. Gottwald, PharmD; David J. Tester, BS; Lucman A. Anwer, MD; Elson L. So, MD; Mohammad Hassan Murad, MD, MPH; Erik K. St Louis, MD, MS; Michael J. Ackerman, MD, PhD; Virend K. Somers, MD, PhD Background-—Sudden unexpected death in epilepsy (SUDEP) is the leading cause of epilepsy-related death. SUDEP shares many features with sudden cardiac death and sudden unexplained death in the young and may have a similar genetic contribution. We aim to systematically review the literature on the genetics of SUDEP. Methods and Results-—PubMed, MEDLINE Epub Ahead of Print, Ovid Medline In-Process & Other Non-Indexed Citations, MEDLINE, EMBASE, Cochrane Database of Systematic Reviews, and Scopus were searched through April 4, 2017. English language human studies analyzing SUDEP for known sudden death, ion channel and arrhythmia-related pathogenic variants, novel variant discovery, and copy number variant analyses were included. Aggregate descriptive statistics were generated; data were insufficient for meta- analysis. A total of 8 studies with 161 unique individuals were included; mean was age 29.0 (ÆSD 14.2) years; 61% males; ECG data were reported in 7.5% of cases; 50.7% were found prone and 58% of deaths were nocturnal. Cause included all types of epilepsy. Antemortem diagnosis of Dravet syndrome and autism (with duplication of chromosome 15) was associated with 11% and 9% of cases.
    [Show full text]
  • The Human Genome Project
    TO KNOW OURSELVES ❖ THE U.S. DEPARTMENT OF ENERGY AND THE HUMAN GENOME PROJECT JULY 1996 TO KNOW OURSELVES ❖ THE U.S. DEPARTMENT OF ENERGY AND THE HUMAN GENOME PROJECT JULY 1996 Contents FOREWORD . 2 THE GENOME PROJECT—WHY THE DOE? . 4 A bold but logical step INTRODUCING THE HUMAN GENOME . 6 The recipe for life Some definitions . 6 A plan of action . 8 EXPLORING THE GENOMIC LANDSCAPE . 10 Mapping the terrain Two giant steps: Chromosomes 16 and 19 . 12 Getting down to details: Sequencing the genome . 16 Shotguns and transposons . 20 How good is good enough? . 26 Sidebar: Tools of the Trade . 17 Sidebar: The Mighty Mouse . 24 BEYOND BIOLOGY . 27 Instrumentation and informatics Smaller is better—And other developments . 27 Dealing with the data . 30 ETHICAL, LEGAL, AND SOCIAL IMPLICATIONS . 32 An essential dimension of genome research Foreword T THE END OF THE ROAD in Little has been rapid, and it is now generally agreed Cottonwood Canyon, near Salt that this international project will produce Lake City, Alta is a place of the complete sequence of the human genome near-mythic renown among by the year 2005. A skiers. In time it may well And what is more important, the value assume similar status among molecular of the project also appears beyond doubt. geneticists. In December 1984, a conference Genome research is revolutionizing biology there, co-sponsored by the U.S. Department and biotechnology, and providing a vital of Energy, pondered a single question: Does thrust to the increasingly broad scope of the modern DNA research offer a way of detect- biological sciences.
    [Show full text]
  • A Genetic Screen for Genes That Impact Peroxisomes in Drosophila Identifies
    bioRxiv preprint doi: https://doi.org/10.1101/704122; this version posted July 16, 2019. The copyright holder for this preprint (which was not certified by peer review) is the author/funder, who has granted bioRxiv a license to display the preprint in perpetuity. It is made available under aCC-BY 4.0 International license. A genetic screen for genes that impact peroxisomes in Drosophila identifies candidate genes for human disease Hillary K. Graves1, Sharayu Jangam1, Kai Li Tan1, Antonella Pignata1, Elaine S. Seto1, Shinya Yamamoto1,2,3,4# and Michael F. Wangler1,3,4,# 1Department of Molecular and Human Genetics, Baylor College of Medicine (BCM), Houston, TX 77030, USA 2Department of Neuroscience, BCM, Houston, TX 77030, USA 3Program in Developmental Biology, BCM, Houston, TX 77030, USA 4Jan and Dan Duncan Neurological Research Institute, Texas Children Hospital, Houston, TX 77030, USA #Corresponding authors: SY ([email protected]), MFW ([email protected]) Keywords: Drosophila, peroxisomes, BRD4, fs(1)h bioRxiv preprint doi: https://doi.org/10.1101/704122; this version posted July 16, 2019. The copyright holder for this preprint (which was not certified by peer review) is the author/funder, who has granted bioRxiv a license to display the preprint in perpetuity. It is made available under aCC-BY 4.0 International license. Abstract Peroxisomes are sub-cellular organelles that are essential for proper function of eukaryotic cells. In addition to being the sites of a variety of oxidative reactions, they are crucial regulators of lipid metabolism. Peroxisome loss or dysfunction leads to multi- system diseases in humans that strongly affects the nervous system.
    [Show full text]
  • Indicators of the Molecular Pathogenesis of Virulent Newcastle
    www.nature.com/scientificreports OPEN Indicators of the molecular pathogenesis of virulent Newcastle disease virus in chickens revealed by transcriptomic profling of spleen Mohammad Rabiei1*, Wai Yee Low2, Yan Ren2, Mohamad Indro Cahyono3, Phuong Thi Kim Doan1,4, Indi Dharmayanti3, Eleonora Dal Grande1 & Farhid Hemmatzadeh1,2 Newcastle disease virus (NDV) has caused signifcant outbreaks in South-East Asia, particularly in Indonesia in recent years. Recently emerged genotype VII NDVs (NDV-GVII) have shifted their tropism from gastrointestinal/respiratory tropism to a lymphotropic virus, invading lymphoid organs including spleen and bursa of Fabricius to cause profound lymphoid depletion. In this study, we aimed to identify candidate genes and biological pathways that contribute to the disease caused by this velogenic NDV-GVII. A transcriptomic analysis based on RNA-Seq of spleen was performed in chickens challenged with NDV-GVII and a control group. In total, 6361 genes were diferentially expressed that included 3506 up-regulated genes and 2855 down-regulated genes. Real-Time PCR of ten selected genes validated the RNA-Seq results as the correlation between them is 0.98. Functional and network analysis of Diferentially Expressed Genes (DEGs) showed altered regulation of ElF2 signalling, mTOR signalling, proliferation of cells of the lymphoid system, signalling by Rho family GTPases and synaptogenesis signalling in spleen. We have also identifed modifed expression of IFIT5, PI3K, AGT and PLP1 genes in NDV-GVII infected chickens. Our fndings in activation of autophagy-mediated cell death, lymphotropic and synaptogenesis signalling pathways provide new insights into the molecular pathogenesis of this newly emerged NDV-GVII. Newcastle disease virus (NDV) has a worldwide distribution.
    [Show full text]
  • Identification of Common Gene Networks Responsive To
    Cancer Gene Therapy (2014) 21, 542–548 © 2014 Nature America, Inc. All rights reserved 0929-1903/14 www.nature.com/cgt ORIGINAL ARTICLE Identification of common gene networks responsive to radiotherapy in human cancer cells D-L Hou1, L Chen2, B Liu1, L-N Song1 and T Fang1 Identification of the genes that are differentially expressed between radiosensitive and radioresistant cancers by global gene analysis may help to elucidate the mechanisms underlying tumor radioresistance and improve the efficacy of radiotherapy. An integrated analysis was conducted using publicly available GEO datasets to detect differentially expressed genes (DEGs) between cancer cells exhibiting radioresistance and cancer cells exhibiting radiosensitivity. Gene Ontology (GO) enrichment analyses, Kyoto Encyclopedia of Genes and Genomes (KEGG) pathway analysis and protein–protein interaction (PPI) networks analysis were also performed. Five GEO datasets including 16 samples of radiosensitive cancers and radioresistant cancers were obtained. A total of 688 DEGs across these studies were identified, of which 374 were upregulated and 314 were downregulated in radioresistant cancer cell. The most significantly enriched GO terms were regulation of transcription, DNA-dependent (GO: 0006355, P = 7.00E-09) for biological processes, while those for molecular functions was protein binding (GO: 0005515, P = 1.01E-28), and those for cellular component was cytoplasm (GO: 0005737, P = 2.81E-26). The most significantly enriched pathway in our KEGG analysis was Pathways in cancer (P = 4.20E-07). PPI network analysis showed that IFIH1 (Degree = 33) was selected as the most significant hub protein. This integrated analysis may help to predict responses to radiotherapy and may also provide insights into the development of individualized therapies and novel therapeutic targets.
    [Show full text]
  • The Prepattern Transcription Factor Irx3 Directs Nephron Segment Identity
    Downloaded from genesdev.cshlp.org on October 8, 2021 - Published by Cold Spring Harbor Laboratory Press The prepattern transcription factor Irx3 directs nephron segment identity Luca Reggiani,1 Daniela Raciti,1 Rannar Airik,2 Andreas Kispert,2 and André W. Brändli1,3 1Department of Chemistry and Applied Biosciences, Institute of Pharmaceutical Sciences, ETH Zurich, CH-8093 Zurich, Switzerland; 2Institute of Molecular Biology, Hannover Medical School, D-30625 Hannover, Germany The nephron, the basic structural and functional unit of the vertebrate kidney, is organized into discrete segments, which are composed of distinct renal epithelial cell types. Each cell type carries out highly specific physiological functions to regulate fluid balance, osmolarity, and metabolic waste excretion. To date, the genetic basis of regionalization of the nephron has remained largely unknown. Here we show that Irx3,a member of the Iroquois (Irx) gene family, acts as a master regulator of intermediate tubule fate. Comparative studies in Xenopus and mouse have identified Irx1, Irx2, and Irx3 as an evolutionary conserved subset of Irx genes, whose expression represents the earliest manifestation of intermediate compartment patterning in the developing vertebrate nephron discovered to date. Intermediate tubule progenitors will give rise to epithelia of Henle’s loop in mammals. Loss-of-function studies indicate that irx1 and irx2 are dispensable, whereas irx3 is necessary for intermediate tubule formation in Xenopus. Furthermore, we demonstrate that misexpression of irx3 is sufficient to direct ectopic development of intermediate tubules in the Xenopus mesoderm. Taken together, irx3 is the first gene known to be necessary and sufficient to specify nephron segment fate in vivo.
    [Show full text]
  • Genetics As Explanation: Limits to the Human Genome Project’ (2006, 2009)
    els a0026653.tex V2 - 08/29/2016 4:00 P.M. Page 1 Version 3 a0026653 Genetics as Explanation: Introductory article Article Contents Limits to the Human • Definitions • Metaphors and Programs Genome Project • Metaphors and Expectations • Genome is Not a Simple Program Irun R Cohen, The Weizmann Institute of Science, Rehovot, Israel • Microbes, Symbiosis and the Holobiont Henri Atlan, Ecole des Hautes Etudes en Sciences Sociales, Paris, France and Hadas- • Stem Cells Express Multiple Genes • sah University Hospital, Jerusalem, Israel Meaning: Line or Loop? AU:1 • Self-Organisation and Program Sol Efroni, Bar-Ilan University, Ramat Gan, Israel • Complexity, Reduction and Emergence • Evolving Genomes AU:2 Based in part on the previous versions of this eLS article ‘Genetics as Explanation: Limits to the Human Genome Project’ (2006, 2009). • The Environment and the Genome • Language Metaphor • Tool and Toolbox Metaphors • Conclusion Online posting date: 14th October 2016 Living organisms are composed of cells and all biological evolution, single organisms, populations of organisms, living cells contain a genome, the organism’s stock species, cells and molecules, heredity, embryonic development, of deoxyribonucleic acid (DNA). The role of the health and disease and life management. These are quite diverse genome has been likened to a computer program subjects, and the people who study them would seem to use the term gene in distinctly different ways. But genetics as a whole that encodes the organism’s development and its is organised by a single unifying principle, the deoxyribonucleic subsequent response to the environment. Thus, the acid (DNA) code; all would agree that the information borne by organism and its fate can be explained by genetics, a gene is linked to particular sequences of DNA.
    [Show full text]
  • The Genetic Basis of Malformation of Cortical Development Syndromes: Primary Focus on Aicardi Syndrome
    The Genetic Basis of Malformation of Cortical Development Syndromes: Primary Focus on Aicardi Syndrome Thuong Thi Ha B. Sc, M. Bio Neurogenetics Research Group The University of Adelaide Thesis submitted for the degree of Doctor of Philosophy In Discipline of Genetics and Evolution School of Biological Sciences Faculty of Science The University of Adelaide June 2018 Table of Contents Abstract 6 Thesis Declaration 8 Acknowledgements 9 Publications 11 Abbreviations 12 CHAPTER I: Introduction 15 1.1 Overview of Malformations of Cortical Development (MCD) 15 1.2 Introduction into Aicardi Syndrome 16 1.3 Clinical Features of Aicardi Syndrome 20 1.3.1 Epidemiology 20 1.3.2 Clinical Diagnosis 21 1.3.3 Differential Diagnosis 23 1.3.4 Development & Prognosis 24 1.4 Treatment 28 1.5 Pathogenesis of Aicardi Syndrome 29 1.5.1 Prenatal or Intrauterine Disturbances 29 1.5.2 Genetic Predisposition 30 1.6 Hypothesis & Aims 41 1.6.1 Hypothesis 41 1.6.2 Research Aims 41 1.7 Expected Outcomes 42 CHAPTER II: Materials & Methods 43 2.1 Study Design 43 2.1.1 Cohort of Study 44 2.1.2 Inheritance-based Strategy 45 1 2.1.3 Ethics for human and animals 46 2.2 Computational Methods 46 2.2.1 Pre-Processing Raw Reads 46 2.2.2 Sequencing Coverage 47 2.2.3 Variant Discovery 49 2.2.4 Annotating Variants 53 2.2.5 Evaluating Variants 55 2.3 Biological Methods 58 2.3.1 Cell Culture 58 2.3.2 Genomic DNA Sequencing 61 2.3.3 Plasmid cloning 69 2.3.4 RNA, whole exome & Whole Genome Sequencing 74 2.3.5 TOPFlash Assay 75 2.3.6 Western Blot 76 2.3.7 Morpholino Knockdown in Zebrafish 79 CHAPTER III: A mutation in COL4A2 causes autosomal dominant porencephaly with cataracts.
    [Show full text]
  • Highlights of the 'Gene Nomenclature Across Species' Meeting
    MEETING REPORT Highlights of the ‘Gene Nomenclature Across Species’ Meeting Elspeth A. Bruford* Project Coordinator, HUGO Gene Nomenclature Committee (HGNC), EMBL-EBI, Wellcome Trust Genome Campus, Hinxton, Cambridgeshire, UK *Correspondence to: E-mail: [email protected] Date received (in revised form): 24th February, 2010 Abstract The first ‘Gene Nomenclature Across Species’ meeting was held on 12th and 13th October 2009, at the Møller Centre in Cambridge, UK. This meeting, organised and hosted by the HUGO Gene Nomenclature Committee (HGNC), brought together invited experts from the fields of gene nomenclature, phylogenetics and genome assembly and annotation. The central aim of the meeting was to discuss the issues of coordinating gene naming across vertebrates, culminating in the publication of recommendations for assigning nomenclature to genes across multiple species. Meeting summary coorganiser of the meeting, kicked off by discussing The meeting began with a welcome and outline of the current work of the HGNC, ‘An Essential the agenda from Elspeth Bruford, one of the Resource for the Human Genome’. Matt outlined meeting organisers and the group coordinator for the roles of the HGNC, including a summary of the HUGO Gene Nomenclature Committee the process of symbol assignment, and its current (HGNC). HGNC has been based at the European efforts in coordinating gene naming across ver- Bioinformatics Institute (EBI) at Hinxton, UK, tebrates. He also highlighted instances where the since 2007. Since its inception in 1979, the lack of approved gene nomenclature for most mam- HGNC has been assigning gene symbols and malian genomes has resulted in valuable published names to all human genes, including pseudogenes data for these species being absent or confused in and non-coding RNAs.
    [Show full text]
  • Connecting Myelin-Related and Synaptic Dysfunction In
    www.nature.com/scientificreports OPEN Connecting myelin-related and synaptic dysfunction in schizophrenia with SNP-rich Received: 24 October 2016 Accepted: 27 February 2017 gene expression hubs Published: 07 April 2017 Hedi Hegyi Combining genome-wide mapping of SNP-rich regions in schizophrenics and gene expression data in all brain compartments across the human life span revealed that genes with promoters most frequently mutated in schizophrenia are expression hubs interacting with far more genes than the rest of the genome. We summed up the differentially methylated “expression neighbors” of genes that fall into one of 108 distinct schizophrenia-associated loci with high number of SNPs. Surprisingly, the number of expression neighbors of the genes in these loci were 35 times higher for the positively correlating genes (32 times higher for the negatively correlating ones) than for the rest of the ~16000 genes. While the genes in the 108 loci have little known impact in schizophrenia, we identified many more known schizophrenia-related important genes with a high degree of connectedness (e.g. MOBP, SYNGR1 and DGCR6), validating our approach. Both the most connected positive and negative hubs affected synapse-related genes the most, supporting the synaptic origin of schizophrenia. At least half of the top genes in both the correlating and anti-correlating categories are cancer-related, including oncogenes (RRAS and ALDOA), providing further insight into the observed inverse relationship between the two diseases. Gene expression correlation, protein-protein interaction and other high-throughput experiments in the post-genomic era have revealed that genes tend to form complex, scale-free networks where most genes have a few connections with others and a few have a high number of interactions, commonly referred to as “hubs”, estab- lishing them as important central genes in these gene networks1.
    [Show full text]