Cedalion Development of a Computational Framework to Discover Pivotal Genomic Features in Newly Sequenced Plant Species

Total Page:16

File Type:pdf, Size:1020Kb

Cedalion Development of a Computational Framework to Discover Pivotal Genomic Features in Newly Sequenced Plant Species Cedalion Development of a computational framework to discover pivotal genomic features in newly sequenced plant species Word count: 33446 Lennart Raman Student number: 01102304 Supervisors: Prof. Dr. Yves Van de Peer, Dr. Oren Tzfadia A dissertation submitted to Ghent University in partial fulfilment of the requirements for the degree of Master of Science in Bioinformatics: Systems Biology Academic Year: 2016 - 2017 Preface The past academic lustrum at Ghent University has led me to writing this final dissertation, aiming at achieving the degree of Master of Science in Bioinformatics. Using this preface, I would like to seize the opportunity to thank numerous friends, family members, supervisors and professors who contributed in shaping me into the person I am today. For this project specifically, I have to show gratitude to my promoter Prof. Dr. Yves Van de Peer, who allowed me to join his team as the second Master’s thesis student amongst the Cedalion members. This truly gave me the possibility to research a fascinating and – to me – very attractive field of study. Moreover, a considerable word of thanks is owed to the project’s supervisor, Dr. Oren Tzfadia. Oren managed to professionally help me with solving difficulties by either answering my questions or by getting me into contact with other experts at VIB- UGent – all whilst keeping a positive and certainly welcoming atmosphere. Regarding recent years, much appreciation goes out to my parents. Their moral and financial support enabled me to focus on my studies in a stable manner. Furthermore, I have to acknowledge their patience as well, as my juvenile Bachelor years were at times less noteworthy compared to the period when following the Master of Science in Bioinformatics. Last but not least, my friends turned the past couple of years into some genuinely unforgettable moments. They offered me some welcoming and necessary distraction from the sometimes difficult study periods. Special attention goes hereby out to my girlfriend Fien Stepman, who positively guided me into the right direction. I’m not waving Ghent University goodbye just yet. Later this year, I’ll start doing a PhD at UZ Ghent. My hopes are to stay in close contact with current fellow students and the great people of UGent and VIB, hopefully by collaborating at joint projects. Ghent, June 2017 i ii Table of Contents LIST OF ABBREVIATIONS AND SYMBOLS ............................................................. 1 Abbreviations .............................................................................................................. 1 Symbols ....................................................................................................................... 2 ABSTRACT .......................................................................................................... 5 INTRODUCTION .................................................................................................. 7 1. The importance of genome sequencing and annotation .................................... 7 1.1. Sequencing across different research domains ...................................................... 7 1.2. Current research challenges in the plant Kingdom ................................................ 7 2. The genome sequencing project .......................................................................... 8 2.1. Collecting DNA ........................................................................................................ 8 2.2. Sequencing .............................................................................................................. 8 2.2.1. Choice of sequencing platform ....................................................................... 8 2.2.2. Read depth ...................................................................................................... 9 2.2.3. Validation ...................................................................................................... 10 2.3. Assembly ............................................................................................................... 10 2.3.1. Principles of assembly ................................................................................... 10 2.3.2. Tools .............................................................................................................. 10 2.3.3. Validation ...................................................................................................... 11 2.4. Structural annotation ............................................................................................ 11 2.4.1. Principles of structural annotation................................................................ 11 2.4.2. Validation ...................................................................................................... 12 2.4.3. Online repositories ........................................................................................ 12 2.5. Functional annotation ........................................................................................... 12 2.5.1. Terms and vocabularies................................................................................. 12 2.5.2. Not all genes can be accurately annotated ................................................... 12 2.5.3. Homology-based functional annotation ....................................................... 14 2.6. Uncovering additional biological knowledge ........................................................ 14 2.6.1. Genome-wide orthology detection ............................................................... 14 2.6.2. Extending gene annotations .......................................................................... 14 2.6.3. Genome-level biological properties .............................................................. 15 3. Cedalion .............................................................................................................. 15 3.1. Automating the whole-genome sequencing project ............................................ 15 3.1.1. The need for automation .............................................................................. 15 3.1.2. The ultimate goal beyond the plant Kingdom ............................................... 16 3.1.3. Existing whole-genome annotation pipelines ............................................... 16 3.2. Introducing Cedalion ............................................................................................. 17 3.2.1. Etymology ...................................................................................................... 18 iii 3.2.2. The workflow ................................................................................................. 19 3.3. Introducing personal contribution attempts ........................................................ 21 3.3.1. The Genome Statistics Module ..................................................................... 21 3.3.2. The GO Set Testing Module ........................................................................... 21 3.3.3. The BadiRate Module .................................................................................... 22 3.4. Benchmarking Cedalion using Zostera marina and Ulva mutabilis ...................... 26 3.4.1. Zostera marina .............................................................................................. 26 3.4.2. Ulva mutabilis ................................................................................................ 26 AIMS................................................................................................................. 27 1. The Genome Statistics Module .......................................................................... 27 1.1. Feature extraction ................................................................................................. 27 1.2. Outputting relevant information .......................................................................... 27 1.3. Example of an additional study ............................................................................. 28 2. The GO Set Testing Module ............................................................................... 28 2.1. Framework based on codon usage ....................................................................... 28 2.2. Framework based on structural features ............................................................. 28 3. The BadiRate Module ......................................................................................... 28 3.1. BadiRate options ................................................................................................... 28 3.2. Input files generation ............................................................................................ 29 3.3. Downstream processing ........................................................................................ 29 RESULTS ........................................................................................................... 31 1. The Genome Statistics Module .......................................................................... 31 1.1. Software implementation ..................................................................................... 31 1.1.1. Feature extraction ......................................................................................... 31 1.1.2. Feature visualization ..................................................................................... 32 1.1.3. Feature ranking ............................................................................................
Recommended publications
  • Naming the Extrasolar Planets
    Naming the extrasolar planets W. Lyra Max Planck Institute for Astronomy, K¨onigstuhl 17, 69177, Heidelberg, Germany [email protected] Abstract and OGLE-TR-182 b, which does not help educators convey the message that these planets are quite similar to Jupiter. Extrasolar planets are not named and are referred to only In stark contrast, the sentence“planet Apollo is a gas giant by their assigned scientific designation. The reason given like Jupiter” is heavily - yet invisibly - coated with Coper- by the IAU to not name the planets is that it is consid- nicanism. ered impractical as planets are expected to be common. I One reason given by the IAU for not considering naming advance some reasons as to why this logic is flawed, and sug- the extrasolar planets is that it is a task deemed impractical. gest names for the 403 extrasolar planet candidates known One source is quoted as having said “if planets are found to as of Oct 2009. The names follow a scheme of association occur very frequently in the Universe, a system of individual with the constellation that the host star pertains to, and names for planets might well rapidly be found equally im- therefore are mostly drawn from Roman-Greek mythology. practicable as it is for stars, as planet discoveries progress.” Other mythologies may also be used given that a suitable 1. This leads to a second argument. It is indeed impractical association is established. to name all stars. But some stars are named nonetheless. In fact, all other classes of astronomical bodies are named.
    [Show full text]
  • Educator's Guide: Orion
    Legends of the Night Sky Orion Educator’s Guide Grades K - 8 Written By: Dr. Phil Wymer, Ph.D. & Art Klinger Legends of the Night Sky: Orion Educator’s Guide Table of Contents Introduction………………………………………………………………....3 Constellations; General Overview……………………………………..4 Orion…………………………………………………………………………..22 Scorpius……………………………………………………………………….36 Canis Major…………………………………………………………………..45 Canis Minor…………………………………………………………………..52 Lesson Plans………………………………………………………………….56 Coloring Book…………………………………………………………………….….57 Hand Angles……………………………………………………………………….…64 Constellation Research..…………………………………………………….……71 When and Where to View Orion…………………………………….……..…77 Angles For Locating Orion..…………………………………………...……….78 Overhead Projector Punch Out of Orion……………………………………82 Where on Earth is: Thrace, Lemnos, and Crete?.............................83 Appendix………………………………………………………………………86 Copyright©2003, Audio Visual Imagineering, Inc. 2 Legends of the Night Sky: Orion Educator’s Guide Introduction It is our belief that “Legends of the Night sky: Orion” is the best multi-grade (K – 8), multi-disciplinary education package on the market today. It consists of a humorous 24-minute show and educator’s package. The Orion Educator’s Guide is designed for Planetarians, Teachers, and parents. The information is researched, organized, and laid out so that the educator need not spend hours coming up with lesson plans or labs. This has already been accomplished by certified educators. The guide is written to alleviate the fear of space and the night sky (that many elementary and middle school teachers have) when it comes to that section of the science lesson plan. It is an excellent tool that allows the parents to be a part of the learning experience. The guide is devised in such a way that there are plenty of visuals to assist the educator and student in finding the Winter constellations.
    [Show full text]
  • On the Months (De Mensibus) (Lewiston, 2013)
    John Lydus On the Months (De mensibus) Translated with introduction and annotations by Mischa Hooker 2nd edition (2017) ii TABLE OF CONTENTS Abbreviations .......................................................................................... iv Introduction .............................................................................................. v On the Months: Book 1 ............................................................................... 1 On the Months: Book 2 ............................................................................ 17 On the Months: Book 3 ............................................................................ 33 On the Months: Book 4 January ......................................................................................... 55 February ....................................................................................... 76 March ............................................................................................. 85 April ............................................................................................ 109 May ............................................................................................. 123 June ............................................................................................ 134 July ............................................................................................. 140 August ........................................................................................ 147 September ................................................................................
    [Show full text]
  • Greek Mythology
    Greek Mythology The Creation Myth “First Chaos came into being, next wide bosomed Gaea(Earth), Tartarus and Eros (Love). From Chaos came forth Erebus and black Night. Of Night were born Aether and Day (whom she brought forth after intercourse with Erebus), and Doom, Fate, Death, sleep, Dreams; also, though she lay with none, the Hesperides and Blame and Woe and the Fates, and Nemesis to afflict mortal men, and Deceit, Friendship, Age and Strife, which also had gloomy offspring.”[11] “And Earth first bore starry Heaven (Uranus), equal to herself to cover her on every side and to be an ever-sure abiding place for the blessed gods. And earth brought forth, without intercourse of love, the Hills, haunts of the Nymphs and the fruitless sea with his raging swell.”[11] Heaven “gazing down fondly at her (Earth) from the mountains he showered fertile rain upon her secret clefts, and she bore grass flowers, and trees, with the beasts and birds proper to each. This same rain made the rivers flow and filled the hollow places with the water, so that lakes and seas came into being.”[12] The Titans and the Giants “Her (Earth) first children (with heaven) of Semi-human form were the hundred-handed giants Briareus, Gyges, and Cottus. Next appeared the three wild, one-eyed Cyclopes, builders of gigantic walls and master-smiths…..Their names were Brontes, Steropes, and Arges.”[12] Next came the “Titans: Oceanus, Hypenon, Iapetus, Themis, Memory (Mnemosyne), Phoebe also Tethys, and Cronus the wily—youngest and most terrible of her children.”[11] “Cronus hated his lusty sire Heaven (Uranus).
    [Show full text]
  • Collection of Hesiod Homer and Homerica
    COLLECTION OF HESIOD HOMER AND HOMERICA Hesiod, The Homeric Hymns, and Homerica This file contains translations of the following works: Hesiod: "Works and Days", "The Theogony", fragments of "The Catalogues of Women and the Eoiae", "The Shield of Heracles" (attributed to Hesiod), and fragments of various works attributed to Hesiod. Homer: "The Homeric Hymns", "The Epigrams of Homer" (both attributed to Homer). Various: Fragments of the Epic Cycle (parts of which are sometimes attributed to Homer), fragments of other epic poems attributed to Homer, "The Battle of Frogs and Mice", and "The Contest of Homer and Hesiod". This file contains only that portion of the book in English; Greek texts are excluded. Where Greek characters appear in the original English text, transcription in CAPITALS is substituted. PREPARER'S NOTE: In order to make this file more accessable to the average computer user, the preparer has found it necessary to re-arrange some of the material. The preparer takes full responsibility for his choice of arrangement. A few endnotes have been added by the preparer, and some additions have been supplied to the original endnotes of Mr. Evelyn-White's. Where this occurs I have noted the addition with my initials "DBK". Some endnotes, particularly those concerning textual variations in the ancient Greek text, are here ommitted. PREFACE This volume contains practically all that remains of the post- Homeric and pre-academic epic poetry. I have for the most part formed my own text. In the case of Hesiod I have been able to use independent collations of several MSS. by Dr.
    [Show full text]
  • The Night Sky This Month
    The Night Sky (January 2020) UT (Universal Time) or GMT is used this month. Northern Horizon Eastern Western Horizon Horizon 2 2:00 at beginning of the month. 21 :00 in middle of month 20 :00 at end of month Southern Horizon The General Weather Pattern January is often a windy month. Rain is usually less than in the previous month, but it is the second wettest month of the year. January is typically the coldest month of the year and normally gets colder as the month ages; it can therefore be very cold at night, often reaching freezing temperatures with clear skies. Snow can be expected in cold years. Wrap up warm with a warm hat, gloves, socks and shoes and wear multiple layers of clothes; wind chill is a factor in January. An energy snack and a flask containing a warm non-alcoholic drink wouldn’t go amiss. Should you be interested in obtaining a detailed weather forecast for observing in the Usk area, log on to https://www.meteoblue.com/en/weather/forecast/seeing/usk_united -kingdom_2635052 From Earth Orion culminates at 22:00 a third of the way through the month and can be observed almost all night. The Earth is at perihelion (closest to the sun) in its elliptical orbit on the morning of the 5th of January. The word perihelion derives from the Greek words, peri meaning ‘near’ and Helios the Greek god of the sun. The winter sky, dominated by Orion, presents itself due south, late in the evenings for most of this month.
    [Show full text]
  • Greek Mythology / Apollodorus; Translated by Robin Hard
    Great Clarendon Street, Oxford 0X2 6DP Oxford University Press is a department of the University of Oxford. It furthers the University’s objective of excellence in research, scholarship, and education by publishing worldwide in Oxford New York Athens Auckland Bangkok Bogotá Buenos Aires Calcutta Cape Town Chennai Dar es Salaam Delhi Florence Hong Kong Istanbul Karachi Kuala Lumpur Madrid Melbourne Mexico City Mumbai Nairobi Paris São Paulo Shanghai Singapore Taipei Tokyo Toronto Warsaw with associated companies in Berlin Ibadan Oxford is a registered trade mark of Oxford University Press in the UK and in certain other countries Published in the United States by Oxford University Press Inc., New York © Robin Hard 1997 The moral rights of the author have been asserted Database right Oxford University Press (maker) First published as a World’s Classics paperback 1997 Reissued as an Oxford World’s Classics paperback 1998 All rights reserved. No part of this publication may be reproduced, stored in a retrieval system, or transmitted, in any form or by any means, without the prior permission in writing of Oxford University Press, or as expressly permitted by law, or under terms agreed with the appropriate reprographics rights organizations. Enquiries concerning reproduction outside the scope of the above should be sent to the Rights Department, Oxford University Press, at the address above You must not circulate this book in any other binding or cover and you must impose this same condition on any acquirer British Library Cataloguing in Publication Data Data available Library of Congress Cataloging in Publication Data Apollodorus. [Bibliotheca. English] The library of Greek mythology / Apollodorus; translated by Robin Hard.
    [Show full text]
  • O Oceanids (Ὠκεανίδες). the 'Holy Company' of Daughters of *Ocean and *Tethys, and Sisters of the River-Gods. Their N
    O Oceanids (Ὠκεανίδες). The 'holy company' of daughters of *Ocean and *Tethys, and sisters of the river-gods. Their number varies, but Apollodorus names seven (Asia, *Styx, Electra, Doris, Eurynome, *Amphitrite, *Metis) and Hesiod forty-one, including, in addition to those mentioned by Apollodorus, *Calypso *Clymene and Philyra. Most of those named mated with gods (Amphitrite with Poseidon, Doris with Nereus for example) and produced important offspring (Athena was born from Metis and Zeus, Philyra mated with Kronos in the form of a horse and produced the centaur Chiron, Clymene was the mother of Prometheus, and also bore Phaethon to Helius the sun-god.). The sons of Ocean were the fresh-water rivers, but some of the daughters were sea-nymphs, others spirits of streams called after a characteristic of their water such as Ocyrrhoe ('swift-flowing') or Xanthe ('brownish-yellow'); Styx, unusually, was a female river deity, personifying the river of Hades. Calypso ruled over the island kingdom of Ogygia (but her parentage as a true Oceanid is disputed). Metis had the ability, common to sea-gods, of being able to change her shape, although she was little more than the personification of wisdom, swallowed by Zeus in order to absorb that wisdom and to contain any threat from the child with whom she was pregnant. Oceanids feature as the chorus in Prometheus Bound, a tragedy which is set at the north-eastern edge of the known world, and at a time before Zeus had consolidated his power. The Oceanids were part of the older, pre-Olympian race of gods, who had a role as consorts or mothers of other divinities, but were more often viewed anonymously as belonging to the vastness of the sea, to be placated in times of storm and turbulence.
    [Show full text]
  • Beyond the Cloud. Gombrich and the Blindness of Orion
    Beyond the cloud. Gombrich and the blindness of Orion Lucia Corrain At his touch, words start up into images, thoughts become things. William Hazlitt1 I believed that art history must be rational, clear. And that one must not talk any nonsense. Ernst H. Gombrich2 The start of the problem The essay that Ernst Gombrich wrote in 1944 about Nicolas Poussin’s Orion (Fig. 1)3 is a powerful one, built upon a limpid articulation of reasoning, particularly if we consider his controlled adherence to the iconological method. The crystal-like clarity and elegance of the linguistic expression, typical of all of the Austrian scholar’s work,4 in this specific case may also be explained by the context in which he was writing. While describing the work he had carried out during the war, Gombrich’s own words were: ‘I also wrote short texts, to give examples of each of these categories. Later I published a few – in fact, most – of these examples. For mythology, for instance, I had chosen the Orion of Poussin’.5 His brief article begins with a citation from Bellori in which the seventeenth- century critic praised ‘Nicolas for the excellence of his landscapes’ and declared the 1 William Hazlitt, Table Talk, vol. II, London, 1822, 14-24. 2 Ernst H. Gombrich, Didier Eribon, A Lifelong Interest, London: Thames and Hudson, 1991, 24. 3 E. H. Gombrich, ‘The subject of Poussin’s Orion’, Burlington Magazine, 84: 1944, 37-41; now in Ernst Gombrich, Symbolic Images. Studies in the Art of the Renaissance, London: Phaidon, 1972, 119-122.
    [Show full text]
  • TOWARDS a NEW HISTORY of DRAMA AS LABOUR CULTURE DK Roselli the Study of the Ancient Worl
    View metadata, citation and similar papers at core.ac.uk brought to you by CORE provided by Humanities Commons THE WORK OF TRAGIC PRODUCTIONS: TOWARDS A NEW HISTORY OF DRAMA AS LABOUR CULTURE D.K. Roselli The study of the ancient world has often come under scrutiny for its ques- tionable ‘relevance’ to modern society, but Greek tragedy has proven rather resilient. From tragedy’s perceived value in articulating an incomplete but ide- alised state of political and ethical being in Hegel to its role in thinking through the modern construction of politics and gender (often through a re-reading of Hegel), tragedy has loomed large in modern critical inquiry into definitions of the political and the formation of the subject.1 This is another way of saying that the richly textured tragic text has in some respects laid the foundation for subsequent theorising of the political subject. Given the importance placed on such figures as Sophocles’ Oedipus and Antigone starting with Schelling and Hegel, it is perhaps not surprising that recent work in critical theory has tended to recast these particular tragic figures in its critique of Enlightenment thought. Nonetheless, there are problems with the adoption of these figures as paradigms through which tragedy becomes a tool to represent the ancient Greek polis and to work through modern political and ethical problems. The repeated returns to certain aspects of Oedipus or Antigone have contributed to a structured silence around the issue of class rela- tions. Along with the increasingly dominant role of neoliberalism and the con- tinuing importance of identity politics, much recent critical theory has contrib- uted to the occlusion of class and labour from public discourse and academic research.2 In such a climate, it is no wonder that historical materialism rarely figures in academic works.
    [Show full text]
  • Lawler, the Dance of the Owl and Its Significance in the History of Greek Religion
    The Dance of the Owl and Its Significance in the History of Greek Religion and the Drama Author(s): Lillian B. Lawler Source: Transactions and Proceedings of the American Philological Association , 1939, Vol. 70 (1939), pp. 482-502 Published by: The Johns Hopkins University Press Stable URL: http://www.jstor.com/stable/283103 REFERENCES Linked references are available on JSTOR for this article: http://www.jstor.com/stable/283103?seq=1&cid=pdf- reference#references_tab_contents You may need to log in to JSTOR to access the linked references. JSTOR is a not-for-profit service that helps scholars, researchers, and students discover, use, and build upon a wide range of content in a trusted digital archive. We use information technology and tools to increase productivity and facilitate new forms of scholarship. For more information about JSTOR, please contact [email protected]. Your use of the JSTOR archive indicates your acceptance of the Terms & Conditions of Use, available at https://about.jstor.org/terms The Johns Hopkins University Press and are collaborating with JSTOR to digitize, preserve and extend access to Transactions and Proceedings of the American Philological Association This content downloaded from 199.111.227.97 on Mon, 27 Jul 2020 17:41:40 UTC All use subject to https://about.jstor.org/terms 482 Lillian B. Lawler [1939 XXXIV.-The Dance of the Owl and Its Significance in the History of Greek Religion and the Drama LILLIAN B. LAWLER HUNTER COLLEGE By " the dance of the owl " Greek writers may mean one of seven different things, or a combination of two or more of them.
    [Show full text]
  • Brill's Companion to Sophocles / Edited by Andreas Markantonatos
    Brill’s Companion to Sophocles Brill’s Companion to Sophocles Edited by Andreas Markantonatos LEIDEN • BOSTON 2012 Cover illustration: Etruscan art: Talamone Pediment: Oedipus Kneeling, 180–160BC. Orbetello, Museo Civico. © 2012. Photo Scala, Florence—courtesy of the Ministero Beni e Att. Culturali. Library of Congress Cataloging-in-Publication Data Brill's companion to Sophocles / edited by Andreas Markantonatos. pages. cm. Includes bibliographical references and index. ISBN 978-90-04-18492-3 (hardback : alk. paper) – ISBN 978-90-04-21762-1 (e-book) 1. Sophocles–Criticism and interpretation. I. Markantonatos, Andreas. PA4417.B78 2012 882'.01–dc23 2012021091 This publication has been typeset in the multilingual “Brill” typeface. With over 5,100 characters covering Latin, IPA, Greek, and Cyrillic, this typeface is especially suitable for use in the humanities. For more information, please see www.brill.com/brill-typeface. ISSN 1872-3357 ISBN 978 90 04 18492 3 (hardback) ISBN 978 90 04 21762 1 (e-book) Copyright 2012 by Koninklijke Brill NV, Leiden, The Netherlands. Koninklijke Brill NV incorporates the imprints Brill, Global Oriental, Hotei Publishing, IDC Publishers and Martinus Nijhof Publishers. All rights reserved. No part of this publication may be reproduced, translated, stored in a retrieval system, or transmitted in any form or by any means, electronic, mechanical, photocopying, recording or otherwise, without prior written permission from the publisher. Authorization to photocopy items for internal or personal use is granted by Koninklijke Brill NV provided that the appropriate fees are paid directly to The Copyright Clearance Center, 222 Rosewood Drive, Suite 910, Danvers, MA 01923, USA. Fees are subject to change.
    [Show full text]