15Th International Symposium on Bioinformatics Research and Applications

Total Page:16

File Type:pdf, Size:1020Kb

15Th International Symposium on Bioinformatics Research and Applications ISBRA 2019 15th International Symposium on Bioinformatics Research and Applications June 3–6, 2019 Technical University of Catalonia Barcelona, Spain http://alan.cs.gsu.edu/isbra19/ About the Technical University of Catalonia The Technical University of Catalonia (UPC) is a public institution of Higher Education and Research, specialized in the areas of Architecture, Engineering, Science and Technology. The UPC has a wide spread presence in Catalonia, with nine campuses located in Barcelona and nearby towns. The campuses are accessible, well connected by public transport and equipped with the necessary facilities and services to contribute to learning, research and university life. Founded in 1968 by grouping together existing state technical schools of Architecture and Engineering in Barcelona, which date back from the mid-19th century, it gained university status in 1971. The UPC today has a student population of over 30,000, with more than 3,000 teaching and research staff and about 2,000 administrative and service staff. There are 64 bachelor’s degrees, 68 master’s degrees, and 46 doctoral programs offered by 20 schools. Further, there are 50 international double-degree agreements with 30 universities, and about 3,000 students in international mobility programs. Research teams generate e60 million in annual funding, with a total budget of about e300 million. About the Department of Computer Science The Department of Computer Science was created in 1987 and is one of the largest depart- ments at the UPC. It has about 90 full-time faculty and 50 Ph.D. students. The Department is responsible for teaching and research in several disciplines related to the foundations of computing and their applications. The core of the teaching activities is concentrated in the undergraduate and master’s programs of the Barcelona School of Informatics, but the Department is also teaching at several other UPC schools. The Department is coordinating two Ph.D. programs: Computing and Artificial Intelligence. The research activities of the Department are organized in different research groups, covering various Computer Science disciplines. It has an active presence in the main conferences of the field and an intense participation in public and industrial projects, as well as in the creation of spin-off companies. Message from the General Chairs We are delighted to welcome you to the 15th International Symposium on Bioinformatics Research and Applications (ISBRA 2019) being held on the campus of Technical University of Catalonia in Barcelona. We are honored to serve the international community by bringing together scholars, scientists, and students from academia, laboratories, and industry to this premier meeting. We would like to thank Program Chairs Zhipeng Cai, Pavel Skums, and Min Li for putting together a terrific technical program. We thank the program committee and reviewers for their hard work, and also thank all authors who submitted papers and abstracts for their contribu- tions. We believe that you will enjoy all contributed and keynote talks this week! We are grateful to M´ıriam Hernandez´ and Ana Iba´nez˜ from the Technical University of Cat- alonia for help in coordinating ISBRA 2019. We also thank the Publications Chairs Olga Glebova, Mingon Kang, and Fa Zhang and Webmasters Filipp Rondel and Sergey Knyazev for their efforts in publicizing the symposium. We would like to recognize Technical University of Catalonia for hosting ISBRA 2019. We would like to thank Carme Casas and Xavier Masso´ for their help in making these venues available for symposium presentations and on-campus housing of participants. Last but not least, we would like to thank our sponsors: the National Science Foundation, the Department of Computer Science of the Technical University of Catalonia, and the Department of Com- puter Science of Georgia State University. Thanks to their generous support we have been able to award travel fellowships to graduate students and post-doctoral scholars from USA and abroad. We hope you will have a wonderful time at the symposium and enjoy the beautiful Barcelona. Gabriel Valiente, Technical University of Barcelona, Spain Alexander Zelikovsky, Georgia State University, USA Message from the Program Chairs On behalf of the Program Committee, we would like to welcome you to the proceedings of the 15th edition of the International Symposium on Bioinformatics Research and Applica- tions (ISBRA 2019), held in Barcelona, Spain, June 3–6, 2019. The symposium provides a forum for the exchange of ideas and results among researchers, developers, and practitioners working on all aspects of bioinformatics and computational biology and their applications. This year we received 95 submissions in response to the call for extended abstracts. The Pro- gram Committee decided to accept 22 of them for full publication in the proceedings and oral presentation at the symposium. We also accepted 23 for oral presentation. Furthermore, we received 20 submissions in response to the call for short abstracts. The technical program invited keynote talks were given by Prof. Niko Beerenwinkel from ETH Zurich,¨ Prof. Roderic Guigo´ from the Centre for Genomic Regulation, and Prof. Mar- tin Vingron from the Max Planck Institute for Molecular Genetics. We would like to thank the Program Committee members and the additional reviewers for volunteering their time to review and discuss symposium papers. We would like to extend special thanks to the Steer- ing and General Chairs of the symposium for their leadership, and to the Publicity and Local Chairs for their hard work in making ISBRA 2019 a successful event. Last but not least, we would like to thank all authors for presenting their work at the symposium. Zhipeng Cai, Georgia State University Pavel Skums, Georgia State University Min Li, Central South University Keynote Speakers Niko Beerenwinkel Associate Professor, Computational Biology, ETH Zurich Roderic Guigo´ Director, Bioinformatics Program, Centre for Genomic Regulation, Barcelona Martin Vingron Director, Max Planck Institute for Molecular Genetics, Head of Computational Molecular Biology Department Symposium Organizers Steering Committee Yi Pan (Georgia State University, USA), Chair Dan Gusfield (University of California, Davis, USA) Ion Mandoiu (University of Connecticut, USA) Marie-France Sagot (INRIA, France) Ying Xu (University of Georgia, USA) Aidong Zhang (University of Virginia, USA) General Chairs Gabriel Valiente (Technical University of Catalonia, Spain) Alexander Zelikovsky (Georgia State University, USA) Local Chair Gabriel Valiente (Technical University of Catalonia, Spain) Program Chairs Zhipeng Cai (Georgia State University, USA) Pavel Skums (Georgia State University, USA) Min Li (Central South University, China) Publicity Chairs Olga Glebova (Georgia State University, USA) Mingon Kang (Kennesaw State University, USA) Fa Zhang (Chinese Academy of Science, China) Webmasters Filipp Rondel (Georgia State University, USA) Sergey Knyazev (Georgia State University, USA) Program Committee Members Kamal Al Nasr (Tennessee State University) Fenglou Mao (National Institute of Health) Max Alekseyev (George Washington Univer- Yuri Orlovich (Belarusian State Univ.) sity) Andrei Paun (University of Bucharest) Nikita Alexeev (ITMO University) Nadia Pisanti (Universita di Pisa) Mukul S. Bansal (University of Connecticut) Yuri Porozov (ITMO University) Paola Bonizzoni (Universita di Milano- Russell Schwartz (Carnegie Mellon Univer- Bicocca) sity) Zhipeng Cai (Georgia State University) Joao Setubal (University of Sao Paulo) Hongmin Cai (South China University of Yi Shi (Shanghai Jiaotong University) Technology) Xinghua Shi (University of North Carolina at Xing Chen (Chinese Academy of Sciences) Charlotte) Xuefeng Cui (Tsinghua University) Dong Si (University of Washington) Ovidiu Daescu (Univ. of Texas at Dallas) Pavel Skums (Georgia State University) Lei Deng (Central South University) Ileana Streinu (University of Massachusetts Pufeng Du (Tianjin University) Amherst) Oliver Eulenstein (Iowa State University) Emily Su (Taipei Medical University) Xin Gao (King Abdullah University of Sci- Shiwei Sun (Chinese Academy of Sciences) ence and Technology) Sing-Hoi Sze (Texas A&M University) Xuan Guo (University of North Texas) Weitian Tong (Georgia Southern University) Olga Glebova (Georgia State University) Gabriel Valiente (Technical University of Cat- Zengyou He (Dalian University of Technol- alonia) ogy) Jianxin Wang (Central South University) Steffen Heber (North Carolina State Univ.) Li-San Wang (University of Pennsylvania) Jinling Huang (East Carolina University) Guohua Wang (Harbin Institute of Technol- Mingon Kang (Kennesaw State University) ogy) Wooyoung Kim (University of Washington) Seth Weinberg (Virginia Commonwealth Uni- Danny Krizanc (Wesleyan University) versity) Xiujuan Lei (Shaanxi Normal University) Yubao Wu (Georgia State University) Shuai-Cheng Li (City University of Hong Fangxiang Wu (University of Saskatchewan) Kong) Yufeng Wu (University of Connecticut) Yaohang Li (Old Dominion University) Zeng Xiangxiang (Xiamen University) Jing Li (Case Western Reserve University) Guoxian Yu (Southwest University) Min Li (Central South University) Ning Yu (Georgia State University) Xiaowen Liu (Indiana University) Alex Zelikovsky (Georgia State University) Bingqiang Liu (Shandong University) Le Zhang (Southwest University) Ion Mandoiu (University of Connecticut) Xuegong Zhang (Tsinghua University) Igor Mandric (University of California, Los Xing-Ming
Recommended publications
  • Ontology-Based Methods for Analyzing Life Science Data
    Habilitation a` Diriger des Recherches pr´esent´ee par Olivier Dameron Ontology-based methods for analyzing life science data Soutenue publiquement le 11 janvier 2016 devant le jury compos´ede Anita Burgun Professeur, Universit´eRen´eDescartes Paris Examinatrice Marie-Dominique Devignes Charg´eede recherches CNRS, LORIA Nancy Examinatrice Michel Dumontier Associate professor, Stanford University USA Rapporteur Christine Froidevaux Professeur, Universit´eParis Sud Rapporteure Fabien Gandon Directeur de recherches, Inria Sophia-Antipolis Rapporteur Anne Siegel Directrice de recherches CNRS, IRISA Rennes Examinatrice Alexandre Termier Professeur, Universit´ede Rennes 1 Examinateur 2 Contents 1 Introduction 9 1.1 Context ......................................... 10 1.2 Challenges . 11 1.3 Summary of the contributions . 14 1.4 Organization of the manuscript . 18 2 Reasoning based on hierarchies 21 2.1 Principle......................................... 21 2.1.1 RDF for describing data . 21 2.1.2 RDFS for describing types . 24 2.1.3 RDFS entailments . 26 2.1.4 Typical uses of RDFS entailments in life science . 26 2.1.5 Synthesis . 30 2.2 Case study: integrating diseases and pathways . 31 2.2.1 Context . 31 2.2.2 Objective . 32 2.2.3 Linking pathways and diseases using GO, KO and SNOMED-CT . 32 2.2.4 Querying associated diseases and pathways . 33 2.3 Methodology: Web services composition . 39 2.3.1 Context . 39 2.3.2 Objective . 40 2.3.3 Semantic compatibility of services parameters . 40 2.3.4 Algorithm for pairing services parameters . 40 2.4 Application: ontology-based query expansion with GO2PUB . 43 2.4.1 Context . 43 2.4.2 Objective .
    [Show full text]
  • Research Report 2006 Max Planck Institute for Molecular Genetics, Berlin Imprint | Research Report 2006
    MAX PLANCK INSTITUTE FOR MOLECULAR GENETICS Research Report 2006 Max Planck Institute for Molecular Genetics, Berlin Imprint | Research Report 2006 Published by the Max Planck Institute for Molecular Genetics (MPIMG), Berlin, Germany, August 2006 Editorial Board Bernhard Herrmann, Hans Lehrach, H.-Hilger Ropers, Martin Vingron Coordination Claudia Falter, Ingrid Stark Design & Production UNICOM Werbeagentur GmbH, Berlin Number of copies: 1,500 Photos Katrin Ullrich, MPIMG; David Ausserhofer Contact Max Planck Institute for Molecular Genetics Ihnestr. 63–73 14195 Berlin, Germany Phone: +49 (0)30-8413 - 0 Fax: +49 (0)30-8413 - 1207 Email: [email protected] For further information about the MPIMG please see our website: www.molgen.mpg.de MPI for Molecular Genetics Research Report 2006 Table of Contents The Max Planck Institute for Molecular Genetics . 4 • Organisational Structure. 4 • MPIMG – Mission, Development of the Institute, Research Concept. .5 Department of Developmental Genetics (Bernhard Herrmann) . 7 • Transmission ratio distortion (Hermann Bauer) . .11 • Signal Transduction in Embryogenesis and Tumor Progression (Markus Morkel). 14 • Development of Endodermal Organs (Heiner Schrewe) . 16 • Gene Expression and 3D-Reconstruction (Ralf Spörle). 18 • Somitogenesis (Lars Wittler). 21 Department of Vertebrate Genomics (Hans Lehrach) . 25 • Molecular Embryology and Aging (James Adjaye). .31 • Protein Expression and Protein Structure (Konrad Büssow). .34 • Mass Spectrometry (Johan Gobom). 37 • Bioinformatics (Ralf Herwig). .40 • Comparative and Functional Genomics (Heinz Himmelbauer). 44 • Genetic Variation (Margret Hoehe). 48 • Cell Arrays/Oligofingerprinting (Michal Janitz). .52 • Kinetic Modeling (Edda Klipp) . .56 • In Vitro Ligand Screening (Zoltán Konthur). .60 • Neurodegenerative Disorders (Sylvia Krobitsch). .64 • Protein Complexes & Cell Organelle Assembly/ USN (Bodo Lange/Thorsten Mielke). .67 • Automation & Technology Development (Hans Lehrach).
    [Show full text]
  • Next Generation Machine Learning Prediction of Protein Cellular Sorting
    TECHNISCHE UNIVERSITÄT MÜNCHEN Lehrstuhl für Bioinformatik Next Generation Machine Learning Prediction of Protein Cellular Sorting Tatyana Goldberg Vollständiger Abdruck der von der Fakultät für Informatik der Technischen Universität München zur Erlangung des akademischen Grades eines Doktors der Naturwissenschaften genehmigten Dissertation. Vorsitzender: Univ.-Prof. Dr. Daniel Cremers Prüfer der Dissertation: 1. Univ.-Prof. Dr. Burkhard Rost 2. Prof. Dr. Yana Bromberg, The State University of New Jersey/USA 3. Univ.-Prof. Dr. Iris Antes Die Dissertation wurde am 15.12.2015 bei der Technischen Universität München eingereicht und durch die Fakultät für Informatik am 22.03.2016 angenommen. ii Table of Contents Abstract ............................................................................................................................................. v Zusammenfassung ........................................................................................................................... vii Acknowledgments ............................................................................................................................ ix List of publications ........................................................................................................................... xi List of Figures and Tables ............................................................................................................... xiii 1. Introduction ...............................................................................................................................
    [Show full text]
  • First Analysis Steps
    FirstFirst analysisanalysis stepssteps o quality control and optimization o calibration and error modeling o data transformations Wolfgang Huber Dep. of Molecular Genome Analysis (A. Poustka) DKFZ Heidelberg Acknowledgements Anja von Heydebreck Günther Sawitzki Holger Sültmann, Klaus Steiner, Markus Vogt, Jörg Schneider, Frank Bergmann, Florian Haller, Katharina Finis, Stephanie Süß, Anke Schroth, Friederike Wilmer, Judith Boer, Martin Vingron, Annemarie Poustka Sandrine Dudoit, Robert Gentleman, Rafael Irizarry and Yee Hwa Yang: Bioconductor short course, summer 2002 and many others a microarray slide Slide: 25x75 mm Spot-to-spot: ca. 150-350 µm 4 x 4 or 8x4 sectors 17...38 rows and columns per sector ca. 4600…46000 probes/array sector: corresponds to one print-tip Terminology sample: RNA (cDNA) hybridized to the array, aka target, mobile substrate. probe: DNA spotted on the array, aka spot, immobile substrate. sector: rectangular matrix of spots printed using the same print-tip (or pin), aka print-tip-group plate: set of 384 (768) spots printed with DNA from the same microtitre plate of clones slide, array channel: data from one color (Cy3 = cyanine 3 = green, Cy5 = cyanine 5 = red). batch: collection of microarrays with the same probe layout. Raw data scanner signal resolution: 5 or 10 mm spatial, 16 bit (65536) dynamical per channel ca. 30-50 pixels per probe (60 µm spot size) 40 MB per array Raw data scanner signal resolution: 5 or 10 mm spatial, 16 bit (65536) dynamical per channel ca. 30-50 pixels per probe (60 µm spot size) 40 MB per array Image Analysis Raw data scanner signal resolution: 5 or 10 mm spatial, 16 bit (65536) dynamical per channel ca.
    [Show full text]
  • Probabilistic Models for Gene Silencing Data
    Probabilistic Models for Gene Silencing Data Florian Markowetz Dezember 2005 Dissertation zur Erlangung des Grades eines Doktors der Naturwissenschaften (Dr. rer. nat.) am Fachbereich Mathematik und Informatik der Freien Universit¨atBerlin 1. Referent: Prof. Dr. Martin Vingron 2. Referent: Prof. Dr. Klaus-Robert M¨uller Tag der Promotion: 26. April 2006 Preface Acknowledgements This work was carried out in the Computational Diagnostics group of the Department of Computational Molecular Biology at the Max Planck Institute for Molecular Genetics in Berlin. I thank all past and present colleagues for the good working atmosphere and the scientific—and sometimes maybe not so scientific—discussions. Especially, I am grateful to my supervisor Rainer Spang for suggesting the topic, his scientific support, and the opportunity to write this thesis under his guidance. I thank Michael Boutros for providing the expression data and for introducing me to the world of RNAi when I visited his lab at the DKFZ in Heidelberg. I thank Anja von Heydebreck, J¨org Schultz, and Martin Vingron for their advice and counsel as members of my PhD commitee. During the time I worked on this thesis, I enjoyed fruitful discussions with many people. In particular, I gratefully acknowledge Jacques Bloch, Steffen Grossmann, Achim Tresch, and Chen-Hsiang Yeang for their contributions. Special thanks go to Viola Gesellchen, Britta Koch, Stefanie Scheid, Stefan Bentink, Stefan Haas, Dennis Kostka, and Stefan R¨opcke, who read drafts of this thesis and greatly improved it by their comments. Publications Parts of this thesis have been published before. Chapter 2 grew out of lectures I gave in 2005 at the Instiute for Theoretical Physics and Mathematics (IPM) in Tehran, Iran, and at the German Conference on Bioinformatics (GCB), Hamburg, Germany.
    [Show full text]
  • Research Report 2009 Max Planck Institute for Molecular Genetics, Berlin Imprint | Research Report 2009
    Research Report 2009 Max Planck Institute for Molecular Genetics, Berlin Imprint | Research Report 2009 Published by the Max Planck Institute for Molecular Genetics (MPIMG), Berlin, Germany, December 2009 Editorial Board: B.G. Herrmann, H. Lehrach, H.-H. Ropers, M. Vingron Conception & coordination: Patricia Marquardt Photography: Katrin Ullrich, MPIMG; David Ausserhofer Scientific Illustrations: MPIMG Production: Thomas Didier, Meta Data Contact: Max Planck Institute for Molecular Genetics Ihnestr. 63 – 73 14195 Berlin Germany Phone: +49 (0)30 8413-0 Fax: +49 (0)30 8413-1207 Email: [email protected] For further information about the MPIMG, please visit http://www.molgen.mpg.de MPI for Molecular Genetics Research Report 2009 Research Report 2009 1 Max Planck Institute for Molecular Genetics Berlin, December 2009 The Max Planck Institute for Molecular Genetics 2 MPI for Molecular Genetics Research Report 2009 Table of contents Organisational structure . 6 The Max Planck Institute for Molecular Genetics . 7 Mission . 7 Development of the Institute. 7 Research Concept . 8 Department of Developmental Genetics (Bernhard Herrmann) . 9 Transmission ratio distortion (H. Bauer) . 13 Regulatory Networks of Mesoderm Formation & Somitogenesis (B. Herrmann) . 17 Signal Transduction in Embryogenesis and Tumour Progression (M. Morkel) . 22 Organogenesis (H. Schrewe) . 26 General information about the whole Department . 29 Department of Vertebrate Genomics (Hans Lehrach) . 33 Molecular Embryology and Aging (J. Adjaye) . 40 Neuropsychiatric Genetics (L. Bertram) . 46 Automation (A. Dahl, W. Nietfeld, H. Seitz) . 49 Nucleic Acid-based Technologies (J. Glökler) . 55 Bioinformatics (R. Herwig) . 60 Comparative and Functional Genomics (H. Himmelbauer) . .65 Genetic Variation, Haplotypes & Genetics of Complex Diseases (M. Hoehe) . 69 3 in vitro Ligand Screening (Z.
    [Show full text]
  • The Myth of Junk DNA
    The Myth of Junk DNA JoATN h A N W ells s eattle Discovery Institute Press 2011 Description According to a number of leading proponents of Darwin’s theory, “junk DNA”—the non-protein coding portion of DNA—provides decisive evidence for Darwinian evolution and against intelligent design, since an intelligent designer would presumably not have filled our genome with so much garbage. But in this provocative book, biologist Jonathan Wells exposes the claim that most of the genome is little more than junk as an anti-scientific myth that ignores the evidence, impedes research, and is based more on theological speculation than good science. Copyright Notice Copyright © 2011 by Jonathan Wells. All Rights Reserved. Publisher’s Note This book is part of a series published by the Center for Science & Culture at Discovery Institute in Seattle. Previous books include The Deniable Darwin by David Berlinski, In the Beginning and Other Essays on Intelligent Design by Granville Sewell, God and Evolution: Protestants, Catholics, and Jews Explore Darwin’s Challenge to Faith, edited by Jay Richards, and Darwin’s Conservatives: The Misguided Questby John G. West. Library Cataloging Data The Myth of Junk DNA by Jonathan Wells (1942– ) Illustrations by Ray Braun 174 pages, 6 x 9 x 0.4 inches & 0.6 lb, 229 x 152 x 10 mm. & 0.26 kg Library of Congress Control Number: 2011925471 BISAC: SCI029000 SCIENCE / Life Sciences / Genetics & Genomics BISAC: SCI027000 SCIENCE / Life Sciences / Evolution ISBN-13: 978-1-9365990-0-4 (paperback) Publisher Information Discovery Institute Press, 208 Columbia Street, Seattle, WA 98104 Internet: http://www.discoveryinstitutepress.com/ Published in the United States of America on acid-free paper.
    [Show full text]
  • BMC Bioinformatics Biomed Central
    BMC Bioinformatics BioMed Central Introduction Open Access The Seventh Asia Pacific Bioinformatics Conference (APBC2009) Michael Q Zhang*1,2, Michael S Waterman3,2 and Xuegong Zhang2 Address: 1Cold Spring Harbor Laboratory, NY, USA, 2Tsinghua University, Beijing, PR China and 3University of Southern California, CA, USA Email: Michael Q Zhang* - [email protected]; Michael S Waterman - [email protected]; Xuegong Zhang - [email protected] * Corresponding author from The Seventh Asia Pacific Bioinformatics Conference (APBC 2009) Beijing, China. 13–16 January 2009 Published: 30 January 2009 BMC Bioinformatics 2009, 10(Suppl 1):S1 doi:10.1186/1471-2105-10-S1-S1 <supplement> <title> <p>Selected papers from the Seventh Asia-Pacific Bioinformatics Conference (APBC 2009)</p> </title> <editor>Michael Q Zhang, Michael S Waterman and Xuegong Zhang</editor> <note>Research</note> </supplement> This article is available from: http://www.biomedcentral.com/1471-2105/10/S1/S1 © 2009 Zhang et al; licensee BioMed Central Ltd. This is an open access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited. The Asia Pacific Bioinformatics Conference (APBC) series, • Michael S. Waterman: Sequence analysis using Eulerian founded in 2003, is an annual international forum for graphs exploring research, development and applications of Bio- informatics and Computational Biology. The Seventh Asia • Michael B. Eisen: Understanding and exploiting the evolu- Pacific Bioinformatics Conference (APBC2009) was held tion of Drosophila regulatory sequences at Tsinghua University, Beijing, China during January 13– 16 in 2009.
    [Show full text]
  • Transformer Neural Networks for Protein Prediction Tasks
    bioRxiv preprint doi: https://doi.org/10.1101/2020.06.15.153643; this version posted June 16, 2020. The copyright holder for this preprint (which was not certified by peer review) is the author/funder, who has granted bioRxiv a license to display the preprint in perpetuity. It is made available under aCC-BY 4.0 International license. TRANSFORMING THE LANGUAGE OF LIFE:TRANSFORMER NEURAL NETWORKS FOR PROTEIN PREDICTION TASKS Ananthan Nambiar ∗ Maeve Heflin∗ Department of Bioengineering Department of Computer Science Carl R. Woese Inst. for Genomic Biol. Carl R. Woese Inst. for Genomic Biol. University of Illinois at Urbana-Champaign University of Illinois at Urbana-Champaign Urbana, IL 61801 Urbana, IL 61801 [email protected] Simon Liu∗ Sergei Maslov Department of Computer Science Department Bioengineering Carl R. Woese Inst. for Genomic Biol. Department of Physics University of Illinois at Urbana-Champaign Carl R. Woese Inst. for Genomic Biol. Urbana, IL 61801 University of Illinois at Urbana-Champaign Urbana, IL 61801 Mark Hopkinsy Anna Ritzy Department of Computer Science Department of Biology Reed College Reed College Portland, OR 97202 Portland, OR 97202 June 16, 2020 ABSTRACT The scientific community is rapidly generating protein sequence information, but only a fraction of these proteins can be experimentally characterized. While promising deep learning approaches for protein prediction tasks have emerged, they have computational limitations or are designed to solve a specific task. We present a Transformer neural network that pre-trains task-agnostic sequence representations. This model is fine-tuned to solve two different protein prediction tasks: protein family classification and protein interaction prediction.
    [Show full text]
  • Computational Biology
    Computational Biology Volume 18 Editors-in-Chief Andreas Dress, CAS-MPG Partner Institute for Computational Biology, Shanghai, China Michal Linial, Hebrew University of Jerusalem, Jerusalem, Israel Olga Troyanskaya, Princeton University, Princeton, NJ, USA Martin Vingron, Max Planck Institute for Molecular Genetics, Berlin, Germany Editorial Board Robert Giegerich, University of Bielefeld, Bielefeld, Germany Janet Kelso, Max Planck Institute for Evolutionary Anthropology, Leipzig, Germany Gene Myers, Max Planck Institute of Molecular Cell Biology and Genetics, Dresden, Germany Pavel A. Pevzner, University of California, San Diego, CA, USA Advisory Board Gordon Crippen, University of Michigan, Ann Arbor, MI, USA Joe Felsenstein, University of Washington, Seattle, WA, USA Dan Gusfield, University of California, Davis, CA, USA Sorin Istrail, Brown University, Providence, RI, USA Thomas Lengauer, Max Planck Institute for Computer Science, Saarbrücken, Germany Marcella McClure, Montana State University, Bozeman, MO, USA Martin Nowak, Harvard University, Cambridge, MA, USA David Sankoff, University of Ottawa, Ottawa, ON, Canada Ron Shamir, Tel Aviv University, Tel Aviv, Israel Mike Steel, University of Canterbury, Christchurch, New Zealand Gary Stormo, Washington University in St. Louis, St. Louis, MO, USA Simon Tavaré, University of Cambridge, Cambridge, UK Tandy Warnow, University of Texas, Austin, TX, USA Lonnie Welch, Ohio University, Athens, OH, USA The Computational Biology series publishes the very latest, high-quality research devoted
    [Show full text]
  • Research Report 2018
    MPIMG RESEARCH REPORT 2018 Max Planck Institute for Molecular Genetics Impairment of cerebral organoid development induced by microRNA overexpression Shown are day 30 organoids following microRNA overexpression (left), and their matched day 30 untreated organoids (right), stained for the neural stem/progenitor cell marker SOX2 (red) and the neuronal differentiation marker DCX (green). The development of cerebral organoid cytoarchitecture can be clearly seen in control organoids, while it appears stalled following microRNA overexpression. The Elkabetz group is studying the mechanisms by which microRNAs govern cerebral development. Picture: Elkabetz Lab / Human Brain and Neural Stem Cell Studies MPIMG RESEARCH REPORT 2018 Max Planck Institute for Molecular Genetics Michael Müller, Governing Mayor of Berlin, at a visit at the MPIMG during the Long Night of Sciences 2017 (from left to right: Hans Lehrach, Martin Vingron, Michael Müller, Bernhard Herrmann, Alexander Meissner) 4 The Max Planck Institute for Molecular Genetics – Research Report 2018 Editorial We are very pleased to present the 2018 Research Report of the Max Planck Institute for Molecular Genetics (MPIMG), covering the time period from 2015 to mid-2018. The Insti- tute looks back on three challenging and productive years. The first important step in the transition and scientific reorientation following the retire- ment of H.-Hilger Ropers and Hans Lehrach in 2014 has been completed successfully by the appointment of Alexander Meissner to director of the new department “Genome Regulation”. This recruitment marks the beginning of a new important track of research at the MPIMG with a strong emphasis on gene regulatory mechanisms in stem cell differen- tiation, embryogenesis and human disease.
    [Show full text]
  • Genes and Men
    Genes and Men 50 Years of Research at the Max Planck Institute for Molecular Genetics Genes and Men 50 Years of Research at the Max Planck Institute for Molecular Genetics contents 4 Preface martin vingron 6 Molecular biology in Germany in the founding period of the Max Planck Institute for Molecular Genetics hans-jörg rheinberger 16 some questions to olaf pongs und volkmar braun 18 From the Kaiser Wilhelm Institute of Anthropology, Human Heredity and Eugenics to the Max Planck Institute for Molecular Genetics carola sachse 32 some questions to kenneth timmis und reinhard lührmann 34 Remembering Heinz Schuster and 30 years of the Max Planck Institute for Molecular Genetics karin moelling 48 some questions to regine kahmann und klaus bister 50 Ribosome research at the Max Planck Institute for Molecular Genetics in Berlin-Dahlem – The Wittmann era knud h. nierhaus 60 some questions to tomas pieler und albrecht bindereif 62 “ I couldn’t imagine anything better.” thomas a. trautner interviewed by ralf hahn 74 some questions to claus scheidereit und adam antebi 76 50 years of research at the Max Planck Institute for Molecular Genetics – The transition towards human genetics karl sperling 90 some questions to ann ehrenhofer-murray und andrea vortkamp 92 Genome sequencing and the pathway from gene sequences to personalized medicine russ hodge 106 some questions to edda klipp und ulrich stelzl 108 Transformation of biology to an information science jens g. reich 120 some questions to sylvia krobitsch und sascha sauer 122 Understanding the rules behind genes catarina pietschmann 134 some questions to ho-ryun chung und ulf ørom 136 Time line about the development of molecular biology and the Max Planck Institute for Molecular Genetics 140 Imprint preface research in molecular biology and genetics has undergone a tremen- dous development since the middle of the last century, marked in particular by the discovery of the structure of DNA by Watson and Crick and the interpretation of the genetic code by Holley, Khorana and Nirenberg.
    [Show full text]