EMBL-EBI Annual Scientific Report 2012 Table of Contents

Total Page:16

File Type:pdf, Size:1020Kb

EMBL-EBI Annual Scientific Report 2012 Table of Contents EMBL-European Bioinformatics Institute Annual Scientific Report 2012 [email protected] © 2013 EMBL-European Bioinformatics Institute © 2013 EMBL-European External Relations Team. by EMBL-EBI’s This publication was produced information about EMBL-EBI please contact: For more EMBL-EBI Annual Scientific Report 2012 Table of contents Introduction & overview 3 Organisation 8 Services Genes, genomes and variations 10 Molecular atlas 20 Proteins and protein families 28 Molecular and cellular structures 36 Molecular systems 44 Chemical biology 48 Cross-domain tools and resources 54 Research 58 Bertone group 60 Birney group 62 Enright group 64 Goldman group 66 Le Novère group 68 Luscombe group 70 Marioni group 72 Rebholz-Schuhmann group 74 Saez-Rodriguez group 76 Thornton group 78 The EMBL International PhD Programme 80 Support 82 Financial report 96 Scientific advisory boards 104 Major database collaborations 110 Publications 116 2012 EMBL-EBI Annual Scientific Report 2012 EMBL-EBI Annual Scientific Report 1 2 Foreword Welcome to the 2012 Annual Scientific Report of the EMBL-European Bioinformatics Institute. This document gives an overview of our institute’s major achievements during the calendar year, reflecting progress in our mission areas of service, research, training, industry and European co-ordination. If you have visited the Genome Campus over the past year, you will have noticed a new building under construction. We were fortunate to receive funding in 2011 from the UK government's Large Facilities Capital Fund to expand our activities, and in June 2012 EMBL-EBI Director Janet Thornton broke the ground at a landmark ceremony. The rapid progress of construction means that the building is scheduled for completion in autumn 2013. The South Building will house the ELIXIR Hub as well as an Industry and Innovation Suite, which will promote the translation of biological information to applications in medicine, biotechnology and the environment. Our funding remained stable in 2012, despite another year of austerity in many countries, and our staff numbers also remain steady. Yet we continued to handle an exponential rise in biological data and unprecedented growth in the use of our resources. All of our efforts rely on close international collaborations. The deposition of new data, the daily exchange of information between data resources, the joint development of software tools, the sharing of curation tasks and the challenges of collaborative research allow us to build an extensive network of colleagues. We look forward to strengthening these links in the coming year and, as always, to creating new collaborations. Janet Thornton, Director Rolf Apweiler, Joint Associate Director Ewan Birney, Joint Associate Director 3 EMBL-EBI 2012 It was a year of transition, as we bade a fond farewell to Associate Director Graham Cameron, and welcomed Rolf Apweiler and Ewan Birney as joint Associate Directors. It was also a year of honours, as Drs Apweiler and Birney were appointed members of EMBO and our Director, Professor Janet Thornton, was made Dame Commander of the British Empire in recognition of her contribution to bioinformatics. Our research programme went through major changes as The new global website, to be launched in 2013, provides well, as we gained new group leaders Pedro Beltrao from an easily navigable interface to EBI services and features the University of California, San Francisco; Oliver Stegle from consistent functionality which upholds the unique identity the Max-Planck Institutes in Tübingen; and Sarah Teichman of resource brands. Many teams expended significant time from the MRC Laboratory of Molecular Biology in Cambridge. and effort creating and testing new user interfaces, for In the same year, we were very sad to say farewell to three example UniProt and InterPro, and their findings contributed talented and experienced group leaders, who reached significantly to the overall design strategy. the end of their EMBL contracts: Nick Luscombe, now at University College London and the Cancer Research UK London Research Institute; Nicolas Le Novère, who has Research moved a short way to the Babraham Institute, Cambridge; The ENCODE project dominated scientific news worldwide and Dietrich Rebholz-Schuhmann, now at the University of in September. Co-led by Ewan Birney at EMBL-EBI and Zurich’s Institute of Computational Linguistics. funded by the National Human Genome Research Institute In 2012 our protein services saw a change of leadership, in the US, the project comprised over 400 scientists in 32 as Alex Bateman left the Wellcome Trust Sanger Institute labs throughout the world who produced a detailed map of and moved across the Genome Campus to take on the genome function. ENCODE’s staggering output inspired a responsibilities formerly held by Rolf Apweiler. His team new publishing model: upwards of 30 papers were published brings with them the Pfam, Rfam, TreeFam and MEROPS under open-access license in several different peer-reviewed databases. A new Variation journals, with the contents linked by topic and united for Team, led by newcomer optimum exploration in a single interface, provided by Nature Justin Paschall, from (ENCODE Project Consortium, 2012). The launch of ENCODE the National Center was all about public engagement, with a press conference for Biotechnology featuring an exhibit at the London Science Museum and aerial Information (NCBI) in the acrobats re-enacting gene expression. The event captured US, is creating a unified the imagination of the world’s top-tier television and print interface for the European media. Genome-phenome Archive The 1000 Genomes Project published a map of normal and the Database of human variation: a major milestone for the Flicek team, Genomic Variants archive. which co-led the project’s data co-ordination centre with the Perhaps 2012’s largest National Centre for Biotechnology Information (NCBI). This project, involving every vast dataset is freely available via the Amazon Web Services team at EMBL-EBI, was the Cloud (The 1000 Genomes Project Consortium, 2012). redesign of the website with user experience as the driver. Figure 1. In 2012 Professor Janet Thornton was awarded the Commander of the British Empire medal. 4 Petra Schwalie in Paul Flicek’s research group led efforts to develop a new, integrated model of the evolution of the Services transcription factor CTCF. Her work, carried out with Duncan After broad consultation with the service teams and support Odom’s group at the University of Cambridge, explains the groups, we reorganised our services into ‘clusters’ to reflect origin of some 5000 highly conserved CTCF binding events the research communities they serve. The clusters provide in mammals and sheds light on how these binding sites are a better framework to make strategic decisions whilst still moved and amplified (Schmidt et al., 2012). enabling a deep engagement with the communities. Another gene expression study in Nick Luscombe’s lab The new clusters are: genes, genomes and variation, led by found that an organism’s most valuable assets are protected Paul Flicek; molecular atlas, led by Alvis Brazma; proteins most carefully. Their work demonstrated that mutation rates and protein families, led by Alex Bateman; molecular and in bacteria vary by more than an order of magnitude, with cellular structures, led by Gerard Kleywegt; chemical biology, highly expressed genes having the lowest mutation rates led by John Overington; molecular systems, also led by John (Martincorena et al., 2012). Overington; and cross-domain tools and resources, unified under Rolf Apweiler. Janet Thornton’s group continues its painstaking work to understand the evolution of enzyme mechanisms. Nick As a result of the exercise, the ArrayExpress Archive, Furnham, a postdoc in Janet’s group, worked with Christine Expression Atlas, PRIDE and MetaboLights resources are Orengo’s group at University College London to develop a now clustered together, in the interests of combining them new resource called FunTree and used it to study 276 enzyme into a single portal that provides access to integrated views superfamilies. This allowed them to determine the extent to of –omics data. which enzyme functions have changed over the course of evolution–findings that have important implications for the development of new therapeutics. Figure 2. The new South building, which will house the ELIXIR Technical Hub, under construction on the Genome Campus. 5 New service developments Training • Understanding the basis of crop diseases was the driver • Our user-training programme continued to develop behind the launch of PhytoPath, a new portal for plant alongside our service offerings and in 2012 actively pathogen data, while the Kersey team’s involvement in the involved more than 150 members of EMBL staff in 232 transPLANT project gave rise to a new integrative portal events, reaching an audience of over 9770 people in for plant genomics data. 35 countries on six continents. The Train online portal saw a rapidly rising number of users in its first year, The EBI Metagenomics resource was officially launched • and our hands-on programme offered new courses on by the Hunter and Cochrane teams, enabling users to –omics-based data analysis for plant biologists and a submit, archive and analyse genomic information from combined experimental/computational metagenomics environments containing many species. course, amongst many others. The Cochrane and Brazma • The Enzyme Portal, launched in 2012, draws together teams
Recommended publications
  • A Method to Infer Changed Activity of Metabolic Function from Transcript Profiles
    ModeScore: A Method to Infer Changed Activity of Metabolic Function from Transcript Profiles Andreas Hoppe and Hermann-Georg Holzhütter Charité University Medicine Berlin, Institute for Biochemistry, Computational Systems Biochemistry Group [email protected] Abstract Genome-wide transcript profiles are often the only available quantitative data for a particular perturbation of a cellular system and their interpretation with respect to the metabolism is a major challenge in systems biology, especially beyond on/off distinction of genes. We present a method that predicts activity changes of metabolic functions by scoring reference flux distributions based on relative transcript profiles, providing a ranked list of most regulated functions. Then, for each metabolic function, the involved genes are ranked upon how much they represent a specific regulation pattern. Compared with the naïve pathway-based approach, the reference modes can be chosen freely, and they represent full metabolic functions, thus, directly provide testable hypotheses for the metabolic study. In conclusion, the novel method provides promising functions for subsequent experimental elucidation together with outstanding associated genes, solely based on transcript profiles. 1998 ACM Subject Classification J.3 Life and Medical Sciences Keywords and phrases Metabolic network, expression profile, metabolic function Digital Object Identifier 10.4230/OASIcs.GCB.2012.1 1 Background The comprehensive study of the cell’s metabolism would include measuring metabolite concentrations, reaction fluxes, and enzyme activities on a large scale. Measuring fluxes is the most difficult part in this, for a recent assessment of techniques, see [31]. Although mass spectrometry allows to assess metabolite concentrations in a more comprehensive way, the larger the set of potential metabolites, the more difficult [8].
    [Show full text]
  • ANNUAL REVIEW 1 October 2005–30 September
    WELLCOME TRUST ANNUAL REVIEW 1 October 2005–30 September 2006 ANNUAL REVIEW 2006 The Wellcome Trust is the largest charity in the UK and the second largest medical research charity in the world. It funds innovative biomedical research, in the UK and internationally, spending around £500 million each year to support the brightest scientists with the best ideas. The Wellcome Trust supports public debate about biomedical research and its impact on health and wellbeing. www.wellcome.ac.uk THE WELLCOME TRUST The Wellcome Trust is the largest charity in the UK and the second largest medical research charity in the world. 123 CONTENTS BOARD OF GOVERNORS 2 Director’s statement William Castell 4 Advancing knowledge Chairman 16 Using knowledge Martin Bobrow Deputy Chairman 24 Engaging society Adrian Bird 30 Developing people Leszek Borysiewicz 36 Facilitating research Patricia Hodgson 40 Developing our organisation Richard Hynes 41 Wellcome Trust 2005/06 Ronald Plasterk 42 Financial summary 2005/06 Alastair Ross Goobey 44 Funding developments 2005/06 Peter Smith 46 Streams funding 2005/06 Jean Thomas 48 Technology Transfer Edward Walker-Arnott 49 Wellcome Trust Genome Campus As at January 2007 50 Public Engagement 51 Library and information resources 52 Advisory committees Images 1 Surface of the gut. 3 Zebrafish. 5 Cells in a developing This Annual Review covers the 2 Young children in 4 A scene from Y fruit fly. Wellcome Trust’s financial year, from Kenya. Touring’s Every Breath. 6 Data management at the Sanger Institute. 1 October 2005 to 30 September 2006. CONTENTS 1 45 6 EXECUTIVE BOARD MAKING A DIFFERENCE Developing people: To foster a Mark Walport The Wellcome Trust’s mission is research community and individual Director to foster and promote research with researchers who can contribute to the advancement and use of knowledge Ted Bianco the aim of improving human and Director of Technology Transfer animal health.
    [Show full text]
  • Improving the Prediction of Transcription Factor Binding Sites To
    Improving the prediction of transcription factor binding sites to aid the interpretation of non-coding single nucleotide variants. Narayan Jayaram Research Department of Structural and Molecular Biology University College London A thesis submitted to University College London for the degree of Doctor of Philosophy 1 Declaration I, Narayan Jayaram confirm that the work presented in this thesis is my own. Where information has been derived from other sources, I confirm that this has been indicated in the thesis. Narayan Jayaram 2 Abstract Single nucleotide variants (SNVs) that occur in transcription factor binding sites (TFBSs) can disrupt the binding of transcription factors and alter gene expression which can cause inherited diseases and act as driver SNVs in cancer. The identification of SNVs in TFBSs has historically been challenging given the limited number of experimentally characterised TFBSs. The recent ENCODE project has resulted in the availability of ChIP-Seq data that provides genome wide sets of regions bound by transcription factors. These data have the potential to improve the identification of SNVs in TFBSs. However, as the ChIP-Seq data identify a broader range of DNA in which a transcription factor binds, computational prediction is required to identify the precise TFBS. Prediction of TFBSs involves scanning a DNA sequence with a Position Weight Matrix (PWM) using a pattern matching tool. This thesis focusses on the prediction of TFBSs by: (a) evaluating a set of locally-installable pattern-matching tools and identifying the best performing tool (FIMO), (b) using the ENCODE ChIP-Seq data to evaluate a set of de novo motif discovery tools that are used to derive PWMs which can handle large volumes of data, (c) identifying the best performing tool (rGADEM), (d) using rGADEM to generate a set of PWMs from the ENCODE ChIP-Seq data and (e) by finally checking that the selection of the best pattern matching tool is not unduly influenced by the choice of PWMs.
    [Show full text]
  • The Kyoto Encyclopedia of Genes and Genomes (KEGG)
    Kyoto Encyclopedia of Genes and Genome Minoru Kanehisa Institute for Chemical Research, Kyoto University HFSPO Workshop, Strasbourg, November 18, 2016 The KEGG Databases Category Database Content PATHWAY KEGG pathway maps Systems information BRITE BRITE functional hierarchies MODULE KEGG modules KO (KEGG ORTHOLOGY) KO groups for functional orthologs Genomic information GENOME KEGG organisms, viruses and addendum GENES / SSDB Genes and proteins / sequence similarity COMPOUND Chemical compounds GLYCAN Glycans Chemical information REACTION / RCLASS Reactions / reaction classes ENZYME Enzyme nomenclature DISEASE Human diseases DRUG / DGROUP Drugs / drug groups Health information ENVIRON Health-related substances (KEGG MEDICUS) JAPIC Japanese drug labels DailyMed FDA drug labels 12 manually curated original DBs 3 DBs taken from outside sources and given original annotations (GENOME, GENES, ENZYME) 1 computationally generated DB (SSDB) 2 outside DBs (JAPIC, DailyMed) KEGG is widely used for functional interpretation and practical application of genome sequences and other high-throughput data KO PATHWAY GENOME BRITE DISEASE GENES MODULE DRUG Genome Molecular High-level Practical Metagenome functions functions applications Transcriptome etc. Metabolome Glycome etc. COMPOUND GLYCAN REACTION Funding Annual budget Period Funding source (USD) 1995-2010 Supported by 10+ grants from Ministry of Education, >2 M Japan Society for Promotion of Science (JSPS) and Japan Science and Technology Agency (JST) 2011-2013 Supported by National Bioscience Database Center 0.8 M (NBDC) of JST 2014-2016 Supported by NBDC 0.5 M 2017- ? 1995 KEGG website made freely available 1997 KEGG FTP site made freely available 2011 Plea to support KEGG KEGG FTP academic subscription introduced 1998 First commercial licensing Contingency Plan 1999 Pathway Solutions Inc.
    [Show full text]
  • Applied Category Theory for Genomics – an Initiative
    Applied Category Theory for Genomics { An Initiative Yanying Wu1,2 1Centre for Neural Circuits and Behaviour, University of Oxford, UK 2Department of Physiology, Anatomy and Genetics, University of Oxford, UK 06 Sept, 2020 Abstract The ultimate secret of all lives on earth is hidden in their genomes { a totality of DNA sequences. We currently know the whole genome sequence of many organisms, while our understanding of the genome architecture on a systematic level remains rudimentary. Applied category theory opens a promising way to integrate the humongous amount of heterogeneous informations in genomics, to advance our knowledge regarding genome organization, and to provide us with a deep and holistic view of our own genomes. In this work we explain why applied category theory carries such a hope, and we move on to show how it could actually do so, albeit in baby steps. The manuscript intends to be readable to both mathematicians and biologists, therefore no prior knowledge is required from either side. arXiv:2009.02822v1 [q-bio.GN] 6 Sep 2020 1 Introduction DNA, the genetic material of all living beings on this planet, holds the secret of life. The complete set of DNA sequences in an organism constitutes its genome { the blueprint and instruction manual of that organism, be it a human or fly [1]. Therefore, genomics, which studies the contents and meaning of genomes, has been standing in the central stage of scientific research since its birth. The twentieth century witnessed three milestones of genomics research [1]. It began with the discovery of Mendel's laws of inheritance [2], sparked a climax in the middle with the reveal of DNA double helix structure [3], and ended with the accomplishment of a first draft of complete human genome sequences [4].
    [Show full text]
  • A Community Proposal to Integrate Structural
    F1000Research 2020, 9(ELIXIR):278 Last updated: 11 JUN 2020 OPINION ARTICLE A community proposal to integrate structural bioinformatics activities in ELIXIR (3D-Bioinfo Community) [version 1; peer review: 1 approved, 3 approved with reservations] Christine Orengo1, Sameer Velankar2, Shoshana Wodak3, Vincent Zoete4, Alexandre M.J.J. Bonvin 5, Arne Elofsson 6, K. Anton Feenstra 7, Dietland L. Gerloff8, Thomas Hamelryck9, John M. Hancock 10, Manuela Helmer-Citterich11, Adam Hospital12, Modesto Orozco12, Anastassis Perrakis 13, Matthias Rarey14, Claudio Soares15, Joel L. Sussman16, Janet M. Thornton17, Pierre Tuffery 18, Gabor Tusnady19, Rikkert Wierenga20, Tiina Salminen21, Bohdan Schneider 22 1Structural and Molecular Biology Department, University College, London, UK 2Protein Data Bank in Europe, European Molecular Biology Laboratory, European Bioinformatics Institute, Hinxton, CB10 1SD, UK 3VIB-VUB Center for Structural Biology, Brussels, Belgium 4Department of Oncology, Lausanne University, Swiss Institute of Bioinformatics, Lausanne, Switzerland 5Bijvoet Center, Faculty of Science – Chemistry, Utrecht University, Utrecht, 3584CH, The Netherlands 6Science for Life Laboratory, Stockholm University, Solna, S-17121, Sweden 7Dept. Computer Science, Center for Integrative Bioinformatics VU (IBIVU), Vrije Universiteit, Amsterdam, 1081 HV, The Netherlands 8Luxembourg Centre for Systems Biomedicine, University of Luxembourg, Belvaux, L-4367, Luxembourg 9Bioinformatics center, Department of Biology, University of Copenhagen, Copenhagen, DK-2200,
    [Show full text]
  • Distinctive Regulatory Architectures of Germline-Active and Somatic Genes in C
    Downloaded from genome.cshlp.org on October 7, 2021 - Published by Cold Spring Harbor Laboratory Press Research Distinctive regulatory architectures of germline-active and somatic genes in C. elegans Jacques Serizay, Yan Dong, Jürgen Jänes, Michael Chesney, Chiara Cerrato, and Julie Ahringer The Gurdon Institute and Department of Genetics, University of Cambridge, CB2 1QN Cambridge, United Kingdom RNA profiling has provided increasingly detailed knowledge of gene expression patterns, yet the different regulatory ar- chitectures that drive them are not well understood. To address this, we profiled and compared transcriptional and regu- latory element activities across five tissues of Caenorhabditis elegans, covering ∼90% of cells. We find that the majority of promoters and enhancers have tissue-specific accessibility, and we discover regulatory grammars associated with ubiquitous, germline, and somatic tissue–specific gene expression patterns. In addition, we find that germline-active and soma-specific promoters have distinct features. Germline-active promoters have well-positioned +1 and −1 nucleosomes associated with a periodic 10-bp WW signal (W = A/T). Somatic tissue–specific promoters lack positioned nucleosomes and this signal, have wide nucleosome-depleted regions, and are more enriched for core promoter elements, which largely differ between tissues. We observe the 10-bp periodic WW signal at ubiquitous promoters in other animals, suggesting it is an ancient conserved signal. Our results show fundamental differences in regulatory architectures of germline and somatic tissue–specific genes, uncover regulatory rules for generating diverse gene expression patterns, and provide a tissue-specific resource for future studies. [Supplemental material is available for this article.] Cell type–specific transcription regulation underlies production tissues are achieved and whether expression is governed by dis- of the myriad of different cells generated during development.
    [Show full text]
  • Open Data, Open Source, and Open Standards in Chemistry: the Blue Obelisk Five Years On" Journal of Cheminformatics Vol
    Oral Roberts University Digital Showcase College of Science and Engineering Faculty College of Science and Engineering Research and Scholarship 10-14-2011 Open Data, Open Source, and Open Standards in Chemistry: The lueB Obelisk five years on Andrew Lang Noel M. O'Boyle Rajarshi Guha National Institutes of Health Egon Willighagen Maastricht University Samuel Adams See next page for additional authors Follow this and additional works at: http://digitalshowcase.oru.edu/cose_pub Part of the Chemistry Commons Recommended Citation Andrew Lang, Noel M O'Boyle, Rajarshi Guha, Egon Willighagen, et al.. "Open Data, Open Source, and Open Standards in Chemistry: The Blue Obelisk five years on" Journal of Cheminformatics Vol. 3 Iss. 37 (2011) Available at: http://works.bepress.com/andrew-sid-lang/ 19/ This Article is brought to you for free and open access by the College of Science and Engineering at Digital Showcase. It has been accepted for inclusion in College of Science and Engineering Faculty Research and Scholarship by an authorized administrator of Digital Showcase. For more information, please contact [email protected]. Authors Andrew Lang, Noel M. O'Boyle, Rajarshi Guha, Egon Willighagen, Samuel Adams, Jonathan Alvarsson, Jean- Claude Bradley, Igor Filippov, Robert M. Hanson, Marcus D. Hanwell, Geoffrey R. Hutchison, Craig A. James, Nina Jeliazkova, Karol M. Langner, David C. Lonie, Daniel M. Lowe, Jerome Pansanel, Dmitry Pavlov, Ola Spjuth, Christoph Steinbeck, Adam L. Tenderholt, Kevin J. Theisen, and Peter Murray-Rust This article is available at Digital Showcase: http://digitalshowcase.oru.edu/cose_pub/34 Oral Roberts University From the SelectedWorks of Andrew Lang October 14, 2011 Open Data, Open Source, and Open Standards in Chemistry: The Blue Obelisk five years on Andrew Lang Noel M O'Boyle Rajarshi Guha, National Institutes of Health Egon Willighagen, Maastricht University Samuel Adams, et al.
    [Show full text]
  • Vol 9 No 1 Spring
    A PUBLICATION OF THE AMERICAN SOCIETY FOR MATRIX BIOLOGY SPRING 2010, VOLUME 9, NO. 1 President’s Letter Expanding the Society’s Value to You and Your Role in the Society Since its inception in 2001, the principal function of the ASMB has been to organize the OFFICERS biennial meeting. The value of this meeting to our members cannot be understated. The ASMB meeting has emerged as the matrix-centric President: meeting in North America, and it provides an William Parks (2010) open venue for students, postdocs, fellows, and junior faculty to present their work and to inter- act with established investigators. Speaking for Vice Pres/President Elect myself, I very much look forward to the ASMB Jean Schwarzbauer (2010) meeting, not only because I get to see many Bill Parks friends, but also to hear a lot of incredibly good science. This year’s meeting will be no excep- Past President: tion and promises to be truly outstanding. I applaud Jean Schwarzbauer and the Renato Iozzo (2010) rest of the Program Committee for putting together an exciting meeting loaded with interesting topics and great speakers (please check out the program here). By organizing the meeting, ASMB provides value to you, the membership, but I Secretary/Treasurer think we–the Society–should always be looking to do more; that is, to provide more Joanne Murphy-Ullrich (2011) bang for your dues buck. For this year’s meeting, we have expanded the Travel Awards that will be given to trainees in recognition of outstanding research, and we recently established merit-based Minority Scholarships that will be awarded to eli- Council Members gible students and postdocs.
    [Show full text]
  • UNIVERSITY of CALIFORNIA SAN DIEGO Making Sense of Microbial
    UNIVERSITY OF CALIFORNIA SAN DIEGO Making sense of microbial populations from representative samples A dissertation submitted in partial satisfaction of the requirements for the degree Doctor of Philosophy in Computer Science by James T. Morton Committee in charge: Professor Rob Knight, Chair Professor Pieter Dorrestein Professor Rachel Dutton Professor Yoav Freund Professor Siavash Mirarab 2018 Copyright James T. Morton, 2018 All rights reserved. The dissertation of James T. Morton is approved, and it is acceptable in quality and form for publication on microfilm and electronically: Chair University of California San Diego 2018 iii DEDICATION To my friends and family who paved the road and lit the journey. iv EPIGRAPH The ‘paradox’ is only a conflict between reality and your feeling of what reality ‘ought to be’ —Richard Feynman v TABLE OF CONTENTS Signature Page . iii Dedication . iv Epigraph . .v Table of Contents . vi List of Abbreviations . ix List of Figures . .x List of Tables . xi Acknowledgements . xii Vita ............................................. xiv Abstract of the Dissertation . xvii Chapter 1 Methods for phylogenetic analysis of microbiome data . .1 1.1 Introduction . .2 1.2 Phylogenetic Inference . .4 1.3 Phylogenetic Comparative Methods . .6 1.4 Ancestral State Reconstruction . .9 1.5 Analysis of phylogenetic variables . 11 1.6 Using Phylogeny-Aware Distances . 15 1.7 Challenges of phylogenetic analysis . 18 1.8 Discussion . 19 1.9 Acknowledgements . 21 Chapter 2 Uncovering the horseshoe effect in microbial analyses . 23 2.1 Introduction . 24 2.2 Materials and Methods . 34 2.3 Acknowledgements . 35 Chapter 3 Balance trees reveal microbial niche differentiation . 36 3.1 Introduction .
    [Show full text]
  • Gurdon Institute 20122011 PROSPECTUS / ANNUAL REPORT 20112010
    The Wellcome Trust/Cancer Research UK Gurdon Institute 20122011 PROSPECTUS / ANNUAL REPORT 20112010 Gurdon I N S T I T U T E PROSPECTUS 2012 ANNUAL REPORT 2011 http://www.gurdon.cam.ac.uk CONTENTS THE INSTITUTE IN 2011 INTRODUCTION........................................................................................................................................3 HISTORICAL BACKGROUND..........................................................................................................4 CENTRAL SUPPORT SERVICES....................................................................................................5 FUNDING.........................................................................................................................................................5 RETREAT............................................................................................................................................................5 RESEARCH GROUPS.........................................................................................................6 MEMBERS OF THE INSTITUTE................................................................................44 CATEGORIES OF APPOINTMENT..............................................................................44 POSTGRADUATE OPPORTUNITIES..........................................................................44 SENIOR GROUP LEADERS.............................................................................................44 GROUP LEADERS.......................................................................................................................48
    [Show full text]
  • Open Data, Open Source and Open Standards in Chemistry: the Blue Obelisk five Years On
    Open Data, Open Source and Open Standards in chemistry: The Blue Obelisk ¯ve years on Noel M O'Boyle¤1 , Rajarshi Guha2 , Egon L Willighagen3 , Samuel E Adams4 , Jonathan Alvarsson5 , Richard L Apodaca6 , Jean-Claude Bradley7 , Igor V Filippov8 , Robert M Hanson9 , Marcus D Hanwell10 , Geo®rey R Hutchison11 , Craig A James12 , Nina Jeliazkova13 , Andrew SID Lang14 , Karol M Langner15 , David C Lonie16 , Daniel M Lowe4 , J¶er^omePansanel17 , Dmitry Pavlov18 , Ola Spjuth5 , Christoph Steinbeck19 , Adam L Tenderholt20 , Kevin J Theisen21 , Peter Murray-Rust4 1Analytical and Biological Chemistry Research Facility, Cavanagh Pharmacy Building, University College Cork, College Road, Cork, Co. Cork, Ireland 2NIH Center for Translational Therapeutics, 9800 Medical Center Drive, Rockville, MD 20878, USA 3Division of Molecular Toxicology, Institute of Environmental Medicine, Nobels vaeg 13, Karolinska Institutet, 171 77 Stockholm, Sweden 4Unilever Centre for Molecular Sciences Informatics, Department of Chemistry, University of Cambridge, Lens¯eld Road, CB2 1EW, UK 5Department of Pharmaceutical Biosciences, Uppsala University, Box 591, 751 24 Uppsala, Sweden 6Metamolecular, LLC, 8070 La Jolla Shores Drive #464, La Jolla, CA 92037, USA 7Department of Chemistry, Drexel University, 32nd and Chestnut streets, Philadelphia, PA 19104, USA 8Chemical Biology Laboratory, Basic Research Program, SAIC-Frederick, Inc., NCI-Frederick, Frederick, MD 21702, USA 9St. Olaf College, 1520 St. Olaf Ave., North¯eld, MN 55057, USA 10Kitware, Inc., 28 Corporate Drive, Clifton Park, NY 12065, USA 11Department of Chemistry, University of Pittsburgh, 219 Parkman Avenue, Pittsburgh, PA 15260, USA 12eMolecules Inc., 380 Stevens Ave., Solana Beach, California 92075, USA 13Ideaconsult Ltd., 4.A.Kanchev str., So¯a 1000, Bulgaria 14Department of Engineering, Computer Science, Physics, and Mathematics, Oral Roberts University, 7777 S.
    [Show full text]