Sze-To Hoyin.Pdf

Total Page:16

File Type:pdf, Size:1020Kb

Sze-To Hoyin.Pdf Discovering Patterns from Sequences with Applications to Protein-Protein and Protein-DNA Interaction by Ho Yin Sze-To A thesis presented to the University of Waterloo in fulfillment of the thesis requirement for the degree of Doctor of Philosophy in Systems Design Engineering Waterloo, Ontario, Canada, 2018 c Ho Yin Sze-To 2018 Examining Committee Membership The following served on the Examining Committee for this thesis. The decision of the Examining Committee is by majority vote. External Examiner Luis Rueda Professor Supervisor(s) Andrew K.C. Wong Distinguished Professor Emeritus Daniel Stashuk Professor Internal Member(s) Paul Fieguth Professor Department Chair Alexander Wong Associate Professor, P.Eng. Canada Research Chair in Medical Imaging Systems Internal-external Member Bin Ma Professor University Research Chair ii Author's Declaration This thesis consists of material all of which I authored or co-authored: see Statement of Contributions included in the thesis. This is a true copy of the thesis, including any required final revisions, as accepted by my examiners. I understand that my thesis may be made electronically available to the public. iii Statement of Contributions Content from 5 papers are used in this thesis. I was the co-author with major contri- butions on designing the methods, implementation and writing the papers: • Sze-To, A., & Wong, A. K. (2017). Pattern-Directed Aligned Pattern Clustering. In Bioinformatics and Biomedicine (BIBM), 2017 IEEE International Conference on. IEEE. Acceptance Rate: 19%. Won IEEE TCCLS Student Award. Incorpo- rated in Chapter 3 of this thesis. [126] • Sze-To, A., & Wong, A. K. (2018). Discovering Patterns from Sequences Using Pattern-Directed Aligned Pattern Clustering. NanoBioScience, IEEE Transactions on. Incorporated in Chapter 3 of this thesis. [127] • Lee, E. S. A., Sze-To, H. Y. A., Wong, M. H., Leung, K. S., Lau, T. C. K., & Wong, A. K. (2017). Discovering protein-dna binding cores by aligned pattern clustering. Computational biology and bioinformatics, IEEE/ACM transactions on, 14(2), 254-263. Incorporated in Chapter 4 of this thesis. [75] • Sze-To, A., Fung, S., Lee, E. S. A., & Wong, A. K. (2015, November). Predicting Protein-protein interaction using co-occurring Aligned Pattern Clusters. In Bioin- formatics and Biomedicine (BIBM), 2015 IEEE International Conference on (pp. 55-60). IEEE. Acceptance Rate: 19%. Incorporated in Chapter 5 of this thesis. [124] • Sze-To, A., Fung, S., Lee, E. S. A., & Wong, A. K. (2016). Prediction of Protein- Protein Interaction via Co-Occurring Aligned Pattern Clusters. Methods. Incorpo- rated in Chapter 5 of this thesis. [125] iv Abstract Understanding Protein-Protein and Protein-DNA interaction is of fundamental impor- tance in deciphering gene regulation and other biological processes in living cells. Tra- ditionally, new interaction knowledge is discovered through biochemical experiments that are often labor-intensive, expensive and time-consuming. Thus, computational approaches are preferred. Due to the abundance of sequence data available today, sequence-based interaction analysis becomes one of the most readily applicable and cost-effective methods. One important problem in sequence-based analysis is to identify the functional regions from a set of sequences within the same family or demonstrating similar biological func- tions in experiments. The rationale is that throughout evolution the functional regions normally remain conserved (intact), allowing them to be identified as patterns from a set of sequences. However, there are also mutations such as substitution, insertion, deletion in these functional regions. Existing methods, such as those based on position weight ma- trices, assume that the functional regions have a fixed width and thus cannot not identify functional regions with mutations, particularly those with insertion or deletion mutations. Recently, Aligned Pattern Clustering (APCn) was introduced to identify functional regions as Aligned Pattern Clusters (APCs) by grouping and aligning patterns with variable width. Nevertheless, APCn cannot discover functional regions with substitution, insertion and/or deletion mutations, since their frequencies of occurrences are too low to be considered as patterns. To overcome such an impasse, this thesis proposes a new APC discovery algorithm known as Pattern-Directed Aligned Pattern Clustering (PD-APCn). By first discovering seed patterns from the input sequence data, with their sequence positions located and recorded on an address table, PD-APCn can use the seed patterns to direct the incremental extension of functional regions with minor mutations. By grouping the aligned extended patterns, PD-APCn can recruit patterns adaptively and efficiently with variable width without relying on exhaustive optimal search. Experiments on synthetic datasets with different sizes and noise levels showed that PD-APCn can identify the implanted pattern with mutations, outperforming the popular existing motif-finding software MEME with much higher recall and Fmeasure over a computational speed-up of up to 665 times. When applying PD-APCn on datasets from Cytochrome C and Ubiquitin protein families, all key binding sites conserved in the families were captured in the APC outputs. In sequence-based interaction analysis, there is also a lack of a model for co-occurring functional regions with mutations, where co-occurring functional regions between interac- tion sequences are indicative of binding sites. This thesis proposes a new representation v model Co-Occurrence APCs to capture co-occurring functional regions with mutations from interaction sequences in database transaction format. Applications on Protein-DNA and Protein-Protein interaction validated the capability of Co-Occurrence APCs. In Protein-DNA interaction, a new representation model, Protein-DNA Co-Occurrence APC, was developed for modeling Protein-DNA binding cores. The new model is more com- pact than the traditional one-to-one pattern associations, as it packs many-to-many associ- ations in one model, yet it is detailed enough to allow site-specific variants. An algorithm, based on Co-Support Score, was also developed to discover Protein-DNA Co-Occurrence APCs from Protein-DNA interaction sequences. This algorithm is 1600x faster in run-time than its contemporaries. New Protein-DNA binding cores indicated by Protein-DNA Co- Occurrence APCs were also discovered via homology modeling as a proof-of-concept. In Protein-Protein interaction, a new representation model, Protein-Protein Co-Occurrence APC, was developed for modeling the co-occurring sequence patterns in Protein-Protein Interaction between two protein sequences. A new algorithm, WeMine-P2P, was developed for sequence-based Protein-Protein Interaction machine learning prediction by construct- ing feature vectors leveraging Protein-Protein Co-Occurrence APCs, based on novel scores such as Match Score, MaxMatch Score and APC-PPI score. Through 40 independent ex- periments, it outperformed the well-known algorithm, PIPE2, which also uses co-occurring functional regions while not allowing variable widths and mutations. Both applications on Protein-Protein and Protein-DNA interaction have indicated the potential use of Co- Occurrence APC for exploring other types of biosequence interaction in the future. vi Acknowledgements I would like to thank all the important people who made this thesis possible. Firstly, I would like to express my gratitude to my supervisors, Professor Andrew K. C. Wong and Professor Daniel Stashuk who guide and support me in my research. Professor Wong, thank you very much for providing me with opportunities in studying in University of Waterloo as an international student. I would not be able to study here without your support and assistance. Thank you for your patience and understanding. Your advice helps me to improve, and guides me throughout the entire Ph.D. process. Professor Daniel Stashuk, thank you very much for your consistent encouragement. Your words always boost my confidence, particularly when I need to face difficulties. I also appreciate your help in providing me with high-end computational equipment to run experiments. Secondly, I would like to thank my Ph.D. committee members Professor Paul Fieguth, from Department of Systems Design Engineering, Prof. Alexander Wong, from Department of Systems Design Engineering, Professor Bin Ma, from Cheriton School of Computer Science, for their time and commitment contributed in reviewing this thesis. I would also like to thank Professor Luis Rueda, from School of Computer Science in University of Windsor, for accepting reviewing this thesis, and sparing his vacation time to participate in my PhD defense. Thirdly, I would like to thank all my colleague whom I have worked and studied with at University of Waterloo, especially Dr. En-Shiun Annie Lee and Sanderz Fung for their outstanding collaborative work, as well as Dr. Dennis Zhuang, for his support in providing me with the relevant toolboxes. I would also like to specially thank Shawn Zhexuan Wang, who supported me tirelessly in my Ph.D study. Last but not least, I would like to thank Professor Fakhri Karray, the director of Centre for Pattern Analysis and Machine Intelligence, who granted me access to the laboratory, allowing me to conduct research in the laboratory. vii Dedication To my beloved parents, My beloved younger sister, My mentors and supporters throughout my life. viii Table of Contents Author's Declaration iii Abstract v Acknowledgements
Recommended publications
  • EYAL AKIVA, Phd
    Eyal Akiva CV, Nov. 2017 EYAL AKIVA, PhD Department of Bioengineering and Therapeutic Sciences Phone +1-650-504-9008 University of California at San Francisco Email [email protected] 1700 4th street, San Francisco, Web www.babbittlab.ucsf.edu/eakiva CA, USA EDUCATION 2012-2017 Post-doctoral fellowship at UCSF, Dept. Of Bioengineering and Therapeutic Sciences. Host: Prof. Patricia Babbitt. 2010-2012 Post-doctoral fellowship at UCSF, Dept. Of Bioengineering and Therapeutic Sciences. Host: Prof. Tanja Kortemme. 2004-2010 PhD at The Hebrew University of Jerusalem (Israel), bioinformatics. Host: Prof. Hanah Margalit. “Various Aspects of Modularity in Protein-Protein Interaction". 2001-2004 MSc at The Hebrew University of Jerusalem (Israel), bioinformatics and human genetics. Host: Prof. Muli Ben-Sasson. “Exploiting the Exploiters: Identification of Virus-Host Pep- tide Mimicry as a Source for Modules of Functional Significance”. MAGNA CUM LAUDE. 1997-2000 BSc at Bar-Ilan University (Israel), biology (major) and computer science (minor). Final project advisor: Prof. Ramit Mehr “Modeling the Evolution of the Immune System: a Sim- ulation of the Evolution of Genes that Encode the Variable Regions of Immunoglobulins”. MAGNA CUM LAUDE. 1996-1997 First year of "Industrial Engineering and Management" studies, Tel-Aviv University, Israel. OTHER WORK EXPERIENCE 2000-01 ‘Do-coop technologies’: Team leader and chemistry/microbiology researcher; development of biological applications and manufacture of proprietary nanoparticles (Or Yehuda, Israel and Tel-Aviv University (Prof. Eshel Ben-Jacob’s lab at the school of physics)). FUNDING, HONORS AND AWARDS 2017 Grant: Co-PI, “Utilizing metagenomic sequences for enzyme function prediction”, Joint Genome Institute (US Department of Energy) (http://jgi.doe.gov/doe-user-facilities-ficus- join-forces-to-tackle-biology-big-data/).
    [Show full text]
  • Annual Scientific Report 2011 Annual Scientific Report 2011 Designed and Produced by Pickeringhutchins Ltd
    European Bioinformatics Institute EMBL-EBI Annual Scientific Report 2011 Annual Scientific Report 2011 Designed and Produced by PickeringHutchins Ltd www.pickeringhutchins.com EMBL member states: Austria, Croatia, Denmark, Finland, France, Germany, Greece, Iceland, Ireland, Israel, Italy, Luxembourg, the Netherlands, Norway, Portugal, Spain, Sweden, Switzerland, United Kingdom. Associate member state: Australia EMBL-EBI is a part of the European Molecular Biology Laboratory (EMBL) EMBL-EBI EMBL-EBI EMBL-EBI EMBL-European Bioinformatics Institute Wellcome Trust Genome Campus, Hinxton Cambridge CB10 1SD United Kingdom Tel. +44 (0)1223 494 444, Fax +44 (0)1223 494 468 www.ebi.ac.uk EMBL Heidelberg Meyerhofstraße 1 69117 Heidelberg Germany Tel. +49 (0)6221 3870, Fax +49 (0)6221 387 8306 www.embl.org [email protected] EMBL Grenoble 6, rue Jules Horowitz, BP181 38042 Grenoble, Cedex 9 France Tel. +33 (0)476 20 7269, Fax +33 (0)476 20 2199 EMBL Hamburg c/o DESY Notkestraße 85 22603 Hamburg Germany Tel. +49 (0)4089 902 110, Fax +49 (0)4089 902 149 EMBL Monterotondo Adriano Buzzati-Traverso Campus Via Ramarini, 32 00015 Monterotondo (Rome) Italy Tel. +39 (0)6900 91402, Fax +39 (0)6900 91406 © 2012 EMBL-European Bioinformatics Institute All texts written by EBI-EMBL Group and Team Leaders. This publication was produced by the EBI’s Outreach and Training Programme. Contents Introduction Foreword 2 Major Achievements 2011 4 Services Rolf Apweiler and Ewan Birney: Protein and nucleotide data 10 Guy Cochrane: The European Nucleotide Archive 14 Paul Flicek:
    [Show full text]
  • Automated Function Prediction SIG Meeting July 20Th 2013, Berlin, Germany ISMB/ECCB 2013
    Automated Function Prediction SIG Meeting July 20th 2013, Berlin, Germany ISMB/ECCB 2013 Keynote Speakers Alex Bateman European Bioinformatics Institute, UK. Using protein domains and families for functional prediction Patricia Babbitt University of California, San Francisco, USA. Protein similarity networks: Identification of functional trends from the context of sequence similarity Anna Tramontano University of Rome, “La Sapienza”, Italy. CASP: aging or growing up? Keith Dunker Indiana University, USA. Functions of Intrinsically Disordered Proteins Automated Function Prediction SIG 2013 Programme 08:30 Welcome from the organisers 08:45 Keynote – Keith Dunker, Indiana University, USA. Functions of Intrinsically Disordered Proteins 09:15 Giorgio Valentini, Università degli Studi di Milano, Italy. Scalable Network-based Learning Methods for Automated Function Prediction based on the Neo4j Graph-database 09:35 Joachim Bargsten, Wageningen University, The Netherlands. Integrated network- and sequence-based protein function prediction across the plant kingdom 09:55 Christos Ouzounis, Institute of Applied Biosciences, Greece. We still haz a job: genome annotashuns 10:15 Coffee break 10:45 Keynote – Patricia Babbitt, University of California, San Francisco, USA. Protein similarity networks: Identification of functional trends from the context of sequence similarity 11:15 Rachael Huntley, European Bioinformatics Institute, UK. The Gene Ontology Resource – Common Misconceptions 11:35 Nives Skunca, ETH Zurich, Switzerland. Assessing protein function predictions in light of the Open World Assumption 11:55 Noah Youngs, New York University, USA. Negative Example Selection in Protein Function Prediction 12:15 Lunch 13:30 Keynote – Anna Tramontano, University of Rome “La Sapienza”, Italy. CASP: aging or growing up? 14:00 Indika Kahanda, Colorado State University, USA.
    [Show full text]
  • AFP / Biosapiens 2008 SIG Meeting Program Toronto, Canada, July 2008
    AFP / Biosapiens 2008 SIG Meeting Program Toronto, Canada, July 2008 Dear AFP / Biosapiens 2008 attendee, Welcome to the joint Automated Function Prediction / Biosapiens meeting in ISMB 2008. This is the second time AFP and Biosapiens are joining forces to bring you an engaging and stimulating program. As usual, we strive to bring you the latest in cutting-edge research in computational gene and protein function prediction and annotation delivered by leading international researchers. This year we are holding a joint session with the 3D SIG on predicting function from protein structure. “Structure to Function” is a problem that is rapidly moving to the forefront of life science due to the increasing number of unannotated structures coming from structural genomics projects. We are also addressing a need voiced by the community to learn more about computational function prediction. Kimmen Sjölander from the University of California Berkeley will conduct a workshop on function prediction using phylogenomics. Yanay Ofran from Bar Ilan University, Israel and Predrag Radivojac from Indiana University will present a variety of components, tools and techniques for computational function prediction. Those tutorials are intended for researchers and students that are entering the field, and for those in the field to gain more knowledge and expertise. This is a rare opportunity to interact informally with leading experts in the field and enhance your own research. We would like to thank the members of the Program Committee for carefully reviewing the abstracts sent to this meeting. Thanks to the International Society for Computational Biology for hosting this meeting at ISMB 2008. A special thanks to Prof.
    [Show full text]
  • Reports from the Fifth Edition of CAGI: the Critical Assessment of Genome Interpretation
    Received: 16 July 2019 | Accepted: 19 July 2019 DOI: 10.1002/humu.23876 OVERVIEW Reports from the fifth edition of CAGI: The Critical Assessment of Genome Interpretation Gaia Andreoletti1 | Lipika R. Pal2 | John Moult2,3 | Steven E. Brenner1,4 1Department of Plant and Microbial Biology, University of California, Berkeley, California Abstract 2Institute for Bioscience and Biotechnology Interpretation of genomic variation plays an essential role in the analysis of Research, University of Maryland, Rockville, cancer and monogenic disease, and increasingly also in complex trait disease, with Maryland 3Department of Cell Biology and Molecular applications ranging from basic research to clinical decisions. Many computational Genetics, University of Maryland, College impact prediction methods have been developed, yet the field lacks a clear consensus Park, Maryland on their appropriate use and interpretation. The Critical Assessment of Genome 4Center for Computational Biology, University of California, Berkeley, California Interpretation (CAGI, /'kā‐jē/) is a community experiment to objectively assess computational methods for predicting the phenotypic impacts of genomic variation. Correspondence John Moult, Institute for Bioscience and CAGI participants are provided genetic variants and make blind predictions of Biotechnology Research, University of resulting phenotype. Independent assessors evaluate the predictions by comparing Maryland, 9600 Gudelsky Drive, Rockville, MD 20850. with experimental and clinical data. Email: [email protected] CAGI has completed five editions with the goals of establishing the state of art in Steven E. Brenner, Department of Plant and genome interpretation and of encouraging new methodological developments. This Microbial Biology, University of California, special issue (https://onlinelibrary.wiley.com/toc/10981004/2019/40/9) comprises Berkeley, CA 94720.
    [Show full text]
  • The University of Chicago Interrogating the 3D Structure of Primate Genomes a Dissertation Submitted to the Faculty of the Divis
    THE UNIVERSITY OF CHICAGO INTERROGATING THE 3D STRUCTURE OF PRIMATE GENOMES A DISSERTATION SUBMITTED TO THE FACULTY OF THE DIVISION OF THE BIOLOGICAL SCIENCES AND THE PRITZKER SCHOOL OF MEDICINE IN CANDIDACY FOR THE DEGREE OF DOCTOR OF PHILOSOPHY DEPARTMENT OF HUMAN GENETICS BY ITTAI ETHAN ERES CHICAGO, ILLINOIS DECEMBER 2020 Copyright © 2020 by Ittai Ethan Eres All Rights Reserved Freely available under a CC-BY 4.0 International license \If I am not for myself, who will be for me? But if I am only for myself, who am I? And if not now, when?" Rabbi Hillel Table of Contents LIST OF FIGURES . vi LIST OF TABLES . vii ACKNOWLEDGMENTS . viii ABSTRACT . xi 1 INTRODUCTION . 1 1.1 The evolution of gene regulation . 1 1.2 Gene regulatory evolution insights from comparative primate genomics . 3 1.3 The growing importance of the 3D genome . 11 2 REORGANIZATION OF 3D GENOME STRUCTURE MAY CONTRIBUTE TO GENE REGULATORY EVOLUTION IN PRIMATES . 15 2.1 Abstract . 15 2.2 Introduction . 16 2.3 Results . 18 2.3.1 Inter-species differences in 3D genomic interactions . 19 2.3.2 The relationship between inter-species differences in contacts and gene expression . 26 2.3.3 The chromatin and epigenetic context of inter-species differences in 3D genome structure . 30 2.4 Discussion . 33 2.4.1 Contribution of variation in 3D genome structure to expression diver- gence . 36 2.4.2 Functional annotations . 37 2.5 Materials and methods . 38 2.5.1 Ethics statement . 38 2.5.2 Induced pluripotent stem cells (iPSCs) .
    [Show full text]
  • Full Schedule ILANIT 20 -23 of February 2017
    Full Schedule ILANIT 20th-23rd of February 2017 ILANIT / FISEB Federation of all the Israel Societies for Experimental Biology (FISEB) איגוד האגודות הישראליות לביולוגיה ניסויית )אילנית( ILANIT/FISEB is a federation of 31 Israeli societies of experimental biology. ILANIT’s conference is held every three years in Eilat, with attendance by researchers and students. This conference, held in February 2017, is the culmination of the most exciting research performed in Israel in a variety of biological disciplines. Board President Treasurer Secretary Karen B. Avraham Yaron Shav-Tal Eitan Yefenof (TAU) (BIU) (HUJI) Scientific Organizing Committee Conference President Conference Deputy Conference Vice Conference Vice Orna Amster-Choder President President President (HUJI) Angel Porgador (BGU) Maya Schuldiner (WIS) Eli Pikarsky (HUJI) 2 Last updated 16.02.2017 Scientific Advisory Committee Molecular and Structural Biology and Biochemistry Orna Elroy-Stein, Tel Aviv University (Chair) Ora Furman, Hebrew University of Jerusalem Shula Michaeli, Bar Ilan University Neurobiology and Endocrinology Yuval Dor, Hebrew University of Jerusalem (Chair) Yadin Dudai, Weizmann Institute of Science Assaf Rudich, Ben Gurion University of the Negev Genetics, Genomics, Epigenetics, Bioinformatics and Systems Biology Tzachi Pilpel, Weizmann Institute of Science (Chair) Ohad Birk, Ben Gurion University of the Negev Howard Cedar, Hebrew University of Jerusalem Yael Mandel, Technion – Israel Institute of Technology Medicine, Immunology and Cancer Roni Apte,
    [Show full text]
  • Engineering Proteins from Sequence Statistics: Identifying and Understanding the Roles
    Engineering Proteins from Sequence Statistics: Identifying and Understanding the Roles of Conservation and Correlation in Triosephosphate Isomerase DISSERTATION Presented in Partial Fulfillment of the Requirements for the Degree Doctor of Philosophy in the Graduate School of The Ohio State University By Brandon Joseph Sullivan B.S. Graduate Program in the Ohio State Biochemistry Program The Ohio State University 2011 Dissertation Committee: Thomas J. Magliery, Advisor Mark P. Foster William C. Ray Copyright by Brandon Joseph Sullivan 2011 Abstract The structure, function and dynamics of proteins are determined by the physical and chemical properties of their amino acids. Unfortunately, the information encapsulated within a position or between positions is poorly understood. Multiple sequence alignments of protein families allow us to interrogate these questions statistically. Here, we describe the characterization of bioinformatically-designed variants of triosephosphate isomerase (TIM). First, we review the state-of-the-art for engineering proteins with increased stability. We examine two methodologies that benefit from the availability of large numbers - high-throughput screening and sequence statistics of protein families. Second, we have deconvoluted what properties are encoded within a position (conservation) and between positions (correlations) by designing TIMs in which each position is the most common amino acid in the multiple sequence alignment. We found that a consensus TIM from a raw sequence database performs the complex isomerization reaction with weak activity as a dynamic molten globule. Furthermore, we have confirmed that the monomeric species is the catalytically active conformation despite being designed from 600+ dimeric proteins. A second consensus TIM from a curated dataset is well folded, has wild-type activity and is dimeric, but it only differs from the raw consensus TIM at 35 nonconserved positions.
    [Show full text]
  • Article Reference
    Article Key challenges for the creation and maintenance of specialist protein resources HOLLIDAY, Gemma L, et al. Abstract As the volume of data relating to proteins increases, researchers rely more and more on the analysis of published data, thus increasing the importance of good access to these data that vary from the supplemental material of individual articles, all the way to major reference databases with professional staff and long-term funding. Specialist protein resources fill an important middle ground, providing interactive web interfaces to their databases for a focused topic or family of proteins, using specialized approaches that are not feasible in the major reference databases. Many are labors of love, run by a single lab with little or no dedicated funding and there are many challenges to building and maintaining them. This perspective arose from a meeting of several specialist protein resources and major reference databases held at the Wellcome Trust Genome Campus (Cambridge, UK) on August 11 and 12, 2014. During this meeting some common key challenges involved in creating and maintaining such resources were discussed, along with various approaches to address them. In laying out these challenges, we aim to inform users about [...] Reference HOLLIDAY, Gemma L, et al. Key challenges for the creation and maintenance of specialist protein resources. Proteins, 2015, vol. 83, no. 6, p. 1005-13 DOI : 10.1002/prot.24803 PMID : 25820941 Available at: http://archive-ouverte.unige.ch/unige:74127 Disclaimer: layout of this document may differ from the published version. 1 / 1 proteins STRUCTURE O FUNCTION O BIOINFORMATICS PERSPECTIVES Key challenges for the creation and maintenance of specialist protein resources Gemma L.
    [Show full text]
  • Automated Function Prediction SIG Program
    Automated Function Prediction SIG Program Time Speaker Title Page 8:30-8:45 Opening Remarks Session 1: Predicting function from structure Chair: Patricia Babbitt Plenary talk: Imperial College, Prediction of Protein 08:45-9:15 Michael J. Sternberg London Function from Structure On Protein Prediction using ConSurf, ConSeq Nir Ben-Tal Tel Aviv University 09:15-09:30 and QuasiMotiFinder Web-Servers High-Throughput University of Rome, Tor Exploration of Gabriele Ausiello 09:30-9:45 Vergata Functional Residues in Protein Structures Structure to Function 09:45-10:00 Dagmar Ringe Brandeis University Prediction Methods in Structural Biology 10:00-10:30 Coffee Session 2: Context-based predictions Chair: Adam Godzik Plenary talk: Baylor College of TBA 10:30-11:00 Olivier Lichtarge Medicine Prediction and European Bioinformatics Contradiction of Protein Daniela Wieser 11:00-11:15 Institute Function using Annotation Rules The Use of Context- Based Functional Association in Daisuke Kihara Purdue University 11:15-11:30 Automated Protein Function Prediction Methods Practical Evaluation of University of Several Automated 11:30-11:45 Kai Wang Washington, Seattle Function Prediction Methods POSOLE: Automated Los Alamos National Katherine Verspoor Ontological Annotation 11:45-12:00 Laboratory for Function Prediction 12:00-1:00 Lunch Time Speaker Title Page Session 3: Analysis of functional sites Chair: Iddo Friedberg Plenary Talk: The FEATURE System 1:00-1:30 Stanford University for Protein Structure Russ Altman Annotation Structure-Based Function University
    [Show full text]
  • Plos Computational Biology Publishes Research of Exceptional
    A peer-reviewed open-access journal published by the Public Library of Science www.ploscompbiol.org PLoS Computational Biology publishes research of exceptional significance that furthers our understanding of living systems at all scales — from molecules and cells, to patient populations and ecosystems — through the application of computational methods. • Run by an international Editorial Board led by Philip E. Bourne (University of California San Diego, USA). • Featuring high-quality Research Articles, invited Reviews, Tutorials, an outstanding Education section, Image credit: Toma Pigli and our popular Editorial “Ten Simple Rules” series. PLoS Computational Biology (2007) • Funder-compliant — Published articles are Topics include (but are not limited to): immediately deposited in PubMed Central and Molecular Biology subsequently cited in PubMed. Protein-Protein Interactions • Provides constructive peer review and rapid Computational Neuroscience publication. Regulatory Networks • Article-level metrics and web tools to facilitate Computational Immunology community discourse through notes, comments, Sequence Analysis and ratings. Protein Structure & Function Prediction • Highlighted in news outlets and blogs from around Population Biology the world. Cancer Genetics Microarray Data Analysis Gene Expression Synthetic Biology PLoS Computational Biology is published by the Public Machine Learning Library of Science (PLoS), a nonprofit organization committed to making the world’s scientific and medical literature a public resource. Everything we publish is freely available online through- out the world, for anyone to read, download, copy, distribute and use (with attribution). Barrier-free, open access, no permissions required. Image credit: Ryan Davey PLoS Computational Biology (2007) PUBLIC LIBRARY of SCIENCE www.plos.org A peer-reviewed open-access journal published by the Public Library of Science www.ploscompbiol.org Editorial Board Philip E.
    [Show full text]
  • Annual Scientific Report 2011 Annual Scientific Report 2011 Designed and Produced by Pickeringhutchins Ltd
    European Bioinformatics Institute EMBL-EBI Annual Scientific Report 2011 Annual Scientific Report 2011 Designed and Produced by PickeringHutchins Ltd www.pickeringhutchins.com EMBL member states: Austria, Croatia, Denmark, Finland, France, Germany, Greece, Iceland, Ireland, Israel, Italy, Luxembourg, the Netherlands, Norway, Portugal, Spain, Sweden, Switzerland, United Kingdom. Associate member state: Australia EMBL-EBI is a part of the European Molecular Biology Laboratory (EMBL) EMBL-EBI EMBL-EBI EMBL-EBI EMBL-European Bioinformatics Institute Wellcome Trust Genome Campus, Hinxton Cambridge CB10 1SD United Kingdom Tel. +44 (0)1223 494 444, Fax +44 (0)1223 494 468 www.ebi.ac.uk EMBL Heidelberg Meyerhofstraße 1 69117 Heidelberg Germany Tel. +49 (0)6221 3870, Fax +49 (0)6221 387 8306 www.embl.org [email protected] EMBL Grenoble 6, rue Jules Horowitz, BP181 38042 Grenoble, Cedex 9 France Tel. +33 (0)476 20 7269, Fax +33 (0)476 20 2199 EMBL Hamburg c/o DESY Notkestraße 85 22603 Hamburg Germany Tel. +49 (0)4089 902 110, Fax +49 (0)4089 902 149 EMBL Monterotondo Adriano Buzzati-Traverso Campus Via Ramarini, 32 00015 Monterotondo (Rome) Italy Tel. +39 (0)6900 91402, Fax +39 (0)6900 91406 © 2012 EMBL-European Bioinformatics Institute All texts written by EBI-EMBL Group and Team Leaders. This publication was produced by the EBI’s Outreach and Training Programme. Contents Introduction Foreword 2 Major Achievements 2011 4 Services Rolf Apweiler and Ewan Birney: Protein and nucleotide data 10 Guy Cochrane: The European Nucleotide Archive 14 Paul Flicek:
    [Show full text]