In Systems Physiology
Total Page:16
File Type:pdf, Size:1020Kb
Load more
Recommended publications
-
Semantic Web
SEMANTIC WEB: REVOLUTIONIZING KNOWLEDGE DISCOVERY IN THE LIFE SCIENCES SEMANTIC WEB: REVOLUTIONIZING KNOWLEDGE DISCOVERY IN THE LIFE SCIENCES Edited by Christopher J. O. Baker1 and Kei-Hoi Cheung2 1Knowledge Discovery Department, Institute for Infocomm Research, Singapore, Singapore; 2Center for Medical Informatics, Yale University School of Medicine, New Haven, CT, USA Kluwer Academic Publishers Boston/Dordrecht/London Contents PART I: Database and Literature Integration Semantic web approach to database integration in the life sciences KEI-HOI CHEUNG, ANDREW K. SMITH, KEVIN Y. L. YIP, CHRISTOPHER J. O. BAKER AND MARK B. GERSTEIN Querying Semantic Web Contents: A case study LOIC ROYER, BENEDIKT LINSE, THOMAS WÄCHTER, TIM FURCH, FRANCOIS BRY, AND MICHAEL SCHROEDER Knowledge Acquisition from the Biomedical Literature LYNETTE HIRSCHMAN, WILLIAM HAYES AND ALFONSO VALENCIA PART II: Ontologies in the Life Sciences Biological Ontologies PATRICK LAMBRIX, HE TAN, VAIDA JAKONIENE, AND LENA STRÖMBÄCK Clinical Ontologies YVES LUSSIER AND OLIVIER BODENREIDER Ontology Engineering For Biological Applications vi Revolutionizing knowledge discovery in the life sciences LARISA N. SOLDATOVA AND ROSS D. KING The Evaluation of Ontologies: Toward Improved Semantic Interoperability LEO OBRST, WERNER CEUSTERS, INDERJEET MANI, STEVE RAY AND BARRY SMITH OWL for the Novice JEFF PAN PART III: Ontology Visualization Techniques for Ontology Visualization XIAOSHU WANG AND JONAS ALMEIDA On Vizualization of OWL Ontologies SERGUEI KRIVOV, FERDINANDO VILLA, RICHARD WILLIAMS, AND XINDONG WU PART IV: Ontologies in Action Applying OWL Reasoning to Genomics: A Case Study KATY WOLSTENCROFT, ROBERT STEVENS AND VOLKER HAARSLEV Can Semantic Web Technologies enable Translational Medicine? VIPUL KASHYAP, TONYA HONGSERMEIER AND SAMUEL J. ARONSON Ontology Design for Biomedical Text Mining RENÉ WITTE, THOMAS KAPPLER, AND CHRISTOPHER J. -
Cryptic Inoviruses Revealed As Pervasive in Bacteria and Archaea Across Earth’S Biomes
ARTICLES https://doi.org/10.1038/s41564-019-0510-x Corrected: Author Correction Cryptic inoviruses revealed as pervasive in bacteria and archaea across Earth’s biomes Simon Roux 1*, Mart Krupovic 2, Rebecca A. Daly3, Adair L. Borges4, Stephen Nayfach1, Frederik Schulz 1, Allison Sharrar5, Paula B. Matheus Carnevali 5, Jan-Fang Cheng1, Natalia N. Ivanova 1, Joseph Bondy-Denomy4,6, Kelly C. Wrighton3, Tanja Woyke 1, Axel Visel 1, Nikos C. Kyrpides1 and Emiley A. Eloe-Fadrosh 1* Bacteriophages from the Inoviridae family (inoviruses) are characterized by their unique morphology, genome content and infection cycle. One of the most striking features of inoviruses is their ability to establish a chronic infection whereby the viral genome resides within the cell in either an exclusively episomal state or integrated into the host chromosome and virions are continuously released without killing the host. To date, a relatively small number of inovirus isolates have been extensively studied, either for biotechnological applications, such as phage display, or because of their effect on the toxicity of known bacterial pathogens including Vibrio cholerae and Neisseria meningitidis. Here, we show that the current 56 members of the Inoviridae family represent a minute fraction of a highly diverse group of inoviruses. Using a machine learning approach lever- aging a combination of marker gene and genome features, we identified 10,295 inovirus-like sequences from microbial genomes and metagenomes. Collectively, our results call for reclassification of the current Inoviridae family into a viral order including six distinct proposed families associated with nearly all bacterial phyla across virtually every ecosystem. -
Learning Protein Constitutive Motifs from Sequence Data Je´ Roˆ Me Tubiana, Simona Cocco, Re´ Mi Monasson*
TOOLS AND RESOURCES Learning protein constitutive motifs from sequence data Je´ roˆ me Tubiana, Simona Cocco, Re´ mi Monasson* Laboratory of Physics of the Ecole Normale Supe´rieure, CNRS UMR 8023 & PSL Research, Paris, France Abstract Statistical analysis of evolutionary-related protein sequences provides information about their structure, function, and history. We show that Restricted Boltzmann Machines (RBM), designed to learn complex high-dimensional data and their statistical features, can efficiently model protein families from sequence information. We here apply RBM to 20 protein families, and present detailed results for two short protein domains (Kunitz and WW), one long chaperone protein (Hsp70), and synthetic lattice proteins for benchmarking. The features inferred by the RBM are biologically interpretable: they are related to structure (residue-residue tertiary contacts, extended secondary motifs (a-helixes and b-sheets) and intrinsically disordered regions), to function (activity and ligand specificity), or to phylogenetic identity. In addition, we use RBM to design new protein sequences with putative properties by composing and ’turning up’ or ’turning down’ the different modes at will. Our work therefore shows that RBM are versatile and practical tools that can be used to unveil and exploit the genotype–phenotype relationship for protein families. DOI: https://doi.org/10.7554/eLife.39397.001 Introduction In recent years, the sequencing of many organisms’ genomes has led to the collection of a huge number of protein sequences, which are catalogued in databases such as UniProt or PFAM Finn et al., 2014). Sequences that share a common ancestral origin, defining a family (Figure 1A), *For correspondence: are likely to code for proteins with similar functions and structures, providing a unique window into [email protected] the relationship between genotype (sequence content) and phenotype (biological features). -
Methodology for Predicting Semantic Annotations of Protein Sequences by Feature Extraction Derived of Statistical Contact Potentials and Continuous Wavelet Transform
Universidad Nacional de Colombia Sede Manizales Master’s Thesis Methodology for predicting semantic annotations of protein sequences by feature extraction derived of statistical contact potentials and continuous wavelet transform Author: Supervisor: Gustavo Alonso Arango Dr. Cesar German Argoty Castellanos Dominguez A thesis submitted in fulfillment of the requirements for the degree of Master’s on Engineering - Industrial Automation in the Department of Electronic, Electric Engineering and Computation Signal Processing and Recognition Group June 2014 Universidad Nacional de Colombia Sede Manizales Tesis de Maestr´ıa Metodolog´ıapara predecir la anotaci´on sem´antica de prote´ınaspor medio de extracci´on de caracter´ısticas derivadas de potenciales de contacto y transformada wavelet continua Autor: Tutor: Gustavo Alonso Arango Dr. Cesar German Argoty Castellanos Dominguez Tesis presentada en cumplimiento a los requerimientos necesarios para obtener el grado de Maestr´ıaen Ingenier´ıaen Automatizaci´onIndustrial en el Departamento de Ingenier´ıaEl´ectrica,Electr´onicay Computaci´on Grupo de Procesamiento Digital de Senales Enero 2014 UNIVERSIDAD NACIONAL DE COLOMBIA Abstract Faculty of Engineering and Architecture Department of Electronic, Electric Engineering and Computation Master’s on Engineering - Industrial Automation Methodology for predicting semantic annotations of protein sequences by feature extraction derived of statistical contact potentials and continuous wavelet transform by Gustavo Alonso Arango Argoty In this thesis, a method to predict semantic annotations of the proteins from its primary structure is proposed. The main contribution of this thesis lies in the implementation of a novel protein feature representation, which makes use of the pairwise statistical contact potentials describing the protein interactions and geometry at the atomic level. -
DECIPHER: Harnessing Local Sequence Context to Improve Protein Multiple Sequence Alignment Erik S
Wright BMC Bioinformatics (2015) 16:322 DOI 10.1186/s12859-015-0749-z RESEARCH ARTICLE Open Access DECIPHER: harnessing local sequence context to improve protein multiple sequence alignment Erik S. Wright1,2 Abstract Background: Alignment of large and diverse sequence sets is a common task in biological investigations, yet there remains considerable room for improvement in alignment quality. Multiple sequence alignment programs tend to reach maximal accuracy when aligning only a few sequences, and then diminish steadily as more sequences are added. This drop in accuracy can be partly attributed to a build-up of error and ambiguity as more sequences are aligned. Most high-throughput sequence alignment algorithms do not use contextual information under the assumption that sites are independent. This study examines the extent to which local sequence context can be exploited to improve the quality of large multiple sequence alignments. Results: Two predictors based on local sequence context were assessed: (i) single sequence secondary structure predictions, and (ii) modulation of gap costs according to the surrounding residues. The results indicate that context-based predictors have appreciable information content that can be utilized to create more accurate alignments. Furthermore, local context becomes more informative as the number of sequences increases, enabling more accurate protein alignments of large empirical benchmarks. These discoveries became the basis for DECIPHER, a new context-aware program for sequence alignment, which outperformed other programs on largesequencesets. Conclusions: Predicting secondary structure based on local sequence context is an efficient means of breaking the independence assumption in alignment. Since secondary structure is more conserved than primary sequence, it can be leveraged to improve the alignment of distantly related proteins. -
Ontology-Based Methods for Analyzing Life Science Data
Habilitation a` Diriger des Recherches pr´esent´ee par Olivier Dameron Ontology-based methods for analyzing life science data Soutenue publiquement le 11 janvier 2016 devant le jury compos´ede Anita Burgun Professeur, Universit´eRen´eDescartes Paris Examinatrice Marie-Dominique Devignes Charg´eede recherches CNRS, LORIA Nancy Examinatrice Michel Dumontier Associate professor, Stanford University USA Rapporteur Christine Froidevaux Professeur, Universit´eParis Sud Rapporteure Fabien Gandon Directeur de recherches, Inria Sophia-Antipolis Rapporteur Anne Siegel Directrice de recherches CNRS, IRISA Rennes Examinatrice Alexandre Termier Professeur, Universit´ede Rennes 1 Examinateur 2 Contents 1 Introduction 9 1.1 Context ......................................... 10 1.2 Challenges . 11 1.3 Summary of the contributions . 14 1.4 Organization of the manuscript . 18 2 Reasoning based on hierarchies 21 2.1 Principle......................................... 21 2.1.1 RDF for describing data . 21 2.1.2 RDFS for describing types . 24 2.1.3 RDFS entailments . 26 2.1.4 Typical uses of RDFS entailments in life science . 26 2.1.5 Synthesis . 30 2.2 Case study: integrating diseases and pathways . 31 2.2.1 Context . 31 2.2.2 Objective . 32 2.2.3 Linking pathways and diseases using GO, KO and SNOMED-CT . 32 2.2.4 Querying associated diseases and pathways . 33 2.3 Methodology: Web services composition . 39 2.3.1 Context . 39 2.3.2 Objective . 40 2.3.3 Semantic compatibility of services parameters . 40 2.3.4 Algorithm for pairing services parameters . 40 2.4 Application: ontology-based query expansion with GO2PUB . 43 2.4.1 Context . 43 2.4.2 Objective . -
Ploidetect Enables Pan-Cancer Analysis of the Causes and Impacts of Chromosomal Instability
bioRxiv preprint doi: https://doi.org/10.1101/2021.08.06.455329; this version posted August 8, 2021. The copyright holder for this preprint (which was not certified by peer review) is the author/funder, who has granted bioRxiv a license to display the preprint in perpetuity. It is made available under aCC-BY-NC-ND 4.0 International license. Ploidetect enables pan-cancer analysis of the causes and impacts of chromosomal instability Luka Culibrk1,2, Jasleen K. Grewal1,2, Erin D. Pleasance1, Laura Williamson1, Karen Mungall1, Janessa Laskin3, Marco A. Marra1,4, and Steven J.M. Jones1,4, 1Canada’s Michael Smith Genome Sciences Center at BC Cancer, Vancouver, British Columbia, Canada 2Bioinformatics training program, University of British Columbia, Vancouver, British Columbia, Canada 3Department of Medical Oncology, BC Cancer, Vancouver, British Columbia, Canada 4Department of Medical Genetics, Faculty of Medicine, Vancouver, British Columbia, Canada Cancers routinely exhibit chromosomal instability, resulting in tumors mutate, these variants are considerably more difficult the accumulation of changes in the abundance of genomic ma- to detect accurately compared to other types of mutations terial, known as copy number variants (CNVs). Unfortunately, and consequently they may represent an under-explored the detection of these variants in cancer genomes is difficult. We facet of tumor biology. 20 developed Ploidetect, a software package that effectively iden- While small mutations can be determined through base tifies CNVs within whole-genome sequenced tumors. Ploidetect changes embedded within aligned sequence reads, CNVs was more sensitive to CNVs in cancer related genes within ad- are variations in DNA quantity and are typically determined vanced, pre-treated metastatic cancers than other tools, while also segmenting the most contiguously. -
AI and Bioinformatics
AI Magazine Volume 25 Number 1 (2004) (© AAAI) Articles Editorial Introduction AI and Bioinformatics Janice Glasgow, Igor Jurisica, and Burkhard Rost ■ This article is an editorial introduction to the re- modern-day biology is far more complex than search discipline of bioinformatics and to the articles suggested by the simplified sketch presented in this special issue. In particular, we address the issue here. In fact, researchers in life sciences live off of how techniques from AI can be applied to many of the introduction of new concepts; the discov- the open and complex problems of modern-day mol- ecular biology. ery of exceptions; and the addition of details that usually complicate, rather than simplify, his special issue of AI Magazine focuses the overall understanding of the field. on some areas of research in bioinfor- Possibly the most rapidly growing area of re- Tmatics that have benefited from applying cent activity in bioinformatics is the analysis AI techniques. Undoubtedly, bioinformatics is of microarray data. The article by Michael Mol- a truly interdisciplinary field: Although some la, Michael Waddell, David Page, and Jude researchers continuously affect wet labs in life Shavlik (“Using Machine Learning to Design science through collaborations or provision of and Interpret Gene-Expression Microarrays”) tools, others are rooted in the theory depart- introduces some background information and ments of exact sciences (physics, chemistry, or provides a comprehensive description of how engineering) or computer sciences. This wide techniques from machine learning can be used variety creates many different perspectives and to help understand this high-dimensional and terminologies. One result of this Babel of lan- prolific gene-expression data. -
BIOINFORMATICS APPLICATIONS NOTE Pages 380-381
Vol. 14 no. 4 1998 BIOINFORMATICS APPLICATIONS NOTE Pages 380-381 MView: a web-compatible database search or multiple alignment viewer NigelP. Brown, ChristopheLeroy and Chris Sander European Bioinformatics Institute (EMBLĆEBI), Wellcome Genome Campus, CambridgeCB10 1SD, UK Received on December 10, 1997; revised and accepted on January 15, 1998 Abstract may be hyperlinked to the SRS system (Etzold et al., 1996), a text field, a field of scoring information from searches, and Summary: MView is a tool for converting the results of a a field reporting the per cent identity of each sequence with sequence database search into the form of a coloured multiple respect to a preferred sequence in the alignment, usually the alignment of hits stacked against the query. Alternatively, an query in the case of a search. existing multiple alignment can be processed. In either case, Multiple alignments require minimal parsing and are the output is simply HTML, so the result is platform independent and does not require a separate application or subjected only to formatting stages. Search hits are first applet to be loaded. stacked against the ungapped query sequence and require Availability: Free from http://www.sander.ebi.ac.uk/mview/ special processing. Ungapped search (e.g. BLAST) hit subject to copyright restrictions. fragments are assembled into a single string by overlaying Contact: [email protected] them preferentially by score onto a template string, while gapped search (e.g. FASTA) hits have columns corresponding Often when running FASTA (Pearson, 1990) or BLAST to query gaps excised. Consequently, the stacked alignment is (Altschul et al., 1990), it is desired to visualize the database a patchwork of reconstituted sequences that nevertheless is hits stacked against the query sequence. -
Aggregation and Correlation Toolbox for Analyses of Genome Tracks Justin Jee Yale University
University of Massachusetts eM dical School eScholarship@UMMS Program in Bioinformatics and Integrative Biology Program in Bioinformatics and Integrative Biology Publications and Presentations 4-15-2011 ACT: aggregation and correlation toolbox for analyses of genome tracks Justin Jee Yale University Joel Rozowsky Yale University Kevin Y. Yip Yale University See next page for additional authors Follow this and additional works at: http://escholarship.umassmed.edu/bioinformatics_pubs Part of the Bioinformatics Commons, Computational Biology Commons, and the Systems Biology Commons Repository Citation Jee, Justin; Rozowsky, Joel; Yip, Kevin Y.; Lochovsky, Lucas; Bjornson, Robert; Zhong, Guoneng; Zhang, Zhengdong; Fu, Yutao; Wang, Jie; Weng, Zhiping; and Gerstein, Mark B., "ACT: aggregation and correlation toolbox for analyses of genome tracks" (2011). Program in Bioinformatics and Integrative Biology Publications and Presentations. Paper 26. http://escholarship.umassmed.edu/bioinformatics_pubs/26 This material is brought to you by eScholarship@UMMS. It has been accepted for inclusion in Program in Bioinformatics and Integrative Biology Publications and Presentations by an authorized administrator of eScholarship@UMMS. For more information, please contact [email protected]. ACT: aggregation and correlation toolbox for analyses of genome tracks Authors Justin Jee, Joel Rozowsky, Kevin Y. Yip, Lucas Lochovsky, Robert Bjornson, Guoneng Zhong, Zhengdong Zhang, Yutao Fu, Jie Wang, Zhiping Weng, and Mark B. Gerstein Comments © The Author(s) 2011. Published by Oxford University Press. This is an Open Access article distributed under the terms of the Creative Commons Attribution Non- Commercial License (http://creativecommons.org/licenses/by-nc/2.5), which permits unrestricted non- commercial use, distribution, and reproduction in any medium, provided the original work is properly cited. -
Biocreative II.5 Workshop 2009 Special Session on Digital Annotations
BioCreative II.5 Workshop 2009 Special Session on Digital Annotations The purified IRF-4 was also The main role of BRCA2 shown to be capable of binding appears to involve regulating the DNA in a PU.1-dependent manner function of RAD51 in the repair by by electrophoretic mobility shift homologous recombination . analysis. brca2 irf4 We found that cells ex- Moreover, expression of pressing Olig2, Nkx2.2, and NG2 Carma1 induces phosphorylation were enriched among virus- of Bcl10 and activation of the infected, GFP-positive (GFP+) transcription factor NF-kappaB. cells. carma1 BB I O olig2 The region of VHL medi- The Rab5 effector ating interaction with HIF-1 alpha Rabaptin-5 and its isoform C R E A T I V E overlapped with a putative Rabaptin-5delta differ in their macromolecular binding site within ability to interact with the rsmallab5 the crystal structure. GTPase Rab4. vhl Translocation RCC, bearing We show that ERBB2-dependenterbb2 atf1 TFE3 or TFEB gene fusions, are Both ATF-1 homodimers and tfe3 medulloblastoma cell invasion and ATF-1/CREB heterodimers bind to recently recognized entities for prometastatic gene expression can the CRE but not to the related which risk factors have not been be blocked using the ERBB tyrosine phorbol ester response element. identified. kinase inhibitor OSI-774. C r i t i c a l A s s e s s m e n t o f I n f o r m a t i o n E x t r a c t i o n i n B i o l o g y October 7th - 9th, 2009 www.BioCreative.org BioCreative II.5 Workshop 2009 special session | Digital Annotations Auditorium of the Spanish National -
Are You an Invited Speaker? a Bibliometric Analysis of Elite Groups for Scholarly Events in Bioinformatics
Are You an Invited Speaker? A Bibliometric Analysis of Elite Groups for Scholarly Events in Bioinformatics Senator Jeong, Sungin Lee, and Hong-Gee Kim Biomedical Knowledge Engineering Laboratory, Seoul National University, 28–22 YeonGeon Dong, Jongno Gu, Seoul 110–749, Korea. E-mail: {senator, sunginlee, hgkim}@snu.ac.kr Participating in scholarly events (e.g., conferences, work- evaluation, but it would be hard to claim that they have pro- shops, etc.) as an elite-group member such as an orga- vided comprehensive lists of evaluation measurements. This nizing committee chair or member, program committee article aims not to provide such lists but to add to the current chair or member, session chair, invited speaker, or award winner is beneficial to a researcher’s career develop- practices an alternative metric that complements existing per- ment.The objective of this study is to investigate whether formance measures to give a more comprehensive picture of elite-group membership for scholarly events is represen- scholars’ performance. tative of scholars’ prominence, and which elite group is By one definition (Jeong, 2008), a scholarly event is the most prestigious. We collected data about 15 global “a sequentially and spatially organized collection of schol- (excluding regional) bioinformatics scholarly events held in 2007. We sampled (via stratified random sampling) ars’ interactions with the intention of delivering and shar- participants from elite groups in each event. Then, bib- ing knowledge, exchanging research ideas, and performing liometric indicators (total citations and h index) of seven related activities.” As such, scholarly events are communica- elite groups and a non-elite group, consisting of authors tion channels from which our new evaluation tool can draw who submitted at least one paper to an event but were its supporting evidence.