Integrative Genomic Analysis by Interoperation Of

Total Page:16

File Type:pdf, Size:1020Kb

Integrative Genomic Analysis by Interoperation Of BRIEF COMMUNICATIONS format converters, relieving scientists of the burden of identi- Integrative genomic fying and scripting the conversions. The GenomeSpace Recipe Resource is a growing set of high-utility use cases that demon- analysis by interoperation strate how to leverage multiple tools and serve as quick guides to analysis tasks. The website serves as a knowledge base, newsstand of bioinformatics tools and point of contact and help for the community of users and tool developers. in GenomeSpace Initially seeded by a consortium of biology research labs and development teams of six popular bioinformatics tools Kun Qu1,12, Sara Garamszegi2,12, Felix Wu2,12, (Cytoscape1,2, Galaxy3, GenePattern4, Genomica5, the Integrative Helga Thorvaldsdottir2, Ted Liefeld2,3, Marco Ocana2,3, Genomics Viewer (IGV)6 and the UCSC Table Browser7), Diego Borges-Rivera4, Nathalie Pochet2,5, GenomeSpace now connects 20 tools and data resources. Our James T Robinson2,3, Barry Demchak3, Tim Hull3, consortium labs provided biological projects and analytical needs 6,7 8 9 to drive GenomeSpace design and development. For example, we Gil Ben-Artzi , Daniel Blankenberg , Galt P Barber , 8,9 9 9 8 recapitulated the steps and results of published analyses within Brian T Lee , Robert M Kuhn , Anton Nekrutenko , GenomeSpace (Supplementary Figs. 1 and 2), dissecting and 6 3 2,3 2,4,10 Eran Segal , Trey Ideker , Michael Reich , Aviv Regev , visualizing the gene regulatory networks in human cancer stem Howard Y Chang1,11 & Jill P Mesirov2,3 cells (Supplementary Note 1, Supplementary Figs. 2–5). The study required diverse data types, analytical steps and methods Complex biomedical analyses require the use of multiple and multiple data transfers between tools. While originally software tools in concert and remain challenging for much of requiring substantial scripting, this work can now be performed the biomedical research community. We introduce GenomeSpace by non-programming biologists using only the GenomeSpace (http://www.genomespace.org), a cloud-based, cooperative platform and tools within it. community resource that currently supports the streamlined From a user’s perspective (Fig. 1 and Supplementary Fig. 6), interaction of 20 bioinformatics tools and data resources. GenomeSpace has several features that together facilitate integra- To facilitate integrative analysis by non-programmers, Nature America, Inc. All rights reserved. America, Inc. Nature tive analysis with a low barrier to user entry: (i) the collection of 6 it offers a growing set of ‘recipes’, short workflows to guide resident tools spanning a broad range of applications (Table 1); investigators through high-utility analysis tasks. (ii) easy dataset management in a variety of cloud storage types, © 201 alongside data-sharing capabilities (all account holders receive The integrative analysis of diverse data types with multiple soft- an allocation of cloud storage, and GenomeSpace also supports ware tools remains an enormous challenge for many biologists. connections to other cloud accounts (Dropbox, Google Drive, npg There is an ever-growing gap between the need to use various Amazon S3)); (iii) the ability to launch tools and to move data analysis and visualization tools and the complications of getting and analyses between tools, all facilitated by ‘behind-the-scenes’ tools from different sources to work together. Moreover, it is dif- file format converters (each tool retains its native environment ficult—even for experts, but especially for less computationally and presents the same user interface and functionality as when oriented biologists—to keep up with all of the available tools and launched outside of GenomeSpace); and (iv) a lightweight, simple, to identify the right recipes to follow, particularly in the absence unifying web interface. In summary, from the web interface of an accepted ‘laboratory manual’ for analytic protocols. a researcher can launch a tool and simultaneously feed it input Here, we present GenomeSpace, an open-source interoperability data files, move analysis results into other tools as needed through platform and community resource to enable non-programming simple launching operations, and collect additional processed scientists to work easily across data types and analysis methods data within cloud accounts or local storage. (http://www.genomespace.org). GenomeSpace provides a ‘tool We developed the GenomeSpace Recipe Resource (http://www. launch pad’ into which tools can be seamlessly added, and a genomespace.org/recipes) to aid researchers in identifying the ‘data highway’ that handles transfers between tools through steps required to perform a genomic analysis—a challenging task 1Program in Epithelial Biology, Stanford University School of Medicine, Stanford, California, USA. 2The Broad Institute of MIT and Harvard, Cambridge, Massachusetts, USA. 3Department of Medicine, University of California, San Diego, La Jolla, California, USA. 4Department of Biology, Massachusetts Institute of Technology, Cambridge, Massachusetts, USA. 5Program in Translational NeuroPsychiatric Genomics, Brigham and Women’s Hospital, Harvard Medical School, Boston, Massachusetts, USA. 6Department of Computer Science and Applied Mathematics, Weizmann Institute of Science, Rehovot, Israel. 7Department of Molecular Cell Biology, Weizmann Institute of Science, Rehovot, Israel. 8Department of Biochemistry and Molecular Biology, Pennsylvania State University, University Park, Pennsylvania, USA. 9UC Santa Cruz Genomics Institute, University of California, Santa Cruz, Santa Cruz, California, USA. 10Howard Hughes Medical Institute, Massachusetts Institute of Technology, Cambridge, Massachusetts, USA. 11Howard Hughes Medical Institute, Stanford University, Stanford, California, USA. 12These authors contributed equally to this work. Correspondence should be addressed to J.P.M. ([email protected]). RECEIVED 15 APRIL 2015; ACCEPTED 4 DECEMBER 2015; PUBLISHED ONLINE 18 JANUarY 2016; DOI:10.1038/NMETH.3732 NATURE METHODS | VOL.13 NO.3 | MARCH 2016 | 245 BRIEF COMMUNICATIONS Tool Current recipes cover diverse genomic analyses as well as basic Dropbox Tool and data Data utilities for using GenomeSpace itself (Supplementary Table 1). connection and source transfer manager We are adding social media vehicles to make recipe collection a Tool crowdsourced, collaborative effort, and we encourage suggestions Google Drive Data for new multitool recipes and ideas to improve existing recipes. source An example from the Recipe Resource illustrates how to “Find Tool User interface subnetworks of differentially expressed genes and identify associ- Amazon S3 Data ated biological functions.” Briefly, given a gene expression dataset, source this recipe identifies network interactions between differentially expressed genes and annotates the biological functions within sub- Cloud-based storage GenomeSpace components Bioinformatics tools networks via the Gene Ontology (GO) (Supplementary Fig. 7). platforms for user data and data resources The example dataset provided with this recipe is gene expression Figure 1 | The GenomeSpace environment for interoperation of data from a study in which granulocyte-macrophage progenitor bioinformatics tools. cells were transformed into leukemia stem cells by the introduction of an oncogene, MLL-AF9 (ref. 11). Applying the recipe identifies processes that are correlated with transformation from a normal even for short analyses. Although preconstructed pipelines can to a leukemic phenotype (Supplementary Fig. 8), such as embody the entire workflow of a study, they may be insufficiently SMAD1-dependent signaling, a process associated with the reg- flexible for exploratory research. We took an alternative approach ulation of hematopoietic differentiation by TGF-β and BMP12. by providing a collection or ‘cookbook’ of recipes: i.e., compre- We also describe a second recipe example, “Identify biological hensive descriptions of cross-tool analysis workflows. Recipes are functions for genes in copy number variation (CNV) regions” generally short, involving two or three tools, but commoditize (Supplementary Note 2 and Supplementary Figs. 9 and 10). important research tasks that investigators can employ as part An important design goal was to facilitate rapid addition of more complex analyses. The notion of our Recipe Resource of diverse tools contributed by the developer community. is modeled after the classical lab guide Molecular Cloning: This provides mutual benefit by extending the capabilities of A Laboratory Manual10, which used a similar approach to democ- GenomeSpace while also giving independent developers’ tools ratize molecular biology three decades ago. simultaneous access to all GenomeSpace-connected tools and Each GenomeSpace recipe contains a motivating biological data sources, circumventing the need to connect to each one problem, a relevant example dataset, detailed recipe steps and one individually. Recent cross-tool interoperability efforts have used possible interpretation of the results. Recipes include screenshot one of several approaches: aggregators host a large number of guides and videos that help walk users through the workflow. command line tools (Galaxy, GenePattern); plug-in architectures Nature America, Inc. All rights reserved. America, Inc. Nature 6 Table 1 | GenomeSpace provides access to a diverse set of bioinformatics tools and resources Tool name Organization Project website © 201 Analysis and visualization tools Cistrome Dana-Farber Cancer Institute
Recommended publications
  • Java Jdk1.4 Software Free Download
    Java jdk1.4 software free download click here to download Downloading and installating Java Development Kit JDK for SVD Downloading and installing JDK for SVD applet and application. The JDK includes tools useful for developing and testing programs written in the links on the top of this page to learn more about it and register for one for free. For current Java releases, please consult the Oracle Software Download page. For information on how to download the JDK for use on Solaris 9 and Solaris 10 with Sun software please see SunSolve Document - Infodoc ref: The Java Development Kit is a collection of tools that developers use to Free Sun Microsystems Windows XP//Vista/Server /7. The Java Development Kit contains the software and tools needed to Free Sun Microsystems Windows XP//Vista/Server /7 0 stars Be the first to review this product. All Versions. out of 1 votes. 5 star. 1; 4 star. Java Development Kit, free and safe download. Java Development Kit latest version: Compile and run Java apps on bit systems. You can check if you are running bit Windows before downloading bit Java for Users should download bit Java software, if they are using bit browser on Go to www.doorway.ru; Click on the Free Java Download button, and start the. Download the latest version of the top software, games, programs and apps Download Java for Windows XP - Best Software & Apps. Filter by: Free . License: buy Download; Platform: Windows. Execute your JavaScript with this free tool. 4. JavaExe, free and safe download. JavaExe latest version: Launch your Java application as a Windows file (if you know how).
    [Show full text]
  • Cytoscape: the Network Visualization Tool for Genomespace Workflows
    F1000Research 2014, 3:151 Last updated: 21 JUL 2021 SOFTWARE TOOL ARTICLE Cytoscape: the network visualization tool for GenomeSpace workflows [version 2; peer review: 3 approved] Barry Demchak1, Tim Hull1, Michael Reich2, Ted Liefeld2, Michael Smoot3, Trey Ideker1, Jill P. Mesirov2 1Department of Medicine, University of California, San Diego, La Jolla, CA, 92093-0688, USA 2Broad Institute of Massachusetts Institute of Technology and Harvard, Cambridge, MA , 02142, USA 3Qualcomm, Inc, San Diego, CA , 92121, USA v2 First published: 01 Jul 2014, 3:151 Open Peer Review https://doi.org/10.12688/f1000research.4492.1 Latest published: 26 Aug 2014, 3:151 https://doi.org/10.12688/f1000research.4492.2 Reviewer Status Invited Reviewers Abstract Modern genomic analysis often requires workflows incorporating 1 2 3 multiple best-of-breed tools. GenomeSpace is a web-based visual workbench that combines a selection of these tools with mechanisms version 2 that create data flows between them. One such tool is Cytoscape 3, a (revision) popular application that enables analysis and visualization of graph- 26 Aug 2014 oriented genomic networks. As Cytoscape runs on the desktop, and not in a web browser, integrating it into GenomeSpace required version 1 special care in creating a seamless user experience and enabling 01 Jul 2014 report report report appropriate data flows. In this paper, we present the design and operation of the Cytoscape GenomeSpace app, which accomplishes 1. Aris Floratos, Columbia University, New this integration, thereby providing critical analysis and visualization functionality for GenomeSpace users. It has been downloaded over York, NY, USA 850 times since the release of its first version in September, 2013.
    [Show full text]
  • Cytoscape: the Network Visualization Tool for Genomespace Workflows
    UC San Diego UC San Diego Previously Published Works Title Cytoscape: the network visualization tool for GenomeSpace workflows. Permalink https://escholarship.org/uc/item/1qr8t9n8 Authors Demchak, Barry Hull, Tim Reich, Michael et al. Publication Date 2014 DOI 10.12688/f1000research.4492.2 Peer reviewed eScholarship.org Powered by the California Digital Library University of California F1000Research 2014, 3:151 Last updated: 15 SEP 2014 SOFTWARE TOOL Cytoscape: the network visualization tool for GenomeSpace workflows [v2; ref status: indexed, http://f1000r.es/47f] Barry Demchak1, Tim Hull1, Michael Reich2, Ted Liefeld2, Michael Smoot3, 1 2 Trey Ideker , Jill P. Mesirov 1Department of Medicine, University of California, San Diego, La Jolla, 92093-0688, USA 2Broad Institute of Massachusetts Institute of Technology and Harvard, Cambridge, 02142, USA 3Qualcomm, Inc, San Diego, 92121, USA v2 First published: 01 Jul 2014, 3:151 (doi: 10.12688/f1000research.4492.1) Open Peer Review Latest published: 26 Aug 2014, 3:151 (doi: 10.12688/f1000research.4492.2) Invited Referee Responses Abstract Modern genomic analysis often requires workflows incorporating multiple 1 2 3 best-of-breed tools. GenomeSpace is a web-based visual workbench that combines a selection of these tools with mechanisms that create data flows version 1 between them. One such tool is Cytoscape 3, a popular application that published report report report 01 Jul 2014 enables analysis and visualization of graph-oriented genomic networks. As 1 1 1 Cytoscape runs on the desktop, and not in a web browser, integrating it into GenomeSpace required special care in creating a seamless user experience and enabling appropriate data flows.
    [Show full text]
  • Data Visualization by Nils Gehlenborg
    Data Visualization Nils Gehlenborg ([email protected]) Center for Biomedical Informatics / Harvard Medical School Cancer Program / Broad Institute of MIT and Harvard ISMB/ECCB 2011 http://www.biovis.net Flyers at ISCB booth! Data Visualization / ISMB/ECCB 2011 / Nils Gehlenborg A good sketch is better than a long speech. Napoleon Bonaparte Data Visualization / ISMB/ECCB 2011 / Nils Gehlenborg Minard 1869 Napoleon’s March on Moscow Data Visualization / ISMB/ECCB 2011 / Nils Gehlenborg 4 I believe when I see it. Unknown Data Visualization / ISMB/ECCB 2011 / Nils Gehlenborg Anscombe 1973, The American Statistician Anscombe’s Quartet mean(X) = 9, var(X) = 11, mean(Y) = 7.5, var(Y) = 4.12, cor(X,Y) = 0.816, linear regression line Y = 3 + 0.5*X Data Visualization / ISMB/ECCB 2011 / Nils Gehlenborg 6 Anscombe 1973, The American Statistician Anscombe’s Quartet Data Visualization / ISMB/ECCB 2011 / Nils Gehlenborg 7 Exploration: Hypothesis Generation trends gaps outliers clusters - A large data set is given and the goal is to learn something about it. - Visualization is employed to perform pattern detection using the human visual system. - The goal is to generate hypotheses that can be tested with statistical methods or follow-up experiments. Data Visualization / ISMB/ECCB 2011 / Nils Gehlenborg 8 Visualization Use Cases Presentation Confirmation Exploration Data Visualization / ISMB/ECCB 2011 / Nils Gehlenborg 9 Definition The use of computer-supported, interactive, visual representations of data to amplify cognition. Stu Card, Jock Mackinlay & Ben Shneiderman Computer-based visualization systems provide visual representations of datasets intended to help people carry out some task more effectively.effectively.
    [Show full text]
  • Modeling Genomic Regulatory Networks with Big Data
    TIGS-1108; No. of Pages 10 Review Modeling genomic regulatory networks with big data Hamid Bolouri Division of Human Biology, Fred Hutchinson Cancer Research Center (FHCRC), 1100 Fairview Avenue North, PO Box 19024, Seattle, WA 98109, USA High-throughput sequencing, large-scale data generation throughput sequencing-based technologies and to compu- projects, and web-based cloud computing are changing tational modeling and analysis. At the same time, GRNs how computational biology is performed, who performs are both complex (i.e., can exhibit hard-to-predict/nonline- it, and what biological insights it can deliver. I review here ar behaviors) and complicated (i.e., they are composed of the latest developments in available data, methods, and large numbers of component parts and interactions). For software, focusing on the modeling and analysis of the this reason, mathematical and computational approaches gene regulatory interactions in cells. Three key findings are essential in GRN research. are: (i) although sophisticated computational resources Cellular behaviors have traditionally been character- are increasingly available to bench biologists, tailored ized as being mediated through highly distinct processes ongoing education is necessary to avoid the erroneous (e.g., DNA replication) and pathways (e.g., the canonical use of these resources. (ii) Current models of the regula- WNT signaling pathway). However, because of widespread tion of gene expression are far too simplistic and need interactions among cellular processes and pathways, the updating. (iii) Integrative computational analysis of large- use of unbiased, genome-wide technologies is essential to scale datasets is becoming a fundamental component of the discovery and characterization of GRNs.
    [Show full text]
  • Integrative Genomic Analysis by Interoperation of Bioinformatics Tools in Genomespace
    Integrative genomic analysis by interoperation of bioinformatics tools in GenomeSpace The MIT Faculty has made this article openly available. Please share how this access benefits you. Your story matters. Citation Qu, Kun et al. “Integrative Genomic Analysis by Interoperation of Bioinformatics Tools in GenomeSpace.” Nature Methods 13.3 (2016): 245–247. As Published http://dx.doi.org/10.1038/nmeth.3732 Publisher Nature Publishing Group Version Author's final manuscript Citable link http://hdl.handle.net/1721.1/105733 Terms of Use Article is made available in accordance with the publisher's policy and may be subject to US copyright law. Please refer to the publisher's site for terms of use. HHS Public Access Author manuscript Author ManuscriptAuthor Manuscript Author Nat Methods Manuscript Author . Author manuscript; Manuscript Author available in PMC 2016 July 18. Published in final edited form as: Nat Methods. 2016 March ; 13(3): 245–247. doi:10.1038/nmeth.3732. Integrative genomic analysis by interoperation of bioinformatics tools in GenomeSpace Kun Qu#1, Sara Garamszegi#2, Felix Wu#2, Helga Thorvaldsdottir2, Ted Liefeld2,3, Marco Ocana2,3, Diego Borges-Rivera4, Nathalie Pochet2,5, James T. Robinson2,3, Barry Demchak3, Tim Hull3, Gil Ben-Artzi6,7, Daniel Blankenberg8, Galt P. Barber9, Brian T. Lee9, Robert M. Kuhn9, Anton Nekrutenko8, Eran Segal6, Trey Ideker3, Michael Reich2,3, Aviv Regev2,4,10, Howard Y. Chang1,11, and Jill P. Mesirov2,3 1Program in Epithelial Biology, Stanford University School of Medicine, Stanford, CA, USA 2The
    [Show full text]
  • Cytoscape: the Network Visualization Tool for Genomespace Workflows[V2
    F1000Research 2014, 3:151 Last updated: 15 SEP 2014 SOFTWARE TOOL Cytoscape: the network visualization tool for GenomeSpace workflows [v2; ref status: indexed, http://f1000r.es/47f] Barry Demchak1, Tim Hull1, Michael Reich2, Ted Liefeld2, Michael Smoot3, 1 2 Trey Ideker , Jill P. Mesirov 1Department of Medicine, University of California, San Diego, La Jolla, 92093-0688, USA 2Broad Institute of Massachusetts Institute of Technology and Harvard, Cambridge, 02142, USA 3Qualcomm, Inc, San Diego, 92121, USA v2 First published: 01 Jul 2014, 3:151 (doi: 10.12688/f1000research.4492.1) Open Peer Review Latest published: 26 Aug 2014, 3:151 (doi: 10.12688/f1000research.4492.2) Invited Referee Responses Abstract Modern genomic analysis often requires workflows incorporating multiple 1 2 3 best-of-breed tools. GenomeSpace is a web-based visual workbench that combines a selection of these tools with mechanisms that create data flows version 1 between them. One such tool is Cytoscape 3, a popular application that published report report report 01 Jul 2014 enables analysis and visualization of graph-oriented genomic networks. As 1 1 1 Cytoscape runs on the desktop, and not in a web browser, integrating it into GenomeSpace required special care in creating a seamless user experience and enabling appropriate data flows. In this paper, we present the design and version 2 operation of the Cytoscape GenomeSpace app, which accomplishes this published 26 Aug 2014 integration, thereby providing critical analysis and visualization functionality for GenomeSpace
    [Show full text]
  • Nathalie Pochet, Diego Borges-Rivera, Thorin Tabor, Helga Thorvaldsdóttir, Aviv Regev, Jill P
    : An Environment for Frictionless Bioinformatics Michael Reich, Ted Liefeld, Marco Ocana, Dongkeun Jang, Jon Bistline, James Robinson, Peter Carr, Barbara Hill www.genomespace.org Nathalie Pochet, Diego Borges-Rivera, Thorin Tabor, Helga Thorvaldsdóttir, Aviv Regev, Jill P. Mesirov Features Interface Recipes Using Galaxy with GenomeSpace GenomeSpace Enabled Tools Customize Organize and GenomeSpace makes it easy for researchers to use the Easily manage your display create groups of A collection of "recipes" provides quick guides to Galaxy users can send data easily between Galaxy and your files and One-click according to users (e.g. your Get help with Galaxy (Penn State University) tools they already know to perform analyses and to find directories and launching of the types of project team) the analysis accomplishing tasks using the GenomeSpace tools: their GenomeSpace cloud storage: Scientific workflow and data and analysis preview files analysis tools analyses you and add your tools and platform providing a large number of other tools that can help them extend their research into resident in the on your wish to local tools to GenomeSpace Importing Data from GenomeSpace to Galaxy sequence and genome analysis tools. cloud. datasets. perform. GenomeSpace. itself. new areas. Registration is free and includes 20GB of 1 2 3 4 5 cloud based storage. Preprocess and quality check RNA-Seq data (UCSD) 4 Cytoscape GenomeSpace features include: 6 Visualize and analyze molecular interaction Manage your Identify and annotate coding variants from Data is account igv networks and biological pathways whole exome sequencing (WES) data 2 Register or log in to your imported Seamless transfer of data between tools information.
    [Show full text]
  • Visualizing Genome and Systems Biology: Technologies, Tools, Implementation Techniques and Trends, Past, Present and Future Georgios A
    Pavlopoulos et al. GigaScience (2015) 4:38 DOI 10.1186/s13742-015-0077-2 REVIEW Open Access Visualizing genome and systems biology: technologies, tools, implementation techniques and trends, past, present and future Georgios A. Pavlopoulos1*, Dimitris Malliarakis2, Nikolas Papanikolaou1, Theodosis Theodosiou1, Anton J. Enright3 and Ioannis Iliopoulos1* Abstract “Α picture is worth a thousand words.” This widely used adage sums up in a few words the notion that a successful visual representation of a concept should enable easy and rapid absorption of large amounts of information. Although, in general, the notion of capturing complex ideas using images is very appealing, would 1000 words be enough to describe the unknown in a research field such as the life sciences? Life sciences is one of the biggest generators of enormous datasets, mainly as a result of recent and rapid technological advances; their complexity can make these datasets incomprehensible without effective visualization methods. Here we discuss the past, present and future of genomic and systems biology visualization. We briefly comment on many visualization and analysis tools and the purposes that they serve. We focus on the latest libraries and programming languages that enable more effective, efficient and faster approaches for visualizing biological concepts, and also comment on the future human-computer interaction trends that would enable for enhancing visualization further. Keywords: Biological data visualization, Network biology, Genomics, Systems biology, Multivariate analysis Review for de novo assembly of a new genome to acceptable Introduction coverage is still high, probably at least $40,000, we can In the current ‘big data’ era [1], the magnitude of data now resequence a human genome for $1000 and can explosion in life science research is undeniable.
    [Show full text]
  • Cytoscape3 6 0Manual/Pdf
    Cytoscape User Manual Release 3.6.0 The Cytoscape Consortium Nov 16, 2017 Contents 1 Introduction 3 2 Launching Cytoscape 5 2.1 System Requirements..........................................5 2.2 Getting Started..............................................5 3 Command Line Arguments 9 4 Quick Tour of Cytoscape 11 4.1 Starter Panel............................................... 11 4.2 Basic Features.............................................. 12 4.3 The Menus................................................ 14 4.4 Network Management.......................................... 21 4.5 The View Navigator........................................... 24 5 Creating Networks 25 5.1 Import Fixed-Format Network Files................................... 25 5.2 Import Networks from Unformatted Table Files............................. 26 5.3 Import Networks from Public Databases................................ 31 5.4 Create a New Network or Edit One Manually.............................. 37 6 Nested Networks 43 6.1 Creating Nested Networks........................................ 43 6.2 Visualization of Nested Networks.................................... 43 7 Supported Network File Formats 45 7.1 SIF Format................................................ 45 7.2 NNF................................................... 47 7.3 GML Format............................................... 51 7.4 XGMML Format............................................. 51 7.5 SBML (Systems Biology Markup Language) Format.......................... 52 7.6 BioPAX (Biological PAthways
    [Show full text]
  • Genomespace Workshop, FGED 15, Seattle
    Outline • Introduction to GenomeSpace • GenomeSpace Tools and Recipes • GenomeSpace User Interface • Integrative analysis exercise • Other GenomeSpace Tools • GenomeSpace development • Q and A The vision: Integrave Translaonal Genomics GenePattern Cytoscape IGV/UCSC Genomica CMAP iii 1 Differentially Expressed Genes Compendium Load compendium Arrests Show module G2/M 3 map Show i 2 GSEA test network enrichment Network 5 Show Extract 4 module Expand +1 Chromosome Idea (include ii atcgcgtttattcgataagg! neighbors) atcgcgttttttcgataagg! 6 Add ! Transcription Learn p53 Factor track site/score on Alterations Added to GenePattern from UCSC promoter Pathway iv activation v 8 7 Test for Looks close similarity of p53 and gene Expression Conclusion to p53 site location vi Online community to share diverse computational tools Seed Tools Driving Biological Projects Cytoscape Galaxy lincRNAs GenePaern Genomica Cancer stem cells IGV Paent Straficaon UCSC Browser Outreach to new DBPs Outreach to new tools www.genomespace.org GenomeSpace: a connection layer between integrative analysis tools • Support for all types of resource: Web- based, desktop, etc. • Automac conversion of data formats between tools • Easy access to data from any locaon • Ease of entry into the environment GenomeSpace Components GenePattern Galaxy Cytoscape Integrative Genomics Viewer GenomeSpace-Enabled Tools Authentication and Authorization Analysis and Data Manager Tool Manager Genome Space GenomeSpace Server Project Data GenomeSpace Server Register www.genomespace.org Register
    [Show full text]
  • 11 April 2016
    Bioinformatics discovery of new disease drivers from genomic, proteomics, and metabolomics data Andrew P. Hodges, Ph.D. [email protected] SBP 3 May 2016 Viper-KR* • Viper-KR applied to human tissue (obesity / adipose tissue) and cell culture transcriptome. • Data from the Translational Research Institute (TRI) for Metabolism and Diabetes, SBP-Florida Hospital (PI Steven Smith) • The results are promising: multiple computationally predicted gene targets were validated experimentally (7 out of 12) Viper - Virtual Inference of Protein-activity by Enriched Regulon analysis (Mariano J. Alvarez, Federico Giorgi, Andrea Califano). KR – K-nearest neighbor and Random Forest. 5/3/2016 BIOINFORMATIC CORE TOOLS 2 Overview • Pt1 - Introduction/Background • Pt2 - Resources – Public and proprietary data sources – Selected software and underlying algorithms • Pt3 – Example Service: Master regulator/driver predictions – Example models from in vitro and clinical studies – New gene/protein drivers for follow-up confirmation – Integration against knowledge repositories/other resources Pt1 - Introduction Bioinformatics • Major driver in current cancer research • Oftentimes an afterthought • *Should be the driver/guide for your studies! • Quest for drivers/controllers of disease – Not cancer-specific… conceptually any disease – Molecular signatures guiding disease processes and drug responses – Limited by type(s) of data & available samples NCI projects (2015) Major question today: What are the drivers of pancreatic cancer progression in gemcitabine-treated
    [Show full text]