ENCODE: Understanding the Genome Michael Snyder May 8, 2013
Total Page:16
File Type:pdf, Size:1020Kb
Load more
Recommended publications
-
Semantic Web
SEMANTIC WEB: REVOLUTIONIZING KNOWLEDGE DISCOVERY IN THE LIFE SCIENCES SEMANTIC WEB: REVOLUTIONIZING KNOWLEDGE DISCOVERY IN THE LIFE SCIENCES Edited by Christopher J. O. Baker1 and Kei-Hoi Cheung2 1Knowledge Discovery Department, Institute for Infocomm Research, Singapore, Singapore; 2Center for Medical Informatics, Yale University School of Medicine, New Haven, CT, USA Kluwer Academic Publishers Boston/Dordrecht/London Contents PART I: Database and Literature Integration Semantic web approach to database integration in the life sciences KEI-HOI CHEUNG, ANDREW K. SMITH, KEVIN Y. L. YIP, CHRISTOPHER J. O. BAKER AND MARK B. GERSTEIN Querying Semantic Web Contents: A case study LOIC ROYER, BENEDIKT LINSE, THOMAS WÄCHTER, TIM FURCH, FRANCOIS BRY, AND MICHAEL SCHROEDER Knowledge Acquisition from the Biomedical Literature LYNETTE HIRSCHMAN, WILLIAM HAYES AND ALFONSO VALENCIA PART II: Ontologies in the Life Sciences Biological Ontologies PATRICK LAMBRIX, HE TAN, VAIDA JAKONIENE, AND LENA STRÖMBÄCK Clinical Ontologies YVES LUSSIER AND OLIVIER BODENREIDER Ontology Engineering For Biological Applications vi Revolutionizing knowledge discovery in the life sciences LARISA N. SOLDATOVA AND ROSS D. KING The Evaluation of Ontologies: Toward Improved Semantic Interoperability LEO OBRST, WERNER CEUSTERS, INDERJEET MANI, STEVE RAY AND BARRY SMITH OWL for the Novice JEFF PAN PART III: Ontology Visualization Techniques for Ontology Visualization XIAOSHU WANG AND JONAS ALMEIDA On Vizualization of OWL Ontologies SERGUEI KRIVOV, FERDINANDO VILLA, RICHARD WILLIAMS, AND XINDONG WU PART IV: Ontologies in Action Applying OWL Reasoning to Genomics: A Case Study KATY WOLSTENCROFT, ROBERT STEVENS AND VOLKER HAARSLEV Can Semantic Web Technologies enable Translational Medicine? VIPUL KASHYAP, TONYA HONGSERMEIER AND SAMUEL J. ARONSON Ontology Design for Biomedical Text Mining RENÉ WITTE, THOMAS KAPPLER, AND CHRISTOPHER J. -
The for Report 07-08
THE CENTER FOR INTEGRATIVE GENOMICS REPORT 07-08 www.unil.ch/cig Table of Contents INTRODUCTION 2 The CIG at a glance 2 The CIG Scientific Advisory Committee 3 Message from the Director 4 RESEARCH 6 Richard Benton Chemosensory perception in Drosophila: from genes to behaviour 8 Béatrice Desvergne Networking activity of PPARs during development and in adult metabolic homeostasis 10 Christian Fankhauser The effects of light on plant growth and development 12 Paul Franken Genetics and energetics of sleep homeostasis and circadian rhythms 14 Nouria Hernandez Mechanisms of basal and regulated RNA polymerase II and III transcription of ncRNA in mammalian cells 16 Winship Herr Regulation of cell proliferation 18 Henrik Kaessmann Mammalian evolutionary genomics 20 Sophie Martin Molecular mechanisms of cell polarization 22 Liliane Michalik Transcriptional control of tissue repair and angiogenesis 24 Alexandre Reymond Genome structure and expression 26 Andrzej Stasiak Functional transitions of DNA structure 28 Mehdi Tafti Genetics of sleep and the sleep EEG 30 Bernard Thorens Molecular and physiological analysis of energy homeostasis in health and disease 32 Walter Wahli The multifaceted roles of PPARs 34 Other groups at the Génopode 37 CORE FACILITIES 40 Lausanne DNA Array Facility (DAFL) 42 Protein Analysis Facility (PAF) 44 Core facilities associated with the CIG 46 EDUCATION 48 Courses and lectures given by CIG members 50 Doing a PhD at the CIG 52 Seminars and symposia 54 The CIG annual retreat 62 The CIG and the public 63 Artist in residence at the CIG 63 PEOPLE 64 1 Introduction The Center for IntegratiVE Genomics (CIG) at A glance The Center for Integrative Genomics (CIG) is the newest depart- ment of the Faculty of Biology and Medicine of the University of Lausanne (UNIL). -
PREDICTD: Parallel Epigenomics Data Imputation with Cloud-Based Tensor Decomposition
bioRxiv preprint doi: https://doi.org/10.1101/123927; this version posted April 4, 2017. The copyright holder for this preprint (which was not certified by peer review) is the author/funder, who has granted bioRxiv a license to display the preprint in perpetuity. It is made available under aCC-BY-NC 4.0 International license. PREDICTD: PaRallel Epigenomics Data Imputation with Cloud-based Tensor Decomposition Timothy J. Durham Maxwell W. Libbrecht Department of Genome Sciences Department of Genome Sciences University of Washington University of Washington J. Jeffry Howbert Jeff Bilmes Department of Genome Sciences Department of Electrical Engineering University of Washington University of Washington William Stafford Noble Department of Genome Sciences Department of Computer Science and Engineering University of Washington April 4, 2017 Abstract The Encyclopedia of DNA Elements (ENCODE) and the Roadmap Epigenomics Project have produced thousands of data sets mapping the epigenome in hundreds of cell types. How- ever, the number of cell types remains too great to comprehensively map given current time and financial constraints. We present a method, PaRallel Epigenomics Data Imputation with Cloud-based Tensor Decomposition (PREDICTD), to address this issue by computationally im- puting missing experiments in collections of epigenomics experiments. PREDICTD leverages an intuitive and natural model called \tensor decomposition" to impute many experiments si- multaneously. Compared with the current state-of-the-art method, ChromImpute, PREDICTD produces lower overall mean squared error, and combining methods yields further improvement. We show that PREDICTD data can be used to investigate enhancer biology at non-coding human accelerated regions. PREDICTD provides reference imputed data sets and open-source software for investigating new cell types, and demonstrates the utility of tensor decomposition and cloud computing, two technologies increasingly applicable in bioinformatics. -
Aggregation and Correlation Toolbox for Analyses of Genome Tracks Justin Jee Yale University
University of Massachusetts eM dical School eScholarship@UMMS Program in Bioinformatics and Integrative Biology Program in Bioinformatics and Integrative Biology Publications and Presentations 4-15-2011 ACT: aggregation and correlation toolbox for analyses of genome tracks Justin Jee Yale University Joel Rozowsky Yale University Kevin Y. Yip Yale University See next page for additional authors Follow this and additional works at: http://escholarship.umassmed.edu/bioinformatics_pubs Part of the Bioinformatics Commons, Computational Biology Commons, and the Systems Biology Commons Repository Citation Jee, Justin; Rozowsky, Joel; Yip, Kevin Y.; Lochovsky, Lucas; Bjornson, Robert; Zhong, Guoneng; Zhang, Zhengdong; Fu, Yutao; Wang, Jie; Weng, Zhiping; and Gerstein, Mark B., "ACT: aggregation and correlation toolbox for analyses of genome tracks" (2011). Program in Bioinformatics and Integrative Biology Publications and Presentations. Paper 26. http://escholarship.umassmed.edu/bioinformatics_pubs/26 This material is brought to you by eScholarship@UMMS. It has been accepted for inclusion in Program in Bioinformatics and Integrative Biology Publications and Presentations by an authorized administrator of eScholarship@UMMS. For more information, please contact [email protected]. ACT: aggregation and correlation toolbox for analyses of genome tracks Authors Justin Jee, Joel Rozowsky, Kevin Y. Yip, Lucas Lochovsky, Robert Bjornson, Guoneng Zhong, Zhengdong Zhang, Yutao Fu, Jie Wang, Zhiping Weng, and Mark B. Gerstein Comments © The Author(s) 2011. Published by Oxford University Press. This is an Open Access article distributed under the terms of the Creative Commons Attribution Non- Commercial License (http://creativecommons.org/licenses/by-nc/2.5), which permits unrestricted non- commercial use, distribution, and reproduction in any medium, provided the original work is properly cited. -
Biocreative II.5 Workshop 2009 Special Session on Digital Annotations
BioCreative II.5 Workshop 2009 Special Session on Digital Annotations The purified IRF-4 was also The main role of BRCA2 shown to be capable of binding appears to involve regulating the DNA in a PU.1-dependent manner function of RAD51 in the repair by by electrophoretic mobility shift homologous recombination . analysis. brca2 irf4 We found that cells ex- Moreover, expression of pressing Olig2, Nkx2.2, and NG2 Carma1 induces phosphorylation were enriched among virus- of Bcl10 and activation of the infected, GFP-positive (GFP+) transcription factor NF-kappaB. cells. carma1 BB I O olig2 The region of VHL medi- The Rab5 effector ating interaction with HIF-1 alpha Rabaptin-5 and its isoform C R E A T I V E overlapped with a putative Rabaptin-5delta differ in their macromolecular binding site within ability to interact with the rsmallab5 the crystal structure. GTPase Rab4. vhl Translocation RCC, bearing We show that ERBB2-dependenterbb2 atf1 TFE3 or TFEB gene fusions, are Both ATF-1 homodimers and tfe3 medulloblastoma cell invasion and ATF-1/CREB heterodimers bind to recently recognized entities for prometastatic gene expression can the CRE but not to the related which risk factors have not been be blocked using the ERBB tyrosine phorbol ester response element. identified. kinase inhibitor OSI-774. C r i t i c a l A s s e s s m e n t o f I n f o r m a t i o n E x t r a c t i o n i n B i o l o g y October 7th - 9th, 2009 www.BioCreative.org BioCreative II.5 Workshop 2009 special session | Digital Annotations Auditorium of the Spanish National -
Manolis Kellis Piotr Indyk
6.095 / 6.895 Computational Biology: Genomes, Networks, Evolution Manolis Kellis Rapid database search Courtsey of CCRNP, The National Cancer Institute. Piotr Indyk Protein interaction network Courtesy of GTL Center for Molecular and Cellular Systems. Genome duplication Courtesy of Talking Glossary of Genetics. Administrivia • Course information – Lecturers: Manolis Kellis and Piotr Indyk • Grading: Part. Problem sets 50% Final Project 25% Midterm 20% 5% • 5 problem sets: – Each problem set: covers 4 lectures, contains 4 problems. – Algorithmic problems and programming assignments – Graduate version includes 5th problem on current research •Exams – In-class midterm, no final exam • Collaboration policy – Collaboration allowed, but you must: • Work independently on each problem before discussing it • Write solutions on your own • Acknowledge sources and collaborators. No outsourcing. Goals for the term • Introduction to computational biology – Fundamental problems in computational biology – Algorithmic/machine learning techniques for data analysis – Research directions for active participation in the field • Ability to tackle research – Problem set questions: algorithmic rigorous thinking – Programming assignments: hands-on experience w/ real datasets • Final project: – Research initiative to propose an innovative project – Ability to carry out project’s goals, produce deliverables – Write-up goals, approach, and findings in conference format – Present your project to your peers in conference setting Course outline • Organization – Duality: -
Tporthmm : Predicting the Substrate Class Of
TPORTHMM : PREDICTING THE SUBSTRATE CLASS OF TRANSMEMBRANE TRANSPORT PROTEINS USING PROFILE HIDDEN MARKOV MODELS Shiva Shamloo A thesis in The Department of Computer Science Presented in Partial Fulfillment of the Requirements For the Degree of Master of Computer Science Concordia University Montréal, Québec, Canada December 2020 © Shiva Shamloo, 2020 Concordia University School of Graduate Studies This is to certify that the thesis prepared By: Shiva Shamloo Entitled: TportHMM : Predicting the substrate class of transmembrane transport proteins using profile Hidden Markov Models and submitted in partial fulfillment of the requirements for the degree of Master of Computer Science complies with the regulations of this University and meets the accepted standards with respect to originality and quality. Signed by the final examining commitee: Examiner Dr. Sabine Bergler Examiner Dr. Andrew Delong Supervisor Dr. Gregory Butler Approved Dr. Lata Narayanan, Chair Department of Computer Science and Software Engineering 20 Dean Dr. Mourad Debbabi Faculty of Engineering and Computer Science Abstract TportHMM : Predicting the substrate class of transmembrane transport proteins using profile Hidden Markov Models Shiva Shamloo Transporters make up a large proportion of proteins in a cell, and play important roles in metabolism, regulation, and signal transduction by mediating movement of compounds across membranes but they are among the least characterized proteins due to their hydropho- bic surfaces and lack of conformational stability. There is a need for tools that predict the substrates which are transported at the level of substrate class and the level of specific substrate. This work develops a predictor, TportHMM, using profile Hidden Markov Model (HMM) and Multiple Sequence Alignment (MSA). -
Prof. Manolis Kellis April 15, 2008
Chromosomes inside the cell Introduction to Algorithms 6.046J/18.401J • Eukaryote cell LECTURE 18 • Prokaryote Computational Biology cell • Bio intro: Regulatory Motifs • Combinatorial motif discovery - Median string finding • Probabilistic motif discovery - Expectation maximization • Comparative genomics Prof. Manolis Kellis April 15, 2008 DNA packaging DNA: The double helix • Why packaging • The most noble molecule of our time – DNA is very long – Cell is very small • Compression – Chromosome is 50,000 times shorter than extended DNA • Using the DNA – Before a piece of DNA is used for anything, this compact structure must open locally ATATTGAATTTTCAAAAATTCTTACTTTTTTTTTGGATGGACGCAAAGAAGTTTAATAATCATATTACATGGCATTACCACCATATA ATATTGAATTTTCAAAAATTCTTACTTTTTTTTTGGATGGACGCAAAGAAGTTTAATAATCATATTACATGGCATTACCACCATATA ATCCATATCTAATCTTACTTATATGTTGTGGAAATGTAAAGAGCCCCATTATCTTAGCCTAAAAAAACCTTCTCTTTGGAACTTTC ATCCATATCTAATCTTACTTATATGTTGTGGAAATGTAAAGAGCCCCATTATCTTAGCCTAAAAAAACCTTCTCTTTGGAACTTTC AATACGCTTAACTGCTCATTGCTATATTGAAGTACGGATTAGAAGCCGCCGAGCGGGCGACAGCCCTCCGACGGAAGACTCTCCTC AATACGCTTAACTGCTCATTGCTATATTGAAGTACGGATTAGAAGCCGCCGAGCGGGCGACAGCCCTCCGACGGAAGACTCTCCTC GCGTCCTCGTCTTCACCGGTCGCGTTCCTGAAACGCAGATGTGCCTCGCGCCGCACTGCTCCGAACAATAAAGATTCTACAATACT GCGTCCTCGTCTTCACCGGTCGCGTTCCTGAAACGCAGATGTGCCTCGCGCCGCACTGCTCCGAACAATAAAGATTCTACAATACT TTTTATGGTTATGAAGAGGAAAAATTGGCAGTAACCTGGCCCCACAAACCTTCAAATTAACGAATCAAATTAACAACCATAGGATG TTTTATGGTTATGAAGAGGAAAAATTGGCAGTAACCTGGCCCCACAAACCTTCAAATTAACGAATCAAATTAACAACCATAGGATG AATGCGATTAGTTTTTTAGCCTTATTTCTGGGGTAATTAATCAGCGAAGCGATGATTTTTGATCTATTAACAGATATATAAATGGAA -
MPGM: Scalable and Accurate Multiple Network Alignment
MPGM: Scalable and Accurate Multiple Network Alignment Ehsan Kazemi1 and Matthias Grossglauser2 1Yale Institute for Network Science, Yale University 2School of Computer and Communication Sciences, EPFL Abstract Protein-protein interaction (PPI) network alignment is a canonical operation to transfer biological knowledge among species. The alignment of PPI-networks has many applica- tions, such as the prediction of protein function, detection of conserved network motifs, and the reconstruction of species’ phylogenetic relationships. A good multiple-network align- ment (MNA), by considering the data related to several species, provides a deep understand- ing of biological networks and system-level cellular processes. With the massive amounts of available PPI data and the increasing number of known PPI networks, the problem of MNA is gaining more attention in the systems-biology studies. In this paper, we introduce a new scalable and accurate algorithm, called MPGM, for aligning multiple networks. The MPGM algorithm has two main steps: (i) SEEDGENERA- TION and (ii) MULTIPLEPERCOLATION. In the first step, to generate an initial set of seed tuples, the SEEDGENERATION algorithm uses only protein sequence similarities. In the second step, to align remaining unmatched nodes, the MULTIPLEPERCOLATION algorithm uses network structures and the seed tuples generated from the first step. We show that, with respect to different evaluation criteria, MPGM outperforms the other state-of-the-art algo- rithms. In addition, we guarantee the performance of MPGM under certain classes of net- work models. We introduce a sampling-based stochastic model for generating k correlated networks. We prove that for this model if a sufficient number of seed tuples are available, the MULTIPLEPERCOLATION algorithm correctly aligns almost all the nodes. -
ENCODE Consortium Meeting
ENCODE Consortium Meeting June 17-19, 2008 Hilton Washington DC/Rockville Executive Meeting Center Rockville, Maryland PARTICIPANTS Bradley Bernstein Piero Carninci, Ph.D. Molecular Pathology Unit Leader Massachusetts General Hospital Functional Genomics Technology Team and 149 13th Street Omics Resource Development Unit Charlestown, MA 02129 Deputy Project Director (617) 726-6906 LSA Technology Development Group (617) 726-5684 Fax Omics Science Center [email protected] RIKEN Yokohama Institute 1-7-22 Suehiro-cho, Tsurumi-ku Ewan Birney Yokohama 230-0045 Joint Team Leader Japan Panda Group Nucleotides +81-(0)901-709-2277 Panda Coordination and Outreach [email protected] Panda Metabolism European Molecular Biology Laboratory Philip Cayting European Bioinformatics Institute Gerstein Laboratory Hinxton Outstation Department of Molecular Biophysics and Wellcome Trust Genome Campus Biochemistry Hinxton, Cambridge CB10 1SD Yale University United Kingdom P.O. Box 208114 +44-(0)1223-494 444, ext. 4420 New Haven, CT 06520-8114 +44-(0)1223-494 494 Fax (203) 432-6337 [email protected] [email protected] Michael Brent, Ph.D. Howard Y. Chang, M.D., Ph.D. Professor Assistant Professor Center for Genome Sciences Stanford University Washington University Center for Clinical Sciences Research, Campus Box 8510 Room 2155C 4444 Forest Park 269 Campus Drive Saint Louis, MO 63108 Stanford, CA 94305 (314) 286-0210 (650) 736-0306 [email protected] [email protected] James Bentley Brown Mike Cherry, Ph.D. Graduate Student Researcher Associate Professor Graduate Program in Applied Science and Department of Genetics Technology Stanford University Bickel Group 300 Pasteur Drive University of California, Berkeley Stanford, CA 94305-5120 Room 2 (650) 723-7541 1246 Hearst Avenue [email protected] Berkeley, CA 94702 (510) 703-4706 [email protected] Francis S. -
You Are Cordially Invited to a Talk in the Edmond J. Safra Center for Bioinformatics Distinguished Speaker Series
You are cordially invited to a talk in the Edmond J. Safra Center for Bioinformatics Distinguished Speaker Series. The speaker is Prof. Alfonso Valencia, Spanish National Cancer Research Centre, CNIO. Title: "A mESC epigenetics network that combines chromatin localization data and evolutionary information" Time: Wednesday, November 11 2015, at 11:00 (refreshments from 10:50) Place: Green building seminar room, ground floor, Biotechnology Department. Host: Prof. Ron Shamir, [email protected], School of Computer Science Abstract: The description of biological systems in terms of networks is by now a well- accepted paradigm. Networks offer the possibility of combining complex information and the system to analyse the properties of individual components in relation to the ones of their interaction partners. In this context, we have analysed the relations between the known components of mouse Embryonic Stem Cells (mESC) epigenome at two orthogonal levels of information, i.e., co-localization in the genome and concerted evolution (co-evolution). Public repositories contain results of ChIP-Seq experiments for more than 60 Chromatin Related Proteins (CRPs), 14 for Histone modifications, and three different types of DNA methylation, including 5-Hydroxymethylcytosine (5hmc). All this information can be summarized in a network in which the nodes are the CRPs, histone marks and DNA modifications and the arcs connect components that significantly co-localize in the genome. In this network co-localization preferences are specific of chromatin states, such as promoters and enhancers. A second network was build by connecting proteins that are part of the mESC epigenetic regulatory system and show signals of co-evolution (concerted evolution of the CRPs protein families). -
News Archive Table of Contents
News Archive Table of Contents The Institute in the News 2 2020 2 2019 3 2018 5 2017 6 2016 6 2015 7 2014 10 2013 12 2012 12 2011 13 2010 14 2009 14 2008 15 2007 15 2006 15 2005 15 2004 16 2003 16 2001 16 2000 17 1997 17 1996 17 David Haussler in the News 18 News Archive: The UC Santa Cruz Genomics Institute in the News - Beginnings to the present | Page 1 of 32 Jim Kent in the News 20 UCSC Genome Browser in the News 21 UCSC Cancer Research In The News 23 Evolutionary and Conservation Genomics in the News 25 Video Library 30 2020 30 2019 30 2018 30 2015 31 2014 31 2013 32 2012 32 2010 32 The Institute in the News Research and accomplishments of the UC Santa Cruz Genomics Institute and its faculty, seen through the eyes of journalists. 2020 Slugs lauded as Hometown Heroes. Marc DesJardins. December 2, 2020. UCSC. List of most highly cited researchers features 19 UCSC scientists and engineers. Tim Stephens. November 18, 2020. UCSC. ARCS Foundation scholarships support UCSC graduate students. Tim Stephens. September 14, 2020. UCSC. UCSC genomics scientist wins fellowship to clarify genetic risk for sudden heart failure. September 24, 2020. NIH. News Archive: The UC Santa Cruz Genomics Institute in the News - Beginnings to the present | Page 2 of 32 The (near) complete sequence of a human genome. Adam Phillippy. September 22, 2020. Genome Informatics Section. NSF advances 25 projects to explore bold ideas for transformative research. September 15, 2020.