Visualization and Exploration of Transcriptomics Data Nils Gehlenborg

Total Page:16

File Type:pdf, Size:1020Kb

Visualization and Exploration of Transcriptomics Data Nils Gehlenborg Visualization and Exploration of Transcriptomics Data 05 The identifier 800 year identifier Nils Gehlenborg Sidney Sussex College To celebrate our 800 year history an adaptation of the core identifier has been commissioned. This should be used on communications in the time period up to and including 2009. The 800 year identifier consists of three elements: the shield, the University of Cambridge logotype and the 800 years wording. It should not be redrawn, digitally manipulated or altered. The elements should not be A dissertation submitted to the University of Cambridge used independently and their relationship should for the degree of Doctor of Philosophy remain consistent. The 800 year identifier must always be reproduced from a digital master reference. This is available in eps, jpeg and gif format. Please ensure the appropriate artwork format is used. File formats European Molecular Biology Laboratory, eps: all professionally printed applications European Bioinformatics Institute, jpeg: Microsoft programmes Wellcome Trust Genome Campus, gif: online usage Hinxton, Cambridge, CB10 1SD, Colour United Kingdom. The 800 year identifier only appears in the five colour variants shown on this page. Email: [email protected] Black, Red Pantone 032, Yellow Pantone 109 and white October 12, 2010 shield with black (or white name). Single colour black or white. Please try to avoid any other colour combinations. Pantone 032 R237 G41 B57 Pantone 109 R254 G209 B0 To Maureen. This dissertation is my own work and contains nothing which is the outcome of work done in collaboration with others, except as specified in the text and acknowledgements. This dissertation is not substantially the same as any I have submit- ted for a degree, diploma or other qualification at any other university, and no part has already been, or is currently being submitted for any degree, diploma or other qualification. This dissertation does not exceed the specified word limit of 60,000 words as defined by the Biology Degree Committee. This dissertation has been typeset in 12 pt Palatino using LATEX2# ac- cording to the specifications defined by the Board of Graduate Studies and the Biology Degree Committee. October 12, 2010 Nils Gehlenborg Visualization and Exploration of Transcriptomics Data Summary October 12, 2010 Nils Gehlenborg Sidney Sussex College During the last decade, high-throughput analysis of transcriptomes with microarrays and other technologies has become increasingly mature and affordable, which has led to rapid growth of the number and size of avail- able transcriptomics data sets. This in turn has created new challenges for bioinformatics to provide adequate methods for data exploration and visualization, which are the topic of this dissertation. The first research question that I address is the efficient retrieval of transcriptomics data sets from large databases, which is motivated by the rapid growth of public repositories for transcriptomics data. This makes the exploration of such repositories increasingly challenging, but also pro- vides opportunities for biological discovery. I describe a knowledge- driven approach for exploration of transcriptomics data repositories based on ontology visualization and data-driven approaches based on gene set enrichment analysis and generative probabilistic models. The second research question of this dissertation deals with the vi- sualization of large transcriptomics data sets. This work has been driven by the observation that there is a growing number of data sets with hun- dreds or thousands of samples but a lack of suitable methods to visualize such data sets. To address this problem, I first present an analysis of the visualization tasks in this context, and then describe the design of an in- teractive visualization method based on pixel-oriented visualizations and tree maps. This design study is complemented with a description of the implementation of the method and a discussion of practical aspects that need to be taken into account when visualizing data sets of such scale. This dissertation includes several case studies in which I describe the application of the proposed methods to a range of real-world data sets and discuss my findings. Preface This dissertation marks the end of my student life, which has been a beau- tiful ten year journey that took me halfway around the globe, and during which I have met many inspiring people who have shaped my view of the world. My decision to pursue a PhD grew out of the wonderful experi- ence I had as an undergraduate researcher in the group of Kay Nieselt in Tubingen¨ and I’m very thankful that she gave me the opportunity to take some first steps in the big, great world of science. I’m also very indebted to my former mentors at the Institute for Systems Biology, Inyoul Lee and Daehee Hwang, who inspired me with their perseverance and work ethic and for whose support over the years I am ever so grateful. For the last four years, I have had Alvis Brazma as my advisor to whom I am most thankful for his support and advice as well as for allow- ing me the freedom to develop my scientific interests and define my own area of research. I also thank the members of my thesis advisory commit- tee, Nick Luscombe, Gos Micklem and Lars Steinmetz for their guidance and encouragement. I’m indebted to the members of the Functional Genomics team at the EBI, who supported so many aspects of my research during last four years. I want to acknowledge in particular the help of Nikolay Kolesnikov, who worked with me on the ArrayExpress Explorer interface, Margus Lukk, who provided me with the data set that ultimately led to the de- velopment of the Space Maps visualization, Gabriella Rustici, who was so helpful with everything related to actual wet lab biology, James Malone and Tomasz Adamusiak, who answered many, many questions about the Experimental Factor Ontology, Ele Holloway, who solved a few curation mysteries for me, Misha Kapushesky, who provided me with access to the ArrayExpress Atlas data, and Helen Parkinson, who helped out whenever I had a questions about ArrayExpress, annotation or ontologies. I also want to thank my fellow PhD students and office mates Katherine Lawler ii and Angelaˆ Gonc¸alves for sharing their knowledge with me. In particu- lar, I want to thank Garth Ilsley for the many conversations we had over the last four years and for joining me for some time in A3-118. Richard Coulson also deserves special mention for many insightful discussions about biology and, even more so, for fun evenings at the pub followed by mandatory late night Indian dinners. Much of the research I undertook during the four years of my PhD was in collaboration with the group of Samuel Kaski in Helsinki. I thank Sami for making me feel so welcome during my visits, and for the great discussions we had during our meetings. Very special thanks go to Jose´ Caldas, from whom I have learned a lot, and whom I greatly respect as a person, a scientist and a colleague. Furthermore, I’m indebted to Ali Faisal, Jaakko Peltonen and Helena Aidos for being such good collabo- rators and co-authors. I also have fond memories of good times spent outside the lab with Leo Lathi and, in particular, with Gayle Leen. If I had to name one single reason why it took me four instead of three and a half years to finish my PhD, I would blame it on the time spent on the organization of the “Visualization of Biological Data” workshop. Nonetheless, I’m enormously grateful for having been part of this, as well as for the opportunities that arose from it, and I’m deeply grateful to Sean´ O’Donoghue and Jim Procter for going through this experience with me. In particular, I want to thank Sean,´ and also Anne-Claude Gavin, for their help with the review on visualization of systems biology data. Of course, this would not have been possible without the support of our co-authors Nitin Baliga, Alexander Goesmann, Hiroaki Kitano, Oliver Kohlbacher, Heiko Neuweger, Reinhard Schneider, Dan Tenenbaum, and in particular Matt Hibbs, whose comments and feedback were especially helpful. Cydney Nielsen and Miriah Meyer have been such wonderful friends and colleagues, and I appreciate our many insightful discussions about visualization in biology. I also thank Joe Parry, my local connec- tion to the visual analytics community, for teaching me a lot about visu- alization in the intelligence analysis field, and about good real ale pubs in Cambridge. I’m grateful also to Bang Wong, Alan Blackwell and Roy Ruddle for conversations about visualization that shaped my views and influenced my work, as well as for coming to the EBI to speak at the “EBI Interfaces” seminars. In this context, I also want to thank Francis Rowland, Eamonn Maguire and Jenny Cham for helping me to organize these sem- inars and for growing a sizable visualization community on the Genome Campus. Finally, I want to thank all of my friends, and especially my fam- iii ily, in both Germany and the United States, for being so encouraging and helpful during my life as a student. And despite all the support I have received from the many people mentioned above, I could not have made it to this point without the encouragement and love of my wife, Maureen. I’m immensely grateful to her for being such an understanding and sup- portive partner. iv Contents Summaryi Preface ii Contentsv List of Figures ix List of Tables xi List of Acronyms xii 1 Introduction1 1.1 Transcriptomics Data.......................2 1.1.1 Gene Expression.....................2 1.1.2 Measurement Technologies...............5 1.1.3 Experimental Design...................8 1.1.4 Data Representation................... 10 1.1.5 Repositories.......................
Recommended publications
  • Research Article Advanced Heat Map and Clustering Analysis Using Heatmap3
    Hindawi Publishing Corporation BioMed Research International Volume 2014, Article ID 986048, 6 pages http://dx.doi.org/10.1155/2014/986048 Research Article Advanced Heat Map and Clustering Analysis Using Heatmap3 Shilin Zhao, Yan Guo, Quanhu Sheng, and Yu Shyr Center for Quantitative Sciences, Vanderbilt University, Nashville, TN 37232, USA Correspondence should be addressed to Yu Shyr; [email protected] Received 6 June 2014; Accepted 2 July 2014; Published 16 July 2014 Academic Editor: Leng Han Copyright © 2014 Shilin Zhao et al. This is an open access article distributed under the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited. Heat maps and clustering are used frequently in expression analysis studies for data visualization and quality control. Simple clustering and heat maps can be produced from the “heatmap” function in R. However, the “heatmap” function lacks certain functionalities and customizability, preventing it from generating advanced heat maps and dendrograms. To tackle the limitations of the “heatmap” function, we have developed an R package “heatmap3” which significantly improves the original “heatmap” function by adding several more powerful and convenient features. The “heatmap3” package allows users to produce highly customizable state of the art heat maps and dendrograms. The “heatmap3” package is developed based on the “heatmap” function in R, and itis completely compatible with it. The new features of “heatmap3” include highly customizable legends and side annotation, a wider range of color selections, new labeling features which allow users to define multiple layers of phenotype variables, and automatically conducted association tests based on the phenotypes provided.
    [Show full text]
  • Searching the Genomes of Inbred Mouse Strains for Incompatibilities That Reproductively Isolate Their Wild Relatives
    Journal of Heredity 2007:98(2):115–122 ª The American Genetic Association. 2007. All rights reserved. doi:10.1093/jhered/esl064 For permissions, please email: [email protected]. Advance Access publication January 5, 2007 Searching the Genomes of Inbred Mouse Strains for Incompatibilities That Reproductively Isolate Their Wild Relatives BRET A. PAYSEUR AND MICHAEL PLACE From the Laboratory of Genetics, University of Wisconsin, Madison, WI 53706. Address correspondence to the author at the address above, or e-mail: [email protected]. Abstract Identification of the genes that underlie reproductive isolation provides important insights into the process of speciation. According to the Dobzhansky–Muller model, these genes suffer disrupted interactions in hybrids due to independent di- vergence in separate populations. In hybrid populations, natural selection acts to remove the deleterious heterospecific com- binations that cause these functional disruptions. When selection is strong, this process can maintain multilocus associations, primarily between conspecific alleles, providing a signature that can be used to locate incompatibilities. We applied this logic to populations of house mice that were formed by hybridization involving two species that show partial reproductive isolation, Mus domesticus and Mus musculus. Using molecular markers likely to be informative about species ancestry, we scanned the genomes of 1) classical inbred strains and 2) recombinant inbred lines for pairs of loci that showed extreme linkage disequi- libria. By using the same set of markers, we identified a list of locus pairs that displayed similar patterns in both scans. These genomic regions may contain genes that contribute to reproductive isolation between M. domesticus and M.
    [Show full text]
  • Supplement 1 Microarray Studies
    EASE Categories Significantly Enriched in vs MG vs vs MGC4-2 Pt1-C vs C4-2 Pt1-C UP-Regulated Genes MG System Gene Category EASE Global MGRWV Pt1-N RWV Pt1-N Score FDR GO Molecular Extracellular matrix cellular construction 0.0008 0 110 genes up- Function Interpro EGF-like domain 0.0009 0 regulated GO Molecular Oxidoreductase activity\ acting on single dono 0.0015 0 Function GO Molecular Calcium ion binding 0.0018 0 Function Interpro Laminin-G domain 0.0025 0 GO Biological Process Cell Adhesion 0.0045 0 Interpro Collagen Triple helix repeat 0.0047 0 KEGG pathway Complement and coagulation cascades 0.0053 0 KEGG pathway Immune System – Homo sapiens 0.0053 0 Interpro Fibrillar collagen C-terminal domain 0.0062 0 Interpro Calcium-binding EGF-like domain 0.0077 0 GO Molecular Cell adhesion molecule activity 0.0105 0 Function EASE Categories Significantly Enriched in Down-Regulated Genes System Gene Category EASE Global Score FDR GO Biological Process Copper ion homeostasis 2.5E-09 0 Interpro Metallothionein 6.1E-08 0 Interpro Vertebrate metallothionein, Family 1 6.1E-08 0 GO Biological Process Transition metal ion homeostasis 8.5E-08 0 GO Biological Process Heavy metal sensitivity/resistance 1.9E-07 0 GO Biological Process Di-, tri-valent inorganic cation homeostasis 6.3E-07 0 GO Biological Process Metal ion homeostasis 6.3E-07 0 GO Biological Process Cation homeostasis 2.1E-06 0 GO Biological Process Cell ion homeostasis 2.1E-06 0 GO Biological Process Ion homeostasis 2.1E-06 0 GO Molecular Helicase activity 2.3E-06 0 Function GO Biological
    [Show full text]
  • Bioinformatics Analyses of Genomic Imprinting
    Bioinformatics Analyses of Genomic Imprinting Dissertation zur Erlangung des Grades des Doktors der Naturwissenschaften der Naturwissenschaftlich-Technischen Fakultät III Chemie, Pharmazie, Bio- und Werkstoffwissenschaften der Universität des Saarlandes von Barbara Hutter Saarbrücken 2009 Tag des Kolloquiums: 08.12.2009 Dekan: Prof. Dr.-Ing. Stefan Diebels Berichterstatter: Prof. Dr. Volkhard Helms Priv.-Doz. Dr. Martina Paulsen Vorsitz: Prof. Dr. Jörn Walter Akad. Mitarbeiter: Dr. Tihamér Geyer Table of contents Summary________________________________________________________________ I Zusammenfassung ________________________________________________________ I Acknowledgements _______________________________________________________II Abbreviations ___________________________________________________________ III Chapter 1 – Introduction __________________________________________________ 1 1.1 Important terms and concepts related to genomic imprinting __________________________ 2 1.2 CpG islands as regulatory elements ______________________________________________ 3 1.3 Differentially methylated regions and imprinting clusters_____________________________ 6 1.4 Reading the imprint __________________________________________________________ 8 1.5 Chromatin marks at imprinted regions___________________________________________ 10 1.6 Roles of repetitive elements ___________________________________________________ 12 1.7 Functional implications of imprinted genes _______________________________________ 14 1.8 Evolution and parental conflict ________________________________________________
    [Show full text]
  • Visualization of Biomedical Data
    Visualization of Biomedical Data Corresponding author: Seán I. O’Donoghue; email: [email protected] • Data61, Commonwealth Scientific and Industrial Research Organisation (CSIRO), Eveleigh NSW 2015, Australia • Genomics and Epigenetics Division, Garvan Institute of Medical Research, Sydney NSW 2010, Australia • School of Biotechnology and Biomolecular Sciences, UNSW, Kensington NSW 2033, Australia Benedetta Frida Baldi; email: [email protected] • Genomics and Epigenetics Division, Garvan Institute of Medical Research, Sydney NSW 2010, Australia Susan J Clark; email: [email protected] • Genomics and Epigenetics Division, Garvan Institute of Medical Research, Sydney NSW 2010, Australia Aaron E. Darling; email: [email protected] • The ithree institute, University of Technology Sydney, Ultimo NSW 2007, Australia James M. Hogan; email: [email protected] • School of Electrical Engineering and Computer Science, Queensland University of Technology, Brisbane QLD, 4000, Australia Sandeep Kaur; email: [email protected] • School of Computer Science and Engineering, UNSW, Kensington NSW 2033, Australia Lena Maier-Hein; email: [email protected] • Div. Computer Assisted Medical Interventions (CAMI), German Cancer Research Center (DKFZ), 69120 Heidelberg, Germany Davis J. McCarthy; email: [email protected] • European Molecular Biology Laboratory, European Bioinformatics Institute, Wellcome Genome Campus, CB10 1SD, Hinxton, Cambridge, UK • St. Vincent’s Institute of Medical Research, Fitzroy VIC 3065, Australia William
    [Show full text]
  • Aneuploidy: Using Genetic Instability to Preserve a Haploid Genome?
    Health Science Campus FINAL APPROVAL OF DISSERTATION Doctor of Philosophy in Biomedical Science (Cancer Biology) Aneuploidy: Using genetic instability to preserve a haploid genome? Submitted by: Ramona Ramdath In partial fulfillment of the requirements for the degree of Doctor of Philosophy in Biomedical Science Examination Committee Signature/Date Major Advisor: David Allison, M.D., Ph.D. Academic James Trempe, Ph.D. Advisory Committee: David Giovanucci, Ph.D. Randall Ruch, Ph.D. Ronald Mellgren, Ph.D. Senior Associate Dean College of Graduate Studies Michael S. Bisesi, Ph.D. Date of Defense: April 10, 2009 Aneuploidy: Using genetic instability to preserve a haploid genome? Ramona Ramdath University of Toledo, Health Science Campus 2009 Dedication I dedicate this dissertation to my grandfather who died of lung cancer two years ago, but who always instilled in us the value and importance of education. And to my mom and sister, both of whom have been pillars of support and stimulating conversations. To my sister, Rehanna, especially- I hope this inspires you to achieve all that you want to in life, academically and otherwise. ii Acknowledgements As we go through these academic journeys, there are so many along the way that make an impact not only on our work, but on our lives as well, and I would like to say a heartfelt thank you to all of those people: My Committee members- Dr. James Trempe, Dr. David Giovanucchi, Dr. Ronald Mellgren and Dr. Randall Ruch for their guidance, suggestions, support and confidence in me. My major advisor- Dr. David Allison, for his constructive criticism and positive reinforcement.
    [Show full text]
  • Gene Expression During Normal and FSHD Myogenesis Tsumagari Et Al
    Gene expression during normal and FSHD myogenesis Tsumagari et al. Tsumagari et al. BMC Medical Genomics 2011, 4:67 http://www.biomedcentral.com/1755-8794/4/67 (27 September 2011) Tsumagari et al. BMC Medical Genomics 2011, 4:67 http://www.biomedcentral.com/1755-8794/4/67 RESEARCHARTICLE Open Access Gene expression during normal and FSHD myogenesis Koji Tsumagari1, Shao-Chi Chang1, Michelle Lacey2,3, Carl Baribault2,3, Sridar V Chittur4, Janet Sowden5, Rabi Tawil5, Gregory E Crawford6 and Melanie Ehrlich1,3* Abstract Background: Facioscapulohumeral muscular dystrophy (FSHD) is a dominant disease linked to contraction of an array of tandem 3.3-kb repeats (D4Z4) at 4q35. Within each repeat unit is a gene, DUX4, that can encode a protein containing two homeodomains. A DUX4 transcript derived from the last repeat unit in a contracted array is associated with pathogenesis but it is unclear how. Methods: Using exon-based microarrays, the expression profiles of myogenic precursor cells were determined. Both undifferentiated myoblasts and myoblasts differentiated to myotubes derived from FSHD patients and controls were studied after immunocytochemical verification of the quality of the cultures. To further our understanding of FSHD and normal myogenesis, the expression profiles obtained were compared to those of 19 non-muscle cell types analyzed by identical methods. Results: Many of the ~17,000 examined genes were differentially expressed (> 2-fold, p < 0.01) in control myoblasts or myotubes vs. non-muscle cells (2185 and 3006, respectively) or in FSHD vs. control myoblasts or myotubes (295 and 797, respectively). Surprisingly, despite the morphologically normal differentiation of FSHD myoblasts to myotubes, most of the disease-related dysregulation was seen as dampening of normal myogenesis- specific expression changes, including in genes for muscle structure, mitochondrial function, stress responses, and signal transduction.
    [Show full text]
  • Induction by Lupus Immune Complexes Differentially Regulate
    Plasmacytoid Dendritic Cells and C1q Differentially Regulate Inflammatory Gene Induction by Lupus Immune Complexes This information is current as Deanna M. Santer, Alice E. Wiedeman, Thomas H. Teal, of September 23, 2021. Pradipta Ghosh and Keith B. Elkon J Immunol published online 5 December 2011 http://www.jimmunol.org/content/early/2011/12/04/jimmun ol.1102797 Downloaded from Supplementary http://www.jimmunol.org/content/suppl/2011/12/05/jimmunol.110279 Material 7.DC1 http://www.jimmunol.org/ Why The JI? Submit online. • Rapid Reviews! 30 days* from submission to initial decision • No Triage! Every submission reviewed by practicing scientists • Fast Publication! 4 weeks from acceptance to publication by guest on September 23, 2021 *average Subscription Information about subscribing to The Journal of Immunology is online at: http://jimmunol.org/subscription Permissions Submit copyright permission requests at: http://www.aai.org/About/Publications/JI/copyright.html Email Alerts Receive free email-alerts when new articles cite this article. Sign up at: http://jimmunol.org/alerts The Journal of Immunology is published twice each month by The American Association of Immunologists, Inc., 1451 Rockville Pike, Suite 650, Rockville, MD 20852 Copyright © 2011 by The American Association of Immunologists, Inc. All rights reserved. Print ISSN: 0022-1767 Online ISSN: 1550-6606. Published December 5, 2011, doi:10.4049/jimmunol.1102797 The Journal of Immunology Plasmacytoid Dendritic Cells and C1q Differentially Regulate Inflammatory Gene Induction by Lupus Immune Complexes Deanna M. Santer,* Alice E. Wiedeman,* Thomas H. Teal,† Pradipta Ghosh,† and Keith B. Elkon*,† Immune complexes (ICs) play a pivotal role in causing inflammation in systemic lupus erythematosus (SLE).
    [Show full text]
  • Data Visualization by Nils Gehlenborg
    Data Visualization Nils Gehlenborg ([email protected]) Center for Biomedical Informatics / Harvard Medical School Cancer Program / Broad Institute of MIT and Harvard ISMB/ECCB 2011 http://www.biovis.net Flyers at ISCB booth! Data Visualization / ISMB/ECCB 2011 / Nils Gehlenborg A good sketch is better than a long speech. Napoleon Bonaparte Data Visualization / ISMB/ECCB 2011 / Nils Gehlenborg Minard 1869 Napoleon’s March on Moscow Data Visualization / ISMB/ECCB 2011 / Nils Gehlenborg 4 I believe when I see it. Unknown Data Visualization / ISMB/ECCB 2011 / Nils Gehlenborg Anscombe 1973, The American Statistician Anscombe’s Quartet mean(X) = 9, var(X) = 11, mean(Y) = 7.5, var(Y) = 4.12, cor(X,Y) = 0.816, linear regression line Y = 3 + 0.5*X Data Visualization / ISMB/ECCB 2011 / Nils Gehlenborg 6 Anscombe 1973, The American Statistician Anscombe’s Quartet Data Visualization / ISMB/ECCB 2011 / Nils Gehlenborg 7 Exploration: Hypothesis Generation trends gaps outliers clusters - A large data set is given and the goal is to learn something about it. - Visualization is employed to perform pattern detection using the human visual system. - The goal is to generate hypotheses that can be tested with statistical methods or follow-up experiments. Data Visualization / ISMB/ECCB 2011 / Nils Gehlenborg 8 Visualization Use Cases Presentation Confirmation Exploration Data Visualization / ISMB/ECCB 2011 / Nils Gehlenborg 9 Definition The use of computer-supported, interactive, visual representations of data to amplify cognition. Stu Card, Jock Mackinlay & Ben Shneiderman Computer-based visualization systems provide visual representations of datasets intended to help people carry out some task more effectively.effectively.
    [Show full text]
  • Role and Regulation of the P53-Homolog P73 in the Transformation of Normal Human Fibroblasts
    Role and regulation of the p53-homolog p73 in the transformation of normal human fibroblasts Dissertation zur Erlangung des naturwissenschaftlichen Doktorgrades der Bayerischen Julius-Maximilians-Universität Würzburg vorgelegt von Lars Hofmann aus Aschaffenburg Würzburg 2007 Eingereicht am Mitglieder der Promotionskommission: Vorsitzender: Prof. Dr. Dr. Martin J. Müller Gutachter: Prof. Dr. Michael P. Schön Gutachter : Prof. Dr. Georg Krohne Tag des Promotionskolloquiums: Doktorurkunde ausgehändigt am Erklärung Hiermit erkläre ich, dass ich die vorliegende Arbeit selbständig angefertigt und keine anderen als die angegebenen Hilfsmittel und Quellen verwendet habe. Diese Arbeit wurde weder in gleicher noch in ähnlicher Form in einem anderen Prüfungsverfahren vorgelegt. Ich habe früher, außer den mit dem Zulassungsgesuch urkundlichen Graden, keine weiteren akademischen Grade erworben und zu erwerben gesucht. Würzburg, Lars Hofmann Content SUMMARY ................................................................................................................ IV ZUSAMMENFASSUNG ............................................................................................. V 1. INTRODUCTION ................................................................................................. 1 1.1. Molecular basics of cancer .......................................................................................... 1 1.2. Early research on tumorigenesis ................................................................................. 3 1.3. Developing
    [Show full text]
  • WO 2010/127399 Al
    (12) INTERNATIONAL APPLICATION PUBLISHED UNDER THE PATENT COOPERATION TREATY (PCT) (19) World Intellectual Property Organization International Bureau (10) International Publication Number (43) International Publication Date 11 November 2010 (11.11.2010) WO 2010/127399 Al (51) International Patent Classification: (74) Agent: MONGER, Carmela; Walter and Eliza Hall In C12Q 1/68 (2006.01) GOlN 35/00 (2006.01) stitute of Medical Research, IG Royal Parade, Parkville, GOlN 33/48 (2006.01 ) Melbourne, Victoria 3052 (AU). (21) International Application Number: (81) Designated States (unless otherwise indicated, for every PCT/AU20 10/000524 kind of national protection available): AE, AG, AL, AM, AO, AT, AU, AZ, BA, BB, BG, BH, BR, BW, BY, BZ, (22) Date: International Filing CA, CH, CL, CN, CO, CR, CU, CZ, DE, DK, DM, DO, 6 May 2010 (06.05.2010) DZ, EC, EE, EG, ES, FI, GB, GD, GE, GH, GM, GT, (25) Filing Language: English HN, HR, HU, ID, IL, IN, IS, JP, KE, KG, KM, KN, KP, KR, KZ, LA, LC, LK, LR, LS, LT, LU, LY, MA, MD, (26) Publication Language: English ME, MG, MK, MN, MW, MX, MY, MZ, NA, NG, NI, (30) Priority Data: NO, NZ, OM, PE, PG, PH, PL, PT, RO, RS, RU, SC, SD, 2009901989 6 May 2009 (06.05.2009) AU SE, SG, SK, SL, SM, ST, SV, SY, TH, TJ, TM, TN, TR, TT, TZ, UA, UG, US, UZ, VC, VN, ZA, ZM, ZW. (71) Applicant (for all designated States except US): WAL¬ TER AND ELIZA HALL INSTITUTE OF MEDICAL (84) Designated States (unless otherwise indicated, for every RESEARCH [AU/AU]; IG Royal Parade, Parkville, kind of regional protection available): ARIPO (BW, GH, Melbourne, Victoria 3052 (AU).
    [Show full text]
  • Bringing 'Bee-Cological' Data to Life Through a Relational Database and an Interactive Visualization Tool By
    Bringing 'Bee-cological' Data to Life through a Relational Database and an Interactive Visualization Tool by Xiaojun Wang A Thesis Submitted to the Faculty of the WORCESTER POLYTECHNIC INSTITUTE in partial fulfillment of the requirements for the Degree of Master of Science in Bioinformatics and Computational Biology Aug 2018 APPROVED BY: Dr. Carolina Ruiz Dr. Robert J. Gegear Dr. Elizabeth F. Ryder 1 Abstract Over the past decade, bumblebees have rapidly declined in abundance and geographic distribution at an alarming rate, raising major social, economic and ecological concern worldwide. However, we presently lack effective bumblebee conservation strategies due to a lack of information on the specific ecological needs of each species. The ‘Beecology Project’ was created to fill this knowledge gap by utilizing citizen scientists to collect data on floral resource use patterns of foraging bees in naturally occurring mixed species communities across Massachusetts. In addition to its research goals, the Beecology Project also has the educational goal of providing a modular, integrated biology - computer science framework (a BIO-CS bridge) to assist teachers in developing curricula to meet the next generation biology and computer science standards at the high school level. The Beecology team has developed Android and Web mobile apps to assist citizen scientists to collect and submit field data on bumblebee and plant species interactions. Other Beecology team members also collected a substantial amount of bumblebee data through field research and online digital museum collections. However, there was no central location dedicated to the storage of such data. There was also no way for users such as researchers, educators, and the general public to access all of the collected data in an ecologically-meaningful way.
    [Show full text]