Supplemental Data Heidel Et Al

Total Page:16

File Type:pdf, Size:1020Kb

Supplemental Data Heidel Et Al Supplemental data Heidel et al. Table of Contents 1. Sequencing strategy and statistics ...................................................................................................... 2 2. Genome structure ............................................................................................................................... 2 2.1 Extrachromosal elements .............................................................................................................. 2 2.2 Chromosome structure ................................................................................................................. 3 2.3 Repetitive elements ...................................................................................................................... 5 3. Coding sequences ................................................................................................................................ 5 3.1 Homopolymer tracts ..................................................................................................................... 5 3.2 Gene families and orthology relationships ................................................................................... 7 3.3 Synteny analysis .......................................................................................................................... 11 4. Protein functional domains ............................................................................................................... 12 5. Protein families .................................................................................................................................. 13 5.1 Primary metabolism .................................................................................................................... 13 5.2 Secondary metabolism ................................................................................................................ 16 5.3 Cell shape, organization and motility .......................................................................................... 19 5.3.1 Kinesins ................................................................................................................................. 19 5.3.2 Myosins ................................................................................................................................ 21 5.3.3 The microfilament system .................................................................................................... 22 5.4 Gene transcription ....................................................................................................................... 32 5.5 Cell adhesion ............................................................................................................................... 34 5.6 Cell signaling ................................................................................................................................ 37 5.6.1 Seven transmembrane domain receptors including G‐protein coupled receptors ............. 37 5.6.2 Cyclic nucleotide synthesis, detection and degradation ...................................................... 40 5.6.3 Sensor histidine kinases ....................................................................................................... 43 5.6.4 Monomeric G‐proteins ......................................................................................................... 45 5.6.5 ABC transporters .................................................................................................................. 49 6. Supplemental Methods ..................................................................................................................... 52 6.1 DNA isolation ............................................................................................................................... 52 6.2 Sequencing and Assembly ........................................................................................................... 52 6.3 Chromosome structure analysis .................................................................................................. 53 6.4 Gene prediction, Blast and synteny analysis ............................................................................... 53 6.5 Gene family detection using domain analysis ............................................................................. 53 6.6 Protein variation .......................................................................................................................... 53 6.7 Phylogeny and species split dating .............................................................................................. 54 7. References ......................................................................................................................................... 57 1. Sequencing strategy and statistics The sequencing projects for both the P. pallidum (PP) and D. fasciculatum (DF) genomes were initiated by paired-end Sanger sequencing of 1-2 kb sheared genomic DNA inserts cloned in pUC18, soon to be followed by four runs of more cost-effective pyrosequencing using the 454 Roche platform, yielding a total coverage of over 15 x for each genome (Table S1.1). A dense physical map of both genomes was prepared by paired-end sequencing of ~30 kb genomic DNA fragments inserted into the fosmid vector pCC2FOS. The gaps within supercontigs were filled in by primer walking, until the sequence at both sides of the gap became too repetitive for primer design. At this point only 52 and 33 gaps remained in the PP and DF genomes respectively, which compares very favorably against the current state of 226 gaps in the DD genome, 5 years after completion and continued polishing (Table 1, main text). Table S1.1: Sequencing statistics P. pallidum D. fasciculatum reads from fosmid clones 7937 5033 reads from small insert library clones 112268 80433 454 runs/MB raw data 4/416 4/418 Contigs from initial newbler assembly 6352 7292 gap closing reads 1068 1165 genome coverage 14.4 x 14.9 x Final assembly: contigs/supercontigs 52/41 33/25 2. Genome structure 2.1 Extrachromosal elements Metazoans and plants integrate tandem arrays of rRNA genes into existing chromosomes (Schwarzacher and Wachtler 1993) to provide a large number of copies for simultaneous transcription. In contrast, some protists, such as Tetrahymena (Blomberg et al. 1997) amplify their rRNA genes on extrachromosomal palindromes. All analysed Dictyostelia contain an amplified extrachromosomal sequence that codes for rRNA genes. Sequences of extrachromosomal elements are highly overrepresented in the sequencing reads due to their high abundance. Yet, repetitiveness prevented their assembly from 454 read data alone. Thus, small insert library reads were used to assemble the full length palindrome arms. rRNA genes were defined based on homology to the DD counterparts and the common eukaryote set of rRNA genes. Based on the relative abundance of the sequencing reads that match the palindromes, compared to unique parts of the genome, the PP, DF and DD palindromes make up around 5, 9 and 25% of the genomic DNA content, respectively. This difference is mainly attributable to the shorter palindrome arms in PP (15 kb) and DF (26 kb) compared to 45 kb for DD. Thus, the number of palindrome molecules is in the same range in all species. The palindrome organization is the same as in DD (Sucgang et al. 2003): the rRNA genes reside at the ends of the arms with transcription directed towards the telomeric regions. The observed plasticity in palindrome size is thus due to different amounts of repeated sequences in their central regions. To highlight conserved regions in extrachromosomal palindromic elements of DD, PP and DF, the full (DD) or half palindromes (PP and DF) were aligned and pairwise represented in a dot-matrix using Tuple_plot (Szafranski et al. 2006), an algoritm which reduces noise caused by repetitive sequence. Only the regions that contain the ribosomal RNA genes are conserved between the three palindromes and these regions are typically located at the end of the palindrome arms (Fig. S2.1) 2 A Comparison between the extrachromosomal elements of DD (X-axis) and DF (Y-axis). B Comparison between DD (X-axis) and PP (Y-axis). C Comparison between DF (X-axis) and PP (Y-axis). Figure S2.1: Pairwise alignments between DD, PP and DF extrachromosomal elements represented as Tuple_plots. The full palindrome is shown only for DD. Yellow shadowing denotes the positions of the rRNA genes. Black dots represent tuples in the same direction, red dots inverse tuples. Representations are drawn to scale so that the figures are comparable. 2.2 Chromosome structure In DD no telomere structures common to eukaryotes are present. However, when we searched the assembled genomes of PP and DF for the common TTTAGA motif and variations thereof, we found repeated structures of such motifs in both species (14 in PP and 12 in DF). The motifs were all located at contig ends indicating a role as telomeres. In case of DF half of these structures were directly associated with DIRS transposable elements reflecting the location of DIRS elements at telomer ends in DD. The repeated telomere motifs in PP were found in the neighborhood of complex structures (see main text) but no DIRS elements could be found in the entire PP genome. Figure S2.2 shows the chromosomes of PP separated by Pulsed Field Gel Electrophoresis (PFGE), confirming the prediction of 7 chromosomes from 14 telomeres. 3 Figure S2.2: Pulsed-field gel separation of Pp PN500
Recommended publications
  • Deregulated Gene Expression Pathways in Myelodysplastic Syndrome Hematopoietic Stem Cells
    Leukemia (2010) 24, 756–764 & 2010 Macmillan Publishers Limited All rights reserved 0887-6924/10 $32.00 www.nature.com/leu ORIGINAL ARTICLE Deregulated gene expression pathways in myelodysplastic syndrome hematopoietic stem cells A Pellagatti1, M Cazzola2, A Giagounidis3, J Perry1, L Malcovati2, MG Della Porta2,MJa¨dersten4, S Killick5, A Verma6, CJ Norbury7, E Hellstro¨m-Lindberg4, JS Wainscoat1 and J Boultwood1 1LRF Molecular Haematology Unit, NDCLS, John Radcliffe Hospital, Oxford, UK; 2Department of Hematology Oncology, University of Pavia Medical School, Fondazione IRCCS Policlinico San Matteo, Pavia, Italy; 3Medizinische Klinik II, St Johannes Hospital, Duisburg, Germany; 4Division of Hematology, Department of Medicine, Karolinska Institutet, Stockholm, Sweden; 5Department of Haematology, Royal Bournemouth Hospital, Bournemouth, UK; 6Albert Einstein College of Medicine, Bronx, NY, USA and 7Sir William Dunn School of Pathology, University of Oxford, Oxford, UK To gain insight into the molecular pathogenesis of the the World Health Organization.6,7 Patients with refractory myelodysplastic syndromes (MDS), we performed global gene anemia (RA) with or without ringed sideroblasts, according to expression profiling and pathway analysis on the hemato- poietic stem cells (HSC) of 183 MDS patients as compared with the the French–American–British classification, were subdivided HSC of 17 healthy controls. The most significantly deregulated based on the presence or absence of multilineage dysplasia. In pathways in MDS include interferon signaling, thrombopoietin addition, patients with RA with excess blasts (RAEB) were signaling and the Wnt pathways. Among the most signifi- subdivided into two categories, RAEB1 and RAEB2, based on the cantly deregulated gene pathways in early MDS are immuno- percentage of bone marrow blasts.
    [Show full text]
  • Biomarker Discovery for Chronic Liver Diseases by Multi-Omics
    www.nature.com/scientificreports OPEN Biomarker discovery for chronic liver diseases by multi-omics – a preclinical case study Daniel Veyel1, Kathrin Wenger1, Andre Broermann2, Tom Bretschneider1, Andreas H. Luippold1, Bartlomiej Krawczyk1, Wolfgang Rist 1* & Eric Simon3* Nonalcoholic steatohepatitis (NASH) is a major cause of liver fbrosis with increasing prevalence worldwide. Currently there are no approved drugs available. The development of new therapies is difcult as diagnosis and staging requires biopsies. Consequently, predictive plasma biomarkers would be useful for drug development. Here we present a multi-omics approach to characterize the molecular pathophysiology and to identify new plasma biomarkers in a choline-defcient L-amino acid-defned diet rat NASH model. We analyzed liver samples by RNA-Seq and proteomics, revealing disease relevant signatures and a high correlation between mRNA and protein changes. Comparison to human data showed an overlap of infammatory, metabolic, and developmental pathways. Using proteomics analysis of plasma we identifed mainly secreted proteins that correlate with liver RNA and protein levels. We developed a multi-dimensional attribute ranking approach integrating multi-omics data with liver histology and prior knowledge uncovering known human markers, but also novel candidates. Using regression analysis, we show that the top-ranked markers were highly predictive for fbrosis in our model and hence can serve as preclinical plasma biomarkers. Our approach presented here illustrates the power of multi-omics analyses combined with plasma proteomics and is readily applicable to human biomarker discovery. Nonalcoholic fatty liver disease (NAFLD) is the major liver disease in western countries and is ofen associated with obesity, metabolic syndrome, or type 2 diabetes.
    [Show full text]
  • Seq2pathway Vignette
    seq2pathway Vignette Bin Wang, Xinan Holly Yang, Arjun Kinstlick May 19, 2021 Contents 1 Abstract 1 2 Package Installation 2 3 runseq2pathway 2 4 Two main functions 3 4.1 seq2gene . .3 4.1.1 seq2gene flowchart . .3 4.1.2 runseq2gene inputs/parameters . .5 4.1.3 runseq2gene outputs . .8 4.2 gene2pathway . 10 4.2.1 gene2pathway flowchart . 11 4.2.2 gene2pathway test inputs/parameters . 11 4.2.3 gene2pathway test outputs . 12 5 Examples 13 5.1 ChIP-seq data analysis . 13 5.1.1 Map ChIP-seq enriched peaks to genes using runseq2gene .................... 13 5.1.2 Discover enriched GO terms using gene2pathway_test with gene scores . 15 5.1.3 Discover enriched GO terms using Fisher's Exact test without gene scores . 17 5.1.4 Add description for genes . 20 5.2 RNA-seq data analysis . 20 6 R environment session 23 1 Abstract Seq2pathway is a novel computational tool to analyze functional gene-sets (including signaling pathways) using variable next-generation sequencing data[1]. Integral to this tool are the \seq2gene" and \gene2pathway" components in series that infer a quantitative pathway-level profile for each sample. The seq2gene function assigns phenotype-associated significance of genomic regions to gene-level scores, where the significance could be p-values of SNPs or point mutations, protein-binding affinity, or transcriptional expression level. The seq2gene function has the feasibility to assign non-exon regions to a range of neighboring genes besides the nearest one, thus facilitating the study of functional non-coding elements[2]. Then the gene2pathway summarizes gene-level measurements to pathway-level scores, comparing the quantity of significance for gene members within a pathway with those outside a pathway.
    [Show full text]
  • Dual Proteome-Scale Networks Reveal Cell-Specific Remodeling of the Human Interactome
    bioRxiv preprint doi: https://doi.org/10.1101/2020.01.19.905109; this version posted January 19, 2020. The copyright holder for this preprint (which was not certified by peer review) is the author/funder. All rights reserved. No reuse allowed without permission. Dual Proteome-scale Networks Reveal Cell-specific Remodeling of the Human Interactome Edward L. Huttlin1*, Raphael J. Bruckner1,3, Jose Navarrete-Perea1, Joe R. Cannon1,4, Kurt Baltier1,5, Fana Gebreab1, Melanie P. Gygi1, Alexandra Thornock1, Gabriela Zarraga1,6, Stanley Tam1,7, John Szpyt1, Alexandra Panov1, Hannah Parzen1,8, Sipei Fu1, Arvene Golbazi1, Eila Maenpaa1, Keegan Stricker1, Sanjukta Guha Thakurta1, Ramin Rad1, Joshua Pan2, David P. Nusinow1, Joao A. Paulo1, Devin K. Schweppe1, Laura Pontano Vaites1, J. Wade Harper1*, Steven P. Gygi1*# 1Department of Cell Biology, Harvard Medical School, Boston, MA, 02115, USA. 2Broad Institute, Cambridge, MA, 02142, USA. 3Present address: ICCB-Longwood Screening Facility, Harvard Medical School, Boston, MA, 02115, USA. 4Present address: Merck, West Point, PA, 19486, USA. 5Present address: IQ Proteomics, Cambridge, MA, 02139, USA. 6Present address: Vor Biopharma, Cambridge, MA, 02142, USA. 7Present address: Rubius Therapeutics, Cambridge, MA, 02139, USA. 8Present address: RPS North America, South Kingstown, RI, 02879, USA. *Correspondence: [email protected] (E.L.H.), [email protected] (J.W.H.), [email protected] (S.P.G.) #Lead Contact: [email protected] bioRxiv preprint doi: https://doi.org/10.1101/2020.01.19.905109; this version posted January 19, 2020. The copyright holder for this preprint (which was not certified by peer review) is the author/funder.
    [Show full text]
  • NVND-2019 Book
    2nd International Conference on Neurovascular and Neurodegenerative Diseases October 28-30 2019 Media Partner Venue Paris Marriott Charles de Gaulle Airport Hotel 5 Allee du Verger, Zone Hoteliere Roissy en France, 95700 France Day- 1 Monday | October 28, 2019 Keynote Presentations Cerebrovascular Lesions in Pick Complex Diseases: A Neuropathological Study with a 7.0-tesla Magnetic Resonance Imaging Jacques De Reuck Université Lille 2, INSERM U1171, Degenerative & Vascular Cognitive Disorders, CHR Lille, France Abstract Introduction: Pick complex refers to a spectrum of diseases that have in common the presence of tau inclusions. The main neuropathological phenotypes comprise tau-frontotemporal lobar degeneration (Tau-FTLD), progressive supranuclear palsy (PSP) and corticobasal degeneration (CBD). The present neuropathological study investigates their incidence of cerebrovascular lesions. Material and Methods: seventy patients underwent an autopsy and post-mortem MRI. The brains consisted of 14 with Tau-FTLD, 22 with PSP, 6 with CBD and 28 controls, who had no history of a brain disease. A whole coronal section of a cerebral hemisphere, at the level of the mamillary body, was taken for the semi-quantitative evaluation of the small cerebrovascular lesions such as white matter changes (WMCs), cortical micro-bleeds (CoMBs), and cortical micro-infarcts (CoMIs). In addition the severity and the distribution of WMCs, CoMIs and CoMBs were examined with 7.0-tesla MRI on three coronal sections of a cerebral hemisphere. Results: on neuropathological examination severe WMCs and more CoMBs are observed in Tau-FTLD, while the latter are also more frequent in CBD. The MRI examination shows that severe WMCs are present in the frontal sections not only of the Tau-FTLD but also to a lesser degree of the PSP and CBD brains.
    [Show full text]
  • Cellular and Molecular Signatures in the Disease Tissue of Early
    Cellular and Molecular Signatures in the Disease Tissue of Early Rheumatoid Arthritis Stratify Clinical Response to csDMARD-Therapy and Predict Radiographic Progression Frances Humby1,* Myles Lewis1,* Nandhini Ramamoorthi2, Jason Hackney3, Michael Barnes1, Michele Bombardieri1, Francesca Setiadi2, Stephen Kelly1, Fabiola Bene1, Maria di Cicco1, Sudeh Riahi1, Vidalba Rocher-Ros1, Nora Ng1, Ilias Lazorou1, Rebecca E. Hands1, Desiree van der Heijde4, Robert Landewé5, Annette van der Helm-van Mil4, Alberto Cauli6, Iain B. McInnes7, Christopher D. Buckley8, Ernest Choy9, Peter Taylor10, Michael J. Townsend2 & Costantino Pitzalis1 1Centre for Experimental Medicine and Rheumatology, William Harvey Research Institute, Barts and The London School of Medicine and Dentistry, Queen Mary University of London, Charterhouse Square, London EC1M 6BQ, UK. Departments of 2Biomarker Discovery OMNI, 3Bioinformatics and Computational Biology, Genentech Research and Early Development, South San Francisco, California 94080 USA 4Department of Rheumatology, Leiden University Medical Center, The Netherlands 5Department of Clinical Immunology & Rheumatology, Amsterdam Rheumatology & Immunology Center, Amsterdam, The Netherlands 6Rheumatology Unit, Department of Medical Sciences, Policlinico of the University of Cagliari, Cagliari, Italy 7Institute of Infection, Immunity and Inflammation, University of Glasgow, Glasgow G12 8TA, UK 8Rheumatology Research Group, Institute of Inflammation and Ageing (IIA), University of Birmingham, Birmingham B15 2WB, UK 9Institute of
    [Show full text]
  • Investigation of the Underlying Hub Genes and Molexular Pathogensis in Gastric Cancer by Integrated Bioinformatic Analyses
    bioRxiv preprint doi: https://doi.org/10.1101/2020.12.20.423656; this version posted December 22, 2020. The copyright holder for this preprint (which was not certified by peer review) is the author/funder. All rights reserved. No reuse allowed without permission. Investigation of the underlying hub genes and molexular pathogensis in gastric cancer by integrated bioinformatic analyses Basavaraj Vastrad1, Chanabasayya Vastrad*2 1. Department of Biochemistry, Basaveshwar College of Pharmacy, Gadag, Karnataka 582103, India. 2. Biostatistics and Bioinformatics, Chanabasava Nilaya, Bharthinagar, Dharwad 580001, Karanataka, India. * Chanabasayya Vastrad [email protected] Ph: +919480073398 Chanabasava Nilaya, Bharthinagar, Dharwad 580001 , Karanataka, India bioRxiv preprint doi: https://doi.org/10.1101/2020.12.20.423656; this version posted December 22, 2020. The copyright holder for this preprint (which was not certified by peer review) is the author/funder. All rights reserved. No reuse allowed without permission. Abstract The high mortality rate of gastric cancer (GC) is in part due to the absence of initial disclosure of its biomarkers. The recognition of important genes associated in GC is therefore recommended to advance clinical prognosis, diagnosis and and treatment outcomes. The current investigation used the microarray dataset GSE113255 RNA seq data from the Gene Expression Omnibus database to diagnose differentially expressed genes (DEGs). Pathway and gene ontology enrichment analyses were performed, and a proteinprotein interaction network, modules, target genes - miRNA regulatory network and target genes - TF regulatory network were constructed and analyzed. Finally, validation of hub genes was performed. The 1008 DEGs identified consisted of 505 up regulated genes and 503 down regulated genes.
    [Show full text]
  • Supp Material.Pdf
    Simon et al. Supplementary information: Table of contents p.1 Supplementary material and methods p.2-4 • PoIy(I)-poly(C) Treatment • Flow Cytometry and Immunohistochemistry • Western Blotting • Quantitative RT-PCR • Fluorescence In Situ Hybridization • RNA-Seq • Exome capture • Sequencing Supplementary Figures and Tables Suppl. items Description pages Figure 1 Inactivation of Ezh2 affects normal thymocyte development 5 Figure 2 Ezh2 mouse leukemias express cell surface T cell receptor 6 Figure 3 Expression of EZH2 and Hox genes in T-ALL 7 Figure 4 Additional mutation et deletion of chromatin modifiers in T-ALL 8 Figure 5 PRC2 expression and activity in human lymphoproliferative disease 9 Figure 6 PRC2 regulatory network (String analysis) 10 Table 1 Primers and probes for detection of PRC2 genes 11 Table 2 Patient and T-ALL characteristics 12 Table 3 Statistics of RNA and DNA sequencing 13 Table 4 Mutations found in human T-ALLs (see Fig. 3D and Suppl. Fig. 4) 14 Table 5 SNP populations in analyzed human T-ALL samples 15 Table 6 List of altered genes in T-ALL for DAVID analysis 20 Table 7 List of David functional clusters 31 Table 8 List of acquired SNP tested in normal non leukemic DNA 32 1 Simon et al. Supplementary Material and Methods PoIy(I)-poly(C) Treatment. pIpC (GE Healthcare Lifesciences) was dissolved in endotoxin-free D-PBS (Gibco) at a concentration of 2 mg/ml. Mice received four consecutive injections of 150 μg pIpC every other day. The day of the last pIpC injection was designated as day 0 of experiment.
    [Show full text]
  • Kinesin Family Member 6 (Kif6) Is Necessary for Spine Development in Zebrafish 8 9 10 11 12 Jillian G
    Page 1 of 40 Developmental Dynamics 1 2 3 4 5 6 7 Kinesin family member 6 (kif6) is necessary for spine development in zebrafish 8 9 10 11 12 Jillian G. Buchan1, Ryan S. Gray2, John M. Gansner6, David M. Alvarado3, Lydia Burgert4, 13 14 Jonathan D. Gitlin7, Christina A. Gurnett3,4,5, Matthew I. Goldsmith1,4,* 15 16 17 Departments of 1Genetics, 2Developmental Biology, 3Orthopaedic Surgery, 4Pediatrics, 18 5 6 19 Neurology, Washington University School of Medicine, St. Louis, MO, 63110; Dana-Farber 20 Cancer Institute, Boston, MA, 02215; 7Eugene Bell Center for Regenerative Biology, Marine 21 Biological Laboratory, Woods Hole, MA, 02543 22 23 24 25 26 *Corresponding author: Matthew I. Goldsmith, MD 27 28 Department of Pediatrics 29 Washington University School of Medicine 30 [email protected] 31 660 S Euclid Ave, Campus Box 8208 32 St Louis, MO 63110 33 Phone: (314) 286-2769 34 35 Fax: (314) 286-2784 36 37 38 39 Running Title: Kif6 in zebrafish spine development 40 41 Keywords: kinesin, scoliosis, Danio rerio 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60 Developmental Dynamics Page 2 of 40 1 2 3 ABSTRACT 4 5 6 Background: Idiopathic scoliosis is a form of spinal deformity that affects 2-3% of children and 7 8 results in curvature of the spine without structural defects of the vertebral units. The 9 10 11 pathogenesis of idiopathic scoliosis remains poorly understood, in part due to the lack of a 12 13 relevant animal model.
    [Show full text]
  • Molecular Genetics of Microcephaly Primary Hereditary: an Overview
    brain sciences Review Molecular Genetics of Microcephaly Primary Hereditary: An Overview Nikistratos Siskos † , Electra Stylianopoulou †, Georgios Skavdis and Maria E. Grigoriou * Department of Molecular Biology & Genetics, Democritus University of Thrace, 68100 Alexandroupolis, Greece; [email protected] (N.S.); [email protected] (E.S.); [email protected] (G.S.) * Correspondence: [email protected] † Equal contribution. Abstract: MicroCephaly Primary Hereditary (MCPH) is a rare congenital neurodevelopmental disorder characterized by a significant reduction of the occipitofrontal head circumference and mild to moderate mental disability. Patients have small brains, though with overall normal architecture; therefore, studying MCPH can reveal not only the pathological mechanisms leading to this condition, but also the mechanisms operating during normal development. MCPH is genetically heterogeneous, with 27 genes listed so far in the Online Mendelian Inheritance in Man (OMIM) database. In this review, we discuss the role of MCPH proteins and delineate the molecular mechanisms and common pathways in which they participate. Keywords: microcephaly; MCPH; MCPH1–MCPH27; molecular genetics; cell cycle 1. Introduction Citation: Siskos, N.; Stylianopoulou, Microcephaly, from the Greek word µικρoκεϕαλi´α (mikrokephalia), meaning small E.; Skavdis, G.; Grigoriou, M.E. head, is a term used to describe a cranium with reduction of the occipitofrontal head circum- Molecular Genetics of Microcephaly ference equal, or more that teo standard deviations
    [Show full text]
  • Mitoxplorer, a Visual Data Mining Platform To
    mitoXplorer, a visual data mining platform to systematically analyze and visualize mitochondrial expression dynamics and mutations Annie Yim, Prasanna Koti, Adrien Bonnard, Fabio Marchiano, Milena Dürrbaum, Cecilia Garcia-Perez, José Villaveces, Salma Gamal, Giovanni Cardone, Fabiana Perocchi, et al. To cite this version: Annie Yim, Prasanna Koti, Adrien Bonnard, Fabio Marchiano, Milena Dürrbaum, et al.. mitoXplorer, a visual data mining platform to systematically analyze and visualize mitochondrial expression dy- namics and mutations. Nucleic Acids Research, Oxford University Press, 2020, 10.1093/nar/gkz1128. hal-02394433 HAL Id: hal-02394433 https://hal-amu.archives-ouvertes.fr/hal-02394433 Submitted on 4 Dec 2019 HAL is a multi-disciplinary open access L’archive ouverte pluridisciplinaire HAL, est archive for the deposit and dissemination of sci- destinée au dépôt et à la diffusion de documents entific research documents, whether they are pub- scientifiques de niveau recherche, publiés ou non, lished or not. The documents may come from émanant des établissements d’enseignement et de teaching and research institutions in France or recherche français ou étrangers, des laboratoires abroad, or from public or private research centers. publics ou privés. Distributed under a Creative Commons Attribution| 4.0 International License Nucleic Acids Research, 2019 1 doi: 10.1093/nar/gkz1128 Downloaded from https://academic.oup.com/nar/advance-article-abstract/doi/10.1093/nar/gkz1128/5651332 by Bibliothèque de l'université la Méditerranée user on 04 December 2019 mitoXplorer, a visual data mining platform to systematically analyze and visualize mitochondrial expression dynamics and mutations Annie Yim1,†, Prasanna Koti1,†, Adrien Bonnard2, Fabio Marchiano3, Milena Durrbaum¨ 1, Cecilia Garcia-Perez4, Jose Villaveces1, Salma Gamal1, Giovanni Cardone1, Fabiana Perocchi4, Zuzana Storchova1,5 and Bianca H.
    [Show full text]
  • A Bioinformatics Tool for Prioritizing Biological Leads from ‘Omics Data Using Literature Retrieval and Data Mining
    International Journal of Molecular Sciences Article OmixLitMiner: A Bioinformatics Tool for Prioritizing Biological Leads from ‘Omics Data Using Literature Retrieval and Data Mining Pascal Steffen 1,2,3 , Jemma Wu 2, Shubhang Hariharan 1, Hannah Voss 3, Vijay Raghunath 4, Mark P. Molloy 1,2,* and Hartmut Schlüter 3,* 1 Bowel Cancer and Biomarker Research, Kolling Institute, The University of Sydney, Sydney 2065, Australia; pascal.steff[email protected] (P.S.); [email protected] (S.H.) 2 Department of Molecular Sciences, Macquarie University, Sydney 2109, Australia; [email protected] 3 Mass Spectrometric Proteomics Group, Department of Clinical Chemistry and Laboratory Medicine, University Medical Center Hamburg-Eppendorf (UKE), Hamburg 20246, Germany; [email protected] 4 Sydney Informatics Hub, The University of Sydney, Sydney 2008, Australia; [email protected] * Correspondence: [email protected] (M.P.M.); [email protected] (H.S.) Received: 7 November 2019; Accepted: 13 February 2020; Published: 19 February 2020 Abstract: Proteomics and genomics discovery experiments generate increasingly large result tables, necessitating more researcher time to convert the biological data into new knowledge. Literature review is an important step in this process and can be tedious for large scale experiments. An informed and strategic decision about which biomolecule targets should be pursued for follow-up experiments thus remains a considerable challenge. To streamline and formalise this process of literature retrieval and analysis of discovery based ‘omics data and as a decision-facilitating support tool for follow-up experiments we present OmixLitMiner, a package written in the computational language R. The tool automates the retrieval of literature from PubMed based on UniProt protein identifiers, gene names and their synonyms, combined with user defined contextual keyword search (i.e., gene ontology based).
    [Show full text]