Next-Generation Sequencing Data for Use in Risk Assessment Bruce Alexander Merrick

Total Page:16

File Type:pdf, Size:1020Kb

Next-Generation Sequencing Data for Use in Risk Assessment Bruce Alexander Merrick Available online at www.sciencedirect.com Current Opinion in ScienceDirect Toxicology Next-generation sequencing data for use in risk assessment Bruce Alexander Merrick Abstract fluorescently labeled nucleotides and capillary electro- Next-generation sequencing (NGS) represents several phoresis that gave rise to automated sequencing in- powerful platforms that have revolutionized RNA and DNA struments such as the Applied Biosystems, Inc. model analysis. The parallel sequencing of millions of DNA molecules 370A sequencers and others, on the basis of Sanger can provide mechanistic insights into toxicology and provide chemistries [2]. Read length per each sample was at 500e new avenues for biomarker discovery with growing relevance 800 nucleotides, and sample throughput was limited. for risk assessment. The evolution of NGS technologies has improved over the last decade with increased sensitivity and Sanger-based DNA sequencing instruments are accuracy to foster new biomarker assays from tissue, blood, considered first-generation platforms. Instruments that and other biofluids. NGS technologies can identify transcrip- perform multiple sequencing reactions simultaneously tional changes and genomic targets with base pair precision in in a ‘massively parallel’ fashion have been dubbed, response to chemical exposure. Furthermore, there are ‘NextGeneration’ or NextGen Sequencing [3]. How several exciting movements within the toxicology community NGS came about compared with other genomic plat- that incorporate NGS platforms into new strategies for more forms is interlinked with microarray technology rapid toxicological characterizations. These include the Tox21 (Figure 1). Both platforms can provide whole genomic in vitro high-throughput transcriptomic screening program, approaches to research problems. Microarrays are a development of organotypic spheroids, alternative animal fluorescent probe hybridization-based technology with models, mining archival tissues, liquid biopsy, and epige- origins in the mid-1990s that are now a mature genomic nomics. This review will describe NGS-based technologies, platform with a well-established data analysis pipeline. demonstrate how they can be used as tools for target dis- Downsides of microarrays are that a prior genomic covery in tissue and blood, and suggest how they might be knowledge is needed to generate probes which are applied for risk assessment. species-specific with a limited dynamic range for dif- ferential expression. Addresses Molecular and Genomic Toxicology Group, Biomolecular Screening Branch, Division National Toxicology Program, National Institute of Development of the second wave of sequencing tech- Environmental Health Sciences, Research Triangle Park, NC, 27709, nologies, termed NextGen sequencing (NGS) technol- USA ogies, has overlapped with microarray platforms Corresponding author: Merrick, B. Alex ([email protected]) (Figure 1). NGS began in the new millennium as exem- plified by the massively parallel signature sequencing (MPSS) system that came from university research. Current Opinion in Toxicology 2019, 18:18–26 Improvements in sequencing chemistries, detection, and This review comes from a themed issue on Genomic Toxicology automation over the next decade promoted a rapid Available online 8 March 2019 development of NGS platforms. From 2010 to 2015, many NGS instruments became commercially available For a complete overview see the Issue and the Editorial that could produce millions of reads from 100 to 1000 https://doi.org/10.1016/j.cotox.2019.02.010 bases in length. A read is a short piece of sequence (e.g. 2468-2020/Published by Elsevier B.V. 100 nucleotides) that can be aligned to a transcript, and it also serves as a quantitative measure of a transcript when Keywords summed up with other aligning reads. These second- Next generation sequencing, RNA-seq, DNA-seq, Risk assessment, generation sequencers include the Roche ‘454 FLX,’ Biomarkers, High throughput transcriptomics, Liquid biopsy, Life Technologies‘Ion Torrent,’Applied Biosystems, Inc. Transcriptomics. ‘SOLiD’, and Illumina family of sequencers including the HiSeq 2000 series, MiSeq, X-Ten, and NovaSeq [4]. Introduction Further advances, such as single-molecule real-time The Sanger sequencing method was developed in the sequencing, in NGS sequencing technology have led to late 1970s to analyze DNA sequence using 32P-labeled longer read sequencers such as the Pacific Biosciences nucleotides separated by polyacrylamide gels for ‘PacBio RS II’ instrument that produces reads greater autoradiograms [1]. Radionuclides were replaced by than 10,000 bases [3]. Current Opinion in Toxicology 2019, 18:18–26 www.sciencedirect.com NGS in risk assessment Merrick 19 Figure 1 Timeline for development of microarray and next-generation sequencing (NGS) technology platforms. Microarray developments are above the timeline and NGS activities are below. For microarray development, Brown’s laboratory at Stanford was one of the first to develop a multigene expression measurement system using fluorescent detection. The term, toxicogenomics (Tgmx) was first defined by Nuwaysir et al., in 1999 [67]. Commercialized platforms such as Affymetrix, Agilent, and NimbleGen matured through 2010. For NGS, the massively parallel signature sequencing (MPSS) was developed in 2000 by the Brenner lab at Lynx Therapeutics. 454 Life Sciences developed a massively parallel pyrosequencing method in 2006 followed by a commercial instrument put out by Roche. The Solexa short-read platform was acquired by Illumina in 2008 and has undergone continued devel- opment and improvement. Commercialization of NGS platforms continues with various speeds of analysis, read lengths, and sequencing capacities. More recent developments include BRB-seq or ‘Bulk RNA Barcoding and Sequencing’ and TempO-Seq by BioSpyder as a library of bar-coded probes that hybridize to representative gene transcripts as a targeted NGS approach to transcript expression. Advantages and disadvantages are summarized and discussed further in the text. A more recent sequencing technology has been available [7]. Test articles can include chemicals and advanced by Oxford Nanopore Technologies. Nanopore many other agents, including pharmaceuticals, drugs, instruments read bases directly from single DNA or natural products, particles such as asbestos, nano- RNA molecules through a biological nanopore particles, physical factors such as radiation, metals, and channelda nanoscale biological tube that sequences by many others. The type of toxicity or hazard can be sensing changes in ionic current as the nucleic acid widely defined as macroscopic or microscopic lesions molecule passes through [5]. The sequencing devices and pathologies, altered pharmacologic, immunologic, can provide rapid analysis (hours), and some units are functional, and behavioral reactions, changes in portable (size of an USB flash drive) that can be readily biochemistry and physiology, or any measurable applied to teaching laboratories, medical offices, and response that is considered adverse or outside of normal field work. Reads lengths can be in the tens to hundreds health. Study of the underlying molecular changes of kb in length. A primary advantage of long read length contributing to toxicity has been greatly facilitated by is to reduce the ambiguity of highly homologous genes, omics technologies, particularly transcriptomics, while splice variants, and repetitive regions in the genome standardization of data analysis and interpretation where alignment is inherently more difficult using short continue to be refined [8]. There are many in vitro assays reads. The high sequence resolution of NGS in- and screens (e.g. anticholinesterase activity or bacterial struments has come at the expense of relatively low mutagenesis) that support the mode of action in the risk sample throughput. This issue has been addressed by assessment process, but new initiatives such as Tox21 creating libraries of targeted probe sets that analyze the aim to develop new assays incorporating NGS platforms complete transcriptome (e.g. TempO-Seq [6] and bulk for a larger role in risk determination [9]. RNA coding-seq [BRB-seq], reviewed later), rapidly and at relatively low cost. A brief depiction of NGS appli- The dynamic nature of gene expression (tran- cations is shown in Figure 2. scriptomics) in response to a chemical or test article exposure makes it well suited as part of the hazard NGS and risk assessment identification and dose-setting process for risk assess- Traditional risk assessment often involves identifying ment [10,11]. There are approximately 15,000 coding hazard(s) in a doseeresponse manner after chemical or genes and probably an equal number of noncoding genes test article exposure in animal models or human data if expressed at any one time in a specific cell type. Splice www.sciencedirect.com Current Opinion in Toxicology 2019, 18:18–26 20 Genomic Toxicology Figure 2 Many applications for measuring RNA and DNA in toxicogenomics are supported by NGS platforms. Whole genome or transcriptome analysis or targeted portions of each can be measured by NGS. seq, sequencing; WG, whole genome; ATAC, assay for transposase-accessible chromatin; ChIP, chromatin immunoprecipitation; Ribo, ribosome; miRNA, microRNA. RNA analysis platforms are indicated by blue arrows, and DNA analysis platforms are shown by red arrows. Further description of these applications is provided in the text. variants also add more complexity to response. Of those the transcriptome
Recommended publications
  • Toxicogenomics Applications of New Functional Genomics Technologies in Toxicology
    \-\w j Toxicogenomics Applications of new functional genomics technologies in toxicology Wilbert H.M. Heijne Proefschrift ter verkrijging vand egraa dva n doctor opgeza gva nd e rector magnificus vanWageninge n Universiteit, Prof.dr.ir. L. Speelman, in netopenbaa r te verdedigen op maandag6 decembe r200 4 des namiddagst e half twee ind eAul a - Table of contents Abstract Chapter I. page 1 General introduction [1] Chapter II page 21 Toxicogenomics of bromobenzene hepatotoxicity: a combined transcriptomics and proteomics approach[2] Chapter III page 48 Bromobenzene-induced hepatotoxicity atth etranscriptom e level PI Chapter IV page 67 Profiles of metabolites and gene expression in rats with chemically induced hepatic necrosis[4] Chapter V page 88 Liver gene expression profiles in relation to subacute toxicity in rats exposed to benzene[5] Chapter VI page 115 Toxicogenomics analysis of liver gene expression in relation to subacute toxicity in rats exposed totrichloroethylen e [6] Chapter VII page 135 Toxicogenomics analysis ofjoin t effects of benzene and trichloroethylene mixtures in rats m Chapter VII page 159 Discussion and conclusions References page 171 Appendices page 187 Samenvatting page 199 Dankwoord About the author Glossary Abbreviations List of genes Chapter I General introduction Parts of this introduction were publishedin : Molecular Biology in Medicinal Chemistry, Heijne etal., 2003 m NATO Advanced Research Workshop proceedings, Heijne eral., 2003 81 Chapter I 1. General introduction 1.1 Background /.1.1 Toxicologicalrisk
    [Show full text]
  • Genomic Dose Response
    Genomic Dose Response: The Picture NTP Genomic Dose Response Modeling Expert Panel Meeting October 23, 2017 Russell Thomas Director National Center for Computational Toxicology The views expressed in this presentation are those of the author and do not necessarily reflect the views or policies of the U.S. EPA Scott May Want To Rethink Asking Me To Be The “Big Picture Guy”… National Center for Computational Toxicology It is a Well Known Fact that Toxicology Continues to Have a Data Problem US National Research Council, 1984 Size of Estimate Mean Percent Category Category In the Select Universe Pesticides and Inert Ingredients of Pesticides 3,350 Formulations 10 24 2 26 38 Cosmetic Ingredients 3,410 2 14 10 18 56 Drugs and Excipients Used in Drug Formulations 1,815 18 18 3 36 25 Food Additives 8,627 • Major challenge is too many 5 14 1 34 46 Chemicals in Commerce: chemicals and not enough At Least 1 Million 12,860 Pounds/Year data 11 11 78 Chemicals in Commerce: • Total # chemicals = 65,725 Less than 1 Million 13,911 Pounds/Year • Chemicals with no toxicity 12 12 76 data of any kind = ~46,000 Chemicals in Commerce: Production Unknown or 21,752 Inaccessible 10 8 82 Complete Partial Minimal Some No Toxicity Health Health Toxicity Toxicity Information Hazard Hazard Information Information Available Assessment Assessment Available Available 2 Possible Possible (But Below Minimal) National Center for Computational Toxicology It is a Well Known Fact that Toxicology Continues to Have a Data Problem 70 60 50 40 30 20 Percent of Chemicals <1% 10 0 Acute Cancer Gentox Dev Tox Repro Tox EDSP Tier 1 Modified from Judson et al., EHP 2009 3 National Center for Computational Toxicology For Those With Data, Have We Been Truly Predictive or Just Protective? …data compiled from 150 compounds with 221 human toxicity events reported.
    [Show full text]
  • In Vitro RNA-Seq-Based Toxicogenomics Assessment Shows
    www.nature.com/scientificreports OPEN In vitro RNA-seq-based toxicogenomics assessment shows reduced biological efect of tobacco Received: 11 August 2017 Accepted: 5 January 2018 heating products when compared Published: xx xx xxxx to cigarette smoke Linsey E. Haswell, Sarah Corke, Ivan Verrastro, Andrew Baxter, Anisha Banerjee, Jason Adamson, Tomasz Jaunky, Christopher Proctor, Marianna Gaça & Emmanuel Minet The battery of regulatory tests used to evaluate the risk of novel tobacco products such as heated tobacco products (THPs) presents some limitations including a bias towards the apical endpoint tested, and limited information on the mode of action. This is driving a paradigm shift to more holistic systems biology approaches. In this study, we used RNA-sequencing to compare the transcriptomic perturbations following acute exposure of a 3D airway tissue to the aerosols from two commercial THPs and a reference 3R4F cigarette. 2809 RNAs were diferentially expressed for the 3R4F treatment and 115 and 2 RNAs for the two THPs (pFDR < 0.05, FC > 1.5), respectively. The relationship between the identifed RNA features and gene ontologies were mapped showing a strong association with stress response, xenobiotics metabolism, and COPD-related terms for 3R4F. In contrast, fewer ontologies were found enriched for the THPs aerosols. “Response to wounding” was a common COPD-related term over-represented for the two THPs but at a reduced signifcance. Quantifcation of a cytokine panel post-exposure confrmed a pro-infammatory efect of cigarette smoke but not for THPs. In conclusion, THPs have a reduced impact on gene expression compared to 3R4F. Te health risk associated with smoking cigarettes and tobacco regulations have been driving a wave of develop- ment and innovation in next generation tobacco and nicotine delivery systems1.
    [Show full text]
  • IPA Analysis of Metabolomics Data Including Cross-Platform Integration with Transcriptomics Data from a Diabetic Mouse Model
    IPA Analysis of Metabolomics Data Including Cross-Platform Integration with Transcriptomics Data from a Diabetic Mouse Model CONTENTS Overview ........................................................................................................................................................................ 3 Introduction ................................................................................................................................................................... 3 Metabolomics Background ........................................................................................................................................ 3 Integration of Metabolomic and Transcriptomic data .............................................................................................. 5 The Ingenuity Platform .............................................................................................................................................. 5 Case Study Background ............................................................................................................................................. 5 Methods ........................................................................................................................................................................ 6 Establishing a Metabolite profile for db/db mice and identification of exploratory biomarkers .............................. 6 IPA-Metabolomics™ Analysis: ...................................................................................................................................
    [Show full text]
  • Toxicogenomics: Toward the Future of Toxic Tort Causation
    NORTH CAROLINA JOURNAL OF LAW & TECHNOLOGY VOLUME 5, ISSUE 1: FALL 2003 Toxicogenomics: Toward the Future of Toxic Tort Causation Jon R. Pierce and Terrence Sexton1 I. Introduction Plaintiff suspects that a chemical in her city’s water supply has caused her to develop a rare form of liver cancer. Defendant, a company in Plaintiff’s city, has been discharging the chemical into the water for a number of years. Both parties in the toxic tort litigation are at the mercy of an unevenly developed and often- insufficient body of science to establish or rebut the required causation element. This article will examine the current causation paradigm in toxic tort litigation, pointing out its specific weaknesses. The article will then introduce an emerging discipline, toxicogenomics, which will eventually make it possible to specifically describe the molecular pathways leading from exposure to injury, and in so doing will greatly improve the reliability of causation evidence in toxic tort cases to the benefit of both plaintiffs and defendants. To illustrate its potential usefulness, this article will walk through a hypothetical toxicogenomics experiment involving a suspected liver toxin. The article will conclude by suggesting that judges controlled by Daubert v. Merrell Dow Pharmaceuticals, Inc.2 would be wise not to admit such evidence until more research can definitively link the described molecular pathways to the specific injury. 1 Jon R. Pierce, J.D. Candidate, University of North Carolina School of Law, 2004. Terrence Sexton is a trial attorney with the international law firm of Shook Hardy & Bacon LLP. He represents clients nationwide in cases involving toxic torts, products liability, white collar crime and commercial disputes.
    [Show full text]
  • Transcriptomics in Toxicogenomics, Part III: Data Modelling for Risk Assessment
    nanomaterials Review Transcriptomics in Toxicogenomics, Part III: Data Modelling for Risk Assessment Angela Serra 1,2 , Michele Fratello 1,2 , Luca Cattelani 1,2 , Irene Liampa 3 , Georgia Melagraki 4 , Pekka Kohonen 5,6 , Penny Nymark 5,6 , Antonio Federico 1,2 , Pia Anneli Sofia Kinaret 1,2,7 , Karolina Jagiello 8,9, My Kieu Ha 10,11,12, Jang-Sik Choi 10,11,12, Natasha Sanabria 13 , Mary Gulumian 13,14, Tomasz Puzyn 8,9 , Tae-Hyun Yoon 10,11,12 , Haralambos Sarimveis 3 , Roland Grafström 5,6, Antreas Afantitis 4 and Dario Greco 1,2,7,* 1 Faculty of Medicine and Health Technology, Tampere University, FI-33014 Tampere, Finland; angela.serra@tuni.fi (A.S.); michele.fratello@tuni.fi (M.F.); luca.cattelani@tuni.fi (L.C.); antonio.federico@tuni.fi (A.F.); pia.kinaret@helsinki.fi (P.A.S.K.) 2 BioMediTech Institute, Tampere University, FI-33014 Tampere, Finland 3 School of Chemical Engineering, National Technical University of Athens, 157 80 Athens, Greece; [email protected] (I.L.); [email protected] (H.S.) 4 Nanoinformatics Department, NovaMechanics Ltd., Nicosia 1065, Cyprus; [email protected] (G.M.); [email protected] (A.A.) 5 Institute of Environmental Medicine, Karolinska Institutet, 171 77 Stockholm, Sweden; [email protected] (P.K.); [email protected] (P.N.); [email protected] (R.G.); 6 Division of Toxicology, Misvik Biology, 20520 Turku, Finland 7 Institute of Biotechnology, University of Helsinki, 00014 Helsinki, Finland 8 QSAR Lab Ltd., Aleja Grunwaldzka 190/102, 80-266 Gdansk, Poland; [email protected]
    [Show full text]
  • Genomics and Toxic Substances: Part I—Toxicogenomics by Gary E
    Copyright © 2003 Environmental Law Institute®, Washington, DC. reprinted with permission from ELR®, http://www.eli.org, 1-800-433-5120. 1-2003 ELR 33 ELR 10071 NEWS&ANALYSIS Genomics and Toxic Substances: Part I—Toxicogenomics by Gary E. Marchant dvances in genomics, the study of the structure and ticle will address the application of toxicogenomics to Afunction of our genetic make-up, are fundamentally toxic torts and environmental regulation; a subsequent transforming toxicology, the science of how toxic sub- companion article will address toxicogenetic applications. stances affect our bodies. These changes will inevitably spill After first describing the scientific background of toxico- over into the legal regimes that frequently rely on toxicolog- genomics, this Article explores some potential uses of ical data, including toxic torts and environmental regula- toxicogenomic data in regulation and litigation involving tion.1 Genomic data, and the techniques with which they are toxic substances. generated, have the potential to make toxic torts and envi- ronmental regulation more effective, efficient, and fair, but Scientific Background at the same time will present many new doctrinal, eviden- tiary, and ethical challenges. Toxicogenomics is defined “as the study of the relationship Two key applications of genomic data for toxic torts and between the structure and activity of the genome (the cellu- environmental regulation are2: (i) the study of the expres- lar complement of genes) and the adverse biological effects sion of genes in
    [Show full text]
  • Review TOXICITY STUDIES of DRUGS AND
    Bulgarian Journal of Veterinary Medicine, 2016 ONLINE FIRST ISSN 1311-1477; DOI: 10.15547/bjvm.983 Review TOXICITY STUDIES OF DRUGS AND CHEMICALS IN ANIMALS: AN OVERVIEW S. A. SAGANUWAN Department of Veterinary Physiology, Pharmacology and Biochemistry, College of Veterinary Medicine, University of Agriculture, Makurdi, Benue State, Nigeria Summary Saganuwan, S. A., 2016. Toxicity studies of drugs and chemicals in animals: An overview. Bulg. J. Vet. Med. (online first). Toxicity study is the investigation of either short or long-term toxic effects of a drug or chemical on animals. The toxicity is dose-dependent as asserted by Paracelsus over 500 years ago. However, short-term toxic effect is determined using median lethal dose (LD50) first introduced by Trevan in 1927 and revised many times. Presently there is a growing preponderance of rejection of scientific papers on acute toxicity study, simply because of the belief that in the current hazard and safety as- sessment of drugs and chemicals, LD50 values are no longer used. In view of this, literature search was carried out with a view to investigating the relevance of LD50 in development and assessment of drugs and chemicals. The findings revealed that in the past, many animals had been used for LD50 determination. OECD has reduced the number of test animals to 5–15 and presently it is further re- duced to 2–6. Acute toxicity study is being carried out in medicinal plants research and in the study of patent medicine. Although the application of LD50 has been drastically reduced, it is still applied and accepted in some parts of the world.
    [Show full text]
  • Public Data Sources to Support Systems Toxicology Applications Allan Peter Davis1, Jolene Wiegers1, Thomas C
    Available online at www.sciencedirect.com Current Opinion in ScienceDirect Toxicology Public data sources to support systems toxicology applications Allan Peter Davis1, Jolene Wiegers1, Thomas C. Wiegers1 and Carolyn J. Mattingly1,2 Abstract metabolic pathways, interaction networks, and/or bio- Public databases provide a wealth of freely available infor- logical processes; and the outcome can be the perturbed mation about chemicals, genes, proteins, biological networks, biological system trying to return back to a normal state phenotypes, diseases, and exposure science that can be in- (resilience) or a new, and potentially, adverse state due tegrated to construct pathways for systems toxicology appli- to toxicity [2]. The toxicant may trigger multiple cations. Relating this disparate information from public different pathways simultaneously by interacting with repositories, however, can be challenging because databases different biological mediators, complicating under- use a variety of ways to represent, describe, and make avail- standing about the specific connectivity between indi- able their content. The use of standard vocabularies to anno- vidual events, mediators, and outcomes. As well, in the tate key data concepts, however, allows the information to be real world, humans are typically exposed simultaneously more easily exchanged and combined for discovery of new to mixtures of chemicals and drugs, further complicating findings. We explore some of the many public data sources understanding of toxic mechanisms of actions [3,4]. currently available to support systems toxicology and demon- strate the value of standardizing data to help construct The goal of systems toxicology is to discern and organize chemical information pathways. exposure and toxicogenomic events to help formalize an understanding for drug therapy, risk assessment, expo- Addresses e 1 Department of Biological Sciences, North Carolina State University, sure hazards, and toxicity to humans [5 9].
    [Show full text]
  • OECD Environment, Health and Safety Publications
    OECD Environment, Health and Safety Publications Series on Testing and Assessment No. 39 DRAFT GUIDANCE DOCUMENT ON ACUTE INHALATION TOXICITY TESTING Environment Directorate Organisation for Economic Co-operation and Development 28 November 2008 1 Also published in the Series on Testing and Assessment: No. 1, Guidance Document for the Development of OECD Guidelines for Testing of Chemicals (1993; reformatted 1995, revised 2006) No. 2, Detailed Review Paper on Biodegradability Testing (1995) No. 3, Guidance Document for Aquatic Effects Assessment (1995) No. 4, Report of the OECD Workshop on Environmental Hazard/Risk Assessment (1995) No. 5, Report of the SETAC/OECD Workshop on Avian Toxicity Testing (1996) No. 6, Report of the Final Ring-test of the Daphnia magna Reproduction Test (1997) No. 7, Guidance Document on Direct Phototransformation of Chemicals in Water (1997) No. 8, Report of the OECD Workshop on Sharing Information about New Industrial Chemicals Assessment (1997) No. 9, Guidance Document for the Conduct of Studies of Occupational Exposure to Pesticides during Agricultural Application (1997) No. 10, Report of the OECD Workshop on Statistical Analysis of Aquatic Toxicity Data (1998) No. 11, Detailed Review Paper on Aquatic Testing Methods for Pesticides and industrial Chemicals (1998) No. 12, Detailed Review Document on Classification Systems for Germ Cell Mutagenicity in OECD Member Countries (1998) No. 13, Detailed Review Document on Classification Systems for Sensitising Substances in OECD Member Countries 1998) No. 14, Detailed Review Document on Classification Systems for Eye Irritation/Corrosion in OECD Member Countries (1998) No. 15, Detailed Review Document on Classification Systems for Reproductive Toxicity in OECD Member Countries (1998) 2 No.
    [Show full text]
  • Prioritizing Candidate Diseases-Related Metabolites Based
    Wang et al. BMC Bioinformatics 2019, 20(Suppl 18):574 https://doi.org/10.1186/s12859-019-3127-4 RESEARCH Open Access Prioritizing candidate diseases-related metabolites based on literature and functional similarity Yongtian Wang1, Liran Juan2, Jiajie Peng3, Tianyi Zang1* and Yadong Wang1* From Biological Ontologies and Knowledge bases workshop at IEEE BIBM 2018 Madrid, Spain. 3-6 December 2018 Abstract Background: As the terminal products of cellular regulatory process, functional related metabolites have a close relationship with complex diseases, and are often associated with the same or similar diseases. Therefore, identification of disease related metabolites play a critical role in understanding comprehensively pathogenesis of disease, aiming at improving the clinical medicine. Considering that a large number of metabolic markers of diseases need to be explored, we propose a computational model to identify potential disease-related metabolites based on functional relationships and scores of referred literatures between metabolites. First, obtaining associations between metabolites and diseases from the Human Metabolome database, we calculate the similarities of metabolites based on modified recommendation strategy of collaborative filtering utilizing the similarities between diseases. Next, a disease-associated metabolite network (DMN) is built with similarities between metabolites as weight. To improve the ability of identifying disease-related metabolites, we introduce scores of text mining from the existing database of chemicals and proteins into DMN and build a new disease-associated metabolite network (FLDMN) by fusing functional associations and scores of literatures. Finally, we utilize random walking with restart (RWR) in this network to predict candidate metabolites related to diseases. Results: We construct the disease-associated metabolite network and its improved network (FLDMN) with 245 diseases, 587 metabolites and 28,715 disease-metabolite associations.
    [Show full text]
  • Exploring Mechanistic Toxicity of Mixtures Using PBPK Modeling and Computational Systems Biology
    Agency for Toxic Substances & Disease Registry Exploring mechanistic toxicity of mixtures using PBPK modeling and computational systems biology Patricia Ruiz, PhD Simulation Science Section Office of Innovation and Analytics ICCVAM Communities of Practice January 2021 The findings and conclusions in this presentation are those of the author(s) and do not necessarily represent the official position of the Centers for Disease Control and Prevention/the Agency for Toxic Substances and Disease Registry. Outline . Background . Single and Multiple Chemical Risk Assessment . Current State of Affairs . Methods . Results and discussion . Conclusions Background Exposome Adverse health . Often toxicity assessment of a outcomes single chemical is conducted . Three approaches can be used for mixtures risk assessment: Whole mixture, similar mixture and the components-based hazard index (HI)[ATSDR, 2018]. Molecular initiating event Toxicity effect Modified from Kortenkamp et al.,Science(2018). DOI: 10.1126/science.aat9219 HAZARD QUOTIENTS (HQ) The HQ for each chemical component of a mixture is calculated for only its critical effect using its exposure level (E) and health guidance value (HGV) HQ = E /HGV Hazard Quotients (HQ) and Hazard Index (HI) Component Based Hazard Index (Hl)= E1 /HGV1 + E2 /HGV2 + .. En /HGVn Assessments Hazard Mixture HGV Exposure Component Quotient A 2 1 0.5 HQ = E /HGV B 16 8 0.5 C 1 1 1.0 D 30 10 0.3 Hl (MIXTURE) 2.3 Current State of Affairs Chemical Library RNA RNA . Integration of Data Library Phenotype Chemical from Multiple Streams databases databases DNA Expression array Microbiome . Transcripts Microbiota Systems Biology databases databases Genomics Bio- Gene . Adverse Outcome monitoring databases databases Pathways array Population data Proteins Proteins Clinical databases databases proteomics Clinical data Modified from resource: Leung et al., 2013.
    [Show full text]