Comparison Study of Microarray and RNA- Seq for Differential Expression

Total Page:16

File Type:pdf, Size:1020Kb

Comparison Study of Microarray and RNA- Seq for Differential Expression DOT/FAA/AM-20/09 Office of Aerospace Medicine Washington, D.C. 20591 Comparison Study of Microarray and RNA- seq for Differential Expression Susan K. Munster1, Vicky L. White1, David C. Hutchings2, Dennis M. Burian1, Scott J. Nicholson1 1Federal Aviation Administration Civil Aerospace Medical Institute, Oklahoma City, OK 2Venesco, LLC, Chantilly, VA Civil Aerospace Medical Institute Federal Aviation Administration Oklahoma City, OK 73125 October 2018 Final Report NOTICE This document is disseminated under the sponsorship of the U.S. Department of Transportation in the interest of information exchange. The United States Government assumes no liability for the contents thereof. ___________ This publication and all Office of Aerospace Medicine technical reports are available in full-text from the Civil Aerospace Medical Institute’s publications Web site: http://www.faa.gov/library/reports/medical/oamtechreports/ 1. Report No. 2. Government Accession No. 3. Recipient's Catalog No. DOT/FAA/AM-20/09 4. Title and Subtitle 5. Report Date Comparison Study of Microarray and RNA-seq for Differential October, 2018 Expression 6. Performing Organization Code 7. Author(s) 8. Performing Organization Report No. Munster SK1, White VL1, Hutchings, DC2, Burian DM1, Nicholson SJ1 9. Performing Organization Name and Address 10.Work Unit No. (TRAIS) 1FAA Civil Aerospace Medical Institute, P. O. Box 25082 Oklahoma City, OK 73126 11. Contract or Grant No. 2Venesco LLC, 14801 Murdock St., Ste. 125, Chantilly, VA 20151 12. Sponsoring Agency Name and Address 13. Type of Report and Period Covered Office of Aerospace Medicine Federal Aviation Administration 800 Independence Ave., S.W. Washington, DC 20591 14. Sponsoring Agency Code AAM-1 15. Supplemental Notes CAMI Aerospace Medical Research Division Project No. 2017-AAM-612-GEN-10020 16. Abstract As costs for performing RNA-seq approach the costs affiliated with utilizing microarray technology, it is worthwhile to determine which methodology is the most efficient, cost effective, and accurate. In this study, we examined the relative capacities of microarrays and total RNA-seq to detect differential gene expression between total RNA samples containing the following RNA mixtures: blood, brain, 2:1 blood:brain, and 1:2 blood:brain. Two microarray hybridization library amplification methods were compared to optimize microarray results, which were then compared to total RNA-seq results for the same samples. Bioinformatic analysis was performed using commonly accepted analysis software for microarray and RNA-seq data to determine which method produced the greatest number of significant differentially expressed genes. It was determined that total RNA-seq outperformed microarrays in efficiency, result quality, and total numbers of detected genes and that cost was similar to utilizing microarrays for quantifying RNA expression levels from mixtures of brain and blood tissue. 17. Key Words 18. Distribution Statement Microarray, total RNA-seq, differential expression Document is available to the public through the Defense Technical Information Center, Ft. Belvior, VA 22060; and the National Technical Information Service, Springfield, VA 22161 19. Security Classif. (of this report) 20. Security Classif. (of this page) 21. No. of Pages 22. Price Unclassified Unclassified 59 Form DOT F 1700.7 (8-72) Reproduction of completed page is authorized i CONTENTS INTRODUCTION ...........................................................................................................................1 MATERIALS AND METHODS .....................................................................................................4 Sample preparation and processing ..............................................................................................4 Microarray preparation and processing ........................................................................................5 Microarray data QC ......................................................................................................................6 Microarray data filtering ..............................................................................................................6 Microarray differential expression comparisons .........................................................................7 RNA-seq preparation ....................................................................................................................8 RNA-seq data QC and processing ................................................................................................8 RNA-seq differential expression processing ...............................................................................9 Comparisons between microarray and RNA-seq differential expression ...................................9 RESULTS ......................................................................................................................................10 DISCUSSION ................................................................................................................................33 CONCLUSION ..............................................................................................................................37 REFERENCES ..............................................................................................................................38 APPENDIX ....................................................................................................................................42 ii LIST OF FIGURES Figure 1: PCA plots of microarray expression values ............................................................. 13-14 Figure 2: Violin plots of microarray antigenomic-filtered value ranges .......................................15 Figure 3: Violin plots of microarray median-filtered value ranges ...............................................16 Figure 4: Volcano plots of microarray antigenomic-filtered differential expression values ... 18-19 Figure 5: Volcano plots of microarray median-filtered differential expression values ........... 19-20 Figure 6: Volcano plots of microarray antigenomic-filtered biomarker subset differential expression values ..................................................................................................................... 21-22 Figure 7: Volcano plots of microarray median-filtered biomarker subset differential expression values .............................................................................................................................................23 Figure 8: PCA plot of Affymetrix microarray and RNA-seq values common to both datasets ....28 Figure 9: Violin plot of entrez IDs common to both RNA-seq and Affymetrix median-filtered microarray dataset log fold change ranges .....................................................................................28 Figure 10: Violin plot of significant RNA-seq and Affymetrix median-filtered microarray differential expression ranges ........................................................................................................29 Figure 11: Violin plot of RNA-seq and Affymetrix median-filtered microarray biomarker subset differential expression ranges ........................................................................................................30 Figure 12: Volcano plot of RNA-seq and Affymetrix median-filtered microarray biomarker subset differential expression values ....................................................................................... 30-31 iii LIST OF TABLES Table 1: Microarray transcript cluster counts ................................................................................12 Table 2: Differentially expressed transcript cluster counts ............................................................16 Table 3: Microarray tissue comparison of log fold change counts .......................................... 17-18 Table 4: Microarray tissue comparison of potential biomarker log fold change counts ...............21 Table 5: Affymetrix tissue comparison of filtering method overlap counts ..................................24 Table 6; Affymetrix tissue comparison biomarker subset differential expression counts ....... 24-25 Table 7: RNA-seq read trimming, mapping, annotation, and featureCount counts ................ 25-26 Table 8: RNA-seq and Affymetrix median-filtered biomarker subset differential expression counts .............................................................................................................................................32 Table 9: Cost comparison for Affymetrix, NuGEN, and RNA-seq processing methods ..............33 iv ABBREVIATIONS Abbreviation Explanation cDNA complementary DNA DE differential expression FDR false discovery rate LFC log2 fold change all-blood a sample comprised of 100% blood-derived RNA 2/3-blood a sample mixture comprised of 67% blood-derived RNA and 33% brain- derived RNA 1/3-blood a sample mixture comprised of 33% blood-derived RNA and 67% brain- derived RNA all-brain a sample comprised of 100% brain-derived RNA PCA principal components analysis PC1 principal component 1 PC2 principal component 2 [Sequence data has been submitted to NCBI’s Sequence Read Archive (http://www.ncbi.nlm.nih.gov/sra) and to NCBI’s Gene Expression Omnibus (http://www.ncbi.nlm.nih.gov/geo) under BioProject accession number PRJNA492492] v COMPARISON STUDY OF MICROARRAY AND RNA-SEQ FOR DIFFERENTIAL EXPRESSION INTRODUCTION In the past four decades, since Sanger sequencing (Sanger et al.,
Recommended publications
  • Limma: Linear Models for Microarray and RNA-Seq Data User’S Guide
    limma: Linear Models for Microarray and RNA-Seq Data User's Guide Gordon K. Smyth, Matthew Ritchie, Natalie Thorne, James Wettenhall, Wei Shi and Yifang Hu Bioinformatics Division, The Walter and Eliza Hall Institute of Medical Research, Melbourne, Australia First edition 2 December 2002 Last revised 14 July 2021 This free open-source software implements academic research by the authors and co-workers. If you use it, please support the project by citing the appropriate journal articles listed in Section 2.1. Contents 1 Introduction 5 2 Preliminaries 7 2.1 Citing limma ......................................... 7 2.2 Installation . 9 2.3 How to get help . 9 3 Quick Start 11 3.1 A brief introduction to R . 11 3.2 Sample limma Session . 12 3.3 Data Objects . 13 4 Reading Microarray Data 15 4.1 Scope of this Chapter . 15 4.2 Recommended Files . 15 4.3 The Targets Frame . 15 4.4 Reading Two-Color Intensity Data . 17 4.5 Reading Single-Channel Agilent Intensity Data . 19 4.6 Reading Illumina BeadChip Data . 19 4.7 Image-derived Spot Quality Weights . 20 4.8 Reading Probe Annotation . 21 4.9 Printer Layout . 22 4.10 The Spot Types File . 22 5 Quality Assessment 24 6 Pre-Processing Two-Color Data 26 6.1 Background Correction . 26 6.2 Within-Array Normalization . 28 6.3 Between-Array Normalization . 30 6.4 Using Objects from the marray Package . 33 7 Filtering unexpressed probes 34 1 8 Linear Models Overview 36 8.1 Introduction . 36 8.2 Single-Channel Designs . 37 8.3 Common Reference Designs .
    [Show full text]
  • Protein Microarray Technology: Assisting Personalized Medicine in Oncology (Review)
    WORLD ACADEMY OF SCIENCES JOURNAL 1: 113-124, 2019 Protein microarray technology: Assisting personalized medicine in oncology (Review) MONICA NEAGU1-3, MARINELA BOSTAN1,4 and CAROLINA CONSTANTIN1,2 1Department of Immunology, ‘Victor Babes’ National Institute of Pathology, 050096 Bucharest; 2Research Core of The Pathology Department, Colentina University Hospital, Bucharest 020125; 3Faculty of Biology, University of Bucharest, 050095 Bucharest; 4Immunology Center, ‘Stefan Nicolau’ Institute of Virology, 030304 Bucharest, Romania Received April 23, 2019; Accepted June 11, 2019 DOI: 10.3892/wasj.2019.15 Abstract. Among proteomics technologies, protein microarray, into the future. We present personalized medicine goals and over the past last years, has gained an increased momentum discuss how protein microarray can aid in these goals, with an in the biomarkers discovery domain. The characteristics of emphasis on several oncological diseases. We also discuss how protein microarray, namely that it is a high-throughput tool, it protein technology has been used in diseases, such as lung, provides a high specificity and only requires a minute amount breast cancers, as well as in other diseases that, over the past of biological samples, render it a suitable tool for searching, last years, have taken advantage of this proteomic endeavor. quantifying and validating biomarkers in various pathologies. Protein microarray is based on the specific antigen‑antibody reaction, such as any enzyme-linked immunosorbent assay, Contents the specific reaction occurring on a miniaturized support (chip or slide), thus having the advantage of simultaneous eval- 1. Introduction uation of tens to thousands of molecules in small samples with 2. Personalized medicine: A step forward in improving a highly specific recognition for the detection system.
    [Show full text]
  • Microarray Bioinformatics and Applications in Oncology
    Microarray Bioinformatics and Applications in Oncology Justine Kate Peeters The work in this thesis was performed in the Department of Bioinformatics, Erasmus University Medical Center, Rotterdam, The Netherlands. The printing of this thesis was financially supported by J.E. Juriaanse Stichting, Rotterdam, The Netherlands The Netherlands Bioinformatics Center (NBIC) Affymetrix Europe Cover: painted by Anne Karin Pettersen Arvola, with compliments of Therese Sorlie Print: Printpartners Ipskamp, Enschede www.ppi.nl Lay-out: Legatron Electronic Publishing, Rotterdam ISBN: 978-90-9023048-1 Copyright © J.K. Peeters All right reserved. No part of this thesis may be reproduced or transmitted in any form, by any means, electronic or mechanical, without the prior written permission of the author, or where appropirate, of the publisher of the articles. Microarray Bioinformatics and Applications in Oncology Toepassingen van bioinformatica en microarray’s in oncologie Proefschrift ter verkrijging van de graad van doctor aan de Erasmus Universiteit Rotterdam op gezag van de Prof. dr. S.W.J. Lamberts en volgens besluit van het college voor Promoties De openbare verdediging zal plaatsvinden op Woensdag 11 juni 2008 om 13.45 uur door Justine Kate Peeters geboren te Melbourne, Australia Promotiecommissie Promotor: Prof.dr. P.J. van der Spek Co-promotor: Dr. A.E.M. Schutte Overige leden: Prof.dr. F.G. Grosveld Prof.dr. P.A.E. Sillevis Smitt Prof.dr. L.H.J. Looijenga This thesis is dedicated to my Oma ‘Wilhelmena Johanna Peeters’ (1918-2004). Table of Contents
    [Show full text]
  • Biomems Applications Overview Primary Knowledge (PK) Participant Guide
    BioMEMS Applications Overview Primary Knowledge (PK) Participant Guide Description and Estimated Time to Complete This learning module is an overview of some of the major biomedical developments and applications of BioMEMS. There are three activities that provide further exploration into some of these applications, how they are used and how they work. Biochip slide for testing protein arrays [Image courtesy of Argonne National Laboratories] This unit provides an overview of Microelectromechanical Systems (MEMS) applications in the biomedical field. Such devices are referred to as bioMEMS. The size of the medical industry and the number of applications that could benefit from MEMS devices, make the medical industry an ideal market to take advantage of the MEMS already on the market and those in research and development. Estimated Time to Complete Allow approximately 30 minutes. Southwest Center for Microsystems Education (SCME) Page 1 of 19 App_BioMEM_PK14_PG_August2017.docx BioMEMS Applications PK Introduction Imagine a device that restores sight when implanted on the retina of the eye, or a "skin patch" chip capable of detecting an invading microorganism before symptoms develop. A type of retinal implant was recently approved (Feb 2011) for commercialization by both Europe and the U.S. “Skin patch chips” are being developed and tested for a variety of applications which include diagnostics as well as therapeutics. One company, MicroCHIPS, is developing both active and passive drug delivery systems based on its proprietary reservoir arrays. The figure shows one of MicroCHIPS' implantable active drug delivery systems. This product contains a reservoir, microprocessor, and sensor feedback system which allows for a controlled 1 release of a drug.
    [Show full text]
  • DNA Microarray Technology in Dermatology Manfred Kunz, MD
    DNA Microarray Technology in Dermatology Manfred Kunz, MD In recent years, DNA microarray technology has been used for the analysis of gene expression patterns in a variety of skin diseases, including malignant melanoma, psoriasis, lupus erythematosus, and systemic sclerosis. Many of the studies described herein con- firmed earlier results on individual genes or functional groups of genes. However, a plethora of new candidate genes, gene patterns, and regulatory pathways have been identified. Major progresses were reached by the identification of a prognostic gene pattern in malignant melanoma, an immune signaling cluster in psoriasis, and a so-called interferon signature in systemic lupus erythematosus. In future, interference with genes or regulatory pathways with the use of different RNA interference technologies or targeted therapy may not only underscore the functional significance of microarray data but also may open interesting therapeutic perspectives. Large-scale gene expression analyses may also help to design more individualized treatment approaches of cutaneous diseases. Semin Cutan Med Surg 27:16-24 © 2008 Elsevier Inc. All rights reserved. KEYWORDS genomics, melanoma/skin tumors, inflammatory skin diseases, autoimmune diseases ene microarray technology started in the early couldnineties be used to measure mRNA molecules within a wide G of the last century. It was demonstrated that linearpeptides range of 3 to 4 orders of magnitude, with a sensitiv- may be synthesized on small silicon chips by photolitho-ity of a few molecules per cell. Indeed, later studies con- graphic synthesis.1,2 This technique was then applied to firmedshort that the detection lower limit of current microarray DNA fragments, generating so-called DNA microarrays technology(un- appears to be around ten copies of mRNA per less otherwise stated, in the present review the termcell.
    [Show full text]
  • Transcriptomics Technologies
    TOPIC PAGE Transcriptomics technologies Rohan Lowe1, Neil Shirley2, Mark Bleackley1, Stephen Dolan3, Thomas Shafee1* 1 La Trobe Institute for Molecular Science, La Trobe University, Melbourne, Australia, 2 ARC Centre of Excellence in Plant Cell Walls, University of Adelaide, Adelaide, Australia, 3 Department of Biochemistry, University of Cambridge, Cambridge, United Kingdom * [email protected] Abstract Transcriptomics technologies are the techniques used to study an organism's transcriptome, the sum of all of its RNA transcripts. The information content of an organism is recorded in the DNA of its genome and expressed through transcription. Here, mRNA serves as a transient intermediary molecule in the information network, whilst noncoding RNAs perform additional diverse functions. A transcriptome captures a snapshot in time of the total transcripts present in a cell. The first attempts to study the whole transcriptome began in the early 1990s, and techno- logical advances since the late 1990s have made transcriptomics a widespread discipline. a1111111111 Transcriptomics has been defined by repeated technological innovations that transform the a1111111111 field. There are two key contemporary techniques in the field: microarrays, which quantify a a1111111111 a1111111111 set of predetermined sequences, and RNA sequencing (RNA-Seq), which uses high- a1111111111 throughput sequencing to capture all sequences. Measuring the expression of an organism's genes in different tissues, conditions, or time points gives information on how genes are regulated and reveals details of an organism's biology. It can also help to infer the functions of previously unannotated genes. Transcrip- OPEN ACCESS tomic analysis has enabled the study of how gene expression changes in different organ- isms and has been instrumental in the understanding of human disease.
    [Show full text]
  • Current Applications of Antibody Microarrays Ziqing Chen1, Tea Dodig‑Crnković2, Jochen M
    Chen et al. Clin Proteom (2018) 15:7 https://doi.org/10.1186/s12014-018-9184-2 Clinical Proteomics REVIEW Open Access Current applications of antibody microarrays Ziqing Chen1, Tea Dodig‑Crnković2, Jochen M. Schwenk2* and Sheng‑ce Tao1,3,4* Abstract The concept of antibody microarrays is one of the most versatile approaches within multiplexed immunoassay tech‑ nologies. These types of arrays have increasingly become an attractive tool for the exploratory detection and study of protein abundance, function, pathways, and potential drug targets. Due to the properties of the antibody microarrays and their potential use in basic research and clinical analytics, various types of antibody microarrays have already been developed. In spite of the growing number of studies utilizing this technique, few reviews about antibody microar‑ ray technology have been presented to refect the quality and future uses of the generated data. In this review, we provide a summary of the recent applications of antibody microarray techniques in basic biology and clinical studies, providing insights into the current trends and future of protein analysis. Keywords: Antibody microarray, Signalling, Drug mechanism, Clinical application, Systems biology, Technology advances Background binding events are reported directly by fuorescent label- Antibody microarrays are built on immobilizing anti- ling of the sample or by the addition of a secondary bodies for a parallel analysis of multiple targets in a detection reagent. given sample [1]. Today’s antibody and afnity reagent- Te attractiveness of antibody microarrays is that they engineering methods have helped to advance the meth- can be used to study a diverse number of biological pro- odology [2, 3].
    [Show full text]
  • Current Knowledge on Microarray Technology - an Overview
    Naidu & Suneetha Tropical Journal of Pharmaceutical Research February 2012; 11 (1): 153-164 © Pharmacotherapy Group, Faculty of Pharmacy, University of Benin, Benin City, 300001 Nigeria. All rights reserved . Available online at http://www.tjpr.org http://dx.doi.org/10.4314/tjpr.v11i1.20 Review Article Current Knowledge on Microarray Technology - An Overview * Chitrala Kumaraswamy Naidu and Yeguvapalli Suneetha Department of Zoology, Sri Venkateswara University, Tirupati -517502, India. Abstract The completion of whole genome sequencing projects has led to a rapid increase in the availability of genetic information. In the field of transcriptomics, the emergence of microarray-based technologies and the design of DNA biochips allow high-throughput studies of RNA expression in cell and tissue at a given moment. It has emerged as one of the most important technology in the field of molecular biology and transcriptomics. Arrays of oligonucleotide or DNA sequences are being used for genome-wide genotyping and expression profiling, and several potential clinical applications have begun to emerge as our understanding of these techniques and the data they generate improves. From its emergence to date, several database, software and technology updates have been developed in the field of microarray technology. This paper reviews basics and updates of each microarray technology and serves to introduce newly compiled resources that will provide specialist information in this area. Keywords: Microarray, Databases, cDNA array, Oligonucleotide array Received: 19 April 2011 Revised accepted: 19 December 2011 *Corresponding author: Email: [email protected]; Tel: +91 9985563644 Trop J Pharm Res, February2012;11 (1): 153 Naidu & Suneetha INTRODUCTION or radioactive tag and a brighter signal is detected when more copies bind [4].
    [Show full text]