The Jackson Laboratory The Mouseion at the JAXlibrary
Faculty Research 2021 Faculty Research
1-8-2021
The mouse Gene Expression Database (GXD): 2021 update.
Richard M. Baldarelli
Constance M. Smith
Jacqueline H. Finger
Terry F. Hayamizu
Ingeborg J. McCright
See next page for additional authors
Follow this and additional works at: https://mouseion.jax.org/stfb2021
Part of the Life Sciences Commons, and the Medicine and Health Sciences Commons Authors Richard M. Baldarelli, Constance M. Smith, Jacqueline H. Finger, Terry F. Hayamizu, Ingeborg J. McCright, Jingxia Xu, David R. Shaw, Jonathan S Beal, Olin Blodgett, Jeff Campbell, Lori E. Corbani, Pete J. Frost, Sharon C Giannatto, Dave Miers, James A. Kadin, Joel E Richardson, and Martin Ringwald D924–D931 Nucleic Acids Research, 2021, Vol. 49, Database issue Published online 26 October 2020 doi: 10.1093/nar/gkaa914 The mouse Gene Expression Database (GXD): 2021 update Richard M. Baldarelli, Constance M. Smith, Jacqueline H. Finger, Terry F. Hayamizu, Ingeborg J. McCright, Jingxia Xu, David R. Shaw, Jonathan S. Beal, Olin Blodgett, Jeffrey Campbell, Lori E. Corbani, Pete J. Frost, Sharon C. Giannatto, Dave B. Miers, Downloaded from https://academic.oup.com/nar/article/49/D1/D924/5940496 by Jackson Laboratory user on 27 January 2021 James A. Kadin, Joel E. Richardson and Martin Ringwald *
The Jackson Laboratory, 600 Main Street, Bar Harbor, ME 04609, USA
Received September 14, 2020; Revised October 01, 2020; Editorial Decision October 02, 2020; Accepted October 02, 2020
ABSTRACT INTRODUCTION The Gene Expression Database (GXD; www. The longstanding objective of GXD has been to capture informatics.jax.org/expression.shtml) is an exten- and integrate different types of mouse developmental ex- sive and well-curated community resource of mouse pression information, with a focus on endogenous gene ex- developmental gene expression information. For pression in wild-type and mutant mice, and to make these many years, GXD has collected and integrated data data readily accessible to researchers via biologically- and biomedically-relevant searches. As an integral component from RNA in situ hybridization, immunohistochem- of the larger Mouse Genome Informatics (MGI) resource istry, RT-PCR, northern blot, and western blot ex- (1–3), GXD combines expression data with genetic, func- periments through curation of the scientific litera- tional, phenotypic and disease-oriented data, thereby en- ture and by collaborations with large-scale expres- abling unique and powerful search capabilities that fos- sion projects. Since our last report in 2019, we ter insights into the molecular mechanisms of human de- have continued to acquire these classical types velopment, differentiation, and disease. In addition, GXD of expression data; developed a searchable index maintains links to external expression resources, including of RNA-Seq and microarray experiments that al- gene-based links to expression data from other vertebrate lows users to quickly and reliably find specific model organisms that are highly relevant for developmen- mouse expression studies in ArrayExpress (https:// tal research: zebrafish (ZFIN; 4), Xenopus (Xenbase; 5)and www.ebi.ac.uk/arrayexpress/)andGEO(https://www. chicken (GEISHA; 6). For many years, GXD has collected RNA and protein expression information from RNA in ncbi.nlm.nih.gov/geo/); and expanded GXD to in- situ hybridization, immunohistochemistry, in situ reporter clude RNA-Seq data. Uniformly processed RNA-Seq (knock in), RT-PCR, northern blot, and western blot exper- data are imported from the EBI Expression Atlas iments. These complex, heterogeneous expression data, gen- and then integrated with the other types of ex- erated by many laboratories and distributed through thou- pression data in GXD, and with the genetic, func- sands of publications, are acquired through curation of the tional, phenotypic and disease-related information in scientific literature and by collaborations with large-scale Mouse Genome Informatics (MGI). This integration expression projects. GXD curators annotate these data in has made the RNA-Seq data accessible via GXD’s standardized ways by making extensive use of controlled enhanced searching and filtering capabilities. Fur- vocabularies and ontologies (7–10). Since our last report in ther, we have embedded the Morpheus heat map util- the NAR Database issue (2), we have (i) continued to cu- ity into the GXD user interface to provide additional rate these classical types of expression data, (ii) expanded GXD to represent RNA-Seq and microarray experiments tools for display and analysis of RNA-Seq data, in- in searchable form, (iii) begun to fully integrate RNA-Seq cluding heat map visualization, sorting, filtering, hi- data into GXD and into the larger MGI system, and (iv) de- erarchical clustering, nearest neighbors analysis and veloped new user interface utilities for the display and anal- visual enrichment. ysis of expression data. In the following, we will report on our progress, putting the main emphasis on the description of new types of data and new search and display capabilities in GXD.
*To whom correspondence should be addressed. Tel: +1 207 288 6436; Email: [email protected]