Region Based Gene Expression Via Reanalysis of Publicly Available Microarray Data Sets

Total Page:16

File Type:pdf, Size:1020Kb

Region Based Gene Expression Via Reanalysis of Publicly Available Microarray Data Sets University of Louisville ThinkIR: The University of Louisville's Institutional Repository Electronic Theses and Dissertations 5-2018 Region based gene expression via reanalysis of publicly available microarray data sets. Ernur Saka University of Louisville Follow this and additional works at: https://ir.library.louisville.edu/etd Part of the Bioinformatics Commons, Computational Biology Commons, and the Other Computer Sciences Commons Recommended Citation Saka, Ernur, "Region based gene expression via reanalysis of publicly available microarray data sets." (2018). Electronic Theses and Dissertations. Paper 2902. https://doi.org/10.18297/etd/2902 This Doctoral Dissertation is brought to you for free and open access by ThinkIR: The University of Louisville's Institutional Repository. It has been accepted for inclusion in Electronic Theses and Dissertations by an authorized administrator of ThinkIR: The University of Louisville's Institutional Repository. This title appears here courtesy of the author, who has retained all other copyrights. For more information, please contact [email protected]. REGION BASED GENE EXPRESSION VIA REANALYSIS OF PUBLICLY AVAILABLE MICROARRAY DATA SETS By Ernur Saka B.S. (CEng), University of Dokuz Eylul, Turkey, 2008 M.S., University of Louisville, USA, 2011 A Dissertation Submitted To the J. B. Speed School of Engineering in Fulfillment of the Requirements for the Degree of Doctor of Philosophy in Computer Science and Engineering Department of Computer Engineering and Computer Science University of Louisville Louisville, Kentucky May 2018 Copyright 2018 by Ernur Saka All rights reserved REGION BASED GENE EXPRESSION VIA REANALYSIS OF PUBLICLY AVAILABLE MICROARRAY DATA SETS By Ernur Saka B.S. (CEng), University of Dokuz Eylul, Turkey, 2008 M.S., University of Louisville, USA, 2011 A Dissertation Approved On April 20, 2018 by the following Committee __________________________________ Dissertation Director Dr. Eric C. Rouchka __________________________________ Dr. Olfa Nasraoui __________________________________ Dr. Dar-Jen Chang ___________________________________ Dr. Juw Won Park ___________________________________ Dr. Jeffrey C. Petruska ii DEDICATION Dedicated to my parents Nuran Saka and Erol Saka and to my sister Esin Saka iii ACKNOWLEDGEMENTS First, I would like to thank my advisor Dr. Rouchka for his guidance, patience and support throughout my PhD. Without the guidance and the support that he has provided, this experience would not have been the same. I would also like to thank the rest of my dissertation committee members, Dr. Chang, Dr. Park, and Dr. Petruska for giving me the opportunity to learn from them. In particular, I would like to thank Dr. Nasraoui for her constant support and encouragement. I appreciate all my professors who have helped me progress. I also would like to thank Dr. Elmaghraby, Chairman of the Department of Computer Engineering and Computer Science for his support. My experience at the bioinformatics lab would not have been the same without our discussions, collaboration and seminars. Thanks to my former and current lab members Abdallah Eteleeb, Dazhuo Li, Fahim Mohammad, Mohammed Sayed, Mohamed Chaabane, Aanchal Malhotra, and Aryan Neupane. I would love to thank Dr. Harrison for his contributions to biological analysis and interpretation of our research outcomes. I also would like to thank Dr. Chariker for her helpful discussions and pleasant conversations we had. Through the years, several friends have provided support. I would like to thank James Walter Moore and Caroline Fortin for always being on my side, their encouragements and listening ears. iv Most of all, I am so grateful to my family, my mother Nuran, my father Erol, my sister Esin and my brother in love Levent for supporting me in every possible way unconditionally. v ABSTRACT REGION BASED GENE EXPRESSION VIA REANALYSIS OF PUBLICLY AVAILABLE MICROARRAY DATA SETS Ernur Saka April 20, 2018 A DNA microarray is a high-throughput technology used to identify relative gene expression. One of the most widely used platforms is the Affymetrix® GeneChip® technology which detects gene expression levels based on probe sets composed of a set of twenty-five nucleotide probes designed to hybridize with specific gene targets. Given a particular Affymetrix® GeneChip® platform, the design of the probes is fixed. However, the method of analysis is dynamic in nature due to the ability to annotate and group probes into uniquely defined groupings. This is particularly important since publicly available repositories of microarray datasets, such as ArrayExpress and NCBI’s Gene Expression Omnibus (GEO) have made millions of samples readily available to be reanalyzed computationally without the need for new biological experiments. One way in which the analysis can dynamically change is by correcting the mapping between probe sets and targets by creating custom Chip Description Files (CDFs) to arrange which probes belong to which probe set based on the latest genomic information or specific annotations of interest. vi Since default probe sets in Affymetrix® GeneChip® platforms are specific for a gene, transcript or exon, the analyses are then limited to profile differential expression at the gene, transcript or individual exon level. However, it has been revealed that untranslated regions (UTRs) of mRNA have important impacts on the regulation of proteins. We therefore developed a new probe mapping protocol that addresses three issues of Affymetrix® GeneChip® data analyses: removing nonspecific probes, updating probe target mapping based on the latest genome information and grouping the probes into region (UTR, individual exon), gene and transcript level targets of interest to support a better understanding of the effect of UTRs and individual exons on gene expression levels. Furthermore, we developed an R package, affyCustomCdf, for users to dynamically create custom CDFs. The affyCustomCdf tool takes annotations in a General/Gene Transfer Format File (GTF), aligns probes to gene annotations via Nested Containment List (NCList) indexing and generates a custom Chip Description File (CDF) to regroup probes into probe sets based on a region (UTR and individual exon), transcript or gene level. Our results indicate that removing probes that no longer align to the genome without mismatches or align to multiple locations can help to reduce false-positive differential expression, as can removal of probes in regions overlapping multiple genes. Moreover, our method based on regions can detect changes that would have been missed by analysis based on gene and transcript. It also allows for a better understanding of 3’ UTR dynamics through the reanalysis of publicly available data. vii TABLE OF CONTENTS DEDICATION iii ACKNOWLEDGEMENTS iv ABSTRACT vi LIST OF TABLES xiii LIST OF FIGURES xv 1 INTRODUCTION 1 1.1 Motivation ..................................................................................................................2 1.2 Organization of the Document ...................................................................................8 2 OVERVIEW OF MOLECULAR BIOLOGY 10 2.1 DNA .........................................................................................................................11 2.2 RNA .........................................................................................................................15 2.3 mRNA Structure ......................................................................................................19 2.3.1 Untranslated Regions (UTR) .......................................................................19 2.3.2 Poly(A) Tail .................................................................................................20 2.4 Protein ......................................................................................................................20 2.5 Chromosome ...........................................................................................................21 2.6 Gene ........................................................................................................................22 2.7 Gene Regions ...........................................................................................................22 2.3.1 Intron .............................................................................................................22 viii 2.3.2 Exon ..............................................................................................................23 2.3.3 Open Reading Frame (ORF) .........................................................................23 2.3.4 Coding sequence (CDS) ................................................................................23 2.8 Gene Expression .......................................................................................................24 2.9 Central Dogma of Molecular Biology ......................................................................25 2.9.1 General Transfers ..........................................................................................26 2.9.2 Special Transfers ...........................................................................................29 2.10 Genetic Code ..........................................................................................................30 2.11 Alternative Splicing ................................................................................................31 2.12 Transcript ..............................................................................................................33
Recommended publications
  • PARSANA-DISSERTATION-2020.Pdf
    DECIPHERING TRANSCRIPTIONAL PATTERNS OF GENE REGULATION: A COMPUTATIONAL APPROACH by Princy Parsana A dissertation submitted to The Johns Hopkins University in conformity with the requirements for the degree of Doctor of Philosophy Baltimore, Maryland July, 2020 © 2020 Princy Parsana All rights reserved Abstract With rapid advancements in sequencing technology, we now have the ability to sequence the entire human genome, and to quantify expression of tens of thousands of genes from hundreds of individuals. This provides an extraordinary opportunity to learn phenotype relevant genomic patterns that can improve our understanding of molecular and cellular processes underlying a trait. The high dimensional nature of genomic data presents a range of computational and statistical challenges. This dissertation presents a compilation of projects that were driven by the motivation to efficiently capture gene regulatory patterns in the human transcriptome, while addressing statistical and computational challenges that accompany this data. We attempt to address two major difficulties in this domain: a) artifacts and noise in transcriptomic data, andb) limited statistical power. First, we present our work on investigating the effect of artifactual variation in gene expression data and its impact on trans-eQTL discovery. Here we performed an in-depth analysis of diverse pre-recorded covariates and latent confounders to understand their contribution to heterogeneity in gene expression measurements. Next, we discovered 673 trans-eQTLs across 16 human tissues using v6 data from the Genotype Tissue Expression (GTEx) project. Finally, we characterized two trait-associated trans-eQTLs; one in Skeletal Muscle and another in Thyroid. Second, we present a principal component based residualization method to correct gene expression measurements prior to reconstruction of co-expression networks.
    [Show full text]
  • Genetic Confirmation and Identification of Novel Variants For
    G C A T T A C G G C A T genes Article Genetic Confirmation and Identification of Novel Variants for Glanzmann Thrombasthenia and Other Inherited Platelet Function Disorders: A Study by the Korean Pediatric Hematology Oncology Group (KPHOG) Eu Jeen Yang 1 , Ye Jee Shim 2,* , Heung Sik Kim 3, Young Tak Lim 1, Ho Joon Im 4, Kyung-Nam Koh 4, Hyery Kim 4 , Jin Kyung Suh 5, Eun Sil Park 6, Na Hee Lee 7, Young Bae Choi 8, Jeong Ok Hah 9, Jae Min Lee 10 , Jung Woo Han 11, Jae Hee Lee 12, Young-Ho Lee 13, Hye Lim Jung 14, Jung-Sook Ha 15, Chang-Seok Ki 16 and on behalf of the Benign Hematology Committee of the Korean Pediatric Hematology Oncology Group (KPHOG) † 1 Department of Pediatrics, Pusan National University School of Medicine, Pusan National University Children’s Hospital, Yangsan 50612, Korea; [email protected] (E.J.Y.); [email protected] (Y.T.L.) 2 Department of Pediatrics, Keimyung University School of Medicine, Keimyung University Dongsan Hospital, Daegu 42601, Korea 3 Department of Pediatrics, Keimyung University School of Medicine, Keimyung University Daegu Dongsan Hospital, Daegu 41931, Korea; [email protected] 4 Department of Pediatrics, University of Ulsan College of Medicine, Asan Medical Center Children’s Hospital, Seoul 05505, Korea; [email protected] (H.J.I.); [email protected] (K.-N.K.); [email protected] (H.K.) 5 Department of Pediatrics, Korea Cancer Center Hospital, Seoul 01812, Korea; [email protected] Citation: Yang, E.J.; Shim, Y.J.; Kim, 6 Department of Pediatrics, Gyeongsang National University College of Medicine, H.S.; Lim, Y.T.; Im, H.J.; Koh, K.-N.; Gyeongsang National University Hospital, Jinju 52727, Korea; [email protected] Kim, H.; Suh, J.K.; Park, E.S.; Lee, 7 Department of Pediatrics, Cha Bundang Medical Center, Cha University, Seongnam 13496, Korea; N.H.; et al.
    [Show full text]
  • Computational Analysis of DNA Methylation and Gene Expression Patterns in Prostate Cancer
    Ieva Rauluševičiūtė Computational Analysis of DNA Methylation and Gene Expression Patterns in Prostate Cancer Master’s thesis in Molecular Medicine Trondheim, June 2018 Supervisors: dr. Morten Beck Rye and prof. Finn Drabløs Norwegian University of Science and Technology Faculty of Medicine and Health Sciences Department of Clinical and Molecular Medicine ABSTRACT DNA methylation is an important contributor for prostate cancer development and progression. It has been studied experimentally for years, but, lately, high-throughput technologies are able to produce genome-wide DNA methylation data that can be analyzed using various computational approaches. Thus, this study aims to bioinformatically investigate different DNA methylation and gene expression patterns in prostate cancer. DNA methylation data from three datasets (TCGA, Absher and Kirby) was correlated with gene expression data in order to distinguish different regulation patterns. Classically, increased DNA methylation in promoter regions is being associated with gene expression downregulation. Although, results of the present project demonstrate another robust pattern, where DNA hypermethylation in promoter regions of 1,058 common genes is accompanied by upregulated expression. After analyzing expression and methylation values in the same samples from TCGA dataset, expression overcompensation in a dataset as an explanation for upregulation was excluded. Further reasons behind the pattern were investigated using TCGA DNA methylation data with extended list of probes and includes the presence of methylated positions in CpG islands, distance to transcription start sites and alternative TSSs. As compared with the downregulated genes in the classical pattern, upregulated genes were shown to have more positions in CpG islands and closer to TSSs. Moreover, the presence of alternative TSS in prostate was demonstrated, also disclosing the limitations of methylation detection systems.
    [Show full text]
  • A Negative-Feedback Loop Regulating ERK1/2 Activation and Mediated by Rasgpr2 Phosphorylation
    HHS Public Access Author manuscript Author ManuscriptAuthor Manuscript Author Biochem Manuscript Author Biophys Res Commun Manuscript Author . Author manuscript; available in PMC 2017 May 20. Published in final edited form as: Biochem Biophys Res Commun. 2016 May 20; 474(1): 193–198. doi:10.1016/j.bbrc.2016.04.100. A negative-feedback loop regulating ERK1/2 activation and mediated by RasGPR2 phosphorylation Jinqi Ren#1, Aaron A. Cook#2, Wolfgang Bergmeier2, and John Sondek1,2,3 1Department of Pharmacology, University of North Carolina School of Medicine, Chapel Hill, North Carolina 27599 2Department of Biochemistry and Biophysics, University of North Carolina School of Medicine, Chapel Hill, North Carolina 27599 3Lineberger Comprehensive Cancer Center, University of North Carolina School of Medicine, Chapel Hill, North Carolina 27599 # These authors contributed equally to this work. Abstract The dynamic regulation of ERK1 and −2 (ERK1/2) is required for precise signal transduction controlling cell proliferation, differentiation, and survival. However, the underlying mechanisms regulating the activation of ERK1/2 are not completely understood. In this study, we show that phosphorylation of RasGRP2, a guanine nucleotide exchange factor (GEF), inhibits its ability to activate the small GTPase Rap1 that ultimately leads to decreased activation of ERK1/2 in cells. ERK2 phosphorylates RasGRP2 at Ser394 located in the linker region implicated in its autoinhibition. These studies identify RasGRP2 as a novel substrate of ERK1/2 and define a negative-feedback loop that regulates the BRaf–MEK–ERK signaling cascade. This negative- feedback loop determines the amplitude and duration of active ERK1/2. Keywords RasGRP2; ERK1/2 signaling; negative-feedback loop; phosphorylation; GEF; CalDAG-GEFI Introduction The Ras-Raf-MEK-ERK signaling pathway is essential for many cellular processes, including growth, cell-cycle progression, differentiation, and apoptosis.
    [Show full text]
  • A Computational Approach for Defining a Signature of Β-Cell Golgi Stress in Diabetes Mellitus
    Page 1 of 781 Diabetes A Computational Approach for Defining a Signature of β-Cell Golgi Stress in Diabetes Mellitus Robert N. Bone1,6,7, Olufunmilola Oyebamiji2, Sayali Talware2, Sharmila Selvaraj2, Preethi Krishnan3,6, Farooq Syed1,6,7, Huanmei Wu2, Carmella Evans-Molina 1,3,4,5,6,7,8* Departments of 1Pediatrics, 3Medicine, 4Anatomy, Cell Biology & Physiology, 5Biochemistry & Molecular Biology, the 6Center for Diabetes & Metabolic Diseases, and the 7Herman B. Wells Center for Pediatric Research, Indiana University School of Medicine, Indianapolis, IN 46202; 2Department of BioHealth Informatics, Indiana University-Purdue University Indianapolis, Indianapolis, IN, 46202; 8Roudebush VA Medical Center, Indianapolis, IN 46202. *Corresponding Author(s): Carmella Evans-Molina, MD, PhD ([email protected]) Indiana University School of Medicine, 635 Barnhill Drive, MS 2031A, Indianapolis, IN 46202, Telephone: (317) 274-4145, Fax (317) 274-4107 Running Title: Golgi Stress Response in Diabetes Word Count: 4358 Number of Figures: 6 Keywords: Golgi apparatus stress, Islets, β cell, Type 1 diabetes, Type 2 diabetes 1 Diabetes Publish Ahead of Print, published online August 20, 2020 Diabetes Page 2 of 781 ABSTRACT The Golgi apparatus (GA) is an important site of insulin processing and granule maturation, but whether GA organelle dysfunction and GA stress are present in the diabetic β-cell has not been tested. We utilized an informatics-based approach to develop a transcriptional signature of β-cell GA stress using existing RNA sequencing and microarray datasets generated using human islets from donors with diabetes and islets where type 1(T1D) and type 2 diabetes (T2D) had been modeled ex vivo. To narrow our results to GA-specific genes, we applied a filter set of 1,030 genes accepted as GA associated.
    [Show full text]
  • The Capacity of Long-Term in Vitro Proliferation of Acute Myeloid
    The Capacity of Long-Term in Vitro Proliferation of Acute Myeloid Leukemia Cells Supported Only by Exogenous Cytokines Is Associated with a Patient Subset with Adverse Outcome Annette K. Brenner, Elise Aasebø, Maria Hernandez-Valladares, Frode Selheim, Frode Berven, Ida-Sofie Grønningsæter, Sushma Bartaula-Brevik and Øystein Bruserud Supplementary Material S2 of S31 Table S1. Detailed information about the 68 AML patients included in the study. # of blasts Viability Proliferation Cytokine Viable cells Change in ID Gender Age Etiology FAB Cytogenetics Mutations CD34 Colonies (109/L) (%) 48 h (cpm) secretion (106) 5 weeks phenotype 1 M 42 de novo 241 M2 normal Flt3 pos 31.0 3848 low 0.24 7 yes 2 M 82 MF 12.4 M2 t(9;22) wt pos 81.6 74,686 low 1.43 969 yes 3 F 49 CML/relapse 149 M2 complex n.d. pos 26.2 3472 low 0.08 n.d. no 4 M 33 de novo 62.0 M2 normal wt pos 67.5 6206 low 0.08 6.5 no 5 M 71 relapse 91.0 M4 normal NPM1 pos 63.5 21,331 low 0.17 n.d. yes 6 M 83 de novo 109 M1 n.d. wt pos 19.1 8764 low 1.65 693 no 7 F 77 MDS 26.4 M1 normal wt pos 89.4 53,799 high 3.43 2746 no 8 M 46 de novo 26.9 M1 normal NPM1 n.d. n.d. 3472 low 1.56 n.d. no 9 M 68 MF 50.8 M4 normal D835 pos 69.4 1640 low 0.08 n.d.
    [Show full text]
  • Chuanxiong Rhizoma Compound on HIF-VEGF Pathway and Cerebral Ischemia-Reperfusion Injury’S Biological Network Based on Systematic Pharmacology
    ORIGINAL RESEARCH published: 25 June 2021 doi: 10.3389/fphar.2021.601846 Exploring the Regulatory Mechanism of Hedysarum Multijugum Maxim.-Chuanxiong Rhizoma Compound on HIF-VEGF Pathway and Cerebral Ischemia-Reperfusion Injury’s Biological Network Based on Systematic Pharmacology Kailin Yang 1†, Liuting Zeng 1†, Anqi Ge 2†, Yi Chen 1†, Shanshan Wang 1†, Xiaofei Zhu 1,3† and Jinwen Ge 1,4* Edited by: 1 Takashi Sato, Key Laboratory of Hunan Province for Integrated Traditional Chinese and Western Medicine on Prevention and Treatment of 2 Tokyo University of Pharmacy and Life Cardio-Cerebral Diseases, Hunan University of Chinese Medicine, Changsha, China, Galactophore Department, The First 3 Sciences, Japan Hospital of Hunan University of Chinese Medicine, Changsha, China, School of Graduate, Central South University, Changsha, China, 4Shaoyang University, Shaoyang, China Reviewed by: Hui Zhao, Capital Medical University, China Background: Clinical research found that Hedysarum Multijugum Maxim.-Chuanxiong Maria Luisa Del Moral, fi University of Jaén, Spain Rhizoma Compound (HCC) has de nite curative effect on cerebral ischemic diseases, *Correspondence: such as ischemic stroke and cerebral ischemia-reperfusion injury (CIR). However, its Jinwen Ge mechanism for treating cerebral ischemia is still not fully explained. [email protected] †These authors share first authorship Methods: The traditional Chinese medicine related database were utilized to obtain the components of HCC. The Pharmmapper were used to predict HCC’s potential targets. Specialty section: The CIR genes were obtained from Genecards and OMIM and the protein-protein This article was submitted to interaction (PPI) data of HCC’s targets and IS genes were obtained from String Ethnopharmacology, a section of the journal database.
    [Show full text]
  • Essential Genes and Their Role in Autism Spectrum Disorder
    University of Pennsylvania ScholarlyCommons Publicly Accessible Penn Dissertations 2017 Essential Genes And Their Role In Autism Spectrum Disorder Xiao Ji University of Pennsylvania, [email protected] Follow this and additional works at: https://repository.upenn.edu/edissertations Part of the Bioinformatics Commons, and the Genetics Commons Recommended Citation Ji, Xiao, "Essential Genes And Their Role In Autism Spectrum Disorder" (2017). Publicly Accessible Penn Dissertations. 2369. https://repository.upenn.edu/edissertations/2369 This paper is posted at ScholarlyCommons. https://repository.upenn.edu/edissertations/2369 For more information, please contact [email protected]. Essential Genes And Their Role In Autism Spectrum Disorder Abstract Essential genes (EGs) play central roles in fundamental cellular processes and are required for the survival of an organism. EGs are enriched for human disease genes and are under strong purifying selection. This intolerance to deleterious mutations, commonly observed haploinsufficiency and the importance of EGs in pre- and postnatal development suggests a possible cumulative effect of deleterious variants in EGs on complex neurodevelopmental disorders. Autism spectrum disorder (ASD) is a heterogeneous, highly heritable neurodevelopmental syndrome characterized by impaired social interaction, communication and repetitive behavior. More and more genetic evidence points to a polygenic model of ASD and it is estimated that hundreds of genes contribute to ASD. The central question addressed in this dissertation is whether genes with a strong effect on survival and fitness (i.e. EGs) play a specific oler in ASD risk. I compiled a comprehensive catalog of 3,915 mammalian EGs by combining human orthologs of lethal genes in knockout mice and genes responsible for cell-based essentiality.
    [Show full text]
  • Queryor: a Comprehensive Web Platform for Genetic Variant Analysis
    Bertoldi et al. BMC Bioinformatics (2017) 18:225 DOI 10.1186/s12859-017-1654-4 SOFTWARE Open Access QueryOR: a comprehensive web platform for genetic variant analysis and prioritization Loris Bertoldi1, Claudio Forcato1, Nicola Vitulo1,5, Giovanni Birolo1, Fabio De Pascale1, Erika Feltrin1, Riccardo Schiavon2, Franca Anglani3, Susanna Negrisolo4, Alessandra Zanetti4, Francesca D’Avanzo4, Rosella Tomanin4, Georgine Faulkner2, Alessandro Vezzi2 and Giorgio Valle1,2* Abstract Background: Whole genome and exome sequencing are contributing to the extraordinary progress in the study of human genetic variants. In this fast developing field, appropriate and easily accessible tools are required to facilitate data analysis. Results: Here we describe QueryOR, a web platform suitable for searching among known candidate genes as well as for finding novel gene-disease associations. QueryOR combines several innovative features that make it comprehensive, flexible and easy to use. Instead of being designed on specific datasets, it works on a general XML schema specifying formats and criteria of each data source. Thanks to this flexibility, new criteria can be easily added for future expansion. Currently, up to 70 user-selectable criteria are available, including a wide range of gene and variant features. Moreover, rather than progressively discarding variants taking one criterion at a time, the prioritization is achieved by a global positive selection process that considers all transcript isoforms, thus producing reliable results. QueryOR is easy to use and its intuitive interface allows to handle different kinds of inheritance as well as features related to sharing variants in different patients. QueryOR is suitable for investigating single patients, families or cohorts. Conclusions: QueryOR is a comprehensive and flexible web platform eligible for an easy user-driven variant prioritization.
    [Show full text]
  • Proteome Profiling of Developing Murine Lens Through Mass
    Lens Proteome Profiling of Developing Murine Lens Through Mass Spectrometry Shahid Y. Khan,1 Muhammad Ali,1 Firoz Kabir,1 Santosh Renuse,2 Chan Hyun Na,2 C. Conover Talbot Jr,3 Sean F. Hackett,1 and S. Amer Riazuddin1 1The Wilmer Eye Institute, Johns Hopkins University School of Medicine, Baltimore, Maryland, United States 2Department of Biological Chemistry, Johns Hopkins University School of Medicine, Baltimore, Maryland, United States 3Institute for Basic Biomedical Sciences, Johns Hopkins University School of Medicine, Baltimore, Maryland, United States Correspondence: S. Amer Riazuddin, PURPOSE. We previously completed a comprehensive profile of the mouse lens transcriptome. The Wilmer Eye Institute, Johns Here, we investigate the proteome of the mouse lens through mass spectrometry–based Hopkins University School of Medi- protein sequencing at the same embryonic and postnatal time points. cine, 600 N. Wolfe Street; Maumenee 840, Baltimore, MD 21287, USA; METHODS. We extracted mouse lenses at embryonic day 15 (E15) and 18 (E18) and postnatal [email protected]. day 0 (P0), 3 (P3), 6 (P6), and 9 (P9). The lenses from each time point were preserved in three Submitted: February 2, 2017 distinct pools to serve as biological replicates for each developmental stage. The total cellular Accepted: October 13, 2017 protein was extracted from the lens, digested with trypsin, and labeled with isobaric tandem mass tags (TMT) for three independent TMT experiments. Citation: Khan SY, Ali M, Kabir F, et al. Proteome profiling of developing mu- RESULTS. A total of 5404 proteins were identified in the mouse ocular lens in at least one rine lens through mass spectrometry.
    [Show full text]
  • Whole Exome Sequencing in Families at High Risk for Hodgkin Lymphoma: Identification of a Predisposing Mutation in the KDR Gene
    Hodgkin Lymphoma SUPPLEMENTARY APPENDIX Whole exome sequencing in families at high risk for Hodgkin lymphoma: identification of a predisposing mutation in the KDR gene Melissa Rotunno, 1 Mary L. McMaster, 1 Joseph Boland, 2 Sara Bass, 2 Xijun Zhang, 2 Laurie Burdett, 2 Belynda Hicks, 2 Sarangan Ravichandran, 3 Brian T. Luke, 3 Meredith Yeager, 2 Laura Fontaine, 4 Paula L. Hyland, 1 Alisa M. Goldstein, 1 NCI DCEG Cancer Sequencing Working Group, NCI DCEG Cancer Genomics Research Laboratory, Stephen J. Chanock, 5 Neil E. Caporaso, 1 Margaret A. Tucker, 6 and Lynn R. Goldin 1 1Genetic Epidemiology Branch, Division of Cancer Epidemiology and Genetics, National Cancer Institute, NIH, Bethesda, MD; 2Cancer Genomics Research Laboratory, Division of Cancer Epidemiology and Genetics, National Cancer Institute, NIH, Bethesda, MD; 3Ad - vanced Biomedical Computing Center, Leidos Biomedical Research Inc.; Frederick National Laboratory for Cancer Research, Frederick, MD; 4Westat, Inc., Rockville MD; 5Division of Cancer Epidemiology and Genetics, National Cancer Institute, NIH, Bethesda, MD; and 6Human Genetics Program, Division of Cancer Epidemiology and Genetics, National Cancer Institute, NIH, Bethesda, MD, USA ©2016 Ferrata Storti Foundation. This is an open-access paper. doi:10.3324/haematol.2015.135475 Received: August 19, 2015. Accepted: January 7, 2016. Pre-published: June 13, 2016. Correspondence: [email protected] Supplemental Author Information: NCI DCEG Cancer Sequencing Working Group: Mark H. Greene, Allan Hildesheim, Nan Hu, Maria Theresa Landi, Jennifer Loud, Phuong Mai, Lisa Mirabello, Lindsay Morton, Dilys Parry, Anand Pathak, Douglas R. Stewart, Philip R. Taylor, Geoffrey S. Tobias, Xiaohong R. Yang, Guoqin Yu NCI DCEG Cancer Genomics Research Laboratory: Salma Chowdhury, Michael Cullen, Casey Dagnall, Herbert Higson, Amy A.
    [Show full text]
  • Early Growth Response 1 Regulates Hematopoietic Support and Proliferation in Human Primary Bone Marrow Stromal Cells
    Hematopoiesis SUPPLEMENTARY APPENDIX Early growth response 1 regulates hematopoietic support and proliferation in human primary bone marrow stromal cells Hongzhe Li, 1,2 Hooi-Ching Lim, 1,2 Dimitra Zacharaki, 1,2 Xiaojie Xian, 2,3 Keane J.G. Kenswil, 4 Sandro Bräunig, 1,2 Marc H.G.P. Raaijmakers, 4 Niels-Bjarne Woods, 2,3 Jenny Hansson, 1,2 and Stefan Scheding 1,2,5 1Division of Molecular Hematology, Department of Laboratory Medicine, Lund University, Lund, Sweden; 2Lund Stem Cell Center, Depart - ment of Laboratory Medicine, Lund University, Lund, Sweden; 3Division of Molecular Medicine and Gene Therapy, Department of Labora - tory Medicine, Lund University, Lund, Sweden; 4Department of Hematology, Erasmus MC Cancer Institute, Rotterdam, the Netherlands and 5Department of Hematology, Skåne University Hospital Lund, Skåne, Sweden ©2020 Ferrata Storti Foundation. This is an open-access paper. doi:10.3324/haematol. 2019.216648 Received: January 14, 2019. Accepted: July 19, 2019. Pre-published: August 1, 2019. Correspondence: STEFAN SCHEDING - [email protected] Li et al.: Supplemental data 1. Supplemental Materials and Methods BM-MNC isolation Bone marrow mononuclear cells (BM-MNC) from BM aspiration samples were isolated by density gradient centrifugation (LSM 1077 Lymphocyte, PAA, Pasching, Austria) either with or without prior incubation with RosetteSep Human Mesenchymal Stem Cell Enrichment Cocktail (STEMCELL Technologies, Vancouver, Canada) for lineage depletion (CD3, CD14, CD19, CD38, CD66b, glycophorin A). BM-MNCs from fetal long bones and adult hip bones were isolated as reported previously 1 by gently crushing bones (femora, tibiae, fibulae, humeri, radii and ulna) in PBS+0.5% FCS subsequent passing of the cell suspension through a 40-µm filter.
    [Show full text]