Identification of Two Novel Biomarkers of Rectal Carcinoma Progression and Prognosis Via Co-Expression Network Analysis

Total Page:16

File Type:pdf, Size:1020Kb

Identification of Two Novel Biomarkers of Rectal Carcinoma Progression and Prognosis Via Co-Expression Network Analysis www.impactjournals.com/oncotarget/ Oncotarget, 2017, Vol. 8, (No. 41), pp: 69594-69609 Research Paper Identification of two novel biomarkers of rectal carcinoma progression and prognosis via co-expression network analysis Min Sun1,*, Taojiao Sun2,*, Zhongshi He3 and Bin Xiong1 1Department of Oncology, Zhongnan Hospital of Wuhan University, Hubei Key Laboratory of Tumor Biological Behaviors & Hubei Cancer Clinical Study Center, Wuhan 430071, P.R. China 2Department of Stomatology, Zhongnan Hospital of Wuhan University, Wuhan 430071, P.R. China 3Department of Radiation and Medical Oncology, Zhongnan Hospital of Wuhan University, Wuhan 430071, P.R. China *These authors have contributed equally to this work Correspondence to: Bin Xiong, email: [email protected] Keywords: rectal cancer, The Cancer Genome Atlas, weighted gene co-expression network analysis, prognosis, disease progression Received: March 06, 2017 Accepted: May 22, 2017 Published: June 27, 2017 Copyright: Sun et al. This is an open-access article distributed under the terms of the Creative Commons Attribution License 3.0 (CC BY 3.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited. ABSTRACT mRNA expression profiles provide important insights on a diversity of biological processes involved in rectal carcinoma (RC). Our aim was to comprehensively map complex interactions between the mRNA expression patterns and the clinical traits of RC. We employed the integrated analysis of five microarray datasets and The Cancer Genome Atlas rectal adenocarcinoma database to identify 2118 consensual differentially expressed genes (DEGs) in RC and adjacent normal tissue samples, and then applied weighted gene co-expression network analysis to parse DEGs and eight clinical traits in 66 eligible RC samples. A total of 16 co-expressed gene modules were identified. The green-yellow and salmon modules were most appropriate to the pathological stage (R = 0.36) and the overall survival (HR =13.534, P = 0.014), respectively. A diagnostic model of the five pathological stage hub genes (SCG3, SYP, CDK5R2, AP3B2, and RUNDC3A) provided a powerful classification accuracy between localized RC and non-localized RC. We also found increased Secretogranin III (SCG3) expression with higher pathological stage and poorer prognosis in the test and validation set. The increased Homer scaffolding protein 2 (HOMER2) expression with the favorable survival prediction efficiency significantly correlated with the markedly reduced overall survival of RC patients and the higher pathological stage during the test and validation set. Our findings indicate that theSCG3 and HOMER2 mRNA levels should be further evaluated as predictors of pathological stage and survival in patients with RC. INTRODUCTION for RC, especially for modules having a strong correlation between genes with similar expression patterns, to predict Accurate staging of rectal cancer (RC) is essential the pathological stage and survival outcome, which could for carrying out precise therapies that increase the help to understand disease pathogenesis and provide survival rate of patients [1]. Previous studies have personalized treatment, have been rarely reported. found no differences in specific clinicopathological risk However, previous research based on examining genetic factors between patients with a high or low risk of RC mutations and different expression patterns associating progression [2]. Recent technological breakthroughs with colorectal carcinogenesis has largely ignored the in genome-wide sequencing have shed new insights relationship between the genes and clinical characteristics on deregulated mRNAs that have been identified and [4, 5]. In addition, RC progression involves several critical characterized in the past few years [3]. Genetic biomarkers stages, although most studies have only evaluated the www.impactjournals.com/oncotarget 69594 Oncotarget Table 1: Quality control results of RC in the different datasets No. Study IQC EQC CQCg CQCp AQCg AQCp Rank 1 GSE75548 3.92 2 1.8* 2.23 1.9* 1.13* 1.83 2 GSE12225 4.74 2 0.68* 1.66* 1.08* 2.38 2.00 3 GSE35982 5.62 2 0.02* 10.03 0* 0.07* 2.67 4 GSE34472 1.3* 0.4* 0.24* 0.36* 0.56* 0.11* 3.50 RC, rectal cancer; GSE, GEO dataset; IQC, internal quality control indexes; EQC, external quality control indexes; CQCg and CQCp, consistency of differential expression quality control indexes for genes and pathways; AQCg and AQCp, accuracy quality control indexes for genes and pathways. * P-value not significant after Bonferroni correction. differences between RC and adjacent normal tissue (ANT), colon cancer, there was very limited mRNA expression regardless of the intermediate stages [6]. Therefore, there data relating to rectal cancer. We investigated and is an urgency to include stage and prognostic predictive manually curated four public datasets (GSE12225, modules to the current staging system, which could GSE34472, GSE35982, and GSE75548) as the training be achieved by combining information on the clinical set (Supplementary Table 1) [12–15]. characteristics and validated gene-specific biomarkers. The training set calculated the six quantitative For these reasons, we aimed to establish comprehensive quality control (QC) measures by standardized mean ranks mRNA expression patterns of the module genes and and principal component analysis (PCA) biplots within clinical traits, especially those relating to multi-stage MetaQC (Table 1 and Figure 1A) [16]. GSE34472 was disease progression which directly affects the prognosis detected in low quality RC samples and was omitted from through weighted gene co-expression network analysis the training set [17]. The training set finally consisted of 65 (WGCNA) [7]. RC samples and 42 ANTs, and it had 8153 gene symbols WGCNA results in the construction of free-scale in common. A total of 6603 gene symbol sets passed the gene co-expression networks that explore the relationships filtering criteria. We identified 4091 mRNAs that were between the gene sets and clinical features [8]. WGCNA consistently DEGs using the moderated t test, Fisher’s groups prerogative genes into modules based on their co- method by summarizing -log(p-value) across studies and expression similarities and analogous functions across a running 300 permutations for the meta-analysis to infer population of samples [9–11]. the P-values (Figure 1B) [18]. As expected, hierarchical In the present work, we carried out the pooled clustering of the three datasets using the remaining 4091 analysis of RC mRNA raw microarrays and the network DEGs distinguished RC from ANT samples (Figure 2A). level analysis of The Cancer Genome Atlas (TCGA) The raw level-3 RNAseq data from 20,501 rectal adenocarcinoma (READ) data to decipher the mRNAs in 95 RC samples and 10 ANT samples were relationships between the module genes and clinical traits. downloaded from TCGA [1, 8, 10]. A total of 2177 DEGs To the best of our knowledge, we are the first group to use were identified by linear models within the microarray meta-analysis and WGCNA to identify modules displaying analysis (LIMMA), among which 1092 were up-regulated a nominal evidence association with RC clinical traits. By and 1085 were down-regulated in RC versus ANT characterizing module content and topology, we identified (Supplementary Table 2). This mRNA signature allowed clinical traits, modules, and network concepts that play for the separation of RC samples from ANT samples in important roles in the regulation of RC at the level of the the 2-way hierarchical cluster (Figure 2B) and in the PCA differentially expressed genes (DEGs). Moreover, to define plot (Figure 2C). A total of 2118 overlapping DEGs were the prognostic value of the cancer-specific module, which chosen by both training and test sets (Figure 2D). was related to tumor progression, further analysis was performed to validate candidate markers by combining Co-expression network construction and module survival analysis with an independent validation cohort. preservation analysis RESULTS We included 2118 DEGs from 70 RC patients with complete clinical traits and prognostic information of the Identification of consensual DEGs in training TCGA-READ test set to construct co-expression networks and test sets in rectal cancer patients via WGCNA. After four outlier samples were discarded, the connections between the genes in the gene network The flow chart outlining the methods used in this were in line with a scale-free network distribution when study is shown in Supplementary Figure 1. In contrast to the soft threshold power β was set at 3 (Supplementary www.impactjournals.com/oncotarget 69595 Oncotarget Figure 2A-2D). The dynamic tree cut method identified yielded significant PCC with the pathological stage (R modules with similar expression profiles (Figure 3A, = 0.25, P = 0.04) and pathology T stage (R = 0.33, P = 3B). After highly similar modules were merged (Figure 0.007). The red (126 genes) module yielded significant 3B), a total of 16 co-expressed modules were identified, PCC with the pathological stage (R = 0.37, P = 0.002), ranging in size from 11 to 623 genes, whereas the “grey” the pathology T stage (R = 0.29, P = 0.02) and pathology module was reserved for genes that were not co-expressed N stage (R = 0.31, P = 0.01). The tan (44 genes) module (Figure 3C). yielded significant PCC with the pathological stage (R = After comparing the TCGA-READ test set with the 0.31, P = 0.01), the pathology T stage (R = 0.26, P = 0.03), validation set (GSE29621) [19], the summary preservation pathology N stage (R = 0.38, P = 0.002), pathology M statistics used in determining whether a reference network stage (R = 0.32, P = 0.009) and lymphatic invasion (R is found in another test network were visualized [20]. The = 0.27, P = 0.03). The green-yellow (54 genes) module green-yellow and salmon modules were well preserved, yielded significant PCC with the pathological stage (R with low median Rank statistics and Z-summary statistics = 0.38, P = 0.002), the pathology T stage (R = 0.34, P larger than 10 (Supplementary Figure 3) [7, 21].
Recommended publications
  • Loss of MAGEL2 in Prader-Willi Syndrome Leads to Decreased Secretory Granule and Neuropeptide Production
    Loss of MAGEL2 in Prader-Willi syndrome leads to decreased secretory granule and neuropeptide production Helen Chen, … , Lawrence T. Reiter, Patrick Ryan Potts JCI Insight. 2020;5(17):e138576. https://doi.org/10.1172/jci.insight.138576. Research Article Cell biology Neuroscience Graphical abstract Find the latest version: https://jci.me/138576/pdf RESEARCH ARTICLE Loss of MAGEL2 in Prader-Willi syndrome leads to decreased secretory granule and neuropeptide production Helen Chen,1 A. Kaitlyn Victor,2 Jonathon Klein,1 Klementina Fon Tacer,1 Derek J.C. Tai,3,4,5 Celine de Esch,3,4,5 Alexander Nuttle,3,4,5 Jamshid Temirov,1 Lisa C. Burnett,6,7 Michael Rosenbaum,7 Yiying Zhang,7 Li Ding,8 James J. Moresco,9 Jolene K. Diedrich,9 John R. Yates III,9 Heather S. Tillman,10 Rudolph L. Leibel,7 Michael E. Talkowski,3,4,5 Daniel D. Billadeau,8 Lawrence T. Reiter,2 and Patrick Ryan Potts1 1Department of Cell and Molecular Biology, St. Jude Children’s Research Hospital, Memphis, Tennessee, USA. 2Department of Neurology, Department of Pediatrics, and Department of Anatomy and Neurobiology, University of Tennessee Health Science Center, Memphis, Tennessee, USA. 3Center for Genomic Medicine, Department of Neurology, Department of Pathology, and Department of Psychiatry, Massachusetts General Hospital, Boston, Massachusetts, USA. 4Department of Neurology, Harvard Medical School, Boston, Massachusetts, USA. 5Program in Medical and Population Genetics and Stanley Center for Psychiatric Research, Broad Institute, Cambridge, Massachusetts, USA. 6Levo Therapeutics, Inc., Skokie, Illinois, USA. 7Division of Molecular Genetics, Department of Pediatrics, and Naomi Berrie Diabetes Center, Vagelos College of Physicians and Surgeons, Columbia University Irving Medical Center, New York, New York, USA.
    [Show full text]
  • A High Throughput, Functional Screen of Human Body Mass Index GWAS Loci Using Tissue-Specific Rnai Drosophila Melanogaster Crosses Thomas J
    Washington University School of Medicine Digital Commons@Becker Open Access Publications 2018 A high throughput, functional screen of human Body Mass Index GWAS loci using tissue-specific RNAi Drosophila melanogaster crosses Thomas J. Baranski Washington University School of Medicine in St. Louis Aldi T. Kraja Washington University School of Medicine in St. Louis Jill L. Fink Washington University School of Medicine in St. Louis Mary Feitosa Washington University School of Medicine in St. Louis Petra A. Lenzini Washington University School of Medicine in St. Louis See next page for additional authors Follow this and additional works at: https://digitalcommons.wustl.edu/open_access_pubs Recommended Citation Baranski, Thomas J.; Kraja, Aldi T.; Fink, Jill L.; Feitosa, Mary; Lenzini, Petra A.; Borecki, Ingrid B.; Liu, Ching-Ti; Cupples, L. Adrienne; North, Kari E.; and Province, Michael A., ,"A high throughput, functional screen of human Body Mass Index GWAS loci using tissue-specific RNAi Drosophila melanogaster crosses." PLoS Genetics.14,4. e1007222. (2018). https://digitalcommons.wustl.edu/open_access_pubs/6820 This Open Access Publication is brought to you for free and open access by Digital Commons@Becker. It has been accepted for inclusion in Open Access Publications by an authorized administrator of Digital Commons@Becker. For more information, please contact [email protected]. Authors Thomas J. Baranski, Aldi T. Kraja, Jill L. Fink, Mary Feitosa, Petra A. Lenzini, Ingrid B. Borecki, Ching-Ti Liu, L. Adrienne Cupples, Kari E. North, and Michael A. Province This open access publication is available at Digital Commons@Becker: https://digitalcommons.wustl.edu/open_access_pubs/6820 RESEARCH ARTICLE A high throughput, functional screen of human Body Mass Index GWAS loci using tissue-specific RNAi Drosophila melanogaster crosses Thomas J.
    [Show full text]
  • Anti-Scg3 Therapeutic Antibody
    Protheragen Inc. Anti-Scg3 Therapeutic Antibody A Selective Angiogenesis Blocker to Treat Multiple Angiogenesis-Dependent Diseases Overview Drug Name Anti-Scg3 antibody Anti-Scg3 antibody aims at the target secretogranin III (Scg3). It alleviates disease vascular leakage with high efficacy and no side-effects. In addition to curing diabetic retinopathy, anti-Scg3 antibody has the utility to treat other angiogenesis-dependent diseases such as retinopathy of prematurity, choroidal neovascularization, and tumor growth and metastasis. Data implicates that Scg3 has minimal binding and angiogenic activity in normal vessels but it Description markedly increases the binding and angiogenic activity in disease conditions. Different from vascular endothelial growth factor (VEGF), anti-Scg3 antibody can selectively bind and inhibit angiogenic activity in disease micro- environment, thus avoiding the damage to normal vasculature. In this case, anti-Scg3 antibody has great potentials to be widely applied in the treatment of various angiogenesis-dependent diseases with high safety. All the research results have been patented. Diabetic retinopathy (DR); Cancers; Retinopathy of prematurity (ROP); Diabetic Indication macula edema (DME); Choroidal neovascularization (CNV) Target Scg3 Product Category Humanized monoclonal antibody; Cancer immunotherapy; Fertility regulation Mechanism of Antibody; Anti-angiogenesis Action Status Pre-clinical Patent Two patents have been filed. * This program was developed by Protheragen’s partner Everglades Biopharma. Cooperation
    [Show full text]
  • Peripheral Nerve Single-Cell Analysis Identifies Mesenchymal Ligands That Promote Axonal Growth
    Research Article: New Research Development Peripheral Nerve Single-Cell Analysis Identifies Mesenchymal Ligands that Promote Axonal Growth Jeremy S. Toma,1 Konstantina Karamboulas,1,ª Matthew J. Carr,1,2,ª Adelaida Kolaj,1,3 Scott A. Yuzwa,1 Neemat Mahmud,1,3 Mekayla A. Storer,1 David R. Kaplan,1,2,4 and Freda D. Miller1,2,3,4 https://doi.org/10.1523/ENEURO.0066-20.2020 1Program in Neurosciences and Mental Health, Hospital for Sick Children, 555 University Avenue, Toronto, Ontario M5G 1X8, Canada, 2Institute of Medical Sciences University of Toronto, Toronto, Ontario M5G 1A8, Canada, 3Department of Physiology, University of Toronto, Toronto, Ontario M5G 1A8, Canada, and 4Department of Molecular Genetics, University of Toronto, Toronto, Ontario M5G 1A8, Canada Abstract Peripheral nerves provide a supportive growth environment for developing and regenerating axons and are es- sential for maintenance and repair of many non-neural tissues. This capacity has largely been ascribed to paracrine factors secreted by nerve-resident Schwann cells. Here, we used single-cell transcriptional profiling to identify ligands made by different injured rodent nerve cell types and have combined this with cell-surface mass spectrometry to computationally model potential paracrine interactions with peripheral neurons. These analyses show that peripheral nerves make many ligands predicted to act on peripheral and CNS neurons, in- cluding known and previously uncharacterized ligands. While Schwann cells are an important ligand source within injured nerves, more than half of the predicted ligands are made by nerve-resident mesenchymal cells, including the endoneurial cells most closely associated with peripheral axons. At least three of these mesen- chymal ligands, ANGPT1, CCL11, and VEGFC, promote growth when locally applied on sympathetic axons.
    [Show full text]
  • Downloaded 01/08/2020)
    www.aging-us.com AGING 2021, Vol. 13, No. 16 Research Paper Proteomic characterization of secretory granules in dopaminergic neurons indicates chromogranin/secretogranin-mediated protein processing impairment in Parkinson’s disease Gehua Wen1, Hao Pang1, Xu Wu1, Enzhu Jiang1, Xique Zhang2, Xiaoni Zhan1 1School of Forensic Medicine, China Medical University, Shenyang, PR China 2Department of Geriatrics, The First Affiliated Hospital of China Medical University, Shenyang, PR China Correspondence to: Xiaoni Zhan; email: [email protected] Keywords: Parkinson’s disease, secretory granules, secretogranin, proteomics, lipid metabolism, neuropeptide Received: May 12, 2021 Accepted: August 3, 2021 Published: August 21, 2021 Copyright: © 2021 Wen et al. This is an open access article distributed under the terms of the Creative Commons Attribution License (CC BY 3.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited. ABSTRACT Parkinson’s disease (PD) is an aging disorder related to vesicle transport dysfunctions and neurotransmitter secretion. Secretory granules (SGs) are large dense-core vesicles for the biosynthesis of neuropeptides and hormones. At present, the involvement of SGs impairment in PD remains unclear. In the current study, we found that the number of SGs in tyrosine hydroxylase-positive neurons and the marker proteins secretogranin III (Scg3) significantly decreased in the substantia nigra and striatum regions of 1-methyl-4-phenyl-1, 2, 3, 6-tetrahydropyridine (MPTP) exposed mice. Proteomic study of SGs purified from the dopaminergic SH-sy5Y cells under 1-methyl-4-phenylpyridinium (MPP+) treatments (ProteomeXchange PXD023937) identified 536 significantly differentially expressed proteins. The result indicated that disabled lysosome and peroxisome, lipid and energy metabolism disorders are three characteristic features.
    [Show full text]
  • Computational Studies of the Genome Dynamics of Mammalian Transposable Elements and Their Relationships to Genes
    COMPUTATIONAL STUDIES OF THE GENOME DYNAMICS OF MAMMALIAN TRANSPOSABLE ELEMENTS AND THEIR RELATIONSHIPS TO GENES by Ying Zhang M.Sc., Katholieke Universiteit Leuven (BELGIUM), 2004 B.E., Harbin Institute of Technology (CHINA), 1993 A THESIS SUBMITTED IN PARTIAL FULFILLMENT OF THE REQUIREMENTS FOR THE DEGREE OF DOCTOR OF PHILOSOPHY in THE FACULTY OF GRADUATE STUDIES (Genetics) THE UNIVERSITY OF BRITISH COLUMBIA (Vancouver) May, 2012 © Ying Zhang, 2012 Abstract Sequences derived from transposable elements (TEs) comprise nearly 40 - 50% of the genomic DNA of most mammalian species, including mouse and human. However, what impact they may exert on their hosts is an intriguing question. Originally considered as merely genomic parasites or “selfish DNA”, these mobile elements show their detrimental effects through a variety of mechanisms, from physical DNA disruption to epigenetic regulation. On the other hand, evidence has been mounting to suggest that TEs sometimes may also play important roles by participating in essential biological processes in the host cell. The dual-roles of TE-host interactions make it critical for us to understand the relationship between TEs and the host, which may ultimately help us to better understand both normal cellular functions and disease. This thesis encompasses my three genome-wide computational studies of TE-gene dynamics in mammals. In the first, I identified high levels of TE insertional polymorphisms among inbred mouse strains, and systematically analyzed their distributional features and biological effects, through mining tens of millions of mouse genomic DNA sequences. In the second, I examined the properties of TEs located in introns, and identified key factors, such as the distance to the intron-exon boundary, insertional orientation, and proximity to splice sites, that influence the probability that TEs will be retained in genes.
    [Show full text]
  • SCG3 Antibody (C-Term) Affinity Purified Rabbit Polyclonal Antibody (Pab) Catalog # Ap17492b
    10320 Camino Santa Fe, Suite G San Diego, CA 92121 Tel: 858.875.1900 Fax: 858.622.0609 SCG3 Antibody (C-term) Affinity Purified Rabbit Polyclonal Antibody (Pab) Catalog # AP17492b Specification SCG3 Antibody (C-term) - Product Information Application WB,E Primary Accession Q8WXD2 Other Accession NP_037375.2, NP_001158729.1 Reactivity Human Host Rabbit Clonality Polyclonal Isotype Rabbit Ig Calculated MW 53005 Antigen Region 378-406 SCG3 Antibody (C-term) - Additional Information SCG3 Antibody (C-term) (Cat. #AP17492b) Gene ID 29106 western blot analysis in CEM cell line lysates (35ug/lane).This demonstrates the SCG3 Other Names antibody detected the SCG3 protein (arrow). Secretogranin-3, Secretogranin III, SgIII, SCG3 SCG3 Antibody (C-term) - Background Target/Specificity This SCG3 antibody is generated from The protein encoded by this gene is a member rabbits immunized with a KLH conjugated of the synthetic peptide between 378-406 amino chromogranin/secretogranin family of acids from the C-terminal region of human SCG3. neuroendocrine secretory proteins. Granins may serve as precursors for Dilution biologically active WB~~1:1000 peptides. Some granins have been shown to function as helper Format proteins in sorting and proteolytic processing Purified polyclonal antibody supplied in PBS of prohormones; with 0.09% (W/V) sodium azide. This however, the function of this protein is antibody is purified through a protein A unknown. Two transcript column, followed by peptide affinity variants encoding different isoforms have been purification. found for this gene. Storage SCG3 Antibody (C-term) - References Maintain refrigerated at 2-8°C for up to 2 weeks. For long term storage store at -20°C Li, X., et al.
    [Show full text]
  • Gene Ontology Analysis of Arthrogryposis (Multiple Congenital Contractures)
    Received: 5 March 2019 Revised: 13 June 2019 Accepted: 17 July 2019 DOI: 10.1002/ajmg.c.31733 RESEARCH ARTICLE Gene ontology analysis of arthrogryposis (multiple congenital contractures) Jeff Kiefer1 | Judith G. Hall2,3 1Systems Oncology, Scottsdale, Arizona Abstract 2Department of Medical Genetics, University of British Columbia and BC Children's In 2016, we published an article applying Gene Ontology Analysis to the genes that had Hospital, Vancouver, British Columbia, Canada been reported to be associated with arthrogryposis (multiple congenital contractures) (Hall 3Department of Pediatrics, University of & Kiefer, 2016). At that time, 320 genes had been reported to have mutations associated British Columbia and BC Children's Hospital, Vancouver, British Columbia, Canada with arthrogryposis. All were associated with decreased fetal movement. These 320 genes were analyzed by biological process and cellular component categories, and yielded 22 Correspondence Judith G. Hall, Department of Medical distinct groupings. Since that time, another 82 additional genes have been reported, now Genetics, BC Children's Hospital, 4500 Oak totaling 402 genes, which when mutated, are associated with arthrogryposis (arthrogryposis Street, Room C234, Vancouver, British Columbia V6H 3N1, Canada. multiplex congenita). So, we decided to update the analysis in order to stimulate further Email: [email protected] research and possible treatment. Now, 29 groupings can be identified, but only 19 groups have more than one gene. KEYWORDS arthrogryposis, developmental pathways, enrichment analysis, gene ontology, multiple congenital contractures 1 | INTRODUCTION polyhydramnios, decreased gut mobility and shortened gut, short umbili- cal cord, skin changes, and multiple joints with limitation of movement, Arthrogryposis is the term that has been used for the last century including limbs, jaw, and spine).
    [Show full text]
  • Biomedical Informatics
    BIOMEDICAL INFORMATICS Abstract GENE LIST AUTOMATICALLY DERIVED FOR YOU (GLAD4U): DERIVING AND PRIORITIZING GENE LISTS FROM PUBMED LITERATURE JEROME JOURQUIN Thesis under the direction of Professor Bing Zhang Answering questions such as ―Which genes are related to breast cancer?‖ usually requires retrieving relevant publications through the PubMed search engine, reading these publications, and manually creating gene lists. This process is both time-consuming and prone to errors. We report GLAD4U (Gene List Automatically Derived For You), a novel, free web-based gene retrieval and prioritization tool. The quality of gene lists created by GLAD4U for three Gene Ontology terms and three disease terms was assessed using ―gold standard‖ lists curated in public databases. We also compared the performance of GLAD4U with that of another gene prioritization software, EBIMed. GLAD4U has a high overall recall. Although precision is generally low, its prioritization methods successfully rank truly relevant genes at the top of generated lists to facilitate efficient browsing. GLAD4U is simple to use, and its interface can be found at: http://bioinfo.vanderbilt.edu/glad4u. Approved ___________________________________________ Date _____________ GENE LIST AUTOMATICALLY DERIVED FOR YOU (GLAD4U): DERIVING AND PRIORITIZING GENE LISTS FROM PUBMED LITERATURE By Jérôme Jourquin Thesis Submitted to the Faculty of the Graduate School of Vanderbilt University in partial fulfillment of the requirements for the degree of MASTER OF SCIENCE in Biomedical Informatics May, 2010 Nashville, Tennessee Approved: Professor Bing Zhang Professor Hua Xu Professor Daniel R. Masys ACKNOWLEDGEMENTS I would like to express profound gratitude to my advisor, Dr. Bing Zhang, for his invaluable support, supervision and suggestions throughout this research work.
    [Show full text]
  • Antibodies Reactive to Non-HLA Antigens in Transplant Glomerulopathy
    CLINICAL RESEARCH www.jasn.org Antibodies Reactive to Non-HLA Antigens in Transplant Glomerulopathy Rajani Dinavahi,*† Ajish George,‡ Anne Tretin,* Enver Akalin,*† Scott Ames,† Jonathan S. Bromberg,† Graciela DeBoccardo,*† Nicholas DiPaola,§ Susan M. Lerner,† ʈ Anita Mehrotra,* Barbara T. Murphy,*† Tibor Nadasdy, Estela Paz-Artal,¶ Daniel R. Salomon,** Bernd Schro¨ppel,*† Vinita Sehgal,*† Ravi Sachidanandam,‡ and Peter S. Heeger*† *Department of Medicine, †Recanati Miller Transplantation Institute, and ‡Department of Genetics and Genomics ʈ Sciences, Mount Sinai School of Medicine, New York, New York; §Histocompatibility Laboratory and Department of Pathology, Ohio State Medical Center, Columbus, Ohio; ¶Hospital 12 de Octubre, Madrid, Spain; and **The Scripps Research Institute, La Jolla, California ABSTRACT Although T and B cell alloimmunity contribute to transplant injury, autoimmunity directed at kidney- expressed, non-HLA antigens may also participate. Because the specificity, prevalence, and importance of antibodies to non-HLA antigens in late allograft injury are poorly characterized, we used a protein microarray to compare antibody repertoires in pre- and post-transplant sera from several cohorts of patients with and without transplant glomerulopathy. Transplantation routinely induced changes in antibody repertoires, but we did not identify any de novo non-HLA antibodies common to patients with transplant glomerulopathy. The screening studies identified three reactivities present before transplan- tation that persisted after transplant and strongly associated with transplant glomerulopathy. ELISA confirmed that reactivity against peroxisomal-trans-2-enoyl-coA-reductase strongly associated with the development of transplant glomerulopathy in independent validation sets. In addition to providing insight into effects of transplantation on non-HLA antibody repertoires, these results suggest that pretransplant serum antibodies to peroxisomal-trans-2-enoyl-coA-reductase may predict prognosis in kidney transplantation.
    [Show full text]
  • 1 Supplementary Material Figure S1. Volcano Plot of Differentially
    Supplementary material Figure S1. Volcano Plot of differentially expressed genes between preterm infants fed own mother’s milk (OMM) or pasteurized donated human milk (DHM). Table S1. The 10 most representative biological processes filtered for enrichment p- value in preterm infants. Biological Processes p-value Quantity of DEG* Transcription, DNA-templated 3.62x10-24 189 Regulation of transcription, DNA-templated 5.34x10-22 188 Transport 3.75x10-17 140 Cell cycle 1.03x10-13 65 Gene expression 3.38x10-10 60 Multicellular organismal development 6.97x10-10 86 1 Protein transport 1.73x10-09 56 Cell division 2.75x10-09 39 Blood coagulation 3.38x10-09 46 DNA repair 8.34x10-09 39 Table S2. Differential genes in transcriptomic analysis of exfoliated epithelial intestinal cells between preterm infants fed own mother’s milk (OMM) and pasteurized donated human milk (DHM). Gene name Gene Symbol p-value Fold-Change (OMM vs. DHM) (OMM vs. DHM) Lactalbumin, alpha LALBA 0.0024 2.92 Casein kappa CSN3 0.0024 2.59 Casein beta CSN2 0.0093 2.13 Cytochrome c oxidase subunit I COX1 0.0263 2.07 Casein alpha s1 CSN1S1 0.0084 1.71 Espin ESPN 0.0008 1.58 MTND2 ND2 0.0138 1.57 Small ubiquitin-like modifier 3 SUMO3 0.0037 1.54 Eukaryotic translation elongation EEF1A1 0.0365 1.53 factor 1 alpha 1 Ribosomal protein L10 RPL10 0.0195 1.52 Keratin associated protein 2-4 KRTAP2-4 0.0019 1.46 Serine peptidase inhibitor, Kunitz SPINT1 0.0007 1.44 type 1 Zinc finger family member 788 ZNF788 0.0000 1.43 Mitochondrial ribosomal protein MRPL38 0.0020 1.41 L38 Diacylglycerol
    [Show full text]
  • Jcem1145.Pdf
    0021-972X/07/$15.00/0 The Journal of Clinical Endocrinology & Metabolism 92(3):1145–1154 Printed in U.S.A. Copyright © 2007 by The Endocrine Society doi: 10.1210/jc.2006-1808 Functional Single-Nucleotide Polymorphisms in the Secretogranin III (SCG3) Gene that Form Secretory Granules with Appetite-Related Neuropeptides Are Associated with Obesity Atsushi Tanabe, Takahiro Yanagiya, Aritoshi Iida, Susumu Saito, Akihiro Sekine, Atsushi Takahashi, Downloaded from https://academic.oup.com/jcem/article/92/3/1145/2597987 by guest on 28 September 2021 Takahiro Nakamura, Tatsuhiko Tsunoda, Seika Kamohara, Yoshio Nakata, Kazuaki Kotani, Ryoya Komatsu, Naoto Itoh, Ikuo Mineo, Jun Wada, Tohru Funahashi, Shigeru Miyazaki, Katsuto Tokunaga, Kazuyuki Hamaguchi, Tatsuo Shimada, Kiyoji Tanaka, Kentaro Yamada, Toshiaki Hanafusa, Shinichi Oikawa, Hironobu Yoshimatsu, Toshiie Sakata, Yuji Matsuzawa, Naoyuki Kamatani, Yusuke Nakamura, and Kikuko Hotta Laboratories for Obesity (A.Tan., T.Y., K.Ho.), Pharmacogenetics (A.I.), SNP Analysis (S.S.), SNP Genotyping (A.S.), Statistical Analysis (A.Tak., T.N., N.K.), and Medical Informatics (T.T.), SNP Research Center, RIKEN, Kanagawa 230-0045, Japan; Medicine and Health Science Institute (S.K.), Tokyo Medical University, Tokyo 101-0062, Japan; Institute of Health and Sport Sciences (Y.Nakat., K.T.), University of Tsukuba, Ibaraki 305-8574, Japan; Department of Internal Medicine and Molecular Science (K.K., T.F., Y.M.), Graduate School of Medicine, Osaka University, Osaka 565-0871, Japan; Rinku General Medical Center
    [Show full text]