Identification of Pathogenic Genes and Transcription Factors in Glaucoma

Total Page:16

File Type:pdf, Size:1020Kb

Identification of Pathogenic Genes and Transcription Factors in Glaucoma 216 MOLECULAR MEDICINE REPORTS 20: 216-224, 2019 Identification of pathogenic genes and transcription factors in glaucoma JIE FENG and JING XU Department of Ophthalmology, The First People's Hospital of Jining, Jining, Shandong 272011, P.R. China Received March 20, 2018; Accepted April 3, 2019 DOI: 10.3892/mmr.2019.10236 Abstract. Glaucoma is a group of eye diseases characterized Introduction by alterations in the contour of the optic nerve head, with corresponding visual field defects and progressive loss of Glaucoma is a widely known, multi-factorial disease, which retinal ganglion cells. The present study aimed to identify the may result in apoptosis of retinal ganglion cells. According key genes and upstream regulators in glaucoma. To screen the to the World Health Organization, glaucoma is the second pathogenic genes involved in glaucoma, an integrated analysis principal cause of blindness and the most common cause of was performed by using the microarray datasets in glaucoma irreversible blindness in the world (1,2). In glaucoma, the ante- derived from the Gene Expression Omnibus (GEO) database. rior and posterior segments of the eye are affected, and serious The functional annotation and potential pathways of differen- damage may be detected in the trabecular meshwork (3). tially expressed genes (DEGs) were additionally examined by Oxidative stress is considered to be responsible for the molec- Gene Ontology (GO) and Kyoto Encyclopedia of Genes and ular damage in the anterior chamber. Primary open-angle Genomes (KEGG) enrichment analyses. A glaucoma‑specific glaucoma (POAG) is the most common type of glaucoma, transcriptional regulatory network was constructed to identify accounting for 60-70% all glaucoma (4). A candidate protein crucial transcriptional factors that target the DEGs in glau- that may be associated with POAG is myocilin (MYOC), coma. From two GEO datasets, 1,935 DEGs (951 upregulated encoded by the MYOC gene. MYOC mutations are common and 984 downregulated genes) between glaucoma and normal in patients with POAG with high levels of intraocular pressure controls were identified. GO and KEGG analyses identified that (IOP) (5,6). Additionally, mutations in optineurin were identi- ‘eye development’ [false discovery rate (FDR)=0.00415533] fied in patients with POAG (7). Previous studies suggested and ‘visual perception’ (FDR=0.00713283) were significantly that an abnormal expression of serine/threonine-protein enriched pathways for DEGs. The expression of lipocalin 2 kinase TBK1 is a cause of normal-tension glaucoma (8-10). (LCN2), monoamine oxidase A (MAOA), hemoglobin Furthermore, a previous study suggested that the calcium subunit β (HBB), paired box 6 (PAX6), fibronectin (FN1) and load-activated calcium channel was involved in glaucoma and cAMP responsive element binding protein 1 (CREB1) were that cyclin-dependent kinase 4 inhibitor B antisense RNA 1 demonstrated to be involved in the pathogenesis of glaucoma. was upregulated in the retina of a rat model of glaucoma (11). In conclusion, LCN2, MAOA, HBB, PAX6, FN1 and CREB1 However, even substantial decreases in IOP are not able may serve roles in glaucoma, regulated by PAX4, solute carrier to prevent the development and progression of glaucoma in family 22 member 1, hepatocyte nuclear factor 4 α and ELK1, a number of clinical cases (12). Glaucoma-associated cell ETS transcription factor. These data may contribute to the death is primarily caused by apoptosis, which is triggered development of novel potential biomarkers, reveal the under- by oxidative stress via mitochondrial damage, inflammation, lying pathogenesis and additionally identify novel therapeutic endothelial dysregulation and dysfunction, and hypoxia (13). targets for glaucoma. In general, glaucoma is not preventable; however, the vast majority of patients may maintain useful visual function for life if they have early detection and appropriate treatment (14). Therefore, for the prevention of glaucoma, emphasis must be placed on early detection, and early diagnosis and treatment. The rapid development and application of high-throughput sequencing technology has provided a comprehensive and rapid Correspondence to: Ms. Jing Xu, Department of Ophthalmology, The First People's Hospital of Jining, 6 Jiankang Road, Jining, analytical method for the study of the pathogenesis of glaucoma, Shandong 272011, P.R. China and provide novel ideas for the future treatment of glaucoma (15). E‑mail: [email protected] The present study aimed to analyze high-throughput transcrip- tome data from tissue samples of patients with glaucoma and a Key words: glaucoma, transcription factors, differentially expressed normal control group. The data was used in bioinformatics anal- genes, integrated analysis yses to identify key transcription factors (TFs) associated with glaucoma, to examine the pathogenesis of glaucoma and provide a basis for the diagnosis of glaucoma and drug development. FENG and XU: PATHOGENIC GENES AND TFs IN GLAUCOMA 217 Materials and methods FACtor (TRANSFAC) website match tool (gene-regulation. com/pub/databases.html) was subsequently used to analyze Microarray expression profiling in Gene Expression Omnibus TFs capable of binding to the promoter region of the DEGs. (GEO). The GEO is the largest database of high-throughput TFs that exhibited altered expression in glaucoma with gene expression data that was developed and is maintained FDR<0.001 were selected. Position Weight Matrix scanning by the National Center for Biotechnology Information (16). was used to scan the human genome sequence to obtain the The GEO was searched to obtain gene expression profiling protein-coding genes that were regulated by the differentially studies of glaucoma subjects. The following key search terms expressed TFs. Following removal of redundant information, were used: [ʻglaucoma’ (Medical subject headings Terms) OR the glaucoma‑specific transcriptional regulatory network was ʻglaucoma’ (All Fields)] AND ʻHomo sapiens’ (porgn) AND constructed using Cytoscape software. ʻgse’ (Filter). The selection criteria were as follows: i) The selected dataset must include genome-wide mRNA tran- In silico validation of DEGs using GEO. The GEO database scriptome data; ii) the data was obtained from the trabecular (GSE9944) was used to validate the expression of selected meshwork tissue samples of glaucoma and normal control glaucoma DEGs. The expression levels of these genes were trabecular meshwork tissue samples; and iii) normalized and compared between the glaucoma cases and the normal group. raw datasets were considered. Following selection, two sets of The expression of five genes, cAMP responsive element GSE27276 (17) and GSE4316 (18) glaucoma mRNA data were binding protein 1 (CREB1), fibronectin 1 (FN1), keratin 19 obtained (19,20). (KRT19), lipocalin 2 (LCN2) and paired box 6 (PAX6) was detected, with the difference of expression levels presented as Identification of differentially expressed genes (DEGs) in box-plots. glaucoma compared with normal controls. Background correction was performed on the raw data. The normaliza- Results tion was performed using the Linear Models for Microarray (Limma version 3.30.13) Data package in R (21). Subsequently, Differential expression analysis of genes in glaucoma. The two-tailed Student’s t-tests were performed to calculate indi- probes corresponding to multiple genes were removed, and vidual P-values. Stouffer's test was used to merge individual the average gene expression to which multiple probes corre- P-values, and multiple comparison correction was performed sponded with was calculated. Finally, the intersection of using the Benjamini and Hochberg method to obtain the 15,757 genes was obtained. false discovery rate (FDR) (22). Genes with FDR<0.001 were A total of two gene expression microarray datasets selected as DEGs. Finally, the DEGs in glaucoma vs. normal (GSE27276 and GSE4316) were used for the analysis. were identified. Compared with the normal controls, 1,935 DEGs in glau- coma were obtained (P<0.05); among these, 951 genes were Functional annotation of DEGs. Gene Ontology (GO) (23) upregulated and 984 genes were downregulated. The top 40 and Kyoto Encyclopedia of Genes and Genomes (KEGG) (24) most significantly up‑ or downregulated genes are summa- pathway enrichment analysis were performed to detect rized in Table I. Among which, PAX6 (26), LCN2 (27), and the biological functions and potential pathways associated MAOA (28) were downregulated and were associated with with DEGs using GeneCoDis3 (http://genecodis.cnb.csic. glaucoma. The DEGs were screened for clustering analysis. es/analysis) as previously described (19). The GO functions of The heatmap produced by cluster analysis of the two sets of the DEGs were determined according to the three categories cDNA microarray data is presented in Fig. 1. of: ‘Biological process’; ‘molecular functions’; and ‘cellular component’. Pathway enrichment analysis was based on the Functional annotation. In Fig. 2, GO enrichment demonstrated KEGG database, as previously described (25). that the DEGs were significantly enriched the ‘Biological processes’ categories: ‘eye development’ (FDR=4.15x10-3); Protein‑protein interaction (PPI) network construction. In ‘visual perception’ (FDR=7.13x10-3); ‘negative regulation order to identify candidate genes involved in the formation of of insulin receptor signaling pathway’ (FDR=1.47x10-2); glaucoma, PPI networks of significant DEGs were constructed, the ‘Cellular components’ categories:
Recommended publications
  • PARSANA-DISSERTATION-2020.Pdf
    DECIPHERING TRANSCRIPTIONAL PATTERNS OF GENE REGULATION: A COMPUTATIONAL APPROACH by Princy Parsana A dissertation submitted to The Johns Hopkins University in conformity with the requirements for the degree of Doctor of Philosophy Baltimore, Maryland July, 2020 © 2020 Princy Parsana All rights reserved Abstract With rapid advancements in sequencing technology, we now have the ability to sequence the entire human genome, and to quantify expression of tens of thousands of genes from hundreds of individuals. This provides an extraordinary opportunity to learn phenotype relevant genomic patterns that can improve our understanding of molecular and cellular processes underlying a trait. The high dimensional nature of genomic data presents a range of computational and statistical challenges. This dissertation presents a compilation of projects that were driven by the motivation to efficiently capture gene regulatory patterns in the human transcriptome, while addressing statistical and computational challenges that accompany this data. We attempt to address two major difficulties in this domain: a) artifacts and noise in transcriptomic data, andb) limited statistical power. First, we present our work on investigating the effect of artifactual variation in gene expression data and its impact on trans-eQTL discovery. Here we performed an in-depth analysis of diverse pre-recorded covariates and latent confounders to understand their contribution to heterogeneity in gene expression measurements. Next, we discovered 673 trans-eQTLs across 16 human tissues using v6 data from the Genotype Tissue Expression (GTEx) project. Finally, we characterized two trait-associated trans-eQTLs; one in Skeletal Muscle and another in Thyroid. Second, we present a principal component based residualization method to correct gene expression measurements prior to reconstruction of co-expression networks.
    [Show full text]
  • Identification of Potential Key Genes and Pathway Linked with Sporadic Creutzfeldt-Jakob Disease Based on Integrated Bioinformatics Analyses
    medRxiv preprint doi: https://doi.org/10.1101/2020.12.21.20248688; this version posted December 24, 2020. The copyright holder for this preprint (which was not certified by peer review) is the author/funder, who has granted medRxiv a license to display the preprint in perpetuity. All rights reserved. No reuse allowed without permission. Identification of potential key genes and pathway linked with sporadic Creutzfeldt-Jakob disease based on integrated bioinformatics analyses Basavaraj Vastrad1, Chanabasayya Vastrad*2 , Iranna Kotturshetti 1. Department of Biochemistry, Basaveshwar College of Pharmacy, Gadag, Karnataka 582103, India. 2. Biostatistics and Bioinformatics, Chanabasava Nilaya, Bharthinagar, Dharwad 580001, Karanataka, India. 3. Department of Ayurveda, Rajiv Gandhi Education Society`s Ayurvedic Medical College, Ron, Karnataka 562209, India. * Chanabasayya Vastrad [email protected] Ph: +919480073398 Chanabasava Nilaya, Bharthinagar, Dharwad 580001 , Karanataka, India NOTE: This preprint reports new research that has not been certified by peer review and should not be used to guide clinical practice. medRxiv preprint doi: https://doi.org/10.1101/2020.12.21.20248688; this version posted December 24, 2020. The copyright holder for this preprint (which was not certified by peer review) is the author/funder, who has granted medRxiv a license to display the preprint in perpetuity. All rights reserved. No reuse allowed without permission. Abstract Sporadic Creutzfeldt-Jakob disease (sCJD) is neurodegenerative disease also called prion disease linked with poor prognosis. The aim of the current study was to illuminate the underlying molecular mechanisms of sCJD. The mRNA microarray dataset GSE124571 was downloaded from the Gene Expression Omnibus database. Differentially expressed genes (DEGs) were screened.
    [Show full text]
  • 1 Mutational Heterogeneity in Cancer Akash Kumar a Dissertation
    Mutational Heterogeneity in Cancer Akash Kumar A dissertation Submitted in partial fulfillment of requirements for the degree of Doctor of Philosophy University of Washington 2014 June 5 Reading Committee: Jay Shendure Pete Nelson Mary Claire King Program Authorized to Offer Degree: Genome Sciences 1 University of Washington ABSTRACT Mutational Heterogeneity in Cancer Akash Kumar Chair of the Supervisory Committee: Associate Professor Jay Shendure Department of Genome Sciences Somatic mutation plays a key role in the formation and progression of cancer. Differences in mutation patterns likely explain much of the heterogeneity seen in prognosis and treatment response among patients. Recent advances in massively parallel sequencing have greatly expanded our capability to investigate somatic mutation. Genomic profiling of tumor biopsies could guide the administration of targeted therapeutics on the basis of the tumor’s collection of mutations. Central to the success of this approach is the general applicability of targeted therapies to a patient’s entire tumor burden. This requires a better understanding of the genomic heterogeneity present both within individual tumors (intratumoral) and amongst tumors from the same patient (intrapatient). My dissertation is broadly organized around investigating mutational heterogeneity in cancer. Three projects are discussed in detail: analysis of (1) interpatient and (2) intrapatient heterogeneity in men with disseminated prostate cancer, and (3) investigation of regional intratumoral heterogeneity in
    [Show full text]
  • Whole Exome Sequencing in Families at High Risk for Hodgkin Lymphoma: Identification of a Predisposing Mutation in the KDR Gene
    Hodgkin Lymphoma SUPPLEMENTARY APPENDIX Whole exome sequencing in families at high risk for Hodgkin lymphoma: identification of a predisposing mutation in the KDR gene Melissa Rotunno, 1 Mary L. McMaster, 1 Joseph Boland, 2 Sara Bass, 2 Xijun Zhang, 2 Laurie Burdett, 2 Belynda Hicks, 2 Sarangan Ravichandran, 3 Brian T. Luke, 3 Meredith Yeager, 2 Laura Fontaine, 4 Paula L. Hyland, 1 Alisa M. Goldstein, 1 NCI DCEG Cancer Sequencing Working Group, NCI DCEG Cancer Genomics Research Laboratory, Stephen J. Chanock, 5 Neil E. Caporaso, 1 Margaret A. Tucker, 6 and Lynn R. Goldin 1 1Genetic Epidemiology Branch, Division of Cancer Epidemiology and Genetics, National Cancer Institute, NIH, Bethesda, MD; 2Cancer Genomics Research Laboratory, Division of Cancer Epidemiology and Genetics, National Cancer Institute, NIH, Bethesda, MD; 3Ad - vanced Biomedical Computing Center, Leidos Biomedical Research Inc.; Frederick National Laboratory for Cancer Research, Frederick, MD; 4Westat, Inc., Rockville MD; 5Division of Cancer Epidemiology and Genetics, National Cancer Institute, NIH, Bethesda, MD; and 6Human Genetics Program, Division of Cancer Epidemiology and Genetics, National Cancer Institute, NIH, Bethesda, MD, USA ©2016 Ferrata Storti Foundation. This is an open-access paper. doi:10.3324/haematol.2015.135475 Received: August 19, 2015. Accepted: January 7, 2016. Pre-published: June 13, 2016. Correspondence: [email protected] Supplemental Author Information: NCI DCEG Cancer Sequencing Working Group: Mark H. Greene, Allan Hildesheim, Nan Hu, Maria Theresa Landi, Jennifer Loud, Phuong Mai, Lisa Mirabello, Lindsay Morton, Dilys Parry, Anand Pathak, Douglas R. Stewart, Philip R. Taylor, Geoffrey S. Tobias, Xiaohong R. Yang, Guoqin Yu NCI DCEG Cancer Genomics Research Laboratory: Salma Chowdhury, Michael Cullen, Casey Dagnall, Herbert Higson, Amy A.
    [Show full text]
  • Plasma Cells in Vitro Generation of Long-Lived Human
    Downloaded from http://www.jimmunol.org/ by guest on September 24, 2021 is online at: average * The Journal of Immunology , 32 of which you can access for free at: 2012; 189:5773-5785; Prepublished online 16 from submission to initial decision 4 weeks from acceptance to publication November 2012; doi: 10.4049/jimmunol.1103720 http://www.jimmunol.org/content/189/12/5773 In Vitro Generation of Long-lived Human Plasma Cells Mario Cocco, Sophie Stephenson, Matthew A. Care, Darren Newton, Nicholas A. Barnes, Adam Davison, Andy Rawstron, David R. Westhead, Gina M. Doody and Reuben M. Tooze J Immunol cites 65 articles Submit online. Every submission reviewed by practicing scientists ? is published twice each month by Submit copyright permission requests at: http://www.aai.org/About/Publications/JI/copyright.html Receive free email-alerts when new articles cite this article. Sign up at: http://jimmunol.org/alerts http://jimmunol.org/subscription http://www.jimmunol.org/content/suppl/2012/11/16/jimmunol.110372 0.DC1 This article http://www.jimmunol.org/content/189/12/5773.full#ref-list-1 Information about subscribing to The JI No Triage! Fast Publication! Rapid Reviews! 30 days* Why • • • Material References Permissions Email Alerts Subscription Supplementary The Journal of Immunology The American Association of Immunologists, Inc., 1451 Rockville Pike, Suite 650, Rockville, MD 20852 Copyright © 2012 by The American Association of Immunologists, Inc. All rights reserved. Print ISSN: 0022-1767 Online ISSN: 1550-6606. This information is current as of September 24, 2021. The Journal of Immunology In Vitro Generation of Long-lived Human Plasma Cells Mario Cocco,*,1 Sophie Stephenson,*,1 Matthew A.
    [Show full text]
  • A Transcriptional Signature of Postmitotic Maintenance in Neural Tissues
    Neurobiology of Aging 74 (2019) 147e160 Contents lists available at ScienceDirect Neurobiology of Aging journal homepage: www.elsevier.com/locate/neuaging Postmitotic cell longevityeassociated genes: a transcriptional signature of postmitotic maintenance in neural tissues Atahualpa Castillo-Morales a,b, Jimena Monzón-Sandoval a,b, Araxi O. Urrutia b,c,*, Humberto Gutiérrez a,** a School of Life Sciences, University of Lincoln, Lincoln, UK b Milner Centre for Evolution, Department of Biology and Biochemistry, University of Bath, Bath, UK c Instituto de Ecología, Universidad Nacional Autónoma de México, Ciudad de México, Mexico article info abstract Article history: Different cell types have different postmitotic maintenance requirements. Nerve cells, however, are Received 11 April 2018 unique in this respect as they need to survive and preserve their functional complexity for the entire Received in revised form 3 October 2018 lifetime of the organism, and failure at any level of their supporting mechanisms leads to a wide range of Accepted 11 October 2018 neurodegenerative conditions. Whether these differences across tissues arise from the activation of Available online 19 October 2018 distinct cell typeespecific maintenance mechanisms or the differential activation of a common molecular repertoire is not known. To identify the transcriptional signature of postmitotic cellular longevity (PMCL), Keywords: we compared whole-genome transcriptome data from human tissues ranging in longevity from 120 days Neural maintenance Cell longevity to over 70 years and found a set of 81 genes whose expression levels are closely associated with Transcriptional signature increased cell longevity. Using expression data from 10 independent sources, we found that these genes Functional genomics are more highly coexpressed in longer-living tissues and are enriched in specific biological processes and transcription factor targets compared with randomly selected gene samples.
    [Show full text]
  • Region Based Gene Expression Via Reanalysis of Publicly Available Microarray Data Sets
    University of Louisville ThinkIR: The University of Louisville's Institutional Repository Electronic Theses and Dissertations 5-2018 Region based gene expression via reanalysis of publicly available microarray data sets. Ernur Saka University of Louisville Follow this and additional works at: https://ir.library.louisville.edu/etd Part of the Bioinformatics Commons, Computational Biology Commons, and the Other Computer Sciences Commons Recommended Citation Saka, Ernur, "Region based gene expression via reanalysis of publicly available microarray data sets." (2018). Electronic Theses and Dissertations. Paper 2902. https://doi.org/10.18297/etd/2902 This Doctoral Dissertation is brought to you for free and open access by ThinkIR: The University of Louisville's Institutional Repository. It has been accepted for inclusion in Electronic Theses and Dissertations by an authorized administrator of ThinkIR: The University of Louisville's Institutional Repository. This title appears here courtesy of the author, who has retained all other copyrights. For more information, please contact [email protected]. REGION BASED GENE EXPRESSION VIA REANALYSIS OF PUBLICLY AVAILABLE MICROARRAY DATA SETS By Ernur Saka B.S. (CEng), University of Dokuz Eylul, Turkey, 2008 M.S., University of Louisville, USA, 2011 A Dissertation Submitted To the J. B. Speed School of Engineering in Fulfillment of the Requirements for the Degree of Doctor of Philosophy in Computer Science and Engineering Department of Computer Engineering and Computer Science University of Louisville Louisville, Kentucky May 2018 Copyright 2018 by Ernur Saka All rights reserved REGION BASED GENE EXPRESSION VIA REANALYSIS OF PUBLICLY AVAILABLE MICROARRAY DATA SETS By Ernur Saka B.S. (CEng), University of Dokuz Eylul, Turkey, 2008 M.S., University of Louisville, USA, 2011 A Dissertation Approved On April 20, 2018 by the following Committee __________________________________ Dissertation Director Dr.
    [Show full text]
  • Apoptotic Cells Inflammasome Activity During the Uptake of Macrophage
    Downloaded from http://www.jimmunol.org/ by guest on September 29, 2021 is online at: average * The Journal of Immunology , 26 of which you can access for free at: 2012; 188:5682-5693; Prepublished online 20 from submission to initial decision 4 weeks from acceptance to publication April 2012; doi: 10.4049/jimmunol.1103760 http://www.jimmunol.org/content/188/11/5682 Complement Protein C1q Directs Macrophage Polarization and Limits Inflammasome Activity during the Uptake of Apoptotic Cells Marie E. Benoit, Elizabeth V. Clarke, Pedro Morgado, Deborah A. Fraser and Andrea J. Tenner J Immunol cites 56 articles Submit online. Every submission reviewed by practicing scientists ? is published twice each month by Submit copyright permission requests at: http://www.aai.org/About/Publications/JI/copyright.html Receive free email-alerts when new articles cite this article. Sign up at: http://jimmunol.org/alerts http://jimmunol.org/subscription http://www.jimmunol.org/content/suppl/2012/04/20/jimmunol.110376 0.DC1 This article http://www.jimmunol.org/content/188/11/5682.full#ref-list-1 Information about subscribing to The JI No Triage! Fast Publication! Rapid Reviews! 30 days* Why • • • Material References Permissions Email Alerts Subscription Supplementary The Journal of Immunology The American Association of Immunologists, Inc., 1451 Rockville Pike, Suite 650, Rockville, MD 20852 Copyright © 2012 by The American Association of Immunologists, Inc. All rights reserved. Print ISSN: 0022-1767 Online ISSN: 1550-6606. This information is current as of September 29, 2021. The Journal of Immunology Complement Protein C1q Directs Macrophage Polarization and Limits Inflammasome Activity during the Uptake of Apoptotic Cells Marie E.
    [Show full text]
  • Open Data for Differential Network Analysis in Glioma
    International Journal of Molecular Sciences Article Open Data for Differential Network Analysis in Glioma , Claire Jean-Quartier * y , Fleur Jeanquartier y and Andreas Holzinger Holzinger Group HCI-KDD, Institute for Medical Informatics, Statistics and Documentation, Medical University Graz, Auenbruggerplatz 2/V, 8036 Graz, Austria; [email protected] (F.J.); [email protected] (A.H.) * Correspondence: [email protected] These authors contributed equally to this work. y Received: 27 October 2019; Accepted: 3 January 2020; Published: 15 January 2020 Abstract: The complexity of cancer diseases demands bioinformatic techniques and translational research based on big data and personalized medicine. Open data enables researchers to accelerate cancer studies, save resources and foster collaboration. Several tools and programming approaches are available for analyzing data, including annotation, clustering, comparison and extrapolation, merging, enrichment, functional association and statistics. We exploit openly available data via cancer gene expression analysis, we apply refinement as well as enrichment analysis via gene ontology and conclude with graph-based visualization of involved protein interaction networks as a basis for signaling. The different databases allowed for the construction of huge networks or specified ones consisting of high-confidence interactions only. Several genes associated to glioma were isolated via a network analysis from top hub nodes as well as from an outlier analysis. The latter approach highlights a mitogen-activated protein kinase next to a member of histondeacetylases and a protein phosphatase as genes uncommonly associated with glioma. Cluster analysis from top hub nodes lists several identified glioma-associated gene products to function within protein complexes, including epidermal growth factors as well as cell cycle proteins or RAS proto-oncogenes.
    [Show full text]
  • Downloaded 10 April 2020)
    Breeze et al. Genome Medicine (2021) 13:74 https://doi.org/10.1186/s13073-021-00877-z RESEARCH Open Access Epigenome-wide association study of kidney function identifies trans-ethnic and ethnic-specific loci Charles E. Breeze1,2,3* , Anna Batorsky4, Mi Kyeong Lee5, Mindy D. Szeto6, Xiaoguang Xu7, Daniel L. McCartney8, Rong Jiang9, Amit Patki10, Holly J. Kramer11,12, James M. Eales7, Laura Raffield13, Leslie Lange6, Ethan Lange6, Peter Durda14, Yongmei Liu15, Russ P. Tracy14,16, David Van Den Berg17, NHLBI Trans-Omics for Precision Medicine (TOPMed) Consortium, TOPMed MESA Multi-Omics Working Group, Kathryn L. Evans8, William E. Kraus15,18, Svati Shah15,18, Hermant K. Tiwari10, Lifang Hou19,20, Eric A. Whitsel21,22, Xiao Jiang7, Fadi J. Charchar23,24,25, Andrea A. Baccarelli26, Stephen S. Rich27, Andrew P. Morris28, Marguerite R. Irvin29, Donna K. Arnett30, Elizabeth R. Hauser15,31, Jerome I. Rotter32, Adolfo Correa33, Caroline Hayward34, Steve Horvath35,36, Riccardo E. Marioni8, Maciej Tomaszewski7,37, Stephan Beck2, Sonja I. Berndt1, Stephanie J. London5, Josyf C. Mychaleckyj27 and Nora Franceschini21* Abstract Background: DNA methylation (DNAm) is associated with gene regulation and estimated glomerular filtration rate (eGFR), a measure of kidney function. Decreased eGFR is more common among US Hispanics and African Americans. The causes for this are poorly understood. We aimed to identify trans-ethnic and ethnic-specific differentially methylated positions (DMPs) associated with eGFR using an agnostic, genome-wide approach. Methods: The study included up to 5428 participants from multi-ethnic studies for discovery and 8109 participants for replication. We tested the associations between whole blood DNAm and eGFR using beta values from Illumina 450K or EPIC arrays.
    [Show full text]
  • Genome-Wide Prediction and Prioritization of Human Aging Genes by Data Fusion: a Machine Learning Approach
    Arabfard et al. BMC Genomics (2019) 20:832 https://doi.org/10.1186/s12864-019-6140-0 RESEARCH ARTICLE Open Access Genome-wide prediction and prioritization of human aging genes by data fusion: a machine learning approach Masoud Arabfard1,2, Mina Ohadi3*, Vahid Rezaei Tabar4, Ahmad Delbari3 and Kaveh Kavousi2* Abstract Background: Machine learning can effectively nominate novel genes for various research purposes in the laboratory. On a genome-wide scale, we implemented multiple databases and algorithms to predict and prioritize the human aging genes (PPHAGE). Results: We fused data from 11 databases, and used Naïve Bayes classifier and positive unlabeled learning (PUL) methods, NB, Spy, and Rocchio-SVM, to rank human genes in respect with their implication in aging. The PUL methods enabled us to identify a list of negative (non-aging) genes to use alongside the seed (known age-related) genes in the ranking process. Comparison of the PUL algorithms revealed that none of the methods for identifying a negative sample were advantageous over other methods, and their simultaneous use in a form of fusion was critical for obtaining optimal results (PPHAGE is publicly available at https://cbb.ut.ac.ir/pphage). Conclusion: We predict and prioritize over 3,000 candidate age-related genes in human, based on significant ranking scores. The identified candidate genes are associated with pathways, ontologies, and diseases that are linked to aging, such as cancer and diabetes. Our data offer a platform for future experimental research on the genetic and biological aspects of aging. Additionally, we demonstrate that fusion of PUL methods and data sources can be successfully used for aging and disease candidate gene prioritization.
    [Show full text]
  • Understanding and Improving the Identification of Somatic Variants
    Understanding and Improving the Identification of Somatic Variants Vinaya Vijayan Dissertation submitted to the faculty of the Virginia Polytechnic Institute and State University in partial fulfillment of the requirements for the degree of Doctor of Philosophy In Genetics, Bioinformatics and Computational Biology Chair: Liqing Zhang Christopher Franck Lenwood S. Heath Xiaowei Wu August 12th, 2016 Blacksburg, VA, USA Keywords: Somatic variants, Somatic variant callers, Somatic point mutations, Short tandem repeat variation, Lung squamous cell carcinoma Understanding and Improving the Identification of Somatic Variants Vinaya Vijayan ABSTRACT It is important to understand the entire spectrum of somatic variants to gain more insight into mutations that occur in different cancers for development of better diagnostic, prognostic and therapeutic tools. This thesis outlines our work in understanding somatic variant calling, improving the identification of somatic variants from whole genome and whole exome platforms and identification of biomarkers for lung cancer. Integrating somatic variants from whole genome and whole exome platforms poses a challenge as variants identified in the exonic regions of the whole genome platform may not be identified on the whole exome platform and vice-versa. Taking a simple union or intersection of the somatic variants from both platforms would lead to inclusion of many false positives (through union) and exclusion of many true variants (through intersection). We develop the first framework to improve the identification of somatic variants on whole genome and exome platforms using a machine learning approach by combining the results from two popular somatic variant callers. Testing on simulated and real data sets shows that our framework identifies variants more accurately than using only one somatic variant caller or using variants from only one platform.
    [Show full text]