Robust Hypergraph Regularized Non-Negative Matrix Factorization for Sample Clustering and Feature Selection in Multi-View Gene E

Total Page:16

File Type:pdf, Size:1020Kb

Robust Hypergraph Regularized Non-Negative Matrix Factorization for Sample Clustering and Feature Selection in Multi-View Gene E Yu et al. Human Genomics 2019, 13(Suppl 1):46 https://doi.org/10.1186/s40246-019-0222-6 RESEARCH Open Access Robust hypergraph regularized non- negative matrix factorization for sample clustering and feature selection in multi- view gene expression data Na Yu1, Ying-Lian Gao2, Jin-Xing Liu1*, Juan Wang1* and Junliang Shang1 From IEEE International Conference on Bioinformatics and Biomedicine 2018 Madrid, Spain. 3-6 December 2018 Abstract Background: As one of the most popular data representation methods, non-negative matrix decomposition (NMF) has been widely concerned in the tasks of clustering and feature selection. However, most of the previously proposed NMF-based methods do not adequately explore the hidden geometrical structure in the data. At the same time, noise and outliers are inevitably present in the data. Results: To alleviate these problems, we present a novel NMF framework named robust hypergraph regularized non-negative matrix factorization (RHNMF). In particular, the hypergraph Laplacian regularization is imposed to capture the geometric information of original data. Unlike graph Laplacian regularization which captures the relationship between pairwise sample points, it captures the high-order relationship among more sample points. Moreover, the robustness of the RHNMF is enhanced by using the L2,1-norm constraint when estimating the residual. This is because the L2,1-norm is insensitive to noise and outliers. Conclusions: Clustering and common abnormal expression gene (com-abnormal expression gene) selection are conducted to test the validity of the RHNMF model. Extensive experimental results on multi-view datasets reveal that our proposed model outperforms other state-of-the-art methods. Keywords: Non-negative matrix decomposition, Hypergraph Laplacian, L2,1-norm, Clustering, Common abnormal gene selection, Multi-view gene expression data Background well known that gene expression data can be down- Due to the development of sequencing technology, loaded from The Cancer Genome Atlas (TCGA) plat- more and more gene expression data have been de- form. We then integrated the gene expression data tected. At the same time, there are many meaningful into multi-view data for different diseases with the biological information in gene expression data. The same genes. Multi-view data will provide a new per- effective analysis and research of this information are spective to mine the connections between multiple of great significance to the prevention and treatment cancers. of diseases. And multi-view data obtained by integrat- To meet the demand for studying explosive gene ing data from different sources have gained much expression data, modern biologists are increasingly attention in the field of machine learning [1]. It is concerned with clustering and feature selection. Clustering is the process of dividing a series of genes * Correspondence: [email protected]; [email protected] or samples into different subsets, and the genes or 1School of Information Science and Engineering, Qufu Normal University, Rizhao 276826, China samples in the same subset are similar [2]. Generally Full list of author information is available at the end of the article speaking, feature selection can not only find useful © The Author(s). 2019 Open Access This article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated. Yu et al. Human Genomics 2019, 13(Suppl 1):46 Page 2 of 10 information and eliminate noise, but also reduce the influences of noise and outliers. The main contributions complexity of the computation. In this paper, we of RHNMF are summarized as follows: performed the selection of com-abnormal genes to study the relationship between genes and multiple (i) To capture high-order relationship between more cancers [3]. sample points, hypergraph regularization is applied As an effective matrix decomposition method, to the objective function. This makes sense for non-negative matrix factorization (NMF) [4]is enhancing the performance of NMF-based widely prevalent in bioinformatics [5], image repre- methods. sentation [6], and other fields [7]. NMF can learn (ii) The L2, 1-norm instead of the Frobenius norm is part-based representations of objects. This is consist- used to estimate the residual approximation, so that ent with the human brain’s perception mechanism. the error term for each data point is no longer Some extensions to NMF have been proposed from squared form. This will greatly suppress the effects different perspectives. For example, the non-negative of noise and outliers. And L2, 1-norm is suitable for local coordinate factorization (NLCF) was presented clustering and feature selection because it produces by imposing the locality coordinate constraint into sparse rows. the original NMF [8]. Kim et al. presented the sparse (iii)Scientific and comprehensive experiments are non-negative matrix factorization (NMFs) method in designed on the multi-view datasets to prove the combination with sparse constraints [9]. In practical effectiveness of the RHNMF and achieved applications, the data are sometimes negative, so semi- satisfactory results. non-negative matrix factorization (Semi-NMF) and convex non-negative matrix factorization (Convex-NMF) The rest of the paper is arranged as follows. In the are derived to solve the problem of positive and negative “Methods” section, we introduce the NMF, L2, 1- data [10]. norm, and hypergraph regularization. The proposed As we mentioned above, these methods have enhanced RHNMF method, the solution process, its conver- the performance of NMF, but there also exist the follow- gence, and computational complexity analysis are ing limitations: (1) In fact, there is an intrinsic geomet- also described in detail. Experimental results are rical information in the high-dimensional data. But these demonstrated in the section “Results and discussion.” methods ignore the nonlinear low-dimensional geomet- The conclusion is drawn in the section rical structure in the original data. (2) There are always “Conclusions.” noise and outliers in real data. Therefore, we need a ro- bust NMF-based approach to effectively suppress noise Methods and outliers. Non-negative matrix factorization For the first question, the graph regularized non-nega- In the field of bioinformatics, gene expression data tive matrix factorization (GNMF) was presented to dis- are usually expressed in the form of a matrix. The cover the manifold structure of raw data [11]. However, sample is represented by a column of matrices, and graph regularization is based on constructing k-nearest the level of gene expression is represented by the neighbors in a simple graph, which explores only the rows of the matrix. Given a data matrix X =[x1, x2, pairwise relationship between two sample points. Zeng m × n …, xn] ∈ R , the column vector xj isasamplevec- et al. introduced hypergraph regularized non-negative tor. NMF aims at finding two non-negative matrices matrix factorization (HNMF) to encode the relationship m × k k × n U =[u1, u2, …, uk] ∈ R and V =[v1, v2, …, vn] ∈ R between two or more than two sample points [12]. Un- whose products are similar to the data matrix X [14]. like simple graphs, the hyperedge of a hypergraph con- U represents a basis matrix, and V represents a coef- tains a series of related vertices. Therefore, high-order ficient matrix. The minimizing objective function of relationship of the data can be found. GNMF and the NMF is as below: HNMF consider important manifold information, but they are exceptionally sensitive to noise and outliers. For Xn 2 2 the second problem, using the L2, 1-norm when estimat- min kkX−UV ¼ x −Uv s:t:U≥0; V≥0; U;V F j j ing the residual can be effectively alleviated [13]. j¼1 Inspired by these work, this paper presents a novel ð1Þ NMF model called robust hypergraph regularized non- negative matrix factorization (RHNMF). It adds hyper- graph regularization and L2, 1-norm to the traditional where ‖⋅‖F represents the Frobenius norm of matrix. xj NMF. So it has the advantage of considering the higher- can be seen as a linear combination of columns of U, pa- order relationship among samples and controlling the rameterized by each column of V [15]. Yu et al. Human Genomics 2019, 13(Suppl 1):46 Page 3 of 10 L2, 1-norm represents the hypergraph’s incidence matrix. Then, m × n Given any matrix X ∈ R ,the‖X‖2, 1 is to first we can calculate it as below: calculate L -norm for rows to form a column matrix, 2 ∈ ; and then calculate L -norm for column matrix [13], HðÞ¼; 1 if v e ð Þ 1 v e : 3 i.e., 0 otherwise vffiffiffiffiffiffiffiffiffiffiffiffiffiffi u Xm uXn For any hyperedge ei, its weight Wi is denoted as kkX ¼ t x2 2;1 i; j follows: i¼1 j¼1 m X Wi ¼ WðÞei i ! ¼ x X 2 2 vi−v j i¼1 ¼ − 2 ; exp δ ∈ ð2Þ v j ei ð4Þ . As shown above, L -norm will cause row sparsity X 2, 1 1 2 where δ ¼ kvi−v jk , k represents the value of k [16]. At the same time, the L2, 1-norm is not susceptible k 2 v j∈ei to noise and outliers, so the robustness of the algorithm nearest neighbors for each vertex. d(v) represents the can be improved. degree of vertex v and is expressed as follows: X Hypergraph regularization dvðÞ¼ weðÞHðÞv; e : ð5Þ Inspired by the simple graph theory, hypergraph e∈E came into being [17]. In the sample graph, one edge And the degree of each hyperedge can be denoted as: is connected by two data samples and the weight of the edge denotes the pairwise relationship between X ðÞ¼ HðÞ; : ð Þ two sample points [11].
Recommended publications
  • The Future of Dermatopathology
    Modern Pathology (2006) 19, S155–S163 & 2006 USCAP, Inc All rights reserved 0893-3952/06 $30.00 www.modernpathology.org The future of dermatopathology A Neil Crowson Departments of Dermatology, Pathology, and Surgery, University of Oklahoma and Regional Medical Laboratory, St John Medical Center, Tulsa, OK, USA Of the major issues that dermatopathology will face in the immediate future, two powerful challenges loom large. The first is the application of novel nondestructive imaging technologies to in vivo diagnosis in humans. The second is the application of molecular technologies to a diagnostic arena which formerly belonged exclusively to the light microscopist. The first to be considered in this context is the application of near infrared spectroscopy to the noninvasive in vivo diagnosis of neoplastic skin disease. The second will be a discussion of application, methodology and the current state of the art in microarray technologies as they apply to neoplastic dermatopathology and, in particular, the diagnosis and prognostication of melanoma. Modern Pathology (2006) 19, S155–S163. doi:10.1038/modpathol.3800513 Keywords: in vivo microscope; tissue microarray; basal cell carcinoma; infrared spectroscopy Noninvasive assessment of skin lesions keratoses and squamous cell carcinomata in vitro. by near infrared (IR) spectroscopy Melanocytic nevi could be subdivided into banal vs dysplastic nevi based upon their spectral differences In the late 1990s, working with Dr Laura McIntosh and melanomas could be separately recognized as and colleagues at the National Research Council of well. Furthermore, the different types of lesion were Canada, the University of Manitoba, Central Medical shown to have distinct mid-IR signatures when Laboratories and the Misericordia General Hospital compared to adjacent normal epidermal and dermal in Winnipeg, Canada, we designed and patented a compartments.
    [Show full text]
  • Deer Thymosin Beta 10 Functions As a Novel Factor for Angiogenesis And
    Zhang et al. Stem Cell Research & Therapy (2018) 9:166 https://doi.org/10.1186/s13287-018-0917-y RESEARCH Open Access Deer thymosin beta 10 functions as a novel factor for angiogenesis and chondrogenesis during antler growth and regeneration Wei Zhang1,2†, Wenhui Chu1,2†, Qingxiu Liu1,2†, Dawn Coates3, Yudong Shang1,2 and Chunyi Li1,2* Abstract Background: Deer antlers are the only known mammalian organ with vascularized cartilage that can completely regenerate. Antlers are of real significance as a model of mammalian stem cell-based regeneration with particular relevance to the fields of chondrogenesis, angiogenesis, and regenerative medicine. Recent research found that thymosin beta 10 (TMSB10) is highly expressed in the growth centers of growing antlers. The present study reports here the expression, functions, and molecular interactions of deer TMSB10. Methods: The TMSB10 expression level in both tissue and cells in the antler growth center was measured. The effects of both exogenous (synthetic protein) and endogenous deer TMSB10 (lentivirus-based overexpression) on antlerogenic periosteal cells (APCs; nonactivated antler stem cells with no basal expression of TMSB10) and human umbilical vein endothelial cells (HUVECs; endothelial cells with no basal expression of TMSB10) were evaluated to determine whether TMSB10 functions on chondrogenesis and angiogenesis. Differences in deer and human TMSB10 in angiogenesis and molecular structure were determined using animal models and molecular dynamics simulation, respectively. The molecular mechanisms underlying deer TMSB10 in promoting angiogenesis were also explored. Results: Deer TMSB10 was identified as a novel proangiogenic factor both in vitro and in vivo. Immunohistochemistry revealed that TMSB10 was widely expressed in the antler growth center in situ, with the highest expression in the reserve mesenchyme, precartilage, and transitional zones.
    [Show full text]
  • RNA Sequencing Reveals the Alteration of the Expression of Novel Genes in Ethanol-Treated Embryoid Bodies
    RESEARCH ARTICLE RNA Sequencing Reveals the Alteration of the Expression of Novel Genes in Ethanol-Treated Embryoid Bodies Chanchal Mandal1, Sun Hwa Kim1, Jin Choul Chai1, Seon Mi Oh1, Young Seek Lee1, Kyoung Hwa Jung2*, Young Gyu Chai1,3* 1 Department of Molecular and Life Science, Hanyang University, Ansan, Republic of Korea, 2 Institute of Natural Science and Technology, Hanyang University, Ansan, Republic of Korea, 3 Department of Bionanotechnology, Hanyang University, Seoul, Republic of Korea * [email protected] (YGC); [email protected] (KHJ) Abstract Fetal alcohol spectrum disorder is a collective term representing fetal abnormalities associ- OPEN ACCESS ated with maternal alcohol consumption. Prenatal alcohol exposure and related anomalies are well characterized, but the molecular mechanism behind this phenomenon is not well Citation: Mandal C, Kim SH, Chai JC, Oh SM, Lee characterized. In this present study, our aim is to profile important genes that regulate cellu- YS, Jung KH, et al. (2016) RNA Sequencing Reveals the Alteration of the Expression of Novel Genes in lar development during fetal development. Human embryonic carcinoma cells (NCCIT) are Ethanol-Treated Embryoid Bodies. PLoS ONE 11(3): cultured to form embryoid bodies and then treated in the presence and absence of ethanol e0149976. doi:10.1371/journal.pone.0149976 (50 mM). We employed RNA sequencing to profile differentially expressed genes in the eth- Editor: Shihui Yang, National Renewable Energy anol-treated embryoid bodies from NCCIT vs. EB, NCCIT vs. EB+EtOH and EB vs. EB Lab, UNITED STATES +EtOH data sets. A total of 632, 205 and 517 differentially expressed genes were identified Received: July 13, 2015 from NCCIT vs.
    [Show full text]
  • The Human Gene Connectome As a Map of Short Cuts for Morbid Allele Discovery
    The human gene connectome as a map of short cuts for morbid allele discovery Yuval Itana,1, Shen-Ying Zhanga,b, Guillaume Vogta,b, Avinash Abhyankara, Melina Hermana, Patrick Nitschkec, Dror Friedd, Lluis Quintana-Murcie, Laurent Abela,b, and Jean-Laurent Casanovaa,b,f aSt. Giles Laboratory of Human Genetics of Infectious Diseases, Rockefeller Branch, The Rockefeller University, New York, NY 10065; bLaboratory of Human Genetics of Infectious Diseases, Necker Branch, Paris Descartes University, Institut National de la Santé et de la Recherche Médicale U980, Necker Medical School, 75015 Paris, France; cPlateforme Bioinformatique, Université Paris Descartes, 75116 Paris, France; dDepartment of Computer Science, Ben-Gurion University of the Negev, Beer-Sheva 84105, Israel; eUnit of Human Evolutionary Genetics, Centre National de la Recherche Scientifique, Unité de Recherche Associée 3012, Institut Pasteur, F-75015 Paris, France; and fPediatric Immunology-Hematology Unit, Necker Hospital for Sick Children, 75015 Paris, France Edited* by Bruce Beutler, University of Texas Southwestern Medical Center, Dallas, TX, and approved February 15, 2013 (received for review October 19, 2012) High-throughput genomic data reveal thousands of gene variants to detect a single mutated gene, with the other polymorphic genes per patient, and it is often difficult to determine which of these being of less interest. This goes some way to explaining why, variants underlies disease in a given individual. However, at the despite the abundance of NGS data, the discovery of disease- population level, there may be some degree of phenotypic homo- causing alleles from such data remains somewhat limited. geneity, with alterations of specific physiological pathways under- We developed the human gene connectome (HGC) to over- come this problem.
    [Show full text]
  • Dissecting Heterogeneous Cell Populations Across Drug and Disease Conditions with Popalign
    Dissecting heterogeneous cell populations across drug and disease conditions with PopAlign Sisi Chena,b,1 , Paul Rivauda,b , Jong H. Parka,b, Tiffany Tsoua,b , Emeric Charlesc, John R. Haliburtond, Flavia Pichiorrie, and Matt Thomsona,b,1 aDivision of Biology and Biological Engineering, California Institute of Technology, Pasadena, CA 91125; bBeckman Center for Single-Cell Profiling and Engineering, California Institute of Technology, Pasadena, CA 91125; cDepartment of Molecular and Cell Biology, University of California Berkeley, Berkeley, CA 94720; dAugmenta Bioworks Inc., Menlo Park, CA 94025; and eDepartment of Hematologic Malignancies Translational Science, City of Hope, Monrovia, CA 91016 Edited by Jonathan S. Weissman, University of California, San Francisco, CA, and approved September 25, 2020 (received for review March 31, 2020) Single-cell measurement techniques can now probe gene expres- The key conceptual advance underlying PopAlign is repre- sion in heterogeneous cell populations from the human body sentational: we model the distribution of gene-expression states across a range of environmental and physiological conditions. within a heterogeneous cell population using a probabilistic However, new mathematical and computational methods are mixture model that we infer from single-cell data. PopAlign required to represent and analyze gene-expression changes that identifies and represents subpopulations of cells as indepen- occur in complex mixtures of single cells as they respond to sig- dent Gaussian densities within a reduced
    [Show full text]
  • Robust Sampling of Altered Pathways for Drug Repositioning Reveals
    Fernández-Martínez JL, Álvarez O, De Andrés EJ, de la Viña JFS, Huergo L. Robust Sam- Journal of pling of Altered Pathways for Drug Repositioning Reveals Promising Novel Therapeutics for Rare Diseases Research Inclusion Body Myositis. J Rare Dis Res Treat. (2019) 4(2): 7-15 & Treatment www.rarediseasesjournal.com Research Article Open Access Robust Sampling of Altered Pathways for Drug Repositioning Reveals Promising Novel Therapeutics for Inclusion Body Myositis Juan Luis Fernández-Martínez*, Oscar Álvarez, Enrique J. DeAndrés-Galiana, Javier Fernández-Sánchez de la Viña, Leticia Huergo Group of Inverse Problems, Optimization and Machine Learning. Department of Mathematics. University of Oviedo, Oviedo, 33007, Asturias, Spain. Article Info ABSTRACT Article Notes In this paper we present a robust methodology to deal with phenotype Received: January 28, 2019 prediction problems associated to drug repositioning in rare diseases, which Accepted: April 3, 2019 is based on the robust sampling of altered pathways. We show the application *Correspondence: to the analysis of IBM (Inclusion Body Myositis) providing new insights about Dr. Juan Luis Fernández-Martínez, Group of Inverse the mechanisms involved in its development: cytotoxic CD8 T cell-mediated Problems, Optimization and Machine Learning. Department immune response and pathogenic protein accumulation in myofibrils related of Mathematics. University of Oviedo, Oviedo, 33007, to the proteasome inhibition. The originality of this methodology consists of Asturias, Spain; Email: [email protected]. performing a robust and deep sampling of the altered pathways and relating © 2019 Fernández-Martínez JL. This article is distributed under these results to possible compounds via the connectivity map paradigm. the terms of the Creative Commons Attribution 4.0 International The methodology is particularly well-suited for the case of rare diseases License.
    [Show full text]
  • Program in Human Neutrophils Fails To
    Downloaded from http://www.jimmunol.org/ by guest on September 25, 2021 is online at: average * The Journal of Immunology Anaplasma phagocytophilum , 20 of which you can access for free at: 2005; 174:6364-6372; ; from submission to initial decision 4 weeks from acceptance to publication J Immunol doi: 10.4049/jimmunol.174.10.6364 http://www.jimmunol.org/content/174/10/6364 Insights into Pathogen Immune Evasion Mechanisms: Fails to Induce an Apoptosis Differentiation Program in Human Neutrophils Dori L. Borjesson, Scott D. Kobayashi, Adeline R. Whitney, Jovanka M. Voyich, Cynthia M. Argue and Frank R. DeLeo cites 28 articles Submit online. Every submission reviewed by practicing scientists ? is published twice each month by Receive free email-alerts when new articles cite this article. Sign up at: http://jimmunol.org/alerts http://jimmunol.org/subscription Submit copyright permission requests at: http://www.aai.org/About/Publications/JI/copyright.html http://www.jimmunol.org/content/suppl/2005/05/03/174.10.6364.DC1 This article http://www.jimmunol.org/content/174/10/6364.full#ref-list-1 Information about subscribing to The JI No Triage! Fast Publication! Rapid Reviews! 30 days* • Why • • Material References Permissions Email Alerts Subscription Supplementary The Journal of Immunology The American Association of Immunologists, Inc., 1451 Rockville Pike, Suite 650, Rockville, MD 20852 Copyright © 2005 by The American Association of Immunologists All rights reserved. Print ISSN: 0022-1767 Online ISSN: 1550-6606. This information is current as of September 25, 2021. The Journal of Immunology Insights into Pathogen Immune Evasion Mechanisms: Anaplasma phagocytophilum Fails to Induce an Apoptosis Differentiation Program in Human Neutrophils1 Dori L.
    [Show full text]
  • A Genomic View of Estrogen Actions in Human Breast Cancer Cells by Expression Profiling of the Hormone-Responsive Transcriptome
    719 A genomic view of estrogen actions in human breast cancer cells by expression profiling of the hormone-responsive transcriptome Luigi Cicatiello1, Claudio Scafoglio1, Lucia Altucci1, Massimo Cancemi1, Guido Natoli1, Angelo Facchiano2, Giovanni Iazzetti3, Raffaele Calogero4, Nicoletta Biglia6, Michele De Bortoli5,7, Christian Sfiligoi7, Piero Sismondi6,7, Francesco Bresciani1 and Alessandro Weisz1 1Dipartimento di Patologia generale, Seconda Università degli Studi di Napoli, Vico L. De Crecchio 7, 80138 Napoli, Italy 2Istituto di Scienze dell’Alimentazione del Consiglio Nazionale delle Ricerche, Avellino, Italy 3Dipartimento di Genetica, Biologia generale e molecolare, Università di Napoli ‘Federico II’, Napoli, Italy 4Dipartimento di Scienze cliniche e biologiche, Università degli Studi di Torino, Torino, Italy 5Dipartimento di Scienze oncologiche, Università degli Studi di Torino, Torino, Italy 6Dipartimento di Discipline ostetriche e ginecologiche, Università degli Studi di Torino, Torino, Italy 7Laboratorio di Ginecologia oncologica, Istituto per la Ricerca e la Cura del Cancro, Candiolo, Italy (Requests for offprints should be addressed to A Weisz; Email: [email protected]) Abstract Estrogen controls key cellular functions of responsive cells including the ability to survive, replicate, communicate and adapt to the extracellular milieu. Changes in the expression of 8400 genes were monitored here by cDNA microarray analysis during the first 32 h of human breast cancer (BC) ZR-75·1 cell stimulation with a mitogenic dose of 17-estradiol, a timing which corresponds to completion of a full mitotic cycle in hormone-stimulated cells. Hierarchical clustering of 344 genes whose expression either increases or decreases significantly in response to estrogen reveals that the gene expression program activated by the hormone in these cells shows 8 main patterns of gene activation/inhibition.
    [Show full text]
  • Novel and Highly Recurrent Chromosomal Alterations in Se´Zary Syndrome
    Research Article Novel and Highly Recurrent Chromosomal Alterations in Se´zary Syndrome Maarten H. Vermeer,1 Remco van Doorn,1 Remco Dijkman,1 Xin Mao,3 Sean Whittaker,3 Pieter C. van Voorst Vader,4 Marie-Jeanne P. Gerritsen,5 Marie-Louise Geerts,6 Sylke Gellrich,7 Ola So¨derberg,8 Karl-Johan Leuchowius,8 Ulf Landegren,8 Jacoba J. Out-Luiting,1 Jeroen Knijnenburg,2 Marije IJszenga,2 Karoly Szuhai,2 Rein Willemze,1 and Cornelis P. Tensen1 Departments of 1Dermatology and 2Molecular Cell Biology, Leiden University Medical Center, Leiden, the Netherlands; 3Department of Dermatology, St Thomas’ Hospital, King’s College, London, United Kingdom; 4Department of Dermatology, University Medical Center Groningen, Groningen, the Netherlands; 5Department of Dermatology, Radboud University Nijmegen Medical Center, Nijmegen, the Netherlands; 6Department of Dermatology, Gent University Hospital, Gent, Belgium; 7Department of Dermatology, Charite, Berlin, Germany; and 8Department of Genetics and Pathology, Rudbeck Laboratory, University of Uppsala, Uppsala, Sweden Abstract Introduction This study was designed to identify highly recurrent genetic Se´zary syndrome (Sz) is an aggressive type of cutaneous T-cell alterations typical of Se´zary syndrome (Sz), an aggressive lymphoma/leukemia of skin-homing, CD4+ memory T cells and is cutaneous T-cell lymphoma/leukemia, possibly revealing characterized by erythroderma, generalized lymphadenopathy, and pathogenetic mechanisms and novel therapeutic targets. the presence of neoplastic T cells (Se´zary cells) in the skin, lymph High-resolution array-based comparative genomic hybridiza- nodes, and peripheral blood (1). Sz has a poor prognosis, with a tion was done on malignant T cells from 20 patients. disease-specific 5-year survival of f24% (1).
    [Show full text]
  • Transcriptome Analysis of Ageing in Uninjured Human Achilles Tendon
    Peffers et al. Arthritis Research & Therapy (2015) 17:33 DOI 10.1186/s13075-015-0544-2 RESEARCH ARTICLE Open Access Transcriptome analysis of ageing in uninjured human Achilles tendon Mandy Jayne Peffers1*, Yongxiang Fang2, Kathleen Cheung3, Tim Koh Jia Wei4, Peter David Clegg1 and Helen Lucy Birch5 Abstract Introduction: The risk of tendon injury and disease increases significantly with increasing age. The aim of the study was to characterise transcriptional changes in human Achilles tendon during the ageing process in order to identify molecular signatures that might contribute to age-related degeneration. Methods: RNA for gene expression analysis using RNA-Seq and quantitative real-time polymerase chain reaction analysis was isolated from young and old macroscopically normal human Achilles tendon. RNA sequence libraries were prepared following ribosomal RNA depletion, and sequencing was undertaken by using the Illumina HiSeq 2000 platform. Expression levels among genes were compared by using fragments per kilobase of exon per million fragments mapped. Differentially expressed genes were defined by using Benjamini-Hochberg false discovery rate approach (P <0.05, expression ratios 1.4 log2 fold change). Alternative splicing of exon variants were also examined by using Cufflinks. The functional significance of genes that showed differential expression between young and old tendon was determined by using ingenuity pathway analysis. Results: In total, the expression of 325 transcribed elements, including protein-coding transcripts and non-coding transcripts (small non-coding RNAs, pseudogenes, long non-coding RNAs and a single microRNA), was significantly different in old compared with young tendon (±1.4 log2 fold change, P <0.05). Of these, 191 were at higher levels in older tendon and 134 were at lower levels in older tendon.
    [Show full text]
  • Molecular Signatures Associated with HCV-Induced Hepatocellular Carcinoma and Liver Metastasis
    Molecular Signatures Associated with HCV-Induced Hepatocellular Carcinoma and Liver Metastasis Valeria De Giorgi2*, Luigi Buonaguro1, Andrea Worschech3, Maria Lina Tornesello1, Francesco Izzo4, Francesco M. Marincola2, Ena Wang2, Franco M. Buonaguro1* 1 Molecular Biology and Viral Oncogenesis and AIDS Refer. Center, Ist. Naz. Tumori "Fond. G. Pascale", Naples, Italy, 2 Infectious Disease and Immunogenetics Section (IDIS), Department of Transfusion Medicine, Clinical Center and Trans-NIH Center for Human Immunology (CHI), National Institutes of Health, Bethesda, Maryland, United States of America, 3 Department of Biochemistry, Biocenter, University of Wuerzburg, Am Hubland, Wuerzburg, Germany, 4 Hepato-biliary Surgery Department, Ist. Naz. Tumori "Fond. G. Pascale", Naples, Italy Abstract Hepatocellular carcinomas (HCCs) are a heterogeneous group of tumors that differ in risk factors and genetic alterations. In Italy, particularly Southern Italy, chronic hepatitis C virus (HCV) infection represents the main cause of HCC. Using high- density oligoarrays, we identified consistent differences in gene-expression between HCC and normal liver tissue. Expression patterns in HCC were also readily distinguishable from those associated with liver metastases. To characterize molecular events relevant to hepatocarcinogenesis and identify biomarkers for early HCC detection, gene expression profiling of 71 liver biopsies from HCV-related primary HCC and corresponding HCV-positive non-HCC hepatic tissue, as well as gastrointestinal liver metastases paired
    [Show full text]
  • Dissecting Transcriptomic Signatures of Neuronal Differentiation and Maturation Using Ipscs
    ARTICLE https://doi.org/10.1038/s41467-019-14266-z OPEN Dissecting transcriptomic signatures of neuronal differentiation and maturation using iPSCs Emily E. Burke1,11, Joshua G. Chenoweth1,11, Joo Heon Shin1, Leonardo Collado-Torres 1, Suel-Kee Kim1, Nicola Micali1, Yanhong Wang 1, Carlo Colantuoni1, Richard E. Straub1, Daniel J. Hoeppner1, Huei-Ying Chen1, Alana Sellers1, Kamel Shibbani1, Gregory R. Hamersky1, Marcelo Diaz Bustamante1, BaDoi N. Phan 1, William S. Ulrich1, Cristian Valencia1, Amritha Jaishankar1, Amanda J. Price 1,2, Anandita Rajpurohit1, Stephen A. Semick1, Roland W. Bürli3, James C. Barrow1, Daniel J. Hiler1, Stephanie C. Page1, Keri Martinowich1,4,5, Thomas M. Hyde1,4,6, Joel E. Kleinman1,4, Karen F. Berman7, Jose A. Apud7, Alan J. Cross8, 8 1,2,4,5,6 1,4,5 1 1234567890():,; Nicholas J. Brandon , Daniel R. Weinberger , Brady J. Maher , Ronald D.G. McKay *& Andrew E. Jaffe 1,2,4,5,9,10* Human induced pluripotent stem cells (hiPSCs) are a powerful model of neural differentiation and maturation. We present a hiPSC transcriptomics resource on corticogenesis from 5 iPSC donor and 13 subclonal lines across 9 time points over 5 broad conditions: self-renewal, early neuronal differentiation, neural precursor cells (NPCs), assembled rosettes, and differentiated neuronal cells. We identify widespread changes in the expression of both individual features and global patterns of transcription. We next demonstrate that co-culturing human NPCs with rodent astrocytes results in mutually synergistic maturation, and that cell type-specific expression data can be extracted using only sequencing read alignments without cell sorting. We lastly adapt a previously generated RNA deconvolution approach to single-cell expres- sion data to estimate the relative neuronal maturity of iPSC-derived neuronal cultures and human brain tissue.
    [Show full text]