The Bioassay Network and Its Implications to Future Therapeutic Discovery

Total Page:16

File Type:pdf, Size:1020Kb

The Bioassay Network and Its Implications to Future Therapeutic Discovery Zhang et al. BMC Bioinformatics 2011, 12(Suppl 5):S1 http://www.biomedcentral.com/1471-2105/12/S5/S1 PROCEEDINGS Open Access The BioAssay network and its implications to future therapeutic discovery Jintao Zhang1, Gerald H Lushington2, Jun Huan3* From IEEE International Conference on Bioinformatics and Biomedicine 2010 Hong Kong, PR China. 18-21 December 2010 Abstract Background: Despite intense investment growth and technology development, there is an observed bottleneck in drug discovery and development over the past decade. NIH started the Molecular Libraries Initiative (MLI) in 2003 to enlarge the pool for potential drug targets, especially from the “undruggable” part of human genome, and potential drug candidates from much broader types of drug-like small molecules. All results are being made publicly available in a web portal called PubChem. Results: In this paper we construct a network from bioassay data in PubChem, apply network biology concepts to characterize this bioassay network, integrate information from multiple biological databases (e.g. DrugBank, OMIM, and UniHI), and systematically analyze the potential of bioassay targets being new drug targets in the context of complex biological networks. We propose a model to quantitatively prioritize this druggability of bioassay targets, and literature evidence was found to confirm our prioritization of bioassay targets at a roughly 70% accuracy. Conclusions: Our analysis provide some measures of the value of the MLI data as a resource for both basic chemical biology research and future therapeutic discovery. Background Under these circumstances, NIH launched the Mole- Intense growth in drug development investment in the cular Libraries Initiative (MLI) in 2003 for identifying past decade has not yet produced significant progress chemical probes to enhance the chemical biology under- on the discovery of novel drugs and the validation of standing of therapeutically interesting genes and path- new drug targets: the average number of novel drugs ways [7], and for the purpose of expanding availability, entering the global market each year has remained flexibility, and utility of small-molecule bioassay screen- roughly constant (approximately 26), along with only ing data. The MLI especially focuses on genes in the about 6-7 new drug targets introduced annually [1-3]. “undruggable” part of human genome that has not been Moreover, the success rate of translating new drug can- well investigated in private sectors for identifying their didates into US Food and Drug Administration (FDA)- functions and potential therapeutics [7]. The MLI has approved drugs significantly decreased [4], mainly due also been synthesizing and screening much broader to lack of efficacy and discovered adverse drug reactions, types of compounds for increasing the diversity of each of which accounts for 30% of late-stage drug fail- selecting potential drug candidates in chemical space (i. ures in clinical development [5]. The increasing rate of e., the set of all possible small organic molecules). All drug attrition challenges the traditional drug design MLI results are being made freely available to research- paradigm and makes the current “one gene, one drug, ers in both public and private sectors via a web portal and one disease” assumption [6] questionable. called PubChem (http://pubchem.ncbi.nlm.nih.gov) [8]. PubChem provides valuable chemical genomics informa- tion in studying genes, pathways, cells and diseases, * Correspondence: [email protected] 3Department of Electrical Engineering & Computer Science, University of however, these data are noisy, high dimensional, with Kansas, Lawrence, KS 66045, USA large volume, and contain outliers and errors. For Full list of author information is available at the end of the article © 2011 Zhang et al; licensee BioMed Central Ltd. This is an open access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited. Zhang et al. BMC Bioinformatics 2011, 12(Suppl 5):S1 Page 2 of 8 http://www.biomedcentral.com/1471-2105/12/S5/S1 instance, the activity score, which measure the biological data from the Online Mendelian Inheritance in Man activity level of screened compounds, are normalized in (OMIM) database [14], Goh et al.[15] built a bipartite many different approaches without a consensus. Hence human disease network, and then generated two biologi- PubChem data deserve a careful investigation for the cally relevant network projections: human disease net- related research communities. work and disease gene network. This network-based There are numerous criteria that chemical biologists approach revealed that genes associated similar disor- might place on an initiative that aims to foster new ders are more likely to have interactions between their paradigms in therapeutic discovery. Some obvious tar- products and higher expression profiling similarity get-related benchmarks should include whether the between their transcripts, indicating the existence of dis- assay priorities focus on the systems that stand a good ease-specific functional modules. In this study we use chance of being potential druggable targets in their network-based analysis approaches to integrate Pub- own right (i.e., share favorable attributes with known Chem data and existing research results on drug-target drug targets, and augment current biomedical capabil- network [2], human diseasome [15], human protein ities). Our assessment of MLI bioassay targets thus interactome [9] for a better understanding of the corre- focuses on a variety of their attributes relative to lations and interrelationships between disease genes, known drug targets. Given our interest in analyzing genetic disorders and drugs by (1) constructing a bioas- specific relationships with phenotypically interesting say network for data in PubChem, visualizing complex pathways, we choose to focus our target analysis data in a network view and characterizing the network strictly on target-based assays, whose target proteins using a variety of statistical tools, (2) mapping bioassay are referred to herein as bioassay targets. As our basis targets into the human PPI networks, and investigating for contrasting targets, we use target protein interac- the interrelationships between bioassay targets, and drug tion profiles in the human PPI network, to evaluate targets, disease genes, and essential genes, and (3) quali- whether the bioassay target selection was progressing tatively analyzing the potential of bioassay targets to be effectively toward augmenting existing chemical geno- potential therapeutic targets, quantitatively modeling mics knowledge. this potential, and confirming our results using literature Biological network analysis approaches have gained survey. Our analyses should provide some measures of popularity for organizing complex biological systems so the value of the MLI data as a resource for both basic that data retrieval, analysis, and visualization can be chemical biology research and future therapeutic highly efficient. It can also reveal important biological discovery. patterns and functions that are deeply hidden in mass data repository. For instance, Stelzl et al.[9]usedthe Results and discussion Y2H system to generate and analyze a human PPI net- We download all bioassay screening data from the Pub- work, and calculated many interesting and critical pat- Chem BioAssay Database. As of January 2009, 1306 ternsandcharacteristics.Thiswasviewedasan bioassays (1,126 with at least one active compound) and important step toward the complete human protein-pro- more than 30 million compounds have been deposited tein interactome. Chaurasia et al.[10,11] built a compre- into PubChem by a variety of screening centers, and the hensive web platform - the Unified Human Interactome size of PubChem data keeps increasing continuously. database (UniHI, http://www.unihi.org), for querying For each bioassay, tens to hundreds of thousand of com- and accessing human protein-protein interaction (PPI) pounds are tested either against specific target proteins data. The latest update of UniHI includes over 250,000 in vitro or within a cell for investigating disease-related interactions between 22,300 unique proteins collected mechanisms. There are totally 151,930 compounds that from 14 major PPI sources [11]. Using association data are active in at least one bioassay, and 555,859 bioassay- of approved drugs and drug targets obtained from the active compound pairs across all the bioassays. On aver- DrugBank database [12,13], Yildirim et al.[2] built a age each active compound is active in 3.7 bioassays, and bipartite network composed FDA-approved drugs and each bioassay has 493.7 active compounds. Therefore a their drug targets, which was an important step toward very sparse bipartite network of bioassays and com- the complete characterization of the global relationship pounds can be observed. In addition, 680 bioassays are between protein targets of all drugs and all disease-gene found associated with at least one target protein and are products in the human protein interactome. Quantita- considered target-based, and the rest are hence assumed tive topological analyses of this drug-target network cell-based bioassays. Moreover, we have found 289 dis- revealed that the targets of current drugs are highly tinct protein GI (gene identifier) numbers
Recommended publications
  • A De Novo 2Q35-Q36.1 Deletion Incorporating IHH in a Chinese Boy (47,XYY) with Syndactyly, Type III Waardenburg Syndrome, and Congenital Heart Disease
    A de novo 2q35-q36.1 deletion incorporating IHH in a Chinese boy (47,XYY) with syndactyly, type III Waardenburg syndrome, and congenital heart disease D. Wang1,2*, G.F. Ren3*, H.Z. Zhang1, C.Y. Yi4 and Z.J. Peng3 1Medical Institute of Pediatrics, Qilu Children’s Hospital of Shandong University, Jinan, Shandong, China 2Institute of Cardiovascular Disease, General Hospital of Jinan Military Region, Jinan, Shandong, China 3Department of Pediatrics, Qilu Children’s Hospital of Shandong University, Jinan, Shandong, China 4Children’s Medical Laboratory Diagnosis Center, Qilu Children’s Hospital of Shandong University, Jinan, Shandong, China *These authors contributed equally to this study. Corresponding authors: C.Y. Yi / Z.J. Peng E-mail: [email protected] / [email protected] Genet. Mol. Res. 15 (4): gmr15049060 Received August 5, 2016 Accepted November 8, 2016 Published December 2, 2016 DOI http://dx.doi.org/10.4238/gmr15049060 Copyright © 2016 The Authors. This is an open-access article distributed under the terms of the Creative Commons Attribution ShareAlike (CC BY-SA) 4.0 License. ABSTRACT. Reports of terminal and interstitial deletions of the long arm of chromosome 2 are rare in the literature. Here, we present a case report concerning a Chinese boy with a 47,XYY karyotype and a de novo deletion comprising approximately 5 Mb between 2q35 and q36.1, along with syndactyly, type III Waardenburg syndrome, and congenital heart disease. High-resolution chromosome analysis Genetics and Molecular Research 15 (4): gmr15049060 D. Wang et al. 2 to detect copy number variations was carried out using an Affymetrix microarray platform, and the genes affected by the patient’s deletion, including IHH, were determined.
    [Show full text]
  • Page 1 Exploring the Understudied Human Kinome For
    bioRxiv preprint doi: https://doi.org/10.1101/2020.04.02.022277; this version posted June 30, 2020. The copyright holder for this preprint (which was not certified by peer review) is the author/funder, who has granted bioRxiv a license to display the preprint in perpetuity. It is made available under aCC-BY 4.0 International license. Exploring the understudied human kinome for research and therapeutic opportunities Nienke Moret1,2,*, Changchang Liu1,2,*, Benjamin M. Gyori2, John A. Bachman,2, Albert Steppi2, Rahil Taujale3, Liang-Chin Huang3, Clemens Hug2, Matt Berginski1,4,5, Shawn Gomez1,4,5, Natarajan Kannan,1,3 and Peter K. Sorger1,2,† *These authors contributed equally † Corresponding author 1The NIH Understudied Kinome Consortium 2Laboratory of Systems Pharmacology, Department of Systems Biology, Harvard Program in Therapeutic Science, Harvard Medical School, Boston, Massachusetts 02115, USA 3 Institute of Bioinformatics, University of Georgia, Athens, GA, 30602 USA 4 Department of Pharmacology, The University of North Carolina at Chapel Hill, Chapel Hill, NC 27599, USA 5 Joint Department of Biomedical Engineering at the University of North Carolina at Chapel Hill and North Carolina State University, Chapel Hill, NC 27599, USA Key Words: kinase, human kinome, kinase inhibitors, drug discovery, cancer, cheminformatics, † Peter Sorger Warren Alpert 432 200 Longwood Avenue Harvard Medical School, Boston MA 02115 [email protected] cc: [email protected] 617-432-6901 ORCID Numbers Peter K. Sorger 0000-0002-3364-1838 Nienke Moret 0000-0001-6038-6863 Changchang Liu 0000-0003-4594-4577 Ben Gyori 0000-0001-9439-5346 John Bachman 0000-0001-6095-2466 Albert Steppi 0000-0001-5871-6245 Page 1 bioRxiv preprint doi: https://doi.org/10.1101/2020.04.02.022277; this version posted June 30, 2020.
    [Show full text]
  • A Revised Nomenclature for the Human and Rodent Α-Tubulin Gene Family ⁎ Varsha K
    Genomics 90 (2007) 285–289 www.elsevier.com/locate/ygeno Short Communication A revised nomenclature for the human and rodent α-tubulin gene family ⁎ Varsha K. Khodiyar a, , Lois J. Maltais b, Katherine M.B. Sneddon a, Jennifer R. Smith c, Mary Shimoyama c, Fernando Cabral d, Charles Dumontet e, Susan K. Dutcher f, Robert J. Harvey g, Laurence Lafanechère h, John M. Murray i, Eva Nogales j, David Piquemal k, Fabio Stanchi l, Sue Povey a, Ruth C. Lovering a a HUGO Gene Nomenclature Committee, Department of Biology, University College London, Wolfson House, 4 Stephenson Way, London NW1 2HE, UK b Mouse Genomic Nomenclature Committee, Mouse Genome Informatics, The Jackson Laboratory, Bar Harbor, ME 04609, USA c Rat Genome Database, Bioinformatics Research Center, Medical College of Wisconsin, Milwaukee, WI 53226, USA d Department of Integrative Biology and Pharmacology, University of Texas Medical School, Houston, TX 77030, USA e Unité Institut National de la Sante et de la Recherche Medicale 590, Laboratoire de Cytologie Analytique, Faculté de Médecine, Université Claude Bernard, 8 Avenue Rockefeller, 69373 Lyon Cedex 08, France f Department of Genetics, Washington University School of Medicine, Saint Louis, MO 63110, USA g Department of Pharmacology, The School of Pharmacy, 29-39 Brunswick Square, London WC1N1AX, UK h Centre de Criblage pour Molécules Bio-Actives, Institut de Recherches en Technologies et Sciences pour le Vivant, CEA–Grenoble, 17 Rue des Martyrs, 38054 Grenoble, France i Department of Cell and Developmental Biology, University of Pennsylvania, Philadelphia, PA 19104-6058, USA j Howard Hughes Medical Institute, Molecular and Cell Biology Department, and Lawrence Berkeley National Laboratory, 355 LSA, University of California at Berkeley, Berkeley, CA 94720-3200, USA k Skuld-Tech Laboratories, Bioinformatics Center, Montpellier, France l Institute of Signaling, Developmental Biology, and Cancer Research, CNRS UMR6543, Centre A.
    [Show full text]
  • A Resource for Exploring the Understudied Human Kinome for Research and Therapeutic
    bioRxiv preprint doi: https://doi.org/10.1101/2020.04.02.022277; this version posted March 11, 2021. The copyright holder for this preprint (which was not certified by peer review) is the author/funder, who has granted bioRxiv a license to display the preprint in perpetuity. It is made available under aCC-BY 4.0 International license. A resource for exploring the understudied human kinome for research and therapeutic opportunities Nienke Moret1,2,*, Changchang Liu1,2,*, Benjamin M. Gyori2, John A. Bachman,2, Albert Steppi2, Clemens Hug2, Rahil Taujale3, Liang-Chin Huang3, Matthew E. Berginski1,4,5, Shawn M. Gomez1,4,5, Natarajan Kannan,1,3 and Peter K. Sorger1,2,† *These authors contributed equally † Corresponding author 1The NIH Understudied Kinome Consortium 2Laboratory of Systems Pharmacology, Department of Systems Biology, Harvard Program in Therapeutic Science, Harvard Medical School, Boston, Massachusetts 02115, USA 3 Institute of Bioinformatics, University of Georgia, Athens, GA, 30602 USA 4 Department of Pharmacology, The University of North Carolina at Chapel Hill, Chapel Hill, NC 27599, USA 5 Joint Department of Biomedical Engineering at the University of North Carolina at Chapel Hill and North Carolina State University, Chapel Hill, NC 27599, USA † Peter Sorger Warren Alpert 432 200 Longwood Avenue Harvard Medical School, Boston MA 02115 [email protected] cc: [email protected] 617-432-6901 ORCID Numbers Peter K. Sorger 0000-0002-3364-1838 Nienke Moret 0000-0001-6038-6863 Changchang Liu 0000-0003-4594-4577 Benjamin M. Gyori 0000-0001-9439-5346 John A. Bachman 0000-0001-6095-2466 Albert Steppi 0000-0001-5871-6245 Shawn M.
    [Show full text]
  • Use of Network Pharmacology to Investigate the Mechanism of the Compound Xuanju Capsule in the Treatment of Rheumatoid Arthritis
    Hindawi BioMed Research International Volume 2021, Article ID 5568791, 14 pages https://doi.org/10.1155/2021/5568791 Research Article Use of Network Pharmacology to Investigate the Mechanism of the Compound Xuanju Capsule in the Treatment of Rheumatoid Arthritis Wenyang Wei, Wanpeng Lu, Xiaolong Chen, Yunfeng Yang, and Mengkai Zheng Academic Research and Development Center of Zhejiang Strong Pharmaceutical Co., Ltd., Hangzhou, 310053 Zhejiang, China Correspondence should be addressed to Mengkai Zheng; [email protected] Received 24 February 2021; Accepted 24 July 2021; Published 10 August 2021 Academic Editor: Nadeem Sheikh Copyright © 2021 Wenyang Wei et al. This is an open access article distributed under the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited. Objective. To clarify the therapeutic mechanisms of compound Xuanju capsule-treated rheumatoid arthritis (RA) based on network pharmacology tactics. Method. The TCMSP, TCMID and STITCH databases were used to screen the active ingredients and targets in the compound Xuanju capsule; the OMIM, TTD, PharmGKB and GeneCards databases were applied to screen the RA-related disease targets. Then, the obtained targets were imported into Cytoscape 3.7.1 software to construct the active ingredient-target network and the RA-related disease-target network. The active ingredient-target PPI network, the RA-related disease-target PPI network and the common target PPI network were built by using the STRING platform and Cytoscape 3.7.1 software. The GO and KEGG analyses of the common targets were analyzed by using the Metascape and Bioinformatics online tools.
    [Show full text]
  • Serine/Threonine Protein Kinase STK16
    International Journal of Molecular Sciences Review Serine/Threonine Protein Kinase STK16 Junjun Wang 1,2 , Xinmiao Ji 1 , Juanjuan Liu 3,* and Xin Zhang 1,2,4,* 1 High Magnetic Field Laboratory, Key Laboratory of High Magnetic Field and Ion Beam Physical Biology, Hefei Institutes of Physical Science, Chinese Academy of Sciences, Hefei 230031, China; [email protected] (J.W.); xinmiaoji@hmfl.ac.cn (X.J.) 2 Science Island Branch of Graduate School, University of Science and Technology of China, Hefei 230026, China 3 School of Life Sciences, Anhui University, Hefei 230601, China 4 Institutes of Physical Science and Information Technology, Anhui University, Hefei 230601, China * Correspondence: [email protected] (J.L.); xinzhang@hmfl.ac.cn (X.Z.) Received: 25 March 2019; Accepted: 8 April 2019; Published: 10 April 2019 Abstract: STK16 (Ser/Thr kinase 16, also known as Krct/PKL12/MPSK1/TSF-1) is a myristoylated and palmitoylated Ser/Thr protein kinase that is ubiquitously expressed and conserved among all eukaryotes. STK16 is distantly related to the other kinases and belongs to the NAK kinase family that has an atypical activation loop architecture. As a membrane-associated protein that is primarily localized to the Golgi, STK16 has been shown to participate in the TGF-β signaling pathway, TGN protein secretion and sorting, as well as cell cycle and Golgi assembly regulation. This review aims to provide a comprehensive summary of the progress made in recent research about STK16, ranging from its distribution, molecular characterization, post-translational modification (fatty acylation and phosphorylation), interactors (GlcNAcK/DRG1/MAL2/Actin/WDR1), and related functions.
    [Show full text]
  • By Submitted in Partial Satisfaction of the Requirements for Degree of in in the GRADUATE DIVISION of the UNIVERSITY of CALIFORN
    by Submitted in partial satisfaction of the requirements for degree of in in the GRADUATE DIVISION of the UNIVERSITY OF CALIFORNIA, SAN FRANCISCO Approved: ______________________________________________________________________________ Chair ______________________________________________________________________________ ______________________________________________________________________________ ______________________________________________________________________________ ______________________________________________________________________________ Committee Members Copyright 2020 by Hang Yin Karen Wong ii ACKNOWLEDGEMENTS This dissertation would not have been possible without the many individuals who have inspired me along the way. First, I would like to thank my PhD advisor Dr. Pui-Yan Kwok for guiding me and providing support throughout my scientific career and beyond. Amid his busy schedule, he is always available and ready to discuss my work with great enthusiasm. He has built an incredible network of talented people, always giving me the opportunities to connect and learn from those with different backgrounds. The projects in the lab are constantly at the forefront of scientific development and I cannot thank him enough for giving me the resources and freedom to grow as an independent scientist. I owe a debt of gratitude most of all to my dissertation and qualifying exam committee members: Drs. Neil Risch, Jeff Wall, Nadav Ahituv, Joseph Shieh, and Elad Ziv. They generously shared their time to give me guidance on my projects and welcomed me to discuss new ideas. I am extremely grateful to the many members of the Kwok lab, past and present, from whom I have learned valuable knowledge. I must thank Michal Levy-Sakin and Yulia Mostovoy for their generosity and patience in mentoring me. The long discussions, from conceptualizing ideas to debugging, had sparked my fundamental interests and curiosity in bioinformatics and human genomics.
    [Show full text]
  • Annotation of the Understudied Kinome and Preliminary Teseting of Kinase Inhibitor Combinations
    ANNOTATION OF THE UNDERSTUDIED KINOME AND PRELIMINARY TESETING OF KINASE INHIBITOR COMBINATIONS Claire Reisig Hall A thesis submitted to the faculty at the University of North Carolina at Chapel Hill in partial fulfillment of the requirements for the degree of Master of Science in the Joint Program of Biomedical Engineering in the School of Medicine. Chapel Hill 2017 Approved by: Shawn Gomez Jeffrey Macdonald Glenn Walker © 2017 Claire Reisig Hall ALL RIGHTS RESERVED ii ABSTRACT Claire Reisig Hall: Annotation of the Understudied Kinome and Preliminary Testing of Kinase Inhibitor Combinations (Under the direction of Shawn Gomez) A technique utilizing multiplexed inhibitor beads and mass spectrometry (MIB/MS) detects functional protein kinases in breast cancer cell lines. Data from this technique was used to shed light on the understudied kinome, a portion of which is captured by the MIB/MS method. Regression analysis was performed to find correlations in kinase activity. The functional linkages were then used to annotate the understudied kinases. Annotations revealed new possible functions and disease relations for many understudied kinases. Kinase inhibitor combinations were suggested by principle components analysis (PCA) results performed on MIB/MS data from treated breast cancer cell lines. The combinations were preliminarily tested for signs of effectiveness. Dose curves and growth assays were performed to compare drug combinations in the SKBR3 cell line. The interpretation of in vitro experiment results was impeded because of poor
    [Show full text]
  • Discovering Meaning from Biological Sequences: Focus on Predicting
    Iowa State University Capstones, Theses and Graduate Theses and Dissertations Dissertations 2013 Discovering meaning from biological sequences: focus on predicting misannotated proteins, binding patterns, and G4-quadruplex secondary Carson Michael Andorf Iowa State University Follow this and additional works at: https://lib.dr.iastate.edu/etd Part of the Bioinformatics Commons, and the Computer Sciences Commons Recommended Citation Andorf, Carson Michael, "Discovering meaning from biological sequences: focus on predicting misannotated proteins, binding patterns, and G4-quadruplex secondary" (2013). Graduate Theses and Dissertations. 13533. https://lib.dr.iastate.edu/etd/13533 This Dissertation is brought to you for free and open access by the Iowa State University Capstones, Theses and Dissertations at Iowa State University Digital Repository. It has been accepted for inclusion in Graduate Theses and Dissertations by an authorized administrator of Iowa State University Digital Repository. For more information, please contact [email protected]. Discovering meaning from biological sequences: focus on predicting misannotated proteins, binding patterns, and G4-quadruplex secondary structures by Carson Michael Andorf A dissertation submitted to the graduate faculty in partial fulfillment of the requirements for the degree of DOCTOR OF PHILOSOPHY Major: Computer Science Program of Study Committee: Vasant Honavar, Co-major Professor Drena Dobbs, Co-major Professor David Fernandez-Baca Robert Jernigan Taner Sen Guang Song Iowa State University
    [Show full text]
  • 1 Supplemental Table 1. Comparison Among Q-RT-PCR, ISH-TMA And
    Supplemental Table 1. Comparison among Q-RT-PCR, ISH-TMA and IHC for the detection of ErbB-2 in breast cancers. IHC ISH-TMA Q-RT-PCR Q-RT-PCR range Q-RT-PCR score 0 0 0* 0 0 0 0* 0 0 0 1 0 0 0 1 0 0 0 1 0 0 1 2 0 1 1 2 0 0 0 3 0 0 0 3 0 0 0 3 0 0 0 3 0 0 0 3 x<10 0 1 1 3 0 0 0 4 0 0 0 5 0 0 0 5 0 0 0 5 0 1 1 5 0 0 0 6 0 0 0 8 0 0 0 8 0 1 1 8 0 0 1 8 0 1 1 11 1 1 1 11 1 1 1 11 1 1 0 12 1 1 1 13 10<x<20 1 0 1 13 1 1 1 16 1 1 1 17 1 2 1 17 1 2 2 22 2 2 2 43 2 20<x<100 3 2 74 2 3 2 86 2 3 3 213 3 3 3 259 x>100 3 3 3 362 3 Legend to Supplemental Table 1. This experiment was set up to demonstrate that there is good semi-quantitative correlation between the levels of expression detected by ISH-TMA, IHC and Q-RT-PCR. We compared the three methods on levels of ErbB-2 expression in breast cancer, since ErbB-2 is overexpressed in breast cancers, over a wide range of levels.
    [Show full text]
  • Attractor Metafeatures and Their Application in Biomolecular Data Analysis
    Attractor Metafeatures and Their Application in Biomolecular Data Analysis Tai-Hsien Ou Yang Submitted in partial fulfillment of the requirements for the degree of Doctor of Philosophy in the Graduate School of Arts and Sciences COLUMBIA UNIVERSITY 2018 ©2018 Tai-Hsien Ou Yang All rights reserved ABSTRACT Attractor Metafeatures and Their Application in Biomolecular Data Analysis Tai-Hsien Ou Yang This dissertation proposes a family of algorithms for deriving signatures of mutually associated features, to which we refer as attractor metafeatures, or simply attractors. Specifically, we present multi-cancer attractor derivation algorithms, identifying correlated features in signatures from multiple biological data sets in one analysis, as well as the groups of samples or cells that exclusively express these signatures. Our results demonstrate that these signatures can be used, in proper combinations, as biomarkers that predict a patient’s survival rate, based on the transcriptome of the tumor sample. They can also be used as features to analyze the composition of the tumor. Through analyzing large data sets of 18 cancer types and three high-throughput platforms from The Cancer Genome Atlas (TCGA) PanCanAtlas Project and multiple single-cell RNA-seq data sets, we identified novel cancer attractor signatures and elucidated the identity of the cells that express these signatures. Using these signatures, we developed a prognostic biomarker for breast cancer called the Breast Cancer Attractor Metagenes (BCAM) biomarker as well as a software platform
    [Show full text]