AUCTSP: an Improved Biomarker Gene Pair Class Predictor Dimitri Kagaris1* , Alireza Khamesipour1 and Constantin T

Total Page:16

File Type:pdf, Size:1020Kb

AUCTSP: an Improved Biomarker Gene Pair Class Predictor Dimitri Kagaris1* , Alireza Khamesipour1 and Constantin T Kagaris et al. BMC Bioinformatics (2018) 19:244 https://doi.org/10.1186/s12859-018-2231-1 RESEARCH ARTICLE Open Access AUCTSP: an improved biomarker gene pair class predictor Dimitri Kagaris1* , Alireza Khamesipour1 and Constantin T. Yiannoutsos2 Abstract Background: The Top Scoring Pair (TSP) classifier, based on the concept of relative ranking reversals in the expressions of pairs of genes, has been proposed as a simple, accurate, and easily interpretable decision rule for classification and class prediction of gene expression profiles. The idea that differences in gene expression ranking are associated with presence or absence of disease is compelling and has strong biological plausibility. Nevertheless, the TSP formulation ignores significant available information which can improve classification accuracy and is vulnerable to selecting genes which do not have differential expression in the two conditions (“pivot" genes). Results: We introduce the AUCTSP classifier as an alternative rank-based estimator of the magnitude of the ranking reversals involved in the original TSP. The proposed estimator is based on the Area Under the Receiver Operating Characteristic (ROC) Curve (AUC) and as such, takes into account the separation of the entire distribution of gene expression levels in gene pairs under the conditions considered, as opposed to comparing gene rankings within individual subjects as in the original TSP formulation. Through extensive simulations and case studies involving classification in ovarian, leukemia, colon, breast and prostate cancers and diffuse large b-cell lymphoma, we show the superiority of the proposed approach in terms of improving classification accuracy, avoiding overfitting and being less prone to selecting non-informative (pivot) genes. Conclusions: The proposed AUCTSP is a simple yet reliable and robust rank-based classifier for gene expression classification. While the AUCTSP works by the same principle as TSP, its ability to determine the top scoring gene pair based on the relative rankings of two marker genes across all subjects as opposed to each individual subject results in significant performance gains in classification accuracy. In addition, the proposed method tends to avoid selection of non-informative (pivot) genes as members of the top-scoring pair. Keywords: Microarray data analysis, Gene expression, Gene selection, Receiver operating characteristic (ROC) curve, AUC, Leukemia, Breast cancer, Ovarian cancer, Colon cancer, Prostate cancer, Diffuse large B-Cell lymphoma Background is present (classification) or the risk for the occurrence Microarray data analysis is a high throughput method of disease in the future (prediction). While very complex used to gain information about gene functions inside cells. classifiers can be constructed, a number of authors have Thisinformationisinturnusedtodetectthepresenceor expressed concern with the “black box” nature of these absenceofdisease[1–3], and gain a better understanding approaches [5] preferring simpler more interpretable clas- of a disease mechanism [4]. sifiers in clinical applications [6, 7]. It is noted that the A particularly useful application of microarray tech- preference for the latter kind of classifiers should not be at nology uses microarray data to detect the presence of the expense of their performance. disease by combining gene expression levels from a num- Classification involves, at its most fundamental level, ber of genes, to provide information on whether disease a comparison between expression levels in one or more genes between two or more conditions (e.g., disease ver- *Correspondence: [email protected] sus no disease). This comparison can be based on a fairly 1 Department of Electrical and Computer Engineering, Southern Illinois heuristic criterion (e.g., fold-change in gene expression University, 1230 Lincoln Drive, 62901 Carbondale, IL, USA Full list of author information is available at the end of the article [8]), or by using parametric or non-parametric statistical © The Author(s). 2018 Open Access This article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated. Kagaris et al. BMC Bioinformatics (2018) 19:244 Page 2 of 13 methods [9–12]. There are several advantages and dis- Given a pair of genes (i, j),1 ≤ i = j ≤ G,thereversal advantages with each of these methods. For example, it score of the pair was defined in [6]as is biologically plausible that genes with large differential = ( > | = ) − ( > | = ) expression levels should be part of a classification crite- ij P Yi Yj C 1 P Yi Yj C 2 (1) rion. However, the fold-change criterion does not take where P(Yi > Yj|C = m) denotes the probability that the gene expression variability into account and determining expression level of gene i is larger than the expression level a cutoff is an arbitrary exercise [13]. On the other hand, of gene j in samples from class C,withC being equal to parametric statistical methods, which are based on some m = 1, 2. The score ij can be empirically approximated variant of the t-test, provide some sense of one’s confi- by the expression [6] dence on the gene expression difference, but frequently lose the intuitive appeal of heuristic methods like fold- n1 ( > ) n ( > ) = I1 Yi, Yj, =n +1 I2 Yi, Yj, change (e.g., when even small differences are statistically D = 1 − 1 ij n n significant). In addition, parametric methods make strong 1 2 and frequently untenable assumptions regarding the dis- (2) tribution of gene expression levels [13]. Non-parametric where index indicates the th subject, 1 ≤ ≤ n and methods, which are based on ranking gene expression I (Y > Y ) = 1ifY > Y in class m = 1, 2, and 0 levels, are expected to lose some information because of m i, j, i, j, otherwise. the use of ranks instead of actual gene-expression data. Obviously, the larger the , the higher the probabil- However, such methods are robust to deviations from ij ity that the expression levels of genes i and j have reverse parametric assumptions [13], and are less vulnerable to relative rankings in the two groups, and it is exactly this biases arising from data normalization and other pre- property that is used for classification by the TSP. More processing steps [14], which are plausibly assumed to be specifically, let (α, β) be the pair of genes that yields the rank-preserving [6, 7]. maximum score αβ = max{ } (referred to as the The fact that the TSP provides classifiers based on only ij Top Scoring Pair (TSP) [6]). Then the classification is two genes is also an attractive compromise in the so-called performed as follows: “bias-variance” tradeoff [15]. As a classifier’s performance Assume that is a combination of variance (random error) and bias (sys- tematic error), in many cases, high-dimensional classifiers P(Yα > Yβ |C = 1)>P(Yα > Yβ |C = 2) (3) withlowbias(duetogoodperformanceinthecurrent sample) have large variances (i.e., poor precision) in new i.e., samples. By contrast, simpler (and thus more rigid) clas- n1 n ( > ) = I1(Yα, > Yβ,) = + I2 Yα, Yβ, sifiers, while possibly having higher levels of bias, are less 1 > n1 1 (4) influencedbyaspecificsampleandmayhavebetterover- n1 n2 all performance (smaller variance) in multiple samples. Then a new subject s whose measured expression levels The simple TSP classifiers, it was hoped, would perform for genes a and b are Yα,s and Yβ,s respectively, will be clas- sufficiently well both in the current sample as well as in sified as belonging to the first class if Yα,s > Yβ,s,andto new samples. The TSP is a rank-based classifier in the the second class otherwise. sense that it uses the rankings of gene expression levels The genes in the top scoring pair as selected by the TSP within a gene profile rather than the levels themselves, an method may have a problem, as Lin et al. [5]alsopoint approach with significant advantages due to the nonpara- out: the selected genes may not be a pair of genuinely metric nature of the classification technique. The central up-regulated and down-regulated genes, but one of the idea behind the TSP classifier is that it identifies two genes selected genes in the pair happens to serve only as a ref- whose gene expression ranking changes between the two erence or “pivot” gene that may lead to a high TSP score conditions under consideration. This change lends itself but a rather non-informative gene pair. Most researchers to a simple biological interpretation as an inversion of have used more complicated methods or selected more mRNA abundance of the two genes in the two conditions features in order to overcome the mentioned problems. under consideration. The pair of genes selected by the In the proposed method we employ a simple statistic TSP [6], referred to as the top scoring pair (TSP), is found associated with the Receiver Operating Characteristic by the following approach: Consider G genes which have (ROC) curve that is commonly known as the Area Under been profiled by microarray analysis. Let n1 be the num- theROCcurve(AUROC)ortheAreaUndertheCurve ber of experiments from the first class with expression (AUC), for short. The ROC curve and the AUC in par- ={ ··· } levels Yi Yi,1, Yi,2, , Yi,n1 ,andletn2 be the number ticularhavebeenwidelyusedasameasureformicroar- of experiments from the second class with expression lev- ray classification and other medical diagnostic tests ={ ··· } = + els Yi Yi,n1+1, Yi,n1+2, , Yi,n ,wheren n1 n2. (see, e.g., [16–23]. Kagaris et al. BMC Bioinformatics (2018) 19:244 Page 3 of 13 The proposed method, referred to as AUCTSP (AUC- The probability PAUCTSP can be calculated by the Area based TSP), uses similar ideas as the TSP, thus benefiting Under the ROC Curve (AUC) [23].
Recommended publications
  • A Computational Approach for Defining a Signature of Β-Cell Golgi Stress in Diabetes Mellitus
    Page 1 of 781 Diabetes A Computational Approach for Defining a Signature of β-Cell Golgi Stress in Diabetes Mellitus Robert N. Bone1,6,7, Olufunmilola Oyebamiji2, Sayali Talware2, Sharmila Selvaraj2, Preethi Krishnan3,6, Farooq Syed1,6,7, Huanmei Wu2, Carmella Evans-Molina 1,3,4,5,6,7,8* Departments of 1Pediatrics, 3Medicine, 4Anatomy, Cell Biology & Physiology, 5Biochemistry & Molecular Biology, the 6Center for Diabetes & Metabolic Diseases, and the 7Herman B. Wells Center for Pediatric Research, Indiana University School of Medicine, Indianapolis, IN 46202; 2Department of BioHealth Informatics, Indiana University-Purdue University Indianapolis, Indianapolis, IN, 46202; 8Roudebush VA Medical Center, Indianapolis, IN 46202. *Corresponding Author(s): Carmella Evans-Molina, MD, PhD ([email protected]) Indiana University School of Medicine, 635 Barnhill Drive, MS 2031A, Indianapolis, IN 46202, Telephone: (317) 274-4145, Fax (317) 274-4107 Running Title: Golgi Stress Response in Diabetes Word Count: 4358 Number of Figures: 6 Keywords: Golgi apparatus stress, Islets, β cell, Type 1 diabetes, Type 2 diabetes 1 Diabetes Publish Ahead of Print, published online August 20, 2020 Diabetes Page 2 of 781 ABSTRACT The Golgi apparatus (GA) is an important site of insulin processing and granule maturation, but whether GA organelle dysfunction and GA stress are present in the diabetic β-cell has not been tested. We utilized an informatics-based approach to develop a transcriptional signature of β-cell GA stress using existing RNA sequencing and microarray datasets generated using human islets from donors with diabetes and islets where type 1(T1D) and type 2 diabetes (T2D) had been modeled ex vivo. To narrow our results to GA-specific genes, we applied a filter set of 1,030 genes accepted as GA associated.
    [Show full text]
  • Studies on the Proteome of Human Hair - Identifcation of Histones and Deamidated Keratins Received: 15 August 2017 Sunil S
    www.nature.com/scientificreports OPEN Studies on the Proteome of Human Hair - Identifcation of Histones and Deamidated Keratins Received: 15 August 2017 Sunil S. Adav 1, Roopa S. Subbaiaih2, Swat Kim Kerk 2, Amelia Yilin Lee 2,3, Hui Ying Lai3,4, Accepted: 12 January 2018 Kee Woei Ng3,4,7, Siu Kwan Sze 1 & Artur Schmidtchen2,5,6 Published: xx xx xxxx Human hair is laminar-fbrous tissue and an evolutionarily old keratinization product of follicle trichocytes. Studies on the hair proteome can give new insights into hair function and lead to the development of novel biomarkers for hair in health and disease. Human hair proteins were extracted by detergent and detergent-free techniques. We adopted a shotgun proteomics approach, which demonstrated a large extractability and variety of hair proteins after detergent extraction. We found an enrichment of keratin, keratin-associated proteins (KAPs), and intermediate flament proteins, which were part of protein networks associated with response to stress, innate immunity, epidermis development, and the hair cycle. Our analysis also revealed a signifcant deamidation of keratin type I and II, and KAPs. The hair shafts were found to contain several types of histones, which are well known to exert antimicrobial activity. Analysis of the hair proteome, particularly its composition, protein abundances, deamidated hair proteins, and modifcation sites, may ofer a novel approach to explore potential biomarkers of hair health quality, hair diseases, and aging. Hair is an important and evolutionarily conserved structure. It originates from hair follicles deep within the der- mis and is mainly composed of hair keratins and KAPs, which form a complex network that contributes to the rigidity and mechanical properties.
    [Show full text]
  • Structural and Biochemical Changes Underlying a Keratoderma-Like Phenotype in Mice Lacking Suprabasal AP1 Transcription Factor Function
    Citation: Cell Death and Disease (2015) 6, e1647; doi:10.1038/cddis.2015.21 OPEN & 2015 Macmillan Publishers Limited All rights reserved 2041-4889/15 www.nature.com/cddis Structural and biochemical changes underlying a keratoderma-like phenotype in mice lacking suprabasal AP1 transcription factor function EA Rorke*,1, G Adhikary2, CA Young2, RH Rice3, PM Elias4, D Crumrine4, J Meyer4, M Blumenberg5 and RL Eckert2,6,7,8 Epidermal keratinocyte differentiation on the body surface is a carefully choreographed process that leads to assembly of a barrier that is essential for life. Perturbation of keratinocyte differentiation leads to disease. Activator protein 1 (AP1) transcription factors are key controllers of this process. We have shown that inhibiting AP1 transcription factor activity in the suprabasal murine epidermis, by expression of dominant-negative c-jun (TAM67), produces a phenotype type that resembles human keratoderma. However, little is understood regarding the structural and molecular changes that drive this phenotype. In the present study we show that TAM67-positive epidermis displays altered cornified envelope, filaggrin-type keratohyalin granule, keratin filament, desmosome formation and lamellar body secretion leading to reduced barrier integrity. To understand the molecular changes underlying this process, we performed proteomic and RNA array analysis. Proteomic study of the corneocyte cross-linked proteome reveals a reduction in incorporation of cutaneous keratins, filaggrin, filaggrin2, late cornified envelope precursor proteins, hair keratins and hair keratin-associated proteins. This is coupled with increased incorporation of desmosome linker, small proline-rich, S100, transglutaminase and inflammation-associated proteins. Incorporation of most cutaneous keratins (Krt1, Krt5 and Krt10) is reduced, but incorporation of hyperproliferation-associated epidermal keratins (Krt6a, Krt6b and Krt16) is increased.
    [Show full text]
  • Noelia Díaz Blanco
    Effects of environmental factors on the gonadal transcriptome of European sea bass (Dicentrarchus labrax), juvenile growth and sex ratios Noelia Díaz Blanco Ph.D. thesis 2014 Submitted in partial fulfillment of the requirements for the Ph.D. degree from the Universitat Pompeu Fabra (UPF). This work has been carried out at the Group of Biology of Reproduction (GBR), at the Department of Renewable Marine Resources of the Institute of Marine Sciences (ICM-CSIC). Thesis supervisor: Dr. Francesc Piferrer Professor d’Investigació Institut de Ciències del Mar (ICM-CSIC) i ii A mis padres A Xavi iii iv Acknowledgements This thesis has been made possible by the support of many people who in one way or another, many times unknowingly, gave me the strength to overcome this "long and winding road". First of all, I would like to thank my supervisor, Dr. Francesc Piferrer, for his patience, guidance and wise advice throughout all this Ph.D. experience. But above all, for the trust he placed on me almost seven years ago when he offered me the opportunity to be part of his team. Thanks also for teaching me how to question always everything, for sharing with me your enthusiasm for science and for giving me the opportunity of learning from you by participating in many projects, collaborations and scientific meetings. I am also thankful to my colleagues (former and present Group of Biology of Reproduction members) for your support and encouragement throughout this journey. To the “exGBRs”, thanks for helping me with my first steps into this world. Working as an undergrad with you Dr.
    [Show full text]
  • Supplementary Materials
    Supplementary materials Supplementary Table S1: MGNC compound library Ingredien Molecule Caco- Mol ID MW AlogP OB (%) BBB DL FASA- HL t Name Name 2 shengdi MOL012254 campesterol 400.8 7.63 37.58 1.34 0.98 0.7 0.21 20.2 shengdi MOL000519 coniferin 314.4 3.16 31.11 0.42 -0.2 0.3 0.27 74.6 beta- shengdi MOL000359 414.8 8.08 36.91 1.32 0.99 0.8 0.23 20.2 sitosterol pachymic shengdi MOL000289 528.9 6.54 33.63 0.1 -0.6 0.8 0 9.27 acid Poricoic acid shengdi MOL000291 484.7 5.64 30.52 -0.08 -0.9 0.8 0 8.67 B Chrysanthem shengdi MOL004492 585 8.24 38.72 0.51 -1 0.6 0.3 17.5 axanthin 20- shengdi MOL011455 Hexadecano 418.6 1.91 32.7 -0.24 -0.4 0.7 0.29 104 ylingenol huanglian MOL001454 berberine 336.4 3.45 36.86 1.24 0.57 0.8 0.19 6.57 huanglian MOL013352 Obacunone 454.6 2.68 43.29 0.01 -0.4 0.8 0.31 -13 huanglian MOL002894 berberrubine 322.4 3.2 35.74 1.07 0.17 0.7 0.24 6.46 huanglian MOL002897 epiberberine 336.4 3.45 43.09 1.17 0.4 0.8 0.19 6.1 huanglian MOL002903 (R)-Canadine 339.4 3.4 55.37 1.04 0.57 0.8 0.2 6.41 huanglian MOL002904 Berlambine 351.4 2.49 36.68 0.97 0.17 0.8 0.28 7.33 Corchorosid huanglian MOL002907 404.6 1.34 105 -0.91 -1.3 0.8 0.29 6.68 e A_qt Magnogrand huanglian MOL000622 266.4 1.18 63.71 0.02 -0.2 0.2 0.3 3.17 iolide huanglian MOL000762 Palmidin A 510.5 4.52 35.36 -0.38 -1.5 0.7 0.39 33.2 huanglian MOL000785 palmatine 352.4 3.65 64.6 1.33 0.37 0.7 0.13 2.25 huanglian MOL000098 quercetin 302.3 1.5 46.43 0.05 -0.8 0.3 0.38 14.4 huanglian MOL001458 coptisine 320.3 3.25 30.67 1.21 0.32 0.9 0.26 9.33 huanglian MOL002668 Worenine
    [Show full text]
  • Distinct Regions of RPB11 Are Required for Heterodimerization with RPB3 in Human and Yeast RNA Polymerase II Wagane J
    Distinct regions of RPB11 are required for heterodimerization with RPB3 in human and yeast RNA polymerase II Wagane J. Benga, Sylvie Grandemange, George V. Shpakovski, Elena K. Shematorova, Claude Kedinger, Marc Vigneron To cite this version: Wagane J. Benga, Sylvie Grandemange, George V. Shpakovski, Elena K. Shematorova, Claude Kedinger, et al.. Distinct regions of RPB11 are required for heterodimerization with RPB3 in hu- man and yeast RNA polymerase II. Nucleic Acids Research, Oxford University Press, 2005, 33 (11), pp.3582-3590. 10.1093/nar/gki672. hal-02981340 HAL Id: hal-02981340 https://hal.archives-ouvertes.fr/hal-02981340 Submitted on 27 Oct 2020 HAL is a multi-disciplinary open access L’archive ouverte pluridisciplinaire HAL, est archive for the deposit and dissemination of sci- destinée au dépôt et à la diffusion de documents entific research documents, whether they are pub- scientifiques de niveau recherche, publiés ou non, lished or not. The documents may come from émanant des établissements d’enseignement et de teaching and research institutions in France or recherche français ou étrangers, des laboratoires abroad, or from public or private research centers. publics ou privés. 3582–3590 Nucleic Acids Research, 2005, Vol. 33, No. 11 doi:10.1093/nar/gki672 Distinct regions of RPB11 are required for heterodimerization with RPB3 in human and yeast RNA polymerase II Wagane J. Benga, Sylvie Grandemange1, George V. Shpakovski2, Elena K. Shematorova2, Claude Kedinger and Marc Vigneron* Unite´ Mixte de Recherche 7100 CNRS–Universite´ Louis Pasteur, Ecole Supe´rieure de Biotechnologie de Strasbourg, Boulevard Se´bastien Brandt, BP 10413, 67412 Illkirch Cedex, France, 1Ge´ne´tique des Maladies Inflammatoires, Institut de Ge´ne´tique Humaine UPR 1142, CNRS.
    [Show full text]
  • Structural and Functional Characterization of Buffalo Oviduct-Specific Glycoprotein (OVGP1) Expressed During Estrous Cycle
    Bioscience Reports (2019) 39 BSR20191501 https://doi.org/10.1042/BSR20191501 Research Article Structural and functional characterization of buffalo oviduct-specific glycoprotein (OVGP1) expressed during estrous cycle Suman Choudhary1, Jagadeesh Janjanam2, Sudarshan Kumar1, Jai K. Kaushik1 and Ashok K. Mohanty1 Downloaded from http://portlandpress.com/bioscirep/article-pdf/39/12/BSR20191501/862649/bsr-2019-1501.pdf by guest on 02 October 2021 1Animal Biotechnology Centre, National Dairy Research Institute, Karnal 132001, Haryana, India; 2Department of Developmental Neurobiology, St. Jude Children’s Research Hospital, Memphis, TN 38105, U.S.A. Correspondence: Ashok K. Mohanty ([email protected]) Oviduct-specific glycoprotein (OVGP1) is a high molecular weight chitinase-like protein belonging to GH18 family. It is secreted by non-ciliated epithelial cells of oviduct during estrous cycle providing an essential milieu for fertilization and embryo development. The present study reports the characterization of buffalo OVGP1 through structural modeling, carbohydrate-binding properties and evolutionary analysis. Structural model displayed the typical fold of GH18 family members till the boundary of chitinase-like domain further con- sisting of a large (β/α)8 TIM barrel sub-domain and a small (α+β) sub-domain. Two criti- cal catalytic residues were found substituted in the catalytic centre (Asp to Phe118, Glu to Leu120) compared with the active chitinase. The carbohydrate-binding groove in TIM bar- rel was lined with various conserved aromatic residues. Molecular docking with different sugars revealed the involvement of various residues in hydrogen-bonding and non-bonded contacts. Most of the substrate-binding residues were conserved except for a few replace- ments (Ser13, Lys48, Asp49, Pro50, Asp167, Glu199, Gln272 and Phe275) in compari- son with other GH18 members.
    [Show full text]
  • The Correlation of Keratin Expression with In-Vitro Epithelial Cell Line Differentiation
    The correlation of keratin expression with in-vitro epithelial cell line differentiation Deeqo Aden Thesis submitted to the University of London for Degree of Master of Philosophy (MPhil) Supervisors: Professor Ian. C. Mackenzie Professor Farida Fortune Centre for Clinical and Diagnostic Oral Science Barts and The London School of Medicine and Dentistry Queen Mary, University of London 2009 Contents Content pages ……………………………………………………………………......2 Abstract………………………………………………………………………….........6 Acknowledgements and Declaration……………………………………………...…7 List of Figures…………………………………………………………………………8 List of Tables………………………………………………………………………...12 Abbreviations….………………………………………………………………..…...14 Chapter 1: Literature review 16 1.1 Structure and function of the Oral Mucosa……………..…………….…..............17 1.2 Maintenance of the oral cavity...……………………………………….................20 1.2.1 Environmental Factors which damage the Oral Mucosa………. ….…………..21 1.3 Structure and function of the Oral Mucosa ………………...….……….………...21 1.3.1 Skin Barrier Formation………………………………………………….……...22 1.4 Comparison of Oral Mucosa and Skin…………………………………….……...24 1.5 Developmental and Experimental Models used in Oral mucosa and Skin...……..28 1.6 Keratinocytes…………………………………………………….….....................29 1.6.1 Desmosomes…………………………………………….…...............................29 1.6.2 Hemidesmosomes……………………………………….…...............................30 1.6.3 Tight Junctions………………………….……………….…...............................32 1.6.4 Gap Junctions………………………….……………….….................................32
    [Show full text]
  • Human Lectins, Their Carbohydrate Affinities and Where to Find Them
    biomolecules Review Human Lectins, Their Carbohydrate Affinities and Where to Review HumanFind Them Lectins, Their Carbohydrate Affinities and Where to FindCláudia ThemD. Raposo 1,*, André B. Canelas 2 and M. Teresa Barros 1 1, 2 1 Cláudia D. Raposo * , Andr1 é LAQVB. Canelas‐Requimte,and Department M. Teresa of Chemistry, Barros NOVA School of Science and Technology, Universidade NOVA de Lisboa, 2829‐516 Caparica, Portugal; [email protected] 12 GlanbiaLAQV-Requimte,‐AgriChemWhey, Department Lisheen of Chemistry, Mine, Killoran, NOVA Moyne, School E41 of ScienceR622 Co. and Tipperary, Technology, Ireland; canelas‐ [email protected] NOVA de Lisboa, 2829-516 Caparica, Portugal; [email protected] 2* Correspondence:Glanbia-AgriChemWhey, [email protected]; Lisheen Mine, Tel.: Killoran, +351‐212948550 Moyne, E41 R622 Tipperary, Ireland; [email protected] * Correspondence: [email protected]; Tel.: +351-212948550 Abstract: Lectins are a class of proteins responsible for several biological roles such as cell‐cell in‐ Abstract:teractions,Lectins signaling are pathways, a class of and proteins several responsible innate immune for several responses biological against roles pathogens. such as Since cell-cell lec‐ interactions,tins are able signalingto bind to pathways, carbohydrates, and several they can innate be a immuneviable target responses for targeted against drug pathogens. delivery Since sys‐ lectinstems. In are fact, able several to bind lectins to carbohydrates, were approved they by canFood be and a viable Drug targetAdministration for targeted for drugthat purpose. delivery systems.Information In fact, about several specific lectins carbohydrate were approved recognition by Food by andlectin Drug receptors Administration was gathered for that herein, purpose. plus Informationthe specific organs about specific where those carbohydrate lectins can recognition be found by within lectin the receptors human was body.
    [Show full text]
  • Potential Genotoxicity from Integration Sites in CLAD Dogs Treated Successfully with Gammaretroviral Vector-Mediated Gene Therapy
    Gene Therapy (2008) 15, 1067–1071 & 2008 Nature Publishing Group All rights reserved 0969-7128/08 $30.00 www.nature.com/gt SHORT COMMUNICATION Potential genotoxicity from integration sites in CLAD dogs treated successfully with gammaretroviral vector-mediated gene therapy M Hai1,3, RL Adler1,3, TR Bauer Jr1,3, LM Tuschong1, Y-C Gu1,XWu2 and DD Hickstein1 1Experimental Transplantation and Immunology Branch, Center for Cancer Research, National Cancer Institute, National Institutes of Health, Bethesda, Maryland, USA and 2Laboratory of Molecular Technology, Scientific Applications International Corporation-Frederick, National Cancer Institute-Frederick, Frederick, Maryland, USA Integration site analysis was performed on six dogs with in hematopoietic stem cells. Integrations clustered around canine leukocyte adhesion deficiency (CLAD) that survived common insertion sites more frequently than random. greater than 1 year after infusion of autologous CD34+ bone Despite potential genotoxicity from RIS, to date there has marrow cells transduced with a gammaretroviral vector been no progression to oligoclonal hematopoiesis and no expressing canine CD18. A total of 387 retroviral insertion evidence that vector integration sites influenced cell survival sites (RIS) were identified in the peripheral blood leukocytes or proliferation. Continued follow-up in disease-specific from the six dogs at 1 year postinfusion. A total of 129 RIS animal models such as CLAD will be required to provide an were identified in CD3+ T-lymphocytes and 102 RIS in accurate estimate
    [Show full text]
  • HARS2 Gene Histidyl-Trna Synthetase 2, Mitochondrial
    HARS2 gene histidyl-tRNA synthetase 2, mitochondrial Normal Function The HARS2 gene provides instructions for making an enzyme called mitochondrial histidyl-tRNA synthetase. This enzyme is important in the production (synthesis) of proteins in cellular structures called mitochondria, the energy-producing centers in cells. While most protein synthesis occurs in the fluid surrounding the nucleus (cytoplasm), some proteins are synthesized in the mitochondria. During protein synthesis, in either the mitochondria or the cytoplasm, a type of RNA called transfer RNA (tRNA) helps assemble protein building blocks (amino acids) into a chain that forms the protein. Each tRNA carries a specific amino acid to the growing chain. Enzymes called aminoacyl-tRNA synthetases, including mitochondrial histidyl- tRNA synthetase, attach a particular amino acid to a specific tRNA. Mitochondrial histidyl-tRNA synthetase attaches the amino acid histidine to the correct tRNA, which helps ensure that histidine is added at the proper place in the mitochondrial protein. Health Conditions Related to Genetic Changes Perrault syndrome At least two mutations in the HARS2 gene have been found to cause Perrault syndrome. This rare condition is characterized by hearing loss in males and females with the disorder and abnormalities of the ovaries in affected females. The HARS2 gene mutations involved in Perrault syndrome reduce the activity of mitochondrial histidyl- tRNA synthetase. A shortage of functional mitochondrial histidyl-tRNA synthetase prevents the normal assembly of new proteins within mitochondria. Researchers speculate that impaired protein assembly disrupts mitochondrial energy production. However, it is unclear exactly how HARS2 gene mutations lead to hearing problems and ovarian abnormalities in affected individuals.
    [Show full text]
  • MALE Protein Name Accession Number Molecular Weight CP1 CP2 H1 H2 PDAC1 PDAC2 CP Mean H Mean PDAC Mean T-Test PDAC Vs. H T-Test
    MALE t-test t-test Accession Molecular H PDAC PDAC vs. PDAC vs. Protein Name Number Weight CP1 CP2 H1 H2 PDAC1 PDAC2 CP Mean Mean Mean H CP PDAC/H PDAC/CP - 22 kDa protein IPI00219910 22 kDa 7 5 4 8 1 0 6 6 1 0.1126 0.0456 0.1 0.1 - Cold agglutinin FS-1 L-chain (Fragment) IPI00827773 12 kDa 32 39 34 26 53 57 36 30 55 0.0309 0.0388 1.8 1.5 - HRV Fab 027-VL (Fragment) IPI00827643 12 kDa 4 6 0 0 0 0 5 0 0 - 0.0574 - 0.0 - REV25-2 (Fragment) IPI00816794 15 kDa 8 12 5 7 8 9 10 6 8 0.2225 0.3844 1.3 0.8 A1BG Alpha-1B-glycoprotein precursor IPI00022895 54 kDa 115 109 106 112 111 100 112 109 105 0.6497 0.4138 1.0 0.9 A2M Alpha-2-macroglobulin precursor IPI00478003 163 kDa 62 63 86 72 14 18 63 79 16 0.0120 0.0019 0.2 0.3 ABCB1 Multidrug resistance protein 1 IPI00027481 141 kDa 41 46 23 26 52 64 43 25 58 0.0355 0.1660 2.4 1.3 ABHD14B Isoform 1 of Abhydrolase domain-containing proteinIPI00063827 14B 22 kDa 19 15 19 17 15 9 17 18 12 0.2502 0.3306 0.7 0.7 ABP1 Isoform 1 of Amiloride-sensitive amine oxidase [copper-containing]IPI00020982 precursor85 kDa 1 5 8 8 0 0 3 8 0 0.0001 0.2445 0.0 0.0 ACAN aggrecan isoform 2 precursor IPI00027377 250 kDa 38 30 17 28 34 24 34 22 29 0.4877 0.5109 1.3 0.8 ACE Isoform Somatic-1 of Angiotensin-converting enzyme, somaticIPI00437751 isoform precursor150 kDa 48 34 67 56 28 38 41 61 33 0.0600 0.4301 0.5 0.8 ACE2 Isoform 1 of Angiotensin-converting enzyme 2 precursorIPI00465187 92 kDa 11 16 20 30 4 5 13 25 5 0.0557 0.0847 0.2 0.4 ACO1 Cytoplasmic aconitate hydratase IPI00008485 98 kDa 2 2 0 0 0 0 2 0 0 - 0.0081 - 0.0
    [Show full text]