A Machine Learning Tool for Predicting Protein Function Anna

Total Page:16

File Type:pdf, Size:1020Kb

A Machine Learning Tool for Predicting Protein Function Anna The Woolf Classifier Building Pipeline: A Machine Learning Tool for Predicting Protein Function Anna Farrell-Sherman Submitted in Partial Fulfillment of the Prerequisite for Honors in Computer Science under the advisement of Eni Mustafaraj and Vanja Klepac-Ceraj May 2019 © 2019 Anna Farrell-Sherman Abstract Proteins are the machinery that allow cells to grow, reproduce, communicate, and create multicellular organisms. For all of their importance, scientists still have a hard time understanding the function of a protein based on its sequence alone. For my honors thesis in computer science, I created a machine learning tool that can predict the function of a protein based solely on its amino acid sequence. My tool gives scientists a structure in which to build a � Nearest Neighbor or random forest classifier to distinguish between proteins that can and cannot perform a given function. Using default Min-Max scaling, and the Matthews Correlation Coefficient for accuracy assessment, the Woolf Pipeline is built with simplified choices to guide users to success. i Acknowledgments There are so many people who made this thesis possible. First, thank you to my wonderful advisors, Eni and Vanja, who never gave up on me, and always pushed me to try my hardest. Thank you also to the other members of my committee, Sohie Lee, Shikha Singh, and Rosanna Hertz for supporting me through this process. To Kevin, Sophie R, Sophie E, and my sister Phoebe, thank you for reading my drafts, and advising me when the going got tough. To all my wonderful friends, who were always there with encouraging words, warm hugs, and many congratulations, I cannot thank you enough. Jocelyn, Hershel, Fiona, Anneli, Lydia, Linda, and Claire, I could not have done this without all of you. And finally, thank you to my parents, who have always believed in me more than I could have thought possible, and have shown me how much I am capable of. ii A Note on the Name Woolf The Woolf Classifier Building Pipeline is named after the lesbian feminist author Virginia Woolf. She is known for her use of stream of consciousness, and is considered one of the most important 20th century modernist authors. The tribute to her here is a reminder that science alone can never capture all of the wonders of life. iii Table of Contents Abstract i Acknowledgments ii A Note on the Name Woolf iii List of Figures vi List of Tables vii 1 Introduction 1 2 Anaerobic Manganese Oxidation: A Case Study 3 2.1 Manganese Oxides in the Geologic Record 3 2.2 Biological Mn(II) Oxidation 4 2.3 Genes and Proteins 6 3 The Field of Machine Learning 8 3.1 What is Machine Learning? 8 3.2 Classification in Machine Learning 9 3.2.1 The Class Distribution Problem 9 3.2.2 The Multi-Class Problem 13 3.3 Algorithm Choice 14 3.3.1 Naïve Bayes 15 3.3.2 Support Vector Machine (SVM) 16 3.3.3 Neural Nets 17 3.3.4 k Nearest Neighbors (kNN) 18 3.3.5 Decision Trees and Random Forests 19 3.4 Machine Learning in Biology 20 4 The Woolf Classifier Building Pipeline 22 4.1 Introduction 22 4.2 How is the Woolf Pipeline Different? 22 4.3 Pipeline Composition 23 4.3.1 Input 24 4.3.2 Creating a Feature Table 25 4.3.3 Model Creation with Default Settings 26 4.3.4 Changing the Pipeline Parameters 27 4.3.5 Error Evaluation 28 4.3.6 Prediction 29 4.3.7 Implementation 29 4.4 Design Decisions 29 4.4.1 Why the Command Line? 29 4.4.2 Machine Learning Features 30 4.4.3 Scalar Operations 31 4.4.4 Accuracy Metrics 32 iv 4.4.5 Algorithm Choices 33 4.5 When are Woolf Classifiers Useful 34 5 Model Building with the Woolf Pipeline 35 5.1 Introduction 35 5.2 �-Lactamases for Woolf Model Testing 35 5.3 Evaluating Algorithm Implementations 36 5.3.1 Accuracy Metrics 36 5.3.2 Scaling Types 39 5.3.3 Hyperparameter Values 40 5.4 Detecting Anaerobic Manganese Oxidation Genes 41 5.4.1 kNN Model 42 5.4.2 Random Forest Model 42 5.4.3 Prediction of Green Lake Manganese Oxide Genes 43 5.5 Conclusion 44 6 Conclusions and Future Work 45 6.1 The Success of the Woolf Classifier Building Pipeline 45 6.2 Improvements to the Woolf Tool 45 6.3 Future Applications 46 7 References 48 Appendices i Appendix A: User Manual i Appendix B: Manganese Oxidizing Peroxidases xiv Appendix C: Non-Manganese Oxidizing Peroxidases xv v List of Figures 2.1 Manganese Oxides and the Build-up of Oxygen in Earth’s Atmosphere 4 2.2: Anaerobic Manganese Oxide Producing Bacterial Communities in Fayetteville 5 Green Lake 3.3.1 Bayes’ Rule 15 3.3.2 Support Vector Machine in Two Dimensions 16 3.3.3 Neural Network Diagram 17 3.3.4 kNN Classification in Two Dimensions 19 3.3.5 A Random Forest with Three Trees 20 4.2 Genome to Gene to Protein 23 4.3.1 FASTA and FASTQ format 24 4.3.2 The Woolf Classifier Building Pipeline 26 4.4.2 The Twenty Amino Acids 31 5.1 The 4 �-Lactamase Comparisons Used to Test the Woolf Classification Pipeline 36 5.3.1 Three Accuracy Metrics to Evaluate Woolf Classifiers 37 5.3.2 Effect of Scaling on kNN-based Model MCC 39 5.3.3 Effect of Hyperparameters on MCC 40 vi List of Tables 3.2.1 The Confusion Matrix 11 4.3.4 User Specified Parameters 28 4.4.3 Possible Scaler Types 32 4.4.4 Possible Accuracy Metrics 33 5.4.3 Predicted Manganese Binding Function of Green Lake Peroxidases 44 vii 1 Introduction Until recently, scientists did not know that bacteria were capable of producing manganese oxides without oxygen. In fact, manganese oxide minerals have been used to date both the formation of Earth’s modern oxygenated atmosphere and the evolution of oxygenic photosynthesis, based on the idea that these minerals could not form without oxygen in the atmosphere [1, 2]. Yet, despite these claims, recent research has discovered bacterial communities capable of forming manganese oxides without oxygen [3]. This discovery has the potential to change our understanding of the early Archean world. However, the formation of these minerals is still a mystery. We have yet to identify any gene that could enable bacteria to produce them without oxygen. The function of genes can be predicted with algorithms used to annotate genomes after whole genome sequencing, and proven with laboratory experiments that show chemically how proteins work within cells [4, 5]. We have full genome sequences of both the manganese oxide producing bacteria, but because anaerobic manganese oxidation is a novel process, there are no known protein sequences to which we can compare the new sequences. The functional gene annotation produced by current algorithms helps narrow down protein types, but none of the annotations are specific enough to identify genes that might be involved in manganese oxidation. Without a way to narrow down the possibilities, it is difficult to know where to start with laboratory experiments. Machine learning has the potential to provide a solution: a more specific computational tool to differentiate between specific protein functions. To be successful, such a tool would need to be more precise than the current protein differentiation algorithms and at the same time, require less training data. This requires simpler machine learning algorithms, more careful treatment of the data to avoid adding bias, and the ability to biologically interpret the results on the level of individual genes. The biological background of the project will be discussed in Chapter Two. The machine learning required for such a tool is complex, and is explored in detail in Chapter Three, as a prelude to the main work of this thesis: the creation of a machine learning tool like the one described above, called the Woolf Classifier Building Pipeline. The Woolf Pipeline, described in Chapter Four, allows researchers to collect sequence data on genes with and without a given function, choose either a � Nearest Neighbor (kNN) or random forest classification algorithm, and predict the functionality of novel protein sequences. Users can select the type of scaling used on the input dataset, the accuracy metric used to assess the classification ability of the models, and give hyperparameter ranges to the tool that will be tested with cross-validation. The Woolf Pipeline user manual will guide users through these decisions to make sure that the pipeline is accessible even to researchers who do not have an extensive machine learning background (Appendix A). 1 An example set of models created by the Woolf Pipeline is presented in Chapter Five. These models show that Woolf-built classifiers are effective at differentiating between classes of �-lactamase antibiotic resistance genes with both kNN and random forest based classification. Both algorithms are feature based; they model the classes using amino acid percentage composition for each of the 20 amino acids and protein length for a total of 21 machine learning features. The default parameters are tuned for the problem of protein differentiation; Min-Max scaling for the kNN algorithm combats the sparsity of the amino acid composition data, and the MCC accuracy metric deals with the small dataset size and unbalanced class distributions. All four example classifiers with default Min-Max scaling achieved Matthews correlation coefficient (MCC) accuracies of over 0.8, better than any previously published model [6]. Most broadly, this thesis is an investigation into the practicality of using machine learning to create simple classification models to understand the function of biological proteins.
Recommended publications
  • Micrornas As New Regulators of Neutrophil Extracellular Trap Formation
    International Journal of Molecular Sciences Review MicroRNAs as New Regulators of Neutrophil Extracellular Trap Formation Sonia Águila † , Ascensión M. de los Reyes-García †, María P. Fernández-Pérez, Laura Reguilón-Gallego, Laura Zapata-Martínez, Inmaculada Ruiz-Lorente, Vicente Vicente , Rocío González-Conejero *,‡ and Constantino Martínez *,‡ Department of Hematology and Medical Oncology, Morales Meseguer University Hospital, Centro Regional de Hemodonación, Universidad de Murcia, IMIB, C/Ronda de Garay S/N, 30003 Murcia, Spain; [email protected] (S.Á.); [email protected] (A.M.d.l.R.-G.); [email protected] (M.P.F.-P.); [email protected] (L.R.-G.); [email protected] (L.Z.-M.); [email protected] (I.R.-L.); [email protected] (V.V.) * Correspondence: [email protected] (R.G.-C.); [email protected] (C.M.); Tel.: +34-968341990 (R.G.-C. & C.M.); Fax: +34-968261914 (R.G.-C. & C.M.) † These authors contributed equally to this work. ‡ These authors shared senior authorship. Abstract: Neutrophil extracellular traps (NETs) are formed after neutrophils expelled their chromatin content in order to primarily capture and eliminate pathogens. However, given their characteristics due in part to DNA and different granular proteins, NETs may induce a procoagulant response linking inflammation and thrombosis. Unraveling NET formation molecular mechanisms as well Citation: Águila, S.; de los as the intracellular elements that regulate them is relevant not only for basic knowledge but also Reyes-García, A.M.; Fernández-Pérez, to design diagnostic and therapeutic tools that may prevent their deleterious effects observed in M.P.; Reguilón-Gallego, L.; several inflammatory pathologies (e.g., cardiovascular and autoimmune diseases, cancer).
    [Show full text]
  • Prokaryotic Origins of the Non-Animal Peroxidase Superfamily and Organelle-Mediated Transmission to Eukaryotes
    View metadata, citation and similar papers at core.ac.uk brought to you by CORE provided by Elsevier - Publisher Connector Genomics 89 (2007) 567–579 www.elsevier.com/locate/ygeno Prokaryotic origins of the non-animal peroxidase superfamily and organelle-mediated transmission to eukaryotes Filippo Passardi a, Nenad Bakalovic a, Felipe Karam Teixeira b, Marcia Margis-Pinheiro b,c, ⁎ Claude Penel a, Christophe Dunand a, a Laboratory of Plant Physiology, University of Geneva, Quai Ernest-Ansermet 30, CH-1211 Geneva 4, Switzerland b Department of Genetics, Institute of Biology, Federal University of Rio de Janeiro, Rio de Janeiro, Brazil c Department of Genetics, Federal University of Rio Grande do Sul, Rio Grande do Sul, Brazil Received 16 June 2006; accepted 18 January 2007 Available online 13 March 2007 Abstract Members of the superfamily of plant, fungal, and bacterial peroxidases are known to be present in a wide variety of living organisms. Extensive searching within sequencing projects identified organisms containing sequences of this superfamily. Class I peroxidases, cytochrome c peroxidase (CcP), ascorbate peroxidase (APx), and catalase peroxidase (CP), are known to be present in bacteria, fungi, and plants, but have now been found in various protists. CcP sequences were detected in most mitochondria-possessing organisms except for green plants, which possess only ascorbate peroxidases. APx sequences had previously been observed only in green plants but were also found in chloroplastic protists, which acquired chloroplasts by secondary endosymbiosis. CP sequences that are known to be present in prokaryotes and in Ascomycetes were also detected in some Basidiomycetes and occasionally in some protists.
    [Show full text]
  • The C718T Polymorphism in the 3″-Untranslated
    Hypertension Research (2012) 35, 507–512 & 2012 The Japanese Society of Hypertension All rights reserved 0916-9636/12 www.nature.com/hr ORIGINAL ARTICLE The C718T polymorphism in the 3¢-untranslated region of glutathione peroxidase-4 gene is a predictor of cerebral stroke in patients with essential hypertension Alexey V Polonikov1, Ekaterina K Vialykh2, Mikhail I Churnosov3, Thomas Illig4, Maxim B Freidin5, Oksana V Vasil¢eva1, Olga Yu Bushueva1, Valentina N Ryzhaeva1, Irina V Bulgakova1 and Maria A Solodilova1 In the present study we have investigated the association of three single nucleotide polymorphisms in glutathione peroxidase (GPx) genes GPX1 rs1050450 (P198L), GPX3 rs2070593 (G930A) and GPX4 rs713041 (T718C) with the risk of cerebral stroke (CS) in patients with essential hypertension (EH). A total of 667 unrelated EH patients of Russian origin, including 306 hypertensives (the EH–CS group) who suffered from CS and 361 people (the EH–CS group) who did not have cerebrovascular accidents, were enrolled in the study. The variant allele 718C of the GPX4 gene was found to be significantly associated with an increased risk of CS in hypertensive patients (odds ratio (OR) 1.53, 95% confidence interval (CI) 1.23–1.90, Padj¼0.0003). The prevalence of the 718TC and 718CC genotypes of the GPX4 gene was higher in the EH–CS group than the EH-alone group (OR¼2.12, 95%CI 1.42–3.16, Padj¼0.0018). The association of the variant GPX4 genotypes with the increased risk of CS in hypertensives remained statistically significant after adjusting for confounding variables such as sex, body mass index (BMI), blood pressure and antihypertensive medication use (OR¼2.18, 95%CI 1.46–3.27, P¼0.0015).
    [Show full text]
  • Bioresources.Com
    PEER-REVIEWED REVIEW ARTICLE bioresources.com BIOLOGICAL PRETREATMENT OF LIGNOCELLULOSES WITH WHITE-ROT FUNGI AND ITS APPLICATIONS: A REVIEW a,b c, d Isroi, Ria Millati, * Siti Syamsiah, Claes Niklasson,b Muhammad Nur Cahyanto,c f Knut Lundquist,e and Mohammad J. Taherzadeh Lignocellulosic carbohydrates, i.e. cellulose and hemicellulose, have abundant potential as feedstock for production of biofuels and chemicals. However, these carbohydrates are generally infiltrated by lignin. Breakdown of the lignin barrier will alter lignocelluloses structures and make the carbohydrates accessible for more efficient bioconversion. White-rot fungi produce ligninolytic enzymes (lignin peroxidase, manganese peroxidase, and laccase) and efficiently mineralise lignin into CO2 and H2O. Biological pretreatment of lignocelluloses using white-rot fungi has been used for decades for ruminant feed, enzymatic hydrolysis, and biopulping. Application of white-rot fungi capabilities can offer environmentally friendly processes for utilising lignocelluloses over physical or chemical pretreatment. This paper reviews white-rot fungi, ligninolytic enzymes, the effect of biological pretreatment on biomass characteristics, and factors affecting biological pretreatment. Application of biological pretreatment for enzymatic hydrolysis, biofuels (bioethanol, biogas and pyrolysis), biopulping, biobleaching, animal feed, and enzymes production are also discussed. Keywords: Biological pretreatment; White-rot fungi; Lignocellulose; Solid-state fermentation; Lignin peroxidase;
    [Show full text]
  • Textile Dye Biodecolorization by Manganese Peroxidase: a Review
    molecules Review Textile Dye Biodecolorization by Manganese Peroxidase: A Review Yunkang Chang 1,2, Dandan Yang 2, Rui Li 2, Tao Wang 2,* and Yimin Zhu 1,* 1 Institute of Environmental Remediation, Dalian Maritime University, Dalian 116026, China; [email protected] 2 The Lab of Biotechnology Development and Application, School of Biological Science, Jining Medical University, No. 669 Xueyuan Road, Donggang District, Rizhao 276800, China; [email protected] (D.Y.); [email protected] (R.L.) * Correspondence: [email protected] (T.W.); [email protected] (Y.Z.); Tel.: +86-063-3298-3788 (T.W.); +86-0411-8472-6992 (Y.Z.) Abstract: Wastewater emissions from textile factories cause serious environmental problems. Man- ganese peroxidase (MnP) is an oxidoreductase with ligninolytic activity and is a promising biocatalyst for the biodegradation of hazardous environmental contaminants, and especially for dye wastewater decolorization. This article first summarizes the origin, crystal structure, and catalytic cycle of MnP, and then reviews the recent literature on its application to dye wastewater decolorization. In addition, the application of new technologies such as enzyme immobilization and genetic engineering that could improve the stability, durability, adaptability, and operating costs of the enzyme are highlighted. Finally, we discuss and propose future strategies to improve the performance of MnP-assisted dye decolorization in industrial applications. Keywords: manganese peroxidase; biodecolorization; dye wastewater; immobilization; recombi- nant enzyme Citation: Chang, Y.; Yang, D.; Li, R.; Wang, T.; Zhu, Y. Textile Dye Biodecolorization by Manganese Peroxidase: A Review. Molecules 2021, 26, 4403. https://doi.org/ 1. Introduction 10.3390/molecules26154403 The textile industry produces large quantities of wastewater containing different types of dyes used during the dyeing process, which cause great harm to the environment [1,2].
    [Show full text]
  • Oxidative Bio-Desulfurization by Nanostructured Peroxidase Mediator System
    catalysts Article Oxidative Bio-Desulfurization by Nanostructured Peroxidase Mediator System Eliana Capecchi 1,*, Davide Piccinino 1, Bruno Mattia Bizzarri 1, Lorenzo Botta 1 , Marcello Crucianelli 2 and Raffaele Saladino 1,* 1 Department of Biological and Ecological Sciences, University of Tuscia, Via S. Camillo de Lellis snc, 01100 Viterbo, Italy; [email protected] (D.P.); [email protected] (B.M.B.); [email protected] (L.B.) 2 Department of Physical and Chemical Sciences, University of L’Aquila, Via Vetoio I, Coppito, 67100 L’Aquila, Italy; [email protected] * Correspondence: [email protected] (E.C.); [email protected] (R.S.) Received: 21 February 2020; Accepted: 6 March 2020; Published: 9 March 2020 Abstract: Bio-desulfurization is an efficient technology for removing recalcitrant sulfur derivatives from liquid fuel oil in environmentally friendly experimental conditions. In this context, the development of heterogeneous bio-nanocatalysts is of great relevance to improve the performance of the process. Here we report that lignin nanoparticles functionalized with concanavalin A are a renewable and efficient platform for the layer-by-layer immobilization of horseradish peroxidase. The novel bio-nanocatalysts were applied for the oxidation of dibenzothiophene as a well-recognized model of the recalcitrant sulfur derivative. The reactions were performed with hydrogen peroxide as a green primary oxidant in the biphasic system PBS/n-hexane at 45 ◦C and room pressure, the highest conversion of the substrate occurring in the presence of cationic polyelectrolyte layer and hydroxy-benzotriazole as a low molecular weight redox mediator. The catalytic activity was retained for more transformations highlighting the beneficial effect of the support in the reusability of the heterogeneous system.
    [Show full text]
  • Characterization of Lignin-Modifying Enzymes of the Selective White-Rot
    View metadata, citation and similar papers at core.ac.uk brought to you by CORE provided by Helsingin yliopiston digitaalinen arkisto Characterization of the lignin-modifying enzymes of the selective white-rot fungus Physisporinus rivulosus Terhi K. Hakala Department of Applied Chemistry and Microbiology Division of Microbiology Faculty of Agriculture and Forestry and Viikki Graduate School in Biosciences University of Helsinki Academic dissertation To be presented, with the permission of the Faculty of Agriculture and Forestry of the University of Helsinki, for public criticism in Auditorium 1, Viikki Infocenter, Viikinkaari 11, on October 19th 2007 at 12 o’clock noon. Helsinki 2007 Supervisor: Professor Annele Hatakka Department of Applied Chemistry and Microbiology University of Helsinki Finland Reviewers: Docent Kristiina Kruus VTT Espoo, Finland Professor Martin Hofrichter Environmental Biotechnology International Graduate School Zittau, Germany Opponent: Professor Kurt Messner Institute of Chemical Engineering Vienna University of Technology Wien, Austria ISSN 1795-7079 ISBN 978-952-10-4172-3 (paperback) ISBN 978-952-10-4173-0 (PDF) Helsinki University Printing House Helsinki 2007 Cover photo: Spruce wood chips (KCL, Lea Kurlin) 2 Contents Contents..................................................................................................................................................3 List of original publications....................................................................................................................4
    [Show full text]
  • Lignocellulolytic Enzyme Production from Wood Rot Fungi Collected in Chiapas, Mexico, and Their Growth on Lignocellulosic Material
    Journal of Fungi Article Lignocellulolytic Enzyme Production from Wood Rot Fungi Collected in Chiapas, Mexico, and Their Growth on Lignocellulosic Material Lina Dafne Sánchez-Corzo 1 , Peggy Elizabeth Álvarez-Gutiérrez 1 , Rocío Meza-Gordillo 1 , Juan José Villalobos-Maldonado 1, Sofía Enciso-Pinto 2 and Samuel Enciso-Sáenz 1,* 1 National Technological of Mexico-Technological Institute of Tuxtla Gutiérrez, Carretera Panamericana, km. 1080, Boulevares, C.P., Tuxtla Gutiérrez 29050, Mexico; [email protected] (L.D.S.-C.); [email protected] (P.E.Á.-G.); [email protected] (R.M.-G.); [email protected] (J.J.V.-M.) 2 Institute of Biomedical Research, National Autonomous University of Mexico, Circuito, Mario de La Cueva s/n, C.U., Coyoacán, México City 04510, Mexico; sofi[email protected] * Correspondence: [email protected]; Tel.: +52-96-150461 (ext. 304) Abstract: Wood-decay fungi are characterized by ligninolytic and hydrolytic enzymes that act through non-specific oxidation and hydrolytic reactions. The objective of this work was to evaluate the production of lignocellulolytic enzymes from collected fungi and to analyze their growth on lignocellulosic material. The study considered 18 species isolated from collections made in the state Citation: Sánchez-Corzo, L.D.; of Chiapas, Mexico, identified by taxonomic and molecular techniques, finding 11 different families. Álvarez-Gutiérrez, P.E.; The growth rates of each isolate were obtained in culture media with African palm husk (PH), coffee Meza-Gordillo, R.; husk (CH), pine sawdust (PS), and glucose as control, measuring daily growth with images analyzed Villalobos-Maldonado, J.J.; in ImageJ software, finding the highest growth rate in the CH medium.
    [Show full text]
  • Bioaugmentation with Vaults: Novel in Situ Remediation Strategy for Transformation of Perfluoroalkyl Compounds
    FINAL REPORT Bioaugmentation with Vaults: Novel In Situ Remediation Strategy for Transformation of Perfluoroalkyl Compounds SERDP Project ER-2422 JANUARY 2016 Shaily Mahendra, PhD Leonard H. Rome, PhD Valerie A. Kickhoefer, PhD Meng Wang, MS University of California, Los Angeles Distribution Statement A Page Intentionally Left Blank This report was prepared under contract to the Department of Defense Strategic Environmental Research and Development Program (SERDP). The publication of this report does not indicate endorsement by the Department of Defense, nor should the contents be construed as reflecting the official policy or position of the Department of Defense. Reference herein to any specific commercial product, process, or service by trade name, trademark, manufacturer, or otherwise, does not necessarily constitute or imply its endorsement, recommendation, or favoring by the Department of Defense. Page Intentionally Left Blank Form ApprowJd REPORTDOCUMENTATJONPAGE 0MB No.070UJ1BB The publicror repartlngbunlen 1h11 C0lec:liDIIlnf0rmat100 af Is estlmaled ID a-age1haurpr.-flllPIIIIN,lncludlng ror lhelime nlYillwlng lrwlructions, .-dq alstlng dalll -. gathering needed,and makltakmglhedata and camplrl&lgilformalbL andflMIIIWIIIthec:dledl0nof Send aimmenll '9g8l'dlng thisbunlenestlmaleo,anyat!.- asped rl llisi1bmallcx,, C0lllldi0ncf Including suggestiarmDepar".mr11d farl'llduc:lnglheburden,ID ot Defame,Semcm, W8stlinglDl'IHwlqmr1ln Dinlclaralll lnfannatian far Operltiara(0704-0188}, andRllparta 1215 JefferlonD11'111 Highway, Sulla 1204+ ArllnglDn.VA 22202-<1302.. Rmp011de,itastia.lld be awm9 thatnalwlltlaandlnganyDlhlll' provillan at law, 1111 pr,rsan lhall be IUbjr,c:ta ID8l'lfpenallyfarfallinglocarnplywllhacalecllar!afblfor11iiilkx,If dotil nat dilplay0MB acumtnllyvalid c:anlnll number. PLEASENOT DO RETURNYOUR FORMTO TIIE ABOVE ADDRESS, 1. REPORTDATE (DD-MM-YVVY) REPORTTYPE 3. DATES COVERED(From - To) 15/01/2016 12.Technical 20/08/2014 - 20/08/2016 4.
    [Show full text]
  • Characterization of Cytosolic Glutathione Peroxidase And
    Aquatic Toxicology 130–131 (2013) 97–111 Contents lists available at SciVerse ScienceDirect Aquatic Toxicology jou rnal homepage: www.elsevier.com/locate/aquatox Characterization of cytosolic glutathione peroxidase and phospholipid-hydroperoxide glutathione peroxidase genes in rainbow trout (Oncorhynchus mykiss) and their modulation by in vitro selenium exposure a a b a d c a,∗ D. Pacitti , T. Wang , M.M. Page , S.A.M. Martin , J. Sweetman , J. Feldmann , C.J. Secombes a Scottish Fish Immunology Research Centre, Institute of Biological and Environmental Sciences, University of Aberdeen, Aberdeen AB24 2TZ, United Kingdom b Integrative and Environmental Physiology, Institute of Biological and Environmental Sciences, University of Aberdeen, Aberdeen AB24 2TZ, United Kingdom c Trace Element Speciation Laboratory, Department of Chemistry, University of Aberdeen, Aberdeen AB24 3UE, United Kingdom d Alltech Biosciences Centre, Sarney, Summerhill Rd, Dunboyne, Country Meath, Ireland a r t i c l e i n f o a b s t r a c t Article history: Selenium (Se) is an oligonutrient with both essential biological functions and recognized harmful effects. Received 4 July 2012 As the selenocysteine (SeCys) amino acid, selenium is integrated in several Se-containing proteins Received in revised form (selenoproteins), many of which are fundamental for cell homeostasis. Nevertheless, selenium may exert 19 December 2012 toxic effects at levels marginally above those required, mainly through the generation of reactive oxygen Accepted 20 December 2012 species (ROS). The selenium chemical speciation can strongly affect the bioavailability of this metal and its impact on metabolism, dictating the levels that can be beneficial or detrimental towards an organism.
    [Show full text]
  • Molecular Characterization, Protein–Protein Interaction Network, and Evolution of Four Glutathione Peroxidases from Tetrahymena Thermophila
    antioxidants Article Molecular Characterization, Protein–Protein Interaction Network, and Evolution of Four Glutathione Peroxidases from Tetrahymena thermophila Diana Ferro 1,2, Rigers Bakiu 3 , Sandra Pucciarelli 4, Cristina Miceli 4 , Adriana Vallesi 4 , Paola Irato 5 and Gianfranco Santovito 5,* 1 BIO5 Institute, University of Arizona, Tucson, AZ 85719, USA; [email protected] 2 Department of Pediatrics, Children’s Mercy Hospital and Clinics, Kansas City, MO 64108, USA 3 Department of Aquaculture and Fisheries, Agricultural University of Tirana, 1000 Tiranë, Albania; [email protected] 4 School of Biosciences and Veterinary Medicine, University of Camerino, 62032 Camerino, Italy; [email protected] (S.P.); [email protected] (C.M.); [email protected] (A.V.) 5 Department of Biology, University of Padova, 35131 Padova, Italy; [email protected] * Correspondence: [email protected] Received: 6 September 2020; Accepted: 1 October 2020; Published: 2 October 2020 Abstract: Glutathione peroxidases (GPxs) form a broad family of antioxidant proteins essential for maintaining redox homeostasis in eukaryotic cells. In this study, we used an integrative approach that combines bioinformatics, molecular biology, and biochemistry to investigate the role of GPxs in reactive oxygen species detoxification in the unicellular eukaryotic model organism Tetrahymena thermophila. Both phylogenetic and mechanistic empirical model analyses provided indications about the evolutionary relationships among the GPXs of Tetrahymena and the orthologous enzymes of phylogenetically related species. In-silico gene characterization and text mining were used to predict the functional relationships between GPxs and other physiologically-relevant processes. The GPx genes contain conserved transcriptional regulatory elements in the promoter region, which suggest that transcription is under tight control of specialized signaling pathways.
    [Show full text]
  • Extracellular Glutathione Peroxidase Gpx3 and Its Role in Cancer
    cancers Review Extracellular Glutathione Peroxidase GPx3 and Its Role in Cancer Caroline Chang 1, Beth L. Worley 2,Rébécca Phaëton 3 and Nadine Hempel 2,* 1 Department of Comparative Medicine, Penn State University College of Medicine, Hershey, PA 17033, USA; [email protected] 2 Department of Pharmacology, Penn State University College of Medicine, Hershey, PA 17033, USA; [email protected] 3 Department of Obstetrics & Gynecology & Department of Microbiology and Immunology, Penn State University College of Medicine, Hershey, PA 17033, USA; [email protected] * Correspondence: [email protected]; Tel.: +1-717-531-4037 Received: 29 June 2020; Accepted: 4 August 2020; Published: 6 August 2020 Abstract: Mammalian cells possess a multifaceted antioxidant enzyme system, which includes superoxide dismutases, catalase, the peroxiredoxin/thioredoxin and the glutathione peroxidase systems. The dichotomous role of reactive oxygen species and antioxidant enzymes in tumorigenesis and cancer progression complicates the use of small molecule antioxidants, pro-oxidants, and targeting of antioxidant enzymes as therapeutic approaches for cancer treatment. It also highlights the need for additional studies to investigate the role and regulation of these antioxidant enzymes in cancer. The focus of this review is on glutathione peroxidase 3 (GPx3), a selenoprotein, and the only extracellular GPx of a family of oxidoreductases that catalyze the detoxification of hydro- and soluble lipid hydroperoxides by reduced glutathione. In addition to summarizing the biochemical function, regulation, and disease associations of GPx3, we specifically discuss the role and regulation of systemic and tumor cell expressed GPx3 in cancer. From this it is evident that GPx3 has a dichotomous role in different tumor types, acting as both a tumor suppressor and pro-survival protein.
    [Show full text]