The University of Chicago Machine Learning for The

Total Page:16

File Type:pdf, Size:1020Kb

The University of Chicago Machine Learning for The THE UNIVERSITY OF CHICAGO MACHINE LEARNING FOR THE GENOTYPE TO PHENOTYPE PROBLEM A DISSERTATION SUBMITTED TO THE FACULTY OF THE DIVISION OF THE PHYSICAL SCIENCES IN CANDIDACY FOR THE DEGREE OF DOCTORATE OF PHILOSOPHY DEPARTMENT OF COMPUTER SCIENCE BY JOHN WILLIAM SANTERRE CHICAGO, ILLINOIS DRAFTDECEMBER 2017 Copyright c 2017 by John William Santerre DRAFTAll Rights Reserved TABLE OF CONTENTS LIST OF FIGURES . v LIST OF TABLES . vii ACKNOWLEDGMENTS . x ABSTRACT . xi INTRODUCTION . xii I GENOTYPE TO PHENOTYPE: ANTIMICROBIAL RESISTANCE 1 OVERVIEW . 2 2 DETECTING ANTIMICROBIAL RESISTANCE IN THE LAB . 6 2.1 Biological concepts . .6 2.2 AMR Protocol . .8 3 ANTIMICROBIAL RESISTANCE AS A SUPERVISED MACHINE LEARNING PROBLEM . 10 3.1 k-mer Notation . 12 3.1.1 Classification Using k-mer Matrices . 17 3.2 Model Selection and Tuning . 19 3.3 Model Considerations . 20 3.3.1 Hyperparameter Selection . 21 3.4 Metrics . 22 3.4.1 Classification performance . 22 3.4.2 Feature Importance Calculation . 23 3.4.3 Gene Region Identification . 24 3.5 Model Comparison . 25 3.5.1 Naive Bayes . 25 3.5.2 AdaBoost . 26 3.5.3 Logistic Regression . 27 3.5.4 Support Vector Machines . 27 3.5.5 Random Forest . 27 4 ANALYSIS . 31 4.1 Data Sets . 31 4.1.1 Acinetobacter baumannii . 31 4.1.2 Staphylococcus aureus . 34 4.1.3 Mycobacterium tuberculosis . 34 4.1.4 Klebsiella pneumoniae . 35 DRAFTiii 4.2 Classifier Comparison . 36 4.2.1 Classifier Performance . 36 4.2.2 Speed . 39 4.3 Random Forest . 39 4.3.1 RF subtrees . 46 4.3.2 RF on subsets of the data . 47 4.4 Feature Importance Calculation . 49 4.4.1 Biological relevance . 49 4.4.2 Feature Importance Stability . 54 5 COMPRESSED MATRIX FORMULATION . 59 5.1 Compressed Matrix Construction . 59 5.2 Experiments . 63 A SUPPLEMENTARY MATERIAL . 70 A.1 AMR: Classification . 70 A.2 AMR: Gene Stability . 86 A.3 AMR: Code . 101 A.4 PGR . 103 DRAFTiv LIST OF FIGURES 3.1 Example dataset from which the k-mer matrix in (3.1) is constructed. SUS genomes are labelled 0. RES genomes are labelled 1 to denote the presence of a mutation conferring resistance to a specific antibiotic. 12 3.2 Schematic of the AMR phenotype classification workflow using k-mers. 16 3.3 An example RF tree. 28 4.1 Overview of the A. baumannii dataset. The k-mer size (shown on the x axis) affects different metrics of the k-mer matrix identically. 32 4.2 Histogram of the entries in the k-mer matrix for the (a) A. baumannii and (b) S. aureus dataset. Note that increasing the size k can have a dramatic effect on the distribution of entries of the k-mer matrix. 33 4.3 Overview of the A. baumannii dataset. The k-mer size (shown on the x axis) affects different metrics of the k-mer matrix identically. 34 4.4 ROC curves for classifiers trained on each of the A. baumannii and S. aureus datasets (k = 15). 39 4.5 ROC curves for classifiers trained on each of the M. tuberculosis datasets listed in Table 4.1. 40 4.6 ROC curves for different classifiers trained on the A. baumannii dataset. Each ROC curve corresponds to a different size k..................... 41 4.7 ROC curves for different classifiers trained on the S. aureus dataset. Each ROC curve corresponds to a different size k........................ 42 4.8 ROC curves for different classifiers trained on the binary A. baumannii dataset. Each ROC curve corresponds to a different size k.................. 43 4.9 ROC curves for different classifiers trained on the binary S. aureus dataset. Each ROC curve corresponds to a different size k..................... 44 4.10 Accuracy of different classifiers on the (binary) A. baumannii and S. aureus datasets. Additional performance metrics can be found in the supplementary Tables A.1, A.2, A.3 and A.4 in Appendix A.1. 45 4.11 Execution times for different classifiers ran on the A. baumannii dataset. NB denotes Naive Bayes. 46 4.12 ROC curves for the RF classifier trained on the K. pneumoniae dataset. 46 4.13 ROC curves for the RF classifier trained with different number of trees on each of the A. baumannii and S. aureus datasets. 48 4.14 ROC curves for the RF classifier trained on training sets of increasing sizes. In particular, RF was trained on subsets of the M. tuberculosis rifampicin and streptomycin datasets from Table 4.1. The subset size denotes the total number of isolates subsampled from each dataset in each experiment (with number of RES and SUS isolates equal to n=2 in each case). 49 4.15 Plot of the ranking (y axis) of each PEG (x axis) by the aggregate score computed by A1, as described in the text. Additional information about the PEGs can be found in supplementary Tables A.12 and A.11 in Appendix A.2. 51 DRAFTv 4.16 Plot of the ranking (y axis) of each PEG (x axis) by the aggregate score computed by A2, as described in the text. Additional information about the PEGs can be found in supplementary Tables A.13 and A.14 in Appendix A.2. 52 4.17 k-mer ranking according to the feature importance computed by RF on the full or the compressed k-mer matrix. Collapsing by column ID refers to calculating the feature importance separately on the full k-mer matrix and then summing the feature importance for all columns that have an identical column identity in the k-mer matrix. 58 5.1 Python pseudocode for constructing a k-mer matrix. 60 5.2 Pseudocode for constructing a compressed k-mer matrix. 62 5.3 Overview of the compressed A. baumannii and S. aureus datasets. Similarly to the full k-mer matrix case (see Figures 4.1 and 4.3), the k-mer size (shown on the x axis) affects different metrics of the k-mer matrix identically. 64 5.4 Accuracy of different classifiers on the compressed (binary) A. baumannii and S. aureus datasets. Additional performance metrics can be found in the supplemen- tary Tables A.5, A.6, A.7 and A.8 in Appendix A.1. 65 5.5 ROC curves for different classifiers trained on the compressed A. baumannii dataset. Each ROC curve corresponds to a different size k............. 66 5.6 ROC curves for different classifiers trained on the compressed S. aureus dataset. Each ROC curve corresponds to a different size k.................. 67 5.7 ROC curves for different classifiers trained on the compressed binary A. bauman- nii dataset. Each ROC curve corresponds to a different size k........... 68 5.8 ROC curves for different classifiers trained on the compressed binary S. aureus dataset. Each ROC curve corresponds to a different size k............. 69 DRAFTvi LIST OF TABLES 4.1 List of the antibiotics used for the M. tuberculosis isolates. For each antibiotic we list the total number of isolates used for classification with the specific number of resistant and susceptible isolates listed in the fourth and fifth column, respectively. Additional details about the dataset can be found in (Davis et al., 2016). The last column shows the number of features when the k = 15. 35 4.2 List of antibiotics used for the K. pneumoniae isolates. For each antibiotic we list the total number of isolates which have a labelled (i.e., resistant or susceptible) for that antibiotic. The specific number of resistant and susceptible isolates are listed in the fourth and fifth column, respectively. 36 4.3 Classifier comparison on the A. baumannii dataset (classification of resistance to carbapenem) and the S. aureus dataset (classification of resistance to methicillin). 37 4.4 Classifier comparison on each of the seven M. tuberculosis datasets listed in Table 4.1............................................ 38 4.5 Comparison of RF performance on the K. pneumoniae dataset. 47 4.6 Comparison of RF classifier trained with different number of trees on the A. baumannii dataset (classification of resistance to carbapenem) and the S. aureus dataset (classification of resistance to methicillin). 48 4.7 Comparison of RF performance on training sets of increasing sizes. In particular, we trained RF on subsets of the M. tuberculosis rifampicin and streptomycin datasets from Table 4.1 . The subset size n denotes the total number of isolates subsampled from each dataset in each experiment (with number of RES and SUS isolates equal to n=2 in each case). All statistics are averaged over three independent runs, on each one of them we performed 5-fold cross-validation. 50 4.8 The top 20 k-mer matrix features computed by training RF on the k-mer matrix of the M. tuberculosis rifampicin dataset. Each classifier consists of 10 trees. FI denotes feature importance. [Change col index?]................. 54 4.9 The top 20 k-mer matrix features computed by training RF on the k-mer matrix of the M. tuberculosis rifampicin dataset. Each classifier consists of 1000 trees. FI denotes feature importance. [Change col index?]................ 55 4.10 The top 20 k-mer matrix features computed by training RF on the compressed k-mer matrix of the M. tuberculosis rifampicin dataset. Each classifier consists of 10 trees. FI denotes feature importance. 56 4.11 The top 20 k-mer matrix features computed by training RF on the compressed k-mer matrix of the M. tuberculosis rifampicin dataset. Each classifier consists of 1000 trees. FI denotes feature importance. 57 A.1 Performance of different classifiers on the A. baumannii dataset for different k- mer sizes.
Recommended publications
  • Table of Contents
    Table of contents SUMMARY IX ZUSAMMENFASSUNG XI 1. INTRODUCTION 1 1.1 The immune system 1 1.1.1 Function 1 1.1.2 Overview 1 1.1.3 Function of MHC molecules 1 1.1.4 The Major Histocompatibility Complex (MHC) Genes 2 1.1.5 Structure of MHC molecules 2 1.2 Antigen presentation 5 1.2.1 Outline of the conventional MHC class I pathway 5 1.3 The components of the peptide loading complex 7 1.3.1 Transporter associated with antigen presentation (TAP) 7 1.3.2 Calreticulin and calnexin 9 1.3.4 ERp 57/ER60 10 1.3.5 Tapasin (Tpn) 11 1.4 (a) The proteasomes 12 1.4 (b) Cytosolic peptidase 15 1.5 Endoplasmic Reticulum Associated Peptidase/s 16 1.6 Protein sorting 17 1.6.2 Vesicular transport in the Golgi apparatus 20 1.6.3 Endocytic pathway 20 1.6.4 Endocytic proteases 24 1.6.4.1 Biosynthesis and transport of cathepsins 25 1.6.4.2 Activation of cathepsins 26 1.6.4.3 Inhibitors of endosomal proteases. 26 1.7 Antigen processing by alternative pathways 27 1.8 Aim of the thesis 30 i 2. MATERIALS AND METHODS 31 2.1 Materials 31 2.1.1 Bacterial strains 31 2.1.2 Plasmids 31 2.1.2.1 Construction of plasmids 31 2.1.3 Primer list 32 2.1.4 PCR conditions 33 2.1.5 Antibiotics 34 2.1.6 Chemicals 34 2.1.7 Media for bacterial cells 34 2.1.8 Protein Chemicals 35 2.1.9 Animal cell culture media 37 2.1.11 Antibodies 39 2.1.11 siRNA oligonucleotides.
    [Show full text]
  • Articles Catalytic Cycling in Β-Phosphoglucomutase: a Kinetic
    9404 Biochemistry 2005, 44, 9404-9416 Articles Catalytic Cycling in â-Phosphoglucomutase: A Kinetic and Structural Analysis†,‡ Guofeng Zhang, Jianying Dai, Liangbing Wang, and Debra Dunaway-Mariano* Department of Chemistry, UniVersity of New Mexico, Albuquerque, New Mexico 87131-0001 Lee W. Tremblay and Karen N. Allen* Department of Physiology and Biophysics, Boston UniVersity School of Medicine, Boston, Massachusetts 02118-2394 ReceiVed March 26, 2005; ReVised Manuscript ReceiVed May 18, 2005 ABSTRACT: Lactococcus lactis â-phosphoglucomutase (â-PGM) catalyzes the interconversion of â-D-glucose 1-phosphate (â-G1P) and â-D-glucose 6-phosphate (G6P), forming â-D-glucose 1,6-(bis)phosphate (â- G16P) as an intermediate. â-PGM conserves the core domain catalytic scaffold of the phosphatase branch of the HAD (haloalkanoic acid dehalogenase) enzyme superfamily, yet it has evolved to function as a mutase rather than as a phosphatase. This work was carried out to identify the structural basis underlying this diversification of function. In this paper, we examine â-PGM activation by the Mg2+ cofactor, â-PGM activation by Asp8 phosphorylation, and the role of cap domain closure in substrate discrimination. First, the 1.90 Å resolution X-ray crystal structure of the Mg2+-â-PGM complex is examined in the context of + + previously reported structures of the Mg2 -R-D-galactose-1-phosphate-â-PGM, Mg2 -phospho-â-PGM, and Mg2+-â-glucose-6-phosphate-1-phosphorane-â-PGM complexes to identify conformational changes that occur during catalytic turnover. The essential role of Asp8 in nucleophilic catalysis was confirmed by demonstrating that the D8A and D8E mutants are devoid of catalytic activity.
    [Show full text]
  • Enzymatic Encoding Methods for Efficient Synthesis Of
    (19) TZZ__T (11) EP 1 957 644 B1 (12) EUROPEAN PATENT SPECIFICATION (45) Date of publication and mention (51) Int Cl.: of the grant of the patent: C12N 15/10 (2006.01) C12Q 1/68 (2006.01) 01.12.2010 Bulletin 2010/48 C40B 40/06 (2006.01) C40B 50/06 (2006.01) (21) Application number: 06818144.5 (86) International application number: PCT/DK2006/000685 (22) Date of filing: 01.12.2006 (87) International publication number: WO 2007/062664 (07.06.2007 Gazette 2007/23) (54) ENZYMATIC ENCODING METHODS FOR EFFICIENT SYNTHESIS OF LARGE LIBRARIES ENZYMVERMITTELNDE KODIERUNGSMETHODEN FÜR EINE EFFIZIENTE SYNTHESE VON GROSSEN BIBLIOTHEKEN PROCEDES DE CODAGE ENZYMATIQUE DESTINES A LA SYNTHESE EFFICACE DE BIBLIOTHEQUES IMPORTANTES (84) Designated Contracting States: • GOLDBECH, Anne AT BE BG CH CY CZ DE DK EE ES FI FR GB GR DK-2200 Copenhagen N (DK) HU IE IS IT LI LT LU LV MC NL PL PT RO SE SI • DE LEON, Daen SK TR DK-2300 Copenhagen S (DK) Designated Extension States: • KALDOR, Ditte Kievsmose AL BA HR MK RS DK-2880 Bagsvaerd (DK) • SLØK, Frank Abilgaard (30) Priority: 01.12.2005 DK 200501704 DK-3450 Allerød (DK) 02.12.2005 US 741490 P • HUSEMOEN, Birgitte Nystrup DK-2500 Valby (DK) (43) Date of publication of application: • DOLBERG, Johannes 20.08.2008 Bulletin 2008/34 DK-1674 Copenhagen V (DK) • JENSEN, Kim Birkebæk (73) Proprietor: Nuevolution A/S DK-2610 Rødovre (DK) 2100 Copenhagen 0 (DK) • PETERSEN, Lene DK-2100 Copenhagen Ø (DK) (72) Inventors: • NØRREGAARD-MADSEN, Mads • FRANCH, Thomas DK-3460 Birkerød (DK) DK-3070 Snekkersten (DK) • GODSKESEN,
    [Show full text]
  • Structure of Human Aspartyl Aminopeptidase Complexed With
    Chaikuad et al. BMC Structural Biology 2012, 12:14 http://www.biomedcentral.com/1472-6807/12/14 RESEARCH ARTICLE Open Access Structure of human aspartyl aminopeptidase complexed with substrate analogue: insight into catalytic mechanism, substrate specificity and M18 peptidase family Apirat Chaikuad1, Ewa S Pilka1, Antonio De Riso2, Frank von Delft1, Kathryn L Kavanagh1, Catherine Vénien-Bryan2, Udo Oppermann1,3 and Wyatt W Yue1* Abstract Backround: Aspartyl aminopeptidase (DNPEP), with specificity towards an acidic amino acid at the N-terminus, is the only mammalian member among the poorly understood M18 peptidases. DNPEP has implicated roles in protein and peptide metabolism, as well as the renin-angiotensin system in blood pressure regulation. Despite previous enzyme and substrate characterization, structural details of DNPEP regarding ligand recognition and catalytic mechanism remain to be delineated. Results: The crystal structure of human DNPEP complexed with zinc and a substrate analogue aspartate-β- hydroxamate reveals a dodecameric machinery built by domain-swapped dimers, in agreement with electron microscopy data. A structural comparison with bacterial homologues identifies unifying catalytic features among the poorly understood M18 enzymes. The bound ligands in the active site also reveal the coordination mode of the binuclear zinc centre and a substrate specificity pocket for acidic amino acids. Conclusions: The DNPEP structure provides a molecular framework to understand its catalysis that is mediated by active site loop swapping, a mechanism likely adopted in other M18 and M42 metallopeptidases that form dodecameric complexes as a self-compartmentalization strategy. Small differences in the substrate binding pocket such as shape and positive charges, the latter conferred by a basic lysine residue, further provide the key to distinguishing substrate preference.
    [Show full text]
  • Supplemental Material
    Supplemental Table B ARGs in alphabetical order Symbol Title 3 months 6 months 9 months 12 months 23 months ANOVA Direction Category 38597 septin 2 1557 ± 44 1555 ± 44 1579 ± 56 1655 ± 26 1691 ± 31 0.05219 up Intermediate 0610031j06rik kidney predominant protein NCU-G1 491 ± 6 504 ± 14 503 ± 11 527 ± 13 534 ± 12 0.04747 up Early Adult 1G5 vesicle-associated calmodulin-binding protein 662 ± 23 675 ± 17 629 ± 16 617 ± 20 583 ± 26 0.03129 down Intermediate A2m alpha-2-macroglobulin 262 ± 7 272 ± 8 244 ± 6 290 ± 7 353 ± 16 0.00000 up Midlife Aadat aminoadipate aminotransferase (synonym Kat2) 180 ± 5 201 ± 12 223 ± 7 244 ± 14 275 ± 7 0.00000 up Early Adult Abca2 ATP-binding cassette, sub-family A (ABC1), member 2 958 ± 28 1052 ± 58 1086 ± 36 1071 ± 44 1141 ± 41 0.05371 up Early Adult Abcb1a ATP-binding cassette, sub-family B (MDR/TAP), member 1A 136 ± 8 147 ± 6 147 ± 13 155 ± 9 185 ± 13 0.01272 up Midlife Acadl acetyl-Coenzyme A dehydrogenase, long-chain 423 ± 7 456 ± 11 478 ± 14 486 ± 13 512 ± 11 0.00003 up Early Adult Acadvl acyl-Coenzyme A dehydrogenase, very long chain 426 ± 14 414 ± 10 404 ± 13 411 ± 15 461 ± 10 0.01017 up Late Accn1 amiloride-sensitive cation channel 1, neuronal (degenerin) 242 ± 10 250 ± 9 237 ± 11 247 ± 14 212 ± 8 0.04972 down Late Actb actin, beta 12965 ± 310 13382 ± 170 13145 ± 273 13739 ± 303 14187 ± 269 0.01195 up Midlife Acvrinp1 activin receptor interacting protein 1 304 ± 18 285 ± 21 274 ± 13 297 ± 21 341 ± 14 0.03610 up Late Adk adenosine kinase 1828 ± 43 1920 ± 38 1922 ± 22 2048 ± 30 1949 ± 44 0.00797 up Early
    [Show full text]
  • Diagnosis, Treatment and Follow Up
    DOI: 10.1002/jimd.12024 REVIEW International clinical guidelines for the management of phosphomannomutase 2-congenital disorders of glycosylation: Diagnosis, treatment and follow up Ruqaiah Altassan1,2 | Romain Péanne3,4 | Jaak Jaeken3 | Rita Barone5 | Muad Bidet6 | Delphine Borgel7 | Sandra Brasil8,9 | David Cassiman10 | Anna Cechova11 | David Coman12,13 | Javier Corral14 | Joana Correia15 | María Eugenia de la Morena-Barrio16 | Pascale de Lonlay17 | Vanessa Dos Reis8 | Carlos R Ferreira18,19 | Agata Fiumara5 | Rita Francisco8,9,20 | Hudson Freeze21 | Simone Funke22 | Thatjana Gardeitchik23 | Matthijs Gert4,24 | Muriel Girad25,26 | Marisa Giros27 | Stephanie Grünewald28 | Trinidad Hernández-Caselles29 | Tomas Honzik11 | Marlen Hutter30 | Donna Krasnewich18 | Christina Lam31,32 | Joy Lee33 | Dirk Lefeber23 | Dorinda Marques-da-Silva9,20 | Antonio F Martinez34 | Hossein Moravej35 | Katrin Õunap36,37 | Carlota Pascoal8,9 | Tiffany Pascreau38 | Marc Patterson39,40,41 | Dulce Quelhas14,42 | Kimiyo Raymond43 | Peymaneh Sarkhail44 | Manuel Schiff45 | Małgorzata Seroczynska29 | Mercedes Serrano46 | Nathalie Seta47 | Jolanta Sykut-Cegielska48 | Christian Thiel30 | Federic Tort27 | Mari-Anne Vals49 | Paula Videira20 | Peter Witters50,51 | Renate Zeevaert52 | Eva Morava53,54 1Department of Medical Genetic, Montréal Children's Hospital, Montréal, Québec, Canada 2Department of Medical Genetic, King Faisal Specialist Hospital and Research Center, Riyadh, Saudi Arabia 3Department of Human Genetics, KU Leuven, Leuven, Belgium 4LIA GLYCOLAB4CDG (International
    [Show full text]
  • Oxido-Reductive Regulation of Vascular Remodeling by Receptor Tyrosine Kinase ROS1
    Oxido-reductive regulation of vascular remodeling by receptor tyrosine kinase ROS1 Ziad A. Ali, … , Thomas Quertermous, Euan A. Ashley J Clin Invest. 2014;124(12):5159-5174. https://doi.org/10.1172/JCI77484. Research Article Angioplasty and stenting is the primary treatment for flow-limiting atherosclerosis; however, this strategy is limited by pathological vascular remodeling. Using a systems approach, we identified a role for the network hub gene glutathione peroxidase-1 (GPX1) in pathological remodeling following human blood vessel stenting. Constitutive deletion ofG px1 in atherosclerotic mice recapitulated this phenotype of increased vascular smooth muscle cell (VSMC) proliferation and plaque formation. In an independent patient cohort, gene variant pair analysis identified an interaction of GPX1 with the orphan protooncogene receptor tyrosine kinase ROS1. A meta-analysis of the only genome-wide association studies of human neointima-induced in-stent stenosis confirmed the association of the ROS1 variant with pathological remodeling. Decreased GPX1 expression in atherosclerotic mice led to reductive stress via a time-dependent increase in glutathione, corresponding to phosphorylation of the ROS1 kinase activation site Y2274. Loss of GPX1 function was associated with both oxidative and reductive stress, the latter driving ROS1 activity via s-glutathiolation of critical residues of the ROS1 tyrosine phosphatase SHP-2. ROS1 inhibition with crizotinib and deglutathiolation of SHP-2 abolished GPX1-mediated increases in VSMC proliferation while leaving endothelialization intact. Our results indicate that GPX1-dependent alterations in oxido-reductive stress promote ROS1 activation and mediate vascular remodeling. Find the latest version: https://jci.me/77484/pdf The Journal of Clinical Investigation RESEARCH ARTICLE Oxido-reductive regulation of vascular remodeling by receptor tyrosine kinase ROS1 Ziad A.
    [Show full text]
  • Supplementary Table S1. Table 1. List of Bacterial Strains Used in This Study Suppl
    Supplementary Material Supplementary Tables: Supplementary Table S1. Table 1. List of bacterial strains used in this study Supplementary Table S2. List of plasmids used in this study Supplementary Table 3. List of primers used for mutagenesis of P. intermedia Supplementary Table 4. List of primers used for qRT-PCR analysis in P. intermedia Supplementary Table 5. List of the most highly upregulated genes in P. intermedia OxyR mutant Supplementary Table 6. List of the most highly downregulated genes in P. intermedia OxyR mutant Supplementary Table 7. List of the most highly upregulated genes in P. intermedia grown in iron-deplete conditions Supplementary Table 8. List of the most highly downregulated genes in P. intermedia grown in iron-deplete conditions Supplementary Figures: Supplementary Figure 1. Comparison of the genomic loci encoding OxyR in Prevotella species. Supplementary Figure 2. Distribution of SOD and glutathione peroxidase genes within the genus Prevotella. Supplementary Table S1. Bacterial strains Strain Description Source or reference P. intermedia V3147 Wild type OMA14 isolated from the (1) periodontal pocket of a Japanese patient with periodontitis V3203 OMA14 PIOMA14_I_0073(oxyR)::ermF This study E. coli XL-1 Blue Host strain for cloning Stratagene S17-1 RP-4-2-Tc::Mu aph::Tn7 recA, Smr (2) 1 Supplementary Table S2. Plasmids Plasmid Relevant property Source or reference pUC118 Takara pBSSK pNDR-Dual Clonetech pTCB Apr Tcr, E. coli-Bacteroides shuttle vector (3) plasmid pKD954 Contains the Porpyromonas gulae catalase (4)
    [Show full text]
  • Serine Proteases with Altered Sensitivity to Activity-Modulating
    (19) & (11) EP 2 045 321 A2 (12) EUROPEAN PATENT APPLICATION (43) Date of publication: (51) Int Cl.: 08.04.2009 Bulletin 2009/15 C12N 9/00 (2006.01) C12N 15/00 (2006.01) C12Q 1/37 (2006.01) (21) Application number: 09150549.5 (22) Date of filing: 26.05.2006 (84) Designated Contracting States: • Haupts, Ulrich AT BE BG CH CY CZ DE DK EE ES FI FR GB GR 51519 Odenthal (DE) HU IE IS IT LI LT LU LV MC NL PL PT RO SE SI • Coco, Wayne SK TR 50737 Köln (DE) •Tebbe, Jan (30) Priority: 27.05.2005 EP 05104543 50733 Köln (DE) • Votsmeier, Christian (62) Document number(s) of the earlier application(s) in 50259 Pulheim (DE) accordance with Art. 76 EPC: • Scheidig, Andreas 06763303.2 / 1 883 696 50823 Köln (DE) (71) Applicant: Direvo Biotech AG (74) Representative: von Kreisler Selting Werner 50829 Köln (DE) Patentanwälte P.O. Box 10 22 41 (72) Inventors: 50462 Köln (DE) • Koltermann, André 82057 Icking (DE) Remarks: • Kettling, Ulrich This application was filed on 14-01-2009 as a 81477 München (DE) divisional application to the application mentioned under INID code 62. (54) Serine proteases with altered sensitivity to activity-modulating substances (57) The present invention provides variants of ser- screening of the library in the presence of one or several ine proteases of the S1 class with altered sensitivity to activity-modulating substances, selection of variants with one or more activity-modulating substances. A method altered sensitivity to one or several activity-modulating for the generation of such proteases is disclosed, com- substances and isolation of those polynucleotide se- prising the provision of a protease library encoding poly- quences that encode for the selected variants.
    [Show full text]
  • Proteomic Analysis of Ascocotyle Longa (Trematoda: Heterophyidae) T Metacercariae Karina M
    Molecular & Biochemical Parasitology 239 (2020) 111311 Contents lists available at ScienceDirect Molecular & Biochemical Parasitology journal homepage: www.elsevier.com/locate/molbiopara Proteomic analysis of Ascocotyle longa (Trematoda: Heterophyidae) T metacercariae Karina M. Rebelloa,c,*, Juliana N. Borgesb, André Teixeirac, Jonas Peralesc, Cláudia P. Santosb,** a Laboratório de Estudos Integrados em Protozoologia, Instituto Oswaldo Cruz, Fundação Oswaldo Cruz, Rio de Janeiro, Brazil b Laboratório de Avaliação e Promoção da Saúde Ambiental, Instituto Oswaldo Cruz, Fundação Oswaldo Cruz, Rio de Janeiro, Brazil c Laboratório de Toxinologia, Instituto Oswaldo Cruz, Fundação Oswaldo Cruz, Rio de Janeiro, Brazil ARTICLE INFO ABSTRACT Keywords: Ascocotyle longa is parasitic trematode with wide distribution throughout America, Europe, Africa, and Middle Mugil liza East. Despite the fact that this fish-borne pathogen has been considered an agent of human heterophyiasis in Fish-born pathogen Brazil, the molecules involved in the host-parasite interaction remain unknown. The present study reports the Proteins proteome profile ofA. longa metacercariae collected from the fishMugil liza from Brazil. This infective stage for Parasite humans, mammals and birds was analyzed using nLC-MS/MS approach. We identified a large repertoire of Helminth proteins, which are mainly involved in energy metabolism and cell structure. Peptidases and immunogenic Heterophyiasis proteins were also identified, which might play roles in host-parasite interface. Our data provided unprecedented insights into the biology of A. longa and represent a first step to understand the natural host-parasite interaction. Moreover, as the first proteome characterized in this trematode, it will provide an important resource for future studies. 1. Introduction find their way into their final hosts [14].
    [Show full text]
  • Letters to Nature
    letters to nature Received 7 July; accepted 21 September 1998. 26. Tronrud, D. E. Conjugate-direction minimization: an improved method for the re®nement of macromolecules. Acta Crystallogr. A 48, 912±916 (1992). 1. Dalbey, R. E., Lively, M. O., Bron, S. & van Dijl, J. M. The chemistry and enzymology of the type 1 27. Wolfe, P. B., Wickner, W. & Goodman, J. M. Sequence of the leader peptidase gene of Escherichia coli signal peptidases. Protein Sci. 6, 1129±1138 (1997). and the orientation of leader peptidase in the bacterial envelope. J. Biol. Chem. 258, 12073±12080 2. Kuo, D. W. et al. Escherichia coli leader peptidase: production of an active form lacking a requirement (1983). for detergent and development of peptide substrates. Arch. Biochem. Biophys. 303, 274±280 (1993). 28. Kraulis, P.G. Molscript: a program to produce both detailed and schematic plots of protein structures. 3. Tschantz, W. R. et al. Characterization of a soluble, catalytically active form of Escherichia coli leader J. Appl. Crystallogr. 24, 946±950 (1991). peptidase: requirement of detergent or phospholipid for optimal activity. Biochemistry 34, 3935±3941 29. Nicholls, A., Sharp, K. A. & Honig, B. Protein folding and association: insights from the interfacial and (1995). the thermodynamic properties of hydrocarbons. Proteins Struct. Funct. Genet. 11, 281±296 (1991). 4. Allsop, A. E. et al.inAnti-Infectives, Recent Advances in Chemistry and Structure-Activity Relationships 30. Meritt, E. A. & Bacon, D. J. Raster3D: photorealistic molecular graphics. Methods Enzymol. 277, 505± (eds Bently, P. H. & O'Hanlon, P. J.) 61±72 (R. Soc. Chem., Cambridge, 1997).
    [Show full text]
  • The Reaction Mechanism of Phosphomannomutase in Plants
    CORE Metadata, citation and similar papers at core.ac.uk Provided by Elsevier - Publisher Connector FEBS 18031 FEBS Letters 401 (1997) 35-37 The reaction mechanism of phosphomannomutase in plants Christine Oesterhelt, Claus Schnarrenberger, Wolfgang Gross* Institut für Pflanzenphysiologie und Mikrobiologie, Freie Universität Berlin, Königin-Luise-Str. 12-16a, D-14195 Berlin, Germany Received 11 November 1996 the presence of an excess of GIC-I.6-P2, purified PMM from G. sul- Abstract The enzyme phosphomannomutase catalyzes the phuraria, pig brain, and yeast was incubated with 1 mM GIC-I.6-P2 interconversion of mannose-1-phosphate (Man-l-P) and man- and 0.1 mM Man-l-P for 3 h at room temperature. The reaction nose-6-phosphate (Man-6-P). In mammalian cells the enzyme products were separated by TLC at pH 10 as described [8]. The has to be activated by transfer of a phosphate group from a corresponding regions for Man-l-P, Man-6-P, and Glc-6-P were sugar-1.6-P2 (Guha, S.K. and Rose, Z.B. (1985) Arch. Biochem. scraped off, the sugar phosphates eluted, and identified enzymatically. Biophys. 243, 168). In contrast, in the red alga Galdieria The concentration of Glc-6-P was determined by the addition of Glc- sulphuraria the co-substrate (Man-1.6-P2 or GIC-I.6-P2) is 6-P dehydrogenase and NADP. For Man-6-P determination PGI and PMI were included and for Man-l-P purified PMM from G. sulphu- converted to the corresponding sugar monophosphate while the raria was added.
    [Show full text]