Glucdrp) and L-Lyxonate Dehydratase Proteins (Lyxd)
Total Page:16
File Type:pdf, Size:1020Kb
MINING THE ENOLASE SUPERFAMILY FOR NEW FUNCTIONS: INVESTIGATIONS OF D-GLUCARATE DEHYDRATASE RELATED PROTEINS (GLUCDRP) AND L-LYXONATE DEHYDRATASE PROTEINS (LYXD) BY SALEHE GHASEM PUR DISSERTATION Submitted in partial fulfillment of the requirements for the degree of Doctor of Philosophy in Biochemistry in the Graduate College of the University of Illinois at Urbana-Champaign, 2014 Urbana, Illinois Doctoral Committee: Professor John A. Gerlt, Chair Professor Wilfred A. van der Donk Professor James H. Morrissey Professor Emad Tajkhorshid ABSTRACT Genomic era begins with development of sequencing methods. Genome sequencing is now cost-effective and fast, giving rise to increasing amounts of genomic data. However, the function of 1% of the deposited sequences have been experimentally characterized. There is no robust method of functional assignment for these sequences. Functional assignments are now performed using a variety of software tools to utilize the known biochemical data of characterized proteins to annotate similar sequences in genome databases. With this large scale automatic annotations of genome databases, annotations were transferred from homologs regardless of their reliability which result in propagation of errors and transfer misleading information to scientific community. Nevertheless, annotating the homologs within a superfamily is a valid approach. To this end, the enolase superfamily is an excellent model system for functional assignment. Structurally, this superfamily contain substrate specificity residues in the N-terminal capping domain and catalytic residues in the C-terminal barrel domain. These proteins with the common structural fold have the ability to abstract a proton α to carboxylate on the substrate before proceeding to dehydration, epimerization, deamination, racemization or cycloisomerization. There have been enough studies of this superfamily to provide valuable insight into the types of reactions performed based on catalytic residues and substrate specificity residues, establishing basis for functional characterization of unknown members. In this thesis, I discuss my efforts on characterizing an enolase superfamily member: D- glucarate dehydratase related protein (GlucDRP). GlucDRP share more than 60% sequence identity with GlucD, a well-characterized protein. I showed that there is a protein-protein interaction between GlucD and GlucDRP which form a detectable heterospecies. The kinetics of heterospecies, isolated from wild type E. coli, was equal to an average of the activity of a GlucD ii and a GlucDRP. Additionally, to determine what percent of GlucDRPs involves in the formation of the heterospecies, a hexahistidine-tag was introduces upstream of the GlucDRP gene in the E. coli chromosome and it was shown that more than half of expressed GlucDRP forms heterospecies. Finally, the three-dimensional crystal structure of the heterospecies was determined which confirmed the formation of α2/β2 tetramers of GlucD and GlucDRP. At a BLAST e-value of 10-175, three clusters of GlucDRPs segregated from the authentic GlucD. Structural alignments of GlucD and GlucDRP proteins showed good superposition between these structures except in 100s loop, which interacts with the substrate and closes the active site upon entrance of the substrates. This structural differences may be responsible for differences in catalytic activity. I investigated the protein-protein interaction between GlucD and GlucDRP from the three GlucDRP clusters in three organisms: Burkholderia cepacia, Actinobacillus succinogenes 130Z, and Ralstonia pickettii 12j and the formation of heterospecies in these proteins were confirmed. Additionally, I showed that GlucDRP promotes faster growth on glucarate media. D-glucarate is a component of urine, and as an occasional colonizer of the bladder and urinary tract, E. coli might have evolved mechanisms to rapidly catabolize a less favored source of energy like D-glucarate and outcompete other resident microorganisms. In this thesis, I assigned a novel function, L-lyxonate dehydratase activity, to a mandelate racemase (MR) subgroup of the enolase superfamily. In vitro and in vivo data were combined here to show that the dehydration of L-lyxonate is the biological role of the enzymes in this protein family. Through in vitro experiment, catalytic efficiency of ~ 104 M-1s-1 was measured for L- lyxonate dehydratase protein. In vivo growth studies revealed that L-lyxonate is a carbon source for Pseudomonas aeruginosa and transcriptome analysis showed that the L-lyxonate dehydratase iii gene along with some neighboring genes were expressed in L-lyxonate grown cells. The neighboring genes were cloned and purified to be tested on the L-lyxonate degradation intermediates and it was shown that upon dehydration of L-lyxonate and formation of 2-keto-3- deoxy-L-lyxonate, a second dehydratase act to convert the product to yield α-ketoglutarate semialdehyde. In the final step a dehydrogenase oxidizes α-ketoglutarate semialdehyde to α- ketoglutarate, an intermediate in the citric acid cycle. Mutational studies revealed that L-lyxonate dehydratase proteins possess a catalytic His-Asp dyad at the end of the seventh and six β-strands of the (β/α)7β-barrel domain and a KxR motif at the end of second β-strand. This is the first example of an L-lyxonate dehydratase in the enolase superfamily and the first example of a pathway for degradation of L-lyxonate. iv For my dear dad and mom For my dear sisters, Atefeh and Vida And For my darling daughter and husband v ACKNOWLEDGEMENTS I would like to express the deepest appreciation to my thesis advisor, Dr. John A. Gerlt, whom without this research would not have been possible. He conveyed a spirit of adventure in regard to research and continually helped me to continue to explore. I also would like to thank my committee members for their time and support throughout my graduate school career; they inspired me to be a better scientist. In particular, I would like to thank Dr. Emad Tajkhorshid whose enthusiasm for science and support have had a lasting effect on my life. I would also like to thank Dr. John Cronan for providing me an opportunity to consult with him and learn from him. I consider it an honor to have worked with all of the wonderful colleagues during my time at UIUC: Dr. Fatemah Hermes, Dr. Steve Lin, Dr. Tobias J. Erb, Dr. John F. Rakus, Dr. Fiona Poe, Dr. B. McKay Wood, Dr. Benjamin P. Warlick, Dr. Xinshaui Zhang, Dr. Bijoy Desai, Dr. Dan Wichelecki, Dr. Brain San Francisco, Dr. Ritesh Kumar, Dr. Jose Solbiati, and Jason Bouvier. I would like to extend a special thank you to Dr. Tiit Lukk who trained me when I first joined the lab and for being a wonderful person. I also enjoyed my daily lunch with all of my IGB friends and colleagues including Joel Cioni, Courtney Cox, Michelle Goettege, Joel Melby, and Katie Molohon. I would like to thank my close friends Sara Bahramian, Ehsan Shafiei, Saleh Yousefi, Mohammad Naghnaian, and Negin Sattari whose friendships made my life more beautiful and for the fun times we had outside of lab. I would like to thank my family especially my parents who were my first teachers. I also would like to thank my grandparents who have been my heroes throughout my life. I also thank my very kind sisters, Atefeh and Vida for their everlasting love and joy. Finally thank you to my husband Seyed Fakhreddin Torabi for his unending love and support and my darling daughter who I am waiting to come and delight our worlds with her charm and love. vi Table of Contents LIST OF FIGURES…………………………………………………………………….xii LIST OF TABLES……………………………………………………………………..xiv CHAPTER 1. INTRODUCTION .................................................................................... 1 1.1 Advances in sequencing technology in genomic era ............................................ 1 Post-genomic challenges .................................................................................. 2 Genome annotations in databases .................................................................... 3 Studying superfamilies..................................................................................... 4 Enolase superfamily as a model system........................................................... 5 1.2 Structural and mechanistic features of enolase superfamily ................................. 7 1.3 Subgroups within the enolase superfamily.......................................................... 10 The enolase subgroup .................................................................................... 10 The β-methylaspartate ammonia lyase subgroup ........................................... 12 The muconate lactonizing enzyme subgroup ................................................. 12 The mandelate racemase subgroup ................................................................ 13 The D-glucarate dehydratase subgroup ......................................................... 13 The D-mannonate dehydratase subgroup ....................................................... 13 Galactarate dehydratase-II subgroup ............................................................. 14 1.4 Assigning function to unknown enolase superfamily members.......................... 14 1.5 Conclusion ........................................................................................................... 16 1.6 References ........................................................................................................... 17 CHAPTER 2. INVESTIGATION OF E. COLI GLUCARATE