Deep Sequencing Driven Protein Engineering: New Methods and Applications in Studying the Constraints of Functional Enzyme Evolution
Total Page:16
File Type:pdf, Size:1020Kb
DEEP SEQUENCING DRIVEN PROTEIN ENGINEERING: NEW METHODS AND APPLICATIONS IN STUDYING THE CONSTRAINTS OF FUNCTIONAL ENZYME EVOLUTION By Emily Elizabeth Wrenbeck A DISSERTATION Submitted to Michigan State University in partial fulfillment of the requirements for the degree of Chemical Engineering – Doctor of Philosophy 2017 ABSTRACT DEEP SEQUENCING DRIVEN PROTEIN ENGINEERING: NEW METHODS AND APPLICATIONS IN STUDYING THE CONSTRAINTS OF FUNCTIONAL ENZYME EVOLUTION By Emily Elizabeth Wrenbeck Chemical engineers have long sought enzymes as alternatives to traditional chemocatalytic routes as they are highly selective and have evolved to function under mild conditions (physiological temperature, neutral pH, and atmospheric pressure). Enzymes, the workhorses of biological chemistry, represent a vast catalogue of chemical transformations. This feature lends their use in a variety of industrial applications including food processing, biofuels, engineered biosynthetic pathways, and as biocatalysts for preparing specialty chemicals (e.g. pharmaceutical building blocks). The totality of an enzymatic bioprocess is a function of its catalytic efficiency (specificity and turnover), product profile (i.e. regio- and enantio-selectivity), and thermodynamic and kinetic stability. For native enzymes, these parameters are seldom optimal. Importantly, they can be modified using protein engineering techniques, which generally involves introducing mutation(s) to a protein sequence and screening for beneficial effects. However, robust enzyme engineering and design based on first principles is extremely challenging, as mutations that improve one parameter often yield undesired tradeoffs with one or more other parameters. In this thesis, deep mutational scanning - the testing of all possible single-amino acid substitutions of a protein sequence using high-throughput screens/selections and DNA counting via deep sequencing - was used to address two fundamental constraints on functional enzyme evolution. First, how do enzymes encode substrate specificity? To address this question, deep mutational scanning of an amidase on multiple substrates was performed using growth-based selections. Comparison of the resulting datasets revealed that mutations benefiting function on a given substrate were globally distributed in both protein sequence and structure. Additionally, our massive datasets permitted the most rigorous testing to date of theoretical models of adaptive molecular evolution. These results have implications for both design of biocatalysts and in understanding how natural enzymes function and evolve. Another fundamental constraint of enzyme engineering is that mutations improving stability (folding probability) of an enzyme are often inactivating for catalytic function, and vice versa. Towards overcoming this activity-stability constraint, I sought to improve the heterologous expression and maintain the catalytic function of a Type III polyketide synthase from Atropa belladonna. This was accomplished using deep mutational scanning and high-throughput GFP- fusion stability screening, followed by novel filtering methods to only accept beneficial mutations with high probability for maintaining function. Lastly, deep mutational scanning relies on the construction of user-defined DNA libraries, however current available techniques are limited by accessibility or poor coverage. To address these limitations, I will present the development of Nicking Mutagenesis, a new method for the construction of comprehensive single-site saturation mutagenesis libraries that requires only double-stranded plasmid DNA as input substrate. This method has been validated on several gene targets and plasmids and is currently being used in academic, government, and industry laboratories worldwide. I dedicate this thesis to the family and friends who have inspired, encouraged, and delighted in my pursuit of understanding our world through science. iv ACKNOWLEDGMENTS I want to acknowledge the significant impact my graduate advisor, Tim Whitehead, has had on my life and scientific career. Thank you for training me how to think quantitatively, design clever experiments, and for initiating my journey in a lifelong obsession with proteins. Thank you for your assistance in acquiring various fellowships and national conference experiences. I express my deepest appreciation for your patience, support, and for challenging me to do my best. To my labmates – Caitlin Stein, Justin Klesmith, James Stapleton, Matthew Faber, and Carolyn Haarmeyer – thank you for your mentorship, support, and for sharing laughs. I want to thank the Plant Biotechnology for Health and Sustainability graduate training program for providing financial support and an enriching graduate experience. Lastly, I want to thank my parents for striving to provide me with opportunities to nourish my brain and for their never-ending love and support. My siblings, for shaping who I am. My friends, for thinking of me as cool for being a scientist. And finally, my husband Phil, for always believing me and for your patience, love, and support throughout my graduate studies. v TABLE OF CONTENTS LIST OF TABLES ....................................................................................................................... ix LIST OF FIGURES ..................................................................................................................... xi KEY TO ABBREVIATIONS ................................................................................................... xiii CHAPTER ONE Introduction to deep sequencing driven protein engineering ......................1 ABSTRACT .........................................................................................................................2 INTRODUCTION ...............................................................................................................3 ENGINEERING PROTEIN MOLECULAR RECOGNITION ..........................................4 Deep sequencing for screening protein binder libraries ..........................................5 Paratope optimization for affinity and specificity ...................................................5 Epitope mapping ......................................................................................................8 MEMBRANE PROTEIN ENGINEERING ........................................................................8 ENZYME ENGINEERING .................................................................................................9 High-throughput screening and selection for enzyme function ...............................9 From fitness landscapes to enzyme engineering ....................................................10 METHODOLOGICAL ADVANCES AND CURRENT LIMITATIONS .......................12 Mutagenic library preparation ................................................................................12 DNA read length restrictions .................................................................................13 Sequencing analysis ...............................................................................................14 CONCLUSION ..................................................................................................................15 REFERENCES ..................................................................................................................17 CHAPTER TWO Nicking Mutagenesis: a plasmid-based, one-pot saturation mutagenesis method ...........................................................................................................................................23 ABSTRACT .......................................................................................................................24 INTRODUCTION .............................................................................................................25 RESULTS ..........................................................................................................................26 DISCUSSION ....................................................................................................................30 MATERIALS AND METHODS .......................................................................................31 Reagents .................................................................................................................31 Plasmid construction ..............................................................................................31 Comprehensive nicking mutagenesis optimization ...............................................32 Comprehensive nicking mutagenesis of amiE and bla ..........................................33 Single and multi-site nicking mutagenesis ............................................................35 DNA deep sequencing and analysis .......................................................................36 Statistics .................................................................................................................36 APPENDIX ........................................................................................................................38 REFERENCES ..................................................................................................................51 vi CHAPTER THREE Exploring the sequence-determinants to specificity of an enzyme using deep mutational scanning ............................................................................................................54 ABSTRACT .......................................................................................................................55