BIOCATALYSIS AND BIOPROCESS ENGINEERING FOR TERPENOID
PRODUCTION
by
Sonal Ayakar
B. Pharm., Mumbai University, 2012
M. Tech., Institute of Chemical Technology, 2014
A THESIS SUBMITTED IN PARTIAL FULFILLMENT OF
THE REQUIREMENTS FOR THE DEGREE OF
DOCTOR OF PHILOSOPHY
in
THE FACULTY OF GRADUATE AND POSTDOCTORAL STUDIES
(Chemical and Biological Engineering)
THE UNIVERSITY OF BRITISH COLUMBIA
(Vancouver)
May 2019
© Sonal Ayakar, 2019 The following individuals certify that they have read, and recommend to the Faculty of Graduate and Postdoctoral Studies for acceptance, the dissertation entitled:
Biocatalysis and bioprocess engineering for terpenoid production
submitted by Sonal Ayakar in partial fulfillment of the requirements for
the degree of Doctor of Philosophy
in Chemical and Biological Engineering
Examining Committee:
Vikramaditya Yadav, Chemical and Biological Engineering Supervisor
Charles Haynes, Chemical and Biological Engineering Supervisory Committee Member
Joerg Bohlmann, Botony Supervisory Committee Member
Fariborz Taghipour, Chemical and Biological Engineering University Examiner
Nobuhiko Tokuriki, Biochemistry and Molecular Biology University Examiner
ii
Abstract
The biomanufacturing of terpenoids is limited by the low yields of heterologously expressed biosynthetic pathway and challenges associated with recovering these products at commercial scales. To enhance the flux through methylerythritol phosphate (MEP) pathway for terpenoid biosynthesis, I screened soil metagenomes for more active and stable orthologs of the rate-limiting enzymes. I successfully identified three entirely novel, natural fusions of IspD and
IspF, one of which improved production of lycopene from 235 mg/L to 275 mg/L and production of isoprene from 3.6 mg/L to 6.3 mg/L when compared to the native enzyme overexpression. A comprehensive study of the role of the linking domain revealed the higher activity of each of the catalytic domains and the absence of substrate channeling. Moreover, the non-natural fusions of
E. coli enzymes catalyzing consecutive steps were constructed. One such a fusion of IspD and
IspE yielded 281 mg/L of lycopene, whereas the best performing fusion of IspE and IspF only yielded 39 mg/L of lycopene. Further investigation of the sequence of this biocatalytic cascade concluded the commencement with the activity of IspE, followed IspD and IspF suggesting the reactive plasticity in MEP pathway.
I probed the promiscuous nature of terpene synthases (TSs) through the systematic study of monoterpene synthases from Picea abies in vivo and in vitro . I uncovered the influence of intracellular expression and oxygen supply on the promiscuity of TSs. Computational analysis revealed the putative roles of the amino acid residues within the active sites and their evolutionary trajectory. Finally, the fermentation of engineered E. coli strains for carene and myrcene were scaled up to 1 L and a newer technique was developed for efficient product capture using a fluidized bed capture device (FBCD) using a hydrophobic resin. The device was easy to integrate iii
into the existing bioreactor set up. It yielded 2-fold higher carene titers and 17-fold higher myrcene titers.
In conclusion, the three aspects of the terpenoid biomanufacture studied in this work address some of the biggest challenges facing the industry and lay strong foundations for commercialization of terpenoid biomanufacturing processes that employ genetically engineered microorganisms.
iv
Lay Summary
Terpenoids are the plant natural products having applications as pharmaceutical agents, flavors, fragrances, surfactants, fuel additives, polymer additives etc. There are various challenges in efficient production and purification of these terpenoids from plants. In this work, E. coli are genetically engineered to perform the terpenoid biosynthesis route identified from Norway spruce.
Efforts of metabolic engineering in this area led to discoveries of novel biochemical pathways that operate at higher activity. Moreover, the fermentative production of the engineered E. coli provided insights into the evolution and activity of the enzymes from the plant to guide higher terpenoid production rates. An efficient method was developed for recovery of the terpenoid products from the fermentative process. This work will aid discovery of newer plant-based terpenoids that have wider applications as well as facilitate the development of their manufacturing in genetically engineered microbes.
v
Preface
This Ph.D. dissertation consists of seven chapters. The Ph.D. study was conducted by Sonal
Ayakar under the direct supervision of Dr. Vikramaditya G. Yadav in the Department of Chemical and Biological Engineering at UBC. The literature review, experimental planning was done by
Sonal Ayakar with direct inputs from Vikramaditya Yadav.
Chapter 2
The metagenomic database was generated by Steven Hallam Lab members, UBC and the screen was conducted with Sandip Pawar. Construction of pSASDFI was conducted by Sonal
Ayakar and Protiva Roy. Other natural fusion constructs were completed by Sonal Ayakar.
Construction of non-natural fusion genes was done by Carmen Bayly. Expression of the enzyme chimera was done by Sonal Ayakar. Analysis, quantification of terpenoids and interpretation of the data for natural fusions was performed by Sonal Ayakar. The natural fusion discovery and application were presented as:
S. R. Ayakar, S. V. Pawar. S. J. Hallam and V. G. Yadav, “Multiplying productivity using
metagenomics: beyond canonical metabolic engineering”, Synthetic biology for natural
products conference , Cancun, Mexico (2017).
The patent application has been filed to protect the natural fusions.
S. R. Ayakar and V. G. Yadav et al., Metabolic Engineering of E. Coli for the
Biosynthesis of Cannabinoid Products, USPTO PCT/CA2018/051073, Sept. 2018
(Technology transfer to Inmed Pharmaceticals, Canada)
vi
Homology models, analysis and quantification of terpenoids for non-natural fusion was conducted by Sonal Ayakar and Adhithi Raghavan. The data interpretation was done by Sonal
Ayakar. The patent application for non-natural fusions has been filed as:
S. R. Ayakar and V. G. Yadav et al., Compositions and methods for biosynthesis of
cannabinoids products in a prokaryote, International PCT Appl. No. PCT/CA2018/
051074, March 2019 (Technology transferred to Inmed Pharmaceuticals, Canada)
The entire chapter was written by Sonal Ayakar.
Chapter 3
The experimental design, experimental work and data interpretation were done by Sonal
Ayakar. The LC-MS analysis was done by Lina Madilao, UBC. The docked substrates were generated by Vikramaditya Yadav. The entire chapter was written by Sonal Ayakar. This work was presented at:
S. R. Ayakar and V. G. Yadav, “Metagenomics yields new insights about metabolic flux
through the non-mevalonate pathway”, DECHEMA Bioflavours , Frankfurt, Germany
(2018).
Chapter 4
The experimental design, experimental work and data interpretation were done by Sonal
Ayakar. The monoterpene gene sequences and monoterpene standards were procured from Joerg
Bohlmann. The docked substrates were generated by Vikramaditya Yadav. The computational modelling and figures were generated by Sonal Ayakar. The entire chapter was written by Sonal
Ayakar. vii
Chapter 5
The research idea was conceived by Sonal Ayakar. The initial planning of experiments for terpenoid capture were conducted by Sonal Ayakar and Azin Amiri. The fluidized bed capture device printing would not have been possible without Logan Phillips. The bioreactor run and optimization experiments were done by Azin Amiri. The data interpretation and product spectra analysis were done by Sonal Ayakar. The entire chapter was written by Sonal Ayakar.
This work has now become Azin Amiri’s PhD project.
Chapter 6
Following work was equally contributed by the co-authors. The application of lycopene producing strain developed and mentioned in chapter 2 was used in construction of biogenic photovoltaic cell. This work was published as:
S. K. Srivastava, P. Piwek, S. R. Ayakar, A. Bonakdarpour, D. Wilkinson, V. G. Yadav,
"A biogenic photovoltaic material", Small, 14, 1800729 (2018).
A metric was developed to gauge efficiency of biomass valorization and is accepted for publication as:
S.C. Patankar, S.R. Ayakar, V.G. Yadav, and S. Renneckar, “The V-factor: A new metric
for gauging the efficiency & profitability of manufacturing processes in the bio-
economy”, Bioproducts Business , 2018.bpb.047, accepted manuscript (2019).
viii
A chemical catalytic process was developed for lignin vaporization with highest reported yield for vanillin:
S.C. Patankar, L. Liu, L. Ji, S.R. Ayakar, V.G. Yadav, and S. Renneckar, Isolation of
phenolic monomers from kraft lignin using a magnetically recyclable TEMPO
nanocatalyst. Green Chem. 21, 785–791 (2019).
The study on characterization of microbial degradation of nanofibrillated cellulose.
Manuscript on this work is under preparation.
ix
Table of Contents
Abstract ...... iii
Lay Summary ...... v
Preface ...... vi
Table of Contents ...... x
List of Tables ...... xviii
List of Figures ...... xx
List of Equations ...... xxviii
List of Abbreviations ...... xxix
Glossary ...... xxxii
Acknowledgments ...... xxxiii
Dedication ...... xxxv
Chapter 1: Introduction ...... 1
1.1 Terpenoid chemical space ...... 1
1.2 Applications of terpenoids ...... 2
1.3 Isoprenoid synthesis ...... 3
Chemical synthesis ...... 3
Semi-synthesis ...... 3
Biosynthesis in heterologous hosts ...... 4
1.4 Biosynthetic pathways of isoprenoid production ...... 4
C5 precursor synthesis ...... 4
Chain elongation ...... 6 x
Pyrophosphate group removal ...... 7
Scaffold activation/modification ...... 7
1.5 MEP pathway ...... 8
Dxs (1-deoxy-d-xylulose 5-phosphate synthase) ...... 8
IspC (Dxr, 1-deoxy-d-xylulose 5-phosphate reductoisomerase) ...... 9
IspD (YgbP, 4-diphosphocytidyl-2-C-methylerythritol synthase) ...... 9
IspE (YchB, 4-diphosphocytidyl-2-C-methyl-D-erythritol kinase) ...... 10
IspF (YgbB, 2-C-methyl-D-erythritol 2,4-cyclodiphosphate synthase) ...... 11
IspG (GcpE, 4-hydroxy-3-methylbut-2-en-1-yl diphosphate synthase) ...... 12
IspH (LytB, 4-hydroxy-3-methylbut-2-en-1-yl diphosphate reductase) ...... 12
Idi (Isopentenyl diphosphate delta-isomerase) ...... 13
1.6 Properties of isoprenoids...... 15
Chemical properties ...... 15
Physical properties ...... 16
1.7 Bioprocess development for isoprene production ...... 16
1.8 Challenges in terpenoid production ...... 17
1.9 Research questions ...... 19
1.10 Organization of thesis ...... 21
Chapter 2: Study of enzyme fusions to improve MEP pathway yield ...... 22
2.1 Introduction ...... 22
Improvements in the precursor supply for the MEP pathway ...... 22
MEP pathway optimization...... 23
2.1.2.1 Homologous expression of MEP pathway enzymes ...... 23 xi
2.1.2.2 Heterologous expression of MEP pathway enzymes ...... 23
2.1.2.3 Improvement in genomic MEP control ...... 23
2.1.2.4 Evolution of MEP pathway genes ...... 24
Effect of culture conditions on the solubility of pathway enzymes ...... 24
Bifunctional enzyme discovery and applications ...... 24
2.1.4.1 IspDF discovery and structure ...... 24
2.1.4.2 Interactions of IspD, IspE and IspF ...... 25
Naturally occurring enzyme fusions catalyzing non-consecutive pathway steps ..... 27
Knowledge gap in the understanding of the MEP pathway ...... 27
Construction of non-natural fusions ...... 28
2.1.7.1 Demonstration of non-native function by fusions ...... 29
2.1.7.2 Non-natural fusions of MEP pathway enzymes ...... 29
2.2 Materials and methods ...... 30
Metagenomic screening for orthologs of MEP pathway enzymes ...... 30
Strains, genes and plasmids ...... 31
Non-natural protein fusions ...... 35
Construction of plasmids and strains for analysis of non-natural fusions ...... 36
Culture conditions ...... 38
Isoprene analysis ...... 38
Lycopene analysis ...... 38
2.3 Results ...... 39
Discovery of three IspDF fusions ...... 39
Construction of MEP pathway operon ...... 44 xii
Influence of novel fusion enzymes on the MEP pathway ...... 46
Role of IspE ...... 48
Effect of linkers on IspDF 1 activity ...... 49
Effect of linkers on non-natural fusions of monofunctional IspD and IspF ...... 51
Effect of XL on non-natural fusions of monofunctional E. coli IspD and IspF ...... 52
Expression of domains of IspDF 1 as IspD 1 and IspF 1 ...... 52
Non-natural fusions of IspE and their effects on MEP pathway flux ...... 53
Categorical inferences ...... 54
2.4 Discussions ...... 55
Influence of native Dxs, IspD, IspF and Idi on MEP pathway flux ...... 55
IspDF fusion activity and the influence of IspE on MEP pathway flux ...... 56
Study on linker types ...... 57
Influence of IspDF 1 domain separation on MEP pathway flux ...... 58
Improvement in the flux on IspDE fusion ...... 59
Chapter 3: Study of MEP pathway biochemistry ...... 61
3.1 Introduction ...... 61
Kinetics of MEP pathway cascade from MEP to MEcPP ...... 61
Canonical MEP pathway and plausible bifurcation step ...... 62
3.2 Materials and methods ...... 64
Strains, plasmid and genes ...... 64
Media and growth conditions...... 65
Protein extraction and IspE purification ...... 66
IspE in vitro reactions ...... 66 xiii
Bioluminescent assay for ATP consumption ...... 67
HPLC analysis of ATP, ADP and AMP ...... 67
LC-MS analysis of MEP and ME-2,4-PP ...... 67
Computational analysis ...... 68
3.3 Results ...... 68
Protein expression and quantification ...... 68
Analysis of IspE reaction by ATP bioluminescent assay ...... 69
HPLC analysis of IspE reactions ...... 71
LC-MS analyses of IspE reaction mixture ...... 74
Computational analysis ...... 76
3.4 Discussions ...... 78
Chapter 4: Study of catalytic promiscuity of terpene synthases ...... 79
4.1 Introduction ...... 79
Catalytic promiscuity of TSs ...... 80
Factors responsible for TS promiscuity ...... 81
Study of TS activity and promiscuity ...... 82
Research variables and scope ...... 85
4.2 Materials and methods ...... 86
Strains and plasmid construction ...... 86
Media and growth conditions...... 87
In vitro reactions ...... 88
Metabolite analyses ...... 89
Computational analysis ...... 89 xiv
4.3 Results ...... 91
Plasmid construction and protein expression ...... 91
Study of parameters on SACar fermentations ...... 93
Study of strain and fermentation parameters on monoterpene production ...... 96
Aerobic and micro-aerobic fermentations ...... 104
Metabolite analyses ...... 107
Study of terpene synthase catalytic mechanisms ...... 109
Cyclization of primary geranyl carbocation (cisoid) to the α-terpinyl carbocation 112
Formation of carene from the α-terpinyl carbocation ...... 115
Pinyl cation formation by 2,7-closure of the α-terpinyl carbocation ...... 118
Acyclic terpenoid mechanisms from cisoid primary geranyl carbocation ...... 119
4.4 Discussions ...... 121
Optimization of fermentation conditions ...... 121
Optimization of shake flask fermentations ...... 121
Product profile characterization ...... 124
Computational analyses ...... 126
Comparison of product profile with computational analysis ...... 129
Chapter 5: Designing an efficient bioprocess for terpenoid recovery ...... 130
5.1 Introduction ...... 130
Challenges in terpenoid bioprocess development ...... 130
Continuous ex situ recovery (CESR) ...... 131
Drawbacks of state of the art ...... 131
The objective of the study ...... 132 xv
5.2 Materials and methods ...... 132
Strains and fermentation conditions ...... 132
Bioreactor set up ...... 133
CESR for terpenoid ...... 133
Resins used for terpenoid capture ...... 134
Screening resin for carene capture ...... 136
5.3 Results and discussions ...... 137
Adsorption based carene recovery ...... 137
Bioreactor parameter optimization ...... 138
Terpenoid product analyses ...... 139
Terpenoid recovery comparison ...... 140
Chapter 6: Other scholarly contributions ...... 142
6.1 A Biogenic Photovoltaic Material 166 ...... 142
6.2 Isolation of phenolic monomers from kraft lignin using magnetically recyclable TEMPO
nanocatalyst 173 ...... 145
6.3 The V-factor: Towards a new metric for gauging the efficiency & profitability of
manufacturing processes for the bioeconomy ...... 149
6.4 Microbial growth and its characterization on nanocellulose synthesized from cherry
veneer 153
Chapter 7: Conclusions ...... 158
7.1 Improvement in the intracellular pool of C5 precursors ...... 159
7.2 Understanding the biocatalytic sequence of steps in the MEP pathway ...... 161
7.3 Study of terpene synthase biocatalysis ...... 162 xvi
7.4 Bioprocess development for terpenoid recovery ...... 163
7.5 Future work ...... 165
Improvement in MEP pathway flux (Chapter 2) ...... 165
MEP pathway biochemistry (Chapter 3) ...... 166
Terpene synthase study (Chapter 4) ...... 166
Terpenoid recovery from the bioprocess (Chapter 5) ...... 167
Bibliography ...... 168
Appendices ...... 179
Appendix A Fusion protein sequences ...... 179
A.1 IspDF 1 ...... 179
A.2 IspDF 2 ...... 179
A.3 IspDF 3 ...... 180
Appendix B HPLC chromatograms for ATP, ADP and AMP analysis ...... 181
Appendix C GC-MS analysis for terpenes ...... 182
xvii
List of Tables
Table 1.1 Theoretical yield of MEP and MVA pathway 27 ...... 6
Table 1.2 Kinetic parameters of MEP pathway reactions ...... 14
Table 1.3 Classes of terpenoids ...... 15
Table 2.1 Strains, genes and plasmids used for MEP pathway study ...... 32
Table 2.2 Types of linkers used in the study and their sequences ...... 35
Table 2.3 List of non-natural protein fusions ...... 36
Table 2.4 Strains and plasmid expressing non-natural fusion proteins ...... 36
Table 2.5 Protein alignment analysis of the bifunctional enzymes against E. coli IspD-IspF and cjIspDF using the online BLASTN search tool ...... 40
Table 2.6 Protein alignment analysis of each domain of the bifunctional enzymes against corresponding E. coli monofunctional enzymes using the online BLASTN search tool ...... 42
Table 2.7 Protein alignment analysis of each domain of the bifunctional enzymes against corresponding cjIspDF enzyme domains using the online BLASTN search tool ...... 43
Table 3.1 Enzyme amount and kinetics for MEP pathway steps (data extracted from 118 ) ...... 62
Table 3.2 Strains, plasmids and genes ...... 65
Table 3.3 Reaction conditions for different experimental sets ...... 69
Table 4.1 Strains, plasmids and genes ...... 86
Table 4.2 Summary of fermentation conditions selected ...... 104
Table 4.3 Monoterpene product distribution and percent total ...... 108
Table 4.4 Physical properties of monoterpenes ...... 123
Table 5.1 Resins employed in the study and their properties ...... 135 xviii
Table 5.2 Screening resin for carene recovery ...... 137
Table 5.3 Product distribution comparison of shake flask and bioreactor scale fermentations .. 140
Table 6.1 Comparison of vanillin yield derived through oxidative depolymerization of kraft lignin using Fe@MagTEMPO catalyst with values in literature ...... 147
Table 6.2 Predicted global averages of fractional dollar outputs of selected sectors ...... 152
xix
List of Figures
Figure 1-1 Suite of isoprenoid products ...... 2
Figure 1-2 MEP pathway ...... 9
Figure 1-3 Challenges in commercialization of terpenoid bioprocess ...... 18
Figure 2-1 Expression of the novel IspDFs as (His) 6 tagged protein expressed in E. coli BL21(DE3) and their analyses on SDS/PAGE. Lanes 1 and 5: total and purified IspDF 1 extract respectively, lanes 2 and 6: total and purified IspDF 2 extract respectively, lanes 4 and 7: total and purified
IspDF 3 extract respectively, lanes 2 and 8: protein ladder. The bands corresponding with the specific protein are highlighted with red box...... 40
Figure 2-2 Protein sequence alignment generated from the Centre for Genomic Regulation (CRG)
M-Coffee tool. IspD-IspF is the sequences for E. coli IspD and IspF...... 41
Figure 2-3 SDS/PAGE image of the soluble protein fraction of pSASDFI. Lane 1: E. coli
BL21(DE3), lane 2: protein ladder, lane 3 and 4: SASDFI. The bands corresponding to protein
are: Dxs (band a, 68.2 kDa), IspD (band b, 25.7 kDa), IspF (band d, 16.9 kDa) and Idi (band c,
21.2 kDa)...... 44
Figure 2-4 Influence of rate-limiting steps on MEP pathway flux. (a) Lycopene production, (b)
Isoprene production. The IPTG concentrations used for induction are denoted in the legends.
Primary Y-axis is terpene titer and secondary Y-axis is normalized terpene titer...... 45
Figure 2-5 Influence of novel IspDF fusions on MEP pathway flux. (a) Lycopene production, (b)
Isoprene production. The IPTG concentrations used for induction are denoted in the legends.
Primary Y-axis is terpene titer and secondary Y-axis is normalized terpene titer...... 47
xx
Figure 2-6 Homology models for the fusion proteins generated by SWISS-MODEL tool. (a)
66 cjIspDF , (b) IspDF 1, (c) IspDF 2 and (d) IspDF 3. The IspD domain is in pink, the IspF domain is in blue and linker is in green. The N-terminal residue is colored black and C-terminal residue is colored orange...... 48
Figure 2-7 Effect of IspE overexpression on lycopene production. The IPTG concentrations used for induction are denoted in the legends. Primary Y-axis is terpene titer and secondary Y-axis is normalized terpene titer...... 49
Figure 2-8 Linkers for IspDF 1 and their effect on MEP pathway flux. (a) Strains overexpressing
Dxs, IspDF chimeras and Idi, (b) strains overexpressing Dxs, IspDF chimeras, IspE and Idi. The
IPTG concentrations used for induction are denoted in the legends. Primary Y-axis is terpene titer and secondary Y-axis is normalized terpene titer...... 50
Figure 2-9 Linkers for non-natural fusions of E. coli IspD and IspF; and their effect on MEP pathway flux. (a) Strains overexpressing Dxs, IspDF chimeras and Idi, (b) strains overexpressing
Dxs, IspDF chimeras, IspE and Idi. The IPTG concentrations used for induction are denoted in the legends. Primary Y-axis is terpene titer and secondary Y-axis is normalized terpene titer...... 51
Figure 2-10 Linkers for non-natural fusions of E. coli IspD and IspF on MEP pathway flux. The
IPTG concentrations used for induction are denoted in the legends. Primary Y-axis is terpene titer and secondary Y-axis is normalized terpene titer...... 52
Figure 2-11 Effect of domain separation of IspDF 1 on MEP pathway flux. The IPTG concentrations used for induction are denoted in the legends. Primary Y-axis is terpene titer and secondary Y-axis is normalized terpene titer...... 53
xxi
Figure 2-12 Non-natural fusions of IspE and their effect on MEP pathway flux. The IPTG concentrations used for induction are denoted in the legends. Primary Y-axis is terpene titer and secondary Y-axis is normalized terpene titer...... 54
Figure 2-13 Categorical comparison of lycopene production for the various linker...... 55
Figure 3-1 Schematic for the flow of metabolites through the MEP pathway for IspDF, IspE. (a)
Canonical MEP pathway, (b)MEP pathway bifurcation. A: MEP, B: CDP-ME, C:CDP-MEP,
D:MEcPP, B’: ME-2,4-PP ...... 63
Figure 3-2 Bifurcation in the MEP pathway with the promiscuity of IspE. The pathway highlighted with blue arrow is the canonical MEP pathway and the step highlighted by yellow arrow marks the bifurcation step from MEP...... 64
Figure 3-3 IspE expression and analysis on SDS/PAGE. Lane 1: Total soluble protein fraction of
E. coli BL21(DE3), lane 2: total soluble protein fraction of SAHisIspE induced with 0.5 mM IPTG, lane 3: protein ladder, lanes 4 and 5: purified IspE. The bands corresponding to IspE are highlighted with red box...... 69
Figure 3-4 Bioluminescent assay monitoring ATP and ADP in the IspE reactions...... 70
Figure 3-5 HPLC analysis of set 1 reactions to study ATP consumption...... 71
Figure 3-6 HPLC analysis of set 2 reactions with higher IspE concentration to study ATP consumption...... 72
Figure 3-7 HPLC analysis of set 3 reactions when MEP is rate limiting reactant to study ATP consumption...... 73
Figure 3-8 HPLC analysis of set 4 reactions when ADP is used as a phosphate donor...... 74
Figure 3-9 Mass spectra of the peak identified as ME-2,4-PP in IspE reactions...... 75
Figure 3-10 The major mass fragments of ME-3,4-PP...... 75 xxii
Figure 3-11 Canonical substrate (MEP) and alternate substrate (ME-2,4-PP) docked on IspD. .. 76
Figure 3-12 Substrate alignments in the active site for canonical MEP pathway and bifurcation pathway. Top left image: MEP and CTP in IspE, top right image: ME-2,4-PP and CTP in IspE, bottom left image: CDP-ME and ATP in IspE, bottom right image: MEP and ATP in IspE...... 77
Figure 3-13 (a) Canonical substrate (CDP-ME) and (b)alternate substrate (MEPP) docked on IspE.
...... 78
Figure 4-1 Enzyme evolution trajectory to achieve the newer function (reproduced 133 ) ...... 81
Figure 4-2 Protein expression and analysis on SDS/PAGE. (a) Gel image for GPPS expression, lane 1: total protein extract of E. coli BL21(DE3), lane 2: protein ladder, lane 3: total protein extract of engineered E. coli for GPPS uninduced, lane 4: total protein extract of engineered E. coli for GPPS with band corresponding GPPS (32.4 kDa) in the red box; (b) Gel image for (His) 6-
MTS (induction with 0.5 mM IPTG) purified by Ni-NTA column, lane 1: myrcene synthase induced, lane 2: carene synthase induced, lane 3: myrcene synthase uninduced, lane 4: carene synthase uninduced, lane 5: protein ladder, lane 6: limonene synthase uninduced, lane 7: linalool synthase uninduced, lane 8: limonene synthase induced, lane 9: linalool synthase induced. The bands corresponding with the specific protein are highlighted with red box. The bands corresponding with the specific protein are highlighted with red box...... 91
Figure 4-3 Protein expression of (His) 6-GPPS and analysis on SDS/PAGE. Lane 1: protein ladder, lane 2: purified uninduced GPPS (5 ug), lane 3: purified uninduced GPPS (10 ug), lane 4: purified induced (with 0.5 mM IPTG) GPPS (5 ug), lane 5: purified induced (with 0.5 mM IPTG) GPPS
(10 ug) ...... 92
Figure 4-4 Plasmid maps for MTS expression system. (a) Myrcene synthase, (b) Linalool synthase,
(c) Carene synthase, (d) Limonene synthase ...... 93 xxiii
Figure 4-5 Images of cultures (a) aerobic tube culture, (b) microaerobic tube culture, (c) aerobic flask culture, (d) microaerobic flask culture...... 94
Figure 4-6 Effect of culturing conditions on SACar. TA: aerobic tube culture, TM: microaerobic tube culture, FA: aerobic flask culture, FM: micro-aerobic flask culture. Primary Y-axis is carene titer and secondary Y-axis is normalized carene titer. The other culture conditions were: 30 °C temperature, LBY media (supplemented with the antibiotic and 2 mM MgCl 2), 18 h incubation time and 10 % dodecane overlay...... 95
Figure 4-7 Effect of incubation temperature on carene production. Primary Y-axis is carene titer and secondary Y-axis is normalized carene titer. The other culture conditions were: micro-aerobic flask cultures, LBY media (supplemented with the antibiotic and 2 mM MgCl 2), 18 h incubation time and 10 % dodecane overlay...... 96
Figure 4-8 Effect of growth media on terpene production. (a) SACar, (b) SAMyr, (c) SALin, (d)
SALim+. Primary Y-axis is terpene titer and secondary Y-axis is normalized terpene titer. The fermentation conditions were: micro-aerobic flask cultures, 2 mM MgCl 2, 18 h incubation time and 10 % dodecane overlay...... 98
+2 Figure 4-9 Effect of Mg concentration on terpene production. (a) SACar in LBY media, (b)
SAMyr in LBY media, (c) SALin in TB media, (d) SALim+ in LBY media. Primary Y-axis is terpene titer and secondary Y-axis is normalized terpene titer. The fermentation conditions were: micro-aerobic flask cultures, 18 h incubation time and 10 % dodecane overlay...... 99
Figure 4-10 Effect of dodecane overlay on terpene production. (a) SACar in LBY media, (b)
SAMyr in LBY media, (c) SALin in TB media, (d) SALim+ in LBY media. Primary Y-axis is terpene titer and secondary Y-axis is normalized terpene titer. The fermentation conditions were: micro-aerobic flask cultures, 2 mM MgCl 2 and 18 h incubation time...... 101 xxiv
Figure 4-11 Effect of IPTG induction on terpene production. (a) SACar, (b) SAMyr and (d)
SALim+ in LBY media with 10 % dodecane overlay; (c) SALin in TB media with 30 % dodecane overlay. Primary Y-axis is terpene titer and secondary Y-axis is normalized terpene titer. The fermentation conditions were: micro-aerobic flask cultures, 2 mM MgCl 2 and 18 h incubation time.
...... 103
Figure 4-12 Fermentative production of monoterpenes over time. (a) SACar, (b) SAMyr, (c)
SALin, (d) SALim+. Primary Y-axis is terpene titer and secondary Y-axis is normalized terpene titer. Fermentation conditions are mentioned in Table 4.2...... 106
Figure 4-13 Isomeric structures of carbocations. (a) Cisoid geranyl carbocation, (b) transoid geranyl carbocation, (c) α-terpinyl carbocation and (d) β-terpinyl carbocation...... 110
Figure 4-14 Reaction mechanisms of monoterpene synthases. Monoterpene structures’ color is based on its penultimate carbocation origin...... 111
Figure 4-15 Changes in the orientation of primary geranyl cation on 6,1-closure to form the α- terpinyl cation. (a) Orientation of primary geranyl cation, (b) orientation of α-terpinyl cation. The
TSSs are highlighted in green...... 113
Figure 4-16 Role of amino acid residues in TS active site towards cisoid primary geranyl cation reactivity. The TS is identified in the left top corner of each box. The cisoid primary geranyl cation is highlighted in green, the favorable interactions for 1,6-closure are highlighted in green dotted lines and unfavorable interactions are highlighted in purple dotted lines with distance measurements...... 114
Figure 4-17 TS active site contour towards formations of carene from the α-terpinyl cation. The
TS is identified in the left top corner of each box. The cisoid primary geranyl cation is highlighted in purple and the α-terpinyl cation is highlighted in green.The α-terpinyl cation interactions are xxv
highlighted in purple dotted lines and carene interactions are highlighted in green dotted lines with distance measurements...... 117
Figure 4-18 TS active site contour towards formations of pinyl cation from the α-terpinyl cation.
The TS is identified in the left top corner of each box. The α-terpinyl cation is highlighted in purple
and the pinyl cation is highlighted in green. The α-terpinyl cation interactions are highlighted in purple dotted lines with distance measurements...... 118
Figure 4-19 TS active site contour towards formations of tertiary geranyl cation from cisoid primary geranyl cation. The TS is identified in the left top corner of each box. The cisoid primary geranyl cation is highlighted in purple and the tertiary geranyl cation is highlighted in green.The primary geranyl cation interactions are highlighted in purple dotted lines and tertiary geranyl cation interactions are highlighted in green dotted lines with distance measurements...... 120
Figure 4-20 TS active site contour and participating amino acid residues. The TS is identified in
the left top corner of each box. The surface is highlighted in grey...... 128
Figure 4-21 Predictions for roles of the amino acid residues present in TS active site and structure-
function correlations ...... 129
Figure 5-1 Schematic for construction of fluidized bed capture device (FBCD) ...... 133
Figure 5-2 Schematic of CESR setup ...... 134
Figure 5-3 Bioreactor parameter optimization for efficient carene production and recovery .... 138
Figure 5-4 Experimental apparatus ...... 139
Figure 6-1 Sequential representation of whole cell bio-PV materials. (a) molecular cloning of E.
coli for expression of lycopene, (b) non-covalent surface binding of TiO2 nanoparticles resulting
in core@shell-like morphology and (c) deployment of biogenic PV material towards DSSC
fabrication...... 143 xxvi
Figure 6-2 Electrochemical measurements of bio-PV DSSC. (a) I-V curves, (b) open-circuit photovoltage response (time dependent) and (c) cyclic voltammetry curves ...... 144
Figure 6-3 Oxidative depolymerization of kraft lignin using the Fe@MagTEMPO catalyst .... 146
Figure 6-4 Comparison between E-factor and V-factor ...... 149
Figure 6-5 AFM height profile of nanocellulose samples synthesized from cherry veneer using
Fe@MagTEMPO catalyst. (a) Freshly synthesized nanocellulose sample and (b) nanocellulose sample stored at 4 °C for 10 months ...... 156
Figure 6-6 AFM image showing assembly of microbes on nanocellulose fibrils synthesized from cherry veneer using Fe@MagTEMPO catalyst ...... 157
Figure 0-1 HPLC chromatogram for set 2 IspE in vitro reactions. The peaks for ATP (2.49 min),
ADP (2.60 min) and AMP (2.72 min) are labelled...... 181
Figure 0-2 GC-MS Chromatograms of aerobic shake flask fermentation analysis for SACar,
SAMyr, SALin and SALim+. The monoterpene retention times are: pinene 6.48 min, sabinene
7.42 min, myrcene 7.86 min, carene 8.42 min, limonene 8.88 min, ocimene 9.36 min, terpinene
9.61 min, terpinolene 10.20 min, linalool 10.35 min...... 182
xxvii
List of Equations
Equation 1-1 MVA pathway stoichiometry ...... 5
Equation 1-2 MEP pathway stoichiometry ...... 5
Equation 6-1 The unabridged V-factor equation ...... 150
Equation 6-2 V-factor equation ...... 150
xxviii
List of Abbreviations
ADP Adenosine diphosphate
AFM Atomic force microscopy
AMP Adenosine monophosphate
ATP Adenosine triphosphate
CDP-ME 4-Diphosphocytidyl-2-C-methylerythritol
CESR Continuous ex situ recovery
CTP Cytidine triphosphate
DHAP Dihydroxyacetone phosphate
DMA Dimethylallyl alcohol
DMAP Dimethylallyl monophosphate
DMAPP Dimethylallyl pyrophosphate
DO Dissolved oxygen
DOX 1-Deoxy-d-xylulose
DOXP 1-Deoxy-d-xylulose 5-phosphate
Dxs 1-Deoxy-d-xylulose 5-phosphate synthase
ED Entner-Doudoroff
EMP Embden–Meyerhof–Parnas
ESI Electrospray ionization
FPP Farnesyl pyrophosphate
GAP Glyceraldehyde-3-phosphate
xxix
GGPP Geranylgeranyl pyrophosphate
GPP Geranyl pyrophosphate
GRAS Generally recognized as safe
HMBPP 1-Hydroxy-2-methyl-2-(E)-butenyl-4-diphosphate
ICH International Council for Harmonisation
Idi Isopentenyl diphosphate delta-isomerase
IPK Isopentenyl monophosphate kinase
IPP Isopentenyl pyrophosphate
ISO Isopentenol
IspC or Dxr 1-Deoxy-d-xylulose 5-phosphate reducto-isomerase
IspD or YgbP 4-Diphosphocytidyl-2-C-methylerythritol synthase
IspE or YchB 4-Diphosphocytidyl-2-C-methyl-D-erythritol kinase
IspF or YgbB 2-C-Methyl-D-erythritol 2,4-cyclodiphosphate synthase
IspG or GcpE 4-Hydroxy-3-methylbut-2-en-1-yl diphosphate synthase
IspH or LytB 4-Hydroxy-3-methylbut-2-en-1-yl diphosphate reductase
LB Luria Bertani log P o/w Logarithm of partition coefficient of a compound in oil from water
MD Molecular dynamic
ME 2-Methyl-d-erythritol
MEcPP 2-C-Methyl-D-erythritol-2,4-cyclodiphosphate
MEP 2-C-Methyl-d-erythritol-4-phosphate
MTS Monoterpene synthase
xxx
MVA Mevalonate/mevalonic acid
MWCO Molecular weight cut off
NAD Nicotinamide adenine dinucleotide
NADH Reduced nicotinamide adenine dinucleotide
NADP Nicotinamide adenine dinucleotide phosphate
NADPH Reduced nicotinamide adenine dinucleotide phosphate
NMR Nuclear magnetic resonance
OD 600 Absorbance at 600 nm wavelength
ORF Open reading frame
PDB Protein data bank
PCR Polymerase chain reaction
PDB Protein data bank
RBS Ribosome binding site
SDS/PAGE Sodium dodecyl sulphate poly acrylamide gel electrophoresis
TPP Thiamine pyrophosphate
TS Terpene synthase
TSS Transition state structure vvm Volume of air passed through 1 volume of liquid per minute (in bioreactor)
xxxi
Glossary
Cn This refers to a chemical structure containing ‘n’ number of carbon atoms.
For example, C10 refers to a compound containing 10 carbon atoms.
th Cn This refers to the carbon atom at n position in the structure.
th For example, C 10 refers to the carbon atom at 10 position in the structure.
Error bars The error bars in the figures denote standard deviation.
xxxii
Acknowledgments
I express my gratitude towards my supervisor Prof. Vikramaditya G. Yadav. He continually encouraged me at every step to lead the work and gave me the independence to take up new challenges. His guidance, educational expertise, providing resources required to conduct the research and scholarly feedback gave a direction to my work. I learned many things from him and being a Biofoundry group member that will take me a long way in my academic career. I thank
Prof. Charles Haynes for his support and constructive feedback at every step of my research progress and enriching my work with his enormous experience in the field.
I thank Joe and Sandip who taught me ABC of molecular biology techniques and trained me in cloning experiments. Protiva and I worked together on my first independent molecular biology project of construction of MEP pathway operon and I am grateful for her guidance and support. Discovery of natural fusions would have not been possible without the support from Prof.
Steven Hallam who provided us access to the genomic library and Sandip for working together in developing metagenomic screen. I thank Carmen for answering my doubts regarding cloning techniques and constructing non-natural enzyme fusions. I thank Adhithi for her willingness to take the lycopene work forward and help in analysis. I thank Lina Madilao for her expertise and help in carrying our LC-MS analysis and interpretation of its data.
Prof. Joerg Bohlmann encouraged me to study terpene synthases and provided all logistic support in terms of enzymes and their products required for the work. Every discussion with him gave me a new perspective on the activity of these enzymes and I deeply acknowledge his support and his work in the field. I thank Azin for her support in investigating the bioprocessing challenges and showing a keen interest in continuing the study in detail. The work also required engineering xxxiii
expertise that was provided by Logan. I thank the contributions of Vicente and Rodrigo who were the undergraduate students worked with me at different stages helping me in cloning.
I thank Prof. Scott Renneckar and Saurabh who gave me opportunities to employ my expertise to tackle the challenges in the biomass valorization field. I thank Sarvesh and Prof. David
Wilkinson who realized the potential of my recombinant lycopene strain as a biogenic photovoltaic material. Both these inter-field academic collaborations stimulated my thinking and widened my research approach.
I also thank the industrial collaborator Inmed Pharmaceuticals for their interest in commercializing the platform strains I developed for cannabinoid biosynthesis. I appreciate inputs from their chief scientific officer Eric Hsu in the process. This experience led to the filing of two patent applications. I thank Protiva and Benson who are now working on scale-up studies of my work. My stay in UBC would have never been enjoyable without my labmates Julia, Parisa, Roza,
Gaurav, Amir and Maryam apart from other members already mentioned. I would like to thank the workshop and administrate staff members of the department for the support. I appreciate all the relations I wove during my tenure at UBC, every stitch of which is unique and colorful.
Lastly, there are few people in my life whom I can never thank enough. My parents have always supported me through thick and thin. Their philosophy encouraged me to pursue higher education and seek excellence. My husband Saurabh has not only encouraged me but also reflected it every day through his deeds. My younger brother Harish always kept the innocence and charm alive in me.
xxxiv
Dedication
Dedicated to my Vibrant In dia
xxxv
Chapter 1: Introduction
Chemical engineering and sciences have been catering to the needs of mankind. This industry is operating at a massive scale of $5.1 trillion per year of global sales 1. These include petrochemicals, polymers, bulk chemicals, consumer products and specialty chemicals. Natural products (NP), their derivatives, semi-synthetic analogs and synthetic derivatives are used in sectors like pharmaceuticals, nutraceuticals, flavors, fragrances, cosmetics, energy, agrochemicals etc., and their share in the market is increasing exponentially. Some of the examples of these natural products are isoprenoids, polyketides, alkaloids, phenylpropanoids etc.
1.1 Terpenoid chemical space
Isoprenoids are one of the largest and most diverse classes of natural products with more than 80000 characterized structures 2. Isoprenoids are also known as terpenoids or terpenes though terpenes are regarded as the hydrocarbon moieties which on rearrangements and functionalization lead to terpenoids containing oxygen moieties. The word ‘terpene’ was named by Otto Wallach who received the Nobel prize in 1910 for his pioneering work on terpene characterization 3.
Figure 1-1 depicts the isoprenoid compounds and biosynthetic interlinks that are summarized from sources on the Kyoto Encyclopedia of Genes and Genomes 4. The isoprenoid backbone also forms the structural basis for many other classes of natural products such as zeatin, quinone, alkaloids, polyketides etc.
1
Figure 1-1 Suite of isoprenoid products
1.2 Applications of terpenoids
Terpenoids are involved in a wide variety of biological functions of growth and development in their natural producers. Terpenoids have been evolved to function as defense molecules in plants and hence are naturally tailored to bind to physiological targets. Isoprenoids have applications in pharmaceutical, nutraceutical, food, flavor and fragrance industry. Terpenes possess high calorific value due to their hydrocarbon backbone and serve as fuel. Artemisinin is used as an antimalarial drug, taxol is used as an anticancer agent. Sandalwood oil and patchouli oil are used as fragrances, isoprene is used in polymers; and myrcene, farnesene are used as fuel additives5.
2
1.3 Isoprenoid synthesis
There are the following three routes for isoprenoid synthesis:
Chemical synthesis
Total chemical synthesis of terpenoids involves multistep catalytic processes that suffer
from poor yields due to complexity in terpenoid chemical structure. Total taxol biosynthesis
involves 42 steps with 0.75 % overall yield 6. Isodon polycyclic diterpene synthesis from a non-
natural starting material takes 12 steps involving rigorous conditions of temperature and pressure 7.
(-)-Lingzhiol takes 17 steps with multiple intermediary purifications from a substrate that needs to be synthesized from petrochemical resources 8.
Semi-synthesis
Although natural hosts such as plants produce terpenoids in very low quantities, they
selectively accumulate some of the stable key precursors at higher quantities. Bacteria and yeast
are engineered to produce such acyclic or cyclic polyprenic intermediates in bulk which are then
purified and converted to the desired isoprenoid via chemical steps. These reactions involve one
or more derivatization steps of the precursor by electronegative groups like alkoxy, phenyl or
halogen followed by selective cyclization reactions 9. This biogenic chemical reactions 10 have lower yields and are difficult to engineer for regioselectivity. These cyclization steps utilize Lewis and Brønsted acid-promoted mechanism 11 . Continuous flow synthesis of artemisinin, an endoperoxide ring containing diterpene, has shown to improve the yield and lower manufacturing cost 12 .
3
Biosynthesis in heterologous hosts
Harvesting terpenoids directly from their natural producers have several consequences.
Their abundance in the natural host is low requiring large biomass processing. For example, 20 kg
of bark from 5 full grown Pacific Yew trees yields 1 kg of an anticancer drug, taxol 13 . Moreover,
the terpenoids usually occur as a mixture in the host from where the desired product needs to be
purified making the downstream process time intensive and cost ineffective, for example,
extraction of sweet steviosides from other bitter and salty analogs in Stevia plant 14 . Apart from these challenges, the recovery of the desired product from the natural producer can often be inefficient such as volatility of isoprene leads to its loss from plants before its capture 15 . Hence,
for majority of these class of products, the growing demand exceeds the supply that can be
sustained by isolation from their natural sources. Hence, alternative sources of expression of the
biosynthetic pathways in heterologous hosts that provide ease of engineering and processing have
gained importance. Microbial fermentations with recombinant strain has shown to produce
selective terpenoids at high titers and productivities16–18 . Pathway engineering tools have further improved yields 19–21 .
1.4 Biosynthetic pathways of isoprenoid production
Biosynthesis of terpenoids in the natural host is a multistep process involving a wide range of enzymes catalyzing reactions that could be summarized as follows:
C5 precursor synthesis
Two isoprenoid biosynthetic pathways exist that synthesize the C5 precursors, IPP and its
isomer DMAPP. Archea and eukaryotes other than plants use the mevalonate-dependent (MVA) 4
pathway exclusively to convert glucose to IPP, which is subsequently isomerized to DMAPP.
Plants use both the MVA and the mevalonate-independent (MEP) pathways for isoprenoid synthesis whereas MEP is dominant in prokaryotes 19,22,23 . There are evidences of the existence of subcellular level compartmentalization of the isoprenoid biosynthetic pathways in higher organisms like plants. The mevalonate pathway is located in the cytoplasm and MEP pathway is located in plastids. The former leads to the formation of sesquiterpenes, triterpenes, squalene, sterols and higher polyprenoids. On the other hand, the MEP pathway is responsible for the formation of the monoterpenes, diterpenes and terpenoids required for the photosynthetic machinery (carotenoids, phytol, prenyl chain of plastoquinone) 24,25 .