University of Kentucky UKnowledge

Theses and Dissertations--Chemistry Chemistry

2017

APPLICATION OF HIGH-RESOLUTION ACCURATE MASS (HRAM) MASS SPECTROMETRY FOR ANALYSIS OF MODEL COMPOUNDS AND THE POST-PRETREATMENT PRODUCTS

Fan Huang University of Kentucky, [email protected] Digital Object Identifier: https://doi.org/10.13023/ETD.2017.149

Right click to open a feedback form in a new tab to let us know how this document benefits ou.y

Recommended Citation Huang, Fan, "APPLICATION OF HIGH-RESOLUTION ACCURATE MASS (HRAM) MASS SPECTROMETRY FOR ANALYSIS OF LIGNIN MODEL COMPOUNDS AND THE POST-PRETREATMENT PRODUCTS" (2017). Theses and Dissertations--Chemistry. 74. https://uknowledge.uky.edu/chemistry_etds/74

This Doctoral Dissertation is brought to you for free and open access by the Chemistry at UKnowledge. It has been accepted for inclusion in Theses and Dissertations--Chemistry by an authorized administrator of UKnowledge. For more information, please contact [email protected]. STUDENT AGREEMENT:

I represent that my thesis or dissertation and abstract are my original work. Proper attribution has been given to all outside sources. I understand that I am solely responsible for obtaining any needed copyright permissions. I have obtained needed written permission statement(s) from the owner(s) of each third-party copyrighted matter to be included in my work, allowing electronic distribution (if such use is not permitted by the fair use doctrine) which will be submitted to UKnowledge as Additional File.

I hereby grant to The University of Kentucky and its agents the irrevocable, non-exclusive, and royalty-free license to archive and make accessible my work in whole or in part in all forms of media, now or hereafter known. I agree that the document mentioned above may be made available immediately for worldwide access unless an embargo applies.

I retain all other ownership rights to the copyright of my work. I also retain the right to use in future works (such as articles or books) all or part of my work. I understand that I am free to register the copyright to my work.

REVIEW, APPROVAL AND ACCEPTANCE

The document mentioned above has been reviewed and accepted by the student’s advisor, on behalf of the advisory committee, and by the Director of Graduate Studies (DGS), on behalf of the program; we verify that this is the final, approved version of the student’s thesis including all changes required by the advisory committee. The undersigned agree to abide by the statements above.

Fan Huang, Student

Dr. Bert C. Lynn, Major Professor

Dr. Mark A. Lovell, Director of Graduate Studies APPLICATION OF HIGH-RESOLUTION ACCURATE MASS (HRAM) MASS SPECTROMETRY FOR ANALYSIS OF LIGNIN MODEL COMPOUNDS AND THE POST-PRETREATMENT PRODUCTS

DISSERTATION

A dissertation submitted in partial fulfillment of the requirements for the degree of Doctor of Philosophy in the college of Arts and Sciences at the University of Kentucky

By

Fan Huang

Lexington, Kentucky

Director: Dr. Bert C Lynn, Professor of Chemistry

Lexington, Kentucky

2017

Copyright © Fan Huang 2017

ABSTRACT OF DISSERTATION

APPLICATION OF HIGH-RESOLUTION ACCURATE MASS (HRAM) MASS SPECTROMETRY FOR ANALYSIS OF LIGNIN MODEL COMPOUNDS AND THE POST-PRETREATMENT PRODUCTS

Lignin, one of main components in the woody cell walls, is a complex heterogeneous biopolymer, which provides structural support and transportation of water in plants. It is highly recalcitrant to degradation (both chemically and environmentally) and protects cellulose from being degraded/hydrolyzed. Due to the structural complexity of native lignin, complete characterization and elucidation of lignin’s structure remains very challenging. The overarching goal of this work is to develop mass spectrometry- based analytical methods to contribute to a better understanding of lignin structures. This dissertation will focus on the development and application of High- Resolution Accurate-Mass (HRAM) Mass Spectrometry (MS) as main analytical technique for studying lignin model compounds, including understanding the ionization behavior, studying corresponding fragmentation patterns and extracting structural information for structural elucidation eventually. Analytical methods were also developed to study the post-pretreatment products of the synthetic trimeric model compound using High-Performance Liquid Chromatography (HPLC) coupled with High-Resolution Accurate Mass (HRAM) Mass Spectrometry (MS). The first project of this dissertation focuses on mass spectral the characterization of lignin models from the in-vitro oxidative coupling reactions. Three specific trimeric compounds were isolated and their ionization behaviors were investigated using HRAM- MS via electrospray ionization (ESI). The reaction parameters of the in vitro oxidative coupling reaction were critical in alternating the linkage profiles of resulting dehydrogenation polymers (DHPs). Reaction parameters were tuned to obtain desired DHP linkages profile. Upon the isolation of three different trimeric compounds, a systematic comparison of ionization efficiency of three trimeric compounds was carried out using ESI-HRAM-MS under different ionization conditions. The second project was aimed to design a synthetic route for a lignin model compound that will be a good representation for native lignin during the pretreatment process. The model compound of interest has not been obtained previously through chemical synthesis. Due to the reactivity of cinnamyl alcohol, which contains the unsaturated side chain, this new synthesis strategy was developed based on the known aldol-type reaction route. A versatile synthesis procedure for preparation of b-O-4 oligomeric compounds was designed and implemented to include the most important functional groups (phenolic alcohol, aryl glycerol b-aryl ether bond and unsaturated side chain) in the resulting model compound. This new synthesis route also allowed incorporation of different . In the third project, Fenton chemistry was applied to a synthetic lignin model compound. Due to the non-specificity in the post pretreatment product profile, non- targeted analytical strategy was developed and applied to study the post-pretreatment products of the model compound using HPLC-HRMS. The results from this dissertation showed a significant difference in ionization behavior between three structurally different model compounds and indicated that primary structures of lignin compounds can largely affect corresponding electrospray ionization properties as well as fragmentation pattern. The work in this dissertation provides analytical techniques for non-targeted analysis of complex lignin samples and an insightful understanding of Fenton’s reaction pretreatment upon lignin model compound.

KEYWORDS: High-Resolution Mass Spectrometry, Lignin, Biofuel, Liquid Chromatography, Electrospray Ionization

Fan Huang

4/25/2017

APPLICATION OF HIGH-RESOLUTION ACCURATE MASS (HRAM) MASS SPECTROMETRY FOR ANALYSIS OF LIGNIN MODEL COMPOUNDS AND THE POST-PRETREATMENT PRODUCTS

By

Fan Huang

Bert C. Lynn

(Director of Dissertation)

Mark A. Lovell

(Director of Graduate Studies)

4/25/2017

Dedicated to my friends and family who supported me through this journey ACKNOWLEDGEMENTS

I would like to thank my parents for their unconditional love and trust for me every moment of my life. Their love and support have carried me throughout my whole graduate life. It is such a blessing for me to grow up in such a supportive and loving family, which has grown me into who I am today.

All these would not happen without my advisor, Dr. Bert C. Lynn, who deserves my sincerest gratitude. He has been giving me invaluable support and mentoring during this journey in seeking my PhD degree. I am always amazed by how much and how diverse his knowledge is. It was definitely a wonderful and incredible experience to learn from him. He has taught me to take initiatives, to think independent and to apply critical- thinking in becoming a scientist. I am so grateful for him being such an open-minded and patient advisor whenever I overwhelm him with hundreds of “why’s”. Not only has he given me professional guidance throughout the years in his group, but he has also supported me and given me invaluable advice regarding my future career decisions. Dr.

Lynn is the best advisor and mentor I could ever have. I would also like to sincerely thank all my committee members, Dr. Yinan Wei, Dr. Jason DeRouchey and Dr. Robert

Lodder for all their supports and insightful guidance in helping me throughout my graduate years. I would like to thank Dr. Michael Flythe for being allowing me to work in his lab and teaching me skills to work with bacteria in the beginning of this journey, it has been an enjoyable experience to work with him. I would like to thank Dr. Sue Nokes for serving as my outside examiner; it has also been a pleasant experience to collaborate with her. She has also been very supportive. I would like to express my gratitude to John

Layton and Dr. Anne-Frances Miller for providing me incredible guidance in running

iii NMR experiments. I would also like to thank Dr. Sean Parkin for helping me obtaining

X-ray crystal structure data whenever I ask for.

I want to thank all the past and present group members in Dr. Lynn’s group. I would like to specifically thank Jennifer Ashley, Dr. Dawn Kato, Dr. Brent Casper and

Dr. Sanja Trajkovic for their supporting from the beginning when I joined Dr. Lynn’s group. I have learned many diverse analytical skills from their experience which allowed me to quickly adapt into the life of graduate school. I am grateful to have them around me during the highs and lows throughout the process, since they understand this journey the best. I also want to express my gratitude to all my organic chemistry friends in the

Department of Chemistry, who have been my resources whenever I have organic questions.

Last but certainly not the least, I would also like to thank all my friends, both in

China and here in the States. They have been encouraging and supporting me through this journey. I am so grateful to know all these wonderful people during my time at

University of Kentucky. Their love and support have made me feel as if I am home even though I am on the other side of the earth.

iv TABLE OF CONTENTS

ACKNOWLEDGEMENT ...... iii

LIST OF TABLES ...... vii

LIST OF FIGURES ...... viii

LIST OF ABBREVIATIONS AND SYMBOLS ...... xi

Chapter 1: Introduction ...... 1 1.1 Background ...... 1 1.2 Lignocellulosic Biomass ...... 6 1.3 Conversion of Lignocellulosic Biomass to Biofuels ...... 10 1.4 High-Performance Liquid Chromatography ...... 13 1.5 Electrospray Ionization-Orbitrap Mass Analyzer ...... 14

Chapter 2: Ionization Behavior of Lignin Model Compounds ...... 24 2.1 Introduction ...... 24 2.1.1 Formation and Structure of Lignin ...... 24 2.1.2 Dehydrogenation Polymers (DHPs) ...... 25 2.1.3 Methods for Characterization of Lignin ...... 31 2.2 Materials and Methods ...... 39 2.3 Results and Discussion ...... 46 2.3.1 Characterization of Two Different DHPs ...... 46 2.3.2 Negative Ionization Behavior of Trilignols ...... 53 2.3.3 Positive Ionization Behavior of Trilignols ...... 60 2.3.4 HCD/MS/MS Experiment of Trilignols ...... 67 2.4 Conclusion ...... 77

Chapter 3: Synthesis of Trimeric Lignin Model Compounds ...... 78 3.1 Introduction ...... 78 3.1.1 Lignin Model Compounds ...... 78 3.1.2 Condensation Reaction of Aldehydes and Ketones ...... 82 3.2 Materials and Methods ...... 84 3.3 Results and Discussion ...... 91

v 3.3.1 Synthesis of Building Blocks 3,4 and 6 ...... 91 3.3.2 Synthesis of the Dimeric Intermediate ...... 92 3.3.3 Synthesis of the Trimeric Model ...... 93 3.4 Conclusion ...... 99

Chapter 4: Un-Targeted Analysis of Pretreatment Products Of b-O-4 Model Trilignol with High-Resolution Accurate Mass (HRAM) Mass Spectrometry Coupled with High Performance Liquid Chromatography (HPLC) ...... 100 4.1 Introduction ...... 100 4.1.1 Pretreatment Strategies ...... 100 4.1.2 Untargeted Analysis by High-Resolution Accurate Mass (HRAM) Mass Spectrometry ...... 104 4.2 Materials and Methods ...... 107 4.3 Results and Discussion ...... 109 4.3.1 High-Performance Liquid Chromatography (HPLC) Method Development ...... 109 4.3.2 Analysis of Unknown Compounds from Fenton’s Reaction of Trilignolic Model Compound (GGG) ...... 114 4.4 Conclusion ...... 129

Chapter 5: Conclusion ...... 131

References ...... 142

Vita ...... 146

Appendices ...... 147

vi

LIST OF TABLES Table 2.1 A brief review of different chemical degradation methods for structural

characterization of native lignin…………………………………………32

Table 2.2 NMR chemical shift assignments of three trimers……………………....45 Table 2.3 List of observed fragment ions for G(b-O-4)G(b-5)G and their chemical formula based on high resolution MS result of [M+Li]+ and [M+Na]+…73

Table 2.4 List of observed fragment ions for G(b-O-4)G(b-b) and their chemical formula based on high resolution MS result of [M+Li]+ and [M+Na]+....76 Table 4.1 Summary of various pretreatment strategies…………………………...103 Table 4.2 LC method used in the pretreatment study……………………………..111 Table 4.3 List of 20 features with their corresponding elemental composition and retention time based on the result from MZmine 2…………………….116

vii

LIST OF FIGURES

Figure 1.1 Total Fossil Fuels Productions and primary energy production by source (Data collected by EIA, Annual Energy Outlook, 2015)………………..3 Figure 1.2 Total Renewable Energy Productions and primary energy production by sources (Data collected by EIA, Annual Energy Outlook, 2015)………..4 Figure 1.3 Primary Energy Consumption by source in 2015 (Data collected by EIA, Annual Energy Outlook, 2015)…………………………………………..5 Figure 1.4 Simplified schematics of lignocellulose structure……………………….8 Figure 1.5 Common linkage types presented in lignin………………………………9 Figure 1.6 Typical bioconversion process of biomass-to-biofuel………………….12 Figure 1.7 Schematic depiction of an ESI source operated in the positive ion mode ………………………………………………………………………….17 Figure 1.8 Simplified Schematics of an Orbitrap…………………………………..20 Figure 1.9 Schematic of Q-Exactive mass spectrometer (Thermo Scientific) (source: http://planetorbitrap.com/q-exactive#tab:schematic)…………………..23 Figure 2.1 Structures of three monolignols: a) coniferyl alcohol(G); b) p-coumaryl alcohol(H); c) (S)………………………………………26 Figure 2.2 Resonance structure of phenoxy radical of lignin monomer…………..27 Figure 2.3 Three Examples of common bonding types presented in lignin; a) b-aryl ether (b-O-4); b) resinol (b-b); c) phenylcoumaran (b-5)……………..28 Figure 2.4 Example of b-O-4 linkage formation via radical coupling…………….29 Figure 2.5 Structures of three trimers: a) Guaiacylglycerol-b, g-bis-coniferyl ether [G(4-O-a)G(b-O-4)G]; b) Guaiacylglycerol-b-dehydrodiconiferyl ether [G(β-O-4)G(β-5)G]; c) Guaiacylglycerol- ether [G(β-O- 4)G(β-β)G]……………………………………………………………..44 + Figure 2.6 Full scan mass spectrum of DHP in Positive ion mode ESI, [M+NH4] ; a) DHP-CT; b) DHP-ZT………………………………………………….47 + Figure 2.7 Full scan mass spectrum of DHP in Negative ion mode ESI, [M+NH4] ; a) DHP-CT; b) DHP-ZT……………………………………………….48 Figure 2.8 GPC chromatogram of isolated DHPs; from top to bottom were coniferyl alcohol, dimers, trimers, tetramers and pentamers…………………….51 Figure 2.9 GPC chromatogram of DHPs from two different reaction conditions: DHP-CT products (top) and DHP-ZT products (bottom)………………52 Figure 2.10 a) Deprotonated ion intensity comparison of three trimers; b) full scan mass spectrum of G(4-O-a)G(b-O-4)G, Error bars indicate ± one standard deviation ……………………………………………………...55

viii Figure 2.11 Liquid chromatograms of G(4-O-a)G(b-O-4)G, a) BPC of trimer 537.213 ; b) EIC of 537.213; c) EIC of dimer, m/z 357.134; d) EIC of monomer, m/z 179.071; e) EIC of coniferyl alcohol, 179.071…………56 Figure 2.12 Proposed fragmentation mechanism of deprotonated G(4-O-a)G(b-O- 4)G ion………………………………………………………………….58 Figure 2.13 Ion intensity comparison of three trimers with chlorine ion adduct, [M+Cl]- in negative ion mode ESI, error bars indicate ± one standard deviation.………………………………………………………………..59 Figure 2.14 a) Relative ionization response of G(4-O-a)G(b-O-4)G in different salt solution. b) Cation selectivity of G(4-O-a)G(b-O-4)G in solution of

equal molar of NH4Cl/LiCl/NaCl/KCl/RbCl, error bars indicate ± one standard deviation………………………….…………………………...61 Figure 2.15 a) Relative ionization response of G(b-O-4)G(b-5)G in different salt solution. b) Cation selectivity of G(b-O-4)G(b-5)G in solution of equal

molar of NH4Cl/LiCl/NaCl/KCl/RbCl, error bars indicate ± one standard deviation……………………………..………………………………….64 Figure 2.16 a) Relative ionization response of G(b-O-4)G(b-b)G in different salt solution. b) Cation selectivity of G(b-O-4)G(b-b)G in solution of equal

molar of NH4Cl/LiCl/NaCl/KCl/RbCl, error bars indicate ± one standard deviation ……………………………...………………………………...65 Figure 2.17 Positive ion mode ESI response comparison of three trilignols with the presence of different cations, error bars indicate ± one standard deviation ………………….……………………………………………66 Figure 2.18 HCD/MS/MS spectra of three trimers with ammonium adduct, + [M+NH4] , a) G(4-O-a)G(b-O-4)G; b) G(b-O-4)G(b-5)G; c) G(b-O- 4)G(b-b)G………………………………………………………………68 Figure 2.19 HCD/MS/MS spectra of G(4-O-a)G(b-O-4)G: a) [M-H]-; b) [M+Li]+; c) [M+Na]+………………………………………………………………...70 Figure 2.20 HCD/MS/MS spectra of G(b-O-4)G(b-5)G: a) [M-H]-; b) [M+Li]+; c) [M+Na]+………………………………………………………………...72 Figure 2.21 HCD/MS/MS spectra of G(b-O-4)G(b-b)G: a) [M-H]-; b) [M+Li]+; c) [M+Na]+………………………………………………………………...75 Figure 3.1 Synthetic route of target trimer G(b-O-4)G(b-O-4)G………………….81 Figure 3.2 General mechanism of a typical aldol reaction…………………………83 Figure 3.3 The synthesis of the three building blocks for the synthesis of target trimer……………………………………………………………………86 Figure 3.4 X-ray crystal structure of compound 9…………………………………94 Figure 3.5 a) Reaction pathway of nucleophilic addition of di-isopropanol amine to compound 6 (purple color); nucleophilic addition within compound 6

ix (blue color); b) Possible byproduct 10 resulting from elimination……..95 Figure 3.6 a) ESI-MS of targeted trimer in negative ion mode, [M-H]-; b) HCD/MS/MS spectrum of the trimer, [M-H]-………………………….98 Figure 4.1 Schematic of pretreatment step of biomass…………………………101 Figure 4.2 Liquid chromatogram of mixture of standards; a) Base peak chromatogram (BPC); b) Extracted Ion Chromatogram (EIC) of 3- hydroxy-4-methoxybenzoic acid; c) EIC of ferulic acid; d) EIC of coniferyl alcohol; e) EIC of vanillin…………………………………..112 Figure 4.3 EIC of starting trilignol GGG…………………………………………113 Figure 4.4 Time course measurement of starting material GGG, error bars indicate ± one standard deviation ……………………..…………...…………….117 Figure 4.5 Degradation of trimeric unknown feature, [M-H]- 551.1909………………………………..………………….………….118 Figure 4.6 a) HCD/MS/MS spectrum of m/z 551.1909; b) putative structures of m/z 551.1909………………………………………………...…………….120 Figure 4.7 possible fragmentation structures for corresponding trimeric unknown m/z 551.1909………………………………………………………….121 Figure 4.8 Time course plots of fifteen unknown features, error bars indicate ± one standard deviation …………………………………………………….122 Figure 4.9 Extract ion chromatogram (EIC) and HCD/MS/MS spectrum of vanillin (upper panel), EIC and HCD/MS/MS spectrum of unknown m/z 151.0397 (lower panel)………………………………….…………….124 Figure 4.10 EIC and HCD/MS/MS spectrum of vanillic acid (upper panel), EIC and HCD/MS/MS spectrum of m/z 167.0346 (lower panel) ..…………….126 Figure 4.11 EIC and HCD/MS/MS spectrum of synthetic G(b-O-4)G(b-O-4)G (upper panel), EIC and HCD/MS/MS spectrum of m/z 375.1439 (lower panel)…………………………………………………..…………….127 Figure 4.12 HCD/MS/MS spectra of three unknown features with putative structures………………………………………………...…………….128

x LIST OF ABBREVIATION AND SYMBOLS AFEX Ammonium fiber expansion APCI Atmospheric Pressure Chemical Ionization AU Arbitrary Units BPC Base Peak Chromatogram Btu British thermal unit ˚C Degree Celsius C-trap Curved quadrupole ion trap C18 octadecyl carbon chain CAD Collision Activated Dissociation CID Collision induce dissociation COSY Homonuclear correlation spectroscopy (CTA)2SO4 Cetyltrimethylammonium sulfate Da Dalton DFRC Derivatization Followed by Reductive Cleavage DHP-CT Dehydrogenation Polymer- Cetyltrimethylammonium sulfate DHP-ZT Dehydrogenation Polymer- Zutropf DHPs Dehydrogenation Polymers DIBAL-H Diisobutylaluminum Hydride Dpmo Number Degree of Polymerization EI Electron Ionization EIA Energy Information Administration EIC Extracted Ion Chromatogram ESI Electrospray Ionization ESI-MS Electrospray Ionization-Mass Spectrometry FT-ICR Fourier Transform Ion Cyclotron Resonance G Guaiacyl GC Gas Chromatography GC/MS Gas Chromatography/Mass Spectrometry GGG G(b-O-4)G(b-O-4)G GOH Coniferyl Alcohol GPC Gel Permeation Chromatography H Hydroxyphenyl HCD High Energy Collision Cell HCD/MS/MS High-Energy Collision Dissociation/Tandem Mass Spectrometry HPLC High-Performance Liquid Chromatography HRAM High-Resolution Accurate Mass HRMS High-Resolution Mass Spectrometry

xi HRP Horseradish peroxidase HSQC Heteronuclear Single Quantum Correlation IR Infrared kV Kilovolt LC Liquid Chromatography LC-MS Liquid Chromatography-Mass Spectrometry LDA Lithium Diisopropylamide m/z Mass-to-charge ratio MALDI Matrix Laser Desorption Assisted Ionization MALDI-MS Matrix Assisted Laser Desorption Ionization-Mass Spectrometry MS Mass Spectrometry MS/MS Tandem Mass Spectrometry MSW Municipal Solid Waste MW Molecular Weight MWL Mill Lignin NaHMDS Sodium Bis(trimethylsilyl)amide NIR Near Infrared NMR Nuclear Magnetic Resonance ppm Parts Per Million PPP Para-phenylphenol RP Reversed-phase rt Retention Time S Syringyl SEC Size Exclusion Chromatography STDEV Standard Deviation TBDMS t-butyldimethylsilyl TIPS Triisopropylsilyl TLC Thin Layer Chromatography UV Ultraviolet ZL Zulauf ZT Zutropf

xii

Chapter 1

Introduction

1.1 Background

According to U.S. Energy Information Administration (EIA), combined energy consumption rose from a low of 32 quadrillion Btu to a high of 97 quadrillion Btu since

1949 to 20151-2. This is an approximate 200% increase in energy consumption since

1949. There has been a downward trend in the production and consumption of coal and petroleum since 2006. The global energy production highly depends on the production of fossil fuels, which are the major energy sources. As global energy production and consumption have been growing increasingly, it may eventually lead to the depletion of fossil fuels. Alternative energy resources that meet the requirements of being renewable, sustainable and ecofriendly are the fundamental solution to the current concerns regarding both energy sustainability as well as global environmental problems created from conventional energy consumption. Solar energy is the most sustainable and abundant form of renewable energy on earth. Currently, solar is being utilizing in four ways: 1) direct photovoltaic conversion of sunlight to electricity; 2) via wind turbines to produce electricity; 3) by using mirrors to concentrate solar to power stirling engine which produces electricity; 4) by harvesting plants that can be burned to generate energy3. Among these different methods, plants are the most cost-effective one in converting photons into chemically stored energy4. Although the production of biomass, wind, solar and other renewable energies are trending upward, the production of natural gas has grown the most in the past ten years. As shown in Figure 1.1-1.3, according to the data collected by EIA, there is an upward trend of total energy production from 1949 to

1

2015. Natural gas and crude oil production are accounted for the increasing trend of total fossil fuels production. Nuclear electric power and biomass-based energy are two major sources accounted for total renewable energy production.

The second-largest biomass uses are wood and wood waste for production of electric and industrial power. For liquid transportation fuels production, current primary industrial biomass resources are predominantly corn grain for bioethanol and soybean oil for biodiesel. In general, technologies that convert corn grain and soybean oil to liquid fuel are referred as “first generation”2. However, the first-generation biomass, such as corn and soybean, are also important food resources. A concern has been that the food based-biofuel production contributes significantly increased in food prices and in the agricultural cultivation cost and land concentration for food based biomass, which results in the increase of total cost for food-based biofuel production. There is also an ethical concern that conventional biofuel production might reduce access to food and increase the food price, ultimately increasing global hunger.

Advanced technologies now enable the production of biofuel from cellulosic biomass, which is referred as second-generation, non-grain, and non-food based feedstock. There are several types of cellulosic biomass: 1) agricultural residues, which are non-food based by-products (e.g. corn stover, cotton gin trash); 2) energy crops, including woody energy crops (e.g., hybrid poplars, shrub willows) and herbaceous energy crops (e.g., switchgrass, miscanthus, sorghum, energycane); 3) forest resources: wood waste and woody biomass, including existing and re-purposed pulp and paper products, logging residues and forest thinning; 4) industrial and other waste, such as municipal solid waste (MSW) and urban renewal wood;

2

Figure 1.1. Total Fossil Fuels Productions and Primary Energy Production by source

(Data collected by EIA, Annual Energy Outlook, 2015)

3

Figure 1.2. Total Renewable Energy Productions and Primary Energy Production by sources (Data collected by EIA, Annual Energy Outlook, 2015).

4

40

35

30

25

20

15 udilo Btu Quadrillion

10

5

0 Coal Natural Gas Petroleum(Excluding Nuclear Electric Hydroelectric Power Geothermal Energy Solar Energy Wind Energy Biomass Energy (Excluding Biofuels) Power Supplemental Gaseous Fuels)

Figure 1.3. Primary Energy Consumption by source in 2015 (Data collected by EIA,

Annual Energy Outlook, 2015).

5

5) algae, which is a diverse group of primarily aquatic photosynthetic algae and cyanobacteria5.

1.2 Lignocellulosic Biomass

Bioenergy plants generally fall into two categories: 1) gymnosperms, such as soft (e.g. pine, spruce and cedar); 2) angiosperms including perennial grasses (e.g., switchgrass, miscanthus and sorghum), herbaceous species (e.g., corn, wheat), flowering plants (e.g., alfalfa, soybean and tobacco) and hard woods (e.g., poplar, willow and black locust)4. The major component of plant biomass is lignocellulose, which forms the plant cell walls, supporting and providing plant a protective barrier from degradation by physiological environment. Lignocelluloses are mainly composed of cellulose, hemicellulose and lignin. These three components interact in very intricate manner, resulting in heavily recalcitrant cell wall that is difficult to degradation or decomposition

(Figure 1.4).

Cellulose, the major component of lignocellulosic biomass (35-50%), is a linear polymer of several hundred to thousands of cellobiose units, which consists of two D- glucose molecules linked through b- (1,4)-glycosidic bonds. The cellulose strains are held together firmly through intra- and inter- molecular hydrogen bonding to form cellulose fibrils with high tensile strength. The cellulose fibers are further linked together via hydrogen bonds resulting in high rigidity and crystallinity of dominant cellulose.

Therefore, cellulose is insoluble in water and most organic solvents6.

Hemicelluloses (20-35%) are heterogeneous branched polymers that include several polysaccharides, such as arabinoxylan, xylan, glucuronoxylan, glucomannan and

6

xyloglucan. Hemicelluloses contain many different sugar monomers, for instance, xylose, mannose, galactose and arabinose. In contrast to crystallinity and rigidity of cellulose, hemicelluloses are easier to be hydrolyzed due to their amorphous, branched structure with short polysaccharide chain. However, hemicelluloses reduce the accessibility of cellulose protecting cellulose from hydrolysis6.

Lignin (15-20%) is a class of complex aromatic biopolymers that plays an important role in the support of vascular plants. Lignin is a polymer with three basic phenylpropane units: p-coumaryl alcohol, coniferyl alcohol and sinapyl alcohol. The content of lignin depends on the type of species. Lignin structures can also differ within the same plant, for instance, primary xylem, compression wood or early versus late wood cell. The monomeric composition in lignin also varies in different type of plants: p- coumaryl alcohol mainly exists in compression wood and grasses; coniferyl alcohol largest exists both in and softwoods; sinapyl alcohol mostly presents in hardwoods. Biosynthesis of lignin is through radical oxidative coupling of three monolignols. It results in the extreme complex amorphous polymers that contain variety chemical linkages: phenylpropane-b-aryl ether (b-O-4), phenylcoumaran (b-5), resinol

(b-b), biphenyl (5-5), dibenzodioxocin, 1,3-diarylpropane (b-1), a-aryl ether (a-O-4) and diaryl ether (4-O-5). The frequency of different linkage types differs between species

(showed in Figure 1.5). b-aryl ether (b-O-4) is considered to be the most abundant linkages in lignin (~ 60% in hard wood, 50% in soft wood). Phenylcoumaran (b-5) is about 11% in soft wood and 6% in hard wood. a-aryl ether (a-O-4) is considered to be ar relatively labile linkage in lignin compared with other linkages, it could present in dibenzodioxocin linkage or a, b-diaryl ether linkage7.

7

Lignin

Hemicellulose

Cellulose

Figure 1.4. Simplified schematics of lignocellulose structure.

8

HO

O

HO O

OCH3 O

β-aryl ether Resinol

OCH3 H3CO

O

O OCH3 O HO

OH

Phenylcoumaran Dibenzodioxocin

OCH3

OH

CH2OH

O O

HO

H3CO OCH3 OCH3 α, β-diaryl ether Biphenyl

H3CO O

O OCH3

Diaryl ether

Figure 1.5. Common linkage types presented in lignin.

9

1.3 Conversion of Lignocellulosic Biomass to Biofuels

The techniques for conversion of biomass to biofuels fall into two different methods: thermochemical and biochemical conversion.

Thermochemical conversion usually requires high-temperature deconstruction of biomass, such as pyrolysis, gasification and hydrothermal liquefaction. Pyrolysis decomposes biomass without the presence of oxygen to produce bio-oil intermediate at high temperature with or without the presence of catalysts. The resulting bio-oil contains mixtures of hydrocarbons with various lengths. The product requires further upgrading before it can be finished into a fuel or used in a refinery because it contains more oxygenated compounds than petroleum crude oils. Gasification typically deconstructs of biomass at more than 700 degree Celsius in the presence of sub-stoichiometric air generating clean synthesis gas or syngas, a mixture of CO2, CO, CH4, N2, and H2. The syngas can be further converted by catalyst to various other molecules such as methanol, ethanol and dimethyl ether. Hydrothermal liquefaction is a thermal deconstruction process of wet feedstock slurry under elevated temperature and pressure to generate hydrothermal liquefaction bio-oil. This process is mostly applicable to algal feedstock.3, 8

Biochemical conversion techniques require much lower temperature compared with thermochemical conversion. In general, biochemical conversion includes pretreatment, enzymatic hydrolysis, fermentation and product recovery (Figure 1.6) 8.

Bioconversion technologies appear to be more attractive over thermochemical method for several reasons: 1) it is relatively scale-neutral and may have lower capital costs than thermal conversion; 2) it is possible that technical development can reduce the costs of

10

production of cellulosic fuels so that farmers can participate in the ownership of the facilities.

As described in previous section, due to the intricate biological structure of lignocellulosic feedstock, biomass remains naturally recalcitrant, protecting themselves from decomposition. Therefore, one of the key steps in bioconversion is pretreatment of biomass. The goal of pretreatment is to use biological or chemical methods to increase the porosity of biomass feedstock and degrade/remove/modify lignin to increase the accessibility of cellulose and other polysaccharides for enzymatic hydrolysis in the following step. During the enzymatic hydrolysis, the solubilized cellulose and other polysaccharides are subjected to further hydrolysis to simple sugars such as glucose.

More efforts have been put on search for more active glycosyl hydrolases from incompletely explored sources (e.g. termites)3. The simple sugars are then fermented to liquid fuels such as ethanol, 1-butanol and acetone by the industrial biofuel-producing microorganism during the fermentation step. The generated liquid fuels are then recovered and separated from the fermentation step and ready for commercial purpose.

The performances of different pretreatment methods are typically evaluated by the level increased in the bioavailability of cellulose after the pretreatment and the level of total lignin content changes according to National Renewable Energy Laboratory (NREL) standard protocols. However, this method does not provide insights regarding the structural changes in lignin caused by the pretreatment process. Chemical degradation techniques coupled with gas chromatography (GC) have been applied in analysis of monomeric units of lignin. However, GC analysis has limitations on the size of the molecules.

11

Figure 1.6. Typical bioconversion process of biomass-to-biofuel

12

Typically, for analysis of lignin compounds, GC fails when the molecular weight higher than dimeric compounds.

High-Performance Liquid chromatography (HPLC) is commonly used to provide versatile separation abilities for complex sample mixtures with high dynamic range. Thus, high-performance liquid chromatography (HPLC) coupled with high-resolution mass spectrometry (HRMS) analytical method is developed and used in this dissertation for analysis lignin related compounds.

1.4 High-Performance Liquid Chromatography

High-performance Liquid chromatography (HPLC) is used in this dissertation as the main analytical separation technique. It is optimum for the separation variety of compounds that are non-volatile. HPLC is a type of liquid chromatography that provides good separation performance and it is run under high pressure. A typical HPLC system consists of a pump, an injector, and a separation column, reservoirs for liquid mobile phase and a detector. A separation column is usually packed with the stationary phase, which are very small porous particles (1-5 µm) with chemically modified surface resulting in the types of LC separation. Different chemical modification on the stationary phase generates a wide variety of LC separation techniques that are suitable for different analyte of interests. Among these, reversed-phase chromatography is widely used for many small organic molecules as well as peptides when using mass spectrometry as detector. Reserved-phase HPLC (RP-

HPLC) is the most common HPLC technique because a large number of compounds can be separated with RP-HPLC. The chromatographic separation is performed on a non-

13

polar stationary phase with a polar mobile phase. There is a wide range of non-polar stationary phase available, which provides various options for choosing the correct separation system according to the physical and chemical properties of the compounds in order to achieve good separation. The stationary phase for RP-HPLC is usually obtained through chemically modified on the solid surface, such as silica. The most common one is C18 column, which is putting long chain hydrocarbon containing eighteen carbons giving very high hydrophobicity. Modification of solid surface with phenyl group is often used in order to separate aromatic compounds. The mobile phase used in RP-HPLC is usually a mixture of an organic solvent (acetonitrile, methanol, etc.) and water. A gradient of mobile phase is usually used when separation of complex compound mixture is needed. During RP-HPLC, a small volume of liquid sample is injected and loaded on the top of the column. The sample components then interact with the stationary phase through hydrophobic interaction. The process of the separation of the sample each component is determined by the choice of the chemical property of the column packing as well as the mobile phases

1.5 Electrospray Ionization- Orbitrap Mass Analyzer

The applications of mass spectrometry on lignin biopolymers are mainly focused on gas chromatogram (GC)/mass spectrometry coupled with chemical degradation or pyrolysis. It has drawn a lot of interest in developing MS-based techniques that would extend the research on studying the intact lignin polymers. Along with the development of soft-ionization techniques, such as electrospray ionization (ESI) and Matrix Laser

Desorption Assisted Ionization (MALDI), application of mass spectrometry has been

14

extended to various compounds with wide range of molecular weight and provided useful structural information. In this dissertation, ESI-MS is the major analytical technique used to study and characterize the compounds of interest.

Electrospray Ionization

Electrospray ionization (ESI) is one of most extensively used ionization methods in mass spectrometry (MS). ESI has been mostly coupled with high-performance liquid chromatograph (HPLC) for the following reasons: a) the relative simplicity of ESI in introducing ions from solution phase to gas phase; b) wide range of eluent flow rate from a few µl/min to several hundred µl/min and c) applicability to a large number of analytes with diverse chemical and physical properties. Electrospray ionization takes place at atmospheric pressure. It is a “soft-ionization” method since there is little or no fragmentation during the electrospray ionization process. ESI can be used in both positive ion mode and negative ion mode. Here is the example of ESI process in positive ion mode. During ionization process (Figure 1.7), a solution of analyte is passed through a thin metal capillary which is applied several kV voltage (e.g., 500V to 4.5kV)9. The metal capillary is held positive potential vs. ground on the mass spectrometer, the high electric field at the emitter tip drives positively charged analytes to move toward the tip of the capillary emitter, resulting in an increasing density of positive ions at the liquid/air interface at the capillary tip. Under the effects of the applied electric field, once the coulomb repulsion between positive ions exceeds the surface tension of the liquid, a jet of positively charged solution emerges from the emitter tip, known as a Taylor cone10. As a result of formed Taylor cone, charged droplets ejects from the emitter. The sizes of these

15

droplets are around mm diameter range. The solvent then continuously evaporates off from the droplets owing to the high surface area-to-volume ratio, assisted by nebulizer gas flow and super-heated interface. The evaporation of solvent from initial charged droplets results in the shrinking of the droplets. As the droplets become smaller and smaller, the charge-to-volume ratio increases until the resulting coulomb repulsion overcomes the surface tension of the droplet, so called Rayleigh limit. The droplet reaching the Rayleigh limit forms a jet of offspring droplets, which are smaller and highly charged. The offspring droplets then undergo numbers of evaporation/jet fission cycles until a radius of about 5-10 nm is reached. Eventually gas-phase ions are formed10-

11. Several important parameters, such as spray voltage, flow rate, emitter tapering, surface hydrophobicity and fluid conductivity, play important roles on ionization efficiency of analytes.

16

Mass Spectrometry

ESI Spray Emitter

------+ + - + - - + + + + + ++ - + - + ++ + + + + + ++ + + + + + + Analyte Solution + + + + + - + +

+ + + + + + +

+ + +

+ + +

- ++ +

+ + + + + +

+ + +

+ + + +++ + + + + + + + + + + + + + + +

+ - + + + + + + + +

+ + + + + + + +

- + + + + + + + + + + + + +

+ + + + +

+ + + + + + + + + - + + ++ ++ + + + + + - - + ++ + - + + + + + ++ - + + + + + + + + + +++ + - + + + - - ++ Mass Spectrometry + - + + + + + + + ESI Spray Emitter

------+ + + + + + + + ++ - + - + ++ + + + + + ++ + + + + + + Analyte Solution + + + + + - + +

+ + + + + + +

+ + +

+ + +

- ++ +

+ + + + + +

+ + +

+ + + +++ + + + + + + + + + + + + + + +

+ - + + + + + + + +

+ + + + + + + +

- + + + + + + + + + + + + +

+ + + + +

+ + + + + + - + + ++ + + + + + - + + + + + + - + + ++ - + - + + + + ++ - + + + + + + + + + ++ + - + + + - - ++ + - + + + + + + + ------

e- e- e- + e- +

Figure 1.7 Schematic depiction of an ESI source operated in the positive ion mode.

17

Orbitrap mass analyzer

Mass spectrometry is quite unique among other analytical instrumental methods, as several different physical principles of ions form the foundation for various types of mass analyzers. For instance, mass-to-charge ratio can be measured based on trajectories in a magnetic sector mass analyzer, path stability in a quadrupole mass filter mass analyzer, orbital frequency in an ion cyclotron resonance mass analyzer and velocity in a time-of-flight mass analyzer. Each mass analyzer has its own advantages and disadvantages12.

The Orbitrap mass analyzer, used extensively in this dissertation, was first commercialized in 2005 by Thermo Fisher Scientific. It is a relatively new mass analyzer compared with other types, however, it quickly dominates the market because it can provide much higher mass accuracy and mass resolution compared to linear ion trap and is at a much lower cost than Fourier Transform Ion Cyclotron Resonance (FT-ICR).

In 1923, Kingdon first built an ion-trapping device using only electrostatic field based on a straight wire along the axis of outer cylindrical electrode. Later Knight refined the shapes of outer electrode. This trap design is often called “ideal Kingdon trap”.

Makarov invented and filed a patent of the new type of mass analyzer, the Orbitrap, in

199912.

The ion-trapping in Orbitrap also uses electrostatic field but with specially shaped inner and outer electrodes compared with Kingdon trap. It has a spindle-like central electrode and a barrel-like outer electrode (Figure 1.8). The ions trapped in the Orbitrap can be described with potential distribution:

18

* & ' ) & ' ) ! ", $ = $ - + (-. ) ln + 4 (1.1) ' ' ' 23

Where r and z are cylindrical coordinates, C is a constant, k is field curvature and

Rm is the characteristic radius. This field is quadrologarithmic because it is a sum of a quadrupole field of the ion trap and a logarithmic field of the cylindrical capacitor13.

In the field (1.1), ions move in an intricate spiral trajectory. By solving the equation of motion in polar coordinates (r, j, z) for ions with mass-to-charge ratio m/q, three characteristic frequencies of the ion can be derived:

# ! = %, (1.2) $ where w is the frequency of axial oscillation13

% ( &)(-* ! = ! % , (1.3) " +

13 where wr is the frequency of radial oscillations

% ( &)(-* ! = ! % (1.4) " +

13 where wj is the frequency of rotation .

Out of the three characteristic frequencies, only the frequency of the harmonic axial oscillation w, which is inversely proportional to the square root of the mass-to- charge ratio m/q of the ions, is completely independent of the energy and the position of ions. Therefore, only this frequency is used for mass analysis.

19

r

z

Figure 1.8. Simplified Schematics of an Orbitrap

20

Similar with FT-ICR instrument, image current detection is used as detector in

Orbitrap. Image current detection relies in that an ion cloud attracts (positive ions) or repels (negative ions) the electrons of the detection electrodes when it passes split outer electrodes. The image current induced by the oscillation of ions can then be amplified, transformed into a voltage signal and translated into frequency domain by Fourier

Transform resulting in accurate reading of ions’ mass-to-charge ratios. Because frequency can be measured at the highest accuracy among other physical quantities,

Orbitrap can offer very high mass measurement accuracy. With the use of an internal calibration ion, mass accuracy can be achieved with 3 ppm. As the harmonic axial oscillation frequency of each m/z is translated from image current, the more transients are acquired in each scan, the higher resolving power of mass analyzer can be achieved.

Thus, the time of analysis increases proportionately with the resolution required. Not only the analysis time is critical for high resolving power in Orbitrap, but it also requires ultra- high vacuum (<2x10-10 mbar) in order to allow ions to travel sufficiently long transients.

The ultra-high vacuum is needed to provide the sufficient mean free path of hundreds of ions revolution undisturbed around the center electrode. However, ion collides with background gas molecules even at this ultra-high vacuum level, resulting in loss of phase coherence of ion packet, causing a decrease in the signal intensity of the transient9, 12.

The Orbitrap mass spectrometer used in this dissertation is Thermo Scientific Q-

Exactive Orbitrap. The schematic of the mass spectrometer is showed in Figure 1.9). Q-

Exactive Orbitrap consists with several components: ESI ion source, S-lens, transfer octopole, quadrupole mass filter, a curve RF-only quadrupole known as C-trap, Orbitrap mass analyzer and high-energy collision cell (HCD). As shown in the schematic, ions

21

generated at the ion sources need to pass through a series of components before analyzed in the Orbitrap. C-trap, which is a bent RF-only quadrupole, is used to accumulate, store and thermalize ions prior to the injection to Orbitrap. It is important component for the high performance of Orbitrap. Ions get collisional damping in the C-trap resulting in efficiently capturing and cooling of the ions. Accumulated ions then can be injected from

C-trap to Orbitrap with narrow energy distribution and injection angle9, 14. The Q-

Exactive can acquire MS data at several different mass resolutions according to the specific analytical requirements. The higher the mass resolution, the lower the MS scan speed, vice versa. As mentioned previously, frequency is the physical property that can be measured at the highest accuracy. With the use of internal calibration ion, the mass accuracy is usually within 2 ppm. The elemental composition information can be achieved with high-resolution accurate mass data with the corresponding mass error. This is extremely useful in providing another level of information regarding identifying unknown compounds. In this dissertation, the high-resolution accurate mass data obtained has been greatly aided in eliminating the unrelated ions and structural identification and confirmation.

22

Figure 1.9 Schematic of Q-Exactive mass spectrometer (Thermo Scientific) (source: http://planetorbitrap.com/q-exactive#tab:schematic)

23

Chapter 2

Ionization Behavior of Lignin Model Compounds

2.1 Introduction

2.1.1 Formation and Structure of Lignin

As mentioned in the previous chapter, lignin is a complex heterogeneous biopolymer. It is mainly composed of three monolignols: 4-hydroxy cinnamyl alcohol, coniferyl alcohol and sinapyl alcohol (shown in Figure 2.1). When incorporated into lignin, they are referred as hydroxyphenyl (H), guaiacyl (G) and syringyl (S). Conventional numbering on the ring (C1-C6) and Greek letters on the side chain (a, b, g) are used to describe the different chemical linkage types. The amount of three monolignols varies in different species. The dominant in gymnosperm is G-unit with minor amount of H-unit, whereas angiosperm dicot lignins contain both G- and S- units. H-units are mainly present in softwood lignin and slightly higher in grasses15. These units are linked through carbon-oxygen ether and carbon-carbon linkages in various bonding types. During lignin biosynthesis, peroxidases initiate dehydrogenation of monolignols at the 4-OH position. The resulting phenoxy radical is resonance stabilized by delocalizing to O-4, C-1, C-3, C-5 and C-b positions. Resonance structures of the phenoxy radicals for coniferyl alcohol are depicted in Figure 2.2. Two phenoxy radicals are involved in dimer formation via radical coupling reaction between active radical positions. The C-b appears to be the most reactive site as the most abundant linkages in lignin involve in C-b position (e.g. b-O-4, b-b and b-5)7 (Figure 2.3). An

24

example of the oxidative coupling of coniferyl alcohol to form the most abundant b-O-4 bond is shown in Figure 2.4. This linkage involves a two-step reaction: 1) radicals at O-4 and C-b position couple and form a Cb-O4 ether, 2) the quinone methide intermediate generated from the coupling is quite reactive and can easily accept additions of another nucleophiles to the C1=Ca double bond. In this case, one water molecule acts as the nucleophile, which introduces an aliphatic alcohol at Ca position. Because the random and nonspecific chemistry of radical coupling reaction, biosynthesized lignin polymers are very complex with unknown combination of all different possible linkage as well as different frequencies of monolignols7.

2.1.2 Dehydrogenation Polymers (DHPs)

Klason first pointed out that lignin is a derivative from the oxidation of coniferyl alcohol, even though he didn’t have direct experimental data to support his view in

190816. Freudenberg and his coworkers then established the fundamental understanding of lignin chemistry. Freudenberg and his coworkers first successfully synthesized lignin- like dehydrogenation polymers (DHPs) of coniferyl alcohol in the presence of laccase and hydrogen peroxide16-17. DHPs have been widely used as a good model system for researchers to understand the chemistry of lignin for several reasons: 1) compared with native lignin, DHP does not have interference from other plant components, such as carbohydrates; 2) by controlling the polymerization condition, one can alter the DHP product profiles which provide deeper understanding of the biosynthesis of natural lignin;

25

OH

H3CO

a) OH

OH

b) OH

OH

H3CO OCH3

c) OH

Figure 2.1. Structures of three monolignols: a) coniferyl alcohol(G); b) p-coumaryl alcohol(H); c) sinapyl alcohol(S)

26

γ γ γ γ CH2OH γ CH2OH CH2OH CH2OH CH2OH

β β β β β α α α α α 1 1 1 1 1 6 2 6 2 6 2 6 2 6 2 3 3 3 3 3 5 5 5 5 5 OCH3 OCH OCH3 OCH3 4 4 OCH3 4 3 4 4 O O O O O

Figure 2.2. Resonance structure of phenoxy radical of lignin monomer.

27

HO

HO

O

H3CO OH

a) HO OCH3

OCH3

O OH H3CO

HO b) O

H3CO

O

H3CO OH

OH c) HO

Figure 2.3. Three Examples of common bonding types presented in lignin; a) b-aryl ether

(b-O-4); b) resinol (b-b); c) phenylcoumaran (b-5).

28

OH

HO CH2OH O H O OH H H3CO O

OCH3

H3CO

O H3CO

O H+

H

O HO CH2OH H HO CH OH H 2

O HO H O O

OCH3 OCH3

H3CO H3CO OH OH

Figure 2.4. Example of b-O-4 linkage formation via radical coupling

29

3) by isolating individual oligomers, one can obtain well-characterized lignin model compounds to be used for further studies.

The literature teaches that the product profiles of DHP, including frequency of each linkage type, molecular distributions and etc., heavily depend on the reaction conditions. There are generally two different reaction modes: 1) the reactants are added in a slow and continuous manner, which is the so called Zutropf (ZT) method; 2) The

Zulauf (ZL) method has all the reactants mixed at the same time. The reaction solvent, pH, temperature and the presence of polymerization templates also have large impacts on the molecular weight distribution and primary structures of DHPs. The slow and continuous way of adding monolignols and hydrogen peroxide is considered to be more similar with the process occurring in the plant. ZT method results in a higher content of b-O-4 linkages than ZL method18-19. The in vitro polymerization is done in an aqueous buffer system, such as phosphate buffer with pH 6.5. As the polymerization proceeds, the resulting oligomers precipitate from aqueous buffer solution because of low solubility of oligomers, creating an inhomogeneous reaction system. The precipitation of the oligomers limits the production of high degrees of polymerization. De Angelis and coworkers developed a polymerization reaction in the presence of cationic surfactant in phosphate buffer. The surfactant keeps the oligomers in the solution so that a higher degree of polymerization can be achieved20-21. The structures of DHPs synthesized with or without surfactant are distinctly different from each other.

30

2.1.3 Methods for Characterization of Lignin

Due to the heterogeneous nature of lignin polymers, extensive efforts have been investigated in order to characterize lignin. Early analytical techniques for characterization of lignins usually involved in the isolation and degradation of lignin into small pieces and then the polymer structure was deduced from characterization of the pieces. Chemical degradation methods were the only way to obtain structural information before the advent of various sophisticated analytical instrumental methods, which have greatly aided the knowledge of lignin structure.

Chemical degradation reactions are used in determination of the monomeric composition of lignin, both qualitatively and quantitatively. Table 2.1 gives a review of different chemical degradation methods for characterization of lignin7.

Spectroscopic methods in lignin study

There is no singular method to characterize lignins. Lignins have been extensively investigated with a variety of spectroscopic techniques: 1) Ultraviolet (UV) - Visible spectroscopic spectra provide adequately understanding of the substituent effect of benzene ring in lignin; 2) Vibrational spectroscopic methods, including infrared (IR), near infrared (NIR) and Raman spectroscopy, have been important in analysis of the functional groups in lignin; 3) Nuclear Magnetic Resonance (NMR) spectroscopy has been extensively applied to complex lignin polymers in providing structural information.

The two-dimensional (2D) NMR techniques are very powerful in determining the relative abundance of each monomeric units and relative frequencies of each types of bonding linkage7, but does not provide information regarding the specific sequence of lignin.

31

Table 2.1 A brief review of different chemical degradation methods for structural characterization of native lignin

Degradation Condition Advantages Disadvantages

Oxidative Alkaline Lignins are oxidized by First used to confirm Poor inter-laboratory

Degradation Nitrobenzene nitrobenzene in 2M NaOH the aromatic nature of reproducibility of the yield of

Oxidation and high temperature (160- lignins by monomeric products; little

(NBO) 180˚C) Freudenberg; simple information regarding side chain

reaction mixture; yield functionality and bonding 32

satisfactory amount of pattern; lack of specificity

benzaldehyde of H, G

and S units;

High reproducibility

of S/G ratios

Permanganate Initial peralkylation of Evaluation of free Low throughput; difficult and

Oxidation phenolic hydroxyl groups phenolic groups in multi-step procedure

with diethylsulfate or lignin

dimethylsulfate, followed

by oxidation with

permanganate and

hydrogen peroxide

Thioacidolysis Acid-catalyzed reaction Simple; informative; It is not an easy technique to

using ethanethiol and boron high yield of desulfurize the thioacidolysis

trifluoride etherate results monomer; routine and products; Stinky reagent 33

in b-O-4 cleavage robust; information of

unusual units

Derivatization Followed by Bromination of the Mild reaction Lower monomer recovery than

Reductive Cleavage (DFRC) benzylic position and condition; routine and thioacidolysis; lack of

acetylation of free hydroxyl simplicity; quantitatively recovery;

group, followed by Preserve the

reductive cleavage of b-O- stereochemistry of C-

4 linkage with zinc dust, b;

acetylation of final Provide new structural

products for GC analysis data of lignin

chemistry, such as g-

ester groups remain

unmodified during

degradation method

Ozonation This method preserves the Exclusively target on 34

side chains and oxidizes lignin side chains;

aromatic rings to provide insight on the

carboxylic groups stereo structure of

lignin side chains;

primary structure; 4) Mass Spectrometry (MS) is extensively used when coupled with gas-chromatography (GC) by analysis of chemical degradation products of lignin for monomeric composition information. This method has limited applications on small volatile degradation products.

Along with the development of soft ionization techniques coupled with increasingly sophisticated MS hardware, mass spectrometry shows promising application in extending the studies on high molecular weight lignin compounds, which cannot be analyzed using GC/MS. Moreover, structural elucidation of lignin compounds is possible using tandem mass spectrometry (MS/MS).

Ionization of Lignin compounds

Traditionally, mass spectrometry (MS) was most commonly coupled with gas chromatography (GC) to identify volatile components in lignin degradation products using electron ionization (EI) technique. However, for the lignin samples with higher molecular weights and high polarities, alternative ionization and separation methods were required to characterize those complex lignin compounds.

In spite of the increasing popularity of the soft-ionization mass spectrometry methods for a variety of synthetic and biological molecules, the analysis of lignin products using different soft ionization techniques is under-developed by comparison 22.

Matrix-assisted laser desorption ionization (MALDI) is also a “soft-ionization” technique coupled with time-of-flight (TOF) mass analyzer. MALDI-TOF instrument is widely used in the study of large biopolymers for molecular weight (MW) determination.

Several groups have published articles using MALDI-MS to determine the MW

35

distribution of the natural extracted lignin and synthetic lignin model compounds and to monitor the degrees of enzymatic polymerization during in vitro synthesis of lignin-like polymers20, 23-26. However, the quality and reproducibility of MALDI-MS data are largely associated with sample preparation, the choice of MALDI matrix, and interferences from matrix ions, which has led to poor inter-laboratory consistency27.

The application of mass spectrometry in lignin chemistry has been greatly expanded by direct analysis in solution using electrospray ionization, which minimizes the inhomogeneity in sample preparation and interferences between analyte and matrix observed in MALDI-MS.

De Angelis et al. applied ESI-MS to characterize an in vitro synthesized model of dehydrogenation polymers (DHPs) in 199920. They successfully observed up to octamer as ammonium adduct ion in positive ion mode. Each oligomer showed definite mass and high regularity of the oligomers, differing by 178 and 180 mass units, which indicated that there were two different reaction mechanisms involved in the polymerization, one was radical coupling and the other was nucleophilic addition of monolignol. Evtuguin et al. used ESI to characterize extracted lignin and commercial lignin monomeric and dimeric model compounds in the same year28. Evtuguin et al. acquired ESI mass spectrum in the negative ion mode in the presence of ammonium hydroxide to aid the deprotonation during ESI. The lignin monomeric and dimeric models used in those studies all contained free phenoxy or carboxyl group, which gave good response in the negative ion mode with simple deprotonation. Due to the high complexity of natural lignin sample, the corresponding ESI mass spectra simply showed as a molecular weight distribution curve instead of discrete peaks. The clusters of ions with 164 - 197 mass unit

36

difference were observed in the mass range from 500 to 1500, which was consistent with average molecular weight of monomeric units28. It was obvious that negative ion mode would be a preferable method due to the free, weak acidic phenoxy group in lignin, as the phenoxy alcohol can be easily deprotonated and provide [M-H]- ion. Önnerud et al.29 applied both MALDI and ESI-Fourier Transform (FT)- Ion Cyclotron Resonance (ICR) to study the thioacidolysis products of lignin model compounds and mill wood lignin

(MWL) with both positive ion mode and negative ion mode. Interestingly, they have reported the observation of [M+H]+, [M+Na]+, and [M+K]+ ions of thioacidolysis products with and without the presence of free phenoxy group. It was clear that these dimeric compounds can easily form adduct ions with Na+ and K+ 29. Morreel et al. studied the fragmentation pattern of dimeric model compounds with characteristic linkage types in negative ion mode using ESI-MS30-31. Their work shows the promising application in elucidation of primary lignin structures with negative ion mode ESI-MS.

As for electrospray ionization, the nature of the analyte (e.g. functional group, polarity and etc.) and the spray solution have huge impacts on the performance and ionization efficiency of the analytes of interest. However, little effort has been expended to investigate into the evaluation of different ESI conditions for lignin. Haupert et al. published on the characterization of monomeric and dimeric lignin model compounds under different ionization conditions, including ESI and atmospheric pressure chemical ionization (APCI)29, 32. Even though the presence of the weak acidic phenoxy group would prefer the negative ion formation via deprotonation, in-source fragmentation can negatively affect the ionization response of model compounds in negative ion mode using traditional ESI ion source. Those model compounds formed sodiated adduct ions in the

37

positive ion mode with no effect of in-source fragmentation. However, the study32 also concluded that no structural information obtained by tandem mass spectrometry as the sodiated cations were very difficult to be fragmented by collision activated dissociation

(CAD). They have suggested that negative ion mode ESI with NaOH as dopant gives better ionization response with no obvious in-source fragmentation. Dean et al. published the study of the interaction of alkali metal ions with dimeric model compounds using optical spectroscopy coupled with mass spectrometry in 201533. This work provided insight regarding the configuration of binding site of lithium or sodium cations with dilignols each containing unique bonding type (b-O-4 and b-b)33. DeBlase et al.34 later applied the same approach to study the alkali cation interaction with tetralignol containing pure b-O-4 linkage. They reported that there was size effect of alkali cations on the three-dimensional structure of the tetralignol model compound34.

It is common that ionization response during ESI process can be very different for different compounds as they have different structural properties. Although lignins are polymers of phenylpropanoid units sharing similar functionalities, one should be careful in examining and interpreting the mass spectrometric data as the presence of “random” linkages in lignin. The complexity and “randomness” of lignin polymers are likely to give rise to different response during MS analysis. Further understanding regarding the ionization behavior of lignin is needed in order to confirm the molecular weight study when using mass spectrometry. The goal of this chapter is to characterize the average ESI response of model trilignols consisting with different distinct bonding motifs. The hypothesis of the first project contains two parts: 1) the electrospray ionization response study of lignin model compounds with distinct chemical linkages will give better

38

understanding in the structural effect on the ionization behavior; 2) by comparative analysis of high-energy collision dissociation (HCD) tandem mass spectrometry

(MS/MS) of trilignols in both positive and negative ion mode, linkage-specific fragmentation pattern maybe obtained, which can lead to useful structural information in

“lignomic” sequencing.

2.2 Materials and Methods

2.2.1 Materials

Coniferyl aldehyde, horseradish peroxidase (Type VI, 250-330 units/mg solid), cetyltrimethylammonium bromide were obtained from Sigma Aldrich (St. Louis, MO).

Sodium phosphate, monobasic and sodium phosphate, dibasic were obtained from Fisher

Scientific (Pittsburgh, PA). Cetyltrimethylammonium sulfate (CTA)2SO4 was synthesized according to Feitosa, E.35. Gel permeation chromatography (GPC) was accomplished with two different columns; a preparative (1 cm x 100 cm (Kontes, Fisher

Scientific, Pittsburgh, PA)) column packed with 90 cm Bio-Beads S-X1 (Bio-Rad,

Hercules, CA) and a 300 mm x 7.5 mm PLgel analytical GPC column (Agilent, Santa

Clara, CA). Isolated oligomers were analyzed on a PolymerX (100 X 4mm, 3um) HPLC column (Phenomenex, Torrance, CA). All the MS analyses were done on a Q-Exactive

Orbitrap mass spectrometer (ThermoScientific, Waltham, MA). Nuclear magnetic resonance (NMR) spectra were acquired on Varian Model NOVA400 (400MHz) spectrometer or Varian Model NOVA600 (600MHz) if otherwise mentioned.

39

2.2.2 Methods

Synthesis of Coniferyl alcohol

Coniferyl alcohol was prepared by the reduction of coniferyl aldehyde36 using

NaBH4. The product was purified by SiO2 column to yield a pure light yellow solid with

85% yield. 1H NMR (400 MHz, Chloroform-d) δ 6.96 – 6.79 (m, 3H), 6.53 (d, J = 15.8

Hz, 1H), 6.22 (dt, J = 15.8, 6.0 Hz, 1H), 5.67 (s, 1H), 4.30 (dd, J = 6.0, 1.5 Hz, 2H), 3.90

(s, 3H).

Synthesis of Dehydrogenation Polymers (DHPs)

Synthesis Condition 1: DHP in the presence of Cetyltrimethylammonium sulfate

(DHP-CT)

Coniferyl alcohol (GOH) was dissolved in 27 mM cetyltrimethylammonium sulfate [(CTA)2SO4]] (CTAS) solution (made in 10mM sodium phosphate buffer) to a final concentration of 5 mg/ml. Cetyltrimethylammonium sulfate was prepared according to procedure35, 37. Horseradish peroxidase (HRP type VI, 250-330 unit/mg) was added at a ratio of 1 mg HRP/1000 mg coniferyl alcohol. Then 10% hydrogen peroxide (1mg

GOH/2 µl H2O2 solution) was added and stirred for 5 min. The reaction was quenched by

21 adding several drops of NaHSO3 and extracted with ethyl acetate .

Synthesis Condition 2: DHP synthesis using Zutropf (ZT) method (DHP-ZT)

Synthesis of b-O-4 linkage enriched DHP followed reported methods18 with slight modification. In general, three different solutions were prepared: Solution A: 3 mg/ml coniferyl alcohol dissolved in acetone: 10 mM phosphate buffer (1:9), Solution B: 0.15%

40

wt of hydrogen peroxide in 10 mM phosphate buffer, with the same final volume of solution A, Solution C: 1µg/ml of horseradish peroxidase (HRP type VI, 250-330 unit/mg, amount of HRP used: 0.5 µg HRP per 1mg starting coniferyl alcohol) solution in

10 mM sodium phosphate buffer (pH 6.5). Solution A and B were added at a rate of 50 ml/hour by a syringe pump to solution C, stirred at 0˚C. The reaction was quenched after

5 min with 5% NaHSO3. The precipitate that formed was centrifuged to remove the supernatant. The precipitate was dried using speed-vac. The aqueous portion was then extracted with EtOAc and brine solution.

Isolation of trilignols

Guaiacylglycerol-b, g-bis-coniferyl ether [G(4-O-a)G(b-O-4)G] was isolated from reaction condition 1. The DHP was separated by semi-preparative gel permeation chromatography (GPC) column, packed with Bio-beads S-X1, 85 cm X 1 cm, THF was used as the elution solvent. The flow rate was at 0.2 ml/min38. Fractions were collected every 10 min. Because the large amount of the surfactant [(CTA)2SO4]] (CTAS) used in the reaction, the isolated trimeric fraction was further purified on silica gel to remove

+ residual surfactant. HRMS: [M+NH4] = 556.2544, Elemental composition: C30H38O9N, calculated: C30H38O9N, m/z 556.2541, ∆ppm=0.5149 ppm.

Guaiacylglycerol-b-dehydrodiconiferyl ether [G(β-O-4)G(β-5)G] and guaiacylglycerol-pinoresinol ether [G(β-O-4)G(β-β)G] were both isolated as a mixture from reaction done in condition 2 from GPC column, using the same condition as described above. The two trilignols were further separated by silica gel, eluted with ethyl

+ acetate/ acetone= 6/4. HRMS: G(β-O-4)G(β-5)G: [M+NH4] = 572.2494, Elemental

41

composition: C30H38O10N, calculated: C30H38O10N, m/z 572.2490, ∆ppm=0.6412 ppm;

+ G(β-O-4)G(β-β)G: [M+NH4] = 572.2493, Elemental composition: C30H38O10N, calculated: C30H38O10N, m/z 572.2490, ∆ppm=0.4279 ppm

The structures of isolated trilignols (shown in Figure 2.5) were confirmed with NMR.

The trimers were dissolved in 500 µl acetone-d6 for NMR analysis. Acetone solvent peaks used as internal reference peaks, δH/δC 2.05 (5)/29.62 (7), 206.68 (1).

Assignments for the spectra are according the database of lignin model compounds39-40.

Structural identification was performed by 1D and 2D NMR experiment (1H and HSQC).

The chemical shift of carbon was obtained from HSQC experiment. Table 2.2 shows the assignments of main 13C-1H cross signals of three trilignols of corresponding 1H and

HSQC spectra.

High-performance liquid chromatography (HPLC)-MS analysis

HPLC separation was carried out by using PolymerX-TM RP-1(100 X 4 mm, 3 mm) at the flow rate of 0.4 ml/min coupled to a Thermo Scientific Q-Exactive mass spectrometer with the electrospray ionization(ESI) in positive ion mode. Spray voltage:

3.0 kV, sheath gas: 20 arbitrary units (AU), capillary temperature: 280, full scan range: m/z 150-2000. All data were analyzed using Thermo Xcalibur.

Direct infusion MS analysis

All MS experiments were run on a Thermo Q-Exactive mass spectrometer equipped with ESI source. To standardize the MS analysis, all the solutions were made in

H2O/ACN=1:1 with 0.5 mM final salt concentration, with 0.1 µg/ml n-hexyl triphenyl

42

phosphonium bromide as internal standard in positive ion mode. For cation selectivity study, the sample was prepared in the solution of H2O/ACN=1:1, with 0.5 mM mixed salts (NH4Cl, LiCl, NaCl, KCl and RbCl). For the negative ion mode via deprotonation, the solution was made in H2O/ACN=1:1 with 0.5 mM NH4OH. For the negative ion mode via chlorine adduct, the solution was made in H2O/ACN=1:1 with 0.5 mM NH4Cl.

The solution was directly infused into MS using ESI at a flow rate of 3 µl/min with a micro-syringe pump. ESI condition: a) negative ion mode: spray voltage: 3.1 kV, sheath gas: 2 arbitrary units (AU), capillary temperature: 250, full scan range: m/z 100-1500; b) positive ion mode: spray voltage: 3.8 kV, sheath gas: 2 AU, capillary, temperature:

200˚C, full scan range: m/z 100-1500. All data were analyzed using Thermo Xcalibur.

43

a)

OCH3

α γ C O CH2OH γ β γ

HOH2C β CH2OH β α O B α

H3CO A

H3CO

HO G(4-O-α)G(β-O-4)G

Chemical Formula: C30H34O9 b) Exact Mass: 538.2203

OCH3 H3CO

O

HO O C B α β β α α γ

CH2OH HOH2C γ β γ A CH2OH

G(β-O-4)G(β-5)G HO OCH3 Chemical Formula: C30H34O10 Exact Mass: 554.2152

c)

OCH3

γ O HO CH OH γ 2 α C OH β β α β O B α γ H3CO A O

H3CO

HO G(β-O-4)G(β-β)G

Chemical Formula: C30H34O10 Exact Mass: 554.2152

Figure 2.5. Structures of three trimers: a) Guaiacylglycerol-b, g-bis-coniferyl ether [G(4-

O-a)G(b-O-4)G]; b) Guaiacylglycerol-b-dehydrodiconiferyl ether [G(β-O-4)G(β-5)G]; c)

Guaiacylglycerol-pinoresinol ether [G(β-O-4)G(β-β)G]

44

Table 2.2. NMR chemical shift assignments of three trimers

G(4-O-a)G(b-O-4)G

dC/dH (ppm) Assignments 80.86/5.48 Aa 85.10/4.57 Ab

61.61/3.94 Ag1

61.61/3.85 Ag2 129.65/6.50 Ba/Ca 129.10/6.20 Bb/Cb

G(b-O-4)G(b-5)G 73.96/4.89 Aa 88.649/4.20 Ab 88.50/5.60 Ba 54.80/3.53 Bb NA/6.50 Ca 130.95/6.23 Cb

G(b-O-4)G(b-b)G 73.96/4.88 Aa 88.649/4.20 Ab 88.65/4.67 Ba 55.62/3.09 Bb

72.39/3.80 Bg1

72.39/4.19 Bg2

45

2.3 Result and Discussion

2.3.1 Characterization of Two Different DHPs

As discussed above that the structural profiles of in vitro synthesized DHPs are highly dependent on reaction conditions. From comparison of DHPs by ZL and ZT synthetic conditions, both DHPs contain similar bonding types but differ in relative abundance of each linkage types. However, DHPs synthesized in the presence of CTAS

(DHP-CT) gives rise to distinctly different linkage types and corresponding frequencies of the linkages compared to ZL and ZT methods. Figure 2.6 shows the mass spectra of

DHP-CT and DHP-ZT in the positive ion mode. Each oligomer forms ammonium adduct

+ [M+NH4] species during the direct infusion in the presence of ammonium acetate.

Figure 2.6.a shows that DHP-CT displayed a highly regular oligomeric mass pattern, which differs by 178 and 180 amu alternatively. The increment of 178 amu indicates two dehydrogenated radical coupling. Increment of 180 amu indicates that ionic reactions occurred by the addition of monomer to a quinone methide intermediate. This leads to the formation of a, b-diaryl linkage in the lignin. In contrast to DHP-CT, DHP-ZT contains higher percentage of b-O-4 linkage. Figure 2.6.b is the mass spectra of DHP-ZT at the same ionization condition with that of DHP-CT, it appears to be a more complex product profile compared with DHP-CT. The cluster-like mass spectra in DHP-ZT results from oligomers that have same number degree of polymerization (DPn) but contain different numbers of b-O-4 linkage.

DHP-CT and -ZT samples were first analyzed in the negative ion mode ESI with simple deprotonation, shown in Figure 2.7. Interestingly, dramatic difference of the

46

GDHP_CTAS_121714_NH4_rep1 12/19/2014 3:53:36 PM GDHP_CTAS_121714_NH4_rep#1 a)0.5mg/ml GDHP_CTAS replicate # 1 in 5mM NH4OAc/ACN=1:1 GDHP_CTAS_121714_NH4_rep1 #70-189 RT: 1.83-4.95 AV: 120 NL: 5.53E7 T: FTMS + p ESI Full lock ms [150.00-1800.00] 556.2550 100

95

90 312.3627 85

80

75

70

65

60

55 494.5664 50

45 RelativeAbundance 40 284.2962 35 200.0691 30

25 734.3181 20 359.1492

15

10 382.4410 914.3974 603.6325 1094.4763 5 1272.5404 0 200 300 400 500 600 700 800 900 1000 1100 1200 1300 1400 1500 1600 1700 1800 m/z

GDHP_ZT_121614_NH4_rep2 12/18/2014 4:06:52 PM GDHP_zt_121614_replicate #2 b)0.5mg/ml GDHP_zt_121614_replicate #2 in 5mM NH4OAc/ACN=1:1 GDHP_ZT_121614_NH4_rep2 #61-134 RT: 1.60-3.51 AV: 74 NL: 1.29E7 T: FTMS + p ESI Full lock ms [150.00-1800.00] 376.3421 100

95

90

85

80

75

70

65

60

55

50

45 734.3175

RelativeAbundance 40 397.2715 35 341.1384 30 436.1967 25 572.2494 755.5800 20 912.3802 614.2598 15 519.2017 654.3321 932.4020 10 235.1329 1108.4540 282.2791 5 1012.6406 1288.5318 1466.5944 1646.6698 810.3336 1150.4647 1330.5422 0 200 300 400 500 600 700 800 900 1000 1100 1200 1300 1400 1500 1600 1700 1800 m/z

+ Figure 2.6. Full scan mass spectrum of DHP in Positive ion mode ESI, [M+NH4] ; a)

DHP-CT; b) DHP-ZT

47

GDHP_CTAS_121714_NEG_rep1 12/19/2014 2:37:53 PM GDHP_CTAS_121714_NEG_rep#1 a)0.4975mg/ml GDHP_CTAS replicate # 1 in 0.3%NH4OH/ACN=1:1, with lock mass GDHP_CTAS_121714_NEG_rep1 #33-129 RT: 0.86-3.38 AV: 97 NL: 6.95E7 T: FTMS - p ESI Full lock ms [150.00-1800.00] 179.0713 100

95

90

85

80

75

70

65

60

55

50 357.1342 45 393.1109

RelativeAbundance 327.1235 40

35

30 255.2327

25

20 420.1663 15

10 732.3026

5 478.2081 768.2795 0 200 300 400 500 600 700 800 900 1000 1100 1200 1300 1400 1500 1600 1700 1800 m/z

GDHP_ZT_121614_NEG_rep2 12/18/2014 2:53:40 PM GDHP_zt_121614_replicate #2 b)0.4975mg/ml GDHP_zt_121614_replicate #2 in 0.3%NH4OH/ACN=1:1, with lock mass GDHP_ZT_121614_NEG_rep2 #10-105 RT: 0.26-2.75 AV: 96 NL: 8.87E6 T: FTMS - p ESI Full lock ms [150.00-1800.00] 255.2327 100

95

90

85

80

75 357.1341 70

65

60

55

50

45

RelativeAbundance 40 179.0713 376.1762 35

30

25 553.2080 713.2602

20 732.2747

15 527.1557 909.3356 10

5 587.1768 1071.4035 1266.4929 0 200 300 400 500 600 700 800 900 1000 1100 1200 1300 1400 1500 1600 1700 1800 m/z

Figure 2.7. Full scan mass spectrum of DHP in negative ion mode ESI, [M-H]-; a) DHP-

CT; b) DHP-ZT

48

molecular distribution of DHP-CT sample was observed compared with that in the positive ion mode. For DHP-ZT samples, the molecular distribution was similar in positive and negative ion mode, but an overall higher response in positive ion mode than that in negative ion mode was observed.

These observations prompted the question of which ionization conditions were better represented the actual composition of the sample mixtures. Size exclusion chromatography (SEC), also known as gel permeation chromatography (GPC), is often used for analysis of molecular distribution of lignin. Figures 2.8 shows the GPC chromatograms of separated oligomer components from coniferyl alcohol standard, dimer, trimer, tetramer and pentamer. The separation efficiency of GPC column was not high enough to have baseline separation of each oligomer. The peak at rt =5.85 min in the top chromatogram in Figure 2.8 is a polystyrene standard, which has a MW around 1500. As shown in the GPC chromatogram of DHP-CT after 10min reaction (Figure 2.9 top), the most abundant oligomer is trimer, there are some dimer and monomer, and for the higher

DP oligomers could not achieve baseline separation with the GPC column used in this experiment. DHP-ZT, obtained from a 5 min reaction, shows similar molecular weight distribution with DHP-CT (Figure 2.9). GPC of DHP-ZT shows similar molecular weight distribution as observed in both positive ion mode and negative ion mode in ESI-MS.

Positive ion mode ESI-MS of DHP-CT shows closer pattern with GPC result, while negative ion mass spectrum is significantly under-presented the molecular weight profiles of DHPs. This important observation motivated this study into what difference between two DHPs gave rise to the difference in ESI response. In order to quantitatively study the ionization response, trilignols from DHPs were isolated and used to specifically compare

49

the ionization response with the presence of unique linkage types. Because trilignols contains more degrees of freedom than that of monomer and dimer, ionization behavior differences could be more significant than that of dimers. Moreover, trimers were separable using GPC and more abundant than other higher oligomers, because of the specialty of the GPC column used in this study. It is worth to point out that the each of the three isolated trilignols contains a mixture of diastereomers, which would be a good representation for natural lignin, as the goal of this study is aimed to characterization the average response of DHP with electrospray ionization.

50

D:\Fan\...\PS_GOH_102114_01 10/21/2014 1:00:44 PM

RT: 0.00 - 9.00 7.42 NL: 40000 4.27E4 Total Scan 30000 PDA PS_GOH_10211 20000 4_01

10000 5.83 0.02 0.33 6.87 7.92 8.12 0 6.88 NL: 15000 1.53E4 Total Scan PDA 10000 goh_oligo_gpc_ 2_051614_fracti 5000 on_15 6.52 8.15 0.05 0.38 1.15 1.47 1.77 2.00 2.42 2.68 3.23 3.43 3.73 3.93 4.52 4.77 5.25 5.53 6.07 6.22 7.38 8.02 8.37 8.68 0 6.52 NL: 1.12E5 100000 Total Scan PDA GOH_oligo_trim er_052114_03

uAU 50000

0.07 0.58 1.10 1.32 1.65 1.93 5.43 6.13 0 6.32 NL: 80000 8.38E4 Total Scan 60000 PDA GOH_oligo_GP_ 40000 052014_frac_11 20000 0.32 0.80 1.13 1.35 1.83 2.52 2.85 3.02 3.35 3.85 4.22 4.43 4.78 5.05 5.48 5.85 6.78 7.10 7.38 8.12 8.30 8.80 0 6.17 NL: 7.72E4 60000 Total Scan PDA GOH_oligo_GP_ 40000 052014_frac_9 20000 0.45 0.72 1.13 1.48 1.80 2.13 2.45 2.73 3.08 3.42 3.73 4.07 4.30 4.58 5.00 5.37 5.65 6.95 7.18 7.70 8.13 8.67 8.97 0 0.0 0.5 1.0 1.5 2.0 2.5 3.0 3.5 4.0 4.5 5.0 5.5 6.0 6.5 7.0 7.5 8.0 8.5 9.0 Time (min)

Figure 2.8. GPC chromatogram of isolated DHPs; from top to bottom were coniferyl alcohol, dimers, trimers, tetramers and pentamers.

51

D:\Fan\...\GOH_oligo_051714_01 5/17/2014 11:45:58 AM

RT: 0.00 - 10.98 6.52 NL: 80000 8.30E4 Total Scan PDA 70000 6.17 GOH_oligo _051714_0 60000 1

50000

uAU 40000

30000 6.88 20000 7.43

10000 5.08 8.08 8.80 0.20 3.42 4.27 4.43 9.27 9.88 10.05 0 6.20 6.55 NL: 90000 9.34E4 6.03 Total Scan 80000 PDA goh_oligo_ 050514_01 70000

60000

50000

40000 5.05 30000 4.55 6.92 20000

10000 7.37 7.98 8.15 9.92 0.07 0.87 1.07 1.68 2.08 2.42 2.67 3.53 8.38 8.77 9.47 10.22 0 0 1 2 3 4 5 6 7 8 9 10 Time (min)

Figure 2.9. GPC chromatogram of DHPs from two different reaction conditions: DHP-

CT products (top) and DHP-ZT products (bottom)

52

2.3.2 Negative Ionization Behavior of Trilignols

Simple deprotonation [M-H]-

As shown in Figure 2.10.a, the absolute ion intensity of G(4-O-a)G(b-O-4)G is significantly lower than other two trilignols at the same concentration, while G(b-O-

4)G(b-b)G gives the highest ionization response. Mass spectrum of G(4-O-a)G(b-O-4)G shows high abundance of ion 357.1341 and 179.0712 (m/z consistent with the presence of dimer and monomer, respectively), while a low ion signal of the target trimeric ion is observed at m/z 537.2130 (Figure 2.10.b). Ions of 179.0712 and 357.1341 are likely resulting from in-source fragmentation during the ionization process. In order to support the hypothesis of in-source fragmentation of G(4-O-a)G(b-O-4)G, an LC-MS experiment was conducted on G(4-O-a)G(b-O-4)G. The EIC chromatograms (Figure 2.11) shows that m/z 357.1341 and m/z 179.0712 co-elute with target trimer m/z 537.2130 and have identical chromatographic peak shape. Analysis of a standard of monomer coniferyl alcohol shows that coniferyl alcohol has an earlier retention time than that observed in the

LC run of G(4-O-a)G(b-O-4)G. The comparison result of LC runs confirms that m/z

179.0712 observed in the spectrum of G(4-O-a)G(b-O-4)G is not an actual coniferyl alcohol but most likely from a fragment ion. Unlike G(4-O-a)G(b-O-4)G trimer, mass spectra of G(b-O-4)G(b-b)G and G(b-O-4)G(b-5)G show a very low amount of dimer ion, which could be because of the higher stability of cyclic structural moiety existing in these two trimers. Therefore, deprotonated G(4-O-a)G(b-O-4)G must undergo a charge- driven in-source fragmentation pathway31 because the phenolic monomeric moiety

53

serving as a very good leaving group compared to a-alcoholic group in typical b-O-4 group.

A proposed fragmentation pathway of G(4-O-a)G(b-O-4)G in negative ion mode30-31 is shown in Figure 2.12. The lone pair of electron resonated to a-C and the moiety C, shown in Figure 2.5.a), results in labile 4-O-a linkage during ionization. Even though for other two trilignols, it can also undergo this charge-driven fragmentation, the alcoholic group at a-C is much poorer leaving group compared to phenolic group at a-C in G(4-O-a)G(b-O-4)G. As a result, the in-source fragmentation has less effect on G(b-

O-4)G(b-5)G and G(b-O-4)G(b-5)G than that on G(4-O-a)G(b-O-4)G.

Chlorine ion adduct formation [M+Cl]-

To test the hypothesis of in-source fragmentation, MS spectrum of G(4-O-a)G(b-

- O-4)G in the presence of NH4Cl was acquired where [M+Cl] was the dominant ion and only a few dimeric ion and monomeric ion were observed. When forming chlorine ion adduct of G(4-O-a)G(b-O-4)G, unlike deprotonation, there is no free lone pair of electrons generated. As a result, the chlorine adduct anion does not undergo the proposed charge-driven fragmentation pathway. Chlorine adduct experiment supported the hypothesis regarding the activation of fragmentation via the deprotonation at the phenolic site.

As the preliminary data show that negative ionization by forming Cl- adduct greatly decreased the extent of in source fragmentation of G(4-O-a)G(b-O-4)G. It is interesting to see whether three trimers will behave differently with formation of Cl- adduct by comparing the Cl- adduct of all three trilignols

54

a)

600000

500000

400000

300000 Intensity

200000

100000

0 G(4-O-a)G(b-O-4)G G(b-O-4)G(b-5)G G(b-O-4)G(b-b)G

GDHP_ab_trimer_021815_NEG_rep2 2/18/2015 3:34:40 PM ` b)20ug/ml GDHP_ab_trimer in 0.5mM NH4OH/ACN=1:1 GDHP_ab_trimer_021815_NEG_rep2 #75 RT: 1.96 AV: 1 NL: 2.18E7 T: FTMS - p ESI Full lock ms [100.00-1500.00] 357.134 100 179.071 95

90

85

80

75

70

65

60

55

50 327.123 151.040 45

RelativeAbundance 40

35

30 255.233 269.212 25

20

15 375.144 10 403.139 195.066 715.276 5 455.102 537.213 895.357 635.179 778.308 941.363 1071.404 1149.173 1253.497 1339.590 1485.567 0 100 200 300 400 500 600 700 800 900 1000 1100 1200 1300 1400 1500 m/z

Figure 2.10. a) Deprotonated ion intensity comparison of three trimers; b) full scan mass

spectrum of G(4-O-a)G(b-O-4)G. Error bars indicate ± one standard deviation.

55

Figure 2.11. Liquid chromatograms of G(4-O-a)G(b-O-4)G, a) BPC of trimer 537.213 ; b) EIC of 537.213; c) EIC of dimer, m/z 357.134; d) EIC of monomer, m/z 179.071; e)

EIC of coniferyl alcohol, 179.071

56

Unlike with deprotonation, all trilignols have similar ionization response by forming Cl- adduct, shown in Figure 2.13. [M-H]- of G(b-O-4)G(b-5)G and G(b-O-

4)G(b-b)G are still observed in Cl- solution, which result from in-source fragmentation ions. Compared to previous deprotonation ionization, ion behavior of Cl- adducts shows better ionization response across all three trimers. The HCD/MS/MS of [M+Cl]- ions contain essentially the same fragmentation pattern with that of deprotonated ions. The loss of HCl gives rise to the main fragment ion of [M-H]-. As the increasing of collision energy, [M-H]- is further under fragmentation, which has the same fragmentation pathway as those from deprotonated precursor ions. Formation of Cl- adducts gives overall better response than simple deprotonation in the negative ion mode. It also seems promising in facilitating the formation of gas-phase negative adduct ions, all three trimers result in good ionization response with some extent of in-source fragmentation shown in

Figure 2.13.

57

HO O

C O HO O

- H O Chemical Formula: C10H11O3 Exact Mass: 179.0714 O C O

O B O

O A O OH

O - Chemical Formula: C30H33O9 O B Exact Mass: 537.2130 O A OH

- Chemical Formula: C20H21O6 O Exact Mass: 357.1344

Figure 2.12. Proposed fragmentation mechanism of deprotonated G(4-O-a)G(b-O-4)G ion.

58

3500000

3000000

2500000

2000000

Intensity 1500000

1000000

500000

0 G(4-O-a)G(b-O-4)G G(b-O-4)G(b-5)G G(b-O-4)G(b-b)G

Figure 2.13. Ion intensity comparison of three trimers with chlorine ion adduct, [M+Cl]- in negative ion mode ESI. Error bars indicate ± one standard deviation.

59

2.3.3 Positive Ionization Behavior of Trilignols

Previous ionization experiment of both different DHPs gives pretty good ion response in positive ion mode. Haupert et al. 32has reported that sodium adduct give good response because of no in-source fragmentation. It is worthwhile to compare ionization

+ + + + and fragmentation behavior of trimers across a series cations (NH4 , Li , Na , K and

Rb+).

(a) G(4-O-a)G(b-O-4)G

Comparison of ionization in NH4Cl and NH4OAc was first done to see whether a counter ion would have effect on the ionization response. There is no significant

+ - difference in [M+NH4] formation in presence of two different counter anions Cl and

OAc- at p<0.05. The ion intensities of both lithiated and sodiated ions are higher than the other three cation adducts (shown in Figure 2.14.a). In the positive ion mode, very few dimeric and monomeric ions are observed, which is opposite to what is observed in the negative ion mode. It is likely that cationization increases the rigidity of G(4-O-a)G(b-O-

4)G trimer, preventing it from in-source fragmentation32. Figure 2.14.b shows the ion competition studies which G(4-O-a)G(b-O-4)G is dissolved in a mixture of different cations with equal molar concentration. The ionization competition experiment shows similar response pattern with the individual cation experiment, where Li+ and Na+ are preferred in the ion competition experiment.

60

a)

120

100

80

60

40 Normalized Normalized Intensity

20

0 NH4 Li Na K Rb

b)

120

100

80

60

40 Normalized Normalized Intensity

20

0 NH4 Li Na K Rb

Figure 2.14. a) Relative ionization response of G(4-O-a)G(b-O-4)G in different salt solution. b) Cation selectivity of G(4-O-a)G(b-O-4)G in solution of equal molar of

NH4Cl/LiCl/NaCl/KCl/RbCl. Error bars indicate ± one standard deviation.

61

(b) G(b-O-4)G(b-5)G

Figure 2.15a shows the ion response of G(b-O-4)G(b-5)G in different salt solutions. Again, there is no significant effect in counter ions when compared the ion response of ammonium adduct formation in the presence of NH4Cl and NH4OAc. No

+ + + + + significant difference is observed in NH4 , Li , Na and K adducts, while Rb adduct formation has the lowest response among all cations.

Figure 2.15b shows the ion competition result of G(b-O-4)G(b-5)G exposed to the mixture of salts. The relative abundance of [M+Li]+, [M+Na]+, [M+K]+ and [M+Rb]+ is similar to the ionization comparison in individual solution (Figure 2.15.a). G(b-O-

4)G(b-5)G shows most affinity to Li+, Na+ and K+. [M+Rb]+ formation is less favorable compared to the formation of [M+Li]+, [M+Na]+, [M+K]+. Unlike the individual

+ ionization comparison, [M+NH4] gives lowest ion abundance compared to the other four cation adducts.

(c) G(b-O-4)G(b-b)G

The ionization response of G(b-O-4)G(b-b)G in different salt solutions is shown in Figure 2.16.a. Again, no significant counter ion (OAc- and Cl-) effect is observed by comparing ammonium adduct formation on G(b-O-4)G(b-b)G. Ion responses of lithiated

+ and sodiated adduct ions show no difference. They both give higher response than NH4 ,

+ + + K and Rb . Among all cationization, NH4 resulted in the lowest ion response. Figure

2.16.b shows that the cation competition results follow similar trends with the individual

+ ionization comparison. NH4 adduct formation is the least favored with G(b-O-4)G(b- b)G.

62

The data shows that different trimeric structures give different cationic response in the presence of different cations. Not only it is worthwhile to see whether individual trimer has a response preference toward particular cation, but it is more interesting to compare whether there is difference between three trilignols in the cationization response from each other due to their structural difference. Such comparison is shown in Figure

2.17. In contrast to the ionization response in the negative ion mode ESI ([M-H]-), G(4-

O-a)G(b-O-4)G is much more responsive compared to the other two trilignols in most of cations. In general, it shows that the relative ionization response of three trilignols for all the cations followe the order of G(4-O-a)G(b-O-4)G > G(b-O-4)G(b-b)G > G(b-O-

4)G(b-5)G

63

a)

120

100

80

60

40 Normalized Normalized Intensity

20

0 NH4 LI Na K Rb

b)

120

100

80

60

40 Nromalized Nromalized Intensity

20

0 NH4 Li Na K Rb

Figure 2.15. a) Relative ionization response of G(b-O-4)G(b-5)G in different salt solution. b) Cation selectivity of G(b-O-4)G(b-5)G in solution of equal molar of

NH4Cl/LiCl/NaCl/KCl/RbCl. Error bars indicate ± one standard deviation.

64

a)

120

100

80

60

40 Normalized Normalized Intensity

20

0 NH4 LI Na K Rb

b)

120

100

80

60

40 Normalized Normalized Intensity

20

0 NH4 Li Na K Rb

Figure 2.16. a) Relative ionization response of G(b-O-4)G(b-b)G in different salt solution. b) Cation selectivity of G(b-O-4)G(b-b)G in solution of equal molar of

NH4Cl/LiCl/NaCl/KCl/RbCl. Error bars indicate ± one standard deviation.

65

G(4-O-a)G(b-O-4)G G(b-O-4)G(b-5)G G(b-O-4)G(b-b)G

120

100

80

60

Normalized Normalized Intensity 40

20

0 NH4 Li Na K Rb

Figure 2.17. Positive ion mode ESI response comparison of three trilignols with the presence of different cations. Error bars indicate ± one standard deviation.

66

2.3.4 HCD/MS/MS Experiment of Trilignols

Tandem mass spectrometry has been successfully applied in sequencing the primary structure of proteins, similar strategy to sequence lignin is proposed by using tandem mass spectrometry. Morreel et al. has done wonderful work in studying the signature fragment ions of the dimeric model compounds containing different linkage types in the negative ion mode30-31.

Previous experiments have shown that positive ion mode gives overall better response than negative ion mode. However, Haupert et al. has found it difficult to obtain fragmentation ions with sodiated adduct analyte with collision activated dissociation

(CAD) in the tandem mass spectrometry32. Moreover, alkali metal adduct ions are considered to be problematic in tandem mass spectrometry previously using CID. No useful fragmentation ions can be obtained because the dissociation of alkali metal ion itself is preferred during collision induced dissociation (CID). Because the difference in the fundamental dissociation mechanism between CID and high-energy collision dissociation (HCD), different fragmentation of alkali metal adducts may be observed using HCD. It will be beneficial to study whether high-energy collision dissociation/tandem mass spectrometry (HCD/MS/MS) experiment will provide useful structural information. If so, one may then distinguish the linkages via the HCD/MS/MS spectra. The accurate MS data are acquired which provide additional confident in

+ analyzing the MS/MS data. For all three trilignols, HCD/MS/MS spectra of [M+NH4] contain most complicate fragment ions than other alkaline metal ion adducts. NH3 is easily knocked off from ammonium adduct ion, the resulting fragment ions are protonated. Compared with HCD/MS/MS of other cation adducts, MS/MS of

67

GDHP_ab_trimer_021815_NH4_rep2 2/18/2015 4:16:48 PM GDHP_ab_diaryl trimer_021815_NH4_rep2 20ug/ml GDHP_ab_trimer in 0.5mM NH4OAc/ACN=1:1, rep#2 GDHP_ab_trimer_021815_NH4_rep2 #137 RT: 3.59 AV: 1 NL: 5.42E7 T: FTMS + p ESI Full ms2 [email protected] [100.00-600.00] 359.1487 C 20 H23 O6 -0.6495 ppm 100

95

90

85

80

75

70

65

60 341.1383 55 C 20 H21 O5 -0.2201 ppm 50

45 137.0598 RelativeAbundance 40 C 8 H9 O2 0.3455 ppm 35

30 179.0703 25 C H O 235.0965 10 11 3 329.1383 C H O 20 0.2333 ppm 13 15 4 C H O 0.1123 ppm 19 21 5 163.0754 -0.2281 ppm 15 C 10 H11 O2 0.2302 ppm 291.1015 10 C 19 H15 O3 -0.2384 ppm 5

0 100 150 200 250 300 350 400 450 500 550 600 a) m/z GDHP_b5_trimer_021915_NH4_rep1 2/19/2015 11:08:00 AM GDHP_beta5 trimer_021915_NH4_rep1 19ug/ml GDHP_beta5_trimer in 0.5mM NH4OAc/ACN=1:1, rep#1 GDHP_b5_trimer_021915_NH4_rep1 #126 RT: 3.30 AV: 1 NL: 6.94E6 T: FTMS + p ESI Full ms2 [email protected] [100.00-600.00] 327.1227 C 19 H19 O5 0.1270 ppm 100

95

90

85

80

75

70

65

60 137.0598 C 8 H9 O2 55 0.7908 ppm

50 187.0755 C 12 H11 O2 45 0.6901 ppm

RelativeAbundance 40 303.1228 537.2122 C 17 H19 O5 35 217.0861 0.3384 ppm C 30 H33 O9 C 13 H13 O3 0.5775 ppm 30 0.9204 ppm 489.1911 235.0966 25 C H O C H O 29 29 7 13 15 4 0.5946 ppm 20 0.6315 ppm 285.1122 C 17 H17 O4 471.1805 15 161.0598 0.3793 ppm 341.1385 C 29 H27 O6 C H O C 20 H21 O5 0.6940 ppm 10 10 9 2 273.1122 0.7677 ppm 0.3167 ppm 555.2228 C H O 383.1491 16 17 4 C 30 H35 O10 5 C 22 H23 O6 0.2842 ppm 0.6587 ppm 0.4267 ppm 0 100 150 200 250 300 350 400 450 500 550 600 b) m/z GDHP_bb_trimer_021915_NH4_rep1 2/19/2015 2:39:29 PM GDHP_beta_beta trimer_021915_NH4_rep1 15ug/ml GDHP_beta_beta_trimer in 0.5mM NH4OAc/ACN=1:1, rep#1 GDHP_bb_trimer_021915_NH4_rep1 #126 RT: 3.30 AV: 1 NL: 1.44E7 T: FTMS + p ESI Full ms2 [email protected] [100.00-600.00] 235.0955 C 13 H15 O4 -4.1065 ppm 100

95

90

85

80

75 205.0852 C 12 H13 O3 70 -3.3410 ppm 65

60

55

50

45 137.0592 RelativeAbundance 40 C 8 H9 O2 519.1995 175.0748 C H O 35 -3.4398 ppm 30 31 8 C 11 H11 O2 -3.6237 ppm 30 -3.2718 ppm

25

20 507.1996 C H O 15 161.0592 383.1475 29 31 8 413.1579 -3.3485 ppm 537.2100 C H O C H O 10 9 2 22 23 6 C 23 H25 O7 C H O 10 -3.2114 ppm -3.7948 ppm 30 33 9 259.0956 321.1110 -3.7542 ppm 459.1785 -3.6262 ppm 5 C 15 H15 O4 C 20 H17 O4 C 28 H27 O6 -3.4906 ppm -3.6548 ppm -3.6743 ppm 0 100 150 200 250 300 350 400 450 500 550 600 c) m/z

+ Figure 2.18. HCD/MS/MS spectra of three trimers with ammonium adduct, [M+NH4] , a)

G(4-O-a)G(b-O-4)G; b) G(b-O-4)G(b-5)G; c) G(b-O-4)G(b-b)G.

68

+ [M+NH4] are more complicated to study signature fragment pattern of different

+ linkages, so the MS/MS spectra of [M+NH4] for three trilignols will not be further discussed (shown in Figure 2.18).

HCD/MS/MS spectra of G(4-O-a)G(b-O-4)G

[M-H]-

As discussed previously, a, b-diaryl ether bond is quite labile compared with other C-C condensed linkages; it is easily fragmented to monomeric and dimeric fragment ions (shown in Figure 2.19.a.) at lower collision energy compared with other two trimeric compounds. The dominant fragmentation ions, m/z 357 and 179 observed in the HCD/MS/MS experiment are the same with the ones from in-source fragmentation of

[M-H]- ion. Besides the major dimeric (m/z 357) and monomeric (m/z 179) fragmentation ions, small neutral losses of 18 Da (H2O), 30 Da (CH2O) are also observed. The loss of water (18 Da) indicates the presence of aliphatic alcohol group. The loss of formaldehyde (30 Da) also indicates the existence of aliphatic alcohol group31.

[M+Li]+ and [M+Na]+

Fragmentation of lithiated and sodiated ions are required higher collision energy than that of [M-H]-. As shown in Figure 2.19.b-c, HCD/MS/MS spectra of both lithiated and sodiated ions give essentially the same fragmentation ion. Both MS/MS spectra are pretty clean, containing only two major fragment ions, which correspond to the loss of a monomeric unit and the loss of a dimeric unit (loss of 180 and 359). The fragmentation patterns of

69

GDHP_ab_trimer_021815_NEG_rep3 2/18/2015 4:03:37 PM GDHP_ab_diaryl trimer_021815_NEG_rep3 20ug/ml GDHP_ab_trimer in 0.5mM NH4OH/ACN=1:1, rep#3 GDHP_ab_trimer_021815_NEG_rep3 #173 RT: 4.69 AV: 1 NL: 3.52E5 T: FTMS - p ESI Full ms2 [email protected] [100.00-600.00] 357.1338 179.0711 C 20 H21 O6 C 10 H11 O3 1.4679 ppm 100 4.7494 ppm

95

90

85

80

75

70

65

60

55

50

45

RelativeAbundance 40

35

30

25

20

15 164.0476 537.2125 C 30 H33 O9 10 C 9 H8 O3 327.1232 1.1456 ppm 4.9485 ppm C 19 H19 O5 5 146.0372 1.6196 ppm

0 100 150 200 250 300 350 400 450 500 550 600 a) m/z GDHP_ab_trimer_021815_Li_rep2 2/18/2015 4:41:17 PM GDHP_ab_diaryl trimer_021815_Li_rep2 20ug/ml GDHP_ab_trimer in 0.5mM LiCl/ACN=1:1, rep#2 GDHP_ab_trimer_021815_Li_rep2 #125 RT: 3.27 AV: 1 NL: 1.23E8 T: FTMS + p ESI Full ms2 [email protected] [100.00-600.00] 545.2358 365.1571 C 30 H34 O9 Li C 20 H22 O6 Li 0.1868 ppm 100 -0.0545 ppm 95

90

85

80

75

70

65

60

55

50

45 186.0863 C 10 H11 O3 Li RelativeAbundance 40 0.3197 ppm

35

30

25

20

15 204.0970 C 10 H13 O4 Li 10 171.0629 0.9371 ppm 5 C 9 H8 O3 Li 0.5676 ppm 0 100 150 200 250 300 350 400 450 500 550 600 m/z b)GDHP_ab_trimer_021815_Na_rep2 2/18/2015 5:06:29 PM GDHP_ab_diaryl trimer_021815_Na_rep2 20ug/ml GDHP_ab_trimer in 0.5mM NaCl/ACN=1:1, rep#2 GDHP_ab_trimer_021815_Na_rep2 #121 RT: 3.17 AV: 1 NL: 1.42E8 T: FTMS + p ESI Full ms2 [email protected] [100.00-600.00] 202.0600 C 10 H11 O3 Na -0.3632 ppm 100

95

90 561.2092 85 C 30 H34 O9 Na -0.4904 ppm 80

75

70

65

60

55

50

45

RelativeAbundance 40

35

30 381.1306 25 C 20 H22 O6 Na -0.8012 ppm 20

15

10

5

0 100 150 200 250 300 350 400 450 500 550 600 c) m/z

Figure 2.19. HCD/MS/MS spectra of G(4-O-a)G(b-O-4)G: a) [M-H]-; b) [M+Li]+; c)

[M+Na]+.

70

lithiated and sodiated ions are very similar with that of deprotonated ions. However, there are no loss of 18 Da and 30 Da observed in these two HCD/MS/MS spectra.

HCD/MS/MS spectra of G(b-O-4)G(b-5)G

[M-H]-

Compared with G(4-O-a)G(b-O-4)G, G(b-O-4)G(b-5)G is more stable when subjected to fragmentation. The tandem spectrum is also more complicated than that of

G(4-O-a)G(b-O-4)G. At the same collision energy, shown in Figure 2.20.a, the base peak

- is [M-H] , m/z 553.2074(C30H33O10, Δppm= -1.06, 100%). m/z 195.0661(C10H11O4,

Δppm= -1.0846, 82.36%), which indicates the existing of b-O-4 at phenolic end group of guaiacyl unit. m/z 339.1232 (C20H19O5, Δppm= -1.85, 8.27%), m/z 327.1234(C19H19O5,

Δppm= -1.17, 28.94%) are two characteristic fragment ions corresponding to the presence of G(b-5)G linkage30. The common fragmentation ions with loss of 18 Da and

30 Da are also observed which indicates the presence of aliphatic alcohol groups as discussed previously.

[M+Li]+ and [M+Na]+

Lithiated and sodiated of G(b-O-4)G(b-5)G generally are more difficult to be fragmented, thus they require higher collision energy than that of a, b-diaryl trimer. As showed in Table 2.3, fragmentation ions of lithiated and sodiated ion overlap with each other pretty well. Not only fragmentation ions of alkaline cation adduct ions are obtained, the characteristic fragmentation ions are also observed in the HCD/MS/MS of lithiated

71

GDHP_b5_trimer_021915_NEG_rep1 2/19/2015 10:30:34 AM GDHP_beta5 trimer_021915_NEG_rep1 19ug/ml GDHP_beta5_trimer in 0.5mM NH4OH/ACN=1:1, rep#1 GDHP_b5_trimer_021915_NEG_rep1 #127 RT: 3.33 AV: 1 NL: 4.77E6 T: FTMS - p ESI Full ms2 [email protected] [100.00-600.00] 553.2073 C 30 H33 O10 -1.0556 ppm 100

95

90

85

80

75

70

65

60

55

50

45

RelativeAbundance 40

35

30

25 301.1078 195.0660 505.1866 C 17 H17 O5 C H O C 10 H11 O4 29 29 8 20 -1.1917 ppm -1.3193 ppm -0.4694 ppm 15 339.1232 523.1969 C 20 H19 O5 165.0555 C 29 H31 O9 10 -1.8519 ppm C 9 H9 O3 -0.8722 ppm 5 -1.6178 ppm

0 100 150 200 250 300 350 400 450 500 550 600 m/z a) GDHP_b5_trimer_021915_Li_rep1 2/19/2015 11:51:00 AM GDHP_beta5 trimer_021915_Li_rep1 19ug/ml GDHP_beta5_trimer in 0.5mM LiCl/ACN=1:1, rep#1 GDHP_b5_trimer_021915_Li_rep1 #189 RT: 4.96 AV: 1 NL: 1.64E7 T: FTMS + p ESI Full ms2 [email protected] [100.00-600.00] 561.2308 C 30 H34 O10 Li 0.2161 ppm 100

95

90

85

80

75

70

65

60

55

50

45

RelativeAbundance 40

35

30

25 543.2205 309.1309 C 30 H32 O9 Li 20 C H O Li 17 18 5 0.6830 ppm -0.0639 ppm 15 259.1153 327.1415 C H O Li C 17 H20 O6 Li 513.2099 10 143.0680 13 16 5 C 8 H8 O2 Li 203.0891 0.3738 ppm 0.2024 ppm C 29 H30 O8 Li C H O Li 5 0.7558 ppm 10 12 4 0.7338 ppm 0.6277 ppm 0 100 150 200 250 300 350 400 450 500 550 600 m/z b)GDHP_b5_trimer_021915_Na_rep3 2/19/2015 1:06:32 PM GDHP_beta5 trimer_021915_Na_rep3 19ug/ml GDHP_beta5_trimer in 0.5mM NaCl/ACN=1:1, rep#3 GDHP_b5_trimer_021915_Na_rep3 #131 RT: 3.43 AV: 1 NL: 5.03E6 T: FTMS + p ESI Full ms2 [email protected] [100.00-600.00] 577.2047 C 30 H34 O10 Na 0.4029 ppm 100

95

90

85

80

75

70

65

60

55

50

45

RelativeAbundance 40 325.1047 C 17 H18 O5 Na 35 0.0937 ppm

30

25 350.1125 C 19 H19 O5 Na 20 0.1384 ppm 202.0602 547.1941 294.0863 380.1231 15 C 10 H11 O3 Na C 20 H21 O6 Na C 29 H32 O9 Na C 16 H15 O4 Na 0.6185 ppm 0.2733 ppm 0.4352 ppm 10 0.0614 ppm 146.0339 175.0367 245.0785 529.1836 C H O Na C H O Na C H O Na 5 C 7 H7 O2 Na 8 8 3 12 14 4 29 30 8 0.4387 ppm 0.7544 ppm 0.4361 ppm 0.5759 ppm 0 100 150 200 250 300 350 400 450 500 550 600 c) m/z

Figure 2.20. HCD/MS/MS spectra of G(b-O-4)G(b-5)G: a) [M-H]-; b) [M+Li]+; c)

[M+Na]+.

72

Table 2.3. List of observed fragment ions for G(b-O-4)G(b-5)G and their chemical formula based on high resolution MS result of [M+Li]+ and [M+Na]+

Observed Ion [X+Li] [X+Na]

C30 H34 O10 561.2308(100%) 577.2047(100%)

C30 H32 O9 543.2201(16.99%) 559.1942(6.68%)

C29 H32 O9 531.2205(14.29%) 547.1941(10.44%)

C29 H30 O8 513.2099(4.49%) 529.1836(1.78%)

C20 H21 O6 - 380.1231(10.14%)

C20 H19 O5 346.1388(3.56%) 362.1125(8.92)

C19 H19 O5 - 350.1125(6.92%)

C17 H20 O6 327.1415(7.29%) -

C17 H18 O5 309.1309(15.99%) 325.1047(33.94%)

C16 H15 O4 - 294.0863(8.92%)

C13 H16 O5 259.1153(6.92%) 275.089(3.59%)

C10 H12 O4 203.0891(2.06%) -

C10 H11 O3 186.0864(1.93%) 202.0602(11.04%)

C8 H8 O3 - 175.0367(1.89%)

C8 H8 O2 143.068(4.31%) -

73

and sodiated cations, which allowed identification and elucidation of detailed structure.

Similar characteristic fragment ions are obtained in the negative ion mode for structure identification31.

HCD/MS/MS spectra of G(b-O-4)G(b-b)G

[M-H]-

Morreel, K et al.30 has studied the fragmenation pattern of deprotonated G(b-O-

4)S(b-b)G, which it has a syringyl unit in the molecules. Again, the major fragment ions of deprotonated G(b-O-4)G(b-b)G follow the similar fragmentation pattern with their study. Because the HCD/MS/MS data are obtained in the HCD cell, there may have difference in the relative abundance of fragment ions compared with CID/MS/MS.

In Figure 2.21.a, which is the HCD/MS/MS spectrum of deprotonated ion [M-H]-,

- the base peak is [M-H] , m/z 553.2074(C30H33O10, Δppm= -0.9453, 100%),

503.1867(C29H29O8, Δppm= -.2277, 32.22%), the signature fragment ions are 357.1339

(C20H21O6, Δppm= -1.18, 11.33%), 343.1182(C19H19O6, Δppm= -1.42, 0.26%),

327.1234(C19H19O5, Δppm= -1.17, 0.92%) and 195.066(C10H11O4, Δppm= -1.4, 29.07%), which again indicates the existing of b-O-4 at phenolic end group.

[M+Li]+ and [M+Na]+

Similar to G(b-O-4)G(b-5)G trimer, the fragmentation of lithiated and sodiated

G(b-O-4)G(b-b)G ions also require higher collision energy than that of G(4-O-a)G(b-O-

4)G. Table 2.4 shows the HCD/MS/MS comparison of [M+Li]+ and [M+Na]+ of G(b-O-

4)G(b-b)G . Different with G(b-O-4)G(b-5)G trimer, HCD/MS/MS spectrum of

74

GDHP_bb_trimer_021915_NEG_rep1 2/19/2015 1:59:47 PM GDHP_beta_beta trimer_021915_NEG_rep1 15ug/ml GDHP_beta_beta_trimer in 0.5mM NH4OH/ACN=1:1, rep#1 GDHP_bb_trimer_021915_NEG_rep1 #132 RT: 3.45 AV: 1 NL: 6.51E6 T: FTMS - p ESI Full ms2 [email protected] [100.00-600.00] 553.2074 C 30 H33 O10 -0.9453 ppm 100

95

90

85

80

75

70

65

60

55

50

45

RelativeAbundance 40 505.1867 C 29 H29 O8 35 195.0660 -0.2277 ppm C 10 H11 O4 30 -1.3975 ppm

25

20 357.1339 165.0555 15 C 20 H21 O6 C 9 H9 O3 -1.1760 ppm 10 -1.5254 ppm 150.0321 535.1973 5 C 8 H6 O3 C 30 H31 O9 -1.1225 ppm -0.1684 ppm 0 100 150 200 250 300 350 400 450 500 550 600 m/z a) GDHP_bb_trimer_021915_Li_rep1 2/19/2015 3:21:49 PM GDHP_beta_beta trimer_021915_Li_rep1 15ug/ml GDHP_beta_beta_trimer in 0.5mM LiCl/ACN=1:1, rep#1 GDHP_bb_trimer_021915_Li_rep1 #143 RT: 3.75 AV: 1 NL: 3.00E7 T: FTMS + p ESI Full ms2 [email protected] [100.00-600.00] 561.2308 C 30 H34 O10 Li 0.2161 ppm 100

95

90

85

80

75

70

65

60

55

50

45 RelativeAbundance 40 513.2098 C H O Li 35 29 30 8 0.4960 ppm 30

25

20 203.0891 C H O Li 15 10 12 4 0.4023 ppm 365.1572 C 20 H22 O6 Li 10 169.0836 131.0679 309.1308 0.1963 ppm 525.2097 C H O Li C 10 H10 O2 Li 7 8 2 C 17 H18 O5 Li C H O Li 5 0.3108 ppm 30 30 8 0.0100 ppm -0.1627 ppm 0.2522 ppm 0 100 150 200 250 300 350 400 450 500 550 600 m/z b)GDHP_bb_trimer_021915_Na_rep2 2/19/2015 4:16:15 PM GDHP_beta_beta trimer_021915_Na_rep2 15ug/ml GDHP_beta_beta_trimer in 0.5mM NaCl/ACN=1:1, rep#2 GDHP_bb_trimer_021915_Na_rep2 #140 RT: 3.67 AV: 1 NL: 1.81E7 T: FTMS + p ESI Full ms2 [email protected] [100.00-600.00] 577.2049 C 30 H34 O10 Na = 577.2044 0.8259 ppm 100

95

90

85

80

75

70

65

60

55

50

45

RelativeAbundance 40

35

30

25

20 202.0603 380.1233 C 10 H11 O3 Na = 202.0600 15 C 20 H21 O6 Na = 380.1230 1.1471 ppm 0.5945 ppm 10

5

0 100 150 200 250 300 350 400 450 500 550 600 c) m/z

Figure 2.21. HCD/MS/MS spectra of G(b-O-4)G(b-b)G: a) [M-H]-; b) [M+Li]+; c)

[M+Na]+.

75

Table 2.4. List of observed fragment ions for G(b-O-4)G(b-b) and their chemical formula based on high resolution MS result of [M+Li]+ and [M+Na]+

Observed ions [X+Li] [X+Na]

C30 H34 O10 561.2308(100%) 577.2049(100%)

C29 H30 O8 513.2098(33.88%)

C20 H24 O7 383.1677(6.44%)

C20 H22 O6 365.1572(7.29%)

C20 H21 O6 380.1233(10.33%)

C19 H18 O6 349.1258(2.1) 365.0999(2.65%)

C17 H18 O5 309.1308(1.82%)

C12 H12 O3 221.0997(3.31%)

C10 H12 O4 203.0891(12.15%)

C10 H11 O3 186.0864(6.02%) 202.0603(11.5%)

76

lithiated ion contains more characteristic fragment ions than that of sodiated ion. Almost all the ions observed in MS/MS of [M+Na]+ are also observed in [M+Li]+, shown in

Figure 2.21.b-c.

Comparison of Fragmentation of three trilignols

Fragmentation in HCD/MS/MS experiments takes much higher collision energy for alkali metal adducts than that of ammonium adduct ions in the positive ion mode. The

+ + HCD/MS/MS of [M+NH4] contains predominant [M+H] fragmentation ions, as ammonium adduct is easily fragmented to H+ and NH3 leaves as neutral. Although alkali metal adducts require higher collision energy to obtain informative fragmentation ion, the

HCD/MS/MS spectrum is relatively less complicated to interpret. Compared with

HCD/MS/MS spectra of three trilignols, the lithiated and sodiated trimers have generated characteristic fragmentation ions and they match with the fragmentation pattern obtained in the negative ion mode (both [M-H]- and [M+Cl]-). The relative abundance of fragment ions is quite different from that of deprotonated ion. The HCD/MS/MS results of the trilignols cation adducts indicates that the fragmentation pathway varies with the species of cation as well as the structures of trilignols. By interpreting the fragmentation pattern of the lithiated and sodiated G(b-O-4)G(b-5)G and G(b-O-4)G(b-b)G adducts, one can also be able to differentiate these two isomers.

2.4 Conclusion

Because of the heterogeneity of lignin, an efficient ionization method for mass spectrometric analysis is needed to improve the sensitivity as well as obtaining structural

77

information of lignin. By studying trilignols as lignin model compounds, it has been demonstrated that how the characteristic structural has an effect on traditional negative ion mode ESI. In this study, it is shown that the ion response in the positive ion mode forming various cations adducts is overall more responsive than that in the negative mode. In general, Li+ and Na+ adduct ions give the highest ion response across three different trilignols, they can provide molecule weight information and be less effected by in-source fragmentation during the ionization process. Not only are [M+Li]+ and

[M+Na]+ fragment ions successfully observed in HCD/MS/MS experiments, they also provide informative structural and sequence information for potential structural elucidation application. Cationization study of these model compounds containing phenolic alcohol may be extended for ionization of lignin oligomers whose phenol groups are occupied resulting difficulties in deprotonation. Application of cationization shows promise with good ionization response as well as structurally informative HCD/MS/MS data on lignin oligomers that contain various chemical linkages.

78

Chapter 3

Synthesis of Trimeric Lignin Model Compound

3.1 Introduction

3.1.1 Lignin model compounds

Lignin is the second most abundant biopolymer on the earth. As described in the previous chapter, lignin has very complex polymeric structure due to: 1) heterogeneous biopolymer consisting of three phenylpropanoid monomeric units: 4-hydroxy-cinnamyl alcohol, coniferyl alcohol and sinapyl alcohol; 2) biosynthesis through oxidative radical coupling reaction resulting in non-specific pattern of bonding types (b-O-4, b-5, b-1, b-b and etc.); and 3) intra-molecular bonding with carbohydrates. For pulp/paper and cellulosic biofuel industries, cellulose is the component of interest. As an unwanted pretreatment byproduct, lignin is often burned for fuel. The complexity of the isolated lignin itself makes it very difficult to be commercialized as a highly valuable product.

However, lignin has drawn a lot of attention because of the valuable aromaticity of lignin as a potential alternative resource of aromatic rings other than non-renewable fossil fuels.

This potential for lignin to replace petroleum as a source of aromatics motivates a growing interest in understanding the fundamental chemistry of lignin degradation processes in order to obtain the highly valuable lignin-derived products.

As discussed previously, b-O-4 ether linkage is considered to be the most abundant linkage in lignin comprising from 40-60% of lignin depending on the type of plants. Compared with other C-C condensed linkage, b-O-4 is more labile, which makes it a reactive target site in lignin degradation in many degradation strategies.

79

Dimeric model compounds containing b-O-4 have been widely studied, but these cannot properly represent the polymeric property of lignin. Dehydrogenation polymer

(DHP) as models have been discussed in Chapter two of this research program as a more

“controllable” model but its structures are still too complicated to be used as a proper model to study lignin degradation process. Several groups have synthesized oligomeric model compounds containing b-O-441-45. Some of these model compounds do not contain phenoxy functionality; some of them do not have the end group that maintains the unsaturated side chain. Thus, the objective of this chapter is to develop a synthetic method so that one can obtain oligomeric models that would contain important functionalities (phenolic alcohol end group, b-O-4 ether linkage as well as a, b- unsaturated side chain with g-OH end group). Successful synthesis of a model compound with the previously mentioned features would have a significant impact on understanding the reactivity of the lignin model compounds towards degradation methods.

The strategy used to generate b-O-4 ether linkage is through an aldol-type reaction44. Ciofi-Baffoni et al. used lithium diisopropylamide as strong base to generate a carbanion as nucleophile addition to the aldehyde group. However, this condition will not work if one wants to introduce a, b-unsaturated side chain to the oligomeric models. The aim of this work is to develop the new synthesis strategy to prepare b-O-4 oligomeric model compounds that contain the most important functional groups: phenolic alcohol, b-

O-4 and unsaturated side chain, which would be a good model for studying the reactivity and chemistry of lignin degradation process. The reaction synthesis route designed in this chapter is shown in Figure 3.1.

80

O OH O O O O O O Si O a Si O O + O O O O

O O O Chemical Formula: C H O Si 4 17 28 3 3 Chemical Formula: C14H18O6 Exact Mass: 308.1808 Exact Mass: 282.1103 7 Chemical Formula: C31H46O9Si Exact Mass: 590.2911

Si OH O O O

O O

b Si O c Si O O O O O O O

O O

Chemical Formula: C H O Si 8 29 42 8 9 Chemical Formula: C35H56O8Si2 Exact Mass: 546.2649 Exact Mass: 660.3514

Si O O O

O O

O d Si OH O + O OH OH

O O O O 6 Chemical Formula: C16H20O6 Exact Mass: 308.1260 OH O

12 Chemical Formula: C45H70O11Si2 Exact Mass: 842.4457

e

Si O O

O O

Si OH O O O O

O O O

O O

11 Chemical Formula: C51H76O14Si2 Exact Mass: 968.4774

f

OH O

O

OH HO OH OH

O O

OH O

13 Chemical Formula: C30H36O11 Exact Mass: 572.2258

Figure 3.1. Synthetic route of target trimer G(b-O-4)G(b-O-4)G

81

3.1.2 Condensation Reaction of Aldehydes and Ketones

The idea of generating b-O-4 linkage is achieved by the reduction of a b-hydroxy ketone, where the b-hydroxy ketone comes from an aldol reaction. In the carbonyl condensation reaction, the carbonyl compounds behave both as nucleophile and as electrophile. In the presence of base, one carbonyl with a-hydrogen is converted into its enolate ion, which acts as nucleophile to attack the second carbonyl compound (the electrophile). The general condensation reaction for aldehydes and ketones with a- hydrogen is called aldol reaction. The aldol reaction is a reversible reaction (mechanism depicted in Figure 3.2). The equilibrium of reaction depends on both reaction conditions and the structure of substrates. A condensation product is generally favored with no a substituent (RCH2CHO). A mixture of products will be generated from an aldol reaction of two similar aldehydes or ketones. However, if one of the carbonyl compounds contains no a-hydrogen but contains a non-steric hindered carbonyl group, such as benzaldehyde or formaldehyde, which would play a role as good electrophilic acceptor.

82

O O O Base:- C R1 C H C R1 C R3 C R3 C R3

R1 R2 R2 R2

O

C

R4 H

H2O

H OH O

C C

R4 C R3

R2 R1

Figure 3.2. General mechanism of a typical aldol reaction

83

3.2 Materials and Methods

Organic solvents were purchased from Thermo Fisher Scientific (Waltham, MA,

USA) otherwise noticed. All the chemicals were purchased from VWR (Radnor, PA,

USA) otherwise noticed. Silica gel column chromatography was used for purification.

Synthesized compounds were characterized by thin layer chromatography (TLC), 1H nuclear magnetic resonance (NMR), gas-chromatography-mass spectrometry (GC-MS) and high-resolution accurate mass (HRAM) mass spectrometry (MS). GC/MS spectra were acquired on Shimadzu QP5000 with DB5-MS capillary column with helium (He) as carrier gas. For monomeric compounds, following GC thermal gradient was used: initial temperature, 100˚C, held for 5 min, increased from 100˚C to 280˚C at rate of 12˚C/min, held for 10 min at 280˚C. GC samples were derivatized with BSTFA prior to analysis by

GC/MS. NMR spectra were acquired on a Varian Model NOVA400 (400 MHz) spectrometer. Acetone-d6 was used as NMR solvent. High-resolution mass spectra were acquired by Q Exactive Orbitrap mass spectrometer (Thermo Scientific, San Jose, CA,

USA) with electrospray ionization (ESI) in positive ion mode and negative ion mode.

Positive ion mode: spray voltage: 3.8 kV, capillary temperature: 225˚C, sheath gas: 2.0, probe heater temperature: 45˚C. Negative ion mode: spray voltage: 3.2 kV, capillary temperature: 320˚C, sheath gas: 10.0, probe heater temperature: 30˚C, infusion flow rate for both modes: 5 µl/min. The syntheses of all the building block are shown in Figure 3.3

Synthesis of Compound 2

To a solution of vanillin (5 g, 32.9 mmol) in 50 ml acetone, 1.5 equiv. of ethyl bromoacetate (1M, 5.47 ml) and 1.5 equiv. of potassium carbonate (K2CO3) (6.8 g) were

84

added at room temperature. The reaction mixture was refluxed for 3 hours. TLC was used for monitoring the reaction progress. The reaction mixture was cooled to room temperature, and filtered. The filtrate was evaporated under vacuum to give clear oil residue. The residue was crystalized from ethanol to give pure 2 (6.88 g, yield 88%). 1H

NMR (400 MHz, Acetone-d6) 9.87 (s, 1H), 7.59 – 7.36 (m, 2H), 7.08 (d, J = 8.2 Hz,

1H), 4.87 (s, 2H), 4.22 (q, J = 7.1 Hz, 2H), 3.92 (s, 3H), 1.25 (t, J = 7.1 Hz, 3H).

Synthesis of Compound 3

To a solution of 2 (5 g, 21 mmol) in 125 ml toluene, 10 equiv. of ethylene glycol

(11.7 ml) and catalytic amount of p-toluenesulfonic acid (TsOH) were added at room temperature. The reaction mixture was refluxed for 4 hours using a Dean-Stark apparatus to remove water. Thin layer chromatography (TLC) was used to monitor the reaction progress. The reaction mixture was cooled to room temperature, quenched with 5ml

K2CO3 solution, extracted with ethyl acetate (EtOAc) (100 ml x 3), washed with brine and the organic layer was dried over Na2SO4, concentrated in vacuo to give light yellow residue. The residue was crystalized from ethanol to give pure 3 (5.5 g, yield 93%). 1H

NMR (400 MHz, Acetone-d6) δ 7.08 (d, J = 1.9 Hz, 1H), 6.97 (ddd, J = 8.2, 1.9, 0.5 Hz,

1H), 6.90 (d, J = 8.2 Hz, 1H), 5.67 (s, 1H), 4.71 (s, 2H), 4.20 (q, J = 7.1 Hz, 2H), 4.09 –

4.05 (m, 2H), 3.97 – 3.94 (m, 2H), 3.84 (s, 3H), 1.24 (t, J = 7.1 Hz, 3H).

Synthesis of Compound 4

To a solution of vanillin 1 (5 g, 32.9 mmol) and imidazole (4.4 equiv., 9.8 g) in

100 ml CH2Cl2, TipsCl (1.5 equiv. 10.5 ml) was added slowly. The reaction was stirred

85

O O

O Si

O O

OH 4 Chemical Formula: C17H28O3Si Chemical Formula: C8H8O3 Exact Mass: 308.1808 Exact Mass: 152.0473

O O O

O O O

O O O O O O O OH 2 Chemical Formula: C12H14O5 3 Chemical Formula: C14H18O6 Chemical Formula: C8H8O3 Exact Mass: 238.0841 Exact Mass: 282.1103 Exact Mass: 152.0473

O O O

O OH O

O HO HO O

O O O O

5 Chemical Formula: C12H14O4 6 Chemical Formula: C10H10O4 Exact Mass: 222.0892 Chemical Formula: C16H20O6 Exact Mass: 194.0579 Exact Mass: 308.1260

Figure 3.3 The synthesis of the three building blocks for the synthesis of target trimer.

86

overnight, then extracted with CH2Cl2 (100 ml x 3), dried in vacuo to give colorless oil residue to white amorphous solid. The residue was purified with silica gel column to remove unreacted vanillin, mobile phase: EtOAc: Hexane= 1:546. 1H NMR (400 MHz,

Acetone-d6) 9.88 (s, 1H), 7.53 – 7.41 (m, 2H), 7.08 (d, J = 8.5 Hz, 1H), 3.92 (s, 3H),

1.31 (dq, J = 14.2, 7.4 Hz, 3H), 1.11 (d, J = 7.4 Hz, 18H). High-Resolution MS: [M+H]+

= 309.1877, Elemental composition: C17H29O3Si, calculated: C17H29O3Si, m/z 309.1886,

∆ppm= -0.9820 ppm.

Synthesis of Compound 5

38 Compound 5 was synthesized according to the reference . Briefly, 5 g of ferulic acid was dissolved in 50 ml ethanol, 2.5 ml acetyl bromide was added dropwisely. The reaction mixture was stirred overnight, extracted with 50 ml potassium carbonate and ethyl acetate (50 ml x 3). The organic layer was dried over Na2SO4 and concentrated in vacuo to give light yellow oil. The solid was recrystallized in -20˚C to give light yellow

1 solid (5.2 g, yield 91%). H NMR (400 MHz, Acetone-d6) 8.12 (s, 1H), 7.59 (d, J =

15.9 Hz, 1H), 7.33 (d, J = 2.0 Hz, 1H), 7.14 (ddd, J = 8.1, 2.0, 0.5 Hz, 1H), 6.87 (d, J =

8.2 Hz, 1H), 6.39 (d, J = 15.9 Hz, 1H), 4.19 (q, J = 7.1 Hz, 2H), 3.92 (s, 3H), 1.27 (t, J =

7.1 Hz, 3H).

Synthesis of Compound 6

To a solution of 5 (10 g, 51.5 mmol) in 200 ml acetone, 1.5 equiv. of ethyl bromoacetate (1M in THF, 8.57 ml) and 1.5 equiv. of potassium carbonate (K2CO3) (10.6 g) were added at room temperature. The reaction mixture was refluxed for 4 hours,

87

monitored using TLC, cooled to room temperature, and extracted with CH2Cl2 (100 ml x

3). The organic layer was dried with Na2SO4 and concentrated in vacuo to give light orange solid. The solid is recrystallized using THF in -20˚C to give white solids (6.3 g,

1 yield 90.7%). H NMR (400 MHz, Acetone-d6) NMR (400 MHJ = 15.9 Hz, 1H), 7.35 (d,

J = 2.1 Hz, 1H), 7.20 – 7.12 (m, 1H), 6.93 (d, J = 8.3 Hz, 1H), 6.44 (d, J = 15.9 Hz, 1H),

4.77 (s, 2H), 4.20 (qd, J = 7.1, 5.3 Hz, 4H), 3.91 (s, 3H), 1.30 – 1.26 (m, 3H), 1.26 – 1.22

+ (m, 3H). High-Resolution MS: [M+H] = 309.1333, Elemental composition: C16H21O6, calculated: C16H21O6, m/z 309.1338, ∆ppm= 0.2111 ppm.

Synthesis of Compound 7

To a solution of sodium bis(trimethylsilyl)amide (1.3 equiv. 6.9 g) in 55 ml anhydrous THF at -78˚C under N2, a solution of 3 in 18 ml anhydrous THF (7.73 g, 1.0 equiv) was added drop-wisely over 1 hour at -78˚C. The reaction mixture was kept stirring for another 1hr at -78˚C, then changed to -45˚C. A solution of 4 in 20 ml anhydrous THF (10.13 g, 1.2 equiv) was added over 1hr at -45˚C. After stirring for another 1hr, the reaction was quenched and extracted by addition of EtOAc (100 ml x 3) and K2CO3 solution. The organic layer was filtered through Na2SO4 and concentrated in vacuo to give light yellow oil residue (23 g, yield 142%). The crude oil residue was used

+ without further purification. HRMS: [M+NH4] = 608.3256, Elemental composition:

C31H50O9NSi, calculated: C31H50O9NSi, m/z 608.3256, ∆ppm=1.027 ppm

Synthesis of Compound 8

To a solution of 7 (crude residue, 23 g, 4.0 equiv.) in 150 ml Acetone: H2O (4:1, v: v), pyridinium p-toluenesulfonate (PPTS) (4.9 g, 1.0 equiv.) was added. The reaction

88

mixture was refluxed for 4 hours and cooled to room temperature. Acetone was removed in vacuo. The residue was extracted with EtOAc (100 ml x3), washed with K2CO3. The organic layer was dried with Na2SO4 and concentrated under vacuum. The crude oil residue was used without further purification (7.21 g, crude yield 64%). HRMS:

+ [M+NH4] = 564.2992, Elemental composition: C29H46O8NSi, calculated: C29H46O8NSi, m/z 564.2993, ∆ppm=0.832 ppm.

Synthesis of Compound 9

To a solution of 8 (5.7 g, 1.0 equiv) in 80 ml CH2Cl2, add imidazole (3.1 g, 4.4 equiv) and t-butyldimethylsilyl chloride (TBDMSCl) (3 g, 2.0 equiv). The reaction mixture was stirred overnight at room temperature. The reaction was quenched with 20 ml H2O and extracted with EtOAc (50 ml x 3), washed with brine. The organic layer was concentrated in vacuo to give oil residue, purified by flash chromatography using silica gel, mobile phase: EtOAc/Hexane=1:8 give pure 9 consisted of the two diastereomers (3

1 g, 44%). White crystal formed in 4˚C. H NMR (400 MHz, Acetone-d6) 9.83 (s, 1H),

7.43 – 7.38 (m, 2H), 7.20 (d, J = 2.0 Hz, 1H), 6.97 (s, 1H), 6.92 (d, J = 8.7 Hz, 1H), 6.86

(d, J = 8.1 Hz, 1H), 5.14 (d, J = 6.7 Hz, 1H), 4.81 (d, J = 6.7 Hz, 1H), 4.21 – 4.16 (m,

2H), 3.87 (s, 3H), 3.85 (s, 3H), 1.24 (t, J = 7.1 Hz, 6H), 1.08 (d, J = 7.3 Hz, 18H), 0.88 (s,

+ 9H), 0.10 (s, 3H), -0.09 (s, 3H). HRMS: [M+NH4] = 678.3861, Elemental composition:

C35H60O8NSi2, calculated: C35H60O8NSi2, m/z 678.3857, ∆ppm=1.346 ppm.

Synthesis of Compound 10

Solution of 6 (932 mg, 2.0 equiv in 12 ml anhydrous THF) was added dropwisely to a solution of NaHMDS (751 mg, 2.6 equiv in 60ml anhydrous THF) at -78˚C under N2.

89

After 30 min at -78˚C, a solution of 9 (1 g, 1.0 equiv in 20 ml anhydrous THF) was added slowly. The reaction was quenched with saturated NH4Cl after 15min, extracted with

EtOAc (100 ml x 3), washed with brine. The organic layer was dried over Na2SO4 and concentrated under vacuum to give yellow color residue. Compound 10 was purified by flash chromatography using silica gel, gradient mobile phase: EtOAc/Hexane 1:9 to 1:3.

The purified 10 which consisted of a mixture of diastereomers were further separated

+ (440mg, yield 30%). HRMS: [M+NH4] = 986.5114, Elemental composition:

C51H80O14NSi2, calculated: C51H80O14NSi2, m/z 986.5117, ∆ppm=0.481 ppm.

Synthesis of Compound 12

To a solution of 10 (700 mg, 1.0 equiv) in 45 ml CH2Cl2 at -78˚C under nitrogen, a solution of diisobutylaluminum hydride (DIBAl-H) (13.7 ml, 1M in THF, 19.0 equiv) was added dropwisely over 20 min. After 1.5 hours, the reaction mixture was brought to

0˚C in ice-water bath, quenched by addition of 8 ml MeOH, extracted with EtOAc (100 ml x 3), washed with brine and dried through Na2SO4. The organic layer was concentrated to give viscous oil (596 mg, crude oil yield 98%). The crude residue was

+ used without further purification. HRMS: [M+NH4] = 860.4797, Elemental composition:

C45H74O11NSi2, calculated: C45H74O11NSi2, m/z 860.4800, ∆ppm=0.219 ppm.

Synthesis of Compound 13

To a solution of 12 (crude oil, about 500 mg, 1.0 equiv) in 80 ml THF, 5 equiv of

TBAF (3 ml, 1M in THF) was added at 0˚C for 5 min, then switched to room temperature.

The reaction is monitor with TLC. After 4 hours, THF was evaporated under vacuum.

90

Purification by flash chromatography, gradient mobile phase was used. 1) EtOAc with

5% TEA; 2) Acetone: EtOAc =1:9; and 3) Acetone: EtOAc= 1:4. (96.3 mg, yield 28.4%).

1H NMR (400 MHz, Acetone-d6) 7.12 – 7.01 (m, 3H), 6.90 – 6.83 (m, 5H), 6.77 –

6.72 (m, 1H), 6.56 – 6.39 (m, 1H), 6.24 (dt, J = 15.8, 5.6 Hz, 1H), 4.94 – 4.79 (m, 2H),

4.38 – 4.21 (m, 2H), 4.16 (d, J = 3.6 Hz, 1H), 3.92 – 3.75 (m, 11H), 3.74 – 3.64 (m, 2H).

+ HRMS: [M+NH4] = 590.2598, Elemental composition: C30H40O11N, calculated:

C30H40O11N, m/z 590.2601, ∆ppm=0.411 ppm.

3.3 Results and Discussion

3.3.1 Synthesis of Building Blocks 3, 4 and 6

Compound 3 was obtained in high overall yield (81%) from vanillin in two straightforward steps: nucleophilic substitution with ethyl bromoacetate and continually protection of aldehyde group with ethylene glycol.

Compound 4 is to protect the free phenolic alcohol of vanillin. Traditional benzyl group is not used in this step because the terminating group 6 contains unsaturated side chain, which would also be reduced during the hydrogenolysis of the benzyl group with

Pd/C and H2. Silyl ether was found to be able to overcome this problem with reasonable yield and also it cut additional steps out in removing the protection group in later steps.

Triisopropylsilyl (TIPS) was chosen to protect phenoxy group because it shows high stability in survival of aldol reaction compared with other silyl ether.

Compound 6 was obtained in overall satisfied yield from ferulic acid in two consecutive steps: first, esterification of ferulic acid with acetyl bromide to obtain

91

compound 5, the detailed information can be found in the previous report38. Following step was the substitution of compound 5 with ethyl bromoacetate

3.3.2 Synthesis of The Dimeric Intermediate

As discussed previously, a dimeric intermediate containing b-hydroxy ketone was synthesized through aldol reaction. In the presence of strong base, the acidic a-hydrogen in compound 3 was removed to generate the enolate ion as a nucleophile. Non- nucleophilic base was used so that the base will not compete with enolate ions in the aldol reaction as a nucleophile. Bases such as lithium diisopropylamide (LDA) or sodium bis(trimethylsilyl)amide (NaHMDS) with bulky substituents are poor nucleophile.

Addition of the sodium enolate of compound 3 to compound 4 at -78˚C in dry THF results in a mixture of diastereomers 7. Although TIPS group showed much better stability than trimethylsilyl (TMS) group, small amount of vanillin was observed resulting from loss of TIPS- from compound 4 in the presence of NaHMDS. This is due to the presence of H2O with strong base in the reaction; even amount of water was only at trace level. To avoid the deprotection of compound 4, excess amount of NaHMDS was used to consume the residual water content. By this way, generation of vanillin during this step was successfully minimized. As no loss of TIPS from dimeric product 7 was observed, it indicated that TIPS on compound 7 was more stable than that on compound 4.

The formation of b-hydroxyl group in compound 7 disrupted the conjugation system in

TIPS-vanillin (compound 4) with aldehyde group contributing extra conjugation to the aromatic ring. As a result, vanillin may have more acid phenoxy group than compound 7, which leads to a less stable TIPS.

92

The mixture of diastereomers 7 was continually treated with PPTS to remove the

1,3-dioxolane protecting group, which led to compound 8. Bis(tert-butyldimethylsilyl)

(TBDMS) ether was used directly to protect a-hydroxyl group. TBDMS provided a satisfied solution to protect b-hydroxyl group compared with acetal 44 by reducing the number of steps that would be needed for protection and deprotection. The protected dimeric intermediate 9 formed nice white crystal in the mixtures of hexane and ethyl acetate. The X-ray crystal structure of compound 9 was collected by Dr. Sean Parkin from department of Chemistry at University of Kentucky (shown in Figure 3.4).

3.3.3 Synthesis of The Trimeric Model

Similar with previous dimer synthesis, trimeric compound 13 was synthesized via aldol reaction using dimeric intermediate unit 9 and terminating unit 6. Compound 6 was first treated with strong base to generate enolate ion. LDA is a very common non- nucleophile base used in aldol reaction.

The terminating unit 6 contains a a, b-unsaturated ester group. The electronegative oxygen from carbonyl group causes b carbon to be more electrophilic than a typical alkene carbon and results in a resonance stabilized enolate ion. Because of additional reactivity of 6, the resulting enolate ion can react with different components leading to un-wanted by-product. The possible reaction pathways are shown in Figure 3.5.a.

93

Figure 3.4 X-ray crystal structure of compound 9

94

a)

O O

O O O TBDMS O O O O 6

TIPS O O O O O

O

O 9 O O

O O N 6

b)

O

O

Si O O O O

O

10 Chemical Formula: C29H40O7Si Exact Mass: 528.2543

Figure 3.5. a) Reaction pathway of nucleophilic addition of di-isopropanol amine to compound 6 (purple color); nucleophilic addition within compound 6 (blue color); b)

Possible byproduct 10 resulting from elimination.

95

Due to the steric effect, conjugate addition of enolate ion to b carbon on side chain (Figure 3.5.a, pathway showed in blue trace) is less favored than addition to the aldehyde group on dimer 9 (Figure 3.5.a, pathway showed in red trace). The trimeric product is preferred than a self-conjugation addition of compound 6 (Figure 3.5a. blue color). However, the predominant by-products with addition of diisopropylamide group, compound 14 was observed47 in the coupling reaction. Although LDA is considered to be a non-nucleophilic base, the conjugate addition of diisopropyl amide to a, b-unsaturated carbonyl group of compound 6 occurred resulting in forming predominate by-product of compound 1447 (Figure 3.5a pathway in pink color). Thus, alternative organic bases that contain bulkier group are considered for this reaction. NaHMDS turned out to be a satisfied non-nucleophilic base to overcome this problem. Due to the steric effect of the bulky bis(trimethylsilyl)amide (HMDS) group, there was no conjugate addition of

HMDS to b-carbon observed.

No loss of TIPS from compound 9 was observed in the later nucleophilic addition with compound 6. As discussed previously, that the disrupting conjugation system of compound 9 led to a more stable TIPS protection. The residue from this step required no purification for the use in the following step.

Finer separation of diastereomers can be achieved by using silica gel column. The reduction of the ethyl ester group of compound 10 with diisobutylaluminum hydride

(DIBAL) at -78˚C afforded good yield. The reaction went to completion within an hour.

The reaction temperature and work-up procedure were crucial in obtaining quantitative yield. In this step, undesired by-product 10 formed in higher reaction temperature, which was resulted from the elimination of TBDMS group of dimeric compound 9 in the basic

96

condition (structure shown in Figure 3.5.b). By keeping reaction temperature at -78˚C, one can quantitatively reduce compound 10 in an hour. During work-up process, it was found that the reaction needed to be warmed up near 0˚C before quenching with methanol, otherwise, the target products could convert to other by-products (structures unknown) immediately. The mechanism of this conversion was unclear, but these byproducts have higher molecular weight that target trimer. Since silyl ether (TIPS and TBDMS) was used as protecting group for both phenolic and b-hydroxyl groups, they can be removed under the same condition at the same time with tetrabutylammonium fluoride (TBAF) in THF.

It was important to keep this order of reduction and deprotection in order to obtain better overall yield and simplify the additional purification process. Each procedure involving the synthesis of trimer was pretty clean reaction, and there is only one purification step needed for the target trimeric b-O-4 compound 13. Figure 3.6 shows the mass spectrum of deprotonated compound 13 in the negative ion mode as well as its HCD/MS/MS spectrum.

97

a)D:\Xcalibur\...\GGGOH_02062017 2/6/2017 11:37:55 AM GGGOH 5mM NH4OH/ACN=1:1 GGGOH_02062017 #129-142 RT: 3.72-4.09 AV: 14 NL: 5.21E8 T: FTMS - p ESI Full lock ms [100.00-1500.00] 571.2178 C 30 H35 O 11 -1.2334 ppm 100

95

90

85

80

75

70

65

60

55

50

45 Relative Abundance 40

35

30 283.2636 C 18 H35 O 2 25 135.0450 -2.2872 ppm C 8 H7 O 2 20 -1.0354 ppm

15 157.1230 10 C 9 H17 O 2 -2.5301 ppm 385.1649 593.2002 C H O 1143.4437 5 C 22 H25 O 6 32 33 11 -2.0323 ppm -4.5216 ppm 0 100 200 300 400 500 600 700 800 900 1000 1100 1200 1300 1400 1500 m/z

b)D:\Xcalibur\...\GGGOH_02062017 2/6/2017 11:37:55 AM GGGOH 5mM NH4OH/ACN=1:1 GGGOH_02062017 #164-190 RT: 4.71-5.44 AV: 27 NL: 4.65E7 T: FTMS - p ESI Full ms2 [email protected] [50.00-750.00] 195.0656 C 10 H11 O 4 -3.4869 ppm 100

95 179.0709 90 C 10 H11 O 3 -2.6020 ppm 85

391.1387 80 C 20 H23 O 8 -2.9941 ppm 75

70

65

60

55 523.1953 C 29 H31 O 9 343.1176 50 -4.0224 ppm 571.2162 C 19 H19 O 6 C 30 H35 O 11 -3.3500 ppm 45 -3.9237 ppm Relative Abundance 40

35

30 165.0553 C 9 H9 O 3 25 -2.5521 ppm 327.1229 375.1438 C 19 H19 O 5 C H O 20 -2.6124 ppm 20 23 7 -3.0949 ppm 15

10 151.0398 5 C 8 H7 O 3 -1.9518 ppm 0 50 100 150 200 250 300 350 400 450 500 550 600 m/z

Figure 3.6. a) ESI-MS of targeted trimer in negative ion mode, [M-H]-; b) HCD/MS/MS

spectrum of the trimer, [M-H]-.

98

3.4 Conclusion

A trimeric b-O-4 lignin model compound is synthesized in 7 steps from vanillin and ferulic acid, which gives a repeating guaiacyl unit in the trimeric compound. The purpose of this synthesis route is to incorporate additional a, b-unsaturated side chain to b-O-4 lignin. This new trimeric compound includes phenoxy end group, b-O-4 moiety and a, b-unsaturated side chain, which will be an interesting model to study its physical and chemical properties. No previous work has been done in introducing different functionalities including unsaturated side chain in the synthesis of lignin model compound using this strategy. Considering the reactivity of a, b-unsaturated side chain in the terminating group, new organic base and protecting group have been investigated to replace LDA and benzyl bromide, which are commonly used in synthesizing b-O-4 linkage model compounds44-45. In the procedure developed in this chapter, silyl ether as protecting group was used for phenolic and aliphatic alcohol. Silyl ether showed very good stability under conditions containing NaHMDS and DIBAl-H. As both alcohols were protected with silyl ether, all these silyl ethers for different alcohol groups could be removed at the same time, which reduces of the reaction steps. This synthesis procedure is also feasible for preparing higher order oligomers and incorporating different monomeric units, such as syringyl (S) and 4-hydroxyphenyl (H).

99

Chapter 4

Un-Targeted Analysis of Pretreatment Products Of b-O-4 Model Trilignol with

High-Resolution Accurate Mass (HRAM) Mass Spectrometry Coupled with High

Performance Liquid Chromatography (HPLC)

4.1 Introduction

4.1.1 Pretreatment Strategies

Lignocellulosic biomass is quite complex, composed of cellulose, hemicellulose and lignin. As a result, plants are quite resilient to degradation under normal conditions.

Naturally, biomass can undergo decomposition by microbes in the moist conditions, however natural degradations are usually quite slow and may take several years to complete. In order to use biomass in biofuel production, a wide variety of pretreatment techniques have been studied to break the hemicellulose-lignin complex cross-links and increase the rate of biomass-biofuel conversion, shown in Figure 4.14.

There are generally two categories of techniques for conversion of biomass to biofuels: thermochemical conversion and biochemical conversion. Biochemical conversion techniques require much lower temperatures compared with thermochemical conversion. As discussed previously that the natural properties of lignin, such as, aromaticity, extensive carbon-carbon cross-linkage, and structural heterogeneity make lignin the most recalcitrant in protecting biomass from decomposition. As a result, the presence of lignin largely hinders the accessibility of cellulose and other polysaccharides to enzymatic hydrolysis step during the biochemical conversion. The first key step in biochemical conversion is the pretreatment of lignocellulosic biomass. The goal of

100

Figure 4.1. Schematic of pretreatment step of biomass

101

pretreatment is to change the physical and chemical properties of biomass material

(surface area, cellulose crystallinity index, degree of polymerization, lignin content and etc.) to increase the rate of enzymes hydrolysis.

Pretreatment methods can be divided into several groups in general: mechanical

(milling and grinding), physicochemical (steam pretreatment, hydrothermolysis and wet oxidation), chemical (alkali, dilute acid, oxidation and organic solvents), and biological

(fungi)48.

The purpose of physical pretreatments is to reduce the particle size and increase the surface area of biomass. For instance, disk milling/ball milling/grinding usually decrease the size of biomass to 0.2-2 mm size. But the milling process normally requires intensive energy input. Chemical pretreatments offer a wide variety of pretreatment methods and conditions. The pretreatment can be carried out at acidic, neutral or basic conditions. For acidic conditions, such as mineral acids (H2SO4, HCl and HNO3) or organic acids (formic acid, acetic acid and maleic acid), the pretreatment temperature, pressure and concentration of acids largely affect the product compositions. Alkaline pretreatments usually swell and increase the surface area of cellulose, disrupt the crystallinity of cellulose and hydrolyze the lignin-carbohydrate cross-linkage, which leave hemicellulose and cellulose behind. Strong alkali species, such as NaOH and KOH can cleave ester and ether bonds, while weak base like ammonia cleaves only ester bonds.

Table 4.1 is the brief summary of the conditions, advantages and disadvantage of different pretreatment methods4, 48.

102

Table 4.1. Summary of various pretreatment strategies

Pretreatment Chemicals Needed Post pretreatment Post Pretreatment Advantages Disadvantages Solid pretreatment conditions liquid Physical Disk milling NA Whole Biomass NA Milling (10-30mm) No chemicals Poor biomass Grinding, particle required, scalable, conversion, low sugar size (0.2-2µm) no water used hydrolysis rate, energy intensive Process Acidic Dilute Sulfuric acid Dilute sulfuric acid Enriched cellulose Xylose 140-190 ˚C, 0.4-2% Applicable to Require expensive and some sulfuric acid, 1-40 wide variety of haste-alloy reactor, hemicellulose min materials, expensive to recycling Produce water, toxic hydrolyzed xylose degradation product during pretreatment Organic acid Acetic acid, fumaric Enriched cellulose Soluble 130- 190˚C, 50-90 Separation of Expensive to recover 103 acid, formic acid and some hemicellulose mM of organic acid biomass to lignin organic acid, high hemicellulose and lignin and hemicellulose amount of water rich fraction and requires to wash post cellulose rich pretreatment substrate fraction, low pressure Concentrated acid H2SO3, HF, HCl, Condensed lignin Soluble glucose Short reaction time Hydrolysis of Corrosive reagents, H3PO4, HNO3 and xylose or cellulose during Highly energy soluble pretreatment, required for acid cellulose which largely decrease recovery is a precipitate the crystallinity of cellulose to amorphous cellulose, effective on soft wood Acidic organosolv Methanol, ethanol, Enriched cellulose Lignin and Acetone- water (1:1 Fractionation of High risk of dealing acetone, ethylene and majority of the some soluble ratio) pretreatment at lignin stream, with high-pressure glycol and hemicellulose hemicellulose 195˚C, pH 2.0, high cellulose reactor, flammable tetrahydrofurfuryl pressure required digestibility and volatile organic alcohol, water increases due to solvents

mixture, organic or removal of lignin inorganic acid (e.g., HCl, H2SO4) Neutral Ionic liquid 1-ally-3- Enriched Cellulose Lignin and 100-150˚C, several Low in the loss of High solvent cost, methylimidazolium- and hemicellulose some min to hours carbohydrate, few high cost in chloride ([AMIM]Cl), content hemicellulose degradation regeneration of ionic 1-ethyl-3- products except at liquid. methylimidazolium- severe conditions acetate ([EMIM]Ac). Alkaline Ammonium fiber Liquid or gaseous Whole biomass NA 100-140˚ C, 1:1- 2:1 Ammonia can be Safety precautions for expansion (AFEX) anhydrous ammonia ammonia to biomass recovered and dealing with volatile loading, 30-60 min reusable, few ammonia, not efficient residence time, 60- degradation for biomass 100% moisture product, dry content process NaOH NaOH Amorphous Soluble lignin NaOH Decrease the Longer residence time cellulose and some and crystallinity of required, large amount hemicellulose hemicellulose cellulose to highly of water used, difficult reactive in scaling up, amorphous expensive recovery of 104 cellulose, NaOH Solubilized lignin Lime Calcium monoxide Whole biomass NA 25- 160 ˚C, 120min Inexpensive Large amount of water with and without to weeks, 0.07-0.2 g pretreatment used, high cost in oxygen CaO/g biomass reactor system recovery of catalyst, long residence time Alkaline peroxide NaOH, H2O2 Enriched cellulose Soluble 0.5-2% NaOH, 0.125 Mild pretreatment Large amount of water and some Degraded lignin g H2O2 /g biomass, condition, used, high cost in hemicellulose and 22˚C, atmospheric commercially catalyst recovery, loss hemicellulose pressure for 2days used in paper of lignin due to industry oxidation Physiochem Steam explosion Steam and SO2 Enriched cellulose Soluble 180-210˚C, 1- Suitable for High cost in reactor ical hemicellulose 120min hardwood and due to high pressure herbaceous operation biomass Biological Fermentation with Microbes like fungi Whole biomass NA 25-30˚C solid state Mild pretreatment Slow process, not very Fungi or bacteria and bacteria with loss of fermentation, 80- condition, low high in sugar cellulose and 120% moisture energy required conversion, need hemicellulose content, 10-15 days continuous monitoring

Biological pretreatment methods usually require less expensive reaction systems compared with chemical or physical pretreatment process. Basidiomycetes has drawn a lot of interest in application of biodegradation of lignocellulose, as it plays an important role in decomposing woody material for carbon cycling in the ecosystem49. There are two different groups of lignocellulolytic Basidiomycetes: white-rot fungi and brown-rot fungi.

Brown-rot fungi usually decompose wood by degrading/removing carbohydrates and leaving lignin as modified without further degradation. White-rot fungi, such as

Phaneochaete chrysosporium, show ability to mineralize lignin, open up cell walls and increase carbohydrate accessibility50. The studies have shown that highly oxidative

. 2+ hydroxyl radical ( OH), generated by extracellular Fenton chemistry (Fe and H2O2), causes the destruction of lignin at close proximity49. Lignin is likely to undergo de- polymerization and re-polymerization forming different chemical structures, which overall results in the increasing accessibility to the polysaccharides in the cell wall.

However, biological pretreatment is usually a time-intensive process with lower throughput. In order to mimic white-rot fungi degradation of lignocellulosic biomass, solution phase Fenton chemistry opens up possibilities to generate hydroxyl radical and non-selectively oxidize organic species in the in-vitro condition. Typically, iron serves as an electron donor to hydrogen peroxide, which causes the catalytic decomposition of hydrogen peroxide in the generation of hydroxyl radicals. The classic Fenton’s reaction has been widely used in industrial wastewater treatment for degradation of toxic organics, decomposition of pesticides and herbicides51-52. Only a few studies apply the chemical

Fenton’s reaction onto the pretreatment of plant material. Because the simplicity, low cost of Fenton’s reaction, it provides potential application for high

103

throughput and efficient pretreatment in biomass-to-biofuel conversion compared with biochemical Fenton chemistry using white fungi. Fenton’s reaction has shown the ability to open up plant cell wall and increase cellulose accessibility on various biomass feedstock53. However, previous analysis on the Fenton-pretreated biomass shows that there is no lignin content difference in the post-treated biomass. No lignin-like degradation products were detected in the supernatant of pretreatment solution by

Fenton’s reaction as well.

4.1.2 Untargeted analysis by High-resolution Accurate Mass (HRAM) Mass

Spectrometry

The types of degradation reaction involved in lignin breakdown highly depend on the reactive functional groups within the molecules. As discussed in the previous chapter, lignin contains several important types of chemical groups: 1) aromatic rings are chemically stable and ring opening is usually achieved by inserting oxygen onto ring to yield carboxylic acid by enzymes like intra-molecular dioxygenases54; 2) ether linkages, such b-O-4, are major linkage in lignin and are stable and resistant to acid or base hydrolysis; 3) primary hydroxyl group on the g-C and the secondary alcohol on the a-C resulted from dominant b-O-4 group; the free phenolic group is at a relative lower content because it extensively involves in lignin polymerization reaction. All these hydroxyl groups have potential to be oxidized to form carboxylic acid, ketone or quinone groups in the biochemical reactions54.

#$ )$ - 49 Fe + H#O# → Fe +∙ OH + OH (1)

104

The hydroxyl radical generated from Fenton chemistry is highly reactive and can randomly attack organic species at close proximity. As the random and non-specific properties of reactive hydroxyl radical, the innovative aspect of this project is to study the oxidation profile of the products from Fenton’s reaction in an untargeted strategy using bioinformatics tools to screen out the possible lignin related product features. High- performance liquid chromatography (HPLC) coupled with high-resolution mass spectrometry (HRMS) provides additional separation step for analyzing complex sample mixture, together with mass-to-charge and ion intensity information. Thus, handling such complex HPLC-HRMS datasets is a critical step for identification and quantification of compounds of interest from complex sample systems. Many bioinformatics tools have been developed in order to handle such complex datasets for proteomics and metabolomics studies, such as XCMS55 and MZmine256. Those bioinformatics tools are widely used in data processing and analysis of metabolomics studies for detection of biologically meaningful metabolites as well as their dynamics related to environmental changes. Such bioinformatics tools are consist of a typical data processing platform, including several stages: filtering, detection of features, alignment and normalization57.

Filtering aims to remove interference signals, such as matrix backgrounds or random noise based on user define parameters. Feature detection step is to detect all the possible signals that are caused by true ions. In this step, both m/z and retention time information are pulled out to define chromatographic peak. During feature detection, isotopic pattern comparison can also be used to improve the detection accuracy by reducing the number of false positive assignments. Feature detection step is critical in accurately quantify the concentration of ions. It is common that retention time variation happens in LC

105

techniques. Alignment step is used for correcting the drifting in retention time between different runs and samples. The aim of normalization is to eliminate the undesired systematic deviation in ion intensities between runs and samples. This can be done by addition of internal standard compounds in the samples57.

In this study, high-resolution mass spectrometer, Thermo Scientific Q-Exactive

Orbitrap, is used as mass detector coupled with HPLC for analysis of complex post- pretreatment samples. HRMS measurements provide several advantages in feature detection as well as identification: 1) high sensitivity for identifying low abundant species because of high mass accuracy and 2) assignation of molecular formula with high accurate mass data. HRMS is able to measure accurate mass of analyte within ppm accuracy range. Additionally, high-resolution mass measurement allows highly resolved isotopic peaks, which largely increases the confidence in determining the elemental composition unambiguously. Further analysis, including isotope pattern, tandem mass spectrometry enables the selection and identification of ions of interest.

In this chapter, synthetic trimeric model compound was used for the pretreatment study. Because of the complexity of native lignin, it is extremely difficult to understand

Fenton chemistry at the molecular level when dealing with natural lignin. The problem is much less complicated when working with lignin model compounds. The trimer model used in this work was synthesized through a series of steps described in Chapter 3. As discussed, the trimeric model compound used in this chapter contains several important function groups, which will be a good representation for lignin as different functional groups may have different reactivity during the pretreatments. As discussed previously that the non-specificity and randomness in destructive reaction property of strong oxidant

106

-hydroxyl radicals in the Fenton chemistry, the product profile of post-oxidation with hydroxyl radical is expected to be challenging in complete prediction. Thus, the analytical strategy to investigate the product profile is to apply the similar untargeted analysis strategy in attempt to seeking and identifying the degradation products of trilignol model compound. The study will provide further understanding regarding the behavior of trimeric lignin molecules during the degradation process of Fenton chemistry and provide clues to help explain the observation during in the pretreatment of biomass feedstock with Fenton’s reaction.

4.2 Materials and methods

Materials

All chemicals and reagents were obtained and used without further purification.

Hydrogen peroxide (H2O2, 50% wt.), ferrous chloride tetrahydrate (reagent grade), sodium thiosulfate, LC/MS grade of water, acetonitrile and methanol were purchased from Thermo Fisher Scientific (Waltham, MA, USA). Vanillin, ferulic acid, para- phenylphenol, , and 3-hydroxy-4-methoxybenzoic acid (Isovanillic acid) were purchased from Sigma Aldrich (Milwaukee, WI, USA). Coniferyl alcohol was synthesized in house from coniferyl aldehyde using published methods36. Trilignolic mode compound [G(b-O-4) G(b-O-4)G] (GGG) was synthesized as described in Chapter

3 that consisted with a mixture of diastereomers. A Kromasil® EternityXT Hexylphenyl column (2.1X100mm, 2.5µm particle size) was purchased from Sigma Aldrich

(Milwaukee, WI, USA).

107

Fenton’s reaction pretreatment on trilignol

The GGG trimer was dissolved in 200 µl ethanol (EtOH) and then into10ml distilled water (18 MΩ), followed by 250 µl, 50 mM FeCl2 and 25.8 µl 50% H2O2

2+ resulting in [GGG]0 = 1 mM, [H2O2]0 = 50 mM and [Fe ]0 = 1.25 mM. The reaction was stirred at room temperature. Reaction samples (200 µl) were taken at the following time points: 0 min, 5 min, 15 min, 35 min, 75 min, 155 min, and 315 min. The reaction samples were quenched with 0.2 mL, 0.1M Na2S2O3 (aq), acidified with 0.2 ml, 1N HCl

(aq), and extracted with 200 µL ethyl acetate containing 0.25 µg/ml 4-phenylphenol as internal standard. This reaction was done in three replicates. 100 µL of above ethyl acetate layer was taken, dried under N2. The residue was then dissolved in 5% ACN, 5%

THF and 90% H2O for LC-MS analysis.

Sample Preparation for HPLC-HRMS analysis

For HPLC/MS analysis, a standard solution of 20µg/ml of mixed model compounds (vanillin, ferulic acid, isovanillic acid, coniferyl alcohol and synthesized trilignol GGG) was made in 95% H2O, 5% ACN. The mixed standard solution was used to develop LC separation method for analysis of post Fenton pretreatment samples. A

Kromasil® EternityXT HexylPhenyl column (100 x 2.1mm, 2.5µm) was used for all

HPLC separations, at a flow rate of 0.3 ml/min. Mobile phase: A: H2O, B: ACN. The eluent was introduced into Q-Exactive Orbitrap mass spectrometer (Thermo Fisher

Scientific) by electrospray ionization (ESI) in the negative ion mode. Raw MS data were processed by MZmine2 software package58. Briefly, after importing raw data to

MZmine2, mass detection function was first used to detect individual ions in each scan

108

and then a list of ions for each scan was created, followed by chromatogram builder step.

In the chromatogram builder step, a chromatogram was reconstructed for each mass which was previously detected. Each chromatogram built previously was then deconvoluted into individual peaks. After grouping isotopic peaks and filtering the ions based on the given restriction. A final peak list contains all features was generated, including m/z, retention time and peak area59. The peak area of each feature was manually integrated using Xcalibur and normalized to internal standard.

4.3 Results and Discussion

4.3.1 High-Performance Liquid Chromatography (HPLC) Method Development

To develop and evaluate the HPLC/MS methods for post Fenton pretreatment products analysis, mixture of pure standard compounds were used and separated by a reversed phase Hexylphenyl column. Hexylphenyl column was chosen because of its good efficiency in separating aromatic compounds. Several mobile phase gradients were tested with standard compounds. In order to be compatible with negative ion mode ESI, formic acid was not added to the mobile phase. The Kromasil® EternityXT Hexylphenyl column is packed with porous silica microspheres, which are functionalized with hexylphenyl stationary phase. After comparison of different gradients, the optimal nonlinear gradient elution is used for this column at a flow rate of 0.3ml/min (see Table

4.2).

Figure 4.2.b-e. show the EIC of each deprotonated model compounds. Vanillic acid, ferulic acid and coniferyl alcohol are well separated from each other. Peaks of

109

vanillin and coniferyl alcohol are very close to each other. With accurate mass EIC, these two peaks are able to be distinguished from each other. Figure 4.3 shows the EIC of the deprotonated trilignol (GGG). There are four peaks accounted for trilignol. The mixture of diastereomers are not able to be completely resolved in this LC condition. Since only the relative quantification of total amount of trilignol is interested in this study, it is acceptable that the diastereomers of trilignol are not fully separated from each other.

110

Table 4.2. LC method used in the pretreatment study

Time (min) %A: H2O %B: ACN

0.00 90 10

5.00 90 10

20.00 10 90

23.00 10 90

25.00 50 50

27.00 90 10

111

mix_std_051316_01 5/13/2016 1:48:36 PM

RT: 0.00 - 27.01 SM: 7G 20.56 NL: 100 a) Base Peak Chromatogram 1.44E8 20.47 2.11 8.96 21.45 Base Peak MS 15.78 mix_std_051 50 316_01 6.35 11.75 11.20 12.65 18.02 25.62 25.84 0.73 15.62 17.47 20.29 21.91 23.69 2.69 3.69 4.76 5.21 6.69 8.65 10.28 0 2.11 NL: 100 b) 3-hydroxy-4-methoxybenzoic acid 1.08E8 m/z= 167.03- 167.04 MS 50 mix_std_051 316_01 0.92 2.54 3.47 4.29 5.20 6.69 7.30 9.09 9.59 11.21 12.36 14.51 16.65 17.19 18.35 19.78 21.48 22.82 23.77 25.10 26.96 0 6.35 NL: 100 c) Ferulic acid 3.91E7 m/z= 193.05- 193.05 MS 50 mix_std_051 316_01

0.71 2.44 3.21 3.69 5.98 6.90 7.30 8.86 9.78 11.23 11.85 13.65 15.05 15.74 17.19 18.35 20.31 21.19 23.07 24.70 25.23 27.00

RelativeAbundance 0 8.83 NL: 100 3.54E7 d) Coniferyl alcohol m/z= 179.07- 179.07 MS 50 mix_std_051 316_01

1.33 2.29 3.65 4.46 4.98 6.70 8.03 9.09 10.11 12.40 13.27 14.15 15.34 17.06 17.76 18.86 20.06 22.42 23.74 24.58 26.91 0 8.96 NL: 100 1.01E8 e) Vanillin m/z= 151.04- 151.04 MS 50 mix_std_051 316_01

0.92 2.11 3.69 4.43 5.21 6.38 7.72 8.57 9.26 10.74 12.72 14.51 16.65 17.77 18.35 19.49 20.77 23.20 23.96 25.26 26.96 0 0 2 4 6 8 10 12 14 16 18 20 22 24 26 Time (min)

Figure 4.2. Liquid chromatogram of mixture of standards; a) Base peak chromatogram

(BPC); b) Extracted Ion Chromatogram (EIC) of 3-hydroxy-4-methoxybenzoic acid; c)

EIC of ferulic acid; d) EIC of coniferyl alcohol; e) EIC of vanillin.

112

\\EXACTIVE\Xcalibur\...\GGG_051316_01 5/13/2016 4:02:49 PM

RT: 0.00 - 27.01 SM: 7G 10.86 NL: 100 8.44E7

\\EXACTIVE\Xcalibur\...\GGG_051316_01 5/13/2016 4:02:49 PM m/z= 95 571.21- RT: 10.08 - 11.66 SM: 7G 10.86 NL: 571.23 90 100 8.44E7 m/z= MS 95 571.21- 571.23 GGG_0513 90 10.83 MS 85 GGG_051316_01 85 16_01 OH 10.95 10.95 80 80 O O HO O 75 OH 75 70 65 HO HO O O 70 60 OH 55

Chemical Formula: C30H36O11 50 65 Exact Mass: 572.2258 45 RelativeAbundance 60 40 35 55 30 25 50 20 15 11.03

10 45 5 11.12 11.21 11.25 11.35 11.46 11.57 0 RelativeAbundance 40 10.1 10.2 10.3 10.4 10.5 10.6 10.7 10.8 10.9 11.0 11.1 11.2 11.3 11.4 11.5 11.6 Time (min) 35

30

25

20

15 11.03

10

5 0.35 6.07 8.42 11.12 11.81 14.21 15.10 15.89 17.89 19.15 20.02 21.75 22.58 23.21 24.47 25.70 0 0 2 4 6 8 10 12 14 16 18 20 22 24 26 Time (min)

Figure 4.3. Extracted Ion Chromatogram (EIC) of starting trilignol GGG.

113

4.3.2 Analysis of Unknown Products from Fenton’s Reaction of Trilignolic Model

Compound (GGG)

Thermo Scientific Q-Exactive Orbitrap mass spectrometer was coupled with

HPLC system for high resolution MS data acquisition. All the standard model compounds formed stable deprotonated ions [M-H]-. The high-resolution accurate mass data acquisition allows the determination of the elemental compositions for unknown post-pretreatment products as well as the standard compounds. To obtain structural information, data-dependent tandem mass spectrometric experiment (MS/MS) was performed. In data-dependent MS/MS experiment, one full scan at high-resolution

(R=70,000) was carried out to determine the subsequent MS/MS experiment. Five most abundant ions in each full scan were then subjected to high-energy collision dissociation

(HCD) cell for MS/MS experiment. The products ions were analyzed at a resolution of

17,500 in the Orbitrap mass analyzer. MS/MS experiments and high-resolution mass measurements can be used to propose the molecular structures to the detected unknown compounds. Upon MS/MS experiments, the deprotonated molecules of lignin related compounds share common fragmentation patterns, which indicate the existing of particular functional groups and chemical linkages30-31, 60. For example, loss of 15Da,

18Da, and 30Da are corresponding to the loss of methyl radical, water and formaldehyde group which indicates the presence of methoxy group on the aromatic ring and aliphatic alcohol group at g-C, respectively. Elemental composition assignment with high- resolution analysis provides another level of confidence in interpreting the MS/MS spectrum as well.

114

Para-phenylphenol (PPP) was used as the internal standards for all the LC/MS analysis. PPP is chosen because of its similar aromaticity and phenoxy group, which gives reasonable response in the negative ion mode. PPP also showed good stability during the work-up process of solution phase Fenton chemistry. The condition of

Fenton’s reaction used in this study was modified based a previous publish work52, 61. In this study, initial concentrations of three reagents were used as following: [GGG]0 = 1

2+ mM, [H2O2]0 = 50 mM and [Fe ]0 = 1.25 mM. Reaction mixtures were sampled at fixed intervals.

Data processing was performed using MZmine2 software58. The peak list generated by MZmine2 contains all the features with m/z value, retention time and the peak area information. All the features were manually screened, based on the mass defect and MS/MS information. Table 4.3 is a summary of twenty features that have reasonable mass defect to be potential lignin-like compounds. The time course measurements of sixteen features were used to plot the concentration change curves. Figure 4.4 shows the concentration changes of starting trilignol over 5 hours.

As shown in Figure 4.4, there is a dramatically decreasing of GGG in the first 5 min, the degradation rate drops as the amount of the remaining GGG decreases. When there is only GGG and FeCl2 presence in the reaction mixture at t = 0 min, the generation of unknown peaks, which are around trimeric and tetrameric molecular weight ranges, are observed. The mechanism of production of trimeric and higher MW molecules is still unclear. Figure 4.5 shows the time course measurement of major trimeric unknown

- product [M-H] 551.1909 (C30H31O10, -1.6784 ppm).

115

Table 4.3. List of 20 features with their corresponding elemental composition and retention time based on the result from MZmine 2.

m/z [M-H]- ∆PPM rt(min) 745.3051 C38H49O15 -1.4951 12.97 629.2594 C33H41O12 -1.3164 11.09 593.1428 C16H33O23 0.986 12.43 571.2179 C30H35O11 -1.0321 10.82 569.2023 C30H33O11 -0.8845 11.43 563.2131 C28H35O12 -1.3373 12.39 553.2064 C30H33O10 -0.6143 12.44 551.1909 C30H31O10 -1.6784 12.78 523.1597 C28H27O10 -2.6059 13.23 433.1856 C23H29O8 -3.0131 12.46 375.1439 C20H23O7 -2.68 12.17 375.1439 C20H23O7 -2.68 10.51 373.1282 C20H21O7 -2.8727 11.43 373.1282 C20H21O7 -2.3002 12.8 353.1597 C18H25O7 -2.745 12.98 213.0762 C10H13O5 -2.523 0.94 209.0812 C11H13O4 -2.9563 13.05 209.0812 C11H13O4 -3.0293 14.35 197.0449 C9H9O5 -3.4031 0.94 195.0292 C9H7O5 -3.6228 0.6 151.0397 C8H7O3 -2.5113 8.93 167.0346 C8H7O4 -2.0628 1.71

116

Figure 4.4. Time course measurement of starting material GGG. Error bars indicate ± one standard deviation.

117

Figure 4.5. Degradation of trimeric unknown feature, [M-H]- 551.1909. Error bars indicate ± one standard deviation.

118

Similar to degradation pattern of trimer GGG, upon the addition of H2O2, trimeric compound ([M-H]- 551.1909) is subjected to degradation rapidly in the first 5 min and goes to completion less than 3 hours (Figure 4.5). Two possible structures of unknown

551.1909 is proposed according the fragmentation pattern as well as the elemental composition of fragment ions (shown in Figure 4.6). The fragmentation pattern of ion

551.1909 is very similar to original trimer GGG. They both contain fragment ion

195.0656 with the same elemental composition, which indicates that the unknown trimer with m/z 551.1909 contains same phenolic end group as in trimer GGG. Figure 4.7 depicts the possible fragmentation pathways of the putative structures that will lead to the fragmentation ions, which were observed in the HCD/MS/MS spectrum (Figure 4.6.a).

Time Course Curve of Other Unknown Features

The peak area of individual features at different time point was extracted out manually using Xcalibur. The time course measurements of a few unknown features are showed in Figure 4.8. The generation of the unknown features with monomeric and dimeric molecular weights are observed as Fenton reaction proceeds, such as [M-H]-

151.0397, 375.1489, and 373.1282. Concentration of several dimeric and monomeric unknown compounds, such as m/z 375.1439 (12.17 min) and m/z 151.0397, increases as the reaction started and decreased after about 1hr, which indicates a further degradation occurs among these intermediate products as shown in Figure 4.8.

119

a)GGG_Fenton_051916_mixed_target_MS2_05... 5/23/2016 3:50:20 PM GGG_Fenton_051916_mixed_target_MS2_052316 #757 RT: 12.75 AV: 1 NL: 1.78E6 T: FTMS - p ESI Full ms2 [email protected] [50.00-580.00] 355.1173 C 20 H19 O6 11.5 RDBE -3.8621 ppm 100

95

90

85

80

75

70

65

60 503.1699 55 C 29 H27 O8 16.5 RDBE 50 195.0656 -2.5442 ppm C H O 45 10 11 4 5.5 RDBE

RelativeAbundance 40 -3.5096 ppm

35

30 165.0550 25 C 9 H9 O3 5.5 RDBE 20 -4.2063 ppm 551.1909 C H O 15 30 31 10 15.5 RDBE 341.1015 -2.4535 ppm 10 C H O 61.9875 19 17 6 5 11.5 RDBE -4.5736 ppm 0 50 100 150 200 250 300 350 400 450 500 550 m/z

b)

OH

O O O O OH

HO HO O O

Chemical Formula: C30H32O10 Exact Mass: 552.1995

OH

O O HO O O

HO HO O O

Chemical Formula: C30H32O10 Exact Mass: 552.1995

Figure 4.6. a) HCD/MS/MS spectrum of m/z 551.1909; b) putative structures of m/z

551.1909.

120

OH

O O HO O O

-O HO O O

- Chemical Formula: C30H31O10 Exact Mass: 551.1923

OH -O HO O O O

O O -O HO Chemical Formula: C H O - 10 11 4 - Exact Mass: 195.0663 Chemical Formula: C20H19O6 Exact Mass: 355.1187

OH

O O O O OH

-O HO O O

- Chemical Formula: C30H31O10 Exact Mass: 551.1923

OH -O O O O OH

O O -O HO Chemical Formula: C H O - 10 11 4 - Exact Mass: 195.0663 Chemical Formula: C20H19O6 Exact Mass: 355.1187

Figure 4.7. possible fragmentation structures for corresponding trimeric unknown m/z

551.1909

121

0.6

0.5

0.4 375.1439(12.17min)

151.0397

745.3068 0.3 629.2593

569.2021

Normalized Peak Area Normalized 563.213 0.2 553.2067

209.0812(13.03min)

0.1

0 0 50 100 150 200 250 300 350 Time(min)

0.07

0.06

0.05

0.04 523.1597 373.1282(11.43min) 0.03 373.1282(12.8min)

Normalized Peak Area Normalized 375.1439(10.51) 0.02

0.01

0 0 50 100 150 200 250 300 350 Time(min)

0.09

0.08

0.07

0.06

0.05 209.0812(14.36min) 195.0292 0.04 167.0345

Normalized Peak Area Normalized 0.03

0.02

0.01

0 0 50 100 150 200 250 300 350 Time(min)

Figure 4.8. Time course plots of fifteen unknown features. Error bars indicate ± one standard deviation.

122

Identification of unknown features

Among these unknown features, three of them are identified and confirmed with standard compounds. Figure 4.9. shows the extracted ion chromatogram of [M-H]- with exact m/z 151.0397 at retention time of 8.96 min, corresponding to an elemental composition of C8H7O3, ∆ppm= -2.5 ppm and a double bond equivalent (DBE) of 5, which can be a compound with one aromatic ring and one double bond. Upon

HCD/MS/MS experiment, this unknown m/z 151.0397 is fragmented via a methyl radical loss (15 Da) and gives rise to m/z 136.0161 (C7H4O3, ∆ppm=-3.6341), which indicates there is a methoxy group on the aromatic ring31. The exact m/z value, elemental composition (EC) assignment as well as the fragmentation patterns are identical to those of the deprotonated vanillin. The confirmation of structure of m/z 151.0397 ion is done by comparison of the retention time and HCDMS/MS data with commercial vanillin, which indeed shows the same with the unknown peak from post Fenton reaction extract, shown in Figure 4.9 upper panel. Hence, this unknown peak at rt 8.96 min in the trilignol degradation product mixture is identified as vanillin. There are another two clusters of chromatographic peaks shown in the Figure.4.9 lower panel that have the ions of

151.0397 at different retention time than standard vanillin. One of the possible explanations is that observation of ion 151.0397 could be the result of in-source fragmentation of the other post-pretreatment products.

123

Vtips_GOH_TBAF_050416_SiO2_frac_41_Neg 5/9/2016 11:17:23 AM Vtips_GOH_TBAF_050416_SiO2_frac_41(combined) 5mM NH4OH/ACN=1:1 D:\Data\...\GGG_Fenton_051916_LC_04 5/19/2016Vtips_GOH_TBAF_050416_SiO2_frac_41_Neg 4:16:25 PM #29 RT: 0.82 AV: 1 NL: 7.97E7 T: FTMS - p ESI Full ms2 [email protected] [50.00-750.00] 136.0159 C 7 H4 O3 HO 6.0 RDBE -4.7560 ppm RT: 6.00 - 18.00 SM: 7G 100 8.96 95 NL: 100 90 1.78E8 85 O m/z= 90 80 75 O 151.0382- 70 151.0412 80 65 MS 60 Chemical Formula: C8H8O3 mix_std_051 70 55 50 Exact Mass: 152.0473 316_03 45

60 RelativeAbundance 40 35 151.0396 C 8 H7 O3 30 5.5 RDBE 50 -3.1174 ppm 25

20 40 15 10

RelativeAbundance 30 5 0 50 60 70 80 90 100 110 120 130 140 150 160 170 180 190 200 m/z 20

10 9.21 6.29 6.57 7.33 7.84 8.30 8.62 9.40 9.84 10.51 11.00 11.69 12.24 13.16 13.97 14.51 15.29 15.92 16.57 17.23 17.89 0 8.96 GGG_Fenton_051216_MS2_02 5/13/2016 6:53:57 PM NL: 100 GGG_Fenton_051216_MS2_02 #1080 RT: 9.05 AV: 1 NL: 2.23E7 T: FTMS - p ESI d Full ms2 [email protected] [50.00-175.00] 2.73E7 136.0161 C 7 H4 O3 6.0 RDBE m/z= 90 -3.6341 ppm 100 151.0382- 95 151.0412 80 90 85 MS 80 GGG_Fenton 70 75 _051916_LC 70

65 _04

60 60 8.91 55 50 50 45

RelativeAbundance 40

35 40 151.0394 30 C 8 H7 O3 12.45 5.5 RDBE 10.43 25 -4.1277 ppm 30 10.51 12.74 20 15

10.67 10

20 5

0 50 60 70 80 90 100 110 120 130 140 150 160 170 10 11.26 11.70 13.16 m/z 9.20 10.29 13.28 14.00 14.42 6.16 6.73 7.33 8.16 8.60 14.72 15.76 15.82 16.69 17.02 17.77 0 6 7 8 9 10 11 12 13 14 15 16 17 18 Time (min)

Figure 4.9. Extract ion chromatogram (EIC) and HCD/MS/MS spectrum of vanillin

(upper panel), EIC and HCD/MS/MS spectrum of unknown m/z 151.0397 (lower panel)

124

Similarly, another two unknown peaks with retention time 1.73min and 10.54min are identified as vanillic acid and dimeric G(b-O-4)G (Figure 4.10 and 4.11) by comparing the retention times as well as HCD/MS/MS spectra with standard compounds.

As showed in Figure 4.11, there are two clusters peak around 10.43 and 12.45 min that have the same exact m/z value as well as MS/MS spectrum, which indicates these are likely to be the isomers of vanillin that have very similar structure with vanillin. Similar observation happens to dimeric G(b-O-4)G, a pair of peaks at 12.18 min is not able to distinguish from pair of peaks at 10.53 min by the HCD/MS/MS experiment other than the different in their retention times. It could be result of being different diastereomers of dimer. The putative structures of other three unknown molecules in the trilignol degradation reaction mixture are proposed according to the exact mass as well as the fragmentation pattern (shown in Figure 4.12). Unfortunately, the lack of authentic compounds makes the identification and confirmation of these unknowns impossible.

125

D:\Data\...\GGG_Fenton_051716_LC_04 5/17/2016 5:19:56 PM

RT: 1.00 - 10.00 SM: 7G vanillic_acid_std_052516_neg 5/25/2016 5:06:30 PM 2ng/ul vanillic acid standard 90%H2O,5%ACN,5% THF 1.85 vanillic_acid_std_052516_neg #21 RT: 0.69 AV: 1 NL: 7.25E5 NL: 100 T: FTMS - p ESI Full ms2 [email protected] [50.00-200.00] 152.0112 1.44E8 C 7 H4 O4 6.0 RDBE -2.3205 ppm m/z= 90 HO 100 95 167.0329- 90 167.0363 80 85 80 167.0345 MS C 8 H7 O4 1.90 75 5.5 RDBE vanillic_acid OH -2.7936 ppm 70 70 _052516_01 O 65 123.0450 60 60 C 7 H7 O2 55 4.5 RDBE -1.0088 ppm 50 O 50 45

RelativeAbundance 40 108.0216 Chemical Formula: C H O 35 C 6 H4 O2 8 8 4 5.0 RDBE 40 30 -1.0836 ppm Exact Mass: 168.0423 25

RelativeAbundance 20

30 15 91.0185 10 C 6 H3 O 5.5 RDBE 20 5 83.0133 -4.2962 ppm 0 50 60 70 80 90 100 110 120 130 140 150 160 170 180 190 200 m/z 10 1.31 1.46 2.14 2.31 2.63 3.14 3.45 3.93 4.14 4.39 5.11GGG_Fenton_051216_MS2_025.49 5.73 6.08 6.42 5/13/20166.82 6:53:57 PM 7.27 7.59 8.03 8.26 8.63 8.81 9.18 10.00 GGG_Fenton_051216_MS2_02 #319 RT: 1.80 AV: 1 NL: 3.79E6 0 T: FTMS - p ESI d Full ms2 [email protected] [50.00-190.00] 1.73 123.0449 NL: 100 C 7 H7 O2 4.5 RDBE 4.20E6 -1.9388 ppm 100 m/z= 90 95 90 167.0329- 85 167.0363 80 80 75 167.0344 MS C H O 70 8 7 4 5.5 RDBE GGG_Fenton 70 65 -3.5244 ppm _051716_LC 60 55 _04 60 50

45 152.0110 C 7 H4 O4 RelativeAbundance 40 50 6.0 RDBE 35 -3.3243 ppm 108.0215 30 C 6 H4 O2 5.0 RDBE 25 40 -1.9311 ppm 20

15 91.0185 30 10 5 132.8675

0 20 2.15 50 60 70 80 90 100 110 120 130 140 150 160 170 180 190 3.60 m/z 8.71 10.00 3.64 8.66 8.75 9.53 3.36 3.39 7.90 8.19 10 1.04 1.52 2.31 2.83 3.80 4.33 4.75 4.97 5.49 6.07 6.31 6.58 7.35 7.57

0 1.0 1.5 2.0 2.5 3.0 3.5 4.0 4.5 5.0 5.5 6.0 6.5 7.0 7.5 8.0 8.5 9.0 9.5 10.0 Time (min)

Figure 4.10. EIC and HCD/MS/MS spectrum of vanillic acid (upper panel), EIC and

HCD/MS/MS spectrum of m/z 167.0346 (lower panel)

126

D:\Data\...\GGG_Fenton_051916_LC_04 5/19/2016 4:16:25 PM

Vtips_GOH_TBAF_050416_SiO2_frac_41_Neg 5/9/2016 11:17:23 AM Vtips_GOH_TBAF_050416_SiO2_frac_41(combined) 5mM NH4OH/ACN=1:1 Vtips_GOH_TBAF_050416_SiO2_frac_41_Neg #34 RT: 0.97 AV: 1 NL: 2.21E7 RT: 6.00 - 18.00 SM: 7G T: FTMS - p ESI Full ms2 [email protected] [50.00-750.00]

327.1223 10.53 195.0653 NL: 100 C 19 H19 O5 100 10.5 RDBE 95 -4.7186 ppm 1.62E8

90 OH m/z= 90 85 80 375.1401- O O 75 375.1476 80 70 65 MS 60 gg_dimer_se 70 OH 55 50 mi_051316_ HO HO O 45 01 60 RelativeAbundance 40 35 179.0709 C 10 H11 O3 30 5.5 RDBE -2.4833 ppm 50 Chemical Formula: C20H24O7 25 Exact Mass: 376.1522 20 15 375.1434 C 20 H23 O7 10 9.5 RDBE 40 -4.0630 ppm 5

10.72 0 60 80 100 120 140 160 180 200 220 240 260 280 300 320 340 360 380 400 420 440 RelativeAbundance 30 m/z

20

10 7.28 9.03 9.85 10.40 10.84 11.58 12.18 12.51 13.07 13.67 14.24 15.15 15.76 16.74 17.69 0 GGG_Fenton_051916_mixed_target_MS2_05... 5/23/2016 3:50:20 PM 12.18 NL: GGG_Fenton_051916_mixed_target_MS2_052316 #681 RT: 12.15 AV: 1 NL: 7.92E6 100 T: FTMS - p ESI Full ms2 [email protected] [50.00-400.00] 327.1225 4.06E7 C 19 H19 O5 10.5 RDBE -3.9722 ppm m/z= 90 100 95 375.1397- 90 375.1473 80 85 80 MS 75 GGG_Fenton 70 70 65 _051916_LC 195.0656 60 C 10 H11 O4 5.5 RDBE _04 55 60 -3.4313 ppm 50

45 50 RelativeAbundance 40 35

30

25 40 12.27 165.0551 20 C 9 H9 O3 5.5 RDBE 15 -3.9290 ppm 375.1435 C 20 H23 O7 10 312.0993 C H O 9.5 RDBE 30 18 16 5 -3.7376 ppm 10.54 5 11.0 RDBE 149.0228 -3.1630 ppm 0 120 140 160 180 200 220 240 260 280 300 320 340 360 380 20 m/z 12.06 10 10.72 11.86 6.51 7.20 7.57 8.02 8.47 8.86 9.15 9.72 10.10 11.17 13.07 14.00 14.08 14.85 15.77 16.36 17.12 0 6 7 8 9 10 11 12 13 14 15 16 17 18 Time (min)

Figure 4.11. EIC and HCD/MS/MS spectrum of synthetic G(b-O-4)G (upper panel), EIC

and HCD/MS/MS spectrum of m/z 375.1439 (lower panel)

127

GGG_Fenton_051916_mixed_target_MS2_05... 5/23/2016 3:50:20 PM

GGG_Fenton_051916_mixed_target_MS2_052316 #69 RT: 0.65 AV: 1 NL: 4.19E6 T: FTMS - p ESI Full ms2 [email protected] [50.00-235.00] 213.0762 C 10 H13 O5 4.5 RDBE -3.0243 ppm 100 OH 95

90

85 OH

80 OH 75 HO 70 O 65

60 Chemical Formula: C10H14O5 55 Exact Mass: 214.0841 50 169.0502 C H O 45 8 9 4 4.5 RDBE

RelativeAbundance 40 -2.7280 ppm 125.0606 35 C 7 H9 O2 3.5 RDBE 30 -1.4979 ppm

25 137.0242 C 7 H5 O3 155.0347 20 109.0294 5.5 RDBE C 7 H7 O4 C 6 H5 O2 -1.9171 ppm 15 83.0497 4.5 RDBE 96.9598 4.5 RDBE -2.1240 ppm C 5 H7 O -0.9786 ppm 10 2.5 RDBE 57.0337 -6.4143 ppm 5 98.9556 59.0130 79.9567 0 50 60 70 80 90 100 110 120 130 140 150 160 170 180 190 200 210 220 230 GGG_Fenton_051916_mixed_target_MS2_05... 5/23/2016 3:50:20 PM m/z GGG_Fenton_051916_mixed_target_MS2_052316 #803 RT: 13.11 AV: 1 NL: 3.78E5 T: FTMS - p ESI Full ms2 [email protected] [50.00-230.00] 194.0578 C 10 H10 O4 6.0 RDBE -3.4632 ppm 100

95

90 O

85

80

75

70 HO OH 65 209.0811 O C 11 H13 O4 60 5.5 RDBE -3.8320 ppm 55 Chemical Formula: C11 H14O4 50 Exact Mass: 210.0892 45

RelativeAbundance 40

35

30

25

20

15 148.0526 C 9 H8 O2 165.1278 10 111.0812 123.0449 6.0 RDBE C 11 H17 O C H O C 7 H7 O2 7 11 -2.7781 ppm 3.5 RDBE 5 61.9875 4.5 RDBE 2.5 RDBE -4.0310 ppm 98.9715 -2.8429 ppm -2.0008 ppm 140.9601 180.9886 0 50 60 70 80 90 100 110 120 130 140 150 160 170 180 190 200 210 220 230 GGG_Fenton_051916_mixed_target_MS2_05... 5/23/2016 3:50:20 PM m/z GGG_Fenton_051916_mixed_target_MS2_052316 #108 RT: 0.95 AV: 1 NL: 3.23E5 T: FTMS - p ESI Full ms2 [email protected] [50.00-220.00] 109.0658 C 7 H9 O 3.5 RDBE 153.0554 -1.2666 ppm C 8 H9 O3 100 4.5 RDBE OH -2.1434 ppm 95

90 125.0606 85 OH C 7 H9 O2 3.5 RDBE 80 -1.6199 ppm 75 HO

70 O 65

60 Chemical Formula: C10H14O4 55 Exact Mass: 198.0892 197.0814 50 C 10 H13 O4 4.5 RDBE 45 -2.5169 ppm

RelativeAbundance 40 96.9598 35

30 61.9875 121.0293 25 169.0502 C 7 H5 O2 C 8 H9 O4 20 91.0549 5.5 RDBE 4.5 RDBE C 7 H7 -1.3858 ppm -2.3670 ppm 15 4.5 RDBE 135.0813 182.0215 -4.3048 ppm C 9 H11 O 154.9129 C H O 10 59.0130 79.9568 8 6 5 4.5 RDBE 6.0 RDBE -1.6601 ppm -3.0584 ppm 5 67.0181 103.9763 130.9922 0 50 60 70 80 90 100 110 120 130 140 150 160 170 180 190 200 210 220 m/z

Figure 4.12. HCD/MS/MS spectra of three unknown features with putative structures

128

4.4 Conclusion

Previous studies have shown that solution phase Fenton chemistry can play a positive role in increasing the accessibility of cellulose in biomass. However, the total lignin content in the biomass remain unchanged after solution phase Fenton pretreatment53. Monitoring the reactions of the solution phase Fenton chemistry on the synthetic trimeric lignin model compound provides insight into understanding the degradation process of Fenton reagent towards lignin compounds. The time course measurement of trilignol shows that the trimer is degraded or modified in the solution phase Fenton condition within a few hours. The study also shows that the degradation is not a first order reaction. The concentration of starting trimer drops rapidly in the first

5min then undergoes at a slower degradation rate. A list of features was generated by using MZmine2 for data preprocessing. With high-resolution mass spectrometry, the possible lignin-like features could be screened out from the other interference ions according to the accurate mass and tandem mass spectrometry. Upon comparison with three authentic compounds, the production of vanillin, vanillic acid and G(b-O-4)G dimeric compound in the Fenton pretreatment reaction are successfully confirmed. While no significant lignin content decrease upon the Fenton pretreatment biomass (switchgrass and miscanthus), the Fenton reaction upon lignin model compounds shows opposite results. One possible explanation is that the accessibility of lignin reactive site toward hydroxyl radical. The hydroxyl radical is a strong oxidative radical with short lifetime and tend to randomly attack reactive sites in close proximity49. During the study of the solution phase Fenton chemistry upon synthetic lignin model compound, the reaction was done in a homogeneous system. The trimeric compound was completely dissolved in the

129

presence of ethanol compared to the heterogeneous conditions found in ground biomass pretreatment. Moreover, due to the complexity of cell wall structure, lignin has complex cross-linkages with hemicellulose and other molecules. The complex lignin structure could largely reduce the available lignin reactive sites to free hydroxyl radicals. This study has successfully developed the LC-MS/MS analytical techniques for the trimeric lignin model compound degradation products. The study also provides an insightful understanding of the Fenton chemistry towards the lignin model compound which could be extended to natural lignin. This study raises up the potentials for improving reaction conditions, which is required for treating natural biomass and the possibility of complete removal of lignin with Fenton chemistry.

130

Chapter 5

Conclusion

The focus of this dissertation is to develop mass spectral analytical methods in characterization of lignin model compounds, design synthetic route for novel lignin model compounds as well as study biomass pretreatment using state of the art high- resolution mass spectrometry. Lignin has drawn increasing attention in the area of biomass-to-biofuel conversion as it negatively affects the utilization of enclosed cellulose in the biomass. Un-locking the aromatics in lignin provides a potential renewable replacement for the traditional chemical precursors derived from petro-chemistry. A better understanding of lignins’ physical and chemical properties will contribute in exploring potentials to upgrade the commercial value of lignin, which requires complete elucidation of lignin structure. However, lignin analysis remains quite challenging, as no analytical methods have been able to completely depict the primary structures of lignin today. As mass spectrometry and tandem mass spectrometry have been successfully in elucidation of primary structure of proteins and peptides, it becomes a very attractive research area for the application of mass spectrometry in lignin characterization and structural elucidation.

One of the most important factors when applying mass spectrometry in lignin chemistry is to evaluate the ionization response of lignin compounds. Electron ionization

(EI) coupled with gas chromatogram (GC)-mass spectrometry (MS) has been successfully applied in characterizing monomeric composition of biomass with the aid of prior chemical degradation. However, electron ionization (EI) is not compatible with non-volatile analytes. Interpretation of complex EI mass spectrum remains very

131

challenging, thus little structural information can be easily obtained for high molecular weight of lignin oligomers with traditional EI ionization technique. Soft-ionization techniques, such as electrospray ionization (ESI) and matrix assisted laser desorption ionization (MALDI), have shown successful application in large biopolymers. Efforts have been investigated in characterizing lignin compounds with several soft-ionization techniques coupled with mass spectrometry, such as ESI, MALDI, Atmospheric Pressure

Chemical Ionization (APCI) and Atmospheric Pressure Photoionization (APPI). Among these, electrospray ionization (ESI) is widely used since it requires less rigorous sample preparation and is easily compatible with separation techniques, such as high- performance liquid chromatography (HPLC). Electrospray ionization allows easy selection of positive ion or negative ion modes, which is mainly dependent on the chemical functional groups of the analytes. In an attempt to apply ESI in lignin study, negative ionization mode in ESI through simple deprotonation is commonly used because of the presence of weak acidic phenol functional group. Due to the structural diversity of lignin, negative ion mode with traditional deprotonation techniques may not be suitable for the lignin oligomers that don’t have free phenolic hydroxyl group. A few studies have observed that positive ion mode in ESI through adduct formation gives better ionization response than negative ion mode via simple deprotonation20-21, 32.

The second chapter of this dissertation focuses on systematic comparison of ionization response with known structures of lignin trimeric model compounds under different ESI conditions. The preliminary mass spectral characterization of dehydrogenation polymers (DHPs) synthesized in two different reaction conditions shows the significant structural difference in the product profile. The DHP synthesized in

132

the presence of CTAS (DHP-CT) contained high abundant of a, b-diaryl ether linkage.

The ESI ionization response of the DHP-CT showed distinct difference between negative ion ESI via simple deprotonation and positive ion mode ESI via the formation of ammonium cation adducts. Higher molecular weight oligomers, number of degree of polymerization to 10, were observed in positive ion mode of a, b-diaryl ether linkage rich DHP, while same high molecular weight ions were absent in the negative ion mode with simple deprotonation. Monomeric and dimeric ions were the most abundant in negative ion mode. In the sharp contrast to DHP-CT, DHPs, synthesized from continuous addition in aqueous buffer (DHP-ZT), gave similar molecular weight distribution in both negative ion mode and positive ion mode. This interesting observation motived a further study for a deeper understanding of the relationship between the ionization conditions and the structures of lignin. This ionization study might provide insights for selecting ionization conditions when encountering lignin compounds. The first hypothesis of this dissertation is that the ionization response of lignin is highly related to the specific chemical linkage types of lignin compounds as well as the solution chemistry of electrospray ionization. This hypothesis was tested by a systematic comparison of three trilignols containing unique chemical bonding type under different ESI conditions. The results of negative ion mode ESI with deprotonation [M-H]- and chloride adduction formation [M+Cl]- showed that the trimer containing a, b-diaryl ether linkage suffered significantly from in-source fragmentation. When forming [M-H]-, a, b-diaryl ether trimer resulted in poor ion response compared with other trimers containing typical b-O-4 linkage. Through detail comparison of the structure of each trimer, it was proposed that the trimer with a, b-diaryl ether linkage requires less fragmentation energy because it

133

contains phenolic group at a-position as a much better leaving group compared with hydroxyl group at a-position in the other two trimers. This in-source fragmentation is

- especially activated when [M-H] generated by deprotonation at the phenolic site. The deprotonation activates the charge-driven fragmentation pathway. The result of chlorine adduct formation [M+Cl]- has supported this hypothesis by obtaining negative ions without the removal of proton at phenolic site.

As the preliminary data showed, ammonium adduction formation gave overall good response across two different DHPs. Quantitative comparison of a series of different cation adduct formations was conducted with trimeric model compounds.

Results from these experiments showed that Li+ and Na+ were the two cations that gave best ion response across all three trimers. Compared with negative ion mode, no in- source fragment ions were observed in the positive ion mode. This result was consistent with previous observations 32. Interestingly, the trimer, containing a, b-diaryl ether linkage, gave an overall much better ion response compared with other two trimers across all different cation condition in the positive ion mode. It is proposed that a, b-diaryl ether linkage trimer contains no cyclic linkage type and has higher degree of freedom to rotate in order to coordinate with cations compared with other two trimers with cyclic linkage present.

Alkali metal adduct ions in collision induced dissociation (CID) are known to give little or no structural information as either alkali metal cation is fragmented off immediately or no fragment ions are generated during CID process32, 62. In contrast to traditional CID observation towards alkali metal adduct ion, useful fragmentation ions during the tandem mass spectrometric experiment using higher-energy collision

134

dissociation (HCD) techniques were successfully obtained. Fragmenting alkali metal adduct ions, such as [M+Li]+ and [M+Na]+, generally required higher collision energy

+ + than that of [M+NH4] or [M+H] . The HCD/MS/MS study allows the investigation of: 1) whether there is difference in fragmentation pattern of lignin compounds between different alkali metal cations; 2) whether useful and characteristic structural information could be obtained based on the fragmentation of alkali metal adduct cations. This study extended the understanding regarding the behavior of these alkali metal adduct ions in the higher-collision dissociation cells and has shown the promising application in studying the primary structure of lignin with this method.

The first project has deepened the knowledge of ESI ionization behavior of lignin model compounds under different condition as well as explored the potential application in sequencing lignin structures. This study provides a possible solution for natural lignin analysis in the future studies by adduct formation in positive ion mode. As the data shows good ionization response regardless of the structures of lignin compounds in the positive ion mode. As mentioned previously, the increasing interest in structural elucidation of higher molecular weight lignin compounds. One of the challenges in direct characterization of these lignin oligomers is the poor solubility of lignin compounds.

Generally, further derivatizations of lignin oligomers, such as acetylation, are used to improve the solubility. Almost all the derivatization methods targets the free alcoholic groups, including phenolic group. Such derivatization, e.g. acetylation, caps the protic alcohol groups, which closes the possibility to form the deprotonated anion when ionization of higher lignin oligomers. As a result, formation of alkali metal adduct

135

provides alternative ionization strategies in ionizing these lignin oligomers, which are not favored in protonation or deprotonation in the traditional ESI conditions.

Further application of this ionization method on the lignin model compounds containing other unique structures would extend the knowledge regarding the fragmentation pattern in existence of different chemical bonding types. It will be beneficial to generate a library which contains the useful information including the lignin structures and their fragmentation patterns under different ionization conditions. This kind of library will be beneficial for those who intend to study primary structures of unknown lignin compounds by using tandem mass spectrometry in the future.

In the early biomass pretreatment experiments with miscanthus and switch grass using Fenton chemistry, positive response in the cellulose accessibility is observed by comparing the biomass before and after pretreatment53. However, these studies of lignin show that no difference in either the total lignin content of biomass or the relative monomeric composition before and after pretreatment. This question motivated the continuous study of the actual chemical effect of Fenton chemistry during the pretreatment step. In order to simplify the complexity of starting pretreatment target, a synthesized lignin model compound with known structure was used to unambiguously study the role of Fenton chemistry by tracking the well-defined starting lignin model compound instead of using extracted native lignin. The object was to design a new lignin model compound that has not been obtained through chemical synthetic methods.

Because different functional groups in lignin have different reactivities towards pretreatment process, it is beneficial to synthesize such model compound that contains all these important functional groups, such as b-O-4 linkage, free phenolic alcoholic group

136

and a, b-unsaturated alcoholic side chain. The goal of this project is to synthesize such lignin model compound to study the pretreatment process. Aldol-type condensation reaction has previously been applied for synthesizing b-O-4 linkage44-45, 63. However, the model compound designed in this dissertation contains a, b-unsaturated alcoholic side chain, which cannot be obtained using the previous published synthesis methods. This motivated the need to come up with new synthetic strategies. The goal of the second project was to design a total synthetic route of trimer that contains the functionalities mentioned above. Different protecting groups and reagents were investigated in order to successfully obtain compound of interest. This project has not only provided a successful synthetic method but also generated a library of analytical data, such as NMR,

HCD/MS/MS and ESI ionization properties, to characterize the intermediate compounds, which will be valuable for structural elucidation for similar compounds. In the third chapter, the project not only successfully provided a route for synthesizing new model compound, but also provided the feasible synthetic strategy that was applicable for introducing structural varieties in the model compounds. Those structural diversities could be different monomeric sequence composition and the number of monomeric units within the oligomers in the future application. This method provided a potential extension in developing an automatic synthesis technique.

As the overall goal was to better understand the function of Fenton chemistry towards lignin based on the observation of the natural biomass discussed previously. The objective of the third project was to develop mass spectrometry-based analytical methods for studying Fenton chemistry on the synthetic trimeric model compound. The strong oxidant, hydroxyl radical, was the active reagent in the degradation of lignin, however,

137

due to the randomness and non-specificity of oxidation using hydroxyl radical, complex post pretreatment product profile was expected. In order to study the complex sample composition, mass spectrometry (MS) coupled with high-performance liquid chromatography (HPLC) was used to provide another dimension of separation along with mass spectrometric data of the complex pretreatment products. High-resolution accurate- mass mass data were obtained using Thermo Q Exactive Orbitrap mass spectrometer. The high resolution and accurate mass measurement allowed the assignment of elemental composition to the unknown ions and reduced the background interference ions during data processing. Due to the non-specificity of the Fenton pretreatment products, similar data mining strategy used in metabolic research was applied in chapter 4. Bioinformatics tools, such as XCMS and MZmine2, are capable to process the raw data and generate a list of features from raw LC-MS data. Each feature is a chromatographic peak containing information, such as mass-to-charge, retention time, peak area and etc. The features were detected according to the parameters set during data processing with MZmine2 (in this dissertation). This is a time-efficient way for untargeted analysis when no specific targeted information is available. By generating a list of features that meet the requirement, one can eliminate the numbers of ions, which are not likely to be the ions of interest. Because of the high-resolution accurate-mass mass spectrometric analysis in the study, twenty features were selected with a higher possibility to be related with lignin model compounds. The concentration change curve of starting trimer over 5hrs showed that the degradation/modification of trimer in the presence of Fenton reagent was not a linear chemical reaction. The trimer concentration was dropping rapidly in the first 5min and slowly went to completion in about 5hrs. The time course curves of the other features

138

were different from each other. For the trimeric unknown features, similar degradation trend with the starting trimer was observed. The time course measurements of a few monomeric and dimeric features showed that these possible pretreatment products were generated once the reaction began. The products were also subjected to further degradation in the Fenton reaction condition. Among all the unknown features, three features were confirmed by comparing the LC/MS as well as HCD/MS/MS spectra with authentic compounds. These three compounds were vanillin, vanillic acid and dimer G(b-

O-4)G (rt = 10.54 min). The high-resolution accurate-mass data allowed the assignment of elemental composition to the unknown features. The corresponding HCD/MS/MS fragment ions of unknown features provided valuable information in proposing the possible chemical structures based on the formula and fragmentation patterns. However, for unambiguous structural assignment of these post-pretreated compounds, further confirmation of those putative structures is required through comparing with authentic compounds when available. In contrast to the lignin study of Fenton pretreatment of biomass, the results of this project have shown that solution phase Fenton chemistry can degrade a lignin model compound very effectively. One possible explanation for this observation was that the physical and chemical complexity of lignin model compound was much simpler than that of native lignin structure. In the native lignin, the existence of various chemical linkages, such as cyclic C-C condense bonding types and intermolecular linkages to hemicellulose, brings different levels of the recalcitrance during Fenton pretreatment. Moreover, hydroxyl radical is very strong oxidant that oxidizing chemicals in close proximity due to its short lifetime, while complex cell wall structures in the biomass decrease the exposure of lignin to the active hydroxyl radicals. The hypothesis of

139

this study is that the Fenton chemistry caused partial breakdown of lignin across whole biomass resulting in the increase of cellulose accessibility. This project has developed

LC/MS/MS methods for analyzing lignin related compounds. This analytical method can be further applied to study other post-pretreatment products generated with different pretreatment methods.

My research has explored and developed mass spectrometry-based analytical methods in depicting the structures of lignin related compounds, especially for the ones that have molecular weight higher than trimers. The first project shows the potential ionization conditions that can provide good ionization response of a variety of lignin oligomers regardless of the primary structures using ESI-MS. The HCD/MS/MS experiments offered unique fragmentation properties in lignin structural elucidation method.

Continuing to synthesize various lignin model compounds with the new synthetic strategy can largely expand the library of lignin model compounds, which will be beneficial to systematically generate useful mass spectral database for structural elucidation of lignin. Moreover, it will be worthwhile to test whether the synthetic lignin model compounds have potential application in material science or chemical engineering area. This brings a promising strategy to increase the industrial value of lignin, as lignin is a green and sustainable bio-resource. The untargeted LC/MS method presented in this dissertation provided a new high-throughput analytical strategy in studying complex lignin pretreatment products. The overall objective of this dissertation is to provide deeper understanding and potential solutions to answer the challenging questions

140

regarding the structure of large lignin molecules or lignin oligomers with high-resolution mass spectrometry-based analytical methods.

141

References

1. EIA, Annual Energy Outlook. 2015. 2. M. H. Langholtz, B. J. S., L. M. Eaton 2016 Billion-Ton Report: Advancing Domestic Resources for a Thriving Bioeconomy; ORNL/TM-2016/160; 2016; p 448. 3. Carroll, A.; Somerville, C., Cellulosic Biofuels. Annual Review of Plant Biology 2009, 60 (1), 165-182. 4. Balan, V., Current Challenges in Commercially Producing Biofuels from Lignocellulosic Biomass. ISRN Biotechnology 2014, 2014, 31. 5. Office, B. T. BIOMASS FEEDSTOCKS. http://energy.gov/eere/bioenergy/feedstock-supply. 6. Haghighi Mood, S.; Hossein Golfeshan, A.; Tabatabaei, M.; Salehi Jouzani, G.; Najafi, G. H.; Gholami, M.; Ardjmand, M., Lignocellulosic biomass to bioethanol, a comprehensive review with a focus on pretreatment. Renewable and Sustainable Energy Reviews 2013, 27, 77-93. 7. Heitner, C.; Dimmel, D.; Schmidt, J. A., Lignin and advances in chemistry. 2010. 8. Energy, U. D. o. Biomass Conversion. 9. Gross, J., Mass Spectrometry. Springer Berlin Heidelberg: 2011; p 21-66. 10. Liuni, P.; Wilson, D. J., Understanding and optimizing electrospray ionization techniques for proteomic analysis. Expert Review of Proteomics 2011, 8 (2), 197-209. 11. Konermann, L.; Ahadi, E.; Rodriguez, A. D.; Vahidi, S., Unraveling the Mechanism of Electrospray Ionization. Analytical Chemistry 2012, 85 (1), 2-9. 12. Hu, Q.; Noll, R. J.; Li, H.; Makarov, A.; Hardman, M.; Graham Cooks, R., The Orbitrap: a new mass spectrometer. Journal of Mass Spectrometry 2005, 40 (4), 430-443. 13. Makarov, A., Electrostatic Axially Harmonic Orbital Trapping: A High- Performance Technique of Mass Analysis. Analytical Chemistry 2000, 72 (6), 1156-1162. 14. Rathahao-Paris, E.; Alves, S.; Junot, C.; Tabet, J.-C., High resolution mass spectrometry for structural identification of metabolites in metabolomics. Metabolomics 2015, 12 (1), 10. 15. Vanholme, R.; Demedts, B.; Morreel, K.; Ralph, J.; Boerjan, W., Lignin Biosynthesis and Structure. Plant Physiology 2010, 153 (3), 895-905. 16. Freudenberg, K., Constitution and biosynthesis of lignin. New York : Springer- Verlag New York: New York], 1968. 17. Freudenberg, K.; Nimz, H., Guajacylglycerin-β-pinoresinoläther, ein Dehydrierungsprodukt des Coniferylakohols. Chemische Berichte 1962, 95 (8), 2057- 2062. 18. Cathala, B.; Saake, B.; Faix, O.; Monties, B., Evaluation of the reproducibility of the synthesis of dehydrogenation polymer models of lignin. Polymer Degradation and Stability 1998, 59 (1–3), 65-69. 19. Saake, B.; Argyropoulos, D. S.; Beinhoff, O.; Faix, O., A comparison of lignin polymer models (DHPs) and lignins by 31P NMR spectroscopy. Phytochemistry 1996, 43 (2), 499-507. 20. De Angelis, F.; Nicoletti, R.; Spreti, N.; Verì, F., A New In Vitro Model of Lignin Biosynthesis. Angewandte Chemie International Edition 1999, 38 (9), 1283-1285.

142

21. Reale, S.; Attanasio, F.; Spreti, N.; De Angelis, F., Lignin Chemistry: Biosynthetic Study and Structural Characterisation of Coniferyl Alcohol Oligomers Formed In Vitro in a Micellar Environment. Chemistry – A European Journal 2010, 16 (20), 6077-6087. 22. Reale, S.; Di Tullio, A.; Spreti, N.; De Angelis, F., Mass spectrometry in the biosynthetic and structural investigation of lignins. Mass Spectrometry Reviews 2004, 23 (2), 87-126. 23. Metzger, J. O.; Bicke, C.; Faix, O.; Tuszynski, W.; Angermann, R.; Karas, M.; Strupat, K., Matrix-Assisted Laser Desorption Mass Spectrometry of Lignins**. Angewandte Chemie International Edition in English 1992, 31 (6), 762-764. 24. Bocchini, P.; Galletti, G. C.; Seraglia, R.; Traldi, P.; Camarero, S.; Martinez, A. T., Matrix-assisted Laser Desorption/Ionization Mass Spectrometry of Natural and Synthetic Lignin. Rapid Communications in Mass Spectrometry 1996, 10 (9), 1144-1147. 25. De Angelis, F.; Fregonese, P.; Verì, F., Structural Investigation of Synthetic Lignins by Matrix-assisted Laser Desorption/Ionization Time-of-flight Mass Spectrometry. Rapid Communications in Mass Spectrometry 1996, 10 (10), 1304-1308. 26. Richel, A.; Vanderghem, C.; Simon, M.; Wathelet, B.; Paquot, M., Evaluation of Matrix-Assisted Laser Desorption/Ionization Mass Spectrometry for Second-Generation Lignin Analysis. Analytical Chemistry Insights 2012, 7, 79-89. 27. Yoshioka, K.; Ando, D.; Watanabe, T., A Comparative Study of Matrix- and Nano-assisted Laser Desorption/Ionisation Time-of-Flight Mass Spectrometry of Isolated and Synthetic Lignin. Analysis 2012, 23 (3), 248-253. 28. Evtuguin, D. V.; Domingues, P.; Amado, F. L.; Neto, C. P.; Correia, A. J. F., Electrospray Ionization Mass Spectrometry as a Tool for Lignins Molecular Weight and Structural Characterisation. In Holzforschung, 1999; Vol. 53, p 525. 29. önnerud, H.; Palmblad, M.; Gellerstedt, G., Investigation of Lignin Oligomers Using Electrospray Ionisation Mass Spectrometry. In Holzforschung, 2003; Vol. 57, p 37. 30. Morreel, K.; Dima, O.; Kim, H.; Lu, F.; Niculaes, C.; Vanholme, R.; Dauwe, R.; Goeminne, G.; Inzé, D.; Messens, E.; Ralph, J.; Boerjan, W., Mass Spectrometry-Based Sequencing of Lignin Oligomers. Plant Physiology 2010, 153 (4), 1464-1478. 31. Morreel, K.; Kim, H.; Lu, F.; Dima, O.; Akiyama, T.; Vanholme, R.; Niculaes, C.; Goeminne, G.; Inzé, D.; Messens, E.; Ralph, J.; Boerjan, W., Mass Spectrometry- Based Fragmentation as an Identification Tool in Lignomics. Analytical Chemistry 2010, 82 (19), 8095-8105. 32. Haupert, L. J.; Owen, B. C.; Marcum, C. L.; Jarrell, T. M.; Pulliam, C. J.; Amundson, L. M.; Narra, P.; Aqueel, M. S.; Parsell, T. H.; Abu-Omar, M. M.; Kenttämaa, H. I., Characterization of model compounds of processed lignin and the lignome by using atmospheric pressure ionization tandem mass spectrometry. Fuel 2012, 95 (0), 634-641. 33. Dean, J. C.; Burke, N. L.; Hopkins, J. R.; Redwine, J. G.; Ramachandran, P. V.; McLuckey, S. A.; Zwier, T. S., UV Photofragmentation and IR Spectroscopy of Cold, G- Type β-O-4 and β–β Dilignol–Alkali Metal Complexes: Structure and Linkage- Dependent Photofragmentation. The Journal of Physical Chemistry A 2015, 119 (10), 1917-1932.

143

34. DeBlase, A. F.; Dziekonski, E. T.; Hopkins, J. R.; Burke, N. L.; Sheng, H.; Kenttämaa, H. I.; McLuckey, S. A.; Zwier, T. S., Alkali Cation Chelation in Cold β-O-4 Tetralignol Complexes. The Journal of Physical Chemistry A 2016, 120 (36), 7152-7166. 35. Feitosa, E.; Savério Brazolin, M. R.; Zumstein Georgetto Naal, R. M.; Perpétua Freire de Morais Del Lama, M.; Lopes, J. R.; Loh, W.; Vasilescu, M., Structural organization of cetyltrimethylammonium sulfate in aqueous solution: The effect of Na2SO4. Journal of Colloid and Interface Science 2006, 299 (2), 883-889. 36. Ludley, F. H.; Ralph, J., Improved Preparation of Coniferyl and Sinapyl Alcohols. Journal of Agricultural and Food Chemistry 1996, 44 (10), 2942-2943. 37. Bunton, C. A.; Gan, L. H.; Moffatt, J. R.; Romsted, L. S.; Savelli, G., Reactions in micelles of cetyltrimethylammonium hydroxide. Test of the pseudophase model for kinetics. The Journal of Physical Chemistry 1981, 85 (26), 4118-4125. 38. Bunzel, M.; Heuermann, B.; Kim, H.; Ralph, J., Peroxidase-Catalyzed Oligomerization of Ferulic Acid Esters. Journal of Agricultural and Food Chemistry 2008, 56 (21), 10368-10375. 39. Ralph, J.; Landucci, L. L., NMR of Lignins. In Lignin and Lignans: Advances in Chemistry, 2010. 40. Ralph, S. R. L. L. J., NMR Database of Lignin and Cell Wall Model Compounds. 2004. 41. Kishimoto, T.; Uraki, Y.; Ubukata, M., Chemical synthesis of [small beta]-O-4 type artificial lignin. Organic & Biomolecular Chemistry 2006, 4 (7), 1343-1347. 42. Kishimoto, T.; Uraki, Y.; Ubukata, M., Easy synthesis of [small beta]-O-4 type lignin related polymers. Organic & Biomolecular Chemistry 2005, 3 (6), 1067-1073. 43. Kishimoto, T.; Uraki, Y.; Ubukata, M., Synthesis of [small beta]-O-4-type artificial lignin polymers and their analysis by NMR spectroscopy. Organic & Biomolecular Chemistry 2008, 6 (16), 2982-2987. 44. Ciofi-Baffoni, S.; Banci, L.; Brandi, A., Synthesis of oligomeric mimics of lignin. Journal of the Chemical Society, Perkin Transactions 1 1998, (19), 3207-3218. 45. Katahira, R.; Kamitakahara, H.; Takano, T.; Nakatsubo, F., Synthesis of β-O-4 type oligomeric lignin model compound by the nucleophilic addition of carbanion to the aldehyde group. J Wood Sci 2006, 52 (3), 255-260. 46. Smith, A. B.; Kanoh, N.; Ishiyama, H.; Minakawa, N.; Rainier, J. D.; Hartz, R. A.; Cho, Y. S.; Cui, H.; Moser, W. H., Tremorgenic Indole Alkaloids. The Total Synthesis of (−)-Penitrem D. Journal of the American Chemical Society 2003, 125 (27), 8228-8237. 47. Doi, H.; Sakai, T.; Iguchi, M.; Yamada, K.-i.; Tomioka, K., Chiral Ligand- Controlled Asymmetric Conjugate Addition of Lithium Amides to Enoates. Journal of the American Chemical Society 2003, 125 (10), 2886-2887. 48. Kumar, P.; Barrett, D. M.; Delwiche, M. J.; Stroeve, P., Methods for Pretreatment of Lignocellulosic Biomass for Efficient Hydrolysis and Biofuel Production. Industrial & Engineering Chemistry Research 2009, 48 (8), 3713-3729. 49. Arantes, V.; Jellison, J.; Goodell, B., Peculiarities of brown-rot fungi and biochemical Fenton reaction with regard to their potential as a model for bioprocessing biomass. Appl. Microbiol. Biotechnol. 2012, 94 (2), 323-338. 50. Arantes, V.; Milagres, A. F.; Filley, T.; Goodell, B., Lignocellulosic polysaccharides and lignin degradation by wood decay fungi: the relevance of

144

nonenzymatic Fenton-based reactions. Journal of Industrial Microbiology & Biotechnology 2011, 38 (4), 541-555. 51. Michalska, K.; Miazek, K.; Krzystek, L.; Ledakowicz, S., Influence of pretreatment with Fenton’s reagent on biogas production and methane yield from lignocellulosic biomass. Bioresour. Technol. 2012, 119 (0), 72-78. 52. Neyens, E.; Baeyens, J., A review of classic Fenton’s peroxidation as an advanced oxidation technique. Journal of Hazardous Materials 2003, 98 (1–3), 33-50. 53. Kato, D. M.; Elía, N.; Flythe, M.; Lynn, B. C., Pretreatment of lignocellulosic biomass using Fenton chemistry. Bioresour. Technol. 2014, 162, 273-278. 54. Palmer, J. M.; Evans, C. S., The Enzymic Degradation of Lignin by White-Rot Fungi. Philosophical Transactions of the Royal Society of London. B, Biological Sciences 1983, 300 (1100), 293-303. 55. Smith, C. A.; Want, E. J.; O'Maille, G.; Abagyan, R.; Siuzdak, G., XCMS: Processing Mass Spectrometry Data for Metabolite Profiling Using Nonlinear Peak Alignment, Matching, and Identification. Analytical Chemistry 2006, 78 (3), 779-787. 56. Katajamaa, M.; Orešič, M., Processing methods for differential analysis of LC/MS profile data. BMC Bioinformatics 2005, 6 (1), 1-12. 57. Katajamaa, M.; Orešič, M., Data processing for mass spectrometry-based metabolomics. Journal of Chromatography A 2007, 1158 (1–2), 318-328. 58. Pluskal, T.; Castillo, S.; Villar-Briones, A.; Orešič, M., MZmine 2: Modular framework for processing, visualizing, and analyzing mass spectrometry-based molecular profile data. BMC Bioinformatics 2010, 11 (1), 1-11. 59. Negreira, N.; Regueiro, J.; López de Alda, M.; Barceló, D., Degradation of the anticancer drug erlotinib during water chlorination: Non-targeted approach for the identification of transformation products. Water Research 2015, 85, 103-113. 60. Morreel, K.; Ralph, J.; Kim, H.; Lu, F.; Goeminne, G.; Ralph, S.; Messens, E.; Boerjan, W., Profiling of Oligolignols Reveals Monolignol Coupling Conditions in Lignifying Poplar Xylem. Plant Physiology 2004, 136 (3), 3537-3549. 61. Ruppert, G.; Bauer, R.; Heisler, G.; Novalic, S., Mineralization of cyclic organic water contaminants by the photo-fenton reaction — Influence of structure and substituents. Chemosphere 1993, 27 (8), 1339-1347. 62. Banoub, J.; Delmas, G.-H.; Joly, N.; Mackenzie, G.; Cachet, N.; Benjelloun- Mlayah, B.; Delmas, M., A critique on the structural analysis of lignins and application of novel tandem mass spectrometric strategies to determine lignin sequencing. Journal of Mass Spectrometry 2015, 50 (1), 5-48. 63. Kawai, S.; Okita, K.; Sugishita, K.; Tanaka, A.; Ohash, H., Simple method for synthesizing phenolicβ-O-4 dilignols. J Wood Sci 1999, 45 (5), 440-443.

145

VITA

Fan Huang was born in Pingnan, a small town in Fujian province, southeast of China on September 25, 1988. She stayed in Pingnan till graduated from middle school. She then moved to Fuzhou for high school in 2003. She was enrolled in undergraduate program in University of Science and Technology of China (USTC) in Hefei, Anhui, China, in 2006. Fan did her undergraduate research under the advising of Dr. Yangzhong Liu, where she got to know about mass spectrometry. After she graduated from USTC with a B.S. in Chemistry in 2010, she stayed in Dr. Yangzhong Liu’s group for another year. Fan was enrolled in Department of Chemistry, University of Kentucky in 2011. Fan was interested to study more about mass spectrometry and analytical chemistry, so she joined Dr. Bert C Lynn’s research group to pursue her Ph.D. in analytical chemistry in 2012. She is a member of the American Chemical Society (ACS) and the American Society for Mass Spectrometry (ASMS).

Publications:

1. F. Huang, B.C. Lynn, “Structural effects on the ionization response of lignin model compounds during electrospray ionization” (In process). 2. F. Huang, B.C. Lynn, “Synthesis of b-O-4 trimeric lignin model compound” (In process). 3. Ma, G.; Huang, F.; Pu, X.; Jia, L.; Jiang, T.; Li, L.; Liu, Y., “Identification of [PtCl2(phen)] Binding Modes in Amyloid-β Peptide and the Mechanism of Aggregation Inhibition”. Chemistry – A European Journal, 2011, 17 (41), 11657-11666. 4. Ma, G.; Min, Y.; Huang, F.; Jiang, T.; Liu, Y., “Thioether binding mediates monofunctional platinum antitumor reagents to trans configuration in DNA interactions”. Chemical Communications, 2010, 46 (37), 6938-6940.

Presentations: 1. Characterization of a β-O-4 trimeric lignin model compound and post Fenton chemistry products using HRAM mass spectrometry. Fan Huang; Bert C. Lynn, 64th ASMS Conference on Mass Spectrometry and Allied Topics, San Antonio, TX, June 2016. 2. Structural Effects on the Ionization Response of Lignin Model Compounds during Electrospray Ionization. Fan Huang; Bert C. Lynn, 63rd ASMS Conference on Mass Spectrometry and Allied Topics, St. Louis, MO. June 2015. 3. Elucidation of Synthetic Lignin Oligomers by Tandem Mass Spectrometry. Fan Huang; Bert C. Lynn, 62nd ASMS Conference on Mass Spectrometry and Allied Topics, Baltimore, MD. June 2014.

146

Appendices Nuclear Magnetic Resonance (NMR) spectra for the compounds that were discussed in this dissertation. The NMR spectra were acquired with a Varian Model NOVA400 (400 MHz) spectrometer, otherwise specified.

All the compounds were run in the Acetone-d6, acetone-d6 signal was used as reference (d 2.05ppm). Figure S1. 1D proton NMR spectrum of coniferyl alcohol

Figure S2. 1D proton NMR spectrum of G(4-O-a)G(b-O-4)G (Varian NOVA 600) Figure S3. 2D HSQC NMR spectrum of G(4-O-a)G(b-O-4)G (Varian NOVA 600) Figure S4. 1D proton NMR spectrum of G(b-O-4)G(b-5)G Figure S5. 1D proton NMR spectrum of G(b-O-4)G(b-b)G Figure S6. 2D HSQC NMR spectrum of mixture of G(b-O-4)G(b-5)G and G(b-O-4)G(b- b)G Figure S7. 1D proton NMR spectrum of Compound 4 Figure S8. 1D proton NMR spectrum of Compound 2 Figure S9. 1D proton NMR spectrum of Compound 3 Figure S10. 1D proton NMR spectrum of Compound 5 Figure S11. 1D proton NMR spectrum of Compound 6 Figure S12. 1D proton NMR spectrum of Compound 9 (Varian NOVA 600) Figure S13. 2D HSQC NMR spectrum of Compound 9 (Varian NOVA 600) Figure S14. 1D proton NMR spectrum of Compound 13 Figure S15. 2D HSQC NMR spectrum of Compound 13 Table S1. Deprotonated ion intensity comparison of three trimers Table S2. Ion intensity comparison of three trimers with chloride ion adduct, [M+Cl]- in negative ion mode ESI. Table S3. a) Relative ionization response of G(4-O-a)G(b-O-4)G in different salt solution. b) Cation selectivity of G(4-O-a)G(b-O-4)G in solution of equal molar of NH4Cl/LiCl/NaCl/KCl/RbCl. Table S4. . a) Relative ionization response of G(b-O-4)G(b-5)G in different salt solution. b) Cation selectivity of G(b-O-4)G(b-5)G in solution of equal molar of NH4Cl/LiCl/NaCl/KCl/RbCl.

147

Table S5. a) Relative ionization response of G(b-O-4)G(b-b)G in different salt solution. b) Cation selectivity of G(b-O-4)G(b-b)G in solution of equal molar of NH4Cl/LiCl/NaCl/KCl/RbCl. Table S6. Positive ion mode ESI response comparison of three trilignols with the presence of different cations.

148

1H1D_GOH_053014.esp 3.85 1.0

0.9

0.8

0.7

0.6

0.5

0.4 Normalized Intensity Normalized

0.3 6.80 7.04 4.22 7.04 0.2 7.70 6.78 6.49 4.21 4.24 3.99 6.26 1.97 6.53 6.88 2.09 0.1 6.22

0 1.00 1.09 1.13 1.13 1.10 1.10 2.25 3.42

7.5 7.0 6.5 6.0 5.5 5.0 4.5 4.0 3.5 3.0 2.5 2.0 Chemical Shift (ppm) Figure S1. 1D proton NMR spectrum of coniferyl alcohol

149

1H1d_good_2June2014.esp 3.89 3.82 3.80

0.8

0.7

0.6

0.5

0.4 Normalized Intensity Normalized 0.3 7.06 6.78 7.05 7.19 6.77 4.20 4.17

0.2 6.99 6.75 6.76 5.47 7.01 5.48 3.79 6.80 6.95 6.45 6.50 3.78 6.52 4.56 3.92 6.48 4.57 6.30 6.25 6.75 3.93 6.27 6.22 0.1 3.94 5.44 5.45 5.44 6.26 5.43

0 0.76 1.62 0.89 0.94 0.87 2.54 1.91 1.91 1.15 0.81 3.951.50 2.73 1.38 5.59

7.5 7.0 6.5 6.0 5.5 5.0 4.5 4.0 3.5 Chemical Shift (ppm) Figure S2. 1D proton NMR spectrum of G(4-O-a)G(b-O-4)G

150

40

50

60

70

80

90

100

110 Shift F1 Chemical (ppm)

120

130

140

7.5 7.0 6.5 6.0 5.5 5.0 4.5 4.0 3.5 F2 Chemical Shift (ppm) Figure S3. 2D HSQC NMR spectrum of G(4-O-a)G(b-O-4)G

151

1H_GDHP_ZT_trimer_13min_021315.esp 3.81

0.45 3.86 0.40

0.35 3.88

0.30 3.80 0.25

0.20 Normalized Intensity Normalized

0.15 4.20 7.10 6.96 3.77 6.88 6.90 3.71 3.75 3.90 6.94 3.70

0.10 6.98 3.53 6.77 6.78 3.51 7.45 7.41 4.89 4.88 6.25 6.51 6.27 4.15 4.16 6.24 4.38 6.82 5.60 0.05

0 0.68 8.45 0.84 1.19 0.80 0.85 2.59 14.72 1.37

7.5 7.0 6.5 6.0 5.5 5.0 4.5 4.0 3.5 Chemical Shift (ppm) Figure S4. 1D proton NMR spectrum of G(b-O-4)G(b-5)G

152

1H_GDHP_ZT_trimer_15min_021315.esp

0.25 3.82 3.84

0.20

0.15

0.10 3.89 3.80 Normalized Intensity Normalized 6.99 6.80 4.20 6.78 0.05 7.51 6.83 4.19 7.05 6.75 3.70 4.67 7.09 3.08 7.45 7.10 7.49 3.96 4.64 4.88 4.87

0 1.84 8.41 0.88 1.81 2.96 11.00 1.15

7.5 7.0 6.5 6.0 5.5 5.0 4.5 4.0 3.5 3.0 Chemical Shift (ppm) Figure S5. 1D proton NMR spectrum of G(b-O-4)G(b-b)G

153

20

40

60

80

100 Shift F1 Chemical (ppm)

120

7.0 6.5 6.0 5.5 5.0 4.5 4.0 3.5 3.0 F2 Chemical Shift (ppm) Figure S6. 2D HSQC NMR spectrum of mixture of G(b-O-4)G(b-5)G and G(b-O-4)G(b- b)G

154

1H1D_Vtips_column_083016.esp 1.12 1.0

0.9 1.10 0.8

0.7

0.6

0.5 3.92 1.04 0.4 Normalized Intensity Normalized

0.3

0.2 9.88 2.05 7.47 1.32 7.47 2.06 2.05 7.09 1.03 7.46

0.1 7.07 1.34 1.02

0 0.81 1.77 0.88 3.10 2.69 18.00

10 9 8 7 6 5 4 3 2 1 Chemical Shift (ppm) Figure S7. 2D HSQC NMR spectrum of mixture of G(b-O-4)G(b-5)G and G(b-O-4)G(b- b)G

155

1H1D_EV_090716.esp 3.92 1.0

0.9

0.8

0.7 4.87 0.6 1.25 0.5

0.4 Normalized Intensity Normalized

0.3 1.23 1.27 9.87 4.23

0.2 7.09 7.46 2.05 7.07 7.46 7.49 2.04 7.51 2.06 0.1

0 0.59 1.70 0.91 2.00 1.78 3.01 2.40

10 9 8 7 6 5 4 3 2 1 0 Chemical Shift (ppm) Figure S8. 1D proton NMR spectrum of Compound 2

156

1H1D_EVG_02012017.esp 3.84 1.0

0.9 1.24 0.8

0.7 4.71

0.6

0.5

0.4 1.22 1.26 Normalized Intensity Normalized 4.20 3.95 0.3 4.19 5.67 6.91 0.2 3.96 7.08 7.08 6.89 2.05 4.22 2.04 2.06 2.84 6.99

0.1 4.09 4.74

0 1.02 1.03 0.97 0.98 1.90 2.01 1.38 1.71 3.08 2.50

7.0 6.5 6.0 5.5 5.0 4.5 4.0 3.5 3.0 2.5 2.0 1.5 1.0 Chemical Shift (ppm) Figure S9. 1D proton NMR spectrum of Compound 3

157

1H1D_EG_02012017.esp 3.92 1.0

0.9

0.8 1.27 0.7

0.6

0.5

0.4 4.20 Normalized Intensity Normalized 4.18 1.25 1.29 0.3 6.41 6.37 6.88

0.2 6.86 3.86 7.33 7.57 7.61 4.21 2.04 4.16 0.87 7.12 0.1 7.14 0.89 0.86 8.12

0 0.92 1.14 1.15 1.15 1.14 1.13 2.22 3.46 2.94

8.0 7.5 7.0 6.5 6.0 5.5 5.0 4.5 4.0 3.5 3.0 2.5 2.0 1.5 1.0 Chemical Shift (ppm) Figure S10. 1D proton NMR spectrum of Compound 5

158

1H1D_EGE_02012017.esp 3.91 1.0

0.9 1.25 0.8

0.7 4.77

0.6 1.28

0.5 4.20 0.4 4.19 1.29 1.23 4.20 Normalized Intensity Normalized 4.22 6.46

0.3 6.42 3.73 6.94

0.2 6.92 2.05 3.90 7.35 7.36 7.58 7.62 4.18 4.23 2.04 7.16 2.06 0.1 7.15

0 1.10 1.11 1.131.10 1.09 2.19 4.19 3.31 6.00

7.5 7.0 6.5 6.0 5.5 5.0 4.5 4.0 3.5 3.0 2.5 2.0 1.5 1.0 Chemical Shift (ppm) Figure S11. 1D proton NMR spectrum of Compound 6

159

1H1D_VtEt_031416.esp 0.88 1.0

0.9 1.09 0.8

0.7

0.6

0.5

0.4 3.87 Normalized Intensity Normalized 3.85 0.10 -0.09

0.3 1.24

0.2 2.05 1.25 9.83 2.06 2.04 4.19 7.40 7.40 6.87 0.87 4.82 2.06 4.80 6.85 2.04

0.1 7.40 7.20 6.94 4.21 7.41 7.21 6.91 5.14 5.15

0 0.94 2.06 0.99 0.97 0.96 0.96 1.010.91 1.976.06 6.40 18.319.33 2.85 2.89

11 10 9 8 7 6 5 4 3 2 1 0 -1 Chemical Shift (ppm) Figure S12. 1D proton NMR spectrum of Compound 9

160

0

20

40

60

80

100

120 F1 Chemical (ppm) F1 Chemical Shift

140

160

180

7.5 7.0 6.5 6.0 5.5 5.0 4.5 4.0 3.5 3.0 2.5 2.0 1.5 1.0 0.5 0 F2 Chemical Shift (ppm) Figure S13. 2D HSQC NMR spectrum of Compound 9

161

1H1D_GGG_OH_020617.esp 3.79 3.78 3.78

0.15 3.80 3.81 3.86

0.10 3.77 Normalized Intensity Normalized 6.86 6.75 6.87 7.07 6.84 6.88 0.05 4.16 6.85 4.15 4.15 6.73 3.86 4.87 7.10 4.85 7.02 7.10 4.88 4.89 3.77 3.69 7.06 6.26 6.45 4.28 4.28 4.27 6.45 4.91 7.00 6.22 4.32 4.34 6.73 6.49 4.92 6.49 3.88 6.27 6.83 6.25 3.66 6.23 3.89

0 3.28 4.36 0.98 0.93 0.93 1.95 1.85 1.94 10.432.11

7.0 6.5 6.0 5.5 5.0 4.5 4.0 Chemical Shift (ppm)

Figure S14. 1D proton NMR spectrum of Compound 13

162

40

60

80

100

120 (ppm) F1 Chemical Shift

140

7.5 7.0 6.5 6.0 5.5 5.0 4.5 4.0 3.5 3.0 2.5 2.0 1.5 1.0 F2 Chemical Shift (ppm) Figure S15. 2D HSQC NMR spectrum of Compound 13

163

Table S1. Deprotonated ion intensity comparison of three trimers

Trimer Average STDEV %RSD Intensity G(4-O-a)G(b-O-4)G 13908.39 5974.62 42.96 G(b-O-4)G(b-5)G 282557.90 21803.84 7.72 G(b-O-4)G(b-b)G 470547.78 60802.74 12.92

164

Table S2. Ion intensity comparison of three trimers with chloride ion adduct, [M+Cl]- in negative ion mode ESI.

Trimer Average STDEV %RSD Intensity G(4-O-a)G(b-O-4)G 1914943.12 55030.00 2.87 G(b-O-4)G(b-5)G 1147487.21 46184.23 4.02 G(b-O-4)G(b-b)G 2489207.77 566208.40 22.75

165

Table S3. a) Relative ionization response of G(4-O-a)G(b-O-4)G in different salt solution. b) Cation selectivity of G(4-O-a)G(b-O-4)G in solution of equal molar of NH4Cl/LiCl/NaCl/KCl/RbCl. a) Different Average Ion STDEV %RSD Cations Response NH4 42.04 7.91 18.82 Li 93.04 6.79 7.30 Na 100.00 12.23 12.23 K 45.76 2.84 6.20 Rb 32.45 0.95 2.94 b) Different Average Ion STDEV %RSD Cations Response NH4 19.46 0.83 4.26 Li 100.00 4.50 4.50 Na 86.92 4.49 5.16 K 41.94 1.93 4.59 Rb 26.01 0.84 3.22

166

Table S4. . a) Relative ionization response of G(b-O-4)G(b-5)G in different salt solution. b) Cation selectivity of G(b-O-4)G(b-5)G in solution of equal molar of NH4Cl/LiCl/NaCl/KCl/RbCl. a) Different Average Ion STDEV %RSD Cations Response NH4 94.00 2.35 2.50 Li 81.60 13.55 16.61 Na 100.00 5.27 5.27 K 89.00 4.61 5.18 Rb 63.27 1.28 2.03 b) Different Average Ion STDEV %RSD Cations Response NH4 43.72 5.98 13.68 Li 91.28 16.25 17.81 Na 100.00 15.68 15.68 K 80.93 10.11 12.49 Rb 59.70 7.03 11.77

167

Table S5. a) Relative ionization response of G(b-O-4)G(b-b)G in different salt solution. b) Cation selectivity of G(b-O-4)G(b-b)G in solution of equal molar of NH4Cl/LiCl/NaCl/KCl/RbCl. a) Different Average Ion STDEV %RSD Cations Response NH4 46.82 4.26 9.09 Li 93.46 4.09 4.37 Na 100.00 7.97 7.97 K 72.78 4.47 6.14 Rb 76.55 5.54 7.23 b) Different Average Ion STDEV %RSD Cations Response NH4 29.21 2.80 9.59 Li 100.00 9.31 9.31 Na 96.80 9.25 9.55 K 55.69 4.52 8.12 Rb 33.00 2.25 6.81

168

Table S6. Positive ion mode ESI response comparison of three trilignols with the presence of different cations.

Trimer Cations Average Ion STDEV %RSD response G(4-O-a)G(b-O-4)G NH4 42.04 7.91 18.82 Li 93.04 6.79 7.30 Na 100.00 12.23 12.23 K 45.76 2.84 6.20 Rb 32.45 0.95 2.94 G(b-O-4)G(b-5)G NH4 10.59 0.26 2.50 Li 9.20 1.53 16.61 Na 11.27 0.59 5.27 K 10.03 0.52 5.18 Rb 7.13 0.14 2.03 G(b-O-4)G(b-b)G NH4 13.28 1.21 9.09 Li 26.52 1.16 4.37 Na 28.37 2.26 7.97 K 20.65 1.27 6.14 Rb 21.72 1.57 7.23

169