Transition Metal and Lanthanide Complexes for Catalysis and Protein Structure Determination
Thesis submitted in partial fulfillment of the requirements of:
Doctor of Philosophy
By
Bradley Yat Wah Man
The University of New South Wales
School of Chemistry
Supervisors:
Professor Barbara Messerle
Professor Gottfried Otting
June 5, 2010
Dedicated to the memory of my grandmother Man Wen Fang, whose hopes has inspired me through the tough times.
Preface
This thesis is a report of original research undertaken by the author and is submitted for admission to the degree of Doctor of Philosophy at the University of New South Wales. This work was completed in the School of Chemistry at the University of New South Wales during the period of March 2006 to March 2010. The works and results presented in this thesis are those of the author, unless otherwise acknowledged.
Sections of this work that have been published:
3-Mercapto-2,6-pyridinedicarboxylic acid, A Small Lanthanide Binding Tag for Protein Studies by NMR Spectroscopy
Bradley Man, Xun-Cheng Su, Haobo Liang, Shane Simonsen, Thomas Huber, Barbara A. Messerle and Gottfried Otting
Chemistry: A European Journal, 2010, 16(12), 3827 – 3832
A Dipicolinic Acid Tag for Rigid Lanthanide Tagging of Proteins and Paramagnetic NMR Spectroscopy
Xun-Cheng Su, Bradley Man, Sophie Bereen, Haobo Liang, Shane Simonsen, Christophe Schmitz, Thomas Huber, Barbara A. Messerle and Gottfried Otting
Journal of the American Chemical Society, 2008, 130(2), 10486 – 10487
Sections of this work have been presented at scientific conferences:
Functionalized Rhodium(I) complexes: structure and catalysis
Bradley Man, Barbara Messerle.
i
Australian Organometallic Meeting 5, Sydney, New South Wales 2010. Oral Presentation.
A New Thiol Modified Dipicolinic Acid Tag for Protein Studies with Pseudocontact Shifts
Bradley Man, Barbara Messerle, Gottfried Otting, Shane Simonsen, Xun-Cheng Su
7th Biannual Meeting of the Australian and New Zealand Nuclear Magnetic Resonance Society (ANZMAG), Couran Cove, Queensland 2008. Poster presentation.
Small Molecular Tags for Protein Structure Determination
Bradley Man, Barbara Messerle, Gottfried Otting, Xun-Cheng Su
23rd International Conference on Organometallic Chemistry, Rennes, France 2008. Poster Presentation.
Paramagnetic Complexes for Protein Structure Determination
Bradley Man, Barbara Messerle, Gottfried Otting, Shane Simonsen, Xun-Cheng Su
21st International Congress for Heterocyclic Chemistry, Sydney, New South Wales 2007. Poster Presentation.
Paramagnetic Complexes for Protein Structure Determination
Bradley Man, Barbara Messerle, Gottfried Otting, Shane Simonsen, Xun-Cheng Su
Conference of the Inorganic Chemistry Division ,Hobart, Tasmania 2007. Poster Presentation.
ii
Abstract
“A short saying often contains much wisdom”
Sophocles
The aims of this thesis are to explore the application of transition metal and lanthanide complexes in the areas of structural biology and catalysis. This PhD thesis is therefore divided into two sections; the emphasis of the first section is on the synthesis and development of small organic molecules as paramagnetic probes with applications in protein structure refinement. The second section is on the synthesis and characterization of a series of rhodium(I) complexes bearing either functionalized tridentate pyrazolyl or tridentate imidazolyl donor ligands. The reactivity of this series of rhodium(I) complexes as catalysts for the intramolecular cyclization of alkynoic acids was investigated.
The first section of the thesis involved the synthesis of two thiol modified dipicolinic acid based tags, 4-mercaptomethyl-2,6-pyridinedicarboxylic acid (4MMDPA, 5) and 3-mercapto-2,6-pyridinedicarboxylic acid (3MDPA, 9) were described. The ligands 4MMDPA (5) and 3MDPA (9) were attached to the N-terminal domain of the arginine repressor from E. coli. (ArgN). Lanthanide complexes of the ArgN-4MMDPA adduct were synthesized in situ and the efficiency of the ligand (5) as a paramagnetic probe was assessed using 15N-HSQC spectra. The HSQC spectra showed that the 4MMDPA- Ln3+ complex was able to rigidly bind to the protein, the first ligand to be reported with rigid binding with only a single attachment point. Similar studies on the lanthanide and transition metal complexes of the ArgN-3MDPA adduct showed that the 3MDPA (9) tag was able to bind to both lanthanide and transition metal ions. Due to the reduced magnitude of the tensors observed for the 3MDPA tag (9), the rigidity of the binding could not be adequately assessed. The incorporation of unnatural amino acids in proteins via a modified protein expression system was proposed as an alternative method to introduce paramagnetic labels without the need of post translational
iii
modifications. Synthetic routes to a series of unnatural amino acids 11, 15 and 19 based on previously described lanthanide and transition metal binding motifs were developed.
The second half of the project describes the synthesis and the characterization of a series of rhodium(I) complexes bearing the tris(pyrazolyl)toluidine ligand (p-tpt, 24a; o-tpt, 24b) and the tris(N-methylimidazolyl)methanol ligand (tim, 23). The structures of the rhodium(I) complexes containing these ligands (27, 28 and 29b) in the solid state showed that the ligands were all bound to the rhodium centre in a bidentate (2) binding mode centre via two of the three available nitrogen donors. In solution, fluxional behavior was observed in the 1H NMR spectrum for the rhodium(I) complexes bearing the tris(pyrazolyl)toluidine ligands (27 – 29). Detailed analysis of the rhodium(I) complexes bearing the o-tris(pyrazolyl)toluidine ligand (27b, 28b and 29b) in the solution state at low temperatures showed that there is restricted rotation of the aryl ring about the C-C bond between the aryl ring and the bridging carbon, leading to observation of two species in the 1H NMR spectrum.
The application of rhodium(I) complexes bearing pyrazolyl donors (29a and 29b) and imidazolyl donors (37 and 40) as catalysts for the intramolecular cyclization of aliphatic and aromatic alkynoic acids was investigated. It was found that rhodium(I) complexes bearing imidazolyl donors 37 and 40 were highly active catalysts for the intramolecular cyclization of aliphatic alkynoic acids. Despite the structural similarities between the rhodium(I) imidazolyl complexes containing the bidentate and tridentate ligands, differences in catalytic activity were observed, attributed to the interference of the unbound imidazolyl donor with the substrate during the catalytic cycle. The rhodium(I) complex bearing the pyrazolyl donors (29a) was found to be a highly active catalyst for the intramolecular cyclization of terminal aromatic alkynoic acids, while rhodium(I) complexes bearing bidentate imidazolyl ligands were highly active catalysts for the intramolecular cyclization of non-terminal aromatic alkynoic acids. All rhodium(I) complexes (29a, 37 and 40) investigated as catalysts for the intramolecular cyclization of both non-terminal and terminal aromatic alkynoic acids displayed high level of regioselectivity in the formation of exo-dig vs. endo-dig product. A study of the dependence of the regioselectivity and the efficiency of cyclization on the electronic
iv
nature of the substrates indicated that electron withdrawing groups at the terminal alkyne substituent generally increased the rates of cyclization. The regioselectivity of the cyclization was also influenced by the relative difference in the electron density between the alkyne carbons.
v
Acknowledgements
“Dreams take long term commitment. From the first to the last we need focus, discipline, persistence, and the ability to keep sight of the vision of what we are slowly creating.”
Dennis Wholey
Each chapter starts off with a quote, and each describing the theme of the chapter. The quote for the acknowledgement more or less summarizes this entire volume of work. Along the way, this thesis felt like folly and many times along the way, the urge to throw it all in was high. Through the support of family, friends and a bit of positive thinking, it all came through in the end.
First I would like to thank my supervisors Prof. Barbara Messerle and Prof. Gottfried Otting for giving me such an opportunity to work on such a wonderful project. Without their tireless guidance and support this project could not have been possible. I would acknowledge the contributions the BAM and GO research group has offered during the PhD. In particular, I would like to thank Dr Jason Harper, Dr Jim Hook, Dr Donald Thomas, Dr Richard Hodgson, Dr Rebecca Wilson, Dr Shane Simonsen and Dr Laurent Poorters. These wonderful people always found the time to teach me everything they knew and without them, I could not have become the chemist I am today. They have also on numerous occasions provided many words of encouragement and wisdom, without which I could not have completed this PhD.
I would like to thank all of my friends, especially Debbie Mackay, Chris Haines, and Richard Parry, Murray Adams, Eric Majka, Matthew Stone, Alison Manion and Kate Odenthal. I would like to thank Richard for showing me the strength to get through some of the toughest times time I had in the last two years. I would like to thank Chris, for showing me that life should be viewed for all its potentials, not for what went
vi
wrong. Deb, Murray and Alison were great friends during my PhD, always offering advice, always happy to take care of a friend going through tough times, and most importantly, always reminding me what a bad car the McLaren was. I would like to especially thank Kate, Alison, Mark, Sandra, Jim and Deb who spent many tireless hours deciphering my ramblings, also known as proof reading.
No PhD could ever be completed without the support of the School of Chemistry, and its vast network of both technical and administrative staff. I would like to thank the wonderful help that the UNSW NMR Facility provided. I would like to thank Dr Ruhu Qi and the Mass Spectroscopy Facility at the Research School of Chemistry, ANU for their support and help.
Last but definitely not least, I would like to thank the efforts of my parents and family who without their wonderful support this work could not have been possible. On that note, I would like to finish this acknowledgement with….
“Its Murad’s fault…”
Bradley Man
Saturday, June 05, 2010
vii
Abbreviations
“Why is abbreviation such a long word?”
Unknown
3MDPA – 3-mercapto-2,6-pyridinedicarboxylic acid
4MMDPA – 4-mercaptomethyl-2,6-pyridinedicarboxylic acid
ArgN – N-terminal domain of the arginine repressor from E. Coli.
BArF – tetrakis(3,5-bis(trifluoromethylphenyl))borate bim – bis(N-methylimidazolyl)methane bpm – bis(pyrazolyl)methane
COD – 1,5 cyclooctadiene
COSY – correlation spectroscopy
DCM – dichloromethane
DPA – 2,6-pyridinedicarboxylic acid
DTNB – 5,5’-dithio-bis(2-nitrobenzoic acid)
DTPA – diethylenetriaminopentaacetic acid
FPLC – fast protein liquid chromatography
HMBC – heteronuclear multiple bond coherence spectroscopy
HSQC – heteronuclear single quantum coherence spectroscopy mRNA – messenger ribonucleic acid
NBD – 2,5 norbornadiene
NMR – nuclear magnetic resonance
viii
nOe – nuclear Overhauser effect
NOESY – nuclear Overhauser effect spectroscopy
PCS – pseudocontact shift
PRE – paramagnetic relaxation enhancement
RACS – residual anisotropic chemical shift
RDC – residual dipolar coupling
RDS – rate determining step tdmpm – tris(3,5-dimethypyrazolyl)methane
THF – tetrahydrofuran tim – tris(N-methylimidazolyl)methanol
TNB- - 5-thio-2-nitrobenzoate tpm – tris(pyrazolyl)methane tpt – tris(pyrazolyl)toluidine tRNA – transfer ribonucleic acid
UTR – universal tensor representation.
ix
Table of Contents
Preface ...... i
Abstract ...... iii
Acknowledgements ...... vi
Abbreviations ...... viii
Table of Contents ...... x
List of Figures ...... xviii
List of Tables ...... xxvi
List of Schemes ...... xxviii
Chapter 1: Introduction ...... 1
1.1 Introduction ...... 2
1.2 Functionalized Metal Complexes for Protein Structure Determination ...... 3
1.2.1 NMR spectroscopy, X-Ray Crystallography and Structural Biology ...... 4
1.2.2 Paramagnetic NMR Spectroscopy ...... 6
1.2.3 Pseudocontact Shifts and Protein Structure Refinement ...... 7
1.3 Functionalized Metal Complexes for Catalysis ...... 10
1.3.1 Catalytic Cycle ...... 10
1.3.2 Homogeneous vs. Heterogeneous Catalyst ...... 12
1.4 Project Aims ...... 13
1.5 General structure of the thesis ...... 13
1.6 References ...... 15
Chapter 2: Dipicolinic Acid As Paramagnetic Probe – 4MMDPA...... 18
x
2.1 Introduction ...... 19
2.2 Paramagnetic Probes ...... 19
2.2.1 Metal Substitution in Metalloproteins ...... 20
2.2.2 Lanthanide Binding Peptides ...... 21
2.2.3 Lanthanide Binding Ligands ...... 22
2.3 Dipicolinic Acid Based Paramagnetic Probes ...... 23
2.3.1 Retrosynthesis ...... 24
2.3.2 Attachment of 4MMDPA onto N-terminal domain of the arginine repressor ...... 26
2.3.3 NMR spectroscopy of ArgN-4MMDPA in the presence of Diamagnetic & Paramagnetic Lanthanide Ions...... 29
2.3.4 Solving the Magnetic Susceptibility Tensor and Metal Ion Locations ...... 33
2.3.5 Alignment Tensor ...... 36
2.4 Conclusion ...... 42
2.5 Reference ...... 44
Chapter 3: Dipicolinic Acid as Paramagnetic Probe – 3MDPA ...... 46
3.1 Introduction ...... 47
3.2 Synthesis ...... 47
3.3 Transition metals as paramagnetic shift reagents ...... 48
3.3.1 Metal Ion Location and Magnetic Susceptibility Tensor ...... 51
3.3.2 Alignment Tensor ...... 53
3.4 Paramagnetic lanthanide metals as paramagnetic shift reagents ...... 56
3.4.1 NMR Spectroscopy of ArgN-3HDPA ...... 57
3.4.2 NMR Spectroscopy of T4-lysozyme ...... 62
3.4.3 Comparison of ArgN-3MDPA and T4-lysozyme-3MDPA ...... 64
3.4.4 Residual Dipolar Coupling ...... 64
xi
3.5 Conclusion ...... 65
3.6 References ...... 67
Chapter 4: Unnatural Amino Acids as Paramagnetic Probes ...... 68
4.1 Introduction ...... 69
4.2 Incorporation of unnatural amino acids ...... 69
4.2.1 Residual specific labeling using unnatural amino acids ...... 70
4.2.2 Site specific labeling using unnatural amino acids ...... 71
4.3 Unnatural amino acids as paramagnetic probes ...... 73
4.3.1 Generalized synthesis of unnatural amino acids ...... 74
4.3.2 Synthesis of the dipicolinic acid modified amino acid, DPA-AA (11) ...... 75
4.3.3 Synthesis of iminodiacetic acid modified amino acid, IDA-AA (15) ...... 77
4.3.4 Synthesis of 8-hydroxyquinoline modified amino acid, HQ-AA (19) ...... 80
4.3.5 Attempted Incorporation of DPA-AA, IDA-AA and HQ-AA ...... 82
4.4 Conclusion ...... 82
4.5 References ...... 84
Chapter 5: Tridentate Rhodium(I) Complexes: Synthesis and Structure ...... 86
5.1 Introduction ...... 87
5.1.1 Synthesis of substituted tris(pyrazolyl)alkane ligands ...... 88
5.1.2 Coordination chemistry of tridentate N,N donor ligands ...... 89
5.2 Rhodium(I) complexes bearing p-tris(pyrazolyl)toludine ligand ...... 91
5.2.1 Synthesis of the tris(pyrazoyl)toluidine rhodium(I) complexes ...... 91
5.2.2 Solid state structure of the complexes [Rh(COD)(p-tpt)][BArF] (27a) and [Rh(NBD)(p-tpt)][BArF] (28a) ...... 92
xii
5.2.3 Solution state structure of the complexes [Rh(COD)(p-tpt)][BArF] (27a) and [Rh(NBD)(o-tpt)][BArF] (28a) ...... 96
5.2.4 Solution state structure of the complex [Rh(CO)2(p-tpt)][BArF] (29a) ...... 98
5.2.5 Synthesis and structure of the complex [Rh(COD)(p-tpt)][BPh4] (32) ...... 100
5.3 Rhodium(I) complexes bearing o-tris(pyrazoyl)toludine ligand ...... 102
5.3.1 Solid state studies of [Rh(COD)(o-tpt)][BArF] (27b) and [Rh(NBD)(o-tpt)][BArF] (28b) ...... 103
5.3.2 NMR studies of [Rh(COD)(o-tpt)][BArF] (27b) and [Rh(NBD)(o-tpt)][BArF] (28b) ...... 107
5.3.3 NMR and solid state structure of [Rh(CO)2(o-tpt)][BArF] (29b) ...... 111
5.3.4 Understanding the structural exchange processes in rhodium(I) complexes bearing o- 3 2 Exchange vs. Atropisomerism ...... 115
5.4 Rhodium(I) complexes bearing the tris(N-methylimidazolyl)methanol ligand ...... 117
5.4.1 Synthesis of [Rh(COD)(tim)][BArF] (36) and
[Rh(CO)2(tim)][BArF] (37) ...... 117
5.4.2 Solid state structures of [Rh(COD)(tim)][BArF] (36) and
[Rh(CO)2(tim)][BArF] (37) ...... 118
5.5 Conclusion ...... 121
5.6 References ...... 123
Chapter 6: Tridentate Rhodium(I) Complexes: Catalytic Activity ...... 126
6.1 Introduction ...... 127
6.1.1 Alkylidene lactones ...... 127
6.1.2 Metal catalyzed intramolecular hydroalkoxylation ...... 130
6.1.3 Gold / silver catalyzed hydroalkoxylation of alkynoic acids...... 132
xiii
6.1.4 Rhodium / Iridium catalyzed hydroalkoxylation of alkynoic acids...... 133
6.2 Catalyzed intramolecular hydroalkoxylation of aliphatic alkynoic acid ...... 135
6.2.1 Procedure ...... 135
6.2.2 Preliminary catalyst and solvent screening ...... 135
6.2.3 Intramolecular cyclization of 4-pentynoic acid using Rhodium(I) complexes bearing imidazolyl and pyrazolyl donors...... 137
6.2.4 Intramolecular cyclization of 5-hexynoic acid using Rhodium(I) complexes bearing imidazolyl donors...... 141
6.3 Intramolecular hydroalkoxylation of aromatic alkynoic acid ...... 143
6.3.1 Synthesis of aromatic alkynoic acid substrates ...... 144
6.3.2 Intramolecular cyclization of 2-ethynylbenzoic acid using Rhodium(I) complexes bearing imidazolyl and pyrazolyl donors...... 144
6.3.3 Intramolecular cyclization of non-terminal aromatic alkynoic acids using Rhodium(I) complexes bearing imidazolyl and pyrazolyl donors...... 149
6.3.4 Exo-dig : Endo-dig selectivity: Aliphatic vs. aromatic terminal alkyne substituents ...... 151
6.4 Exo-dig : Endo-dig selectivity: Electron donating vs. electron withdrawing terminal alkyne substituents ...... 153
6.4.1 Synthesis of non-terminal aromatic alkynoic acids with electron rich or electron deficient terminal substituents ...... 153
6.4.2 Electronic properties of electron donating and withdrawing substrates...... 154
6.5 Conclusion ...... 157
6.6 References ...... 159
Chapter 7: Summary, conclusions and future work ...... 161
7.1 Summary and conclusions ...... 162
7.2 Future work ...... 164
xiv
Chapter 8: Experimental ...... 168
8.1 General Experimental ...... 169
8.2 Synthesis of thiol modified lanthanide binding ligands and modification of proteins...... 171
8.2.1 Synthesis of 2,6-Dimethoxycarbonyl-4-bromomethylpyridine (4)...... 171
8.2.2 Synthesis of 4-Mercaptomethyl-2,6-pyridinedicarboxylic acid (5)...... 172
8.2.3 Synthesis of 3-mercapto-2,6-dimethoxycarbonylpyridine (9)...... 172
8.2.4 Ligation of thiol modified tags to N-terminal domain of the arginine repressor (ArgN)...... 173
8.3 Synthesis of unnatural amino acids bearing lanthanide and transition metal ion binding motifs...... 174
8.3.1 Synthesis of Dimethyl Acetamido(2,6- dimethoxycarbonylpyridyl)malonate (10)...... 174
8.3.2 Synthesis of 2-amino-3-(2,6-pyridinedicarbonyl)propanoic acid (11)...... 174
8.3.3 Synthesis of Diethyl Acetamido(N,N-bis(ethoxycarbonylmethyl)- aminobenzyl)malonate (14)...... 175
8.3.4 Synthesis of 2-amino-3-(N,N-bis(methylcarboxylate) aminobenzyl)propanoic acid (15)...... 176
8.3.5 Synthesis of Diethyl Acetamido-6-(9-hydroxylquinolyl)malonate diethyl ester (18)...... 176
8.3.6 Synthesis of 2-amino-6-(9-hydroxylquinoyl)propanoic acid dihydrochloride (19)...... 177
8.4 Synthesis of rhodium(I) complexes bearing pyrazolyl donor ligands...... 178
8.4.1 General synthesis of rhodium(I) olefin tris(pyrazolyl)toluidine complexes bearing the BArF counterion 27a-b and 28a-b...... 178
8.4.2 General synthesis of rhodium carbonyl tris(pyrazolyl)toluidine complexes 29 by the displacement of the COD co-ligand...... 184
xv
8.5 Synthesis of rhodium(I) complexes bearing imidazolyl donor ligands...... 186
8.5.1 Synthesis of [Rh(COD)(tim)][BArF] (36)...... 186
8.5.2 Synthesis of [Rh(CO)2(tim)][BArF] (37)...... 187
8.6 Catalyzed intramolecular cyclization of alkynoic acids – Synthesis of substrates and catalysis...... 187
8.6.1 Synthesis of 2-heptylmethylbenzoate (43c)...... 187
8.6.2 Generalized procedure for the synthesis of terminally substituted alkynoic esters 49a–d...... 188
8.6.3 General procedure for the hydrolysis of aromatic alkynoic acids 44 and 50...... 191
8.6.4 Generalised procedure for metal catalysed intramolecular cyclization of alkynoic acids...... 192
8.6.5 Cyclization of 4-pentynoic acid (38) to 5-methylenedihydrofuranone (39)...... 193
8.6.6 Cyclization of 1-hexynoic acid (41) to 5-methylenedihydropyranone (42)...... 194
8.6.7 Cyclization of 2-ethynylbenzoic acid (44a) to isochromenone (45a) and 3-methyleneisobenzofuranone (46a)...... 195
8.6.8 Cyclization of 2-(phenylethynyl)benzoic acid (44b) to 3- phenylisochromenone (45b) and (Z)-3-benylideneiso-benzofuranone (46b)...... 196
8.6.9 Intramolecular cyclization of 2-(heptynyl)benzoic acid (44c) to 3- pentyl-1H-isochromen-1-one (45c) and (Z)-3-hexylideneiso-benzofuranone (46c)...... 197
8.6.10 Cyclization of 2-(4-acetylphenylethynyl)benzoic acid (50a) to 3-(4- acetylphenyl)-1H-isochromen-1-one (51a) and (Z)-3-(4-acetylphenyl)isobenzofuran(3H)-one (52a) ...... 198
8.6.11 Cyclization of 2-(4-chlorophenylethynyl)benzoic acid (50b) to 3- (4-chlorophenyl)-1H-isochromen-1-one (51b) and (Z)-3-(4-chlorophenyl)isobenzo-furan(3H)-one (52b) ...... 200
xvi
8.6.12 Cyclization of 2-(4-methylphenylethynyl)benzoic acid (50c) to 3- (4-methylphenyl)-1H-isochromen-1-one (51c) and (Z)-3-(4-methylphenyl)-isobenzofuran(3H)-one (52c) ...... 201
8.6.13 Cyclization of 2-(4-methoxyphenylethynyl)benzoic acid (50d) to 3- (4-methoxyphenyl)-1H-isochromen-1-one (51d) and (Z)-3-(4-methoxyphenyl) isobenzofuran(3H)-one (52d) ...... 202
8.7 X-Ray Crystallographic data ...... 203
8.7.1 [Rh(COD)(p-tpt)][BArF] (27a) ...... 203
8.7.2 [Rh(NBD)(p-tpt)][BArF] (28a) ...... 204
8.7.3 [Rh(COD)(p-tpt)][BPh4] (32) ...... 205
8.7.4 [Rh(COD)(o-tpt)][BArF] (27b) ...... 206
8.7.5 [Rh(NBD)(o-tpt)][BArF] (28b) ...... 207
8.7.6 [Rh(CO)2(o-tpt)][BArF] (29b) ...... 208
8.7.7 [Rh(COD)(tim)][BArF] (36) ...... 209
8.7.8 [Rh (CO)2(tim)][BArF] (37) ...... 210
8.8 References ...... 211
xvii
List of Figures
Figure 1.1: The application of functionalized metal complexes: (a) as a homogeneous catalyst using the functional group to induce additional selectivities; (b) attached to a protein as a biological NMR probe or heterogeneous catalyst in aqueous conditions; and (c) attached to a macroscopic scaffold as a heterogeneous catalyst...... 2
Figure 1.2: The four levels of the protein structure (a) primary structure; (b) secondary structure; (c) tertiary structure; and (d) quaternary structure...... 4
Figure 1.3: (a) Selected resonances in a 1H-15N HSQC spectra recorded of the - ! ! ! " # connecting cross peaks of backbone amide recorded without a lanthanide ion (black), with Dy3+ (red) and Er3+ (magenta). (b) Isosurfaces showing the pseudocontact shifts corresponding to ±3, ±1.5 and ±0.5 ppm where positive values are denoted in blue and negative values are denoted in red. The three $ ! ! ribbon representation.15 ...... 9
Figure 1.4: General catalytic cycle involving the complex (MLn) and the reactants (A and B)...... 11
Figure 2.1: Structures of the CLaNP11, CLaNP-312 and CLaNP-5.213 lanthanide binding ligands attached to proteins...... 22
Figure 2.2: (a) The 1H-15N HSQC spectrum of ArgN protein (red) overlaid with the spectrum of ArgN-4MMDPA adduct (green) showing the change in the spectrum after derivatization with the 4MMPDA ligand. (b) A plot of the changes in 15N chemical shift for the amino acid residues in the protein...... 28
Figure 2.3: A three dimensional ribbon representation of the ArgN protein with the residues Glu20 – Ser25 and Gly54 – Ala 71 highlighted in red. The
xviii
three dimensional structure of the 4MMDPA probe (5) attached to the Cys68 is superimposed for visual guidance and does not represent the actual orientation of the ligand in the ArgN-4MMDPA adduct...... 29
Figure 2.4: Overlay of the 1H-15N HSQC spectra of the unpurified ArgN- 4MMDPA adduct in the presence of a diamagnetic lanthanide ion (red) and in the presence of a paramagnetic lanthanide ion (green)...... 30
Figure 2.5: Overlay of the 1H-15N HSQC spectra of the purified ArgN- 4MMDPA adduct in the presence of a diamagnetic lanthanide ion (cyan) and in the presence of a paramagnetic lanthanide ion (purple)...... 30
Figure 2.6: Overlay of the 1H-15N HSQC spectra of G27 and G54 residue in the ArgN-4MMDPA adduct in the presence of a diamagnetic lanthanide ion and in the presence of a paramagnetic lanthanide ion Tm3+ (green), Yb3+ (purple) and Tb3+ (orange). Paramagnetic peaks belonging to the same amino acid residue are connected using a dashed blue line...... 32
Figure 2.7: Plot of the PCS of the amide protons in the 1H dimension vs. the amino acid residue number for Tb3+ (J), Tm3+ (L) and Yb3+ ([)...... 32
Figure 2.8: The plot of the back calculated PCS vs. the observed PCS for ArgN-4MMDPA: (a) Tb3+; (b) Tm3+; and (c) Yb3+. The predicted PCS was calculated by solving all three sets of experimental data using all 23 structures simultaneously and with no additional boundary conditions using NUMBAT.19 ...... 34
Figure 2.9: PCS Isosurface for ±1 and ±0.2 ppm (blue = positive PCS, red = negative PCS) superimposed on the 3 dimensions structure of the ArgN for: (a) Tb3+; (b) Tm3+; and (c) Yb3+ visualized using PyMol. The PCS isosurfaces were calculated using all three sets of experimental data using all 23 structures simultaneously and with no additional boundary conditions using NUMBAT.19 ...... 35
Figure 2.10: The plot of the back-calculated RDC vs. the observed RDC for ArgN-4MMDPA: (a) Tb3+; (b) Tm3+; and (c) Yb3+ using only the first model of the ArgN protein structure. The predicted RDC was calculated using PALES20
xix
only including the RDCs from only the rigid element of the tertiary structure (i.e. residues 10 – 19, 26 – 36 and 43- 53)...... 38
Figure 2.11: The Sanson-Flamsteed equivalence plot for the magnetic susceptibility tensor (left) and the alignment tensors (right) for: (a) Tb3+; (b) Tm3+; and (c) Yb3+ determined independently for each of the 23 conformers in the NMR structure of the ArgN protein. The orientation of the alignment tensor was calculated by using the RDCs from only the rigid element of the tertiary structure (i.e. residues 10 – 19, 26 – 36 and 43- 53)...... 39
Figure 2.12: A ribbon representation of the three dimensional structure of the ArgN protein with the location of the metal ion (cyan sphere) and the G21 residues superimposed. The three dimensional structure of the 4MMDPA tag attached to the C68 residues is superimposed for visual guidance and does not represent the actual orientation of the ligand in the ArgN-4MMDPA complex in solution...... 41
Figure 3.1: (a) The 1H-15N HSQC spectrum of ArgN protein (red) overlaid with the spectrum of ArgN-3MDPA adduct (black) showing the change in the spectrum after derivatization with the 3MDPA ligand. (b) A plot of the changes in 15N chemical shift for the amino acid residues in the protein...... 49
Figure 3.2: Overlay of the 1H-15N HSQC spectra of the purified ArgN-3MDPA adduct in the presence of a diamagnetic lanthanide ion (black) and in the presence of a paramagnetic transition metal ion (red)...... 50
Figure 3.3: Plot of the PCS in the 1H dimension vs. the amino acid residue number for the ArgN-3MDPA in the presence of zinc and cobalt ion...... 51
Figure 3.4: The plot of the back calculated PCS vs. the experimentally observed PCS for ArgN-3MDPA in the presence of zinc and cobalt ions. The predicted PCS was calculated by solving all three sets of experimental data simultaneously using all 23 structures simultaneously and with no additional boundary conditions using NUMBAT.2 ...... 52
xx
Figure 3.5: PCS Isosurface for ±2 and ±0.2 ppm (blue = positive PCS, red = negative PCS) superimposed on the 3 dimensions structure of the ArgN for Co2+ visualized using PyMol. The PCS isosurface were calculated using all 23 structures simultaneously and with no additional boundary conditions using NUMBAT.2 ...... 52
Figure 3.6: The plot of the back calculated RDC vs. the experimentally observed RDC for ArgN-3MDPA-Co2+ using only the first model of the ArgN protein structure. The predicted RDC was calculated using PALES3 only including the RDCs from only the rigid element of the tertiary structure (i.e. residues 10 – 19, 26 – 36 and 43- 53)...... 53
Figure 3.7: The Sanson-Flamsteed equivalence plot for the magnetic susceptibility tensor: (a) calculated using NUMBAT2 and; (b) the corrected alignment tensors determined independently for each of the 23 conformers for the NMR structure of the ArgN protein. The orientation of the alignment tensor was calculated by using the RDCs from only the rigid element of the tertiary structure (i.e. residues 14 – 16, 28 – 34 and 45- 51) using PALES3...... 55
Figure 3.8: A ribbon representation of the three dimensional structure of the ArgN protein with the location of the metal ion (purple sphere) and the Cys68, Arg59 and Lys62 residues superimposed...... 56
Figure 3.9: 1H-15N-HSQC spectra of ArgN-3MDPA in complex with different metal ions. The spectra were recorded of 0.15 mM solutions of ArgN-3MDPA in a buffer of 20 mM MES, pH 6.5, using a Bruker 800 MHz NMR spectrometer at 25oC. Selected pairs of cross-peaks from the diamagnetic and paramagnetic molecules are connected by lines and labelled with their assignment. (a) Superimposition of 15N-HSQC spectra recorded with Y3+ (black) and a mixture of Tb3+/Y3+ (grey). (b) Same as (a), except that the grey spectrum was recorded with a mixture of Tm3+/Y3+...... 58
Figure 3.10: tensors of eight different metal ions bound to the ArgN-3DPA adduct. The x-, y- and z-axes of the tensors are distinguished by light grey, dark grey and black lines, respectively. The axes definitions follow the unique tensor representation (UTR) convention, where the x- and z-axes are the
xxi
shortest and longest axes of the tensor.2 The side chains of Glu12 and Cys68 and the N- and C-termini of the protein are labelled. The figure was generated using Molmol.8 ...... 60
Figure 3.11: 1H-15N-HSQC spectra of the C54T/C97A/Q69C triple mutant of T4 lysozyme derivatized with 3MDPA (9) and in the presence of with Tm3+, Lu3+ and Co2+. Selected pairs of cross-peaks from the diamagnetic and paramagnetic molecules are connected by lines and labelled with their assignment. (a) Superimposition of the spectrum with Co2+ (red) onto the spectrum recorded with a 1:0.8 mixture of Tm3+ and Lu3+ (black). (b) Superimposition of spectra recorded with 1:0.8 mixtures of Tm3+ : Lu3+ of the construct with 3MDPA (black) and the construct with 4MMDPA (red). The spectra were recorded at 25oC in a buffer of 20 mM MES (pH 6.5) on a Bruker 800 MHz NMR spectrometer, using a protein concentration of about 0.15 mM...... 63
Figure 4.1: Phenylalanine analogues incorporated using residue specific method...... 70
Figure 4.2: 1H NMR spectrum of 2-amino-3-(2,6- pyridinedicarbonyl)propanoic acid (DPA-AA, 11) in D2O...... 77
Figure 4.3: 1H NMR spectrum of Diethyl Acetamido(N,N- bis(ethoxycarbonylmethyl)- aminobenzyl)-aminomalonate diethyl ester (14) in
CDCl3...... 79
Figure 4.4: 1H NMR spectrum of 2-amino-3-
(N,N(methylcarboxylate)aminobenzyl)-propanoic acid (15) in D2O / NaOD...... 80
Figure 4.5: 1H NMR spectrum of 2-amino-3-(8-hydroxyquinoly)propanoic acid (HQ-AA, 19) in D2O...... 82
Figure 5.1: The two different binding modes for tris(pyrazolyl)alkane ligands: (a) the tridentate binding mode and 3; (b) the bidentate binding mode 2...... 89
Figure 5.2: The molecular structure of: (a) [Rh(COD)(tdmpm)]+; and (b) [Rh(NBD)(tdmpm)]+ synthesized by Hallett et al.32 ...... 91
xxii
Figure 5.3: ORTEP depictions of (a) [Rh(COD)(p-tpt)][BArF] (27a) and (b) [Rh(NBD)(p-tpt)][BArF] (28a) at 50% thermal ellipsoids for non-hydrogen atoms. Nitrogen, rhodium and carbon atoms are colored in blue, green and black respectively...... 94
Figure 5.4: 1H NMR spectrum of [Rh(COD)(p-tpt)][BArF] (27a) at: (a) 25oC; o and (b) -55 C in CD2Cl2...... 97
Figure 5.5: 1H NMR spectrum of [Rh(NBD)(p-tpt)][BArF] (28a) showing o only the aromatic region at -55 C in CDCl3...... 98
1 Figure 5.6: H NMR spectrum of [Rh(CO)2(p-tpt)][BArF] (29a) showing only o the aromatic region at -55 C in CD2Cl2 ...... 99
Figure 5.7: ORTEP depictions of [Rh(COD)(p-tpt)][BPh4] (32) at 50% thermal ellipsoids for non-hydrogen atoms and selected bond lengths and angles. Nitrogen, rhodium and carbon atoms are colored in blue, green and black respectively...... 102
Figure 5.8: ORTEP depiction of [Rh(COD)(o-tpt)][BArF] (27b) showing the two co-crystallized structures: (a) Structure A; and (b) Structure B at 50% thermal ellipsoids for non-hydrogen atoms. Nitrogen, Rhodium and carbon atoms are colored in blue, green and black respectively ...... 105
Figure 5.9: ORTEP depiction of [Rh(NBD)(o-tpt)][BArF] (28b) at 50% thermal ellipsoids for non-hydrogen atoms. Nitrogen, Rhodium and carbon atoms are colored in blue, green and black respectively ...... 106
Figure 5.10: 1H NMR spectrum of [Rh(COD)(o-tpt)][BArF] (27b) showing o o only the aromatic region at: (a) -55 C; and (b) 25 C in CDCl3 measured on Bruker Avance 600 MHz spectrometer...... 108
Figure 5.11: 1H NMR spectrum of [Rh(NBD)(o-tpt)][BArF] (28b) showing o only the aromatic region at -55 C in CD2Cl2 measured on Bruker Avance 600 MHz spectrometer...... 109
xxiii
Figure 5.12: 1H NOESY spectrum of [Rh(COD)(o-tpt)][BArF] (27b) measured on Bruker Avance 600 MHz spectrometer showing only the aromatic o region at -50 C in CDCl3. Highlighted are the exchange cross peaks between the resonances due to the pyrazole H3/5, the H2’ and H3’ of the aniline ring...... 110
Figure 5.13: ORTEP depiction of the cationic fragment of the complex
[Rh(CO)2(o-tpt)][BArF] (29b) at 50% thermal ellipsoid for non hydrogen atoms: (a) viewed along the Rh-C1 axis and (b) viewed from the top. Nitrogen atoms are colored in blue, rhodium atoms are colored in green and carbon atoms colored in black...... 112
1 Figure 5.14: H NMR spectrum of [Rh(CO)2(o-tpt)][BArF] (29b) showing o only the aromatic region at -55 C in CD2Cl2 measured on Bruker Avance 400 MHz spectrometer...... 114
Figure 5.15: ORTEP representation of the cationic fragment of [Rh(COD)(tim)][BArF] (36) at 50% thermal ellipsoid for non hydrogen atoms. Nitrogen atoms are colored in blue, rhodium atoms are colored in green and carbon atoms colored in black...... 118
Figure 5.16: ORTEP representation of the cationic fragment of
[Rh(CO)2(tim)][BArF] (37) at 50% ellipsoid for non hydrogen atoms viewed from (a) the top; (b) top. Nitrogen atoms are colored in blue, rhodium atoms are colored in brown and carbon atoms colored in black. The 1H NMR spectrum of the complex at 25°C of the complex (37) in d4-methanol measured on a Bruker Avance 300 MHz Spectrometer...... 119
Figure 6.1: Structures of biologically active or alkylidene lactones isolated from natural products.5 ...... 128
Figure 6.2: Plot of the percentage conversion vs. time for the intramolecular o cyclization of 4-pentynoic acid to the lactone 39 using in d6-benzene at 80 C using 2 mol% of [Rh(CO)2(p-tpt)][BArF] (29a) (L), [Rh(CO)2(bim)][BArF]
(40) ([) and [Rh(CO)2(tim)][BArF] (37) (J)...... 138
xxiv
Figure 6.3: 1H NMR spectrum for the intramolecular cyclization of 4- pentynoic acid (38) catalyzed by 2 mol% [Rh(CO)2(tim)][BArF] (37) in C6D6 at 80oC at: (a) t = 1 min; (b) t = 8 mins; (c) t = 21 mins; and (d) t = 34 mins...... 139
Figure 6.4: 1H NMR spectrum for the intramolecular cyclization of 4-hexynoic o acid (41) catalyzed by 1 mol% [Rh(bim)(CO)2][BArF] (40) in C6D6 at 70 C at: (a) t = 0.2 hrs; (b) t = 0.8 hrs; (c) t = 2.2 hrs; and (d) t = 14.4 hrs ...... 142
Figure 6.5: 1H NMR spectrum for the intramolecular cyclization of 2- ethynylbenzoic acid (44a) catalyzed by 1 mol% [Rh(CO)2(bim)][BArF] (40) in o C6D6 at 60 C at: (a) t = 3 mins; (b) t = 9 mins; (c) t = 20 mins; and (d) t = 25 mins...... 145
Figure 6.6: Plot of the percentage conversion vs. time for the intramolecular cyclization of 2-ethynylbenzoic acid (44a) to the lactones 45a and 46a using in o o d6-benzene using 1 mol% of [Rh(CO)2(p-tpt)][BArF] at 60 C and 25 C (L o o and ), [Rh(CO)2(bim)][BArF] at 60 C ([) and [Rh(CO)2(tim)][BArF] at 60 C (J)...... 146
Figure 6.7: Proposed mechanism for the intramolecular cyclization of aromatic alkynoic acid catalyzed by a rhodium(I) dicarbonyl complexes, showing the two alternative pathways leading to the formation of the exo-dig product (red cycle) and the endo-dig product (purple cycle)...... 151
Figure 7.1: Proposed methods for the immobilization of the complex
[Rh(CO)2(p-tpt)][BArF] (29a) onto: (a) a carbon surface using the electrochemical reduction of a aryl diazonium salt generated in situ; (b) a carboxylic acid modified carbon surface using EDC / NHS peptide coupling between the amine and carboxylic acid group; and (c) onto a protein though the conversion of the amine to a thiol, followed by DTNB assisted disulfide bond formation...... 167
xxv
List of Tables
Table 2.1: The axial and rhombic components of the alignment tensor and magnetic susceptibility tensor for the ArgN-4MMDPA adduct in the presence of paramagnetic lanthanide ion.d ...... 40
Table 3.1: The axial and rhombic components of the alignment tensor and the magnetic susceptibility tensors for the ArgN-3MDPA adduct...... 54
Table 3.2 0 $ $$ ex with ArgN-3MDPAa ...... 61
Table 5.1: Selected bond distances for the complexes [Rh(COD)(p-tpt)][BArF] (27a), [Rh(NBD)(p-tpt)][BArF] (28a) and [Rh(COD)(bpm)][BArF] (31).34 ...... 95
Table 5.2: Selected bond distances for the complex [Rh(COD)(o-tpt)][BArF] (27b), [Rh(NBD)(o-tpt)][BArF] (28b) and [Rh(COD)(bpm)][BArF] (31).34 ...... 104
Table 5.3: Selected bond distances for the complex [Rh(CO)2(o-tpt)][BArF] 34 (29b) and the analogous [Rh(CO)2(bpm)][BArF] (35)...... 113
Table 5.4: Selected bond distances and angles for the complexes
[Rh(COD)(tim)][BArF] (36) and [Rh(CO)2(tim)][BArF] (37)...... 120
Table 6.1: Catalytic data for the intramolecular cyclization of 4-pentynoic acid (38) acid using Rhodium(I) and Iridium(I) based complexes...... 134
Table 6.2: Catalytic data for the intramolecular cyclization of 4-pentynoic acid
(38) in the presence of 2 mol% [Rh(CO)2(p-tpt)][BArF] (29a) in a range of solvents and temperature...... 137
Table 6.3: Comparison of catalytic activity of rhodium complexes and literature complexes for the intramolecular cyclization of 4-pentynoic acid (38) to the lactone 39...... 140
xxvi
Table 6.4: Comparison of literature catalytic activity of rhodium complexes for the intramolecular cyclization of 5-hexynoic acid (41) to the lactones 42...... 143
Table 6.5: Comparison of literature catalytic activity of rhodium(I) and gold (I) catalysts for the intramolecular cyclization of 2-ethynylbeznoic acid (44a) to the cyclic lactones 45a and 46a.a ...... 148
Table 6.6: Comparison of the catalytic activity for the intramolecular cyclization of non-terminal alkynoic acids (44b-c) to the cyclic lactones 45b-c and 46b-c using rhodium(I) imidazolyl and pyrazolyl complexes...... 150
Table 6.7: Summary of the exo : endo selectivities for the intramolecular cyclization of terminal and non-terminal alkynoic acids using the complex 40 as catalyst...... 152
Table 6.8: The corresponding Hammett constant, carbonyl stretching frequency and the selected 13C chemical shifts of the substituted aromatic alkynoic acids...... 155
Table 6.9: Intramolecular cyclization of electron rich and electron deficient non-terminal aromatic alkynoic acids (50a – d) using the complex o [Rh(CO)2(bim)][BArF] (40) in C6D6 at 60 C ...... 157
xxvii
List of Schemes
Scheme 2.1: The four possible disconnections in the retrosynthesis of 4- mercaptomethyl-2,6-pyridinedicarboxylic acid (5) ...... 24
Scheme 2.2: Synthesis of 4-mercaptomethyl-2,6-pyridinedicarboxylic acid (5)...... 25
Scheme 2.3: Functionalization of the ArgN using the 4MMDPA (5) lanthanide binding tag...... 27
Scheme 3.1: Synthesis of 3-mercapto-2,6-pyridinedicarboxylic acid (9)...... 47
Scheme 4.1: Schematic representation showing the propagation of a peptide chain through the interaction of the mRNA and the aminoacyl tRNA molecules: (a) initiation of protein synthesis signaled by the AUG codon, followed by the selection of valine amino acid using the GUU codon; (b) formation of the peptide bonding linking the two adjacent amino acids; (c) movement of the ribosome relative to the mRNA sequence to select and incorporate the next amino acid as determined by the codons; and (d) the termination and release of the peptide chain as signaled by the UGA codon.15 ...... 72
Scheme 4.2: Generalized method of the synthesis of unnatural amino acids: (a) nucleophilic attack of -halocarboxylic acids; (b) the Strecker synthesis; (c) alkylation of an acetamidomalonate ester; and (d) enzyme catalyzed ring opening of substituted hydantoin based compounds...... 75
Scheme 4.3: The synthesis of the unnatural amino acid bearing the dipicolinic acid motif (DPA-AA, 11) ...... 76
Scheme 4.4: The synthesis of the unnatural amino acid bearing the iminodiacetic acid motif (IDA-AA, 15) ...... 78
Scheme 4.5: The synthesis of the unnatural amino acid bearing the hydroxyquinoline motif (HQ-AA, 19) ...... 81
xxviii
Scheme 5.1: The generalized synthesis of the diolefin complexes 27 and 28 and the conversion to the corresponding dicarbonyl complexes 29...... 92
Scheme 5.2: Synthesis of the rhodium(I) olefin complexes bearing the tetraphenylborate counterion...... 101
Scheme 5.3: The possible exchange processes occurring in solution: (a) the V \ ! 2 3 binding mode; and (b) atropisomerism...... 116
Scheme 6.1: ^ $ $ " { ! \ of enzymes (Enz-Nu) by binding irreversibly.5 ...... 128
Scheme 6.2: ^ | ! ! $ ! $ $ " alkylidene lactones using (a) coupling reactions involving preformed alkylidene lactones; (b) condensation reactions between functionalized furans and aldehydes; and (c) catalyzed intramolecular cyclization of alkenoic or alkynoic acids.5 ...... 129
Scheme 6.3: The two possible mechanisms for the intramolecular cyclization of an alkynoic acid for: (a) substituted alkyne; and (b) unsubstituted...... 131
Scheme 6.4: The synthetic scheme for the synthesis of aromatic alkynoic acids bearing unsubstituted (44a) and substituted (44b-c) terminal alkynes...... 144
Scheme 6.5: The synthetic scheme for the synthesis of aromatic alkynoic acids with electron withdrawing and electron donating substituents at the terminal alkyne position...... 154
xxix
Chapter 1: Introduction
“Shallow men believe in luck. Strong men believe in cause and effect”
Ralph Waldo Emerson (1803 – 1882)
1
1.1 Introduction
Transition metal and lanthanide complexes have been traditionally used as homogeneous catalysts in the synthesis of large complex organic molecules from readily available synthons such as alkynols1-5 and aminoalkynes.6-8 Metal complexes also have a number of applications beyond catalysis; these include applications such as luminescence labeling in biology9-12 and medicinal applications such as anti-cancer activity.13-14 The aims of this project are to explore the application of transition and lanthanide metal complexes in the area of structural biology (Figure 1.1b) and the immobilization of homogeneous catalysts to combine the benefits of homogeneous and heterogeneous catalysis (Figure 1.1c).
Figure 1.1: The application of functionalized metal complexes: (a) as a homogeneous catalyst using the functional group to induce additional selectivities; (b) N-N donor ligand attached to a protein as a biological NMR probe or heterogeneous catalyst in aqueous conditions; and (c) attached to a macroscopic scaffold as a heterogeneous catalyst.
Paramagnetic lanthanide complexes have been successfully used in a number of biomedical applications, including their use as contrast agents in magnetic resonance
2
imaging.9-10 Recently, paramagnetic lanthanide ions have been used to determine the binding mode of protein-protein complexes in solution.15-17 One of the aims of this project is the development of small functionalized metal complexes that can be site specifically attached onto a protein and induce paramagnetic shifts in the NMR spectrum of the protein. These paramagnetic shifts can be used to determine the relative orientation and binding mode of protein-protein complexes.
Many metal complexes are highly active as homogeneous catalysts with tunable reactivity and selectivity. Despite the advantages of metal complexes as homogeneous catalysts, they are not as widely implemented as heterogeneous catalysts in part due to difficulties with separation of the catalysts from the product. With the presence of reactive functional groups on metal complexes, the complexes can be covalently attached to macroscopic scaffolds such as proteins,18-19 silicon20-21 and polymers.22-23 This makes it possible to combine the advantages of both homogeneous (i.e. tunable selectivities and activities) and heterogeneous catalyst (i.e. ease of product separation from the catalyst) into one catalytic system. Before the surfaces can be modified with the functionalized catalyst, the reactivity and the structure of the functionalized metal complexes needs to be established, which will be the major focus of this body of work.
1.2 Functionalized Metal Complexes for Protein Structure Determination
Protein structure is divided into three levels (Figure 1.2) which include: (1) primary structure; (2) secondary structure; and (3) tertiary and quaternary structure. Typically the primary structure (Figure 1.2a) refers to the sequence of amino acids linked by the amide bonds. Secondary structure (Figure 1.2b) refers to the organization of the amino acid sequence via hydrogen bonding to form discrete structural units such as the alpha helices or beta sheets. The tertiary structure refers to the arrangement of discrete secondary structural units to form the three dimensional structure of the protein. For larger proteins or protein-protein complexes, the quaternary structure refers to the full
3
assembly of multiple protein sub units. The three-dimensional structures of proteins and protein-protein complexes have been traditionally solved using single crystal x-ray crystallography. Recently, advances in both the hardware and software of NMR spectrometers have allowed high resolution NMR spectroscopy to solve the three dimensional structures of proteins in solution.
(b) Secondary Amino acids Protein Structure
(a) Primary Protein Structure Alpha Beta helices sheets
(d) Quaternary Protein (c) Tertiary Structure Protein Structure
Figure 1.2: The four levels of the protein structure (a) primary structure; (b) secondary structure; (c) tertiary structure; and (d) quaternary structure.
1.2.1 NMR spectroscopy, X-Ray Crystallography and Structural Biology
Structural biology is concerned with three specific objectives: (1) the identification of the various proteins which assemble to form a particular cell, tissue or organism; (2) understanding the interaction between the various proteins which makes up these cells, tissues and organisms; and (3) the determination of the precise three dimensional
4
structures of the protein-protein complexes which make up these cells, tissues and organisms.25
The ability to determine the three dimensional structure of protein-protein complexes has been of great interest in the area of drug discovery. Structural biology has been able to provide a means to select, refine and optimize small organic molecules to enhance their activity and selectivity. The use of detailed protein structures in the development of effective drugs will reduce not only the development time but also the costs.26 There are currently two main methods by which protein structures are solved; (1) X-Ray crystallography; and (2) Nuclear Magnetic Resonance (NMR) spectroscopy.
X-ray crystallography is considered to be the main method for solving protein structures due to the accurate coordinates that can be obtained for the entire protein assembly. With the recent advancement of techniques such as automated crystallization, data collection and processing, large structures can be refined rapidly, allowing x-ray crystallography to be used as a high throughput technique.27-29 Despite the advantages that x-ray crystallography provides, it is limited by the need to produce high quality single crystals of the protein, which is not always possible for proteins, or in particular protein-protein complexes.
NMR spectroscopy has been used to complement x-ray crystallography in protein structure determination, in particular for proteins which cannot be easily crystallized.27 Despite the benefits offered by NMR spectroscopy a major limitation is the extensive sample preparation required to determine the three dimensional structure of a large protein. Due to the complex nature of a typical NMR spectrum, isotopic labelling (i.e. 13C, 2H and 15N) is required to reduce the complexity of the NMR spectra. Due to the inherent insensitivity of NMR spectroscopy and low concentrations of a typical sample, long data collection times are required to measure an acceptable spectrum. To make NMR spectroscopy a more viable technique, there has been a drive to reduce the acquisition time. This has been achieved through several advances such as reduced dimensionality experiments,30 development of cryogenic probes31 and various automated assignment programs.32-34 Another approach to increase the efficiency of
5
NMR spectroscopy in the refinement of protein structures is by increasing the volume of information from a given experiment. One such example is through the use of paramagnetic NMR spectroscopy.
1.2.2 Paramagnetic NMR Spectroscopy
Paramagnetism can be introduced into an NMR sample by the addition of a species with an unpaired electron. When paramagnetism is introduced there are two effects: (1) pseudocontact shifts (PCS), which leads to changes in the chemical shift of NMR active nuclei; and (2) paramagnetic relaxation enhancement (PRE), which leads to an increase in the rate of relaxation (T1 and T2) of the resonances in the NMR spectra.
Both PRE and PCS can offer long range structural information that can be used to refine the three dimensional structures of proteins by determining the relative orientation and binding mode of protein-protein complexes. In the case of PRE, information can be observed from as far as 20 – 25 Å away from the paramagnetic label, complementing traditional nOes which are restricted to much shorter internuclear distances.35 In the case of the PCS, changes in the chemical shift of NMR active nuclei can be observed at distances as far as 40 Å from the paramagnetic centre.36 While both the PCS and PRE can offer invaluable information, PCS has been used more extensively in solving the structures of paramagnetic metalloproteins37-39 due to the fact that it is mathematically well defined and easily measured with a high degree of accuracy, and as a consequence leads to precise structural information.
There are many different sources of paramagnetism, they include transition metal ions,40 long living radicals,4 oxygen42 and lanthanide ions.43-44 The lanthanide ions and their metal complexes are often the preferred source of paramagnetism used in the refinement of protein structures due to the fact that they form stable polyvalent ions which do not interfere with the structure of the protein. Lanthanide ions can be used to replace calcium ions, providing a convenient means to introduce a paramagnetic center. The
6
advantage of using lanthanide ions is that while the different lanthanide ions are chemically similar, they all possess different paramagnetic properties, as these differences can be used to obtain different structural information. The different lanthanide ions have different PCS shift distributions, but at the same time they have different relaxation properties.16 In addition, the unpaired electrons of the lanthanide ions exist in the inner orbitals, which reduces the complexity of analysis of the pseudocontact shifts.45 This simplification allows the PCS to be measured by taking the difference between the chemical shift of the nuclei in the presence of a diamagnetic reference and in the presence of the desired paramagnetic lanthanide ion.
1.2.3 Pseudocontact Shifts and Protein Structure Refinement
As described in the previous section the PCS can be defined as the difference of the chemical shift observed with and without a paramagnetic centre (eqn. 1.1).