<<

Biosynthesis – Inspiration for Discovery

Primary & Chemistry

Alan C. Spivey [email protected]

Oct/Nov 2008 Lessons in Synthesis - Azadirachtin

• Azadirachtin is a potent insect anti-feedant from the Indian neem tree: – exact biogenesis unknown but certainly via modification:

O MeO C OAc O 2 H O OH O H O OH 12 O O C 11 O 14 OH oxidative 8 O H 7 cleavage highly hindered C-C bond HO OH AcO OH AcO OH for synthesis! H H of C ring H MeO2C O AcO H tirucallol azadirachtanin A azadirachtin (cf. lanosterol) (a limanoid = tetra-nor-triterpenoid)

– Intense synhtetic efforts by the groups of Nicolaou, Watanabe, Ley and others since structural elucidation in 1987. – 1st total synthesis achieved in 2007 by Ley following 22 yrs of effort – ~40 researchers and over 100 man-years of research! – 64-step synthesis

– Veitch Angew. Chem. Int. Ed. 2007, 46, 7629 (DOI) & Veitch Angew. Chem. Int. Ed. 2007, 46, 7633 (DOI) – Review ‘The azadirachtin story’ see: Veitch Angew. Chem. Int. Ed. 2008, 47, 9402 (DOI) Rational Agrochemical Development – Shikimate Pathway Intervention

• The shikimate biosynthetic pathway is not found in animals/humans – only in plants – selective intervention in these pathways allows development of agrochemicals with minimal toxicity

• Glyphosate (‘Roundup’) – a Monsanto agrochemical is a potent inhibitor of the conversion of 3-phosphoshikimate (3-PS) → 5-enolpyruvylshikimate-3- (5-EPS-3P) – a non-selective herbicide

H NH3 CO2 CO2 CO2 CO PO 2 CO2 phosphoenol pyruvate CO (PEP) PEP 2 H (S)- PO OH PO O + PO O CO2 OH PO OH OP OH OH Pi 3-phosphoshikimate H NH3 HO (3-PS) O 5-enolpyruvylshikimate-3-phosphate CO OH (5-EPS-3P) 2 erythrose-4-phosphate glyphosate (Roundup®) (E-4-P) inhibits this step (S)-

PO N CO2 H2 Inspiration for Med Chem - Statins

• HMG CoA → MVA is the rate determining step in the biosynthetic pathway to cholesterol

• ‘Statins’ inhibit HMG CoA and are used clinically to treat hypercholesteraemia -a causative factor in heart disease – e.g. lipitor (Atorvastatin , Pfizer) is a competitive inhibitior of HMG-CoA reductase and the worlds biggest selling drug [first drug to reach $10 billion sales (2004: $10.8 bn] Format & Scope of Lectures

• What is ? – some definitions – phototrophs, chemotrophs; metabolism (catabolism/), 1° & 2° • Overview of primary metabolism → secondary metabolites – & → shikimate formation → shikimate metabolites – acetylCoA & the → α-amino acids → penicillins, cephalosporins, alkaloids – acetylCoA → malonylCoA → fatty acids, prostaglandins, polyketides, macrolide antibiotics – acetylCoA → mevalonate → isoprenoids, terpenoids, , carotenoids • Biological/biosynthetic reactions – enzyme & cofactor chemistry – free energy source –ATP – C-C & C-O bond formation – CoASH, SAM, DMAPP, – oxidation –NAD+, FAD/FMN, haem oxo monooxygenases – reduction – NADPH – C-N bond formation – pyridoxal Metabolism & Diversity

O Me Me HO C 2 Me N N NMe O H H H O N N H camphor caffeine HO N H MeO NH

lysergic acid CO2 H2O Pi N2 N hv quinine

H Me O O OH O N Me O O H CO2H N clavulanic acid O Me androstenedione N O OH nicotine patulin Phototrophs & Chemotrophs

• Living organisms are not at equilibrium. They require a continuous influx of free energy to perform mechanical work & for cellular growth/repair:

– Phototrophs (e.g. green plants, algae & photosynthetic ): derive free energy from the sun via

photosynthesis (‘CO2 fixation’): • 1015 kg/year by green plants, which constitute 99% of Earths biomass (i.e. 1012 tons of dry matter) • 1g of carbon processed = >6250 litres of air

hv CO2 + H2O (CHO) + O 2 PHOTOSYNTHESIS

– Chemotrophs (e.g. animals, fungi, most bacteria): derive free energy by oxidising (, lipids, ) obtained from other organisms, ultimately phototrophs • some bacteria & fungi require just D-glucose • mammals require sugars, essential amino acids (~half total used) & certain (enzyme co-factors or precursors)

• Degradation of the nutrients is coupled to the stoichiometric production of ‘high energy’ phosphate compounds, particularly triphosphate (ATP, see later). All metabolic function is underpinned by ATP energetic coupling. • By analogy with a money-based economy, the metabolic cost of production of a given from another can be quantified in terms of ‘ATP equivalents’ defined as the # of moles of ATP consumed/produced per of converted in the reaction or sequence Metabolism

• Metabolism is the term used for in vivo processes by which compounds are degraded, interconverted and synthesised: – Catabolic or degradative: primarily to release energy and provide building blocks • generally oxidative processes/sequences (glycolysis, Krebs cycle) – Anabolic or biosynthetic: primarily to create new cellular materials (1° & 2° metabolites) • generally reductive processes/sequences • These two types of process are coupled – one provides the driving force for the other:

Natural product CO2 + H2O secondary metabolites

'CO2 fixation' hv complex metabolites (photosynthesis)

Nutrients ATP Cell components energy energy & Growth storage release Catabolism ADP + P i O 2 Anabolism oxidative NAD(P)H reductive Energy & degredation biosynthesis Building blocks NAD(P) + H Building Blocks respiration '' simple products (by diazatrophs, lightening, the Haber process)

CO2 + H2O N2 Types of Metabolite & Biosynthesis

• Biosynthesis is the term for the in vivo synthesis of metabolites/natural products: – These are divided into two camps: • Primary metabolites: These are the universal and essential components for the survival of living organisms. e.g. sugars, amino acids, , ‘common’ fats and polymers such as proteins, DNA, RNA, lipids and polysaccharides • Secondary metabolites: Compounds produced by organisms which are not required for survival, many of which have no apparent utility to the host organism. Frequently a given metabolite will only be produced in a single organism or in a set of closely related organisms. Provide a rich source of pharmacologically active compounds. e.g. shikimate derivatives, alkaloids, fatty acids, polyketides, isoprenoids – Although the boundary is imprecise the term biosynthesis is most commonly applied, by organic chemists, to the in vivo synthesis of secondary metabolites:

“Now ever since Perkin, failing to make quinine, founded the dyestuffs industry, organic chemists have found the study of ‘natural products’ an inexhaustable source of exercises, which can be performed out of pure curiosity even when paid for in the hope of a more commercial reward. As a result the organic chemist’s view of nature is unbalanced, even lunatic but still in some ways more exciting than that of the biochemist. While the enzymologist’s garden is a dream of uniformity, a green meadow where the cycles of Calvin and Krebs tick round in disciplined order, the organic chemist walks in an untidy jungle of uncouthly named extractives, rainbow displays of pigments, where in every bush there lurks the mangled shapes of some alkaloid, the exotic perfume of some new terpene, or some shocking and explosive polyacetylene...’’ ... Since these intriguing derivatives AND e.g. lysine or ATP are ALL in a sense ‘natural products’ we may prefer the term ‘secondary metabolite’ for the former Bu’Lock Adv. Appl. Microbiol. 1961, 3, 293 Primary Metabolism - Overview

Primary metabolism Primary metabolites Secondary metabolites

CO2 +H2O 1) 'light reactions': hv -> ATP and NADPH PHOTOSYNTHESIS 2) 'dark reactions': CO2 -> sugars (Calvin cycle)

HO O oligosaccharides HOHO polysaccharides HO nucleic acids (RNA, DNA) OH glycolysis glucose & other 4,5,6 & 7 carbon sugars

CO2 SHIKIMATE METABOLITES PO cinnamic acid derivatives CO2 aromatic compounds + HO HO OH lignans, f lavinoids PO O OH OH phosphoenol pyruvate erythrose-4-phosphate shikimate

ALKALOIDS aromatic amino acids penicillins CO 2 proteins cephalosporins aliphatic amino acids cyclic peptides O pyruvate tetrapyrroles () Citric acid cycle (Krebs cycle) saturated fatty acids FATTY ACIDS & POLYKETIDES SCoA SCoA unsaturated fatty acids prostaglandins lipids CO2 polyacetylenes O O aromatic compounds, polyphenols acetyl malonyl coenzyme A macrolides

CoAS O HO ISOPRENOIDS terpenoids CO2 O HO steroids carotenoids acetoacetyl coenzyme A mevalonate

For interesting animations’ of e.g. photosynthesis see: http://www.johnkyrk.com/index.html Biological/Biosynthetic Reactions – Enzyme & Cofactors

• Most biosynthetic steps are catalysed by specific, individual . They generally perform familiar processes such as oxidation, reduction, alkylation, , acylation, hydroxylation, elimination etc. • Different enzymes carrying out related reactions often employ common co-factors: small organic functional fragments and/or . e.g.

– FREE ENERGY RELEASING COUPLE: (ATP)

– C-C & C-O BOND FORMATION: Coenzyme A (CoASH); S-adenosyl (SAM); dimethylallylpyrophosphate (DMAPP); biotin

+ – OXIDATION: NAD(P) ; FAD/FMN; Haem iron oxo species (e.g. P450)

– REDUCTION: NAD(P)H; (FADH2/FMNH2)

– C-N BOND FORMATION: Pyridoxal Free Energy Releasing Couple -ATP

• Adenosine triphosphate (ATP) – of an alcohol by (ADP) is highly exothermic (i.e. liberates energy):

NH NH2 2 N N N N N N N HO O O O N HO O O P P P enzyme P P O O RO P OH OOOOOO + ROH OOOO + -1 OO ΔG° = -31 kJmol Mg2+ or Mn2+ PPPO- OHOH PPO- OHOH -OP adenosine adenosine triphosphate (ATP) ADP

– The phosphorylated alcohol (ROP) is then activated towards nucleophilic displacement:

O OH Nu + ROP R-Nu + OP OP = Pi = orthophosphate = P OO

– So, overall the endothermic process ROH + Y- → RY + OH- has been achieved by ‘coupling’ the process to the ‘hydrolysis of ATP’ – The situation is analogous to the use of tosylate to achieve nucleophilic displacement of an alcohol – In general, the exothermicity associated with phosphorylation shifts the equilibria of ‘coupled’ process by a factor of ~108 Acylation & C-C Bond Formation α to C=O – CoASH • Coenzyme A (CoASH) – Coenzyme A acts as an acyl transfer/α-carbon activation reagent by forming reactive acyl thioesters:

– Acyl CoA derivatives can act as nucleophiles or electrophiles depending on the circumstances – These modes of reactivity reflect inherent properties of thioesters: - - • The good leaving group ability of RS (cf. RO ) reflects: pKa (RSH) ~10 cf. pKa (ROH) ~16 • The high electrophilic character of a thioester carbonyl carbon (cf. normal esters) reflects the poor orbital overlap

between the lone pairs on (nS) [cf. nO] and the carbonyl anti bonding molecular orbital π*C=O

• The enhanced acidity of protons α to the carbonyl of thioesters (cf. normal esters) reflects the same poor nS ↔ π*C=O resonance: H B: R H R nX-π*C=O resonance makes carbonyl less susceptible to enolisation π*C=O R H H Sulfur is in the 2nd period C O H O OH + BH so its lone pair has poor size/energy match with the π* orbital nOp X C=O X Hence: pK (RCH COSR’) ~20 cf. RCH COOR’ ~25 X R' R' a 2 2 R' i.e. α to a thioester is similar to α to a ketone Methylation/Dimethylallylation – SAM & DMAPP

• S-Adenosyl methionine (SAM) – SAM acts as a versatile O-, C-, N-& S- methylating reagent in vivo

e.g. Nu NH2 Me SR2 N N enzyme Me O OMe N O C S N 2 Nu-Me Me Me O Me SR2 NH3 O O OH methionine OHOH Me SR2 S-adenosyl methionine (SAM) R2N R2NMe

– Equivalent to performing an SN2 methylation using MeI in the laboratory

• Dimethylallyl (DMAPP) – DMAPP acts a dimethylallylating reagent – the pyrophosphate (+ Mg2+/Mn2+) is an excellent leaving group

Nu

O P O P O enzyme Nu OOOO Mg2+ or Mn2+ dimethylallyl pyrophosphate (DMAPP)

– Equivalent to performing an SN2 allylation using allyl bromide in the laboratory Carboxylation – Biotin

• Biotin – Biotin in the presence of bicarbonate, ATP and Mg2+ enables nucleophile carboxylation in vivo:

O Mg Nu O O NHHN enzyme O H H O N NH NHR H H HCO Nu O S 3 NHR O ATP S 2+ biotin (R = carrier ) Mg O Oxidation – NAD+

- dinucleotide (NAD+) [and its phosphorylated analogue (NADP+)] are mediators of biological oxidation (e.g. alcohol to ketone oxidation) – In general, the couple NAD+/NADH is used by enzymes in catabolic oxidation (degradation) – The reagent is a stereospecific hydride acceptor:

stereospecific: O NH enzyme 2 O HS HR NH N N + H H H H H 2 NH 2 H + H N N N ROH RO RO O P O P O N - H OOOO O O O H O OH OH NADH H OH OR NH2 NH2 N N nicotinamide-adenine dinucleotide (NAD+) R = H NB. H = H + 2e + - + NB. NADP : R = PO3H NAD NADH

– Different enzymes show different absolute specificities but are generally specific for the pro-R or pro-S both for removal and delivery Oxidation – Flavins (FAD & FMN) • Flavin adenine dinucleotide (FAD) and (FMN) are also mediators of biological oxidations (e.g. dehydrogenations – alkane to alkene) – Unlike NAD+, which readily diffuses from enzyme to enzyme, FAD/FMN is usually tightly bound to a given enzyme, sometimes covalently

+ – Re-oxidation of the FADH2 back to FAD is generally by molecular (although NAD is also sometimes used). The intermediate peroxyflavin can also mediate hydroxylation, epoxidation & other oxygen transfer reactions (see next slide):

H H H O NO O N O O N O O O O NH HO N N HN HN N + HO OH N N N peroxide

FADH2 FAD peroxyflavin Oxidation Reactions Mediated by Flavins • Dehydrogenation by flavins – e.g. dehydrogenation of succinate → fumarate:

• Baeyer-Villiger-type oxidation by peroxyflavins – e.g. ketone monooxygenase:

H H O O O O NO O O O N HN N H H H N OO N OO O NO O HO HO N N N HN HN N N N N

H2O peroxyflavin NADPH + H + O2 FAD

NADP Oxidation – Haem Iron oxo Species (P450)

• Haem iron oxo species e.g. in P450 (a ubiquitous heam monooxygenase) are also mediators of biological oxidation (e.g. phenolic coupling, epoxidation, hydroxylation):

H H H N O N OH N N N N HO H V IV Fe Fe FeIII + N N N N N N

SEnz HO2C CO H SEnz SEnz 2 HO2C CO2H HO2C CO2H haem Fe(V) oxo species haem Fe(III) species

O2

– The ring acts as a tetradentate for the octahedral iron. The two axial positions are occupied by an enzyme amino acid ligand (typically a histidine nitrogen) and hydroxy/hydroperoxy residue respectively Reduction - NADPH

• Dihydro-nicotinamide-adenine dinucleotide phosphate (NADPH) [and its de-phosphorylated analogue (NADH)] are mediators of biological reduction (e.g. ketone to alcohol reduction) – In general, the couple NAPH/NADP+ is used by enzymes in anabolic reduction (biosynthesis) – The reagent is a stereospecific hydride donor:

stereospecific: O NH HS HR 2 enzyme O N NH2 N - H H H NH2 H H H H N N N O P O P O N RO RO ROH + H OOOO O O H O O OH OH NADP+ H OH OR NH2 NH2 - NB. H = H + 2e N N Hydro-nicotinamide-adenine dinucleotide (NADPH) R = PO3H + NB. NADH: R = H NADPH NADP

– As for the reverse process, different enzymes show different absolute specificities but are generally specific for the pro-R or pro-S hydrogens both for removal and delivery Transamination - PLP

( B6) → pyridoxal-5’-phosphate (PLP) – PLP forms imines (Schiffs bases) with primary amines. This forms the basis of in vivo transamination of α-ketoacids to give α-amino acids (& also racemisation/decarboxylation processes, see ‘alkaloids’)

Enz-NH3 Enz-NH2 H Enz-NH2 RCO RCO OH RCO2 2 2 Enz

OH NH3 RCO2 H N N N N HO OP OP H OP H OP H OP H O imine O O O O exchange O N Pyridoxine (vitamin B ) 6 N N N N N H H2O H H H H H RCO2 pyridoxamine imine bound as imine phosphate formation NH3

– The α-carbon protonation is stereospecific and gives the (S) configured chiral centre Primary Metabolism - Overview

For interesting animations’ of e.g. photosynthesis see: http://www.johnkyrk.com/index.html