<<

1 : Importance, General Structure, and Biosynthesis

1.1 Term and Significance The term terpenes originates from turpentine (lat. balsamum terebinthinae). Tur- pentine, the so-called "resin of pine trees", is the viscous pleasantly smelling bal- sam which flows upon cutting or carving the bark and the new wood of several pine tree species (Pinaceae). Turpentine contains the "resin acids" and some hydrocar- bons, which were originally referred to as terpenes. Traditionally, all natural com- pounds built up from isoprene subunits and for the most part originating from plants are denoted as terpenes 1 (section 1.2). Conifer wood, balm trees, citrus fruits, coriander, eucalyptus, lavender, lemon grass, lilies, carnation, caraway, peppermint species, roses, rosemary, sage, thyme, violet and many other plants or parts of those (roots, rhizomes, stems, leaves, blos- soms, fruits, seed) are well known to smell pleasantly, to taste spicy, or to exhibit specific pharmacological activities. Terpenes predominantly shape these properties. In order to enrich terpenes, the plants are carved, e.g. for the production of incense or myrrh from balm trees; usually, however, terpenes are extracted or steam dis- tilled, e.g. for the recovery of the precious oil of the blossoms of specific fragrant roses. These extracts and steam distillates, known as ethereal or essential oils ("es- sence absolue") are used to create fine perfumes, to refine the flavor and the aroma of food and drinks and to produce medicines of plant origin (phytopharmaca). The biological and ecochemical functions of terpenes have not yet been fully inves- tigated. Many plants produce volatile terpenes in order to attract specific insects for pollination or otherwise to expel certain animals using these plants as food. Less volatile but strongly bitter-tasting or toxic terpenes also protect some plants from being eaten by animals (antifeedants). Last, but not least, terpenes play an impor- tant role as signal compounds and growth regulators (phytohormones) of plants, as shown by preliminary investigations. Many insects metabolize terpenes they have received with their plant food to growth hormones and pheromones. Pheromones are luring and signal compounds (sociohormones) that insects and other organisms excrete in order to communicate with others like them, e.g. to warn (alarm pheromones), to mark food resources and their location (trace pheromones), as well of assembly places (aggregation phero- mones) and to attract sexual partners for copulation (sexual pheromones). Harmless to the environment, pheromones may replace conventional insecticides to trap harmful and damaging insects such as bark beetles.

Terpenes. Eberhard Breitmaier. Copyright © 2006 WILEY-VCH Verlag GmbH & Co. KGaA, Weinheim ISBN: 3-527-31786-4 2 1 Terpenes: Importance, General Structure, and Biosynthesis

1.2 General Structure: The Isoprene Rule 2-7 About 30 000 terpenes are known at present in the literature . Their basic struc- ture follows a general principle: 2-Methylbutane residues, less precisely but usually also referred to as isoprene units, (C5)n , build up the carbon skeleton of terpenes; 1 this is the isoprene rule found by RUZICKA and WALLACH (Table 1). Therefore, terpenes are also denoted as isoprenoids. In nature, terpenes occur predominantly as , alcohols and their glycosides, ethers, aldehydes, ketones, carbox- ylic acids and esters.

Table 1. Parent hydrocarbons of terpenes (isoprenoids).

tail C5 head Hemi- 2-Methylbutane 2-Methyl-1,3-butadiene (Isoprene)

tail C10 head Mono- 2,6-Dimethyloctane

C15 Sesqui- 2,6,10-Trimethyldodecane (Farnesane)

C20 Di- 2,6,10,14-Tetramethylhexadecane ()

tail C25 head Sester- 2,6,10,14,18-Pentamethylicosane

tail C30 tail Tri- 2,6,10,15,19,23-Hexamethyltetracosane (Squalane)

tail C40 tail Tetra- ψ,ψ-

(C5)n n Poly- all-trans-Polyisoprene (Guttapercha) terpenes

1.3 Biosynthesis 3

Depending on the number of 2-methylbutane (isoprene) subunits one differentiates between hemi- (C5), mono- (C10), sesqui- (C15), di- (C20), sester- (C25), tri- (C30), tetraterpenes (C40) and polyterpenes (C5)n with n > 8 according to Table 1. The isopropyl part of 2-methylbutane is defined as the head, and the ethyl residue as the tail (Table 1). In mono-, sesqui-, di- and sesterterpenes the isoprene units are linked to each other from head-to-tail; tri- and tetraterpenes contain one tail-to-tail connection in the center.

1.3 Biosynthesis Acetyl-coenzyme A, also known as activated acetic acid, is the biogenetic precursor 9-11 of terpenes (Figure 1) . Similar to the CLAISEN condensation, two equivalents of acetyl-CoA couple to acetoacetyl-CoA, which represents a biological analogue of acetoacetate. Following the pattern of an aldol reaction, acetoacetyl-CoA reacts with another equivalent of acetyl-CoA as a carbon nucleophile to give β-hydroxy-β- methylglutaryl-CoA, followed by an enzymatic reduction with dihydronicotinamide adenine dinucleotide (NADPH + H+) in the presence of water, affording (R)- mevalonic acid. Phosphorylation of mevalonic acid by adenosine triphosphate (ATP) via the monophosphate provides the diphosphate of mevalonic acid which is decarboxylated and dehydrated to isopentenylpyrophosphate (isopentenyldiphos- phate, IPP). The latter isomerizes in the presence of an isomerase containing SH groups to γ,γ-dimethylallylpyrophosphate. The electrophilic allylic CH2 group of γ,γ-dimethylallylpyrophosphate and the nucleophilic methylene group of isopen- tenylpyrophosphate connect to geranylpyrophosphate as . Subsequent reaction of geranyldiphosphate with one equivalent of isopentenyldiphosphate yields farnesyldiphosphate as a (Fig. 1).

NH H NH2 2 H N N CONH2 N N O O N O O O N N O P O P O N HO P O P O P O N O O O OH OH OH OH OH O HO OH O HO O P OH HO O P OH OH OH Dihydro nicotinamide adenine dinucleotide phosphate Adenosine tri phosphate (ATP) + (NADPH + H ) 4 1 Terpenes: Importance, General Structure, and Biosynthesis

activated acetic acid biological CLAISEN- O CH3 O CH acetoacetyl-CoA condensation 3 H C (biological aceto- CoAS + CoAS O CoAS O acetic acid ester)

nucleophile electrophile O biological aldol reaction + H + H2O − HSCoA SCoA

HO CH3 O β-hydroxy-β-methyl-glutaryl-CoA HO C 2 SCoA

+ (NADPH + H ) − HSCoA

HO CH3 HO C (R)-mevalonic acid 2 OH

(ATP)

HO CH 3 O O Mevalonic acid diphosphate HO C 2 OPP PP = POPOH OH OH − CO2, − H2O

γ,γ-dimethylallyl- (Isomerase) isopentenyl- pyrophosphate OPP OPP pyrophosphate (activated isoprene)

− HOPP

geranylpyrophosphate OPP (monoterpene)

+ , − HOPP OPP

farnesylpyrophosphate OPP (sesquiterpene)

Figure 1. Scheme of the biogenesis of mono- and . 1.3 Biosynthesis 5

However, failing incoporations of 13C-labeled acetate and successful ones of 13C- labeled glycerol as well as pyruvate in hopanes and ubiquinones showed isopen- tenyldiphosphate (IPP) to originate not only from the acetate mevalonate pathway, but also from activated acetaldehyde (C2, by reaction of pyruvate and thiamine 12 diphosphate) and glyceraldehyde-3-phosphate (C3) . In this way, 1-deoxy- pentulose-5-phosphate is generated as the first unbranched C5 precursor of IPP.

OO HO N H C N CO2H N − CO2 C HO C H O C + S S S H3C CH3 CH3 activated pyruvic acid thiamine H O H acetaldehyde z diphosphate + z OH triosediphosphate (glyceraldehyde diphosphate) z OPP CH3 N N CH3 HO H O − O S z H O z S H O z zzOPP z OH z OH OOH z z OPP z OPP zzOPP 1-deoxypentulose-5-phosphate isopentenyldiphosphate (IPP)

()

2 F-PP , - HOPP, tail to tail linkage

farnesylpyrophosphate F-PP OPP (sesquiterpene)

+ − HOPP OPP

geranylgeranylpyrophosphate GG-PP OPP (diterpene)

2 GG-PP , - HOPP, tail to tail linkage

16-trans-phytoene (tetraterpene, a carotenoide) Figure 2. Scheme of the biogenesis of di-, tri- and tetraterpenes. 6 1 Terpenes: Importance, General Structure, and Biosynthesis

Geranylgeranylpyrophosphate as a diterpene (C20) emerges from the attachment of isopentenylpyrophosphate with its nucleophilic head to farnesylpyrophosphate with its electrophilic tail (Fig. 2). The formation of sesterterpenes (C25) involves an addi- tional head-to-tail linkage of isopentenylpyrophosphate (C5) with geranylgeranyl- pyrophosphate (C20). A tail-to-tail connection of two equivalents of farnesylpyro- phosphate leads to squalene as a triterpene (C30, Fig. 2). Similarly, tetraterpenes such as the 16-trans-phytoene originate from tail-to-tail dimerization of geranylgeranylpyrophosphate (Fig. 2). The biogeneses of cyclic and polycyclic terpenes 9,10 are usually assumed to involve intermediate carbenium ions, but evidence for this in vivo was given only in some specific cases. In the simple case of monocyclic such as the allylic cation remaining after separation of the pyrophosphate anion cyclizes to a cyclohexyl cation which is deprotonated to (R)- or (S)-limonene.

+ − OPP− − H

OPP geranylpyrophosphate (R+S)-limonene

The non-classical version of the intermediate carbenium ion (also referred to as a carbonium ion) resulting upon dissociation of the pyrophosphate anion from farne- sylpyrophosphate explains the cyclization to several cyclic carbenium ions 8, as demonstrated for some sesquiterpenes (Fig. 3). Additional diversity arises from 1,2- hydride and 1,2-alkyl shifts (WAGNER-MEERWEIN rearrangements) and sigmatropic reactions (COPE rearrangements) on the one hand, and on the other hand from the formation of diastereomers and enantiomers provided that the cyclizations generate new asymmetric carbon atoms (Fig. 3) 8-10. Thus, the non-classical carbenium ion arising from dissociation of the diphosphate anion from farnesylpyrophosphate permits formation of the monoyclic sesqui- terpenes humulatriene and germacratriene after deprotonation (Fig.3). A COPE re- arrangement of germacratriene leads to elematriene. Protonation of germacratriene following MARKOWNIKOW orientation initially provides the higher alkylated and therefore more stable carbenium ion which undergoes 1,2-hydride shifts resulting in bicyclic carbenium ions with an eudesmane or guaiane skeleton. Subsequent deprotonations yield diastereomeric eudesmadienes and guajadienes. Finally, eu- desmanes may rearrange to eremophilanes involving 1,2-methyl shifts (Fig. 3). 1.3 Biosynthesis 7

OPP farnesylpyrophosphate non-classical carbenium Ion 1,3,11-elematriene

COPE rearrangement

− H+ − H+

2,6,9-humulatriene 1(10),4,11-germacratriene

+ H+

*

* 1,2-hydride shift

1,2-methyl shift

1(10),11-guajadiene 1,11-eudesmadiene 1,11-eremophiladiene Figure 3. Biogenesis of some mono- and bicyclic sesquiterpenes from farnesylpyrophosphate.

A simliar cyclization generates the 14-membered skeleton of cembrane from which other polycyclic diterpenes are derived. 3,7,11,15-Cembratetraene, better known as cembrene A, emerges directly from geranylgeranylpyrophosphate (Fig. 2) invol- ving the 1,14-cyclization of the resulting allylic cation 9,10.

17 16 15 OPP 1 13 + 20 18 − H 11 3 19 9 5 7 geranylgeranyl- 3,7,11,15-cembratetraene pyrophosphate (cembrene A) 8 1 Terpenes: Importance, General Structure, and Biosynthesis

The biogenesis of pimarane, the parent compound of many polycyclic diterpenes, is assumed to arise from iso-geranylgeranylpyrophosphate 9,10. After dissociation of the pyrophosphate anion, the remaining acyclic allylic cation undergoes a 1,3- sigmatropic hydrogen shift and thereby cyclizes to a monocyclic carbenium ion which, itself, isomerizes to the ionic precursor of the pimarane skeleton.

OPP

1,3-sigmatropic − − OPP hydrogen shift

iso-geranylgeranyl- pyrophosphate

1,2-hydride shift

pimarane skeleton

Table 2. Isoprenoids

polyterpenes (polyprenes) [C5]n h : head t : tail (h - t) insertion into other natural compounds natural compounds containing hemiterpenes C5 hemiterpenes (e.g. ester alkaloids, lysergic acid) + C (h - t) 5 insertion into other natural compounds e.g. cannabinoids, indole alkaloids monoterpenes C10

(h - t) + C5 2x sesquiterpenes C C15 (t - t) 30 C27 C24 + C5 (h - t) C21 2x diterpenes C C20 (t - t) C40 19 C (h - t) tetraterpenes 18 + C5 () sesterterpenes C25

1.3 Biosynthesis 9

2,3-Epoxysqualene has been shown by isotope labeling to be the biogenetic precur- sor of tetracyclic triterpenes with perhydrocyclopenta[a] as the basic skeleton (also referred to as gonane or sterane). Steroids 13 are derived from these tetracyclic triterpenes. These include (C27), (C21), andro- stanes (C19) with trans fusion of the rings A and B (5α), (C18) with a ben- zenoid ring A (estra-1,3,5-triene; Fig. 4) 9,10 as well as cholic acid and its deriva- tives (C24) with cis fusion of the rings A and B (5β). The biogenetic origins of tetra- cyclic triterpenes and steroids are summarized in Table 2.

triterpenes with gonane skeleton

21 H 23

11 13 17 25 30 27 O 2,3-epoxysqualene 19 9 15 21 H (protonated) 1 18 23 3 5 H 7 18

HO 13 25 H 19 29 28 dammarane-3β-ol 9 H 14 27 10H 8 30 HO H

C30 : lanostane-3β-ol

21

18 23 21 13 27 19 9 H 14 18 25 10 8 13 18 5 H H 19 9 H 14 HO 10 8 13 H 5 H H 19 9 H 14 18 C27 : 5α-- HO H 10H 8 H 13 3β-ol 5 9 H 14 C : 5 -- 21 α HO 10 8 3β-ol H 5 H H C19 : 5α-- HO 21 3 -ol OH 24 β C18 : estra-1,3,5(10)-triene- 22 18 CO2H 23 3-ol 13 19 9 H 14 CD S t e r o i d s 10 8 5 H H A B HO OH H C24 : 3α,7α,12α-trihydroxy- basic skeleton of steroids: 5β-cholanic acid (cholic acid) gonane = perhydrocyclopenta[a]phenanthrene

Figure 4. Biogenetic origin of steroids.