<<

GENERAL ARTICLE

Retrosynthetic Analysis* Art of Planning

Bharati V Badami

Organic synthesis is a creative science involving the construc- tion of via chemical processes. Organic synthesis has the ability to produce new materials continuously, hence, bestow us with. The invention, discovery and development of new synthetic reactions, reagents and catalysts are collec- tively referred to as synthetic technology/methodology or meth- od-oriented synthesis. E J Corey formalized the concept of Bharati V Badami was a ‘’ for the of a large num- Professor of Organic ber of naturally occurring and bioactive molecules. Karnatak University, Dharwad. Her research interests are 1. Introduction synthesis, reactions and synthetic utility of sydnones. She is currently working on electrochemical and The social impact of organic synthesis can be understood when insecticidal/antifungal we recount its applications in everyday life – food, medicine, activities for some of these clothes, fuels, polymers, dyes, paints, cosmetics, perfumes, engi- compounds. neering and high technology materials and many more. The vir- tually unlimited kingdom of chemicals of unimaginable sizes and shapes is due to the creativity and brilliance of synthetic .

The most important frontiers of chemistry today are related to the synthesis and development of advanced materials with de- sired properties. Every few years, the discovery of a new mate- rial/ with unusual properties triggers chemical research in a big way. Hence, synthesis remains a major activity of chem- istry. Keywords Retrosynthesis, synthetic plan- ning, transformation, retrosyn- From a historical perspective, the science of organic synthesis thetic tree.

*DOI: https://doi.org/10.1007/s12045-019-0877-2

RESONANCE | October 2019 1071 GENERAL ARTICLE

saw its dawn in the 19th century, when its beginning was marked by the synthesis of urea by the German Friedrich Wöhler in 1828.

There is no record of the Landmark achievements in organic synthesis were made by R thought processes which B Woodward who is recognized as the greatest synthetic organic led to the realization of chemist and is known as the ‘Father of organic synthesis’. The un- successful synthesis of complex molecules. The precedented and brilliant contributions by R B Woodward sharply synthetic routes were defined the synthetic field and set the pace for what was to come developed by selecting during the 2nd half of the 20th century. The glorious Woodward an appropriate era (1950–1960) will remain a turning point for the art and sci- commercially available starting material, ence of . structurally resembling Milestones such as the total synthesis of quinine, cholesterol, cor- the desired molecule and building the other parts tisone, reserpine, chlorophyll, cephalosporin, vitamin B12 and of the molecule. strychnine were achieved by the brilliance of the synthetic chemists of that era. Woodward was awarded the Nobel Prize in 1965 for his ideas on the art of synthesis for the landmark synthesis of strychnine. The total synthesis of vitamin B12 completed in 1973, stands as a major achievement in organic synthesis. Along with these, the synthesis of penicillinV by Sheehan is historical. However, synthetic planning was not formulated on a systematic footing. The early design of synthetic routes for a vast number of simple and complex molecules was largely an intuitive operation. There is no record of the thought processes which led to the re- alization of successful synthesis of complex molecules. The syn- thetic routes were developed by selecting an appropriate com- mercially available starting material, structurally resembling the desired molecule and building the other parts of the molecule. But the difficulty in recognizing the available starting materials for many complex molecules called for a more systematic method for recognizing simpler molecules from which the required prod- uct could be obtained. The earlier strategy of Woodward et.al. was transformed by an- other great synthetic chemist E J Corey of Harvard University, who developed the theory and methodology of organic synthesis.

1072 RESONANCE | October 2019 GENERAL ARTICLE

According to Corey, the synthetic planning should start with the final product, and one could work backwards towards the simple starting materials. Corey coined the term ‘retrosynthetic anal- ysis’ for this methodology in 1957. It is a process of working backwards from the target molecule in order to devise a suitable synthetic route. These imaginary backward reactions are known as ‘antithetical reactions.’ One noteworthy example is the application of retrosynthetic strat- egy by Corey for the synthesis of the stereochemically compli- cated natural molecule – prostaglandin.

2. Retrosynthetic Analysis

The advent of retrosynthetic analysis constituted a major advance in the strategic planning of total synthesis of natural and complex compounds. Retrosynthetic analysis is a problem-solving technique for the synthesis of complex molecules. It is the art of planning organic synthesis by transforming the structure of the desired molecule to simple commercially available starting materials for its synthesis. E J Corey was awarded the Nobel Prize in 1990 for develop- ing newer synthetic methodologies leading to efficient synthetic routes, newer reactions and newer reagents which simplified the synthesis of prostaglandins and other complex molecules. Transformation of a molecule to its synthetic precursor is done by the imaginary disconnection of its bonds to progressively simple structures along a pathway which ultimately leads to simple or commercially available starting materials for the synthesis. At Retrosynthetic analysis each step, the availability of the intermediate is evaluated. is the exact reverse (antithetic) of a synthetic Retrosynthetic analysis is the exact reverse (antithetic) of a syn- reaction. However, the thetic reaction. However, the first notable example of a product first notable example of being transformed into its synthetic precursors was that of Robin- a product being transformed into its son’s tropinone synthesis. synthetic precursors was Tropinone was submitted to imaginary hydrolysis at the points that of Robinson’s tropinone synthesis.

RESONANCE | October 2019 1073 GENERAL ARTICLE

indicated by the dotted lines below and resolved into succinalde- hyde, methylamine and acetone.

The precursors were identified from the starting material, and then a suitable path was devised to convert these starting mate- rials into the target molecule using known reactions.

2.1 Terms and Definitions

1. Target molecule (TM): The molecule to be synthesized.

2. Retrosynthetic analysis: The process of imaginary break down of a molecule into progressively simpler starting materials. The reactions are viewed in the retrosynthetic direction i.e., starting with the product and going back to the reactants along a pathway that is reverse of a synthetic direction.

3. Disconnection: Imaginary bond cleavage corresponding to the reverse of a forward reaction leading to the immediate precursor. This is also known as transformation and is indicated by a wavy line.

4. Retrosynthetic arrow: Disconnection is represented by a dou- ble line closed arrow which indicates the transformation of the molecule into its immediate precursor.

1074 RESONANCE | October 2019 GENERAL ARTICLE

5. : Synthons are the imaginary fragments obtained by disconnection. The concept of bond polarity with the fragments is of prime importance during disconnection. Synthons are not real compounds but are idealized ionic or neutral fragments, and they are not reagents.

The following reaction shows a concerted cycloaddition reaction, where the synthons are neutral fragments.

6. Retron: Each reaction generates a characteristic structural ele- ment in the product, such as the enone resulting from aldol con- densation. This substructure, called the retron, must be present in a target molecule to be able to apply the corresponding trans- formation to that target.

7. Reagents: These are the actual source of the synthons.

+ ≡ + + ≡ + ≡ + NO2 HNO3 H2SO4;Br Br2;Cl Cl2 AlCl3

RESONANCE | October 2019 1075 GENERAL ARTICLE

B) Alkyl – a) Alkyl cation (carbenium )−

+ − H + H2O + i) R − OH −→ R − O H2 −→ R Alcohol Alcl3 + ii) R − X −→ R + XAlCl¯ 3 Alkyl halide

b) Alkyl anion ()−

+ R − M −→ R¯ + M organometallic compound carbanion

C) Acyl ions a) Acyl cations

+ H + (i) R − CO − CH2 − X −→ R − CHO C H2 + HX (X = Cl or Br) Acyl halide −X¯ Acyl cation + H + (ii) R − CO − CH = CH2 −→ RCOC H − CH3 Enone Acyl cation

b) Acyl anion

Removal of a proton from a methylene group adjacent to a electron- withdrawing group.

(i) − −→Base − ¯ R COCH3 + R COCH2 methyl keystones −H  Base  (ii) RCH2 − COOR −→ RCH¯ − COOR −H+ Base (iii) RCH2 − NO2 −→ RCH¯ − NO2 −H+

D) Ethoxy anion E) Aldehyde carbonyl as a cation and anion

The aldehyde carbonyl carbon is electron deficient and undergoes nucleophilic attack.

1076 RESONANCE | October 2019 GENERAL ARTICLE

The aldehyde carbonyl carbon can also undergo electrophilic at- tack when it is converted into an anion. The polarity of the elec- tron – deficient aldehyde carbonyl carbon can be reversed and this is known as ‘’.

8) Retrosynthetic tree

It is a complex pattern of several or all possible retrosynthesis of a Retrosynthetic analysis single compound. Retrosynthetic analysis of a molecule may lead of a molecule may lead to the possibility of identifying several different starting materials to the possibility of ff identifying several and several di erent routes to the synthesis of a molecule. Each different starting structure derived from a disconnection becomes a TM itself for materials and several further analysis. different routes to the synthesis of a molecule. The analysis can be repeated for each precursor, generating a sec- ond level of precursors. Each precursor generated is checked for its availability. Such repeated disconnections give an outline of the available routes for the TM, and this is known as the retrosyn- thetic tree (Figure 1). A retrosynthetic tree is a complex pattern with many branches which lead to different routes and different synthetic precursors. The synthetic tree is a graph of several synthetic routes to the TM which can be designed and evaluated, from which an efficient

RESONANCE | October 2019 1077 GENERAL ARTICLE

Figure 1. Schematic depic- tion of a retrosynthetic tree.

method is chosen. The synthetic plan is written by choosing an appropriate route according to the analysis and adding reagents and reaction conditions. The synthetic strategy was further sim- plified by the use of computers.

Corey was the first organic chemist to explore the possibility of use of computers in simplifying synthesis. Computer programs have been categorized into two main types: Passive program: It is the use of computerized libraries which identify the structural features of the target molecule i.e., an aro- matic ring, an acyclic ring, an alkyl group, etc. It is a collection of data from which we can locate all the compounds that contain a specified substructure corresponding to the TM. A huge collec- tion of organic reactions is also available for use. Active program: A menu is displayed with different strategies for retrosynthetic analysis.

2.2 Guidelines or Empirical Rules (Heuristics) to a Proper Dis- connection

1) Disconnection should correspond to a known and reliable reac- tion. So, a thorough knowledge of reactions is necessary (Figure 2). 2) Disconnect C–X bond.

1078 RESONANCE | October 2019 GENERAL ARTICLE

Figure 2. Route A is a good disconnection as acetylation of an amino group is a known reaction to get the N-acetyl group. Routes B and C route disconnections are improper as they do not lead to known reactions.

These are the known reactions where alkyl halides are obtained by nucleophilic attack by a halide whereas the halogen is an elec- trophile in the preparation of arylhalides. 3) For compounds comprising of two parts joined by a heteroatom, disconnection is carried out next to the heteroatom. The molecule is disconnected in the middle for greater simplification.

The aryloxyacetic acid can be readily obtained by reaction of a phenol and chloroacetic acid in presence of NaOH. So, the proper disconnection is at the ‘b’ site. The amide can be obtained from an acid chloride and an amine, so, the proper disconnection is at the a site. 4) In open-chain compounds, the disconnection at the heteroatom

RESONANCE | October 2019 1079 GENERAL ARTICLE

is also based on the stability of the synthons.

Both these disconnections are next to the heteroatom and also in the middle to give equal size synthons. But route ‘a’ is preferred as it gives a more stable secondary cation.

2.3 Group Oriented Strategy: Manipulation of the Functional Groups

Whenever disconnection does not lead to reliable reactions, then the following changes have to be carried out with the functional groups – FGI, FGA, FGR – and the unmasking of the latent func- tional groups by deprotection or other conversions. Manipulation of functional groups can lead to significant reductions in molecu- lar complexity. a) Interconversion (FGI):

FGI is substituting functional group for another when the discon- nection of the group does not lead to a proper precursor. In the following reaction, disconnection is not possible because there is no corresponding reaction to introduce group X directly on the benzene ring. Hence, it has to be converted to group Y by FGI as group Y can be disconnected to give the appropriate starting materials. Aniline on bromination gives 2, 4, 6-tribromoaniline so the group interconversions has to be carried. Amino group is readily ob- tained by the reduction of nitro group. So, in the above reaction, the first step is the functional group interconversion followed by disconnection.

1080 RESONANCE | October 2019 GENERAL ARTICLE

Preparation of p-bromoaniline from aniline is shown below.

The synthetic route for the target molecule is the exact reverse of these retro steps. b) Functional Group Addition (FGA):

Some functional groups need the addition of a group to the im- mediate precursor suitable for disconnection.

An example is the facile dehydration of β–hydroxy ketone that yields α, β–unsaturated ketone. c) Functional Group Removal (FGR): A functional group has

RESONANCE | October 2019 1081 GENERAL ARTICLE

to be removed to get a precursor required for the target molecule. In the following example, the precursor is a compound contain- ing an olefinic double bond so the two hydroxyl groups are to be removed. The target molecule caprolactam is obtained by Beckmann rear- rangement from the lactam. So, the first retro step is the addition of a keto group to obtain a lactam. This reaction involves both FGR and FGI.

Box 1 shows the list of some common functional groups which cannot be introduced directly on the benzene ring and have to undergo functional group inversion. However, simple compounds containing these groups which are commercially available can be used for the subsequent reactions.

2.4 Disconnection in Disubstituted Benzene Derivatives

Disconnection in dissubstituted benzene derivatives is based on the relative positions of the groups, their reactivity and orienta- tion. The order of events in the retrosynthetic analysis, based on the orientation and reactivity of the groups is illustrated by the following examples.

1082 RESONANCE | October 2019 GENERAL ARTICLE

Box 1. Groups that have to undergo functional group inversion.

X Y Reaction Reagent

-NH2 -NO2 Reduction Sn / HCl

-COOH2 -CH3 Oxidation kMnO4,CrO3,&H2SO4 -CN Hydrolysis NaOH

-OH -NH2 Diazotisation NaNO2 + HCl + & heating with H3O

-OH -NH2 Diazotisation NaNO2 + HCl & heating with CuCN

-CN -NH2 Diazotisation NaNO2 + HCl & coupling with CuCN

-COCl COOH Reaction with SOCl2 .

-CONH2 COCl Reaction with NH3

-CHO -CH2Cl Oxidation Hexamine -COOR -COOH Esterification ROH & H+

4–isopropyl acetophenone can be disconnected in two ways – A and B. Both, the acetyl and the alkyl groups can be introduced by Friedel– Crafts reaction. The disconnection depends on the reactivity of the isopropyl and acetyl groups and their orientation for the in- coming group. Disconnection B does not lead to proper precursors as the acetyl group deactivates the ring towards Friedel–Crafts alkylation re- action and it is a meta-orienting group. Disconnection A is a re-

RESONANCE | October 2019 1083 GENERAL ARTICLE

Alcohols can be treated liable reaction as the isopropyl group activates the ring towards as central functional Friedel–Crafts reaction and is para-orienting. The synthesis de- groups as they can be signed based on this retrosynthetic analysis is alkylation followed converted into various other functional groups. by acetylation. Alcohols are the starting materials for many functionalized aliphatic compounds.

2.5 Interconvertible Functional Groups

The functional groups are easy to inter convert if the carbon skele- ton remains unchanged. For e.g., the carboxylic group can be used to obtain various functional groups as follows:

Similarly, alcohols can be treated as central functional groups as they can be converted into various other functional groups. Alco- hols are the starting materials for many functionalized aliphatic compounds.

1084 RESONANCE | October 2019 GENERAL ARTICLE

3. Tips for Synthesis Based on Retrosynthetic Analysis

An efficient synthesis should be planned based on the following conditions:

• Advantages and disadvantages of the route.

• Availability of the starting materials for each step.

• Type of reactions: Less competent reactions should be involved in the synthesis. The reaction conditions should be simple and high yielding processes. They should work out on industrial scale. Yield at each step is to be considered.

• Safety: Dangerous chemicals are best avoided when planning a synthesis as they require more expensive equipment to ensure safe handling and containment. More important is they pose more danger to the person handling them. This is an important consideration for large scale preparations.

• Length and cost of each synthetic step: Minimum number of steps would save time. A is preferred over the linear method. A lengthy scheme can be selected when the starting materials are inexpensive and readily available and easy experimental conditions are involved. Expensive starting materi- als are used when the number of steps are less.

• Starting materials for industrial syntheses: Starting compounds of linear skeleton upto six carbon with one functional

group like – COOH, CHO, OH, X, NH2, etc., are commercially available. Small molecules like simple aromatic compounds, , monomers for plastics, etc., are commercially available.. Naturally occurring compounds like amino acids, simple sugars and other plant materials are preferred.

RESONANCE | October 2019 1085 GENERAL ARTICLE

4. Conclusion

The aim of retrosynthetic analysis to not only simplify but also aid the discovery of new synthetic routes and compare them for the development of an efficient synthetic strategy for a complex molecule.

Address for Correspondence Dr Bharati V Badami Suggested Reading H.No. 80, (Upstairs) 1st Main, 3rd Cross, [1] S Warren, Designing Organic Synthesis, Introduction to Approach, Narayanpur John Wiley & Sons, NY, 1978. Dharwad 580 008, India. [2] S Warren, Organic Synthesis, The Disconnection Approach, John Wiley & Email: Sons, NY, 1983. [email protected] [3] R Robinson, J.Chem.Soc., Vol.111, 762, 1917. [4] E J Corey and W T Wipke, Science, 166, 178, 1969.

1086 RESONANCE | October 2019