chapter
Carbohydrates and Glycobiology 9
Carbohydrates are the most abundant biomolecules on Earth. Each year, photosynthesis converts more than 100 billion metric tons of CO2 and H2O into cellulose and other plant products. Certain carbohydrates (sugar and starch) are a dietary staple in most parts of the world, and the oxidation of carbohydrates is the central energy-yielding pathway in most nonphoto- synthetic cells. Insoluble carbohydrate polymers serve as structural and protective elements in the cell walls of bacteria and plants and in the con- nective tissues of animals. Other carbohydrate polymers lubricate skeletal joints and participate in recognition and adhesion between cells. More com- plex carbohydrate polymers covalently attached to proteins or lipids act as signals that determine the intracellular location or metabolic fate of these hybrid molecules, called glycoconjugates. This chapter introduces the major classes of carbohydrates and glycoconjugates and provides a few ex- amples of their many structural and functional roles. Carbohydrates are predominantly cyclized polyhydroxy aldehydes or ketones, or substances that yield such compounds on hydrolysis. Many, but not all, carbohydrates have the empirical formula (CH2O)n; some also con- tain nitrogen, phosphorus, or sulfur. There are three major size classes of carbohydrates: monosaccharides, oligosaccharides, and polysaccharides (the word “saccharide” is derived from the Greek sakcharon, meaning “sugar”). Monosaccharides, or sim- ple sugars, consist of a single polyhydroxy aldehyde or ketone unit. The most abundant monosaccharide in nature is the six-carbon sugar D-glucose, sometimes referred to as dextrose. Oligosaccharides consist of short chains of monosaccharide units, or residues, joined by characteristic linkages called glycosidic bonds. The most abundant are the disaccharides, with two monosaccharide units. Typical is sucrose, or cane sugar, which consists of the six-carbon sugars D-glucose Cotton bolls ready for harvest. The cotton fiber consists of and D-fructose. All common monosaccharides and disaccharides have cellulose, a polysaccharide with physical and chemical names ending with the suffix “-ose.” In cells, most oligosaccharides having properties that give cotton textiles their desirable qualities. three or more units do not occur as free entities but are joined to nonsugar The cotton plant has been cultivated and used in textile production for thousands of years. molecules (lipids or proteins) in glycoconjugates. Sugar polymers occur in a continuous range of sizes. Those containing more than about 20 monosaccharide units are generally called polysac- charides. Polysaccharides may have hundreds or thousands of monosac- charide units. Some polysaccharide molecules, such as cellulose, are linear chains, whereas others, such as glycogen, are branched chains. The plant products starch and cellulose both consist of recurring units of D-glucose, but they differ in the type of glycosidic linkage, and consequently have strikingly different properties and biological roles.
293 294 Part II Structure and Catalysis
Monosaccharides and Disaccharides The simplest of the carbohydrates, the monosaccharides, are either alde- hydes or ketones with one or more hydroxyl groups; the six-carbon mono- saccharides glucose and fructose have five hydroxyl groups. The carbon atoms to which hydroxyl groups are attached are often chiral centers, which give rise to the many sugar stereoisomers found in nature. We begin by describing the families of monosaccharides with backbones of three to seven carbons—their structure and stereoisomeric forms, and the means of representing their three-dimensional structures on paper. We then discuss several chemical reactions of the carbonyl groups of monosaccharides. One such reaction, the addition of a hydroxyl group from within the same mole- cule, generates the cyclic forms of five- and six-carbon sugars (which pre- dominate in aqueous solution) and creates a new chiral center, adding further stereochemical complexity to this class of compounds. The nomen- clature for unambiguously specifying the configuration about each carbon atom in a cyclic form and the means of representing these structures on pa- per are therefore described in some detail; these will be useful later in our discussion of the metabolism of monosaccharides. We also introduce here some important monosaccharide derivatives that will be encountered in later chapters.
The Two Families of Monosaccharides Are Aldoses and Ketoses Monosaccharides are colorless, crystalline solids that are freely soluble in water but insoluble in nonpolar solvents. Most have a sweet taste. The back- bones of common monosaccharide molecules are unbranched carbon chains in which all the carbon atoms are linked by single bonds. In the open- chain form, one of the carbon atoms is double-bonded to an oxygen atom to form a carbonyl group; each of the other carbon atoms has a hydroxyl group. If the carbonyl group is at an end of the carbon chain (i.e., in an alde- hyde group) the monosaccharide is an aldose; if the carbonyl group is at any other position (in a ketone group) the monosaccharide is a ketose. The simplest monosaccharides are the two three-carbon trioses: glyceralde- hyde, an aldotriose, and dihydroxyacetone, a ketotriose (Fig. 9–1a). Monosaccharides with four, five, six, and seven carbon atoms in their backbones are called, respectively, tetroses, pentoses, hexoses, and hep- toses. There are aldoses and ketoses of each of these chain lengths: al- dotetroses and ketotetroses, aldopentoses and ketopentoses, and so on. figure 9–1 The hexoses, which include the aldohexose D-glucose and the ketohexose Representative monosaccharides. (a) Two trioses, an D-fructose (Fig. 9–1b), are the most common monosaccharides in nature. aldose and a ketose. The carbonyl group in each is The aldopentoses D-ribose and 2-deoxy-D-ribose (Fig. 9–1c) are compo- shaded. (b) Two common hexoses. (c) The pentose nents of nucleotides and nucleic acids (Chapter 10). components of nucleic acids. D-Ribose is a component of ribonucleic acid (RNA), and 2-deoxy-D-ribose is a component of deoxyribonucleic acid (DNA).
H O H C H C OH H O H O H C OH C O C C H O H C H C OH HO C H HO C H H C OH CH2 H C OH C O H C OH H C OH H C OH H C OH A A HCOH HCOH H C OH H C OH H C OH H C OH
H H CH2OH CH2OH CH2OH CH2OH Glyceraldehyde, Dihydroxyacetone, D-Glucose, D-Fructose, D-Ribose, 2-Deoxy-D-ribose, an aldotriose a ketotriose an aldohexose a ketohexose an aldopentose an aldopentose (a) (b) (c) Chapter 9 Carbohydrates and Glycobiology 295
Monosaccharides Have Asymmetric Centers All the monosaccharides except dihydroxyacetone contain one or more asymmetric (chiral) carbon atoms and thus occur in optically active iso- meric forms (pp. 58–61). The simplest aldose, glyceraldehyde, contains one chiral center (the middle carbon atom) and therefore has two different op- tical isomers, or enantiomers (Fig. 9–2). By convention, one of these two forms is designated the D isomer, the other the L isomer. As for other biomolecules with chiral centers, the absolute configurations of sugars are known from x-ray crystallography. To represent three-dimensional sugar structures on paper, we often use Fischer projection formulas (Fig. 9–2).
Mirror CHO CHO HCOH HO C H
CH2OH CH 2OH CHO CHO D-Glyceraldehyde L-Glyceraldehyde
Fischer projection formulas H OH H OH
CHO CHO H C OH HO C H CH2OH CH OH 2 CH 2OH CH2OH D-Glyceraldehyde L-Glyceraldehyde
Ball-and-stick models Perspective formulas
figure 9–2 Three ways to represent the two stereoisomers of glycer- aldehyde. The stereoisomers are mirror images of each other. Ball-and-stick models show the actual configuration n of molecules. By convention, in Fischer projection for- In general, a molecule with n chiral centers can have 2 stereoisomers. mulas, horizontal bonds project out of the plane of the 1 Glyceraldehyde has 2 2; the aldohexoses, with four chiral centers, have paper, toward the reader; vertical bonds project behind 24 16 stereoisomers. The stereoisomers of monosaccharides of each car- the plane of the paper, away from the reader. Recall (see bon chain length can be divided into two groups that differ in the configu- Fig. 3–7) that in perspective formulas, solid wedge- ration about the chiral center most distant from the carbonyl carbon. shaped bonds point toward the reader, dashed wedges point away. Those in which the configuration at this reference carbon is the same as that of D-glyceraldehyde are designated D isomers, and those with the same configuration as L-glyceraldehyde are L isomers. When the hydroxyl group on the reference carbon is on the right in the projection formula, the sugar is the D isomer; when on the left, it is the L isomer. Of the 16 possible aldo- hexoses, eight are D forms and eight are L. Most of the hexoses of living or- ganisms are D isomers. Figure 9–3 shows the structures of the D stereoisomers of all the al- doses and ketoses having three to six carbon atoms. The carbons of a sugar are numbered beginning at the end of the chain nearest the carbonyl group. Each of the eight D-aldohexoses, which differ in the stereochemistry at C-2, C-3, or C-4, has its own name: D-glucose, D-galactose, D-mannose, and so forth (Fig. 9–3a). The four- and five-carbon ketoses are designated by Three carbons Four carbons Five carbons
H O H O H O H O H O H O C C C C H O C C H C OH HO C H H C OH HO C H C H C OH HO C H HOHC HOHC HO C H HO C H H C OH H C OH H C OH H C OH H C OH H C OH H C OH
CH2OH CH2OH CH2OH CH2OH CH2OH CH2OH CH2OH
D-Glyceraldehyde D-Erythrose D-Threose D-Ribose D-Arabinose D-Xylose D-Lyxose
Six carbons
H O H O H O H O H O H O H O H O C C C C C C C C H C OH HO C H H C OH HO C H H C OH HO C H H C OH HO C H H C OH H C OH HO C H HO C H H C OH H C OH HO C H HO C H HOHC HOHC HOHC H C OH HO C H HO C H HOC H HO C H H C OH H C OH H C OH H C OH H C OH H C OH H C OH H C OH
CH2OH CH2OH CH2OH CH2OH CH2OH CH2OH CH2OH CH2OH
D-Allose D-Altrose D-Glucose D-Mannose D-Gulose D-Idose D-Galactose D-Talose
D-Aldoses (a) figure 9–3 The series of (a) D-aldoses and (b) D-ketoses having from Three carbons Four carbons Five carbons three to six carbon atoms, shown as projection formulas.
The carbon atoms in red are chiral centers. In all these CH2OH CH2OH D isomers, the chiral carbon most distant from the carbonyl carbon has the same configuration as the chiral carbon in C O C O CH2OH D-glyceraldehyde. The sugars named in boxes are the CH2OH H C OH HO C H most common in nature; we shall encounter these again C O in this and later chapters. C O H C OH H C OH H C OH CH2OH CH2OH CH2OH CH2OH D D Dihydroxyacetone D-Erythrulose -Ribulose -Xylulose
Six carbons
CH2OH CH2OH CH2OH CH2OH C O C O C O C O H C OH HO C H HHOC OH C H H C OH H C OH HO C H HO C H H C OH H C OH H C OH H C OH
CH2OH CH2OH CH2OH CH2OH
D-Psicose D-Fructose D-Sorbose D-Tagatose
D-Ketoses (b)
296 Chapter 9 Carbohydrates and Glycobiology 297
1 1 1 CHO CHO CHO figure 9–4 2 2 2 Epimers. D-Glucose and two of its epimers are shown as HO C H H C OH H C OH projection formulas. Each epimer differs from D-glucose in 3 3 3 HO C H HO C H HO C H the configuration at one chiral center (shaded red). 4 4 4 H C OH H C OH HO C H 5 5 5 HCOH HCOH HCOH 6 6 6 CH2OH CH2OH CH2OH D-Mannose D-Glucose D-Galactose (epimer at C-2) (epimer at C-4)
inserting “ul” into the name of a corresponding aldose; for example, D-ribu- lose is the ketopentose corresponding to the aldopentose D-ribose. The ke- tohexoses are named otherwise, such as fructose (from Latin fructus, meaning “fruit”; fruits are rich in this sugar) and sorbose (from Sorbus, the H O genus of mountain ash, which has berries rich in the related sugar alcohol GJ C sorbitol). Two sugars that differ only in the configuration around one car- A bon atom are called epimers; D-glucose and D-mannose, which differ only HOCOOH A in the stereochemistry at C-2, are epimers, as are D-glucose and D-galactose HOOCOH (which differ at C-4) (Fig. 9–4). A HOOCOH Some sugars occur naturally in their L form; examples are L-arabinose A and the L isomers of some sugar derivatives (discussed below) that are com- CH2OH mon components of glycoconjugates. L-Arabinose
The Common Monosaccharides Have Cyclic Structures For simplicity, we have thus far represented the structures of various al- doses and ketoses as straight-chain forms (Figs. 9–3, 9–4). In fact, in aque- ous solution, aldotetroses and all monosaccharides with five or more carbon atoms in the backbone occur predominantly as cyclic (ring) structures in which the carbonyl group has formed a covalent bond with the oxygen of a hydroxyl group along the chain. The formation of these ring structures is the result of a general reaction between aldehydes or ketones and alcohols to form derivatives called hemiacetals or hemiketals (Fig. 9–5), which contain an additional asymmetric carbon atom and thus can exist in two stereoisomeric forms. For example, D-glucose exists in solution as an intra- molecular hemiacetal in which the free hydroxyl group at C-5 has reacted
3 3 O OH HO R OR 1 2 1 2 1 2 R C HO R R C OR R C OR H2O H 3 H HO R H Aldehyde Alcohol Hemiacetal Acetal figure 9–5 Formation of hemiacetals and hemiketals. An aldehyde 4 4 or ketone can react with an alcohol in a 1 1 ratio to yield OH HO R OR a hemiacetal or hemiketal, respectively, creating a new 1 3 1 3 1 3 chiral center at the carbonyl carbon. Substitution of a R C O HO R R C OR R C OR H2O second alcohol molecule produces an acetal or ketal. 2 2 4 2 R R HO R R When the second alcohol is part of another sugar mole- Ketone Alcohol Hemiketal Ketal cule, the bond produced is a glycosidic bond (p. 301). 298 Part II Structure and Catalysis figure 9–6 H O Formation of the two cyclic forms of D-glucose. Reac- tion between the aldehyde group at C-1 and the hydroxyl 1 C group at C-5 forms a hemiacetal linkage, producing either 2 H C OH of two stereoisomers, the a and b anomers, which differ 3 only in the stereochemistry around the hemiacetal HO C H D-Glucose carbon. The interconversion of a and b anomers is called 4 mutarotation. H C OH 5 H C OH 6 CH2OH
6 CH2OH 5 C OH H H H 4 C C 1 OH H HO O C C 3 2 H OH
6 6 CH2OH CH2OH 5 C O 5 C O H H H OH H H 4 C 1C 4 C 1C OH H mutarotation OH H HO OH HO H C C C C 3 2 3 2 H OH H OH -D-Glucopyranose -D-Glucopyranose
with the aldehydic C-1, rendering the latter asymmetric and producing two stereoisomers, designated a and b (Fig. 9–6). These six-membered ring compounds are called pyranoses because they resemble the six-mem- bered ring compound pyran (Fig. 9–7). The systematic names for the two ring forms of D-glucose are a-D-glucopyranose and b-D-glucopyranose. Aldohexoses also exist in cyclic forms having five-membered rings, which, because they resemble the five-membered ring compound furan, are called furanoses. However, the six-membered aldopyranose ring is much more stable than the aldofuranose ring and predominates in aldohexose so- lutions. Only aldoses having five or more carbon atoms can form pyranose rings.
6 CH2OH CH2OH 5 O O HC O H H H H H OH 4 1 HC CH OH H OH H HO OH HO H 3 2 H2C CH H OH H OH figure 9–7 -D-Glucopyranose -D-Glucopyranose Pyran Pyranoses and furanoses. The pyranose forms of 6 D-glucose and the furanose forms of D-fructose are HOCH O 1CH OH HOCH O OH O shown here as Haworth perspective formulas. The edges 2 2 2 of the ring nearest the reader are represented by bold 5 H HO 2 H HO HC CH lines. Hydroxyl groups below the plane of the ring in H OH H CH2OH 4 3 C C these Haworth perspectives would appear at the right H H side of a Fischer projection (compare with Fig. 9–6). OH H OH H Pyran and furan are shown for comparison. -D-Fructofuranose -D-Fructofuranose Furan Chapter 9 Carbohydrates and Glycobiology 299
Axis Axis Axis ax ax ax H eq O eq H2COH O eq ax ax O eq HO H ax eq H eq eq eq HO H eq eq OH ax ax ax ax H OH
Two possible chair forms -D-Glucopyranose (a) (b) figure 9–8 Conformational formulas of pyranoses. (a) Two chair Isomeric forms of monosaccharides that differ only in their configura- forms of the pyranose ring. Substituents on the ring tion about the hemiacetal or hemiketal carbon atom are called anomers. carbons may be either axial (ax), projecting parallel with The hemiacetal or carbonyl carbon atom is called the anomeric carbon. the vertical axis through the ring, or equatorial (eq), pro- jecting roughly perpendicular to this axis. Generally, sub- The a and b anomers of D-glucose interconvert in aqueous solution by a stituents in the equatorial positions are less sterically hin- process called mutarotation. Thus a solution of a-D-glucose and a solution dered by neighboring substituents, and conformations of b-D-glucose eventually form identical equilibrium mixtures having identi- with their bulky substituents in equatorial positions are cal optical properties. This mixture consists of about one-third a-D-glucose, favored. Another conformation, the “boat” (not shown), is only seen in derivatives with very bulky substituents. two-thirds b-D-glucose, and very small amounts of the linear and five- (b) A chair conformation of a-D-glucopyranose. membered ring (glucofuranose) forms. Ketohexoses also occur in a and b anomeric forms. In these compounds the hydroxyl group on C-5 (or C-6) reacts with the keto group at C-2, form- ing a furanose (or pyranose) ring containing a hemiketal linkage (Fig. 9–5). D-Fructose forms predominantly the furanose ring (Fig. 9–7); the more common anomer in combined forms or derivatives is b-D-fructofuranose. Haworth perspective formulas like those in Figure 9–7 are com- monly used to show the ring forms of monosaccharides. However, the six- membered pyranose ring is not planar, as Haworth perspectives suggest, but tends to assume either of two “chair” conformations (Fig. 9–8). Recall from Chapter 3 that two conformations of a molecule are interconvertible without the breakage of covalent bonds, but two configurations can be in- terconverted only by breaking a covalent bond—for example, in the case of a and b configurations, the bond involving the ring oxygen atom. The spe- cific three-dimensional conformations of the monosaccharide units are im- portant in determining the biological properties and functions of some poly- saccharides, as we shall see.
Organisms Contain a Variety of Hexose Derivatives In addition to simple hexoses such as glucose, galactose, and mannose, there are a number of sugar derivatives in which a hydroxyl group in the parent compound is replaced with another substituent, or a carbon atom is oxidized to a carboxyl group (Fig. 9–9). In glucosamine, galactosamine, and mannosamine, the hydroxyl at C-2 of the parent compound is replaced with an amino group. The amino group is nearly always condensed with acetic acid, as in N-acetylglucosamine. This glucosamine derivative is part of many structural polymers, including those of the bacterial cell wall. Bacterial cell walls also contain a derivative of glucosamine called N-acetylmuramic acid, in which lactic acid (a three-carbon carboxylic acid) is ether-linked to the oxygen at C-3 of N-acetylglucosamine. The substitution of a hydrogen for the hydroxyl group at C-6 of L-galactose or L-mannose produces L-fucose or L-rhamnose, respectively; these deoxy sugars are found in plant polysac- charides and in the complex oligosaccharide components of glycoproteins and glycolipids described later. When the carbonyl (aldehyde) carbon of glucose is oxidized to the car- boxyl level, gluconic acid is produced; other aldoses yield other aldonic acids. Oxidation of the carbon at the other end of the carbon chain—C-6 300 Part II Structure and Catalysis
Glucose family
Amino sugars
CH2OH CH2OH CH2OH CH2OH CH2OH O O O O O H H OH H H OH H H OH HO H OH H H OH H2N OH H OH H OH H OH H OH HO H HO H HO H H H HO H
H OH H NH2 H NH H NH2 H H C O -D-Galactosamine -D-Mannosamine
CH3 -D-Glucose -D-Glucosamine N-Acetyl- -D-glucosamine Deoxy sugars
2 CH2 O PO3 CH2OH CH2OH H H O O O CH O O H OH H OH H OH 3 H H HO OH H H H CH3 CH3 R O C H OH H R H R H H OH H H OH H H HO H HO H HO H COO HO
H OH H NH2 H NH OH H OH OH C O
CH3 -D-Glucose 6-phosphate Muramic acid N-Acetylmuramic acid -L-Fucose -L-Rhamnose
Acidic sugars