<<

C ARBOHYDRATES AND G LYCOBIOLOGY REVIEW Chemical Carolyn R. Bertozzi1 and Laura L. Kiessling2

Chemical tools have proven indispensable for studies in glycobiology. provided substantial insight. For example, mice Synthetic oligosaccharides and provide materials for cor- deficient in ␣ mannosidase II expressed an relating structure with function. Synthetic mimics of the complex assem- altered portfolio of N-linked on their blies found on cell surfaces can modulate cellular interactions and are cell surface (2). The mice were under development as therapeutic agents. Small molecule inhibitors of prone to a systemic autoimmune response, sug- biosynthetic and processing enzymes can block the assembly gesting that abnormalities in N- in of specific oligosaccharide structures. Inhibitors of carbohydrate recogni- humans may be a factor in the pathogenesis of tion and biosynthesis can reveal the biological functions of the carbohy- autoimmunity. Still, cell surface presentation of drate epitope and its cognate receptors. Carbohydrate biosynthetic path- simple as well as complex glycans requires ways are often amenable to interception with synthetic unnatural sub- many to be expressed in concert, which strates. Such metabolic interference can block the expression of oligo- complicates the analysis of single “knock- saccharides or alter the structures of the presented on cells. outs” or “knockins.” Collectively, these chemical approaches are contributing great insight As outlined in this review, chemical ap- into the myriad biological functions of oligosaccharides. proaches are powerful allies to genetics and in the study of glycobiology. Oligosaccharides and glycoconjugates (gly- geneous and chemically defined glycoconju- Chemical tools have been used to probe gly- coproteins and ) have intrigued gates from biological sources. Without such cosylation at many levels. For example, cell biologists for decades as mediators of com- materials in hand, biological functions are surface carbohydrate-receptor binding events plex cellular events. With respect to structur- difficult to unravel. (Fig. 1) can be interrogated with synthetic al diversity, they have the capacity to far Genetic approaches have contributed con- oligosaccharides, glycoconjugates, and their exceed and nucleic acids. This struc- siderably to our appreciation of oligosaccharide analogs. The biosynthesis of oligosaccharide tural variance allows them to encode infor- function. The availability of entire genome se- structures can be disrupted or modulated by mation for specific molecular recognition and quences has revealed the multiplicity of en- synthetic enzyme inhibitors. Unnatural sug- to serve as determinants of folding, zymes that contribute to assem- ars that substitute for native monosaccharides stability, and pharmacokinetics. Given that bly. Their deletion in model organisms has or their downstream metabolic intermediates glycosylation is one of the most ubiquitious forms of posttranslational modification, the unexpectedly small number of genes identi- fied in the initial analyses of the human ge- nome sequence provides even more impetus for understanding the biological roles of oligosaccharides. Oligosaccharide functions are now being elucidated in molecular detail, but advances in glycobiology have been slow to arrive compared with the pace of revelations in protein or nucleic acid biochemistry. The same structural diversity that has captivated biologists has also frustrated efforts to define oligosaccharide expression patterns on pro- teins and cells and to correlate structure with function. Some technical challenges are ana- lytical in nature; determination of the oligo- saccharide sequence on a specific glyco- conjugate is still far from routine. Others originate from glycoconjugate biosynthesis, which is neither template-driven nor under direct transcriptional control. Oligosaccha- rides are assembled in step-wise fashion pri- marily in the endoplasmic reticulum and Golgi apparatus (Fig. 1), a process that af- Fig. 1. Glycoconjugate biosynthesis and cell surface recognition. Exogenously supplied monosac- fords significant product microheterogeneity charides are taken up by cells and converted to monosaccharide “building blocks” (typically (1). As a result, it is difficult to obtain homo- nucleoside sugars) inside the cell. Several steps of metabolic transformation might take place en route from an exogenous to a building block. The building blocks are imported into the secretory compartments where they are assembled by glycosyltransferases into oligosaccharides 1Departments of Chemistry and Molecular and Cell bound to a protein (or lipid) scaffold. In the case of N-linked glycoproteins, a core oligosaccharide Biology and Howard Hughes Medical Institute, Uni- is assembled in the cytosol, then transported into the ER where it is processed by glycosidases, and versity of California, Berkeley, CA 94720, USA. 2De- then further elaborated by glycosyltransferases. Once expressed in fully mature form on the cell partments of Chemistry and Biochemistry, University surface, the glycoconjugates can serve as ligands for receptors on other cells or pathogens. of Wisconsin, Madison, WI 53706, USA. Chemical tools can be used to inhibit or control any stage of this process.

www.sciencemag.org SCIENCE VOL 291 23 MARCH 2001 2357 C ARBOHYDRATES AND G LYCOBIOLOGY (Fig. 1) can intercept biosynthetic pathways, ate building blocks are produced and assem- metal activators are being replaced with mild- leading to changes in cell surface glycosyla- bled into oligosaccharides (Fig. 2B). In both er, more environmentally sound methods (9, tion. These chemical strategies allow one to approaches, the focus is on forming the crit- 10). Our understanding of how to obtain the perturb glycosylation and oligosaccharide-re- ical connection that links saccharide building desired configuration of a glycosidic bond is ceptor interactions in a cellular context. Fur- blocks: the glycosidic bond. also more sophisticated. Stereochemical con- thermore, chemically synthesized molecules Chemical synthesis and enzyme-based trol can be achieved by employing stereospe- that disrupt pathological carbohydrate-depen- routes are complementary. Enzymes can be cific activation methods, using protecting dent processes are emerging as important used to effect glycosylation with absolute groups that direct the orientation of the gly- therapeutic agents. regio- and stereo-control. If the necessary cosidic bond through intermolecular (neigh- enzyme is available, the desired bond can be boring group participation) or intramolecular Synthesis of Oligosaccharides and formed (Fig. 2A), often with high efficiency. (tethered aglycone delivery) participation, al- Glycoproteins In comparison, chemical synthesis offers ex- tering the steric environment around the ano- Access to structurally defined oligosaccha- ceptional flexibility. Natural and non-natural meric position to bias the desired outcome, or rides and glycoconjugates is a prerequisite for saccharide building blocks can be assembled exploiting the intrinsic stereoelectronic pref- unraveling their function. Chemical routes to with natural or non-natural linkages. Al- erences for reaction at the anomeric position. the production of oligosaccharides are, there- though some enzymes will act on alternative Two major advances are being applied to fore, essential. Advances on this front are substrates, chemical synthesis provides the streamline the chemical synthesis of oligo- providing materials for the assessment of gly- means to generate any oligosaccharide, oli- saccharides: one-pot reactions (11, 12) and can function, the establishment of the struc- gosaccharide analog, or glycoconjugate. polymer-supported synthesis (13, 14). In tural features important for function, the elu- The chemical synthesis of oligosaccha- one-pot processes, glycosylation reactions cidation of biosynthetic pathways, the cre- rides is formidable. It requires stereochemical occur sequentially in a single reaction vessel; ation of carbohydrate-based vaccines, the and regiochemical control in glycosidic link- the most reactive glycosyl donor is triggered production of non-natural glycosylated anti- age formation. The first viable method for first and the least is coerced to engage in the biotics, and the generation of inhibitors of controlled glycosidic bond formation, the final reaction. A key concept underlying glycoconjugate function. Koenigs-Knorr glycosylation, was reported progress on this front is that there are Two general strategies are used for in in 1901 (7). Although the search for alterna- “armed” (reactive) and “disarmed” (less re- vitro oligosaccharide production: enzymatic tive glycosylation reactions is ongoing (8), active) glycosyl donors (10, 15). A number of (including chemoenzymatic) synthesis and chemists have made remarkable strides in research groups have exploited this knowl- chemical synthesis. In enzymatic and che- carbohydrate synthesis. The problem of ob- edge to efficiently assemble oligosaccharides moenzymatic routes, saccharide intermedi- taining the proper regiochemistry of glyco- using one-pot reactions (12, 16–20). Solid- ates are elaborated with enzymes, typically sylation has been largely solved by the devel- phase synthesis of oligosaccharides similarly glycosyltransferases or glycosidases, to gen- opment of orthogonal hydroxyl group protec- offers powerful advantages for oligosaccha- erate oligosaccharides (Fig. 2A) (3–6). Che- tion strategies. Thus, groups can be installed ride synthesis in comparison to conventional moenzymatic synthesis is distinguished from to block reaction at one site and later re- methods because it circumvents multiple pu- enzymatic synthesis by its reliance on both moved to unmask specific hydroxyl groups rification steps needed in traditional solution chemical synthetic and enzymatic transfor- for glycosylation. To devise safer and less syntheses. Pioneering studies in solid-phase mations. In chemical synthesis, the appropri- toxic procedures, reactions that require heavy oligosaccharide synthesis were initiated in the early 1970s (21) after Merrifield’s suc- cessful demonstration of solid-phase peptide synthesis (22). Unfortunately, the harsh con- ditions needed for some of the early glyco- sylation reactions and the difficulty of mon- itoring the reaction on the solid support hin- dered progress for 20 years. Newly devel- oped glycosylation methods and advances in solid-phase organic chemistry rejuvenated in- terest in solid-phase oligosaccharide synthe- sis in the 1990s (13, 14). The productivity from these renewed efforts is evident from the diversity of and complexity of oligosac- charides that have been synthesized, includ- ing sequences containing up to 12 residues (23). Progress on this front also has facilitated the generation of oligosaccharide libraries containing up to 1300 compounds (23–25). Thus, complex oligosaccharides and oligo- saccharide libraries are becoming accessible to a large community of scientists. Most oligosaccharides are linked co- Fig. 2. Generic examples of glycosylation reactions. (A) Enzyme-catalyzed glycosylation. Glycosyl- valently to proteins. As oligosaccharides be- transferases catalyze the formation of glycosyl bonds from sugar donors. The specific come more accessible, the next challenge is regioisomer and stereoisomer produced depends on the enzyme employed. (B) Chemical glyco- to integrate them into glycoproteins. Several sylation. Most such involve the reaction of a glycosyl donor, equipped with a leaving group (X) at the anomeric position, with a glycosyl acceptor. To produce the desired regioisomer, laboratories have used protected glycosylated protecting groups (PЈ,P) are used to block other reactive sites. The desired stereoisomer can be amino acids as building blocks for automated generated by altering the nature of X or the protecting group. solid-phase peptide synthesis (SPPS) to gen-

2358 23 MARCH 2001 VOL 291 SCIENCE www.sciencemag.org C ARBOHYDRATES AND G LYCOBIOLOGY erate glycosylated peptide fragments reminis- saccharides. But, their non-native linkages vancing. These developments offer nonspe- cent of natural glycoproteins (26). Modern may affect their overall structures and perturb cialists access to oligosaccharides that can be FMOC-based peptide synthesis methods are their biological activities. used to address problems in biology. Al- sufficiently mild that the oligosaccharide re- The total chemical synthesis of native gly- though a standard method for the synthesis of mains intact throughout the synthesis. As an coproteins has recently been facilitated by oligosaccharides has yet to emerge, the diver- illustration, Danishefsky and co-workers breakthroughs in protein chemistry. In partic- sity of glycosidic linkages may demand mul- chemically synthesized glycopeptide 2, de- ular, the “native chemical ligation” method tiple synthetic approaches. Applications of rived from the mucin-like leukocyte antigen (29, 30) has enabled the convergent conden- these methods will afford diverse combinato- leukosialin (CD43), using trisaccharide–ami- sation of unprotected glycopeptide frag- rial libraries of oligosaccharides and more no acid 1 as a building block (Fig. 3A) (27). ments, each generated by automated SPPS, to complex glycoconjugates. This increased The oligosaccharide structures and their ar- form full-length, fully functional glycopro- repertoire of compounds will facilitate the rangement on the polypeptide backbone are teins (31) (Fig. 3B). The related “expressed identification of specific oligosaccharide and characteristic of tumor-associated glycopro- protein ligation” (32) permits the chemical ligands for proteins and provide teins and, hence, such synthetic assemblies union of recombinant protein fragments with new leads for inhibitor design. may serve as tumor vaccine components. synthetic glycopeptides, further relieving the The extension of these methods to full- burden of large protein synthesis. O-linked Inhibitors of Biosynthesis and length glycoproteins has proved more trou- (33) and N-linked (34) glycoproteins bearing Processing blesome, largely due to limitations inherent homogeneous and chemically defined gly- The discovery of diverse biological roles for to linear, step-wise SPPS. Peptides larger cans have been prepared in this fashion, sug- oligosaccharides and glycoconjugates is fueling than 50 to 60 residues are difficult to obtain gesting that any glycoprotein may now be interest in the development of chemical tools using conventional methods due to poor obtained in the quantities required for struc- that block their formation and/or function. Two yields and accumulating byproducts. Much ture determination and functional analysis. general types of inhibitors are being sought: larger polypeptides can be produced using These synthetic advances are very important those that block glycoconjugate biosynthesis recombinant DNA technology; thus, several with respect to development of glycoprotein and those that interfere with glycoconjugate groups have exploited recombinant proteins therapeutics. Glycosylated biotechnology prod- recognition. Effective inhibitors of various bio- as starting materials for the synthesis (i.e., ucts such as monoclonal antibodies, erythro- synthetic steps in glycoconjugate assembly “semi-synthesis”) of homogeneous glycopro- poietin, and tissue plasminogen activator may have the potential to transform our understand- teins. Whereas the glycosidic linkage is too benefit from methods for their semi-synthesis. ing of carbohydrate function. By blocking the difficult a bond to be made between an oli- What future breakthroughs can we expect production of specific glycoconjugates, gosaccharide and a large protein, analogs of in the synthesis of oligosaccharides and gly- their biological roles can be ascertained (36). glycoproteins bearing unnatural linkages are coconjugates? Success in the automated sol- Similarly, antagonists that prevent glycoconju- readily prepared (28). These “neoglycopro- id-phase synthesis of complex oligosaccha- gate recognition can illuminate the function of teins” lend themselves to a facile convergent rides appears imminent (35) and efforts to the natural interactions (37). Progress on all assembly from proteins and synthetic oligo- automate one-pot assembly methods are ad- of these fronts is accelerating.

Fig. 3. Chemical synthesis of glycopeptides and glycoproteins. (A) Glycopeptide synthesis using glycosylated building blocks. Synthesis of a mucin-like fragment of leukosialic (CD43) (2) by incorporation of glycosy- lated FMOC amino acids (1) into solid-phase peptide synthesis (SPPS). (B) Glycoprotein synthe- sis by convergent coupling of glycopeptide fragments. Assem- bly from synthetic glycopeptide fragments using the technique of native chemical ligation. One fragment is functionalized as a COOH-terminal thioester (␣thioester), and the other bears an NH2-terminal cysteine resi- due. A transthioesterification re- action between the two compo- nents produces an intermedi- ate thioester that rearranges to the peptide bond shown in the product.

www.sciencemag.org SCIENCE VOL 291 23 MARCH 2001 2359 C ARBOHYDRATES AND G LYCOBIOLOGY Efforts to generate antagonists of the though active against the enzyme, polar and tions: What functional groups are needed biosynthetic and processing enzymes have charged compounds of this type are unlikely for recognition by the selectins? Can the been successful (38–40). To produce oligo- to be effective in cells or organisms due to sLex tetrasaccharide be modified such that saccharides, eukaryotic organisms require their membrane impermeability. Thus, strat- it binds specific selectin family members? enzymes for both synthesis and remodel- egies for the design and discovery of glyco- Can compounds that bind more tightly be ing. The former is mediated by glycosyl- syltransferase inhibitors that can access their discovered? Access to sLex and a wide transferases, catalysts that mediate the for- targets inside cells are needed. The oligosac- array of analogs, conjugates, and mimics mation of glycosidic bonds, and the latter charyl transferase inhibitor 4 (Fig. 4) is capa- has been instrumental in addressing these by glycosidases, enzymes that hydrolyze ble of transport across the endoplasmic retic- questions. glycosidic bonds. The glycosidases have ulum membrane where it can act on its target Data from many synthetic sLex deriva- proven to be particularly vulnerable targets and abrogate N-linked glycosylation (45), tives have revealed the critical contacts for for inhibition. Natural product inhibitors and other glycosyltransferase inhibitors dem- selectin binding, information that has guided have been identified, and numerous antag- onstrate efficacy in whole cells (38). Library the generation of more potent inhibitors. The onists inspired by these have been synthe- screening approaches that are now routine in functional groups important for binding to sized (41, 42). In a notable example of pharmaceutical industry might be applied to each of the selectin family members (E-, P- computer-aided design, potent transition the glycosyltransferases (46). The availabili- and L-selectin) have been identified. For in- state inhibitors of influenza virus N-acetyl- ty of combinatorial methods for synthesizing stance, the hypothesis that sulfation of sLex neuraminidase were devised (40, 43). libraries of drug-like (and, hence, bioavail- would increase its affinity for L-selectin (52) These agents, which interfere with viral able) compounds has revolutionized the was confirmed with derivatives produced by infectivity, are currently on the market. search for pharmacological tools in academic chemical synthesis (53–55). Similarly, a che- Several inhibitors of another major class laboratories. Indeed, library screens have pro- moenzymatic route was used to generate of carbohydrate-modifying enzymes, the gly- duced cell-permeable, small molecule inhib- sLex-substituted glycopeptides to elucidate cosyltransferases, have been reported. Com- itors of a class of carbohydrate modifying the critical binding epitope of PSGL-1 (56), pound 3 (Fig. 4), for example, is a potent enzymes called sulfotransferases (5, Fig. 4), the physiological ligand for P-selectin. With inhibitor of an ␣-2,6-sialyltransferase that and these may be effective tools for delineat- knowledge of the sLex functional groups im- uses cytidine monophosphate (CMP)–sialic ing biological function (47). The prospects portant for complexation, mimics of the tet- acid as a glycosyl donor substrate (44). Al- for advances in glycosyltransferase inhibitor rasaccharide have been synthesized that are identification are, therefore, excellent. In ad- potent selectin antagonists (57). For example, dition, the rapid increase in structural infor- tetrasaccharide mimic 6 (Fig. 5) is Ͼ50-fold mation available for this class of proteins will more active at blocking E-selectin than sLex aid in inhibitor design and discovery (48, 49). (58). In a P-selectin inhibition assay, mac- rocyclic sLex analog 7 exhibits a dramatic Inhibitors of Glycan Recognition enhancement (about 103) relative to sLex The generation of compounds that block gly- (59). These data underscore the tremendous can recognition remains a major challenge. progress made in generating efficacious Many oligosaccharide binding sites are rela- glycomimetics. tively shallow and solvent exposed, and bind- ing interactions can take place over large Synthesis and Biological Activity of surface areas (50). Those confronted with the Glycoassemblies problem of inhibiting protein–saccharide The function of many is con- complexation encounter many of the same tingent on their multidentate presentation. challenges as those seeking to block protein– The binding of proteins to monovalent car- protein interactions. Moreover, in solution bohydrate determinants is often weak, yet many oligosaccharides bind their protein tar- the strength and specificity required for gets with relatively low affinities (e.g., with a recognition in physiological settings is Ϸ Ϫ3 Ϫ4 dissociation constant Kd 10 to 10 M); high. The simultaneous formation of mul- thus, initial lead compounds tend to require tiple protein–carbohydrate interactions is much optimization. Lastly, structural data is one binding mode that can be exploited to often lacking, so information on the impor- achieve the necessary avidity. In physiolog- tant binding contacts can be obtained only ical settings, saccharide epitopes and their through the synthesis and evaluation of ana- protein receptors are arranged such that logs. Given these barriers, the progress that multiple binding events can occur simulta- has been made in identifying inhibitors of one neously. Naturally occurring carbohydrate family of carbohydrate-binding proteins, the displays are widespread: examples include selectins, is impressive. highly glycosylated proteins (e.g., mucins), The selectins are a family of three car- the carbohydrate surfaces of bacteria and Fig. 4. Three inhibitors. A potent sialyltrans- bohydrate-binding proteins that have been other pathogens, and the outer membranes ϭ the object of numerous investigations be- of mammalian cells. Carbohydrate-binding pro- ferase inhibitor (3, inhibition constant Ki 40 nm); its polar, charged nature may preclude cause of their role in leukocyte recruitment teins also tend to be oligomeric or present in activity in cellular systems. An inhibitor of oli- to sites of inflammation (51). The discov- multiple copies at the surface of a cell. The gosaccharyl transferase (4) that permeates the ery of the selectins led to the initial iden- interaction of multivalent presentations can re- endoplasmic reticulum membrane. An inhibitor tification of a ligand, the tetrasaccharide sult in the formation of numerous simultaneous of a bacterial N-acetylglucosamine-6-sulfotrans- x ferase (NodH) derived from a combinatorial li- sialyl Lewis x (sLe ). The findings raised complexation events that proceed to afford a brary screen (5); this neutral small molecule questions illustrative of those generally rel- high observed affinity and a high functional can permeate cell membranes. evant for protein–carbohydrate interac- affinity (60).

2360 23 MARCH 2001 VOL 291 SCIENCE www.sciencemag.org C ARBOHYDRATES AND G LYCOBIOLOGY What are the physiological advantages receptors (Fig. 6C) (70, 71). Interestingly, five toxin B subunits. Kitov et al. pursued an conferred by multivalent binding? Synthetic ligands with this property may serve not only alternative strategy for inhibiting SLT-I (75). arrays have provided key answers to this as antagonists but also as agonists. Because Structural studies of SLT-I complexed with question. First, these interactions have been cell surface receptor clustering can facilitate the Gb3 revealed that each B sub- shown to be highly specific and versatile signaling transduction, carbohydrate displays unit possessed subsites adjacent to the prima- (61–63). For example, binding can be modu- that cluster receptors or lectins that cluster ry carbohydrate-binding site. They synthe- lated by changing the individual saccharide glycoproteins can elicit cellular responses. sized dimers of the appropriate trisaccharide residues or by altering their spacing. Second, Thus, multivalent ligands can exhibit a wide binding element that could, in principle, oc- the kinetics exhibited by such binding events range of different activities, depending on cupy both the primary and secondary site. are likely critical for biological systems. For their binding modes. Although their divalent ligands exhibited example, relative to monovalent binding, Two recent studies dramatically illustrate only modest increases in potency (40-fold), multivalent interactions exhibit greater re- the power of multivalent presentation in in- attachment of these units to a pentavalent versibility in the presence of competing li- hibitor design and the interplay of different scaffold afforded one of the most potent (107 gands (64). Thus, low affinity, multivalent binding mechanisms. Highly potent pentava- times more active than the trisaccharide interactions are less likely to entrap cells in lent inhibitors of bacterial toxin binding to alone) multivalent ligands (8, Fig. 7) yet de- unproductive binding events. In addition, host cells were synthesized. The toxins tar- scribed. Structural analysis of compound 8 binding events mediated by multiple weak geted, heat-labile enterotoxin (LT-1) and bound to SLT-1 did not reveal the 1:1 com- interactions are expected to be more resistant shiga-like toxin I (SLT-I), are members of the plex envisioned but rather a complex contain- to shear stress, such as that encountered when class of AB5 toxins. The AB5 class, which ing two equivalents of the pentameric pro- cells interact in the bloodstream (65). can be subdivided into the cholera, pertussis, tein. This ligand appears to act both through To understand the roles of multivalency in and shiga toxin families, possess a pentago- dimerizing the target receptor as well as carbohydrate recognition, platforms that dis- nal arrangement of five B subunits and a through the chelate effect. The excellent po- play multiple copies of recognition elements single A subunit (72). These toxins, which tencies of the toxin inhibitors highlight the have been generated (66–68). A number of are responsible for millions of human fatali- advantages of multivalent presentation, and diverse scaffolds have been used for multiva- ties each year (73), invade cells by multiva- the wide range of unique binding modes mul- lent presentation; these include low–molecu- lent binding of the B subunit to the carbohy- tivalent ligands can employ. lar weight displays (e.g., dimers and trimers), drate residues of gangliosides. Thus, a logical Multivalent presentation is also beneficial dendrimers, polymers, and liposomes. The strategy is to generate multivalent ligands for eliciting rather than inhibiting biological structure of the display determines what fea- that can occupy all five binding sites simul- responses. This role has long been appreciat- tures of selected of naturally occurring mul- taneously. Fan et al. applied this strategy to ed in the context of vaccine development. As tivalent ligands it mimics. For example, a produce candidate inhibitors of LT-1, which early as 1929 (76), it was recognized that low–molecular weight ligand or dendrimer were generated by appending galactose resi- displays of oligosaccharides conjugated to can resemble a branched oligosaccharide dues to a pentacyclen core (Fig. 7) (74). A proteins elicit the production of specific an- chain, such as those displayed by glycopro- potent antagonist of LT-1 was found, a com- tibodies to carbohydrate. In 1975, Lemieux teins. Alternatively, larger displays such as pound 105 times more active than the corre- and co-workers showed that defined synthetic polymers or liposomes can more effectively sponding monovalent galactose derivative. glycoconjugates could be synthesized and mimic a glycoprotein or a glycolipid array. This excellent potency appears to arise from used to elicit an immune response (77). There are several distinct mechanisms that its ability to interact simultaneously with the These investigations provided the foundation contribute to the high activities often ob- served for multivalent ligands. An under- standing of these is critical for optimizing ligand performance and for understanding how natural systems function. Relevant mechanisms include the chelate effect, occu- pation of adjacent subsites, and ligand-in- duced protein clustering. When the chelate effect operates (Fig. 6A), multiple interac- tions occur after formation of the first con- tact; these are facilitated because of the high effective concentration of the binding groups (69). This mode of binding can give rise to large enhancements in activity if the orienta- tion and display of binding groups are favor- ably disposed. Because multipoint binding typically involves ligand organization, it ex- acts a conformational entropy penalty that may offset the advantages of chelation. Al- ternatively, multivalent displays may be ef- fective ligands because they occupy subsites Fig. 5 (left). Examples of selectin antagonists (6 and 7). as well as the primary binding site on the Both were designed to mimic the tetrasaccharide x target protein. Many lectins possess second- sLe . Fig. 6 (right). Binding modes engaged in by ary sites adjacent the key binding cleft (Fig. multivalent ligands. (A) Multivalent ligands can bind oligomeric receptors by occupying multiple binding 6B) (50). Lastly, the activities of many small sites (chelate effect), (B) occupying primary and sec- and large multivalent carbohydrate deriva- ondary binding sites on a receptor, and (C) causing tives are due to their abilities to cluster their receptor clustering in the membrane.

www.sciencemag.org SCIENCE VOL 291 23 MARCH 2001 2361 C ARBOHYDRATES AND G LYCOBIOLOGY

Fig. 7 (left). Multivalent ligands can act as antagonists and agonists of nous scaffolds, reducing the expression of specific carbohydrate struc- biological processes. Compound 8 is a potent antagonist of shiga-like tures. (B) Unnatural substrates can be used in biosynthetic pathways and toxin I. Multivalent ligand 9 elicits the downregulation of L-selectin, incorporated into cell surface glycoconjugates. (C) If the unnatural thereby inhibiting L-selectin function. Fig. 8 (right). Modulating cell substrates possess unique functional groups, their metabolic products on surface glycosylation by metabolic interference. (A) Unnatural substrates the cell surface can be chemically elaborated with exogenous reagents fed to cells can divert oligosaccharide biosynthesis away from endoge- (D).

for current approaches to generating synthetic spacing, valency, backbone composition) af- teins on the endothelium (82). A soluble carbohydrate-based vaccines. One such effort fects its immunogenicity is needed. form of L-selectin can be released into is directed at exploring the features required Multivalent ligands can also act as ef- circulation by a membrane-associated pro- to generate the first vaccine against Shigella fectors of one response and inhibitors of tease (83). Given the role of cell surface dysenteriae (78). Synthetic anti-cancer vac- another. In one study, a multivalent saccha- L-selectin, ligands that induce its down- cines are also being pursued. Danishefsky ride derivative was designed to effect a regulation could serve as anti-inflammatory and co-workers have used state-of-the-art process that disables the receptor, L-selec- agents. To test this hypothesis, human neu- synthetic methods to generate complex gly- tin (80, 81). L-Selectin is a transmembrane trophils were treated with monovalent and coconjugates (79) with promising anti-cancer protein found on neutrophils and lympho- multivalent oligosaccharides that mimic activities. To optimize the potency of carbo- cytes that facilitates the transient attach- features of physiologic L-selectin ligands. hydrate-based vaccines, more data addressing ment and rolling of leukocytes through its The levels of L-selectin after addition of how the structure of the conjugate (epitope interaction with highly O-glycosylated pro- monovalent oligosaccharides were un- changed, but upon exposure to multivalent Fig. 9. Biosynthetic engineering display 9 (Fig. 7) L-selectin was lost from of unnatural sialic acids on cell the surface (80, 81). Multivalent ligand 9 surface glycoconjugates. Man- also inhibits L-selectin-mediated cell roll- NAc (10) is converted to sialic ing, suggesting that its ability to downregu- acid (11) by cellular metabo- late L-selectin makes it a highly effective lism. Unnatural N-acyl groups inhibitor (84). These results suggest a new are tolerated by the biosyn- thetic enzymes and trans- approach to manipulating the cell surface port proteins, enabling the display interactions. The design of multivalent li- of myriad unnatural sialosides on gands that selectively disarm receptors by cells (12 through 17). Sialic acids activating endogenous processes is an un- bearing ketones (15) or azides charted territory with high potential. (16) can be further elaborated by reaction with compounds Modulating Cell Surface Glycosylation bearing complementary func- tional groups. by Metabolic Interference The ability to alter glycoconjugate struc- tures expressed on cell surfaces is impor- tant for understanding their biological func- tions. An alternative to enzyme inhibitors is the use of unnatural metabolic substrates that can intercept carbohydrate biosynthetic pathways (85). Metabolic interference can

2362 23 MARCH 2001 VOL 291 SCIENCE www.sciencemag.org C ARBOHYDRATES AND G LYCOBIOLOGY produce several outcomes on the cell sur- technique has been used to construct non- 26. H. Herzner, T, Reipen, M. Schultz, H. Kunz, Chem. Rev. face (Fig. 8). An unnatural substrate might natural glycans on cell surfaces for lectin 100, 4495 (2000). 27. D. Sames, X.-T. Chen, S. J. Danishefsky, Nature 389, divert oligosaccharide elaboration away binding studies (95) and to target magnetic 587 (1997). from endogenous scaffolds destined for the resonance imaging contrast reagents to cells 28. L. Marcaurelle, C. R. Bertozzi, Chem. Eur. J. 5, 1384 cell surface (Fig. 8A). The result is a re- overexpressing sialic acid residues, a hall- (1999). duction in the amount of mature structures mark of many tumor types (96). Unnatural 29. P. E. Dawson, T. W. Muir, I. Clark-Lewis, S. B. H. Kent, expressed by the cell. Alternatively, unnat- sialic acids bearing ketone groups have also Science 266, 776 (1994). 30. G. J. Cotton,T. W. Muir, Chem. Biol. 6, R247 (1999). ural substrates might be designed to engage been used to facilitate viral-mediated gene 31. Y. Shin et al., J. Am. Chem. Soc. 121, 11684 (1999). a biosynthetic pathway, resulting in their delivery (97). Other sugars such as N-acetyl- 32. T. W. Muir, D. Sondhi, P. A. Cole, Proc. Natl. Acad. Sci. incorporation into cell surface glycoconju- galactosamine (GalNAc) can also be replaced U.S.A. 95, 6705 (1998). gates (Fig. 8B). The result is the presenta- by unnatural variants using metabolic pro- 33. D. Macmillan, C. R. Bertozzi, Tetrahedron 56, 9515 (2000). tion of an unnatural epitope that might cesses (98). If the approach is generalizable 34. T. J. Tolbert, C.-H. Wong, J. Am. Chem. Soc. 122, display different receptor binding proper- across carbohydrate biosynthetic pathways, 5421 (2000). ties than its native counterpart. Subtle mod- multiple modifications might be made to cell 35. O. J. Plante, E. R. Palmacci, P. H. Seeberger, Science 291, ifications to the fine structure of a surface glycans for structure/function studies 1523 (2001). 36. A. D. Elbein, Annu. Rev. Biochem. 56, 497 (1987). monosaccharide can be engendered in this in a cellular context. 37. P. Sears, C. H. Wong, Angew. Chem. Int. Ed. 38, 2300 fashion. Incorporation of unnatural sugars In summary, advances in oligosaccharide (1999). with reactive functional groups into cell and glycoprotein synthesis have provided the 38. F. M. Platt et al., Science 276, 428 (1997). 39. Y. J. Kim, M. Ichikawa, Y. Ichikawa, J. Am. Chem. Soc. surface glycoconjugates (Fig. 8C) provides critical material for biological investigations. 121, 5829 (1999). a scenario in which the glycan structure can Genome sequencing efforts have unveiled the 40. M. von Itzstein et al., Nature 363, 418 (1993). be further altered by chemical reactions at various enzymes that participate in glycocon- 41. A. Berecibar, C. Grandjean, A. Siriwardena, Chem. Rev. 99, 779 (1999). the cell surface (Fig. 8D). jugate biosynthesis and processing, and a de- 42. B. Ganem, Acc. Chem. Res. 29, 340 (1996). An example of cell surface glycan sup- tailed understanding of these pathways will 43. C. U. Kim et al., J. Am. Chem. Soc. 119, 681 (1997). pression using metabolic decoys is provid- allow the judicious choice of targets for in- 44. B. Mu¨ller, C. Schaub, R. R. Schmidt, Angew. Chem. Int. Ed. 37, 2893 (1998). ed by the “oligosaccharide primers” de- hibitor design and metabolic interference. 45. P. D. Eason, B. Imperiali, Biochemistry 38, 5430 (1999). scribed by Esko and co-workers. When fed The convergence of chemical tools with fron- 46. G. Jung, Ed., Combinatorial Chemistry: Synthesis, to cells, hydrophobic disaccharides inter- tier genetic and biochemical technologies has Analysis, Screening (Wiley-VCH, Weinheim, Germa- cepted the biosynthesis of sLex, and the created an exciting platform from which to ny, 1999). 47. J. I. Armstrong et al., Angew. Chem. Int. Ed., 39, 1303 cells displayed a concomitant reduction in tackle problems in glycobiology. (2000). cell surface E-selectin binding activity 48. U. M. Unligil, J. M. Rini, Curr. Opin. Struct. Biol. 10, (86). The roles of sLex in inflammation and References and Notes 510 (2000). 49. C. Breton, A. Imberty, Curr. Opin. Struct. Biol. 9, 563 tumor metastasis may be addressed using 1. N. Jenkins, R. B. Parekh, D. C. James, Nature Biotech- nol. 14, 975 (1996). (1999). 50. W. I. Weis, K. Drickamer, Annu. Rev. Biochem. 65, 441 this reversible process. Likewise, hydro- 2. D. Chui et al., Proc. Natl. Acad. Sci. U.S.A. 98, 1142 (1996). phobic glycosides of xylose can distract the (2001). 51. L. A. Lasky, Annu. Rev. Biochem. 64, 113 (1995). glycosaminoglycan biosynthetic machin- 3. S. L. Flitsch, Curr. Opin. Chem. Biol. 4, 619 (2000). 52. S. Hemmerich, H. Leffler, S. D. Rosen, J. Biol. Chem. ery, thereby temporarily reducing the den- 4. M. M. Palcic, Curr. Opin. Biotechnol. 10, 616 (1999) 270, 12035 (1995). 5. S. David, C. Auge, C. Gautheron, Adv. Carbohydr. sity of these chains on cell surfaces (87). 53. C. Galustrian et al., Biochem. Biophys. Res. Commun. Chem. Biochem. 49, 175 (1991). 240, 748 (1997). The pathway for sialic acid biosynthesis is 6. E. J. Toone, E. S. Simon, M. D. Bednarski, G. M. 54. C. R. Bertozzi, S. Fukuda, S. D. Rosen, Biochemistry 34, highly amenable to biosynthetic modulation Whitesides, Tetrahedron 45, 5365 (1989). 14271 (1995). using unnatural metabolic precursors. De- 7. W. Koenigs, E. Knorr, Berichte 34, 957 (1901). 55. W. J. Sanders, T. R. Katsumoto, C. R. Bertozzi, S. D. 8. S. H. Khan, R. A. O’Neill, Eds., Modern Methods in Rosen, L. L. Kiessling, Biochemistry 35, 14862 (1996). rived from metabolism of N-acetylman- Carbohydrate Synthesis; Frontiers in Natural Product 56. A. Leppanen, S. P. White, J. Helin, R. P. McEver, R. D. nosamine (ManNAc) (10, Fig. 9), sialic acids Research (Harwood Academic, Australia, 1996), vol. Cummings, J. Biol. Chem. 275, 39569 (2000). 1; Preparative (Dekker, New 57. E. E. Simanek, G. J. McGarvey, J. A. Jablonowski, C.-H. are known to participate in myriad cell sur- York, 1997). face recognition events, including selectin- Wong, Chem. Rev. 98, 833 (1998). 9. B. G. Davis, J. Chem. Soc. Perkin Trans. I, 2137 (2000). 58. R. Banteli et al., Helv. Chim. Acta 83, 2893 (2000). mediated leukocyte adhesion and influenza 10. H. Paulsen, Angew. Chem. Int. Ed. 21, 155 (1982). 59. C.-Y. Tsai, X. Huang, C.-H. Wong, Tetrahedron Lett. virus binding. Synthetic analogs of ManNAc 11. K. M. Koeller, C. H. Wong, Chem. Rev. 100, 4465 41, 9499 (2000). (2000). bearing unnatural N-acyl groups are sub- 60. M. Mammen, S.-K. Choi, G. M. Whitesides, Angew. 12. N. L. Douglas, S. V. Ley, U. Lu¨cking, S. L. Warriner, Chem. Int. Ed. 37, 2754 (1998). strates for the metabolic pathway, and when J. Chem. Soc., Perkin Trans. I, 51 (1998). 61. K. H. Mortell, R. V. Weatherman, L. L. Kiessling, J. Am. fed to cells they produce unnatural sialic 13. P. H. Seeberger, W.-C. Haase, Chem. Rev. 100, 4549 Chem. Soc. 118, 2297 (1996). acids on cell surface glycoconjugates (Fig. 9) (2000). 62. R. Liang et al., Proc. Natl. Acad. Sci. U.S.A. 94, 10554 14. D. Kahne, Curr. Opin. Chem. Biol. 1, 130 (1997). (1997). (88). Cellular expression of unnatural sialic 15. B. Fraser-Reid, D. R. Mootoo, P. Konradsson, U. Udo- 63. R. V. Weatherman, K. H. Mortell, M. Chervenak, L. L. acids can either inhibit or enhance viral in- dong, J. Am. Chem. Soc. 110, 5583 (1988). Kiessling, E. J. Toone, Biochemistry 35, 3619 (1996). fection, depending on the physical interaction 16. Z. Zhang et al., J. Am. Chem. Soc. 121, 734 (1999). 64. J. H. Rao, J. Lahiri, L. Isaacs, R. M. Weis, G. M. White- of the unnatural moiety with the viral recep- 17. J. Gildersleeve, A. Smith, K. Sakurai, S. Raghavan, D. sides, Science 280, 708 (1998). Kahne, J. Am. Chem. Soc. 121, 6176 (1999). 65. R. Alon, D. A. Hammer, T. A. Springer, Nature 374, 539 tor (89, 90). Unnatural sialosides can also 18. R. Geurtsen, D. S. Holmes, G. J. Boons, J. Org. Chem. (1995). disrupt contact inhibition of cell growth (91), 62, 8145 (1997). 66. Y. C. Lee, R. T. Lee, Acc. Chem. Res. 28, 321 (1995). and block the binding of myelin-associated 19. H. Yamada, T. Harada, T. Takahashi, J. Am. Chem. 67. R. Roy, Curr. Opin. Struct. Biol. 6, 692 (1996). Soc. 116, 7919 (1994). 68. L. L. Kiessling, N. L. Pohl, Chem. Biol. 3, 71 (1996). glycoprotein to neurons (92). The effects wit- 20. S. Raghavan, D. Kahne, J. Am. Chem. Soc. 115, 1580 69. M. I. Page, W. P. Jencks, Proc. Natl. Acad. Sci. U.S.A. nessed implicate sialic acid residues as key (1993). 68, 1678 (1971). determinants of the respective processes. If 21. J. M. Fre´chet, in Polymer-Supported Reactions in Or- 70. S. M. Dimick et al., J. Am. Chem. Soc. 121, 10286 ganic Synthesis P. Hodge, D. C. Sherrington, Eds. (1999). the unnatural N-acyl substituent comprises a (Wiley, Chichester, UK, 1980), pp. 407–434. 71. S. D. Burke, Q. Zhao, M. C. Schuster, L. L. Kiessling, unique chemical group such as a ketone (93) 22. R. B. Merrifield, J. Am. Chem. Soc. 85, 2149 (1963). J. Am. Chem. Soc. 122, 4518 (2000). or azide (94), chemoselective reactions can 23. K. C. Nicolaou, N. Winssinger, J. Pastor, F. DeRoose, 72. E. K. Fan, E. A. Merritt, C. L. M. J. Verlinde, W. G. J. Hol, J. Am. Chem. Soc. 119, 449 (1997). Curr. Opin. Struct. Biol. 10, 680 (2000). be performed on the cells that further aug- 24. R. Liang et al., Science 274, 1520 (1996). 73. J. Holmgren, A. M. Svennerholm, Gastroenterol. Clin. ment their cell surface composition. This 25. M. J. Sofia et al., J. Med. Chem. 42, 3193 (1999). North Am. 21, 283 (1992).

www.sciencemag.org SCIENCE VOL 291 23 MARCH 2001 2363 C ARBOHYDRATES AND G LYCOBIOLOGY

74. E. K. Fan et al., J. Am. Chem. Soc. 122, 2663 (2000). 84. W. J. Sanders, E. J. Gordon, P. J. Beck, R. Alon, L. L. 92. B. E. Collins, T. J. Fralich, S. Itonori, Y. Ichikawa, R. L. 75. P. I. Kitov et al., Nature 403, 669 (2000). Kiessling, J. Biol. Chem. 274, 5271 (1999). Schnaar, Glycobiology 10, 11 (2000). 76. W. F. Goebel, O. T. Avery, J. Exp. Med. 50, 521 (1929). 85. S. Goon, C. R. Bertozzi, “Metabolic substrate engi- 93. L. K. Mahal, K. J. Yarema, C. R. Bertozzi, Science 276, 77. R. U. Lemieux, D. R. Bundle, D. A. Baker, J. Am. Chem. neering as a tool for glycobiology,” in Glycochemis- 1125 (1997). Soc. 97, 4076 (1975). try: Principles, Synthesis and Applications, P. G. Wang, 94. E. Saxon, C. R. Bertozzi, Science 287, 2007 (2000). 78. V. Pozsgay, Angew. Chem. Int. Ed. 37, 138 (1998). C. R. Bertozzi, Eds. (Dekker, New York), in press. 95. K. J. Yarema, L. K. Mahal, R. Bruehl, E. C. Rodriguez, 79. S. J. Danishefsky, J. R. Allen, Angew. Chem. Int. Ed. 39, 86. A. K. Sarkar, T. A. Fritz, W. H. Taylor, J. D. Esko, Proc. C. R. Bertozzi, J. Biol. Chem. 273, 31168 (1998). 837 (2000). Natl. Acad. Sci. U.S.A. 92, 3323 (1995). 96. G. A. Lemieux, K. J. Yarema, C. L. Jacobs, C. R. Bertozzi, 80. E. J. Gordon, W. J. Sanders, L. L. Kiessling, Nature 392, 87. F. N. Lugemwa, J. D. Esko, J. Biol. Chem. 266, 6674 (1991). J. Am. Chem. Soc. 121, 4278 (1999). 30 (1998). 88. H. Kayser et al., J. Biol. Chem. 267, 16934 (1992). 97. J. H. Lee et al., J. Biol. Chem. 274, 21878 (1999). 81. E. J. Gordon, L. E. Strong, L. L. Kiessling, Bioorg. Med. 89. O. T. Keppler et al., J. Biol. Chem. 270, 1308 (1995). 98. H. C. Hang, C. R. Bertozzi, J. Am. Chem. Soc. 123, Chem. Lett. 6, 1293 (1998). 90. O. T. Keppler et al., Biochem. Biophys. Res. Commun. 1242 (2001). 82. S. D. Rosen, Am. J. Pathol. 144, 1013 (1999). 253, 437 (1998). 99. The authors thank J. Gestwicki, L. Strong, E. Saxon, and 83. T. K. Kishimoto, M. A. Jutila, E. L. Berg, E. C. Butcher, 91. J. R. Wieser, A. Heisner, P. Stehling, F. Oesch, W. K. Yarema for their comments on the manuscript, and Science 245, 1238 (1989). Reutter, FEBS Lett. 395, 170 (1996). the NIH and NSF for research support.

REVIEW Intracellular Functions of N-Linked Glycans Ari Helenius1* and Markus Aebi2

N-linked oligosaccharides arise when blocks of 14 sugars are added cotrans- the glycans have a common role in promoting lationally to newly synthesized polypeptides in the endoplasmic reticulum protein folding, quality control, and certain (ER). These glycans are then subjected to extensive modification as the sorting events. Later, Golgi enzymes prepare glycoproteins mature and move through the ER via the Golgi complex to their them for the spectrum of novel functions that final destinations inside and outside the cell. In the ER and in the early the sugars display in the mature proteins (3). secretory pathway, where the repertoire of oligosaccharide structures is still Here, we mainly address events in the early rather small, the glycans play a pivotal role in protein folding, oligomerization, secretory pathway. We focus on observations quality control, sorting, and transport. They are used as universal “tags” that that are starting to unmask the logic of the allow specific lectins and modifying enzymes to establish order among the various early trimming and modification diversity of maturing glycoproteins. In the Golgi complex, the glycans acquire events. We also discuss glycan structure and more complex structures and a new set of functions. The division of synthesis function in light of fundamental differences and processing between the ER and the Golgi complex represents an evolu- between the two biosynthetic organelles, the tionary adaptation that allows efficient exploitation of the potential of ER and the Golgi complex. oligosaccharides. N-Linked Glycan Synthesis and In mature glycoproteins, N-linked glycan moi- the ER to diversification in the Golgi com- Modification eties are structurally diverse. The sugar compo- plex coincides with a marked change in gly- During the synthesis of N-linked glycans in sition and the number and size of branches in can function. In the early secretory pathway, mammalian cells (Fig. 2), a 14-saccharide the sugar tree varies among glycans bound to a protein, among glycoproteins, and among cell types, tissues, and species (1, 2). However, when initially added in the ER to growing nascent polypeptides, the glycans do not dis- play such heterogeneity. The “core glycans” are homogeneous and relatively simple (Fig. 1). The trimming and processing that the gly- cans undergo when the glycoprotein is still in the ER introduce only limited additional di- versity, because the alterations are shared by all glycoproteins. Thus, the spectrum of gly- coforms remains rather uniform until the gly- coproteins reach the medial stacks of the Golgi apparatus, where structural diversifica- tion is introduced through a series of nonuni- form modifications. Particularly in vertebrate and plant cells, it is the terminal glycosyla- tion in the Golgi complex that gives rise to the tremendous diversity seen in glycoconju- gates that reach the cell surface. The switch from structural uniformity in

Fig. 1. The N-linked core oligosaccharide. N-linked glycans are added to proteins in the ER as “core 1Institute of Biochemistry, 2Institute of Microbiology, oligosaccharides” that have the structure shown. These are bound to the polypeptide chain through Eidgeno¨ssische Technische Hochschule Zu¨rich, Uni- an N-glycosidic bond with the side chain of an asparagine that is part of the Asn-X-Ser/Thr versita¨tstrasse 16, CH-8092 Zu¨rich, Switzerland. consensus sequence. Terminal glucose and mannose residues are removed in the ER by glucosidases *To whom correspondence should be addressed. and mannosidases. The symbols for the different sugars are used in the following figures.

2364 23 MARCH 2001 VOL 291 SCIENCE www.sciencemag.org