Elucidating Glycosaminoglycan–Protein–Protein Interactions Using Carbohydrate Microarray and Computational Approaches

Elucidating Glycosaminoglycan–Protein–Protein Interactions Using Carbohydrate Microarray and Computational Approaches

Elucidating glycosaminoglycan–protein–protein interactions using carbohydrate microarray and computational approaches Claude J. Rogersa,1, Peter M. Clarka,1, Sarah E. Tullya, Ravinder Abrolb, K. Christopher Garciac, William A. Goddard IIIb,d, and Linda C. Hsieh-Wilsona,2 aCalifornia Institute of Technology and Howard Hughes Medical Institute, Division of Chemistry and Chemical Engineering, 1200 East California Boulevard, Pasadena, CA 91125; bMaterials and Process Simulation Center, California Institute of Technology, Division of Chemistry and Chemical Engineering, 1200 East California Boulevard, Pasadena, CA 91125; cStanford University School of Medicine and Howard Hughes Medical Institute, Departments of Molecular and Cellular Physiology and Structural Biology, Stanford, CA 94305; and dGraduate School of EEWS (WCU), Korea Advanced Institute of Science Technology, Daejeon, 305-701, Korea Edited by Chi-Huey Wong, Academia Sinica, Taipei, Taiwan, and approved May 4, 2011 (received for review February 25, 2011) Glycosaminoglycan polysaccharides play critical roles in many Several methods have been developed to study GAG–protein cellular processes, ranging from viral invasion and angiogenesis to interactions, including affinity chromatography, analytical ultra- spinal cord injury. Their diverse biological activities are derived centrifugation, electrophoretic mobility shift assays, competition from an ability to regulate a remarkable number of proteins. How- experiments, mass spectrometry-based approaches, isothermal ever, few methods exist for the rapid identification of glycosami- titration calorimetry, and surface plasmon resonance (9–16). noglycan–protein interactions and for studying the potential of Although powerful, these approaches are low throughput, often glycosaminoglycans to assemble multimeric protein complexes. labor intensive, and require significant quantities of carbohydrate Here, we report a multidisciplinary approach that combines new and/or protein. Notably, no methods are available to rapidly carbohydrate microarray and computational modeling methodol- screen various GAGs for their ability to assemble multimeric pro- ogies to elucidate glycosaminoglycan–protein interactions. The tein complexes. In addition, existing methods often require oligo- CHEMISTRY approach was validated through the study of known protein part- saccharides or polysaccharides that are relatively homogeneous ners for heparan and chondroitin sulfate, including fibroblast in chain length and charge density, such as fractionated heparin growth factor 2 (FGF2) and its receptor FGFR1, the malarial protein or chemically modified HS (11, 12, 14). As such, it has been VAR2CSA, and tumor necrosis factor-α (TNF-α). We also applied difficult to study the interactions of proteins with other GAG the approach to identify previously undescribed interactions be- classes and physiologically relevant GAG preparations, which are tween a specific sulfated epitope on chondroitin sulfate, CS-E, more heterogeneous and structurally diverse. and the neurotrophins, a critical family of growth factors involved Similarly, structural studies of GAG–protein interactions have in the development, maintenance, and survival of the vertebrate been limited by the complexity and heterogeneity of naturally BIOCHEMISTRY nervous system. Our studies show for the first time that CS is occurring GAGs. For example, the majority of crystal structures capable of assembling multimeric signaling complexes and modu- contain fully sulfated heparin oligosaccharides instead of the lating neurotrophin signaling pathways. In addition, we identify physiological HS ligands of lower charge density (17). As an a contiguous CS-E-binding site by computational modeling that alternative strategy, recent advances in molecular modeling suggests a potential mechanism to explain how CS may promote have provided several approaches for understanding the interac- neurotrophin-tyrosine receptor kinase (Trk) complex formation tion of heparin or HS with proteins, including IL-8, TSG-6, and neurotrophin signaling. Together, our combined microarray and PECAM-1 (18–20). However, GAG–protein interactions and computational modeling methodologies provide a general, pose a unique set of challenges for computational modeling, facile means to identify new glycosaminoglycan–protein–protein such as highly flexible sugar ligands with many rotatable bonds, interactions, as well as a molecular-level understanding of those interaction energies dominated by electrostatics, and shallow, complexes. solvent-accessible binding pockets. Importantly, computational approaches have not yet been applied to other GAGs, such as lycosaminoglycans (GAGs) regulate a wide range of physio- the less highly charged CS class, nor have they been developed Glogical processes, including viral invasion, blood coagulation, to investigate large GAG–protein–protein complexes. cell growth, and spinal cord injury (1–4). Assembled from repeat- Here, we describe an integrated approach that combines ing disaccharide units, GAGs display diverse patterns of sulfation carbohydrate microarray methodologies with computational (SI Appendix, Fig. S1). These sulfation patterns are believed to modeling to provide unique insights into GAG interactions with have important functional consequences, enabling the polysac- proteins and multimeric protein complexes. We demonstrate that charides to interact with a wide variety of proteins (1, 2). How- carbohydrate microarrays can be used to rapidly screen proteins ever, the precise sulfation motifs involved in protein recognition and protein–protein complexes for binding to specific sulfation are understood in only a few cases (1, 4, 5). Moreover, studies of motifs and GAG classes. Such information can then be used heparan sulfate (HS) interactions with fibroblast growth factors suggest that GAGs can assist in the assembly of multimeric pro- Author contributions: C.J.R., P.M.C., R.A., W.A.G., and L.C.H.-W. designed research; C.J.R., tein complexes, thereby modulating signal transduction pathways P.M.C., and S.E.T. performed research; C.J.R., P.M.C., S.E.T., and K.C.G. contributed new (6–10). Yet, only a few such examples have been elucidated, and reagents/analytic tools; C.J.R., P.M.C., R.A., W.A.G., and L.C.H.-W. analyzed data; and the extent to which other GAGs such as chondroitin sulfate (CS) C.J.R., P.M.C., and L.C.H.-W. wrote the paper. engage in the formation of multimeric protein complexes remains The authors declare no conflict of interest. unknown. Elucidating the interactions of specific GAG substruc- This article is a PNAS Direct Submission. tures with proteins and large protein–protein complexes will be 1C.J.R. and P.M.C. contributed equally to this work. critical for understanding the structure-activity relationships of 2To whom correspondence should be addressed. E-mail: [email protected]. GAGs and the mechanisms underlying important biological This article contains supporting information online at www.pnas.org/lookup/suppl/ processes. doi:10.1073/pnas.1102962108/-/DCSupplemental. www.pnas.org/cgi/doi/10.1073/pnas.1102962108 PNAS Early Edition ∣ 1of6 Downloaded by guest on September 30, 2021 in conjunction with previously undescribed computational mod- formation of a CS-A tetrasaccharide by performing molecular eling approaches to predict GAG-binding sites within proteins dynamics simulations in explicit water (22) and used ScanBind- and to determine the potential for GAGs to assemble multimeric Site and GenMSCDock to determine the CS-A binding site on protein complexes. Using this combined approach, we identify a DBL3x or DBL6. CS-A was found to interact with both of the specific interaction between the sulfated chondroitin sulfate-E lysine residues predicted by mutagenesis to comprise the primary (CS-E, SI Appendix, Fig. S1) epitope and the neurotrophin CS-A binding site on DBL6 (Fig. 1A and SI Appendix, Table S1). (NT) family of growth factors and receptors. Our computational Moreover, five of the seven residues determined to be important modeling results suggest a contiguous CS-E-binding site that for CS-A binding to DBL3x were successfully identified, further spans the NT–tyrosine receptor kinase (Trk) complex, providing validating the computational approach (Fig. 1B and SI Appendix, a potential mechanism to explain how CS modulates complex Ta b l e S 2 ). The side chain of Lys1515, one of the residues not formation and NT signaling pathways. Together with cellular identified by computational modeling, is buried in the crystal data, we provide evidence that CS plays an active role in cellular structure and makes electrostatic contacts with two internal signaling by regulating the interactions between growth factors Glu residues (Glu1464 and Glu1518), suggesting that this residue and their receptors. may not engage directly in interactions with CS-A. Together, these results show that CS binding sites on proteins can be cor- Results and Discussion rectly identified using our computational approach. Furthermore, General Microarray Approach. The binding of proteins to GAGs we identify additional residues within DBL6 and DBL3x that may was examined using carbohydrate microarrays containing either participate in close van der Waals and other interactions with synthetic tetrasaccharides of defined sulfation sequence (21, 22) the sugar. or naturally occurring polysaccharides representing various GAG classes (23). Microarrays of synthetic tetrasaccharides displaying Fibroblast growth factor 2 (FGF2) and its

View Full Text

Details

  • File Type
    pdf
  • Upload Time
    -
  • Content Languages
    English
  • Upload User
    Anonymous/Not logged-in
  • File Pages
    6 Page
  • File Size
    -

Download

Channel Download Status
Express Download Enable

Copyright

We respect the copyrights and intellectual property rights of all users. All uploaded documents are either original works of the uploader or authorized works of the rightful owners.

  • Not to be reproduced or distributed without explicit permission.
  • Not used for commercial purposes outside of approved use cases.
  • Not used to infringe on the rights of the original creators.
  • If you believe any content infringes your copyright, please contact us immediately.

Support

For help with questions, suggestions, or problems, please contact us