Visualizing Molecular Interactions: Computational Investigations of Structure-Function Relationships and Dynamics in Biomolecular Systems Kristina E. Furse Department of Chemistry and Biochemistry, University of Notre Dame, Notre Dame, IN 46556 Email: [email protected]

Computer modeling has emerged as a powerful complement The de novo receptor models also yielded some interesting and to experimental and theoretical investigations. Visualization of testable hypotheses for the structural basis of !-AR subtype experimental data in the context of a three-dimensional, ligand selectivity. atomic-scale model can not only help to explain unexpected Four years after we completed our study, the first high- results but often raises new questions, driving future research. resolution X-ray structure of !-AR was released.3 Many of our Models of sufficient quality can be set in motion in molecular structural predictions were validated by the X-ray structure, dynamics (MD) simulations to move beyond a static picture including an unusual configuration of three active site serines and provide insight into the dynamics of important biological and the unique spatial arrangement of two extracellular processes. disulfide bonds that introduced a previously unknown loop The power of computer modeling and simulation is residue, Phe-193, into the binding site (Figure 1). Our de novo nowhere more evident than in the study of small-molecule models were included in a virtual screening study by an interactions with proteins and DNA. Detailed knowledge of independent lab.4 They found that our model outperformed these molecular-scale interactions is vital to rational drug rhodopsin-based homology models as well as other de novo design, understanding and harnessing the catalytic power of models. In fact, ours was the only model to rival the !-AR X- enzymes, and unraveling the mysteries of molecular ray structure itself in virtual screening of agonists and recognition, which is at the heart of innumerable physiological antagonists. This demonstrates the potential power of processes from signal transduction to DNA repair. Computer constraint-based modeling approaches for structure prediction, models and simulations have been used in concert with given a methodical approach and sufficient high quality experiment and theory to gain insight into these processes. experimental data to refine the model. The following describes three distinct biological problems requiring distinct computational approaches: (1) theoretical modeling of a structurally underdetermined member of the G- protein coupled receptor (GPCR) superfamily, (2) mechanistic investigations of the pharmacologically important (COX) enzymes, and (3) simulation of time- resolved fluorescence spectroscopy experiments to understand water dynamics at the DNA interface.

GPCR: De novo model building. G-protein coupled receptors (GPCR) are prime therapeutic targets since they play a key role in signal transduction in many cells and are implicated in control or regulation of a wide array of biological functions. There is great interest in exploiting the details of GPCR structure for rational drug design, however like many integral membrane proteins they are challenging molecules to study experimentally and high-resolution structural characterization is extremely difficult. As a result, great importance has been placed on indirect structural Figure 1. Extracellular view of the complex of photolabile antagonist evidence obtained from a variety of biophysical techniques, as aminoflisopolol with the de novo !2-AR model. The ligand is well as detailed sequence analysis and molecular modeling displayed in white with key active site residues shown in magenta. studies. De novo modeling is a flexible technique that utilizes Y199 in TM5 is the site of covalent attachment in photoaffinity this indirect structural evidence as constraint data to guide the labeling experiments. Other key contacts include a charge reinforced construction of theoretical three-dimensional (3D) models of a hydrogen bond between the protonated amine and D113 in TM3 and a target without using a related protein of known structure as a hydrogen bond between the ligand sidechain hydroxyl and N293 in template (the latter describes “homology modeling"). TM6, both indicated by dashed yellow lines. N312 in TM7, along with F289 and H296 in TM6, interact with the isopropyl N- In this work, we constructed 3D models for agonist and substituent. For this complex, the second extracellular loop is antagonist complexes with !-adrenergic receptors (!-AR) included, with the two disulfide bonds highlighted in yellow and loop using both standard homology modeling methods (template: residue F193 shown interacting with the fluorenone ring of the ligand. rhodopsin crystal structure),1 as well as de novo modeling Adapted from Figure 6 in Ref 2. techniques.2 While the de novo and homology-derived receptor models were generally quite similar, there were some COX Enzymes: Mechanistic studies. The two localized structural differences that impacted the putative cyclooxygenase enzymes, COX-1 and COX-2, are responsible ligand-binding site significantly. The de novo receptor models for the committed step in biosynthesis, and are appeared to provide much better agreement with experimental the targets of the non-steroidal anti-inflammatory drugs data, particularly for receptor complexes with agonist ligands. , and the COX-2 selective inhibitors,

1

CelebrexTM, VioxxTM and BextraTM. The enzymes are both COX-1 and COX-2 with the substrate-derived remarkable in that they catalyze two dioxygenations and two pentadienyl radical intermediate bound in the active site.7 The cyclizations of the native substrate, , with near enzymes’ influence on the conformation of the pentadienyl absolute regio- and stereoselectivity. In stark contrast, the non- radical was investigated, along with the accessible space enzymatic (solution) oxidation of AA or its esters yields many above and below the radical plane, and the width of several different products. Several theories have been advanced to channels to the active site that could function as access routes explain the nature of enzymatic control over this series of for molecular oxygen. Additional simulations demonstrated reactions. the extent of molecular oxygen mobility within the active site (Figure 2). The results suggested that spin localization (Scenario 3) is unlikely to play a role in enzymatic control of this reaction. Instead, a combination of oxygen channeling, steric shielding and selective radical trapping appear to be responsible. This work adds a dynamic perspective to the strong foundation of static structural data (X-ray structures) available for these enzymes.

Figure 2. Traces of oxygen position from explicit O2 simulations. Each panel is a sliced solid surface view of the cyclooxygenase active site of COX-1/AA-13-yl complexes in the snapshot used as a starting point for 2 different trajectories (rows 1-2). The AA-13-yl radical is shown in red, with C11 highlighted in yellow, and the surfaces of active site residues Arg-120 (top) and Ser-530 (bottom) highlighted in cyan. The position of O2 in the active site over time is indicated by a line that changes color from blue to red from 0-500 ps. While the Figure 3. Fluorescent probe Hoechst 33258 bound to the minor surface views were sliced with a vertical clipping plane (cut surface is groove of DNA. The solvent accessible surface of the DNA is white) to show the interior space, no clipping planes were used on the displayed along with water molecules in the first solvation shell of the oxygen traces in order to show them in their entirety. Adapted from probe. Figure 7 in Ref 7. DNA: Simulating solvation dynamics. Together, We identified four possible scenarios to explain how the spectroscopy combined with computational studies that relate enzyme guides the first addition of oxygen to the pentadienyl directly to the experimental measurements have the potential radical intermediate almost exclusively at the pro-R face of to provide unprecedented insight into the dynamics of C11: (1) molecular oxygen is directed to the substrate via a important biological processes. It is well known that hydration designated oxygen channel in the enzyme; (2) the substrate is is essential to the stability and function of biological bound in the active site in a configuration that sterically molecules. Recent time-resolved fluorescence experiments shields alternative positive spin-bearing positions along the 5,6 have shown that the time scales for collective reorganization at pentadienyl radical; (3) the enzyme localizes spin at the pro- the interface of proteins and DNA with water are more than an R face of C11 by forcing the radical intermediate to adopt a order of magnitude slower than in bulk aqueous solution. The higher energy, non-planar conformation; (4) simple molecular interpretation of this change in the collective competition kinetics governs the product distribution—oxygen response is somewhat controversial—some attribute the adds to all possible positions along the substrate radical but slower reorganization to dramatically retarded water motion, only oxygenation at the pro-R face of C11 yields an while others describe rapid water dynamics at the interface intermediate that can be efficiently trapped (via cyclization to combined with a slower biomolecular response. form the endoperoxide ring). In order to gain further insight into the behavior of water at In order to evaluate the four possible scenarios, we biological interfaces, we performed solvation dynamics performed explicit, 10 ns molecular dynamics simulations of calculations on models of the fluorescent probe Hoechst

33258 (H33258) free in aqueous solution8 and bound to the We found that the water response to the DNA-bound probe minor groove of DNA (Figure 3).9 These studies connect was not dramatically different from the response to the probe directly to the time-resolved fluorescence experiments of in solution, only slowing by a factor of 2-3. The long time Zewail and coworkers.10 The H33258/DNA system has been scale response (20 ps) was instead traced to collective motions extensively characterized, providing a basis for modeling and of the DNA. This is consistent with the idea that solvation in complex environments includes the motion of components validating our simulations. X-ray crystal structures have been 13,14 reported for the DNA duplex with11 and without12 H33258 other than water. In the present study, DNA effectively bound, and the photophysical properties of the probe have “solvates” the probe along with water and ions, and exhibits a been studied in bulk and when bound to DNA. We conducted distinct time dependence of reorganization that leads to the an extensive investigation using both equilibrium and long time component in the collective response. nonequilibrium approaches on multiple independent Acknowledgement. The author would like to thank Dr. Terry trajectories with a detailed model of the H33258 fluorophore. Lybrand, Dr. Ned Porter, Dr. Derek Pratt, Dr. Claus Schneider and The time scales we calculated for the solvation response of Dr. Steve Corcelli for support and helpful discussions during the H33258 free in solution (0.17 and 1.4 ps) and bound to DNA evolution of these projects. K.E.F. was supported by a GAANN (1.5 and 20 ps) were highly consistent with experiment (0.2 Fellowship, training grant T32 GM08320 from the National Institutes and 1.2 ps, 1.4 and 19 ps, respectively). of Health, and the Warren Research Fellowship.

References (1) Palczewski, K.; Kumasaka, T.; Hori, T.; Behnke, C. A.; Motoshima, H.; Fox, B. A.; Le Trong, I.; Teller, D. C.; Okada, T.; Stenkamp, R. E.; Yamamoto, M.; Miyano, M. Science 2000, 289, 739-745. (2) Furse, K. E.; Lybrand, T. P. Journal of Medicinal Chemistry 2003, 46, 4450-4462. (3) Cherezov, V.; Rosenbaum, D. M.; Hanson, M. A.; Rasmussen, S. G. F.; Thian, F. S.; Kobilka, T. S.; Choi, H. J.; Kuhn, P.; Weis, W. I.; Kobilka, B. K.; Stevens, R. C. Science 2007, 318, 1258-1265. (4) Tang, H.; Wang, X. S.; Tropsha, A. In The 236th ACS National Meeting Philadelphia, PA, 2008. (5) Kiefer, J. R.; Pawlitz, J. L.; Moreland, K. T.; Stegeman, R. A.; Hood, W. F.; Gierse, J. K.; Stevens, A. M.; Goodwin, D. C.; Rowlinson, S. W.; Marnett, L. J.; Stallings, W. C.; Kurumbail, R. G. Nature 2000, 405, 97-101. (6) Malkowski, M. G.; Ginell, S. L.; Smith, W. L.; Garavito, R. M. Science 2000, 289, 1933-1937. (7) Furse, K. E.; Pratt, D. A.; Schneider, C.; Brash, A. R.; Porter, N. A.; Lybrand, T. P. Biochemistry 2006, 45, 3206-3218. (8) Furse, K. E.; Lindquist, B. A.; Corcelli, S. A. Journal of Physical Chemistry B 2008, 112, 3231-3239. Figure 4. Decomposition of the solvation response (calculated as the (9) Furse, K. E.; Corcelli, S. A. J. Am. Chem. Soc. 2008, 130, 13103-13109. ground state equilibrium correlation function, C(t)) into individual (10) Pal, S. K.; Zhao, L. A.; Zewail, A. H. Proceedings of the National Academy of Sciences of the United States of America 2003, 100, 8113-8118. contributions from water, DNA, sodium counterions and solute (11) Vega, M. C.; Saez, I. G.; Aymami, J.; Eritja, R.; Vandermarel, G. A.; conformation. DNA is primarily responsible for the long time Vanboom, J. H.; Rich, A.; Coll, M. European Journal of Biochemistry 1994, components of the collective response, while the water response 222, 721-726. decays rapidly. The inset shows a semi-log plot of the first 20 ps. (12) Edwards, K. J.; Brown, D. G.; Spink, N.; Skelly, J. V.; Neidle, S. Journal Taken from Figure 3 in Ref 9. of Molecular Biology 1992, 226, 1161-1173. (13) Andreatta, D.; Lustres, J. L. P.; Kovalenko, S. A.; Ernsting, N. P.; While the experimental solvation response is a collective Murphy, C. J.; Coleman, R. S.; Berg, M. A. Journal of the American Chemical Society 2005, 127, 7270-7271. property, we are able to decompose the total calculated (14) Halle, B. In Hydration processes in biology (ed. M.-C. Bellisent-Funel) response into contributions from the individual components 1998, pp. 233-249. (water, DNA, counterions and solute conformation, Figure 4).