UC Irvine UC Irvine Electronic Theses and Dissertations
Title Development and Application of Computational Approaches in Drug Discovery
Permalink https://escholarship.org/uc/item/00m5c6qs
Author Zanette, Camila
Publication Date 2019
License https://creativecommons.org/licenses/by/4.0/ 4.0
Peer reviewed|Thesis/dissertation
eScholarship.org Powered by the California Digital Library University of California UNIVERSITY OF CALIFORNIA, IRVINE
Development and Application of Computational Approaches in Drug Discovery
DISSERTATION
submitted in partial satisfaction of the requirements for the degree of
DOCTOR OF PHILOSOPHY
in Pharmacological Science
by
Camila Zanette
Dissertation Committee: Professor David L. Mobley, Chair Professor Andrej Luptak Professor Celia W. Goulding
2019 © 2019 Camila Zanette DEDICATION
For Colin
ii TABLE OF CONTENTS
Page
LIST OF FIGURES vi
LIST OF TABLES xiv
LIST OF ALGORITHMS xvi
ACKNOWLEDGMENTS xvii
CURRICULUM VITAE xviii
ABSTRACT OF THE DISSERTATION xxi
1 Introduction 1
2 Toward learned chemical perception of force field typing rules 4 2.1 Abstract ...... 4 2.2 Introduction ...... 5 2.3 Methods ...... 8 2.3.1 Monte Carlo sampling over chemical perception trees ...... 13 2.3.2 SMARTY is an algorithm for learning indirect chemical perception trees 19 2.3.3 SMIRKY is an algorithm for learning direct chemical perception trees 22 2.3.4 SMARTY and SMIRKY were evaluated using multiple molecule sets compared to reference force fields ...... 28 2.4 Results and Discussion ...... 32 2.4.1 SMARTY ...... 32 2.4.2 SMIRKY ...... 43 2.5 Conclusions ...... 54
3 Challenging molecular test cases for implicit solvation 57 3.1 Abstract ...... 57 3.2 Introduction ...... 58 3.3 Methods ...... 60 3.3.1 FreeSolv Database ...... 60 3.3.2 Implicit Hydration Free Energy Calculation using GROMACS . . . . 61 3.3.3 We separate the molecules with positive electrostatic term ...... 62
iii 3.3.4 Reprocessing of Implicit Solvent Hydration Free Energy using SEA - Hybrid model ...... 63 3.3.5 1,4-dimethylpiperazine study case ...... 64 3.4 Results and discussion ...... 66 3.4.1 Our hybrid method improved the overall accuracy of the hydration free energies ...... 67 3.4.2 A closer look at the 1,4-dimethylpiperazine implicit solvent model result 69 3.4.3 Comparing series of molecule types ...... 74 3.5 Conclusions ...... 77 3.6 Acknowledgement ...... 78
4 Predicting conformational poses of the lissoclimides for translation inhibition using in silico docking and in cerebro analysis 79 4.1 Abstract ...... 79 4.2 Introduction ...... 80 4.3 Methods ...... 83 4.3.1 We first prepared the structures for docking calculations ...... 84 4.3.2 Different docking methods were tested ...... 84 4.3.3 We calculated the shape-similarity of the ligands ...... 85 4.3.4 The new chlorolissoclimide structure influenced the later part of our study ...... 85 4.3.5 Analysis ...... 86 4.4 Results and Discussion ...... 86 4.4.1 Tanimoto score ...... 87 4.4.2 Structure-Activity Relationship ...... 88 4.4.3 Molecules without IC50...... 92 4.4.4 Docking versus Crystal Structure ...... 93 4.5 Conclusion ...... 94
5 Progress toward the comprehensive computational design of a macrocyclic peptide 96 5.1 Abstract ...... 96 5.2 Introduction ...... 97 5.3 Methods ...... 99 5.3.1 Initial structures preparation ...... 100 5.3.2 Molecular dynamics simulations ...... 101 5.3.3 Analysis ...... 101 5.4 Results and Discussion ...... 101 5.5 Conclusion ...... 108
Bibliography 110
iv A Appendix - Progress toward the comprehensive computational design of a macrocyclic peptide 129
v LIST OF FIGURES
Page
2.1 Describing a specific chemical group with SMARTS. An example of how a particular type of hydroxyl oxygen in a molecule (top) can be described using SMARTS strings. In this case, the SMARTS pattern (bottom) is for a hydroxyl connected to a carbon which is itself connected to another carbon. The alpha atoms (light blue) for the oxygen base type (“[#8]”) are one hydrogen (“[#1]”) and one carbon (“[#6]”), and they are connected to the base type via a single bond (“-”). The carbon (“[#6]”) atom is connected to another carbon (“[#6]”, which is considered a beta atom (pink)) via any bond (“∼”). The oxygen has a decorator (light green) (“X2”) which means it has two connected atoms. . 11 2.2 Illustration of the parm99/parm@Frosst hydrogen atom types for AlkEthOH set. This specific set has five different hydrogen types, OH (high- lighted in blue), HC (highlighted in yellow), H1 (highlighted in green), H2 (highlighted in lilac), and H3 (highlighted in pink) atom types. Reference atom types labels for each hydrogen atom are shown in italics next to each atom. SMARTS recovered by SMARTY for these atom types are shown in the corresponding color...... 12
vi 2.3 SMARTY and SMIRKY. Workflows for the SMARTY (left) and SMIRKY (right) tools are shown. There are three input data categories for SMARTY and SMIRKY (shown at top). In SMARTY (on the left), there are input files (such as base types, initial atom types, and decorators), molecules (in- put via a molecule set file), and reference data consisting of typed molecules (parm99/parm@Frosst atom typing was used in this work). SMIRKY has sim- ilar inputs, shown in the purple area on the right, and additionally allows the user to set the decorator odds and its reference data is fragment types (here, from smirnoff99Frosst). The algorithms are represented in the green area and are available on GitHub (https://github.com/openforcefield/smarty). Both tools begin by reading and processing the input files (top part of the green area). SMARTY (on the left) then conducts a series of moves (coral area) consisting of choosing one working atom type per iteration (icon with connected atoms), and deleting or modifying (pencil icon) this atom type, making choices made with equal probabilities (Section 2.3.2). SMIRKY (on the right) is similar, but samples over working fragment types (such as non- bonded types, bonds, angles, and proper and improper torsions; in the figure these are represented by two different icons with connected atoms) and uses weighted probabilities on their choices (Section 2.3.3). The acceptance crite- ria (diamond icon) for both tools are the same, using Equation 2.3 (Section 2.3.1). Both tools run a user-specified number of iterations and iterate over the steps in the coral area (repeat icon), then write out a final results file after all iterations are completed...... 14 2.4 Illustration of the bipartite graph used to calculate the score for a new proposed move. (a) A bipartite graph is created with a node for each working (light red) and reference (light blue) atom type. (b) Edges (represented by lines) are created for all possible connections between the nodes of the two sets. Initially each edge has a weight of zero. (c) For each atom, the weight is incremented by one on the edge connecting that atom’s working and reference atom types. In this figure, we illustrate the case for potential graph matches to hydrogen working atom type (“#1”) using the AlkEthOH molecule set (Section 2.3.4). However, in practice this process is applied to all working atom types simultaneously. (d) Each node is then restricted to have only a single edge, such that the total weight is maximized. 16
vii 2.5 Flowchart illustrating the decision tree used for SMARTY move proposals. The numbers attached to certain arrows correspond to the prob- ability of following that arrow; all moves are made with equal probability in SMARTY. The diamonds represent decisions and the rectangles are pro- cesses. We start the SMARTY move proposals with the working atom types and decide randomly with equal probability if we want to remove or add an atom type to the set. If we decide to remove, we randomly choose a working type to be removed and re-score the new set. If we decide to add, we pick a working atom type from the set and check if it has an alpha substituent. If the answer is no, we either add an alpha substituent or a decorator. If the answer is yes, we first check if the working atom type describes a hydrogen, and if yes, we add a beta substituent, if not, we either add an alpha or a beta substituent. The new atom type SMARTS created is added to the end of the list of its corresponding parent. The end result (black circle on the bottom) is a move proposal which is then evaluated via scoring function (Section 2.3.1). 21 2.6 Mapping between ChemicalEnvironment and corresponding SMIRKS pattern. This illustrates a SMIRKS pattern “[#6X3H2,#7X2H1;A+0:1]-[#1:2]” (middle) for a substructure (top) being partitioned and how it would be stored in a ChemicalEnvironment object (bottom). The SMIRKS pattern (middle) represents a bond parameter “-” (gray) between two labeled atoms — a carbon (light blue) or nitrogen (light blue) atom and a hydrogen atom (light red). In this figure, we show two matches for this SMIRKS pattern on the top structure. The matches are highlighted in light blue (atom 1) and light red (atom 2) cir- cles on the substructure (top). In the ChemicalEnvironment representation, at bottom, atom 1 (light blue rectangle), there are two ORtypes ‘’‘#6X3H2” with a carbon atom base “#6” and OR decorators “[‘X3, ‘H2’]”, correspond- ing to a connectivity of three and a total hydrogen count of one and “#7X2H1” with nitrogen atom base “#7” and OR decorators “[‘X2, ‘H1’]” (light green). The AND decorators for the atom 1 are “[‘A’, ‘+0’]” (light green) corre- sponding to an aliphatic atom with a zero charge. In SMIRKS strings, the high precedence “and” operator is given by a semi-colon “;”, so the bond described by this pattern is one between a carbon atom with connectivity three and hydrogen count two or a nitrogen atom with connectivity two and hydrogen count one that is also aliphatic with zero charge...... 25
viii 2.7 Flowchart to illustrate the decision tree used for SMIRKY move proposals. The numbers attached to certain arrows correspond to the prob- ability of following that arrow, as currently implemented in SMIRKY. The diamonds represent decisions and the rectangles are processes. We start the SMIRKY move proposals by choosing an initial fragment type SMIRKS from the set. SMIRKY then decides randomly with specified probabilities whether it will remove or add a fragment type SMIRKS. To add, the algorithm chooses between making a change to an atom or a bond. If the change is in a bond, SMIRKY picks OR or AND decorators to change, but if the change is in an atom, SMIRKY can pick OR decorator or OR base to change; and then choose between deleting, adding or swapping one of those decorators. The last step is to choose if a change to an outer atom should be symmetric, if so both outer atoms are changed. Also, if SMIRKY chooses to change an atom, it can delete or add a new atom to this atom. The new fragment type is added to the end of its parent’s list. The end result (black circle) is a move proposal which is then evaluated via the scoring algorithm described in Section 2.3.1. * - The probability of “yes” for this particular decision is 0.2, 0.6, and 0.15 for bond, angle, and torsion types respectively)...... 27 2.8 Total score as a function of iteration for SMARTYorig sampling AlkEthOH. The graph shows the total score (fraction of atoms matching the parm99/parm@Frosst reference atom types) as a function of iteration varies with the temperature for SMARTYorig sampling AlkEthOH. The lines in the graph represent the temperatures 0 (pink), 0.0001 (orange), 0.01 (green) and 0.1 (gray). We can see a high variation throughout the simulation and a pre- dominance of lower scores for extreme temperatures such as 0 and 1.0. At intermediate temperatures, such as 0.01 and 0.0001, SMARTY scored higher. The Supporting Information includes plots for simulations at all temperatures. 34 2.9 Frequency of success for SMARTYorig on the AlkEthOH and PhEthOH sets. Heat maps show the results of SMARTYorig for the AlkEthOH (A) and PhEthOH (B) molecule sets. We plot the fraction of simulation time that each reference atom type (x axis) is matched by a SMARTS pattern with a par- tial score of 100%. We tested eight different temperatures (y axis) and ran 10 simulations of 10,000 iterations each. Darker colors imply a higher rate of matching 100% of that reference atom type; see color bars at right. If we never found the reference atom type, the corresponding cell would be white, but this does not occur here...... 36
ix 2.10 Example of a SMARTY hierarchy and inheritance in AklEthOH. This figure shows the final hierarchy achieved in one SMARTY simulation with the AlkEthOH set at T = 0.0001 after 10,000 iterations. The top shows the hierarchy of SMARTS strings discovered where each color represents a different level. For example, the initial types are represented in light blue, and the yellow squares are the working atom types derived from the initial types; the same logic applies for the other colors. For hydrogen, we see SMARTS patterns that match the parm99/parm@Frosst reference types HC (yellow), H1 (light green), H2 (lilac), and H3 (light red). The bottom (large gray box) shows all final working types and their respective reference atom types as well as partial score for that pairing...... 37 2.11 Comparison of SMARTYelem (hydrogen and oxygen) versus SMARTYorig (all atoms) on AlkEthOH. Here we determine the first iteration at which each sampler achieves a 100% total score (numerical values, out of 10,000 in total), then average this number for all runs which were successful in achieving a score of 100% at some point (out of 10 trials in total). The y axis shows which sampler was used (SMARTYorig for all atoms versus SMARTYelem for hydrogen and oxygen) and the x axis shows the temperature employed. In general lighter colors indicate higher success rates, and lower numbers indicate success was achieved more rapidly. Results for SMARTYelem for carbon is not shown since AlkEthOH has only one carbon atom type (CT)...... 39 2.12 Frequency of success for SMARTYelem on the MiniDrugBank set. For SMARTYelem for MiniDrugBank, we plot the fraction of simulation time where the partial score is 100% for a given parm99/parm@Frosst atom type. The elements shown here are (a) hydrogen, (b) carbon, (c) nitrogen, and (d) oxygen. We used 10 simulations of 10,000 iterations each to construct the plots at different temperatures (y axis). At high temperatures, most of the reference atom types reach 100% partial score. However, at low temperatures, there are more well populated reference atom types but also more which never achieve 100% at any iteration...... 40 2.13 SMARTY scoring with the bipartite graph. We illustrate how SMARTY scores working atom types (red on the left) compared to reference atom types (blue on the right) using a bipartite graph. (a) When there is only one generic SMARTS string (“[#6]”), this pattern is assigned to both CT and CA atom types through the PATTY-adapted scheme, but it will be matched only to the more populated reference atom type (CT highlighted in yellow); (b) when SMARTY creates a new set of working types, containing both “[#6]” and “[#6X4]”, the generic atom type will not match CT anymore, since it chemically matches the new working atom type (“[#6X4]”) instead...... 42
x 2.14 Total score versus iteration for SMIRKY sampling torsion fragment types in AlkEthOH. At a temperature of zero (pink line), SMIRKY works solely as a stochastic optimizer and gets stuck in a local optimum. When temperatures are too high, such as 0.1 (gray line), the probability of accepting moves that cause a decrease in score is very high, leading to dramatic changes in score throughout the simulation. At more moderate temperatures, such as 0.001 (blue line), a total score of 100% is eventually achieved...... 46 2.15 Frequency of success for SMIRKY simulations with AlkEthOH and PhEthOH. For torsion fragments in both AlkEthOH (a) and PhEthOH (b), we plot the fraction of simulation time where each reference fragment type has a torsion SMIRKS pattern that receives a partial score of 100%. For AlkEthOH, at temperatures below 0.001, all of the reference fragments spend at least 50% of the simulation time with a 100% partial score. However, for PhEthOH, many of the reference fragments spend a significant amount of simulation time without a 100% partial score...... 48
3.1 Thermodynamic path for calculating the ∆GSEA for our hybrid method. Svac and SGB are the solvation energies using GROMACS in vacuum and GB, respectively. SSEA is the solvation energy from SEA using the 5000 trajectories from the implicit solvent hydration free energy using GROMACS. ∆GSEA was calculated using ∆∆GGB→SEA and ∆Ghyd (implicit solvent)...... 63 3.2 Implicit (yellow dots) and hybrid solvent (lilac dots) versus explicit solvent [1] hydration free energy for the FreeSolv database molecules. (a) Linear re- gression (red line) analysis of implicit solvent model (GBOBC ) values (on the horizontal axis) and explicit solvent model values (on the vertical axis). The molecules that presented positive value of the electrostatic energy are shown in black triangles. The free energies distributions for the implicit and explicit solvent methods are shown on the top and the right side of the graph, re- spectively. The R2 is equal to 0.8. (b) Linear regression (red line) analysis of hybrid SEA values (on the horizontal axis) and explicit solvent model values (on the vertical axis). The free energies distributions for the hybrid SEA and explicit solvent methods are shown on the top and the right side of the graph, respectively. The R2 is equal to 0.95...... 67 3.3 Hydration free energy distributions for explicit [1], implicit, and hybrid sol- vent hydration free energy calculations for the FreeSolv database. (a) Explicit (gray) and implicit (yellow) solvent hydration free energy distributions; (b) Explicit (gray) and hybrid SEA (lilac) solvent hydration free energy distri- butions; (c) Implicit (yellow) and hybrid SEA (lilac) solvent hydration free energy distributions; ...... 69 3.4 Molecules from the FreeSolv database that showed positive GB polarization values...... 70 3.5 GB polarization energy for 1,4-dimethylpiperazine. We changed the radius size of the nitrogen (wine line) and hydrogen (tan line), separately, and cal- culated their GB polarization energies. Errors are shown in silver lines. . . . 72
xi OBC 3.6 Hydration free energies for a series of molecules types. a) Gel from the GB in kcal/mol. b) Explicit solvent hydration (beige), implicit solvent (dark red), SEA(light lilac), and GBNSR6 (sand) hydration free energies for seven molecules. The electrostatic term for 1-methylpyrrolidine is 0.065 ± 0.005. No explicit solvent hydration free energy is shown for 1-methylpyrrolidine as it is not part of the FreeSolv database...... 76
4.1 Cycloheximide and Lactimidomycin, respectively, 2D structures...... 80 4.2 Cycloheximide (magenta) and Lactimidomycin (purple) crystal structures in the binding pocket of the 60S ribosome from S. cerevisiae [2]. Hydrogen bonds are represented in red and black dashed lines for CHX and LTM, respectively. 81 4.3 2D structures of the ligands tested in this study and their activities. The molecules that were tested their 50% inhibitory concentrations (IC50) against P388 leukemia cells. and (B) the ones that were not test their activity. 82 4.4 2D structures of the ligands tested in this study with no activities. The molecule structures that were not test in P388 leukemia cells to get their activity...... 82 4.5 Most likely binding mode for (A) chlorolissoclimide, (B) its analogue, (C) haterumaimide J, and (D) its analogue...... 87 4.6 Chlorolissoclimide crystal structure in the binding site of the S. cerevisiae 80S ribosome, hydrogen bonds are in black dashed lines...... 88 4.7 Most likely binding poses for compounds (A) 37, (B) 38, (C) 39, (D) 41, (E) 40, (F) hatN, (G) 3168, (H) hatQ, (J) 393, (K) 45, (L) dichlorolissoclimide, (M) hatJ, (N) anti1, (O) anti2,and (P) C3AC in the binding pocket of ribosome. Hydrogen bonds are represented in black dashed lines...... 89 4.8 Ligands bound to the 80S ribosome. (a) Co-crystal structure of CL, (b) docking-based structure of analogue 39, (c) docking-based structure of analogue 45, (d) docking-based structure of analogue 37...... 92 4.9 Compound 45 bound to the 80s ribosome. The two halogen-π interac- tions on the facing guanines (G2793/4) are in red dashed lines. The hydrogen bonds are in black dashed lines...... 93 4.10 Comparison of crystal structure and docking pose prediction. Co- crystal chlorolissoclimide (magenta) and the binding mode predictions of (a) chlorolissoclimide (green) and (b) analogue (orange), their RMSD are 1.6 Å and 1.2 Å, respectively. The hydrogen bonds between the co-crystal CL and ribosome are shown in black dashed lines...... 94
5.1 Crystal structure of CdiA-CT (green) and CdiI (cyan)...... 98 5.2 Crystal structure of macrocyclic peptide [3]...... 99 5.3 Hydrogen bonds average per residue for alanine mutations (mut1 to mut11) to Cdil. The asterisk (*) represents where the alanine mutation is for each mutated peptide. Some of alanine mutations drastically reduced the average number of hydrogen bonds between the peptide and Cdil protein...... 104
xii 5.4 RMSD of the backbone for the (a) ligand and (b) complex ligand-protein for mutation 1 to 11. All mutated peptide showed a constant RMSD and variations below 0.3 nm. The peptide alone shows more flexibility than when it is bound to the protein...... 106 5.5 Secondary structures analysis (DSSP). Secondary structures of the (A) origi- nal macrocyclic peptide, (B) mut12 and (C) mut2 over the time. The original macrocyclic peptide kept its β-sheet structure considerable stable along the time. The mut12, which has two cysteine mutations to form a bridge disulfide bridge, lost its β-sheet structure along the time and also the turn was com- promised. The mut2 is a more conservative mutation, where a serine residue was switched to an alanine, the β-sheet structure was kept along the time, but the peptide lost its bent shape...... 107 5.6 Hydrogen bonds average per residue for mutations (mut13 to mut20). The as- terisk (*) represents where the mutations is for each mutated peptide. Some of mutations drastically reduced the average number of hydrogen bonds between the peptide and Cdil protein...... 108
xiii LIST OF TABLES
Page
2.1 Decorators for elaborating SMIRKS and SMARTS strings. A selection of decorators that can be used for atoms (top section) and bonds (middle section) in SMIRKS or SMARTS strings. Decorators on atoms and bonds can be combined using Boolean operators (bottom section), where the high precedence and is applied before an or operator and the low precedence and is applied after. For a complete list of decorators and documentation for SMARTS and SMIRKS see the Daylight Theory Manual [4]...... 10 2.2 Average of average total scores of 10 SMARTY simulations with AlkEthOH and PhEthOH. We took the average score at each temperature for each of our 10 simulations (10,000 iterations each), then averaged this across all 10 simulations. * - uncertainties were estimated by the standard error over all 10 trials...... 33 2.3 Initial and maximum total scores for all SMARTY simulations with AlkEthOH, PhEthOH, and MiniDrugBank. Initial scores are the score from the initial types in each molecule set. Maximum scores are the highest total score measured during a single iteration in any simulation performed. SMARTYorig scores are shown in the ‘all’ row, and SMARTYelem scores are shown broken out in the ‘hydrogen’, ‘carbon’, ‘oxygen’, ‘nitrogen’, and ‘sulfur’ rows...... 35 2.4 Initial and maximum total scores achieved across all SMIRKY simu- lations with AlkEthOH, PhEthOH, and MiniDrugBank. Initial scores are the score from the initial types for each fragment type in each molecule set. Maximum scores are the highest total score measured during a single iteration in any simulation performed...... 44 2.5 Number of fragment types with 100% partial score in all SMIRKY simulations. This table summarizes results for all SMIRKY simulations. For each molecule set, we report the fraction of reference fragment types that get a 100% partial score during at least one simulation. It shows that we were successful in identifying all fragment types in the AlkEthOH and PhEthOH sets where all fractions are equal to one. As with SMARTY, we were still partially successful for MiniDrugBank given that we find the majority of fragment types...... 45
xiv 2.6 Average of average total scores of 10 SMIRKY simulations with AlkEthOH. We took the average score at each temperature for each of our 10 simulations (10,000 iterations each), then averaged this across all 10 sim- ulations. * - uncertainties were estimated by the standard error over all 10 trials...... 47 2.7 Average of average total scores of 10 SMIRKY simulations with PhEthOH. We took the average score at each temperature for each of our 10 simulations (10,000 iterations each), then averaged this across all 10 sim- ulations. * - uncertainties were estimated by the standard error over all 10 trials...... 49 2.8 Average of average total scores of 10 SMIRKY simulations with MiniDrugBank. We took the average score at each temperature for each of our 10 simulations (10,000 iterations each), then averaged this across all 10 simulations. * - uncertainties were estimated by the standard error over all 10 trials...... 51
3.1 Electrostatic (Gel) and hydration free energies (∆Gsolv) in kcal/mol for 1,4- dimethylpiperazine using AMBER in different solvation models and radii. . . 73 3.2 Comparison of results with experimental hydration free energies using the full free energy scheme (∆G) and several different implicit solvation models (kcal/mol)...... 74
4.1 Shape overlay results using ROCS Software ...... 90
5.1 List of mutations created from the original macrocyclic peptide. The amino acids mutated from the original peptide are in bold letters...... 103 5.2 RMSF of the backbone of mut1 to mut11 using GROMACS...... 105
xv List of Algorithms
Page 1 Macrocyclic peptide auto run ...... 129 2 Macrocyclic peptide analysis ...... 130
xvi ACKNOWLEDGMENTS
Over the course of my graduate career, I have learned a wealth of information from David Mobley. I would like to extend my deepest gratitude to my advisor for giving me the opportunity to be part of his lab and teach me computational chemistry. His patience and hard work are greatly admired by me. I also would like to thank the entire Mobley group for their friendship, advise and help along those years. The lab has such a positive atmosphere that I could not ask for better peers.
During my graduate journey, I was blessed with the opportunity to work with incredible scientists in different projects that provided me an exceptional experience. Thank you to Chris Vanderwal and Zef Könst for giving me the opportunity to learn and collaborate with you. I also would like to thank the brilliant scientists behind the Open Force Field consortium for their invaluable scientific insights, especially Chris Bayly, Michael Gilson, Michael Shirts, and John Chodera. And especial thank you to Chris Fennell for his help with SEA.
I had the privilege to meet incredible professors at UCI who shaped my scientific develop- ment. Andrej Luptak was a great mentor, his breadth of knowledge and attention to his students is something I aspire to emulate. Celia Goulding is the synonymous of girl power, I admire her incredible scientific knowledge, her positive energy, and how she finds time to surf (my husband hopes I will join you someday). Luckily, I could have Andrej and Celia as committee members — thank you for everything.
My transition from Brazil to the US was not easy and the friends that I made here in Irvine made everything much lighter — thanks to Bev, Tom, Riann, Bao, Mona, and Rachel. I could not forget to thank my Brazilian friends, Amanda and Adriana, who supported me since the beginning and helped me during the difficulty times. Also, thank you Kirsten and Alex for your great friendship along those years (and for giving me a “forced break” from grad school to go to Hawaii for your wedding).
I would like to thank my family for their unconditional love. My family has helped me throughout the process, especially my mother and father who have been a never ending source of support and guidance. In 2008, I met the love of my life and partner in crime, Guilherme, who was responsible for awakening my interest in science. His love, guidance, and support helped me to go through this long five years of grad school. This dissertation would not be done without you. With Gui, I got an incredible second family, which makes me feel hugely blessed due to all the support and caring from his parents and brother.
Colin, my love, you are so little and have taught me so much. Thank you for the opportunity of being your mother. Te amo! And Summer, you went through all this process with me, I could not forget to thank you!
My dissertation was supported by the Brazilian Science without Borders program (Capes) and LASPAU, National Science Foundation, National Heath Institute, and Department of Pharmaceutical Sciences at the University of California.
xvii CURRICULUM VITAE
Camila Zanette
xviii Camila Zanette PHARMACEUTICAL SCIENCES · COMPUTATIONAL CHEMISTRY
[email protected] | www.camilazanette.com | camizanette | czanette Education Ph.D. — University of California Irvine (UCI) Irvine, CA, USA PHARMACOLOGICAL SCIENCE - ADVISOR: DAVID L. MOBLEY Sep 2013 - PRESENT • Computational chemistry M.S. — University of California Irvine (UCI) Irvine, CA, USA PHARMACOLOGICAL SCIENCE - ADVISOR: DAVID L. MOBLEY Sep 2013 - Dec 2016 • Computational chemistry M.S. — University of Sao Paulo (USP) Sao Paulo, SP, Brazil NUCLEAR TECHNOLOGY - APPLICATION (RADIOPHARMACEUTICALS) - ADVISOR: ELAINE BORTOLETI DE ARAUJO Feb 2011 - May 2013 • Study of the production process of radiopharmaceutical FLT-18F in automatic system: process validation and analytical methodology B.S. — State University of Maringa (UEM) Maringa, PR, Brazil PHARMACY (SPECIALIZATION IN PHARMACEUTICAL INDUSTRY) - ADVISORS: JOSIANE M. DE MELLO AND REGINA A. C. Mar 2006 - Jan 2011 GONÇALVES
Skills Computational Chemistry Molecular Modeling, Molecular Dynamics, Docking, Virtual Screening, Free Energy Calculations, Chemoinformatics Bioinformatics Genetics, Sequence Alignment (Blast), Selection Analysis, Motif Searching, Protein-protein Interaction Programming Python, LINUX, LaTeX, C/C++, bash, R, Perl, Machine Learning Analytical Methods HPLC, CG, Chromatography Languages Portuguese (native), English (fluent), Spanish (basic)
Experience
Agendia Irvine, CA BIOINFORMATICIAN Feb 2018 - present • Data analysis in molecular diagnostics • Data handling and transfer to clinical trials • QC monitoring • Software update and validation University of California Irvine Irvine, CA GRADUATE STUDENT RESEARCHER - MOBLEY LAB [MOBLEYLAB.ORG] Sep 2013 - present • Performed solvation free energy calculations of 642 small molecules, with experience in 4 different software (GROMACS, OpenEye toolkit, OpenMM, Amber). • Used bioinformatics tools and structure-based search methods to find GTP aptamers in BLAST database (Luptak Lab.) • Worked on parallel molecular dynamics simulations of protein and macrocyclic peptide engineering using GROMACS. • Applied 3 different computational docking software (AutoDock Vina, OEDocking, and Schrödinger) and shape similarity technique to study structure-activity relationships of lissoclimide compounds (translation inhibitors) in ribosome in collaboration with Vanderwal Lab. • Worked collaboratively on code development of a chemical perception tool in the innovative Open Force Field Group [openforce- field.org] to sample atom types and fragment types using Monte Carlo optimization and statistical Bayesian Inference. Sequencial Technical School Sao Paulo, Brazil PROFESSOR Jul 2011 - Dec 2011 • Taught introductory concepts in pharmaceutical formulation development in the Pharmacy Technician Program. • Prepared and proctored quizzes and exams. • Taught laboratory practices in pharmaceutical formulation.
MARCH 20, 2019 CAMILA ZANETTE · RESUME 1
xix