International Journal of Molecular Sciences
Article Identification of 13 Guanidinobenzoyl- or Aminidinobenzoyl-Containing Drugs to Potentially Inhibit TMPRSS2 for COVID-19 Treatment
Xiaoqiang Huang 1 , Robin Pearce 1, Gilbert S. Omenn 1,2 and Yang Zhang 1,3,*
1 Department of Computational Medicine and Bioinformatics, University of Michigan, 100 Washtenaw Avenue, Ann Arbor, MI 48109, USA; [email protected] (X.H.); [email protected] (R.P.); [email protected] (G.S.O.) 2 Departments of Internal Medicine and Human Genetics and School of Public Health, University of Michigan, 100 Washtenaw Avenue, Ann Arbor, MI 48109, USA 3 Department of Biological Chemistry, University of Michigan, 100 Washtenaw Avenue, Ann Arbor, MI 48109, USA * Correspondence: [email protected]
Abstract: Positively charged groups that mimic arginine or lysine in a natural substrate of trypsin are necessary for drugs to inhibit the trypsin-like serine protease TMPRSS2 that is involved in the viral entry and spread of coronaviruses, including SARS-CoV-2. Based on this assumption, we identified a set of 13 approved or clinically investigational drugs with positively charged guanidinobenzoyl and/or aminidinobenzoyl groups, including the experimentally verified TMPRSS2 inhibitors Camo- stat and Nafamostat. Molecular docking using the C-I-TASSER-predicted TMPRSS2 catalytic domain model suggested that the guanidinobenzoyl or aminidinobenzoyl group in all the drugs could form putative salt bridge interactions with the side-chain carboxyl group of Asp435 located in the S1 pocket Citation: Huang, X.; Pearce, R.; Omenn, G.S.; Zhang, Y. Identification of TMPRSS2. Molecular dynamics simulations further revealed the high stability of the putative of 13 Guanidinobenzoyl- or salt bridge interactions over long-time (100 ns) simulations. The molecular mechanics/generalized Aminidinobenzoyl-Containing Drugs Born surface area-binding free energy assessment and per-residue energy decomposition analysis to Potentially Inhibit TMPRSS2 for also supported the strong binding interactions between TMPRSS2 and the proposed drugs. These COVID-19 Treatment. Int. J. Mol. Sci. results suggest that the proposed compounds, in addition to Camostat and Nafamostat, could be 2021, 22, 7060. https://doi.org/ effective TMPRSS2 inhibitors for COVID-19 treatment by occupying the S1 pocket with the hallmark 10.3390/ijms22137060 positively charged groups.
Academic Editor: Anindita Das Keywords: SARS-CoV-2; COVID-19; TMPRSS2; drug; docking; molecular dynamics
Received: 25 May 2021 Accepted: 28 June 2021 Published: 30 June 2021 1. Introduction
Publisher’s Note: MDPI stays neutral Although several mRNA or protein-based coronavirus disease 2019 (COVID-19) vaccines with regard to jurisdictional claims in (Pfizer-BioNTech, Moderna, Johnson & Johnson/Janssen, AstraZeneca, Chinese and Russian published maps and institutional affil- COVID-19 vaccines, etc.) have been authorized for emergency use in the United States (https: iations. //www.fda.gov/emergency-preparedness-and-response/coronavirus-disease-2019-covid- 19/covid-19-vaccines (accessed on 28 March 2021)) or introduced elsewhere, the devel- opment of anti-COVID-19 drugs is still of high necessity for COVID-19 disease treatment. Developing and bringing to market a brand new drug for treating a specific disease usually takes more than 10 years. Given that there are many approved or clinically investiga- Copyright: © 2021 by the authors. Licensee MDPI, Basel, Switzerland. tional drugs, efforts to repurpose available drugs to target critical proteins involved in the This article is an open access article SARS-CoV-2/host interaction pathway are desirable [1,2]. distributed under the terms and The pathogen that causes COVID-19, severe acute respiratory syndrome coronavirus conditions of the Creative Commons 2 (SARS-CoV-2), invades hosts by hijacking the angiotensin-converting enzyme 2 (ACE2) Attribution (CC BY) license (https:// using its spike protein, followed by employing a host enzyme, transmembrane protease 0 creativecommons.org/licenses/by/ serine 2 (TMPRSS2), to prime the spike at the S2 cleavage site to expose its hydrophobic 4.0/). fusion peptide for fusing with the membranes of host cells [3–5]. Normally, it would be
Int. J. Mol. Sci. 2021, 22, 7060. https://doi.org/10.3390/ijms22137060 https://www.mdpi.com/journal/ijms Int. J. Mol. Sci. 2021, 22, 7060 2 of 20
ideal to develop drugs to target the SARS-CoV-2 proteins (e.g., the spike protein) rather than the host proteins to reduce the side effects. However, SARS-CoV-2 is an RNA virus whose genome is prone to mutation [6]; the alteration of some amino acids on a target viral protein may make a drug ineffective. Thus, a desirable alternative would be anti-SARS-CoV-2 drugs that target key host proteins. TMPRSS2 is a viable anti-SARS-CoV-2 host protein target for the following four rea- sons. First, it is not mutation-prone. Second, TMPRSS2 is used by other coronaviruses (e.g., SARS-CoV and MERS-CoV) and by influenza A viruses for the activation of surface glyco- proteins; therefore, a specific TMPRSS2 inhibitor may treat a whole class of diseases caused by different pathogens [6], including SARS-CoV-2 variants, during this pandemic and in coming years. Third, TMPRSS2 does not appear to play an essential role in any organ, as other proteases may provide a degree of redundancy; thus, TMPRSS2 inhibition may have few on-target side effects. In TMPRSS2-knockout mice, TMPRSS2 appeared dispensable for normal development, growth, and organ function [7]. Fourth, since TMPRSS2 is a member of the serine protease family for which many inhibitors are available [6], finding a suitable drug to target it should be feasible. There have been multiple studies aimed at repurposing and screening available drugs to target TMPRSS2 [8–17]. Soon after the outbreak of COVID-19, Hoffmann et al. demonstrated that SARS-CoV-20s dependence on TMPRSS2 for cell entry can be blocked by a clinically proven protease inhibitor, Camostat [3]. A metabolite of Camostat, 4-(4- guanidinobenzoyloxy)phenylacetic acid (GBPA, known as FOY-251) also inhibited TM- PRSS2 but with reduced efficiency compared to Camostat [8]. Later, numerous research groups proved that Nafamostat has about 10-fold greater potency than Camostat for pre- venting SARS-CoV-2 infection through in vitro and in vivo studies [9,11,12,18]. Meanwhile, Shrimp et al. suggested that Gabexate was a potential TMPRSS2 inhibitor with an IC50 of about 130 nM [11], but other studies reported that Gabexate was not able to effectively inhibit viral infections even at a high concentration of 10 µM[9,12]. Some studies iden- tified the United States Food and Drug Administration (FDA)-approved bromhexine as an inhibitor of TMPRSS2 at a concentration of 750 nM [19], while other in vitro studies reported that bromhexine could not inhibit TMPRSS2 at all [11,20]. Hempel et al. carried out a systematic analysis to compare Nafamostat, Camostat, and GBPA to determine how these compounds could effectively inhibit TMPRSS2 [18]. Their computational studies suggested that the three compounds contain the positively charged guanidinobenzoyl and/or aminidinobenzoyl moiety, which can form stable salt bridge interactions with the negatively charged aspartic acid Asp435 in the S1 pocket of TMPRSS2, occupying the binding site and leading to the inhibition. Inspired by these studies, we hypothesized that some other guanidinobenzoyl- or aminidinobenzoyl-containing drugs may act as TMPRSS2 inhibitors. We identified from DrugBank [21] a narrowed list of 13 compounds (three FDA-approved drugs and 10 inves- tigational drugs) that contain guanidinobenzoyl or aminidinobenzoyl groups (Figure S1). We computationally evaluated their potency for inhibiting TMPRSS2 through molecular docking, molecular dynamics (MD) simulation, and post-MD analysis, such as molecular mechanics/generalized Born surface area (MM/GBSA) [22]-binding free energy calcula- tions. Figure1 outlines the workflow carried out in this study. Consistent with previous experiments showing that Camostat and Nafamostat are strong inhibitors of TMPRSS2, our computational data revealed that Camostat and Nafamostat can form stable binding interactions with TMPRSS2. Besides Camostat and Nafamostat, our computational find- ings suggest that the other 11 drugs may also function as potential TMPRSS2 inhibitors, as their guanidinobenzoyl or aminidinobenzoyl groups formed very stable salt bridges with Asp435 in the S1 pocket, occupying TMPRSS20s active site and preventing the spike from binding to TMPRSS2. This short list of promising drugs may be of great interest to biochemists and pharmacologists for further experimental tests. Int. J. Mol. Sci. 2021, 22, x FOR PEER REVIEW 3 of 20 Int. J. Mol. Sci. 2021, 22, 7060 3 of 20
FigureFigure 1. 1.The The workflow workflow carried carried out out in in this this study. study. The The database, database, software, software, and/or and/or web web server server used used in in each step are shown in parentheses. each step are shown in parentheses. 2.2. Results Results 2.1.2.1. TMPRSS2 TMPRSS2 Sequence Sequence and and Structural Structural Model Model TMPRSS2TMPRSS2 (UniProt (UniProt ID: ID: O15393) O15393) has has two two isoforms isoforms produced produced by by alternative alternative splicing. splicing. IsoformIsoform 1 1 (O15393-1) (O15393-1) has has been been chosen chosen as as the the canonical canonical sequence sequence and and was was used used in in this this study.study. TheThe full-length full-length TMPRSS2 TMPRSS2 isoformisoform 11 is is composed composed of of 492 492 amino amino acids, acids, containing containing twotwo topological topological domains domains (amino (amino acids acids 1-84, 1-84, cytoplasmic cytoplasmic domain domain and and 106-492, 106-492, extracellular extracellu- domain),lar domain), one transmembraneone transmembrane domain domain (amino (amino acids acids 85-105), 85-105), and oneand trypsin-likeone trypsin-like catalytic cata- domainlytic domain (amino (amino acids 256-492)acids 256-492) (Figure (Figure2a). Isoform 2a). Isoform 2 does 2 not does have not a have catalytic a catalytic domain domain and wasand excluded was excluded from thefrom analysis. the analysis. ThereThere is is no no experimental experimental structure structure available available for for TMPRSS2 TMPRSS2 and and its its domains. domains. We We used used a deep-learning contact-guided protein structure assembly approach, C-I-TASSER [23], to a deep-learning contact-guided protein structure assembly approach, C-I-TASSER [23], to model the structure of the catalytic domain of TMPRSS2 (Figure2b). The model had a model the structure of the catalytic domain of TMPRSS2 (Figure 2b). The model had a C- C-score [24] of 0.45, which corresponded to the estimated TM-score [25] of 0.89. Here, the score [24] of 0.45, which corresponded to the estimated TM-score [25] of 0.89. Here, the C- C-score was a confidence score for estimating the global quality of predicted models by C-I- score was a confidence score for estimating the global quality of predicted models by C-I- TASSER; based on large-scale benchmark tests, C-I-TASSER models with a C-score > −2.5 TASSER; based on large-scale benchmark tests, C-I-TASSER models with a C-score > −2.5 correspond to a correct fold with a TM-score > 0.5. The model also had high local structure correspond to a correct fold with a TM-score > 0.5. The model also had high local structure quality, with a MolProbity [26] score of 0.91 (Figure S2), which ranked at the 100th percentile. quality, with a MolProbity [26] score of 0.91 (Figure S2), which ranked at the 100th per- This puts the structure models amongst the best structures of a comparable solution centile. This puts the structure models amongst the best structures of a comparable solu- by comparison with a representative set of experimental structures collected from the tion by comparison with a representative set of experimental structures collected from the Protein Data Bank (PDB) [27]. Our model exhibited a very high structural similarity Protein Data Bank (PDB) [27]. Our model exhibited a very high structural similarity (e.g., (e.g., TM-score > 0.95) to the reported models [28,29] generated by homology modeling approaches.TM-score > Compared0.95) to the withreported the homology models [28,29] models generated built on by a single homology template, modeling our C-I- ap- TASSERproaches. model Compared was constructed with the byhomology considering mode thels consensus built on ofa single multiple template, templates our (PDB C-I- IDs:TASSER 7meq, model 3w94, was 4dgj, constructed and 6eso) by and, considering thus, avoided the consensus the modeling of multiple bias toward templates a single (PDB experimentalIDs: 7meq, 3w94, structure. 4dgj, and 6eso) and, thus, avoided the modeling bias toward a single experimental structure.
Int. J. Mol. Sci. 2021, 22, x FOR PEER REVIEW 4 of 20
Int. J. Mol. Sci. 2021, 22, 7060 4 of 20
FigureFigure 2. 2. Sequence,Sequence, topology, topology, structural structural mo model,del, and and function function of of TMPRSS2. TMPRSS2. ( (aa)) Sequence Sequence and and domain domain topology topology of of TMPRSS2 TMPRSS2 (amino(amino acids acids 1-84: 1-84: cytoplasmic cytoplasmic domain, domain, 85-105: 85-105: transmembrane transmembrane domain, domain, 106-492: 106-492: extracellular extracellular domain, domain, and 256-492: and 256-492: cata- lyticcatalytic domain). domain). (b) C-I-TASSER (b) C-I-TASSER model model of the TMPRSS2 of the TMPRSS2 catalytic catalytic domain domain (shown (shownin cyan cartoon). in cyan cartoon). The conserved, The conserved, catalytic triadcatalytic (Ser441, triad His296, (Ser441, and His296, Asp345); and oxyani Asp345);on holes oxyanion (mainchain holes (mainchain amide groups amide of Ser441 groups and of Ser441Gly439); and and Gly439); the conserved and the asparticconserved acid aspartic (Asp435) acid in (Asp435) the S1 pocket in the are S1 pocketshown arein yellow shown sticks. in yellow A tetrapepti sticks. Ade tetrapeptide (RSFI, shown (RSFI, in a shown magenta in aball-and- magenta stickball-and-stick model) extracted model) from extracted the SARS-CoV-2 from the SARS-CoV-2 S2′ cleavage S2 site0 cleavage is docked site into is docked the binding into the site binding using HPEPDOCK. site using HPEPDOCK. Hydrogen bonds and salt bridge interactions are illustrated in dashed green lines. The distance between the atom Oγ of Ser441 and Hydrogen bonds and salt bridge interactions are illustrated in dashed green lines. The distance between the atom Oγ of the carbonyl C atom of P1 arginine is illustrated in dashed black lines. The distances are shown around the lines (unit: Å). Ser441 and the carbonyl C atom of P1 arginine is illustrated in dashed black lines. The distances are shown around the lines (c) The S2′ cleavage sites of SARS-CoV and SARS-CoV-2. (d) An example of the trypsin inhibitor interaction in the S1 0 pocket.(unit: Å). (c) The S2 cleavage sites of SARS-CoV and SARS-CoV-2. (d) An example of the trypsin inhibitor interaction in the S1 pocket. AA typical typical feature feature of of trypsin trypsin or or a a trypsin-like trypsin-like protease protease is is the the deeply deeply buried buried negatively negatively chargedcharged aspartic acidacid in in the the S1 pocket,S1 pocket, which which specifically specifically recognizes recognizes the positively the positively charged chargedarginine arginine or lysine or at lysine the P1 at site the of P1 a proteinsite of a substrate. protein substrate. In TMPRSS2, In TMPRSS2, such an aspartic such an acidas- particresidue acid is residue Asp435 is (Figure Asp4352b); (Figure there 2b); is no there other is asparticno other acidaspartic residue acid inresidue the S1 in pocket. the S1 pocket.Besides, Besides, the catalytic the catalytic elements elements of TMPRSS2 of TMPRSS2 include include a well-established a well-established catalytic catalytic triad triad(Ser441–His296–Asp345) (Ser441–His296–Asp345) indicated indicated by the by hydrogen-bonding the hydrogen-bonding network network and twoand oxyaniontwo oxy- anionholes holes (i.e., (i.e., the main-chainthe main-chain amide amide groups groups of Ser441of Ser441 and and Gly439) Gly439) (Figure (Figure2b). 2b). The The idealideal configurationconfiguration of of these these catalytic catalytic elements elements al alsoso suggested suggested a a good good quality quality of of the the TMPRSS2 TMPRSS2 model.model. TMPRSS2 TMPRSS2 can can prime prime the the spike spike proteins proteins at the at theS2′ cleavage S20 cleavage site for site SARS-CoV for SARS-CoV and SARS-CoV-2and SARS-CoV-2 (Figure (Figure 2c). We2c). used We used a protein–peptide a protein–peptide docking docking tool HPEPDOCK tool HPEPDOCK [30] to [ 30 pre-] to 0 0 0 dictpredict the thebinding binding mode mode of the of the P1-P1 P1-P1′-P2-P2′-P3-P3′ tetrapeptidetetrapeptide (RSFI, (RSFI, Fi Figuregure 2c)2c) extracted extracted from SARS-CoV/SARS-CoV-2SARS-CoV/SARS-CoV-2 bound bound to thethe bindingbindingpocket pocket of of TMPRSS2. TMPRSS2. In In the the top top one one pose, pose, the theP1 arginineP1 arginine was was predicted predicted to form to bidentateform bident saltate bridge salt interactionsbridge interactions with Asp435, with whileAsp435, the whileOγ atom the O ofγ the atom catalytic of theSer441 catalytic is 3.4Ser441 Å to is the 3.4 carbonyl Å to the atom carbonyl of the atom P1 arginine of the P1(Figure arginine2b) (Figurewithin the2b) vanwithin der the Waals van contactder Waals distance contact (3.5 distance Å) that (3.5 is critical Å) that for is thecritical subsequent for the subse- bond- quentbreaking bond-breaking catalysis. The catalysis. predicted The binding predicted interactions binding interactions mimic those mimic made betweenthose made trypsin be- tweenand its trypsin natural and substrate, its natural in whichsubstrate, the in lysine which side-chain the lysine amino side-chain group amino interacts group with inter- the actsconserved with the aspartic conserved acid aspartic in the S1 acid pocket in the [31 S1] (Figure pocket2 d).[31] (Figure 2d). 2.2. Guanidinobenzoyl- or Aminidinobenzoyl-Containing Drugs 2.2. Guanidinobenzoyl- or Aminidinobenzoyl-Containing Drugs Previous studies revealed that a positively charged group that mimics arginine or Previous studies revealed that a positively charged group that mimics arginine or lysine in a natural substrate of trypsin was important for a drug acting as an inhibitor to lysine in a natural substrate of trypsin was important for a drug acting as an inhibitor to the trypsin-like TMPRSS2 [18,28,32]. Specifically, the guanidinobenzoyl and/or aminidi- the trypsin-like TMPRSS2 [18,28,32]. Specifically, the guanidinobenzoyl and/or aminidi- nobenzoyl group in Camostat or Nafamostat could form stable binding interactions with nobenzoyl group in Camostat or Nafamostat could form stable binding interactions with the conserved Asp435 in the S1 pocket in TMPRSS2 and lead to inhibition. Compared with the conserved Asp435 in the S1 pocket in TMPRSS2 and lead to inhibition. Compared with Camostat and Nafamostat, Gabexate, which contains an arginine-like side-chain, showed Camostat and Nafamostat, Gabexate, which contains an arginine-like side-chain, showed
Int. J. Mol. Sci. 2021, 22, x FOR PEER REVIEW 5 of 20 Int. J. Mol. Sci. 2021, 22, 7060 5 of 20
only a a weak weak inhibitory inhibitory potency potency [9,11]. [9,11 Considering]. Considering that that a drug’s a drug’s rigidity rigidity is crucial is crucial for high- for affinityhigh-affinity binding binding due dueto its to low its low conformational conformational entropy entropy effect effect [33–35], [33–35], we we preferentially preferentially considered drugs with guanidinobenzoyl or or aminidinobenzoyl rather rather than than an an arginine- arginine- or lysine-like side-chain group; the former two groups have far fewer degrees of freedom and are, hence, more rigid. We searched DrugBank for FDA-approved or investigational drugs that contain guanidinobenzoyl or guanidinobenzoyl and obtained a small library of 13 13 drugs drugs (Figure (Figure 3).3). The The drug drug name, name, DrugBank DrugBank ID, ID, regulatory regulatory status, status, and and primary primary in- dicationindication of ofthese these drugs drugs are are listed listed in inTable Table 1.1 .
Figure 3. Molecular formulas of the 1313 drugsdrugs investigatedinvestigated inin thisthis work.work. TheThe guanidinobenzoylguanidinobenzoyl andand aminidinobenzoylaminidinobenzoyl groups are highlighted in blue. Among the 13 drugs, Camostat is a serine protease inhibitor approved in Japan for Among the 13 drugs, Camostat is a serine protease inhibitor approved in Japan for the treatment of chronic pancreatitis and postoperative reflux esophagitis. Nafamostat is the treatment of chronic pancreatitis and postoperative reflux esophagitis. Nafamostat is a synthetic serine protease inhibitor approved as an anticoagulant therapy for patients a synthetic serine protease inhibitor approved as an anticoagulant therapy for patients undergoing continuous renal replacement therapy due to acute kidney injury and used for undergoing continuous renal replacement therapy due to acute kidney injury and used the treatment of acute pancreatitis in Japan. Camostat and Nafamostat were demonstrated for the treatment of acute pancreatitis in Japan. Camostat and Nafamostat were demon- to be effective TMPRSS2 inhibitors [8,9,12] and are in clinical trials for COVID-19 treatment strated(ClinicalTrials.gov to be effective (accessed TMPRSS2 on inhibitors 28 March [8,9,12] 2021) Identifier: and are in NCT04321096 clinical trials for for COVID-19 Camostat treatmentand NCT04352400 (ClinicalTrials.gov for Nafamostat). Identifier: It NCT04321096 was speculated for thatCamostat Camostat and NCT04352400 and Nafamostat for Nafamostat).are covalent TMPRSS2It was speculated inhibitors, that because Camostat their and ester Nafamostat bonds can are be covalent cleaved byTMPRSS2 serine pro- in- hibitors,teases [18 because,32,36]; their this speculation ester bonds wascan supportedbe cleaved byby theirserine low proteases nanomolar-level [18,32,36]; inhibitorythis spec- ulationbehaviors was [11 supported,18,28,32]. by In their contrast, low asnanomolar- shown inlevel Figure inhibitory3, the other behaviors drugs do [11,18,28,32]. not contain In a contrast,cleavable as ester shown bond in adjacentFigure 3, tothe the other guanidinobenzoyl drugs do not contain or aminidinobenzoyl a cleavable ester group bond andad- jacentmay only to the function guanidinobenzoyl as a noncovalent or aminidinobenzoyl TMPRSS2 inhibitor. group The and three may FDA-approved only function drugs as a
Int. J. Mol. Sci. 2021, 22, 7060 6 of 20
Pentamidine, Hexamidine, and Hydroxystilbamidine are primarily used for the treatment of pneumocystis pneumonia, acanthamoebiasis, and nonprogressive blastomycosis, respec- tively. Except for Camostat and Nafamostat, none of the other drugs have been clinically investigated for COVID-19 treatment.
Table 1. The 13 guanidinobenzoyl- or aminidinobenzoyl-containing drugs studied in this work.
DrugBank ID Drug Name Status with FDA Primary Indication DB00738 Pentamidine Approved For the treatment of pneumocystis pneumonia DB03808 Hexamidine Approved For the treatment of acanthamoebiasis DB05038 Anatibant Investigational For the treatment of traumatic brain injuries DB05476 WX-UK1 Investigational For the treatment in solid tumors DB06472 Fradafiban Investigational For the treatment in angina DB06635 Otamixaban Investigational For the treatment of thrombosis DB12120 Avoralstat Investigational For the prevention of hereditary angioedema Used as an anticoagulant in patients with DB12598 Nafamostat Investigational disseminative blood vessel coagulation, hemorrhagic lesions, and hemorrhagic tendencies For the treatment of pancreatic cancer, ductal DB13000 PCI-27483 Investigational adrenocarcinoma, and exocrine pancreatic cancer DB13296 Propamidine Investigational For the treatment of Acanthamoeba infection For the treatment of chronic pancreatitis and DB13729 Camostat Investigational drug-induced lung injury For the treatment and prevention of blood clots and DB14726 Dabigatran Investigational prevention of stroke in people with atrial fibrillation For the treatment of nonprogressive blastomycosis of DB14753 Hydroxystilbamidine Approved the skin and other mycoses Note: The table is ranked by DrugBank ID. Camostat and Nafamostat have been approved in Japan but have not been approved by the FDA.
2.3. Molecular Docking Suggests Salt Bridge Interactions between Guanidinobenzoyl or Aminidinobenzoyl and Asp435 Each of the 13 drugs was docked into the putative binding pocket of TMPRSS2 using LeDock [37], as a previous comprehensive evaluation of ten docking programs on a diverse set of protein–ligand complexes suggested that LeDock had the best sampling power [38], which is important for predicting the correct binding mode. The docked poses were clustered with an RMSD cutoff of 2 Å, and a maximum of 20 cluster center poses were saved for analysis. Different numbers of clustered poses were generated for distinct TMPRSS2 drug systems (Table2 and Table S1).
Table 2. The best LeDock-binding score and the best EvoEF2 reranking score for each drug.
Best Score DrugBank ID Drug Name Number of Poses by LeDock LeDock (kcal/mol) EvoEF2 (EEU) DB00738 Pentamidine 12 −8.73 −37.36 DB03808 Hexamidine 15 −8.47 −35.08 DB05038 Anatibant 18 −8.60 −34.21 DB05476 WX-UK1 17 −8.57 −39.29 DB06472 Fradafiban 8 −7.32 −30.48 DB06635 Otamixaban 17 −8.50 −40.43 DB12120 Avoralstat 7 −9.35 −46.30 DB12598 Nafamostat 6 −8.61 −34.70 DB13000 PCI-27483 16 −9.44 −36.84 DB13296 Propamidine 14 −7.93 −32.48 DB13729 Camostat 13 −8.02 −27.02 DB14726 Dabigatran 15 −9.51 −42.58 DB14753 Hydroxystilbamidine 4 −7.56 −32.56 Int. J. Mol. Sci. 2021, 22, x FOR PEER REVIEW 7 of 20
DB13296 Propamidine 14 −7.93 −32.48 DB13729 Camostat 13 −8.02 −27.02 DB14726 Dabigatran 15 −9.51 −42.58 Int. J. Mol. Sci. 2021, 22, 7060 7 of 20 DB14753 Hydroxystilbamidine 4 −7.56 −32.56
TheThe binding binding energy energy for forall the all theposes poses was was rescored rescored using using our our physics-based physics-based energy energy functionfunction EvoEF2 EvoEF2 [39], [39 because], because the thedefault default LeDock LeDock score score function function may may tolerate tolerate severe severe inter- inter- molecularmolecular steric steric clashes clashes in a in few a few top-ranked top-ranked poses, poses, which which achieved achieved high high EvoEF2-binding EvoEF2-binding scoresscores (Table (Table S1). S1). For For instance, instance, for for the the drug drug Anatibant, Anatibant, the the firstfirst andand 10th10th poses poses had had LeDock- Le- Dock-bindingbinding scores scores of of− 8.60−8.60 and and− −7.167.16 kcal/mol,kcal/mol, respectively; however, however, they they were were scored scored at at −−10.3410.34 and and −34.21−34.21 EEU EEU (EvoEF2 (EvoEF2 energy energy units) units) and and reranked reranked as the as the 17th 17th and and first first best best posesposes by the by theEvoEF2 EvoEF2 score. score. Consistent Consistent with with the theexperimental experimental data data that that Nafamostat Nafamostat acts acts as as a strongera stronger TMPRSS2 TMPRSS2 inhibitor inhibitor than than Camostat Camostat [9,11,12,18], [9,11,12,18 we], wefound found that that the thetop topposes poses of of NafamostatNafamostat obtained obtained more more favorable favorable (i.e., (i.e., lower/more lower/more negative) negative) binding binding scores scores than than those those of Camostatof Camostat by both by both LeDock LeDock and and EvoEF2 EvoEF2 (Tables (Table 2 2and and S1). Table The S1). LeDock The LeDockscores for scores both for drugsboth were drugs not were much not different: much different: −8.02 kcal/mol−8.02 kcal/mol for Camostat for Camostat and −8.61 and kcal/mol−8.61 kcal/molfor Nafa- for mostat.Nafamostat. However, However, their EvoEF2 their EvoEF2 scores were scores much-better were much-better distinguished: distinguished: −27.02 −EEU27.02 for EEU Camostatfor Camostat and −34.70 and EEU−34.70 for EEUNafamostat. for Nafamostat. Compared Compared with Nafamostat, with Nafamostat, some other some drugs, other suchdrugs, as Pentamidine, such as Pentamidine, Hexamidine, Hexamidine, WX-UK1, WX-UK1, Otamixaban, Otamixaban, Avoralstat, Avoralstat, PCI-27483, PCI-27483, and Dabigatran,and Dabigatran, had even had more even favorable more favorable best LeDock best LeDockor EvoEF2-binding or EvoEF2-binding scores (Tables scores 2 (Table and 2 S1),and indicating Table S1), that indicating they may that also they be potent may also inhibitors be potent of TMPRSS2. inhibitors of TMPRSS2. TheThe molecular molecular docking docking results results indicated indicated that all that the all drugs the drugshad at hadleast at one least pose one that pose couldthat form could salt form bridge salt interactions bridge interactions with the withnegatively the negatively charged Asp435 charged (Table Asp435 S1). (Table Note S1). thatNote Anatibant that Anatibant had only hadone onlypose one(i.e., pose pose (i.e., 10) involved pose 10) involvedin the putative in the salt putative bridge salt inter- bridge actions,interactions, which was which reranked was reranked as the asbest the pose best by pose EvoEF2, by EvoEF2, with withthe lowest the lowest energy energy score score (Table(Table S1). S1). The The top top one one pose pose with with the thelowest lowest EvoEF2 EvoEF2 score score is shown is shown in Figure in Figure 4 for4 for each each of theof thedrugs. drugs. All Allthe thedrugs drugs are arewell-docked well-docked into into the thebinding binding pocket pocket of TMPRSS2, of TMPRSS2, with with theirtheir guanidinobenzoyl guanidinobenzoyl or oraminidinobenzoyl aminidinobenzoyl gr groupsoups aligning aligning in in the the S1 S1 pocket pocket and and form- forming ing saltsalt bridgebridge interactionsinteractions withwith thethe side-chainside-chain carboxylcarboxyl groupgroup ofof Asp435Asp435 (see(see alsoalso TableTable S1), S1),supporting supporting favorable favorable binding binding scores. scores.
FigureFigure 4. Superposition 4. Superposition and andcomparison comparison of the of ligand the ligand poses poses with withthe lowest the lowest EvoEF2 EvoEF2 scores scores for all for 13 all drugs. 13 drugs. TMPRSS2 TMPRSS2 is shown in the green cartoon model, with residue Asp435 depicted in the yellow ball-and-stick model. The zoom-in inset is shown in the green cartoon model, with residue Asp435 depicted in the yellow ball-and-stick model. The zoom-in inset shows that guanidinobenzoyl or aminidinobenzoyl can form salt bridge interactions with the Asp435 carboxyl group (shown in the dashed box).
Among these putative TMPRSS2 inhibitors, the binding modes of Nafamostat and Camostat have been extensively studied. We found that Nafamostat could form salt bridge Int. J. Mol. Sci. 2021, 22, x FOR PEER REVIEW 8 of 20
shows that guanidinobenzoyl or aminidinobenzoyl can form salt bridge interactions with the Asp435 carboxyl group (shown in the dashed box). Int. J. Mol. Sci. 2021, 22, 7060 8 of 20 Among these putative TMPRSS2 inhibitors, the binding modes of Nafamostat and Camostat have been extensively studied. We found that Nafamostat could form salt bridgeinteractions interactions with Asp435 with Asp435 with either with itseither aminidinobenzoyl its aminidinobenzoyl or guanidinobenzoyl or guanidinobenzoyl group group(Figure (Figure5a,b), which 5a,b), waswhich consistent was consistent with the with “reverse” the “reverse” and “forward” and “forward” binding modes binding of modesNafamostat of Nafamostat described described by Hempel by etHempel al. [18]. et In al both. [18]. cases, In both the cases, other the positively other positively charged chargedgroup could group form could hydrogen-bonding form hydrogen-bonding interactions interactions with the with main-chain the main-chain carbonyl carbonyl groups groups(Figure 5(Figurea,b). The 5a,b). binding The binding mode mode of Camostat of Camostat was similarwas similar to the to “forward”the “forward” mode mode of ofNafamostat; Nafamostat; the the noncharged noncharged terminal terminal of Camostat of Camostat was alignedwas aligned into a into hydrophobic a hydrophobic pocket pocketenveloped enveloped by Val280, by Val280, Cys297, Cys297, and Pro301 and (Figure Pro3015 c).(Figure This binding5c). This pattern binding was pattern similar was to similarthose reported to those inreported previous in studiesprevious [18 studies,32]. [18,32]. AA previous previous study study reported reported a a few few weak weak TMPRSS2 TMPRSS2 inhibitors inhibitors without without a a guanidinoben- guanidinoben- zoylzoyl or or aminidinobenzoyl aminidinobenzoyl group, group, including including Bromohexine Bromohexine (PubChem (PubChem CID2442), CID2442), 0591-5329 0591-5329 (CID765269),(CID765269), 4401-00774401-0077 (CID2882138), (CID2882138), 4554-5138 4554-5138 (CID5395514), (CID5395514), and 8008-1235 and (CID693919),8008-1235 (CID693919),with an IC50 ofwith 0.75, an 0.93, IC50 2.68,of 0.75, 1.37, 0.93, and 2.68, 2.64 1.37,µM, respectively,and 2.64 µM, for respectively, inhibitingTMPRSS2 for inhibiting [19]. TMPRSS2Therefore, [19]. these Therefore, compounds these could compounds be used ascoul controld be used molecules as control to examine molecules the to 13 exam- target inedrugs. the 13 The target chemical drugs. formulas The chemical of these formulas inhibitors of arethese shown inhibitors in Figure are shown S3. These in Figure molecules S3. Thesewere alsomolecules docked were to the also TMPRSS2 docked model to the using TMPRSS2 LeDock model following using the LeDock same procedure.following the As sameshown procedure. in Table S1, As the shown best dockedin Table poses S1, the of be thesest docked control poses molecules of these in generalcontrol molecules had much inhigher general LeDock had much and EvoEF2 higher LeDock scores than and the EvoEF2 13 proposed scores than drugs. the 13 proposed drugs.
Figure 5. Putative binding modes of Nafamostat (a,b) and Camostat (c). Ligands, catalytic triad residues, binding residues Figure 5. Putative binding modes of Nafamostat (a,b) and Camostat (c). Ligands, catalytic triad residues, binding residues in the S1 pocket, and other important binding residues outside of the S1 pocket are shown in cyan, green, yellow, and in the S1 pocket, and other important binding residues outside of the S1 pocket are shown in cyan, green, yellow, and magenta sticks, respectively. Salt bridges and hydrogen bonds are shown in green dashed lines. Poses 1 and 3 of Nafa- mostatmagenta represent sticks, respectively. the “forward” Salt and bridges “reverse” and hydrogen binding bondsmodes, are respectively. shown in green dashed lines. Poses 1 and 3 of Nafamostat represent the “forward” and “reverse” binding modes, respectively.
Int. J. Mol. Sci. 2021, 22, 7060 9 of 20
2.4. MD Simulations Reveal High Stability of the Putative Salt Bridge Interactions Docking revealed potentially strong binding between TMPRSS2 and the investigated drugs, but docking cannot tell to what extent the binding interactions will be stable for a duration of time. Besides, since the ligands were forcefully docked into the binding pocket, a pose may adopt a highly constrained conformation that may not be stable in reality. To overcome the limitations of molecular docking, we carried out MD simulations to examine the binding stability between TMPRSS2 and the drugs. Before MD, the top ten poses (if they existed) ranked by EvoEF2 for each drug were parameterized using the ACPYPE [40] program with the AM1-BCC [41,42] charge model. Note that pose 4 for drug WX-UK1 and poses 6, 7, 10, 13, and 14 for drug PCI-27483 failed to be parameterized be- cause of severe intramolecular steric clashes. The number of poses that can be successfully applied to MD for the 13 drugs were 10 (DB00738), 10 (DB03808), 10 (DB05038), 9 (DB05476), 8 (DB06472), 10 (DB06635), 7 (DB12120), 6 (DB12598), 5 (DB13000), 10 (DB13296), 10 (DB13729), 10 (DB14726), and 4 (DB14753), respectively (Tables S2 and S3 and Figures S4–S7). TMPRSS2 in complex with each suitable drug pose was subjected to a long-time (100 ns) MD simulation using GROMACS v2020.4 [43]. We expected that a ligand pose that formed stable binding with TMPRSS2 should have very limited movement most of the time, e.g., within the vicinity of the original position. We observed that, for all the protein–ligand complexes and the apo-form TMPRSS2, the protein approached equilibrium very quickly, with an RMSD of about 2 to 3 Å (Figure S4), indicating that the TMPRSS2 catalytic domain is quite stable with or without a ligand. In most cases, ligand binding could induce a slightly lower protein RMSD (Figure S4), suggesting that protein–ligand interactions may further enhance the protein’s stability. The root mean square fluctuations (RMSFs) were, in general, not more than 3 Å for the nonterminal amino acids (Figure S5), suggesting a high rigidity of the protein. The RMSF profiles did not change much with or without ligand binding (Figure S5). Only a handful of residues were shown to have higher flexibility (e.g., > 3 Å), including Met320, Phe321, Phe357, Lys390, Asn450, and the C-terminal Gly492. All of the flexible residues were located on the loop regions that are distant from the binding pocket. Therefore, we reasoned that their flexibility would not have a great influence on ligand binding. Each of the 13 ligands starting from different poses had, in general, much larger RMSD fluctuations compared with that of the protein (Figure S6), indicating that the ligands are more mobile. To quantify the mobility of the ligands, we calculated the mean and median RMSD for each ligand pose across the whole MD process (Table S2). Most of the drugs had one or more poses with a relatively low mean and median RMSD, e.g., <3.5 Å in the 100-ns dynamics process (Table3). Pose 4 of Nafamostat had the lowest median RMSD of 2.1 Å, suggesting that the pose was bound to TMPRSS2 in an extremely stable manner for at least half of the simulation time. Pose 4 of Camostat also achieved a stable binding with a median RMSD of 2.9 Å. Most drugs exhibited a lower mobility than Camostat in terms of the mean and median RMSDs, suggesting a stabilized binding pose (Table3). Note that pose 3 of Otamixaban and pose 1 of Propamidine also achieved a low median RMSD of 2.1 Å, the same as pose 4 of Nafamostat. Only two drugs, PCI-27483 (DB13000) and Dabigatran (DB14726), exhibited a median RMSD of >3.5 Å for all the poses investigated (Table S2). Surprisingly, four control molecules (CID2442, CID2882138, CID5395514, and CID765269) also had at least one stable pose, with mean and median RMSDs not more than 3.5 Å (Table S2). Therefore, the ligand RMSD alone was not sufficiently reliable to distinguish the strong inhibitors (e.g., Nafamostat and Camostat) and weak binders (e.g., the control compounds). Int. J. Mol. Sci. 2021, 22, 7060 10 of 20
Table 3. The drug poses with both mean and median ligand RMSDs below 3.5 Å.
Ligand RMSD (Å) Drug Name DrugBank ID_pose Mean ± std Median Pentamidine DB00738_05 3.2 ± 1.0 3.1 DB00738_10 2.7 ± 1.6 2.2 Hexamidine DB03808_03 3.3 ± 1.4 2.7 Anatibant DB05038_10 3.3 ± 0.6 3.1 WX-UK1 DB05476_01 2.7 ± 0.3 2.6 DB05476_15 2.8 ± 0.3 2.7 Fradafiban DB06472_03 3.2 ± 1.4 2.9 Otamixaban DB06635_02 3.1 ± 0.4 3.1 DB06635_03 2.5 ± 1.6 2.1 DB06635_04 2.9 ± 0.7 2.7 DB06635_05 2.6 ± 0.5 2.7 Avoralstat DB12120_04 2.9 ± 0.6 2.8 DB12120_05 2.9 ± 0.7 2.7 Nafamostat DB12598_01 3.3 ± 0.7 3.2 DB12598_04 2.1 ± 0.5 2.1 DB12598_05 2.8 ± 0.7 2.6 DB12598_06 3.3 ± 0.7 3.4 Propamidine DB13296_01 2.3 ± 0.7 2.1 DB13296_06 2.3 ± 0.7 2.2 DB13296_09 2.5 ± 0.6 2.4 DB13296_13 3.5 ± 2.4 2.3 Camostat DB13729_04 3.1 ± 0.6 2.9 Hydroxystilbamidine DB14753_02 2.9 ± 1.0 3.0 DB14753_04 2.6 ± 0.7 2.4
According to docking models, the guanidinobenzoyl or aminidinobenzoyl groups were docked into the deep S1 pocket and formed salt bridge interactions with Asp435, while the other parts of the ligands were accessible to the bulk solvent. Therefore, the large ligand RMSDs could be partly due to the swing of the non-buried portion. We further examined the stability of the putative salt bridge interactions by measuring the minimum distance between the positively charged guanidinobenzoyl or aminidinobenzoyl group min and the negatively charged carboxyl group of Asp435 (denoted as dON ); only the distances between the nitrogen and oxygen atoms were calculated. We carried out this analysis, because a large RMSD of the whole ligand did not necessarily mean the salt bridges were min broken. All the drugs had at least one pose with a mean and median dON fluctuating around 2.8 Å, an ideal salt bridge distance, with small deviations (Figure6 and Figure S7 and Table S3). Therefore, a long-time MD simulation indicated the high stability of the putative salt bridge interactions between the guanidinobenzoyl or aminidinobenzoyl group and Asp435, which should be important for TMPRSS2 inhibition. We also calculated the minimum distance between their nonhydrogen atoms and the carboxyl group of Asp435 min (denoted as dOD1/OD2) for the control compounds (Table S3 and Figure S7). As shown in min Table S3, almost all the poses of these compounds had a long dOD1/OD2 distance, suggesting that these molecules might not be deeply docked in the S1 pocket, which may partly explain their weak inhibition of TMPRSS2. Only pose 1 of compound CID5395514 exhibited a short min mean and median dOD1/OD2 distance (~2.6 Å) to Asp435 (Figure S7), which was because the phenolic hydroxyl group of CID5395514 formed a hydrogen bond with the carboxyl group of Asp435. It should be noted that the poses with the lowest RMSD values and the smallest min deviations of the dON distance were not always those with the best LeDock and/or EvoEF2 scores. In fact, only pose 10 of DB05038 and pose 1 of DB05476 satisfied this description. However, most of the poses with the lowest RMSD values and the smallest deviations of min the dON distance were ranked in the top 3, and all such poses were always ranked in the Int. J. Mol. Sci. 2021, 22, x FOR PEER REVIEW 11 of 20
Int. J. Mol. Sci. 2021, 22, 7060 11 of 20