<<

SWISS CHEMISTS IN THE USA CHIMIA 2009, 63, No. 11 737

doi:10.2533/chimia.2009.737 Chimia 63 (2009) 737–744 © Schweizerische Chemische Gesellschaft Engineering to Phosphorylate Nucleoside Analogs for Antiviral and Cancer Therapy

Stefan Lutz*, Lingfeng Liu, and Yichen Liu

Abstract: engineering by directed evolution presents a powerful strategy for tailoring the function and physicochemical properties of biocatalysts to therapeutic and industrial applications. Our laboratory’s research focuses on developing novel molecular tools for protein engineering, as well as on utilizing these methods to cus- tomize and to study fundamental aspects of their structure and function. Specifically, we are interested in nucleoside and nucleotide kinases which are responsible for the intracellular phosphorylation of nucleoside analog (NA) prodrugs to their biologically active triphosphates. The high specificity of the cellular ki- nases often interferes with prodrug activation and consequently lowers the potency of NAs as antiviral and cancer therapeutics. A working solution to the problem is the co-adminstration of a promiscuous from viruses, bacteria, and other mammals. However, further therapeutic enhancements of NAs depend on the selective and efficient prodrug phosphorylation. In the absence of true NA kinases in nature, we are pursuing laboratory evolu- tion strategies to generate efficient phosphoryl-transfer catalysts. This review summarizes some of our recent work in the field and outlines future challenges. Keywords: Deoxynucleoside kinase · Directed evolution · Enzyme · Nucleoside analogs · Protein engineering

Stefan Lutz holds a Introduction blocks of RNA and DNA, yet they carry BSc degree from the distinct chemical substitutions in the ri- Zurich University Enzymes are remarkable catalysts, capable bose and nucleobase moieties. To exhibit of Applied Sciences of accelerating chemical reactions by up to their biological function, NAs depend on (Switzerland), and 13 orders of magnitude while exhibiting intracellular phosphorylation to their cor- a MSc degree from generally high selectivity and specificity responding triphosphates via the host’s the University of for their substrates under environmentally nucleoside salvage pathway (Fig. 2). At Teesside (UK). He benign reaction conditions. One of the first the core of the salvage pathway is a cas- obtained his PhD attempts to rationalize the observed func- cade of three kinases, deoxynucleoside from the University tional capabilities of enzymes was Emil kinases (dNK, e.g. ), of Florida and spent three years as a post- Fischer’s ‘lock-and-key’ model[1] which, deoxynucleoside monophosphate kinases doctoral researcher with Stephen Benkovic despite being overly simplistic, captures (dNMPK, e.g. thymidylate kinase), and at Pennsylvania State University under a some of the basic ideas of enzyme cataly- nucleoside diphosphate kinase (NDPK), fellowship of the Swiss National Science sis. It also helps to account for some of the to recycle scavenged 2’-deoxynucleo- Foundation. Since 2002 he has been a limitations encountered by scientists and sides and to activate intermediates of the chemistry professor at Emory University engineers as they try to capitalize on the de novo nucleotide biosynthetic pathway. in Atlanta, Georgia (USA). The research distinct performance of enzymes for ap- Following cellular uptake via membrane in the Lutz laboratory focuses on the plications in the laboratory and industry. transporters, the prodrugs have to ‘borrow’ structure-function relationship of proteins Many of the synthetic applications envi- these kinases to become activated. Once through combinatorial protein engineer- sioned by researchers involve modified and in their triphosphate state, the NA anabo- ing. unnatural substrates, as well as reaction lites then turn into competitive substrates conditions that significantly diverge from for low-fidelity and reverse the physiological environment in regards transcriptases found in cancer cells and to pH, temperature, and solvent composi- viruses, respectively. The incorporation of tion. In practice, these novel requirements NA-triphosphates results in termination of often translate into compromised catalyst the DNA replication process, preventing performance as the new conditions inter- further disease proliferation. The replica- fere with the complementarity of enzyme tion machinery of normal, healthy cells on and substrate. In terms of Fischer’s model, the contrary has higher fidelity, protecting the key no longer fits the lock. the host from the lethal effects of these sui- *Correspondence: Prof. Dr. S. Lutz A perfect example for such a lock-key cide substrates. Emory University Department of Chemistry mismatch is the application of nucleo- Problems with the concept arise as the 1515 Dickey Drive side analog (NA) prodrugs in antiviral intracellular activation of NAs is often Atlanta, GA 30322 and cancer therapy. As shown in Fig. 1, compromised by the high substrate speci- Tel.: +1 404 712 2170 NAs closely resemble the natural ribo and ficity of the host’s endogenous kinases. Fax: +1 404 727 6586 E-mail: [email protected] 2’-deoxyribonucleosides, the building The inefficient phosphorylation of NAs not 738 CHIMIA 2009, 63, No. 11 SWISS CHEMISTS IN THE USA

Fig. 1. Overview of natural 2’-deoxynucleosides and selected nucleoside analogs. Thymidine (T, 1), 2’-deoxycytidine (dC, 2), 2’-deoxyadenosine (dA, 3), 2’-deoxyguanosine (dG, 4), 3’-azido-3’-deoxythymidine (AZT, 5), 2’,3’-dideoxy-5-fluoro-3’-thiacytidine (FTC, 6), ganciclovir (7), 3-deoxy- thymidine (ddT, 8). Grey-shaded areas are marking structural and functional modifications in the prodrugs. only reduces the potency of existing pro- drugs but can result in the accumulation of cytotoxic reaction intermediates.[2–4] More importantly, it is responsible for the fail- ure of a large number of potential NA pro- drug candidates in vivo.[5] One promising solution to overcome the shortcomings of the phosphorylation cascade has been the co-administration of prodrugs with exog- enous, broad-specificity kinases.[6–9] While biochemical and pre-clinical experiments have demonstrated the effectiveness of the strategy in principle, the studies also un- Fig. 2. Nucleoside salvage pathway for the production of covered limitations arising from the dNKs’ 2’-deoxynucleoside triphosphates. Scavenged 2’-deoxynucleosides significantly lower activity for NAs com- are imported by designated membrane transporter proteins and pared to their performance with the native serve as substrate for dNK. Alternatively, cleavage of the nucleobase substrates. In addition, the broad substrate by phosphorylase feds into other pyrimidine and purine metabolism specificity of exogenous kinases raised pathways. Separately, thymidylate synthase and ribonucleotide concerns as it interferes with the tightly reg- reductase catalyze the fi nal reactions of the nucleotide de novo pathways, producing TMP and dNDPs which are converted to their ulated 2’-deoxynucleoside metabolism.[10] triphosphates by dNMPK and NDPK. dNK = 2’-deoxynucleoside More recent studies have hence focused on kinase, dNMPK = 2’-deoxynucleoside monophosphate kinase, NDPK = identifying orthogonal NA kinases for the nucleoside diphosphate kinase selective and efficient activation of these prodrugs. Besides searching for orthogonal NA kinases in nature, enzyme engineering by by selection or screening assays.[17] Prob- approach, applying the tools of molecular rational design and directed evolution pres- ably the most common selection strategy biology, biochemistry and molecular evo- ents a promising alternative. Directed evo- is genetic complementation of auxotrophic lution in combination with synthetic or- lution in particular has given scientists and host strains which can process libraries of ganic chemistry and computational protein engineers a powerful tool to explore the in- 106–107 members. Alternatively, screen- design. This strategy has formed the foun- ner workings of proteins and enzymes, as ing protocols are traditionally based on dation for the creation of unique kinase well as to tailor native biocatalysts to novel LB-agar or microtiter plate assays and libraries and the development of a novel substrates and function.[11–16] The process can typically be used to analyze up to 104 screening protocol to identify promising mimics Darwinian evolution, applying iter- library members. Extending the limit on kinase candidates. ative cycles of diversification and selection library size, newer screening methods uti- to stepwise modify an existing protein to- lizing fluorescence-activated cell sorting wards progeny with the desired properties. (FACS) and in vitro compartmentalization Cellular Kinases, Exogenous In practice, one or more ‘parental’ genes have been developed for protein libraries Kinases and Substrate Specificity that encode for target proteins are forced with up to 108 members.[18,19] Candidates to diversify by saturation and random mu- that excel under the chosen conditions are The enzyme engineering of kinases in tagenesis, as well as in vitro recombination isolated and can either serve as ‘parents’ the salvage pathway has largely focused methods, creating gene libraries of typi- for the next round of evolution or undergo on the first phosphorylation step catalyzed cally 103–1010 members. Following over- detailed characterization. by dNKs (Fig. 2). These enzymes exhibit expression in a suitable host organism or For the creation of effective NA kinas- generally high specificity towards natu- in vitro, the corresponding protein libraries es from natural nucleoside and nucleotide ral 2’-deoxyribonucleosides and possess are evaluated for the desired property (e.g. kinases by directed evolution, our labora- distinct yet complementary preferences substrate specificity, thermostability etc.) tory is pursuing a highly interdisciplinary for the individual nucleobases (Fig. 3). In SWISS CHEMISTS IN THE USA CHIMIA 2009, 63, No. 11 739

type-II type-I

human Drosophila melanogaster Thermotogamaritima human human human thymidine 2’-deoxynucleoside thymidine kinase 2’-deoxyguanosine kinase thymidine kinase 2 2’- kinase 1 kinase (PDB: 2qq0) (PDB: 1jag) (model) (PDB: 1p61) (PDB: 1xbt) (PDB: 1ot3)

TAGC TAGC TAGC TAGC TAGC TAGC

Fig. 3. Summary of structure and substrate specificity of the 2’-deoxynucleoside kinase family. Members are divided into type-I and type-II subfamilies based on their distinct structural features. Representative protein structures, shown as grey ribbons, are based on crystal structure coordinates except for the homology-based model of human TK2. Where available, substrate occupying the phosphoryl-acceptor is shown as black spheres while substrate binding in the phosphoryl-donor site is marked by white spheres. Substrate profiles are displayed beneath the structures, refl ecting each enzyme’s catalytic efficiency for the four natural 2’-deoxynucleosides (T, dA, dG, dC) by their font sizes.

humans, two of the dNKs (2’-deoxycyti- in the ‘close’ state is catalytically inactive. sively recognizes purine nucleosides as dine kinase (dCK) and thymidine kinase Although it remains unclear whether the phosphoryl acceptors while TK2 phos- 1(TK1)) are cytoplasmic kinases while conformational transition is forced by ATP phorylates only pyrimidine nucleosides. thymidine kinase 2 (TK2) and 2’-deoxy- binding or occurs spontaneously due to Even broader specificity can be found in guanosine kinase (dGK) are expressed in protein dynamics, hence creating a suitable dCK which activates 2’-deoxyribose car- the mitochondria. Although all four en- binding site for ATP, our activity measure- rying cytidine, adenine and guanine moi- zymes have distinct substrate preferences, ments as a function of temperature support eties while 2’-deoxynucleoside kinase the two enzyme pairs complement each the latter hypothesis. A discontinuity in from Drosophila melanogaster (DmdNK) other to ensure phosphorylation of the four the corresponding Arrhenius plot, as well effectively phosphorylates all four natural natural DNA building blocks. as characteristic changes in the turnover 2’-deoxynucleosides, although with a pref- Crystal structures of several dNKs rates are consistent with an ATP-indepen- erence for pyrimidines.[25] The substrate show these enzymes to belong to two dent conformational change. Furthermore, adaptability of type-I dNKs has made subfamilies; type-I dNKs with a parallel our results support the idea that the ob- members of this subfamily the target of β-sheet in the core, flanked by multiple served conformational changes in TmTK the majority of past and present enzyme helices on both sides and a short LID re- are physiologically relevant and could be engineering studies. gion over the , as well as the involved in regulating its enzyme activ- smaller type-II dNKs with a central par- ity. Based on TmTK’s considerable struc- allel β-sheet, flanked by three helices. tural similarity with other type-II dNKs, Probing Substrate Specificity in An extended lasso region, which serves we anticipate similar behavior across the Kinases by Mutagenesis as lid over the active site, is held in place entire subfamily including human TK1. by a ligated zinc atom. We have explored In regards to NA activation, type-II dNKs The diverse substrate profiles of type- some of the fundamental functional fea- play an important role in phosphorylating I kinase family members in respect to in- tures on the type-II thymidine kinase the two anti-HIV prodrugs 3’-azido-3’-de- dividual nucleobases presented an initial from the hyperthermophilic eubacterium oxythymidine (AZT; 5) and 2’,3’-dideoxy- focus for our enzyme engineering studies. Thermotoga maritima (TmTK).[20–22] 2’,3’-didehydro-thymidine (d4T), as well While crystal structures of multiple family The Thermotoga enzyme is a close struc- as various thymidine derivatives used for members with different pyrimidine and pu- tural and functional homolog of the hu- boron neutron-capture therapy.[23,24] Nev- rine nucleosides, as well as nucleoside ana- man TK1 and, due to its thermal stability, ertheless, the subfamily’s strict specificity logs have clearly established that all sub- serves as a (literally) robust model for the for thymine and uracil-nucleosides, as well strates bind with superimposable orienta- latter. Biochemical studies, as well as a as the substrate’s direct interactions with tion in the active site, an assessment of the series of crystal structures of TmTK with the protein backbone, has complicated individual enzyme–substrate interactions bound substrate(s) and products revealed a enzyme engineering and so far limited its has been complicated by the generally low large conformational change in the native potential in therapeutic applications. protein sequence identity in the subfamily homotetramer associated with ATP bind- Members of the type-I dNK subfamily (30–50%). Nevertheless, earlier random ing in the phosphoryl donor site. This 5Å- on the contrary show great versatility in re- mutagenesis experiments with DmdNK, as lateral expansion of the protein complex gards to substrate specificity, even though well as crystallographic studies of human from its ‘close’ to ‘open’ conformation is they all share a common protein scaffold[2] dCK had identified the same three residues critical to function as the tetramer locked (Fig. 3). For example, human dGK exclu- in both enzymes, namely Ala100/Arg104/ 740 CHIMIA 2009, 63, No. 11 SWISS CHEMISTS IN THE USA

A. B. vance of these two positions for thymidine kinase activity. Subsequent reverse engi- neering further probed the role of position 100, identifying it as a lesser contributor to activity gains, and determined the oth- er mutations to be irrelevant for catalytic Val Ala performance. Interestingly, data from the Arg 100 84 kinetic analysis of the reverse engineered Asp 128 Arg epTK6A indicated a much more balanced 133 Ala 105 110 substrate profile compared to previously Glu characterized rTK3, showing improved 53 Met thymidine phosphorylation by >400-fold Arg Glu 88 but wild type-like activity for the remain- 104 52 ing DNA precursors (Table 1). The broader specificity of epTK6A was shown to ex- tend beyond the natural substrates as re- Fig. 4. Active site comparison of A) human 2’-deoxycytidine kinase (PDB code: 1P6O)[27] and flected in substantially increased activity B) 2’-deoxynucleoside kinase from Drosophila melanogaster (PDB code: 1J90).[26] Both enzymes for several NAs. The ease to create such a carry 2’-deoxycytidine in the phosphoryl acceptor binding site. Across the type-I kinase family, broad-specificity dNK is intriguing, espe- Arg128 (Arg105) and Glu53 (Glu52) are highly conserved as they form critical hydrogen bonding cially if we consider recent hypotheses for interactions with the 5’-hydroxyl group of the substrate. In dCK, the hydrogen bonding network is enzyme evolution via ‘generalist’ interme- extended to include Arg104 and Asp133 which in turn interacts with the exocyclic amino group on diates.[30] the substrate’s nucleobase. In contrast, the same residues in DmdNK form a large binding pocket Given the strong evidence for the cen- which lacks specific contacts and could explain the broader substrate specificity of the fruitfly tral role of amino acids 104 and 133 in thy- enzyme. Figures adapted from ref. [28]; created in PyMol (DeLano, Palo Alto, CA). midine kinase activity, we next performed simultaneous site-saturation mutagenesis Table 1. Catalytic performance of dCK variants in the two positions in search of alternate amino acid pairs. While selection of the WT dCK rTK3 epTK6A ssTK1A ssTK2A library in our auxotrophic E. coli strain Ala100 Val100 ... Val100 Val100 yielded over a dozen functional variants, Arg104 Met104 Gln104 Met104 Met104 DNA sequence analysis showed that sub- Asp133 Ala133 Gly133 Ser133 Thr133 stitutions in position 104 were dominated by Met with Gln being the less frequent T 0.095 105 (+1105) 39 (+410) 342 (+3600) 406 (+4273) alternate. Mutations in position 133 were dC 63 777 (+12) 356 (+6) 400 (+6.3) 50 (–1.2) more diverse, showing Thr, Ser, and Asn as the most common substitutions. In dA 38 6 (–6) 8 (–5) 4 (–10) 2 (–20) combination with Ala100Val, the kinetic dG 21 0.23 (–100) 19 (1) 2 (–10) 2 (–10) analysis of the two most frequent mu- tants Arg104Met/Asp133Ser (ssTK1A) The k /K values of the parental enzyme, as well as individual variants are listed. The fold change cat M and Arg104Met/Asp133Thr (ssTK2A) re- relative to dCK is shown in parentheses. Data were adapted from ref. [28]. vealed a ~4000-fold enhanced thymidine kinase activity (Table 1). The additional gain in performance over rTK3 likely re- Asp133 in dCK (corresponding to Val84/ as reflected by a raise in catalytic efficien- sults from favorable hydrogen-bonding in- Met88/Ala110 in DmdNK), as being criti- cy (k /K ) by over 1000-fold (Table 1).[28] cat M teractions between the Ser/Thr side chains cal for thymidine discrimination in the The mutations also benefit the phosphory- with the carbonyl oxygen in the 4-position human kinase[26,27] (Fig. 4). The elaborate lation of 2’-deoxycytidine (dC) while the of thymine. More so, the 8-fold decline in hydrogen-bonding network in dCK allows activation of purine nucleosides shows a dC activity as a consequence of the extra for favorable binding interaction between substantial drop. These findings prompted methyl group in Thr133 (ssTK2A) was the exocyclic amino group of the substrate the question whether i) Ala100Val, Arg- rationalized by unfavorable steric interac- and Asp133, a contact that would be lost 104Met and Glu133Ala were the only mu- tions between the side chain and the sub- upon binding of thymidine. Furthermore, tations leading to thymidine kinase activi- strate’s exocyclic amino group. In sum- the network also preorients the Arg104 ty, and ii) there are other positions through- mary, we successfully applied a variety of side chain in a conformation which creates out the protein which, upon mutagenesis, enzyme engineering methods to identify a steric clash with the 5-methyl group of could result in a similar change in substrate and verify two key positions in human thymine. In contrast, Val84/Met88/Ala110 specificity. To answer these questions, we dCK, responsible for the phosphoryla- in DmdNK lack these hydrogen-bonding used site-saturation and whole-gene ran- tion of thymidine. Exploring a number of interactions and instead create a large, dom mutagenesis of dCK in combination amino acids pairs in these positions, the re- generic hydrophobic binding pocket, ca- with functional selection of library mem- sulting kinetic data further confirmed our pable of accommodating pyrimidines and bers in the thymidine kinase-deficient E. ability to exert control over the pyrimidine purines alike. coli auxotroph KY895.[29] Random muta- nucleobase preferences in dCK. Based on preliminary data by Lavie and genesis yielded two unique kinase variants, Nevertheless, we have also come to coworkers,[27] we built rTK3, a variant of epTK6 and epTK16, with five and three realize that there are limits to what extent dCK with the substitutions at Ala100Val/ mutations, respectively. More importantly, primary shell residues, amino acids that Arg104Met/Glu133Ala by site-directed both candidates carried an Arg104Gln sub- directly contact substrate in the enzyme mutagenesis, thereby eliminating the en- stitution and changes in Asp133 to either active site, can alter the overall perfor- zyme’s discrimination against thymidine Gly or Asn, supporting the functional rele- mance of an enzyme. In a separate study, SWISS CHEMISTS IN THE USA CHIMIA 2009, 63, No. 11 741

we analyzed and compared the active sites Table 2. Catalytic performance of chimeric kinases [31] of DmdNK and human TK2. While the WT TK2 WT DmdNK HD-16 HDHD-12 two enzymes share similar preferences for pyrimidine substrates, their catalytic per- TK: 1-175 DmdNK: TK: 1-11 DmdNK: formance differs by almost two orders of 176-236 12-88 TK: 89-172 magnitude. We found this difference to be DmdNK: 173-233 particularly interesting in light of the fact that the two enzymes carry identical amino T 2,700 91,000 (+33) 13,000 (+5) 14,000 (+5) acids in 27 out of 29 positions within 6 Å dC 1,200 90,000 (+75) 2,300 (+2) 9,100 (+8) of the phosphoryl acceptor binding site. While individual, as well as combined mu- dA nd 830 (1) 9.3 (–100) 22 (–33) tations of the two positions (L78F/L116M) dG nd 180 (1) 1.5 (–100) 4.5 (–33) in TK2 did raise enzyme activity in an ad- ditive fashion, the gains were comparably moderate. These findings strongly sug- gested that more distant portions of the en- ddC nd 39 (1) 3.4 (–11) 110 (+3) zyme also contribute to its overall catalytic d4T nd nd 11 19 performance and encouraged us to expand The k /K values (x 102 s–1 M–1) of the parental enzymes, as well as individual chimera are listed. our kinase engineering efforts beyond the cat M immediate vicinity of the active site. The fold change relative to TK2 (for pyrimidines) and DmdNK (for purines and nucleoside ana- logs) are shown in parentheses. nd = not detectable. Data were adapted from ref. [33].

Chimeragenesis in Kinase Engineering showed both chimeras to exhibit intermedi- ca plates, testing for cytotoxicity of NAs.[35] ary catalytic performances relative to their Replica plating, while directly evaluat- The functional role of protein regions parents (Table 2). However, experiments ing library members for NA activation, is distant to the active site can be explored by with 2’,3’-dideoxycytidine (ddC) and d4T very laborious and ultimately depends on random mutagenesis and in vitro recombi- showed increased rates of phosphorylation the toxicity of the phosphorylated NA to nation. The latter approach, better known for both NAs. More so, in the case of d4T the host, a criterion that in our tests in E. as DNA shuffling, exchanges entire pro- neither parent had measurable activity for coli could not be extended beyond AZT. tein fragments and domains between two the analog while both chimeras successfully In light of these biases and functional defi- or more parental structures, a strategy that phosphorylated the prodrug. Although the ciencies of existing assays, we sought the has been shown more efficient in generat- activities themselves are very low and offer design and implementation of a new, more ing progeny with improved properties than little immediate value for practical applica- adequate selection or screening technique mutagenesis.[32] However, traditional DNA tions, our experiments were able to identify for orthogonal NA kinases. shuffling is based on homologous gene re- another hotspot in the type-I dNK scaf- Inspired by new library screening combination, requiring the parental DNA fold, the C-terminal region of the protein. methods for directed evolution libraries sequences to share at least 70% identity. Ongoing experiments in our group and by based on FACS,[36,37] we developed a simi- For the shuffling of our type-I dNKs, this others in the field are now focusing on the lar strategy for identifying enzymes that method is not suitable as the sequence functional role of this region which, despite can phosphorylate NAs. Conceptionally, identities of various family members do being approximately 20 Å away from the our idea was based on the notion that i) not exceed 50%. To overcome this limita- phosphoryl acceptor binding site, appears nucleosides and NAs are efficiently trans- tion, we applied SCRATCHY, an alterna- to influence enzyme activity and substrate ported across the cell membrane while tive in vitro recombination protocol that specificity. Our working hypothesis argues their corresponding monophosphates, the does not depend on sequence identity of for communication between the two sites to of the kinase-catalyzed phospho- the parental genes.[33,34] be transmitted via conformational changes ryl transfer, become trapped in the host The application of SCRATCHY en- of the LID region, located between the two cytoplasm and ii) fluorescent nucleoside abled us to create extensive libraries of chi- segments in the kinase structure. analogs (fNAs) with an excitation maxima meric proteins between the human TK2 and of >300 nm are substrates for type-I dNKs. DmdNK. Subsequently, chimeras with thy- The spectral shift is necessary as proteins midine kinase activity were identified with New Methods for More Effective and other small-molecule metabolites such the help of our auxotrophic E. coli strain. Directed Evolution of Kinases as ATP possess intrinsicfluorescence prop- From among the functional hybrid kinases, erties under physiological conditions and candidates HD-16 and HDHD-12 were se- Over the course of our directed evolu- otherwise interfere with NA detection due lected for in-depth characterization. HD-16 tion experiments on dNKs, one reoccur- to the overlapping absorption maxima. is a single crossover chimera, carrying the ring issue has been the restrictions posed Fortunately, earlier DNA structure studies, N-terminus of TK2 (amino acids 1–175) by library analysis with the thymidine utilizing fluorescent 2’-deoxyribonucleo- and the C-terminus of DmdNK (amino ac- kinase-deficient E. coli strain KY895. In sides including etheno-adenine (11),[38] as ids 176–236). In contrast, HDHD-12 con- directed protein evolution, the golden rule well as furano (9) and pyrrolo-pyrimidines tains three crossovers, starting with the N- of ‘you get what you select for’ has been (10) and pterines (12,13)[39,40] (Fig. 5) terminal TK2 sequence (amino acids 1–11), upheld many times over the last decade. In have already demonstrated that relatively switching to DmdNK from residues 12–88 our particular case, library selection with small synthetic modifications of a nucleo- and back to TK2 from position 89–172, and the E. coli auxotroph would inadvertent- side’s pyrimidine or purine moiety could finally completing the chimera with the ly identify the thymidine kinases among red-shift its absorption spectrum, enabling C-terminal portion from DmdNK (amino our library members, not, as originally detection of these fNAs with high sensi- acids 173–233). When tested with natural planned, variants with NA kinase activity. tivity in complex mixtures such as the cy- 2’-deoxynucleosides, the kinetic analysis An alternative is in vivo screeningon repli- toplasm. We argued that the combination 742 CHIMIA 2009, 63, No. 11 SWISS CHEMISTS IN THE USA

Fig. 5. Structures and of these modified nucleobases with sugar spectral properties of derivativesfound in NA prodrugs would fl uorescent analogs create suitable substrates for dNK library of pyrimidines and screening. purines. Furano and For our specific purposes, we incu- pyrrolo-pyrimidines (8,9) can substitute bated bacteria that express individual dNK for thymine and library members with a target fNA, relying cytosine while etheno- on broad-specificity nucleoside transport- λ λ λ adenine (10) and ers to efficiently translocate the fluorophor λ λ λ pterines (11,12) are across the membrane and produce the cor- replacements for responding fNA-monophosphate in the purines. Fluorescent presence of a functional fNA kinase (Fig. nucleoside analogs 6). Upon accumulation of the fluorophor carry the appropriate in the cytoplasm, FACS was employed to sugar derivative identify and isolate hosts with the highest instead of the [37,41] 2’-deoxyribosyl moiety fluorescence intensity. as the R-group. The new FACS-based assay was first λ λ put to the test over four rounds of directed λ λ evolution of DmdNK to identify enzymes for the phosphorylation of 3’-deoxythy- midine (ddT; 8).[42] Even though the iso- lated kinases had as few as four mutations, their kinetic characterization showed a O nucleoside-specific O dramatic change in substrate specificity. fluorescent N kinase N For example in variant R4.V3 (Thr85Met/ nucleoside -O HO N O - O N O Glu172Val/Tyr179Phe/His193Tyr), the O P O O O catalytic efficiency for natural pyrimidine

OH OH 2’-deoxyribonucleosides was lowered by up to 48,000-fold while activity for the fddT and ddT was largely preserved 100 (Table 3). Reverse engineering studies re- vealed that these functional changes were 80 largely accomplished by two amino acid

count 60 substitutions in the phosphoryl-acceptor binding site, responsible for interacting FACS cell new round / 40 with the substrate’s ribose moiety (Fig. 7). library analysis ax. The substitution of Val172 back to wild inase 20 inase type Glu172 in R4.V3-[172] reestablished %m -k +k significant activity levels of the natural 0 0 1 2 3 2’-deoxynucleosides (Table 3), verifying 10 10 10 10 the residue’s major role in determining fluorescence intensity substrate specificity. The amino acid rever- sion at position 179 resulted in less dra-

Fig. 6. Schematic overview of our new kinase screening assay based on fl uorescence activated cell sorting (FACS). Accumulation Table 3. Catalytic performance of ddT-kinases of the fl uorescent nucleoside (or nucleoside WT DmdNK R4.V3 R4.V3-[85] R4.V3-[172] analog) monophosphate in bacteria that express a kinase with the desired substrate Thr85Met ... Glu172Val Thr85Met ... specificity raises the fl uorescence intensity Glu172Val Tyr179Phe Tyr179Phe of the host cell. FACS can rapidly detect Tyr179Phe His193Tyr His193Tyr and separate this subpopulation of the cell His193Tyr culture for subsequent characterization or additional rounds of directed evolution. T 4813 0.1 (–48,000) 1.4 (–3,400) 326 (–14) The assay was validated with E. coli cells, expressing either DmdNK with thymidine dC 5850 0.36 (–16,000) 0.39 (–15,000) 2925 (–2) kinase activity (+ kinase) or the thymidine dA 161 nd nd 57 (–2.5) kinase activity-deficient human dCK (- kinase). Upon incubation with furano-thymidine, the dG 28 nd nd 6 (–4.5) fl uorescence intensity increase 20 to 30-fold in host cells expressing DmdNK. Figure adapted from ref. [42]. ddT 4.6 3 (–1.5) 28 (+6) 2.6 (–1.8)

fddT 1.3 1.9 (+1.5) 0.4 (–3) 7.5 (+6)

The k /K values (x 103 s–1 M–1) of the parental enzyme, as well as evolved variants are listed. cat M The fold change relative to DmdNK are shown in parentheses. nd = not detectable. Data were adapted from ref. [42]. SWISS CHEMISTS IN THE USA CHIMIA 2009, 63, No. 11 743

A. B. its native Thr (R4.V3-[85]), we observed a Tyr Phe 10-fold increase in the catalytic efficiency 179 179 for ddT. In comparison with the parental Glu Val DmdNK, R4.V3-[85] showed only resid- 172 172 ual activity for the natural DNA building blocks but phosphorylated ddT with 6-fold greater efficiency, overall making it a truly orthogonal NA kinase. Although we anticipated a larg- er increase in the catalytic activity of Thr Met R4.V3-[85] for ddT, based on the selec- 85 85 tion pressure during FACS analysis, the moderate effect seen in vitro translated into more dramatic improvements in vivo. In bacterial cultures, as well as in more Fig. 7. Structure model of active sites in DmdNK (A) and R4.V3 (B). The location of the three recent mammalian cell culture work, our relevant mutations Glu172Val, Tyr179Phe, and Thr85Met are highlighted in DmdNK with ddT kinase clearly outperformed the pa- thymidine, bound in the phosphoryl acceptor site (PDB code: 1OT3).[43] A) Glu172 in the wild- rental DmdNK. We attribute this effect type enzyme forms a direct hydrogen-bonding interaction with the 3’-hydroxyl group of the to the ddT kinase’s ability to discriminate substrate while Tyr179 contacts the same position via hydrogen bonding to a water molecule (red against naive 2’-deoxynucleosides which sphere) in the active site. B) Directed evolution of DmdNK for ddT phosphorylation introduces are present at low micromolar concentra- two hydrophobic residues in position 172 and 179, better accommodating the binding of the tions inside the cell. In contrast to the engi- nucleoside analog. Separately, the mutation of Thr85 to Met is believed to slightly enlarge the binding pocket for the nucleobase portion by inducing a conformational shift of the helical region. neered kinase, the ability of DmdNK to ac- tivate the NA is compromised by substrate competition for the active site between ddT and natural 2’-deoxynucleosides. Follow- ing this initial demonstration of our new FACS-based kinase screening assay to A.) B.) identify NA kinases in directed evolution libraries, we are poised to expand our en- gineering efforts to find novel kinases with improved properties for existing, as well as new NA prodrugs.

Future Perspectives: Multi- functional Kinases

Another new and exciting research challenge presents itself as a direct conse- quence of the recent success in engineer- ing 2’-deoxynucleoside kinases. Early experiments suggest that the increased ef- ficiency of initial nucleoside analog phos- phorylation could shift the bottleneck of activation to the second step of the reac- tion cascade, the conversion of nucleoside monophosphate to its diphosphate. While Fig. 8. Structure comparison of dNK, dNMPK and dual-function kinase. A) Superposition of earlier studies have demonstrated that DmdNK (PDB code: 1OT3)[43] and thymidylate kinase from E. coli (PDB code: 4TMK).[46] Despite mutagenesis and in vitro recombination overall low sequence identity and strict specificity for either 2’-deoxynucleosides or thymidine can also broaden the substrate specificity monophosphate, the two enzymes have matching functional elements (LID region highlighted in of dNMPKs,[44,45] an attractive alternative yellow/orange, P-loop shown in green) and striking overall structural similarity. The bisubstrate would be to merge dNK and dNMPK activ- 1 5 analog Tp5A (P -(5’-adenosyl)P -(5’-thymidyl)pentaphosphate) bound in the active site of the ity by creating a dual-function biocatalyst. E. coli kinase is shown in sticks B) The thymidine kinase from Herpes Simplex virus (PDB code: 1P7C)[47] is primarly a thymidine kinase, yet can also perform the second phosphorylation step A single enzyme capable of performing from TMP to TDP. The enzyme carries N and C-terminal extensions but otherwise shares the both phosphoryl-transfer reactions would same core structure with dNKs and dNMPKs including the central β-sheet, the LID region not only simplify its clinical application (orange), and the P-loop (green). The binding conformation of the co-crystallized Tp5A analog but could also minimize the accumulation also closely resembles the orientation seen in the above E. coli structure. of cytotoxic intermediates and raise the ef- ficiency of the overall activation process. From a practical perspective, the idea matic changes but nevertheless confirmed ety (fddT) did bias the outcome of the ex- of creating a bis-phosphorylation kinase this residue’s contribution to the overall periment. In addition to the two active site for nucleoside analogs is supported by specificity switch. Further analysis also mutations, a third mutation at Thr85 was the structural similarities of enzymes in showed that the new assay was faithful to linked to enlarging the nucleobase binding the two subfamilies. As shown in Fig. the golden rule; using a nucleoside analog pocket to better accommodate the extra 8A, the crystal structures from represen- with modifications in the nucleobase moi- furnao moiety. Upon reversion of Met85 to tatives of both subfamilies, DmdNK and 744 CHIMIA 2009, 63, No. 11 SWISS CHEMISTS IN THE USA

the E. coli thymidylate kinase (EcTMPK), [1] E. Fischer, Ber. Dtsch. Chem. Ges. 1894, 27, [33] M. L. Gerth, S. Lutz, J. Mol. Biol. 2007, 370, show striking similarities between the two 2984. 742. [2] E. S. Arner, S. Eriksson, Pharmacol. Ther. [34] S. Lutz, M. Ostermeier, G. L. Moore, C. D. enzyme scaffolds. Furthermore, the same 1995, 67, 155. Maranas, S. J. Benkovic, Proc. Natl. Acad. Sci. structures show superimposable binding [3] S. Eriksson, B. Munch-Petersen, K. Johansson, USA. 2001, 98, 11248. sites for phosphoryl-donor and -acceptor, H. Eklund, Cell. Mol. Life Sci. 2002, 59, 1327. [35] F. C. Christians, L. Scapozza, A. Crameri, G. respectively, while results from mechanis- [4] G. A. Gentry, Pharmacol. Ther. 1992, 54, 319. Folkers, W. P. Stemmer, Nat. Biotechnol. 1999, tic studies are consistent with direct phos- [5] J. Shi, J. J. McAtee, S. Schlueter Wirtz, P. 17, 259. Tharnish, A. Juodawlkis, D. C. Liotta, R. F. [36] L. L. Lairson, A. G. Watts, W. W. Wakarchuk, S. phoryl-transfer in both cases (in contrast Schinazi, J. Med. Chem. 1999, 42, 859. G. Withers, Nat. Chem. Biol. 2006, 2, 724. to nucleoside diphosphate kinase, the third [6] F. L. Moolten, Cancer Res. 1986, 46, 5276. [37] M. J. Olsen, D. Stephens, D. Griffiths, P. enzyme in the salvage pathway, which pro- [7] K. W. Culver, Z. Ram, S. Wallbridge, H. Ishii, Daugherty, G. Georgiou, B. L. Iverson, Nat. ceeds via a phospho-histidine intermedi- E. H. Oldfield, R. M. Blaese, Science 1992, 256, Biotechnol. 2000, 18, 1071. [48] 1550. [38] J. R. Barrio, J. A. Secrist, 3rd, N. J. Leonard, ate ). Probably the strongest argument in [8] L. P. Jordheim, C. M. Galmarini, C. Dumontet, Biochem. Biophys. Res. Commun. 1972, 46, 597. favor of developing a dual-function kinase Cancer Chemother. Pharmacol. 2006, 58, 547. [39] D. A. Berry, K.Y. Jung, D. S. Wise, A. D. Sercel, is the existence of precedence in nature. [9] N. Solaroli, M. Johansson, J. Balzarini, A. W. H. Pearson, H. Mackie, J. B. Randolph, R. L. Generally, dNKs and dNMPKs are strictly Karlsson, Gene Ther. 2007, 14, 86. Somers, Tetrahedron Lett. 2004, 45, 2457. limited to their respective reactions, yet [10] S. Song, Z. F. Pursell, W. C. Copeland, M. J. [40] M. E. Hawkins, W. Pfleiderer, F. M. Balis, D. Longley, T. A. Kunkel, C. K. Mathews, Proc. Porter, J. R. Knutson, Anal. Biochem. 1997, two promiscuous enzymes have been re- Natl. Acad. Sci. USA 2005, 102, 4990. 244, 86. ported: the thymidine kinases from Herpes [11] C. A. Tracewell, F. H. Arnold, Curr. Opin. [41] G. Georgiou, Adv. Protein. Chem. 2000, 55, 293. Simplex and Equine herpes virus.[47,49,50] Chem. Biol. 2009, 13, 3. [42] L. Liu, Y. Li, D. Liotta, S. Lutz, Nucleic Acids In both cases, the homology of their core [12] J. A. Gerlt, P. C. Babbitt, Curr. Opin. Chem. Res. 2009, 37, 4472. Biol. 2009, 13, 10. [43] N. E. Mikkelsen, K. Johansson, A. Karlsson, structures clearly identifies them as mem- [13] P. Bernhardt, S. E. O’Connor, Curr. Opin. W. Knecht, G. Andersen, J. Piskur, B. Munch- bers of the 2’-deoxynucleoside kinase Chem. Biol. 2009, 13, 35. Petersen, H. Eklund, Biochemistry 2003, 42, family (Fig. 8B). Furthermore, phyloge- [14] J. Tao, J. H. Xu, Curr. Opin. Chem. Biol. 2009, 5706. netic analysis places them closer to the 13, 43. [44] C. L. Willmon, E. Krabbenhoft, M. E. Black, dNK subgroup, a finding that is consistent [15] N.J. Turner, Nat. Chem. Biol. 2009, 5, 567. Gene Ther. 2006, 13, 1309. [16] R.J. Kazlauskas, U.T. Bornscheuer, Nat. Chem. [45] R. Brundiers, A. Lavie, T. Veit, J. Reinstein, I. with kinetic experiments which confirm Biol. 2009, 5, 526. Schlichting, N. Ostermann, R. S. Goody, M. the herpes enzymes’ primary function as [17] S. Lutz, W. M. Patrick, Curr. Opin. Biotechnol. Konrad, J. Biol. Chem. 1999, 274, 35289. thymidine kinases with ~10% secondary 2004, 15, 291. [46] A. Lavie, N. Ostermann, R. Brundiers, R.S. activity as thymidylate kinases.[49,50] [18] E. T. Farinas, Comb. Chem. High Throughput. Goody, J. Reinstein, M. Konrad, I. Schichtling, Screen. 2006, 9, 321. Proc. Natl. Acad. Sci. USA 1998, 95, 14045. While there have been numerous en- [19] A. D. Griffiths, D. S. Tawfik, Trends Biotechnol. [47] A. Gardberg, L. Shuvalova, C. Monnerjahn, M. zyme engineering studies involving Her- 2006, 24, 395. Konrad, A. Lavie, Structure 2003, 11, 1265. pes Simplex virus thymidine kinase, a [20] S. Lutz, J. Lichter, L. Liu, J. Am. Chem. Soc. [48] I. Lascu, P. Gonin, J. Bioenerg. Biomembr. majority of these studies have focused on 2007, 129, 8714. 2000, 32, 237. the initial phosphorylation of ganciclovir [21] D. Segura-Pena, J. Lichter, M. Trani, M. Konrad, [49] M. S. Chen, J. Walker, W. H. Prusoff, J. Biol. A. Lavie, S. Lutz, Structure 2007, 15, 1555. Chem. 1979, 254, 10747. (7)and acyclovir, two nucleoside analogs [22] D. Segura-Pena, S. Lutz, C. Monnerjahn, M. [50] M. S. Chen, W. H. Prusoff, J. Biol. Chem. 1978, used in cancer treatment and antiherpal Konrad, A. Lavie, J. Mol. Biol. 2007, 369, 129. 253, 1325. therapy, respectively.[51–58] However, as [23] M. Welin, U. Kosinska, N. E. Mikkelsen, [51] D. K. Dube, J. D. Parker, D. C. French, D. S. neither prodrug relies on the herpes ki- C. Carnrot, C. Zhu, L. Wang, S. Eriksson, B. Cahill, S. Dube, M. S. Horwitz, K. M. Munir, L. Munch-Petersen, H. Eklund, Proc. Natl. Acad. A. Loeb, Biochemistry 1991, 30, 11760. nase for its conversion to the diphosphate, Sci. USA 2004, 101, 17970. [52] K. M. Munir, D. C. French, D. K. Dube, L. A. few of these investigations have assessed [24] A. S. Al-Madhoun, J. Johnsamuel, R. F. Barth, Loeb, Protein Eng. 1994, 7, 83. the impact of mutations on the secondary W. Tjarks, S. Eriksson, Cancer Res. 2004, 64, [53] K. M. Munir, D. C. French, D. K. Dube, L. A. activity, the phosphorylation of TMP to 6280. Loeb, J. Biol. Chem. 1992, 267, 6584. TDP.[59–61] In its current state, the dual [25] B. Munch-Petersen, W. Knecht, C. Lenz, L. [54] K. M. Munir, D. C. French, L. A. Loeb, Proc. Sondergaard, J. Piskur, J. Biol. Chem. 2000, Natl. Acad. Sci. USA 1993, 90, 4012. functionality of Herpes Simplex virus 275, 6673. [55] M. E. Black, D. E. Hruby, J. Biol. Chem. 1992, thymidine kinase is mostly a curiosity but [26] W. Knecht, M. P. Sandrini, K. Johansson, H. 267, 9743. one with potential to offer new insight in Eklund, B. Munch-Petersen, J. Piskur, EMBO J. [56] M. E. Black, L. A. Loeb, Methods Mol. Biol. the catalytic plasticity of enzymes and to 2002, 21, 1873. 1996, 57, 335. [27] E. Sabini, S. Ort, C. Monnerjahn, M. Konrad, A. [57] M. E. Black, L. A. Loeb, Biochemistry 1993, create more effective tools for nucleoside Lavie, Nat. Struct. Biol. 2003, 10, 513. 32, 11618. prodrug activation. [28] P. Iyidogan, S. Lutz, Biochemistry 2008, 47, [58] M. S. Kokoris, M. E. Black, Protein Sci. 2002, 4711. 11, 2267. Acknowledgements [29] K. Igarashi, S. Hiraga, T. Yura, Genetics 1967, [59] B. Degreve, R. Esnouf, E. De Clercq, J. This work was supported in part by the U.S. 57, 643. Balzarini, Mol. Pharmacol. 2001, 59, 285. National Institutes of Health [GM69958] and a [30] A. Aharoni, L. Gaidukov, O. Khersonsky, Q. G. [60] T. A. Hinds, C. Compadre, B. K. Hurlburt, R. R. grant to the Emory Center for AIDS Research S. Mc, C. Roodveldt, D. S. Tawfik, Nat. Genet. Drake, Biochemistry 2000, 39, 4105. 2005, 37, 73. [61] K. E. Mercer, C. E. Ahn, A. Coke, C. M. [AI050409] from the National Institutes of [31] M. L. Gerth, S. Lutz, Biochem. Biophys. Res. Compadre, R. R. Drake, Protein Eng. 2002, 15, Health and by institutional funding from the Commun. 2007, 354, 802. 903. Emory University Health Science Center. [32] M. Trani, S. Lutz, in ‘Protein Engineering Handbook’, Ed. S. Lutz, U. T. Bornscheuer, Received: October 1, 2009 Wiley VCH Verlag, Weinheim, 2008, p. 493.