The Specificity-Conferring Code of Adenylation Domains in Nonribosomal Peptide Synthetases Torsten Stachelhaus*, Henning D Mootz and Mohamed a Marahiel

Research Paper 493 The specificity-conferring code of adenylation domains in nonribosomal peptide synthetases Torsten Stachelhaus*, Henning D Mootz and Mohamed A Marahiel Background: Many pharmacologically important peptides are synthesized Address: Biochemie/Fachbereich Chemie, nonribosomally by multimodular peptide synthetases (NRPSs). These enzyme Philipps-Universität Marburg, Hans-Meerwein- Straße, D-35032 Marburg, Germany. templates consist of iterated modules that, in their number and organization, determine the primary structure of the corresponding peptide products. At the *Present address: Department of Biological core of each module is an adenylation domain that recognizes the cognate Chemistry and Molecular Pharmacology, Harvard substrate and activates it as its aminoacyl adenylate. Recently, the crystal Medical School, 240 Longwood Avenue, Boston, structure of the phenylalanine-activating adenylation domain PheA was solved with MA 02115, USA. phenylalanine and AMP, illustrating the structural basis for substrate recognition. Correspondence: Mohamed A Marahiel E-mail: [email protected] Results: By comparing the residues that line the phenylalanine-binding pocket in PheA with the corresponding moieties in other adenylation domains, general rules Key words: adenylation domain, binding pocket, nonribosomal peptide synthetase, signature for deducing substrate specificity were developed. We tested these in silico sequence, substrate specificity ‘rules’ by mutating specificity-conferring residues within PheA. The substrate specificity of most mutants was altered or relaxed. Generalization of the selectivity Received: 11 March 1999 determinants also allowed the targeted specificity switch of an aspartate- Revisions requested: 13 April 1999 Revisions received: 27 April 1999 activating adenylation domain, the crystal structure of which has not yet been Accepted: 4 May 1999 solved, by introducing a single mutation. Published: 29 June 1999 Conclusions: In silico studies and structure–function mutagenesis have defined Chemistry & Biology August 1999, 6:493–505 general rules for the structural basis of substrate recognition in adenylation http://biomednet.com/elecref/1074552100600493 domains of NRPSs. These rules can be used to rationally alter the specificity of adenylation domains and to predict from the primary sequence the specificity of © Elsevier Science Ltd ISSN 1074-5521 biochemically uncharacterized adenylation domains. Such efforts could enhance the structural diversity of peptide antibiotics such as penicillins, cyclosporins and vancomycins by allowing synthesis of ‘unnatural’ natural products. Introduction pool of available substrates and generates the correspond- Nonribosomally synthesized peptides represent a large ing aminoacyl adenylate using ATP. The aminoacyl group of structurally complex metabolites that are manu- moiety is then covalently tethered to the sulfhydryl group factured from amino, hydroxy and carboxy acid monomers of a phosphopantetheinyl (Ppant) prosthetic group on an by large multifunctional enzymes, termed nonribosomal adjacent thiolation domain (also called peptidyl carrier peptide synthetases (NRPSs) [1–4]. Although their actual protein, PCP). Thiolation domains are post-translationally biological roles in the producing organisms (primarily modified by Ppant transferases, which add the Ppant group bacilli, actinomycetes and filamentous fungi) are often [7]. The nascent peptidyl chain grows unidirectionally obscure, a remarkable variety of pharmacological proper- under the catalytic control of a condensation (C) domain ties can be linked to such naturally occurring peptides. each time an upstream peptidyl-S-Ppant donor is attacked Important antibiotics synthesized by NRPSs include peni- by a monomeric aminoacyl-S-Ppant nucleophile [8]. Modi- cillins and cephalosporins, vancomycins, as well as the fying the incorporated monomers (e.g. by epimerization or immunosuppressant cyclosporin A [1–3]. Interestingly, the N-methylation) or the peptide backbone (e.g. by acylation, same class of NRPSs also assembles several virulence- glycosylation or heterocylization) can further functionalize conferring siderophoric peptides, such as mycobactin and the synthesized peptide product. These tailoring reactions yersiniabactin [5,6]. are catalyzed by specialized domains or by fusion to polyketide synthase (PKS) modules [1–3]. Primary structure, size and complexity of a peptide product are dictated by the number and organization of iterated Although several biochemical studies have shown that the modules, which constitute the NRPS template [1–4]. Each A domains are the specificity-mediating ‘gate-keeper’ module activates its cognate amino acid in a two-step reac- units of the repeated modules [9–11], the selectivity- tion using a pair of closely coupled domains. An adenyl- conferring elements have remained unclear. This has ation (A) domain selects the cognate amino acid from the changed recently, with the elucidation of the crystal 494 Chemistry & Biology 1999, Vol 6 No 8 Figure 1 Structural basis for phenylalanine activation by PheA (from Conti et al. [13]). (a) The ribbon diagram of PheA shows how the A domain is folded into a large amino-terminal and a smaller carboxy-terminal subdomain. The AMP (red) and phenylalanine (orange), bound at the interface between both subdomains, are shown in space-filling presentation. The locations of the highly conserved core motifs are indicated : A1, LTYxEL; A2, LKAGxAYVPID; A3, LAYxxYTSGTTGxPKG; A4, FDxS; A5, NxYGPTE; A6, GELxIxGxGLARGYW; A7, YKTGDQ; A8, GRxDxQVKIRGx- RVELEEVE; A9, LPxYMIP; and A10, NGKIDR (using single-letter amino-acid code where x is any amino acid). (b) The phenylalanine-specific binding pocket consists of ten residues. Asp235 and Lys517 mediate electrostatic interactions (dotted lines) with the α-amino and α-carboxylate groups of phenylalanine, Cys331, and on the other side by Ala322, bottom of the pocket. This architecture allows whereas the sidechain specificity pocket is Ala301, Ile299 and Thr278. Both sides are PheA to accommodate L-Phe (green) and D- surrounded on one side by Ala236, Ile330 and separated by the indole ring of Trp239 at the Phe with no significant change in conformation. structures of two members of this superfamily of adeny- The conclusion that shared sequence motifs play a key role late-forming enzymes. First, the structure of firefly in ATP binding is reasonable, because all A domains luciferase of Photinus pyralis was reported [12], which con- require ATP as a substrate. In contrast, the residues lining firmed the existence of a third scaffold for the ATP- the phenylalanine-binding pocket were found to be located dependent formation of aminoacyl adenylates, in addition within a 100 amino acid (aa) stretch that possesses a lower to those found in class I and class II aminoacyl tRNA syn- overall similarity between different A domains (Figure 2) thetases. Subsequently, the structure of the A domain of [13]. On the strength of sequence alignments and hybrid the peptide synthetase GrsA (termed PheA) was solved domain construction [19–23], this area was also proposed to complexed with phenylalanine and AMP (Figure 1a) [13]. confer substrate specificity on the A domains. As shown in This second structure provided, for the first time, funda- Figure 1b, the α-amino and the α-carboxylate groups of the mental insight into the structural basis of substrate recog- substrate amino acid are stabilized by electrostatic interac- nition and activation (Figure 1b). tions with Asp235 and Lys517, respectively. These residues are located in the highly conserved core motifs A4 Although the protein sequence similarity between PheA (Asp235) and A10 (Lys517) [1,4], with the latter being and luciferase is only 16%, both enzymes share a highly found in a loop within the carboxy-terminal subdomain conserved three-dimensional structure [12,13]. As shown (Figure 1a). The specificity pocket for the phenylalanine for PheA (Figure 1a), the enzyme folds into a large amino- sidechain is bordered on one side by Ala236, Ile330 and terminal and a smaller carboxy-terminal subdomain. The Cys331, and by Ala322, Ala301, Ile299 and Thr278 on the latter has been shown to be important for catalytic activity, other side. Both sides are appropriately separated by the and is rotated 94° with respect to the amino-terminal sub- indole ring of Trp239 at the bottom of the pocket. At one domain, when compared to the structure of luciferase. end, towards the viewer in Figure 1b, a water-filled channel Because luciferase was crystallized in the absence of sub- connects the pocket with the surrounding solvent. Inspec- strate, the rotation can be interpreted as an active conformation of the PheA binding pocket shows that both D- and tional change during the course of substrate recognition and L-Phe can be accommodated (Figure 1b) with no signifi- activation. Indeed, the active site is located at the interface cant change in conformation [13], confirming biochemical between the folding domains and rotation closes the cleft data which revealed that both stereoisomers are activated between them. Most of the highly conserved core motifs with the same catalytic efficiency [9]. (A1 to A10, Figure 1a) were found surrounding the active site where the substrates bind. This finding confirmed pre- We expected, because of the high similarity between vious results obtained using site-directed mutagenesis and peptide synthetases, that amino-acid residues that corre- photoaffinity

The Specificity-Conferring Code of Adenylation Domains in Nonribosomal Peptide Synthetases Torsten Stachelhaus*, Henning D Mootz and Mohamed a Marahiel

Review of Oxepine-Pyrimidinone-Ketopiperazine Type Nonribosomal Peptides

Amide Bond Formation in Nonribosomal Peptide Synthesis

Construction of Hybrid Peptide Synthetases for the Production of A-L-Aspartyl-L-Phenylalanine, a Precursor for the High-Intensity Sweetener Aspartame

Nonribosomal Peptides Synthetases and Their Applications in Industry Mario Alberto Martínez‑Núñez1,2* and Víctor Eric López Y López3

Biosynthesis and Engineering of Cyclomarin and Cyclomarazine: Prenylated, Non-Ribosomal Cyclic Peptides of Marine Actinobacterial Origin

Atlas of Nonribosomal Peptide and Polyketide Biosynthetic Pathways Reveals Common Occurrence of Nonmodular Enzymes

Recent Advances in Engineering Nonribosomal Peptide Assembly Lines Cite This: Nat

Non-Ribosomal Peptide Synthesis

Diversity of Linear Non-Ribosomal Peptide in Biocontrol Fungi

NIH Public Access Author Manuscript Proteins

Drosophila Melanogaster Nonribosomal Peptide Synthetase Ebony Encodes an Atypical Condensation Domain

Distribution and Evolution of Nonribosomal Peptide Synthetase Gene Clusters in the Ceratocystidaceae