ISSN 00062979, Biochemistry (Moscow), 2008, Vol. 73, No. 1, pp. 113. © Pleiades Publishing, Ltd., 2008. Original Russian Text © G. N. Rudenskaya, D. V. Pupov, 2008, published in Biokhimiya, 2008, Vol. 73, No. 1, pp. 317.

REVIEW Cysteine Proteinases of Microorganisms and Viruses

G. N. Rudenskaya1* and D. V. Pupov2

1Faculty of Chemistry and 2Faculty of Biology, Lomonosov Moscow State University, 119991 Moscow, Russia; fax: (495) 9393181; Email: [email protected] Received May 7, 2007 Revision received July 18, 2007

Abstract—This review considers properties of secreted cysteine proteinases of protozoa, , and viruses and presents information on the contemporary taxonomy of cysteine proteinases. Literature data on the structure and physicochemical and enzymatic properties of these are reviewed. High interest in cysteine proteinases is explained by the discovery of these enzymes mostly in pathogenic organisms. The role of the proteinases in pathogenesis of several severe diseases of human and animals is discussed.

DOI: 10.1134/S000629790801001X

Key words: cysteine proteinases, properties, protozoa, bacteria, viruses

Classification and Catalytic Mechanism and related peptidases showed that the catalytic of Cysteine Proteinases residues are arranged in the following order in the polypeptide chain: Cys, His, and Asn. Also, a glutamine Cysteine proteinases are peptidyl in residue preceding the catalytic cysteine is also important which the role of the nucleophilic group of the for catalysis. This residue is probably involved in the for is performed by the sulfhydryl group of a cysteine residue. mation of the oxyanion cavity of the . The cat Cysteine proteinases were first discovered and investigat alytic cysteine residue is usually followed by a residue of ed in tropic plants. They have now been found in all rep an aromatic hydrophobic amino acid, but sometimes this resentatives of the animal world. position is occupied by a glycine residue. The catalytic mechanism of cysteine proteinases is The representatives of clan CB contain the catalytic similar to that of serine peptidases, which implies the par dyad HisCys. Comparative analysis of tertiary structures ticipation of a nucleophilic group and a proton donor. of typical representatives of clan CB, proteinases of The role of the proton donor in all cysteine peptidases, as 3C and virus, with the structure in most serine peptidases, is played by a histidine residue. of chymotrypsin reveals a significant similarity in their Some cysteine proteinases need a third catalytic residue spatial structures. These data suggest that these represen that serves for the correct spatial orientation of the imida tatives of the cysteine and serine proteinases may origi zole ring of the histidine residue. nate from a common ancestor. The temporary classification of proteolytic enzymes In clan CC, the cysteine catalytic residue usually fol MEROPS (http://merops.sanger.ac.uk) based on the cat lows a hydrophobic residue, while in clan CB a glycine alytic mechanism of proteinases and their primary and residue follows the active site cysteine. The catalytic spatial structure includes nine main clans of cysteine pro mechanism of the clan CC proteinases is similar to that of teinases. papain, which involves the participation of four amino The first discovered cysteine proteinase, papain, acid residues, but in the polypeptide chain of clan CC belongs to clan CA. Analysis of the crystal structures of proteinases these residues are arranged in a different order: His, Glu, Gln (or Asp), Cys. of clan CC usually hydrolyze a single bond during the pro Abbreviations: DTT) dithiothreitol; FMDV) footandmouth disease virus; RDRP) RNAdependent RNA polymerase; cessing of the virus polyprotein, while the proteinases of TLCK) tosylLlysine chloromethyl ketone; TPCK) tosylL clan CB are capable of cleaving several bonds. phenylalanine chloromethyl ketone. The enzymes of clan CD contain the catalytic dyad * To whom correspondence should be addressed. HisCys. Typical representatives of this clan are the

1 2 RUDENSKAYA, PUPOV enzymes gingipains K and R, , , and animal diseases possess a number of proteinases, includ legumains. All enzymes of this clan are highly selective. ing cysteine proteinases that are important factors of their Their specificity is determined by the amino acid residue pathogenicity. in the P1 position, which is uncharacteristic for papain Cysteine proteinases of protozoa of the genus like cysteine proteinases. Proteinases of clan CD, unlike Plasmodium. Malaria is one of the most severe fever other cysteine proteinases, are not inhibited (or are inhib causing infectious diseases. There are one million lethal ited reversibly) by the inhibitor E64. Moreover, they all cases per year worldwide. Plasmodium falciparum is the have a similar structure of the active site, and gingipain R most pathogenic species of parasitic protozoa that causes and 1 exhibit similar spatial packing of polypep malaria in humans. The main proteinase secreted by this tide chains. parasite is falcipain; it belongs to the papain family of cys Currently, the following five clans of cysteine pro teine proteinases (clan CA, family C1). Falcipain was first teinases have been specified: CF, CH, CL, CM, and CN. isolated from the trophozoite form of the parasite [1]. The The serine proteinases of clan CF contain in the active enzyme exhibits maximal activity in an acid medium site the GluCysHis. A representative of (pH 5.06.5), this being connected with its localization in this clan is pyroglutamylpeptidase I from the bacterium the digestive vacuoles of trophozoites. The activity of fal Thermococcus litoralis. The tertiary structure of pyroglut cipain is significantly stimulated by reducing agents such amylpeptidase I differs from the structures of other as dithiothreitol (DTT), cysteine, and glutathione. The known cysteine proteinases and belongs to the new clan enzyme cleaves protein substrates: azocoll, gelatin, and CF. However, there is some similarity between the repre casein. Among synthetic substrates, falcipain hydrolyzes sentatives of clan CF and clans of metalloproteinases those containing in the P1 position Lys or Arg residues, MC, MF, and MH. preferring large neutral amino acid residues in the P2 A cysteine proteinase of clan CH (hedgehog protein) position. Falcipain is inactivated by the peptide inhibitors was found in Drosophila melanogaster. The active site of of cysteine proteinases leupeptin, E64, and chymostatin this enzyme is composed of the triad CysThrHis. Clan [2]. CH includes autolytic cysteine proteinases. The translation product of the falcipain gene is a pre Clan CL includes sortase A and B from proenzyme (569 amino acid residues, 66.8 kD) [3]. Staphylococcus aureus, whose active sites contain the dyad Comparison with papain and the relative proteinases sug HisCys. The spatial structures of sortases A and B were gests that the formation of the active enzyme form is pro determined, and sortase A was shown to be activated by vided by cleavage between Lys332 and Val333. The pro calcium ions. Clan CL includes bacterial proteinases with teolysis yields the mature proteinase of 26.8 kD with the high activity. amino acid sequence exhibiting 37 and 33% of homology Clan CM includes viral endopeptidases existing as with human L and papain, respectively. homodimers in the active state, which is of special inter It is supposed that the mature falcipain molecule est. Their active site contains the triad HisGluCys, but contains four disulfide bonds. The gene encoding this the first two residues belong to one monomer, and the protein was named the falcipain1 gene, since another cysteine residue belongs to the other monomer. The terti gene was found that was named the falcipain2 gene [2, ary structure of proteinase NS2 from hepatitis C virus was 4]. It is considered that all previously determined bio determined; it appears to be unique for clan CM. chemical properties of falcipain are related to the product The active sites of cysteine proteinases of clan CN of the falcipain2 gene. This gene encodes a proteinase contain the dyad CysHis. The tertiary structure for pro that is a typical member of papain family containing a teinase nsP2 from Venezuelan horse virus has been deter large prodomain. Without this prodomain, the enzyme mined. The domain structure of clan CN cysteine pro exhibits molecular weight of 27 kD. The predicted amino teinases is unique, but the Cterminal domain is similar to acid sequences of mature falcipain2 and falcipain1 that of SadenosineLmethioninedependent enzymes exhibit only 37% of homology. Falcipain2 lacks in the of the methyltransferase family. characteristic signal peptide, but has a hydrophobic sequence of 20 amino acid residues that is probably a transmembrane domain. The gene of falcipain3 has also Cysteine Proteinases of Protozoa been characterized [5]. The mature falcipain3 exhibits 65% of homology with falcipain2. Various protozoans differ significantly in their nutri The most important period of the life cycle of the tional metabolism. There are photosynthesizing parasite takes place in erythrocytes. Falcipains are hemo autotrophs and microorganisms that use carbohydrate globinases that are can digest hemoglobin in a nutrition substrates instead of proteins as the source of carbon vacuole in an acidreducing medium. The stability of the (Euglenales), so the secretion of proteolytic enzymes into erythrocyte membrane is maintained by a bidimensional the environment is not characteristic for them. However, protein system on the internal surface of the membrane. representatives of protozoa causing numerous human and Proteolytic cleavage of these proteins results in the lysis of

BIOCHEMISTRY (Moscow) Vol. 73 No. 1 2008 CYSTEINE PROTEINASES OF MICROORGANISMS AND VIRUSES 3 the erythrocytes. Falcipain2 is capable of cleaving skele proteinases are activated in the presence of reducing tal proteins of the erythrocyte membrane at neutral pH agents (DTT, cysteine, and βmercaptoethanol) and values. For example, falciapin2 hydrolyzes ankyrin and inactivated in the presence of inhibitors of cysteine pro band 4.1 protein in their Cterminal region. The activity teinases (iodoacetate, iodoacetamide, Nethyl of this enzyme allows the parasites to leave the erythro maleimide, E64), as well as tosylLlysine chloromethyl cyte. ketone (TLCK), leupeptin, and α1antitrypsin [12, 13]. Analogs of falcipain have been found in nine species At least seven genes of E. histolytica encoding differ of malarial parasites. The primary structures of the ent preproforms of cysteine proteinases have been identi mature enzyme are rather conservative. For example, fal fied and sequenced. About 90% of the total activity of cipain1 exhibits 71 and 56% homology with its analogs cysteine proteinases in E. histolytica is accounted for by from P. vivax and P. vinckei, respectively. All these pro EhCP1, EhCP2, and EhCP5 [14]. The product of the teinases have a nonconservative region of 30 amino acids gene EhCP5 is considered as the most important cysteine (499529 in falcipain), which is absent in most enzymes proteinase of E. histolytica and, consequently, the main of the papain family. All cysteine proteinases of malarial factor of pathogenicity of this microorganism. It was sup parasites have unusually large sequences. posed that the nonpathogenic species E. dispar lacks this Cysteine proteinases of P. vivax (vivapain2 and gene. However, it was demonstrated that the gene is pres vivapain3) have also been isolated and characterized [6]. ent but inactive [15]. The amino acid sequences of the Vivapains, like falcipains, exhibit maximal activity in acid products of seven genes of cysteine proteinases of E. his media. The character of inhibition of vivapains is typical tolytica are proteins exhibiting 3439 and 3646% homol for proteinases of the papain family. ogy with the amino acid sequences of papain and cathep Cysteine proteinases of the ameba Entamoeba his sin L, respectively, being absolutely conservative in the tolytica. The protozoan E. histolytica is a causative agent positions of the amino acid residues that are essential for of amebiasis. Infection with this microorganism results in the functioning of cysteine proteinases. hemorrhagic colitis and abscess of the liver, sometimes Cysteine proteinases of E. histolytica are synthesized being lethal. Based on biochemical, immunological, and as the inactive precursors containing a hydrophobic signal genetic data, E. histolytica has been subdivided into two peptide of 1214 amino acid residues, a prodomain of 78 species that are identical morphologically but different 82 amino acid residues, a catalytic domain of 216225 genetically: the pathogenic E. histolytica and nonpatho amino acid residues, and a Cterminal region [16]. A sur genic E. dispar. It is important to note that the most sig face cysteine proteinase EhCP12 containing a transmem nificant difference between these two species consists in brane region and an RGD domain for the binding to inte the expression level of cysteine proteinases. Entamoeba grin, which can indicate the participation of the enzyme dispar exhibits a much lower activity of cysteine pro in the adhesion of the parasite cells to the host cells, was teinases compared with E. histolytica. Such a difference is also found (Scheme 1). probably connected with the number of functioning genes Two main reasons for tissue damage during infection encoding cysteine proteinases [7]. Cysteine proteinases by E. histolytica have been determined: lysis of cells by with molecular weights in the range of 1696 kD have proteins that are capable of forming pores and proteolyt been found in different extracts of Entamoeba. The first ic activity of cysteine proteinases leading to the destruc isolated and characterized proteinases were amebapain tion of cells and extracellular matrix. Cysteine proteinas [8] and histolysin [9]. Later, two other cysteine proteinas es can adhere to host cells and destroy matrix proteins es of 27 kD differing in affinity to laminin were described: such as collagens, laminin, and fibronectin, but not amebapains EhCP1 and EhCP2 [10]. elastin [12]. Thus, cysteine proteinases provide not only The substrate specificity of cysteine proteinases of E. proteolytic cleavage of the extracellular matrix proteins, histolytica is typical for the like enzymes of but they also take part in the adhesion to host cells and the papain family [11]. Most of these cysteine proteinas their subsequent lysis. es exhibit the highest activity towards the substrates con Cysteine proteinases of trypanosomatids. Trypano taining arginine residues in the P1 and P2 positions. The soma cruzi is a causative agent of American trypanosomi

prepeptide propeptide mature enzyme

NH2 C H N COOH 1214 amino acid 7882 amino acid 216225 amino acid residues residues residues

Polypeptide chain of cysteine proteinase of E. histolytica. The scheme shows the cathepsinLlike motif ERFNIN in the prodomain and the catalytic cysteine, histidine, and asparagine residues in the region corresponding to the mature enzyme Scheme 1

BIOCHEMISTRY (Moscow) Vol. 73 No. 1 2008 4 RUDENSKAYA, PUPOV asis (Chagas’ disease). Humans are infected when bitten forms. The enzyme is localized in lysosomes, but presum by bloodsucking insects, which are the main carriers of ably it can be released to the surface of the cell. The role the disease. The main cysteine proteinase of the parasite of in the nutrition of the parasite is obvious. is cruzipain belonging to the papain family (clan CA, However, it also plays an important role in the penetra family C1). tion of the trypomastigotes into the mammalian cell [23, Cruzipain is an that cleaves casein, 24]. bovine albumin, and hemoglobin (optimal range of pH is Leishmania is a protozoan that is a causative agent of 35) and different synthetic chromogenic and fluorogenic different internal and skin diseases of mammals in the substrates (at pH 79). Cruzipain binds and hydrolyzes tropics and subtropics. The intermediate hosts of the par peptide substrates containing both arginine and phenyl asite and its carriers are bloodsucking Diptera. alanine in the P2 position, the efficiency of these process Investigations of L. mexicana showed that it possesses es differing 15fold at pH 6.0 [17]. The specificity of several isoforms of cysteine proteinases exhibiting similar cruzipain is higher at low pH values, this being connect biochemical properties [25, 26]. Type I proteinases ed with protonation of the amino acid residues of the (lmcpb) are cathepsin Llike proteinases characterized by pocket that is responsible for the substrate specificity of an unusual Cterminal domain. Type II proteinases the enzyme [18]. (lmcpa) are also cathepsin Llike proteinases, but their The enzyme is inhibited by organomercuric biochemical properties are poorly characterized. Type III reagents, E64, TLCK, leupeptin, and some peptidyl proteinases are cathepsin Blike enzymes. Type I pro derivatives of chloro and fluoromethane. Cystatins, teinases includes seven groups of enzymes (AH). Some stefins, and kininogens are also strong inhibitors of the of them exhibit differences in substrate specificity, partic proteinase [17, 19]. The enzyme is stable within the range ularly selectivity in P1 position [27]. of pH 4.59.5 [20]. For L. mexicana, it was shown that a high content of Cruzipain is encoded by several genes encoding a cysteine proteinases is characteristic for the amastigote signal peptide and a proprotein. The mature enzyme form of the parasite that is located in the phagolysosome (molecular weight 40 kD including two mannose polysac compartment of the mammalian macrophages. Thus, it charide chains) includes a catalytic part (215 amino acid can be supposed that cysteine proteinases are important residues) that is homologous to and a Cter for the survival of the parasite in the host organism. minal domain (130 amino acid residues) that is unique for Cysteine endopeptidases of trichomonads. Represen the cysteine proteinases of trypanosomatids. The Cter tatives of the protozoa Trichomonas, Tritrichomonas, and minal domain contains a core (76 amino acid residues) Giardia infest the urogenital and gastrointestinal tracts of stabilized by disulfide bridges, an Nterminal segment (27 human and other mammals. Cysteine proteinases of tri amino acid residues), and a highly hydrophilic Ctermi chomonads do not have trivial names and are usually nal tail (27 amino acid residues). The Nterminal seg identified by their molecular weight determined by SDS ment contains seven proline residues and presumably acts PAGE. The proteolytic activity of cysteine proteinases of as a hinge between the catalytic and Cterminal domains. these parasites is one of the highest among the cysteine It should be noted that the Cterminal domain is the main proteinases of the studied groups of protozoa. immunogenic part of the cruzipain molecule. So, after At least four genes of cysteine endopeptidases were infection the host antibodies bind to the Cterminal found in the genome of Trichomonas vaginalis [28], and domain of the proteinase, but the catalytic part of the nine genes encoding these enzymes were revealed in the cruzipain in the resulting complex is not inhibited. genome of Tritrichomonas foetus (a parasite of cattle) [29]. Consequently, the resulting immune complex retains The molecular weights of the proteinases encoded by enzymatic activity [21]. these genes are in the range of 1834 kD. Proteinases of The spatial structure of recombinant cruzipain in a Giardia are less well investigated. complex with the synthetic inhibitor ZPheAlaCH2F Proteinases of trichomonads can hydrolyze numer has been determined [22]. The catalytic part of the ous proteins including possible physiological substrates enzyme consists of one polypeptide chain that forms two such as hemoglobin, immunoglobulins [30], and proteins domains. The first domain exhibits predominantly α of extracellular matrix [31]. structure (L domain), and the second has antiparallel β Undoubtedly, cysteine proteinases play an important structure (R domain). The catalytic triad Cys25His159 role in the interactions of T. vaginalis with the host cell Asn175 is located in the active site situated in the cleft and contribute to the total pathogenicity of the parasite. between the two domains. The general model of the struc It was shown that the activity of 43kD cysteine pro ture and the arrangement of the amino acid residues in teinase located on the cell surface is necessary for the the active site are analogous to those of papain. adhesion of the parasite to the host cells [32]. Due to their The expression of cruzipain is different in the four ability to cleave immunoglobulins [33], cysteine pro main stages of the life cycle of the parasite: its activity is teinases of trichomonads can be involved in the suppres 10100fold higher in the epimastigotes than in other sion of the immune response of the host organism.

BIOCHEMISTRY (Moscow) Vol. 73 No. 1 2008 CYSTEINE PROTEINASES OF MICROORGANISMS AND VIRUSES 5 Cysteine Proteinases of Bacteria causative agent of myonecrosis (gaseous gangrene) com plicated by profuse necrosis of tissues. Clostripain is the Not all bacteria are capable of releasing cysteine pro main proteinase of C. histolyticum that belongs to clan CD teinases into the environment. For example, representa and forms its own family C11. The family includes the tives of the genus Bacillus do not possess such proteinas enzymes hydrolyzing selectively the peptide bonds after es, having only cysteine exopeptidases. In the genomes of arginine residues depending on the presence of calcium the most studied representatives of Bacillus used in the ions [37]. Analysis of the specificity of clostripain in microbiological industry (B. subtilis, B. cereus, and B. experiments on hydrolysis of parvalbumin containing one thuringiensis) the genes of cysteine proteinases have not Arg residue and 12 Lys residues demonstrated that the been revealed. polypeptide chain is cleaved only after the Arg residue. Cysteine proteinases of S. aureus—staphylopain and The enzyme can also hydrolyze the bonds formed by the staphopains. Staphylococci are causative agents of carboxyl group of Lys, but with much lower rate [38]. numerous diseases that are accompanied by extensive Clostripain is able to hydrolyze the peptide bonds with damage to connective tissues. Staphylopain is a cysteine proline in the P1′ position [39]. proteinase, one of three proteinases secreted by staphylo Clostripain is a heterodimeric protein composed of coccus. Staphylopain is a 13kD (by results of SDS heavy and light chains of 43 and 15.4 kD, respectively PAGE) basic protein of 175 amino acid residues with pI [38]. The Nterminal amino acid residues of the heavy of 9.4 [34]. This proteinase is characterized by a wide sub and light chains are alanine and asparagine, respectively. strate specificity that is independent of the nature of the The chains are connected by noncovalent interactions. amino acid residues adjacent to the cleaved peptide bond. The two polypeptide chains are encoded in a single gene. Reducing agents are required to maintain the enzyme in The translation product of the gene contains a signal pep the active state. The maximal activity determined by the tide (27 amino acid residues), a propeptide (23 amino hydrolysis of elastin was observed at pH 6.5. acid residues), the light chain (131 amino acid residues), Staphylopain is inhibited irreversibly by E64, heavy a linker peptide (nine amino acid residues), and the heavy metal ions, and oxidants. The proteolytic activity is also chain (336 amino acid residues) [40]. Clostripain is syn inhibited by Tkininogen and α2macroglobulin, but not thesized as the inactive preproenzyme. In the presence of by human cystatin C and kininogen. calcium ions, the core protein (residues 51526) catalyzes Two other cysteine proteinases, staphopains A and B the removal of the linker nonapeptide (residues 182190) secreted by S. aureus, exhibit 47% homology in their independently of its amino acid composition [41]. The amino acid sequences [35]. The spatial structure of catalytic residues are Cys231 and His176, and the sub staphopain A has been determined. Based on this struc strate specificity for the P1 position is determined by the ture, it can be concluded that staphopains are longdis presence of Asp229. tance members of the papain family, the closest structure Clostripain is inhibited by oxidants, thiolblocking being observed in cathepsin B. The spatial structure of agents, EDTA, Co2+, Cu2+, Cd2+, and heavy metal ions. staphopain B [36] is similar to that of staphopain A. Citrate, borate, and Tris also partially inhibit the protein Like most cysteine proteinases of the papain family, [42]. The strongest protein inhibitors are leupeptin and the staphopain molecule consists of two domains (L and antipain (both contain Arg in the P1 position). Protein Rdomains), the active site being located in the cleft inhibitors of some serine proteinases containing Lys (soy between these domains. The Ldomain of staphopains is bean Kunitz inhibitor, aprotinin, lima bean trypsin more compact than in most other papainlike proteinas inhibitor) and Arg (soybean Bowman–Birk inhibitor) es. The Rdomain is represented by an antiparallel residues in the P1 position are also strong inhibitors of pseudobarrel that differs significantly from the barrels of clostripain. In the case of the enzyme inactivation by other cysteine proteinases including cathepsin B. It these inhibitors, no preference was observed towards Lys should be noted that the binding of the protein inhibitor or Arg. In spite of the fact that the character of inhibition (staphostatin) by the staphopain molecule is similar to the of clostripain is similar to that of trypsin, the binding of binding of its protein substrate. A glycine residue is locat inhibitors by clostripain is not analogous to their binding ed in the P1 position of the inhibitor, and the replacement by trypsin [43]. It is of importance that histatin 5, a low of this residue transforms the inhibitor into a substrate. molecular weight protein (24 amino acid residues) from The connection between the pathogenicity of S. the family of the histidinerich saliva proteins, is a very aureus and the proteolytic activity of the excreted pro strong inhibitor of clostripain. Histatin 5 is not an analog teinases has not yet been determined. However, consider of the substrate, although it competes with the substrate ing that staphylopain possesses elastinolytic activity, it may for the active site of the enzyme [44]. Nothing is known be assumed that that the enzyme takes part in the destruc concerning the biological function of clostripain. tion of the host tissues during staphylococcal infection. Cysteine proteinases of Porphyromonas gingivalis. Cysteine proteinase of Clostridium histolyticum— Porphyromonas gingivalis is a gramnegative anaerobic clostripain. The anaerobic bacterium C. histolyticum is a bacterium that is considered as the main cause of the

BIOCHEMISTRY (Moscow) Vol. 73 No. 1 2008 6 RUDENSKAYA, PUPOV development of adult parodontitis. Porphyromonas gingi the proteolytic and amidolytic activities of purified RGP valis secrets numerous proteinases including aminopepti 2 and HRGP constitute 1% of their activity in the pres dases, carboxypeptidases, di and tripeptidylpeptidases, ence of 10 mM cysteine, which is the most efficient acti oligopeptidases, and several endopeptidases [45]. All vator of gingipains. The amidolytic activity also depends secreted proteinases (gingipains, periodontain, Prtpro on the presence of glycylglycine, which is used to reveal P. tease, and Tpr) belong to the class of cysteine gingivalis in parodontitis and to differentiate between peptidases and play the main role in the pathogenesis dur RGP1 and RGP2 that are stimulated by glycylglycine in ing parodontitis. Gingipains R and K are specific towards different ways. ArgXaa and LysXaa bonds, respectively, and constitute Gingipain K also exists in several forms encoded by 85% of the total proteolytic activity of the pathogen [46]. the single gene kgp [50]. The product of its expression is a Gingipain R is specific towards substrates containing proteinaseadhesive polypeptide of 186.8 kD that is sub Arg in the P1 position [47]. It has been determined that jected to posttranslational modification on arginine the argininespecific activity is due to two proteinases residues, yielding numerous forms of gingipain K encoded by two genes, rgpA and rgpB. The first proteinase (Scheme 2). The activity of gingipain K towards synthet is encoded by the single gene rgpA, but was isolated in two ic substrates is maximal at pH 8.0. The characters of inhi forms: a single chain of 50kD (RGP1) and a high bition of gingipains R and K are similar, but there are molecular weight protein of 95kD (HRGP). The latter some differences—gingipain K is not inhibited by zinc form consists of the catalytic domain (50 kD) fused with ions, E64, leupeptin, antipain, and EDTA, and is not the hemagglutinin adhesive domain [48]. The second activated by glycylglycine. Similarly to gingipain R, cys argininespecific proteinase (RGP2) is encoded by the teine is the most efficient activator of gingipain K. gene rgpB and isolated as four isoforms with different iso Gingipain K exhibits specificity towards the amino acid electric points [49]. All isoforms have identical Ntermi residue in the P1 position and does not hydrolyze the nal sequences and similar molecular weight (48 kD), bonds with lysine and arginine in the P2 position. which indicates that the differences must be connected The crystal structure of the complex of gingipain R with modifications of the side chains of amino acid (RgpB) with the inhibitor DPhePheArgchloromethyl sequences. This does not affect the enzymatic properties ketone has been determined [51]. The enzyme consists of of gingipains, since the isoforms do not differ in their sta two domains: the catalytic domain (351 amino acid bility, pH optima, and kinetic characteristics. residues) and the domain of the immunoglobulin family Gingipain R is inhibited by chelating agents (EDTA IgSF (84 amino acid residues). The catalytic domain is and EGTA), zinc ions, iodoacetic acid, iodoacetamide, subdivided into subdomains A and B. The surface Nethylmaleimide, E64, leupeptin, and antipain. between these subdomains is relatively hydrophilic and Cystatin C does not inhibit gingipain R. TLCK and tosyl contains a cluster of charged amino acid residues with Lphenylalanine chloromethyl ketone (TPCK) inhibit coordinated calcium ion providing the correct spatial ori gingipain due to their ability to modify cysteine residues. entation of the subdomains. The Bsubdomain contains It should be noted that SDS stimulates the gelatinolytic the catalytic dyad Cys244His211. The tertiary structure activity of this enzyme. In the absence of reducing agents, of the catalytic domain of gingipain R is similar to that of

M 227 R 1 R 492 R 909 K 1044 R 1202 K 1477

NH2 COOH

propeptide proteinase domain (RGP) adhesive domains (HGP)

M 228 R 1 R 509 R 927 K 1062 R 1220 K 1495

NH2 COOH

propeptide proteinase domain (KGP) adhesive domains (HGP)

Polypeptide chain of gingipains RGP and KGP: the prodomain is shown dark, the proteinase domain is clear, adhesive domains are shad ed. Some amino acid residues involved in the proteolytic cleavage of the peptide are indicated Scheme 2

BIOCHEMISTRY (Moscow) Vol. 73 No. 1 2008 CYSTEINE PROTEINASES OF MICROORGANISMS AND VIRUSES 7 caspase1 and caspase3. Also, caspases and gingipain [55]. The proteinase is inhibited by iodoacetate, iodoac contain identically situated histidine and cysteine etamide, Nethylmaleimide, pchloromercuribenzoate, 2+ 2+ + residues, equivalent oxyanion cavities, and S1pockets of heavy metal ions (Cu , Hg and Ag ), as well as atmos similar shape and location that determine the substrate pheric oxygen [56]. specificity of the enzymes. The spatial structure of the streptopain proenzyme Gingipains R are capable of activating prothrombin has been determined [57]. The characteristic features of [52]. Gingipains R (HRGP and RGP2) cannot cleave packing of the polypeptide chain suggest that streptopain hemoglobin, while gingipain K cleaves this protein yield belongs to the papain family. The molecule of this cys ing different low molecular weight fragments. Gingipain teine proteinase consists of two typical domains. The first K is capable of hydrolyzing transferrin, while gingipain R domain is formed by αhelixes, and the second one by hydrolyzes it very slowly. Gingipain R can hydrolyze the antiparallel βstructures. The enzyme molecule contains C3 component of the complement yielding C3b and the dyad CysHis, but not the triad CysHisAsn. The C3alike fragments, which are significant mediators of profragment of the immature enzyme shifts the catalytic inflammation [53]. His195 from the active site and prevents its interaction The other cysteine proteinase isolated from P. gingi with Cys47, thus maintaining the proteinase in the inac valis is peridontain [54]. The gene of peridontain encodes tive state. a protein of 93 kD (843 amino acid residues). The molec Streptopain is one of the most important factors of ular weight of peridontain determined by gel filtration pathogenicity of streptococci [58]. SpeB contains the constitutes 75 kD, while SDSPAGE yields two bands motif ArgGlyAsp (RGD) that is required for the bind corresponding to 55 and 20 kD. Thus, the mature enzyme ing to integrins of the host cells, allowing the microor is a heterodimer with the active site in the large subunit. ganism to adhere to the host cell surface. Streptopain is The pI value of peridontain is 5.3. The enzyme does not capable of hydrolyzing fibronectin and vitronectin, pro hydrolyze azocasein, casein, lysozyme, collagen, fibrin, teins of extracellular matrix, but does not cleave laminin plasminogen, and fibrinogen. However, when lysozyme [59]. SpeB hydrolyzes the hinge region of IgG and the was reduced and carboxymethylated, complete cleavage heavy chains of immunoglobulins of all types [60]. of the protein was observed after less than 10 min. Thus, Streptopain activates the 66kD matrix metalloproteinase it can be concluded that peridontain hydrolyses dena cleaving type IV collagen, and also activates some pro tured proteins or easily accessible polypeptide chains, but teins of S. pyogenes: M1 protein, C5a peptidase, and IgG cannot hydrolyze proteins with pronounced secondary binding protein. SpeB cleaves the inactive precursor of and tertiary structures. Peridontain is active within the human interleukin 1β yielding biologically active inter range of pH 6.09.0, exhibiting maximal activity at leukin 1β [61]. Thus, streptopain performs numerous pH 7.58.0. The enzyme is absolutely inactive in the pres functions and is an important factor of pathogenicity of S. ence of reducing agents such as cysteine, βmercap pyogenes. toethanol, and DTT. Ca2+ does not stabilize the molecule of peridontain, and glycylglycine does not activate the enzyme. Peridontain is inhibited by leupeptin, TLCK, Cysteine Proteinases of Viruses iodoacetamide, and E64. These results suggest that peri dontain is closer to the papain family than gingipains. Viruses reproduce and function using the genetic Porphyromonas gingivalis is asaccharolytic organism apparatus of the host cell. The extreme extent of para obtaining carbon and energy mostly by cleaving proteins. sitism has led to the significant simplification of the virus Evidently, gingipains possessing a high specificity are structure, reducing the number of proteins required for capable of cleaving large proteins yielding peptides, while their existence compared to other organisms. For exam several proteinases of wide specificity including peridon ple, the grippe virus does not produce proteolytic tain are required for the complete cleavage of these pep enzymes, and its genome lacks in the structures corre tides. sponding to proteolytic enzymes. Nevertheless, some Cysteine proteinase of Streptococcus pyogenes—strep virus families are capable of producing proteinases topain. Streptococcus pyogenes is one of the widespread including cysteine proteinases that are important factors bacterial pathogens of humans. Diseases caused by strep of their pathogenicity. tococcal infection vary from relatively mild, such as Proteinase L of footandmouth disease virus. Foot pharyngitis, to severe diseases that are dangerous for andmouth disease virus (FMDV) belongs to the genus of human life. Cysteine proteinase of S. pyogenes (SpeB) is autoviruses from the picornavirus family. The genome of the first cysteine proteinase isolated from prokaryotes. is represented by a plusstrand RNA. After Streptopain, also known as exotoxin B, is present in all penetrating into the host cell, the RNA is translated strains of S. pyogenes and constitutes 95% of all secreted yielding a long polyprotein. Then the polyprotein is proteins. Streptopain hydrolyzes the bonds in the pep processed yielding separate viral proteins. Viruses of two tides containing hydrophobic residues in the P1 position genera of this family (auto and cardioviruses), as well as

BIOCHEMISTRY (Moscow) Vol. 73 No. 1 2008 8 RUDENSKAYA, PUPOV some viruses of indefinite systematic position, for exam Peptidase Lb is a protein of 21 kD (173 amino acid ple Equine rhinovirus (ERV), encode in their genome a residues) exhibiting pI of 4.8. XRay analysis revealed in special leader (L) protein [62]. the active site of the enzyme two amino acid residues Protein L was named because of its position in the (Cys51 and His148) that are essential for its functioning polyprotein and due to the early emergence in the life [64]. cycle of the virus: it corresponds to the 5′region that pre Protein L is cleaved autocatalytically from the trans cedes the region of structural genes encoding the virus lated viral polyprotein (Scheme 3b). Protein L is also capsid proteins (Scheme 3a). involved in the cleavage of translation initiation factor Protein L of FMDV is a proteolytic enzyme that eIF4G, a cell protein also known as p220 or eIF4g that belongs to clan CA of the papainlike cysteine proteinas is required for the initiation of capdependent protein es [63]. FMDV is characterized by the presence of two synthesis [65]. The Lb form of peptidase L cleaves two dif start codons AUG within the single reading frame, so two ferent bonds in the eIF4G molecule: first, different L proteins can be formed, these being designat Gly479–Arg480, and then Lys318–Arg319. ed as Lab and Lb. Protein Lb, the smaller of these two pro The main function of peptidase Lb in cells infected teins, is the main synthesized proteinase L. by the virus is the cleavage of eIF4G protein that is

a

poly(A) b

poly(A)

ciscleavage transcleavage c mRNA of the host cell mRNA of the host cell

poly(A) poly(A)

viral RNA proteinase L poly(A)

Genome of FMDV and the main functions of the encoded proteinase L. a) Genome plusstrand RNA of FMDV. Names of the genes are designated. The 5′terminus of the genome RNA contains the covalently bound protein VPg, and the 3′terminus is polyadenylated. The IRES element responsible for the capindependent translation of the virus RNA is located in the nontranslated region at the 5′terminus. b) Translation of mRNA of FMDV: autocatalytic cleavage of proteinase L can be catalyzed by the intramolecular (cis) or intermolecular (trans) mechanisms. The ribosomal subunits (40S and 60S) are designated as ovals. Proteinase L is shown by dashed oval and rectangle; the site of the cleavage is shown by the arrow. c) Role of eIF4G factor in the capdependent initiation of protein synthesis and its proteolytic cleavage by proteinase L. The cell mRNA cannot be involved in the formation of the initiation complex during the translation. However, the viral RNA can interact directly with eIF3 without capbinding eIF4E and eIF4G proteins yielding the initiation complex Scheme 3

BIOCHEMISTRY (Moscow) Vol. 73 No. 1 2008 CYSTEINE PROTEINASES OF MICROORGANISMS AND VIRUSES 9 required for the efficient translation of the capdependent Endopeptidase PCP1 is supposed to play an impor mRNA of the host cell (Scheme 3c). The cleavage of this tant role in the processing of the polyprotein through both protein ceases the synthesis of most host proteins. cis and transmechanisms yielding nonstructural pro However, the genome mRNA of the virus is translated by teins connected with the replication of the viral genome. the capindependent mechanism and is not suppressed The existence of the regions of proteinase PCP1 was pre after the cleavage of eIF4G. The changed variant of dicted, but was not functionally demonstrated for three FMDV with a deletion in the region encoding peptidase other representatives of coronaviruses with the complete

Lb [66] is less productive compared to the wild type, sug ly sequenced gene of RDRP: avian infectious bronchitis gesting that peptidase Lb is necessary for the normal pro virus, human coronavirus 229E, and swine transmissible duction of the virus and for the development of the dra gastroenteritis virus. Other members of the superfamily matic cytopathic effects connected with FMDV infection that includes arteriviruses and coronaviruses, as previous [67]. ly described, also encode regions that are similar to those Papainlike endopeptidases of coronaviruses. Coro of PCP1 [72]. naviruses comprise a family of viruses containing plus Papainlike endopeptidase of Sindbis virus. Sindbis strand genome RNA of large size (2832 kb) encoding virus (Togaviridae family, Alphavirus genus) is a virus of their own RNAdependent RNA polymerase (RDRP). animals containing in the genome a single plusstrand During the translation of the viral genome, RDRP is RNA. Sindbis virus has a wide range of hosts and is wide translated as a protein precursor, a polyprotein that must spread in different tissues, also infecting numerous cell be subjected to the proteolytic cleavage for the correct types in culture including cells of mammals, birds, and replication of the viral genome. In the wellstudied repre mosquitoes. The translation of the genome RNA yields sentative of coronaviruses mouse hepatitis virus (MHV), four nonstructural proteins that are formed in the cyto RDRP encoded inside the genome region (22 kb) results plasm of the infected cells as two precursors: polyproteins in the translation of a protein precursor of 750 kD [68]. P123 (1896 amino acid residues) and P1234 (2513 amino The presence of different functional regions in this acid residues, formed due to readthrough of the stop polyprotein was predicted. There are three regions corre codon in the 1897 position). Further, the polyprotein sponding to proteinases: two of them are papainlike pro P1234 is processed by the viral nsP2 proteinase yielding teinases, and the third is picornainlike proteinase. The (through a number of intermediate products) four prod amino acid sequence of the first papainlike cysteine pro ucts: nsP1, nsP2, nsP3, and nsP4 [73] (Scheme 4). teinase (PCP1) exhibits a limited similarity with the Sindbis virus proteinase nsP2 is a polypeptide of 807 sequence of papain. The similarity mainly consists in the amino acid residues encoded by nucleotides 16804100 of presence of the conservative dipeptide CysTrp contain the viral genome. The Nterminal part of the protein is ing the catalytic cysteine residue and in the interval supposed to be helicase that is required for replication of between the catalytic cysteine and the histidine residue RNA, and the Cterminal region between the 460 and [69]. 807 residues corresponds to the proteinase [74]. The sup The second papainlike region (PCP2) exhibits lit posed catalytic pair Cys481 and His558 is accompanied tle similarity with papain, retaining only catalytic residues by the Trp559 residue that is required to stabilize the con Cys and His and the interval between them. Therefore, formation of the active site of the enzyme [75]. Currently, proteinases PCP2 of hepatitis C are now included into a the proteinase is referred to clan CN. All known separate family of clan CM. alphaviruses encode nsP2 proteinases exhibiting more It has been demonstrated that the PCP1 region is than 50% similarity in the amino acid sequences. necessary for the emergence of p28 protein. In vitro, p28 The substrates for proteinase nsP2 of Sindbis virus splits off from the polyprotein cotranslationally, this sup are only the nonstructural polyproteins P123 and P1234, porting the cismechanism of the cleavage. The proteo and the products of their processing (P12, P23, and P34). lytic activity of proteinase PCP1 is connected with the The site of the cleavage has a consensus sequence liberation from the polyprotein of another protein (p65) (Ala/Ile)Gly(Ala/Cys/Gly)(Ala/Tyr). that is adjacent to p28 [70]. The nsP2 proteinase and protein precursors contain Commercial inhibitors of proteinase PCP1 sup ing nsP2 are active proteolytic enzymes differing in their pressing its activity have been found. For example, leu substrate specificity. For example, P1234 can cleave itself peptin and zinc chloride block the proteolytic activity of autoproteolytically by the cismechanism yielding two PCP1 towards the cleavage of p28 in vitro, but none of separate proteins, P123 and nsP4, while P123 can be them showed an efficient blocking of the viral protease in processed only by the transmechanism. The cleavage of a vivo due to their high toxicity. The membranepenetrating P123 molecule by another molecule of P123 or P12 results derivative of irreversible inhibitor of cysteine proteinases in the formation of two proteins, nsP1 and P23, while the E64 (E64d) is capable of blocking efficiently the cleav cleavage of P123 by P23 yields P12 and nsP3. The proteins age of p65 from the polyprotein in vivo, exhibiting no that lost the nsP3 region during processing, namely nsP2 effect on the proteolytic cleavage of protein p28 [71]. or P12, are incapable of cleaving the site nsP3/nsP4. Such

BIOCHEMISTRY (Moscow) Vol. 73 No. 1 2008 10 RUDENSKAYA, PUPOV

plusstrand RNA replicase nsP2 proteinase

minusstrand RNA replicase nsP2 proteinase nonstructural polyprotein

stop codon nsP2 proteinase readthrough

5′ translation 26S mRNA 3′ ORF of nonstructural proteins

genome plusstrand RNA stop codon poly(A)

Organization and processing of nonstructural proteins formed during translation of the genome RNA of Sindbis virus Scheme 4 changes in the substrate specificity of proteinase nsP2 taining viruses (pico, very little, and rna stands for RNA). that depend on the original polyprotein allow fine regula They comprise one of the most numerous and important tion of RNA synthesis during the infection cycle. families of causative agents of diseases of humans and During the early stages of infection, proteins P123 farm animals. FMDV, poliovirus, and human hepatitis A and nsP4 together with some proteins of the host organ virus are picornaviruses. The main causative agents of ism form a replicase complex synthesizing the minus coldrelated diseases (rhinoviruses) are also representa strand RNA of the viral genome. Proteolytic decomposi tives of this family. tion of P123 within the replicase complex changes its The main strategy of protein synthesis used by picor specificity. As a result, the complex acquires the ability to naviruses is the following: the genome plusstrand RNA synthesize efficiently the plusstrand RNA, but cannot contains a single extended reading frame encoding a take part in the replication of the minusstrand RNA [76, polyprotein that is cleaved during translation, so the full 77]. Since during infection the concentration of the length protein is not formed. The cleavage cascade results transacting proteinases grows, the nonstructural in 11 or 12 products. polyprotein is cleaved at the nsP2/nsP3 site during its The first viral protein with proteolytic activity was synthesis (cotranslationally) yielding the processing 3C protein of poliovirus [78]. Later, this proteolytic activ products nsP1, nsP2, nsP3, and nsP34. At this stage, the ity was identified in all genera of the picornavirus family, synthesis of the minusstrand RNA is blocked, since the and the corresponding protein was named picornain 3C. processing products cannot form the replicase that is Picornain 3C is a protein containing from 180 (human capable of synthesizing the minusstrand RNA. The use rhinoviruses) to 220 (hepatitis A virus) amino acid of proteinase nsP2 for regulation of genome RNA synthe residues. The pI value of the protein is in the range of 8.0 sis is an interesting evolutionary adaptation of viruses that 10.0. The active site includes the residues Cys, His, and have to encode in their genome an additional protein pro Asp. There are extended regions of homology in picor viding the correct cleavage of the polyprotein in the cyto nain 3C and chymotrypsinlike serine proteinase (family plasm of the infected cells. C1). The structure of the active site of picornain 3C and Thus, besides the processing and liberation of pro the packing of the polypeptide chain that is similar to that teins required for the reproduction of the virus, pro of chymotrypsin suggest that picornain 3C is an example teinase nsP2 from Sindbis virus is important for the fine of evolutionary relationship of cysteine and serine pro regulation of the genome RNA replication, thus provid teinases [79]. ing high infectivity. Picornain 3C, as a typical representative of cysteine Cysteine proteinase of picornaviruses—picornain 3C. proteinases, is inhibited by Nethylmaleimide, iodoac Picornaviruses are the smallest of the known RNAcon etamide, and Hg2+, but is not inhibited by E64.

BIOCHEMISTRY (Moscow) Vol. 73 No. 1 2008 CYSTEINE PROTEINASES OF MICROORGANISMS AND VIRUSES 11 Picornain 3C cleaves sites containing Gln and Gly cleavage in the PV1/2A site of the viral polyprotein. The residues in the P1 and P1′ positions, respectively. cleavage is an intramolecular process, since the 2A pro However, these residues and their environment can differ tein is cut out before the translation of the viral polypro in different genera of viruses. For the successful process tein is finished. It should be noted that 2A proteinase is ing, not only the primary sequence of the site of cleavage also responsible for the cleavage of the eukaryotic transla is important, but also the character of the second struc tion initiation factor eIF4G (also known as p220 or eIF ture. It was demonstrated that the GlnGly pairs situated 4g) [86]. Picornain 2A catalyzes both intramolecular (cis) between two βbarrels are hydrolyzed more efficiently and intermolecular (trans) cleavage of the viral polypro than pairs situated inside ordered secondary structures tein. [80]. Picornain 2A is a protein with a single polypeptide Picornain 3C is the main enzyme involved in the chain (149 amino acid residues) exhibiting homology processing of the polyprotein in picornaviruses, providing with the amino acid sequence of serine proteinases, this from 8 to 10 disruptions depending on the genus of the suggesting the similarity of its spatial structure with that virus. Some differences between the enzymes from differ of representatives of the bacterial αlyticlike proteinase ent genera of picornaviruses concern the substrate speci family [87]. ficity and number of recognized sites of cleavage. Similarly to picornain 3C, picornain 2A is inhibited Besides the viral polyprotein, picornain 3C can by Nethylmaleimide, iodoacetamide, and Hg2+, but not cleave different proteins of the host cell. For example, by E64. The inhibitors of serine proteinases chymostatin proteinase 3C of poliovirus blocks the transcription of the and elastatin also inhibit efficiently the activity of picor host proteins by specific proteolysis of the transcription nain 2A [88]. factors TFIIIC [81] and TFIID [82]. One of the proteins Picornain 2A performs at least two main functions in associated with microtubules (MAP4) is also a substrate the replication cycle of polioviruses. It catalyzes the fist of the poliovirus picornain 3C [83]. The cleavage of this stage of proteolytic processing of the viral polyprotein, protein results in destruction of the cytoskeletal micro cleaving the site located between the Cterminus of PV1 tubules of the infected cell. The picornain of FMDV is and its own Nterminus. Besides, picornain 2A takes part involved in proteolysis of histone H3 [84]. Thus, picor in the cleavage of 3CD protein yielding the molecules 3C′ nain 3C is responsible for many severe cytopathic effects and 3D′ [89]. Picornain 2A from and rhi during viral infection. noviruses is also responsible for the cleavage of eIF4G, a Picornain of polioviruses binds specifically to hairpin cell protein involved in the initiation of protein syntheses. structures on the 5′ ends of the genome RNA [85]. The Picornain 2A from enteroviruses and rhinoviruses picornain precursor 3CD protein binds to the viral RNA differs from picornain 3C by smaller size and by substrate much more efficiently than the mature 3C protein. specificity. No structural similarity was observed between Presumably, such binding of the picornain precursor 3CD picornain 2A from enteroviruses and rhinoviruses and to the RNA provides correct spatial orientation of the proteinase L of FMDV or 2A protein of encephalomyo viral 3D polymerase for the initiation of genome RNA carditis virus. synthesis. Autoproteolysis of 3CD protein directly after its binding to the genome RNA releases the polymerase The most widespread and investigated proteinases 3D in the required site. The supposed RNA are cysteine proteinases from plants. As seen from this of 3C protein is on the opposite side of the active site side review, these proteinases are also found in pathogenic and is enriched with basic amino acid residues. protozoa and microorganisms, where these enzymes are Thus, picornain 3C is one of the important factors of important factors of pathogenicity. It is of interest that the pathogenicity of picornaviruses that determines not only closely related pathogenic and nonpathogenic species processing and maturation of the viral polyprotein, but differ mostly in the expression of cysteine proteinases, as also severe cytopathic effects such as inhibition of the demonstrated for E. histolytica and E. dispar, and also for transcription and breaking of the microtubular cytoskele T. cruzi and T. rangeli. ton. Besides, this proteinase recognizes specifically and Based on the homology of the primary and spatial binds to certain hairpin structures of the viral RNA in the structures, most cysteine proteinases of protozoa can be form of the protein precursor 3CD, this providing the referred to clan CA of papain. The bacterial cysteine pro correct spatial orientation of the viral polymerase 3D on teinases belong to clan CD of cysteine proteinases includ the molecule of genome RNA. ing caspases and legumins. The common feature of the Picornain 2A of poliomyelitis virus 1 (PV1). The enzymes of this clan is a high selectivity that is deter assumption that 2A protein of PV1 possesses proteolytic mined by the amino acid residue in the P1 position: argi activity was made based on the revelation of specific nine for clostripain, arginine and lysine for gingipains R picornain 3C motifs in the sequence encoding this pro and K, respectively, aspartate for caspases, and asparagine tein. Later it was confirmed that the disruption of the for legumains. Besides, they all are not inhibited by E64, sequence encoding 2A protein inhibited the proteolytic an irreversible inhibitor of most cysteine proteinases. The

BIOCHEMISTRY (Moscow) Vol. 73 No. 1 2008 12 RUDENSKAYA, PUPOV reversible inhibition of clostripain and gingipain R by this 24. Cazzulo, J. J., Stoka, V., and Turk, V. (1997) Biol. Chem., reagent can be explained by the fact that this compound 378, 110. is an analog of arginine. 25. Robertson, C. D., and Coombs, G. H. (1990) Mol. Biochem. Parasitol., 42, 269276. 26. Souza, A. E., Waugh, S., Coombs, G. H., and Mottram, J. This work was supported by the Russian Foundation C. (1992) FEBS Lett., 311, 124127. for Basic Research (050449087). 27. Mottram, J. C., Souza, A. E., Hutchison, J. E., Carter, R., Frame, M. J., and Coombs, G. H. (1996) Proc. Natl. Acad. Sci. USA, 93, 60086013. REFERENCES 28. Mallinson, D. J., Lockwood, B. C., Coombs, G. H., and North, M. J. (1994) Microbiology, 140, 27252735. 1. Rosenthal, P. J., McKerrow, J. H., Aikawa, M., Nagasawa, 29. Mallinson, D. J., Livingstone, J., Appleton, K. M., Lees, S. H., and Leech, J. H. (1988) J. Clin. Invest., 82, 1560 J., Coombs, G. H., and North, M. J. (1995) Microbiology, 1566. 141, 30773085. 2. Shenai, B. R., Sijwali, P. S., Singh, A., and Rosenthal, P. J. 30. Parenti, D. M. (1989) J. Infect. Dis., 160, 10761080. (2000) J. Biol. Chem., 275, 2900029010. 31. Williams, A. G., and Coombs, G. H. (1995) Int. J. 3. Rosenthal, P. J., and Nelson, R. G. (1992) Mol. Biochem. Parasitol., 25, 771778. Parasitol., 51, 143152. 32. Arroyo, R., and Alderete, J. F. (1989) Infect. Immun., 57, 4. Hanspal, M. (2000) Biochim. Biophys. Acta, 1493, 242245. 29912997. 5. Sijwali, P. S., Shenai, B. R., Gut, J., Singh, A., and 33. Provenzano, D., and Alderete, J. F. (1995) Infect. Immun., Rosenthal, P. J. (2001) Biochem. J., 360, 481489. 63, 33883395. 6. Na, B. K., Shenai, B. R., Sijwali, P. S., Choe, Y., Pandey, 34. Potempa, J., Dubin, A., Korzus, G., and Travis, J. (1988) J. K. C., Singh, A., Craik, C. S., and Rosenthal, P. J. (2004) Biol. Chem., 263, 26642667. Biochem. J., 378, 529538. 35. Rzychon, M., Sabat, A., Kosowska, K., Potempa, J., and 7. Tannich, E., Scholze, H., Nickel, R., and Horstmann, R. Dubin, A. (2003) Mol. Microbiol., 49, 10511066. D. (1991) J. Biol. Chem., 266, 47984803. 36. Filipek, R., Rzychon, M., Oleksy, A., Gruca, M., Dubin, 8. Scholze, H., and Schulte, W. (1988) Biomed. Biochim. Acta, A., Potempa, J., and Bochtler, M. (2003) J. Biol. Chem., 47, 115123. 278, 4095940966. 9. Luaces, A. L., and Barrett, A. J. (1988) Biochem. J., 250, 37. Kembhavi, A. A., Buttle, D. J., Rauber, P., and Barrett, A. 903909. J. (1991) FEBS Lett., 283, 277280. 10. Li, E., Yang, W. G., Zhang, T., and Stanley, S. L., Jr. (1995) 38. Gilles, A. M., Imhoff, J. M., and Keil, B. (1979) J. Biol. Infect. Immun., 63, 41504153. Chem., 254, 14621468. 11. Jacobs, T., Bruchhaus, I., Dandekar, T., Tannich, E., and 39. Ullmann, D., and Jakubke, H. D. (1994) Eur. J. Biochem., Leippe, M. (1998) Mol. Microbiol., 27, 269276. 223, 865872. 12. Keene, W. E., Petitt, M. G., Allen, S., and McKerrow, J. H. 40. Dargatz, H., Diefenthal, T., Witte, V., Reipen, G., and von (1986) J. Exp. Med., 163, 536549. Wettstein, D. (1993) Mol. Gen. Genet., 240, 140145. 13. Spinella, S., Levavasseur, E., Petek, F., and Rigothier, M. 41. Witte, V., Wolf, N., and Dargatz, H. (1996) Curr. C. (1999) Eur. J. Biochem., 266, 170180. Microbiol., 33, 281286. 14. Bruchhaus, I., Jacobs, T., Leippe, M., and Tannich, E. 42. Porter, W. H., Cunningham, L. W., and Mitchell, W. M. (1996) Mol. Microbiol., 22, 255263. (1971) J. Biol. Chem., 246, 76757682. 15. Willhoeft, U., Hamann, L., and Tannich, E. (1999) Infect. 43. Siffert, O., Emod, I., and Keil, B. (1976) FEBS Lett., 66, Immun., 67, 59255929. 114119. 16. Que, X., and Reed, S. L. (2000) Clin. Microbiol. Rev., 13, 44. Gusman, H., Grogan, J., Kagan, H. M., Troxler, R. F., and 196206. Oppenheim, F. G. (2001) FEBS Lett., 489, 97100. 17. Serveau, C., Lalmanach, G., Juliano, M. A., Scharfstein, 45. Potempa, J., Sroka, A., Imamura, T., and Travis, J. (2003) J., Juliano, L., and Gauthier, F. (1996) Biochem. J., 313, Curr. Protein Pept. Sci., 4, 397407. 951956. 46. Potempa, J., and Travis, J. (1996) Acta Biochim. Pol., 43, 18. Gillmor, S. A., Craik, C. S., and Fletterick, R. J. (1997) 455465. Protein Sci., 6, 16031611. 47. Kadowaki, T., Yoneda, M., Okamoto, K., Maeda, K., 19. Turk, B., Stoka, V., Turk, V., Johansson, G., Cazzulo, J. J., and Yamamoto, K. (1994) J. Biol. Chem., 269, 21371 and Bjork, I. (1996) FEBS Lett., 391, 109112. 21378. 20. Stoka, V., Nycander, M., Lenarcic, B., Labriola, C., 48. Pike, R. N., Potempa, J., McGraw, W., Coetzer, T. H., and Cazzulo, J. J., Bjork, I., and Turk, V. (1995) FEBS Lett., Travis, J. (1996) J. Bacteriol., 178, 28762882. 370, 101104. 49. Potempa, J., MikolajczykPawlinska, J., Brassell, D., 21. Martinez, J., Campetella, O., Frasch, A. C., and Cazzulo, Nelson, D., Thogersen, I. B., Enghild, J. J., and Travis, J. J. J. (1993) Immunol. Lett., 35, 191196. (1998) J. Biol. Chem., 273, 2164821657. 22. McGrath, M. E., Eakin, A. E., Engel, J. C., McKerrow, J. 50. Pavloff, N., Pemberton, P. A., Potempa, J., Chen, W. C., H., Craik, C. S., and Fletterick, R. J. (1995) J. Mol. Biol., Pike, R. N., Prochazka, V., Kiefer, M. C., Travis, J., and 247, 251259. Barr, P. J. (1997) J. Biol. Chem., 272, 15951600. 23. YokoyamaYasunaka, J. K., Pral, E. M., Oliveira, O. C., Jr., 51. Eichinger, A., Beisel, H. G., Jacob, U., Huber, R., Alfieri, S. C., and Stolf, A. M. (1994) Acta Trop., 57, 307 Medrano, F. J., Banbula, A., Potempa, J., Travis, J., and 315. Bode, W. (1999) EMBO J., 18, 54535462.

BIOCHEMISTRY (Moscow) Vol. 73 No. 1 2008 CYSTEINE PROTEINASES OF MICROORGANISMS AND VIRUSES 13

52. Imamura, T., Banbula, A., Pereira, P. J., Travis, J., and 70. Bonilla, P. J., Hughes, S. A., Pinon, J. D., and Weiss, S. R. Potempa, J. (2001) J. Biol. Chem., 276, 1898418991. (1995) Virology, 209, 489497. 53. Wingrove, J. A., DiScipio, R. G., Chen, Z., Potempa, J., 71. Kim, J. C., Spence, R. A., Currier, P. F., Lu, X., and Travis, J., and Hugli, T. E. (1992) J. Biol. Chem., 267, Denison, M. R. (1995) Virology, 208, 18. 1890218907. 72. Den Boon, J. A., Faaberg, K. S., Meulenberg, J. J., 54. Nelson, D., Potempa, J., Kordula, T., and Travis, J. (1999) Wassenaar, A. L., Plagemann, P. G., Gorbalenya, A. E., J. Biol. Chem., 274, 1224512251. and Snijder, E. J. (1995) J. Virol., 69, 45004505. 55. Gerwin, B. I., Stein, W. H., and Moore, S. (1966) J. Biol. 73. De Groot, R. J., Hardy, W. R., Shirako, Y., and Strauss, J. Chem., 241, 33313339. H. (1990) EMBO J., 9, 26312638. 56. Liu, T. Y., and Elliott, S. D. (1965) Nature, 206, 3334. 74. Hardy, W. R., and Strauss, J. H. (1989) J. Virol., 63, 4653 57. Kagawa, T. F., Cooney, J. C., Baker, H. M., McSweeney, 4664. S., Liu, M., Gubba, S., Musser, J. M., and Baker, E. N. 75. Strauss, E. G., de Groot, R. J., Levinson, R., and Strauss, (2000) Proc. Natl. Acad. Sci. USA, 97, 22352240. J. H. (1992) Virology, 191, 932940. 58. Lukomski, S., Montgomery, C. A., Rurangirwa, J., Geske, 76. Lemm, J. A., Rumenapf, T., Strauss, E. G., Strauss, J. H., R. S., Barrish, J. P., Adams, G. J., and Musser, J. M. (1999) and Rice, C. M. (1994) EMBO J., 13, 29252934. Infect. Immun., 67, 17791788. 77. Shirako, Y., and Strauss, J. H. (1994) J. Virol., 68, 1874 59. Kapur, V., Topouzis, S., Majesky, M. W., Li, L. L., 1885. Hamrick, M. R., Hamill, R. J., Patti, J. M., and Musser, J. 78. Hanecak, R., Semler, B. L., Ariga, H., Anderson, C. W., M. (1993) Microb. Pathog., 15, 327346. and Wimmer, E. (1984) Cell, 37, 10631073. 60. Collin, M., and Olsen, A. (2001) Infect. Immun., 69, 7187 79. Brenner, S. (1988) Nature, 334, 528530. 7189. 80. YpmaWong, M. F., Filman, D. J., Hogle, J. M., and 61. Kapur, V., Majesky, M. W., Li, L. L., Black, R. A., and Semler, B. L. (1988) J. Biol. Chem., 263, 1784617856. Musser, J. M. (1993) Proc. Natl. Acad. Sci. USA, 90, 7676 81. Clark, M. E., Hammerle, T., Wimmer, E., and Dasgupta, 7680. A. (1991) EMBO J., 10, 29412947. 62. Li, F., Browning, G. F., Studdert, M. J., and Crabb, B. S. 82. Clark, M. E., Lieberman, P. M., Berk, A. J., and Dasgupta, (1996) Proc. Natl. Acad. Sci. USA, 93, 990995. A. (1993) Mol. Cell. Biol., 13, 12321237. 63. Roberts, P. J., and Belsham, G. J. (1995) Virology, 213, 83. Joachims, M., Harris, K. S., and Etchison, D. (1995) 140146. Virology, 211, 451461. 64. Guarne, A., Kirchweger, R., Verdaguer, N., Liebig, H. D., 84. Tesar, M., and Marquardt, O. (1990) Virology, 174, 364374. Blaas, D., Skern, T., and Fita, I. (1996) Protein Sci., 5, 85. Andino, R., Rieckhof, G. E., Achacoso, P. L., and 19311933. Baltimore, D. (1993) EMBO J., 12, 35873598. 65. Medina, M., Domingo, E., Brangwyn, J. K., and Belsham, 86. Krausslich, H. G., Nicklin, M. J., Toyoda, H., Etchison, G. J. (1993) Virology, 194, 355359. D., and Wimmer, E. (1987) J. Virol., 61, 27112718. 66. Piccone, M. E., Rieder, E., Mason, P. W., and Grubman, 87. Bazan, J. F., and Fletterick, R. J. (1988) Proc. Natl. Acad. M. J. (1995) J. Virol., 69, 53765382. Sci. USA, 85, 78727876. 67. Brown, C. C., Piccone, M. E., Mason, P. W., McKenna, T. 88. Sommergruber, W., Ahorn, H., Zophel, A., MaurerFogy, S., and Grubman, M. J. (1996) J. Virol., 70, 56385641. I., Fessl, F., Schnorrenberg, G., Liebig, H. D., Blaas, D., 68. Bredenbeek, P. J., Pachuk, C. J., Noten, A. F., Charite, J., Kuechler, E., and Skern, T. (1992) J. Biol. Chem., 267, Luytjes, W., Weiss, S. R., and Spaan, W. J. (1990) Nucleic 2263922644. Acids Res., 18, 18251832. 89. Sommergruber, W., Ahorn, H., Klump, H., Seipelt, J., 69. Gorbalenya, A. E., Koonin, E. V., and Lai, M. M. (1991) Zoephel, A., Fessl, F., Krystek, E., Blaas, D., Kuechler, E., FEBS Lett., 288, 201205. Liebig, H. D., et al. (1994) Virology, 198, 741745.

BIOCHEMISTRY (Moscow) Vol. 73 No. 1 2008