<<

Specificity of the medaka and its usefulness as a biotechnological tool for fusion- cleavage

Katsueki Ogiwara and Takayuki Takahashi*

Laboratory of Molecular and Cellular Interactions, Faculty of Advanced Life Science, Hokkaido University, Sapporo 060-0810, Japan

Edited by Ryuzo Yanagimachi, University of Hawaii, Honolulu, HI, and approved March 12, 2007 (received for review November 24, 2006) We cloned two distinct cDNAs for enteropeptidase (EP) from the It is generally believed that EP (or EP-like ) is present in intestine of the medaka, Oryzias latipes, which is a small freshwa- all vertebrates. This belief comes from the finding that in almost all ter teleost. The mRNAs code for EP-1 (1,036 residues) and EP-2 vertebrate species a short sequence of Asp-Asp-Asp-Asp- (1,043 residues), both of which have a unique, conserved domain Lys (D4K) is found in the presumed activation site of trypsinogens structure of the N-terminal heavy chain and C-terminal catalytic (14). However, no information on EP in vertebrates other than light chain. When compared with mammalian EP mammals has been made available to date. Here, we report on the serine , the medaka enzyme exhibited extremely low isolation of cDNAs encoding EP of the medaka (Oryzias latipes), a amidolytic activity for small synthetic peptide substrates. Twelve freshwater teleost, and its expression in several tissues. The present mutated forms of the medaka EP protease were produced by study also describes some enzymatic properties of the catalytic site-directed mutagenesis. Among them, one mutant protease, serine protease domain. Surprisingly, the protease domain of E173A, was found to have considerably reduced nonspecific hy- medaka EP exhibited very limited amidolytic activity for any of the drolytic activities both for synthetic and protein substrates without synthetic peptide substrates tested, indicating that the medaka serious reduction of its Asp-Asp-Asp-Asp-Lys (D4K)-cleavage activ- protease itself is much more highly specific for the D4K sequence ity. For the cleavage of fusion containing a D4K-cleavage than its mammalian counterparts. We further generated various site, the medaka EP proteases were shown to have advantages mutant proteases of medaka EP by site-directed mutagenesis. Some over their mammalian counterparts. Based on our present data, we of the mutated proteases exhibited cleavage specificity that was propose that the E173A mutant is the most appropriate protease to stricter than that of the WT enzyme, and may prove to be more specifically cleave proteins containing the D4K cleavage sequence. effective tools for recombinant protein technology.

cleavage specificity ͉ medaka fish Results cDNA Cloning and Expression of Medaka EP. The present approach nteropeptidase (EP) (enterokinase, EC 3.4.21.9) is a two- gave two distinct medaka EP cDNA clones, EP-1 (3997-bp, Echain, membrane-bound serine protease of the duodenal AB272104) and EP-2 (4036-bp, AB272105). The full-length EP-1 mucosa that converts to active (1). Trypsin cDNA clone contained an ORF that codes a protein of 1,036 aa, thus produced then cleaves and activates other in whereas the EP-2 clone codes a protein of 1,043 aa residues pancreatic secretions, including chymotrypsinogen, proelastase, [supporting information (SI) Fig. 5]. The deduced procarboxypeptidases, and some prolipases. Therefore, EP has sequence of the medaka EP was homologous to those of its been recognized to play a critical role in regulating protein mammalian counterparts. As in mammalian EPs, unique domain within the lumen of the gut. The biological importance structures were found in the N-terminal heavy chain of the fish of EP is demonstrated by the malabsorption and malnutrition of protein (Fig. 1A). However, the extent of sequence identity between patients with primary EP deficiency, a genetic disorder resulting the medaka and mammalian EPs varies considerably from one in no or little EP activity in the (2). domain to another: the identity is 21% in the mucin-like domain, EP has been intensively studied in the last decade. To date, EP 45% in the low-density lipoprotein receptor domain 1, 41% in the has been molecularly cloned from several sources, including cattle Cl r/s domain 1, 49% in the MAM domain (named for the motifs found in Meprin, Xenopus laevis A5 protein, and protein (3, 4), (5), pigs (6), rats (7), and mice (8). These studies ␮ provided much information on the structural details and organi- phosphatase ), 57% in Cl r/s domain 2, 47% in low-density zation of EP, and opened a path to further investigation of the lipoprotein receptor domain 2, and 23% in the MSCR domain. The molecular properties of this protease. For example, it was demon- C-terminal serine protease domain of the fish EP exhibited 53% BIOCHEMISTRY strated that the N-terminal heavy chain is required for efficient identity to mammalian EP serine proteases (Fig. 1B). activation of trypsinogen by the serine protease domain of the In RT-PCR analyses, using primer sets specific for the two C-terminal light chain (9, 10). In addition, a recent study by Lu et medaka EPs, we observed that the band intensities of amplified al. (11) established the tertiary structure of the bovine EP catalytic

domain. The study demonstrated the involvement of Lys-99, which Author contributions: K.O. performed research; K.O. and T.T. analyzed data; K.O. and T.T. is situated in a unique exosite on the enzyme surface, in the specific designed research; and K.O. and T.T. wrote the paper. cleavage of trypsinogen and similar peptidyl substrates. More The authors declare no conflict of interest. recently, a mucin-like domain found in the heavy chain of EP has This article is a PNAS Direct Submission. been shown to be a possible targeting signal for apical sorting of the Abbreviations: ␤NA, ␤-naphthyamide; EP, enteropeptidase; MCA, 4-methylcoumaryl-7-amide; protein (12). sctPA, single-chain tissue-type ; tPA, tissue-type plasminogen activator. EP is also of biotechnological interest because of the unique Data deposition: The sequences reported in this paper have been deposited in the DNA Data substrate specificity of the serine protease domain. The high Bank of Japan (accession nos. AB272104, AB272105, and AB272106). degree of specificity exhibited by EP makes it a suitable reagent *To whom correspondence should be addressed. E-mail: [email protected]. for cleaving bacterially produced proteins. Indeed, the catalytic This article contains supporting information online at www.pnas.org/cgi/content/full/ domain of bovine EP is now widely used as a valuable tool for 0610447104/DC1. this purpose (13). © 2007 by The National Academy of Sciences of the USA

www.pnas.org͞cgi͞doi͞10.1073͞pnas.0610447104 PNAS ͉ April 24, 2007 ͉ vol. 104 ͉ no. 17 ͉ 7021–7026 Downloaded by guest on October 4, 2021 Fig. 1. Cloning, expression, and localization of medaka EP. (A) Schematic representation of the medaka EP do- main structures. Medaka EP consists of a putative signal anchor (SA), a mucin-like domain, two low-density li- poprotein receptor domains, two complement-compo- nent C1r or C1s (C1r/s) domains, a MAM domain, a mac- rophage scavenger receptor (MSCR) domain, and a serine protease domain with active-site residues of (H), aspartate (D), and serine (S). The disulfide bond connect- ing the heavy and light chains is shown. (B) Amino acid sequence alignment of the EP serine protease domain. Amino acid residues are numbered based on the se- quence of medaka EP (top numbers). For comparison, the bovine chymotrypsinogen (Chymo) residue numbers are included in parentheses at the bottom of each block. The arrow indicates a putative activation site between the heavy and light chains. The active-site residues (H, D, and S) are boxed. The positions of mutations are indicated by asterisks. (C) Expression of medaka EP mRNA in various tissues, analyzed by Northern blot. The sizes of the de- tected mRNAs are shown at the left. (Lower) Results for medaka cytoplasmic actin mRNA as a control. (D) Expres- sion of EP mRNA in the , analyzed by RT-PCR. The medaka gastrointestinal tract was divided into eight pieces, from the stomach (lane 1) to the anus (lane 8), and the PCR products from each piece were electrophoresed. (E) In situ hybridization of EP mRNA in the medaka intestine. Neighboring sections of medaka intestine were hybridized with an EP antisense (Left)or sense RNA probe (Right). Scale bars: 100 ␮m. (F) Expres- sion of the medaka EP protein, analyzed by Western blot, using the medaka anti-EP antibody. Extracts of the intes- tine, testis, and ovary (Left) and of nuclear, membrane, and cytosolic fractions of the medaka intestine (Right) were analyzed. The size of the EP protein detected is shown at the left. (G) Immunohistochemical analysis of EP in the medaka intestine, performed with the medaka anti-EP antibody (Left). The control section was stained with the primary antibody previously treated with the antigen (Right). (Scale bars: 200 ␮m.)

products were greater for EP-1 than for EP-2 at every PCR cycle 1,090 bp (corresponding to 2908–3997 in AB272104) and 1,241 bp (SI Fig. 5B). We also performed RT-PCR, using primers common (corresponding to 2757–3997 in AB272104). Both transcripts were to the two EP transcripts. Amplified products (1,235 bp for EP-1 found not to code for any functional protein. In situ hybridization and 1,246 bp for EP-2) were gel-purified and cloned into pBlue- analysis indicated that EP mRNA was localized in the cytoplasm of script II KS(ϩ), and the recombinant plasmids were transformed small, growing follicles in the ovary of mature female medaka (SI into strain JM109. Forty-four clones were randomly Fig. 6). Neither Western blotting nor immunohistochemical anal- picked for nucleotide sequencing; 26 clones were picked for EP-1, ysis, using antibodies specific for the medaka EP protease, detected and 18 clones were picked for EP-2. The results indicated that EP-1 corresponding proteins. Therefore, no further study was conducted is the dominant EP species expressed in the medaka intestine. The on the ovarian EP transcripts. We tentatively speculate that EP results of a Southern blot analysis support the presence of at least transcripts expressed in the fish ovary might play a role as noncod- two distinct copies of the EP in the medaka (SI Fig. 5C). ing RNAs in the oocytes of growing follicles (15). Northern blot analysis of EP, using various fish tissues, revealed In RT-PCR, using primers common to the two species of medaka that the intestine expresses an Ϸ4-kb transcript, and this size is EP, transcripts were detected in the intestinal segments proximal to consistent with that of the full-length cDNA (Fig. 1C). Surprisingly, the stomach (Fig. 1D). In situ hybridization analysis localized EP very strong signals at 1.3 kb and 1.5 kb were detected in the ovary expression to the intestinal epithelium (Fig. 1E). Western blot and testis. Further analyses indicated that these were transcripts of analysis (under reducing conditions) of the extract of medaka

7022 ͉ www.pnas.org͞cgi͞doi͞10.1073͞pnas.0610447104 Ogiwara and Takahashi Downloaded by guest on October 4, 2021 intestine but not ovary and testis extract, using specific anti-EP antibodies against the catalytic domain, detected a 36-kDa immu- noreactive band (Fig. 1F Left). A polypeptide band of the same molecular mass was detected in both cytosolic and membrane fractions of the intestine (Fig. 1F Right). Western blotting of the intestine extract under nonreducing conditions gave no clear band (data not shown). Immunohistochemical analysis, using the anti- body, demonstrated the epithelial localization of EP in the intestine (Fig. 1G). The extract of medaka intestine exhibited enzyme activity for the synthetic EP substrate Gly-Asp-Asp-Asp-Asp-Lys-␤- naphthylamide [GD4K-␤-naphthyamide (␤NA)]. Using this activity as a marker, the apparent molecular mass of intact EP was estimated by gel filtration to be 440 kDa (SI Fig. 7A). The fraction having GD4K-␤NA-hydrolyzing activity showed a 36-kDa polypep- tide in Western blotting under reducing conditions (SI Fig. 7B Left). Again, the same fraction did not show any clear band with the current antibody when analyzed under nonreducing conditions (SI Fig. 7B Right). Taken together, these results indicate that the fish intestine contains active, membrane-bound EP. Part of the mole- cule exists in the intestine in a soluble form that is probably detached from the epithelial cell membrane. The adult medaka Fig. 2. Specificity of medaka EP-1 protease for peptide and protein substrates. intestinal epithelium contains most of the cell types (, (A) Active recombinant EP proteases were assayed by using GD4K-␤NA as a goblet cells, and enteroendocrine cells) observed in the small substrate, including bovine EP proteases obtained from Novagen (Nvg) and New intestine of other vertebrates, but lacks crypts containing Paneth England Biolabs (Neb). (B) Active recombinant EP proteases were assayed by cells and intestinal stem cells (16). Our current data suggest that using various synthetic peptide substrates. (C) Active EP proteases were analyzed medaka EP is localized in the enterocytes in the proximal intestinal by gelatin zymography. (D) Fibronectin (4 ␮g) was incubated with active EPs (100 epithelium. ng) at 37°C for 12 h. (E) High-molecular-weight kininogen (10 ␮g) was incubated with active recombinant EPs (100 ng) at 37°C for 12 h. (F) Two ␮g of the control protein containing the D4K site were incubated with active recombinant EPs (100 The Medaka EP Catalytic Serine Protease Domain Has Distinct Prop- ng) at 37°C for 1 h. erties as Compared with Its Mammalian Counterparts. The active 32-kDa C-terminal serine protease domains of both EP-1 and EP-2 were prepared to characterize their enzymatic properties (SI Fig. Clearly, the medaka protease specifically cleaved the fusion protein ␤ 8A). Both showed maximal activities for GD4K- NA at to generate two polypeptides with expected molecular masses of 16 pH 8, but EP-1 was approximately three times more active than and 32 kDa (Fig. 2F). In contrast, the mammalian enzymes not only EP-2 (SI Fig. 8B). To examine the effects of EP-1 and EP-2 on the produced the two expected polypeptides but also further degraded physiological substrate trypsinogen, a 866-bp medaka trypsinogen the products, presumably because of extensive nonspecific proteo- cDNA (AB272106), which codes for a protein of 242 aa (SI Fig. 9), lytic activities. These results demonstrate that the medaka EP-1 was obtained from the intestine. Using the sequence, a recombinant protease intrinsically has much more strict cleavage specificity than fusion protein of medaka trypsinogen was prepared. The trypsino- its mammalian counterparts. gen was converted to active trypsin by EP-1 faster than by EP-2 (SI Active recombinant medaka EP-1 was stable at Ϫ20°C and 4°C; Fig. 8C). The behavior of the two proteases in response to various the initial enzyme activity was retained at both temperatures for at protease inhibitors was undistinguishable (SI Table 2). From these least six months with no detectable change in the electrophoretic results, together with the finding that EP-1 is the dominantly pattern (data not shown). When medaka EP-1 alone was kept at expressed form in the intestine, we selected EP-1 for the following 37°C at neutral pH, Ϸ30% loss of enzyme activity was observed experiments. after 4 days of incubation (SI Fig. 10). In a parallel experiment, The serine protease domain of medaka EP-1 cleaved GD K- 4 using bovine EP protease, a sharp decline in enzyme activity was ␤NA at a rate comparable to those of the porcine and bovine seen after just a few hours of incubation at 37°C. enzymes (Fig. 2A). Because, in addition to D4K-cleavage activity, mammalian EP protease is known to have amidolytic activity for some 4-methylcoumaryl-7-amide (MCA)-containing peptide sub- Some Medaka Mutant EP Proteases Exhibit Cleavage Specificity Stricter than That of the Wild-Type Enzyme. We further examined

strates (11), enzyme activities of medaka EP-1 protease were BIOCHEMISTRY determined with these substrates. Surprisingly, the activities of the whether site-directed mutagenesis of the medaka EP-1 catalytic medaka protease for t-butyloxycarbonyl-Glu[benzyloxycarbonyl- domain could produce a mutant protease with superior character- (butanoil)]-Ala-Arg-MCA, Z-Phe-Arg-MCA and Pro-Phe-Arg- istics in terms of biotechnological interest. Because strict specificity MCA were much lower than those of the EP proteases of mam- of the medaka protease compared with its mammalian counterparts malian origin (Fig. 2B). The kinetic parameters of the proteases for might be due to substitutions of residues conserved in the mam- malian enzymes, we focused primarily on five amino acid positions these substrates were determined (Table 1). Generally, the kcat/Km values of the medaka enzyme were 1–2 orders of magnitude smaller where the residues differed from those of mammalian EP proteases. than those of the mammalian proteases for all MCA-containing These five residues are all distant from the medaka EP-1 protease synthetic substrates. in a structural model (Fig. 3) based on a model of the The proteolytic activity of the medaka protease was examined by bovine EP serine protease domain, which was crystallized in using gelatin (Fig. 2C), fibronectin (Fig. 2D), and laminin (Fig. 2E). complex with an inhibitor analog of the activation peptide, Val- For comparison, the mammalian proteases were also tested under Asp-Asp-Asp-Asp-Lys-chloromethane (11). The residues were mu- the same conditions. Little or no hydrolysis of the proteins was tated to those conserved in the mammalian protease (K63R, observed with the fish enzyme, whereas these substrates were T105E, F144S, E173K, and P193E). In addition, the residues were detectably hydrolyzed by the mammalian proteases. Finally, the replaced by those with a small side chain (K63A, T105A, F144A, fusion protein (48 kDa) containing an EP-cleavage site (available E173A, and P193A) or those with a different charge (K63E and from Novagen, Madison, WI) was tested with various EP proteases. T105E) to examine the effects of side chains (Fig. 1B).

Ogiwara and Takahashi PNAS ͉ April 24, 2007 ͉ vol. 104 ͉ no. 17 ͉ 7023 Downloaded by guest on October 4, 2021 Table 1. Kinetic parameters of various EP proteases as measured on GD4K-␤NA and MCA-containing substrates GD4K-␤NA Boc-E(OBzl)-AR-MCA Z-FR-MCA PFR-MCA

Km, kcat, kcat/Km, Km, kcat, kcat/Km, Km, kcat, kcat/Km, Km, kcat, kcat/Km, Protease mM minϪ1 mMϪ1 ϫ minϪ1 mM minϪ1 mMϪ1 ϫ minϪ1 mM minϪ1 mMϪ1 ϫ minϪ1 mM minϪ1 mMϪ1 ϫ minϪ1

WT (EP-1) 0.7 940 1,300 0.2 6.7 34 0.1 2.9 29 10 140 14 K63R 0.2 210 1,100 1.2 12 10 0.1 1.4 14 1.1 11 10 T105E 0.4 260 650 1.3 11 9 0.1 2.0 20 1.3 16 12 E173A 0.3 320 1,100 1.0 10 10 0.4 2.3 6 1.0 9.2 9 P193E 0.4 290 730 0.2 2.3 12 0.2 1.7 9 1.0 27 27 Porcine 0.4 530 1,300 0.3 110 370 0.2 55 280 3.9 300 77 Bovine (Nvg) 0.8 770 960 0.5 740 1,500 0.5 720 1,400 4.0 790 200 Bovine (Neb) 0.5 1,500 3,000 0.4 570 1,400 0.4 600 1,500 2.9 1,300 450

AR, Ala-Arg; Boc, t-butyloxycarbonyl; FR, Phe-Arg; OBzl, benzyloxycarbonyl(butanoil); PFR, Pro-Phe-Arg; Z, benzyloxycarbonyl.

All 12 mutants could convert the recombinant medaka trypsino- These results indicate that the substitution of by gen to its active enzyme (data not shown). K63R, T105E, F144S, at position 173 caused a significant reduction in unwanted, E173K, and P193E showed reduced enzyme activity for GD4K- nonspecific enzyme activities for both the synthetic and protein ␤NA (Fig. 4A). The same held true for all of the other mutants substrates without seriously deteriorating the mutant’s cleavage except F144A, which hydrolyzed both GD4K-␤NA and the three specificity for the GD4K sequence. MCA-containing substrates at an elevated rate compared with the WT enzyme (SI Fig. 11). As enzymes all exhibiting significantly Medaka EP Protease Can Be Used as an Efficient Biotechnological Tool reduced activity for the MCA substrates, but retaining GD4K-␤NA- for Cleavage of Recombinant Fusion Proteins. To gain insight into the hydrolyzing activity to a reasonable extent, four mutants (K63R, biotechnological usefulness of medaka EP serine protease, its effect T105E, E173A, and P193E) were chosen for further characteriza- tion. K63R converted recombinant medaka trypsinogen to trypsin as fast as the WT enzyme, whereas the other mutants activated trypsinogen at a reduced rate (SI Fig. 12). The mutant proteases were characterized by kinetic studies. Interestingly, E173A retained a kcat/Km value comparable to the WT enzyme for GD4K-␤NA. However, the kcat/Km values for the MCA-containing substrates were lower (Table 1). Compared with mammalian EP proteases, the mutant proteases showed lower nonspecific proteolytic activity for fibrinogen (SI Fig. 13A). Nonspecific hydrolysis of human high-molecular- weight kininogen occurred at a much slower rate with the mutant proteases than with the mammalian proteases, although some mutations slightly increased nonspecific cleavage (SI Fig. 13B). Neither human fibronectin nor laminin was hydrolyzed by the mutants (data not shown).

Fig. 4. Specificity of mutant medaka EP proteases for peptide and protein substrates containing the D4K-cleavage sequence. (A) The specific activities of WT (EP-1) and mutant EP proteases were determined by using GD4K-␤NA. (B)A recombinant fusion protein of medaka gelatinase A (5 ␮g) was incubated sepa- rately with active EP proteases (100 ng) at 37°C for 1 h and analyzed by SDS/PAGE. (C) A recombinant fusion protein of human 8 (hK8) (5 ␮g) was incu- bated with active EP proteases (100 ng) at 37°C for 2 h. After incubation, the Fig. 3. Computer model of the putative three-dimentional structure of the samples were analyzed by SDS/PAGE (Upper), or assayed for activity with Pro- serine protease domain of medaka EP. The model was created by using Swiss Phe-Arg-MCA (Lower). (D) Culture media collected from two culture dishes Model, starting from the crystallographically derived structure of bovine EP. The (10-cm diameter) of CHO cells transfected with the pCMV tag4 vector containing side chains of the active-site residues (Asp, His, and Ser) of the enzyme are shown the human sctPA sequence were affinity-purified by using Ni2ϩ-Sepharose, and in stick-model form in red; the substrate Asp-Asp-Asp-Lys in blue; the substrate the resulting eluate was treated separately with active EP proteases (100 ng) at residue in the P2 position in dark blue; and the side chains at the positions of point 37°C for 1 h. The samples were then analyzed by SDS/PAGE and Western blot, mutations in green. The black and purple dashed boxes indicate the putative S1 using anti-human tPA antibody (Upper) or anti-His probe antibody (Lower). The and S2 subsite regions, respectively. positions of 58- and 61-kDa sctPA detected with the antibodies are shown.

7024 ͉ www.pnas.org͞cgi͞doi͞10.1073͞pnas.0610447104 Ogiwara and Takahashi Downloaded by guest on October 4, 2021 on various fusion proteins containing a D4K-cleavage site was whether site-directed mutagenesis could produce a mutant enzyme examined. Medaka gelatinase A (17) was synthesized as a fusion with further reduced amidolytic activity for the synthetic substrates. protein containing a His tag and D4K sequence at the N terminus Twelve mutant medaka EP proteases were generated by mu- in the E. coli expression system, using the pET30 expression vector. tagenesis at positions Lys-63 [corresponding to Arg60f in bovine EP Both the WT and mutant proteases converted the 60-kDa fusion protease (11)], Thr-105 (bovine Arg-98), Phe-144 (bovine Ser-137), protein to a 55-kDa protein (Fig. 4B). At the same substrate/ Glu-173 (bovine Lys-167), and Pro-193 (bovine Glu-185). These protease ratio, mammalian EP serine proteases extensively digested positions all occur on the surface of the medaka EP serine protease the fusion protein. model (Fig. 3), and, with the exception of medaka EP protease Next, we synthesized a 35.5-kDa protein of human position 105 (bovine 98), the residues are located a considerable (hK8) in the same E. coli expression system. Digestion with all EP distance from the enzyme active site. A previous crystalline struc- proteases commonly generated 31.5-kDa active hK8 by cleaving the tural study (11) and other biochemical studies (10, 11) on mam- D4K sequence (Fig. 4C Upper). The EP protease-treated samples malian EP protease indicate that the minimal recognition sequence were directly assayed for hK8 activity by using Pro-Phe-Arg-MCA, for EP consists of Lys at P1 and Asp at P2 of the D4K sequence. a good synthetic peptide substrate for hK8 (18). All of the samples Therefore, the five residues we targeted are probably not essential treated with the medaka or mammalian EP proteases exhibited for substrate recognition. As is clear from the medaka EP protease Pro-Phe-Arg-MCA-hydrolyzing activity (Fig. 4C Lower). As ex- model, the structure around the S1 site highly resembles that of pected, none of the medaka EP proteases (WT EP-1, K63R, T105E, bovine EP protease, indicating that the catalytic mechanism and E173A, or E193A) showed any significant enzyme activity. In contacts with Lys at P1 are conserved with mammalian EP pro- contrast, considerable enzyme activities were detected with porcine teases. The interaction of Asp at P2 with the side chain of Lys-106 and bovine (Neb) EP proteases. Fusion protein that had been (bovine Lys-99) at S2 is also conserved, because GD4K-␤NA is digested with the bovine (Nvg) protease had very low activity, hydrolyzed at a comparable rate by the medaka and bovine presumably because of inactivation of the EP protease itself during enzymes. However, the microenvironment around the S2 site is incubation. These results demonstrate that the medaka EP protease presumed to be different between medaka and mammalian EP used for cleaving the fusion protein has no serious effect on hK8 proteases (Fig. 3). A slight difference in the structure around the S2 activity. site might underlie the detectable difference in catalytic activities Finally, a human single-chain tissue-type plasminogen activator for MCA-containing peptide substrates. We indeed observed that (sctPA) fusion protein containing an 11-residue sequence of a the T105E mutant displayed significantly lower catalytic activity for His-tag/EP-susceptible site at the N terminus of mature tPA was GD4K-␤NA. A plausible explanation for this activity decrease is generated by CHO cells and used as a substrate for medaka and that replacement of Thr by Glu at position 105, which is next to mammalian EP proteases. The protein samples treated with the Lys-106 at the S2 site, could affect interaction of Asp at P2 with medaka WT or mutant EP proteases, but not with mammalian Lys-106 (bovine Lys-99) at S2. ones, showed two polypeptides (58 and 61 kDa) detectable with To examine the effects of side chains of the target residues on anti-human tPA antibodies (Fig. 4D Upper). The specific antibody catalytic activity, we replaced these by Ala. Mutagenesis had for the His-tag sequence did not recognize the polypeptides (Fig. different effects on each of the enzyme activities. One of the 4D Lower). These results indicate that the medaka proteases mutants, E173A, was interesting in that it showed significantly lower properly cleave the fusion protein at the EP-cleavage site to activities than the WT enzyme toward all synthetic substrates produce sctPA, and also demonstrate that mammalian EP pro- tested. In addition, this mutant enzyme retained a low nonspecific teases extensively degrade the sctPA fusion protein. proteolytic activity for protein substrates (high-molecular-weight Taken together, our results suggest that the medaka proteases are kininogen and fibrinogen), with no serious reduction in D4K- more effective than their mammalian counterparts as fusion- cleaving activity for fusion proteins (gelatinase A, hK8, and tPA). protein cleavage enzymes for the preparation of desired recombi- Because none of the targeted residues is essential for the EP nant proteins containing the D4K cleavage sequence. protease catalytic activity, changes in the activity by mutagenesis are presumably because of conformational change brought about by Discussion substitution of residues at regions distant from the active site of the Almost all trypsinogen sequences known to date contain a highly enzyme. conserved tetra-aspartate sequence preceding the - We note here that, as demonstrated with bovine EP (9), activity isoleucine scissile . Although EP is widely consid- of the enzyme for D4K-containing substrates depends the heavy ered to play a role in trypsinogen activation in all vertebrate chain. The presence of the heavy chain had almost no effect on species, there has been no report on EP from nonmammalian recognition of the small peptide substrate GD4K-␤NA, whereas this species. The present study is thus the first to report on the chain had profound effects on recognition of the macromolecular

molecular and biochemical characterizations of EP from a lower substrate trypsinogen (9). Furthermore, bovine EP is able to BIOCHEMISTRY vertebrate, the medaka. hydrolyze specifically several biologically active in vitro Medaka EP mRNA exists in two distinct forms, EP-1 and EP-2, (19), and substrate length greatly affects the catalytic efficiency of in the intestine. A comparison of the entire amino acid sequences this hydrolysis. Activity dependency on the heavy chain and peptide of EP-1 (1,036 residues) and EP-2 (1,043 residues) revealed a substrate length remain to be determined for medaka EP. difference of only 22 aa, including an insertion of seven residues in Because EP serine proteases preferably hydrolyze the D4K EP-2. Our data suggest that EP-1 and EP-2 mRNA are expressed sequence, this motif has been intentionally introduced for the at a ratio of Ϸ3:2 in the intestine. It remains to be determined specific cleavage of fusion proteins. Bovine EP serine protease is whether they are indeed translated at this ratio. Moreover, it is not now widely used for this purpose. The current system, using the known at present whether they have a discrete role in vivo. bovine enzyme, works reasonably well in many cases but requires To our surprise, medaka EP serine protease per se showed very handling with great care. We often encounter the following diffi- low activity for three synthetic peptide substrates, t-butyloxycar- culties. (i) Bovine EP protease primarily cleaves at the EP-cleavage bonyl-Glu[benzyloxycarbonyl(butanoil)]-Ala-Arg-MCA, Z-Phe- site of recombinant fusion proteins; however, other peptide bonds Arg-MCA, and Pro-Phe-Arg-MCA, that are rapidly hydrolyzed by are also hydrolyzed to a considerable degree by nonspecific pro- the mammalian enzymes. A comparison of the kinetic parameters teolytic activity. This results in a low yield of the protein in question. between medaka and mammalian EP proteases revealed that the (ii) For preparing active recombinant proteases, the bovine EP remarkably low activities of the fish enzyme for these substrates protease used for cleavage of the inactive fusion protein could be were due to the reduced kcat value. This finding led us to examine an obstacle. This is particularly serious when the proteases to be

Ogiwara and Takahashi PNAS ͉ April 24, 2007 ͉ vol. 104 ͉ no. 17 ͉ 7025 Downloaded by guest on October 4, 2021 examined are ones with very low activity for synthetic and protein Purifier (GE Healthcare Biosciences) to remove inactive enzyme. substrates. Significant nonspecific activities of bovine EP protease A trace amount of trypsin often contained in the sample thus often make it difficult to determine whether the target recombinant prepared was removed by passage through an -Sepharose proteases have been successfully activated. 4B column (Sigma, St. Louis, MO). To address this latter difficulty, we examined the enzyme activ- Active recombinant enzyme of the porcine EP serine protease ities of medaka EP protease for synthetic MCA-containing sub- domain (Ile-800 to His-1034) (6) was prepared basically according strates, which are known to be nonspecifically hydrolyzed by the to the method described above. Bovine EP serine protease was mammalian counterparts, including bovine EP (11), although the obtained from Novagen and New England Biolabs (Schwalbach, synthetic substrates themselves are not physiologically relevant. As Germany). demonstrated in the present study, the serine protease domain of medaka EP has a stricter specificity for almost all of the substrates Site-Directed Mutagenesis. Site-directed mutagenesis of medaka tested compared with the mammalian EP proteases. The disadvan- EP-1 was carried out to produce various mutant proteases. For each tages mentioned above associated with bovine EP could be over- mutant, two PCR products were first amplified with medaka EP-1 come by using the medaka protease. Medaka WT EP protease cDNA as a template, using the following two primer combinations: would be adequate for the preparation of recombinant nonproteo- the upper primer described above with the respective antisense lytic enzymes. However, in view of the efficient cleavage at the D4K primer (SI Table 3) and the lower primer described above with the site and the minimum nonspecific hydrolysis at peptide and amide sense primer (SI Table 3). Using a mixture of these amplified DNAs bonds, we recommend use of the E173A mutant enzyme. Medaka as the template, a second PCR was performed with the upper and WT EP protease and its mutant can be prepared in large quantity lower primers. The PCR products were digested with BamHI and in the E. coli expression system. We believe that, using the medaka HindIII, gel-purified, and ligated into the pET30a expression EP serine proteases as cleavage enzymes for fusion proteins con- vector. All mutants were confirmed by DNA sequencing. The subsequent procedures for preparation of mutant proteases were taining the D4K cleavage sequence, the desired recombinant pro- teins can be easily and effectively produced. the same as for the WT protein described above. Materials and Methods Hydrolysis of Proteins by the EP Catalytic Serine Protease Domain. Preparation of the Recombinant EP Serine Protease Domain. Two Human plasma fibronectin (Chemicon, Temecula, CA), human distinct medaka EP cDNA clones, EP-1 (3,997 bp, AB272104) and fibrinogen (Merk Biosciences, Tokyo, Japan), human high- EP-2 (4,036 bp, AB272105), were obtained as described in SI molecular-weight kininogen (Calbiochem, La Jolla, CA), mouse Materials and Methods. laminin (Biomedical Technologies, Stoughton, MA), D4K cleavage A DNA fragment, including the coding sequence for the medaka site-containing control protein (Novagen), medaka gelatinase A (17), trypsinogen (this study), human kallikrein 8 (hK8) (18), and EP-1 or EP-2 catalytic domain, was amplified by PCR, using a ⅐ pBluescript II plasmid containing cDNA of the catalytic domain as human tPA were incubated at 37°C in 20 mM Tris HCl buffer (pH the template. The upper and lower primers were 5Ј-CGCGGATC- 7.4) containing 50 mM NaCl and 2 mM CaCl2 with various EP CCAAGCTGGTGTGGTGGGTGG-3Ј and 5Ј-CCCAAGCTT- serine proteases at ratios (wt/wt) ranging from 20:1 to 100:1. After TCAGTCTAGATCTGAGAA-3Ј, respectively, which have incubation, samples were subjected to SDS/PAGE, followed by BamHI and HindIII sites at the respective 5Ј termini. The product Coomassie brilliant blue staining. Gelatin zymography was con- ducted as described in ref. 17, except that gels were incubated in 20 was ligated into the cloning site of a pET30a expression vector mM Tris⅐HCl buffer (pH 7.4) containing 50 mM NaCl and 2 mM (Novagen). Expression of the recombinant medaka EP catalytic CaCl . domain in the E. coli expression system was carried out as described 2 in ref. 17. The medaka EP catalytic domain was produced as a Computer Modeling of the Serine Protease Domain of Medaka EP. fusion protein with an extra amino acid sequence of 50 residues at Three-dimensional structure prediction was carried out by using its N terminus; the vector-derived N-terminal stretch contained a Swiss Model. The serine protease domain of medaka EP was His tag and an S-protein sequence. Harvested cells were lysed, and modeled in the ‘‘optimize’’ mode after manual alignment of the the insoluble materials were dissolved in a solubilization buffer sequence with bovine EP. containing 6 M urea, 50 mM Tris⅐HCl (pH 7.6), and 0.5 M NaCl. The methods used for the cloning of medaka EP-1 and EP-2 and Solubilized proteins were subjected to affinity chromatography on 2ϩ trypsinogen, preparations of recombinant proteins other than EP, Ni -Sepharose (GE Healthcare Biosciences, Piscataway, NJ), and RT-PCR, Northern blotting, Southern blotting, in situ hybridiza- eluted with the same buffer, containing 50 mM histidine. Eluted tion, antibody production, Western blotting, immunohistochemis- recombinant proteins were renatured by dialysis against 50 mM ⅐ try, gel-filtration chromatography, and other enzymic methods are Tris HCl (pH 8.0). The fusion protein was then incubated in 50 mM described in SI Materials and Methods. Tris⅐HCl (pH 8.0) containing 0.5 M NaCl with trypsin immobilized on Sepharose 4B at room temperature for 1 h. The immobilized We thank Dr. M. Yao and I. Tanaka (Hokkaido University) for valuable trypsin was then removed by filtration. The resulting sample, which comments on the structure of EP. This work was supported in part by contained not only active EP protease but also inactive enzyme Ministry of Education, Culture, Sports, Science and Technology of Japan protein, was fractionated on a column of Resource Q in AKTA Grant-in-Aid for Scientific Research 17370021 (to T.T.).

1. Light A, Janska H (1989) Trends Biochem Sci 14:110–112. 10. Mikhailova AG, Rumsh LD (1999) FEBS Lett 442:226–230. 2. Grishan FK, Lee PC, Lebenthal E, Johnson P, Bradley CA, Greene HL (1983) Gastroenterology 11. Lu D, Fu¨tterer K, Korolev S, Xinglong Z, Tan K, Waksman G, Sadler JE (1999) J Mol Biol 85:727–731. 292:361–373. 3. LaVallie ER, Rehemtulla A, Racie LA, DiBlasio EA, Ferenz C, Grant KL, Light A, McCoy JM 12. Zheng X, Sadler JE (2002) JBiolChem277:6858–6863. (1993) JBiolChem268:23311–23317. 13. Collins-Racie LA, McColgan JM, Grant KL, DiBlasio-Smith EA, McCoy JM, LaVallie ER 4. Kitamoto Y, Yuan X, Wu Q, McCourt DW, Sadler JE (1994) Proc Natl Acad Sci USA (1995) Biotechnology 13:982–987. 91:7588–7592. 14. Bricteux-Gregoire S, Schyns R, Florkin M (1972) Comp Biochem Physiol 42B:23–39. 5. Kitamoto Y, Veile RA, Donis-Keller H, Sadler JE (1995) Biochemistry 34:4562–4568. 15. Costa FF (2005) Gene 357:83–94. 6. Matsushima M, Ichinose M, Yahagi N, Kakei K, Tsukada S, Miki K, Kurokawa K, Tashiro K, 16. Rombout JH, Stroband HW, Taverne-Thiele JJ (1984) Cell Tissue Res 236:207–216. Shiokawa K, Shinomiya K, et al. (1994) JBiolChem269:19976–19982. 17. Ogiwara K, Takano N, Shinohara M, Murakami M, Takahashi T (2005) Proc Natl Acad Sci USA 7. Yahagi N, Ichinose M, Matsushima M, Matsubara Y, Miki K, Kurokawa K, Fukamachi H, 102:8442–8447. Tashiro K, Shiokawa K, Kageyama T, et al. (1996) Biochem Biophys Res Commun 219:806–812. 18. Rajapakse S, Ogiwara K, Takano N, Moriyama A, Takahashi T (2005) FEBS Lett 579:6879–6884. 8. Yuan X, Zheng X, Lu D, Rubin DC, Pung CY, Sadler JE (1998) Am J Physiol 274:342–349. 19. Likhareva VV, Mikhailova AG, Vaskovsky BV, Garanin SK, Rumsh LD (2002) Lett Peptide Sci 9. Lu D, Yuan, X., Zheng. X. & Sadler JE (1997) JBiolChem272:31293–31300. 9:71–76.

7026 ͉ www.pnas.org͞cgi͞doi͞10.1073͞pnas.0610447104 Ogiwara and Takahashi Downloaded by guest on October 4, 2021