Amino Acid Sequence of Porcine Spleen Cathepsin D (Primary Structure/Aspartic Protease/Lysosomal Enzyme/Homology with Pepsin) JAIPRAKASH G
Total Page:16
File Type:pdf, Size:1020Kb
Proc. Nati. Acad. Sci. USA Vol. 81, pp. 3703-3707, June 1984 Biochemistry Amino acid sequence of porcine spleen cathepsin D (primary structure/aspartic protease/lysosomal enzyme/homology with pepsin) JAIPRAKASH G. SHEWALE* AND JORDAN TANG*t *Laboratory of Protein Studies, Oklahoma Medical Research Foundation, and tThe Department of Biochemistry and Molecular Biology, The University of Oklahoma Health Sciences Center, Oklahoma City, OK 73104 Communicated by Stuart Kornfeld, March 12, 1984 ABSTRACT The amino acid sequence of porcine spleen V8 and endoproteinase Lys C were obtained from Millipore, cathepsin D heavy chain has been determined and, hence, the Miles, and Boehringer Mannheim, respectively. Sequencer complete structure of this enzyme is now known. The sequence reagents were from Beckman and from Burdick and Jackson of heavy chain was constructed by aligning the structures of (Muskegon, MI). All other chemicals used in this study were peptides generated by cyanogen bromide, trypsin, and endo- of analytical grade. proteinase Lys C cleavages. The structure of the light chain Separation of Peptides. Peptides from CNBr cleavage, has been published previously. The cathepsin D molecule con- tryptic and endoproteinase Lys C digests were initially frac- tains 339 amino acid residues in two polypeptide chains: a 97- tionated by gel filtration on Sephadex G-50 (Pharmacia) in residue light chain and a 242-residue heavy chain, with a com- either 10% formic acid or 0.1 M ammonium bicarbonate. In- bined Mr of 36,779 (without carbohydrate). There are two car- dividual peptides were further purified using ion-exchange bohydrate units linked to asparagine residues 70 and 192. The chromatography with DEAE-cellulose (DE 52, Whatman) disulfide bond arrangement in cathepsin D is probably similar and HPLC (C8 or C18) individually or in combination. S. to that of pepsin, because the positions of six half-cystine resi- aureus protease digests were directly fractionated on HPLC. dues are conserved. The active site aspartyl residues, corre- The details of the isolation of peptides will be discussed else- sponding to aspartic acid-32 and -215 of pepsin, are located at where. residues 33 and 224 in the cathepsin D molecule. The amino Reaction of the seven methionines in reduced carboxy- acid sequence around these aspartyl residues is strongly con- methylated heavy chain with CNBr (70% HCOOH, 24 hr, served. Cathepsin D shows a strong homology with other acid room temperature) yielded eight fragments, CN1-CN8 (Fig. proteases. When the sequence of cathepsin D, renin, and pep- 1). The dipeptide Gly-Hse [cathepsin D residue number sin are aligned, 32.7% of the residues are identical. The ho- (CDN) 301-302; pepsin residue number (PN) 289-290] was mology is observed throughout the length of the molecules, in- not isolated. Instead, an eighth fragment (CN8) was generat- dicating that three-dimensional structures of all three mole- ed because of oxidative cleavage of a Trp-Ile (CDN 312-313; cules are similar. PN 300-301) bond. Fragments CN3 and CN5 were further cleaved with the S. aureus protease (0.1 M ammonium bicar- Cathepsin D (EC 3.4.23.5), is a lysosomal aspartic protease bonate; 37°C; 4 or 20 hr; protease/protein ratio, 5:100), and present in all mammalian cells. The physiological function of peptides CN3 S1 (CDN 149-159; PN 142-152), CN3 S2 this enzyme is the breakdown of tissue proteins under nor- (CDN 160-173; PN 153-164), and CN5 S1 (CDN 240-253; mal and pathological conditions (1-4). The structural studies PN 232-245) were isolated. on cathepsin D were initiated in this laboratory because, in Cleavage at arginine residues was accomplished by modi- contrast to other well studied aspartic proteases, cathepsin fication of the E-amino groups of lysine residues with maleic D is an intracellular enzyme. A structure-function compari- anhydride and subsequent proteolytic digestion with trypsin son of cathepsin D with other aspartic proteases may gener- (0.1 M ammonium bicarbonate; 37°C; 2 hr; protease/protein ate some insight into the function and regulation of this ratio, 5:100). Peptides A1-A9 were isolated as described enzyme. Also, cathepsin D can serve as a model for the lyso- above. COOH-terminal peptide (CDN 333-339; PN 321-327) somal enzymes, whose synthesis and targeting have generat- was not isolated. Peptide A7 (CDN 300-321; PN 288-309) ed considerable interest recently (5, 6). For the structural was generated because of secondary cleavage at a Phe-Met studies, cathepsin D from porcine spleen was purified (CDN 299-300; PN 287-288) bond. Peptides A2 S1 (CDN to homogeneity in our laboratory (7). This enzyme com- 106-110; PN 99-103) and A2 S2 (CDN 111-134; PN 104-127) prises two chains, each of which contains an asparagine- were isolated by digestion of the peptide A2 with S. aureus linked oligosaccharide. Previously, we have published struc- protease (0.1 M ammonium bicarbonate; 37°C; 4 hr; prote- tural studies on the light chain (8) and on the oligosaccha- ase/peptide ratio, 5:100). rides (9) of cathepsin D. Peptide KC1 (CDN 287-339; PN 279-327), required for se- We report here the evidence for the structure of heavy quence overlap between three CNBr and two tryptic pep- chain and, hence, the complete amino acid sequence of the tides, was produced by endoproteinase Lys C digestion (0.1 cathepsin D. The sequence is compared with the structure of M ammonium bicarbonate; 37°C; 6 hr, protease/protein ra- pepsin (10) and renin (11, 12). The location and possible tio, 5:100) of heavy chain. functional significance of conserved residues is discussed. Other Methods. The amino acid analyses of acid hydroly- sates were performed on a Durrum (Palo Alto, CA) D500 MATERIALS AND METHODS amino acid analyzer as described by Spackman et al. (13). was with a Beck- Materials. Cathepsin D was purified from porcine spleen Automatic sequence analysis performed as described (7). Proteases: trypsin, Staphylococcus aureus Abbreviations: CDN, cathepsin D residue numbers; PN, pepsin res- idue numbers; CN1-CN8, peptides generated by cleavage with The publication costs of this article were defrayed in part by page charge CNBr; A1-A9, tryptic (arginine) cleavage peptides; KC1, endopro- payment. This article must therefore be hereby marked "advertisement" teinase Lys C cleavage peptide; S1 and S2, peptides generated by in accordance with 18 U.S.C. §1734 solely to indicate this fact. cleavage with Staphylococcus aureus protease. 3703 3704 Biochemistry: Shewale and Tang Proc. NatL Acad Sci. USA 81 (1984) RESIDUE 98 100 150 200 250 310 339 NUMBER II HC CNI CN3 CN5 CN7 CN CN2 CN4 CN6 CN8 A I A 3 A5 A 7 A9 A .. _ _~~~~~~~~. .. A 2 A4 A6 A8 A2 S1 CN3 S1 CN5 Si S A2 S2 CN3 S2 KC 1 KC FIG. 1. Schematic presentation of the evidence for sequence assembly of cathepsin D heavy chain (HC). Bars represent isolated peptides; shaded areas indicate sequenced regions. Residue numbers indicate the position in cathepsin D molecule (see Fig. 2). Peptides were generated by cleavage with cyanogen bromide (CN), arginine cleavage by trypsin after lysine modification (A), and protease digestion with S. aureus protease (S) and endoproteinase Lys C (KC). man sequencer, model 890C, equipped with a Sequemat sylation site with a sequence of Asn-Tyr-Thr from residues model P-6 autoconverter using a dilute quadrol program in 282-284 (PN 274-276). However, no carbohydrate was ob- combination with polybrene (14). The phenylthiohydantoin served at this position. Since the positions of six out of seven amino acids were identified by HPLC (15) and occasionally cysteines are the same as in pepsin and renin (Fig. 3), it is by thin-layer chromatography (16). probable that in the cathepsin D molecule three disulfide bridges are located in the equivalent pairing as in pepsin (10, RESULTS AND DISCUSSION 17)-i.e., between cysteine-45 and cysteine-50, cysteine-206 and cysteine-210, and cysteine-250 and cysteine-283. The The complete amino acid sequence of porcine spleen cathep- free sulfhydryl nature of cysteine-27 (PN 26) in cathepsin D sin D is given in Fig. 2. A detailed study of the structure of has been established previously (8). In the entire sequence of light chain was published earlier by Takahashi and Tang (8). cathepsin D, we observed structural microheterogeneity at The discussion here, therefore, is restricted to the assembly two positions. Lysine and serine were both found at position of heavy-chain structure. The sequence is constructed from 228 (PN 219) and glycine and glutamine were at position 241 the structures of the peptides generated from the reduced (PN 233). It is interesting to note that the microheterogenei- carboxymethylated heavy chain by CNBr, trypsin, and en- ties of cathepsin D, as well as those in the pepsin molecule at doproteinase Lys C cleavages. The structural evidence is residues 230 and 255 (17) and in the chymosin molecule at summarized in Fig. 1. Sequence analysis of CNBr and tryp- residue PN 244 (18), all occur in a region between residues sin peptides placed and overlapped most of the heavy-chain PN 219 and PN 255. sequence except for residues 288-299, which was obtained Cathepsin D shows a strong homology when compared by sequence analysis of peptide KC1. There are only two with the other aspartic proteases, pepsin and renin (Fig. 3). single residue overlaps. One is located at residue 287. The Among all three proteases, 32.7% of the residues are identi- connectivity at this position is substantiated by identification cal. The homology is observed throughout the length of the of alanine (CDN 290) in the sequence analysis of peptide molecules, indicating that their three-dimensional structures CN6.