<<

USOO5861477A United States Patent (19) 11 Patent Number: 5,861,477 Atassi (45) Date of Patent: *Jan. 19, 1999

54 CYCLIC PEPTIDE CATALYSTS MODELED Cunningham et al. (1997) Curr. Opin. Struct. Biol, 7(4), ON ACTIVE SITES “Minimized Proteins”, pp. 457-462. Walse et al. (1996) J. Comp.-Aided Mol. Des., 10(1), 76 Inventor: M. Zouhair Atassi, 11743 Cawdor “Structure of a Cyclic Peptide with a , Deter Way, Houston, Tex. 77024 mined by Computer Simulation and NMR Spectroscopy', pp. 11-22. * Notice: The term of this patent shall not extend Gutte et al. (1979) Nature, “Design, Synthesis and Charac beyond the expiration date of Pat. No. terization of a 34-Residue Polypeptide That Interacts with 5,759,834. Nucleic Acids”, pp. 650-655. 21 Appl. No.: 535,298 Atassi et al. (1993) Proc. Nat. Acad. Sci., USA, 90, “Design of Peptide (Pepzymes): Surface-Simulation Syn 22 PCT Filed: May 18, 1994 thetic Peptides That Mimic the and 86 PCT No.: PCT/US94/05569 Active Sites Exhibit the Activity and Specificity of the Respective Enzyme”, pp. 8282-8286. S371 Date: Jun. 19, 1996 Atassi (1985) Biochem. J., 226, "Surface-Simulation Syn thesis of the Substrate- of an Enzyme”, pp. S 102(e) Date: Jun. 19, 1996 477-485. 87 PCT Pub. No.: WO94/28018 Matthews et al. (1994) Proc. Bat. Acad. Sci., USA, 91, “Can Small Cyclic Peptides Have the Activity and Specificity of PCT Pub. Date: Dec. 8, 1994 Proteolytic Enzymes?", pp. 4103–4105. Related U.S. Application Data (List continued on next page.) 63 Continuation-in-part of Ser. No. 63,640, May 18, 1993, abandoned, and a continuation-in-part of Ser. No. 461,597, Primary Examiner-Jon P. Weber Jun. 5, 1995, Pat. No. 5,759.834. Attorney, Agent, or Firm-Conley, Rose & Tayon, P.C. 51 Int. Cl...... C07K 7/52; CO7K 7/50; 57 ABSTRACT C12N 9/22; C12N 9/36; C12N 9/46; C12N 9/48 PEPZYMESTM, chemically synthesized cyclic peptides, 52 U.S. Cl...... 530/317; 530/324; 530/334; modeled on the active Sites of naturally-occurring enzymes 435/199; 435/206; 435/213; 435/212 represented by chymotrypsin, trypsin, lysozyme, 58 Field of Search ...... 435/183, 188.5, ribonuclease, , tissue and 435/195, 199, 206, 213,91.31; 536/23.2; their analogs are disclosed. The conformational constraints 530/317, 324, 334 imposed on the peptide residues cause the amino acids of the peptide to assume a three-dimensional Spatial relationship 56) References Cited relative to the Substrate that is essentially equivalent to that of the corresponding amino acids of the natural U.S. PATENT DOCUMENTS enzyme in its catalytically active State. The new cyclic 4,751,180 6/1988 Cousens et al...... 435/69.7 peptides catalyze the same reaction as the native enzyme being modeled, but have amino acid Sequences that are OTHER PUBLICATIONS Substantially shorter than the naturally-occurring enzymes, Buono et al. (1996) J. Comp.-Aided Mol. Des., 10(3), and do not occur in the same linear relationship in the “Synthesis and Conformationa Analaysis by H-NMR and naturally-occurring enzymes. Methods of producing PEP Restrained Molecular Dynamics Simulations of the Cyclic ZYMESTM are described. Decapeptide Ser-Tyr-Ser-Met-Glu-His-Phe-Arg-Tr p-Gly', pp. 213-232. 12 Claims, 40 Drawing Sheets

848 6.1 g \le 8.01

7.75

1316 6.13

6.38 as- 7.18 110) 5.48 5,861,477 Page 2

OTHER PUBLICATIONS Leatherbarrow et al. (1991) Biochemistry, 30, “Design of a Small Peptide-Based Proteinase Inhibitor by Modeling the Corey et al. (1994) Proc. Nat. Acad. Sci. USA, 91, “Cyclic Active-Site Region of Barley Chymotrysin Inhibitor 2, pp. Peptides as Proteases: A Revaluation”, pp. 4106-4109. 10717-10721. Wells et al. (1994) Proc. Nat. Acad. Sci., USA, 91, “A Reinvestigation of a Synthetic Peptide (TrPepz) Designed to Marrone et al. (1994) J. Am. Chem. Soc., 116(15), “Pep Mimic Trypsin”, pp. 4110-4114. zyme Dynamics and Conformation: A Molecular Dynamics Rizo et al. (1992) Constrained Peptides: Models of Bioacv Study in Water”, pp. 6987–6988. tive Peptides and protein Substructures, pp. 387-418. Stryer (1975) “Biochemistry (2nd Ed.)”, Freeman and Com Kirby (1996) Angew. Chem. Int. Engl., 35(7), “Enzyme pany, San Francisco, pp. I-XXi. Mechanisms, Models, and Mimics', pp. 707-724. Sakar (1981) “Cyclopeptide Macrocycles”, pp. 251-313, in Corey et al. (1996) Proc. Natl. Acad. Sci. USA, 93(21), “On Progress in Macrocyclic Chemistry, Ed. Izatt et al., John the Failure of De Novo-Designed Peptides as Biocatalysts”, Wiley & Sons, New York. pp. 11428-11434. U.S. Patent Jan. 19, 1999 Sheet 1 of 40 5,861,477

WN s

S.

O O O n

O (N

O

O s M C O O O O O ABSORBANCE AT 256 NM U.S. Patent Jan. 19, 1999 Sheet 2 of 40 5,861,477

1/V (MIN/mm.OL) U.S. Patent Jan. 19, 1999 Sheet 3 of 40 5,861,477

ABSORBANCE AT 247 NM U.S. Patent Jan. 19, 1999 Sheet 4 of 40 5,861,477

F. &. U.S. Patent Jan. 19, 1999 Sheet S of 40 5,861,477

G. 3 U.S. Patent Jan. 19, 1999 Sheet 6 of 40 5,861,477

68||

|7 U.S. Patent Jan. 19, 1999 Sheet 7 of 40 5,861,477

S n N-da CN S O

O) n

OO LO s r U.S. Patent Jan. 19, 1999 Sheet 8 of 40 5,861,477

1 O.25 5.18

5.5O

15. 16

1 O.27

FIG 7B U.S. Patent Jan. 19, 1999 Sheet 9 of 40 5,861,477

s e

9

o co N co L? r M C t O O O O O O O O O O ABSORBANCE AT 45O NM U.S. Patent Jan. 19, 1999 Sheet 10 of 40 5,861,477

CN

g

C)

3.

O CO S g C g

S n 6.S C) N - s h

H CD d N.(?. S.

n

O

Ya O N/ O O O O O O O O CO O s n CN s 1/W (MIN/OD) U.S. Patent Jan. 19, 1999 Sheet 11 of 40 5,861,477

9

S.

G. O WN Li O D . - CD Lu N

aH

O r

O O

O O

S d 3 ABSORBANCE AT 3OO NM U.S. Patent Jan. 19, 1999 Sheet 12 of 40 5,861,477

N s

O O O O O O O q- CO CN O) CO S. CN v- ur- -o- 1/V (MIN/OD) U.S. Patent Jan. 19, 1999 Sheet 13 of 40 5,861,477

S 4. ns C N 1- N r

Y

O

aC

n r CO es up N N LO - O ve n O) O CC ser C)

n CO CO O) st O 2. th 2. U.S. Patent Jan. 19, 1999 Sheet 14 of 40 5,861,477

8.48

- 6. 19S 8.01

1316

Šs,Pival Ai) Gly (69°y ing FIG 13B U.S. Patent Jan. 19, 1999 Sheet 15 0f 40 5,861,477

f U.S. Patent Jan. 19, 1999 Sheet 16 of 40 5,861,477

8.66

U.S. Patent Jan. 19, 1999 Sheet 17 of 40 5,861,477

U.S. Patent Jan. 19, 1999 Sheet 18 of 40 5,861,477

U.S. Patent Jan. 19, 1999 Sheet 19 of 40 5,861,477

n

N

U.S. Patent Jan. 19, 1999 Sheet 20 of 40 5,861,477

U.S. Patent Jan. 19, 1999 Sheet 21 of 40 5,861,477 FIG 16B1 IDENTIFICATION OF SEQUENCES COMPARED TO CHPEPZ CHYMOTRYPSINOGEN B (EC 3.4.21.1) CHYMOTRYPSINOGEN (EC 3.4.21.1) B PRECURSOR-BOVINE CHYMOTRYPSINOGEN B PRECURSOR (EC 3.4.21.1) CHYMOTRYPSINOGEN B PRECURSOR (EC 3.4.21, 1) CHYMOTRYPSINOGEN (EC 3:23 B PRECURSOR-HUMAN CHYMOTRYPSINOGEN (EC 3.4.21.1) B PRECURSOR-RAT CHYMOTRYPSINOGEN A (EC 3.4.21.1) CHYMOTRYPSINOGEN (EC 3.4.21.1) A PRECURSOR-BOVINE CHYMOTRYPSINOGEN 2 PRECURSOR (EC 3.4.21.1) CHYMOTRYPSINOGEN (EC 3.4.21.1) 2 PRECURSOR-DOG (EC 3.4.21.7) - PIG . PLASMINOGEN (EC 3.4.21.7)3:22. SSMENB(FRAGMENT . PLASMIN (EC 3.4.21.7) PRECURSOR-HUMAN . PLASMINOGEN PRECURSOR (EC 3.4.21.7) . PLASMIN (EC 3.4.21.7) PRECURSOR-HUMAN . PLASMIN (EC 3.4.21.7) PRECURSOR-HUMAN PLASMIN (EC 3.4.21.7) - SHEEP . PLASMINOGEN (EC 3.4.21.7) . PLASMIN (EC 3.4.21.7) - PIG (FRAGMENT) PLASMINOGEN PRECURSOR (EC 3.4.21.7) . PLASMIN (EC 3:23 -RHESUS MACAOUE i APOLPOPLASMIN PROTEN(EC 3.4.21.7) 3. -RHESUS PRECURSOR-RHESUS MACAQUE (FRAGMENT MACAQUE APOLIPOPROTEIN (o) -RHESUS MACAQUE (FRAGMENT 2 APOLIPOPROTEIN (a) - (EC 3.4.21.-) (APO(A)) (LP(A)) TRYPSINOGEN II, ANONIC PRECURSOR (EC 3.4.21.4) 2 É. PLASMINOGEN (EC 3.4.21.7) PLASMINOGEN PRECURSOR-MOUSE TRYPSN-RELATED PROTEIN - AFRICAN MALARIA MOSOUTO PLASMNTRYPSIN 88(EC 3:233.4.21.7) -| BOVINEPRECURSOR-RAT . PLASMINOGEN PRECURSOR (EC 3.4.21.7) . TRYPSIN (EC 3.4.21.7) PRECURSOR - AFRICAN MALARIA . TRYPSIN 1 PRECURSOR (EC 3.4.21.4) . TRYPSIN 3A1 PRECURSOR (EC 3.4.21.4) . TRYPSIN-LIKE PROTEINASE (EC 3.4.21.-) 3A 1 PRECURSOR . TRYPSIN (EC 3.4.21.4) PRECURSOR-MOUSE . TRYPSINOGEN PRECURSOR (EC 3.4.21.4) HEPATOCYTE GROWTH FACTOR HOMOLOG PRECURSOR-MOUSE . TRYPSIN 4 PRECURSOR (EC 3.4.21.4) . TRYPSIN (EC 3.4.21.4) PRECURSOR, CATIONIC-RAT . TRYPSINOGEN, ANONIC PRECURSOR (EC 3.4.21.4) HEPATOCYTE GROWTH FACTOR-LIKE PROTEIN PRECURSOR HEPATOCYTE GROWTH FACTOR HOMOLOG PRECURSOR-HUMAN . TRYPSIN 7 PRECURSOR (EC 3.4.21.4) . TRYPSIN PRECURSOR (EC 3.4.21.4) . TRYPSINOGEN III, CATIONIC PRECURSOR (EC 3.4.21.4) . HEPATOCYTE GROWTH FACTOR-LIKE PROTEIN PRECURSOR : TRYPSIN (EC 3.4.21.4) PRECURSOR, ANONIC-DOG U.S. Patent Jan. 19, 1999 Sheet 22 of 40 5,861,477 FIG 16 B2 51. TRYPSIN (EC 3.4.21.4) PRECURSOR, PANCREATIC 52. TRYPSIN-LIKE PROTEINASE SE 3.23.3 PRECURSOR 53. TRYPSIN-LIKE PROTEINASE (EC 3.4.21 - ) 5G PRECURSOR 54. TRYPSIN ALPHA PRECURSOR (EC 3.4.21.4) 55. PROTEINASE - SILKWORM 56. TRYPSIN 5G 1 PRECURSOR (EC 3.4.21.4) (FRAGMENT) 57. FACTOR Xia (EC 3.4.21.27) PRECURSOR 58. TRYPSIN 3 PRECURSOR (EC 3.4.21.4) 59. COAGULATION PRECURSOR (EC 3.4.21.27) 6O. PLASMA KALLKREN PRECURS OR SS 3.223 61. PLASMA KALLKREN PRECURSOR (EC 3.4.21.34 62. PLASMA (EC 3.4.21.34) PRECURSOR- MOUSE 63. COAGULATION FACTOR Xo (EC 3.4.21.6) PRECURSOR - HUMAN 64. CHYMOTRYPSIN 1 - PENAEID SHRMP (PENAEUS VANAMEII) 65. PLASMA KALLIKREIN PRECURSOR (EC 3.4.21.34) 66. CHYMOTRYPSN 1 PRECURSOR - PENAED SHRMP 67. CHYMOTRYPSIN 1 PRECURSOR (EC 3.4.21.1) 68. PLASMA KALLIKREN (EC 3.4.21.34) PRECURSOR - RAT 69. TRYPSINOGEN IV PRECURSOR (EC 3.4.21.4) (PRETRYPSIN) 70. TRYPSIN (EC 3.4.21.4) IV PRECURSOR - RAT 71. PLASMA KALLIKREN (EC 3.4.21.34) PRECURSOR - HUMAN 72. TRYPSIN (EC 3.4.21.4) PRECURSOR - AFRICAN MALARIA 73. EBNA-1 NUCLEAR PROTEIN 74. TRYPSIN-RELATED PROTEN - AFRICAN MALARIA MOSQUITO 75. TRYPSIN 2 PRECURSOR (EC 3.4.21.4) 76. PROBABLE NUCLEAR ANTIGEN - HUMAN HERPESVIRUS 4 77. TRYPSIN BETA PRECURSOR (EC 3.4.21.4) 78. G PRECURSOR (EC 3.4.21.-) 79. TRYPSN 6 PRECURSOR (EC 3.4.21.4) 8O. CYTOTOXC T-LYMPHOCYTE PROTEINASE (EC 3.4.21.-) 81. ALPHA- PRECURSOR (EC 3.4.21.59) 82. TRYPSINOGEN IVA PRECURSOR (EC 3.4.21.4) 83. TRYPSINOGEN | PRECURSOR (EC 3.4.21.4) 84. TRYPSN 8. 3.4214) | PRECURSOR - RAT 85. TRYPSN (EC 3.42 1.4) TRYPSINOGEN IV o-FORM-HUMAN 86. TRYPTASE (EC 3.4.21.4 ) PRECURSOR - HUMAN 87. TRYPSIN PRECURSOR (EC 3.4.21.4) 88. TRYPTASE (EC 3.4.21.4) ALPHA PRECURSOR - HUMAN 89. TRYPSINOGEN I, ANONIC PRECURSOR (EC 3.4.21.4) 90. TRYPSIN (EC 3.4.21.4) PRECURSOR, CATIONIC - DOG 91. MUCIN, SUBMAXILLARY - PG (FRAGMENT) 92. GRANZYME A PRECURSOR (EC 3.4.21.-) 93. TRYPSINOGEN, CATIONIC PRECURSOR (EC 3.4.21.4) 94. PERIOD CLOCK PROTEIN - MOUSE (RAGMENT 95. PERIOD CLOCK PROTEIN (FRAGMENT 96. TRYPSIN (EC 3.4.21.4) IV.b PRECURSOR - HUMAN 97. (EC 3.4.21.10), SPERM - PIG 98. TRYPSINOGEN || PRECURSOR (EC 3.4.21.4) 99. TRYPSIN-LIKE SERINE PROTEINASE (EC 3.4.21.4) 1 OO. TRYPSIN (EC 3.4.21.4) PRECURSOR - PG U.S. Patent Jan. 19, 1999 Sheet 23 of 40 5,861,477 retrictr O | | | | | | | | | | | | | | | | | || >3O) || | | | | | | | | | | | | | | | | St 9SI ITT III CC

uuuuuuuuuuuuuuula aaa. ontoocoon loooooooo | | | | | | | | | | | | | | >>>> WN B-B-B-B- S s

----|--|--|--|--|--|--|--|--

-B-B-B-B-B-B-B-B-B-B-B-B-B-B-B-B-B-B- Z S. | | | | | | | | | | | | | | | | | | | soonoooooooooooooooo. : nnSI ITT III ugu?) || | | | | | | | | | | | | | | | | ussesssssssssssssssss is -aoloton cooles aspesigg U.S. Patent Jan. 19, 1999 Sheet 24 of 40 5,861,477

a w colorer colorer O | | | | | | | | | | | | | | | | | ||

sg|OO || | | | | | | | | | | | | | | | | N >>>>>>>>>>>> st | | | | | | | | | | | | | | is a B-B-B-B-B-B-B-B-B-B-B-B-B-B-

- Sulu...... asCN z CO | | | | | | | | | | | | | | | c ------

usO

SS O v CN n?) ru?) (ON | OOO) O v CNN) r u?) (ON | OO cN CN can CN C C, CN C C CN M n) n n n n no M) no U.S. Patent Jan. 19, 1999 Sheet 25 of 40 5,861,477

cer O | | | | | | | | | | | | | | | | | | | O) D C) s3CO

as || | | | | | | | | | | | | || || -g->>>>>>| | | | | | | | | | | | is a | | | | | | | | | | | | | | | U.S. Patent Jan. 19, 1999 Sheet 26 of 40 5,861,477

U.S. Patent Jan. 19, 1999 Sheet 27 of 40 5,861,477 carry...loo

s s

coolozzi- on zSr | | | | | | | |o U.S. Patent Jan. 19, 1999 Sheet 28 of 40 5,861,477 FIG 17B 1 IDENTIFICATION OF SEQUENCES COMPARED TO LYPEPZ LYSOZYME SES 3.2.1.17) c2-BOVINE LYSOZYME EC 3.2.1. 17) 2b PRECURSOR-BOVINE LYSOZYME (EC 3.2.1.17) 1 PRECURSOR-BOVINE LYSOZYME 88 3.2.1.17) 2C PRECURSOR-BOVINE LYSOZYME EC 3.2.1.17) 2C PRECURSOR-BOVINE LYSOZYME C 1o TO 4b (EC 3.2.1.17) LYSOZYME (EC 3.2.1.17) 2 - CHITAL LYSOZYME EC 3.2.1.17) 3 - SHEEP LYSOZYME EC 3.2.1.17) 2 - SHEEP LYSOZYME EC 3.2.1.17) 1 - SHEEP LYSOZYME C PRECURSOR (EC 3.2.1.17) LYSOZYME (EC 3.2.1.17) - 3 PRECURSOR - BOVINE LYSOZYME C 1 AND 2 (EC 3.2.1.17) LYSOZYME 8E. 3.2.1.17) - RED DEER LYSOZYME YSOZYME (ECEC 323:32,3.2.1.17) - HANUMANCHTAL LANGUR LYSOZYME c PRECURSOR (EC 3.2.1.17) LYSOZYME c (EC 3.2.1.17) LYSOZYME (EC 3.2.1.17) - RABBIT LYSOZYME c (EC 3.2.1.17) LYSOZYME LYSOZYME c, KIDNEY (EC 32:3:32,3.2.1.17 LYSOZYME 88 3.2.1.17) - SHEEP : LYSOZYME EC 3.2.1. 17) - BOVINE LYSOZYME (EC 3.2.1.17) - BABOON LYSOZYME LYSOZYME c PRECURSOR (EC 3.2.1.1733:23 LYSOZYME SE 3.2.1.17) PRECURSOR - HUMAN LYSOZYME EC 3.2.1. 17) - HUMAN LYSOZYME c PRECURSOR (EC 3.2.1.17) | LYSOZYME (EC 3.2.1.17) - HUMAN LYSOZYME C-3 (EC 3.2.1.17) LYSOZYME EC 3.2.1.17) - RED DEER LYSOZYME EC 3.2.1.17) PRECURSOR - HUMAN LYSOZYME (EC 3.2.1.17) PRECURSOR - HUMAN LYSOZYME C, TYPE P PRECURSOR (EC 3.2.1.17) LYSOZYME C, TYPE M PRECURSOR (EC 3.2.1.17) LYSOZYME (EC 3:23 M PRECURSOR - MOUSE LYSOZYME (EC 3.2.1.17) P PRECURSOR - MOUSE LYSOZYME C-1 (EC 3.2.1.17) LYSOZYME 88 3.2.1. 17) - RAT LYSOZYME EC 3.2.1.17) - RED DEER LYSOZYME C-2 (EC 3.2.1.17) LYSOZYME C (EC 3.2.1.17) LYSOZYME C PRECURSOR (EC 3.2.1.17) LYSOZYME LYSOZYME 88EC 32:283.2.1.17) C- -LADY COMMON AMHERST'S BOB WHITE PHEASANT LYSOZYME C (EC 3.2.1.17 LYSOZYME C SES 3.2.1. 1 7 : : LYSOZYME C (EC 3.2.1.17 U.S. Patent Jan. 19, 1999 Sheet 29 of 40 5,861,477 FIG. 17B2 51. LYSOZYME C (EC 3.2.1.17) 52. LYSOZYME (EC 3.2.1.17) c || - DUCK 53. LYSOZYME C (EC 3.2.1.17) 54. LYSOZYME C-3 (EC 3.2.1.17) 55. LYSOZYME (EC 3.2.1.17) c - KALIJ PHEASANT 56. LYSOZYME EC 3.2.1.17) - REEVE'S PHEASANT 57. LYSOZYME EC 3.2.1.17) - GOLDEN PHEASANT 58. LYSOZYME EC 3.2.1. 1 7 59. LYSOZYME EC 3.2.1.17) C - INDIAN PEAFOWL 6O. LYSOZYME EC 3.2.1.17) C - PLAN CHACHALACA 61. LYSOZYME EC 3.2.1.17) C PRECURSOR 62. LYSOZYME C PRECURSOR (EC 3.2.1.17) 63. LYSOZYME (EC 3.2.1.17) - GREEN PHEASANT 64. LYSOZYME C PRECURSOR (EC 3.2.1.17) 65. LYSOZYME C-1 PRECURSOR (EC 3.2.1.17) 66. LYSOZYME (EC 3.2.1.17) c - PRECURSOR - DUCK 67. LYSOZYME C PRECURSOR (EC 3.2.1.17) 68. LYSOZYME (EC 3.2.1.17) PRECURSOR - CHICKEN 69. LYSOZYME C (EC 3.2.1.17) 7O. LYSOZYME (EC 3.2.1.17) c - CALFORNIA QUAIL 71. LYSOZYME EC 3.2.1.17) c. PRECURSOR - CHICKEN 72. LYSOZYME EC 3.2.1.17) C PRECURSOR - JAPANESE OUAL 73. LYSOZYME 74. LYSOZYME C 88(EC 33:233.2.1.17 - CHCKEN 75. LYSOZYME (EC 3.2.1.17) c - HELMETED GUINEAFOWL 76. LYSOZYME C (EC 3.2.1.17) 77. LYSOZYME EC 32:3:2 C - DONKEY 78. LYSOZYME EC 3.2.1.17) - HORSE 79. LYSOZYME C SS 3.2.1.17 8O. LYSOZYME C (EC 3.2.1. 17 81. LYSOZYME (EC 3.2.1.17) c - PIGEON 82. LYSOAYME SES 3.2.1.17) 2d PRECURSOR - BOVINE 85. LYSOZYME EC 3.2.1.17) - AUSTRALIAN ECHDNA 84. LYSOZYME (EC 3.2.1.17) c PRECURSOR - JAPANESE QUAIL 85. LYSOZYME SS 3.2.1.17) c PRECURSOR - TURKEY 86. LYSOZYME EC 3.2.1. 1 7 87. ALPHA-LACT ALBUMN PRECURSOR - GOAT 88. APHA-LACT ALBUMN PRECURSOR 89. ALPHA-LACTALBUMN PRECURSOR - SHEEP 9 O. ALPHA-LACT ALBUMIN (LACTOSE SYNTHASE B PROTEIN) 91. ALPHA-LACT ALBUMN - DUCKBLL PLATYPUS 92. ALPHA-LACT ALBUMN PRECURSOR - GOAT 93. ALPHA-LACTALBUMIN PRECURSOR - GUNEA PG 94. ALPHA-LACTALBUMIN PRECURSOR - (LACTOSE SYNTHASE B) 95. ALPHA-LACTALBUMN PRECURSOR - GUNEA PG 96. ALPHA-LACTALBUMIN PRECURSOR 97. LYSOZYME (EC 3.2.1.17) P PRECURSOR - FRUIT FLY 98. LYSOZYME P PRECURSOR (EC 3.2.1.17) 99. ALPHA-LACTALBUMIN PRECURSOR (VERSION 1) - RAT 1 OO. ALPHA-LACTALBUMN PRECURSOR U.S. Patent Jan. 19, 1999 Sheet 30 of 40

s s JC2

l S.g C) 3.2 U.S. Patent Jan. 19, 1999 Sheet 31 of 40 5,861,477

S. s

UE

O) 3.2 U.S. Patent Jan. 19, 1999 Sheet 32 of 40 5,861,477 FIG. 18B1 IDENTIFICATION OF SEOUENCES COMPARED TO RNPEPZ RBONUCLEASE PANCREATIC PRECURSOR PANCREATIC RBONUCLEASE RBONUCLEASE, BRAIN - BOVINE RBONUCLEASE, SEMINAL PRECURSOR (EC 3.1.27. 5) SEMINAL RIBONUCLEASE (EC 3.1.27.5) PRECURSOR - BOVINE PANCREATIC RBONUCLEASE (EC 3.1.27.5) PRECURSOR RBONUCLEASE PANCREATC PRECURSOR (EC 3.1.27.5) PANCREATIC RBONUCLEASE (EC 3.1.27.5) - MOUSE PANCREATIC RBONUCLEASE (EC 3.1.27.5) PRECURSOR RBONUCLEASE PANCREATC PRECURSOR (EC 3.1.27.5) PANCREATIC RBONUCLEASE SES 3.127.5) - BOVINE PANCREATIC RBONUCLEASE (EC 3. 1275) - BOVINE PANCREATIC RIBONUCLEASE (EC 3.1.27.5 PANCREATIC RBONUCLEASE (EC 3.1.27.5) A - BOVINE PANCREATIC RIBONUCLEASE (EC 3.1.27.5) - IMPALA PANCREATIC RIBONUCLEASE (EC 3.1.27.5) - SHEEP PANCREATIC RIBONUCLEAS (EC 3.1.27.5) RBONUCLEASE PANCREATIC (EC 3.1.27.5) (RNose 1) RIBONUCLEASE - DOMESTC WATER BUFFALO RBONUCLEASERIBONUCLEASE PANCREATIC SES(EC 3:3733.1.27.5) (RNose(RNase 1) . PANCREATIC RIBONUCLEASE (EC 3.1.27.5) - GIRAFFE PANCREATIC RBONUCLEASE (EC 3.1.27.5 RBONUCLEASE PANCREATIC (EC 3.1.27.5) (RNose 1) RBONUCLEASE PANCREATIC (EC 3.1.27.5) (RNase 1) RBONUCLEASE PANCREATIC (EC 3.1.27.5) (RNose 1) RBONUCLEASS - DOMESTC WATER BUFFALO PANCREATIC RIBONUCLEASE (EC 3.1.27.5) - PRONGHORN RBONUCLEASE PANCREATC PRECURSOR (EC 3.1.27.5) PANCREATIC RBONUCLEASE (EC 3.1.27.5) - REINDEER RBONUCLEASE PANCREATC PRECURSOR (EC 3.1.27.5) RBONUCLEASE PANCREATC RBONUCLEASE PANCREATIC (EC 3.1.27.53.223 (RNase 1) RBONUCLEASE - THOMSON'S GAZELLE PANCREATC RBONUCLEASE (EC 3.1.27.5) RBONUCLEASE PANCREATC SE: 3.127.5 (RNase 1) PANCREATIC RBONUCLEASE EC 3.1275 - FALLOW DEER RBONUCLEASE PANCREATIC (EC 3.1.27.5 (RNose 1 RIBONUCLEASE PANCREATC E. 3.127.5 (RNose 1 RBONUCLEASE PANCREATC EC 3127.5 RBONUCLEASE PANCREATC EC 3127.5 SNERNose 1 PANCREATIC RBONUCLEASE EC 3.1.27.5) - ROE DEER SEMINAL RIBONUCLEASE (EC 3.1.27.5) - SYNTHETIC PANCREATIC RBONUCLEASE (EC 3.1.27.5) A - GUINEA PG RBONUCLEASE PANCREATC A (EC 3.1.27.5) (RNase 1A) RBONUCLEASE PANCREATC 88 3.1.27.5) (RNasse 1) PANCREATIC RBONUCLEASE EC 3.127.5 - MINKE WHALE PANCREATIC RBONUCLEASE EC 3. 1275 - HPPOPOTAMUS RBONUCLEASE PANCREATC SS 3.1.27.5) (RNose 1) PANCREATIC-TYPE RIBONUCLEASE (EC 3.1.27.5) BRb, br U.S. Patent Jan. 19, 1999 Sheet 33 of 40 5,861,477 FIG. 18B2 5 1. PANCREATIC RBONUCLEASE (EC 3.1275) PRECURSOR 52. RIBONUCLEASE PANCREATIC (EC 3.1.27.5) (RNase 1) 53. PANCREATIC RIBONUCLEASE PRECURSOR (EC 3.1.27.5) 54.55. PANCREATICRBONUCLEASE RBONUCLEASE PANCREATC 88(EC 3.1.27.5373 (RNose 1) 56. PANCREATC RBONUCLEASE & 275) - GOLDEN HAMSTER 57. PANCREATIC RIBONUCLEASE (EC 3.1.27.5), MINOR FORM 58. PANCREATIC RBONUCLEASE SES 27.5 59. PANCREATIC RBONUCLEASE (EC 3.127.5) - RED DEER 60. RIBONUCLEASE PANCREATIC (EC 3.1.27.5) (RNase 1) 61. PANCREATIC RBONUCLEASE (EC 3. 127.5) - HUMAN 62. RBONUCLEASE PANCREATC (EC .27.5) (RNose 1 63. RBONUCLEASE PANCREATC (EC .27.5) (RNase 1 64. RIBONUCLEASE PANCREATC (EC 27.5) (RNose 1) 65. PANCREATIC RBONUCLEASE (EC 27.5 HUMAN 66. PANCREATIC RBONUCLEASE (EC 27.5) - CAPYBARA 67. PANCREATIC RBONUCLEASE (EC 27.5 68. PANCREATIC RBONUCLEASE (EC 27.5) - CASRAGUA 69. PANCREATIC RBONUCLEASE (EC 27.5) - HORSE 7O. RBONUCLEASE PANCREATC (EC .27.5) (RNase 1) 71. PANCREATC RBONUCLEASE (EC 275) - PG 72. RIBONUCLEASE PANCREATIC B (E 3.1.27.5) (RNose B) 73. PANCREATIC RIBONUCLEASE SES 27.5). B - GUINEA PG 74. PANCREATIC RBONUCLEASE (EC 27.5) - NUTRA 75. RIBONUCLEASE PANCREATIC 76. RIBONUCLEASE PANCREATIC (EC 27.527.5) SN:(RNose 31 77. PANCREATIC RBONUCLEASE (EC 27.5) - CHINCHILLA 78 RBONUCLEASE PANCREATC (EC .27.5) (RNase 1) 79. PANCREATC RBONUCLEASE (EC 27.5) - MUSKRAT 8O. PANCREATIC RBONUCLEASE (EC 275) - MOUSE 81 RBONUCLEASE PANCREATIC (EC .27.5) (RNose 3 82. RIBONUCLEASE PANCREATIC (EC .27.5) (RNose 1 83. PANCREATIC RBONUCLEASE (EC 27.5) - CUIS 84. PANCREATIC RBONUCLEASE (EC 27.5) - GOAT 85. RIBONUCLEASE PANCREATC (EC .27.5) (RNase 1) 86. PANCREATIC RIBONUCLEASE (EC 27.5 87. RIBONUCLEASE PANCREATIC (EC 27.5) (RNose 1) 88. PANCREATC RBONUCLEASE (EC 3.127.5) - HANUMAN LANGUR 89. (EC 3.4.21.62) BPN PRECURSOR 90. SUBTILISIN BPN' PRECURSOR (EC 3.4.21.62) 91 RBONUCLEASE PANCREATC 8 3.223 (RNose 1) 92. PANCREATIC RBONUCLEASE (EC 3.1.27.5 93. SPORE COAT PROTEIN SP96 PRECURSOR - SLIME MOLD 94. SPORE COAT PROTEN SP96 95. HYPOTHETICAL PROTEIN 4 (phoC2 3' REGION) 96. TRANSCRIPTIONAL REGULATORY PROTEIN ALGP 97. REGULATORY PROTEIN Agr3 - PSEUDOMONAS AERUGINOSA 98. HYPOTHETICAL 34.4K PROTEIN - PSEUDOMONAS AERUGNOS 99. RIBONUCLEASE PANCREATIC (EC 3.223 (RNose 1) 100. PANCREATIC RIBONUCLEASE (EC 3.1.27.5) - RED KANGAR U.S. Patent Jan. 19, 1999 Sheet 34 of 40 5,861,477

s

s s U.S. Patent Jan. 19, 1999 Sheet 35 of 40 5,861,477

s

s S-Polo an one annon an

N

O U.S. Patent Jan. 19, 1999 Sheet 36 of 40 5,861,477

OO Set - CN N (/) - CN CO D v- H - I - - - - H - H - H. H. CN

s s U.S. Patent Jan. 19, 1999 Sheet 37 of 40 5,861,477

s s U.S. Patent Jan. 19, 1999 Sheet 38 0f 40 5,861,477 FIG. 19B 1 IDENTIFICATION OF SEQUENCES COMPARED TO UKPEPZ UROKNASE-TYPE PLASMINOGEN ACTIVATOR PRECURSOR U-PASMINOGENU-PASMNOGEN ACTIVATOR (ECSE: 3.22.733.42 1.73 PRECURSOR UROKNASE-TYPE PLASMINOGEN ACTIVATOR PRECURSOR U- PLASMNOGEN ACTIVATOR SE: 22:23 PRECURSOR U- PLASMINOGEN ACTIVATOR (EC 3.42 1.73 PRECURS OR UROKINASE-TYPE PLASMINOGEN ACTIVATOR PRECURS OR UROKNASE-TYPE PLASMINOGEN ACTIVATOR PRECURSOR U-PLASMINOGEN ACTIVATOR (EC 3.23.73 - RAT U-PLASMINOGEN ACTIVATOR (EC 3.4.21.73 - RAT 1 1 UROK NASE-TYPE PLASMNOGEN ACTIVATOR PRECURSOR U-PLASMINOGEN ACTIVATOR (EC 3.4.21.73) PRECURSOR PRECURSOR U-PLASMNOGENU- PLASMINOGEN ACTIVATOR SEEC 3:2:233.42 1.73 PRECURSOR UROKNASE-TYPE PLASMINOGEN ACTIVATOR PRECURSOR UROK NASE-TYPE PLASMINOGEN ACTIVATOR PRECURSOR . U-PLASMINOGEN ACTIVATOR (EC 3.4.21.73) PRECURS OR HEPATOCYTE GROWTH FACTOR ACTIVATOR - HGF ACTIVATOR ... COAGULATION FACTOR XLI PRECURSOR (EC 3.4.21.38) . COAGULATION FACTOR XIlo (EC 3:23 PRECURSOR COAGULATION FACTOR XIlo (EC 3.4.21.38 - GUINEA PG ; TSSUE PLASMNOGEN ACTIVATOR PRECURS OR T-PLASMINOGEN ACTIVATOR (EC 3.4.21.68) PRECURSOR . T-PLASMINOGEN ACTIVATOR (EC 3.4.21.68) PRECURSOR PERIOD CLOCK PROTEIN - MOUSE (FRAGMENT) ; : PROTEIN C (ACTIVATED PRECURSOR PROTEIN C (ACTIVATED (EC 3.4.21.693:23 PRECURSOR 2 s PREOTEIN C PRECURSOR (EC 3.4.21.69) (AUTOPROTHROMBIN) PERIOD CLOCK PROTEIN (FRAGMENT) . T-PLASMINOGEN ACTIVATOR (EC 3:23 PRECURSOR T-PLASMINOGEN ACTIVATOR (EC 3.4.21.68 - RAT TSSUE PLASMNOGEN ACTIVATOR PRECURSOR . T-PLASMINOGEN ACTIVATOR (EC 3.4.21.68) PRECURSOR TISSUE PLASMINOGEN ACTIVATOR PRECURSOR APOLIPROPROTEIN(a) - RHESUS MACAQUE (FRAGMENT) COAGULATION FACTOR VII PRECURSOR - HUMAN PROTEIN C - HUMAN COAGULATION FACTOR VII PRECURSOR (EC 3.4.21.21) APOLIPOPROTEIN (A) (EC 3.4.21.-) (APO (A)) (LP(A)) ELASTIN - CHICKEN (FRAGMENT) ELASTIN PRECURSOR - CHICKEN (FRAGMENT) ELASTIN PRECURSOR (FRAGMENT) PARACRYSTALLINE SURFACE LAYER PROTEIN, RsoA . PROTEIN C PRECURSOR (EC 3.4.21.69) (AUTOPROTHROMBIN) PROTEIN C (ACTIVATED) EC 3.4.21.69) PRECURSOR PROCOLLAGEN ALPHA 2(1 CHAIN PRECURSOR . PRO ALPHA 2CI) COLLAGEN - MOUSE 2 s - PROTEIN C PRECURSOR (EC 3.4.21.69) (AUTOPROTHROMBIN) . SPIDROIN 2, DRAGLINE SILK FIBRON - ORB SPIDER 5 . PROTEIN C (ACTIVATED) (EC 3.4.21.69) PRECURSOR U.S. Patent Jan. 19, 1999 Sheet 39 0f 40 5,861,477 FIG. 19B2 51. PROTEIN C (EC 3.4.21.-) PRECURSOR - RAT 52. SLK FIBROIN, DRAGLINE - ORB SPIDER 53. PLASMA KALLIKREN (EC 3.4.21.34) PRECURSOR - HUMAN 54. PLASMA KALLKREN PRECURSOR SS 3.4.21.34) 55. PLASMA KALLKREN PRECURSOR (EC 3.4.21.34) 56. FLAMENTO US HEMAGGLUTINN - BORDETELLA PERTUSSS 57. PLASMA KALLKREIN (EC 3.4.21.34) PRECURSOR - MOUSE 58. T-PLASMINOGEN ACTIVATOR (EC 3.4.21.68) alpha 2 PRECURSOR 59. MSB2 PROTEIN 6O. MSB2 PROTEIN - YEAST (SACCHAROMYCES CEREVISAE) 61. PLASMA KALLKREIN PRECURSOR (EC 3.4.21.34) 62. PLASMINOGEN PRECURSOR (EC 3.4.21.7) 63. PLASMINOGEN PRECURSOR - MOUSE 64. T-PLASMINOGEN ACTIVATOR (EC 3.4.21.68) 65. SALIVARY PLASMINOGEN ACTIVATOR PRECURSOR 66. PLASMA KALLKREN (EC 3.4.21.34) PRECURSOR - RAT 67. T-PLASMINOGEN ACTIVATOR (EC 3.4.21.68) beto PRECURSOR 68. HMW 1 = HIGH MOLECULAR-WEIGHT SURFACE-EXPOSED PROTEIN 69. HEPSIN (EC 3.4.21.-) 7O. SERINE PROTEASE HEPSIN (EC 3.4.21.-) 71. HEPSN - RAT 72. HEPSN - RAT 73. HEPSIN (EC 3.4.21.-) - HUMAN 74. PLASMINOGEN (EC 3.4.21.7) 75. T-PLASMINOGEN ACTIVATOR (EC 3.4.21.68) alpha-1 PRECURSOR 76. NAD(P)+ TRANSHYDROGENASE (B-SPECIFIC) (EC 1.6.1.1) 77. COLLAGEN ALPHA 2(V) CHAIN - HUMAN (FRAGMENT) 78. NAD(P) TRANSHYDROGENASE, MITOCHONDRIAL PRECURSOR 79. RNA 1 POLY PROTEIN (253 KD PROTEIN) 8O. GENOME POLY PROTEN - GRAPEVNE FANLEAF VIRUS 81. PROTEIN-TYROSINE KNASE (EC 2.7.1.112)p 150-HUMAN 82. PLASMIN (EC 3.4.21.7) PRECURSOR - RHESUS MACAQUE 85. PLASMIN (EC 3.4.21.7) PRECURSOR - RiESUS MACAQUE 84. COAGULATION FACTOR Xlo (EC 3.4.21.27) PRECURSOR 85. PROCOLLAGEN alpha 2 (IV) CHAIN PRECURSOR 86. PLASMINOGEN PRECURSOR (EC 5ök" 87. COAGULATION FACTOR X| PRECURSOR (EC 3.4.21.27) 88. COLLAGEN olpho 2(V) CHAIN PRECURSOR - MOUSE 89. MAJOR AMPULLATE FIBRON PROTEN - ORB SPIDER 9 O. SILK FIBROIN (FRAGMENT) 91. COAGULATION FACTOR X PRECURSOR (EC 3.4.21.6) 92. PLASMIN (EC 3.4.21.7) PRECURSOR - HUMAN 93. COAGULATION FACTOR Xo PRECURSOR (EC 3.4.21.6) PRECURSOR 94. PLASMIN (EC 3.4.21.7) PRECURSOR - HUMAN 95. PLASMINOGEN PRECURSOR (EC 3.4.21.7) 96. PLASMINOGEN (EC 3.42 1.7 97. PLASMIN (EC 3.4.21.7) PRECURSOR - HUMAN 98. HCF, C1, VCAF, CFF = VP16 ACCESSORY PROTEIN HOST CELL 99. PROTEOGLY CAN CORE PROTEIN PRECURSOR, CARTILAGE 1 OO. PLASMIN (EC 3.4.21.7) - BOVINE U.S. Patent Jan. 19, 1999 Sheet 40 of 40 5,861,477

5.18 10.25/s

5.5O

13.16 1 O.27

FIG 2 OB 5,861,477 1 2 CYCLIC PEPTIDE CATALYSTS MODELED would have a desired chemical activity. K. W. Hahn et al., ON ENZYME ACTIVE SITES “Design and Synthesis of a Peptide Having Chymotrypsin Like Esterase Activity,” Science, Vol. 248, pp. 1544-1547 CROSS-REFERENCE TO RELATED (1990), describe the construction of a catalytic molecule in APPLICATIONS which four helical peptides were assembled in a bundle which contained at its amino ends, Serine, histidine and This is the U.S. National Phase of PCT/US94/05569 filed aspartic acid in a Spatial arrangement Similar to that present May 18, 1994, which is a continuation-in-part of Ser. No. in chymotrypsin (CCT). This assembled-helical design mol 08/063,640 filed May 18, 1993 (abandoned). This applica ecule contained 73 residues representing a major portion of tion is also a continuation-in-part of Ser. No. 08/461,597 and derived from the whole enzyme and was capable of filed Jun. 5, 1995 now U.S. Pat. No. 5,759,834. binding to ester Substrates of C.CT. The assembled-helical BACKGROUND OF THE INVENTION design molecule was limited to hydrolyzing an acetylty rosine ethyl ester for about 100 turnovers and was limited to A. Field of the Invention a rate of catalysis which was only about 0.02% that of the The present invention relates to chemically Synthesized 15 native CCT. Importantly, the Sequences in the marginally catalysts which are capable of catalyzing chemical reactions catalytic Synthetic structure derived from the whole enzyme heretofore only efficiently catalyzed using naturally were Substantially the same as the equivalent Sequences occurring molecules. In particular, the invention pertains to found in the native enzyme. catalytic compositions of matter and to catalyst chemical Quite different approaches to attaining the chemical Structures, to methods of producing the kinds of catalysts advantages of enzymes without having to rely on the native described, and to chemical reactions in which the catalytic molecule have involved a unique use of immunological compositions can be applied. molecules. Monoclonal antibodies with catalytic activity B. Description of the Related Art have been prepared by using transition-State intermediates of For almost a century, catalytic macromolecules have the Substrate as the immunizing antigen. Again, these mol occupied center-Stage in life Sciences and medicine because 25 ecules demonstrate binding of Substrate but possesses only they catalyze many of the important chemical processes marginal activity. carried out in living organisms. In fact, one of the principal However, no synthetic catalytic Structure representing achievements of the modern pharmaceutical industry is the Significantly less than, but designed to Structurally imitate, production of large quantities of Such molecules for use in an enzyme has functioned like a true enzyme. Specifically, health care. More recently, the application of these catalysts no chemically Synthesized, low molecular weight catalyst in non-biological chemical processes has received the comprising a peptide has been produced which is capable of increasing interest of industrial chemists. converting a Substrate to a product in a specific manner at Entire enzyme molecules have been Synthesized as a pre-Selected bonds, at a useful rate of catalysis and for a useful period of time. result of major advances in the knowledge of protein Struc 35 ture and Solid-phase peptide chemistry. Some of these whole molecules retained the catalytic nature of the naturally SUMMARY OF THE INVENTION occurring enzyme. However, the molecules upon which the In accordance with the present invention, Applicant has Synthetic molecules were modeled were relatively Small and constructed relatively low molecular weight, Synthetic cata the yields of the correct Structures were extremely low due lysts having the functional characteristics of naturally occur to the inherent limitations of Solid-phase peptide Synthesis. 40 ring enzymes. The Synthetic catalysts of the invention rep These failures made it evident that synthesizing whole resent molecules which are significantly Smaller than, but enzymes merely to achieve the catalytic capability of the designed to Structurally imitate, an enzyme to the degree that naturally occurring molecule was not a viable approach for they function like the native enzyme. These catalysts are the construction of commercial quantities of catalysts. Nei 45 capable of converting a Substrate to a product as does the ther is this approach at all useful for larger enzymes which native enzyme upon which they are modeled in a highly may comprise Several hundreds of amino acid residues in Specific manner, at pre-Selected chemical bonds, at commer their chains. Clearly, it would be highly desirable if it were cially useful rates of catalysis and for commercially useful possible to obtain the desired chemical activity without periods of time. In Some instances, the Synthetic catalysts Synthesizing the entire enzyme structure. 50 actually improve upon the characteristics of the native A short peptide was previously designed by the Applicant molecule when comparisons of Specificity, turnover rates, using Surface-simulation Synthesis to mimic the Substrate temperature Sensitivity and the like are carried out. binding Site of trypsin. This peptide was shown to possess The Synthetic catalysts of the invention generally com the expected binding activities of the native enzyme with prise a sterically-constrained peptide comprising amino substrates and inhibitors (Atassi, M. Z., “Surface-Simulation 55 acids. Herein, the term amino acid referS generally to Synthesis of the Substrate-Binding Site of an Enzyme.” L-amino and imino acids routinely encountered in native Biochem. J. Vol. 226, pp. 477-485 (1985). However, enzymes as well as to analogs of these naturally occurring although the Synthetically produced peptide had the ability compounds including D-amino acids and the like. to bind to substrates and competitive inhibitors, it exhibited Further, the Sterically-constrained catalyst comprises, in no significant catalytic activity. 60 part, those amino acids which are in the active site of the Artificial linear peptide was Synthesized consisting of 34 naturally occurring enzyme. For the purposes of this residues the Selection of which was based upon Secondary invention, an active site amino acid is one which interacts Structure prediction rules. A dimer of this peptide showed with a Substrate, Substrate analog, transition-State Substrate/ limited nuclease activity (Gutte, B., et al., Nature 281: product or competitive inhibitor of the native enzyme. The 650-655 (1979). More recently and representative of this 65 interaction may be one which achieves the goal of binding type of approach to Synthetic catalyst design, an attempt was the Substrate to the catalytic Surface of the enzyme. The made to Synthesize a portion of an enzyme molecule which interaction may also be one which actually causes the 5,861,477 3 4 conversion of the Substrate to product, Such as by temporary the active site amino acids is known for a large number of donation of electrons to cause the cleavage of a bond in the enzymes. AS more enzymes are Subjected to 3-dimensional Substrate. Not uncommonly, the nature of the interaction analysis and included in the relevant databases, Synthetic may be a hybrid interaction where both binding and con catalysts modeled after Such enzymes may be constructed in version of Substrate to product occurs. a Straightforward manner using the methods of the inven tion. This interaction may comprise a very tight interaction A number of Sources are available for both the Such as through one or more covalent bonds shared, however 3-dimensional Structural information as well as the amino briefly, between the Substrate and the active site amino acid. acid Sequence information which may be used in designing The interaction between the Substrate and the active site the catalysts of the invention. In the United States, a useful amino acid may be of a weaker nature Such as through Source is the Protein Data Bank, Chemistry Department, hydrogen bonding, hydrophobic bonding or other weak Building 555, Brookhaven National Laboratory, Upton, N.Y. inter-molecular interactions. Regardless of the actual 11973. Protein Data Bank Service ASSociation member interaction, be it mere binding, conversion or a combination centers are located worldwide and include Canadian Scien of the two, or be it of a strong or weak character, for the tific Numeric Data Base Service, Dutch National Facility for purposes of the invention, Such interactions with the Sub 15 Computer Assisted Chemistry, NE Italy Interuniversity Strate cause an amino acid residue to fall within the defini Computing Center, European Molecular Biology tion of an active Site amino acid. Laboratory, Japan ASSociation for International Chemical Identification of active Site amino acids is routinely car Information, National Center for Supercomputing ried out on enzymes for which 3-dimensional Structural Applications, Osaka University, Pittsburgh Supercomputing information is known. In certain cases, Substrate analogs or Center, Prophet, San Diego Supercomputing Center, and transition-State Substrate analogs/product are utilized which SEQNET. Addresses, phone numbers, and access to mag allow the protein chemist to determine which of the amino netic media at the parent facility as well as the member acids exposed to the catalytic Surface (active site) of the centers may be obtained through the Protein Data Bank. enzyme interact with the Substrate. Typically these interac 25 Commercial Search firms also have access to protein tions are detected using analytical techniqueS Such as X-ray Sequence databaseS Such as Intelligenetics of Mountain crystallography, maSS Spectroscopy, nuclear magnetic reso View, Calif. nance and the like. Moreover, in many cases, enzymes fall into classes which In certain instances, having applied one or more of these allow certain Structural generalizations to be made in the techniques, it will be known precisely which one of the Selection of the active site amino acids. Thus, enzymes may amino acid residues exposed to the catalytic Surface actually be classified into classes according to the Substrates they cause the Substrate to be bound to or otherwise interact with convert-proteases digest proteins, lipases attack lipids, and the catalytic Surface of the enzyme. Similarly, it may be So on. Subclasses of proteins also exist-serine proteases known precisely which one of the amino acid residues possess a Serine in their active sites useful in the cleaving of exposed to the catalytic Surface actually cause the Substrate 35 peptide bonds in proteins. Even more useful is the fact that to be converted to product. More typically, however, the many proteins, especially those falling into the same classes actual Site of interaction of the Substrate with the catalytic and Subclasses, exhibit a great deal of amino acid Sequence Surface residues will be narrowed down to a defined region homology. Thus, trypsin and chymotrypsin, both of which comprising Sequences and/or amino acid residues that are in are Serine proteases, Share many of the same or chemically close proximity to one another in 3-dimensional Structure 40 Similar amino acid residues at or near the same positions in but distant in primary Structure among which are the actually each enzyme chain. Chemically similar Substrates (for interacting residues. For the purposes of the invention, each example, protein Substrates of proteases) impose very simi of the Sequences and amino acid residues in Such a region lar structural requirements on the catalytic Surfaces in the among which are the actually interacting residues are con active Sites of the enzymes which convert the Substrates to sidered to be active site amino acid residues. This is par 45 products (i.e., serine proteases cleaving peptide bonds of ticularly the case Since it is not unlikely that each residue in protein Substrates, albeit at different positions). Such a region may coordinately participate in the binding or Therefore, the catalyst designer having designed a Syn conversion of the Substrate, leaving no one residue among thetic catalyst based on the known 3-dimensional Structure them to be Singled out as the actually interacting residue. of one enzyme in a given class, can justifiably rely on Such Additionally, it will be understood by those of skill in the 50 homology in Selecting the active site amino acids of another art that certain enzymatic reactions are catalyzed indirectly enzyme of the same class for which there is Substantial by ligands bound to the enzyme active site, which ligands homology. This approach has been Successfully utilized by themselves are bound by interactions with the amino acid the Applicant to design a Synthetic trypsin-like catalyst residues in the enzyme chain. In Such instances, the amino based upon the homology of trypsin with chymotrypsin. acids which bind the ligand which in turn interacts with the 55 Thus, active site amino acids may also be defined by their substrate are expressly included within the definition of the homology with the same or Similar residues in an enzyme for active Site amino acids of the invention. which the 3-dimensional structure is known. If an active site Selection of the active site amino acids to be included in residue has been defined in the enzyme for which the the Synthetic catalysts of the invention may actually involve 3-dimensional Structure is known, and an homologous the isolation, purification and 3-dimensional analysis to a 60 region of a related, closely homologous enzyme with unde Sufficiently detailed degree of resolution of a protein for termined 3-dimensional Structure is known, then by analogy which the Substrate interactions with the catalytic Surface are those residues failing within the homologous region may be unknown. The techniques for doing So are known well by justifiably considered to be active Site residues. Such resi those of skill in the art. Even So, the obtaining of Such data dues are, therefore, included within the definition of active routinely involves Substantial investments of research time 65 Site amino acid for purposes of the invention. and effort. Fortunately, however, the structural information By design methods as detailed herein, after the identifi of the degree of resolution necessary to allow Selection of cation of the active Site amino acids, these residues are 5,861,477 S 6 arranged to have a 3-dimensional spacial relationship which the linear catalyst into a cyclic Structure. In certain instances, is essentially equivalent to that of the naturally occurring incorporation of a cysteine residue in the Synthetic catalyst enzyme. Chiefly, this entails measurements of the distances may mimic a similar residue in the naturally-occurring between the active site residues from C-carbon to C-carbon enzyme. In others, the added cysteine will be without in the peptide bond. Thus, if the C-carbon of active site homology to the naturally-occurring enzyme. If a cysteine residue A is spatially Separated from the C-carbon of active residue is necessarily included as a residue in an active site site residue B by 10 A, it will be understood that the region, and the inclusion cannot be made in Such a way as resulting design must provide a similar distance to be to make the cysteine residue an N- or C-terminus, it will be achieved in the Synthetic catalyst. Surprisingly, the Appli advisable to convert it to a non-disulfide linkage-forming cant has found that at least Some flexibility in design is residue where cyclization is to be achieved by terminal allowable indicating that leSS exacting distances between cysteines. Where a cysteine residue is required for activity C-carbons may be tolerated. For instance, in the chymot (e.g. cysteine proteases), Steric constraint must be achieved rypsin and trypsin designs which follow, the C-carbon of by means other than a disulfide bond. Such a constraint invariant residue aspartic acid -102 is actually Separated by should be to stable to reduction. The pepzyme could then be 10.25 A from the C-carbon of alanine-55 and from the made with the required cysteine residue in the design and, C-carbon of isoleucine/leucine-99 (chymotrypsin/trypsin) 15 even though Such a pepzyme could then dimerize, its activity by 5.18 A. By Simple error, in constructing certain versions would be re-established in the presence of a reducing agent. of the Synthetic catalysts, the C-carbon of aspartic acid-102 Other intramolecular disulfide linkages work as well. Alter was positioned 10.25 A from the C-carbon of isoleucine/ natively and preferably where a bond such as the disulfide leucine-99 and 5.18 A from the C-carbon of alanine-55. linkage may be Susceptible to being reduced, intramolecular Even So, the Applicant obtained activities similar to those cyclization may be achieved via peptide bonding between reported below. This would indicate that even lower levels any two terminal amino acids So long as the proper inter of resolution in the 3-dimensional Structure of an enzyme chain distances are maintained. Other interchain and intra may be useful in creating the Synthetic catalysts of the chain bonding, including bonding or crosslinking to and invention. 25 between non-amino acid Substituents of the catalyst may be In the first instance, the 3-dimensional relationship may used to Sterically constrain the catalyst So long as the means be promoted using SpacerS Such as catalytically inactive achieve the two criteria detailed Supra. amino acids between the active site amino acids. The bond Thus, the catalysts of the invention may achieve Steric lengths of representative Spacers are detailed in the constraint by constructing a cyclized catalyst containing an Examples to follow. By way of continuing example, end-to-end joint between N- and C-termini of the catalyst. In however, if active site residues A and B above were to be certain embodiments, the end-to-end joint further comprises linked by Spacers, the linear distance of 10 A would deter a disulfide bond, a peptide bond or a linking molecule which mine the number and the nature of the spacers to be used. AS is not an amino acid (e.g., Sugars, lipids, carbon-carbon used herein, a catalytically inactive amino acid is any amino bonds, and the like). acid which does not fall within the definition of an active site 35 Since there is no a priori reason for closing the Synthetic amino acid as defined Supra. Other spacers which would not peptide catalyst chain at a particular site along its linear fall within the definition of amino acids as used herein but Structure, it is possible to construct the linear catalyst using which would function Structurally and chemically in a Standard peptide Synthesis chemistry or through recombi similar fashion will find usefulness, as well. nant DNA techniques as any of a number of possible linear Steric constraint of the catalyst may take Several forms. 40 configurations. Some of these linear configurations may be Certainly, the Applicant has found that cyclization of the preferable to others for purposes of Synthesis, especially catalyst achieves the goal of Steric constraint admirably. where it is desired to use disulfide linkages to cyclize the However, the goal of Steric constraint may be achieved by catalyst and the interchain distances limit the placement of other means as well. Thus, co-terminal attachment of the N the disulfide linkage to one or only a few potential Sites and C-termini of the linear peptide to a Surface in close 45 within the linear catalyst. Where cyclization is affected by proximity to one another is Such a means of Sterically means other than disulfide linkages, the order of Synthesis is constraining the catalyst. For the purposes of the present more flexible. invention, any means which Sterically, torsionally or con Thus, claimed herein are Synthetic catalysts capable of formationally constrains the catalyst in a manner which catalyzing reactions of Substrates in a manner Similar to that allows the requisite flexibility of the chain to conform to the 50 of naturally-occurring enzymes. The catalysts generally Substrate yet which eliminates a large portion of the non comprise Sterically-constrained peptides, which peptides are productive (non-catalytic) conformations which would themselves made up of, at least in part, amino acids. The result from a strictly linear, freely mobile chain in Solution amino acids which are used to Synthesize the peptide further are expressly included in this definition. comprise at least Some of the active Site amino acids known The means for Steric constraint must meet at least two 55 to exist in the naturally-occurring enzyme upon which the criteria. First, the means for Sterically constraining the catalyst is modeled. The catalyst is also made up of one or catalyst must maintain the interchain distances necessary for more Spacers variously interspersed during Synthesis of the attaining the proper orientation of active site residues around linear peptide between the active site amino acids. the Substrate in a fashion Similar to that occurring in the The active site amino acids are arranged during the native enzyme. Second, the means for constraining the 60 Synthesis of the linear peptide to achieve a linear relation catalyst must be at least Stable enough under the conditions ship along the length of the catalyst Such that, when con of the reaction with the substrate to maintain the sterically torted properly in 3-dimensional Space, the catalyst is constrained, functional nature of the catalyst. Certainly, the capable of assuming a 3-dimensional spacial relationship Applicant has found that a simple yet effective means for amongst these active site amino acids. The 3-dimensional constraining the catalyst is to include at its N- and C-termini, 65 spacial relationship is essentially equivalent to that present a cysteine residue in order to allow the formation of a in the naturally-occurring enzyme in its catalytically active disulfide bond between the terminal residues thereby closing State. Unlike this essential equivalency of 3-dimensional 5,861,477 7 8 Spatial intermolecular distances, the linear relationship the constructed design. The size of the catalysts is therefore between the active site amino acids along the length of the dictated to a large degree by the size of the active Site of the synthetic catalyst is substantially different from that of the modeled enzyme. In any case, the numbers of amino acid naturally-occurring enzyme. In many cases, the linear rela residues of certain preferred Synthetic catalysts will rarely tionships established in the Synthetic catalysts of the inven exceed approximately 30% of that of the naturally-occurring tion will be substantially altered and even reversed from that enzyme. observed in the native molecule. Since the Synthetic catalyst Of course, the skilled artisan will recognize that there may incorporates in the design only the essential residues of the be additions to the minimal Synthetic catalyst made for other active Site which are linked with proper spacing and reasons which would add to the mass of the molecule. Thus, directionality, large amounts of the linear Sequence found in for instance, it may be possible to add allosteric regions to the naturally-occurring enzyme interspersed between the the basic catalytic molecule in order to regulate the catalysis. active Site regions/residues will be missing from the Syn Additionally, it is likely that multimers of the synthetic thetic catalyst. Thus, the catalysts have molecular weights peptide catalysts may exhibit preferred characteristics and Substantially less than that of the naturally-occurring would thereby add additional mass to such catalysts. For enzyme upon which they are modeled. 15 instance, it is readily apparent that mixed function Synthetic The catalysts of the invention display reaction kinetics catalysts may be constructed using the methods and com that are enzyme-like for Substrates, Substrate analogs and positions of the present invention. A trypsin-like Synthetic competitive inhibitors of the correlate enzymes. The sub catalyst may be coupled to a chymotrypsin-like catalyst to Strates are converted in an essentially equivalent manner achieve a combined catalyst which will act on either protein (i.e., chemically equivalent) to that of the correlate enzyme. Substrate by covalently bonding two distinct Synthetic cata Herein, Substrates of a Synthetic catalyst are understood to lyst molecules Via, for instance, the Side chains present in include at least the substrates of the correlate enzyme. While either molecule's amino acid residues. In certain instances, certain Synthetic catalysts may have unique Substrates not it may be desirable to increase the solubility or serum life of shared with the correlate enzyme, those Substrates will a catalyst of the invention by attaching it to a Solubilizing typically be determined empirically based on Similarity to 25 agent or Stabilizing agent, Such as lysozyme, polyethylene the correlate enzyme Substrates. Herein, binding specificity glycol, polyvinyl alcohol or the like, adding mass to the is meant to refer to Substrate preference. AS Seen herein, a catalyst in the process. Also, where a ligand is bound and is mere four alterations in the active site residues of the involved in catalysis, maSS increases are possible. Thus, chymotrypsin-like Synthetic catalyst causes the resulting while it is anticipated that the basic Synthetic peptide cata trypsin-like catalyst to no longer bind to or cleave proteins lyst will be Substantially reduced in molecular weight com at chymotrypsin cleavage sites, rather it then only binds and pared to the native enzyme, Synthetic constructions which cleaves proteins at trypsin cleavage sites. Binding constant add mass to this basic configuration are expressly antici is meant to refer to the values used in determining the pated. proportional Michaelis-Menten constant (K). In certain The Spacers which may further comprise catalytically embodiments, the catalysts of the invention have binding 35 inactive amino acids will be recognized by one of skill in the constants for their respective Substrates which are approxi art to meet certain criteria. Chiefly, Such amino acid Spacers mately of the same order of magnitude or at most 100 times will be amino acids which have a minimal Steric hindrance that of or better than that of the naturally-occurring enzyme. capacity as it relates to movement of the peptide in all AS evidenced in the examples to follow, it is known that dimensions. Additionally, the Spacer amino acids will typi naturally occurring homologs of a given enzyme display 40 cally lack Such side groups as might interfere in catalysis different affinities for the same Substrate and convert that Such as reactive, hydrophilic, hydrophobic groups or other substrate at different rates. In preferred embodiments, the Such moieties. Typically, Such amino acid Spacers will be binding constant of the Synthetic catalyst will be essentially glycyl, Seryl or alanyl residues. However, in certain equivalent to that of the native molecule. Similarly, the instances, it may be desirable to use one or more Spacers preferred catalysts have rates of catalysis (k) which are 45 Selectively introduced into the catalyst design to impart a approximately 1% of or better than that of the naturally desired conformational constraint on the design, Such as by occurring enzyme. In preferred embodiments, the rate of using proline residues to introduce a kink. Such chain kinkS catalysis of the Synthetic catalyst will be essentially equiva may help to further Sterically constrain the Synthetic catalyst lent to that of the native molecule. In highly preferred to additionally limit the non-catalytic conformations which embodiments, the rates of catalysis of the Synthetic catalysts 50 the cyclic molecule takes in Solution. will be Superior to that of the native molecule, especially In certain preferred embodiments of the invention, the where the catalysis takes place in a reaction environment catalysts of the invention were modeled upon naturally unfavorable for the native enzyme such as with elevated and occurring enzymes Selected from the group of enzymes depressed temperatures or other Situations where the native consisting of chymotrypsin, trypsin, lysozyme and ribonu Structure is Susceptible to denaturing. In certain instances, as 55 clease. These were the enzymes which the Applicant chose shown in the examples to follow, the turnover number (the to be representative of the wide utility of the present cycles of catalysis in a given amount of time) of the invention in constructing Synthetic catalysts. This Selection Synthetic catalysts of the invention exceed that of the native was made in order to demonstrate in the one instance that enzyme to Such a degree that it remains undetermined how while a 3dimensional Structure is advantageous and ulti many turnovers are theoretically possible with the Synthetic 60 mately necessary to construct the Synthetic catalysts of the molecules. Due to their size and lack of Steric constraints, present invention, where amino acid Sequences exist which the Synthetic catalysts of the invention may be capable of are highly homologous to a known 3-dimensional Structure, approaching rates close to that controlled only by diffusion. Substitution of the relevant catalytic residues was carried out The catalysts of the invention, as noted previously, have to alter the catalytic Specificity of the resulting catalyst. molecular weights Substantially less than that of the native 65 Thus, trypsin residues were shown in the examples to follow enzymes. This is a function of the need to incorporate and to be replaced for chymotrypsin residues in a 3-dimensional link together only the active site residues of the enzyme into catalyst based only upon the chymotrypsin X-ray coordi 5,861,477 9 10 nates. In the Second instance, these exemplary enzymes were It will be recognized, however, by those of skill in the art diversely Selected to demonstrate that the compositions of that the catalysts described above and those claimed in matter and methods of the invention were not limited to general may contain functionally equivalent amino acid enzymes catalyzing only certain classes of Substrate. While substitutions. The importance of the hydropathic index of both chymotrypsin and trypsin are proteolytic enzymes, amino acids in conferring biological function on a peptide is lysozyme is a polysaccharide digesting enzyme and ribonu generally known by those of skill in the art. It has been found clease hydrolyzes the bonds of RNA. These selections were only exemplary. The approaches and designs detailed herein by many researchers that certain amino acids may be Sub allow any one of skill in the art with the requisite Stituted for other amino acids having a similar hydropathic 3-dimensional data necessary (or homologous to) to design indeX or Score and Still retain Similar if not identical bio a peptide capable of assuming that 3-dimensional Structure. logical activity. AS displayed below, amino acids are Molecular modeling computer programs known to those of assigned a hydropathic indeX on the basis of their hydro skill in the art may also provide useful information when phobicity and charge characteristics. It is believed that the comparing Substrate induced-fit of the Synthetic catalysts of relative hydropathic character of the amino acid determines the invention. the Secondary Structure of the resultant peptide, which in More specifically, representative catalysts are disclosed 15 turn defines the interaction of the peptide with the substrate which indicate the utility and Scope of the present invention molecule. It is proposed that biological functional equiva acroSS a wide range of natural enzyme models. One pre lence may typically be maintained where amino acids are ferred catalyst is modeled on chymotrypsin and consists eXchanged having no more than a t1 to 2 difference in the essentially of the cyclic amino acid catalyst: index value, and more preferably within a t1 difference. cyclo(-S-cystinyl-glycyl-phenylalanyl-histidyl phenylalanyl-glycyl-glycyl-Seryl-aspartyl-glycyl methionyl-glycyl-seryl-seryl-glycyl-glycyl-Valyl AMINO ACID HYDROPATHIC INDEX Seryl-tryptophanyl-glycyl-isoleucyl-glycyl-glycyl Isoleucine 4.5 aspartyl-glycyl-alanyl-alanyl-histidyl-cystinyl-S) Valine 4.2 (SEQ ID NO:1); wherein 25 Leucine 3.8 Phenylalanine 2.8 “cyclo” refers to the cyclic nature of the peptide and the Cysteine/Cystine 2.5 residues “-S-cystinyl' and “-cystinyl-S” indicate that the Methionine 1.9 peptide was cyclized through the use of a disulfide bond Alanine 18 between the two cysteine residues. Another preferred cata Glycine -0.4 lyst is modeled on trypsin and consists essentially of the Threonine -O.7 cyclic amino acid catalyst: Tryptophan -0.9 Serine -0.8 cyclo(-S-cystinyl-glycyl-tyrosyl-histidyl-phenylalanyl Tyrosine -1.3 glycyl-glycyl-Seryl-aspartyl-glycyl-glutamyl-glycyl Proline -1.6 Seryl-aspartyl-glycyl-glycyl-Valyl-seryl-tryptophanyl Histidine -3.2 glycyl-leucyl-glycyl-glycyl-aspartyl-glycyl-alanyl Glutamic Acid -3.5 35 Glutamine -3.5 alanyl-histidyl-cystinyl-S) (SEQ ID NO:2). Aspartic Acid -3.5 Another preferred catalyst is modeled on lysozyme and Asparagine -3.5 consists essentially of: Lysine -3.9 cyclo(-S-cystinyl-threonyl-asparagyl-arginyl-asparagyl Arginine -4.5 glycyl-glycyl-aspartyl-glycyl-glycyl-leucyl-glutamyl 40 isoleucyl-asparagyl-glycyl-tryptophanyl-tryptophanyl Thus, for example, isoleucine, which has a hydropathic glycyl-glycyl-isoleucyl-glycyl-aspartyl-glycyl index of +4.5, can be substituted for valine (+4.2) or leucine aspartyl-glycyl-alanyl-tryptophanyl-Valyl-alanyl (+3.8), and might still obtain a peptide having similar glycyl-arginyl-glycyl-phenylalanyl-glutamyl-seryl biological activity. Alternatively, at the other end of the asparagyl-cystinyl-S) (SEQ ID NO:3). 45 Scale, lysine (-3.9) can be Substituted for arginine (-4.5), Another preferred catalyst is modeled on ribonuclease and and So on. consists essentially of: Accordingly, these amino acid Substitutions are generally cyclo(-S-cystinyl-glutamyl-glycyl-Valyl-histidyl based on the relative Similarity of R-group Substituents, for phenylalanyl-aspartyl-alanyl-Seryl-glycyl-glycyl example, in terms of size, electrophilic character, charge, threonyl-asparagyl-Valyl-prolyl-lysyl-glycyl-glycyl 50 and the like. In general, although these are not the only Such glutamyl-histidyl-glycyl-phenylalanyl-lysyl Substitutions, the preferred Substitutions which take various cystinyl-S) (SEQ ID NO:4). of the foregoing characteristics into consideration include Another preferred catalyst is modeled on urokinase and the following: consists essentially of: cyclo(-S-cystinyl-glycyl-threonyl-tyrosyl-Valyl-glycyl 55 glycyl-seryl-aspartyl-glycyl-glutaminyl-glycyl-Seryl ORIGINAL RESIDUE EXEMPLARY SUBSTITUTIONS aspartyl-glycyl-glycyl-Valyl-seryl-tryptophanyl alanine glycine; serine glycyl-leucyl-glycyl-glycyl-aspartyl-glycyl-alanyl arginine lysine threonyl-histidyl-cystinyl-S) (SEQ ID NO:5). asparagine glutamine; histidine aspartic acid glutamic acid Another preferred catalyst is modeled on tissue plasminogen 60 cysteine serine activator and consists essentially of glutamine asparagine cyclo(-S-cystinyl-glutamyl-arginyl-phenylalanyl-leucyl glycine alanine glycyl-glycyl-seryl-aspartyl-glycyl-glutaminyl-glycyl histidine asparagine; glutamine isoleucine leucine; valine alanyl-aspartyl-glycyl-glycyl-isoleucyl-Seryl leucine isoleucine; valine tryptophanyl-glycyl-tyrosyl-glycyl-glycyl-aspartyl 65 lysine arginine; glutamine; glutamic acid glycyl-alanyl-alanyl-histidyl-cystinyl-S) (SEQ ID methionine leucine; tyrosine NO:6). 5,861,477 11 12 -continued spacial relationship is essentially equivalent to that present in the naturally-occurring enzyme in its catalytically active ORIGINAL RESIDUE EXEMPLARY SUBSTITUTIONS State. Conversely, the linear relationship is Substantially serine threonine different from that of the naturally-occurring enzyme. As a threonine serine result of the unique linear bridging of active site residues, tryptophan tyrosine there is no need to include vast amounts of the amino acids tyrosine tryptophan; phenylalanine typically found along the native polypeptide chain nor is valine isoleucine; leucine there a need to maintain the order of Sequence found along the native polypeptide chain. The result of this cutting out of The invention also relates to methods of Synthesizing a most of the non-catalytic residues found in the native catalyst capable of catalyzing a reaction of a Substrate in a polypeptide is that the catalyst has a molecular weight manner Similar to that of a naturally-occurring enzyme. Substantially less than that of the naturally-occurring These methods generally comprise first Selecting certain enzyme. As a final Step, the linear peptide is Sterically amino acids to be included in the catalyst from at least Some of the active Site amino acids in the naturally-occurring constrained by one or another of the techniques discussed enzyme. As noted above, the Protein Data Bank readily 15 above. provides an excellent Source of Such information in the form A process using the Synthetic catalysts of the invention to of both 3-dimensional structure, probable active site catalyze a reaction of a Substrate in a manner Similar to that residues, Sequences and Sequence homologies between of a naturally-occurring enzyme is also provided. Typically, related enzymes. Where the designer chooses to base the the Synthetic peptide catalysts of the invention are used in design of the catalyst on a known 3-dimensional Structure of exactly the same manner and under identical conditions for an enzyme, reference is first made to Such a Source for the which the native enzyme is known to function best. Thus, distances Separating and the identity of the active site buffered Solutions at given temperatures and given concen residues on the catalytic Surface of the enzyme. If the design trations which conform to equivalent molar ratioS of catalyst is to be based upon the Sequence homology of an enzyme for to Substrate are preferred. However, Since the catalysts of the which the 3-dimensional structure is unknown but which 25 invention may exhibit enhanced ability to function under unknown enzyme is related to an enzyme of known conditions which would denature or otherwise reduce the 3-dimensional Structure, then a first Step includes design of catalytic ability of the native enzyme, in Some instances the a first catalyst modeled after the known 3-dimensional Synthetic catalysts of the invention may be used under structure enzyme. This step is followed by Substitutions of conditions different from those that are maximal for the active Site amino acids where the unknown 3-dimensional native molecules. In any case, no special conditions over Structure enzyme differs with that of the known enzyme as those routinely utilized by those of skill in the art of determined by Sequence homologies. enzymology are necessary when using the catalysts of the The elements of the design first require the identification invention. Products produced using the catalytic processes of the active site residues and the C-carbon to C-carbon of the invention are also provided. distances Separating them. This is based on the Structural 35 In addition to the terms and definitions discussed above, homology with the known reference enzyme. Next, a Selec the following abbreviations are used herein: BTEE, tion is made of one or more Spacers to be variously inter N-benzoyl-L-tyrosine ethyl ester; BPTI, bovine pancreatic spersed between the active site amino acids Selected in the trypsin inhibitor; ChPepz, peptide designed by Surface first step. The distances determined in the first step between Stimulation Synthesis to mimic the active site of the active site residues guide the Selection of a properatomic 40 C-chymotrypsin; DIFP, diisopropyl fluorophosphate; PMSF, bridging Structure in the form of a Spacer to be used. phenylmethylsulfonyl; TAME, N-tosyl-L-arginine methyl Reference should be made to the detailed examples herein ester; TPCK, L-1-p-tosylamino-2-phenylethyl chloromethyl for representative Spacers and their atomic dimensions. Of ketone; and TrPepz, Surface-stimulation Synthetic peptide course, the Spacers used herein are but exemplary and may designed to mimic the active site of trypsin; DTT, dithio be readily Substituted by any spacer of Suitable dimensions 45 threitol; LyPepz, peptide designed by Surface-simulation and chemical compatibility. Synthesis to mimic the active site of hen egg lysozyme, LYZ, Having Selected the Substituents of the Synthetic catalyst, hen egg lysozyme, RNASE, bovine ribonuclease A, a neXt Step will typically include Synthesizing a linear RnBepz, Surface-Simulation Synthetic peptide designed to peptide by Stepwise addition of either the active site amino mimic the active site of bovine ribonuclease A. acids or the Spacers to the growing peptide to achieve the 50 proper linear arrangement and distances. While the Appli BRIEF DESCRIPTION OF THE DRAWINGS cant has demonstrated the utility and ease by which this may FIG. 1. Time-course hydrolysis of BTEE by CCT and be accomplished using Synthetic peptide chemistry, it is ChPepz. Hydrolyses were carried out, as described in the certainly possible and, in Some cases, advisable to achieve test, on 200 ug (0.638 umole) of BTEE, and monitored on the synthesis of the peptide using recombinant DNA tech 55 a recording spectrophotometer by increase in absorbance at nology. Either approach is permissible. In either case, the 256 nm, using (a) 1.5ug (6.05x10-5 umole) CCT; (b) 7.06 preferred order of synthesis will typically be dictated by the ug (2.65x10 umole) ChPepz; and (c) 4.71 ug (1.77x10 position deemed best for cyclization. Thus, where a disulfide umole) ChPepZ. Curve (d) represents the following linkage is to be used to cyclize the catalyst, Synthesis will reactions, all of which had Zero activity and Superimposed typically begin with a cystinyl residue and end with a 60 on the x axis: ChPepz (7.06 ug) inhibited with TPCK or with cystinyl residue. Alternatively, where a peptide bond is used DIFP; ChPepz (7.06 ug) made acyclic by reduction of the to cyclize the peptide, leSS concern need be placed on the disulfide bond; Trypsin or TrPepz action on BTEE; O.CT or Specific Starting point Synthesis. ChPepz, action on TAME, and action of unrelated peptide Having SO constructed the linear peptide, this establishes controls on BTEE. a linear relationship along the peptide Such that the peptide 65 FIG. 2. Examples of Lineweaver-Burk plots for BTEE is capable of assuming a 3-dimensional spacial relationship hydrolysis by ChPepZ. (O), and CCT (A). Assays were amongst the active Site amino acids. The 3-dimensional carried out at pH 7.8 and 25 C., in 306 replicates at six 5,861,477 13 14 different Substrate concentrations as described in the text. FIG. 9. Lineweaver-Burk plots for M. lysodeikticus The K, and k c values obtained from these plots are given hydrolysis by LyPepZ. (A), and LYZ (A). Assays were in Table 1 together with the kinetic constants for the action carried out at 25 C. in 0.125M NaCl using nine different of trypsin and TrPepz on TAME. Substrate concentrations in 3-6 replicates as described in the FIG. 3. Hydrolysis of TAME by trypsin and TrPepZ. text. The K, and kc values obtained from these plots are Reactions were done on 240 ug (0.634 umole) of TAME at given in Table 2. 25 C., pH 8.0, in a total reaction of 1 ml and monitored at 247 nm on a recording spectrophotometer, as described in FIG. 10. Time-course hydrolysis of yeast RNA by RNase the text. Hydrolyses were done with: (a) trypsin (2.1x10 and RnBepz. Reactions were done at 25 C. on 700 tug of umoles); (b) 6.84 ug of TrPepz (2.53x10 umoles), both in yeast RNA. The decrease in absorbance at 300 nm was the presence and absence of 10 molar excess of TPCK (the monitored continuously on a recording spectrophotometer. two curves Superimposed) (c) 4.55 ug of TrPepz (1.69x10 The curves show: (a) hydrolysis by 1.5 lug (1.095x10" almoles); (d) Several reactions Superimposed on this Zero umole) of RNase; (b) hydrolysis by 5.8 ug (2.42x10 activity curve. These are: action of ChPepz (7.0 ug) on umole) of RnPepz; (c) RnPepz inhibited by 250 ug of TAME, effect of control cyclic and linear peptides (unrelated denatured DNA, or 10 molar excess of AgNO, and RnBepz to TrPepz or ChPepz) on TAME, TrPepz (6.84 ug) inacti 15 rendered acyclic by reduction of the disulfide bond. Also vated by reduction with DTT or inhibited by 10 molar excess Superimposed on curve (c) is the action of unrelated peptide of PMSF, DIFP, BPTI or human C.-antitrypsin. controls (LyPepz and other cyclic peptides) on yeast RNA. FIG. 4. Maps of the peptides obtained from (1) myoglobin FIG. 11. Lineweaver-Burk plots for yeast RNA hydrolysis and (2) casein by hydrolysis with OCT or ChPepZ. (1a) by RnPepz (()), and RNase (O,603). Assays were carried myoglobin peptides obtained with CCT; (1b) myoglobin out in 0.1M sodium acetate at pH 5.0 and 25 C. in 3-6 peptides obtained with ChPepz; (2a) Casein peptides replicate analyses of different Substrate concentrations, as obtained with CCT; (2b) casein peptides obtained with described in the text. The kinetic constants derived from ChPepZ. Peptide maps were done, as described in the text, these plots are given in Table 2. by chromatography in the descending dimensions followed FIG. 12. A diagram of the spatial disposition of the active by high voltage paper electrophoresis (from left to right). 25 site residues of LYZ. FIG. 5. Fingerprints of the myoglobin and casein peptides FIG. 13. Design of the surface-simulation peptide mim obtained by hydrolysis with trypsin or TrPepZ. (1) myoglo icking the active site of LYZ. (a) Shows the active site bin peptides obtained by hydrolysis with: (1a) trypsin, or residues in the shaded areas and the C-to-C' distances (in (1b) TrPepz; (2) casein peptides obtained with (2a) trypsin, A) Separating the appropriate residues. (b) Gives the Surface or (2b) TrPepZ. Peptide maps were done by paper chroma Simulation Synthetic peptide designed to mimic the active tography in the ascending dimension, followed by high Site of LYZ. The shaded areas, representing the actual active Voltage paper electrophoresis (from left to right). For details, Site residues of LYZ, were linked by glycine Spacers which See the text. were introduced to achieve appropriate spacing between the FIG. 6. Stereo-diagram showing the active site residues of 35 regions and the residues of the active site. (Cter) denotes the (A) bovine C.CT and (B) bovine trypsin. Four residues in the C-terminal Cys and (Nter) is the N-terminal Cys of the active site of CCT are substituted in trypsin (Phe-39 to Tyr, peptide. A disulfide bridge between these half cystine resi Ile-99 to Leu, Ser-189 to Asp, Met-192 to Gln) and shown dues is used to cyclize the peptide. in heavy lines in (B). FIG. 14. A diagram of the spatial disposition of the FIG. 7. Design of the surface-simulation synthetic pep 40 essential residues of the active site of RNase. tides mimicking the active sites of C.-chymotrypsin FIG. 15. Design of the surface-simulation peptide mim (ChPepz) and trypsin (TrPepz). (a) shown the contact resi icking the active site of RNase. (a) The active site residues dues of the active site in the shaded areas and the distances shown in the shaded areas and the C-to-C' distances (C-to-C, in A) Separating the appropriate residues. Resi Separating them are given in A. (b) Structure of the surface due numbers are based on bovine chymotrypsinogen 45 Simulation Synthetic peptide designed to mimic the active Sequence. (b) indicates the Surface-simulation Synthetic pep site of RNase. The shaded areas are the actual site residues tides designed to mimic the active sites. The residues in the and the glycine residues in between are spacers. A disulfide shaded areas are the actual active site residues of bovine bridge between the Cys residues at the C-terminal (Cter) and CCT while those linking these Shaded areas are glycine the N-terminal (Nter) is used to cyclize the peptide. Spacers introduced to achieve appropriate distances of Sepa 50 FIG. 16. A. Tabulated results of computer search of PIR ration between the respective residues and regions of the and SWISS-PROT databases utilizing Chymotrypsin B pre site. (Cter) is the C-terminal Cys and (Nter) denotes the cursor as the query Sequence. Active Site residues which N-terminal Cys of the peptides which are cyclized by the were previously utilized to construct a Synthetic catalytic disulfide between the two Cys residues. The outer sequence chymotrypsin peptide are indicated across the top of the differs from ChPepz in four positions only: Phe-39 to Tyr, 55 Figure along with the Sequence number based upon the Ile-99 to Leu, Ser-189 to Asp, Met-192 to Gln. chymotrypsin linear Sequence. To the left of the Figure, FIG. 8. Time-course hydrolysis of M. lysodeikticus by numbered references are indicated which refer to the LYZ and LyPepz. Hydrolyses were carried out, as described Sequences listed in B below. For each Such Sequence, in the text, on 140 ug of Substrate and monitored on a divergences from the active site residue utilized to obtain recording Spectrophotometer by decrease in absorbance at 60 chymotrypsin activity are shown. B. The first 100 sequences 450 nm using (a) 1.5 lug (1.04x10" umole) of LYZ and (b) demonstrating varying degrees of Sequence homology with 5.6 ug (1.51x10 umole) of LyPepZ. Curves (c) represent the query Sequence. Homology Searching was accomplished the following reactions all of which had near Zero hydrolytic using FastDB-Fast Pairwise Comparison of Sequences activity: LyPepz (5.6 ug) inhibited by 10 molar excess of (Release 5.4) by Intelligenetics, Inc., 700 East El imidazole, histamine or tryptamine; LyPepZ after reduction 65 Real, Mountain View, Calif. 94040. of the intramolecular disulfide bond; and action of unrelated FIG. 17. A. Tabulated results of computer search of PIR peptide controls (RnBepz, and other cyclic peptides). and SWISS-PROT databases utilizing Lysozyme C2 Bovine 5,861,477 15 16 as the query Sequence. Active site residues which were various enzymes. The ability to construct at will fully previously utilized to construct a Synthetic catalytic functional peptide catalysts having the activity and Speci lysozyme peptide are indicated acroSS the top of the Figure ficity chosen by the investigator should find vast applica along with the Sequence number based upon the lysozyme tions. linear sequence. To the left of the Figure numbered refer The disclosure describes two catalyst designs that mimic the activities and specificities of CCT and trypsin, both ences are indicated which refer to the Sequences listed in B enzymes being proteases. Two 29-residue peptides were below. For each Such Sequence, divergences from the active prepared, one of which (ChPepz) was designed by Surface Site residue utilized to obtain lysozyme activity are shown. Simulation Synthesis to mimic the active Site of B. The first 100 Sequences demonstrating varying degrees of C-chymotrypsin (C-CT), while the other (TrPepz), which Sequence homology with the query Sequence. Homology contained four substitutions relative to ChPepz, was fash Searching was accomplished using FastDB-Fast Pairwise ioned after the active site of trypsin. The peptides were each Comparison of Sequences (Release 5.4) by Intelligenetics, cyclized by a disulfide bond. The ChPepz, monomer effected Inc., 700 East El Camino Real, Mountain View, Calif. hydrolysis of the ester group in N-benzoyl-L-tyrosine ethyl 94040. ester (BTEE), an CCT Substrate, with K, and k values that FIG. 18. A. Tabulated results of computer search of PIR were comparable to those of CCT. ChPepz was completely and SWISS-PROT databases utilizing Ribonuclease A 15 inactivated by diisopropyl fluorophosphate (DIFP), L-1-p- Bovine as the query Sequence. Active Site residues which tosylamino-2-phenylethyl chloromethyl ketone (TPCK) or were previously utilized to construct a Synthetic catalytic reduction of the disulfide bond. It had no catalytic activity on ribonuclease peptide are indicated across the top of the N-tosyl-L-arginine methyl ester (TAME), a trypsin Sub Figure along with the Sequence number based upon the strate. On the other hand, TrPepz, which had no effect on ribonuclease linear Sequence. To the left of the Figure BTEE, hydrolyzed TAME with a K value that was essen numbered references are indicated which refer to the tially identical to that of trypsin, while its k value was Sequences listed in B below. For each Such Sequence, almost half that of the enzyme. TrPepz was fully inactivated divergences from the active site residue utilized to obtain by reduction of the disulfide bond, by DIFP or by PMSF but ribonuclease activity are shown. B. The first 100 sequences not by TPCK. It was also completely inhibited by soybean demonstrating varying degrees of Sequence homology with 25 trypsin inhibitor, bovine pancreatic trypsin inhibitor and the query Sequence. Homology Searching was accomplished human C.-antitrypsin. ChPepz and TrPepz hydrolyzed pro using FastDB-Fast Pairwise Comparison of Sequences teins (myoglobin and casein) to give panels of peptides that (Release 5.4) by Intelligenetics, Inc., 700 East El Camino were similar to those of the same protein obtained with the Real, Mountain View, Calif. 94040. respective enzyme. However, TrPepz was more efficient FIG. 19. A. Tabulated results of computer search of PIR than trypsin at hydrolysing the C-bonds of two or more and SWISS-PROT databases utilizing Urokinase-Human consecutive lysine and/or arginine residues. Finally, like the as the query Sequence. Active site residues which were esteratic activity, the proteolytic activity of ChPepz was previously utilized to construct a synthetic catalytic uroki inhibited by DIFP or TPCK while that of TrPepz was nase peptide are indicated acroSS the top of the Figure along inhibited by DIFP or PMSD but not by TPCK. with the Sequence number based upon the urokinase linear To ensure that the ability to construct peptide enzymes Sequence. To the left of the Figure numbered references are 35 was not restricted to the two aforementioned proteolytic indicated which refer to the sequences listed in B below. For enzymes, Applicant Synthesized two catalysts which were each Such Sequence, divergences from the active site residue designed by Surface-Simulation to mimic the active Sites of utilized to obtain urokinase activity are shown. B. The first hen egg lysozyme (LYZ) and bovine ribonuclease A 100 Sequences demonstrating varying degrees of Sequence (RNase). The former (LyPepz), a 37-residue peptide 40 cyclized by an intramolecular disulfide bond, exhibited the homology with the query Sequence. Homology Searching muramidase activity and Specificity typical of LYZ. LyPepZ. was accomplished using FastDB-Fast Pairwise Comparison was able to effect complete hydrolysis of the cell wall of of Sequences (Release 5.4) by Intelligenetics, Inc., 700 East MicrococcuS lySOdeikticus with a K value that compared El Camino Real, Mountain View, Calif. 94040. well with that of LYZ, while its k value was 6.7 times FIG. 20. Design of the surface-simulation peptides mim lower than that of the whole enzyme. Like LYZ, LyPepz was icking the active sites of urokinase (UkPepz) and tissue 45 inhibited by imidazole, tryptamine and histamine. LyPepZ. plasminogen activator (Tppepz). (a) Shown are the contact became completely inactive when rendered acyclic by residues of the active site in the shaded areas and the reduction of the disulfide bond. The 24-residue cyclic (by a distances (CC-to-CO, in A) separating the appropriate resi disulfide bond) peptide, RnBepz, designed to mimic the dues. Residue numbers are based on urokinase sequence. (b) active site of RNase was able to completely hydrolyze yeast indicates the Surface-Simulation Synthetic peptides designed 50 RNA with a K value that was essentially identical to that to mimic the active sites. The residues in the shaded areas of RNase, while its k value was considerably lower than are the actual active site residues of urokinase while those that of the enzyme. The cyclic structure of Rn Pepz was linking these shaded areas are glycine Spacers introduced to important for its activity because the latter was completely achieve appropriate distances of Separation between the lost upon reduction of the disulfide bond. Like RNase, respective residues and regions of the site. (Cter) is the 55 RnBepz was inhibited by heavy metals and by denatured C-terminal Cys and (Nter) denotes the N-terminal Cys of the DNA. Finally, LyPepz had no activity on RNA and con peptides which are cyclized by the disulfide between the two versely Rn Pepz had no muramidase activity. CyS residues. The Outer Sequence which shows the Substi CHYMOTRYPSIN AND TRYPSIN tutions made to construct TpPepz, differs from UkPepz in 8 General Materials and Methods positions: Glu-27, Arg-28, Phe-29, Leu-30, Ala-45, Tyr-92, 60 Materials. Myoglobin was the major chromatographic com Ala-193, Ile-216. ponent (No. 10) isolated from crystallized sperm-whale DESCRIPTION OF PREFERRED myoglobin as described (Atassi, M. (1964) Nature (London) EMBODIMENTS 202, 496-498; to the extent that such references provide INTRODUCTION disclosure which would enhance the ability of the skilled The Applicant has now constructed by way of example a 65 practitioner to practice the invention described herein, all number of catalytically active Synthetic peptide catalysts references provided herein are specifically incorporated by that duplicate the catalytic activities and Specificities of reference). BTEE and TAME were obtained from Aldrich 5,861,477 17 18 Chemical Company. CCT, TPCK-trypsin, BPTI and soybean with 25 mM acetic acid. It is recommended that Small trypsin inhibitor were from Worthington Biochemical Cor molecular weight compounds be removed from the crude poration. Bovine milk f-casein (which contained about 10% cleaved peptide prior to the cyclization Step. The tubes C-casein), DIFP, PMSF, and human C.1-antitrypsin were containing the peptide were combined and freeze-dried. from Sigma Chemical Co. Reagents for peptide Synthesis Cyclization of the Peptide and Purification of the Monomer. and NCI.-Fmoc amino acids were obtained from Vega Bio Generally-A portion (50mg) of the synthetic product was technologies. dissolved in 8.0M urea containing 2-mercaptoethanol, pre Peptide Synthesis and Cleavage. The rationale for the design adjusted to pH 8.5 with triethylamine. The solution was of the peptides is given in the Examples. Certain versions of Stirred gently on a magnetic Stirrer for at least 3 hrs at room the peptides, ChPepz and TrPepz, were prepared by Solid temperature, after which it was applied on a column of phase Synthesis on a benzyloxybenzyl alcohol resin to which Sephadex G15, which was eluted with 0.025M acetic acid, 9-fluorenylmethylcarbonyl (Fmoc)-S-tert-butylcysteine had to remove the urea and mercaptoethanol. The fractions been coupled. The Side-chain protecting groups were: containing the peptide were pooled and diluted with at least aspartic, B-tert-butyl ester, cysteine, S-tert-butyl, histidine, 3 liters of 0.025M acetic acid and the pH was adjusted to im-trityl; lysine, e-tert-butoxycarbonyl, Serine and tyrosine, 15 approximately 8.0 on the pH meter by the addition of O-tert-butyl. The peptides were synthesized and cleaved triethylamine. The Solution was stirred magnetically at room from the resin by the procedures described elsewhere in temperature for at least 4 days and then freeze-dried. detail Atassi, M. Z., Manshouri, T. and Sakata, S. (1991) Specifically: ChPepz-ChPepz (35 mg) was dissolved in Proc. Natl. Acad. Sci. USA 88, 3613-3617. Synthesis on 0.5 ml of 8.0 m urea/10% 2-mercaptoethanol. The solution alkoxybenzyl alcohol resin (Wang resin) proceeds more was allowed to Stir at room temperature for 16 hours, after Smoothly (i.e. complete coupling of consecutive amino acids which it was desalted on a column (2.5x95 cm) of Sephadex is achieved more rapidly) than on methoxyalkoxybenzyl G-15 which had been pre-equilibrated and then eluted with alcohol resin. 0.025M CHCOOH. Fractions containing the peptide were Fully Automatic Synthesis has routinely given Applicant combined and then diluted up to 4 liters having a final acetic inferior products, even with double-coupling of each resi 25 acid concentration of 2 mM. After cooling the Solution down due. The product obtained by fully automatic synthesis is to 4 C., its pH was adjusted to 7.8-8.0 with triethylamine very heterogeneous with many components that have dele and allowed to stir for 9-10 days with full exposure to Air tion and/or additions. Machine Synthesis can yield better (covered with a Kimwipe TM which is secured around the quality products if Synthesis is checked for completeneSS neck of the flask with a rubber band). The pH was then after each coupling by the ninhydrin reaction on a few beads adjusted down to 5.0 with acetic acid and the solution was of the resin and recoupling done and repeated as necessary either flash-evaporated under vacuum (but not allowed to until the ninhydrin reaction becomes completely negative. warm up above 30° C.) or freeze-dried. Manual synthesis has proven to routinely give a better Reduction was done in 8.0 urea to ensure complete product in the Applicant's hands. In either case, completion unfolding of polymeric Species and thus make the disulfide of coupling of each residue was monitored by ninhydrin and 35 bonds more accessible to reduction. Concentration of the recycled as needed until the ninhydrin reaction became 2-mercaptoethanol and the duration of the reduction were negative (certain positions have required 3 or 4 coupling increased to ensure completeness of the latter. cycles using 3 molar excess of Fmoc-amino acid derivative Cyclization was done under very dilute conditions (6-8 at each cycle). Aug/ml) to minimize inter-molecular disulfide bond forma Specifically, for cleavage of peptides from the resins, 10 40 tion. Cyclization was done at 2-4° C. (in the cold room) ml of DCM (CHCl) was added to peptide-resin (1g) in a because chymotryptic activity of ChPepz (and for that Sintered-disc reaction vessel. The resin was allowed to Swell matter of chymotrypsin itself) is very negligible at 4°C. This for about 1 hour. DCM was then removed by N. pressure technique avoided auto-hydrolysis of ChPepz. Cyclization and 10 ml of Reagent R (TFA-thioanisole-1,2-ethanedithiol was done at pH 7.8-8.0 and never allowed to exceed pH 8.5. anisole,9: 5:3:2 vol/vol) was added. The resin was shaken 45 Before concentrating the cyclized pepzyme, the pH was slowly at room temperature for 3 hours after which it was necessarily adjusted down to 5.0. This prevented the pep filtered and rinsed twice (7 ml each) with Reagent R. The Zyme from digesting itself when the Solution is concentrated three filtrates were combined and rotary evaporated under on the flash evaporator. Air oxidation was done slowly. Vacuum down to 2-3 ml. The peptide Solution thus concen Bubbled oxygen and Vigorous agitation was avoided. trated was added dropwise to 30 ml cold (0°C) ether upon 50 Specifically: TrPepz-Procedures were the same as which it precipitated. It was shaken gently after addition of described for ChPepz except that cyclization of TrPepz was 20 ml more of ether. The precipitate was washed with cold done at room temperature. Additionally, at the completion of ether on the centrifuge five times (20 ml each time), dis the cyclization reaction, the pH was adjusted down to 6.0 solved (or suspended) in water or 25 mM CHCOOH and with acetic acid before the Solution was concentrated or freeze-dried (yield of crude product approximately 0.52g). 55 freeze-dried. Care was taken to ensure that tryptophan was not modified The dry peptide (approx. 30 mg) was dissolved in 1 ml of during cleavage because thioanisole can cause N-alkylation 0.25M acetic acid, centrifuged to remove insoluble material, of tryptophan. Inadvertent alkylation of tryptophan was and Subjected to ascending chromatography on two columns checked by amino acid analysis of p-toluenesulfonic acid (90x2.5 cm each) of G50-fine, connected in series and eluted hydrolysates (Liu, T. Y. and Chang, Y. H., J. Biol. Chem. 60 with 0.25M acetic acid. The fractions (2-3 ml) were moni 246: 2842 (1971). tored by absorbance at 230 and 280 nm. The peak of the To the cleavage product (0.2g) was then added 2 ml of 25 monomer, eluting at 750 ml, was well resolved from the mM acetic acid. At this concentration, the peptide is not oligomers which eluted earlier as a Single peak. The oligo completely soluble. Therefore, it was centrifuged (2500 mers were Saved for reduction and re-cyclization to obtain rpm, 30 min; the residue is saved for another cyclization) 65 more monomer. The tubes containing the monomeric Spe and the Supernatant was applied to a column (2.5x95 cm) of cies were pooled and freeze-dried (yields: ChPepz, 26.7%; Sephadex G-15 which had been pre-equilibrated and eluted TrPepz, 33.2%). 5,861,477 19 20 Alternatively, the cyclized product (4-5 mg) was frac (1) Chromatography No. 1: Monomer prepared above was tionated by HPLC on a Phenomenex Polysep 2000 molecu subjected to reverse-phase HPLC. lar sizing column (7.8x600 mm). The column was eluted at Solvent A: 0.3% acetic acid in water, adjusted to pH 5.5 the rate of 0.5 ml/min (0.5 ml per fraction) with 5 mM acetic with TEA adjusted to pH 5.5 with triethylamine. The chromatographic Solvent B: Acetonitrile-solvent A (9:1), vol/vol) operation was repeated about 12 times, with 4-5 mg being Elution gradient: applied on the column each time and the effluent of each run 0-15% B, 5 min. was collected in the tubes of the respective elution time. The 15-35% B, 25 min. fractions belonging to each peak were combined, freeze 35-60% B, 5 min. dried and tested for activity with BTEE. The fraction eluting 60-80% B, 5 min. at 50 minutes, which corresponds to the cyclic monomer, At least 19 peaks are obtained. Peak 18 possessed hydro possessed Some activity and was further purified by reverse lytic activity towards tosyl-L-arginine methyl ester (TAME) phase HPLC. Yield of monomer using this method was Peak 18 was reapplied on the Same column approximately 18-33%. (2) Chromatography No. 2 Generally, the monomer was further purified by HPLC on Solvent A: 0.1% acetic acid in water, adjusted to pH 5.5 a 5 um Cs column (10 mm IDx25 cm) using a gradient of 15 with triethylamine 0.05% acetic acid-triethylamine pH 5.5, and acetonitrile in Solvent B: Acetonitrile-Solvent A (9:1, vol/vol) 0.05% acetic acid (9:1 vol/vol). The fractions were moni Elution gradient: tored by absorbance at 256 nm and by their hydrolytic 0-35% B, 10 min. activity toward BTEE (for ChPepz) or TAME (for TrPepz). 35-60% B, 25 min. The active fraction was freeze-dried and reapplied on the 60% B, 5 min. Same column using a gradient of 0.1% acetic acid and Thirteen peaks were obtained (of which peaks 1, 5, 6 and acetonitrile in 0.1% acetic acid (9:1 vol/vol). The 7 were major). Most of the activity towards TAME coin catalytically-active fractions (yields: ChPepz, 12.1%; cided with peak 7. TrPepz, 11.5%, of the monomer) were homogeneous by high Peak 7 was reapplied on the same column. Voltage paper electrophoresis and by analytical HPLC and 25 (3) Chromatography No. 3 their amino acid compositions were in excellent agreement Solvent A: 0.1% acetic acid in water, adjusted to pH 5.5 with those expected from their covalent Structures. with triethylamine Specifically, the further purification by reverse-phase Solvent B: Acetonitrile in solvent A (9:1, vol/vol) HPLC of ChPepz, monomer was accomplished as follows: Elution gradient: The monomeric fraction was further purified by reverse 0–15% B, 8 min. phase HPLC as follows: 15-27% B, 30 min. Column: C18, semi-preparative, 10 mmx25 mm (Rainin) 27–30% B, 5 min. (1) Chromatography No.1: Monomer prepared above was Peak 7 from chromatography no. 2 was found to be further purified by reverse phase HLPC essentially pure giving four very minor peaks and one major Solvent A: 0.05M acetic acid, adjusted to pH 5.5 with 35 peak (peak no. 5). Peak 5 accounted for 89.9% of the total. triethylamine. Further purification of peak 5 was unnecessary. Yield of Solvent B: 90% acetonitrile+10% solvent A peak 5 was between 5.4 to 12.0% of the monomer, depend Gradient: ing on the Synthesis. Molecular weight calculations and 0-15% B, 5 min purity determinations may also be accomplished by electro 15-35% B, 25 min 40 Spray and/fast atom bombardment mass spectroScopy. 35-60% B, 5 min Measurement of Catalytic Activity 60-80% B, 5 min Chymotryptic activity. The catalytic activity of CCT and Nine peaks were obtained, peak 9 exhibited hydrolytic CHPepz was determined by hydrolysis of BTEE in 0.08M Tris buffer, pH 7.8 containing 0.01M CaCl as described activity on BTEE Peak 9 was reapplied on the same column. (Hummel, B. C. W. (1959) Can. J. Biochem. Physiol. 37, (2) Chromatography No.2 45 1393-1399). Assays were done at 25° C. using 8.06x10" Solvents A and B, Same as in chromatography no. 1 umoles of CCT or 7.49x10" umoles of ChPepz and differ 0-35% B, 10 min ent substrate concentrations (from 3 to 6 mM) in a total 35-60% B, 25 min. reaction volume of 1.0 ml. The change of absorbance at 256 60% B, 5 min nm was monitored on a recording spectrophotometer against Peak 9 (from chromatography #1) was resolved into 11 50 peaks. Activity towards BTEE resided in peak 5. a reference cuvette containing 1 ml of the same concentra Peak 5 was reapplied on the Same column tion of BTEE, but without C.CT or ChPepZ. Controls (chromatography no. 3) included trypsin, TrPepz (which are inactive against BTEE) (3) Chromatography No. 3 and Several linear and cyclic peptides from Applicant's Solvents A and B, Same as in chromatography no. 1 55 peptide library although any Such controls would work 100% A (0% B). 1 min equally as well. 0-15% B, 8 min Specifically, reagents and procedures used to measure 15-27% B, 30 min chymotryptic activity were as follows: 27–30% B, 5 min Reagents: Activity resided in peak 4 60 a. 0.08M Tris-HCl, containing 0.01M CaCl, pH 7.8 Recovery peak 4-5.18%–12.06% of monomer, depend b. N-Benzoyl-L-tyrosine ethyl ester (BTEE) dissolved in ing on the Synthesis. 50% (w/w) Methanol (63 ml CH-OH+50 ml water). Specifically, the further purification by reverse-phase ASSay points in the range 90-180 ug per assay were HPLC of TrPepz monomer was accomplished as follows: used. The monomeric fraction was further purified by reverse 65 c. ChPepz in 0.001N HCl. 1-10 ug was used. phase HPLC Column: C-18, semi-preparative 10 mmx25 d. Negative controls: Several peptides that are unrelated to mm (Rainin) ChPepz were used. Solutions of these control peptides 5,861,477 21 22 were made in the same solvent (10 NHCl) and at the g. Total reaction volume 1.01 ml. Same concentrations as ChPepZ and test Samples. To TAME solutions (1 ml containing different amounts of e. C-chymotrypsin (CCT)(positive control). 1-5 ug was Substrate), an aliquot (10 ul) of TrPepz, or negative control used. Importantly, it was critical to always dedicate a peptide Solutions (containing 1-10 ug/10 ul) was added, Separate cuvette for this positive control to ensure no 5 mixed immediately and the increase of absorbance at 247 cross-contamination. nm was measured at 37 C. (or at room temperature). f. Total reaction volume 1.01 ml. TrPepz solutions were always made immediately before 1.0 ml of substrate (BTEE) was mixed with solutions (10 use. Although TrPepZ has no potential tryptic cleavage All aliquots) of ChPepz, or negative control peptides points, it lost activity on Standing in Solution at 4 C. (containing 1-10 ug/10 ul) or CCT and activity was moni- 10 (activity was completely lost in two days). Activity was lost tored by increase in absorbance at 256 nm at 37° C. (or at room temperature). by freezing and thawing (completely lost after 5 Such ChPepz solution was made in 10 NHCl, since it will not cycles). TrPepZ will not act on chromogenic Substrates—it digest itself in this solvent. Solutions of ChPepz were always was inhibited by p-nitro analine (and probably is inhibited made immediately before use. Activity was typically lost on by other aromatic amines). After mixing TrPepz, with Storage in Solution (completely lost in 2 days at 4 C.). 15 Substrate, there was a lag of 2-5 minutes (depending on Activity was typically lost by freezing and thawing pepzyme/Substrate ratio) before any change in A, was (completely lost after 5 such cycles). ChPepz will not act on observed. In the early Stages of the purification, the activity chromogenic Substrates-it was inhibited by p-nitroaniline, was always monitored over a period of Several hours. analine and benzamidine (and is probably inhibited by other Negative controls (containing unrelated peptides in amounts aromatic compounds). After mixing ChPepz, with substrate 20 similar to those of TrPepz in the test samples) were also (BTEE), there was typically a lag of 2-5 min (depending on monitored at the same time because the background absor pepzyme/Substrate ratio) before any change in As was bance will increase and the test Samples had to be corrected observed. However, in the early Stages of the purification, for this background. AS the Sample's purity improved, the activity needed to be monitored over a period of several maximum A-7 was achieved more quickly. hours. It was important to ensure that negative controls 25 Inhibition of Enzymatic Activity. The effects of inhibitors or (containing unrelated peptides in amounts similar to those of disulfide bond reduction on enzymatic activities were done ChPepz, in the test Samples) were also monitored at the same as described above except that the enzymes were pre-mixed time because the background absorbance will increase and (3 hrs, 25°C.) with 10 molar excess of inhibitor (or DTT) the test Samples had to be corrected for this background. AS prior to addition to the substrate (140-150 molar excess the Sample's purity improves, maximum A256 was achieved so relative to the catalyst). Activities were monitored spectro more quickly. See, e.g., Atassi, M. Z. in Immunochemistry of photometrically as above and were compared to uninhibited Proteins, ed. M. Z. Atassi, Vol. 2, pp. 77-176, Plenum, New York, N.Y. (1977); King, D. S., et al. Int. J. Pept. Prot. Res. controls. 36:255-266 (1990). Hydrolysis of Proteins by Enzymes or Peptide Enzyme Tryptic activity. Measurement of tryptic activity was done in Catalysts. Hydrolyses were done at 37° C. on aliquots (200 0.046M Tris-HCl buffer, pH 8.0, containing 0.0115M CaCl 35 All) containing 1 mg of myoglobin (5.6x10-2 umoles) or as described (Hummel, B. C. W. (1959), id) using TAME as casein (4.2x10f umoles) in 0.1M triethylamine-acetic acid the substrate. For the assays, 8.4x10" umoles of CCT or 1.6 buffer, pH 8.0 with 51 lug of C.CT (2.06x10 umoles) and c 10 umoles of TrPepz were allowed to hydrolyze different 2.9 tug of ChPepz (1.1x10 umoles) for 3% hrs or with 49 amounts of substrate at 25 C. in a total reaction volume of tug trypsin (2.06x10 umoles) and 5.0 ug TrPepz (1.87x 1.0 ml. Hydrolysis was monitored on a recording spectro 40 10 umoles) for 8 hrs. The samples were then acidified (to photometer by change in absorbance at 247 nm against a pH 3.0) with 0.1M HCl freeze-dried and redissolved in 100 reference cuvette containing the same concentration of All of HO at pH 3.0, and the entire Sample was applied as a TAME, but without trypsin or TrPepZ. C.CT, ChPepz (which single spot to Whatman No. 3MM paper and subjected, in do not hydrolyze TAME) and several linear and cyclic the first dimension, to ascending chromatography in peptides from Applicants library were used as controls 45 n-butanol-acetic acid-water (4:1:5, vol/vol) followed by although others would work as well. Kinetic constants for high voltage electrophoresis (3000 volts, 55 min), in the hydrolysis of BTEE and TAME were determined from the Second dimension, in pyridine-acetic acid-water (1:10:289, linear plots of 1/initial velocity (Vi in tumoles/min) versus vol/vol), pH 3.65, as described (Atassi, M.Z. and Saplin, B. 1/Substrate concentration as described (Line weaver, H. and J. (1968) Biochemistry 7, 688–698). The papers were then Burk, D. (1934) J. Amer: Chem. Soc. 56,658). 50 dried, steamed and stained with 0.2% ninhydrin in ethanol Specifically, reagents and procedures used to measure and the color was allowed to develop at room temperature. tryptic activity were as follows: The papers were photographed 48 hrs after Staining. Reagents: Chymotrypsin and Trypsin Examples Example I: Hydrolysis of Ester Substrates by C.CT and a. 0.046M Tris-HCl, containing 0.0115M CaCl, pH 8.0. 55 ChPepz b. Tosyl-L-arginine methyl ester (TAME) Like CCT, the action of ChPepz on BTEE caused hydroly c. Solutions of TAME were made in the Tris-HCl buffer sis of the ester bond (FIG. 1). Lineweaver-Burk plots (FIG. noted in a. ASSay points in the range 100-200 lig 2) of experiments at different Substrate concentrations TAME/ml were used. showed that the kinetic constants of the hydrolysis of BTEE d. TrPepz in the Tris-HCl buffer noted in a 1-10 ug was go by CCT and by ChPepz were comparable (Table 1). The used in each assay. values of K, (a measure of Substrate affinity) for ChPepz e. Negative controls: Several peptides unrelated to TrPepZ. and CCT were almost identical, while the k value for Solutions in the Tris-HCl buffer were made and at the ChPepz was only slightly lower than that of CCT. The Same concentrations as the TrPepZ test Samples. specificity constants ki/K) for BTEE with CCT and f. Trypsin-TPCK (positive control). 1-5 lug were used in 65 ChPepz were also quite comparable (Table 1). The activity each assay. Applicant always dedicated a separate of ChPepz was completely inhibited by the CCT inhibitors, cuvette for this. TPCK and DIFP, and also completely lost when rendered 5,861,477 23 24 acyclic by reduction of the disulfide bond. ChPepz had no Lys-Arg-His-Gly-Leu-Asp-Asn (SEQ ID NO:8), Asp-Asn hydrolytic activity on TAME (which is a trypsin Substrate) Tyr-Arg-Gly-Tyr-Ser-Leu-Gly (SEQ ID NO:9), Ala-Lys and, as mentioned below, TEE was not hydrolyzed by Lys-Ile-Val-Ser-Asp-Gly (SEQID NO:10) were completely TrPepz or by control cyclic and linear peptides that are not cleaved by TrPepz at the C-bond of the lysine and arginine related to C.CT. residues indicated. No other products were obtained. The proteolytic activity of TrPepz was investigated on two TABLE 1. proteins, myoglobin and casein. In each case, the peptide pattern of the TrPepz hydrolysate was essentially the same Kinetic constants for hydrolysis of BTEE by CCT and ChPepz and of TAME by trypsin and TrPepz' as the pattern of the respective protein obtained by tryptic hydrolysis (FIG. 5). However, TrPepz was in fact more Kn kcal kcal/Kin efficient at hydrolyzing Lys-LyS, Lys-Lys-LyS, Arg-LyS and (M x 10) (sec.) (M'Sec") Lys-Arg bonds as evident from the higher yields of lysine Hydrolysis of and arginine in hydrolyses by TrpepZ, as compared to those BTEE by: by trypsin. Finally, the proteolytic action of TrPepz, like that 15 ChPepz 1.11 + 0.15 1478.5 1.32 x 10 of trypsin, was completely inhibited by DIFP or PMSF while CCT 1.07 - O16 185 - 10.3 1.72 x 10 TPCK had no effect. Hydrolysis of Design of the Synthetic Peptide Catalysts TAME by: AS Stated above, a Surface-Simulation Synthetic peptide was constructed (Atassi, M. Z., Biochem. J. 226: 477-485 TrPepz 2.42 0.09 85 - 2.6 3.5 x 10 (1985)) that possessed the expected binding activities of Trypsin 2.56 - 0.16 221 - 9.7 8.6 x 10 trypsin with Substrates and inhibitors but had no significant 'Values of the constants for ChPepz and CCT were obtained at pH 7.8 and 25 catalytic activity. Nevertheless, the ability to produce Sub C. while those for TrPepz and trypsin were derived from measurements at pH Strate binding encouraged Applicant to improve the designs 8.0 and 25 C. For details, see the text. k. and kCat values for whole enzymes in general agreement with reported values. See, e.g., Martin, C. J., et al., J. to achieve an enzymically active peptide. Applicant made 10 Biol. Chem. 234: 1718-1725 (1959); Ronwin, E., Biochem. Biophys. Acta 25 design changes. The following cyclic peptide designs were 33: 326–332 (1959); Hummel, B. C. W. Can. J. Biochem. Physiol. 37: active (shown here for the trypsin active site): design 1, 1393-1399 (1959); Hartley, B.S., in Structure and Activity of Enzymes, pp. Cys-Asp-Ser-Gly-Gly-Val-Ser-Trp. Gly-Gly-Leu-Gly-Asp 47-60, Academic Press, New York, NY (1964); Hahn, K. W. et al., Science 248: 1544-1547. Gly-Ala-Ala-His-Gly-Gly-Phe-His-Tyr-Cys (SEQ ID *Note that BTEE is not hydrolysed by trypsin or TrPepz and TAME is not NO:11); design 2, Asp-Ser-Gly-Gln-Cys-Asp-Ser-Gly-Gly hydrolysed by CCT or ChPepz. Val-Ser-Trp-Gly-Gly-Leu-Gly-Asp-Gly-Ala-Ala-His-Gly Example II: Activity of TrPepz on Ester Substrates Gly-Phe-His-Tyr-Cys (SEQ ID NO:12); design 3, Cys-Phe In its action on TAME, TrPepz exhibited an activity which Gly-Gly-Ser-Asp-Gly-Gln-Gly-Ser-Asp-Gly-Gly-Val-Ser was very much like that of trypsin. Lineweaver-Burk plots Trp-Gly-Leu-Gly-Gly-Asp-Gly-Ala-Ala-His-Cys (SEQ ID of reactions at different Substrate concentrations showed that NO:13) (active site residues are underlined; others are TrPepz had an affinity for the substrate (K) which was 35 Spacers). Under the conditions described above, the rate of similar to that of trypsin. It hydrolyzed TAME at a rate hydrolysis of TAME by the final TrPepz design (FIG. 7) was which was about 40% relative to the rate obtained with the better than designs 1-3 by 250, 123, and 72 times, respec enzyme itself (Table 1). The hydrolytic activity of TrPepz on tively. The peptide maps of 144-hr Mb or casein hydroly TAME was completely lost by reduction of the disulfide Sates by designs 2 and 3 compared well to the respective bond. The activity of TrPepz on TAME was completely lost 40 tryptic hydrolysates. The cyclic Structures of designs 1-3 by reduction of the disulfide bond. The activity was also were essential for activity. Similar results were obtained inhibited entirely by DIFP, PMSF, soybean trypsin inhibitor, with the corresponding analogs of the CCT active site. BPTI and human C.-antitrypsin (FIG.3). On the other hand, Example V: Surface Simulation of Chymotrypsin Active the activity was not affected by TPCK. TrPepz had no effect Site on BTEE and, an mentioned above, TAME was not hydro 45 The essential residues of the active sites of OCT and lyzed by ChPepz or by control unrelated (to trypsin) cyclic trypsin are shown in FIG. 6. These residues have been or linear peptides. Thus, the replacement of residues con implicated as essential parts of the active site by chemical verted the catalyst activity from chymotryptic to tryptic. and crystallographic evidence which has been reviewed Example III: Proteolytic Activity of ChPepz (Atassi, M. Z. (1985) Biochem. J. 226, 477-485). FIG. 7 To confirm that the ability of the two synthetic catalysts 50 shows that position of the contact residues in the Sequence to hydrolyze, in a specific manner, the correct amino acid (using bovine chymotrypsinogen sequence numbers) and the ester Substrate was a true proteolytic activity, it was neces distances (in A) Separating them. The Sequence was obtained Sary to examine their action on peptide and protein Sub from reference (Meloun, B., Kluh, I., Kosta, V., Moravek, strates. The action of ChPepz on Mb and casein resulted in L., Prusik, Z., Vanecek, J., Keil, B. and Sorm, F. (1966) the hydrolysis of the respective protein by CCT (FIG. 4). 55 Biochim. Biophys. Acta 130,543–546) and the X-ray coor The hydrolysis of the proteins by ChPepz was quite efficient dinates are known to 1.68 A resolution (Tsukada, H. and and was achieved in a time-frame that was enzyme-like. The Blow, D. M. (1985) J. Mol. Biol. 184703 (Protein Data activity of ChPepz on protein substrates was completely Bank) entry code 4CHA). In order to determine the appro inhibited by TPCK. priate length of Spacers to be used for linking the active site Example IV: Proteolytic Activity of TrPepz 60 residues, we calculated the average (of 15) Co-to-C' dis In order to further ascertain that TrPepz, behaved like tances in Single peptide bonds and two and three consecutive trypsin in activity and possessed its specificity, it was tested peptides bonds. These were: one peptide bond (C-to-C), on model peptides and proteins. Its action (3hrs; molar ratio 3.80+0.20 A; two peptide bonds (distance between the first of Substrate/TrPepz, 1:60) on the peptide Gln-Leu-Glu-Pro and the third C. carbons in C-C-C), 6.20+1.00 A; three Ser-Thr-Ser-Ser-Ala-Val-Pro-Leu-Ile-Gly-Lys-Gly (SEQ ID 65 peptide bonds (distance between the first and the fourth C. NO:7) resulted in almost complete (over 95%) hydrolysis of carbons in C-C-C-C), 7.94+2.07 A. Therefore, the the Lys-Gly bond. The lysozyme sequences: Ala-Ala-Met distances separating the contact residues (FIG. 7A) could be 5,861,477 25 26 well accommodated by the glycine Spacers shown in FIG. lyze proteins producing, from a given protein, peptides that 7B. In Surface-Simulation Synthesis, glycine residues have were essentially the same as those produced by CCT. been found (Atassi, M. Z. (1986) in Protein Engineering, In addition, test were performed which compared the Applications in Science, Medicine and Industry (Inouye, M. activity of the two proteases under differing temperature and Sarma, R., Eds.) pp. 125-153, Academic Press, Orlando, parameters. If temperatures were changed to a lower 10 C. Fla.) to be most Suited for use as Spacers, probably because or a higher 48 C., and all other parameters being equal, each of their flexibility and the fact that they provide no inter of the two synthetic catalysts were compared to their fering Side chains. The cyclic design, which is crucial for enzyme parent molecule, an interesting pattern is demon activity, requires closure. The Applicant measured Several strated as shown below. disulfide bonds in proteins and found that the C*-to-C distance in Cys-SS-Cys is 5.5+0.4 A (range 5.11-5.93 A). Closure of the peptide (to obtain a cyclic structure) could be % Activity of Native Enzyme at 35 C. achieved anywhere an appropriate Space occurs in the Struc Temp. Chymotrypsin ChPepz Trypsin TrPepz ture provided the bond angles in the disulfide bridge do not 48 O 3.5 67 approx. 50 interfere or induce undue distortion in the orientation of 15 35 1OO 1OO 1OO approx. 40 essential active site residues. The best design was that 1O 6.8 12.6 4.5 7.4 obtained by closure between Cys-58 and Phe-39 (or Tyr in TrPepz) which are separated by 13.16 A (C-to-C). This is Thus, not only do the synthetic catalysts exhibit activity at satisfied by a Gly-Cys spacer which, with the S-S bond, temperatures at which the native enzyme fails to do So, but would give an effective separation of 11.7+1.4 A. The inner also the Synthetic catalysts may actually exhibit improved Sequence in FIG. 7b was designed to mimic the active site activity over that Seen for the Synthetic catalyst at Standard of CCT, while the outer sequence (which differs from the temperatures. inner sequence by four residues: Phe-39->Tyr, Ile-99->Leu, Example VII: Surface Simulation of Trypsin Active Site Ser-189->Asp and Met-192->Gln) mimicked the active site To further confirm that an enzymically-active peptide of trypsin. The amino acid Sequence of bovine trypsin and 25 design had been achieved, an analog was Synthesized in the X-ray coordinates for its active site were from references which four residues were Substituted to obtain a peptide (Mikes, O., Holeysovsky, V., Tomasek, V. and Sorm, F. (TrPepz) that would then mimic the active site of trypsin. (1966) Biochem. Biophys, Res. Commun. 24, 346-352) and Trypsin does not hydrolyze BTEE but hydrolyzes TAME. (Marquart, M., Walter, J., Deisenhofer, J., Bode, W. and TrPepZ behaved precisely like trypsin, exhibiting an almost Huber, R. (1983) Acta Crystallogr., Section B 39, 480 identical Substrate dissociation constant (K, and its k and (Protein Data Bank entry 2PTN)), respectively. k/K values were about half the corresponding values of Example VI: Functional Behavior of the Synthetic Catalyst the enzyme (Table 1). The values of trypsin K (2.56x10 The Surface-simulation synthetic peptide, ChPepz, which 3M) and k (221 sec') found here were in agreement with was designed to mimic the active site of OCT behaved the reported values of 2.76x10M (Lorand, L. et al., 1961) functionally very much like the enzyme itself. The kinetic 35 and 187sec (Martin, C.J., Golubow, J. and Axelrod, A. E. constants for hydrolysis of BTEE by ChPepz and by CCT (1959) J. Biol. Chem. 234, 1718–1725), respectively, at pH were comparable (Table 1). The affinity of ChPepz for the 8.0 and 25 C. The activity of TrPepz was completely Substrate, as evidenced from the K value, was very similar inhibited by DIFP, PMSF, soybean trypsin inhibitor and to that of the whole enzyme. The k values indicated that human C-1-antitrypsin, all known to be inhibitors of trypsin. ChPepz effected hydrolysis at a rate which was in the same 40 Like ChPepz, the cyclic structure of TrPepz was essential for order of magnitude as, and only slightly lower than, that of activity. The most Striking finding was the exquisite speci the whole enzyme. Our k value (185 sec') for BTEE ficity of TrPepz for cleavage of the C-peptide bonds of hydrolysis by CCT is similar to the value of 193 sec' arginine and lysine residues in peptides and proteins. Its reported in the literature (Hartley, B. S., 1964) at pH 7.9 and action on myoglobin or casein resulted in hydrolysis of each 25 C. The values of the specificity constant (k/K) of 45 protein into peptide fragments that were Similar to those ChPepz and CCT were also comparable indicating that obtained by hydrolysis with trypsin itself. In fact, TrPepz BTEE functioned equally well as a substrate for both was more efficient than trypsin at hydrolyzing Lys-LyS, ChPepz and C.CT. The similarity of the kinetic constants of Lys-Arg, Arg-LyS and Lys-Lys-LyS, bonds. Thus, the Substi OCT and ChPepz, and the inhibition of the ChPepz, catalytic tution of four amino acid residues in the ChPepZ design activity by DIFP (a serine esterase inhibitor) and by TPCK 50 caused an unequivocal functional conversion from a chy (an CCT inhibitor) Suggest that the catalytic process by motryptic to a tryptic activity. ChPepz must employ the same mechanism as C.CT. The LYSOZYME AND RIBONUCLEASE decrease in the rate of catalysis of ChPepz is probably General Materials and Methods caused by the flexibility of the peptide as it Searches, through Materials. Hen egg lysozyme, MicrococcuS lySOdeikticus, an equilibrium of conformational States and induced fit, for 55 bovine pancreatic ribonuclease A and yeast RNA were a catalytically-productive conformation. The virtual loSS of obtained from Worthington Biochemical Corp. Imidazole catalytic activity when the peptide is rendered acyclic (by was purchased from Eastman Organic Chemicals. reduction of the disulfide bond) is most probably due to the Tryptamine, histamine and dithiothreitol were from Aldrich inability of the open-chain Structure to achieve Such a Chemical Co. Silver nitrate, analytical grade, was from conformation. This would explain why the first generation of 60 Fisher Scientific and herring testes DNA from Sigma Applicant's Synthetic active Sites, which employed an open Chemical Co. Reagents for peptide synthesis and N'-Fmoc chain design (Atassi, M. Z. (1985) Biochem. J. 226, amino acids were obtained from Vega Biotechnologies. 477-485) exhibited binding but did not possess measurable Synthesis and Purification. The rationale for the design of catalytic activity. The inability of ChPepz to hydrolyze the peptides and their Structures are given in the examples TAME, which is a trypsin substrate, further confirmed that 65 below. The peptides, LyPepz and Rn Pepz, were prepared by this peptide had an CCT-like specificity. But the most Solid phase Synthesis on a benzyloxybenzyl alcohol resin to compelling performance of ChPepz was its ability to hydro which 9-fluorenylmethyl-carbonyl (Fmoc)-S-ter-butyl cys 5,861,477 27 28 teine had been coupled. The methods for Synthesis, cycliza 40–60% B, 10 min. tion and isolation of the monomer have been described The material was resolved into 2-6 peaks. Peak No. 8 was above. The oligomeric Species was Saved for reduction and found to possess the most activity. Peak 8 was reapplied on re-cyclization. The monomeric Species was further purified the same column. by HPLC on a 5 um C18 column (10 mm IDx25 cm) using (2) Chromatography No. 2 the following gradients: LyPepz, solvent A, 0.1% acetic Solvent A: 0.1% acetic acid, adjusted to pH 5.5 with TEA acid-triethylamine, pH 5.5, and solvent B acetonitrile-0.1% Solvent B: Acetonitrile-Solvent A (9:1, vol/vol) acetic acid (9:1 vol/vol); RnFepz, solvent A, 0.1% trifluo Elution gradient: roacetic acid (4:1, vol/vol). The fractions were monitored by 0–20% B, 3 min. absorbance at 256 nm and by their hydrolytic activity toward 1O 20–40% B, 40 min. M. lysodeikticus (for LyPepz) or yeast RNA (for RnPepz). 40–60% B, 5 min. The active fraction was freeze-dried and applied on the same Catalytic activity was found in peak #2. Peak 2 corre column using the same respective Solvents but employing sponded to 45% of the total material of peak 8. Peak 2 was gradients that were leSS Steep. The active Site fractions pure and no further purification was needed. (yields: LyPepz, 12.9%; RnFepz 12.2% of the monomer) 15 Specifically, the following reagents and methods were were homogeneous by high Voltage paper electrophoresis applied to cyclization, isolation and purification of the and by analytical HPLC, and their amino acid compositions RnBepz product: were in excellent agreement with those expected from their Reduction of Rn Pepz, crude product was done in covalent Structures. Cyclic peptides, which are unrelated to mercaptoethanol-8M urea (1:9, vol/vol) as described for LYZ or RNase and which were used as negative controls in ChPepz and TrPepz. The desalting of the reduced product the enzymic assays, were from Applicant's library of Syn was done on Sephadex G-15, as described for ChPepz. thetic peptides as described above. Isolation of the Monomer Specifically, the following reagents and methods were The Rn PepZ was isolated by Size exclusion chromatog applied to cyclization, isolation and purification of the raphy. LyPepz product: 25 Column: Protein Pack 60, 7.8 mmx300 mm (Waters) Cyclization Solvent: 0.05M NHHCO Peptide (25 mg) was dissolved in 0.5 ml of trifluoroacetic Rate: 0.7 ml/min acid (TFA) containing 2-mercaptoethanol (ME) (9:1, vol/ Six peaks were obtained. Peaks 1 and 2 contained oligo vol) and stirred at 4 C. overnight. The peptide was then mers. Peak 3 contained the active fraction (as monitored by precipitated with acidified ether trifluoroacetic acid (TFA) hydrolysis of yeast RNA) and corresponded to the monomer. -ether 0.5: 95.5, vol/voll, centrifuged and the precipitate Reverse-Phase HPLC washed 3 times with acidified ether. The peptide was dis The monomeric fraction (peak 3 from Protein Pack 60) solved in 0.5 ml of TFA and added drop by drop to 3 liters obtained above was further purified by reverse-phase HPLC of 0.6M TEA which was being stirred well by magnetic as follows: stirrer. The solution was stirred for 10 days in air (covered 35 Column: Reverse-phase C-18, semi-preparative 10 with a Kimwipe TM secured around the neck of the bottle by mmx25 mm (Rainin) a rubber band), after which it was rotary-evaporated in (1) Chromatography No. 1 vacuum (not exceeding 30° C) down to one ml. (The Solvent A: 0.05M acetic acid preparation became oily and showed Some precipitation.) Solvent B: solvent A+acetonitrile (1:4), vol/vol) Isolation of the Monomer 40 Elution gradient: The cyclized product was mixed with 2 ml of 0.1M 0–40% B, 10 min. NHHCO, centrifuged and the Supernatant was applied 40–85% B, 30 min. onto two columns containing Sephadex G-50 fine (1.6x90 Eleven peaks were obtained. The major one (peak 10) cm each) connected in Series and eluted in the ascending contained the catalytic activity. direction with 0.1M NHHCO. The second peak (eluting 45 (2) Chromatography No. 2 between 320-490 ml) contained the monomeric species Peak 10 from chromatography no. 1 was reapplied on the while the oligomeric species emerges earlier (eluting Same column between 193-318 ml). The fractions containing the mono Solvent A: 0.1% TFA in water mer were combined and immediately freeze-dried. The Solvent B: Solvent A+acetonitrile (1:4, vol/vol) material was dissolved in 2 ml of 0.025M acetic acid and 50 Elution gradient: reapplied on two columns (each 1.6x90 cm) of G-50 fine, 0–2% B, 1 min. connected in Series and eluted in the ascending direction 20-50% B, 10 min. with 0.025M acetic acid. The fraction eluting between 50-60% B, 25 min. 400-490 ml contained hydrolytic activity towards Micro 60-80% B, 5 min. COccus lySOdeikticus. 55 Nine peaks were obtained. Peaks 7, 8 and 9 were not well Reverse-Phase HPLC resolved. They were combined and reapplied on the same The monomeric fraction obtained above was further puri column. fied by reverse-phase HPLC as follows: (3) Chromatography No. 3 Column: Reverse-phase C-18, semi-preparative 10 Peaks 7-9 from chromatography no. 2 were reapplied on mmx25 mm (Rainin) 60 the same column (1) Chromatography No. 1 Solvent A: 0.1% TFA in water Solvent A: 0.1% acetic acid, adjusted to pH 5.5 with TEA Solvent B: Solvent A+acetonitrile (1:4, vol/vol) Solvent B: Acetonitrile-solvent A (9:1), vol/vol) Flow Rate: 1.5 ml/min Flow Rate: 1.5 ml/min Elution gradient: Elution gradient: 65 0–25% B, 1 min. 0-20% B, 3 min. 25–36% B, 30 min. 20-40% B, 40 min. 36-80% B, 10 min. 5,861,477 29 30 The material was resolved into 19 peaks. The highest with different amounts of yeast RNA (from 500 to 180 catalytic activity coincided with peak 17. ug/ml) in a total reaction volume of 1 ml of 0.1M sodium (4) Chromatography No. 4 acetate at pH 5. The decrease in absorbance at 300 nm with Same conditions as in chromatography no. 2 were time was measured on the recording spectrophotometer. repeated. Peak No. 10 was most active. No further purifi Reaction mixtures that contained no catalyst as well as cation of peak 10 was needed. mixtures containing LyPepZ or other cyclic peptides unre Measurement of Catalytic Activity lated to RNase were used as controls. The kinetic parameters Measurement of Lysozyme Activity. The kinetics of the of the hydrolytic activities were determined by Lineweaver catalytic activity of LYZ and LyPepz were determined by Burk plots (Lineweaver, H. and Burk, D. (1934) J. Amer: hydrolysis of M. lysodeikticus as described (Neville, W. M. Chem. Soc. 56,658-666) of 1/absorbance change per minute and Eyring, H. (1972) Proc. Natl. Acad. Sci. USA 88, against 1/Substrate concentration in mg per ml. 3613-3617). Assays were done at 25° C. in 0.125M NaCl Specifically, the following reagents and methods were using 1.5 lug (1.04x10" umole) of LYZ and 4.25 ug (1.13x used to measure RnPepz, activity: 10 umole) of LyPepz and different amounts of cell sus Reagents pension (in the range 330 to 100 ug/ml) in a total reaction 15 a. 0.1M sodium acetate buffer, pH 5.0 volume of 1.10 ml. The decrease in turbidity was measured at 450 nm on a recording spectrophotometer. Reaction b. Substrate (yeast RNA)-several dilutions in the range mixtures that contained no catalyst or had RnBepz or other 200 lug-600 ug/ml of buffer (a) cyclic peptides that are unrelated to LYZ (instead of LyPepz) c. RinPepz-several dilutions from 1 tug to 40 tug/100 ul of were used as controls. Kinetic constants were determined buffer (a). from the linear plots of 1/initial Velocity (in change of d. Ribonuclease A (positive control)—several dilutions absorbance/min) against 1/Substrate concentration in mg/ml from 1 tug to 4 ug per 100 ul of buffer (a) as described (Lineweaver, H. and Burk, D. (1934) J. Amer: e. Peptides unrelated to RnPepz (negative controls)- Chem. Soc. 56,658-666). To study the effect of reduction of several dilutions from 1 to 40 ug/100 ul of buffer (a). the disulfide bond on the catalytic activity, LyPepz (5.6 25 To perform the assay, 1 ml of a given Substrate Solution ug=1.51x10 umole) or LYZ (1.5 lug=1.04x10" umole) was mixed with 100 ul of RnPepz, or unrelated peptide were mixed with 10 molar excess of dithiothreitol. After solution. The activity (hydrolysis of the RNA) was moni reaction at room temperature for 3 hr, the reduced cyclic tored by change (decrease) in absorbance at 300 nm. The Synthetic peptide catalyst or LYZ were each added to 140 ug change in absorbance with the test samples (RnBepz) was of M. lysodeikticus (total reaction volume, 1 ml of 0.125M correlated for the Small change obtained with the same NaCl) and the activity was measured by the decrease of amount of unrelated peptides (negative controls) acting on absorbance at 450 nm as described above. The effects of the the corresponding concentration of Substrate. Precautions as lysozyme inhibitors, imidazole, tryptamine and histamine to assaying and Stability were the same as described for were also determined by pre-mixing the Synthetic catalyst or ChPepz and TrPepZ. LYZ for 3 hr at room temperature with a 10 molar excess of 35 The effects of disulfide-bond reduction on RNase and each inhibitor before addition to the cells, using the same RnBepZ activities were determined by premixing each cata amounts of catalysts and cells, as described for the effect of lyst (RNase, 1.095x10" umole; RnFepz, 2.42x10 umole) dithiothreitol. with 10 molar excess of DTT for 3 hr at room temperature, Specifically, the following reagents and methods were followed by addition to 600 ug of yeast RNA (total reaction used ot assay lysozyme activity: 40 volume, 1 ml. of 0.1M sodium acetate pH 5.0). The reactions Reagents: were monitored by the decrease in absorbance at 300 nm as a. O.125M NaCl above. The effects of inhibitors (denatured DNA or Ag") on b. LyPepZ Solution-prepared in Solvent (a), containing activities were also done by pre-mixing RNase or RnPepz 1-40 tug/200 ul with denatured DNA (250 lug) or with 10 molar excess of 45 AgNO prior to addition to RNA, using the same amounts c. Negative controls-peptides unrelated to LyPepZ in of catalyst and Substrate described for DTT, Denaturation of solvent (a) from 1-40 tug/200 ul DNA was done by boiling a solution in water (1 mg/ml) for d. Micrococcus lysodeikticus solution in solvent (a)- 10 min, then chilling immediately in ice. from 100 ug to 300 tug/800 ul. Lysozyme and Ribonuclease Examples e. Positive control-hen lysozyme in Solvent (a) at 1, 2, 50 Example VIII: Hydrolysis of M. lysodeikticus by LYZ and and 3 ug/200 ul. LyPepz Applicant always dedicated a separate spectrophotometer In its action on M. lysodeikticus, LyPepz exhibited a cuvette for this positive control. To perform the assay, 200 lysozyme-like activity, causing complete hydrolysis of the All of the LyPepZ. negative control or positive control Solution cell wall of the organism (FIG. 8). The activity was specific was added to 800 ul of the substrate solution. The decrease 55 since RnPepz had no hydrolytic activity on the cell wall. The in turbidity was monitored at room temperature at 450 nm. kinetic constants were determined from experiments at The decrease obtained with the LyPepz, sample had to be different substrate concentrations (FIG. 9). Because it is not corrected for the non-Specific decrease obtained with the possible to express Substrate concentrations in molar Same concentration of the negative control peptide acting on quantities, the constants were based on mg/ml and initial the same amount of Substrate. Precautions as to assaying and 60 velocities in OD change/min. These results showed that the stability were the same as described for ChPepz and TrPepz. K values of LyPepz and LYZ were quite comparable. The Measurement of Ribonuclease Activity. The catalytic activ turn-over rate of LYZ, however, was about 7 times higher ity of RNase and RnBepz were determined by their hydro than that of LyPepz. The synthetic catalyst, like LYZ, was lytic effects on yeast RNA (Kunitz, M. (1946).J. Biol. Chem. completely inhibited by imidazole, tryptamine or histamine 164,563-569; Gutte, B. and Merrifield, R. B. (1971).J. Biol. 65 (FIG. 8). The hydrolytic activity of LyPepz was completely Chem. 246, 1922–1941). RNase (1.5ug=1.095x10' umole) lost by reduction of the disulfide bond with DTT (FIG. 8). and RnFepz (2.9 ug=1.21x10 umole) were each mixed It should be noted that Rn Pepz and other control peptides 5,861,477 31 32 had no hydrolytic activity on M. lysodeikticus. Both LyPepz RNase are given in FIG. 15. The appropriate numbers of and LYZ showed an optimum temperature for activity glycine Spacers used in the design of each Synthetic catalyst between 30 C-35 C. and there was no significant change were based on the calculation that the average C*-to-C in their activities to one another in the range 10 C-45 C. distances were: one peptide bond (C-to-C), 3.80+0.20 A; Example IX: Hydrolysis of yeast RNA by RNase and 5 two peptide bonds (distances between C", and C in C RnBepz C-C), 6.20+1.00 A; three peptide bonds (distance Like Rnase, RnBepz was able to hydrolyze yeast RNA between C, and C, in C-C-C-C), 7.942.07 A. Glycine residues were found (Atassi, M. A. (1986) in completely (FIG. 9), while LyPepz (used as a control) had no Protein Engineering Applications in Science, Medicine and effect on this substrate. Because it is difficult to express Industry, eds. Inouye, M. and Sarma, R. (Academic Press, Substrate concentrations in molar quantities, the kinetic Orlando, Fla.), pp. 125-153) to be most suitable for use as constants were calculated using Substrate concentration Val Spacers in Surface-Simulation Synthesis, probably because ues in mg/ml. Also, Velocities of hydrolysis were measured they are flexible and do not have a side chain that might in decrease in OD/min. Lineweaver-Burk plots of the hydro potentially interfere in the activity of the peptide. To prepare lytic reactions (FIG. 4) showed that the K value for RnPepz the cyclic Structure which is essential for activity, the was almost identical to that of RNase. RinPepz, however, 15 peptides were cyclized by an intramolecular disulfide bond. exhibited a k value which was considerably lower than The latter was placed in each Synthetic catalyst in a position that of the whole enzyme. The activity of Rn Pepz was that would closely maintain the appropriate Spacing and completely destroyed by reduction of the disulfide bond. The cause no interference or undue distortion in the orientation temperature optimum for the hydrolytic activity of RnPepz of the essential site residues. The C-to-C' distance in was around 25 C., while the optimum for the enzyme was Cys-SS-Cys was found (Atassi, M. Z. et al. (1993), id) to be closer to 35° C. In the range 17.5 C-45° C., RnBepz and 5.52+0.41A. The best design, which fulfilled these require RNase showed little change in their relative activities. At 10 ments was obtained by a disulfide bridge between Asn-37 C. neither RnPepz, nor RNase had any measurable hydrolytic and Thr-43 in LyPepz and between Lys-7 and Glu-11 in activity on yeast RNA. Like RNase, RnBepz was completely RnPepz (see FIGS. 12 and 15). inhibited by Ag" or denatured DNA. It should be noted that 25 The peptide LyPepZ possessed a catalytic activity which LyPepz, lysozyme and other Synthetic cyclic peptides that was similar to that of the protein itself. The affinity of are unrelated to RNase had no effect on yeast RNA. LyPepz for the substrate was quite similar to that of LYZ as Design of the Synthetic Peptide Catalysts Based on Ribo evidenced by their comparable K values. The K value nuclease and Lysozyme obtained here for LYZ (121 mg/liter) in 0.125M NaCl is Lysozyme (muramidase) catalyzes cleavage of the similar to the value of 129 mg/liter reported (Neville, W. M. N-acetylmuramic acid-N-acetylglucosamine B-1,4-linkages et al. (1972), id) in the same solvent. The k values showed that occur in the cell-wall polysaccharide of Some organisms that LyPepz hydrolyzed the cell wall of M. lysodeikticus in (e.g. M. lySOdeikticus). The LYZ amino acid sequence a time-frame that was very much enzyme-like. The finding (Canfield, R. E. (1963) J. Biol. Chem. 238,2869; Jollés, J., that LyPepz was almost completely inhibited by imidazole, Jauregui-Adell, J., Bernier, I. and Jollé, P. (1963) Biochim. 35 histamine or tryptamine, which are known to inhibit LYZ by Biophys. Acta 78,668–698) and three-dimensional structure forming charge-transfer complexes with the tryptophan resi (Blake, C. C. F., Mair, G. A., North, A. C. T., Phillips, D. C. dues in the enzyme (Shinitzky, M., Katchalski, E., Grisaro, & Sharma, V. R. (1967) Proc. Roy. Soc. B167, 365; as well V. and Sharon, N. (1966) Arch. Biochem. BiophyS. 116, as its interactions with Substrate analogs and inhibitors 332–343; Swan, I. D. A. (1972) J. Mol. Biol. 65, 59–62), (Blake, C. C. F., Mair, G. A., North, A. C. T., Phillips, D. C. 40 would indicate that LyPepz employs the same mechanism of and Sarma, V. R. (1967) Proc. Roy. Soc. B167, 365-377; catalysis as that used by LYZ. The inability of Rn Pepz and Imoto, T., Johnson, L. N., North, A. C. T., Phillips, D.C. and other control peptides to hydrolyze M. lysodeikticus cell Rupley, J. A. (1972) in The Enzymes (Boyer, P. D., ed) Vol. wall demonstrated that the action of LyPepz on the cell wall 7, pp. 665–868) have been determined. For the present work, is a true muramidase activity. the X-ray coordinates were from Diamond, R. and Phillips, 45 Example XI: Ribonuclease D. (1975) Protein Data Bank, Entry Identification Code In their hydrolysis of yeast RNA, RnBepz and RNase, 6LYZ; Kelly, J. and James, M. (1979) Protein Data Bank, exhibited near-identical K values (Table 2), indicating that Entry Identification Code 9LYZ. Ribonuclease A effects the two catalysts possessed comparable affinities for the hydrolysis of the phosphodiester bond between the 5'-ribose substrate. The K value found here for RNase (1.25 mg/ml) of a nucleotide linked to the 3'ribose of a pyrimidine 50 in 0.1M sodium acetate buffer, pH 5.0, is in agreement with nucleotide. Its amino acid sequence (Smyth, D. G., Stein, W. the reported values of 1.20 mg/ml (Gutte, B. et al. (1971), id) H. and Moore, S. (1963).J. Biol. Chem. 238, 227–234.) and and 1.25 mg/ml (Edelhock, H. and Coleman, J. (1956) J. three-dimensional structure (Kartha, G., Bello, J. and Biol. Chem. 219, 351-363). The k values of RnPepz and Harker, D. (1967) Nature (London) 213, 862) are known. RNase showed that the synthetic catalyst effected hydrolysis The RNase X-ray coordinates employed in the present 55 at a rate which was considerably lower than that of the studies were from Nachman, J. and Wlodawer, A. (1989) enzyme. Nevertheless, Rinpepz was quite efficient and was Protein Data Bank, Entry codes 8RSA and 9RSA. in fact able to effect complete hydrolysis of RNA in an Example X: Lysozyme enzyme-like time frame. The specificity of the hydrolytic The essential residues of the active site of LYZ are shown action of Rn Pepz on RNA was evident from its inability to in FIG. 12. The elements of the design of LyPepz are 60 hydrolyze the cell wall of M. lysodeikticus and conversely outlined in FIG. 13, which shows (in FIG.13a) the sequence from the lack of action of LyPepz and other control peptides positions of the LYZ active site residues together with the on RNA. The complete inhibition of Rn Pepz by heavy C-to-C" distances (in A separating them and (in FIG.11b) metals and by DNA, which are known to inhibit RNase the Structure of the peptide designed to mimic the active site. (Sekine, H., Nakano, E. and Sakaguchi, K. (1969) Biochim. FIG. 14 shows the essential residues of the active site of 65 Biophys. Acta 174, 202-210), would suggest that the syn RNase. The distances separating these residues and the thetic catalyst employs a similar catalytic mechanism to that Structure of the peptide designed to mimic the active Site of of the enzyme. 5,861,477 33 34 Example XII: Design Problem Areas Sequence of each enzyme upon which Successful Synthetic The lower rate of catalysis by the Synthetic catalysts, catalysts had been modeled. Thus, the following query relative to their respective enzymes, is most likely caused by Sequences were used: the existence of each peptide in an equilibrium of confor ChPep Z/TrPepz-Chymotrypsin (EC 3.4.21.1) B mational states influenced by substrate-induced fit. The Precursor-bovine peptide will display activity when a catalytically-productive LyPepz-Lysozyme (EC 3.2.1.17) c2-bovine conformation is achieved. The finding that integrity of the RnBepz-Ribonuclease A-bovine In another eXtension, using the urokinase active site as the cyclic structure was crucial for the activity of both LyPepz template, the Structures of designed Synthetic catalysts were and RnFepz (as evidenced from the total loss of activity on deduced for urokinase-related enzymes. reduction of the disulfide bond) is probably due to the The parameters for each Search were virtually the same inability of the acyclic form to assume Such a conformation. and were as follows: This is similar to the findings with the two proteolytic Similarity matrix PAM-250 Synthetic catalysts which mimicked the activities of trypsin Threshold level of Sim. 13% and C-chymotrypsin (Atassi, et al. (1993), id). The presence Mismatch Penalty 1 in RnPepz of two segments (residues 7-8 and 41-45), 15 Gap Penalty 4.00 possessing the retro-Sequence to their counterparts in native Gap Size Penalty 0.05 RNase may have also contributed to the decrease in the rate Cutoff score 2-4 of hydrolysis by RNPepZ. A synthetic analog in which these Randomization Group O two Segments were made by D-amino acids did not show K-tuple 1 any significant changes in the rate of hydrolysis (data not Joining Penalty 20 shown). Window size 7-10 Initial scores to save 100 TABLE 2 Optimized scores to save 100 Alignments to save 100 Kinetic constants for hydrolysis of M. lysodeikticus by 25 Display context 5 LYZ and LVPepz and of yeast RNA by RNase and RnFepz The Scores were Sorted by initial Score and homologies Kn kcat were displayed comparing linear homology of each query (mg/liter) (OD units/umole/sec) Sequence to each found Sequence. Each Such pairwise com Hydrolysis of parison was visually Scanned for the active Site residues M. lysodeikticus by: previously found to be Successfully integrated into the correlate Synthetic catalyst. LyPepz 133.4 2.6 1.38 O.O1 LYZ 120.6 - 4.O 9.26 O.12 The results of each such search are displayed in FIGS. Hydrolysis of 16-18A. Where homology was not significant in the active yeast RNA by: Site regions or in certain cases where virtually identical 35 homologies were located, the tabulated results are not nec RnFepz 1238 228 O.559 O.O1 essarily shown. All of the alignments Saved are displayed in RNase 1254 - 460 14.05 - 6.85 FIGS. 16-18B. In each instance the number of the sequence identified in FIGS. 16-18A correspond to the named and Example XIII: Homologous Peptide-Sequences to Active similarly numbered sequence in FIGS. 16-18B. Synthetic Catalysts 40 The Success of the method was marked. Very high degrees Having successfully utilized the methods of the invention of homology were found in each Such Search. In many to generate Synthetic, catalytically active peptides modeled instances, only minor modifications/Substitutions were on enzymes with known 3-dimensional Structures, Applicant uncovered. Whereas, as described herein, certain Substitu began the process of identifying other Such peptides by tions are likely to generate virtually identical activity in each homology as described in the Specification. In the first Such 45 homologous Synthetic peptide, it is also possible that Such instance, as already described, Applicant Successfully pre modifications will result in a Synthetic catalyst in which the dicted by homology that an active tryptic peptide could be activity will be modulated in certain regards (i.e., catalytic constructed where Sequence homology of chymotrypsin to rate, Substrate affinity, Solubility, temperature Sensitivity, trypsin allowed substitution of trypsin-derived residues for etc.) those in the chymotrypsin modeled Synthetic catalyst. 50 In any case, as in the case of use of ChPepZ to Successfully Employing the same rationale, Applicant has designed Syn predict Structure of the TrPepz, catalyst, amino acids Substi thetic catalysts mimicking the activity of urokinase and tutions can be readily introduced and the resulting catalysts tissue-plasminogen activator (and whose structures were monitored for activity. For example, using the chymotrypsin constructed on the basic construct of the chymotryptic search results and referring now to FIG.16A and B, a trypsin Synthetic catalyst) based on the homology of the active site 55 homolog was predicted. Referring first to the active site residues in the respective enzyme with chymotrypsin. residues listed in the heading of FIG. 16, it can be seen that Expanding this Successful approach further, Applicant the ChPepz, contained residues Fo, Iloo, Sso and Mo. initiated a computer Search for Sequences homologous to the Referring now to Sequence No. 31 in FIG. 16A, it can be active Synthetic catalysts disclosed herein. The Searches Seen that the method predicts that an identical Sequence, were accomplished using the FAST-DB (Fast Pairwise Com 60 wherein Fo is Substituted by a tyrosine residue, Ioo is parison of Sequences-Release 5.4) sequence homology Substituted by a leucine residue, Sso, is Substituted by an Searching program as provided by Intelligenetics, Inc., 700 aspartic acid residue and Mo is Substituted by a glutamine East El Camino Real, Mountain View, Calif. 94040. The residue, will be active. Referring now to FIG.16B, it can be Searches were conducted in the publicly-available Sequence Seen that Sequence 31 is, in fact, a trypsin (EC 3.4.21.4) databases-PIR39 and Swiss-Prot 28. Each search was 65 molecule. This approach, as noted above, resulted in the accomplished by choosing a query Sequence. The query Successful prediction of an entirely tryptic activity Synthetic Sequence was chosen to represent the most complete catalyst. 5,861,477 35 36 Extending this Successful analogy only Slightly, it can be The present invention has been described in terms of seen in FIG.16A and B that Sequence No. 27 varies from the particular embodiments found or proposed to comprise trypsin homolog by only the Substitution of V (in preferred modes for the practice of the invention. It will be chymotrypsin) with a threonine. By analogy, it is very likely appreciated by those of Skill in the art that, in light of the that the trypsinogen II (EC 3.4.21.7) synthetic catalyst will present disclosure, numerous modifications and changes can be similarly active. Most, if not all, of the other homologs be made in the particular embodiments exemplified without uncovered by the Searches are predicted to result in Similarly departing from the intended Scope of the invention. Thus, it active catalysts. Similarly, using active Site homology of the Serine pro will be understood by those of skill in the art that any teases chymotrypsin and trypsin with urokinase and tissue enzyme Sequence and 3-dimensional information which is plasminogen activator. Referring now to FIG. 19A and B, it available will serve admirably well as a model. In certain can be seen that the active Site residues for urokinase listed instances, where the Sequence is known but the in the heading of the figure follow a highly similar pattern 3-dimensional Structure of the chosen enzyme is not known, to those for chymotrypsin as follows: a homologous protein whose 3-dimensional Structure and Chymotrypsin Active Site Residues: Phe-His-Phe Ala 15 Sequence are known can be used as a model. For example, Ala-His-Cys Ile Asp Ser-Ser-Gly-Met-Gly-Asp-Ser Val-Ser the Structure of trypsin can be used to model peptide Trp catalysts designed to possess the activity of urokinase and Urokinase Active Site Residues: Thr-Tyr-Val Ala-Thr tissue plasminogen activator. In addition, as demonstrated His-Cys Leu Asp Asp-Ser-Gly-Gln-Gly-Asp-Ser Val-Ser by the substitution of the trypsin residues for those of Trp chymotrypsinherein, Substitutions of active site residues can Extending the analogy to predict an equivalent Structure for be used to modulate, alter or to improve catalytic function. a tissue plasminogen activator (TPA) (last listed Sequence), Additionally, it is anticipated that certain designs will be the pattern Similarity can be seen to be as follows: useful even where there is not a particularly impressive Urokinase Active Site Residues: Thr-Tyr-Val Ala-Thr correlation with the native at maximal His-Cys Leu Asp Asp-Ser-Gly-Gln-Gly-Asp-Ser Val-Ser 25 conditions since the Synthetic catalysts may exhibit desirable Trp characteristics under conditions where the native enzyme TPA Active Site Residues: Arg-Phe-Leu- Ala-Ala-His fails to function. All Such modifications are intended to be Cys Tyr Asp Asp-Ala-Gly-Gln-Gly-Asp-Ser Ile-Ser-Trp included within the Scope of the appended claims.

SEQUENCE LISTING

( 1) GENERAL INFORMATION: ( i i i ) NUMBER OF SEQUENCES: 13

( 2) INFORMATION FOR SEQ ID NO: 1: ( i) SEQUENCE CHARACTERISTICS: ( A ) LENGTH: 29 amino acids (B) TYPE: amino acid (D) TOPOLOGY: circular ( i i ) MOLECULETYPE: protein ( i i i ) HYPOTHETICAL: NO ( i i i ). ANTI-SENSE: NO ( v ) FRAGMENT TYPE: not applicable ( x i ) SEQUENCE DESCRIPTION: SEQ ID NO: 1: Cy s G 1 y P h e H is P he G 1 y G 1 y Ser A s p G 1 y Me t G 1 y Ser Ser G 1 y G 1 y 1. 5 1 O 1 5

V a 1 Ser T r p G 1 y I le G 1 y G 1 y As p G 1 y A 1 a. A 1 a His Cy s 2 O 25

( 2) INFORMATION FOR SEQ ID NO: 2: ( i) SEQUENCE CHARACTERISTICS: ( A ) LENGTH: 29 amino acids (B) TYPE: amino acid (D) TOPOLOGY: circular ( i i ) MOLECULETYPE: protein ( i i i ) HYPOTHETICAL: NO ( v ) FRAGMENT TYPE: not applicable 5,861,477 37 38 -continued

( xi ) SEQUENCE DESCRIPTION: SEQ ID NO: 2: Cy s G 1 y Ty r H is Phe G 1 y G 1 y Ser A s p G 1 y G 1 u G 1 y Ser A s p G 1 y G 1 y 1. 5 1 O 1 5

V a 1 Se r T r p G 1 y Le u G 1 y G 1 y A s p G 1 y A 1 a. A 1 a His Cy s 2 O 25

( 2) INFORMATION FOR SEQ ID NO:3: ( i) SEQUENCE CHARACTERISTICS: ( A ) LENGTH:37 amino acids (B) TYPE: amino acid (D) TOPOLOGY: circular ( i i ) MOLECULETYPE: protein ( i i i ) HYPOTHETICAL: NO ( v ) FRAGMENT TYPE: not applicable ( xi ) SEQUENCE DESCRIPTION: SEQ ID NO:3: Cys Thr As n A rig As n G 1 y G 1 y A s p G 1 y G 1 y Le u G 1 u I le. As in G 1 y Tr p 1. 5 1 O 1 5

T r p G 1 y G 1 y I le G 1 y A s p G 1 y A s p G 1 y A la T r p V a 1 A a G 1 y Arg G 1 y 2 O 25 3 O

P he G 1 u Ser As n Cys 3 5

( 2) INFORMATION FOR SEQ ID NO: 4: ( i) SEQUENCE CHARACTERISTICS: ( A ) LENGTH: 24 amino acids (B) TYPE: amino acid (D) TOPOLOGY: circular ( i i ) MOLECULETYPE: protein ( i i i ) HYPOTHETICAL: NO ( v ) FRAGMENT TYPE: not applicable ( xi ) SEQUENCE DESCRIPTION: SEQ ID NO: 4: Cy s G 1 u G 1 y V a H is Ph e A s p A a Ser G 1 y G 1 y Thr As n V a 1 Pro Lys 1. 5 1 O 1 5

G 1 y G 1 y G 1 u H is G 1 y P he L y s Cy s 2 O

( 2) INFORMATION FOR SEQ ID NO: 5: ( i) SEQUENCE CHARACTERISTICS: ( A ) LENGTH: 29 amino acids (B) TYPE: amino acid (D) TOPOLOGY: circular ( i i ) MOLECULETYPE: protein ( i i i ) HYPOTHETICAL: NO ( v ) FRAGMENT TYPE: not applicable ( xi ) SEQUENCE DESCRIPTION: SEQ ID NO: 5: Cy s G 1 y Thr Ty r V a 1 G 1 y G 1 y Ser A s p G 1 y G. l n G 1 y Ser A s p G 1 y G 1 y 1. 5 1 O 1 5

V a 1 Se r T r p G 1 y Le u G 1 y G 1 y A s p G 1 y Al a T her H is Cy s 2 O 25

( 2) INFORMATION FOR SEQ ID NO: 6: 5,861,477 39 40 -continued

( i) SEQUENCE CHARACTERISTICS: ( A ) LENGTH: 29 amino acids (B) TYPE: amino acid (D) TOPOLOGY: circular ( i i ) MOLECULETYPE: protein ( i i i ) HYPOTHETICAL: NO ( v ) FRAGMENT TYPE: not applicable ( xi ) SEQUENCE DESCRIPTION: SEQ ID NO: 6: Cy s G 1 u A rig Ph e Le u G 1 y G 1 y Ser A s p G 1 y G. l n G 1 y A la A s p G 1 y G 1 y 1. 5 1 O 1 5

I le S e r T r p G 1 y Ty r G 1 y G 1 y A s p G 1 y A 1 a. A 1 a His Cy s 2 O 25

( 2) INFORMATION FOR SEQ ID NO:7: ( i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 16 amino acids (B) TYPE: amino acid (C)STRANDEDNESS: single (D) TOPOLOGY: linear ( i i ) MOLECULETYPE: peptide ( i i i ) HYPOTHETICAL: NO ( i v ). ANTI-SENSE: NO ( v ) FRAGMENT TYPE: not applicable ( xi ) SEQUENCE DESCRIPTION: SEQ ID NO: 7: G l n Le u G 1 u Pro Ser Th r S e r S e r A 1 a V a l Pro Le u I le G 1 y Lly s G 1 y 1. 5 1 O 1 5

( 2) INFORMATION FOR SEQ ID NO:8: ( i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 10 amino acids (B) TYPE: amino acid (C)STRANDEDNESS: single (D) TOPOLOGY: linear ( i i ) MOLECULETYPE: peptide ( i i i ) HYPOTHETICAL: NO ( i v ). ANTI-SENSE: NO ( v ) FRAGMENT TYPE: not applicable ( xi ) SEQUENCE DESCRIPTION: SEQ ID NO:8: A 1 a. A 1 a Me t Lys Arg His G 1 y Le u A s p As n 1. 5 1 O

( 2) INFORMATION FOR SEQ ID NO:9: ( i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 9 amino acids (B) TYPE: amino acid (C)STRANDEDNESS: single (D) TOPOLOGY: linear ( i i ) MOLECULETYPE: peptide ( i i i ) HYPOTHETICAL: NO ( i v ). ANTI-SENSE: NO ( v ) FRAGMENT TYPE: not applicable 5,861,477 41 42 -continued

( xi ) SEQUENCE DESCRIPTION: SEQ ID NO:9: A s p As in Ty r A rig G 1 y Ty r S e r Le u G 1 y 1. 5

( 2) INFORMATION FOR SEQ ID NO:10: ( i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 8 amino acids (B) TYPE: amino acid (C)STRANDEDNESS: single (D) TOPOLOGY: linear ( i i ) MOLECULETYPE: peptide ( i i i ) HYPOTHETICAL: NO ( i v ). ANTI-SENSE: NO ( v ) FRAGMENT TYPE: not applicable ( xi ) SEQUENCE DESCRIPTION: SEQ ID NO:10: A 1 a Lys Lys I le V a 1 Se r A s p G 1 y 1. 5

( 2) INFORMATION FOR SEQ ID NO:11: ( i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 23 amino acids (B) TYPE: amino acid (C)STRANDEDNESS: single (D) TOPOLOGY: linear ( i i ) MOLECULETYPE: peptide ( i i i ) HYPOTHETICAL: NO ( i v ). ANTI-SENSE: NO ( v ) FRAGMENT TYPE: not applicable ( xi ) SEQUENCE DESCRIPTION: SEQ ID NO:11: Cy s A s p Ser G 1 y G 1 y V a 1 Ser T r p G 1 y G 1 y Le u G 1 y A s p G 1 y A la A 1 a 1. 5 1 O 1 5

H is G 1 y G 1 y P h e H is Ty r Cy s 2 O

( 2) INFORMATION FOR SEQ ID NO:12: ( i) SEQUENCE CHARACTERISTICS: ( A ) LENGTH: 27 amino acids (B) TYPE: amino acid (C)STRANDEDNESS: single (D) TOPOLOGY: linear ( i i ) MOLECULETYPE: peptide ( i i i ) HYPOTHETICAL: NO ( i v ). ANTI-SENSE: NO ( v ) FRAGMENT TYPE: not applicable ( xi ) SEQUENCE DESCRIPTION: SEQ ID NO:12: A s p Ser G 1 y G 1 u Cy s A s p Ser G 1 y Gil y V a 1 Ser T r p G 1 y G 1 y Le u G 1 y 1. 5 1 O 1 5

A s p G 1 y A 1 a. A 1 a His G 1 y G 1 y Ph e H is Ty r Cy s 2 O 25

( 2) INFORMATION FOR SEQ ID NO:13: 5,861,477

-continued

( i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 26 amino acids (B) TYPE: amino acid (C)STRANDEDNESS: single (D) TOPOLOGY: linear ( i i ) MOLECULETYPE: peptide ( i i i ) HYPOTHETICAL: NO ( i v ). ANTI-SENSE: NO ( v ) FRAGMENT TYPE: not applicable ( xi ) SEQUENCE DESCRIPTION: SEQ ID NO:13: Cy s P he G 1 y G 1 y S e r A s p G 1 y G 1 u G 1 y Se r A s p G 1 y G 1 y V a 1 Ser T r p 1. 5 1 O 1 5

G 1 y Le u G 1 y G 1 y A s p G 1 y A la A 1 a His Cy s 2 O 25

What is claimed is: 7. A peptide modeled on an active Site of a naturally 1. A peptide modeled on an active Site of a naturally occurring enzyme and having the Structure: occurring enzyme and having the Structure: 25 cyclo(-S-cystinyl-glutamyl-glycyl-Valyl-histidyl cyclo(-S-cystinyl-glycyl-phenylalanyl-histidyl phenylalanyl-aspartyl-alanyl-Serylglycyl-glycyl phenylalanyl-glycyl-glycyl-Seryl-aspartyl-glycyl threonyl-asparagyl-Valyl-prolyl-lysyl-glycyl-glycyl methionyl-glycyl-seryl-seryl-glycyl-glycyl-Valyl glutamyl-histidylglycyl-phenylalanyl-lysyl-cystinyl-S) Seryl-tryptophanylglycyl-isoleucyl-glycyl-glycyl (SEQ ID NO:4). aspartyl-glycyl-alanyl-alanyl-histidyl-cystinyl-S) 8. The peptide of claim 7, wherein said enzyme is (SEQ ID NO:1). ribonuclease. 2. The peptide of claim 1, wherein Said enzyme is 9. A peptide modeled on an active Site of a naturally chymotrypsin. occurring enzyme and having the Structure: 3. A peptide modeled on an active Site of a naturally cyclo(-S-cystinyl-glycyl-threonyl-tyrosyl-Valyl-glycyl occurring enzyme and having the Structure: 35 glycyl-seryl-aspartyl-glycyl-glutaminyl-glycyl-Seryl cyclo(-S-cystinyl-glycyl-tyrosyl-histidyl-phenylalanyl aspartyl-glycyl-glycyl-Valyl-Seryl-tryptophanyl glycyl-glycyl-Seryl-aspartyl-glycyl-glutamyl-glycyl glycyl-leucyl-glycyl-glycyl-aspartyl-glycyl-alanyl Seryl-aspartyl-glycyl-glycyl-Valyl-seryl-tryptophanyl threonyl-histidyl-cystinyl-S) (SEQ ID NO:5). glycyl-leucyl-glycyl-glycyl-aspartyl-glycyl-alanyl 10. The peptide of claim 9, wherein said enzyme is alanyl-histidyl-cystinyl-S) (SEQ ID NO:2). 40 urokinase. 4. The peptide of claim 3, wherein Said enzyme is trypsin. 11. A peptide modeled on an active site of a naturally 5. A peptide modeled on an active Site of a naturally occurring enzyme and having the Structure: occurring enzyme and having the Structure: cyclo(-S-cystinyl-glutamyl-arginyl-phenylalanyl-leucyl cyclo(-S-cystinyl-threonyl-asparagyl-arginyl-asparagyl glycyl-glycyl-seryl-aspartyl-glycyl-glutaminyl-glycyl glycyl-glycyl-aspartyl-glycyl-glycyl-leucyl-glutamyl 45 alanyl-aspartyl-glycyl-glycyl-isoleucyl-Seryl isoleucyl-asparagyl-glycyl-tryptophanyl-tryptophanyl tryptophanyl-glycyl-tyrosyl-glycyl-glycyl-aspartyl glycylglycyl-isoleucyl-glycyl-aspartyl-glycyl-aspartyl glycyl-alanyl-alanyl-histidyl-cystinyl-S) (SEQ ID glycyl-alanyl-tryptophanyl-Valyl-alanylglycyl-arginyl NO:6). glycyl-phenylalanyl-glutamyl-Seryl-asparagyl 12. The peptide of claim 11, wherein Said enzyme is tissue cystinyl-S) (SEQ ID NO:3). 50 plasminogen activator. 6. The peptide of claim 5, wherein said enzyme is lysozyme. k k k k k