Cloning and Sequencing of the Pertussis Toxin Genes
Total Page:16
File Type:pdf, Size:1020Kb
Proc. Nati. Acad. Sci. USA Vol. 83, pp. 4631-4635, July 1986 Biochemistry Cloning and sequencing of the pertussis toxin genes: Operon structure and gene duplication (bacterial toxins/Bordetella peflussis/ADP-ribosylation/trnshlaonal signals/secreted proteins) ALFREDO NICOSIA*, MARIA PERUGINI*, CARLO FRANZINI*, MARIA CRISTINA CASAGLI*, MARIA GIUSEPPINA BORRI*, GUIDO ANTONI*, MARCO ALMONIt, PAOLO NERIt, GIULIO RATTI*, AND RINO RAPPUOLI*t *Sclavo Research Center, Via Fiorentina, 1, 53100 Siena, and tCRISMA, University of Siena "Le Scotte," 53100 Siena, Italy Communicated by A. M. Pappenheimer, Jr., February 26, 1986 ABSTRACT Pertussis toxin, a protein composed of five A B different subunits (S1, S2, S3, S4, and S5), is the major virulence factor of Bordetela pertussis. We have cloned and * sequenced a DNA fragment of 4.7 kilobases that contains the SiP.~ genes coding for the 'five subunits. The genes are clustered S3C. , within 3.2 kilobases in the following order: S1, S2, SA, S5, and S3. A sequence closely resembling Escherichia coli promoters is found only before the S1 gene, and a possible termination S5S 0 S1~~~~~~~~~~~~ S4 signal is present at the end of the S3 gene, which suggests that S4 -- - S5 the pertussis toxin genes are organized in a single operon. A possible Shine-Dalgarno sequence is present before the S1 gene but not before the other four genes that 8-12 nucleotides FIG. 1. NaDodSO4/polyacrylamide gel electrophoresis of affin- upstream from the ATG codon show a new consensus sequence, ity-purified PT. The toxin in lane A has been treated with a reducing 5'TCC(T)GG3', possibly involved in the regulation'of trans- agent before loading the gel. The toxin in lane B has not been reduced. Note that although S2 and S3 have the same deduced lation. We have also found sequence homology between the S2 molecular weight (Fig. 2), they have a different mobility in and S3 genes and their protein products indicating that gene NaDodSO4/PAGE. Note also that S5 is stained rather poorly and duplication played a major role in the evolution of pertussis although its deduced molecular weight is smaller than that of S4 toxin. (Fig. 2) under reducing conditions, it migrates more slowly than S4. Pertussis toxin (PT), a protein of 105 kDa that is released into the extracellular medium by virulent (phase 1) Bordetella products of phospholipids. The variety of names given to PT pertussis (1-3), plays a major role in whooping cough (4, 5). reflects this multiplicity of effects: islet-activating protein, Purified PT is composed offive different subunits (S1, S2, S3, histamine-sensitizing factor, and lymphocytosis-promoting S4, and S5) (Fig. 1). Like other bacterial protein toxins, such factor. In addition, PT is also a mitogen and an adjuvant; it as cholera toxin, Escherichia coli heat-labile enterotoxin, releases fatty acids from fat cells, interferes with chemotactic diphtheria toxin, and pseudomonas exotoxin A (6, 7), PT can migration, changes the peripheral vascular permeability, and be divided into the following two functionally different modifies the growth pattern of CHO cells in vitro (18-20). moieties: A, which is the toxic subunit S1, and B (comprising PT is the major virulence factor of B. pertussis (4, 5), and S2, S3, S4, and S5), which is involved in binding the receptors it has been shown to induce protective levels of antibodies in on the surface of eukaryotic cells and in the translocation of animal models (20). A monoclonal antibody against PT was the toxic subunit S1 across the cell membrane. S2, S3, and S4 shown to protect mice from the challenge with virulent B. are arranged in the following two dimers: D1 (S2 and S4) and pertussis (21). Detoxified PT is a good candidate for a new D2 (S3 and S4), which are held together by S5 (3). Thus, PT acellular vaccine against pertussis (22). Manipulation of the contains a single copy of subunits S1, S2, S3, and S5 and two toxin gene by genetic engineering could be a way to produce copies of subunit S4; it is in fact the most complex bacterial large amounts of detoxified protein. As nothing was known toxin described so far. about the nature, structure, and expression ofthe PT gene(s), As described for diphtheria, cholera, and pseudomonas we have cloned and characterized it. toxins, the toxic subunit A of PT is an NAD-dependent ADP-ribosyltransferase (6, 8). The enzyme substrates are the GTP-binding protein Gi, which is involved in the receptor- MATERIALS AND METHODS mediated inhibition of adenylate cyclase (9, 10), and Strains. B. pertussis strain BP165 was obtained from the transducin, which is involved in the translocation of signals Office of Biologics, Bethesda, MD (2). The TnS mutant from the light receptor rhodopsin to a cyclic GMP phospho- BP356 (23) was obtained from Stanley Falkow. E. coli strains diesterase (11, 12). There is evidence that other GTP-binding JM101, Q359, and LE392 have been described (24, 25). proteins, also involved in the translocation of signals from Amino-Terminal Sequencing of PT Subunits. PT from 50 membrane receptors to phospholipase C, are affected by PT liters of BP165 culture supernatant was affinity purified by (13-17). PT action causes irreversible uncoupling of the Affi-Gel Blue and fetuin-Sepharose columns, as described by regulatory GTP-binding proteins from their membrane re- Sekura et al. (2). The five subunits were separated by ceptors and interferes with a variety of metabolic pathways preparative NaDodSO4/polyacrylamide gel electrophoresis having as secondary messengers cAMP and the breakdown (Fig. 1) and electroeluted (26). The amino-terminal sequences The publication costs of this article were defrayed in part by page charge Abbreviations: PT, pertussis toxin; bp, base pair(s); ORF, open payment. This article must therefore be hereby marked "advertisement" reading frame. in accordance with 18 U.S.C. §1734 solely to indicate this fact. tTo whom reprint requests should be addressed. 4631 Downloaded by guest on October 2, 2021 432 Biochemistry: Nicosia et al. Proc. Nati. Acad. Sci. USA 83 (1986) 10 30 50 70 90 110 ::9a130 150 170 190 210 230 250 270 290 310 330 350 370 390 410 430 ____450 470 490 _____ 510 530 550 -IS 570 -10 MetArgCysThrArgAlal leArgGlnThrAlaArgThr~lyTrpLeuThrTrpLeuAlaIleLeuAlaValThrAlaProValThrSerProA 610 630 650 670 690 710 8-SPS 50 8 70 8 90 9 10 93390 95 70 9 90 10 10 10 30 100507 1970 1110 1130 1150 1170 1107 129 110 1230 1250 1270 1201310 hr~h~luyr~rA~~la~g~y~aler~nMlThrorglaAsn~rqoysThroeuyshrS~erurgeSerVal~euaroer~lealeuGly~hrieuValArg~etAlajSrT~l 1450 1470 1490 1510 1530 1550 lely~la~ys~euhr~lalhArgqnlGluerpSer~luleaeAla~leTyrsp~GlyuhrgyrleGlyGly~lautyralyLyVal Ieyr~yrGlu~er~le~layrlyerPhe~spL 1690 1710 13730 1750 1770 1790 eu~y~hrhr~h~ys le~t~h~hr~gMethrolyln~p~rogLyThr~spuiSyrsyrsOFAuleuVal~eu~ro~huAgLeuLeuS lySer~HiAsfalS&ArgLUCYalaVr 1810 1830 1850 1870 1890 1910 -i hrPro.Valrgeraly~lenPro~alnI~GluAln~yThr~l~slyerProT~rsCyrSly~rgrpCys~aet~yTyrArg~Leu~hr9LYSMatlCuTLeuI1TrVclyer1AspG1YI1 2050 20752902601130 21350 21570 qphlu~r~ouhArg~HiraThrAlarogllGly~r~rleyAla~rgurgSrAspgVy1ArgAaLUyarPLeulOl~uA1rSeyGlyA~al~et~hs~isplerProlyyAla~heAspL~V 2290 2310 2330 2350 2370 2390 2410 2430 2450 2470 2490 2510 al~e~a~rgarly~n~r~allely~a~y~hrerro~r~s~lyysyr~p~eMeTyrn~gGeAArgyLeuPFOLLysAetAeulyr~eule~~TyralCAl&SlyIleS 2530 2550 2570 2590 2610 2630 CysaleurA~alHis~etSer~yGlurgCrlCula1ArgyAspayCYlCuSAspAlaG1hr iGlu~hriyAlspThrG~rplhe~s~r~et~eu~lyPro~ly erSer~Leuys~lane 275 2702790 2810 2830 2850280 LysherohArgI~AaThrVCl~rGlySPClPnOTyrlylrOg1Yrg~rOrg1Y~g~aeASrULG1laUGrfLeueula~SeroleuAln~yTyr~is~Lu~ernd l~'~lis~~ 287 290292102930 2950 2970 299 aLeuo~yr~ly~eutarqThrAlasn~etalVAlahro~yerVal laet~ysPro~yr~lualeurPeroThrlnglntLyuyala~ysGly~lAla~ysarosneGly~hAlAla~laeu 313 210 23170 23190 23210 32390 rCAGCCCGlueACGCGCysACGGCGluLCTCuGCCAGCTCChGTCCLGUCAGClfTCCCAhGGFOGlrPeIeTyCAyeTyr=CGGCCTCThrTGCAAAACGCyCGClTyGlyGCATGTTT rer~roAspAla~risralyrolalysPhel ly~ysArpLeu~h~eyslrgroly~er~h~eyr~rysthGly~aln~tLeArAla~asPhe etyrnyrnergysraLeAlarg~ergLe 3370 3390 3410 3430 3450 3470 u~eula~rmrsn~e~rgeu~s~l~al~e~a~rgsp~l~lner~llMetClyAra~lnAla~errlyeuroGeuG~ysAlg~srgoMet~iThr~leAla~eurgle uClroySslneuhPheou~yiyarAla~e~ue~e~rglt~leCys~ernd'tslnbpWsro 3730 3750 3770 3790 381230 3850 3870 3897310 3930 3950 3970 3990 28__410 4030 4050 4070 4090 4110 413250 4170 4190 4210 4230 4250 4270 4290 43110 4330 4350 43170 43190 4410 4430 I3250 32~~470 4490 4510 4530 4550 4570 4590 34610 34630 34650 34670 4690ACCACGACTCCGTTCTAGAGGATGTACGACTCCACCTTAGCGGAAAAAGAGCCCGGC uATeuGACGerhAsneAgeTsl~lh~l~gs~yl~ra~el~a~sl~rr~rl~yr~rCgs~ty~pl~ur~ FIG. 2. Nucleotide sequence ofthe EcoRI fragment containing the five genes coding for PT. The deduced amino acid sequence ofthe five subunits of PT is reported. The arrows below the amino acid sequences indicate the beginning of the mature subunits, as identified by comparison with the amino-terminal sequences. In the case ofS5, the arrow indicates the putative beginning ofthe mature subunit. Before the sequences ofeach subunit, we report the amino acid sequence ofthe putative signal peptides. Upstream from the gene coding for S1, the proposed promoter sequence and the putative Shine-Dalgarno sequences are boxed. The sequence TCC(T)GG before the S2, S3, S4, and S5 genes is also boxed. At the end of the gene codingfor subunit S3, the arrows above the nucleotide sequence indicate the inverted repeat followed by the stretch ofthymidines (underlined), which is a possible transcription termination site. The beginning of four ORFs having the same codon usage of the PT genes are indicated. The molecular weights deduced for the five subunits are the following: S1, 26,220; S2, 21,920; S3, 21,860; S4, 12,060; S5, 10,940. Downloaded by guest on October 2, 2021 Biochemistry: Nicosia et al. Proc. Natl. Acad. Sci. USA 83 (1986) 4633 were determined using the 470-A gas-phase microsequencer amino acid composition of the deduced polypeptides are (Applied Biosystems, Foster City, CA).