Structure and Expression of the Bovine Coronavirus Hemagglutinin Protein
Total Page:16
File Type:pdf, Size:1020Kb
STRUCTURE AND EXPRESSION OF THE BOVINE CORONAVIRUS HEMAGGLUTININ PROTEIN Thomas E. Kienzle, Sushma Abraham, Brenda G. Hogue 1 and David A. Brian Department of Microbiology The University of Tennessee Knoxville, Tennessee 37996-0845 INTRODUCTION cDNA clones prepared from genomic RNA of the Mebus strain of bovine coronavirus (BCV) were sequenced to reveal the hemagglutinin (H) gene of 1,272 bases that predicts a 47,700 mol. wt. apoprotein of 424 amino acids. The H gene mapped on the immediate 5' side of the peplomer gene. The H protein sequence revealed a putative N-terminal signal peptide of 18 amino acids, 9 potential glycosylation sites, 14 cysteine residues, and a potential C-terminal anchor region of 26 amino acids. When transcripts of the gene were translated in vitro in the presence of microsomes, signal cleavage, glycosylation, and membrane anchorage were observed, but not disulfide-linked dimerization. Translation of a truncated mRNA having no sequence for the C-terminal anchor resulted in a nonanchored, intraluminal (intramicrosomal) protein. When the H protein was expressed in cells in the absence of other coronaviral proteins, it became glycosylated, dimerized, and transported to the cell surface. The BCV hemagglutinin protein, therefore, is a type 1 glycoprotein that contains all the information it needs for signal cleavage, glycosylation, disulfide-linked dimerization, and transport to the cell surface. MATERIALS AND METHODS cDNA Cloning of BCV Genomic RNA The Mebus strain of BCV was grown on human rectal tumor (HRT) cells and purified as previously described (Lapps et. al., 1987). cDNA cloning was done essentially as described (Lapps et. al., 1987), except that random 5-mer oligodeoxynucleotides (Pharmacia) were used as primers for first strand synthesis to generate several clones, one of which was II (Fig. lA). A synthetically made primer (5' ATTATGACCGCACACC 3') was used to extend genomic sequence 5'-ward from clone II, and this was used to generate several clones one of which was LA6 (Fig. lA). Clones were selected by colony hybridization to randomly primed cDNA prepared from lCurrent address: Department of Microbiology and Immunology, UCLA School of Medicine, Center for Health Sciences, Los Angeles, CA 90024 COTOnoviruse3 and Their Diseases Edited by D. Cavanagh and T.D.K. Brown 95 Plenum Press, New York, 1990 genomic RNA, and clones were mapped relative to one another and to the 3' end of the genome using a matrix spot hybridization technique. DNA Sequencing and Sequence Analysis Dideoxy sequencing was used throughout. The ends of clones II and LA6 were sequenced using universal primers for pUC vectors, and the clones were ligated at the Hae 3 site (Fig. IA) to form a continuous sequence mapping in the region of the H gene. The reconstructed sequence was subcloned into pGEM3Z (Promega) and a 3' and 5' nested set of subclones was generated using exonuclease III and SI nuclease (Henikoff, 1984). Subclones were sequenced using universal primers for the pGEM vector, and by synthetic primers that were made complementary to regions within the H gene. Sequences were analyzed with the aid of the Microgenie program (Beckman). Expression Analyses A construct of the H gene beginning 15 bases upstream (i.e., beginning with the CTAAAC intergenic sequence) from the putative ATG start codon (an exonuclease III/SI-generated subclone of the LA6/I1 construct in pGEM4Z) and under the control of the Sp6 polymerase promoter, was made and named pHSp6 (Fig. IB). The same insert was also subcloned into pGEM3Z (using the Eco RI and Hind III sites within the multiple cloning region) such that the H gene was under the control of the T7 polymerase promoter. This was named pHT7 (Fig. IC). For in vitro expression analyses, pHSp6 was linearized with Hind III to yield full-length transcripts, or with Eco R5 to yield transcripts lacking the coding region for the C-terminal anchor, and transcribed with A ki1clb8ses 20 9 8 7 6 5 4 J 2 4' , , , , , , , r::::K::::::::::::+!::::::::::::::::::::::::::::::::::::2p~::::::::::::::::===4"~"8 ~-H::::t:=?=~=Fil=::::::::H """S"""l,i- LH~6 ' }JI-J{In-L_ -----.!.! " ~ 1 11 ~5 LP6L9___ ~HP~A2~~= ______G~6_MA_7 ______ __ Eco R5 H•• 3 '\ ( B c Fig 1. A. Gene map of the BCV genome, cDNA clone positions, and strategy for sequencing the H gene. B. Plasmid construct for generating H transcripts with Sp6 polymerase, C. Plasmid construct for generating H transcripts with T7 polymerase. 96 SP6 polymerase (Promega protocol for yielding capped transcripts). Translation was done in wheat germ extract (Promega) in the presence or absence of canine pancreatic microsomes (Promega). Protein was labeled with 35S-methionine (lCN). lmmunoprecipitates were made using polyclonal rabbit anti gp65 (H subunit) (Hogue et. a7., 1984) by the method of Anderson and Blobel (1983). Carbonate extractions at pH 11 were done by the method of Fujiki et. a7., (1982). Competitive inhibition of N-Linked glycosylation was accomplished with 30~M octanoyl-asparagine-leucine threonine (a gift from Dr. F. Naider, City University of New York), in the microsomal-containing translation mixture (Lau et. a7., 1983). Polyacrylamide gel electrophoresis analyses were done as previously described (Hogue et. a7., 1984). For in vivo expression and analysis by immunofluorescence, HRT cells were infected with a vaccinia virus recombinant (vTF7-3, moi of 30) that expresses T7 polymerase (Fuerst et. a7., 1987), transfected with pHT7 plasmid (1 ~g per 1 cm2 ) using Lipofectin Reagent (BRL) to mediate transfection, and prepared at 32h for internal or surface immunofluorescence using the method of Kaariainen et. a7., (1983). For • . 30 . • 60 • • 90 • • 120 CTAAACTCAGTGAAAATG'!TI'T'I'GCTTCTTAGATT:'G'ITCTAGTTAGCTGCATAATTG:3T>"GCC'!'AGGTTl"l'GATAACCCTCC'!'ACCAA'!'G'I'TG'I"!'TCGCATTTAAATGGAGATTGG'l":::'T = M F L L L R F v L V SCI I G S L G F 0 N P P T N V V S H L NM GJ:: Dr WG FF H- .0 IORF1- • . 150 . • 180 • • 210 • . 240 'l"l'A'I'TrGGTGACAG':'CG'I"!'CAGAT'I'GTAATCATGTTG'rl'AATACCAACCCCCGTAATTATl'CTTATATGGACCTTAATCCTGCCCTGTGTGAT!'CTGGTAAAATATCATCTAAAGCTGGC L F G 0 S R soc N H V V N T N P R ~ Y S Y MOL N PAL cOS G K ISS K A G Y L V T V v Q I V I M L LIP T P V I I L I W T L I L P C V I L V K Y H L K L A • • 270 • • 300 • • 330 • • 360 AACTCCATTT'I'TAGGAGTrTTCACTl'TACCGA'l"l"'ITrATAATTACACAGGCGAAGGTCAACAAATTATTI"I"I"l'ATGAGGGTGTTAA'l'T'I"l'ACGCCTTA.TCATGCCTTTAAATGCACCACT N S I FRS F H F T 0 F Y ~ Y T G E G Q Q I I F Y E G V@F T P Y H A F K C T T T P F L G V F T L P I F I I T Q A K V N K L F F M R V L I L R LIM P L NAP L • • 390 • • 4020 • • 450 • • 480 TCTGGTAGTAATGATA'I"rl'GGATGCAGAATAAAGGC'l"l'G"rlTl'ACACTCAGGT'l'TATAAGMTATGGCTGTGTATCGCAGCCTTAcrrrrG'I'TAATGTACCATATGttrATAATGGCTCT S G S N D I W M Q N K G L F Y T Q v Y K N M A V Y R S L T F V N V P Y v Y ~ G S L V V M I $ G C R I K A C F T L R FIR I W L C I A ALL L L M Y H M F I MAL • • 510 • • 540 • • 570 • • 600 GCACAATCTACAGCTCTrI'GTAAATCTGGTAGTTTAG'l"l'CTTAATAACCCTGCATATATAGCTCGTGAAGCTAA"rr1"l'GGGGATTATTATTATAAGG'l'TGAAGCTGACT"rrI'A'lTl'GTCA A Q S TAL C K S G S L V L K N PAY I ARE A N F G D Y Y Y K V E A D F Y L S H N L 0 L F V N L V V • • 630 • . 660 • • 690 • • no GG'rI'GTGACGAGTATATCGTACCACTrI'GTA'l"I"I'Tl'AACGGCAAG'l"ITl'TGTCGAATACAAAGTATTATGATGATAGTCAATA'l'TATrTTAATAAAGACAC'rGGTG'rl'A'l'TTATGGTC',!'C G C DEY I v P L C I F N G K F L S N T K Y Y DDS Q Y Y F N K D T G V I Y G L • • 750 , • 780 • • 810 • • 840 AA'l"l'CTAC'l'GAAACCA'I"l'ACCACTGG'lTl'TGA'rl"rl'AAttGTCA'rl'A'lTl'AGT"rrI'ACCCTCTGGTAATTA'l'TTAGCCA'l"rl'CAAATGAGCTA'I"l'G'I"l'AACTG'l'TCCTACGAAAGCAATC ~ S T BTl T T G F D F N C 8 Y L V L PSG N Y L A I S N ! L L LTV P T K A r • • 870 • • 900 • • 930 • • 960 TGTCTTAACAAGCGTAAGGA'l"I'Tl'ACGCCTGTACAGG"rl'G'l"l'GA'rl'CACGGTGGAACAATGCCAGGCAGTCTGATAACATGACGGCGG'rI'GCTTGTCAACCCCCGTACTGTTATrrI'CGT C L N K R K D r T P v Q v V D S R W N N A R Q S D ~ M T A V A C Q P P Y C Y F R • • 990 • • 1020 • • 1050 • • 1080 MTTCTACTACCAACTATGT'l'GGTGTTTATGATATCAATCATGGQGATGCTGGTrrl'ACThGCATACTCAGTGGTT'l'G"rl'ATATGATTCACCTTG'lTl"'l"'l'CGCAGCAAGGTG'lTl"'l"'l'AGG 00 S T T K Y V G V Y DIN H G D A G F T S I L S G L L Y D S P C F S Q Q G V F R M L 11 V MIS 1M G M L v L LAY S v v C Y M 1.8 L V FRS K V F L Q IORF2- • . 1110 • • 1140 • • 1170 • • 1200 TATGATAATG'l'TAGCAGTGTCTGGCCTCTCTATTCCTATGGCAGATGCCCTACTGCTGCTGATATTAATACCCCTGATGTACCTA'l'T'l'GTGTGTATGATCCGCTACCAC'l'TArrrrGC'l'T Y D 00 v s S v W PLY S Y G R CPT A A DIN T P D V PIC V Y D P L P L ILL M I M L A V S G LSI P MAD ALL L L I LIP L M Y L F V C M I R Y H L t c t • • 1230 • • 1260 • • GGCATCCTl"l"l'GGGTG'l"'l'GCGGTCATAATTAT'I'GTAG'l"ITl'G'l"l'G'rl'ATA'l'T'l"TATGGTGGATAATGGTACTAGGCTGCATGATGCTTAG GA IS LV LW Gv VL AR Vs I I r v v L L L Y F M V D [HI G T R L H 0 A Fig.