Protein Expression and PuriWcation 36 (2004) 207–216 www.elsevier.com/locate/yprep Protein aggregation during overexpression limited by peptide extensions with large net negative charge Yian-Biao Zhang, Jason Howitt,1 Sean McCorkle, Paul Lawrence, Karen Springer, and Paul Freimuth¤ Biology Department, Brookhaven National Laboratory, Upton, NY 11973, USA Received 8 January 2004, and in revised form 20 April 2004 Available online 10 June 2004 Abstract Folding of the human coxsackie and adenovirus receptor immunoglobulin (Ig) variable-type domain (CAR D1) during overex- pression in the Escherichia coli cytoplasm was shown previously to be partially rescued by fusion to a 22-residue C-terminal peptide. Here, peptide sequence features required for solubilization and folding of CAR D1 and similar Ig variable-type domains from two other human membrane proteins were investigated. Peptide extensions with net negative charge 1¡6 fully solubilized CAR D1, and approximately half of the peptide-solubilized protein was correctly folded. The Ig variable-type domains from human A33 antigen and myelin P-zero proteins were only partially solubilized by peptide extensions with net charge of ¡12, however, and only the solu- bilized P-zero domain appeared to fold correctly whereas the A33 domain formed soluble microaggregates of misfolded protein. Our results suggest a model where the large net charge of peptide extensions increases electrostatic repulsion between nascent polypep- tides. The resulting decrease in aggregation rate can enable some polypeptides to fold spontaneously into their native protein confor- mations. Analysis of the solubility and folding status of sets of structurally homologous proteins, such as the Ig variable-type domains described here, during overexpression could provide insights into how amino acid and gene sequences inXuence the eYciency of spontaneous protein folding. © 2004 Elsevier Inc. All rights reserved. Keywords: Fusion protein; Aggregation; Protein folding; Electrostatic repulsion Protein production and characterization has been accumulation of high concentrations of folding inter- greatly facilitated by the development of systems for mediates in spontaneous folding pathways or from expression of proteins to high levels in homologous or ineYcient recognition or processing of polypeptide sub- heterologous cells [1,2], although many proteins are strates by molecular chaperones. Culturing cells at unable to fold correctly during overexpression and reduced temperature or increasing cell chaperone capac- instead form insoluble aggregates. Under normal expres- ity can in some cases increase the yields of correctly sion conditions, a subset of endogenous polypeptides folded overexpressed proteins, but these approaches are can fold spontaneously [3], while folding of aggregation- not universally eVective. Development of alternative prone polypeptides requires assistance from molecular strategies to minimize aggregation of nascent polypep- chaperones, trans-acting factors that associate reversibly tides therefore would enhance the utility of systems for with nascent polypeptides to prevent aggregation during protein overexpression. the protein folding process [4]. Polypeptide aggregation Fusion of aggregation-prone polypeptides to carrier during overexpression therefore could result either from proteins frequently increases the solubility and in some ¤ cases promotes correct folding of polypeptides during Corresponding author. Fax: 1-631-344-3407. overexpression in bacteria or other host cells. Soluble E-mail address: [email protected] (P. Freimuth). 1 Present address: Department of Biological Sciences, Biophysics fusion partners commonly used for this purpose include Section, Blackett Laboratory, Imperial College London, Prince the thioredoxin [5], NusA [6] and maltose-binding Consort Road, London SW7 2BW, UK. proteins [7] from Escherichia coli, and the glutathione 1046-5928/$ - see front matter © 2004 Elsevier Inc. All rights reserved. doi:10.1016/j.pep.2004.04.020 208 Y.-B. Zhang et al. / Protein Expression and PuriWcation 36 (2004) 207–216 S-transferase (GST) protein from Schistosoma japoni- DNA fragments encoding the D1 domains of CAR and cum [8]. The mechanism by which these carrier proteins A33, and the single extracellular domain of P0 (P0ex) promote folding of target polypeptides is not fully were ampliWed from cDNA templates by PCR using Pfu understood and may not be universal. It was suggested, Turbo polymerase (Stratagene, La Jolla, CA) and the for example, that a hydrophobic cleft on the surface of primer sets shown in Table 1. Products were cloned into maltose-binding protein might bind reversibly to expression vector pET15b (Novagen, Madison, WI), or exposed hydrophobic regions of nascent passenger poly- into derivatives of pET15b created to allow fusion of peptides, steering the polypeptides toward their native proteins to the solubility-enhancing tags shown in Table conformation by a chaperone-like mechanism [9]. 2. Ligated DNAs were initially transformed into E. coli Another study suggested that decreased translation rates strain DH5 for characterization, and subsequently into resulting from NusA-mediated transcriptional pausing strain BL21(DE3) for protein expression. The predicted might enable proper nucleation of passenger polypeptide nucleotide sequences of all expression constructs were folding during translation [6]. conWrmed by DNA sequence analysis. Bacterial strains Here, we report that aggregation-prone Ig variable- were grown in LB medium supplemented with 150 mg/L type domains derived from three human membrane gly- penicillinG. Protein expression was induced in midlog coproteins can be solubilized during overexpression in phase cultures by addition of isopropyl--D-galactopy- the E. coli cytoplasm by extension of the domain N- or ranoside (IPTG) to 1 mM. C-terminus with highly acidic peptides. In two cases, DNA fragments encoding the peptide extensions substantial fractions of the peptide-solubilized domains listed in Table 2 were produced by PCR ampliWcation of were correctly folded whereas the third protein predomi- bacteriophage T7 genomic DNA or a subcloned region nantly formed soluble microaggregates. Although much of the bacteriophage T3 DNA genome [11], using Pfu smaller than maltose-binding protein, GST, or the other polymerase and primer sets (sequences not shown) to fusion partners noted above, these peptides nonetheless adapt fragment ends for ligation between the XhoI/BlpI appear to have similar enhancing eVects on protein solu- or BamHI/BlpI sites of pET15b. The amino acid bility and folding. Furthermore, potential folding mech- sequence changes listed in Table 2 were introduced via anisms that would require activities associated with mutagenic primers (sequences not shown), following speciWc protein conformations, such as those proposed established methods for PCR mutagenesis [12]. The pre- for maltose-binding protein and NusA, do not likely dicted DNA sequences encoding all variant peptides apply to these peptide extensions because peptides of were conWrmed by DNA sequence analysis. DNA this length (040 residues) generally are unable to adopt encoding the T7B9 peptide (Table 2) also was ampliWed stable tertiary conformations. Our results suggest that with primers (not shown) which adapted it for cloning the peptide extensions described here may indirectly pro- mote folding by increasing electrostatic repulsion Table 1 between nascent polypeptides. The resulting delay in PCR primers for Ig domain protein expression polypeptide aggregation would provide more time for Proteina Primersb proteins to fold spontaneously by chaperone-indepen- F-CTAGTGCATATGGGTATCACTACTCCT dent mechanisms. Amino acid and gene sequence char- CAR D1 CAA68868.1 R1-TCTGACTCGAGTTAACCTGAAGGCTTAACA acteristics of these three Ig-type domains that might S21Gc-A144 inXuence their eYciencies of spontaneous, chaperone- R2-TGACTTCTCGAGCGCACCT independent folding are discussed. A33 D1 F-CAGTCATATGATCTCTGTGGAAACTCC Q99795 R1-CTAGCTCGAGTCATTTGGAGGGTGGCACGAGG ACCAACAGGCG Materials and methods I22-K143 R2-CTAGCTCGAGTTTGGAGGGTGGCACGAGGACC AACAGGCG Protein expression P0ex F-GGAATTCCATATGATCGTGGTTTACACCG AAH06491 R1-CAGACTCGAGTCACCTAGTTGGCACTTTTTC cDNA clones for human A33 antigen and human I40-R163 R2-CAGACTCGAGCCTAGTTGGCACTTTTTC myelin P protein were obtained from the Image Con- 0 a Protein database accession numbers are indicated under the pro- sortium [10] collection (Clone Nos. 2710753 and tein name, and residue numbers that deWne the endpoints of 3926008, respectively). Clone 2710753 is a partial A33 the expressed protein fragments are indicated under the accession cDNA clone, in which correctly spliced exons 1–3 are numbers. b fused to intron 3. This clone thus encodes the complete Forward primers are denoted F-. Primers denoted R1- are reverse A33 D1 domain and N-terminal signal peptide, and a primers that contain stop codons to terminate protein translation at the Ig domain C-terminal boundary. Stop codons were omitted 10-residue C-terminal extension of the D1 domain speci- from reverse sense primers denoted R2-, to allow fusion of Ig domain Wed by intron 3. Human CAR cDNA was obtained by C-termini to T7-derived solubility tags. reverse transcriptase-PCR of HeLa cell total RNA. c Amino acid substitution designed into PCR primer. Y.-B. Zhang et al. / Protein Expression and PuriWcation 36 (2004) 207–216 209 Table 2 Sequence and net charge of peptides fused to the C-terminus of CAR D1 Peptide Sequencea Net
Details
-
File Typepdf
-
Upload Time-
-
Content LanguagesEnglish
-
Upload UserAnonymous/Not logged-in
-
File Pages10 Page
-
File Size-