Protein Expression and Purification 85 (2012) 9–17

Contents lists available at SciVerse ScienceDirect

Protein Expression and Purification

journal homepage: www.elsevier.com/locate/yprep

Review Simplifying protein expression with ligation-free, traceless and tag-switching plasmids ⇑ Venuka Durani a, Brandon J. Sullivan b, Thomas J. Magliery a,c, a Department of Chemistry, The Ohio State University, Columbus, OH 43210, USA b Ohio State Biochemistry Program, The Ohio State University, Columbus, OH 43210, USA c Department of Biochemistry, The Ohio State University, Columbus, OH 43210, USA article info abstract

Article history: Synthetic biology and genome-scale protein work both require rapid and efficient cloning, expression and Received 17 March 2012 purification. Tools for co-expression of multiple proteins and production of fusion proteins with purifica- and in revised form 1 June 2012 tion and solubility tags are often desirable. Here we present a survey of plasmid vectors that provide for Available online 21 June 2012 some of these features with a focus on tools for rapid cloning and traceless tagging – a setup that facil- itates removal of fusion tags post-purification leaving behind no ‘scar’ on the final construct. Key features Keywords: are reviewed, including plasmid replication origins and resistance markers, transcriptional promoters, Plasmid cloning methods, and fusion tags and their removal by . We describe a vector system called Protein expression pHLIC, which assembles features for simple cloning, overexpression, facile purification, and traceless Co-expression Protease cleavage cleavage, as well as flexibility in modifying the vector to exchange fusion tags. Ligation independent cloning Ó 2012 Elsevier Inc. All rights reserved. Traceless tagging

Contents

Introduction...... 9 Origin of replication ...... 10 Antibiotic resistance ...... 10 Cloning regions ...... 10 Transcription promoters ...... 12 Fusion tags...... 13 Protease cleavage sites ...... 13 Seamless cloning and traceless tagging ...... 13 Vectors for traceless tagging – some examples...... 13 pHLIC vectors ...... 14 Conclusion ...... 15 Appendix A. Supplementary data ...... 15 References ...... 15

Introduction often expensive and time-consuming prerequisites, particularly for beginning researchers. Solutions to these problems have High-throughput approaches in modern protein science – emerged [6–9]: recombinogenic (e.g., Gateway [10]) cloning and including structural genomics and – have necessitated ligation-independent cloning (LIC)1 [11,12]; vectors with very the development of high-throughput methods for cloning, protein expression, and purification [1–5]. Even for laboratories studying a 1 Abbreviations used: MBP, maltose binding protein; GST, glutathione S-transferase; single protein target, such as designed proteins, these steps are CBP, calmodulin binding protein; Trx, thioredoxin; PSI–MR, protein structure initiative–materials repository; LIC, ligation independent cloning; SLiCE, seamless ⇑ Corresponding author at: Department of Chemistry, The Ohio State University, ligation cloning extract; IPTG, isopropyl b-D-1-thiogalactopyranoside; IMAC, immo- Columbus, OH 43210, USA. Fax: +1 614 292 1685. bilized metal-ion affinity chromatography; SUMO, small ubiquitin-like modifier; Ub, E-mail addresses: [email protected] (V. Durani), bsulliva@ ubiquitin; DUBs, deubiquitylating enzymes; TEV, tobacco etch virus; TIM, triose- chemistry.ohio-state.edu (B.J. Sullivan), [email protected] (T.J. Magliery). phosphate isomerase.

1046-5928/$ - see front matter Ó 2012 Elsevier Inc. All rights reserved. http://dx.doi.org/10.1016/j.pep.2012.06.007 10 V. Durani et al. / Protein Expression and Purification 85 (2012) 9–17

that includes the origin of replication determines its copy number. Usually for large scale protein purification we turn to high copy number plasmids that can maintain tens to hundreds of copies of plasmid per cell. Production of recombinant proteins in often results in degradation, aggregation or misfolding of pro- teins, and co-expression [22] of molecular chaperones can provide a solution to this problem [23–25]. Co-expression of proteins is also desirable for purification of multi-subunit complexes and studying interacting proteins [26–28]. However, plasmid incom- patibility, a mechanism that prevents the stable co-existence of two similar plasmids in the same bacterial cell, can cause problems for these kinds of applications, and having access to plasmids with different origins of replication that are compatible with each other is important. The best developed class of plasmid vectors for many manipulations including protein expression is based on the ColE1/ pMB1 [29] replicons including the pBR322 [30,31], pUC [32,33] and pET (Novagen) vector systems. Vectors containing ColE1/pMB1 de- rived origins of replication fall into the same incompatibility group [34,35]. On the other hand, plasmids like pACYC177 and pACYC184 [36,37] with origins of replication derived from p15A [38], although incompatible with other vectors with p15A derived ori- gins, are compatible with ColE1/pMB1 derived plasmids. These are two compatible vector groups that are commonly used for co-expression of proteins. Since ColE1 based plasmids are very common for production of target proteins, several chaperone- encoding plasmids are available on vectors with p15A derived rep- licons [23]. For experiments in which the population of three or more different plasmids needs to be maintained in a cell, other Fig. 1. Schematic plasmid map (not drawn to scale) showing various important plasmids with pSC101 [39], CloDF13 [40,41], ColA [42,43], regions of plasmid vectors that are discussed in this review. RF1030 [44] and pEC [45] replicons can be used [22]. strong promoters such as T7 and cspA; and the use of fusion tags Antibiotic resistance such as hexahistidine (6His) [13], maltose binding protein (MBP) [14,15], glutathione S-transferase (GST) [16,17], calmodu- In order to ensure that a plasmid vector is taken up and main- lin-binding peptide (CBP) [18,19] and thioredoxin (Trx) [20] for tained in a cell, genes encoding resistance to antibiotics are ex- solubility and/or purification. Many useful vectors are available pressed from plasmids. Moreover, if populations of multiple through the Protein Structure Initiative Material Repository (PSI– vectors are to be maintained in a cell, then not only do their repli- MR), and information about these vectors is available in a search- cator regions need to be compatible, their antibiotic resistance able database called DNASU [7]. markers also need to be orthogonal to ensure co-transformation. While affinity tagging results in convenient purification, many The most common antibiotic resistance markers are ampicillin, applications demand removal of the fused tag post-purification, kanamycin, chloramphenicol and tetracycline. Table 1 has exam- typically by proteolysis. However, the recognition sequences of ples of some vectors with spectinomycin, zeocin and erythromycin restriction enzymes for cloning and of proteases for freeing the fi- resistance. Ampicillin is especially susceptible to degradation nal protein typically put constraints on one or more amino acids in when present the bacterial culture and this effect can be observed the protein, leaving a ‘scar’ that may not be desirable. This problem as ‘satellite’ colonies on agar plates. This effect can be alleviated to has initiated work on seamless cloning and traceless tagging strat- some extent by using carbenicillin [46], a less degradation- egies, and plasmid vectors that simplify such manipulations are susceptible analog of ampicillin. desirable. Plasmid vectors require various components to carry out their functions (Fig. 1). The origin of replication determines Cloning regions the copy number of a plasmid; antibiotic resistance provides a selection marker; promoter regions facilitate gene transcription; Most plasmid vectors that are commercially available have and careful engineering of cloning sites can provide for features multiple unique restriction enzyme sites arranged in tandem in a like ligation-independent cloning and fusion tags to facilitate solu- polylinker cloning region. These can be used for traditional liga- bility and purification of the expressed protein. Vectors with un- tion-dependent cloning methods (Fig. 2a) where a target gene is ique combinations of these functionalities are especially suited amplified by PCR using primers containing unique restriction sites, for specific applications in structural genomics and synthetic biol- followed by restriction enzyme digest and ligation into a vector ogy. Here we present a brief survey of plasmid vectors useful for containing compatible cohesive ends. These traditional methods protein expression with a focus on vectors that facilitate quick often produce ‘background’ due to incomplete digestion and cloning and traceless tagging of proteins. re-ligation of the digested vector and require unique restriction en- zyme sites in the vector that do not occur in the DNA to be cloned. Origin of replication These limitations and others such as difficulty to manipulate very large inserts have led to research in recombinogenic cloning meth- The origin of replication is the smallest region of the plasmid ods especially for applications in structural genomics. Homologous that is able to replicate on its own and is also the site where plas- recombination occurs through stretches of DNA shared by the two mid replication is initiated [21]. The replicator region of a plasmid molecules. Because the sequence of the homology regions can be V. Durani et al. / Protein Expression and Purification 85 (2012) 9–17 11

Table 1 List of plasmid vectors.

Vector name Fusion tagsa Cleavage sites ‘Scar’ residues post-cleavageb Cloning method Promoter region Ori Res. Ref. Origins of replication and antibiotic resistance markers: pET11a T7 None 13 amino acids Ligation T7 ColE1 Amp [99] pURI2-Ery 6His EK protease M LIC lpp ColE1 Ery [100] pACYCDuet-1 6His None 13 amino acids Ligation T7 p15A Cmp [101] pT7LICK 6His (Ct) None DHHHHHH (Ct) LIC T7 pSC101 Kan [12]

pEC None N.A. N.A. Ligation PBAD pEC Kan [45] pMCSG21 6His TEV protease SNA LIC T7 Clo-DF13 Spec [102] pCOLADuet-1 6His None 14 amino acids Ligation T7 ColA Kan [43] Cloning methods: pET32a Trx-6His EK protease AMA Ligation T7 ColE1 Amp [99] pVP16 8His-MBP TEV protease S Gateway T5 ColE1 Amp [8] pNIC28-Bsa4 6His TEV protease SM LIC T7 ColE1 Kan [4] pBL None N.A. N.A. SLiCE T7 ColE1 Amp [52] pCR-BluntII- None N.A. N.A. TOPO T7, SP6 ColE1 Kan, [55,56] TOPO Zeo Transcription promoters: pVP33A 8His-MBP 3C and TEV AIA Ligation T5 ColE1 Amp [8] pURI2 6His EK protease M LIC lpp ColE1 Amp [13] pURI2-TEV 6His TEV protease GM LIC lpp ColE1 Amp [100] pURI3 6His EK protease M LIC T7 ColE1 Amp [13]

pBAD/His 6His EK protease DRWGSELE Ligation PBAD ColE1 Amp [103] pCKSP6 None N.A. N.A. Ligation SP6 ColE1 Amp [104] PinPoint Xa1 Biotin Factor Xa None Ligation tac ColE1 Amp [105] pCOLD I 6His Factor Xa HM Ligation cspA ColE1 Amp [72] Tag-switching plasmids: pHLIC 6His TEV protease None LIC T7 ColE1 Amp [98] pHLIC-MBP 6His-MBP TEV protease None LIC T7 ColE1 Amp pMCSG7 6His TEV protease SNA LIC T7 ColE1 Amp [11] pMCSG9 6His-MBP TEV protease SNA LIC T7 ColE1 Amp [95] pMCSG10 6His-GST TEV protease SNA LIC T7 ColE1 Amp [102] pMCSG11 6His TEV protease SNA LIC T7 p15A Cmp [102] pMCSG13 6His-MBP TEV protease SNA LIC T7 p15A Cmp [97] pMCSG14 6His-GST TEV protease SNA LIC T7 p15A Cmp [97] pLIC-His 6His TEV protease GAAS LIC T7 ColE1 Amp [2] pLIC-MBP 6His-MBP TEV protease GAAS LIC T7 ColE1 Amp [2] pLIC-GST 6His-GST TEV protease GAAS LIC T7 ColE1 Amp [2] pOPINF 6His 3C protease GP LIC T7 ColE1 Amp [1] pOPINM 6His-MBP 3C protease GP LIC T7 ColE1 Amp [1] pOPINJ 6His-GST 3C protease GP LIC T7 ColE1 Amp [1] Protease cleavage sites: pMCSG19 MBP-6His TVMV and TEV SNA LIC T7 ColE1 Amp [95] pVP33K 8His-MBP 3C and TEV AIA Ligation T5 ColE1 Kan [8] pCAL-n-EK CBP EK protease None LIC T7 ColE1 Amp [19] pHUE 6His-Ub Usp2-cc None Ligation T7 ColE1 Amp [77] pHLIC-smt3 6His-SUMO SUMO protease None LIC T7 ColE1 Amp C-terminal His-tags: pT7LICB 6His (Ct) None DHHHHHH (Ct) LIC T7 ColE1 Amp [12] pT7LICC 6His (Ct) None DHHHHHH (Ct) LIC T7 p15A Cmp [12] pT7LICK 6His (Ct) None DHHHHHH (Ct) LIC T7 pSC101 Kan [12] pURI2-Cter 6His (Ct) None HHHHHH (Ct) LIC lpp ColE1 Amp [100]

Ori, origin of replication; Res, antibiotic resistance marker; Ref, reference; Amp, ampicillin; His, histidine tag; EK, enterokinase; LIC, ligation independent coning; Ery, erythromycin; Cmp, chloramphenicol; Ct, C-terminal; Kan, kanamycin; N.A., not applicable; TEV, Tobacco Etch Virus; Spec, spectinomycin; Trx, thioredoxin; MBP, maltose binding protein; Zeo, zeocin; 3C, human rhinovirus 3C protease site; GST, glutathione S-transferase; SUMO, Small Ubiquitin-like Modifier; TVMV, tobacco vein mottling virus protease; CBP, calmodulin binding protein. a All the fusion tags are on the N-terminus unless otherwise specified as Ct (C-terminus). b Extra N-terminal (unless otherwise specified) residues that remain after protease cleavage due to the design of the cloning site are listed in one letter amino acid codes. In some cases more residues may be required based on efficiency of protease cleavage. For vectors that have a tag but no cleavage site, the length of the tag is mentioned. chosen freely, any position on a target molecule can be specifically drawbacks of traditional cloning methods [51]. By taking advan- altered (for reviews on recombinogenic cloning see references tage of the controlled exonuclease activity of T4 DNA polymerase, [47–49]). Gateway cloning (Fig. 2b) vectors have several features the LIC procedure creates long single-stranded overhangs of DNA required for high-efficiency and accurate cloning using recombino- in the vector and the insert. These overhangs hybridize on mixing genic methods [10]. Once an insert is cloned into an entry vector the vector and insert together. When transformed into E. coli, the for Gateway cloning, it can be easily swapped into other Gateway nicks are sealed by the cellular machinery. Hence no use of ligase vectors using clonase activity. This makes Gateway cloning very is required, and since the vector has two fairly large overhangs useful for applications where the target construct is required in which can be designed to be completely non-complementary to various contexts like different solubility tags or different origins each other, the chances of re-sealing the vector are very small. of replication. Moreover, since this scheme does not require digestion of the in- Ligation-independent cloning (Fig. 2c) [50] adds flanking se- sert with any restriction enzymes, there are no complications quence extensions to the target gene that are shorter than those regarding multiple occurrences of the preferred cloning enzymes used in recombinogenic methods while still avoiding some of the within the sequence to be cloned. A new cloning method called 12 V. Durani et al. / Protein Expression and Purification 85 (2012) 9–17

Fig. 2. Cloning schemes for (a) ligation-based cloning; (b) ligation-independent cloning; (c) recombinogenic cloning (Gateway); (d) SLiCE cloning; and (e) TOPO cloning.

SLiCE (Seamless Ligation Cloning Extract) relies on in vitro recom- to degradation of the host chromosome [57,58]. Ligation of an in- bination in bacterial cell extract (Fig. 2d) [52]. Extracts from the sert into the vector disrupts the ccdB gene and enables the recom- two recA strains DH10B and JM109 yielded the highest cloning binant colonies to grow. This method eliminates background from efficiencies indicating that SLiCE is facilitated by a RecA-indepen- self-ligation of the vector. dent mechanism. SLiCE facilitates seamless cloning by recombining short end homologies (>15 bp) with or without flanking heterolo- gous sequences. Seamless cloning using SLiCE has been demon- Transcription promoters strated using plasmid vector pBL [52]. Other recently-reported methods of cloning that fall into this category include Gibson clon- Promoter regions affect the level of gene expression in E. coli, and ing (enzymatic assembly of DNA molecules using T5 exonuclease, the suitability of a promoter for protein expression depends on Phusion polymerase and Taq ligase) [53] and CPEC (Circular Poly- three main factors. First, the promoter must be strong and have merase Extension Cloning that involves extension of the overlap the capacity to over-produce proteins to the extent that they be- between single stranded vector and insert to form a complete plas- come a significant percentage of the total expressed protein in the mid) [54]. cell. Second, it is desirable to have a highly tunable (or at least highly TOPO cloning (Fig. 2e) is a method of cloning where a vector and repressible) promoter particularly for proteins that have adverse ef- insert with single nucleotide overhangs are ligated together by tak- fects on the growth of the host cell. Third, the promoter should be ing advantage of topoisomerase activity [55]. The single nucleotide easily inducible typically by chemical or thermal means. Several overhang in the insert is created upon amplification by Taq poly- promoter systems used for protein production in E. coli, their regu- merase and this technique, like LIC, negates the use of restriction lation mechanisms, and methods of induction can be found in refer- enzyme digest for the insert. The vector pCR-BluntII-TOPO (Invitro- ences [59,60]. Vectors with SP6 (phage promoter, mainly used for gen) uses TOPO cloning in conjunction with the ccdB (control of cell cell free expression [61,62]), T5 (phage promoter, induced by IPTG, death) gene for zero background cloning [56]. The CcdB protein very high level of expression, tight regulation possible [63]), hybrid causes cell death by poisoning bacterial DNA gyrase, which leads lpp (E. coli promoter, induced by IPTG, very high level of expression V. Durani et al. / Protein Expression and Purification 85 (2012) 9–17 13

[64]), PBAD (E. coli promoter, induced by L-arabinose, variable level of after purification. Common proteases for this purpose include expression from low to high, tight regulation possible [65]), tac TEV protease [79–81] and related 3C protease [82], enterokinase (E. coli hybrid promoter, induced by IPTG, moderately high level of [83–85], Factor Xa [86], [87,88] and SUMO protease expression, high basal level of expression [66]), cspA (E. coli pro- [76]. TEV protease, the capsid protease of the tobacco etch virus, moter, cold shock induction at temperatures below 20 °C, low level is among the most robust of the proteases in common use [89]. of expression [67]) and T7 (phage promoter, induced by IPTG, very It is more specific than enterokinase [85] or Factor Xa [90], and it high level of expression, tight regulation possible [68]) promoters is active under a wide range of conditions. It is commercially avail- are listed in Table 1. The pET vector system that has the promoter able and can also be overexpressed and purified from E. coli using a system from bacteriophage T7 has been very widely used due to standard protocol [91]. The general form of its recognition se- the orthogonality of this system to E. coli promoters, leading to ease quence is taken to be E-X-X-Y-X-Q-(G/S), with cleavage after the of cloning toxic genes and also its capacity to overproduce proteins Gln residue; the most commonly used sequence is ENLYFQG. Re- at very high levels constituting up to 50% of the total cell protein cently, Kapust et al. demonstrated that the TEV protease will cleave [69–71]. The T7 promoter, along with some of the other promoters with greater than 90% efficiency with 12 different amino acids at listed above, is indirectly induced by IPTG in that T7 RNA polymer- the P10 site (C-terminal to Gln) and greater than 50% efficiency ase is expressed from the lac promoter-operator. One of the for all other amino acids except proline [92]. Effectively, TEV pro- limitations of the T7 promoter system, however, is that rapid tease can be used to quantitatively produce mature proteins with over-expression of relatively ill-behaved proteins can lead to pro- any N-terminal amino acid except Pro under reasonable reaction tein misfolding, aggregation and segregation into inclusion bodies. conditions. However, proteins with structured N-terminal regions Although using a cold shock promoter like cspA [67] (pCold vectors sometimes do not display very efficient cleavage with TEV protease by Takara Bio Inc. [72]) is sometimes a solution to this problem, a presumably due to inaccessibility of the protease recognition and major disadvantage of that system is that it becomes repressed 1– cleavage site. For such cases, having an alternative option is desir- 2 h after lowering the temperature, and this time frame might not able, such as Factor Xa, which cleaves I-(E/D)-G-R-X between R and be sufficient for a high level accumulation of all proteins. X, where X can be any amino acid except proline [93]. Another op- tion is the SUMO protein fusion. In this case, the 100 residue SUMO protein serves as the recognition site for SUMO protease, which Fusion tags cleaves after the C-terminal Gly of SUMO. It has been shown that recombinant SUMO–GFP fusions are efficiently cleaved when any Besides cold shock, there are several approaches to the expres- amino acid except proline is in P1´ position of the cleavage site sion of proteins that end up in the insoluble fraction, including [76]. Similar to SUMO, the ubiquitin tag also serves as a recognition the co-overexpression of chaperones [23–25], periplasmic expres- site for deubiquitylating enzymes (DUBs), however most DUBs that sion of proteins [73] and the use of solubilizing fusion tags. Protein have been isolated from various species are relatively large and dif- fusions are efficient tools for protein purification, detection and ficult to express and purify. No stable DUB known with general immobilization. One of the most commonly used fusion tags for activity against a range of fusion proteins was known. Recently, a protein purification is the hexahistidine (6His) [13] tag that facil- mouse DUB, Usp2, was engineered to provide a minimal catalyti- itates purification by immobilized metal-ion affinity chromatogra- cally active deubiquitylating domain (Usp2-cc) [77]. It was ex- phy (IMAC) purification with Ni–NTA agarose. This method is pressed and purified with a polyhistidine tag that allowed the inexpensive; the resin has very high capacity and can be reused in vitro cleavage of Ub from the desired protein as well as its selec- many times. It is less expensive than other resins such as streptavi- tive removal from the cleavage reaction. din agarose used for purification of biotin-tagged proteins, and it has a higher capacity than the less-expensive amylose resin used to purify proteins with an MBP tag [59,74]. Although fusion tags Seamless cloning and traceless tagging were originally developed for purification purposes, it was found that many tags lead to better solubilization of proteins fused at Generation of the overhang for cloning into a vector requires their C-termini. It is believed that the general mechanism for this restriction enzyme scission, which places a limitation on the se- effect involves the fast and efficient folding of the N-terminal fusion quence of the overhang and usually results in a particular protein tag upon exiting the ribosome immediately after translation, thus sequence at the site of fusion. A way to get around this problem pulling the C-terminally fused protein into solution. The most com- and accomplish ‘seamless’ cloning [94] is to digest with a type IIs monly used fusion tags for solubility enhancement are maltose restriction enzyme which cuts outside of its recognition sequence binding protein (MBP) [14,15], calmodulin-binding peptide (CBP) and therefore in any sequence. (GeneArt by Invitrogen and CloneEZ [18,19], glutathione S-transferase (GST) [16,17], and thioredoxin by GenScript are kits that can be used for seamless cloning.) Even (Trx) [20]. More recently it has been shown that the N-terminal fu- for routine protein expression and purification, it would be useful sion of SUMO protein [75,76], ubiquitin [77], and HaloTag7 [78] also to have a very flexible plasmid vector for high-level E. coli overex- enhance expression levels and solubility of ill-behaved proteins. pression of proteins using an N-terminal 6His tag that could be Several fusion tags can have multiple functionalities; for instance, removed tracelessly – which is to say, with no residues left behind MBP and HaloTag7 can be used for both solubility enhancement and and few or no constraints on the amino acid next to the scissile purification. Similarly, SUMO and ubiquitin (Ub) protein fusions can bond. To circumvent some issues with ligation-dependent cloning, be used as both solubility tags and also protease cleavage tags. Some- a ligation-independent method is desirable, but with minimal ex- times finding the optimum tag for an application requires some trial pense, and with the flexibility to use ligation-dependent methods, and error and also some applications may require the same protein as well. Several vectors which nearly fit these criteria are available, with different tags. Hence, tag switching vectors where the same in- perhaps most notably the pMCSG7 vector [11] and the pHLIC vec- sert can be used to clone with various tags are desirable. tor system (described here).

Protease cleavage sites Vectors for traceless tagging – some examples

Even though fusion tags often solubilize proteins and provide Today several plasmid vectors are available that have cloning options for easy purification, it is often desirable to remove them sites designed to attach fusion tags and also provide for protease 14 V. Durani et al. / Protein Expression and Purification 85 (2012) 9–17

optionally C-terminal) 6His tag to a target protein cloned by LIC under the control of the T7 promoter. Variants of this vector that allow for TEV cleavable MBP and GST fusions are also available and this series has some ColE1 derived vectors and others with p15A derived replicons for co-expression [97]. However, the LIC cloning scheme designed for these vectors requires that the cleaved N-terminal sequence begin with Ser-Asn-Ala. We have constructed a related vector in which there are no constraints on the N-terminal sequence, and in which the protease recognition se- quence can be removed or changed by a simple standard cloning step. We call the resulting vector pHLIC (plasmid for 6His tagging and Ligation Independent Cloning, Fig. 3 and Supplementary Fig. 3. Schematic maps of the pHLIC and pHLIC-MBP plasmids. The AlwNI site is not material). unique in the pHLIC-MBP vector. To achieve high-level expression and simple, inexpensive purifi- cleavage sites to remove the fusion tags later. These vectors have cation, we selected a pET vector system with a T7 promoter for combinations of different methods of cloning, different fusion tags overexpression in DE3-lysogenized cells, and we selected an N-ter- and different protease cleavage sites (several examples are listed in minal 6His-tag. It would be difficult to achieve ‘‘traceless’’ scission Table 1). For instance, vectors pURI [13] and pCAL-n-EK [19] can be with a C-terminal tag, since virtually all useful (highly specific and used for ligation-independent cloning, and use enterokinase, which active) proteases have their recognition elements upstream of or is not as specific as TEV and also more expensive. Vectors like pOPINJ overlapping the scissile bond. To truly make this traceless, we se- [1], pLIC [2], pNIC28 [4], pVp [8] allow for TEV cleavage of 6His tag, lected the robust and easy-to-prepare TEV protease, which cleaves but due to the design of the vector they leave behind 2–4 residues on before every amino acid except Pro. To circumvent ligation-depen- the N-terminus after protease cleavage. The vector pT7LIC [12] dent cloning, we designed a sequence for ligation-independent leaves behind an unmodified N-terminus, but uses a C-terminal cloning, which is to say, a sequence that encodes the TEV cleavage 6His-tag that cannot be removed. pMCSG7 [11] and pMCSG9 sequence but that is devoid of a single nucleotide (dG in the vector; [95,96] are tag-switching vectors where the same insert can be used see Fig. 4) to generate the LIC overhang with T4 polymerase editing. to clone in the context of a 6His–TEV fusion and also a 6His–MBP– To remove any restriction on the first amino acid at the cloning TEV fusion. However pMCSG7 and pMCSG9 leave three extra N-ter- junction, we embedded BsaI sites in opposite orientations between minal residues after TEV cleavage and do not use a type IIs restriction the sequences for the TEV site and the terminator. BsaI is a type IIs enzyme for seamless cloning. On the other hand, the use of BsaI en- restriction enzyme that cuts outside of its recognition sequence. zyme for seamless cloning has been used with pX-LacZ vector [3] but Therefore, cleavage with BsaI removes the cassette without leaving not in the context of a His-tag purification scheme. a trace. Finally, we placed an NcoI site between the 6His tag and TEV site, making it possible to clone without the TEV site (between NcoI and BamHI) by standard means or replace the 6His tag (be- pHLIC vectors tween NdeI and NcoI). The pHLIC vector can also be used for cloning using enzymes NdeI and BamHI. The result would be equivalent to The pMCSG7 vector of the Midwest Center for Structural using the pET11a vector (Novagen) between those sites and would Genomics appends a TEV protease-cleavable N-terminal (and

Fig. 4. LIC scheme for pHLIC. BsaI digested pHLIC vector (a) is treated with T4 DNA polymerase in the presence of dGTP to create large overhangs (b). The PCR product for the insert (c) is treated with T4 DNA polymerase in the presence of dCTP to create complementary overhangs (d). Then (b) and (d) are mixed together leading to the formation of the desired clone by hybridization of the overhangs of the vector and insert. This product has nicks, at positions indicated by arrows in (e), which are resolved in the cell upon transformation. V. Durani et al. / Protein Expression and Purification 85 (2012) 9–17 15

protein sequence desired, with no ‘scar’ left behind from fusions designed for expression, solubilization and purification. Many fu- sion proteins and proteases are useful in different contexts, and so it is helpful to use systems that allow for simple tag switching. Vectors from the Gateway system, as well as many from structural genomics centers, assemble many of these useful features. The pHLIC system and similar vectors combine these features in a sim- ple plasmid that is easy to construct and modify for the research- er’s specific needs. Fig. 5. Coomassie Blue stained 12% SDS–PAGE gel showing protein purification and TEV cleavage of 6His–cTIM and 6His–MBP–cTIM. Lane 1, Marker; lane 2, soluble fraction of the cell lysate for 6His-cTIM; lane 3, eluate; lane 4, TEV reaction; lane 5, flow-through from Ni–NTA column after TEV cleavage; lane 6, soluble fraction of Appendix A. Supplementary data the cell lysate for 6His-MBP-cTIM; lane 7, eluate; lane 8, TEV cleavage reaction; lane 9, flow-through from Ni–NTA column after TEV cleavage. Supplementary data associated with this article can be found, in the online version, at http://dx.doi.org/10.1016/j.pep.2012.06.007. attach no sequence for a 6His-Tag or TEV cleavage site to the 50 end of the insert. Also, pHLIC can be easily modified to make other References recombinant forms of proteins using LIC by changing the affinity tag to MBP, GST, Trx or other such fusion tags. For example, we made a [1] N.S. Berrow, D. Alderton, S. Sainsbury, J. Nettleship, R. Assenberg, N. Rahman, pHLIC-MBP vector (Fig. 3) and successfully used it for LIC and pro- D.I. Stuart, R.J. Owens, A versatile ligation-independent cloning method suitable for high-throughput expression screening applications, Nucleic Acids tein expression. Since the insert required for cloning into both Res. 35 (2007) e45. pHLIC and pHLIC-MBP is exactly the same, switching from a [2] L.D. Cabrita, W.W. Dai, S.P. Bottomley, A family of E-coli expression vectors for 6His-tag to a 6His-MBP tag is very easy. Similarly, we have con- laboratory scale and high throughput soluble protein production, BMC Biotechnol. 6 (2006) 12. structed a pHLIC-smt3 (Supplemental Fig. S1) vector, which allows [3] C. Engler, R. Kandzia, S. Marillonnet, A one pot, one step, precision cloning for seamless cloning with an N-terminal SUMO-like protein tag method with high throughput capability, PLoS ONE 3 (2008) e3647. using ligation-independent cloning. It also provides an option for [4] O. Gileadi, N.A. Burgess-Brown, S.M. Colebrook, G. Berridge, P. Savitsky, S.C.E. ´ A, P. Loppnau, C. Johanss, E. Salah, N.H. Pantic, High throughput production of ligation-dependent cloning using DraIII (engineered in the 3 end recombinant human proteins for crystallography, in: B. Kobe, M. Guss, T. of the smt3 gene using silent mutations) and BamHI cloning sites. Huber (Eds.), Structural Proteomics: High Throughput Methods, Humana This basic model vector can therefore be used to generate a library Press, 2008, pp. 221–246. of vectors useful for expressing various forms of recombinant pro- [5] S. Graslund, P. Nordlund, J. Weigelt, J. Bray, B.M. Hallberg, O. Gileadi, S. Knapp, U. Oppermann, C. Arrowsmith, R. Hui, J. Ming, S. Dhe-Paganon, H.W. Park, A. teins. Detailed information on the cloning and use of pHLIC and Savchenko, A. Yee, A. Edwards, R. Vincentelli, C. Cambillau, R. Kim, S.H. Kim, Z. its derivatives can be found in the Supplementary material. Rao, Y. Shi, T.C. Terwilliger, C.Y. Kim, L.W. Hung, G.S. Waldo, Y. Peleg, S. The pHLIC vector was tested to ensure that the ligation-inde- Albeck, T. Unger, O. Dym, J. Prilusky, J.L. Sussman, R.C. Stevens, S.A. Lesley, I.A. Wilson, A. Joachimiak, F. Collart, I. Dementieva, M.I. Donnelly, W.H. pendent cloning scheme (Fig. 4 and Supplementary material) Eschenfeldt, Y. Kim, L. Stols, R. Wu, M. Zhou, S.K. Burley, J.S. Emtage, J.M. worked as intended and that it could be used to overexpress pro- Sauder, D. Thompson, K. Bain, J. Luz, T. Gheyi, F. Zhang, S. Atwell, S.C. Almo, J.B. teins. To this end, the gene for a designed variant of triosephos- Bonanno, A. Fiser, S. Swaminathan, F.W. Studier, M.R. Chance, A. Sali, T.B. Acton, R. Xiao, L. Zhao, L.C. Ma, J.F. Hunt, L. Tong, K. Cunningham, M. Inouye, S. phate isomerase enzyme (cTIM) [98] was cloned into it with a Anderson, H. Janjua, R. Shastry, C.K. Ho, D.Y. Wang, H. Wang, M. Jiang, G.T. high success rate and a low background. This vector was then used Montelione, D.I. Stuart, R.J. Owens, S. Daenke, A. Schutz, U. Heinemann, S. to express cTIM in high yield. This test was also done for the pHLIC- Yokoyama, K. Bussow, K.C. Gunsalus, Protein production and purification, Nat. Methods 5 (2008) 135–146. MBP vector. The protein was purified by affinity chromatography [6] C. Scheich, D. Kummel, D. Soumailakakis, U. Heinemann, K. Bussow, Vectors using Ni–NTA resin (Fig. 5). Over 5 mg of protein was purified from for co-expression of an unrestricted number of proteins, Nucleic Acids Res. 35 100 mL culture, corresponding to a yield of over 50 mg L1. After (2007) e43. [7] C.Y. Cormier, S.E. Mohr, D. Zuo, Y. Hu, A. Rolfs, J. Kramer, E. Taycher, F. Kelley, purification, the 6His-tag was cleaved from the protein using M. Fiacco, G. Turnbull, J. LaBaer, Protein structure initiative material TEV protease. The first amino acid of cTIM is Ala, which is a small repository: an open shared public resource of structural genomics plasmids amino acid known to be acceptable at the P10 site of the TEV pro- for the biological community, Nucleic Acids Res. 38 (2010) D743–D749. tease cleavage site. Hence, after TEV cleavage we obtain the native [8] P.G. Blommel, P.A. Martin, R.L. Wrobel, E. SteVen, B.G. Fox, High effciency single step production of expression plasmids from cDNA clones using the cTIM protein without any extra N-terminal residues. This protein Flexi Vector cloning system, Protein Expr. Purif. 47 (2006) 562–570. was also expressed and purified as an MBP-fusion (Fig. 5) using [9] M.Z. Li, S.J. Elledge, Harnessing homologous recombination in vitro to the pHLIC-MBP vector. Since the MBP gene in this vector also has generate recombinant DNA via SLIC, Nat. Methods 4 (2007) 251–256. [10] A.J.M. Walhout, G.F. Temple, M.A. Brasch, J.L. Hartley, M.A. Lorson, S. van den an N-terminal 6His-tag encoded in it, the same method of purifi- Heuvel, M. Vidal, GATEWAY recombinational cloning: Application to the cation as above could be used to purify it. After purification, the cloning of large numbers of open reading frames or ORFeomes, Applications 6His-MBP tag was cleaved using TEV protease leaving behind of Chimeric Genes and Hybrid Proteins, Pt C, 2000, pp. 575–592. [11] L. Stols, M.Y. Gu, L. Dieckman, R. Raffen, F.R. Collart, M.I. Donnelly, A new the native cTIM protein as was the case with the 6His-tagged vector for high-throughput, ligation-independent cloning encoding a tobacco construct. Mutants of TIM, superoxide dismutase, fructose etch virus protease cleavage site, Protein Expr. Purif. 25 (2002) 8–15. repressor DNA binding domain and paraoxonase-1 have been [12] P.K. Chanda, W.A. Edris, J.D. Kennedy, A set of ligation-independent expression vectors for co-expression of proteins in Escherichia coli, Protein successfully cloned into pHLIC in our lab using both ligation-inde- Expr. Purif. 47 (2006) 217–224. pendent and ligation-based cloning. [13] B. de las Rivas, J.A. Curiel, J.M. Mancheno, R. Munoz, Expression vectors for enzyme restriction- and ligation-independent cloning for producing recombinant his-fusion proteins, Biotechnol. Prog. 23 (2007) 680–686. Conclusion [14] R.B. Kapust, D.S. Waugh, Escherichia coli maltose-binding protein is uncommonly effective at promoting the solubility of polypeptides to which it is fused, Protein Sci. 8 (1999) 1668–1674. Recombinant protein expression, particularly for genome-scale [15] C. Guan, P. Li, P.D. Riggs, H. Inouye, Vectors that facilitate the expression and approaches and synthetic biology, benefits greatly from recent purification of foreign peptides in Escherichia coli by fusion to maltose- and emerging plasmid vectors that enable simple cloning and binding protein, Gene 67 (1988) 21–30. [16] D.B. Smith, K.S. Johnson, Single-step purification of polypeptides expressed in traceless tagging. Not only do ligation-independent methods save Escherichia coli as fusions with glutathione S-transferase, Gene 67 (1988) 31– time and reduce problems, they also facilitate getting the exact 40. 16 V. Durani et al. / Protein Expression and Purification 85 (2012) 9–17

[17] K.L. Guan, J.E. Dixon, Eukaryotic protein expression in Escherichia coli: An [49] D. Nakajima, K. Saito, H. Yamakawa, R.F. Kikuno, M. Nakayama, R. Ohara, N. improved thrombin cleavage and purification procedure of fusion proteins Okazaki, H. Koga, T. Nagase, O. Ohara, Preparation of a set of expression-ready with glutathione Stransferase, Anal. Biochem. 192 (1991) 262–267. clones of mammalian long cDNAs encoding large proteins by the ORF trap [18] R.E. Stofko-Hahn, D.W. Carr, J.D. Scott, A single step purification for cloning method, DNA Res. 12 (2005) 257–267. recombinant proteins Characterization of a microtubule associated protein [50] C. Aslanidis, P.J. de Jong, Ligation-independent cloning of PCR products (LIC– (MAP 2) fragment which associates with the type II cAMP-dependent protein PCR), Nucleic Acids Res. 18 (1990) 6069–6074. kinase, FEBS Lett. 302 (1992) 274–278. [51] C. Aslanidis, P.J. de Jong, G. Schmitz, Minimal length requirement of the [19] D.L. Wyborski, J.C. Bauer, C.F. Zheng, K. Felts, P. Vaillancourt, An Escherichia single-stranded tails for ligation-independent cloning (LIC) of PCR products, coli that allows recovery of proteins with native N-termini Genome Res. 4 (1994) 172–177. from purified calmodulin-binding peptide fusions, Protein Expr. Purif. 16 [52] Y. Zhang, U. Werling, W. Edelmann, SLiCE: a novel bacterial cell extract-based (1999) 1–10. DNA cloning method, Nucleic Acids Res. 40 (2012) e55. [20] E.R. Lavallie, E.A. Diblasio, S. Kovacic, K.L. Grant, P.F. Schendel, J.M. McCoy, A [53] D.G. Gibson, L. Young, R.-Y. Chuang, J.C. Venter, C.A. Hutchison III, H.O. Smith, thioredoxin gene fusion expression system that circumvents inclusion body Enzymatic assembly of DNA molecules up to several hundred kilobases, Nat. formation in the E. coli cytoplasm, Bio-Technology 11 (1993) 187–193. Methods 6 (2009) U341–U343. [21] G. del Solar, R. Giraldo, M.J. Ruiz-Echevarría, M. Espinosa, R. Díaz-Orejas, [54] J. Quan, J. Tian, Circular polymerase extension cloning of complex gene Replication and control of circular bacterial plasmids, Microbiol. Mol. Biol. libraries and pathways, PLoS ONE 4 (2009) e6441. Rev. 62 (1998) 434–464. [55] S. Shuman, Novel approach to molecular cloning and polynucleotide [22] N.H. Tolia, L. Joshua-Tor, Strategies for protein coexpression in Escherichia synthesis using vaccinia DNA topoisomerase, J. Biol. Chem. 269 (1994) coli, Nat. Methods 3 (2006) 55–64. 32678–32684. [23] K. Nishihara, M. Kanemori, M. Kitagawa, H. Yanagi, T. Yura, Chaperone [56] P. Bernard, P. Gabarit, E.M. Bahassi, M. Couturier, Positive-selection vectors coexpression plasmids: differential and synergistic roles of DnaK-DnaJ-GrpE using the F plasmid ccdB killer gene, Gene 148 (1994) 71–74. and GroEL-GroES in Assisting folding of an allergen of Japanese cedar pollen, [57] P. Bernard, M. Couturier, Cell killing by the F plasmid CcdB protein involves Cryj2, in Escherichia coli, Appl. Environ. Microbiol. 64 (1998) 1694–1699. poisoning of DNA–topoisomerase II complexes, J. Mol. Biol. 226 (1992) 735–745. [24] R. Levy, R. Weiss, G. Chen, B.L. Iverson, G. Georgiou, Production of correctly [58] P. Bernard, K.E. Kézdy, L. Van Melderen, J. Steyaert, L. Wyns, M.L. Pato, P.N. folded Fab fragment in the cytoplasm of Escherichia coli trxB gor Higgins, M. Couturier, The F plasmid CcdB protein induces efficient ATP- mutants via the coexpression of molecular chaperones, Protein Expr. Purif. 23 dependent DNA cleavage by Gyrase, J. Mol. Biol. 234 (1993) 534–541. (2001) 338–347. [59] G. Hannig, S.C. Makrides, Strategies for optimizing heterologous protein [25] G. Georgiou, P. Valax, Expression of correctly folded proteins in Escherichia expression in Escherichia coli, Trends Biotechnol. 16 (1998) 54–60. coli, Curr. Opin. Biotechnol. 7 (1996) 190–197. [60] K. Terpe, Overview of bacterial expression systems for heterologous protein [26] N. Kholod, T. Mustelin, Novel vectors for co-expression of two proteins in production: from molecular and biochemical fundamentals to commercial E. coli, Biotechniques 31 (2001) 322–323. 326-328. systems, Appl. Microbiol. Biotechnol. 72 (2006) 211–222. [27] M. Wakamori, T. Umehara, S. Yokoyama, A series of bacterial co-expression [61] D.A. Melton, P.A. Krieg, M.R. Rebagliati, T. Maniatis, K. Zinn, M.R. Green, vectors with rare-cutter recognition sequences, Protein Expr. Purif. 74 (2010) Efficient in vitro synthesis of biologically active RNA and RNA hybridization 88–98. probes from plasmids containing a bacteriophage SP6 promoter, Nucleic [28] J. Zeng, L. Zhang, Y. Li, Y. Wang, M. Wang, X. Duan, Z.-G. He, Over-producing Acids Res. 12 (1984) 7035–7056. soluble protein complex and validating protein–protein interaction through a [62] T. Sawasaki, T. Ogasawara, R. Morishita, Y. Endo, A cell-free protein synthesis new bacterial co-expression system, Protein Expr. Purif. 69 (2010) 47–53. system for high-throughput proteomics, Proc. Natl. Acad. Sci. USA 99 (2002) [29] D.J. Donoghue, P.A. Sharp, Replication of colicin E1 plasmid DNA in vivo 14652–14657. requires no plasmid-encoded proteins, J. Bacteriol. 133 (1978) 1287–1294. [63] R. Gentz, H. Bujard, Promoters recognized by Escherichia coli RNA polymerase [30] F. Bolivar, R.L. Rodriguez, P.J. Greene, M.C. Betlach, H.L. Heyneker, H.W. Boyer, selected by function: highly efficient promoters from bacteriophage T5, J. J.H. Crosa, S. Falkow, Construction and characterization of new cloning Bacteriol. 164 (1985) 70–77. vehicles.2. Multipurpose cloning system, Gene 2 (1977) 95–113. [64] S. Inouye, M. Inouye, Up-promoter mutations in the lpp gene of Escherichia [31] N. Watson, A new revision of the sequence of plasmid pBR322, Gene 70 coli, Nucleic Acids Res. 13 (1985) 3101–3110. (1988) 399–403. [65] L.M. Guzman, D. Belin, M.J. Carson, J. Beckwith, Tight regulation, modulation, [32] C. Yanischperron, J. Vieira, J. Messing, Improved M13 phage cloning vectors and high-level expression by vectors containing the arabinose PBAD and host strains - nucleotide sequences of the M13mp18 and pUC19 vectors, promoter, J. Bacteriol. 177 (1995) 4121–4130. Gene 33 (1985) 103–119. [66] J. Brosius, M. Erfle, J. Storella, Spacing of the 10 and 35 regions in the tac [33] J. Messing, New M13 vectors for cloning, Methods Enzymol. 101 (1983) 20– promoter. Effect on its in vivo activity, J. Biol. Chem. 260 (1985) 3539–3541. 78. [67] J.A. Vasina, F. Baneyx, Recombinant protein expression at low temperatures [34] T. Hashimotogotoh, K.N. Timmis, Incompatibility properties of Col E1 and under the transcriptional control of the major Escherichia coli cold shock pMB1 derivative plasmids: Random replication of multicopy replicons, Cell promoter cspA, Appl. Environ. Microbiol. 62 (1996) 1444–1447. 23 (1981) 229–238. [68] F.W. Studier, B.A. Moffatt, Use of bacteriophage T7 RNA polymerase to direct [35] J. Davison, Mechanism of control of DNA replication and incompatibility in selective high-level expression of cloned genes, J. Mol. Biol. 189 (1986) 113–130. ColE1-type plasmids – a review, Gene 28 (1984) 1–15. [69] S. Tabor, C.C. Richardson, A bacteriophage T7 RNA polymerase/promoter [36] Y. Nakano, Y. Yoshida, Y. Yamashita, T. Koga, Construction of a series of pACYC system for controlled exclusive expression of specific genes, Proc. Natl. Acad. derived plasmid vectors, Gene 162 (1995) 157–158. Sci. USA 82 (1985) 1074–1078. [37] B. Bartolome, Y. Jubete, E. Martinez, F. Delacruz, Construction and properties [70] J.W. Dubendorf, F.W. Studier, Controlling basal expression in an inducible T7 of a family of pACYC184 derived cloning vectors compatible with pBR322 and expression system by blocking the target T7 promoter with lac repressor, J. its derivatives, Gene 102 (1991) 75–78. Mol. Biol. 219 (1991) 45–59. [38] G. Selzer, T. Som, T. Itoh, J. Tomizawa, The origin of replication of plasmid [71] S.F. William, Use of bacteriophage T7 lysozyme to improve an inducible T7 p15A and comparative studies on the nucleotide sequences around the origin expression system, J. Mol. Biol. 219 (1991) 37–44. of related plasmids, Cell 32 (1983) 119–129. [72] G. Qing, L.-C. Ma, A. Khorchid, G.V.T. Swapna, T.K. Mal, M.M. Takayama, B. Xia, [39] S.N. Cohen, A.C.Y. Chang, Revised interpretation of the origin of the psc101 S. Phadtare, H. Ke, T. Acton, G.T. Montelione, M. Ikura, M. Inouye, Cold-shock plasmid, J. Bacteriol. 132 (1977) 734–737. induced high-yield protein production in Escherichia coli, Nat. Biotechnol. 22 [40] A.R. Stuitje, E. Veltkamp, H.J.J. Nijkamp, P.J. Weijers, Origin and direction of (2004) 877–882. replication of the bacteriocinogenic plasmid Clo DF13, Nucleic Acids Res. 6 [73] G. Georgiou, L. Segatori, Preparative expression of secreted proteins in (1979) 71–80. bacteria: status report and future prospects, Curr. Opin. Biotechnol. 16 (2005) [41] E. Veltkamp, A.R. Stuitje, Replication and structure of the bacteriocinogenic 538–545. plasmids Clo DF13 and Col E1, Plasmid 5 (1981) 76–99. [74] J.J. Lichty, J.L. Malecki, H.D. Agnew, D.J. Michelson-Horowitz, S. Tan, [42] J. Morlon, D. Sherratt, C. Lazdunski, Identification of functional regions of the Comparison of affinity tags for protein purification, Protein Expr. Purif. 41 colicinogenic plasmid ColA, Mol. Gen. Genet. MGG 211 (1988) 223–230. (2005) 98–105. [43] V.V. Zverev, I.A. Khmel, The nucleotide sequences of the replication origins of [75] T.R. Butt, S.C. Edavettal, J.P. Hall, M.R. Mattern, SUMO fusion technology for plasmids ColA and ColD, Plasmid 14 (1985) 192–199. difficult-to-express proteins, Protein Expr. Purif. 43 (2005) 1–9. [44] T. Som, J.-I. Tomizawa, Origin of replication of Escherichia coli plasmid RSF [76] M.P. Malakhov, M.R. Mattern, O.A. Malakhova, M. Drinker, S.D. Weeks, T.R. 1030, Mol. Gen. Genet. MGG 187 (1982) 375–383. Butt, SUMO fusions and SUMO-specific protease for efficient expression and [45] H. Zhang, H. Wu, H. Zhang, A novel high-copy plasmid, pEC, compatible with purification of proteins, J. Struct. Funct. Genomics 5 (2004) 75–86. commonly used Escherichia coli cloning and expression vectors, Biotechnol. [77] A.-M. Catanzariti, T.A. Soboleva, D.A. Jans, P.G. Board, R.T. Baker, An efficient Lett. 29 (2007) 431–437. system for high-level expression and easy purification of authentic [46] K. Butler, A.R. English, V.A. Ray, A.E. Timreck, Carbenicillin: chemistry and recombinant proteins, Protein Sci. 13 (2004) 1331–1339. mode of action, J. Infect. Dis. 122 (1970) S1–S8. [78] R.F. Ohana, L.P. Encell, K. Zhao, D. Simpson, M.R. Slater, M. Urh, K.V. Wood, [47] J.P.P. Muyrers, Y. Zhang, A.F. Stewart, Techniques: recombinogenic HaloTag7: A genetically engineered tag that enhances bacterial expression of engineering new options for cloning and manipulating DNA, Trends soluble proteins and improves protein purification, Protein Expr. Purif. 68 Biochem. Sci. 26 (2001) 325–331. (2009) 110–120. [48] C.S. Shashikant, J.L. Carr, J. Bhargava, K.L. Bentley, F.H. Ruddle, [79] W.G. Dougherty, T.D. Parks, S.M. Cary, J.F. Bazan, R.J. Fletterick, Recombinogenic targeting: a new approach to genomic analysis – a review, Characterization of the catalytic residues of the tobacco etch virus 49-kDa Gene 223 (1998) 9–20. proteinase, Virology 172 (1989) 302–310. V. Durani et al. / Protein Expression and Purification 85 (2012) 9–17 17

[80] J.C. Carrington, W.G. Dougherty, A viral cleavage site cassette: Identification [93] K. Nagai, H.C. Thogersen, Generation of beta-globin by sequence specific of amino acid sequences required for tobacco etch virus polyprotein proteolysis of a hybrid protein produced in Escherichia coli, Nature 309 (1984) processing, Proc. Natl. Acad. Sci. USA 85 (1988) 3391–3395. 810–812. [81] W.G. Dougherty, J.C. Carrington, S.M. Cary, T.D. Parks, Biochemical and [94] L. Quinn, Seamless cloning and gene fusion, Trends Biotechnol. 23 (2005) mutational analysis of a plant virus polyprotein cleavage site, EMBO J. 7 199–207. (1988) 1281–1287. [95] M.I. Donnelly, M. Zhou, C.S. Millard, S. Clancy, L. Stols, W.H. Eschenfeldt, F.R. [82] M.G. Cordingley, P.L. Callahan, V.V. Sardana, V.M. Garsky, R.J. Colonno, Collart, A. Joachimiak, An expression vector tailored for large-scale, high- Substrate requirements of human rhinovirus 3C protease for peptide cleavage throughput purification of recombinant proteins, Protein Expr. Purif. 47 in vitro, J. Biol. Chem. 265 (1990) 9062–9065. (2006) 446–454. [83] S.I. Choi, H.W. Song, J.W. Moon, B.L. Seong, Recombinant enterokinase light [96] D.A. Vinarov, B.L. Lytle, F.C. Peterson, E.M. Tyler, B.F. Volkman, J.L. Markley, chain with affinity tag: Expression from Saccharomyces cerevisiae and its Cell-free protein production and labeling protocol for NMR-based structural utilities in fusion protein technology, Biotechnol. Bioeng. 75 (2001) 718–724. proteomics, Nat. Methods 1 (2004) 149–153. [84] T. Hosfield, Q. Lu, Influence of the amino acid residue downstream of [97] W. Eschenfeldt, S. Lucy, C.S. Millard, A. Joachimiak, D. Mark, A family of LIC (Asp)(4)Lys on enterokinase cleavage of a fusion protein, Anal. Biochem. 269 vectors for high-throughput cloning and purification of proteins, Methods (1999) 10–16. Mol. Biol. (Clifton, N.J.) 498 (2009) 105–115. [85] O.W. Liew, J.P.C. Chong, T.G. Yandle, S.O. Brennan, Preparation of recombinant [98] B.J. Sullivan, V. Durani, T.J. Magliery, Triosephosphate isomerase by consensus thioredoxin fused N-terminal proCNP: analysis of enterokinase cleavage design: dramatic differences in physical properties and activity of related products reveals new enterokinase cleavage sites, Protein Expr. Purif. 41 variants, J. Mol. Biol. 413 (2011) 195–208. (2005) 332–340. [99] A.H. Rosenberg, B.N. Lade, C. Dao-shan, S.W. Lin, J.J. Dunn, F. Studier, Vectors [86] D.B. Smith, K.S. Johnson, Single-step purification of polypeptides expressed in for selective expression of cloned DNAs by T7 RNA polymerase, Gene 56 Escherechia coli as fusions with glutathione S-transferase, Gene 67 (1988) 31– (1987) 125–135. 40. [100] J.A. Curiel, B. de las Rivas, J.M. Mancheño, R. Muñoz, The pURI family of [87] K.R.K. Sticha, C.A. Sieg, C.P. Bergstrom, P.E. Hanna, C.R. Wagner, expression vectors: A versatile set of ligation independent cloning plasmids Overexpression and large-scale purification of recombinant hamster for producing recombinant His-fusion proteins, Protein Expr. Purif. 76 (2011) polymorphic arylamine N-acetyltransferase as a dihydrofolate reductase 44–53. fusion protein, Protein Expr. Purif. 10 (1997) 141–153. [101] A.C. Chang, S.N. Cohen, Construction and characterization of amplifiable [88] U.C. Vothknecht, C.G. Kannangara, D. von Wettstein, Expression of multicopy DNA cloning vehicles derived from the P15A cryptic miniplasmid, catalytically active barley glutamyl tRNA(Glu) reductase in Escherichia coli J. Bacteriol. 134 (1978) 1141–1156. as a fusion protein with glutathione S-transferase, Proc. Natl. Acad. Sci. USA [102] L. Stols, M. Zhou, W.H. Eschenfeldt, C.S. Millard, J. Abdullah, F.R. Collart, Y. 93 (1996) 9287–9291. Kim, M.I. Donnelly, New vectors for co-expression of proteins: structure of [89] R.C. Stevens, Design of high-throughput methods of protein production for Bacillus subtilis ScoAB obtained by high-throughput protocols, Protein Expr. structural biology, Structure 8 (2000) R177–R185. Purif. 53 (2007) 396–403. [90] R.J. Jenny, K.G. Mann, R.L. Lundblad, A critical review of the methods for [103] N. Lee, C. Francklyn, E.P. Hamilton, Arabinose-induced binding of AraC cleavage of fusion proteins with thrombin and factor Xa, Protein Expr. Purif. protein to araI2 activates the araBAD operon promoter, Proc. Natl. Acad. Sci. 31 (2003) 1–11. USA 84 (1987) 8814–8818. [91] L.J. Lucast, R.T. Batey, J.A. Doudna, Large-scale purification of a stable form [104] C. Kang, C.-W. Wu, Studies on SP6 promoter using a new plasmid vector that of recombinant tobacco etch virus protease, Biotechniques 30 (2001) 544– allow gene insertion at the transcription initiation site, Nucleic Acids Res. 15 550. (1987) 2279–2294. [92] R.B. Kapust, J. Tozser, T.D. Copeland, D.S. Waugh, The P1 specificity of tobacco [105] J.E. Cronan, Biotination of proteins in vivo. A post-translational modification etch virus protease, Biochem. Biophys. Res. Commun. 294 (2002) 949–955. to label, purify, and study proteins, J. Biol. Chem. 265 (1990) 10327–10333.