Ryecellwall βββglucosidase:subcloning,expressionand purificationofrecombinantproteinfrom E.coli NicolasRochereau FrançoisRabelaisUniversity,Tours,France January2007–June2007 SödertörnUniversityCollege,Huddinge,Sweden

Examiner:AssocProfMagnusJohansson Supervisors: MSc Therese Gradin, Assoc Prof Gabriele Delp and Prof Lisbeth Jonsson,Dept.ofLifeSciences,SödertörnUniversityCollege.

1 Abstract Severalplantdefensesystemsconsistofenzymesthatactonglucosidesandproduce atoxiccompound.Intheintactplanttissuethesubstrateandenzymearekeptapart. The system studied here consists of the substrate 2OβDglucopyranosyl4 dihydroxy1,4benzoxazin3one and the enzyme glucan 1,3βglucosidase in rye. Theaimwastodeterminethepropertiesofacellwall βglucosidase. Twodifferentsystemsforexpressionandpurificationof βglucosidasefusedtoatag wereused:a6xHistidinetagsystemandathioredoxintagsystem. Thesequenceof the βglucosidase had previously been determined so now the gene was subcloned into E.coli . A direct PCR on colonies, a test expression, a restriction digestion of and sequencing was made to analyze the transformation, which all turned outsuccessful.Thenthe βglucosidasesolubilitywasdetermined.Finallyapurification of the βglucosidase from E.coli under native conditions and a pNPG assay was carriedout. Forthe(His) 6taggedprotein, therecombinantβglucosidasetendedtoendupinthe insolublepelletedfractionwhichindicatedformationofinclusionbodies. The cell wall 1,3βglucosidase was soluble with the thioredoxin system, but the percentage of soluble protein fraction was around 5% only of the total protein. In eluatesfromanickelnitrilotriaceticacidcolumnthepresenceofrecombinantprotein wasconfirmedwithWesternblot,butcontaminatingbandswerealsopresent.Purified elautedfractionsdidnotexhibitdetectable βglucosidaseactivity.Itwasnotpossible to purify active enzyme. From a BLAST search it was clear that the most similar enzymes all had putative glycosylation sites and lack of glycosylation could be a reasonfortheproteinnottofoldproperly.

2 Introduction Since plants cannot move they have come up with different ways of defending themselves,otherthanrunningaway.Secondarymetabolitesareofteninvolvedinthe chemicaldefensemechanismsofplants. Plantsmakealotofsecondarymetabolites. Many of these are highly dangerous to various attackers and are often stored in specificvesiclesorinthevacuole(Poulton1990).Hydroxamicacids(Hx),whichare the most considerably studied secondary metabolites, play a role in the defence of cereals, in the detoxification of herbicides and in allelopathic effects of the crops (Niemeyer 1988). Several systems consist of enzymes that act on glucosides and produce a toxic compound. The system studied here consists of the substrate DIBOAGlc (2OβDglucopyranosyl4dihydroxy1,4benzoxazin3one) which is an Hxandtheenzymeglucan1,3βglucosidase. Thesystemispresentinrye( Secalecereale ),maize( Zeamays )andwheat( Triticum aestivum ).Intheintactplanttissuethesubstrateandenzymearekeptapart. The β glucosidasecomestogetherwiththesubstrate(DIBOAGlc)whenthetissueisbroken, forexampleataninsectattack.The βglucosidaseactsontheglucosidesandbreaks the glycosidic bond. Then DIBOA (2,4dihydroxyl,4benzoxazin3one) is formed whichspontaneouslyproducesthetoxiccompoundBOA(2benzoxazolinone).

Figure 1. The reaction catalysed by rye βglucosidase and the subsequent transformationofDIBOAtothemoretoxicbenzoxalinone,BOA. Alotofthingsareknownaboutthe βglucosidases.Forexample βglucosidasescut glycosidiclinkages,theyhavegotanacidicpHoptima(pH56),theyareubiquitousin the living world (mammal, plant, insect, fungus, and bacterium) (Esen 1993). β glucosidases have been implicated in numerous different plant functions, such as growth regulation by the release of active plant hormones (Dietz el al. 1999), lignificationandplantdefensebythereleaseofplanttoxins(Dharmawardhanaetal. 1999). Thisworkwillfocusona βglucosidaseinrye.Atthesubcellularlevel, βglucosidases werefoundbothinthecellwallandintheplastid(Nikusetal.2001).Thesequenceof the plastidic βglucosidase was determined, the gene was cloned, purified and the biochemichal properties of the enzyme was defined. The native cell wall 1,3β glucosidasewaspartlypurifiedandhadanapparentmolecularweightofabout6869 kDa (Nikus and Jonsson 2003).Thecellwall βglucosidasebelongstothefamily3, whereastheplastidic βglucosidasebelongstofamily1(HenrissatandDavies1997).

3 Theaimofthisprojectistotrytoconfirmtheresultsconcerningthepropertiesofthe partly purified cell wall βglucosidase by and purification of recombinant protein.Therecombinantproteinwillbecharacterizedwithregardtoitsmolecularand kineticproperties. Based on the peptide sequences of the isolated protein primers could be designed and a fulllengthcDNAclonewasisolatedusingGeneRacerKit(Invitrogen,Paisley, UK)(F.Michoux,personalcommunication).Thegene,referredtoas“CWR”(CellWall Recombinant), was cloned into pCR ®4TOPO ®, referred to as “D4”. The construct was sequenced and the result was submitted to GenBank (Accession Number AY586531). In a previous attempt to subclone and then purify the recombinantproteinusingthe IMPACT(InteinMediatedPurificationwithanAffinityChitinbindingTag)system(New EnglandBiolabs,BeverlyMA,USA)theproteinendedupininclusionbodiesandwas notsoluble.Thepurificationfailed(Roos2006). Materialsandmethods Twoapproachesweremade,(His) 6 tag(Qiagen,Solna,Sweden)andThioredoxintag (Invitrogen). The (His) 6affinity tag facilitates bindingtonickelnitrilotriaceticacid(Ni NTA).Indeed,(His) 6taggedproteinareboundtometalchelatingsurfaces:NiNTAor cobalt resins. So the protein could be purified with a Nicolumn or Cocolumn. Thioredoxin is able to accumulate up to 40% of the total cellular protein and still remainsoluble.Thethioredoxinproteinhasbeenmutatedtocontainametalbinding domain. The mutated histidines interact with a native histidine to form a patch. His Patchthioredoxinproteinscantereforebepurifiedonmetalchelatingresins. SubcloningoftheCWRgene CWR -Qifull.R  5´ -AAT TAG AGT GGT ACC CTA CTT GTT GGC CTC GGT GGT- 3´ CWR-QiMat.F  5´-GAT CAT CTC AAG TAC AAG GAT CCG AAG CAG CCT ATT- 3´ GF1  5´-CCT TAC GCT TTT GCT CCG TGT- 3´ GR1  5´-GCT GTC AGG ATT CTC TGA GA- 3´ CWRpET102M.F  5´-CAC CGA TCA TCT CAA GTA CAA GGA TC- 3´ CWRpET102.R  5´-CTA CTT GTT GGC CTC GGT GGT GAG- 3´ Table1showingallprimersusedduringcloningandsequencing.

4 Thetargetgene,CWR,wasamplifiedfromthesequencingvector,D4,byPCR(98°C 30 sec,(98°C10sec,71°C30sec,72°C1min)x31,72°C5min)usingthePhusion HotStartHighFidelityDNAPolymerasekit(Finnzymes,Espoo,Finland).Theprimers usedforthe(His) 6systemwereCWRQiMat.F,whichcontainedtherestrictionsitefor restriction enzyme Kpn I to facilitate directional cloning, and CWRQiFull.R. The primersusedforthethioredoxinsystemwereCWRpET102M.FandCWRpET102.R. ThePCRreactioncontained:Sterilewater12,4L,buffer4L,dNTP0,4L,template (D4) 1 L, primers 1 L, polymerase 0,2L. Both the HFandGCPCRbufferwere tested.ThePCRproductforCWRQiMat.FandCWRQiFull.Rwascleanedonaspin column using GeneClean (Qbiogene, Irvine, United States). The PCRproduct for CWRpET102M.F and CWRpET102.R was run on a 1% agarose gel and cleaned usingGeneClean(Qbiogene). ThePCRproductfortheHis 6systemwasdigestedwithKpnI(NewEnglandBiolabs, Ipswich, England). The pQE1 expression vector (Qiagen) was double digested with Kpn I and Pvu II that produces a blunt end. The PCRproduct and vector were gel cleanedandpurifiedwithGeneClean(Qbiogene). TherestrictiondigestedpQE1vectorwascombinedwitha3molarexcessofinsert usingQuickLigationKit(NewEnglandBiolabs). The PCRproduct for the thioredoxin system was ligated into the pET102D/TOPO vectorusingcloningprotocolaccordingtothesuppliedmanual(Invitrogen). Transformation Forthe(His) 6tagsystem, E.coli strainM15[pREP4]werepreparedaccordingtothe QIAexpressionistmanual(Qiagen). AnaliquotofM15[pREP4]cellsweremixedcarefullywith10Lofligationmixand kept on ice for 20 min, transferred to a 42°C heating block for 90 s. Psi broth was addedtothemixandafterincubationfor90minutesat37°C.100and200Laliquots wereplatedoutonLuriaBertani(LB)agarplatescontaining25g/mlkanamycinand 100 g/ml ampicillin. The plates were incubated at 37°C overnight. Positive and negative controlplateswereprepared.Thepositivecontrolwasmadewithacontrol includedinthekit,andthenegativecontrolwasmadewithoutplasmid. Forthethioredoxinsystem, E.coli strainBL21Star TM (DE3)OneShot ®wereprepared according to supplied manual (Invitrogen). An aliquot of BL21 cells were mixed carefully with 2 L of ligation mix and kept on ice for 30 min, transferred to a 42°C heatingblockfor30s.S.O.Cmediumwasaddedtothemixandafterincubationfor30 minutes at 37°C. 100 and 200 L aliquots were plated out on LBagar plates containing100g/mlampicillin.Theplateswereincubatedat37°Covernight. DirectPCRoncolonies Five colonies, a1a5 for the (his) 6system and 15 for the thioredoxin system respectively, were picked from the LBplate, diluted in 200 L H 2O and boiled 5 minutes.5LwereusedinthePCRreaction.

5 These 2x5 colonies were analyzed by PCR with CWRQifull.R (10 M), and CWR QiMat(10M).ForthethioredoxinsystemGF1(10M)andGR1(10M)wereused. PCR reaction (94°C 30sec, (94°C 30sec, 55°C 30sec, 72°C 1min) x 30) was performedusingTaqPolymerase(Fermentas,Burlington,Canada).ThePCRreaction contained:Sterilewater8,4L,10XPCRbufferKCl(with2mMMgCl 2)4L,dNTP 0,4L,colony(diluteandboil)5L,primers1L. Testexpression LBmedia(25g/mlkanamicynand100g/mlampicillin)wasinoculatedwiththea1, a2, a3, a5 colonies. At OD 600 ~0,6 the cultures was split in two and one half was induced with 0,5 mM IPTG. The other one was kept as control. All cultures were incubatedfor4hoursat37°C.1mLofbothcultureswerespundownfor15minutesat 5000 rpm. The cellpellet was boiled directly in 200 L 1xSDS Sample Buffer for 4 minutes. The mix was centrifuged for 15 minutes at 10000 rpm. Finally 20 L was mixed with 4xSDS Sample buffer (Tris/HCl pH 6, 252 mM, glycerol 40%, SDS 8%, bromophenol blue 0,04%) and loaded onto a 420% TrisGlycine Duramide gel (Cambrex,Rockland,USA). Restrictiondigestionofplasmids ExtractionofpQE1withCWRinsertwasperformedusingPlasmidmidikit (Qiagen). KpnI and EcoRI (New England Biolabs) were used for the restriction digestion. The expectedsizesoffragmentswere3381bp,1398bp,and469bp. Sequencing The plasmids pQE1CWRa1 and pET102D/TOPOCWRa were sequenced (Cybergene). Expression Determinationofβglucosidaseproteinsolubility LB media (25 g/ml kanamycin and 100 g/ml ampicillin) was inoculated with an overnightcultureofcolonya1forthe(His) 6systemand1forthethioredoxinsystem. AtOD 600 of~0,6theculturewasinducedwith1mMIPTG.A1mLsample(uninduced control) was removed before induction, pelleted and resuspended in 50 uL 1xSDS Sample Buffer. The culture was incubated for 4 hours at 37°C. A 1 mL sample (induction control) was removed after induction, pelleted and resuspended in 50 L 1xSDS Sample Buffer. The culture was centrifuged at 4000xg for 20 minutes, resuspendedin5mLlysisbuffer(NaH 2PO 4 50mM,NaCl300mM,imidazole10mM, PMSF 20 M, pH 8). Cells were lysed by repeated freezethawing in liquid nitrogen and cold water. 5 L respectively of the soluble protein fraction and the insoluble proteinfractionwasmixedwith1xSDSSamplebuffer.Allsampleswereloadedontoa 420%TrisGlycineDuramidegel(Cambrex).Thelysatewascentrifugedat10000xg

6 at4°Cfor30minutes.Thesupernatantwasdecanted(solubleprotein),andthepellet was resuspended in 400 L 1xSDS Sample Buffer (insoluble protein). Other media, incubationtemperatures,lengthsofinductionandconcentrationsofIPTGweretested; 2xYeastTryptone(2xYT)andTerrificBroth(TB)wereothermediatested,25°Cand 30°Cforincubation,2,5hoursofinduction,0,5mMofIPTGconcentration. Westernblot A Western blot was performed with samples from the solubility test of the pET102 D/TOPOCWR1.APVDFmembrane(AmershamBiosciences,Amershamplace,UK) wasputin100%MeOHfor1minute,MilliqH 2Ofor5minutesandintransferbuffer (96mMGlycine,12mMTrisBase,10%MeOH)for5minutes.Thefilterpapersand the SDSPAGE gel were soaked in transfer buffer for a few minutes. Three filter papers, the PVDFmembrane, the gel, and another three filter papers on top were placedintheSemiPhor(HoeferScientificInstruments,SanFrancisco,USA). The membrane was placed with TBST (20 mM TrisHCl, 500 mM NaCl, pH 7,5, Tween20,w/v)torinseanygelresiduesofffor2x5minutes.Themembranewasthen putinblockingsolution(25mLTBST+3%BSA,w/v)for1houratroomtemperature. ThemembranewaswashedwithTBSTfor2x5minutesandincubatedwithprimary antibody, AntiThio TM Antibody (Invitrogen) in dilution solution (25 mL TBST + 1% BSA,w/v)for1houratroomtemperature.Theprimaryantibodywassupposedtobind to the thioredoxin fusion protein. The membrane was washed with TBST for 2x5 minutesandincubatedwithsecondaryantibody,theaffinitypurifiedHRPconjugated AntimouseIgGsecondaryantibody(Sigma,Steinheim,Germany)for1houratroom temperature.AfterfinalwashstepsofTBSTfor2x5minutesandTBSfor20minutes themembranewasincubatedinHRPsubstrate(5,5mL100%MeOH+16,5mg4CN, 21mLTBS,40 L30%H 2O2)untilcolordeveloped. Largescaleexpression 1 L of LB media (100 g/ml ampicillin) was inoculated with an overnight culture of colony1forthethioredoxinsystem.AtOD 600 of~1,3theculturewasinducedwith0,5 mMIPTG.Theculturewasincubatedfor2,5hoursat37°C,andthenthecellswere harvestedbycentrifugationat3000xgat4 οCfor10min.Thecellswereresuspended in5mLoflysisbuffer.1mg/mloflysozymewasaddedandincubatedonicefor30 min. After 10 x 6 sec of sonication, DNase I was added and incubated on ice for1 hour.Thelysatewascentrifugedat10000xgfor30minat4°C,andthesupernatant wassavedforthepurification. Purificationoftheβglucosidasefrom E.coli undernativeconditions 1mLofthe50%NiNTAwasaddedwith4mLclearedlysateandmixedat4°Cfor1 hour.Themixturewasaddedontoacolumn.Firsttheflowthroughwascollected,then the column was washed twice with wash buffer (NaH 2PO 4 50 mM, NaCL 300 mM, imidazole 20 mM, pH8).Samplesfromthewashstepswerealsocollected.Finally,

7 theproteinwaseluted4timeswith0,5mLofelutionbuffer(NaH 2PO 450mM,NaCl 300mM,imidazole250mM,pH8). Characterization pNPGassay 70 L of substrate (5 mM pNPG ( pnitrophenyl glucopyranoside) in 100 mM citrate phosphate buffer) was mixed with 70 L of enzyme. The mixture was incubated at 37°Cfor15to45min.Thereactionwasstoppedwith70Lof0,4MNa 2CO 3.pNP( p nitrophenol) concentration was determined using a spectrophotometer at 405 nm. A negative control was made with H 2Oinstead ofpNPG.Apositivecontrolwasmade withpurifiednativeplastidic βglucosidaseinsteadofcellwall βglucosidase.Protein concentrationwasdeterminedusingtheBradfordmethod.5Lofsamplewasmixed with 155 L dH 2O and 40 Lof Bradford reagent. A standard curve wasmadewith definedconcentrationsofBSA(BovineSerumAlbumine).Blanksweremadewithout sampleanddH2Oinstead.Aftera10minutesincubationtheresultswereanalyzedin aBIORADModel550MicroplateReader. Bioinformatics Theaminoacidsequenceofthecellwall βglucosidasewassubmittedtotheBLAST sequencealignment program (http://www.ncbi.nlm.nih.gov/BLAST/ ). Clustal Watthe EBIserver(www.ebi.ac.uk/clustalw/ )wasusedtoalignthesequencesandtocalculate the sequence alignment scores. In order to investigate putative glycosylation the aminoacidsequencesoftheplastidicandthecellwall β− glucosidasesweresubmitted toNetNGlyc1.0Server.(http://www.cbs.dtu.dk/services/NetNGlyc/ ). Results SubcloningoftheCWRgene The CWR gene was amplified from the sequencing vector D4 using PCR. The expectedsizeofthetargetproductwas1818bp.Twodifferentbuffers,GCandHF, included in the PCRkit (Finnzymes) were tested. Figure 1 showsthe productsfrom the amplification, with different dilutions, 1x, 5x and 10x. Both buffers produced products of expected size. The same result was obtained for the thioredoxin CWR (resultsnotshown).UpscaleforproductionofmuchPCRproductwasdifficultusing theHFbuffer,thereforeGCbufferwaschosenforthesmallupscaleofvolume. Thesameresultwasobtainedforthe“thio”CWR(resultsnotshown).

8 Figure1: AgarosegelshowingPCRproductsfromtheamplificationoftheCWRgene in D4 vector. Expected size of target product is 1818 bp. PCRbuffers GC and HF, provided in the kit, were tested. The PCRproducts were diluted 5 and 10 times for easiervisualization.Ladder:MassRuler TM DNALaddermix(Fermentas). Transformationofcompetent E.coli strainsM15[pREP4]andBL21(DE3) Competent E.coli strainM15[pREP4]wastransformedwiththeligationmixofpQE1 vectorandPCRproduct.PositivecontrolwaspreparedwiththeuncutpQE1vector, andthenegativecontrolwaspreparedwithoutanyplasmid.200Land50Lofthe transformationmixwasaccomplishedwiththeligationmixofpQE1vectorandPCR product.Thefigure2showsthatforthepositivecontroltherewassomanycolonies that they were unable to be counted. The negative control did not contain any colonies.Theplateswiththetransformedhadcolonies.Therewasa~5xratio ofcolonynumberbetweentheplateswith200Land50Ltransformationmix.Soit was possible to continue with these bacteria. There were similar results for the thioredoxinsystem.Thepositivecontrolandtheplateswiththetransformedbacteria hadcolonies. Colonies Positivecontrol ~10000 Negativecontrol 0 Ligation(200l) ~4000 Ligation(50l) ~1000

Figure 2: Table showing bacteria growing on the LB agar containing kanamycin (25 g/ml) and ampicillin (100 g/ml). The uncut pQE1 vector was used as a positive control and the negative control transformation was accomplished without any

9 plasmid. Two different volumes (200 L and 50 L) of competent E.coli strain M15 transformed with the ligation mix of pQE1 vector and PCRproduct were plated. Similarresultswereobservedforthethioredoxinsystem. DirectPCRoncolonies In order to find out if the CWR insert is present in the pQE1 plasmid, 5 differents colonies were picked and used as templates in PCR. The expected size of PCR productsis1818bp.Thesameprocedurewascarriedoutforthethioredoxinsystem. The expected size of PCRproducts (CWR insert + pET102D/TOPO plasmid) was 984bp. The figures 3a and 3b show respectively colonies a1, a2, a3, a4, and a5 of pQE1CWR,andcolonies1,2,3,4and5ofpET102D/TOPO. Therewerebandsofexpectedsizesincoloniesa1,a2,a3,a5,and1,2,3,4,5. Figure 3a: Agarose gel showing if the insert, CWR, is present in the plasmid, pQE1.5differentcolonies(a1,a2,a3,a4, anda5)werepickedandusedastemplates in PCR. The expected size of PCR products is 1818 bp. There is a band of expectedsizeina1,a2,a3anda5. Ladder: MassRuler TM DNA Ladder mix (Fermentas) Figure3b: Agarosegelshowingifthe insert,CWR,ispresentintheplasmid, pET102D/TOPO.5differentscolonies (1,2,3,4,and5)werepickedand usedastemplatesinPCR.The expectedsizeofPCRproductswas 984bp.Thereisabandofexpected sizeincolony1,2,3,4and5. Apositivcontrol(D4)wasmadewith D4plasmid.Ladder:MassRuler TM DNA Ladder,mix(Fermentas) Testexpression In order to find out if transformed colonies could produce the βglucosidase, a test expression was made. Same colonies as shown in figures 3a and 3b were picked. Figure 4 shows the boiled pellet from induced E.coli strain M15, transformed with pQE1+CWRinsert,withoutIPTG(“c”)orwith0,5mMIPTG(“i”).Theexpectedsize oftheproteinwas~70kDasincethesizeoftheβglucosidaseis6869kDaandthe

10 tagis~1kDa.Thereisastrongbandforeachinducedculturedcolony,closelyto70 kDa. ForthetestexpressionofpET102D/TOPOCWR1seesolubilitytest,Figure6.There isastrongbandintheinducedculturedcolony(IC),closelyto80kDa.Thesizeofthe thioredoxintagisabout13kDa. Figure4: SDSPAGEshowinginducedculturesofcoloniesa1,a2,a3,anda5forthe (his) 6system. This figure shows the boiled pellet from induced E.coli strain M15, transformedwithpQE1+CWRinsert,withoutIPTG(“c”)orwith0,5mMIPTG(“i”). The expected size of the proteinis ~70 kDa. There is a clear band attheexpected sizeforalltheinducedcolonies. Ladderontheleft:PageRuler TM PrestainedProteinLadder(NewEnglandbiolabs) Ladderontheright:PageRuler TM ProteinLadder(NewEnglandbiolabs) RestrictiondigestionofpQE1CWRa1 In order to further confirm that the plasmid (pQE1CWRa1) contained the insert, a double digestion with EcoRI and Kpn1 was made. This figure shows the plasmid digestionofthecolonya1fromthe(His) 6system.Theexpectedsizeswere3381bp, 1398bp,and469bpandthereareclearbandsat3,4kb,1,4kbandjustbelow500 bp.

11 Figure5: Agarosegelshowingiftheplasmidhas theCWRinsert.Inthisfigure,onecanseethe restrictiondigestionofplasmidsextractedfrom colonya1withEcoRIandKpnI.Theexpectedsizes are3381bp,1398bp,and469bpandthereare clearbandsat3,4kb,1,4kbandjustbelow500bp. Ladder:MassRuler TM DNALaddermix(Fermentas) Sequencing Theresultsfromthesequencingshowedthatthegenewassuccessfullyclonedwith no mismatches. The clones pQE1CWRa1 and pET102D/TOPOCWR1 were chosen to be continuedwith. However, forthe pET102D/TOPOCWR1 there wasa ~150bppartofthegenethathadnotbeensequenced. Expression Determinationof βglucosidasesolubility The determinationofthe βglucosidasesolubilityisanimportantsteptocharacterizea protein.Sotoseeifaproteinissolubleornot,itcouldbeinducedbyIPTG,andrunon aSDSPAGEseparatingthesupernatant,containingsolubleproteins,andthepellet, containinginsolubleproteins. Thesolubilityofthetworecombinantfusionproteins,the(his) 6tagged βglucosidase andthethioredoxintagged βglucosidase,weretested.Coloniesa1and1wereused respectively.Theexpectedsizeoftheproteinwas~70kDaand~80kDarespectively. Figure 6 shows the negative control (“C”) which is an uninduced sample and the positivecontrol(“IC”)whichistheinducedsamplewith0,5mMIPTG.Thesupernatant is the lysate after lysis of the cells and the pellet is the insoluble material after centrifugation.Fora1itwasdifficulttodetectabandinthesupernatant,becauseof

12 thedilutionbutit´sclearthatthereisabigbandinthepelletedinsolublematerial.It means βglucosidase is not soluble. There was not any success with the other induction times and temperatures. Consequently, the Histagged system was not a goodwaytogetsoluble βglucosidase. One the other hand, the supernatant from the colony 1 for the thioredoxin system containedabandattheexpectedsize.Theinducedcontrolindicatesthattheamount ofinducedproteinmakesupapproximately50%ofthetotalproteins.Forthesoluble protein fraction the part of induced protein is much smaller, approximately 5%. This culturehadbeeninducedat37 οCfor2,5h.Otherinductiontimesandtemperatures didnotgiveahigheramountofsolublerecombinantprotein. Figure 6 : SDSPAGE showing the solubility of the (his) 6−tagged and the thioredoxin tagged β glucosidase, from a colony a1 and 1 respectively. The expected size of the protein was ~70 and ~80 kDa respectively. The negative control (“C”) is noninduced sample and the positive control (“IC”) is the induced samplewith0,5mMIPTGfor 4hoursforcolonya1and2,5 hoursforcolony1.Thefigure shows the supernatant and the pelleted material from induced E.coli with 0,5 mM IPTG. Ladder: PageRuler TM Protein Ladder (NewEnglandBiolabs). To confirm the results of a soluble protein fraction for the thiofusion protein the supernatant fraction from Figure 6 was subjected to Western Blot using anti thioredoxinantibodies.Theexpectedsizeoftheproteinwas~80kDa.Thefigure7 shows the supernatant from induced E.coli strain BL21, transformed with pET102D/TOPOCWR1, with 0,5 mM IPTG. The culture was induced for 2,5 h at 37 οC.Thesupernatantfromthecolony1containedabandattheexpectedsize.

13 Figure 7 : Western blot showing the βglucosidase solubility. The expected size of the protein is ~80 kDa. The positive control (“boiled pellet”) is the induced sample with 0,5 mM IPTG for 2,5 hours. This figure shows the supernatant from induced E.coli strainBL21,transformedwithpET102D/TOPO CWRinsert,with0,5mMIPTG. Ladder: PageRuler TM Prestained Protein Ladder (New Englandbiolabs). To confirm that it actually was not just a contamination from the pellet in the supernatant another Western Blot was carried out. At the same time, a timecourse wasmadetoseethatitwasagradualinduction.Theexpectedsizeoftheproteinis ~80 kDa. The figure 8 shows the supernatant and the pellet protein from induced E.coli strain BL21, transformedwith pET102D/TOPOCWR1.Theuninducedsample (“nonIC”) is the negative control and the induced samples (“boiled pellet”) are the positivecontrols.An80kDabandispresentintheinducedsampleandinthepellet. Moreover, the same band is present in the supernatant samples. As the time of inductionincreases,thesizeofthebandinthesupernatantincreases. Figure8 :Westernblot,withantithioredoxintagantibody,showingthesolubilityofthe βglucosidaseat differenttimes of induction. As the time of induction increases, the sizeofthebandinthesupernatantincreases.

14 From left to right: PageRuler TM Prestained Protein Ladder, uninduced control “c”, supernatant30minutesofinduction,supernatant60minutesofinduction,supernatant 90minutesofinduction,pelletproteinfrominduced E.coli strainBL21,boiledpellet30 minutesofinduction,boiledpellet60minutesofinduction,boiledpellet90minutesof induction. Purificationofthe βglucosidase Afterthelargescaleexpression,the βglucosidasewaspurifiedwithaNicolumn.The expected size was 80 kDa. The figures 9a and 9b showpellet protein from induced E.coli strainBL21,transformedwithpET102D/TOPOCWR1,theflowthroughwithall proteins,thetwowashingfractions,andthe4elutions(E1,E2,E3,andE4).Alotof bands were present in eluate 1 and2. There was one band inthesesamplesat80 kDa. The 4xE2 sample is an eluate from the column loaded with four times more samplethantheotherE2.Therewasan80kDabandwithaquitesamethicknessin the4xE2sample. Figure 9a: SDS PAGE showing the βglucosidase purification withNicolumn.Fromlefttoright: PageRuler TM Prestained Protein Ladder, pelleted protein from induced E.coli strainBL21,Flow through (FT), twice wash (W1 & W2), 4 elutions (E1, E2, E3 & E4). Figure 9b: Western blot, with antithioredoxin tag antibody, showing the β glucosidase purification withNicolumn. From left to right: PageRuler TM Prestained Protein Ladder, pellet protein from induced E.coli strain BL21, Flowthrough (FT), twice wash (W1 & W2), 4 elutions (E1, E2, E3 & E4), one elution with 4 timesmoreofsupernatant.

15 pNPGassay The eluates E1 and E2 from the 250 mL and 1 L culture were subjected to pNPG assay.TherewasnodetectedactivityinE1andE2samples(resultsnotshown).The activitywasalsomeasuredincrudelysatesfromuninducedandinducedbacteria.The uninduced sample showed an activity of 3 nmol/(mg, min) and the induced sample showedanactivityof9nmol/(mg,min).Thepositivecontrolshowedahighactivity. Bioinformatics Inordertoknowwhichenzymes thatweresimilartothecellwall βglucosidase,the NCBI Blast software was used. This table shows the top hits of the amino acid similarity of the cell wall βglucosidase. The sequence alignment scores come from alignmentwiththecellwall βglucosidaseusingClustalW. Nameprotein reference species score βDglucanexohydrolase AAM13694 Wheat 97 βDglucanexohydrolase AAC49170 Barley 96 isoenzymeExoII Os03g0749300 NP_001051275 Rice 86 ExoβDglucanase AAF79936 Maize 85 βDglucosidase AAQ17461 CottonSeeds 77 βDglucosidase CAA07070 IndianCress 75 βDglucanexohydrolase AAD23382 Barley 70 isoenzymeExoI Table 2 showing the most similar enzymes to the cell wall βglucosidase based on aminoacidsequences. Inordertoinvestigateputativeglycosylationtheaminoacidsequencesoftheplastidic and the cell wall β− glucosidases were submitted to NetNGlyc 1.0 Server. This predictionprogramfoundthattherewerefourputativeglycosylationsitesforthecell wall βglucosidase.Fortheplastidictheprobabilityofanyglycosylationwasverylow sincetheproteindidnotcontainasignalpeptide,eventhoughitcontainedsites. Discussion TherewereproblemswhentryingtogetalotofPCRproductforcloningwiththeHF buffer.TheerrorrateofPhusionHighFidelityDNAPolymeraseinHFBufferislower thanthatinGCBuffer.Therefore,theHFbuffershouldbeusedasadefaultforhigh fidelity amplification. However, GC Buffer can improve the performance of Phusion DNAPolymerase.Therefore,GCbufferwaschosenforthesmallupscaleofvolume.

16 TherewasnoproblemwithusingtheHFbufferforthethioredoxinprimers.Theband sizes which were observed on the gels all had the expected sizes. Though, when thereisalotofproductloadedontoagelittendstogivetheimpressionofalower molecularweight. After transformation of E.coli strains with the two plasmids, the analysis was made withdifferentmethods. PCR on colonies directly (figure 3a and 3b), test expression (figure 4 and 6), the doubledigestionwithEcoR1andKpnI(figure5)andthesequencingshowedthatthe transformationhadworked.Thesmallpart,~150bpinthepET102D/TOPOCWR1, thatwasnotsequencedwillbesequencediftheenzymeshowstobeactive. TheCWRhadbeennicelyligatedintoeachvector.Onlythecolonya4didnotwork becausesometimestheplasmidcloseswithoutthegene.Thatiswhythebacteriahad grownonthemediumcontainingantibioticseventhoughtherewasnoinsertpresent. Itisstillstrangethough,becausetheopenplasmidhadastickyandabluntendso theoretically it would not close. All this experience was an indication also that the preparationofcompetent E.coli wassuccessful. The previous attempt to express the βglucosidase using the IMPACTCN (Intein Mediated Purification with an Affinity Chitinbinding Tag) had not been successful. One possible explanation was that this was maybe due to a too big tag, which interferedormadeitdifficultforthebacteriatofoldtheproteinproperlyandtherefore producedinclusionbodies.Forthisreasonaconstructwithasmallertag,a(his) 6tag waschosen.Thefigure6showsthatthe βglucosidasewasnotsolublewhenfusedto the(his) 6tag.So,itseemsthattherewasnotamatterofsize. Thesolubilityofthethioredoxintaggedproteinwasalsotested.Severalotherproteins thathavehadproblemswithsolubilityhadbecomesolublewhenfusedtothioredoxin (Yasukawa, 1995). Indeed the thioredoxin tag helped the protein become soluble. Nevertheless, this was a very small fraction. The induced control indicates that the amountofinducedproteinmakesupapproximately50%ofthetotalprotein.Forthe solubleproteinfractionthepartofinducedproteinismuchsmaller,approximately5%. Thefactthatsomuchproteintendtoendupintheinsolublepelletedfractionindicates formation of inclusion bodies. This is not a very uncommon result when trying to expressarecombinantproteinin E.coli .Indeed, E. colihasafrequentdepositionofthe expressed protein product into insoluble inclusion bodies (Cabrita and Bottomley 2004). Forinstance,amaize βglucosidasewasoverexpressedin E.coli ,resultingin accumulation of most of the protein in insoluble inclusion bodies (Jan Zouhar et al. 1999). The figure 7 confirms this conclusion. Indeed, the supernatant from the colony 1 containedabandat80kDa.Therearetwootherbandsat~55kDaand~40kDa.It means the primary antibody, which was used in this Western Blot, was not really specific for the thioredoxin tag. One possibility could also be that there were degradationproductsformthe βglucosidasefusionprotein.

17 Inthefigure8,contrarytotheHistaggedsystem,one80kDabandispresentinthe supernatantsamples.Thisfigureshowsusasthetimeofinductionincreases,thesize ofthebandinthesupernatantincreases.Itmeansasthetimeofinductionincrease, thecellsproduceaprotein. Thefigure9ashowstherewerealotofbandspresentineluates1and2.Thefigure 9b shows that there were four other bands, though a lot weaker. It means the Ni columnwasnotsospecifictobindthethioredoxintag. Therewasa80kDabandwith quitethesamethickness,onjudgedbytheeyesinthe4xE2sample.Itmeanswhen the eluate quantity increase, the βglucosidase quantity after purification did not increase.Thiswasnotexpected. The resultsof the pNPG assay did notshowanysuccess.Itwasprobablyduetoa weakconcentrationofthe βglucosidaseineluatesamples.Forsomeeluatesitwasa bit unclear whether there was activity or notsince it wassolow.Ontopofthis,the eluateswerenotpuresotherecouldbeariskthatitwerecopurified E.coli proteins thatwereresponsiblefortheactivity.Onewaytosolvethiscouldbetomakeanative gelandmakeanactivitystaindirectlyonit.Theresultsshowsaverysmalldifference between uninduced and induced sample (supernatant): 3 nmol/(mg, min) and 9 nmol/(mg,min)respectively.Ofcourse,therewasactivity,whichcamefromthe E.coli enzymesbutitistoodiluted.Anotherpossibilityfortheundetectableactivitywasthat theproteinwasnotactivewhenitwasproducedinE.coli atall.Itwasnotsurprising thatthepositivecontrolgavesuchahugeactivitysincethesamplewaspure. ABLASTsearchwasperformedtoinvestigatethemostsimilarproteinstothecellwall βglucosidasewithrespecttopossibleglycosylation(table2).Thepartlypurifiedcell wall βglucosidase glycosylation had been analysed and there was not any proof of glycosylation(NikusandJonsson2003).AsNikusandJonssonargues,thiswasvery surprising since it is very common for secreted proteins, like the cell wall β glucosidase, to be glycosylated (Cairns et al. 2000, Varghese et al.1999). The βD glucosidasefromIndianCress(table2)hadbeensequenced,andtheNglycosylation sites determinated. The primary amino acid sequence contains four putative N glycosylation sites (Crombie et al. 1998). As well, exoβDglucanase also called ExGase (table 2) which was purified from the cell walls of developing maize ( Zea mays ) shares five putative glycosylation sites with the cell wall βglucosidase in rye (Kim et al. 2000). The same observation had been observed with the βDglucan exohydrolase exo II (table 2). This enzyme had four Nglycosylation sites and there wasalsosomeevidenceforactualglycosylation(Hrmovaetal.1995). βDGlucanExohydrolase,ExoI,inbarley(table2)whichalsobelongstofamily3and sharethecatalyticsiteswiththecellwallβglucosidaseisglycosylatedatthreesites according to the threedimensional structure (Varghese et al. 1999). Two of the correspondingsitesarefoundinthecellwallβglucosidase.So,iftheenzymesthat showabigsequencesimilaritytothecellwall βglucosidaseareglycosylated,thereis abigprobabilitythatthecellwall βglucosidaseisalsoglycosylated.

18 The Nglycosylation sometimes has a role in the biological activity of the protein. Indeed the Nglycosylation influence the glycoprotein conformation, stability and biological activity for some proteins (Rayon et al. 1998). For the cell wall β– glucosidase there is not enough information from the literature whether the actual glycosylation is important for the activity or not. βDGlucan Exohydrolase, Exo I, in barleyhasthreeglycosylationsiteswhereoneisdiscussedtohaveapotentialrolein guiding of the substrate (Varghese et al 1999). It is not very similar to our β glucosidasebutissimilarinregionsoftheactivesiteandtheglycosylationsites. Theimportanceofglycosylationfortheactivitymaynotbesobig,itismorelikelythat itplaysabiggerroleinfoldingtheproteincorrectly.Thismightexplaintheproduction of inclusion bodies. In this project, three systems for cloning, expression and purificationofthecellwall βglucosidasehavebeentried.Allofthemwerebasedon expression in E.coli . The plastidic βglucosidase was successfully cloned and expressedin E.coli (Nikusetal.2003).TheNetNGlycpredictionprogramsuggested thattherewasnoglycosylationsincetheproteindoesnotcontainasignalpeptideand thereforeitisnotverylikelythatitisexposedtoNglycosylation. E.coli cannotmake many of the modifications that eukaryotic proteins, like plant proteins, need. Using yeast for expression could be used if protein needs posttranslational modifications, likeglycosylation. Three E.coli basedsystemshavebeentriedforcloning,expressionandpurificationof thecellwall βglucosidase.Inthisexjobononehand E.coli producedaweakfraction of soluble βglucosidase when coexpressed with a thioredoxin tag. One the other hand, E.coli probablydidnotproducea βglucosidasewhichwasactive.Productionof asoluble andactive βglucosidase with E.coli was maybe not a good way. Agood idea is to move into a higher system, like yeast or insect cells, since they have a higherpossibilitytomakeposttranslationalmodificationsifneeded. Acknowledgments ThisworkwasmadepossiblebySödertörnsHögskola,Flemingsberg,SwedenandI amthankfulforbeingacceptedasastudentthere. IwouldliketothankProfessorLisbethJonssonforacceptingmeforthisproject. IwouldliketothankMScThereseGradinforherpatientlyhelpwiththelabwork,for thehelpwiththereport,foransweringmyquestionsaboutdetailsoflabworkandalso improvemyEnglishskills.

19 References Picturesonthefrontpage:http://www.cdl.umn.edu/introduction/crops_images/rye.gif AsimE(1993) βglucosidase,biochemistryandMolecularBiology.(Manuscript) CabritaLDandBottomleySP(2004). Proteinexpressionandrefolding–apractical guidetogettingthemostoutofinclusionbodies.BiotechnolAnnuRev,10:3150. Cairns K, ChampattanachaiV,SrisomsapC,WittmannLieboldB,ThiedB,and Svati J (2000). Sequence and expression of thai rosewood βglucosidase/ β fucosidase,afamily1glycosylhydrolaseglycoprotein.J.Biochem.128:9991008 CrombieHJ,ChengappaS,HellyerA,ReidJSG(1998). Axyloglucan oligosaccharide active,transglycosylating D glucosidasefromthecotyledonsof nasturtium(TropaeolummajusL)seedlings–purification,propertiesand characterizationofacDNAclone.ThePlantJ15(1),27–38. DharmawardhanaDP,EllisBE,CarlsonJE(1999). cDNAcloningandheterologous expressionofconiferin βglucosidase.PlantMolBiol40:365372 Dietz KJ, Sauter A, Wichert K, Messdaghi, Hartung W (2000). Extracellular β glucosidaseactivityinBarleyinvolvedintheHydrolysisofABAglucoseconjugatein leaves.JExpBot51:937944. Gupta R, Jung E, Brunak S (2004). Prediction of Nglycosylation sites in human proteins.Inpreparation HenrissatB,DaviesGJ(1997) Structuralandsequencebasedclassificationof glycosidehydrolases.Curr.Op.Struct.Biol.7:637644[PMID: 9345621 ]. (http://www.cazy.org/) Hrmova M, Harvey AJ, Wang J, Shirley NJ, Jones GP, Stone BA, Hoj PB, and FincherGB.(1996). Barley βDGlucanexohydrolasewith βDglucosidaseactivity.J BiolChem,271:52775286 KimJB,OlekAT,andCarpitaNC(2000). CellWallandMembraneAssociatedExo βdGlucanasesfromDevelopingMaizeSeedlings.PlantPhysiol.123(2):471–486. Niemeyer H. (1988) Hydroxamic acids (4hydroxy1,4benzoxazin3ones), defence chemicalsinthegramineae.Phytochemistry,27:33581988 NikusJ,DanielG,JonssonLMV(2001). Subcellularlocalizationof βglucosidasein rye,maizeandwheatseedlings.PhysiolPlant111:466472

20 NikusJ,EsenA,JonssonLMV(2003) Cloningofaplastidicrye( Secalecereale )β glucosidasecDNAanditsexpressionin E.coli .(Manuscript) NikusJ,JonssonLMV(2003) .Cellwallglucan1,3β glucosidasefromrye(Secale cereale)activeonhydroxamicacidglucosides.(Manuscript) PoultonJE(1990). CyanogenesisinPlants.PlantPhysiol94:401405 RayonC,LerougePandFayeL(1998). TheproteinNglycosylationinplants.JExp bot,49:14631472 RoosA(2006) Ryecellwall βglucosidase:purificationofrecombinantproteinfrom E.coli andmolecularandkineticcharacterizationoftheenzyme.(Studentreport) Varghese JN, Hrmova M, Fincher GB (1999). Threedimensional structure of a barley βDglucanexohydrolase,afamily3glycosylhydrolase.Structure15;7(2):179 90 YasukawaT,KaneiishiiS,MaekawaT,FujimotoJ,YamamotoT,IshiiS(1995). Increase of solubility of foreign protreins in E.Coli by copropduction of the bacterial Thioredoxin.JBiolChem.270:2532825331. ZouharJ,NanakEandBrzobohatyB(1999). Expression,singlesteppurification, andmatrixassistedrefoldingofamaizecytokininglucosidespecific βglucosidase. ProteinExprPurif17:153162.

21