Query 1 MSTEILFVFLFSLAFLFVARKVAKRIGLVDKPNYRKRHQGLIPLVGGIAVYAGLCFAFFI 60

Total Page:16

File Type:pdf, Size:1020Kb

Query 1 MSTEILFVFLFSLAFLFVARKVAKRIGLVDKPNYRKRHQGLIPLVGGIAVYAGLCFAFFI 60

(FIGURE S3)

A) Query 1 MSTEILFVFLFSLAFLFVARKVAKRIGLVDKPNYRKRHQGLIPLVGGIAVYAGLCFAFFI 60 MSTEILFVFLFSLAFLFVARK AKRIGLVDKPNYRKRHQGLIPLVGGI+VYAGLCFAF I Sbjct 1 MSTEILFVFLFSLAFLFVARKAAKRIGLVDKPNYRKRHQGLIPLVGGISVYAGLCFAFLI 60

Query 61 SEQKIAHSELYLTCAGILVFIGALDDRFDISVKIRALVQALVGIAMMVFAGLYLRSFGHV 120 S+Q IAH +LYL CAGILVF+GALDDRFDISVKIRALVQALVGIAMMVFAGLYLRS GHV Sbjct 61 SDQTIAHGKLYLACAGILVFVGALDDRFDISVKIRALVQALVGIAMMVFAGLYLRSLGHV 120

Query 121 LGNWEVLLGPFGYLVTLFAVWAAINAFNMVDGIDGLLGGLSCVSFGALGLLLYLSGHNEL 180 LG+WE+LLGPFGYLVTLFAVWAAINAFNMVDGIDGLLGGLSCVSFGALG+LLYLSGH++L Sbjct 121 LGDWEMLLGPFGYLVTLFAVWAAINAFNMVDGIDGLLGGLSCVSFGALGVLLYLSGHSDL 180

Query 181 AFWCFAMIAAIVPYILLNLGILGRRYKVFMGDAGSTLIGFTAIWLLLQSSQGATRSINPV 240 AFWCFAMIAAIVPYILLNLGILGRRYKVFMGDAGSTLIGFTAIWLL+QSSQG SINPV Sbjct 181 AFWCFAMIAAIVPYILLNLGILGRRYKVFMGDAGSTLIGFTAIWLLVQSSQGKAHSINPV 240

Query 241 TALWIIAIPLMDMIAIMYRRLRKGMSPFSPDRQHIHHLIMRAGFTPRQAFVLITLAAVLL 300 TALWIIAIPLMDMIAIMYRRLRKGMSPFSPDRQHIHHLIMRAGFTPRQAFVLITLAA LL Sbjct 241 TALWIIAIPLMDMIAIMYRRLRKGMSPFSPDRQHIHHLIMRAGFTPRQAFVLITLAAALL 300

Query 301 AAVGVIGERLTFIPEWVMLALFLLAFFLYGYCIKRAWRVARYIKRIKRRLRRSSDNKQVS 360 AA+GV+GERLTFIPEWVMLALFLLAFFLYGYCIKRAWRVARYIKRIKRRLRRS+ NKQVS Sbjct 301 AAIGVLGERLTFIPEWVMLALFLLAFFLYGYCIKRAWRVARYIKRIKRRLRRSNGNKQVS 360

1 (FIGURE S3 continued)

B) Query 1 MSKAIMGAAALSIGDILGKVIGFIILPYLTAHLGASGYGALTLYLSVIQILIIFISFSGQ 60 MS+ I+GAAALSIGDILGKVIGFIILPYLTAHLGASGYGALTLYLSVIQILIIFISFSGQ Sbjct 1 MSRVIIGAAALSIGDILGKVIGFIILPYLTAHLGASGYGALTLYLSVIQILIIFISFSGQ 60

Query 61 GLLPVKYMQEGEGSSLVFRRDNITLAVASSVLLVAIFYIVTLVTKISVSFSDGFLVVLAS 120 GLLPVKYMQEGEGSSLVFRRDNITLAVASSVLLVAI+++ T V I VSF+DG LV+LAS Sbjct 61 GLLPVKYMQEGEGSSLVFRRDNITLAVASSVLLVAIYFVATRVASIPVSFTDGSLVILAS 120

Query 121 LAQALNFINLSHLRISQTYKVAAIGQFLLSAFNVLFTIALFEMIAATPGQRLIAIAASFF 180 LAQALNFINLSHLRISQTYKVAAIGQFLLSAFNVLFTIALFEMIAATPGQRLIAIAASFF Sbjct 121 LAQALNFINLSHLRISQTYKVAAIGQFLLSAFNVLFTIALFEMIAATPGQRLIAIAASFF 180

Query 181 SIQLAYEFFVYRKRAVKRSELEPSRKRYKEIISYGVSLLPHHGSYWIKSSVDRFFIAHYM 240 IQLAYEF +YRKR+ P KRYKEI+SYG+SLLPHHGSYWIKSS+DRFFIAHYM Sbjct 181 CIQLAYEFIIYRKRSKDIKSDAPLGKRYKEIVSYGISLLPHHGSYWIKSSIDRFFIAHYM 240

Query 241 STAVVGVYGLAFQLTSIVMLFFGVINQAFQPFIYRKLKANDFRGVELIQYGYTALVIVSC 300 STAVVG+YGLAFQLTSIVMLFF VINQAFQPFIYRKLK DF+GVELIQYGYT LV+V+C Sbjct 241 STAVVGIYGLAFQLTSIVMLFFSVINQAFQPFIYRKLKDKDFKGVELIQYGYTGLVVVAC 300

Query 301 VIYFFILPFAFPYLFNAEFNSAIYYFNILLPGTAFLSIYYIFTHSLFYYRKNKLISIITM 360 +IYF I+PF FPYLF EFN AIYYFNILLPGTAF+SIYYIFTHSLFYYRKNK+ISIITM Sbjct 301 LIYFLIMPFVFPYLFKPEFNRAIYYFNILLPGTAFMSIYYIFTHSLFYYRKNKMISIITM 360

Query 361 GSMITHLIGIFVITMTEIKVEYFCLVYAISSFYACAVTFYCGKRQIGIERKR 412 GSM+THL+GI +ITMT IKVEYFCLVYAISS ++C VTF GKRQI IERKR Sbjct 361 GSMLTHLLGIVIITMTSIKVEYFCLVYAISSLFSCVVTFIYGKRQIRIERKR 412

2 3 (FIGURE S3 continued)

C) Query 1 MLGDAVFNRVKRSDHKAISEISAFLRSNDLNIDTTVEIFITVTQHDKLVACGGIADNIIK 60 MLGDA+F+RVKRSDHK I++I+AFLRSNDLNIDTTVEIFITVTQ+DKLVACGGIADNIIK Sbjct 1 MLGDAIFDRVKRSDHKNITQINAFLRSNDLNIDTTVEIFITVTQNDKLVACGGIADNIIK 60

Query 61 CVAISPLMRGEGLALALATELVNLAYERHHTQLFIYTKVQNEPLFRQCGFYPIATVPGIV 120 CVAISP MRGEGLAL LATELVNLAYERHHTQLFIYTK+QNEPLFRQCGFYPIA VPGIV Sbjct 61 CVAISPQMRGEGLALRLATELVNLAYERHHTQLFIYTKIQNEPLFRQCGFYPIAQVPGIV 120

Query 121 VLMENSPCRLKRYAAQLASQRRPGETIGSIVMNANPFTRGHQYLVRQAAKRCDWLHLFLV 180 VLMENSPCRLKRYAAQLA QR+PG+ IGSIVMNANPFTRGHQYLVRQAA++CDWLHLFLV Sbjct 121 VLMENSPCRLKRYAAQLAGQRQPGKRIGSIVMNANPFTRGHQYLVRQAAEKCDWLHLFLV 180

Query 181 KENTSRFSYEDRRRLVLAGTADIPNLTVHEGSQYVISRATFPCYFIKDQGVADDCYTEID 240 KENTSRFSYEDRRRLVLAGTADI NLTVHEGSQYVISRATFPCYFIKDQG+ADDCYTEID Sbjct 181 KENTSRFSYEDRRRLVLAGTADIANLTVHEGSQYVISRATFPCYFIKDQGIADDCYTEID 240

Query 241 LKIFRQYLAPALGITHRFVGNEPFCAVTAKYNRDMRYWLETPALPSPPIALVEIERLQYQ 300 LKIFRQYLAPALGITHRFVGNEPFCAVTAKYNRDMRYWLETPAL SPPIALVEIERLQYQ Sbjct 241 LKIFRQYLAPALGITHRFVGNEPFCAVTAKYNRDMRYWLETPALSSPPIALVEIERLQYQ 300

Query 301 GTAISASWVRKLLAAGDFHAAAPLVPPDTLYYLQDLQTQRRAKAAPHPFESAQSGE 356 GTAISASWVRKLLAAGDFHAAAPLVPPDTLYYLQDLQTQRRAK A PFESAQSGE Sbjct 301 GTAISASWVRKLLAAGDFHAAAPLVPPDTLYYLQDLQTQRRAKPAATPFESAQSGE 356

4 (FIGURE S3 continued)

D) Query 6 SKIQVVGQFWPEIEVLITREGKEVDISDDIWRLPYSTRDNSTLNFTKIINSEIRESFKGH 65 SK Q V Q WPEIE LITREGKEVDIS D WRLPYS RDNSTLNFTKI NSEIRE+FK H Sbjct 6 SKPQAVEQSWPEIEKLITREGKEVDISGDTWRLPYSARDNSTLNFTKITNSEIREAFKDH 65

Query 66 VADRLKRISTHAGYAVYQDVWREVLRHWGNPAPQVDTESHLIDLFEIAINRARSQKRLWA 125 VADRLKRISTHAGYA YQDVWREVLRHWG+P QVDTESHLI LFE AINRARS+KRLWA Sbjct 66 VADRLKRISTHAGYAAYQDVWREVLRHWGSPVSQVDTESHLIGLFETAINRARSRKRLWA 125

Query 126 MYRPIQWYIWSADNNPEYGFSEIYAQELEVLELPGNPKGEAVRMEDPEGGPLNKSLELPL 185 MYRPIQWYIWSADN PE GFSEIYAQELE LELPGNPKGEAVRMEDPE GPL+KSLELPL Sbjct 126 MYRPIQWYIWSADNKPECGFSEIYAQELEALELPGNPKGEAVRMEDPESGPLHKSLELPL 185

Query 186 LINALKSDEGRSLEHLHQRVVVALSIAFGRNPANLIFLRESDFERLAPGGEDPCYIIRMP 245 LINALKSDEGRSL +L QRVVVALSIAFGRNPANL FLRESDFE L PGGEDPCYIIRMP Sbjct 186 LINALKSDEGRSLGYLQQRVVVALSIAFGRNPANLTFLRESDFECLVPGGEDPCYIIRMP 245

Query 246 RIKKRFVNPRDDLLDEYLDPHFGAMIEQLIEISKLVPLSFADRAFVNPEERPLLINRNGN 305 RIKKRFVNPRDDLLDEYLDPHFGAMIEQLIEISKLVPLSFADRAFV PEERPLLINRNGN Sbjct 246 RIKKRFVNPRDDLLDEYLDPHFGAMIEQLIEISKLVPLSFADRAFVIPEERPLLINRNGN 305

Query 306 KAAILSKDLDNAFNLTSSDISRLLSAFVKRHNIISPLTGELMRVTPRRLRYTLATGLAAE 365 KAAILSKD+DN FNLTSSDI+RLLSAFVKRHNIISPLTGEL+ VTPRRLRYTLATGLAAE Sbjct 306 KAAILSKDIDNVFNLTSSDIARLLSAFVKRHNIISPLTGELLHVTPRRLRYTLATGLAAE 365

Query 366 GISKHELARILDHTDIQHVNVYFEMAGRIVQHLDKATAKGFSTYLNFFRGRLINSDENAV 425 GISK ELARILDHTD QHVNVYFEMAG+IV+HLDKATAKGFS YL+FF+GRLI++DE+AV Sbjct 366 GISKRELARILDHTDTQHVNVYFEMAGKIVEHLDKATAKGFSKYLSFFKGRLIDNDEDAV 425

5 Query 426 NGERDDKHLTFFDEQNPTDQVGIGVCGESSVCHLDPPYSCYLCPKFQPYRHANHEQILEC 485 NGERDDKHLTF DEQNP DQ IGVCGESSVCHLDPPYSCYLCPKFQPYRHA+HE +LEC Sbjct 426 NGERDDKHLTFVDEQNPNDQADIGVCGESSVCHLDPPYSCYLCPKFQPYRHADHEHVLEC 485

Query 486 LLAGREERLKKYENARLGIQLDEVIAAVAQVAKLCEEGGDNV 527 LLAGREERLKKYENARLGIQLDEVIAAVAQVAKLCE+G NV Sbjct 486 LLAGREERLKKYENARLGIQLDEVIAAVAQVAKLCEKGVHNV 527

6 (FIGURE S3 continued)

E) Query 1 MNLVKVDWTPENMGGRKTVPPAGKYYAVSRFPEDKEWQNNSWSVFFELEESKSENGKTFS 60 M+LVK+ W + GGR P G+YYAVSRFPED WQNN+WSV F++ +KSEN + S Sbjct 1 MHLVKIKWVQKKDGGRTRPPDVGRYYAVSRFPEDTTWQNNAWSVVFDIHSTKSENNEFIS 60

Query 61 LGTVDFLMESAPKERMVQHDKFEIYEGPKKVADVFL 96 +VDFLM++APKERM QH F+IYEGPKKVA VFL Sbjct 61 EASVDFLMDNAPKERMEQHSTFDIYEGPKKVATVFL 96

7 (FIGURE S3 continued)

F) Query 4 RKDILLQVLEYMEKNIVEGLSVEKVSIISGYSKWHLQRLFKHYFGITLGTYIRNRKLSRS 63 + +I+ +L +M+ N+ LS++ V+ +GYSKWHLQR+FK G +G+YIR R+LS++ Sbjct 3 QTNIIRDLLHWMDNNLDRPLSLDNVAAKAGYSKWHLQRMFKDVTGQAIGSYIRARRLSKA 62

Query 64 AILLKQHQGNILDVALASGFASQQCYTRAFKRFFGETPNSFRNSRGWDFSTQIPPYGNDK 123 A+ L+ ILD+AL F SQQ +TRAFK+ F TP S+R + W + PP D Sbjct 63 AVALRLTSRPILDIALQYRFDSQQTFTRAFKKQFNRTPASYRRNEEWCATGICPPIMLDN 122

Query 124 KPYFYHTVMPDDIEMLKHYKSLIFHSIRKFRDAGDIAQTNEKWLSKQRGACNVEKKRSNK 183 KP H + + ++L + +++ ++ A + W++ + A + + Sbjct 123 KPLPAHKYVTLEDQLLIGTEHTCSYTLEQWSSAC-TEMRHGFWVNYLQHAETLPPRLYGL 181

Query 184 WHSNSKG-----QEVFHS------IFN------FYIPMGKYIVIPFTGELSEYIDFFD 224 HS+ Q VF++ FN +P G YI + G + +F Sbjct 182 HHSDLSAEHEDEQTVFYTTAIEPEYATFNTQDNRPVVLPGGDYISFSYFGGKEQLKEFLF 241

Query 225 VMYDAYLPAINVKMRESFFIELYRQEDLNGK------VVNVDILIPV 265 +Y LP +N+ R+ + +E Y +L K VV + LIP+ Sbjct 242 TVYGICLPKLNITRRKGYDVERYYLTNLEWKNFEDESQTHVVEFEYLIPI 291

8 (FIGURE S3 continued)

G) Query 1 MFLIILFKSIIIGGLVGVGVGAGAARMFHAPTTQGMGAFRTLGELNSCEGDPASHFSFGL 60 MFLIILFKSIIIGGLVGVGVGAGAARMFHAPTTQGMGAFRTLGELNSCEGDPASHFSFGL Sbjct 1 MFLIILFKSIIIGGLVGVGVGAGAARMFHAPTTQGMGAFRTLGELNSCEGDPASHFSFGL 60

Query 61 GFFFNAWASSVAAGSFTQDVDHRIIPHWGAAALMVKNRNLAQTLHDPKKMAIACGIIGML 120 GFFFNAWASSVAAG+FTQDVDHRIIP+WGAAALM+KNRN+AQTLHDPKKMAIACGIIGML Sbjct 61 GFFFNAWASSVAAGAFTQDVDHRIIPNWGAAALMIKNRNVAQTLHDPKKMAIACGIIGML 120

Query 121 VVAFLNTTASAVPAALQVTAIKVLVPAANLLVNTVMPVIFWLAAIDAGRRSGFWGTIFGG 180 VVAFLN+TASAVPAALQVTAIKVLVPAANLLVNTVMPVIFWLAAIDAG+RSGFWGTIFGG Sbjct 121 VVAFLNSTASAVPAALQVTAIKVLVPAANLLVNTVMPVIFWLAAIDAGKRSGFWGTIFGG 180

Query 181 LAQLIMGNAVPGLVLGILIGKGVEESGWNKITKIMMAVIVLLFVLSGFFRGFDMKVLESF 240 LAQLIMGNAVPGLVLGILIGKGVEESGWNK+TKIMMAVIVLLFVLSGFFRGFDMK+LESF Sbjct 181 LAQLIMGNAVPGLVLGILIGKGVEESGWNKVTKIMMAVIVLLFVLSGFFRGFDMKLLESF 240

Query 241 SLGVPGWLDAIHNTLSGK 258 SLG+PGWLDAIHNTLSGK Sbjct 241 SLGIPGWLDAIHNTLSGK 258

9 (FIGURE S3 continued)

H) Query 16 KPLIYAGSLSDEELIDFLEGVILKIKSRRRLIGRPESLREQAGEVEERLKSANEFINSIV 75 KP IYAGS +DEELI+FLE V+ ++K R RL R + LR + + + NEFINS+V Sbjct 132 KPYIYAGSYTDEELINFLETVVAELKCRNRLPLRQQQLRHELEAYQAEIDRNNEFINSVV 191

Query 76 CRQVQSPTLNP 86 RQ ++P NP Sbjct 192 YRQDKNPKQNP 202

10 (FIGURE S3 continued)

I) Query 6 SDPTIAAALVSALVTVSLFLIKGFCAPLWSKHFHSYKIKVEHNHEQKKKIKEAISKYKMP 65 SDP I AA+VS VT+ +F++KG P W KHFH +KI+ EH +EQKKKIKEAISKYK+P Sbjct 2 SDPKIIAAVVSGTVTLLMFILKGLTKPFWEKHFHHFKIRTEHKYEQKKKIKEAISKYKVP 61

Query 66 LLDSAESLNHRLWNFSVNCNKGWHNLGSNDLLSEKYYLQNFCYRFLALYAWCLKFERELI 125 L+D+AESLNHRLWNFS NC+K W + + +KYYLQ+FCYR+L +AWC K E+EL+ Sbjct 62 LIDAAESLNHRLWNFSGNCSKDWLTFKPKEKIKDKYYLQSFCYRYLVFFAWCRKIEKELV 121

Query 126 YLDVTLSDEGDLCFVKYIKVMKNIFCDASMLSNTGYDNEHAIDHFFKDDLISMAEKMLTS 185 YLD TLSD+ DL FVKY+K M+NIFCD S+ YD+EHA+DHFFKD L+SMA+ ++T Sbjct 122 YLDSTLSDKDDLYFVKYLKTMQNIFCDVSLFDARNYDSEHAVDHFFKDQLLSMADSLITE 181

Query 186 TGVITFSEFKTKNEVDYAQVCKYISTIMQDKSCNKWHLFNCFHFTLMAFLSRYGYDYQRT 245 GV++FSEF+T N Y +V Y S I +++ CNKW + FHF LMAFLS+YGYDYQ+T Sbjct 182 NGVVSFSEFQTWNISKYKKVSDYFSAISKNQDCNKWFALHGFHFVLMAFLSKYGYDYQKT 241

Query 246 GLFKLYKLRCREPKNKLMSNFNLLILNAKLNKCKEIKKAIDIL 288 KL KLR PKN + SN L+ + L+KCK +K + +L Sbjct 242 SKCKLKKLRDETPKNLVASNLFQLVKKSHLDKCKNMKLTMKVL 284

11 (FIGURE S3 continued)

J) Query 1 VLAFDSAQLTVWLSHYFWPLLRILALIGTAPIFSEKQISKKVKIGLGGLIVILIAPTLPA 60 +L DSAQ VWLSHYFWPLLRILALI TAP+FSEKQI KKVKIGLGGLIVILIAP LP Sbjct 1 MLTVDSAQFGVWLSHYFWPLLRILALISTAPVFSEKQIGKKVKIGLGGLIVILIAPGLPT 60

Query 61 SNIPIFSAAGLWLAIQQILIGVALGLTMQFAFAAVRLAGEVIGMQMGLSFATFFDPSGGP 120 S +PIFSAAGLWLA QQILIGVALGLTMQFAFAA+RLAGEVIGMQMGLSFATFFDPSGGP Sbjct 61 SAVPIFSAAGLWLAAQQILIGVALGLTMQFAFAAIRLAGEVIGMQMGLSFATFFDPSGGP 120

Query 121 NMPVLARLLNLLAMLLFLSFDGHLWLISLLADSFHTLPIQTQPLNGNGFLVLTQVGSLIF 180 N PVLARLLNLLAMLLFLSFDGHLWLISLLADSFHTLPIQ PLNGNGFL LTQVGSLIF Sbjct 121 NTPVLARLLNLLAMLLFLSFDGHLWLISLLADSFHTLPIQPAPLNGNGFLALTQVGSLIF 180

Query 181 INGMMLALPLICLLLTLNMALGLLNRMTPQLSVFVIGFPVTMTFGIMTLGMMMPMLAPFC 240 INGMMLALPLICLLLTLNMALGLLNRMTPQLSVFVIGFPVTMT GIMT+GMMMPMLAPFC Sbjct 181 INGMMLALPLICLLLTLNMALGLLNRMTPQLSVFVIGFPVTMTIGIMTIGMMMPMLAPFC 240

Query 241 EHLFGEVFDRLAAVIGGMTF 260 EHLFGE FD+LA VIGGM F Sbjct 241 EHLFGEFFDQLAGVIGGMAF 260

12 (FIGURE S3 continued)

K) Query 4 RIDYTKTSPAGVKALGGVYTYVAQCGLENTLVELVNLRVSQINGCAYCLDMHTRDLLKKG 63 R+DYTK +P GVKA G VY YV Q GL++ LVELV LRVSQINGCAYCLDMHTRDL+K+G Sbjct 2 RLDYTKAAPGGVKAFGSVYGYVMQSGLDDVLVELVYLRVSQINGCAYCLDMHTRDLVKRG 61

Query 64 LSPVKLALVQVWEEAGEVFTAREKAALSWAETVTRVSETHVPDEAFAAASAVFEDKELAD 123 +S KLALVQVW EAG +F+ REKAAL+WAETVTRVSETHVPD+AF AASAVF +KELAD Sbjct 62 VSVGKLALVQVWHEAGALFSDREKAALAWAETVTRVSETHVPDDAFGAASAVFSEKELAD 121

Query 124 LTMAIGLINAYNRLAISFRNVPQ 146 LTMAI L+NA+NRLAISFR PQ Sbjct 122 LTMAISLMNAFNRLAISFRRAPQ 144

13 (FIGURE S3 continued)

L) Query 1 MAILPRQGQRWLAGMVMLTLSGCAYIPHKPLVDGATTAQPAPASAPMPNGSIFQTVQPMN 60 MA P+QG+RWLA M MLTLSGCAYIPHKPLVDGATTAQPAPA AP+PNGSIFQTVQPMN Sbjct 1 MANAPQQGKRWLAAMAMLTLSGCAYIPHKPLVDGATTAQPAPAGAPVPNGSIFQTVQPMN 60

Query 61 YGYQPLFEDRRPRNVGDTLTIVLQENVSASKSSSANASRNGASKFGVATSPRYLDGLLGN 120 YGYQPLFEDRRPRN+GDTLTIVLQENVSASKSSSANASRNGASKFGVATSPRYLDGLLGN Sbjct 61 YGYQPLFEDRRPRNIGDTLTIVLQENVSASKSSSANASRNGASKFGVATSPRYLDGLLGN 120

Query 121 ARADMDISGDSTFGGKGGANANNTFNGTITVTVNQVLANGNLHVVGEKQIAINQGTEFIR 180 ARADMDISGDSTFGGKGGANANNTF+GTITVTVNQVL NGNLHVVGEKQIAINQGTEFIR Sbjct 121 ARADMDISGDSTFGGKGGANANNTFSGTITVTVNQVLVNGNLHVVGEKQIAINQGTEFIR 180

Query 181 FSGVVNPRTISGNNSVTSTQVADARIEYVGNGYINEAQTMGWLQRFFLNVSPF 233 FSGVVNPRTISG+NSVTSTQVADARIEYVGNGYINEAQTMGWLQRFFLNVSPF Sbjct 181 FSGVVNPRTISGSNSVTSTQVADARIEYVGNGYINEAQTMGWLQRFFLNVSPF 233

14 (FIGURE S3 continued)

M) Query 1 MKIVTGIILTSVVAFSGAAYAADAQPTTGNAEVTLEHVHAVMENGSPAPQHDAACKKELS 60 MKIVTGIILTS+ AFSGAAYAADAQPTTG+AEVTLEHVHAVMENGSPAPQHDAACKKELS Sbjct 1 MKIVTGIILTSMAAFSGAAYAADAQPTTGSAEVTLEHVHAVMENGSPAPQHDAACKKELS 60

Query 61 MPESKYMGMKVKTDYTINSSSMMMSAKSMFPSPDSMKPMELTVDLSALGLADVYAFGAFK 120 MPESKYMGMKVKTDYTINSS+MMMSAKSMFPSP+ +PMELTVDLSALGLADVYAFGAFK Sbjct 61 MPESKYMGMKVKTDYTINSSTMMMSAKSMFPSPNDKQPMELTVDLSALGLADVYAFGAFK 120

Query 121 PAALPQAYIYFTIEKDFKNPVSTFMIINQGKQYNCVISSSNKMMSEEMRGKMMKKQ 176 PAALPQAYIYFTI+KDFKNPVSTFMIINQGKQYNCVISSSNKMMSEEMRGKMMKKQ Sbjct 121 PAALPQAYIYFTIDKDFKNPVSTFMIINQGKQYNCVISSSNKMMSEEMRGKMMKKQ 176

15 (FIGURE S3 continued)

N) Query 1 MVKRFRRMSNSDINTIVADLDRWALGELGSKLTWAVLEERFGFSRQSLQAKSEIKAAYDN 60 MVKRFRRMS+SDINTIVADLDRWALGELGSKLTWAVLEERFGFSRQSLQAK+EIKAAYDN Sbjct 1 MVKRFRRMSDSDINTIVADLDRWALGELGSKLTWAVLEERFGFSRQSLQAKTEIKAAYDN 60

Query 61 AKRALSGGSVKTKEQATKEAEELQVEVERLKSELEAYKRKEKIWLRRWQQIAFHVRQKGI 120 AK+ALSGG VKTKEQATKEAEELQVEV+RLK+EL+AYKRKE+ WLRRWQQIAFHVRQKGI Sbjct 61 AKQALSGGLVKTKEQATKEAEELQVEVDRLKAELDAYKRKEEQWLRRWQQIAFHVRQKGI 120

Query 121 QMDSVDKAPPEESCLPSNTETAKILWPFDKEIPPSGRV 158 QM SVDK PPE + LPSNTETA+IL PFDKE+PPSGRV Sbjct 121 QMASVDKVPPEGADLPSNTETAQILRPFDKEVPPSGRV 158

16 (FIGURE S3 continued)

O) Query 1 MGNMGTSELLKHIYDINLSYLLLAQRLINDEKASAMFRLGIDETMADALAQLTLPQMVKL 60 MGNMGTSELLKHIYDINLSYLLLAQRLINDEKASAMFRLGIDETMADALAQLTLPQMVKL Sbjct 1 MGNMGTSELLKHIYDINLSYLLLAQRLINDEKASAMFRLGIDETMADALAQLTLPQMVKL 60

Query 61 AETNQLVCHFRFNESQTIERLTKESRVDDLQQIHTGILLSSHLLQELSSKDASPTKKRA 119 AETNQLVCHFRFNESQTIERLTKESRVDDLQQIHTGILLSSHLLQELSSKDASPTKKRA Sbjct 61 AETNQLVCHFRFNESQTIERLTKESRVDDLQQIHTGILLSSHLLQELSSKDASPTKKRA 119

FIGURE S3. Alignment of S. marcescens 2170 ORFs identified in the transposon screening with homologous genes of other species.

Nucleotide sequences close to the transposon in STM mutants were analyzed by BLAST and Artemis (Wellcome Trust Sanger Institute; www.sanger.ac.uk/resources/software/artemis/) in S. marcescens 2170 genome contigs to search for nearby ORFs. Each ORF of S. marcescens 2170 (Query) was analyzed by NCBI BLAST to search for homologous genes of other species (Sbjct). A, In STM91, the transposon was inserted between nucleotides 273 and 274 bp within a 1083-bp (360 aa) ORF. This ORF showed 93% identity with undecaprenyl-phosphate alpha-N- acetylglucosaminyl 1-phosphatetransferase (wecA) (360 aa) of Serratia proteamaculans 568. B, In STM162, the transposon was inserted between nucleotides 111 and 112 bp within a 1242-bp (413 aa) ORF. This ORF showed 86% identity with PST family polysaccharide transporter (412 aa) of Serratia odorifera DSM4582. C, In STM261, the transposon was inserted between

17 nucleotides 334 and 335 bp within a 1071-bp (356 aa) ORF. This ORF showed 94% identity with citrate lyase ligase (357 aa) of Serratia proteamaculans 568. D, In STM314, the transposon was inserted between nucleotides 1,363 and 1,364 bp within a 1608-bp (535 aa) ORF. This ORF showed 89% identity with phage integrase (527 aa) of gamma proteobacterium HdN1. E, In STM315, the transposon was inserted 112 bp upstream of a 300-bp (99 aa) ORF. This ORF showed 61% identity with hypothetical protein PROSTU_00814 (97 aa) of Providencia stuartii ATCC 25827. F, In STM316, the transposon was inserted 598 bp upstream of an 801-bp (266 aa) ORF. This ORF showed 29% identity with transcriptional activator for resistance to antibiotics, organic solvents and heavy metals (AraC/XylS family) (293 aa) of Xenorhabdus bovienii SS-2004. G, In STM394, the transposon was inserted between nucleotides 53 and 54 bp within a 777-bp (258 aa) ORF. This ORF showed 97% identity with permease (258 aa) of Serratia odorifera DSM 4582. H, In STM396, the transposon was inserted between nucleotides 36 and 37 bp within a 291-bp (96 aa) ORF. This ORF showed 49% identity with hypothetical protein ESCAB7627_2151 (202 aa) of Escherichia albertii TW07627. I, In STM417, the transposon was inserted 58 bp upstream of a 924-bp (307 aa) ORF. This ORF showed 57% identity with hypothetical protein Sden_2787 (286 aa) of Shewanella denitrificans OS217. J, In STM447, the transposon was inserted between nucleotides 228 and 229 bp within a 783-bp (260 aa) ORF. This ORF showed 91% identity with flagellar biosynthetic protein FliR (260 aa) of Serratia odorifera 4Rx13. K, In STM639, the transposon was inserted between nucleotides 84 and 85 bp within a 456-bp (151 aa) ORF. This ORF showed 77% identity with hypothetical protein H16_A1750 (151 aa) of Ralstonia eutropha H16. L, In STM673, the transposon was inserted 19 bp upstream of a 702-bp (233 aa) ORF. This ORF showed 95% identity with flagellar L-ring protein FlgH (233 aa) of Serratia odorifera 4Rx13. M, In STM854, the transposon was inserted between nucleotides 16 and 17 bp within a 531-bp (176 aa) ORF. This ORF showed 95% identity with BsmB (176 aa) of Serratia liquefaciens. N, In STM855, the transposon was inserted between nucleotides 245 and 246 bp within a 477-bp (158 aa) ORF. This ORF showed 89% identity with Protein kinase (HDNI_03960) (158aa) of gamma proteobacterium HdN1. O, In STM898, the transposon was inserted 138 bp upstream of a 360- bp (119 aa) ORF. This ORF showed 100% identity with flagellar transcriptional regulator FlhD (119 aa) of Serratia marcescens.

18

Recommended publications