Supplemental Data. Santelia Et Al.(2011).Plant Cell 10.1105/Tpc.111.092155

Total Page:16

File Type:pdf, Size:1020Kb

Supplemental Data. Santelia Et Al.(2011).Plant Cell 10.1105/Tpc.111.092155

Supplemental Data. Santelia et al.(2011).Plant Cell 10.1105/tpc.111.092155

DSP domain protein sequences used for phylogenetic analyses

>at3g52180_SEX4 MNCLQNLPRCSVSPLLGFGCIQRDHSSSSSSLKMLISPPIKANDPKSRLVLHAVSESKSSSEMSGVAKDEEKSDE YSQDMTQAMGAVLTYRHELGMNYNFIRPDLIVGSCLQTPEDVDKLRKIGVKTIFCLQQDPDLEYFGVDISSIQAY AKKYSDIQHIRCEIRDFDAFDLRMRLPAVVGTLYKAVKRNGGVTYVHCTAGMGRAPAVALTYMFWVQGYKLMEAH KLLMSKRSCFPKLDAIRNATIDILTGLKRKTVTLTLKDKGFSRVEISGLDIGWGQVNIFYLKPIYLL*

>Arabidly3_opsislyratasubsp.lyrata297819928 MNCLQNLPRCLVSPLLGFGCIQRDPSPSSLKMLVSPPIKANDPKARLFLLAVSESKSSSEMSGVSKEEEKSDEYS QDMTQAMGAVLTYRHELGMNYNFIRPDLIVGSCLQTPEDVDKLRKIGVKTIFCLQQDPDLEYFGVXDIRSIQAYA KKHSDIQHIRCEIRDFDAFDLRMRLPAVVSTLYKAVKRNGGVTYVHCTAGMGRAPAVALTYMFWVQGYKLMEAHK LLMSKRSCFPKLDAIRNATIDILTGLKKKTVTLTLKDKGFSAVEISGLDIGWGQRIPLTLDKGTGFWTLKRELPE GQFEYKYIIDGEWTHNEAEPFIGPNKDGHTNNYTKVVGDPTSVDGATRERLSSEDPELLEEERSKLIQFLETCSE AEIXX*

>Populust1_richocarpa224113173 MNCLRHLPRSSSALPLQGFNFHHRRPSSPPLFSFNMAGTMDYHDLQLRVAVKAAPGSPSSAEMSGTDVEEEEEEK SEIYSHNMTEAMGAVLTYRHELGMNYNFIRPDLIVGSCLQTPEDVDKLRKIGVKTVFCLQQDPDLEYFGVDISAI RDYAKACGDIQHLRAQIRDFDAFDLRIQLPAVVSKLRKAINQNGGVTYIHCTAGMGRAPAVALAYMFWVQGHKLN EAHDLLMSKRSSFPKLNAIKSATADILTGLRKKLVTLKWEDNNYSTVEISGLDIGWGQRIPLELDEERKFWILKR ELMEGVYEYKYIVDGEWIVNKNELVTTVNRDGHINNYVQVLDDDADSANAVLRKRLTCEDDPVLTREERLKIRMC LETLPGDDE*

>Orysasat1_iva108705759 MNCLQNLLKEPPIVGSRSMRRPSPLNLTMVRGGSRRSNTVKTASGASTSSAESGAVEAGTEKSDTYSTNMTQAMG AVLTYRHELGMNYNFIRPDLIVGSCLQSPLDVDKLRDIGVKTVFCLQQDPDLEYFGVDICAIQEYCLQCKDIEHC RAEIRDFDAFDLRLRLPAVISKLHKLVNHNGGVTYIHCTAGLGRAPAVTLAYMFWILGYSLNEGHQLLQSKRACF PKLEAIKLATADILTGLSKNSITLKWESDSCSSVEISGLDVGWGQIIPLTYNKEKRAWYLERELPEGRYEYKYIV DGKWVCNDNEKKTKANADGHVNNYVQVSRDGTSDEERELRERLTGQNPDLTKEERLMIREYLEQYVER*

>Zeamays01_219362899 MNCLQNLLKEPPIVGSRSMRRPSPLNLAMVRGGSRRSNTVKTLQAPGASTSGAESSAVEMGTEKSEVYSTNMTQA MGAALTYRHELGMNYNFIRPDLIVGSCLQSPLDVDKLRKIGVKTVFCLQQDSDLEYFGVDIRAIQDYSLQFKDIV HCRAEIRDFDAFDLRLRLPAVVSKLHKLINCNGGVTYIHCTAGLGRAPAVALAYMFWILGYSLNEGHRLLQSKRA CFPKLEAIKLATADILTGLSKNTITLKWEADGSSSVEISGLDIGWGQRIPLTYDEEKGAWFLEKELPEGRYEYKY VVDGKWLCNEHELITKPNADGHVNNYVQVSRDGTSDEEKELRERLTGPDPDLTDQERLMIREYLEQYADAAER*

>Zeamays02_108705763 MNCLQNLLKEPPIVGSRSMRRPSPLNLTMVRGGSRRSNTVKTASGASTSSAESGAVEAGTEKSDTYSTNMTQAMG AVLTYRHELGMNYNFIRPDLIVGSCLQSPLDVDKLRDIGVKTVFCLQQDPDLEYFGVDICAIQEYCLQCKDIEHC RAEIRDFDAFDLRLRLPAVISKLHKLVNHNGGVTYIHCTAGLGRAPAVTLAYMFWILGYSLNEGHQLLQSKRACF PKLEAIKLATADILTGLSKNSITLKWESDSCSSVEISGLDVGWGQIIPLTYNKEKRAWYLERELPVSIKISIILS ISRLYFLSLIILVLV*

>Ricinusc1_ommunis255584370 MNCLHYLPRSSALPFQGFKCHHQRKLSSSSSCSLSFNLMAISGSASSAETSGADVKEEEEKSEIYSHNMTEAMGA VLTYRHELGMNYNFILPDLIVGSCLQTPEDVDKLRRIGVKTIFCLQQDPDLEYFGVDITAIREYAKKCGDIQHLR AEIRDFDAFDLRIRLPAVVSKLYRAINQNGGVTYIHCTAGLGRAPGVAMAYMFWVQGYKLSDAHDLLLSKRSCFP KLDAIKSATADILTGLRGRLVTLTWKDSKCTTVEISGLDIGWGQRIPLKLDEERASWILKRELLEGCYEYKYIID GEWTYNKHEPVTSPNKDGHVNNYVQVLNDDTNSISAAIRKRLTGHDPDLMRDERLKIRQFLENCPKDEE*

>Glycinem1_ax255647912 MNCLRNLSRFSVLPFETPVTRHRKNLPLSLGFVNNSRQYPTMALKAASGSIPSADTSSADKEEEKSETYSHNMTE AMGAVLTYRHELGMNYNFIRPDLIVGSCLQTPEDVDKLRRIGVKTIFCLQQDSDLEYFGIDINAIREYAKTCNDI QHLRAEIRDFDAFDLRRRLPAVVSKLYKAINSNGGVTYIHCTAGLGRAPAVALAYMFWVLGYKLNEAHTLLQSKR SCFPKLDAIKSATADILTGLSKKSVTLSWEDKNCSTVEISGLDIGWGQRIPLNFDDKEGLWFLKRELPEGLYEYK YIVDGEWTCNSDELITSPNKDGHVNNFIQVLDDTSSVRAFLRKRLTADDPDLTTDEQLRIKEFLEACPDED*

>Vitisvin1_ifera147820654 MNCLQNLPRSSALPLQSFNCHGRKAICSVHLGVMNKTADLHRSIAAKAISGSISDTKMSDPEFKEEKSEIYSNNM TEAMGAVLTYRHELGMNYNFIRPDLIVGSCLQSPEDVDKLRSIGVKTIFCLQQDSDLEYFGVDINAIIEYANTFD DIQHLRAEIRDFDAFDLRLQLPAVVSKLYKAINRNGGVTYIHCTAGLGRAPAVALAYMFWVQGYKLNEAHSLLMS KRSSFPKLDAIKSATADILTDLKKQPVTLKWKDNSCTTVEISGLDIGWGQRMPLRFDKEQDLWILERELPEGHYE YKYIVDGEWTCNEHEHVRAPNKDGHVNNYVHVFDSDPHGVSAVLRKRLTGDDDPTLSRDERLKIRQILEACSDDD A*

>Selagine1_llamoellendorffii302803580XP002983543 VLTHIWFQAKQSSEEEGKEETSKKADDYSAVFQKITQSDLTYRHELGMNYNRVLPNLIVGSCLQNPADVDRLKKD ENVTTVCNLQQDPDMAYFNVDISEIRDHAKEVGDFNHLRLPIRDMDGFDLRMRLPSVIASLYQELKDREGTLYVH CTAGLGRAPAVALGYMFWVLGYDLHEAYLLLQSKRKCVPSMENIRAATCDLLTGMTRSPIGLLYRRGTCEHVEVA GLDIGWHSRLPFNFISRDGHWTLEHELPVGRYEYKYVIDKERWTYNPHAPITNPDRKGNYNNYIEVVDSDPENWD LRQYWRKEDAKLTDEQRRMILEKLEAMGSEL*

>Physcomi5_trellapatenssubsp.patens MASTIGVLGCKAPLLQDRVCISRTIECLSVRENRVCFAKRFCAPTPLARISTRSVHRDQLHPLQTTQINAMGENA EKKTEEKTETEAKTAEYSASMQQAMGTTLEYRHELGMNYAHVLTDLIVGSCLQTPADADKLKDAGVGVIFCLQQD PDLAYFGVDLPAIQAHVKELDGIDHYRCQIRDFDPYDLRMRLPVAVAQLHNAIEAHKGKTAYVHCTAGLGRAPGV ALAYMYWLRGLSLKEAND*

>Volvoxca1_rteri302836967XP002950043 MQKKMGTTLSYRHELGINYNRILPDLIVGSCLQTVEDVDLLAEKEGVRTVFCLQEDSDMAYFNLDVKPIQARCEE RGDIKHVRFPIRDFDPFDLRRKLPKAVTRLARDHNPANGTVYIHCTAGLGRAPATALAYMFWLRGYQLDAAYELL RGKRMCSPRIEAIRSATVDLLVGTEPVLATIGVSRIGTARDFKIAGLDVGWHQQLPLEREDGTGRMVLRRLLQPG KYAYKFIIDGNWTYSADRK*

>Chlamydo1_monasreinhardtii159475114XP001695668 MQSLALQNPGLQAPCRRLGLRRIAPVPVRLATSVAWSSSQSTASRRRSWVAAAAATGTADDAAATTSTSTAATSA ATEPTTDEEPVDADAAAEAEAKKSQAYSEDMQKKMGTTLTYRHEDGLNYNRILPDLIVGSCLQTVADVDHLYNKE NVRTIFCLQEDPDMAYFSLDIIPIQERCAELGLKHVRFPIRDFDGFDLRRKLPKAVARLARDHDPTAGTVYIHCT AGMGRAPATALAYMFWLRDFQLDAAYELLRGKRMCSPRIEAIRSATVDLLVGSEPVPVTIAVFRTGTATDFKIAG LDVGWHQQLPLEREPGTGRMVLNRVLQPGKYAYKFVVDGHWTYSADHPTLQDGNNTNNYVEVLGREVPEHLQLAQ QRLLQPGGDLTEEERAELRMMLCPWASHAALYGEPTGAAQPDMAAIDFDDRLP*

>Chlorell1_avariabilis307109692EFN57929 MQSTHREPEVVKTGDEDTEDLSDKYTDVMQERMGSAVLTYRHEDGTNFSRILEDLIVGSCLQQPADVDRRVVADG EDVRTVLCLQEDSDMAYFDLDLTPILERIGERGDVRHVRHRIRDFDPFSLRMELPGAVAALAQNAAANGGTAYVH CTAGLGRAPATALAYMWWFKGWHLEDAYQHLTGIRTCKPNAQAIRNAAADVLYGEPPTPVAGLDVGWGQQLDMAP DPQFQHRFLLQRCLPPGTYQFKFIINGRWGYAPNHPTRLDGDNLNNYVAVPYSNTDPAVQAARERLLAPGGRLTE EEAARIQQILLALPLNGGGTTGNLSATSSSSEE*

>Ostreoco3 ccustauri308800912Ostreococcus tauri] MFTTRSRACVPSSARERASERAKRSQATTTTRRGLADDDAYNAAMRAYSASPYEYKHDIGLYYHRIKPFLIVGTQ PQTPADVDRLRETEGVTCVFNTQQEKDWKYWNVDYDSVRARAIETGMRHVRYPFEDFSADSLREGLPSAAAMLDA EIERGETVYLHCTAGMGRSPGLAIAYMYWFLDAYNTLDGAYEGLTSIRPCGPKKESIRGATCDLLAAIESDGKVE PLKRELEENEGVVLSIVSKERIQRAVRAMLLSTSDETVAKPWWKVW* >Ostreoco4_ccuslucimarinusCCE9901__145341998XP001416085 MTASMTHAIPSRAAGRPSRPSRARGGGGGAARSLSRKRTRQRAAADDDAYNAAMKAYSATPYEYKHELGLYYHRI RPNLIVGTQPTTAADIDRLRDVEGVTCVFNTQQDKDMEYWKVDFASVKRQIEKRGMKLERYPFVDFSADSLREGL PAAAAALDAAARRGETVYLHCTAGMGRSPGLAIAYMYWFLDDSLDGAYEALTSIRPCGPKKESIRAATCDILAAH SDEWPIKLLERELNEYEGARLSFGAKRMIRRKLRR*

>Micromon2_assp.RCC299_255077615XP002502442 YNKAMAEYSKTPFEYRHELGLYYHFILPNLIVGTQPQTREDIDRLKDVEGVTCMFDTQQDKDKDYWKVDAGAIRD QMNKRGVLHVRQPFVDFNADSLRVGLPKAVAQMDKVLREGHVVYCHCTAGMGRSPGVAIGYLYWCLNDSLDQAYD FLTSKRPCGPKKESIRLATVDMLR*

>Micromon1_aspusillaCCMP1545303275400XP_003056994_predictedprotein MNAHVAISRATTVATSSARVSSASARRRNPGKITSAASVASRCEKRVAFAGGARRRGDLTLTRVSPEEQQEAYNR AMAEYSKTPFEYRHDLGLYYHFILPNLVVGTQPTTPEDINRLRDVEGVTCMFDTQQDKDKEHWGVDAHAIREQMR ARDVLHVREPFLDFDADSLRVGLPKAVASMDKALREGHVVYCHCTAGMGRSPGVAIAYLYWCLNFESLDQAYDFL TSKRPCGPKKESIRLATCDMMWGHGDALPENMVTSDDPQGTRINEEERWNIVRRLRRSVGDDADALDAAQHAADA EECELNLGGTVKRALGFETTMEDCKEA*

>Arabidly1_opsislyrata297828590XP002882177 MQDLSSTVSYSPSFSYYASGEFAAAAVKVSRENQSFNMTDHQSVKVDSFNDEDADFEFETSPLHEETFVPFPTFK TDVVSIINKKDKETAPENISDRFSGNCLWSPLRSPAASKHSESSTSTPKQCRLKKFLTRSHSDGGVLAPSASKRC YIFKDLMRRSHSDGGSDKKSLLDRFMAFLQQISGLGALERSSPSILIGSSFRSGNGRVFDGRGIAYLGSREKVGF NRRSRLVLRVVAMSSSSSPFKMNLNEYMVTLEKPLGIRFALSADGKIFVHAIKKGSNAEKARIIMVGDTLKKASD SSGGSLVEIKDFGDTEKMRVEKTGSFSLVLERPFSPFPIQYLLHLSDLDMLYNRGRVSFVTWNRNLLSSNLRASQ GSGNSGYAAFSSKFFTSQGWKLLTRHSNSFQSGTQKNILSPPISPLVSVFSEEVPGDGEWGYGNFPLEEYIKALD RSKGELSYNHALGMRYSKITEQIYVGSCIQTEEDVENLSEAGITAILNFQGGTEAQNWGINSQKINDACQKSEVL MINYPIKDADSFDLRKKLPLCVGLLLRLLKKNHRVFVTCTTGFDRSSACVIAYLHWMTDTSLHAAYSFVTGLHAC KPDRPAIAWATWDLIAMVDDGKHDGTPTHSVTFVWNGHEGEDVLLVGDFTGNWKEPIKATHKGGPRFETEVRLSQ GKYYYKYIINGDWRHSTTSPTERDDRGNTNNIIVVGDVANVKPTIQQPRKDANIIKVIERVLTESERFRLAKAAR CIAFSVCPIRLCPKS*

>Oryzasat4_125561375EAZ06823hypotheticalp MALHLTAAPIIAPSAAAACRSLAPPLPAVSCSTSRRGWRRRRRCVSVVAMAAAADGEMRHGHAAEAGGGGAGTGT GRMNLNEYMVAVDRPLGVRFALAVDGRVFVHSLKKGGNAEKSRIIMVGDTLKKAGSREGVGLVDIRDLGDTEMVL KETSGPCDLVLERPFAPFPIHQLHQNEDYHLLFNKGRVPLTSWNGALLSSKLNESSEGNGNPGFAIFSPRLLNSH GWAVLSSEQDGLNQRSTSLANRISEIVGLYSDEDDADTEWAHGSFPLEEYIKALDRAKGELYYNHSLGMQYSKIT EQIFVGSCLQTERDVKMLSETMGITAVLNFQSESERTNWGINSEAINNSCRENNILMVNYPIREVDSMDLRKKLP FCVGLLLRLIRKNYRIYVTCTTGYDRSPACVIAYLHWVQDTPLHIAHKFITGLHSCRPDRAAIVWATWDLIALVE NGRHDGTPTHSVCFVWNSGREGEDVELVGDFTSNWKDKVKCNHKGGSRYEAEIRLRHGKYYYKFIAGGQWRHSTS LPTETDEHGNVNNVIRVGDIARIRPAPSQLQIRDPTVVKVIERALTEDERFLLAFAARRMAFAICPIRLSPKQ*

>Zeamays04_EU945336zea_mays RIIMVGDTLKKAGADGEGLVTIKDLGDTETALRDKSGPCSLVLERPFAPFPIHQLHQNEDYHILFNRGRAAVASW NSAVWSTKMNESSTGDGKLGFAVFSPRLLSSQGWALLSNEKGGLNQSSTNLANRVSEIVGLYSDEDDDNAEWAHR SFPLEEYIKALDRAKGELYYNHSLGMQYSKITEQIFVGSCIQTEKDVKMLSETMGITAVLNFQSESERINWGINS EIINSSCRENNILMVNYPIREVDSLDLRKKLPFCVGLLLRLIRKNYRIYVTCTTGYDRSPACVISYLHWVQDTPL HIAHKFITGLHSCRPDRAAIVWATWDLIALVENGRHDGSPTHSVCFIWNSGREGEDVELVGDFTSNWKDKIRCSH KGGSRYEAEVRLRHGKYYYKFIVGGQWRHSTSLPTETDEHGNVNNVIRVGDIARIRPAPSQLHIKDPSVVKVIER ALT*

>Sorghumb1_icolor242078999XP002444268 MAPHLLTPPALAITGAAAPGRLGGAASKADPRAPRPRAPSRTFFCSSPGTGRRGMMMMRRKGLSVAAAAAAQGAE PGGPAGPMRLNEYMVAVDRPLGVRFALGVDGRVFVHSLRKGGNAEKSRIIMVGDTLKKAGGDGEGLVTIKDLGDT ESALRDKSGPCSLVLERPFAPFPIHQLHQNEDYHILFNKGRAAVASWNSAVLSTKLNGSSTGDGKSGFAVFSPRL LSSQGWALLSNEKGGLNQSSTNLANRISEIVGLYSDEDDANAEWAHGSFPLEEYIKALDRAKGELYYNHSLGMQY SKITEQIFVGSCIQTEKDVKMLSETMGITAVLNFQSESERINWGINSETINSSCRENNILMVNYPIREVDSVDLR KKLPFCVGLLLRLIRKNYRIYVTCTTGYDRSPACVISYLHWVQDTPLHIAHKFITGLHSCRPDRAAIVWATWDLI ALVENGRHDGSPTHSVCFVWNSGREGEDVELVGDFTSNWKDKIRCNHKGGSRYEAEVRLRHGKYYYKFIVGGQWR HSTSLPTETDEHGNVNNVIRVGDIAWIRPAPSQLHIKDPTVVKVIERALTEDERFSLAFAARRMAFAICPIRLSP KQ*

>Ricinusc2_ommunis_255567329XP002524644 MSMSSLQLSSCSSAAHHHHNHNHNHNLFSQTSLRGRDLWCLLHGVLPWKKHNKVLSLSKRTLKVHATSDNTNNNS SLKMNLNEYMVNLDKPFGIRFALSVDGKIIVHALRKGGNAERSRIIMVGDTLNRAIDSSSGRLVPIIDFASEGSG NSGFITFSSKFLTSHGWKLLNDDPKGYVDLPSQKTLRSPVSQFVCVFSERESGDGEWAYGNFPVEEYIKALERSE GELYYNHGLGMRFRKITEQIYVGSCIQTEADVKNLSSVGITAVINFQSVAEAENWGINSNSINESCQRSNILMIN YPIRDADSFDMRKKLPFCVGLLLRLLKKNHRVFVTCTTGFDRSPASIIAYLHWITDTSLHAAYNFVTGLHFCKPD RPAVAWATWDLIGMVESGGHDGPATHAVTFVWNGQEGEDVSLVGDFTGNWKEPMKASHMGGPRYEVEVRLPQGKY YYKYIINGQWRHSTASPIERDERGNVNNIIVVGDIANVRPSIQQKKKDVNIVKVIERPLTENERFVLAKAARCVA FSVCPIRLAPK*

>Vitisvin2_ifera225452450XP002274298 MALYPLQLSSSRALHQSSFLGPFSASSWGRDFLCFPNATAAVTSSKKAIARVYAMSSDTSSSFKMNLNEYMVTLE KPLGIRFALSADGKVFVHALKKGGNAEKSRIIMVGDTLKKASDSPDGGLVEIKDYGDTQKMLEQKTGSFSLVLER PFSPFPIQQLHLMSDLDILFNRGRVPVATWNKTILASNLQTCSDGGGNSGFVTFSPKFITSQGWKFLMGQNGDVN SKMQRNILSPPISQLVCIFSEEESGDVEWAHGSFPLDEYIKALDRSKGELYYNHSLGMRYSKITEQIYVGSCIQT EADVETLSNAGITAILNFQSGIEAENWGINSRSINESCQKFNILMINYPIREVDSYGMRKKLPFCVGLLLRLLKK NHRVFVTCTTGFDRSPACVVAYLHWMTDTSLHAAYNFVTGLHSCRPDRPAIAWATWDLIAMVEKGKHDGPATHAV TFVWNGHEGEEVFLVGDFTANWKEPIKAVHKGGSRYEVEVRLTQGKYYYKFITNGQWRHSTASPTERDERANVNN VIVVGDIASVRPSIQQQKKDSNVVKVIERQLTENERFMLAKAARCIAFSVCPIRLAPK*

>Selagine3_llamoellendorffii__302788746XP002976142 MNLNEYLVTIHKPLGIRFAQTLSGKVFVDALAKQGNAENSRLIMVGDVLKKVLFGDAVWNVDDFGRVLNTMKTRS GEVTLVLERSLFPSHIQALILESKLENTFNYGAVGSATWNNSGFASPNQPESADEYGGGNVGFVTYAWRFLLPKG IRQLAIFFTGVDDEDDDDDENERSERAELDIWHNLEASFEVIARYSTDEDLDGQVEWSHGNFQLQEFTSAQLRAS KDLTYSHRFGLRYTKMTEHIVVGSCLQNGAEMQQLKQMGITAILNLQSESEQLNWGIDKSSITEAAQANGMLSVF LRFRDVDTVDLRRKLPLAVGILYRLLRAGHHVYVTCTSGMDRAPACVIAYLHWIQDVPLQSAVDFVTNLHLCGPD RPALVWATWDLIAMVEKRNHDGPPTHAVQFVWNHGCKEGEEVLIVGDFKGGGWNEPIKATHASGPKYIVDLRVPQ GKYQYKFIVGGQWRHSNSLPTEMDRWGNVNNVLFVGGRASTTPEGVIHSPMPQDLITVKVIERLLTEEERFTLAF AARRLAFSVCPSKFAPKSKGQI*

>Physcomi1_trella_patens168061634XP001782792 MACSKETGGGRVSSTGLWVKKEEAGKRSLVIRAASPNNVATGVVKNNKFKINLDEYMVTLEKPIGIRFAQTLSGK VYVEALAKNGNAESTMMIMVGDVLKKTSAVFGDGMWDVEDFSRSMQAIRSRSGPVSLIFERSPPPLQSDLKKKAD IPFNGGRVAFATWNGNILAYPYQGGTVGFLTFSSKYIMPKGLTRLANATAVQEPYETFSGRRPLGDFEADDDFEV IASASDEDEDEDTGTEWSHGSFNNDEYQAALARAQNDLTYKPYRGNTYTKLTDYIYVGSCIQSAEDISHLADNFG ITAVLNLQRKSEQVNWGINGQEIDNMARQKGIIVVDAPIRDVDTVDLRRKLPYAVGVLHRLLRRCHRVYVTCTTG LDRAPSCVIGYLHWIQDVSLPQAYDFVTSLHRSGPDRPALVWATWDLIAMVEEGKHEGLPTHSVQFVWNHGCNPG EEVLVVGEFTSDWTKPIKANHVSGTKFAVNLRLPQGRYMYKFIVGGHWRHAHNLPTDMDQWGNINNVIQIGDVAT SNFNNRSGPRMKDPTNIKVIERPLTEDERFTLAFAARRMAFSISPITFQSKR*

>At3g10940_LSF2 MSVIGSKSCIFSVARYTRENEKSSCFTSINKKSSLDLRFPRNLAGVSCKFSGENPGTNGVSLSSKNKMEDYNTAM KRLMRSPYEYHHDLGMNYTLIRDELIVGSQPQKPEDIDHLKQEQNVAYILNLQQDKDIEYWGIDLDSIVRRCKEL GIRHMRRPAKDFDPLSLRSQLPKAVSSLEWAVSEGKGRVYVHCSAGLGRAPGVSIAYMYWFCDMNLNTAYDTLVS KRPCGPNKGAIRGATYDLAKNDPWKEPFESLPENAFEDIADWERKLIQERVRALRGT*

>Arabidly2_opsislyrata297833882 MSVIGTKSCIFSVTIYSRGNEISSCFTSINKKSSIDLRFPRNLAGVSCKISGENPRTNGVSLSSKNKMEDYNTAM KRLMRSPYEYHHDLGMNYTLIRDELIVGSQPQKPEDIDHLKQEQNVAYILNLQQDKDIDYWGIDLDSIVRRSKEL GIRHMRRPAKDFDPLSLRSQLPKAVSSLEWAVSEGKGRVYVHCSAGLGRAPGVSIAYMYWFCDMNLNTAYDNLVS KRPCGPNKGAIRGATYDLAKNDPWKEPFESLPENAFEDIADWERKLIQERVRALRGT*

>Populust2_richocarpa224144694XP_002325379 MGRTGIHCKLSGVEDNPTGKNLSLSSTNRMEEYNIAMKRMMRNPYEYHHDLGMNYTLITDNVIVGSQPQKPEDIE HLRHEENVAYILNLQQDKDIEYWGIDLQSIKQRCQQLGIRHMRRPATDFDPDSLRSALPKAVSSLEWATSEGKGR VYLHCTAGLGRAPAVAIAYMFWFCCMNLNTAYDTLTSKRPCGPSKRSIRAATYDLAKNDPWKEPFESLPEYAFED IADWERHLIQDRVRSLRGT*

>Oryzasat7_222616506EEE52638 MAAMATAPCFPATPGLPARGAVAARSRMAAGGSRSQRRRSSSGVFLCRSSTTGSTRMEDYNTAMKRMMRNPYEYH HDLGMNYAIISDSLIVGSQPQKPEDIDHLKDEEKVAFILCLQQDKDIEYWGIDFQTVVNRCKELGIKHIRRPAVD FDPDSLRTQLPKAVASLEWAISEGKGRVYVHCTAGLGRAPAVAIAYMFWFENMNLKTAYEKLTSKRPCGPNKRAI RAATYDLAKNDPHKESFDSLPEHAFEGIADSERRKPREQIQQEPEESVDQETGTRESTGGLGCPAAIINQRINNI SVFRQKGINLIEFENDESMSASEGTSPARRSTRSPGVGVSASAPVRNDAHVVSPASDVTQSVGDEPPPPLPGMPG IKVPGGVVCDDLPHGGGSEPPAGGDSGRAGTDDDDGGGDGFGGAARGDGLAPGNPGKRWRLPGKPGKRRTRASDE TKAAAAFLLGLIASDGVAIITEATTTSKTTAARLDAGDAIIFFFA*

>at3g01510_LSF1 MAFLQQISGLGALERSCPSIMIGSSFRSGNGRVFDGRGIAYLGSREKFGFNRRRRVVLRVVAMSSSSTPFKMNLN EYMVTLEKPLGIRFALSADGKIFVHAIKKGSNAEKARIIMVGDTLKKASDSSGGTLVEIKDFGDTKKMLVEKTGS FSLVLERPFSPFPIQYLLHLSDLDLLYNRGRVSFVTWNKNLLSSNLRASSQGSGNSGYAAFSSKFFTPQGWKLLN RQSNSFQSGTKKNILSPPISPLVSVFSEDVPGDGEWGYGNFPLEEYIKALDRSKGELSYNHALGMRYSKITEQIY VGSCIQTEEDVENLSEAGITAILNFQGGTEAQNWGIDSQSINDACQKSEVLMINYPIKDADSFDLRKKLPLCVGL LLRLLKKNHRVFVTCTTGFDRSSACVIAYLHWMTDTSLHAAYSFVTGLHACKPDRPAIAWATWDLIAMVDDGKHD GTPTHSVTFVWNGHEGEEVLLVGDFTGNWKEPIKATHKGGPRFETEVRLTQGKYYYKYIINGDWRHSATSPTERD DRGNTNNIIVVGDVANVRPTIQQPRKDANIIKVIERVLTESERFRLAKAARCIAFSVCPIRLCPKS*

>Oryzasat9_115483819NP001065571 MAAMATAPCFPATPGLPARGAVAARSRMAAGGSRSQRRRSSSGVFLCRSSTTGSSRMEDYNTAMKRMMRNPYEYH HDLGMNYAIISDSLIVGSQPQKPEDIDHLKDEEKVAFILCLQQDKDIEYWGIDFQTVVNRCKELGIKHIRRPAVD FDPDSLRTQLPKAVSSLEWAISEGKGRVYVHCTAGLGRAPAVAIAYMFWFENMDLRTAYEKLTSKRPCGPNKRAI RAATYDLAKNDPHKESFDSLPEHAFEGIAGSERSLIQERVRALREA*

>Zeamays03_223944219ACN26193 MAAMANTSRLPTPCSLPTISIGAKSRRLAVAAVRCGPGGSRSHRRSLGVLLCRSSSTAGAQGSTRMEDYNTAMKR MMRNPYEYHHDLGMNYAVISESLIVGSQPQKPEDIDRLKNEERVAYILCLQQDKDIEYWGIDFQSIVNRCKELGI QHIRRPAVDFDPDSLRSQLPKAVSALEWATSQRKGRVYVHCTAGLGRAPAVAIAYMFWFENMDLNTAYQKLTSIR PCGPSKRAIRAATYDLAKNDPSKEPFENLPEHAFEGVADWERKLIHDRVRALHEA*

>Sorghumb2_icolor242082782XP002441816 MAAMANTSRLPTPYSLPTVNIGGKCRRLAMAAVRCGPGGSRSHRRSLGVFLCRSSSTAGAQGSTRMEDYNTAMKR MMRNPYEYHHDLGMNYAVISESLIVGSQPQKPEDIDHLKDEERVAYILCLQQDKDIEYWGIDFQSILNRCKELGI QHIRKPAVDFDPDSLRSQLPKAVSALEWAISQRKGRVYIHCTAGLGRAPAVAIAYMFWFENMDLNTAYKKLTTIR PCGPSKRAIRAATYDLAKNDPSKEPFENLPEHAFEGIADWERKLIQDRVRALREA*

>Ricinusc3_ommunis__255559653XP002520846 MAAATNCISLSSLLFTYPHGKEVFLIRKKSTCKFMVSKNCYKMGRINCKLTDSGVEENPTRKHFSLSSNNRMDDY NIAMKRMMRNPYEYHHDLGMNYTLITNNLIVGSQPQKSEDIDHLKHEENVAYILNLQQDSDIEYWGIDLQSIRER CQELGIRHMRRPAKDFDPDSLRSILPKAVSSLEWAISEGKGRVYVHCTAGLGRAPAVTIAYMFWFCDMNLNAAYD ELTSQRPCGPNKRSIRGATYDLAKNDPWKEPFENLPEHAFEDIADWERSLIQDRVRALRGT* >Vitisvin3_ifera225448641 MGAIGNSCFHLAFKNPIENGVVLMKNKSSYYNTVMKGMMRNPYEYHHDLGMNYTLITDHLIVGSQPQKPEDVDHL KQEENVAYILNLQQDKDVEYWEVDLPSIIKRCKELEIRHMRRPARDFDPDSLRSGLPKAVSSLEWAISEGKGKVY VHCTAGLGRAPAVAIAYMFWFCGMDLNTAYDTLTSKRPCGPSKQAIRGATYDLAKNDPWKEPLESLPERAFEDVA DWERNLIQDRVRSLRGT*

>Selagine6_llamoellendorffii__302814724 AAKSDDYNRAMQRQMRNPYEYHHDLGMHFTVIEKNLIVGSQPQCKEDITRLYEEEGVRAILNLQQDKDVEYWGID LPAIMKQSASHGIAYFRIPARDFDPNSLRNELPRAVAALESAISSGSVYVHCTAGLGRSPAVAIAYLYWFCDMDM DTAYSLLTSKRPCGPKKEAIRGATYDLANDDPFKLPLEKLPEDAFTDVSEEERRLIQQRVRKL*

>Physcomi2_trella_patens168010761XP001758072 MASSVVSVSQMSSSAAFNSSTRSVYQIVNVGGSCGNSSWRTCPAPVNVSSSMSTSSFQVLNVESSAKSFGLLAVG NSASVRASGGASSKSICPGGGIVPGTRREMICHAKIEDAQSEEVPPMTVEEQAGWEQRQKEAAEIKDPADPFQWR WTLNWNQITPNIIVGSCPRSPGDIDRMVNEAGIDAILNLQCDLCFDALKIPFDAIRTRAVERGVRLERVAIRDFD HADQSLMLPVAIRVLNSLVGRGMKVYVHCTAGINRATLTTVGHLTFVQQMDLEDAVASVKSSRPVAHPYIDCWSE ARRRLLDGRKDEVTRASVELYETRLQNGIKGTKETDWFDAEKLVISRTFQRYLETDLAMIDMETAWLKRKFELDH QAATGGNGVPTKDSII*

>Physcomi3_trella_patens168036620XP001770804 MASSTASVSQMSSSAAFNSSTRSLYKTVKVEGSRVNSSWRTCSAFVNISSLLSVSSFQSLNVENSAKSLSLRAVD NVSNIRASGATRLKPVSSSAGNVSLSSDRETTANEQTEMAPMSAAERMGWEQRQKEAAETKDAADPFQWRWTLNW DQITPNIIVGSCPRSPGDIDRMVDEAGIDAVLNLQSDLCFDALKIPYDSIRKRALERGIRLERVAIRDFDHADQS LMLPVAVRLLNSLIGRGMKVYVHCTAGINRATLTTVGHLTFVQQMDLEDAVALVKSCRPVAHPYIDCWIEVRRRL LDGRKDEVTRTSEQLYQARLQSGTEGTEDQDWFKAEKVVVSQTFQRYLETDVALIDMETAWLKRRFELEHKSNLG GNGSVHYL*

>Osteroco1_ccustaurii308810913XP003082765 MSLTVATRRARAVTPSPATRARETRGIFVPSARWSATARAFDGRRRARWSSSHGRGCARRCGATANVDQDAMKDE QSGYKKRAEESKEAGVKEKRPPRGGQAPAPGSWSWTLNWDYITIDPQTNELIEEPSAKDCARARMLIGSCPRTAE DVDRLVDEAGVEAIVCLQCAMCHSAMEIDWQAVRRRALEREVMIVQVSVRDFDRLDQAKMLPEAVRKLAAFQAMG KRTYVHCTAGINRASLTVVGYLTFVKQFNLEDALRVVRTCRPQANPYVVSWQIARARLLANRVEDTYLYTQVTAG GNSSKEGGDWIQRDLERAEKGVIAEIFKRAIDTDLSMYGALLDFPRKK*

>Ostreoco2_ccuslucimarinus145353419XP001421011 MRDEASGYEKRAAESAAAGVEEKRPPRGGDAPAPGSWSWTLNWDHVMFDTRGRVIEEPTKEDMARARVLIGSCPR NAEDVDRLVDEAGVEAIVCLQCSLCHAAMEIDWQSVRRRAIERGVMIVQVNVRDFDRLDQAKMLPEAVRKLAAFQ AMGKRTYVHCTAGINRASLTVVGYLTFVKMFDLEAALHAVRTSRPQANPYVVSWEIARARLLAHRLEDIYLYSQV DAGGNTIDDGGDWIKRDLERAEKGVIAEVFKRAIDTDLSMYGALIEGDYQQQRH*

>Micromon3_assp.RCC299255089857 MASQCFASTLAAAPVARLARRSRAPAAPRPARSLTIAAASATEKDDKALESESAGAKKRDEESKAAGIQERRPPR GGEAPPPGKWEWTLNWDPVVYDKVNAAPLTAPTEAELNAAGCVIGSCPRTPSDVDRLIDEGGVEAIICLQCELCH GALMIDWEPIRARALERGVPIVRVSVRDFDRLDQAKMLPEMVRKLALFRAMGKRTYVHCTAGINRASLTVLGYLT FVEGMTYDQALAIVRESRPQANPYAVSWEIARERLLAGRTEDVYLYTQVELGGNSIEEGGDWIKRDLESASIGVI QSIFNRGVEADLSFTAALTDMCAAAIKAGGK*

>Micromon4_aspusillaCCMP1545303279096XP_003058841 MASVIHAASRVAPAPPRARARFARASSSSSVARGSAVARRTMLNRRRSAGRSAIAPILRAVASSDADEAMKSEES AYDKRKEESEAAGVKENEIRPPRGGDPPPTGGRWEWTLNWDPVVFDASASPPAAIESPTDAQLASSKLVIGSCPR SPADVDRLIDEGGVEAIICLQCTLCHGALEIDWEPIRRRALDRDVPIVRVAVRDFDRLDQAKMLPEMVRKLALFQ AMGKRTYVHCTAGINRASLTVLGYLTFCKARPLASMTPFPYDRVGVVNADP* >Volvoxca2_rterif.nagariensis302831047XP002947089 MGWSHLNPLEYHFERGLYYHEIVPNLICGTQPRNASDVDILAESERITHILNLQQDKDMHYWGVKLEDIRRACSR HSINHMRRPARDFDPHSLRRTIPGAVHSLAQALNSGGSRVYVHCTAGLGRAPAVCIAYLYWFTQLQLDEAYSYLT SLRPCGPKRDAIRGATYDVLANGADFQAFDSLPSDSFATLTEDDRFALQYRILRGLC*

>Chlamydo2_monasreinhardtii__159474008XP001695121 MGWSHLNPYEYHWDRGLYYHEIIPNLICGTQPRNAGEVDTLADNEGITHILNLQEDKDMHYWGVKIEDIRRACAK HSINHMRRPAKDFDKGSLRKAIPGAVHTLAGAMAGGGRVYVHCTAGLGRAPGVCIAYLYWFTDMQLDEAYSHLTT IRPCGPKRDAIRGATYDVAHAPPLPFESLPEQAYATLSEDDRFALQYRVLKGLC*

>Chlorell2_avariabilis307103658EFN51916 MDNPFEYDFNRGLYYHYVAPDVIVGSQPRNALDVDALAAEGVGVILNLQQDKDMAYWKVSLKEISERAAHHGMRL VRTPAVDFSPHSLRDTLPTAVSALERSRAAGDKVYVHCTAGLGRSPAVAIAALYWFTDMQLDEAYAYLTGIRPCG PSKDAIRGATYDLLSGRPHEHFQHEPSHAFATLSSWDRDAIRAKLLHG*

>Homosapi1_ens11321613NP005661 MRFRFGVVVPPAVAGARPELLVVGSRPELGRWEPRGAVRLRPAGTAAGDGALALQEPGLWLGEVELAAEEAAQDG AEPGRVDTFWYKFLKREPGGELSWEGNGPHHDRCCTYNENNLVDGVYCLPIGHWIEATGHTNEMKHTTDFYFNIA GHQAMHYSRILPNIWLGSCPRQVEHVTIKLKHELGITAVMNFQTEWDIVQNSSGCNRYPEPMTPDTMIKLYREEG LAYIWMPTPDMSTEGRVQMLPQAVCLLHALLEKGHIVYVHCNAGVGRSTAAVCGWLQYVMGWNLRKVQYFLMAKR PAVYIDEEALARAQEDFFQKFGKVRSSVCSL*

>Homosapi2_ens66346728NP001018051 MRFRFGVVVPPAVAGARPELLVVGSRPELGRWEPRGAVRLRPAGTAAGDGALALQEPGLWLGEVELAAEEAAQDG AEPGRVDTFWYKFLKREPGGELSWEGNGPHHDRCCTYNENNLVDGVYCLPIGHWIEATGHTNEMKHTTDFYFNIA GHQAMHYSRILPNIWLGSCPRQVEHVTIKLKHELGITAVMNFQTEWDIVQNSSGCNRYPEPMTPDTMIKLYREEG LAYIWMPTPDMSTEGRVQMLPQAVCLLHALLEKGHIVYVHCNAGVGRSTAAVCGWLQYVMGWNLRKVQYFLMAKR PAVYIDEEAASQDTFPL*

>Oryctola1_guscuniculus_291397114XP002714906 MRFRFGVVVPPAVAGARPELLVVGSRPELGRWEPRGAVRLRPAGTAAGAEALVLQEPGLWLGEVELAAEEAAQDG AEPGRVDTFWYKFLKREPGGELAWEGNGPHHDRCCTYNENNLVDGVYCLPIGHWIEATGHTNEMKHTTDFYFNIA GHQAMHYSRILPNLWLGSCPRQVEHVTIKLKHELGVTAIMNFQTEWDIVQNSSGCNRYSEPMTPDTMIKLYKEEG LVYIWMPTPDMSTEGRVQMLPQAVCLLHALLENGHTVYVHCNAGVGRSTAAVCGWLQYVMGWSLRKVQYFLMAKR PAVYIDEDALARAQDDFFQKFGKVRSSTCSL*

>Musmuscu1_116063575NP034276 MLFRFGVVVPPAVAGARQELLLAGSRPELGRWEPHGAVRLRPAGTAAGAAALALQEPGLWLAEVELEAYEEAGGA EPGRVDTFWYKFLQREPGGELHWEGNGPHHDRCCTYNEDNLVDGVYCLPVGHWIEATGHTNEMKHTTDFYFNIAG HQAMHYSRILPNIWLGSCPRQLEHVTIKLKHELGVTAVMNFQTEWDIIQNSSGCNRYPEPMTPDTMMKLYKEEGL SYIWMPTPDMSTEGRVQMLPQAVCLLHALLENGHTVYVHCNAGVGRSTAAVCGWLHYVIGWNLRKVQYFIMAKRP AVYIDEDALAQAQQDFSQKFGKVHSSICAL*

>Monodelp1_hisdomestica126311178XP001381051 MLFRFGVVVPPGVAQGRPQLLLVGSRPELGSWEPERAVPMKRAAGGAGPSGARGLQEPGLWLAEVELACQPPAEG AGPGVPETFWYKFLKREPGGEFSWEGNGPHHDRCSTYNENNLVDGVYCLPIGHWIEATGHTNEMKHTTDFYFNIA GHQAMHYSRILPNIWLGSCPRQLEHVTIKLKHELGVTAVMNFQTEWDITQNSSGCNRYPDPMTPETMIRLYKEEG IVYVWMPTPDMSTEGRVQMLPQAVCLLHGLLENGHTVYVHCNAGVGRSTAAVCGWLKYVMGWNLRKVQYFLMSKR PAVYIDEEALARAEEDFYQKFGKVCSSLCNF*

>Gallusga1_llus71894761NP001026240 MLFRFGVVLPARIAEGGGALLVAGSRPELGEWDPQRAVPMQPARPAAALAAQEPVLWLGEVLLSDEDTASPFWYK FLRREGGQLLWEGNGPHHDRSCVYNQSNIVDGVYCLPIAHWIEVSGHTDEMKHTTDFYFNIAGHQAIHYSRILPN IWLGSCPRQLEHVTVKLKHELGVTAVMNFQTEWDIVQNSWGCNRYPEPMSPEVLMRLYKEEGLAYVWMPTPDMST EGRIQMLPQAVCLLHGLLQNGHTVYVHCNAGVGRSTAAVSGWLKYVMGWSLRKVQYFLASRRPAVYIDEEALIRA EDDFFQKFGPLRSSCKVEE*

>Parameci1_umtetraureliastraind42145478487XP001425266 MKKKSVSTSASLPMPYFNVYFKIHYHTQPGKAIYIVGDCNILGNWVSTKGVRLQWNENDEWTVCVKIDRSQYVKI EYKFIVNNYDYPTLNDTLWEPGENRVITNHMIENETKSEYFNCEYWGYRTIKLKLNYNLPDKQRMMIVGSIEQLG SWIHPVLMKQQSKIDIISREIVQQWSISFIVDSMHFSFRYFYVIRNDGSMIWERGNGRYLKSSDLKSFRLVQDQY STSPIKIKTQMLTACQHQRLHNKKNGSFCSEKQQKLQKQISMGYSFSDKEPSFFYYESFGRLNKLDWNFVVQFSI TQINENIIIGPYPQNEQDIVVLKDFGVKAVLNLQTRLDVYHRGVDWDEILSSYKKHNIQMKNFEIFDMDPQDFEK

>Parameci2_umtetraureliastraind42_145496242XP001434112 MKKQSISTTATLPIPYFKVYFKIHYHTQPGRAIYIVGDCSILGNWVPTKGLRLQWNENDEWTICIKIDRSKYSKI EYKFIVNNYDYPTMNDILWEPGENRVITNHMIQNETKSEYFNCEYWGYRTIKLKLNYNLPEKQRMMVIGSIEQLG QWIHPVLMKQQTKIDILNGESVQQWSISFIVDSMNFSFRYFYVIRNDDTGSMIWERGNGRFLKSSDLRSFRQVQD QYAKSPIKIKTQLLTACQHLRHHNKKNGSFCSEKQQKSNKETSMGYSFSDKEPSFFYYESFGRLNKLDWNFVVQF SITQINENIIIGPYPQNEQDIINLSNYGIRAVLNLQTRLDVYHRGVDWDEILASYKKHNIYMKNFEIFDMDPQDF EKKITKAVQILKKLINQYEFVYIHCTSGIGRAPSLAVIYLASVLQIPLDQAIAFVKSKREHFYINLSMLKKALQK TMIYNNGLGYDQIPEINEFQIQSSGPMLIQQFDYNILC*

>Parameci3_umtetraureliastraind42145552513XP001461932 MKLQLYSLFSQEVYCKNQTQVESQIKTKNDLKLYGNSRIASRIQTVRNGNAWNQLNDVIDNEILYLNLPQIIMKQ QSMSTNMTLGSLYFKVYFKIHYHTESGKAIYIVGDNKQLGNWNPVKGLRLQWNENDEWTICIKIDRSQYQKIEYK FIVNNYENPSLQASIWEPGENRVITTHMIQNETKSEYFNCEYWGYRTIKLKLNYNLTQRQRMMIVGSIEQLGQWS HPVLMKCQQKIDILNGEPVQQWSISFIVDPMNFSFRYFYVIRNDENGSMIWERGNGRYLKSSDLRSFRQVQNAYA NSPIKIKTQLLAACQNYRLQKFKNGSFCSDRQLKINKETNLGYSFSDKEPSFFYYESFGRLNKLDWNFVVQFTIS QISENIIIGPYPQNEQDILVLKQNGIKAVLNLQTRLDIYHRGVDWDEIQNTYKKNDMVMKNFEIFDMDPVDFEKK AFKAVQMLKKLINNYEFVYVHCTSGIGRAPSLVVLYLATVLQVPLNEAISFVKSKREHFYINHDMLRKSLQKTTM YNNGEGYEKAQVYNQFQIGSQTKIYEKKFDYNLLY*

>Parameci4_umtetraureliastraind42145478153XP001425099 GEVNKDPMSIIKKLHKQMFQDQTQDVVVGYVNTYNKDNLITELPYLLQQMEYKQIYNQIRQQVLVIFHYIGGATE TVPITLIKKARNRYGQNPIEANQPKDYYVSTNMDLKQVRRNYQETVQSEAEPFKIKEDIQNDEINKLYYLCLKEC EAKHLRNYLQTKEYQLEYFKFQSDGQMFRMNRVKKVQKQRANEFLVLNNEIQQLNLQQMIMKQQSISTNTTLGSQ FFKVYFKIHYHTQPGKAIYIVGDNKQFGNWNPIKGMRLQWNENDEWTICIGVDRSQYQKIEYKFIVNNFDNPTLQ DPIWEPGENRVITTHMIQNETKSEYFNCEYWGYRTIKLKLNYNLAQKQRMMVIGSIEQLGQWSHPVLMKPQQKID ILNDEPVQQWSISFIVDPMNFSFRYFYVIRNDENGSMIWERGNGRYLKSSDLRSFRQVQTVYAKSPIRIKTQLLA ACQNQRLQKLKNGSFCSDKQLKINKETSIGYSFSDKEPSFFYYESFGRLNKLDWNFVVQFAITQINENIIIGPYP QNEQDILVLKSQGIKAVLNLQTRLDIYHRGVDWDEIQNSYKKNDIIMKNFEIFDMDPIDFERKAFKAVQLLKKLI NNYEFVYVHCTSGIGRAPSLVVLYLSTVLQIPLNEAISFVKKKREHFYINHNMLRKSLQKTEIFNNGEGYEEISV FNQYQIGSLQKIYEQKFDYNLLY*

>Tetrahym2_enathermophila118368910XP001017661Dualspecif MKKGDVWEYKKVILRLFESTRIISCDSPNCISQKCPQLQKSLSTPPKLNNHNSQDSKQSLIDNQLVVQQSREHSS SISSNSTQASEESSGSPGTPEKFGSSGNLIKERYYVTANIESMGNMIKLQKMKKYTYRSQANTASNCSEQKEIVF WEKQFLVKASQQNLKYKYAYKSEAQYNDQMERKSHKINFSKNDVDYQGAFCINDKWQMNQFNFDKITDNISLGPY PENQEQIKMLAQSGVKAVFNLQTEQDMEYHGTNWESIKKLYSSNGIKVIHYPVTDMDVHDMAYKLHDAVDKFAMA IEKWNHVYIHCTSGIYRSPQVIVAYLNLYHEIDVNKAISQVESKRPITKKNRDYLRQVYAIKGGYGKIIRQINCQ ILEESNRKSKSPSTNQFSPLLKIDPKVSKVHHNLQKLSKASSVQIESNEKICQHNSSNKQVEQNIKSHHNVHKTS STSNINSKCFYNSTTTHSRKYKPSEENIQSSNRKHHSPPQVSTFKIQIKKAKHQETLASGINSPSKKNCQEEDTL QQILFNMKAILPKYGHEQIEKDQIDSTRQ*

>Parameci5_umtetraureliastraind42_145535153XP001453315hypothetic MNSVSTNISFSDIRDVKIYLKIHYNTSYGQAIYLCGDDERLGIWDSTKAIRLQWNENNEWTACLKLPRICKKFEY KFLLNDYDNPSREKEFWEPGENRIITKHLLLNGKKGEYFNQEYWGYRTIKLKLNHNLQFGERMMIIGSIPEIGSW KTPVLMKQQQKIDILTQEPIQQWSISFIVNPLNFYFRYYYVIRNDESGNMIWERGNGRYLKTADLSSLRQVLDQY ALHPIKVKTQIYTAFQTKPQYKNGSFSTTKIPKQKIKPNNQGYQFADKEPSFFYYEEFGRLNKLDWNFVVQFQTY EINENILIGPYPQNEQDILLLKQKQVKAVLNLQTRLDMFHRGVNWEQIVDAYKRQNIVMKNYQIFDMDAEDFEKK SNKAVQILKKLINEHEYVYVHCTAGIGRAPSIIVLYLSSILQYDLKDAIEFVKQKRQQFYVNYSMLKKSLQKTLV FNHGLGYQNLAQTL*

>Parameci6_umtetraureliastraind42145511746XP001441795hypothetic MNSISTNASFTDIRVVKIYFKIHYNAEYGQAIYLCGDDERLGIWDSTKAIRLQWNQNNEWTTCLKLPRICKTFEY KFLLNDYNNPSPQKELWEPGENRIITKHLLLNGKKGEYFNCNKEYWGFRTIKLKLNHNLQPGERMMIIGSIPEIG SWKTPVLMKQQQKIDILTQEPIQQWSISFIVNPLNFYFRYYYVIRNDESGNMIWERGNGRYLKTADLSSIRQVLD QFALHPIKVKTQIYTAFESRTQKRNGSFQATKIPKSKMKLNNQGYQFADKEPSFFYYEEFGRLNKLDWNFVVQFQ TYEINENIMIGPYPQNEQDILMLKQKQVKAVLNLQTRLDMFHRGVNWEQIVDAYKRQKIVMKNYQIFDMDAEDFE KKSNKAVQILRKLINEYEYVYVHCTAGIWRAPSIVVLYLSSILKYDLKEAIELVKQKRQQFYVNYSMLKRSLQKT LVFNHGLGYQNLASTL*

>Parameci7_umtetraureliastraind42145516795XP001444286hypothetic MNSVSTNTSFTDVRIVKIYFKIHYITQFGQAIYLCGDDESLGMWDPCKALRLQWNQNNEWTICVKMPRIARKFEY KFLVNDYNEPSICKAFWEPGENRIITKHLLLNGKKSEYFNQEFWGYRTIKLKLNHTLQPKERMMIIGSIPEIGSW KSPVLMKQQLKIDILTQEPIQQWSISFIVNPLNFFFRYYYVIRNDETGNMIWERGNGRYLKTADLSSIRQVLDQY DLHPIKVKTQIYTAFQARQIHKNGSFSSSKIPKSKIKKNNQGYSFADKEPSFFYYEEFGRLNKLDWNFVVQFQTY EINENILIGPYPQNEQDILYLKQKQVRAVLNLQTRLDMFHRGVNWEQIVDAYKRHNIVMKNYQIFDMDSEDFEKK SNKAVQILKKLINEYEYVYVHCTAGIGRAPSIVVLYLASILQYDLKEAIEFVKQKRQQFYINYSMLKKSFQKTLV FNHGLGYQDLTQTL*

>Parameci8_umtetraureliastraind42145526228XP001448925hypothetic MNSVSTNTSFTEVRIVKIYLKIHYITKFGQAIYLCGDDESLGMWDPCKALRLQWNQNHEWTICIKLPRISRKFEY KFLVNNYNEPQIHYAFWEPGENRIITKHLLLNGKKNEYFNCNAVLILFIEEFWGYRTIKLKLNHTLQPKERMMII GSIPEIGSWKSPVLMKQQMKIDILTEEATQQWSISFIVDPLNFFFRYYYVIRNDDTGKMIWERGNGRYLKSADLS SIRQVLDQYDLHPIKVKTEIYSAIQARQFNKNGSFSSSKIPKSKIKQKNQGYSFADKEPSFFYYEEFGRLNKLDW NFVVQFQTYEINENILIGPYPQNEQDILYLKQQQVRAVLNLQTRLDMFHRGVNWEQIVDAYKRHNIVMKNYQIFD MDSEDFEKKSNKAVQILKKLINEYEYVYVHCTAGIGRAPSIVVLYLASILQYDLKDAIEFVKQKRQQFYINYSML KKSYQKALVFNHGLGYQELALTF*

>Cyanidos1_chyxonmerolae MARIRTSDRRNTNDQAGSESRHRVPSMARAPAADSSGAQSTPAARRASEGVSVAEPPSKPAADSSGAQSTPAARR ASEGVSVAGPPSKPAADSSGAQSTPAARFASEGVSVPEPPSKPAADSSGAQSTPAARGASEDISVPGPPSDIADT ISKNDRSVTPTIPTLFRVYCHTEFGDAVVAAGSHDKLGNWEPAKALRLRHQCQVDTPFRDCWEGEVDLVPETSFE FKFVRLIGGDPQRALWETGPNRRAVIQRNSKDGCLIEVEWERTRVLFSIYYPTKEKQHLCVTGDLPEIGRWVEPG PVPMALSTTEERLETGGKGRRWSLTVSVPSTVGKFAYRYVLVDDNRQQTIWEREPNRYATLERAVNGRLECFDAN FVASLEFDEICPDIYIGPYPQTPEHVEMMHEAGITAVLNLQTDEDFAHRSIPWSTLMETYTALEMQVIRCPIPDF NAEALMQLLPDAVRALDAALKAKRVVYVHCTAGMGRAPAVVVAYLVWRRGMTLEDALSHVKARRAVAAPNVTVLE KVLRNPL*

>Tetrahym1_enathermophila_146162797XP001010090.2Dualspec MGFLADNPQDDSEALSTAAQATPSTTQGSSESSTDTISSSTQISGFSTASLVKKHQSQVKFKIHYQATFGQSLYV VGSIKALGKWRIEKSLLMKWTEGHYWVASLASSEIDSAELDKLIEYKYCLANTVYPEQYLILWEEGPNRSFYFPE VSQLCLNDFWSLRKVILRLFEAKNDKDLYVLGDCEEIGTYESPSKMERVVQTNFADLSQTVFFEKTLLVKAHKQK IKYTYAKKDTNQMLIIDRQPRKLQIKQPPKHIHIEINDNSFSSKFQVTKIDDNIYLGPYPQSEEDVKELSERGIR AVLNLQTEKDMQLKGAAYIKLLRFYKTYNIQPFHFPVIDMDVIDMCYKLQDVSRLLNYLVSTMKRVYVHCTAGMF RSPQCVIGYYTYFKNMKVQQAIKYVENQHPHSKINKGYIETIMNCTPVPTYNSMNDIDKKYQKNHISDSDFFDDD DDDDNNNLLISHTESIININNITDISPILT*

>Tetrahym3_enathermophila118388320XP001027258Dualspecif MNNNNYNNIHRYDFNIVEFKVVCKVEWGQTVGVMGNIASLGNWEDQGIFPLTWTENHVWKGEMLIMPGCEQNFEC KFVIFHKDDKNNTNTLIEMEKGANRQYNLNNPAQFITEQQQESQQSTQDNKFIISEIFNMRKVTFSVYSTPNSII HISGNIPQFKVPQQLCYEQQIPSMNCDQQLSLYYQDFYVDMREDMMRYYYIKNFVQVERGIGRYIIFNNHITKPF AIQYSLAIPEVFKEELSFDNITENLSIGSFINSANDISSLHKLGVTAIVNLQTKRDMERKYVNAQEIRKICKSKG ILFINTPIRDNDPVDYVQRAPEVLDIIEDLYKANHHIYIHCTAGIGRAPQTAILHLVLHRNYKINEASELIFSKR PVSSPNKEAIISVLKMLSESSSESTSDRSIDEEEEDYRSEEFCSSNQISAKIM*

>Leishman1_iamajorstrainFriedlin157864661XP001681039dualspecif XMGGGATPATACSQVVDLRALDPSPCPSRGSRSPLRSGHVHQRGSRCEDVPATQILEFLYLGSVKDAQDAAFLAR HQIRYIINVSQEEYWSVDKKVQIFTFKVDDSATADIAALFQPTRDLITSIRGRYYRYARGESSTRPAVLVHCQKG RSRSATIVLAYLIYTNGWSVAEAMKYVGARRPCAEPNIGFMEELRKLQESLSFEERTRRYSELCWFMRNLNAETS QAHVRELFEKHIGMVRHVVTYVVAGGGAGSAGNSGTVGTCGDGSRVAVEGDARFPASWPAVPSTGDSGPADPLVA FDLGVAPGKQNDGVDEQRAAKDGDTHVYKDAGAAVSLLRGRSTEKAMLCFVFFTCREDVLHGIKSGQFQQLLLQL PPAAGKQIKYATGPKLKKIMTEHQSMSNSFVQDMAGFSDHLTTPTGGESAAADADGAVRTQGVVAPASKEAEVTL PA*

Unambiguously aligned character set used as input for phylogenetic analyses

At3g52180 NYNFIRPDLIVGSCLQEDVDKLRKIGVKTIFCLQQDPDLEYFGDISSIQAYAKKYSDIQH Arabidly3 NYNFIRPDLIVGSCLQEDVDKLRKIGVKTIFCLQQDPDLEYFGDIRSIQAYAKKHSDIQH Populust1 NYNFIRPDLIVGSCLQEDVDKLRKIGVKTVFCLQQDPDLEYFGDISAIRDYAKACGDIQH Orysasat1 NYNFIRPDLIVGSCLQLDVDKLRDIGVKTVFCLQQDPDLEYFGDICAIQEYCLQCKDIEH Zeamays01 NYNFIRPDLIVGSCLQLDVDKLRKIGVKTVFCLQQDSDLEYFGDIRAIQDYSLQFKDIVH Zeamays02 NYNFIRPDLIVGSCLQLDVDKLRDIGVKTVFCLQQDPDLEYFGDICAIQEYCLQCKDIEH Ricinusc1 NYNFILPDLIVGSCLQEDVDKLRRIGVKTIFCLQQDPDLEYFGDITAIREYAKKCGDIQH Glycinem1 NYNFIRPDLIVGSCLQEDVDKLRRIGVKTIFCLQQDSDLEYFGDINAIREYAKTCNDIQH Vitisvin1 NYNFIRPDLIVGSCLQEDVDKLRSIGVKTIFCLQQDSDLEYFGDINAIIEYANTFDDIQH Selagine1 NYNRVLPNLIVGSCLQADVDLKKDENVTTVCNLQQDPDMAYFNDISEIRDHAKEVGDFNH Physcomi5 NYAHVLTDLIVGSCLQADADKLKDAGVGVIFCLQQDPDLAYFGDLPAIQAHVKELDGIDH Volvoxca1 NYNRILPDLIVGSCLQEDVDLAEKEGVRTVFCLQEDSDMAYFNDVKPIQARCEERGDIKH Chlamydo1 NYNRILPDLIVGSCLQADVDLYNKENVRTIFCLQEDPDMAYFSDIIPIQERCAELG-LKH Chlorell1 NFSRILEDLIVGSCLQADVDVADGEDVRTVLCLQEDSDMAYFDDLTPILERIGERGDVRH Ostreoco3 YYHRIKPFLIVGTQPQADVDLRETEGVTCVFNTQQEKDWKYWNVDYDSVRARAIETGMRH Ostreoco4 YYHRIRPNLIVGTQPTADIDLRDVEGVTCVFNTQQDKDMEYWKVDFASVKRQIEKRGMKL Micromon2 YYHFILPNLIVGTQPQEDIDLKDVEGVTCMFDTQQDKDKDYWKVDAGAIRDQMNKRGVLH Micromon1 YYHFILPNLVVGTQPTEDINLRDVEGVTCMFDTQQDKDKEHWGVDAHAIREQMRARDVLH At3g01510 RYSKITEQIYVGSCIQEDVENLSEAGITAILNFQGGTEAQNWGIDSQSINDACQKSEVLM Arabidly1 RYSKITEQIYVGSCIQEDVENLSEAGITAILNFQGGTEAQNWGINSQKINDACQKSEVLM Oryzasat4 QYSKITEQIFVGSCLQRDVKLSETMGITAVLNFQSESERTNWGINSEAINNSCRENNILM Zeamays04 QYSKITEQIFVGSCIQKDVKLSETMGITAVLNFQSESERINWGINSEIINSSCRENNILM Sorghumb1 QYSKITEQIFVGSCIQKDVKLSETMGITAVLNFQSESERINWGINSETINSSCRENNILM Ricinusc2 RFRKITEQIYVGSCIQADVKNLSSVGITAVINFQSVAEAENWGINSNSINESCQRSNILM Vitisvin2 RYSKITEQIYVGSCIQADVETLSNAGITAILNFQSGIEAENWGINSRSINESCQKFNILM Selagine3 RYTKMTEHIVVGSCLQAEMQQLKQMGITAILNLQSESEQLNWGIDKSSITEAAQANGMLS Physcomi1 TYTKLTDYIYVGSCIQEDISLADNFGITAVLNLQRKSEQVNWGINGQEIDNMARQKGIIV At3g10940 NYTLIRDELIVGSQPQEDIDLKQEQNVAYILNLQQDKDIEYWGIDLDSIVRRCKELGIRH Arabidly2 NYTLIRDELIVGSQPQEDIDLKQEQNVAYILNLQQDKDIDYWGIDLDSIVRRSKELGIRH Populust2 NYTLITDNVIVGSQPQEDIELRHEENVAYILNLQQDKDIEYWGIDLQSIKQRCQQLGIRH Oryzasat7 NYAIISDSLIVGSQPQEDIDLKDEEKVAFILCLQQDKDIEYWGIDFQTVVNRCKELGIKH Oryzasat9 NYAIISDSLIVGSQPQEDIDLKDEEKVAFILCLQQDKDIEYWGIDFQTVVNRCKELGIKH Zeamays03 NYAVISESLIVGSQPQEDIDLKNEERVAYILCLQQDKDIEYWGIDFQSIVNRCKELGIQH Sorghumb2 NYAVISESLIVGSQPQEDIDLKDEERVAYILCLQQDKDIEYWGIDFQSILNRCKELGIQH Ricinusc3 NYTLITNNLIVGSQPQEDIDLKHEENVAYILNLQQDSDIEYWGIDLQSIRERCQELGIRH Vitisvin3 NYTLITDHLIVGSQPQEDVDLKQEENVAYILNLQQDKDVEYWEVDLPSIIKRCKELEIRH Selagine6 HFTVIEKNLIVGSQPQEDITLYEEEGVRAILNLQQDKDVEYWGIDLPAIMKQSASHGIAY Physcomi2 NWNQITPNIIVGSCPRGDIDMVNEAGIDAILNLQCDLCFDALKIPFDAIRTRAVERGVRL Physcomi3 NWDQITPNIIVGSCPRGDIDMVDEAGIDAVLNLQSDLCFDALKIPYDSIRKRALERGIRL Volvoxca2 YYHEIVPNLICGTQPRSDVDLAESERITHILNLQQDKDMHYWGVKLEDIRRACSRHSINH Chlamydo2 YYHEIIPNLICGTQPRGEVDLADNEGITHILNLQEDKDMHYWGVKIEDIRRACAKHSINH Osteroco1 AKDCARARMLIGSCPREDVDLVDEAGVEAIVCLQCAMCHSAMEIDWQAVRRRALEREVMI Ostreoco2 KEDMARARVLIGSCPREDVDLVDEAGVEAIVCLQCSLCHAAMEIDWQSVRRRAIERGVMI Micromon3 EAELNAAGCVIGSCPRSDVDLIDEGGVEAIICLQCELCHGALMIDWEPIRARALERGVPI Micromon4 AQL-ASSKLVIGSCPRADVDLIDEGGVEAIICLQCTLCHGALEIDWEPIRRRALDRDVPI Chlorell2 YYHYVAPDVIVGSQPRLDVDLA-AEGVGVILNLQQDKDMAYWKVSLKEISERAAHHGMRL Homosapi1 HYSRILPNIWLGSCPREHVTLKHELGITAVMNFQTEWDIVQNSMTPDTMIKLYREEGLAY Homosapi2 HYSRILPNIWLGSCPREHVTLKHELGITAVMNFQTEWDIVQNSMTPDTMIKLYREEGLAY Oryctola1 HYSRILPNLWLGSCPREHVTLKHELGVTAIMNFQTEWDIVQNSMTPDTMIKLYKEEGLVY Musmuscu1 HYSRILPNIWLGSCPREHVTLKHELGVTAVMNFQTEWDIIQNSMTPDTMMKLYKEEGLSY Monodelp1 HYSRILPNIWLGSCPREHVTLKHELGVTAVMNFQTEWDITQNSMTPETMIRLYKEEGIVY Gallusga1 HYSRILPNIWLGSCPREHVTLKHELGVTAVMNFQTEWDIVQNSMSPEVLMRLYKEEGLAY Parameci1 SITQINENIIIGPYPQNEQDVLKDFGVKAVLNLQTRLDVYHRGVDWDEILSSYKKHNIQM Parameci2 SITQINENIIIGPYPQNEQDNLSNYGIRAVLNLQTRLDVYHRGVDWDEILASYKKHNIYM Parameci3 TISQISENIIIGPYPQNEQDVLKQNGIKAVLNLQTRLDIYHRGVDWDEIQNTYKKNDMVM Parameci4 AITQINENIIIGPYPQNEQDVLKSQGIKAVLNLQTRLDIYHRGVDWDEIQNSYKKNDIIM Tetrahym2 NFDKITDNISLGPYPENQEQMLAQSGVKAVFNLQTEQDMEYHGTNWESIKKLYSSNGIKV Parameci5 QTYEINENILIGPYPQNEQDLLKQKQVKAVLNLQTRLDMFHRGVNWEQIVDAYKRQNIVM Parameci6 QTYEINENIMIGPYPQNEQDMLKQKQVKAVLNLQTRLDMFHRGVNWEQIVDAYKRQKIVM Parameci7 QTYEINENILIGPYPQNEQDYLKQKQVRAVLNLQTRLDMFHRGVNWEQIVDAYKRHNIVM Parameci8 QTYEINENILIGPYPQNEQDYLKQQQVRAVLNLQTRLDMFHRGVNWEQIVDAYKRHNIVM Cyanidos1 EFDEICPDIYIGPYPQTPEHMMHEAGITAVLNLQTDEDFAHRSIPWSTLMETYTALEMQV Tetrahym1 QVTKIDDNIYLGPYPQSEEDELSERGIRAVLNLQTEKDMQLKGAAYIKLLRFYKTYNIQP Tetrahym3 SFDNITENLSIGSFINSANDSLHKLGVTAIVNLQTKRDMERKYVNAQEIRKICKSKGILF Leishman1 PATQILEFLYLGSVKDAQDAFLARHQIRYIINVSQEEYWSVDK--KVQIFTFKVDDSATA

IRCEIRDFDAFDLRMRLPAVVGTLYKAVRNGGVTYVHCTAGMGRAPAVALTYMFWVQGYK IRCEIRDFDAFDLRMRLPAVVSTLYKAVRNGGVTYVHCTAGMGRAPAVALTYMFWVQGYK LRAQIRDFDAFDLRIQLPAVVSKLRKAIQNGGVTYIHCTAGMGRAPAVALAYMFWVQGHK CRAEIRDFDAFDLRLRLPAVISKLHKLVHNGGVTYIHCTAGLGRAPAVTLAYMFWILGYS CRAEIRDFDAFDLRLRLPAVVSKLHKLICNGGVTYIHCTAGLGRAPAVALAYMFWILGYS CRAEIRDFDAFDLRLRLPAVISKLHKLVHNGGVTYIHCTAGLGRAPAVTLAYMFWILGYS LRAEIRDFDAFDLRIRLPAVVSKLYRAIQNGGVTYIHCTAGLGRAPGVAMAYMFWVQGYK LRAEIRDFDAFDLRRRLPAVVSKLYKAISNGGVTYIHCTAGLGRAPAVALAYMFWVLGYK LRAEIRDFDAFDLRLQLPAVVSKLYKAIRNGGVTYIHCTAGLGRAPAVALAYMFWVQGYK LRLPIRDMDGFDLRMRLPSVIASLYQELDREGTLYVHCTAGLGRAPAVALGYMFWVLGYD YRCQIRDFDPYDLRMRLPVAVAQLHNAIAHKGTAYVHCTAGLGRAPGVALAYMYWLRGLS VRFPIRDFDPFDLRRKLPKAVTRLARDHPANGTVYIHCTAGLGRAPATALAYMFWLRGYQ VRFPIRDFDGFDLRRKLPKAVARLARDHPTAGTVYIHCTAGMGRAPATALAYMFWLRDFQ VRHRIRDFDPFSLRMELPGAVAALAQNAANGGTAYVHCTAGLGRAPATALAYMWWFKGWH VRYPFEDFSADSLREGLPSAAAMLDAEIERGETVYLHCTAGMGRSPGLAIAYMYWFLDNT ERYPFVDFSADSLREGLPAAAAALDAAARRGETVYLHCTAGMGRSPGLAIAYMYWFLDDS VRQPFVDFNADSLRVGLPKAVAQMDKVLREGHVVYCHCTAGMGRSPGVAIGYLYWCLNDS VREPFLDFDADSLRVGLPKAVASMDKALREGHVVYCHCTAGMGRSPGVAIAYLYWCLNES INYPIKDADSFDLRKKLPLCVGLLLRLLKKNHRVFVTCTTGFDRSSACVIAYLHWMTDTS INYPIKDADSFDLRKKLPLCVGLLLRLLKKNHRVFVTCTTGFDRSSACVIAYLHWMTDTS VNYPIREVDSMDLRKKLPFCVGLLLRLIRKNYRIYVTCTTGYDRSPACVIAYLHWVQDTP VNYPIREVDSLDLRKKLPFCVGLLLRLIRKNYRIYVTCTTGYDRSPACVISYLHWVQDTP VNYPIREVDSVDLRKKLPFCVGLLLRLIRKNYRIYVTCTTGYDRSPACVISYLHWVQDTP INYPIRDADSFDMRKKLPFCVGLLLRLLKKNHRVFVTCTTGFDRSPASIIAYLHWITDTS INYPIREVDSYGMRKKLPFCVGLLLRLLKKNHRVFVTCTTGFDRSPACVVAYLHWMTDTS VFLRFRDVDTVDLRRKLPLAVGILYRLLRAGHHVYVTCTSGMDRAPACVIAYLHWIQDVP VDAPIRDVDTVDLRRKLPYAVGVLHRLLRRCHRVYVTCTTGLDRAPSCVIGYLHWIQDVS MRRPAKDFDPLSLRSQLPKAVSSLEWAVEGKGRVYVHCSAGLGRAPGVSIAYMYWFCDMN MRRPAKDFDPLSLRSQLPKAVSSLEWAVEGKGRVYVHCSAGLGRAPGVSIAYMYWFCDMN MRRPATDFDPDSLRSALPKAVSSLEWATEGKGRVYLHCTAGLGRAPAVAIAYMFWFCCMN IRRPAVDFDPDSLRTQLPKAVASLEWAIEGKGRVYVHCTAGLGRAPAVAIAYMFWFENMN IRRPAVDFDPDSLRTQLPKAVSSLEWAIEGKGRVYVHCTAGLGRAPAVAIAYMFWFENMD IRRPAVDFDPDSLRSQLPKAVSALEWATQRKGRVYVHCTAGLGRAPAVAIAYMFWFENMD IRKPAVDFDPDSLRSQLPKAVSALEWAIQRKGRVYIHCTAGLGRAPAVAIAYMFWFENMD MRRPAKDFDPDSLRSILPKAVSSLEWAIEGKGRVYVHCTAGLGRAPAVTIAYMFWFCDMN MRRPARDFDPDSLRSGLPKAVSSLEWAIEGKGKVYVHCTAGLGRAPAVAIAYMFWFCGMD FRIPARDFDPNSLRNELPRAVAALESAISGS--VYVHCTAGLGRSPAVAIAYLYWFCDMD ERVAIRDFDHADQSLMLPVAIRVLNSLVGRGMKVYVHCTAGINRATLTTVGHLTFVQQMD ERVAIRDFDHADQSLMLPVAVRLLNSLIGRGMKVYVHCTAGINRATLTTVGHLTFVQQMD MRRPARDFDPHSLRRTIPGAVHSLAQALSGGSRVYVHCTAGLGRAPAVCIAYLYWFTQLQ MRRPAKDFDKGSLRKAIPGAVHTLAGAMAGGGRVYVHCTAGLGRAPGVCIAYLYWFTDMQ VQVSVRDFDRLDQAKMLPEAVRKLAAFQAMGKRTYVHCTAGINRASLTVVGYLTFVKQFN VQVNVRDFDRLDQAKMLPEAVRKLAAFQAMGKRTYVHCTAGINRASLTVVGYLTFVKMFD VRVSVRDFDRLDQAKMLPEMVRKLALFRAMGKRTYVHCTAGINRASLTVLGYLTFVEGMT VRVAVRDFDRLDQAKMLPEMVRKLALFQAMGKRTYVHCTAGINRASLTVLGYLTFCKARP VRTPAVDFSPHSLRDTLPTAVSALERSRAAGDKVYVHCTAGLGRSPAVAIAALYWFTDMQ IWMPTPDMSTEGRVQMLPQAVCLLHALLEKGHIVYVHCNAGVGRSTAAVCGWLQYVMGWN IWMPTPDMSTEGRVQMLPQAVCLLHALLEKGHIVYVHCNAGVGRSTAAVCGWLQYVMGWN IWMPTPDMSTEGRVQMLPQAVCLLHALLENGHTVYVHCNAGVGRSTAAVCGWLQYVMGWS IWMPTPDMSTEGRVQMLPQAVCLLHALLENGHTVYVHCNAGVGRSTAAVCGWLHYVIGWN VWMPTPDMSTEGRVQMLPQAVCLLHGLLENGHTVYVHCNAGVGRSTAAVCGWLKYVMGWN VWMPTPDMSTEGRIQMLPQAVCLLHGLLQNGHTVYVHCNAGVGRSTAAVSGWLKYVMGWS KNFEIFDMDPQDFEKKILKAVQILKKLINQHESVYIHCTSGIGRAPSLAVIYLSSVLQIP KNFEIFDMDPQDFEKKITKAVQILKKLINQYEFVYIHCTSGIGRAPSLAVIYLASVLQIP KNFEIFDMDPVDFEKKAFKAVQMLKKLINNYEFVYVHCTSGIGRAPSLVVLYLATVLQVP KNFEIFDMDPIDFERKAFKAVQLLKKLINNYEFVYVHCTSGIGRAPSLVVLYLSTVLQIP IHYPVTDMDVHDMAYKLHDAVDKFAMAIEKWNHVYIHCTSGIYRSPQVIVAYLNLYHEID KNYQIFDMDAEDFEKKSNKAVQILKKLINEHEYVYVHCTAGIGRAPSIIVLYLSSILQYD KNYQIFDMDAEDFEKKSNKAVQILRKLINEYEYVYVHCTAGIWRAPSIVVLYLSSILKYD KNYQIFDMDSEDFEKKSNKAVQILKKLINEYEYVYVHCTAGIGRAPSIVVLYLASILQYD KNYQIFDMDSEDFEKKSNKAVQILKKLINEYEYVYVHCTAGIGRAPSIVVLYLASILQYD IRCPIPDFNAEALMQLLPDAVRALDAALKAKRVVYVHCTAGMGRAPAVVVAYLVWRRGMT FHFPVIDMDVIDMCYKLQDVSRLLNYLVSTMKRVYVHCTAGMFRSPQCVIGYYTYFKNMK INTPIRDNDPVDYVQRAPEVLDIIEDLYKANHHIYIHCTAGIGRAPQTAILHLVLHRNYK DIAALFQPTRDLITSIRGRYYRYARGESSTRPAVLVHCQKGRSRSATIVLAYLIYTNGWS

LMEAHKLLMSKRSCFPKLDAIRNATIDILT LMEAHKLLMSKRSCFPKLDAIRNATIDILT LNEAHDLLMSKRSSFPKLNAIKSATADILT LNEGHQLLQSKRACFPKLEAIKLATADILT LNEGHRLLQSKRACFPKLEAIKLATADILT LNEGHQLLQSKRACFPKLEAIKLATADILT LSDAHDLLLSKRSCFPKLDAIKSATADILT LNEAHTLLQSKRSCFPKLDAIKSATADILT LNEAHSLLMSKRSSFPKLDAIKSATADILT LHEAYLLLQSKRKCVPSMENIRAATCDLLT LKEAND------LDAAYELLRGKRMCSPRIEAIRSATVDLLV LDAAYELLRGKRMCSPRIEAIRSATVDLLV LEDAYQHLTGIRTCKPNAQAIRNAAADVLY LDGAYEGLTSIRPCGPKKESIRGATCDLLA LDGAYEALTSIRPCGPKKESIRAATCDILA LDQAYDFLTSKRPCGPKKESIRLATVDMLR LDQAYDFLTSKRPCGPKKESIRLATCDMMW LHAAYSFVTGLHACKPDRPAIAWATWDLIA LHAAYSFVTGLHACKPDRPAIAWATWDLIA LHIAHKFITGLHSCRPDRAAIVWATWDLIA LHIAHKFITGLHSCRPDRAAIVWATWDLIA LHIAHKFITGLHSCRPDRAAIVWATWDLIA LHAAYNFVTGLHFCKPDRPAVAWATWDLIG LHAAYNFVTGLHSCRPDRPAIAWATWDLIA LQSAVDFVTNLHLCGPDRPALVWATWDLIA LPQAYDFVTSLHRSGPDRPALVWATWDLIA LNTAYDTLVSKRPCGPNKGAIRGATYDLAK LNTAYDNLVSKRPCGPNKGAIRGATYDLAK LNTAYDTLTSKRPCGPSKRSIRAATYDLAK LKTAYEKLTSKRPCGPNKRAIRAATYDLAK LRTAYEKLTSKRPCGPNKRAIRAATYDLAK LNTAYQKLTSIRPCGPSKRAIRAATYDLAK LNTAYKKLTTIRPCGPSKRAIRAATYDLAK LNAAYDELTSQRPCGPNKRSIRGATYDLAK LNTAYDTLTSKRPCGPSKQAIRGATYDLAK MDTAYSLLTSKRPCGPKKEAIRGATYDLAN LEDAVASVKSSRPVAHPYIDCWSEARRRLL LEDAVALVKSCRPVAHPYIDCWIEVRRRLL LDEAYSYLTSLRPCGPKRDAIRGATYDVLA LDEAYSHLTTIRPCGPKRDAIRGATYDVAH LEDALRVVRTCRPQANPYVVSWQIARARLL LEAALHAVRTSRPQANPYVVSWEIARARLL YDQALAIVRESRPQANPYAVSWEIARERLL LASMTPFPYDRVGVVNADP------LDEAYAYLTGIRPCGPSKDAIRGATYDLLS LRKVQYFLMAKRPAVYIDEEALARAQEDFF LRKVQYFLMAKRPAVYIDEEAASQDTF--- LRKVQYFLMAKRPAVYIDEDALARAQDDFF LRKVQYFIMAKRPAVYIDEDALAQAQQDFS LRKVQYFLMSKRPAVYIDEEALARAEEDFY LRKVQYFLASRRPAVYIDEEALIRAEDDFF LNESIALVKNKREHFYINFNMLKRALQKTM LDQAIAFVKSKREHFYINLSMLKKALQKTM LNEAISFVKSKREHFYINHDMLRKSLQKTT LNEAISFVKKKREHFYINHNMLRKSLQKTE VNKAISQVESKRPITKKNRDYLRQVYAIKG LKDAIEFVKQKRQQFYVNYSMLKKSLQKTL LKEAIELVKQKRQQFYVNYSMLKRSLQKTL LKEAIEFVKQKRQQFYINYSMLKKSFQKTL LKDAIEFVKQKRQQFYINYSMLKKSYQKAL LEDALSHVKARRAVAAPNVTVLEKVLRNPL VQQAIKYVENQHPHSKINKGYIETIMNCTP INEASELIFSKRPVSSPNKEAIISVLKMLS VAEAMKYVGARRPCAEPNIGFMEELRKLQE

Recommended publications