>seq_1 NALIASKKQLIDGIAHELRTPLVRLRYRLEMSDSQALNRDISQLEALIEELLTYARLDEPDLPLWLSTHLADIQAVTPDKTVRIKTLAALDMRLWPMNTI VFVEDDAEVGALIAAYLAKHDMQVTVEPRGDRAEETILRENPDLVLLDIMLPGKDGMTICRDLRAKWSGPIVLLTSLDSDMNHILALEMGACDYILKTTP PAVLLARLRLHLRQLHFGTLTIDPINRVVTLANTEISLSTADFELLWELATHAGQIMDRDALLKNLRGVSYDGLDRSVDVAISRLRKKLLDNAAEPYRIK TVLF >seq_2 GALINSKKTLTNAVAHELRTPLARLRYRLALLE-QAIERDLTAIDKLVEELLFHARLDSFNALPWARDRIAGQAALAQ-EIHWIEI--TADEHLWPTHRI VFVEDDADLAELISDFLSRHEMEVVIEPRGDTALDTIAREAPDLVLLDIMLPGKDGLSICRELRPKFDGPIIMLTSLDSDMNQILGLELGANDYILKTTP PSVLVARLRAQLR--NFGQLSIDPVSRDVVLCGEKIPLSTSDFDLLWLLASHAGETLNRDLLLKEMRGVDYDGLDRSIDVAISRLRKKLGDNPSEPFRIK TVLF >seq_3 ERQLETRQALSHAIAHEIRTPIARLRFGLTMLERDGMERDMVELEELISTSMEFAKLRTVDLFDWFDDLITPL-KPPNLTLTLC-ADRKLMYIAILIVED DARLAELIAAYLTKHGYQVSLHGRGDTAPAFILETRPDLVVLDVMLPGKEGFDVCRDVRP-HGGRILMMTARDEDVDEILGLELGADDYLAKPVEPRRLL ARIRALLRRITFGQFSISQATREALLGDEQIELTTAEFDLLWLLATHAGQVLSRDDIMSELRGIGFDGLDRSIDARISRLRRKLGDNPDAPARIKTVRGK GYLF >seq_4 ERQMEQRTAMLNGVSHDLRTILTRFKLQLALAGLEGLDKDVDDMQSMLEAYLTFARTETLELPALLEKI-GH--DF-E--GKK-FSQLIIARPNIPAAHL LIVDDDARIRNLLQRFLGDKGYRVSVAGDAAEARRKMVGIRFDLLILDVMMPGENGLSLTRSLNENKSVPVILLTARSEADSRIAGLEAGADDYLAKPFD PRELVLRINNILRRIMFGPYSFS-P---LRRG-ETIRLTDREQEIMLLFALRAGDTIPRHE---VE-----ESEVGER-IDVQINR-RR-I-DDPANPVY LQ-R >seq_5 ERQIEQRTAMLSGVSHDLRTILTRFKLQLALA-TEPLNQDIADMQTMLEGYLAFARGETFDVTRLCEKLEEA--RLRERG---FKYIINVRPNAPAPHLL VVDDDTRIRSLLSQYLTNSGFRVTMAGSAAEARRKLEGIDFDLLILDVMMPGETGVSLTRSLREQKNVPILMLTALSETDSRIDGLAAGADDYLPKPFDP RELTLRINNILRRIVFGPYTFFIPRRELKKGTETIKLTDREQDIMAIFAERAGETIPRHE--LT----GQDGDVGERTIDVQINRLRRKIEQDPANPVWL QTVL >seq_6 ERTMEQRTAMLAGVSHDLRTILTRFKLELALIGLEGMRKDVDEMSMMLEDYLAFARGDPTDMAQALEELRDA--ERHGHAAT-VAF-VTVKPASPAPHLL LVDDDRRIRDLLSRFLAGEGYRVSTAASASDARSKLMGLHFDLLILDVMMPGETGFDLARFIRTSSSVPIVMLTARHEAEARIEGLQIGADDYVAKPFEP RELALRINNILKRIAFGPYVYHLDRGELRQGEEVIHLTDREREMLRILSETPGETVPRSA--LT-----GNGSVNERAVDVQINRLRRKIETDPANPLFL QAVL >seq_7 QRHIEQRTALLASVSHDLRTPLTRLKLEMAMAEMEAMKGDLAEMEHMIDEYLAFARGEVVDLSDLVESVV-ADAERGGAAIETITRRPLTFRRALLVVDD DDRLRKLIKEFLSRAGFRVTAASSAAAADKLFDALDFDLMVLDVMMPGEDGMAFTKRLRAKGRTPILMLTARDQTADRIEGLSSGVDDYLGKPFEPQELL LRIEAILRRLSLGRCTFDADRGELTCDGEAVRITEAEVTLLRRLARSLHEPVDRLELAR----DTADATGRAVDVQVTRLRRKIEPDPKNPRYLQTVRGV GYRL >seq_8 AESDSVRRTLLSGLPHDLKGPLSRMWLRIEMA-KEGLRKDLQDMQHMVDQFIGFVRGTPLALGEWLAERVQG----AGTDIRLAIQDAVALGRLLLVVDD DPALRQLLADYLNRHGYDTLLAPDASDLAARITRYAPDLLVLDRMLPGGDGADACRRLRDQGDIPVILLTARDEAVDRIIGLEAGADDYLGKPFDPRELL ARIEAVLRRVSFGPFVFDPATRQLLRDDTPVKLTGGEINLLEALVRNAGKPLSRERLLALARDDDGERNDRAIDIAILRLRRAIEEDPKQPRWIQTVWGI GYRF >seq_9 DNAARERRLMLAGLSHDLRTPLTRLKLTLEMQE-HDMLSDIDELSRIVRQFIDFARAEPVALADLAASVVR----REDMDVRLILKDALALERLILVVDD DAKLRELLTRYLTQQGYIVETLPDPKDLDRKLARNRPDLIVLDVMMPGEDGLAVVRRLRAQGTLPVIMLTARGEDIDRILGLEMGADDYLPKPFNPRELT ARIQSVLRRVEFGEFVLNLGQRELRRAGQPVSLTSAEFAVLSVLVSHPRRPLTREQLMELALGKGNESLDRSIDVHISRLRKALESGDSQLRYIQTVWGY GYVF >seq_10 RQAEADRELMLAGISHDLRTPLARMRLEIEMSGRQAIDEDLGQIDHSIGQLMEYARPAATDISSVLAELYRSHTASLGGELEAIARTALDLKRIILVVDD DPRLRDLLRRYLSEQGFNVFVAEDAKEMGKLWQREHFDLLVLDLMLPGEDGLSICRRLRGGHNTPIIMLTAKAEEIDRIVGLEMGADDYLSKPFNPRELL ARINAILRRIAFGPYVLNLSTRTLTRNGEQVPITTGEFSVLKVFARHPKIPLSRDKLMELARGREYEAFDRSLDVQISRLRKLIEPNPSKPVFIQTVWGL GYVF >seq_11 ARLMSDRTRMLAAISHDLRTPITRLRLRAEFIE-KRMLIDLDQMRSMLESVLSLLRNDAVTLVDIAST-LADQFG-MGHVVHYGAARPDDLHRGILVVED DRETRTLIAKYLRNNACNVTAVSDGREMSRAMADHRVDLIILDVMLPGEDGLSLCRKVRSEAQTPIIMLTARGEDVDRIVGLEMGADDYLTKPFNPRELL ARINAVLRRLAFEGWRIDLRLRELRNEGARVAVTSAEFDLLRTFCERPGRVLSRDSLLDLTQGRNTGSFERSIDVLVSRIRRKIEPNPHDPTIIKTVRSG GYLF >seq_12 TSLMNDRTRMLAAISHDLRTPITRLRLRSEYIERTQTLRDLDQMQAMLESVLILLRG---GLVDIAAL-VCEEFA-CGHAVHYGLTKPDEIRRAILIVED DQETRNLVARYLRENSFNVGMAANGREMDRYISQNRVDLIVLDLMLPGEDGLSLCRRLRVDTTTPIIILTAKGDDVDRILGLEMGADDYLPKPFNPRELL ARINAVLRRLRFEGWIIDSRMRELRDEGAQVPLTSAEFDLLQSFCERHGRILSRDTLLNMTRGRPGGAFGRSIDVLVSRLRGKL-DRTEGTSMIKTVRTG GYIF >seq_13 RRYVEDRTAMVGAIAHDLRTPLTRLKFRIEAAP----LAADIDQMEAMISATLGFVRDTKLELSSLLESVAAETGGDAT--VES-IGDPVALKRRILIVD DDPGIRDVVSDFLAKHGYVVETAQDGRTMEQVLARGPIDLIVLDVMLPGEDGLAICRRLSATPAPAIIMLSAMGEETDRIVGLELGADDYLPKPCNPREL LARVRAVLRR-EFAGWRLDLVRRELRPQSIVVNLSSGEFSLLRAFVERPQRVLTRDQLLDLARGRDSDAYDRAIDVQISRLRRKL-DDGGGSELIRTIRS EGYM >seq_14 QDLIRNRTQMLAAISHDLRTPITRMKLRAQFLD--NALVKDLNEMEVMINETLSFARAD--DLVSLVCSLMQ----DMGYNIQHSIGRASALKRHILIVD DDSDIRDLLGKFLRRHGFEASLAKDGSEMQAILLKQAVDLVILDIMMPGDDGLTLCRQMRANSTIPILMLTAISEEVDRILGLEMGADDYLSKPFNPREL LARVRAILRRYEFAGWSLDPAERRLKPDSLEITLSSGEFDLLHALVQRSQQVLSRDKLLDLTKNREAGPFDRSIDIQISRLRHKLEQDPKNPQIIKTIRG GGYV >seq_15 QRYLQTRSQMLAGVSHDLRLPITRVRLRLEQLERQAIERDLSEMDQLIGDTLAYLRMGLLNVGALLEGVIEAL----GAEVAYALYRPQALRRAVLIVDD DPDLRDLLSDYLSRQDMVVSAVGDGEAMNRALAEQSFDILILDLMLPGADGLTLCRDLRSRSNMPILMLTARGDELDRIIGLEMGADDYLPKPFNPRELL ARVRSILRRLQFGDWRLELGAQHLVDGGVVTPLSGGEFKLMQALAENPQRVMSRDQLMEAMNGKEAGPFDRTVDVMIGRLRRRLGDDAREPLLIKTIRSG GYML >seq_16 LDYVAEREQLAAALAHDLRTPLTRMKLRMELL-RQSLSRDLNDIEAISRSVIDFATSERLDLWSLLLSVAQVSLDEKG----SYCLQPISIQRCILIVDD DKDIRDLLHEFLKRRGMHVSIACNGDEMLDVLSRTPIDLVILDVMLPGKSGIEICQDVRRTSRVPIIMLTAIADAADKILGLEIGADDYIAKPFDPRELL ARIRAVLRRRFAG-WTLDCARRRLTSHDVRVELTTAEFNLLEAFVKSSQHILSRDQLMEMAGHQAVYGYDRSVDILISRLRKKLEDDPCAPKLILTIRGG GYQF >seq_17 QRANGFKNEILGTVAHDLKNPLGVILGRTEMLKVDHIRDATKRLTTMVDHLISDAMADPVDVAALVKEVAQPLAVNKQQAISVATMDTDRIREAIMIVDD EAPAREMVGDYLKMHGFTVTLCDGGKSLRAAIDGGMPDLVVLDLNMPEEDGLSIIRDLKS-RNVPVIMLTATASPIDRVVGLELGADDYVAKPCELRELM ARIRSVLRRVRFGTKWLDLEAQALRDDGNEHPLTASEFGLLKVFAANPKRVLSRERLLELANARDAEAFDRAVDLRIMRIRRKIEPDPAKPAVIRTIRGG GYLF >seq_18 YSRIEAIEMFAADVAHELKNPLTSLRSAVETLPLEVIEHDVKRLDRLISDISDASRLDPVDLRRLLGTLVLGHDVAVEARFEGGVTHDSRLGQVIALVDD DRNILTSVSIALEAEGYRIMTYTDGASALDGFRTTQPDLAILDIKMPRMDGMETLRRLRQKSDLPVIFLTSKDEEIDELFGLKMGADDFIRKPFSQRLLV ERVKAVLR---RGLLRMDPERHTCTWKNEPVTLTVTEFLILQALATRPGVVKSRNALMDAAYDDQVYVDDRTIDSHIKRLRKKFKVVDNEFEMIETLYGV GYRF >seq_19 YTRIEAIESFAADVSHELKNPLTSLRSAVETLPLDVIQHDVRRLDRLITDISDASRLDRVDMKKLLTSLVREV-RRNKVGTEIFVAHDLRLGQVIALVDD DRNILTSVSIALESEGYRVETYTDGASALDGLMARPPNLAIFDIKMPRMDGMELLRRLRQKSDLPVIFLTSKDDEIDELFGLKMGADDFITKPFSQRLLV ERVKAVLRRLERGQLVMDQERHTCTWKGEPVTLTVTEFLILHSLAQRPGVVKSRDALMDAAYDEQVYVDDRTIDSHIKRLRKKFKAVDDSFEMIETLYGV GYRF >seq_20 SERMDAIERFAADVAHEIKNPLTSIRSAIETLDLAILQNDVNRLDRLVTDISNASRLDALDLGRLLTEVVENQWRPGSVRVSLLILRETPIGQVITLIDD DENIVASVSLALESHGHTVKAYYDGASGLEAVEASPPDLVILDVKMPRMDGMEVLRRLRQTSEIPVIMLTSKDDEIDEILGFNLGADDYMHKPFSQRLLL ERVKAVLR---RGKLTLDPARHDSLWDGRPVRLTVTEFLLLQALAQRPGFVKSRDNLMDAAYDDQVYVDDRTIDSHIKRMRKKFRQVDPEFDSIETLYGV GYR- >seq_21 ARRSDYIATFSAHLTHELKSPLTSIKGAAELLQIANILSDTQRLEAMAQRLRELARAERTELAPVIADLRSRF-PESSIEASGLMSEKALI--VILIVDD EGHIREVIRVALKKAGMDVIEARDGKEALARFAADRPDLIVLDVGMPEFDGLDVCREIRKGSDVPILFLSARDEEIDRVLGLEIGGDDYVTKPFSPRELV ARVNVILRRLAQGGLLIDPEQHVASFAGTPLKLTAIEFGILRAFLTRPTSVFNREQLMRAAYQLNIQVSDRTIDSHIRNIRAKLAAL-SCDNVIETIHGV GFKL >seq_22 QDNADDMIAWNAAIAHELRTPLTILKGRLQGIAIGNLLLQIDGLSRLVDDLRTVTLADPVALAAEIQNVAEPVLANAGFSLELLALDGIRIRQAILIAED EPQICEILDAYLTREGFRTVRAGDGRAALDLHLALKPDLILLDVTMPRLDGWEVLAEIRRRGDTPVIMITALDQDIDRLQGLRIGADDYVVKPFNPVEVV ARTKAVLRRLRVDCVAIDLDGHMVKVEAQPLPVTLTEFRLLAHMARTPTKAFTRSELVDACLP-GSDALERTVDSHISNLRKKLDVAGA-PGMLSGVRGI GYRL >seq_23 KELEKMRKEFVAGVSHELKTPIGIIEGYAEGLKTDVIIDEAARMGKLVSDMLDLSQLEKFDIGEMVFKCSYAILDEKKIDMSIIVMDQYRLEQVILIVED EDRMRELIKAYLRREGYSVLEAADGKEALEMFDRNSISLVILDIMLPLLDGWTVCTNIREKSEVPIIMLTAKSEEDDKLLGYELGADDYVTKPCSPKVLT AKVKVLLKR--FDGLKIDAISHEVTIEDKEIYLSPKEYDLLIYFSNNKGITLTRDKILDNVWGEDYYGDLRTVDTHVKRLREKLQD---KAYLVATVRGS GYKF >seq_24 RELEKLRKDFIAGASHELKTPIGIISGYAEGIKLDIIIDEAEKMNKLVMDMLELSKLEDFSLTELTEEVLSVDINKNNLTVIKYVQDDFKIEQVVLIVED EIRIRFLVRDYFKKEGFNVLEASDGEEALRIFSENIVDLAILDIMMPKLDGLAVCRNIREVSNTPIILLTAKSQEEDKLLGYELGADDYMTKPFSPKVLL AKAKALLRRLDFNGLTINKLSHEVKLNGEELLLSPKEYDLLIYLSSNEGIALSRDRILDNVWGYDYFGDIRTVDTNIKRLREKLLD---KANYIATVRGS GYKF >seq_25 RKLEKTRKEFISGVSHELKTPLSIMKSCISILKFQAMEREVDKMDTLILDMLELAKFEPFYIDTVMEAICSVEIEKKELRVHKIVVNQGRIEQVILIVED EDILREILKDYFLSEQYVVFEARDGKEALVVFEEEEVDLVILDIMLPELDGWSVCRRIRKTSEVPIIMLTARVDEDDTLLGFELGADDYVTKPYSPPILL ARAKRLLESLSIHGIHVHFPSRTVTVNKTDINLTHTEFEILAYFMKNPGIVLTREQLISRIWGYEFAGDDRTVNSHIRNLRNKLGE---KAKYITTVVRT GYKF >seq_26 TDRNEHLKRFMGDVTHELKTPIALVKAYSMGIKVDTIIKQTDQISNLIEELLRFSKLEEFPIEPLVQSILQIELDSKEINLQVCVYDLNKMRMVVLIADD EQDMLRILKAYFEKEGFEVLLAKDGEEALQIFYDEKIDLAILDWMMPKHSGITVCQEIKKNSSVKVLMLTAKSESEDELAALQSGADEYVKKPFHPGVLI TRAKKLIQHIQVQDVKINFAKNKVYKNDIELEITKTELELIKCFLNHKGTILTRKKLLDIVWGFDYFGEERTVDTHVRRLRKKIGED-----IIKTHRGL GYSL >seq_27 HELEETQRYFFAAASHELKTPIAAVSVLLEGMLLRECIKMMDRQGKTISEILELVSLNPLDIGRTVAELLQTLAEANNQRFVTIVLDPKLIQKAILLVED DDHICNTVRAFLAEARYEVDACTDGNEAHTKFYENTYQLVILDIMLPGMNGHELLREFRAQNDTPILMMTALSDDENQIRAFDAEADDYVTKPFKMRILL KRVEALLRRFRVGRLTLLPEDFRVLCDGTELPLTRKEFEILLLLVQNKGRTLTHEIILSRIWGYDFDGDGSTVHTHIKNLRAKLPEN-----IIKTIRGV GYRL >seq_28 LTLEKTRTEFFNNITHELKTPLTNISGYAQIMSLERINKESKRMHELIVSLIEISKIENVNLKTLIEDLCKVKGDKQGICISNLFMREDEIRRIILIIED EPDIQNILRYSFMKEGFKVRCVDKGRQGIDIFKEFNPSLVILDLMLPDISGFDVCRELNKLNN--IIMLTARDDIVDKVLGLELGADDYITKPFDIRECV ARVKVALRRFIIGPIKIKRQSRKVYLNEEEIRLKPKEIELLLYFLDNPNISLSRDQILDGVWGEEYFGDFRTVDVHVRRLRQKL----NNEEIIETIFGL GYMF >seq_29 IKLEKSRREFFNNVTHELKTPLTAISGYAQILSYNRIYMESERLHGLVLDLIDVSKVEVIDMKKLVIEISSIKANKYNLNLISIIYQSNKIRQLIIVIED EFSINDILTFALKEEGYRVKSTFSSQEARKIMKEFRPNLVLLDINLPDESGFELCKFITCKYKIPILMITARNDLVDKVLGLELGADDYITKPFHIKEVM TRVRVALRR---DEVKINLESREVFKNNREIKLKPKEYDLLVFLAQNRNIVFSRETILDRVWELEYDGEIRTVDIHVRRLRAKL-DSNNGQSIIETVFGV GYVM >seq_30 EKAEQMKNEFIASISHEIRTPLTGIKGWSETLKMGIISGETDRLIHLVEELLDFSRLQKVQLYDILEETITPNAEEKKMQFIKILIDRNRLKQIILVLED EMPIRSFIVLNLKRAGFYVLEASTGEEALQILCEHTVDVALLDVMLPGMDGFQVCKAIREENKIGIIMLTARVQNEDKVQGLGIGADDYIAKPFSPVELT ARIQSLLRRITSGPFLLNIIEERLYKSGQLVDLTPTEYMILQYLMNQASKPVSRDEILNMIWGTNYVGETKVVDVNMRRLRQKIECNPSEPEFILTVWGK GYVW >seq_32 TYMKKERGEFLASVAHELLTPLTYMKGYAKVAKLQIIEDETDSVTDLVQDLFMLVQLEKVLLRPFLERMVKTTLTNKQMQLHVCVCDERRMEQVILLVDD EERMLRLLDLFLSPRGYFCMKATSGLEALKLIEQKDFDIILLDVMMPNMDGWDTCYQIRQISNVPIIMLTARNQNYDMVKGLTMGADDYITKPFDEHVLV ARIEAILRRVSFNGIEWDKTKHTVTVYDEKISLTPIEFSLLRLFLQNTNRAYSRDDLIEKIWGYETDIEYRTIDSHIRNIRDKLRKKGFVENYLETVYKV GYKW >seq_33 KQLTESRNELISNISHELRTPLTYIKAYAALLKAMIIHEEAIRMERLVGDLFELMKLEKTDLEAVVERSVTIEGKKKNVTLAIKSLDSGRLEQVVLVVDD EKKMQVLITVCVSEAGYLVQTAASGLAAMDRLKAEPFDLVLLDIMLPDVDGFSLIAQLRHLQDVAIIMLTALGETEHIVRGLNEGADDYIVKPFEPDELN ARILSVLRRLKVNE-----KAHDISFAGQSLGLTKTEYRVLHRLLSNPGRTYTREQLLDLAWDRQVEIYDRNVDAHIKNIRDKLEKVGADSTVIQTVWGV GYK- >seq_34 ERVEEKRREFLADVSHELRTPLSYMKGYAEGVELSIIQKEANRLERLVNDLLDLAQLEPIAFAQLVYEVVRYLAAQKRLQVHLLISDPDRIEQVILIVDD ELDLRELVTSYLRKEGFAVYTAETGDEAIKRLEQEPMDLVVLDVMMDEMDGFTACKEIRAFSQIPIIMLTARGGEDDKVMGLQIGADDYIVKPFSPRELV ARIEVALRRYRFNELRIQPSGRKVFVNGQEISLTKKEYDLLVFLLEHRGRVFTREHLHDRLWGMDTQQTLRTVDTHIKTLRLKLKPAD---RFIKTVWGV GYKF >seq_35 -RHDK-LRKDFIANVSHELRTPISMLQGYSEAIIAKIIYDESLRMGRLVNELLDLARMEEIEVKTFCERIFQGLAKEQQLTLDIYIVDPDRLEQKILVVD DEDRIRNLLKMYLEREAYDVEEASDGKEALEKALAFDYDVILLDLMMPEMDGIEVCQKLRKQKATPIIMLTAKGEEANRVQGFEVGTDDYIVKPFSPREV VLRVKALLRRLVFPHLSIDNDAHRVTVADQEINLTPKEYELLYYLAQSPDKVFSREQLLKDVWNYDFFGDLRTVDTHIKRLREKLNVSPQAASMISTVWG VGYF >seq_36 -RLDK-LREDFIANVSHELRTPISMLQGYSEAIVAQIIYDESLRMGRLVNDLLDLARMEKINVNEFLEKIFSGVAKEKNIALDDIFFDEDKMEQKILVVD DEARIRRLLRMYLERENYAIDEAENGDEAIAKGLEANYDLILLDLMMPGTDGIEVCRQIREKKATPIIMLTAKGEEANRVQGFEAGTDDYIVKPFSPREV VLRVKALLRRLVFSHLSIDHDAHRVTADGTEVSLTPKVYELLYFLAKTPDKVYDREKLLKEVWQYEFFGDLRTVDTHVKRLREKLNVSPEAAKKIVTVWG VGYF >seq_37 EKSEKNRREFISNVSHEIRSPITSIKGFIGGMLLSLTYDEINRLTRLVNDLLDLSSIEKIDINEIIRFTVETKIKEKKLNVDVFVIDKDRISQVVLIVDD DENICEVIKLYLESSGYATKISNDGKSAQNVFVEYKPDIVLLDIMLPQEDGIDVLKWIRKADNTPVIMLTAKGETFDKVLTLELGADDYIVKPFEPKELV ARVKAVLRRLNFNGLKIDMGSYTVIYNEKDIKMPPKEFELLYYLANNKNKVFTREQLLCEVWGYDYPGDSRTVDVHVKRLREKLHEGN--GWDIQTVWGV GYKF >seq_38 EKIEENRRSFISNVSHEIRSPITSIKGFISGILLALAYDETQRLTRLVNDLLDLSAIEEVNINEIIRLTVKPLIDRKNLDVDILVVDKDRLFQVILIVDD DQNICEVIKMYLESAGFDTRVCHDGKESQSIFIEYNPDLVLLDIMLPSMDGIDVLKWIRKEHETPVIMLTAKGETFDKVLGLELGADDYMVKPFEPKELL ARVKAVLRRLQFNGLTIDIDSYTVIYNGKEIKMPPKEFELLYYLASNKNRVFTREQLLCEVWGYDYPGDSRTVDVHIKRLREKLEEGS--NWQIETVWGV GYKF >seq_39 EEAELNRREFISNVSHELRSPMTSIKGFITAILLVIVNDEISRLTRLINDLLDLSAMQELELNRIIETTVNQKAASKNIKIEVLVYDNDRLIQVILIVDD DENICEVIKMYLETTGYNVKVAHDGKAAKEEFVNFSPNLVVLDMMLPGIDGMEVLKWIRKDSNVPVIMLTAKGETFDKVLALEIGADDYVVKPFEPKELL ARVKAVMRRLNFPGLTIDANSYKVIYNGEEVKTPPKEFELLHYLASNKNKVFTRDQLLCEVWGYDYPGYSRTVDVHIKRLREKLNGGE--DWQLETVWGV GYKF >seq_40 NRQEERRRQFMADASHEMRTPLTTINGLLEGLQIKLMQNETARLIRLVNENLDYEKIRKFNGTEALENIVTAKAEAAGNQLYLTVYDYDRFVQVILMIED NESVSEMMQMFFLNEGEATFK-DDGKEGLATFLASKWDMITLDLNLPSMDGMAVCREIRKVSNVPIIMLTARDSESDQVIGLEMGADDYVTKPFSPLTLI ARMKALHRRFK-----MNTKTRETYLDNQLIELTPKEFDLLYTLAKKPRQVFSREQLLELVWDYQYFGDERTVDAHIKKLRQKIEKVG--PQVIQTVWGV GYKF >seq_41 KKLEQVRKDFVANVSHELKTPVTSIKGFTETLLLHIIWKESERLQSLIHDLLELSKIEQTNLFAVVSEVMKGKAEEKGIDISLALEDPERLKQILLVVDD EESIVTLLQFNLEQSGYEVVTAMDGASGLQLAKTQTFDLIILDLMLPEMDGLDVCKQLRQSKMTPILMLTAKDDEFDKVLGLELGADDYMTKPFSPREVV ARVRAILRRLSFGNVEIYPDNYEVYLKGQPLELTPKEFELLLYLANHKGRVLTRDQLLNAVWNYEFVGDTRIVDVHISHLREKIEPNTKKPIYIKTIRGL GYKL >seq_42 KKLEQMRKDFVANVSHELKTPITSIKGFTETLLLSIILKESERLQSLVQDLLDLSKIETFEPAKMLGEIEKHKADEKGISLHLVVSDPYRLKQVILVVDD EESIVTLLQYNLERSGYDVITASDGEEALKKAETEKPDLIVLDVMLPKLDGIEVCKQLRQQKMFPILMLTAKDEEFDKVLGLELGADDYMTKPFSPREVN ARVKAILRRIVIGDLKILPDHYEAYFKESQLELTPKEFELLLYLGRHKGRVLTRDLLLSAVWNYDFAGDTRIVDVHISHLRDKIENNTKKPIYIKTIRGL GYKL >seq_43 RQLEKMQKDFVSNVSHELKTPVTSLLGFTETLILHIMQKDAQRLQQLIQEILELSRGSEITLEKFITEILQQQLAAKQLKTVVGFFKYELFYPIVLVVDD EPSILTLLTFNLEKEGYQVTTSEDGKNGFELALSNQYDFIILDVMLPGMDGLEITKALRREKDTPILILTAKDEQVDKIIGLEIGADDYLTKPFSPREVL ARMKAIFRRLVIGEIRVDEQNYEVFVRNQPIELTPKEFELLVYFMKRKDRVINRETLLERIWQYDFAGQSRIVDVHISHLRDKIEPDPKRPVYLVTVRGF GYRF >seq_44 RKLENMRTQFVANVSHELKTPLTSIKGFAETLRLDIIDDEVDRLTRLISDILTLSDIEEFDVNEIIKAVCEKSASRKNISLKVGLIDNDKFKQMILVVDD EEHIVKLIKFNLENNGYKVITAADGGEALEKAKGEVPQLVLLDLMLPVMDGYDVCREIRRDQ-MPVIMITAKGEELDKILGLELGADDYITKPFSVRELV ARVKAVLRRFKFGNIQIDFQRHNVTKEGEKVELTLKEFELLQVLIKNKGRVMTRDFLLDKIWGYEYIGETRTVDVHVRHLRQKIEDDDKNPKYIETIRGI GYRF >seq_45 KKLENIRSQFVANVTHELKTPLTSIRGFAETLKLNIINDESDRLTRLINDILALSHIENISINEMLEDIVENSAKDKNIELFIGIKDRDRFKQMILIVDD EEHIRELLRYNLEKEGYKIFCAENGKEALEIAKEKKPTLILLDVMLPQMDGYDVCKEIRKDNTTPIIMITAKGEELDKVLGLELGADDYITKPFSIRELI ARIKAVLRRFKFEGLKMDFEKHEVMKDGEKVDLTLKEFQLLEILIKNKGRVLTREYLLDKIWGYEYIGETRTVDVHIRHLRQKVEDDDKNPKYIETIRGV GYRF >seq_46 EKIEQERREFVANVSHELRTPLTTMKSYLEALDLHVTQNETERMIRLVNDLLQLSKIDWVDLGPYLHDMIEIIAKEKDIHFIRIVEDQDKLTQVILVVDD EKPIADILKFNLEKEGFTVICAYDGAEAVEQVNQEKPDLILLDIMLPNKDGMEVCREVRKQFEMPIIMLTAKDSEIDKVLGLELGADDYVTKPFSTRELL ARVKANLRRISVGDLSIHPEAYLVKKRGESIELTHREFELIHYLAKHLGQVMTREHLLQAVWGYDYFGDVRTVDVTVRRLREKVEDNPSYPTWIITRRGV GYYL >seq_47 EKMDQERREFVANVSHELRTPLTTMRSYLEALALMVTQNETERMIRLVNDLLQLSKFDWIQIVRFMSLIIEMT-KEQHVEFIRLVEDQDKITQVILVVDD EKPIADILEFNLRKEGYEVHCAHDGNEAVEMVEELQPDLILLDIMLPNKDGVEVCREVRKKYDMPIIMLTAKDSEIDKVIGLEIGADDYVTKPFSTRELL ARVKANLRRIHIGSLVIFPDAYVVSKRDETIELTHREFELLHYLAKHIGQVMTREHLLQTVWGYDYFGDVRTVDVTVRRLREKIEDNPSHPNWIVTRRGV GYYL >seq_48 EKNERERREFVSNVSHELRTPLTSMRSYIEALSLKVTLEETDRMIRMINDLLNLSRMDYVNFNELINFVLDMMIENKNYKICRFVEDTDKVIQVILVVDD EKPISEIVKYNLVKEGYEVFTAYDGEEALEKVEEVEPDLIILDLMLPKMDGLEVAREVRKTHDMPIIMVTAKDSEIDKVLGLELGADDYVTKPFSNRELV ARVKANLRRLTIGDLTIHPDAYMVSKRGEKIELTHREFELLYYLAKHIGQVMTREHLLQTVWGYDYFGDVRTVDVTVRRLREKIEDSPSHPTYLVTRRGV GYYL >seq_50 RM-ESARRDFVANVSHELKTPVGGMALLAEALMFGSLHREAHRMADMINELISLSKLQPVQADDIISEAIQLAADNANIEIIRGVEDRSLLVTAILIVED EESLADPLAFLLRKEGFDTIIAGDGPTALVEFSRNEIDIVLLDLMLPGMSGTDVCKELRSVSTVPVIMVTARDSEIDKVVGLELGADDYVTKPYSSRELI ARIRAVLRRLEGGRVRMDVDSHTVTVGGEPVSMPLKEFDLLEYLLRNAGRVLTRGQLIDRIWGADYVGDTKTLDVHVKRLRSKIEEEPSRPRYLVTVRGL GYKF >seq_51 RM-EAARRDFVANVSHELKTPVGGMALLAEAITGNRLYKEAHRLADMINELISLSKLQPVSVDRVIDEALVLAADNAGIELNRGVMDMTLLVTAILLVED EESLADPLAFLLRKEGFDVVIAGDGPSALVEFDRNAIDIVLLDLMLPGMSGTDVCKRLRAVSSVPVIMVTARDSEIDKVVGLELGADDYVTKPYSSRELI ARIRAVLRR---GRIHMDVERHTVTVDGEEISMPLKEFDLLEYLMRNSGRVLTRGQLIDRVWGADYVGDTKTLDVHIKRLRSKIEPEPSAPCHVITVRGL GYK- >seq_52 RF-EAVRRDFVTNVSHELKTPAGAIALLAETVTSGRISKESARLTELVHHLIDLQKAQRISALDVARAAIQTQADSRHVDIRLVIKDKEAMQTAILIVED EESYREPLVYQLTREGYDVSAAATGEEGLELFTKGGIDLVLLDLMLPGIDGTALCRRIREQSRVPIIMLTAKSAEIDKVVGLEIGADDYVTKPYSFRELL ARIRAVMRRLVCGDIVMQVGQHQVTVRGETVFFPLKEFELLEYLMQNKGRVMTRHQLIDRIWGSDYVGDTKTLDVHVKRVRSKIEEDPAHPKYLTTVRGL GYKI >seq_53 ERVQTVHRDFVANVSHELRTPLTVVGGFLETLSLPMMMEQSRRMQSLVEDLLTLSRLELVDMRVMLDTLMEGLSQGRH---QVLLWSGQELHSAILLVED EPAIQELIAFNLAQAGHHVLRAGTAEAALTLVRNALPDLVLLDWMLPGASGIDVAKRLRADEHIPIIMLTARSDEQDKVAGLETGADDYITKPFSPRELL ARIKAVLRRVQG--LRLDPVTHRVTGNGSVIELGPTEFRLLHFFMTHPERVHSRAQLLDQVWGDHVFVEERTVDVHIRRLRSALE-GTQHDGLIQTVRGT GYRF >seq_54 NQLERIRSDFVANVSHELRTPLTVIRGYLETLLFQQMYQHSARMETIIDDLLLLSRLENVAVPEILKTLCERISGEKQHFIQLAISSEEELKSLLLIIED EIAIRDMVRFSLPDE-FELLDAEDTAKAIKQLAAQIPDLILLDWMLPGKSGIEFIEWIKQKEDIPIVLLTAKAEEENKVRGLMSGADDYITKPFSPDELI ARIRAILRRIKIGDIKINTAKHLVTVKDEHIALSPTEYKMLHFFMKHPNKTYSRDQLITYIWGGNIYIDDRTVDVQVRRLRDKLKKYDHHH-LIKTIRGA GYQL >seq_55 KHQNLLRRRLISDISHEIRTPLNVLQNNLEAMILVKLNEEVIRFGELLNNLDLLKRFEKIPLDKIIRGICLNVAKENNIEIHTIILDENKLKQVILVVED EENVLEIVRAYLEKEGYNVYCTTQGLDGIELFKKVKFQLVILDLMLPDIDGEEVCRILRRISDVYIFILTAKVALKDRIEGLNMGADEYLLKPCSPRELT ARVNALFRR---GNLQICSDKRIVRIKDQQISLTPNEFDILYALVLNKGRVLSREQLIERVFGLDFDGFDRTIDVHIKNIRKKIEEDTKKPKYIITVTKL GYKF >seq_56 EKQDSLRKRLVSDISHEIRTPLNILQNNLEAMILIGLNEEVIRFGKLLDNLNILKEIEKVSLKDILEGVISIIAKDKHIFFEKIICDENALKQVILIIED EEKVSSVLKAYLGKEGYKVISTTSGIEGIEIFKKGNFKLIILDLMLPDIEGEEVCKIIRGISDVYIFMLTAKGDLSNRIEGLNIGGDEYLVKPFSPRELT ARVNALFRRFNNGELKINKIKRIVEIEKRDVKLTPNEFQILYALASNKGKVFTREMLINSALGLDFQGFDRTVDVHVKNIRKKIEKDSRNPKYVLTVTRV GYKF >seq_57 KEQENIRKRLTGDISHELKTPLTNIQSHLEAMILLSVKEEAERLSSLVSDMQKLNKYDNVNISDIICFVISNLAKSKNIKIEY-LYDKDKITQAILLVDD EEKIIEVLEAYLKKEGFSVFSCSNGEEALKIFDNNKIDLILLDLMLPDLSGEEVCKVIRAKS-VPIIMITAKTEEEDLLEGFDIGADDYVTKPFSVKQLI ARIKAIIRRFNNGELKINVDTREVWVRDELITLTSTEFNILLCLSSYPKKIFTRDEIIELVLGDKSDSFDRVIDSHIKNLRGKIEENTRKPKFIVTVYGV GYKF >seq_58 EEQVRLKRQLTQNIAHELKTPVSSIQGYLETIVLERCYAQSNRLSRLLRDISVLTRMDRVDISVLVGNIISLELEEKHISIVDLIKNYSLLYSIILVVDD EEDLCEILKFNLENEGYEVDTANSAEEALK-MDISSYHLILLDVMMGEISGFKMANMLKKDKRVPIIFITAKDTENDTVTGFNLGADDYISKPFSLREVI ARVKAVLRRLTYQSLVIDITKKKVSIDDEEVQLTKKEFEILLLLVQNKGRVFSREDILARIWSDEVYVLDRTIDVNITRLRKKIGEYGK---CIVTRLGY GYCF >seq_59 KRSEESRKQLLSSISHDIRTPLTSIIGYIDALKLKILYMKSNNLKHLVDEIFNMAKLDKLDFSEVTREVLLPEISKHNIKLQVMIIDHLSLMRIILIADD DKEIRNLLKIYLERELYMVDTAIDGEVALQLFNQNNYSIVILDLMMPKVDGIEVCRKLRDKTNVPILMLTAKDHEVDKILGLSIGADDYITKPFSIHEVV ARVKALIRRLAFKGLTINLNTYTVHTNKEEISLTGKELELLKFFTSNPGQVFTKTQLFRNVWDDNYIEDDNTVMVHIRKLRKKIEIDPSNPKFIQTVWGI GYKF >seq_60 KEYEDERRNMITNISHDLRTPLTSLLGYIEFIKINIIEEKGNNLRNLMSEFFQLSKLEKINLSEIIRQSIYNDFNNKEIEPVILVMDKIAVERVVLIVED DKEINNLISKALIKEGFSVVQAFEGIEGYSKYRKIDFSIIILDIMLPYLNGIEIMKKIREES-VPIILISAKDEEYDRIIGLELGADDYIVKPFFVGELV ARIKSQLRR---GDIILDLNNYCIYKNGEKIDLTAKEFGLLKLFFTNPNRVFTKMQIFENVWHEEYRGDDNTIMVNIRRLRNKIEDDPNNPEYILTVWGI GYKL >seq_61 QRTELSMRKMLANISHDLKTPLTVIHGYVETILLEKVQTKTIEIIELINKFFDLAKLEKVAVNELCKNRIFDLIQNEKLQVEIIVLDEKALDRIILLIED DVSISEMVKEHLQLEGYHVCCAFDGEEATRLFEQNTYDLVLLDLMLPRRNGIDCLQWIRTKS-VPVLIMSAKDSDVDKALGLGFGADDYITKPFSMIELT ARVKAAIRRIKIHELVVELSSFTVKKKGEEIKLTAKEFQILKLFVTHPKKVFTKEQLYRSVWNDDYYSDENVINVHIRRLREKIEDTPSKPQYIKTIWGI GYRM >seq_62 EKSEKLRKEFVRNIAHDLKTPLSSIIGYSNIIKISIIENNGERANEMIMELFEFYTLQYQDLSEFLRNLIIYLLEEKDFDFDFILEDNKRLERAILIAED EQDIRNLLELHLLKEGYTVFKAVNGLEALDILKKEHIHLALLDVMMDGIDGFNLLKMLREWSQIPVIFLTAKIEDDDKILGLGLGADDYVIKPFKPVELM ARIKSNLRRIKIGNLELSKEGCTVKKDGEIVTLNAKEFKILEMLMSNIGRVFTKKQIYENVWQEEYLGDDNTIMVHISHLRDKIEEDPKNPCKIKTIRGI GYRF >seq_63 EKSEDNRKRLMLDISHDLKNPLASIRGYSNYLILKIIENNSIRVNDLITDLFELSKFEKMDICELIREIIIPQMEDKDMVYSFYIMNEKNLYRAILIAED EQDIRELLGLHLIKEDYNVYEACNGIEALDVFNKNDIDLAILDIMMPGLDGFKVLKKIREKSNIPVIFITARGEDENKILGLGLGADDYIVKPFSPMEVV ARVKALFRRIENGELSLNKDSCTFYKNGNPIELNAKEYKIMEFLMENKGRVHTKKQIYEKVWGYEYYGDANTIMVHISHIRDKIEDNPKEPIYLKTIRGI GYKL >seq_64 RQAEQAKSELITNVSHDLRTPLTSIVGYVNLIHIQVIYDKVTRLNALMNDLFEYTRVQPIDIVELLGQLTRIQLQEANIDCRPFVLDGDKLVRVILIIDD DKEIVELLAVYLRNEGYNIYKAYDGDEALQMISTYEVDLMILDIMMPKRNGLEVCQEVRENNTVPILMLSAKAEDMDKILGLMTGADDYMIKPFNPLELV ARVKALLRR------EIHKHNHTVKVNGEYIKLTSIEFDILYLLASNTGRVFSSEEIFERVWNEDGYGSNKTVMVHISNLRDKLETGMNGEKFIHTVWGV GYKI >seq_65 EK---TKIDLITNVSHDLRTPLTSILGYLELVEIDIAYNKTKRLKVLIDDLFQLTTLNEINIAELLKQLIMLNFQKAGIRCRLLVLDAVLLIRAILVVDD DIEILNLVSIYLSNEGYEIIKATNGCEAISKINNEKPKLVILDVMLPDIDGIEVCRKIREKLNVPIIILSAKVQNSDKIKGLLTGADDYITKPFNQLEFI VRVKTLLRRIVLGTLTIKKSTHTVLINNNEIILTATEFEILKLLAINKGRIFSAEEIFETVWKEKYFQSNNTVMVHISNLRDKIERKLNGEKLIHTVWGV GYKI >seq_66 RRIEKSKDELITNVSHDIRTPLTSIIGYLGLIETHTAYVKAKQMKLLVDDLFEYTKVRTFDMAQLIEQLAELEA--KKINMQIVMEDTEKLVRVILVADD DKEIVELLSIYIHNEGYEVVKAYDGKEALSKLHTTDIDLLILDIMMPIMDGMEVVKELRKESQIPIIMLTAKTTDMDKIKGLVAGADDYVTKPFNPLEVM ARVKSILRRLEVGPLMINKDSHEVKIEGKEIQLTALEFGILYLLASHPNRVFSADEIFERVWQQESIVSAKTVMVHVSHLRDKIEEATGGEKVIQTVWGV GYKI >seq_67 AKSERLKTELITNVSHDLRTPLTSIITYTELLKIEIIDRKARRLKILIDDLFEASKMSTVDVLQLLQQALGEALEQSSLQLRIVIKDGQKIWRVVLVVDD DKEIREGITIYLKNEGMNVFQAKDGIEALERLNEEDIHLMLLDVMMPRLDGIQTTLKIREARNIPIIFLSAKSEEADKVLGLQVGGDDYVTKPFNPLELI ARVKSQLRRINLNGITLDLDAKEVKAYGEPVKLTPIEYRIIELLMTNAGRVFSIAEIYERVWNEPSQSSENTVAVHIRKIREKIEIDPSNPRHLKVVWGI GYKM >seq_68 VKSERLKSELITNVSHDLKTPLASIINYVDLLKIQILDKKSKRLKVLIEDLFEASKISKVNISEILRQALYEKIEGSSLNFKVIANDGKKTWRVVLVVDD EDEIRDAIGIYLKNEGIKVLKAKDGIEALMVLEEEEVHLIIMDIMMPRMDGIKATFKIRESKKIPIIMLSAKSEDMDKILGLNIGADDYVTKPFNPLELI ARVKSQLRRIVVRGLTLNKDAKTAMVDGKEVSLTLTEYKILKLLMENKGRVFSIEHIYESVWQEP-YNGENTVAVHIRRIREKIEINPREPEYLKVVWGI GYKI >seq_69 VKSQRMKTELISNVSHDLKTPLTSIIAYVDLLKLETLERKSQRLKHLIEDLFEVSKATDVDISYLMKQVVDDKINEAGLDMRLLLPDGQRTYRVVLVVED EKEIAEAIEIYLKNQGYNVFKGSNGLEGLEIVEKEEVHLAVVDIMMPKMDGATMVMKIRE-SDFPIIMLSAKSEDMDKILGLNIGADDYVTKPFNPLELL ARVNSQLRRYSIGGLEVNSDRKEVILDGDVVKVTPIEFKILQLLIKSPGRVFSAEEIYERVWNENAVNTDT-VMVHVRNIREKIEIDPKNPKYLKVVWGV GYKI >seq_70 IKNEKLKTELISNVSHDLKTPLTSIINYVNILQIEILDKKSQTLKKLIDDLFEVSKMSNIDIIQLVYQCIEDVYSEKEIEFKIGVKDPQRMSRVVLVVDD EKEIRDAIDIYLRGEGINVIKAGDGFEALEILDKEDIHLVVLDIMMPKLDGMRTCLKIRESRNIPIILLSAKSEDSDKILGLNIGADDYVTKPFNHLELV ARVKSQLRRIVIKDLTIDTVNKQVSLRGENIKLTATEYKILTLLASHPGRIFSIKEIYERVWEEP-YKSENTVTVHIRRMREKIEINSKEPEYIKVVWGL GYKI >seq_71 REAEKTKNDLITNVSHDLRTPLTSVKGYLLLLKIEIAYNKSEKLENLINDLFEYTKLSSICLDELLEQLVYVICKENNTEIEKIVKDGDKMVRVVLVVDD EKEIRDLIGIYLNNEGFNVIKAENGVEALEILKDTEVQLVLLDIMMPKMDGITTCMKIREDKNTPIIMLSAKGEDMDKILGLTTGADDYISKPFNPLELI ARVKSQIRRIEVGEITINTDTHEVKIGDRTVSLTKREFDILVLLSRNKGVVFSTEKIYESVWHEE-YDCHNTVMVHIRKIREKIEKNPRKPEYIKTVWGV GYKI >seq_72 REIEKTKSELITNVSHDLRTPLTSILGYLNLIKIQIAYNKSEKLRVLIDELFEYTRLSEIALDELIEQLVIPVFLENKIEIKRIIDDGDKLVRVIIVVDD EKEIRDLICIYLENEGYRVIKAENGIKALELLEKEKVDLIILDIMMPNMDGIEACTKIREEKNMPIIMLSAKTEDMDKIWGLTAGADDYMTKPFNPLELI ARVKSQLRRIEIDELTINTATHEVQVGEKKVRLTPREFDILELLARNKGIVFSIEKIYERVWKEE-FKSDNTVMVHIRKLREKIEENPRKPKYVKTVWGV GYK- >seq_73 KESIEMKNEMISNISHDLRTPVTSLIGYADLLGVSILKRKSYELKNQVDDLLEYCQINEVDMKALIEQIMVPQLDDANMSFYISVEDVALIVRLILIVDD DQDIVRFVKANLMQEGFKVFSAHNGEESLEIINNNSIQLAILDIMMPQMDGIELCRRIREKHSLPIMFLSAKSSDVDKVVGFSTGADDYIVKPFSTIEFI ARVKAQLRRINIRGLEIDEASRTVMLYGETINLTKTEYDILFLMAAAKNRVFTIEEIYESVWKERAYESNNTVMVHIARLRNKIEEDPKRTEV------- ---- >seq_74 QQAISSKDQLVVNLAHDLRTPLTSVIGYLDLILLMIAFTKSQRLERLIDELFEITRMNNIDLSDLLLQLKYPLFEKNHLIARTMILDGELLARVILVADD EQEIRDLIAIHLEKEGYHVIKVSDGKEAVDVIGRQVIDLLILDIMMPNIDGYEVARQIREQHNMPIIFLSAKTSDFDKVQGLVIGADDYMTKPFAPIELV ARVNAQLRRLEFGGLVIAPEQRKVTLYGETIELTPKEFEILYLLASHPKKVYSVENIFHQVWGEAYFEGGNTVMVHVRTLRKKLKDDQRKSKWIKTVWGV GYAF >seq_75 KRLEKVREEWLAAMSHDLRTPLSSIQGYGQLLEGEVISEKGAYMLQLIKDFLTFELK-KVDLADLAERTVRHDATLKEAVFAFILYNSRFLERLILIVDD EKAIVKMLETVLRKEGFTIYTAYNAEEAFHYVKNYTLDVILLDVMLPDRSGFDLCPQIRGLTQAYILFLTAKVSDLDKLTGFAIGADDYVTKPFNPLEIV ARINARLRRFTFDRFTVNELAGELLVDGRKVPCPAQVFLLLCYFCKHPNRILSKEQLLEAVWGMDSFVDDNTVMVHIRRIRERIEIDPSHPRYLVTVRGL GYKL >seq_76 EKIQATREEWIAGLSHDLKTPLSSIYGYSMMLEGQVVREKSEYMSKLIEDLLTYRKNDLTSLIPFFKNVIKKNPFSEGYDISFSFADEAWFRRIILIVDD EKAIVDMIKRVLEKEGYRNIDAASAEEAIPVVKANKVDLIVLDVMMGGMSGFEACTLIREYSDAPIFFLTARSSDADKLSGFAVGADDYITKPFNPLELA ARIRAHLKRYTYDYFTFSPQNAELIVGGEAVACSAQLLQLLQYFCEHPNVVLSKDQIYEKVWGYPSYGDNNTVMVHIRKLREKTERDPSNPEYIVTVRGL GYRF >seq_77 KDSIDNRKGLISSISHDLRTPITSIKGYVEGILLRTIYSKAEHVDLMIDDLLLYSKLDKVDIVEYFNYCIEPELKKDNIKINNLVMDRERLKRVILIVED DLSIAELQKDYLEISDFEVKICTDGVSGLNEIKENKYDLIILDVMLPKMDGFDILRIIHDTKDVPVLMVSAKKEEIDKIKGLSLGADDYITKPFSPGELV ARVKSHIQNISIRGLEINKDSRQVIINDKEVNLAQKEFDLLLHMAQYPNRVFGKDELFESIWGLDSIGDSATVTVHIRRIREKIEFNPSKPQYIETVWGV GYRI >seq_78 EKYEANRKELIANISHDLKTPITSIIGYVEGLMLTVIHEKSLGLNDLIEELFLYSKLDKTNFTRFIAHILRLE---QELVITSLVQDPTQMNRVVLIIED DPSIADLQKDYLEINDMTVTIEHDGKKGLEAALNEPFDLIILDVMLPTMDGFEICRAIRKKKQTPIMIVSAKKEDIDKIRGLGLGADDYIIKPFSPNELV ARAKAHMNRLKINEIAVDTAAHKVFVLENEVIFTSKEYKLLVFLMEHPNRVWNKEELFESVWGFDALTEVSTVVVHIKRIREKLKKANLSDSPIETLWGS GYRF >seq_79 SESLETRKEQIGALAHDIKTPLTIIRGNSELLKNDDILNEIKNMEFYIKSLIEITRSEQVNLIKFIDNIMHLISKNKNLSFKSIIFDEIALKRVILAIDD EEGILTIIKSALEKEGHNVTTVSNPTCLLK--DKYKYDLILLDVMMPEIDGFSLCKEIRN-LDCPIIFLTAKTMEQDIVMGLSLGGDDYISKPFGISELR ARVAAHLRRFSISNVKFNITSKEAYYNEKLIPFTKSEYNICEYLAMNHGQVFSKDRIYEKVYGTYGNSDTTAIVEHIKNIRNKLKNMGINP--IETVWGI GYKW >seq_80 ELMKETQKEFLANLSHELKTPLAVVMNTLETLLIGRALKRVKEAISLTETVLSFSKGKEVNLEEVLSEVLEENIKEKEISVEILLKNREEIKVIIALIED DKDLAFLVKLNLEREGFEVEHFERATPFFKFISENSVDLILIDIMLPDLDGFRIANFLKSRAEIPVIFITAKGEEEDKLKGFELGADDYITKPFSMKELI ARVRAVLKRLKYEGVELDTERQKLFVDGREVYLTPAEFKILKTLMENFGRPVSRSSLVEKLWDYERETTERAIDVHVKHIRDKLGKYKG---LIKTVRGV GYKF >seq_81 TLKNTQLASVISAISHELKNPLSVIDLSLEMLKLEKISRQSVKLNALTHKLVFNLNSEEFDLFSLCEKITKNPG------FERAVKDEFLIEQVILIIED DIDLNELLVLKLKSSGYEVISLADFFGVEDLLDNEQIDLLIVDRNLPSGDSLEKIQDLREQGKEAVIFLTAKALHQDLLEGFESGCDDYVCKPFDFNELL LRIKAILKRLSFGDFILDLVSYEFFYKNQKLEISNLDYELLKCFFENPNTLLTRQFLSESVWKDDT-TSDKTINIALTRLRNKF---PKLKDHIISVRGI GYKL >seq_82 EQHSRSY-HFANRWTHQMKTPLSVIRLILQEQDVDQIEEEVGKLEDGLHMMLSTVRLQRIALLSVIRSVL----HGYR-----IAADKKWLIFVIFVIED DRKMASIICQYFKKYGYEAQYAIDFDMIKQDFLDYRPDLVLLDINLPAFDGFYWCRQLRRLSNLPIIFLSARASEMDQVYAIENGGDDYITKPFHLEVLL AKVKGVLRRLEVDGLFFYRDRNTIELNGQKVECSPKEFRLLICLAEHVENIVSRDKLLEAIWDEIDFVDDNTLNVNIRRGRRRLEDIG-ITDAIQTVRGQ GYCL >seq_83 LKKLDIQKHYLDFINHQMKTPVSVIDLILQEEDLDSIGEENEKISQGLNIMLYNARINDIDILLILRKVI----DNHK--IFPIVQDKKWIYFVIMIIED DKKMAKLIKNHLERYGYKTFLIKDFSNIKDEFLEHKPDLVLMDINLPFFDGFYWCSHIRSHSKVPIIFISSRDSDMDQVMAIDNGGDDFITKPFSYDVLL AKIKGVLRRLKIDDLILYTNKNVLEYKGKKTELSKNEFSLLLHLLKNINKIVSRDTLLGILWSDIDFIDDNTLSVNVTRLRKRLEEIG-ITNAIETKRGQ GYIL >seq_84 SQYINQQKQFTNQWVHHMKTPVSVISLMIQEGKLEELEDENERFRHGLDMMLQTARLETFDLAEMVRSLI----QERR--LTLVISDQKWLSFVILLVED DERIASLLGGHLQKYGYEVKIAEQLNDIKLEFAEMKPDLVLLDINLPFFDGFYWCRQIRTISNAPIIFISARTDELNQVMAIENGGDDYITKPFHLEVVM AKIKSVLRRVELGGLTIYPDQNEAEWNSVRILFSQKEFQLLSIFVREHKKIVSRDELLEALWDDVDFVDDNTLTVNVNRLRRKLENAGLT-DCISTIRGQ GYQF >seq_85 TALEQEKDDLLAWI-HEIKTPLTAMHLIIDRLEKGQLTYEWMRIHLLLDQQLHQKRIPKVNLESVLHQEIQSWCIQKGIGFDLLVLDAKWLSFIIMLIED DHTLFSEIKERLSQWSYDVFGVSNFEKVIEEFTSLKPDLVIIDIQLPKFDGFHWCRMIRQHSNVPIIFLSSRDHPTDMVMSMQLGADDFIQKPFHFDVLV AKIQAILRR------TIDYESNTVSNHVGSIELSKNEFFILKRLIERKNKIVTRDDLIRSLWEDERFISDNTLTVNVNRLRKRLDELGLGA-YIETKVGQ GYM- >seq_86 QAINQDQKDYIDSWVHEIKVPLAAITLLVQSVE-YLLENELGKIDEYVEQVLYYARLDEYSLKEIVQSVVANYFIQKRLQFSIGVLDRKWVIFIIMIVED ETTIRELISEELQKWQFETIGTTDFNDVLDDFQEENPQLVLMDINLPVYDGYYWCQKIREVSKVPIIFISSRSTNMDMIMAMNMGADDFVTKPFQIDVLI AKINALLRRLSHNGITLNVDNGRMEIRGEMIDLSKNEYRLLYLLMKKHGKILTREKLLRALWDDERFVDDNTLTVNINRLRKKIEQAGIAG-YIETKVGV GYM- >seq_87 KKEKEHLADSLADIAHQLRTPLTSANLILSLLAVRETEELLVRMDWLITSLLKLSRLDQINVNNLICAALLIPMELHDIDLQIAIQDSGWLSEAIFLVED DKAIARNLMLLLRSEGFTVTHAPTRSEAFAALAGNKFDLALIDISLPDGNGFTVCTEIKETQDVPVIFLTASGDEASVVTGLNMGADDYITKPFRSRELI ARIGTALRKFEIRGLHVDTASGIVKKNGNEVFLSALEYRLLLVFISNPKSIITRGRLLDELWDAAGFVNDNTLTVYIKRLREKIENDPASPQIILTVRGT GYRL >seq_88 KEDKIFLKNIISDISHQLKTPLTSLLTINELLLLQKSGSQLNRMEWLIISLLKIARLENTLGINFIQDALKFKSDAKHQKIMVGFYDEQWTIEAILLVED DSILAYSIEYTLNGEGFNLKCADSIKKARDNFKENSFDLIILDVMLPDGNGYDFCKEIRKASEVPVIFLTACDEEANVVIGLDIGADDYITKPFRIKELI SRIRAILRRLISGDIKVQTLQGKIEKNDKEIILTALEYRLLLIFINHPKQILSRNIILEELWDAEGFVDDNTLSVYIRRLREKIEDNAGEPKYITTHRGL GYRW >seq_89 AETDRLRTALLTSISHDLKTPLAAILGSAGTLKLSTIVEESERLNRFISNLLDMTRIGLHFLGDMVGTALAKIVVRH--E--VLDIKVDPVLFEVRILIV DDEPPIRKLLRVGLSTQGYAVSEAMNAKVAMELVREDKPDLVVLDLGLPDVSGHDLLEKWRSDGALPVIILSSRTDEAGIVRALELGADDYISKPFGMNE LVARIRVALRHFQTGDLSVDLVRRIVKVEDREVKLSPKEYDILRVLVQHAGKVLTHRFLLEHVW--DELTDVQYLRVYVRQLRQKIEKLPDQPRYILTET GVRL >seq_90 VESERLRSALLTSISHDLKTPLASVLGAASTMRLATVIDESERLNRFIANLLDMTKLELHDLSEIVGSALAKILTAH--K--VLVLQLDAVLFEIKVLVI DDEPPIRKLLRMGLTTQGYEILEASNGKIALEKLTE-EPALIILDLGLPDIQGHELLRTIRARNAVPIVVLSSRGDEAGKVQALDLGADDYLTKPFGMDE LLARLRAALRHFRTGDLSVDLVRRIVKVGEREVKLSPKEYDLLRVLVQHAGKVLTHRFLLKELW--DELTDAQYLRVYVRQLRQKIEADPERPQYVLTET GIRL >seq_91 RRADRFRSALMNSVSHDLRTPLSTVLGASTTLIVVSIREEAERLNRYVANLLDMTRIEWIDVRDVLNAAASRRLGGR--ELARFIQDASLLEQVILVIDD EPQIHRFLGPALDAAGYEALRADSGQEGLRGIALWSPDAVVLDLGLPDMDGKDVLEKARAFYEGPILILSARDREVEKIEALDRGANDYVEKPFGVGELL ARLRVALRQLTFGDVIIDLDLRLVTKAGAAVKLSPKEFELLARLALSPGKVLTHKELLVGIWGASHADDTQYLRVFVGQLRQKLEDDSAHPRLILTEPGV GYRL >seq_92 ES-ERLRNALLAAVSHDLRTPLTSLLGMAETLQVAAMQDQARRMHALVVNLLDMARLQWQSVEELVGSALNAALQDRPVTVAPLVEDGILIERVVLIVED DAHIRRFVRSALEAEGCEVHETDTVRRGLIEAGTRQPDAVVLDLGLPDEDGMALLRELRGWTELPVLVLSARSSEGDKIAALDAGADDYLTKPFGVGELL ARLRVLLRR----DVQIDLARRVVTRAGEHVHLTQIEYRLLAALVASRGKVMTHRELLREVWGPSHVESHHYLRIYMGHLRQKLEADPARPRYLLTEIGV GYR- >seq_93 ES-ERLRNSVLSVVSHDLRTPLTTMLGLANMLNAQSIQDEAIRMTKLVTNLLDMAKLQWQLLDEVVGSAVEHSLRHHKLELKL-LEDAVLIERVVVIIED EKPIRRFLSAALEEEKLTVYEAETGKQGQIEVATRKPDLVILDLGLPDMNGIDVIKSLREWTDIPILVLSARTQETEKVAALDAGADDYLTKPFGVAECL ARIRVLLRRFQFGDIRVDLVNRLVHKGDAPVHLTPIEYRLLSTLIRNAGKVITHRELLLAVWGPSFSEHNQYLRVYMGHLRQKLEDNPAMPRHIVTETGV GYRL >seq_94 ER-ERLRSNLLRSISHDLRSPLAGIKGAATILEINGIYEDTEWLIRLIENLLSMTKFDVELVEEVVSEAVSKYFKNH--KIKVVVSDGSLIEQVILVVED DKPIRNFITTALSTQGYKYSETDKGNEAIALSMANSPDLIVLDLGLPDIDGIEVIGKIREWSKVPIIIVSARENERQKVEALDKGADDYITKPFGIGELL ARIRVSLRHFKVKNLSVDFEKRRVSVNGKEVHLTPIEYKIMFLLCKYSGRVLTHNFIINEIWGASVGNENQSLRVFMANLRRKIEKDPAQPEYIYTEVGV GYRL >seq_95 EQNYQKQQQFVSDASHELKTPLTVIESYASMLRVEAIHEEAVRMKAMTEQMLQLANEDEVDLLEVAEKACQHIHVSFEREIQVVVWDENKLKQVILIIED EKKIARVLQLELEHEGYETDAAFSGSDGLETFQAHAWDLVLLDVMLPELSGLEVLRRIRMTDVTPIILLTARNSIPDKVSGLDLGANDYITKPFEIEELL ARVRACLRTLMFQELTINEKTRDVQRGNETIELTPKEFELLVFFIKNKGQVLSREQILTNVWGFDYYGDTNVIDVYVRYLRKKL-----SLTEALQTVRG VGYR >seq_96 KEHYDKQQQFVQDASHELKTPLTIIESYSSLMKIEAIHSEAVHMKKLTNQLLALAKSHTIDLIKAARAVMQ-SVYQRDILLET-VKDEERIKQLILIVED EEKIARVLQLELEYEGYSVTIKHNGTEGLDAAAEGGYSLVLLDVMLPGLSGLEVLRRLRKTDQTPVILLTARDSIPDKVTGLDIGANDYVTKPFEIEELL ARIRAALRQLTYDDLRVNEKTREVRRGDKEVELTPREFDLLVYMLKHPQQVLTREQILSSVWGFDYIGDTNVVDVYIRYIRKKL--DYPYEKQLIHTIRG VGY- >seq_97 RRYIEQQEQFVEDVSHELRTPVAIMEGHLNLLNLKASLQEISRMKSLVQEMLDLSRAERTDAKQVVYQVFQLVYPE----FHILVEKRNHFEQLILIIED EKNLARFVELELKHEGYTTEVHYNGRTGLEAALNNEWDAILLDLMLPELNGLEVCRRVRQVKNTPIIMMTARDSVIDRVSGLDHGADDYIVKPFAIEELL ARLRALLRRITYRDLTIEKENRVVRRNSEMIELTKREYELLLTLMENVNVVLARDVLLNKVWGYETEVETNVVDVYIRYLRNKI-DVPGEESYIQTVRGT GYVM >seq_98 --LKKKMQQFISDASHELRTPLTSIRGFVEVLLLNTILIESERLTELVNNLLTLTRLDKQNMSDIIEEIYKILAKNRQVNLNLYFLNKNQIKQVIMVVDD EEHIVEFLKMGLEAEGFTVYTAFNGNDAVIYSRKLNPNLIILDVMLPYMNGYEVCSLIKKASNIPIIMLTAKDEIDDKVLGLNLGADDYMIKPFSFKELL ARINARLRNVVIGPFLIKDSAHEIIYKDDILSLSPTEYNLLKYLLLNNGIALSKDQILEKVWGYDFTGEKNIVEVYIRYLRDKISDK--NHTIIRTVRGV GYK- >seq_99 EASFEAQRRFTSDASHELRTPVTAISGHASYLLLNIIRSESERLTNLITSLLQLARSDPIFSRLFLDDVSAPLA-TGGSELRVGFEDPDRLRQVVLVIED EKDIARFIELELAAEGYATEVAFDGVTGLSKFREVNPDLVILDLMLPVLDGLEVARRIRKTSNTPIIILTAKDGIQDKVEGLDSGADDYLIKPFSIEELL ARVRAHLRRVRVADLVMNLDGREIFRGGRRVELSAKEFELLELLARNPGKVFSRFEIEEKVWP-EYTGGSNVVDVYI-YLRRKLEEG-GERRLIHTVRGV GYVL >seq_100 EKSFNSQKMFVSNVSHELRTPMAALTAELDLALIGNALQDSRRIINLIDGLLNLAKADEVRLDELLLDARLKAHPDYHIELEQAVINSYLLTTAILIIED EPRVASLLMNGLEENGYQTMVAYDGLMGLRLFQAHTFDLVISDIVLPKMDGFELTKEIRKSNRIPILMLTALGSTNDKLDGFDAGADDYMVKPFDFRELN ARIKVLLKRLVYADLRIDLQRKDVERNNVSIKLSPKEYNLLLYMVENAERVLSRVEIAEKVWNTHFDTGTNFIDVYINYLRKKIDRNF-EPKLIHTKAGM GFIL >seq_101 DSAFKAEKSFVSHASHELNNPITAIQGECEISLLQRISSESKRISNLIRHLLFLSRQDAMSLPDMLNDLIE------RIRLHYEVKNPYLLKIAILLAED EVNIASFIERGLKEFGHSVTVCHDGNTGWRILQEEPFDLVILDIIMPKINGLELCHLYRQMFQAPVIMLTALGTTEDIVKGLDAGADDYLVKPFSFQELE ARIKALLRRLTCDNLVLDCNTRRAKRGDADIDLTVKEYRLLEYFMTHQGVALSRITLLKDVWDKNFDTNTNIVDVYVNYLRVKIDRDFDK-KLIHTVVGL GYIM >seq_102 ETAYNDQRQFVDDAGHELRTPITVVRGQLELLEIELATTELDRMARMVNDLLTLAVAD---PVDVTELTIDLEDKARTGRVMLDVIDEQRVTEAILIAED DDGIADFIRRGLVQEGFECEVAVSGAAAFARAHSGDFDLMILDLGLPHMDGADVLEQLRVLKSLPIIVLTARTNIEDRIRSLEGGADDYMPKPFQFAELL ARVKL--RLLKHGTLELDLRTQKVFVADKWRDLSRREFDLLETLMRHPGQILSRAQLLNMVWDMSFDPGSNVVDVYIRALRKKIG-----TEKVETIRGS GYRL >seq_103 EIAYNDQRQFVDDAGHELRTPITVVRGQLELLAIELATTELDRMSRMVNDLLTLAVADHAHPTDVTDLTIDIEDKARTDRILLS--DEQRVTEAILLAED DAGIADFIVRGLIREGFECEVTESGAEAFARAHSGDFDLMVLDLGLPHMDGTDVLEQLRNLQTLPIIVLTARTNIEDRLRTLEGGADDYMPKPFQFAELL ARIKLRLAKLRNGDLELDLRTQRVLIDGSWHDLSRREVDLLETLMRHPGQILSRVQLLRLVWDMDWDPGSNVVDVYIRALRKKIGAH-----RVETIRGS GYRL >seq_104 RQVEARRRQLLGDISHELRTPMTAIRGEAEVTLLQRITLASGQMGALIEDLLMMARSDDMDPRQALEEAISPIAHVREVELRIIVADGQRLQQVILVVED DTRVADFLERGLKAEGYKVRVARDGVSGLEAARDDQRGVILLDLMLPKMTGMEVCQTLRASGLTPVLMLTALGAVDDRVTGLRTGADDYLVKPFSFEELL ARIEALLRRLKAGNVELDRTTMRVSRDGEEVVLTARELALLELFLSSPGRVLSRERILSNVWGVDEDPLTNVVDVYVRRLRAKIDP-PLASSFITTVRGL GYRL >seq_105 QEKESQMRRFVGDASHELRTPLTSVKGYSELYHLSKISGEAQRMSVLVEDLLSLTRAEPVDVLELSLSVARAAWPERSITVVNTVEDATRLHQVVLVVDD EPNIVELLTVSLKFQGFNVMTASDGNEGLKIAREFRPDAYILDVMMPGMDGFELLTKLRAEGDGPVLYLTAKDAVENRIHGLTIGADDYVTKPFSLEEVI TRLRVILRRLTYADLTLNDETHEVTKAGELVDLSPTEFNLLRYLMLNAEVVLSKAKILDNVWHYDFGGDGNVVESYISYLRRKVDTHEPQ--LIQTVRGV GYVL >seq_107 RDKEAQMRRFVGDASHELRTPLTSVKGYAELYRIEKIEDEAKRMSLLVEDLLALTRAEPVDLLDLALNVSRGAYPDRDIDVRSCVEDAARLHQVVLVVDD EPNIVELLKVSLKFQGFEVETAQSGIEALEKARSFQPDAFILDVMMPGMDGYELLPKLRADGEGPVLYLTAKDAVEHRIHGLTIGADDYVTKPFSLEEVI TRLRVILRRLVYADLTLNDDTHEVTKAGQVVELSPTEFNLLRYLMLNAEVVLSKSKILNNVWHYDFGGDGNVVESYISYLRRKVDTQEPA--LIQTVRGV GYVL >seq_108 EETTDKMKRFVSDASHELRTPLAAIHGYAELMQIEHIERSSQRMTVLVEDLLSLARLDTVKLSSLVTDAVLEPARDHPAEFSLELPDASRLRQVIVVVDD EPSIRELLVASLHFAGFEVNTAASGSEAIEVIEKVQPDLIVLDVMLPDIDGFTVTRRIRQEGNAPVLFLTARDDTQDKIMGLTVGGDDYVTKPFSLEEVV ARIRAILRRIRVADLEINEDSHDVTRAGQPVDLSPTEYKLLRYLMDNEGRVLSKAQILDHVWQYDWGGDAAIVESYISYLRKKVDGIEVDDPLIETKRGI GYMI >seq_109 RVSRMRQSQLVADAGHELKTPLTSMRTNIELLMECDVLAQMEEMSTLIGDLVDLTREEPVHLNRVLETAVERR--RHDVEILT-LNDGFSLTRAVLVVDD EPAVRESLRRSLTFNGYNVVTAEDGVQALEVIEREHPEIVILDVMMPRMDGLEVCRTLRSQGDRSILILTARDNVSDRVGGLDAGADDYLTKPFALEELL ARVRSLVRRLTFGDLTLNPETRDVSRAVRPISLTRTEFALLELLMKNPRKVLPRNTILEEVWGYDFPTSGNALEVYIGYLRRKTEQEG-ESRVIYTVRGV GYVL >seq_110 RESRTRQSQLVADAGHELKTPLTSMRTNIELLLLDGVLAQMTEMSDLIGDLVDLAREEIVDLNQVLEIALE----SRRMTVRIVLLDDFSLTRAILVVDD EQAVRDSLRRSLSFNGYNVVLAEDGIQALEMIDKEQPALVILDVMMPGMDGLEVCRHLRSEGDRPILILTARDNVSDRVGGLDAGADDYLAKPFALEELL ARVRSLVRRLSCGDLTLDPESRDVYRNGRAISLTRTEFALLQLLLKNQRKVLTRAQILEEVWGCDFPTSGNALEVYIGYLRRKTELEG-EDRLIHTVRGV GYVL >seq_111 RRSRQRQTELVADAGHELKTPLTSMRTNIELLMLQSVIAQMEELSTLIGDLIDLARQE--DLLDVVNSSVRRR--RPDVRFEIALEDSFALGRAIVVVDD EQAVRESLCRSLSFNGYEVHLAQDGVEALEVIEREQPELVILDVMMPRMDGLEVCRTLRGSGDRPILVLTARDGVSDRVAGLDAGADDYLPKPFALEELL ARVRSLLRRLSFEDLRLNPDTRDVTRGGRPISLTRTEFALLQLLMANSRRVLSRASILEEVWGYDFPTSGNALEVYIGYLRRKTESEG-EPRLIHTVRGV GYVL >seq_112 EHTEQVRRQMLSDLAHEMGTPLSVLTAYLDGLQ-TIMADQLTRLTRLMEDIDYVSRAQEEGLGDLLHTAAGEAYADKGVDLQVTVVDRQRFGQVVLVVDD EQPLAQMVASYLIRAGFDTRQAHTGTQAVDEARRFSPDVVVLDLGLPELDGLEVCRRIRTFSDCYILMLTARGSEDDKISGLTLGADDYITKPFSIRELV TRVHAVLRR---GRIHMDVERHTVTVDGEEISMPLKEFDLLEYLMRNSGRVLTRGQLIDRVWGADYVGDTKTLDVHIKRLRSKIEPEPSAPCHVITVRGL GYK- >seq_113 EQTEQVRRQMLSDLAHEMGTPLSVLAVYLDGLQ-KVISDQLSRLTRLTEDMDEVSRVQEENLAEVLSTSLREAYQAKSVDLQLIVVDQQRFGQVVLIVDD EKPLAQMVETYLLRAGFETVQAHTGIDAVHEARRFSPDVVILDLGLPELDGLEVCRQVRAFSDCYILMLTARGSEEDKITGLTMGADDYITKPFGIRELV TRVRAVMRRLIIGDLVIDPSAHTVRVGGGAVDITPTEFDLLLALALRPGRVCSRRELVTEVWDTTWVGDERIVDVHIGNLRRKLGTDARGRGLIDTVRGV GYRL >seq_114 EHTEELRRNMLSDLSHEMNTPLSVLLVYVDGLQHAVFAEQLGRLSRLTSDLDDVSRAQTVAIGGLIHNAAAGSYQEKGVALEVGIRDSQRFAQVVLIVED ERPLARMISLYLSKAGFDTTTIHDGAAAPDKVAHLRPDVVILDLGLPGLDGLEVCKRIRAFTDCYILMLTARGSERDRITGLEIGADDYITKPFNIRELV IRIQSVMRRLTYGHIELDTLAHEVTVKGVGVTLTRTEFELLQALMHKPGEAVSRRDLVSQVWDTTWVGDERIVDVHIGNLRRKLEAPAPGSHFIDTIRGV GYRM >seq_115 EHADKMRRDMIANVSHELRTPVSALQAMVENMALESILTQTQRLSDLIAFLLDLSRMEKFNFADFLDETIEIADGGHAHDIRVVMEDQDRLRQLVLVVED EPTLATAIAQRITAEGWTARVAGDGASAVQAASQLRPDLVIMDIMLPVMDGLEATKRIVAER-VPVLILTARDDEADKVIGLGAGADDYMTKPFSMRELI ARCKALLRRLDFGSMVIDPAQRIVTVNGEQVHLTPTEFDLLATLARRPKSVLTREKLLEEVWDWVDASGTRTVDSHVKALRHKLGADT-----IRTVHGV GYAF >seq_116 SRQINLQKQFTSDVSHELRTPLTTVRMAADLIVSELMVRELDRFEALLADLLEISRHDTLDIRICISSAHDHLAQELGVDIIIVIKDSRRIERIILVVDD DPAISEMLTIVLEAEGFEPVAVTDGAVAVDAFRTESPDLVLLDLMLPGMNGIDICRIIRQESAVPIVMLTAKTDTVDVVLGLESGADDYINKPFKPKELI ARLRARLRRIEIGDLTIDVLGHEVTRGDEEIQLTPLEFDLLLELASKPGQVFTREELLQKVWGYRNASDTRLVNVHVQRLRSKIEKDPENPHIVLTVRGV GYK- >seq_117 SAQINLQRQFTSDVSHELRTPLTTVRMAADLIASQLMNRELDRFESLLSDLLEISRHDLHDVRIPVRSALQHLATELDVELLVLIQDSRRIERIILVVDD DPAISEMLTIVLSAEGFDTVAVTDGALAVETASREQPDLILLDLMLPGMNGIDICRLIRQESSVPIIMLTAKTDTVDVVLGLESGADDYVNKPFKAKELV ARIRA---RIEVGDLSIDVPAHTVKRNGAEISLTPLEFDLLLELARKPQQVFTREELLGKVWGYRHASDTRLVNVHVQRLRAKIEKDPENPQIVLTVRGV GYK- >seq_118 EEASVFQKRFVSDVSHELRTPVTTMRMASDLLEVELLAGQISRFQDMLADLLEISRYDETDLCEPIETAVDGIAQAKRVPIHTLTRDSRRVIRIIFIVDD DQAIGEMLSLVLENEGFQTVTCLDGLRAVEMFPIVKPDLILLDVMLPGLDGTGVARRIRATSNVPIIMLTAKSDTLDVVAGLEAGADDYVPKPFKVAELL ARIRARFRILERGPIVIDRLEHTATKDGKDLNLTPMEFELLFMLAAAAGEAISRSSLLKNVWGYENSGDTRLVNVHVQRLRAKVEDDPENPQIVQTVRGI GYKF >seq_119 AQRLDGIMRMTHQLSHEVGNMIGIITGSLGLLEINRIRKAADRGRQLASSMLSIGSQQYVDVAALLKGMVEIAVGTRN-QINLLLADAALFEQSILIVED DPDMAELIADLVEAEGWSPLVVHSAEDAAATLGHEQIDLVLLDHNLPGISGRTFAQRIRTELNVGIVMVTAAGSATERVLGLETAADDYVVKPFEPIELT ARIKAVLRRLRLGDWAVDLSGRRVCLADRSRTLTTAEFALLEILAETPNKPVSRSHILDRLGAESDRFIDRNVDVLVLRLRRKIERNPDLPQHIQTRRGK GYVL >seq_120 ARALEAQRQVHALLSHELRTPAATISAAAQSLELARIRRAVTRMIELMNQVLSPERLRPIELGELARDTVRLDT-AHPLVLNA-AWDPLLTALVIGLLED DEDFRDELALGLGGYGFNVVACGDAPSLYRFLQSQSCDIVILDANVPGEDGFSVATRLRAQSTTGIVMLTGRGALEDRVRGLEGGADVYLTKPVDLLELS SVIRSLARR------------------GVSLHLNAQERIFMTSLLQSGGDAVTRQMLAEAFQPNPDDFEPRRVDVMVSRLRAKAQSAGF-KLPVLSVRGQ GYVF >seq_121 RRREAELREATAVLSHEFRTPVAALRGVLEALEVRQGLQETERLARLVEDLVGFRRARTLPLAEAFARAEAAEVAARQVKLAF-VRDPDKLLQVVVVIED ESTVRDVLRFHLERAGLRVSAYASTQAAEEAGALAGADALVLDWMLPGESGIGFLRRLRTDNRLPVLMLTARAAEAERVEGLETGADDYLTKPFSAAELV ARVRALLRRLGNGPLTVDLAAAEAQLAGRTLNLTRREFDLLAFLTANAGRVYSRTELLDRVWGADFLGGERTVDQHVTQLRAHLGDDPGKPSFLETVRGK GY-- >seq_122 EALMAGLKEVSDNIAHDLKTPLTRLRNRAEEALLERTIEESDGLIRTFNALLMIARAEDFDAADVAGGIHEPLAEDDGMTLKVAVHNRELISQALLIIED DRESADYLVKAFREVGHIADHAPDGEEGLAMAENGDYDVLVVDRMLPKRDGLSLIGALRDKGTAPVLILSALGQVDDRIKGLRAGGDDYLPKPYSFAELL ARVEVLSRRYRVGDLELDRLSHRVARGKDELTLQPREFRLLEYLMKHAGQVVTRTMLLENVWDYHFDPQTNVIDVHISRLRSKIDKGFERP--LLHTIRG AGYM >seq_123 LELNEGLKQVSDNIAHDLKTPLTRLRNRAEEALLEDIIGESDQLIRTFNAILMISRLEDMPVAPIMRDVAEPVAEDAGVTLTLALHNRELVGQTILVIED DREAARYLEKAFAEAGHSADIAGDGETGYALAENGNYDVLVVDRMLPKRDGLSVVAGLRAKGETPVLILSALGEVDDRVTGLRAGGDDYLTKPYAFSELL ARVEVLQRRYRVGDLELDRLTHTARRQSVDITLQPREFRLLEYLMRHAGQVVTRTMLLENVWDYHFDPQTNVIDVHISRLRSKIEKGFDEP--LLHTVRG AGYM >seq_124 EKLNEGLRQVSDNIAHDLKTPLTRLRNKAAAALLEGIIAESDQLIRTFNALLMISRVEDVDMSAIAADSAEPVAEDAGLALESIVQNRELIGQAILVIED DLEAAAYMTKAFREAGIVADHASDGESGLFMGCENAYDVLVIDRMLPRRDGLSVISELRRRGETPVLILSALGQVDDRVTGLRAGGDDYLPKPYAFSELL ARIEVLGRRYRVGDLELDRLSHDVRRGGKEILLQPREFRLLEYLMKNAGQVVTRTMLLENVWDYHFDPQTNVIDVHVSRLRSKIEKDFEKP--LLKTIRG AGYM >seq_125 ETLMGEVKQVSDNVAHDLRTPLTRMRGRLEKAYIGDTIADLDAVLGMFASITRISEIEALDLAEIAGEVVDAAAEQVATRLSLGITDRDLLFDAILVVED DPETAGQLVEELTTSGYEVDLAATGREALSHGAARDYAVITIDRMLPDIDGITVMRQLRDDGAAPFLIISALGEVDDRVRGLRAGGDDYLVKPFSFVELL ARLEALGRRLRVGDLAIDLIARNASRRGRKIPLLPREFQLLEYLVRNEGRVVSRAMLLQHVWDLHFDPSTNIIDVYVGRVRRKVDDAQAYP-LIHTIRGI GYCL >seq_126 ETMINALAGVGNDIAHDLRTPLTRVRLALERGRTDKAIAGIDQSLAIVTALLRLTEIEEVALDEILREVCEPIAEDKRIALGVIVWDRDLLFEAILLIED DAETAEAIVAELADRGFEVQWAGDGVDGLDRARTSSPDAMIVDRMLPGMDGLAVIEALRKDQRTPVLVLSALGAVDDRVRGLRMGGDDYLTKPFAIIELV ARIEALLRRLHVGPLEIDLIERTARRGERELELLPREFRLLEYMMRRNDQLLTRAMLLEEVWNYKFVATTNLIDVHMGRLRHKV-DGPGEVQLIHNVRGS GFVL >seq_127 ELLMKSARHISDTIAHNLRTPLTRIVGALRTARNQHAIESIERLNVLLEKLLQIAELEPCELDVIVADVLGTLAEEKGVSLLRKLHDANLLASACLLIED DQANARYLANGLTELGHMVTVSGDGAGGLENAMKDQWDIIILDRMLPNVDGLSILSTLRALGKTPVLVLSALSAVDDRVDGLRAGGDDYLVKPFSFSELV ARLDALVRRLRIADLSVNVATQKVQRAGVPISLQPQEFRLLVYLMLHAHRIVTRTMLLETIWGYSFDPQSNVIDVQVSRLRRKIDTASATP-LIHTVRGA GYTL >seq_128 ERLETTRRDFVANVSHELRTPLTVLAGFLETLRMAMMHEQARRMQAIVEDLLTLSTLEAVDVGALLRTAREALSNGRH-VFEWIVLSGTELASAILVIED EPKLADYLHKGLSEQSHIVDVARDGVNGRHLALEGDYELVILDVMLPDIDGFAVLAALRAAANTPVLMLTARDRVEDRVRGLEGGADDYLVKPFAFSELL ARVHALQRRKA-------------QRGGRRLDLTAKEFSLLALLLRRQGQILSRTTLAEQVWDMNFDSDTNVIDVAIRRLRGKL-DDPYDAKLLHTVRGM GYVL >seq_129 QEAYQQMEGFNADVAHELRTPLATLINGAQVTLLASNLEELEDLKTLVNDMLFLARADPARLEQEALRVAEATLEAQALHVAVGCANPGLVRRAILVIED EPKLADYLHKGLSEQSHIVDVARDGVNGRHLALEGDYELVILDVMLPDIDGFAVLAALRAAANTPVLMLTARDRVEDRVRGLEGGADDYLVKPFAFSELL ARVHALQRRKA-------------QRGGRRLDLTAKEFSLLALLLRRQGQILSRTTLAEQVWDMNFDSDTNVIDVAIRRLRGKL-DDPYDAKLLHTVRGM GYVL >seq_130 AARLEREEEFTRAAAHDLRSPLAALKVRLQGSLMREALADVDRMQRLTDHLLLLARGTPVDLADLAGEAVRETAPDVRLDFETGIPDEALLTRLLLLVED DPRIALPTVRALEEAGHEAAWEPDGVRGLAAARAGAFDALLLDVMLPGLGGFELARELRAGGEVPIVFLTARSDLADRVEGLDLGGDAYLVKPFELPELL AVLRAVVRRRL------DTSGRQVWWQGELTGLTAREYALLEALTLSRERWFTREELLTKVWGPEFSGDMRVVDVYVSYLRRKL-----APEAVQSSRGL GYR- >seq_131 QQSFEREQAFLRAAAHDLRSPLAALQARVDGTLLRELGRDLTRLSTLANHLLLLARDPPVPLRDLAADAVRELDPLAD----VLVQDRVLLGQALLLVED DPRIAEPTAAALREAGYAVTWAQTGPEGLEAAMLGEFPLVVLDVMLPGLDGFEIAGQLREAEDSPILFLTARGEVSDRVQGLDLGGDAYLVKPFAMPELL AQLRALTRR-L------DTVARTVTWDGQEVAVTGREYELLSALALAPERWLTREELLDRVWGPEFGGEARIVDVYVRYLRRKL-----APEAISSERGR GYR- >seq_132 DDGYERHKRFILDAAHELRTPVAILQTRVETLLRSRILADASRIAVLAEQLLDLQRLGPLDIVGLCRSVVAPLAISQGYNLSFPALDDASLQRAMRLLLV EDEREMAAALSAALHRFDIIVDTVGTLDLARHAIMDSVHDLVVLDRQLPDGDGIQLIEDLRLLPTPPVIVLTAQGGLADRINGLNLGADDYLAKPFAVEE LLARIRALMRR-RLGALEFCFETREVRIKGEFLPLTRRELLILEALLRRQGRTVLRSMLEEAVYNFDDEIQSNALDSHISRLRRKL-AAADAGVEVHGIR GVLM >seq_133 DQGYARHKRFVADAAHELRTPIAILNTRLESLAKTRLLEDAARLATLAEQLLDIQRLDRVDLVRVAQGAAAPLAIAGGYELALATIDAAALERAMRILLV EDEAEMAGALASALKRYDMVVDHAPTLADAEEAISADVHVAVLLDRQLPDGDGLALIPKLRARAGVPIIVLTARGELADRIAGLDSGADDYLAKPFAVEE LLARLRAVMRRIRAGRLAFDVGHREASIDGQPFELPRRELLVLEALIRRIGRTVIRSALEEAVYNFDDEIQSNALDTHISRLRRKLAEA-DAGIEIHGIR GVLL >seq_134 DDGYNRRNRFLADAAHELRTPIAIVRTRADLLPSRQIRADIDRLTRVAHQLLEMQAVGPCDLNKLVEHIAAPIAMDAGYEFDFPFTQASIIEMAMRILLL EDEPEMARALLEALRRRDVLADHVSTISDADALARDGSYDVLVLDRRLPDGEGLNLVASLRRSKSVPILVLTALGNVDHRVDGLDAGADDYLAKPFAIEE LLARLRALHRR-RFGNLGIDPRSNEVSVAGSSIELRRREYLVLEALMRRPNRIVTRSSLIEAVYTLDDEIESNALDAHISRIRKKLAQA-EATVEIRAVR NI-- >seq_135 DNGFEATERFFVNAAHELRTPIAVLQVRIDTLSKTHLQTAIKRLTAIANQLLDTEKYRPVDLNSVVSKVVAPLAIAEGYEISFSVPDAESLERALLLIED DPEMTDALKVALSQHGIVLDAVGDLATAREAIIMADYDIVLIDRQLPDGDGSTFLADLRRAGNTRSIIISALRSTDERISGLNDGADDYLPKPFEIPELV ARMSAVLRRLSAGNVTYDRVSCDVHVNGIRLALTRRELLIIETLLRNRGRTVLRSSLEGQVYSFDDEIQSNSLESNMSRLRRKLGEA-EADIVIKNIRGI GYYL >seq_136 DAYIRMQRRFLGNAAHQLRTPLTLLRAKIDDV-KVELVRDVRRLTSLVSAMLDLARLQPIDLAEITLDVLGPSALDAGIELALRVQVDAAIRSASLVIED EPQIGAYVSRLLGQL-HGVVLVGSIADARQALVNFKYDLAIVDRMLPDGDALEVVTALSRSPRPAIIMLTSKDAKEDVVDGLNGGADDYLGKPFEPQELI ARVRAVLRR-SLGNVELHLGSNEAVVADTKILLRRREALILGALLMRRDRVITRAALIEEIYGFDDEIESNTLEAQVSRLRKKLAALG-GDVEIRSMRGI GYIL >seq_137 SANIETQRRFVADAAHQLKTPLAGLRTQAELALLRQLVTGSERATRLVNQLLLLARAELTDLNAIAYEQTVPQALALSTDLGFGISNAILLAELILIAED DSILADGLSRSLRHNGYAVDAVRDGLAADSALAVQPFDLLILDLGLPHLPGLEVLRRLRARNVLPVLILTAADSIEQRVKGLDLGADDYMAKPFALSELE ARVRALTRRVRHGRLVFDQTGRIATVDDQTLDLSAREVSLLEILLARSGRMVSKTQLVDHLCEWGEEVSTNAIEVYVHRLRKKLEPSGVK---IITVRGL GYCL >seq_138 DASLAAQRRFIGHAAHQLRTPLSGLRLESELMLAERIKAVSDRMIRLGQQLLVMARADRLDLCEWVRASGIPRVRAAQAEIDLAIDDPLLLDEMVLVIED DTTLGHALQEFLADQGYAVDWLTEGDKVLGALAGQPYDLLLLDLNLPGMSGLDVLRQLRQDGQVPVLILTARDGLDDRVAGLDAGADDYVTKPFDLPELA ARVRALGRRIEVGPLVFDTVGREVRANGTRLALSVRELSVLEMLMARSGRVVTKRQIVNSLSAWDADFSENAVEVYVYRLRKRLEGSGAS---IQTVRGF GYML >seq_139 QSALDALRHFTGNASHQLRTPLAIISTQLALSAALKGGASVAHAEHILAQLLRMANIDAIDLVAVAQNVTVPRAADAGIDLGFGISEPLLIGELILLVED NQVLSEGLSALLRGSGYAVDVVSDGASADAAIAAESFDLVILDLNLPEMDGIEVLRSMRSRQKAAVLILTARGTPEEKVKGLDLGADDYMIKPFDITEFE ARVRVLLRRLSFGKVLFDLTSRTFSADGRPLDIPAREVALLEVLFMRAGKVVAKEAIVQSLTGFDDDISANAIEQYVSRLRKRLAPHGLT---VKTARGI GYYL >seq_140 TALARVQRQFLDDASHQLRTPLSVLRTQTAYALLLAMQDGLDRAVRTTNQMLALARAETVDLAELADGVILPAARARQLDLGLVVQVEWLLREAILLIED EAELARWLSRSLAHAGFVVEWADDGLLAERRLAVEEFDAIILDLGLPGMDGHTLLTRIRARDRTPVLVLTARDSLAERVGTLHQGADDFLPKPFVLEELE ARLTALIRRLTLGDLALDTASQRFTVKGQPLALSPREHAVLRALIQRSGEPLNKQQILDRIQSSDSDVNLEAIEVLVHRLRKKLADTGVQ---IVTMRGM GYCL >seq_141 EDGFRVQRQFTADAAHQLRTPLAILRTRIETL-RQALHADIEGMSRIVAQLLEIAELDTADLRAVCTEVVAPFALAQHKDIALGVHNSEMLQRALLIVED NAELSRLVAAGLSAAGYESDIVSSAAEAREAVGSVSYAAMILDLGLPDGDGLSVLRELRRQMPLPVLVLTARGGLQDRVNGLRSGADDYLAKPFAMEELV ARLEAILRRLRLANLVYDTESRQIFIDDQPRIISAREASVLEILLRRQGRVVPKKNVEDHIFGLEGEVASNAVEVYVSRLRKQLTEHGAK-VVIHTIRGV GYLM >seq_142 HTALARERRLIADAAHELRTPLAALKMQAEVALLCKLLDAARRAARLSEQLLDQARLDEVDLATLTAMIIQARAQARHQRIQLVVSDLDSLGILLLLVED DPMLGDALQTALQELGARVDWVRDGAQAQLALVDHAYQLILLDIGLPGQSGLALLRALREGYTTPVLLVTARDQLSDRIRGLDGGADDYIVKPFELDELL ARLRAVLRR-RHGELLVDPAARQVSRAGRPVALSQSEYRTLLALMQRRGRTVSREQLEHDVYGASVALESNTVAVYVHQLRRKLGDD-----LIETVHGY GYRI >seq_143 SLLLGQQRRFIADAAHELRSPLTALSLQAENLRLVPLQSGIERARQLTAQLLDLARVQPVDLPALARELILPLAEARGIDLGLDLDDPESFTLIVLLVED DPMIAEAVSVALKDAAYAVDWVRDGEKAADALRYGEHQAVLLDLGLPGRGGLEVLAALREGGSIPVIIITARDGLGERIAGLDSGADDYLVKPFDLDELL ARLRAVIRRIGNGQITLDPATHTAFCGEERAVLSSREFAVLHALLLQPGRILSRSALEERVYGWDEEVESNAIDYLIHQLRRKLGATS-----IKNIRGA GWM- >seq_144 GQMLEHERRFTADAAHELRTPLAALRVQAEVLALSQLQLGIARAGRLVEQLLALSRLDPVDWGRLAEGLMAAAADGVEWRLDWCLADATLLGLMILVVED DAQIGDGLKMGLQQLGFAVDWLRDGRQALAALPAAPYDAVVLDLGLPGMDGMDILAAWRRAGDEPVLVLTARDALADRVGGLDAGADDYLAKPFALSEVA ARLRALTRRLSHGRLSLDPVARTATLDGALLELTARELALLELLLSSKGRVLSRELIEEKLYGWGQELGSNALEVHVHHLRKKLGAG-----FIRTLRGI GYTL >seq_145 KEAFEREKRFTADAAHELRTPLAALNTHTQVALLLKVLAGVNRGTHVVQQLLTLNRMVWMDLGKEAADIAAPEAIAKNIDLELIVKNNTAISILVLLVED DEFLGDGIRAGLKQYGHTIDWVRDGQAAHDVLTHETFDIIILDLGLPKRSGLDVLKTIREKNPTPVVILTARDTVDDRIKGLDAGSDDYMTKPFDLEELC ARMRAMQRRISHGDITLDPASHVVTLKKKEVMVARREFALLQKLLENAGRVISREQLNQTLYGWGENIDSNALEVHIHNLRKRFGA-----KLIRTIRGV GYM- >seq_146 GQALQRERRLACDAAHELRTPLTVIKTHLQVAQLGHALEGTSRLQRVLEQMLTLARLDRAGADDVVGRAVDATW---GLCWRVCALVALPLELAVLLVED NRLIAQGIVTGLRANGCIVDAVGSAAQADLALATLNVDVMILNLGLPDENGMSLLRRLRTRGRVPILVLTARDSVEDRVAALRAGADDYLLKPFDLDELV ARLHALTRRIEDGRLRLDPARGEVWLDESPVALTRREMALLMALLRADGRILSPDQLKDRLYDYSEEIESNALNVHICHLRRKLGPN------------- ---- >seq_147 RSTIEAEREFTANSAHELRTPIAGALAQAQRL-VENIEKSLQHLAHLAEKLLQMSRAETSDLIPVLELVVSRTAIGSR-RFTNCIGD--AFGIIVLLVED DYVLGEALRDHVAAAGHAVDWFKTLGDAMAATLTMGYGLILLDMRLPDGEGITLLQSLRKRDATPVIILTAHDQVSDRIAGLNAGADDYVVKPFDLNELS ARMLAVSRRIRLPGIEINQVARNITVDGTAQTLSAREWAVLEKLVEHPGAVVSKAQLHDTLYEFGAEIESNTVEVYISRLRKKIGHD-----RVETVRGV GY-- >seq_148 KRALEAERSFTANSAHELRTPIAAALAQTQRLIGEQIEAALHRLSRLSEKLMQLAKAEPIDMGVILRMVVSQPHGNEN-RLSIDIDD--AFAILIILIED DPILGAAVRDHIAAEGHCVDWVSRLDTAHDYMESARYDLVLLDLMLPDGQGLSFLRDLRTKGSTPVIILTALDQISDRIQGLNAGADDYLTKPFDLSELS ARLNAVARRIEIGGLQIDLAAKSVMRDGRALCLTAREWALLEAFLQHPGQTLSKAQLEEHLYSFDTEVESNTMEVHVSRLRKKLGHA-----IIETVRGI GYRL >seq_149 REIVERARTHVGNLAHAIKTPLSVIVNEAGVHAAAKVMEQADVMRDQVAHHLERARI-VTEVAPAIEALREKIHRDRGIMVEAAFRERQDLEEMRLLVVE DDPDLNRQLTKALTDAGYVVDRAFDGEEGHYLGDNEPYDAVVLDIGLPKKDGISVLEAWRRNGTMPVLILTARDRWSDKVQGFDAGADDYVAKPFHLEEV LARIRALLRRLSCGPVTLDTRTGRVSVSGNPVKMTSHEYRLLAYLMHHSGRVVSRTELVEHLYDQDFDRDSNTIEVFVGRIRKKLDVD-----IIQTVRG LGYL >seq_150 RRIMERSRTQVGNLAHSLKTPLSVLVNEARAMG--IVQEQSEAMQVQIQHYLQRARIA-TPVTPVLERLHAKLHPTFNISFRNLFAEREDLEEIRILIVE DDKDLNRQLSEAMIAAGYVVDSAYDGEEGHYLGDTEPYDAVVLDIGLPQMDGISVVERWRRSGTIPVLMLTARDRWSDKVAGIDAGADDYVAKPFHIEEV LARLRALIRRFVCGPLHLDTKTSKASIDGVALKLTSHEYRLLSYMMHHMDEVVSRTELVEHLYDQDFDRDSNTIEVFVGRLRKKMGVD-----LIETVRG MGYR >seq_151 RRLVERARMQVGNLAHSLKTPIAVLLNEARTLEGDLVRAQADAMQAQVQSYLSRARIARVEAEPALERLVRRLNPDKQ--FVLFVLMEQQLEEVRILVVE DDANLNRQLTDALKEAGYVVDQAFDGEEGHYLGDTEPYDAIVLDIGLPQLDGITVVEKWRAAGAMPVLILTARDRWSDKVAGIDAGADDYVTKPFHVEEV LARVRALIRRIMFGPYSFSIPKRELRRGAETIRLTDREQEIMLLFALRAGDTIPRHELVEA----ESEVGERTIDVQINRLRRKIEDDPANPVYLQTVRG IGYR >seq_152 DRSIAREREFSVNVSHEVRTPLAAIRSDSEMMLLVRIVANVDNVSTALESARAMARDERVDLAACMDDAWEVNAEAAGLAFANIHVDRYAMLTVVLIVED DLTIAANLYDYLQVRGFVPDAAYDGRSALALLDEHAFDAMVLDVGLPGMDGHAVLHALRVERALPVLMLTARDGLDDKLAGFAHGADDYLTKPFALAEVE ARLLALIQRRSFGPLQFDSATREVAVHGKPVHLTRKCGMIVEVLLRDPGRVVSREQLENALWGDD-PPSSDALRSQVHLLRRALADAGFD--GIETVHGT GWRL >tr|L0G4T2|L0G4T2_9BACT Signal transduction histidine kinase OX=926556 OS=Echinicola vietnamensis DSM 17526. GN=Echvi_4346 PE=4 SV=1 -EINQLKLRFFTNISHELRTPLMLIKLPLEQLILNSIHNNASRLLKLINQLLEFRKQETNDPIYFIKGVYSAMAKQRNIDFRINVEDVERFTKKLLLVED NVELLSLMKSALKEH-FHILTACNGREGLQVATSHSPDIIISDVMMPEMDGVELCRLLKSESHIPVLLLTARSNHDYQMEGYSSGADDYLSKPFPLDLLI AKIRNLLHTRKKFKEAF---RLKPEIMPSEISISPADTELLEKAIKVAEK-------------------------------------------------- ---- >tr|C7PH22|C7PH22_CHIPD Histidine kinase OX=485918 OS=2034). GN= PE=4 SV=1 -EVHQLKLQFFTNISHEFRTPLTLILGPLEKMIFKLMQKNAYRLLDLVNEVMDFRKVESANPALFLEEIAEEWAVEKQIRFTIRVVDEERYLATILVVDD NSELREFLKGALLPT-YRIIEASDGQMGFEKAKAEMPDIIISDVMMPVLDGIAFCKLVKDNSHIPFIMLTAKTALSSNIEGAASGADFYFGKPLSIELLL ITIRNTLEQRRKARERF---LKDYYYEAKELVTSTKDKEFMDTLLEKIEE-------------------------------------------------- ---- >tr|I8YR86|I8YR86_BACOV Uncharacterized protein OX=997886 OS=Bacteroides ovatus CL03T12C18. GN= PE=4 SV=1 -EFHQTKIRLFTNFSHELRTPLTLIISPIDELVLDLVLKNARRLLLLVNQLMDLQKNQSTDLNAFLLEIYYQIAESKQITFLIQVADTDPFYLTILLVED NEDIRSYVKEHLQRH-YRVLEAGNGEEAFQIVLKEFPDLVVTDIMMPGVDGLELCAMIKNDGHIPVILLTARTMVMHVKEGFLSGADDYVVKPFNIDVLL VRIYNLLLQREKLKSMY---GKNFSLQSMGIETTSADDKFMQKLFEIIQD-------------------------------------------------- ---- >tr|J2SG60|J2SG60_9FLAO Signal transduction histidine kinase OX=1144313 OS=Flavobacterium sp. CF136. GN=PMI10_01844 PE=4 SV=1 -ELDQMKLRFFINVSHEFRTPLTLILNPVEKILANLIQKSSKRLLYLVDQLLDLRKIESLDIVKFSKDIFYDLAKTKDIRFVIKITDKQRFKPVVLLIED NKVLRNHIKNELSDQ-FRVKEASNGVEGLEKIRKYFPDVIISDVMMPKMDGFELCHEVKNDSHIPILLLTARNLDEDILQGYQTGADGYLSKPFNMNILK ARVANLLEARKRSRDRF---SSGGIVPSSDMTINSLDEQFLEKSTKIVIA-------------------------------------------------- ---- >tr|F4C3M9|F4C3M9_SPHS2 Histidine kinase OX=743722 OS=Sphingobacterium sp. (strain 21). GN= PE=4 SV=1 -EIHEAKLNFFTNISHEFRTPLSLIVGPIERLQLHNAYKNANRLLNLVNQLLDFRKQESTNLVRFVEEIVSFLAAKKNVQITIEVWDKAQFQSVVLIVED NVELRRFIVGSLQDK-FKVLEAGDGLEAWPIVESIQPDIVVTDVMMPTCDGISLLRKIKGENHIPVILLTARTADPYMMEAFKEGGDDYITKPFSFKLLA WKIANMLESRKRLKEKF---VQEYLLQPHK-TDVVETNSFMDDVITIVEE-------------------------------------------------- ---- >tr|K2Q462|K2Q462_9FLAO Histidine kinase OX=555500 OS=Galbibacter sp. ck-I2-15. GN=I215_06767 PE=4 SV=1 -QLNEKKFQFFTNISHEFRTPLTLIMNPIQDILHGIIHKNTERLHRLVNELLDFRKLELLNMVHLIEYVIGEEALDKNIDLSIVISDEERFTYTLLLVED NVELREYLEQAFKAQ-YKVLVANNGAEGIKIAKDILPDVIITDVVMPKMNGFDFCKEIKTDSHIPVLMLTAKAKIEDRIEGIEIGADAYMVKPFDIRLLK LRLSQLISSRQLIFNKY---FSAISDIGENSNTTSLDKEFIQKALDYITT-------------------------------------------------- ---- >tr|G0L8H7|G0L8H7_ZOBGA One-component system sensor protein OX=63186 OS=/ Dsij). GN= PE=4 SV=1 -ELDQMKLQFFVNVSHEFRTPLTLILNPVDKILAQTIQRSARRLLHLVNQLLDYRKMDVGNIVAFCEDIFMDLADQKDLAYTIVVKDVSRFRPTVLVVED NKELRTHLVNDLREF-YQVKQAANGEKGLKMAKKHFPDVIISDVMMPVMDGFELCKKLKNECHIPVLLLTAKSLDDDRIEGYHSGADGYLSKPFVTRVLI ARINNLLETKKRLRQRF---SEGGIFPASEVTSNNMDEVFLDKVTKTILD-------------------------------------------------- ---- >tr|E6X3N7|E6X3N7_CELAD Histidine kinase OX=688270 OS=Cellulophaga algicola (strain DSM 14237 / IC166 / ACAM 630). GN= PE=4 SV=1 -RLNEKKLQFFTNISHEFRTPLTLIINPLEDIIHHIIHKNTDRLYRLINELMDFRKLELLDVVEFTKEVTSEEAANRNIHLAIIITDDERFQHTLLVVED NAELRNYLRDEFKQQ-YKVLVAKDGQEGLQMAKEFLPDVILTDVIMPEMDGFTFCKNIKEDSHIPLLMLTAKAKIDDRIEGIGLGADAYMVKPFDMRLLS LRLKQLITSRQLIFDKY---FGSISGADENANASSLDKDFIHKVLNYINE-------------------------------------------------- ---- >tr|C3PYI4|C3PYI4_9BACE Two-component system sensor histidine kinase/response regulator OX=457395 OS=Bacteroides sp. 9_1_42FAA. GN=BSBG_01352 PE=4 SV=1 -EINHAKLQFFTNITHELLTPLSIISASVDELKCPVIADNTVRLIRLIQQILEFRKVENGNVSMFLKKSVSPLVKKQKLSIQLSVNNEERFASTILLVED NEELLALMVRLLHGK-YHILKAANGTEALEILAKQEVDLIVSDVMMPEMDGMELCRRVKTQCHIPLILLTAKTSDEDRVEGYESGADGYICKPLRLSVLF AKIDNLLKRRKRMGVDF---RKQLVFEAKELNYTSMDEAFIRKAVDCVNA-------------------------------------------------- ---- >tr|Q8A1I3|Q8A1I3_BACTN Two-component system sensor histidine kinase/response regulator, hybrid (One-component system) OX=226186 OS=10582 / E50 / VPI-5482). GN= PE=4 SV=1 -ELNHAKLQFFTNITHELLTPLTIISATVDELKYTVMNSNIQRLIRLLQQILEFRKAETGDIAAFVKNAAEPLVKKRKIHFSLKIKDEKRFANTILLVED NGELLHLMTKLLSRE-YNVFTAQNGKEGIAVLEKEDVDLIVSDVMMPEMDGIEFCKYVKGHSHIPMILLTAKNKEEDRAEAYEIGADAFISKPFNLTVLH ARIRNLLKYKERMARDF---KNQIVFEVKDLNYTSLDEDFIQRAIHCVNN-------------------------------------------------- ---- >tr|E5C889|E5C889_9BACE Uncharacterized protein OX=556259 OS=Bacteroides sp. D2. GN=BSGG_0636 PE=4 SV=1 -EFHQTKLHLFTNFSHELRTPLTLIISPLEELMLDMILKNARRLLLLVNQLMDLQKNQSSDLNAFLLEIFYQIAESKQIHFEIQVADDTPFSLPVLLVED NEEIRSYVKGHLEQY-YNVLEADNGSDAFEIVLKEFPDLVVTDIMMPGIDGLELCSLIKNNGHIPVILLTARTMVMHVKEGFLSGADDYVVKPFNIDVLL VRIYNLLAQREKLKSVY---SKNFSLQSMGIETTSADEKFMQKLFKIIEK-------------------------------------------------- ---- >tr|F0SAS3|F0SAS3_PEDSD Histidine kinase OX=762903 OS=10337 / NBRC 100064 / NCIMB 13643). GN= PE=4 SV=1 -EMHQMQLQFFTNISHELRTPLALIMGPVERLLYHTIHNNANRLLNLINELMDFRKVESLNLEAFFVEIEEELAQEKNIDFRIKVIDDERYHKKILVVDD NEEIRTFLKETLNKD-YLIFEAENGEQGLILCKEIYPDLIISDVMMPKVNGISFCKQAKDDSHIPFMMLTAKTSLEAEIEGKESGADFYFAKPINIDLLE ITLRNIFEQQNKLKEHY---QKNYQVQAKELVNNARDKEFLEKLLDIIHK-------------------------------------------------- ---- >tr|K1YLB8|K1YLB8_9BACT Uncharacterized protein OX=77133 OS=uncultured bacterium. GN= PE=4 SV=1 -TLLRNQDDFYLRTAHELRTPLTLIRIPAKQLSLDIILRATARLQRLTEQMFQ-AAIHGIDLKAVMAPLFEEVAERKAITFCLCVRDDDRLNQSLLIIED DDDMQQILKSLLEDK-YQLVITGSAAEGLRKAQEQTPDLVLCDVMLPDGSGFDIIHSLKSHSHIPLILLTAVGDLSGRKTGWEKGADDYIVKPFAKDDLL CRIGGLLANRKRLQEWY---KRKFSYGVDNGQICDNELDFIVKLETQTLI-------------------------------------------------- ---- >tr|I8YUW5|I8YUW5_9BACE Uncharacterized protein OX=997887 OS=Bacteroides salyersiae CL02T12C01. GN= PE=4 SV=1 -ELYQAKLQFFTNASHELKNPLTLILAPLEKLLLSLIKRNTMRVIKNVNEVIDIRKIDYMDVVSFFKEIADGVIEDKNINFVIEISDKDCFKYKILVVDD ESEIRDFLAMELHEE-YEVYTAADGIEGFKSALNSIPDLIISDIIMPNMDGIELCAKIKSNSHIPIILLTAKETHEDRLAGLEVGANSYIPKPFDIRHLR IRIEQLIKYQEVVKEKF---MKKVALVSGETPDAETDDILIQKIINYINE-------------------------------------------------- ---- >tr|K9E092|K9E092_9BACE Uncharacterized protein OX=742727 OS=Bacteroides oleiciplenus YIT 12058. GN=HMPREF9447_03870 PE=4 SV=1 -EFHQAKMHLFTNFAHELRTPLTLIITPFEELMLGIIYKNAQRLLLLVNQLMDLQKNQSNNVYEFVTEIYCQIAQTNEISFTMEVIDQTPFKKPILLVED DKDVREYLHKSLEDE-YEVIEASNGIKGYDKAVQFFPDLVLSDIMMPKRNGLELCSMIKNDGHIPVILMTARSMVVHIKEGFQAGADDYVIKPFSMDVLR IRIQSLLQSREQLKRLY---GKRFSPEVVGVSTNSADERFSQKLYEIIEK-------------------------------------------------- ---- >tr|K9EI48|K9EI48_9BACE Uncharacterized protein OX=742727 OS=Bacteroides oleiciplenus YIT 12058. GN=HMPREF9447_02075 PE=4 SV=1 -ELNRAKLQFFTNITHELLTPLTVISVTLNEIKYSIMDNNINRLKRLLQQILEFRKAESGNITSFIMNSVNPLVKKKKMQLEIAVSDENRFRKTLLLVED NEDLLNVMSHSLARE-YHILKATNGKEAIAQLEKEDIKIVISDVMMPVMNGIELCHYVKKQLHIPIILLTAKNSESDIIEGYEAGADDYITKPFQLTLLL AKIKSLLKNKENLFKDY---SDAIYFKMQKPDIDSTDKVFLQEAINCVYE-------------------------------------------------- ---- >tr|I9QSN9|I9QSN9_9BACE Uncharacterized protein OX=997874 OS=Bacteroides cellulosilyticus CL02T12C19. GN= PE=4 SV=1 -ALNQSKLRFFTNISHEFRTPLTLIVSQVETLMILSIYQNSIQLRELITELLDFRKQEQHNIVHFLYENYLEYASSKQINFNIRVTDKDRFDVKMVIVED NASIREMLENIFRPF-YQVLTAADGEEGLELIQKEMPNIVVSDVVMPKMSGTELCKLIKNDCHIPVVLLTARTAVEQNIEGLRIGADDYITKPFNTNLLI SRCNNLVNSRILLQEKF---SKQPQAYAQMLATNPIDKEILDRAISIIEK-------------------------------------------------- ---- >tr|K5CCR5|K5CCR5_9BACE Uncharacterized protein OX=997888 OS=Bacteroides finegoldii CL09T03C10. GN=HMPREF1057_02271 PE=4 SV=1 -ALNQSKLRFFTNISHEFRTPLTLIVGQVETLLVLGIYKNSLQLRELITELLDFRKQEQHNLVEFLYENYLEYASSKQINFNIEIKDADRFDAKMLIVED NESIKQMLVSIFETF-YQVTTASDGEEALEKVKEEMPSIILSDVVMPRMSGTELCKRIKTDCHIPVVLLTARTAIEHNIEGLKIGADDYITKPFNTNLLI SRCNNLVNSRRLLQEKF---SKQPQAFAQMLATNPMDKEMLDRAMAIIER-------------------------------------------------- ---- >tr|D1PB74|D1PB74_9BACT Two-component system sensor histidine kinase/response regulator hybrid OX=537011 OS=Prevotella copri DSM 18205. GN=PREVCOP_04453 PE=4 SV=1 -EMRDARLQFFTMIAHEIRTPVTLIIGPLESLKLSVIDRNAQRLLLLVNQLLDFNKVQQNNISKLMHAVAEPTFEQKSIRLDIEVEDDGAFAGVMLIVDD DEDMRQFVKAHFEKM-YTVYTADNGKDALRKLEKHPVSLIISDWMMPEMDGPEFCRRVRENSHLPFVMLTAKTDDAAKTESMNCGADVYIEKPFSMKYLE ASVRQLLEMRRLLRSKF---SHTPLEPIAEIAPTQVDNAFLERMSRIIEE-------------------------------------------------- ---- >tr|F3PEZ2|F3PEZ2_9BACE ATPase/histidine kinase/DNA gyrase B/HSP90 domain protein OX=762984 OS=Bacteroides clarus YIT 12056. GN=HMPREF9445_00556 PE=4 SV=1 -LLNQSKLRFFTNISHEFRTPLTIIIGQMEMLLILNVYKNALQLRELISELLDFRKQEQHDLIDFIYENYLEYAATRQIDFHVEVQNETRFNAKMLIVED NDSLREMLAGLFRPY-YDVALAVDGEDGTEKAREFCPDIILSDIVMPKMTGTEFCKQIKTDCHIPVVLLTARTAVEHTLEGFRIGADDYITKPFNTALLI SRCNNLVNGRRILQEKF---SKQPQAPVKMLAINPMDEDLLERTMEIIER-------------------------------------------------- ---- >tr|C3R026|C3R026_9BACE Two-component system sensor histidine kinase/response regulator OX=469590 OS=Bacteroides sp. 2_2_4. GN=BSCG_04488 PE=4 SV=1 -ELYNSKIDFFTNIAHEIRTPLSLIIGPLEYLMLSIIEQNYKRLYALVTQLLDFRKVDTYRIKEIICKVSCLSARQKKVTIDVTVTDQDAFQYAIMVVDD NPEILDFLSKILSEE-YFVISASSGEEALQILEKNNIDLIISDVMMEEMDGFELCGKIKSNSHVPVILLTAKTDTESKIKGLEAGADAYIEKPFSPFHLK AQLLNLLKKRESQQKTY---ASTPLSDLHSAVHNKLDEEFMNKCTEIIQN-------------------------------------------------- ---- >tr|A6EKH6|A6EKH6_9SPHI Two-component system sensor histidine kinase/response regulator hybrid OX=391596 OS=Pedobacter sp. BAL39. GN=PBAL39_17204 PE=4 SV=1 -ELYHAKINFFTHVTHEIRTPLTLINGPLEEVILTIMKKNTDRLLELTNQLLDFRKTEQLDVNELLRDTCVPLIDEKKILMLIVIINKEPFLPVILVVED NPEIRTFITNILQED-FTIVAVENGEKALKALNDVSVQLIISDIMMPVMDGLELCKTIKGNAHIPIVLLTAKNTLQSKIEGLEVGADAYIEKPFSPNHLL SQVVNLLTLRDKIKNHF---AHSPLVHLKSMAYNKADEAFLDKLNDTIIK-------------------------------------------------- ---- >tr|D7NET6|D7NET6_9BACT Two-component system sensor histidine kinase/response regulator, hybrid (One-component system) OX=563008 OS=Prevotella oris C735. GN=HMPREF0665_02066 PE=4 SV=1 -ERNQSKIRFFTNVSHEIRTPLTVIIGLAESLLLQGIYRNSNLLRDLISELLDFKKHEQVDWSAFVGSICNEYANSNDVTLNLRVTDTDCFEATILIVED NDDIRQLLMTVLSPY-YRILTAVDGQEGLEVIRDEMPDIIISDVLMPNMSGIELCKIIKEDCHIPVVLLTARTAVEQELEGLKTGADDYITKPFNNELLI SRCNNLINSRRLLQRKF---GEHPHTEANMLASNPLDKEMLDRAMNIIDK-------------------------------------------------- ---- >tr|E4RWG2|E4RWG2_LEAB4 Histidine kinase OX=649349 OS=Leadbetterella byssophila (strain DSM 17132 / KACC 11308 / 4M15). GN= PE=4 SV=1 -ELAELKVNFFTNVSHELRTPLTLIMSPAEYLMAELLYKQANKLLVLVNQLMNFRKVESLDLLPLITEVYLIKADEQNIHYDLSISDEEPF---LLIVED NEDLRNYLVSLFHSH-FETHSAANGKEGMDKALQILPDIILSDVMMPVMNGLEFCERIKNNAHIPLILLTARAATLHELEGLESGADDYIVKPFNPKVLH TKILSLISNRKKAQEFF---HKQLIASPSETVIPDADRIFLQTAMQIIEE-------------------------------------------------- ---- >tr|F7M1N5|F7M1N5_9BACE Putative uncharacterized protein OX=457387 OS=Bacteroides sp. 1_1_30. GN=HMPREF0127_01369 PE=4 SV=1 -NMNQQKINFFTYISHDLKTPLTLILSPLQRLILEVIYRNANRMNYLINELLTFSKIEMGDIMHFLEELSHIVAGEREIDFIISVKDDESYRESIMIVED NKEMNDYLASIFGEK-YDIIRAYNGAEACKKIARQLPNLIISDLMMPVMDGLEFTERVKQDSHIPVILLTAKTDENDHTEGYLRGADAYITKPFNAKNLE LLVQNIQKSRKQNIEHF---KQAEELNIKQITNNPRDEVFMKELVELIMA-------------------------------------------------- ---- >tr|G5SNE4|G5SNE4_9BACT Response regulator receiver domain protein OX=762968 OS=Paraprevotella clara YIT 11840. GN=HMPREF9441_00873 PE=4 SV=1 -ELNQNKVEFFTEIAHEIRTPLTLINGPLEIIQLNVIATNTKRLLNLASQLLDFQKMGAVNISELLQETVNPTFTHQHKELKVSVCSEEPFGNVILIVED EESIRNFMKERLSPL-FIVETASNGKEALDIIRKEHIDLVISDVMMPEMNGYELCTAIKSDCHIPIIFLTAKNDIDSKVKGLKVGAEAYIEKPFSYDYLK AQILSLLNNRQKEREAF---SKRPFFPVQNMQMSKEDEEFMNKVIEVINA-------------------------------------------------- ---- >tr|F5J3C4|F5J3C4_9PORP Putative uncharacterized protein OX=742766 OS=Dysgonomonas gadei ATCC BAA-286. GN=HMPREF9455_03841 PE=4 SV=1 -EVYQSKISFFVNLIHEIRTPLSLIKLPLDKLTLTVINKNVNYLLDIVNQLLDFQKIENENINSLLLEIYTHSAELKNIDLSISVSDEDAFDFTVLLVED NIELLDMVADSLAPF-FSIMKSGNGKEALKILSENNVDLVVSDVMMPEMDGFELCKTIKSDSHIPVVLLTAKVTLDAKIEGMEYGADVYLEKPFSIKQLH KQIENLLKLRLSLQKII---TTSPASSAIDIAMPKKDKEFIERLHTEIEK-------------------------------------------------- ---- >tr|E2N716|E2N716_9BACE Putative uncharacterized protein OX=537012 OS=Bacteroides cellulosilyticus DSM 14838. GN=BACCELL_00058 PE=4 SV=1 -ELLKEKESFFESLGHDLITPLSLILAPANDMLLSIITKNAAFLSDLFSTILDFKRVEFIEIVSFCRIIVNYLASSKKIQLSVFIKDLNKFNSAILLVED NEQILKYLAGKLAEH-FNIVTATHGEEALKLVGEYLPEIVISDIMMPGMDGLTLCRHIKENADIFVILLTAKTSSEDELRGYKEGADIYIKKPFDTEALI NQIMNILNTRQKRKEQL---LKNLIAKDSDTIEFNSKEVFLQQAMKVIEE-------------------------------------------------- ---- >tr|E4RVE7|E4RVE7_LEAB4 Histidine kinase OX=649349 OS=Leadbetterella byssophila (strain DSM 17132 / KACC 11308 / 4M15). GN= PE=4 SV=1 -EVNQSKLRFFMNVSHEIRTPVTLIMAQLDMLLLQNIKKNAANLKKLITELLDFKKQEEEDLVKYLHTVFVDYANSLNIHLAIILSDSDRFLHKLFIAED NSDLQGVLSDLFSPM-YDVLTEGDGKAVFERVKAGKPALVLLDVMLPNVSGFEICKRIKSDKEIPVVLLTAAASGDKKIEGLQMGADDYVTKPFDARELV IRCNSLINNRQELKAGL---NKE-----KVYGSNELHELFIKEATEVVTR-------------------------------------------------- ---- >tr|A0YCM6|A0YCM6_9GAMM Adenylate/Guanylate Cyclase OX=247633 OS=marine gamma proteobacterium HTCC2143. GN=GP2143_08344 PE=4 SV=1 --------------RHELRNLTSAIIGYSEFVL-EEMEQT-GLLYDSLQTILALCHQLS----------AYDDR-------------IKTIGGTILIVDD QIESRDLLKRYLQLNKTTVLEASNGTSMFAVLTENEVDLILLDLILPEMDGDELLELLKQDRAIPVIVVSGNKETDRVIRCIEAGAEDYLFKPFNPILLQ ARISAGVERKRWHN----K---ELQYR---RELERNQDFIRSV----FGRYLSETRILENPEGLDLGGSQRKVTVLMADIRG------------------ ---- >tr|B8KFH0|B8KFH0_9GAMM Adenylate cyclase 1 (ATP pyrophosphate-lyase 1) OX=566466 OS=gamma proteobacterium NOR5-3. GN=NOR53_2381 PE=4 SV=1 --------------NHELRNLAGALRGYAEMLS-ESASNPGAALASVIARVLEGTGEPV----------LSGASASSE-TIEPI-PGLTGEPGFILAVDD REENRELLARYLTRSGHFVVTAPGGAEALEMLANADVDVVLLDRMMPGMDGREVLRRIKAERATPVIMISGEQDMQGIIECIEAGADDYLFKPFNPVLLQ ARIKAGIERKQWHD----R---EQLYR---DQLERNERFIRAT----FGRYLSDTEILERPEGLELGGDLREVTIMMSDIRG------------------ ---- >tr|A4ADE9|A4ADE9_9GAMM Adenylyl cyclase class-3/4/guanylyl cyclase OX=314285 OS=Congregibacter litoralis KT71. GN=KT71_18127 PE=4 SV=1 --------------THELRNLAGALRGYAEMLS-EVVAQPGAALKSAIDRVMEGTAEPA----------LSGTAADTG-AVQQI-AKITSDPGFILAVDD REENRALLARYLTRSGHFVVTAPSGEEALEMLANADVDVVLLDRMMPGMDGREVLRRIKAERATPVIMISGEQDMQGIIECIKAGADDYLFKPFNPVLLQ ARIKAGIERKQWHD----R---EQLYR---DQLERNERFIRAT----FGRYLSDTEILERPEGLELGGDLREVTIMMSDIRG------------------ ---- >tr|A0Z7L2|A0Z7L2_9GAMM Adenylate cyclase PLUS two component hybrid sensor and regulator OX=247639 OS=marine gamma proteobacterium HTCC2080. GN=MGP2080_14546 PE=4 SV=1 --------------RHDRRNIIGAIRGYSEMLL-EDSEVLPAAVRAHLLQILAAAKNEP----------KP-ASESAT-PTKSV-TLPSEEPGVILAVDD LPENRELVSRLLQKTGHTVISAESGEEALELLDTMGVDVVLLDLVMPGIGGAEVLKRLKEDRATPVVMISGQQDMDQIVMCIEAGADDYLLKPFNPVLLQ ARISAGIERKRWHD----R---EELYR---EQLERREQFIRAT----FGRYLSDDEILERPEGLELGGDLREVTIMMSDIRG------------------ ---- >tr|B7RSY3|B7RSY3_9GAMM Response regulator receiver domain protein OX=247634 OS=marine gamma proteobacterium HTCC2148. GN=GPB2148_2905 PE=4 SV=1 --------------RHDLMNVLSAIRGYGEMLR-EDLATEHAELDSALTRLLKGIQAAN----------SGGDTSEPA-SATQ--RTITTEPGFILAVDD LQENRELVARYLSRSGHIVVTAAGGEEALRSLDQADVDVVLLDLVMPDMDGREVLRRIKEHRATPVIIISGRQDMDGIIECIEAGADDYLFKPFNPVLLQ ARIKAGIERKRWHD----R---EQLYR---QQLERNEKFIRAT----FGRYLSDTDILERPEGLELGGDLRRVTIMMSDIRG------------------ ---- >tr|B8KXE5|B8KXE5_9GAMM Adenylate/guanylate cyclase OX=565045 OS=gamma proteobacterium NOR51-B. GN=NOR51B_2093 PE=4 SV=1 --------------NHDLLNVIGAIRGYAEMLN-EESADLHPGITTTLPAILSVVRSTV----------VAGASNTAK-P-------PSADPGVILAVDD MPENRELISRLLHRSGHTVITAESGEEALELLGTMAVDVVLLDLMMPGIGGAEVLRRLKDDRATPVVMISGRQDMQQIIGCIQAGADDYLLKPFNPVVLQ ARISAGIERKRWHD----R---EQQYR---RQLERNEAFIRAT----FGRYLSDDEILESPEGLELGGDLREVTIMMSDIRG------------------ ---- >tr|H3NWR0|H3NWR0_9GAMM Family 3 adenylate cyclase OX=745014 OS=gamma proteobacterium HIMB55. GN=OMB55_00014680 PE=4 SV=1 --------------RHDLLNAVAAVLGYAEMVL-ELDHDLPSSIRERVQGIADALSTST----------KTPEATTKS-GRAL----DTFEGCSVLVVDD LPENLDLMSRLLRKMHCSVITAESGEQALGLLASEPVDLVLLDLVMPNIDGREVLARIKQSRAIPVIMISGRQDMDQIVDCIQVGADDYLLKPVNSVLLN ARIKSGLERKRWHD----K---EEQYR---EQLEKREQFIRQT----FGRYISDEEILESPEGLKLGGDLKSVTLMMSDIRS------------------ ---- >tr|F3L3Z7|F3L3Z7_9GAMM Adenylate cyclase OX=876044 OS=gamma proteobacterium IMCC3088. GN=IMCC3088_2378 PE=4 SV=1 --------------RHDLLNHYTSACGYAEILL-EELGNTQEALQHGLTELVAAIRELS----------ATPEAVAKA-SPSYK-SDPRNTNGSILVVDD QNDNRAVLNRLLSRVGHTVYTADSGEAALEQLSTQAIDVVLLDLSMPGMGGKTALTKIKADRSIPVIVISGHQELDSVVECITAGADDYLFKPFNSVLLH ARIAAGLERKQWHD----R---EVNYL---HQLEQREKFIRAT----FGRYLTDADILEKPEGLRLGGDLKEVTILMADIRG------------------ ---- >tr|G7VY50|G7VY50_PAETH Two component transcriptional regulator, winged helix family protein OX=985665 OS=Paenibacillus terrae (strain HPL-003). GN= PE=4 SV=1 -------------------------------------------MPKQANQITNRFKAAE--MPKRVKGNML-PAPSAPALQT----VPCPTTRRIVLISS VPGVVHGLVRTLSDACFDVMVFHRWEPEVQ--RHLGNDLLIYDLTATDIRSELLIRKLEEQKDTPVMYVIQDRM---AVRAHLLPSEEVLVWPLASNHMV HDIERMIRRTVFKDIWIDRDKMLVYKEENQLNLTKTEYDLLLKLIDAQGKVISREQMMHDIWETDFVGGSNVVDVHVKSLRKKLGDRPAEPEYIATVRGV GYRL >tr|E5YNS2|E5YNS2_9BACL Putative two component transcriptional regulator, winged helix family protein OX=715225 OS=Paenibacillus vortex V453. GN=PVOR_00125 PE=4 SV=1 -------------------------------------------MPNQTEEMMEPGTVYP--FPEWKSGNLF-VRPTAAMAIG----DACPTTLRAALLSR SPGRVHEMFITLTDNCFEVMMFRKWEPRLQ--SALNTGLIIADMTGCRNMAAFNEKELLERSSIPILYLVGEEL---MSNAGSLLDEELMVWPARSKDMM YQVQRTIRGLVYKDLSIDRDKMTIHRGSTPIHLTKTEYHLLMLMLDSEGAVCTREDLMCKIWDTDFMGGSNVVDVHIKSLRKKLSDNAGAPRYIATVRGV GYRL >tr|G4HCH6|G4HCH6_9BACL Putative two component transcriptional regulator, winged helix family OX=743719 OS=Paenibacillus lactis 154. GN=PaelaDRAFT_1679 PE=4 SV=1 -------------------------------------------MSNQTEEIMEMATVYS--FPDWKTTSLY-QHPGAGMAIG----DACPTTMRAALLSR SPGRVHEMFMTLTENCFEVMMFRKWEPRLQ--TALNTGLIIADMTGCRDMAAFSERELLEHPSIPVLYLVGEEL---MSNAGSLMNEELMVWPARSKDMM YQVQRTIKGLVFKDLTIDRDKMTVHRGQAPIHLTKTEYMLLLLLMDSKGAVCTREELMSKIWDTDFMGGSNVVDVHIKSLRKKLSDNAGAPRYIATVRGV GYRL >tr|H6NEC3|H6NEC3_9BACL Transcriptional regulator domain-containing protein OX=1116391 OS=Paenibacillus mucilaginosus 3016. GN=PM3016_7329 PE=4 SV=1 ------------------------------------------E------TLTAPSRAGA--YESWTGSGMP-QTPGS---TS----A-CTAPRRVGLLSP APDRLYPLIMELYRGCCDLFLMHQADESLL--GTGPVDLLIVDETLSEEAGSASMASLRTGLDIPALILVRDRQ---HLTSIADLRKQQGVWFAVPQEAW TAVQSAV--LTFKDLCVDEKRMSVTRGQTSLSLTKTEYDLLRALLAAEGAVLSREELIHRIWSSDFYGGSNVVDAHMKSLRKKLGDSAAAPKYILTVRGA GYRL >tr|C6IXU0|C6IXU0_9BACL Transcriptional regulator domain-containing protein OX=621372 OS=Paenibacillus sp. oral taxon 786 str. D14. GN=POTG_00967 PE=4 SV=1 -------------------------------------------MQNGIS-----GNTVP--FR--RPSGVE-VAEPEGLELG----RACPVTQRVILISP FPSEVHELVRELSESCFDVLVFHHLEQGIR--NALAADLLIFDLTSYQENIDTAIRSVGETGGTPSLLLVRESM---LGYLDALRNQELLVWPARPVEIV YHAQRII---IFKDLWIDRKKMTVYRGGIKIELTKTEFELLIKLLEHEGTVLSREELLSEVWGTTFLGGSNVVDVHIKSLRKKLGDNASRPTYITTVRGV GYRL >tr|G4F145|G4F145_9GAMM Signal transduction histidine kinase OX=550984 OS=Halomonas sp. HAL1. GN=HAL1_00405 PE=4 SV=1 --ANQAKSDFLAMVSHEIRTPLNGVIGMSELLREHTIHDSANQLLAMINEILDFSKIEPTALKPLIDSVVVLRAKVKGLTLTVVLVDAAFSGTTLLLVED NAVNRKVAIGLLSRLGCDVVWAENGHDALAMAQSQQVQLIFMDIQLPDMDGLLVTQRLREQGGVPIVAMTAGGAEDDRQRCLAAGMNDYITKPLSLSVLS NVL----ALQLFATPDATPTEGPPLLNDDMLKASLGESSLIQLAMLYHQQVSDYTEQLAACLAAT----------------------------------- ---- >tr|H2FZV1|H2FZV1_OCESG Signal transduction histidine kinase OX=511062 OS=Oceanimonas sp. (strain GK1). GN= PE=4 SV=1 --ASKAKSEFLATMSHEIRTPMNAIIGLSGLLLEATIKSSADLLLHLINDILDYSKVEPMSLQAEVKGLQAMRDNADRVDFVCLYYDVDMAEASLLLVED NAVNQEVARALLEKMGLEVTVADGGEAALQLCREQRFDLVLMDIQMPGMDGMETVRQLRCQPELPIIAMTANVMPGERERCLAAGMDDYLSKPVNPEILR RML----AGYLAATSERNGQPAGELLDVESLFEGLGGDSMEHLFAMFFTRLDERCEALRQAAGQE----------------------------------- ---- >tr|F7SHW0|F7SHW0_9GAMM Signal transduction histidine kinase OX=999141 OS=Halomonas sp. TD01. GN=GME_00315 PE=4 SV=1 --ANHAKSDFLAMVSHEIRTPLNGVIGMSELLRERTIHDSANQLLAMINEILDFSKIEPTALKPLVDSVISLRAKAKGLQLTLVMIDASFAGISLLLVED NAVNRKVATGLLSRLGCDVVCAENGQDALDMVQAQQVHLIFMDIQLPDIDGITVTKRLRSQGGVPIVAMTAGGVEGVRQRCLAAGMSDYITKPLSLLTLS NVL----SLQLFTVSRVEQHELQPLINRETLAVSLGAESVDQLIMLYHQQVSDYVAQLAAYLRST----------------------------------- ---- >tr|G9EEM4|G9EEM4_9GAMM Hybrid signal transduction histidine kinase K OX=1072583 OS=Halomonas boliviensis LC1. GN=KUC_2807 PE=4 SV=1 --ANQAKSDFLAMVSHEIRTPLNGVIGMSELLREHTIHDSANQLLAMINEILDFSKIEPTELKPLVDSVVVLRAKKKGLALTVVLVDAGFSGTTLLLVED NAVNRKVAMGLLARLGCDIVWAESGHDALSMAQSQQVHLIFMDIQLPDMDGLTVTQRLREQGGVPIVAMTAGGAEDDRQRCLAAGMDDYITKPLSLSALS NVL----ALQLFATPDATLTGEQSLLNGDMLKTTLGEASMIQLIMLYHQQVSDYSAQLASCLAAT----------------------------------- ---- >tr|H0IYI6|H0IYI6_9GAMM Signal transduction histidine kinase OX=1118153 OS=Halomonas sp. GFAJ-1. GN=MOY_02124 PE=4 SV=1 --ANHAKSDFLAMVSHEIRTPLNGVIGMSELLRESTIHDSANQLLAMINEILDFSKIEPTAIKPLVDSVMALRAQAKGLQLVLVMIDAGFTGTSLLLVED NAVNRKVATGLLSRLGCDVVCAENGQDALEMIKSQQVHLIFMDIQLPDMDGLTVTRKLRAVGGVPIVAMTAGGVEDDRSRCLSAGMNDYITKPLSLLSLS NVL----SLQLFPSSQPTPSEGQTVLNSAMLLGSLGQESLAQLIMLYHQQVDDYAAQLGAYLSSP----------------------------------- ---- >tr|G9E7J0|G9E7J0_9GAMM Hybrid signal transduction histidine kinase K OX=1072583 OS=Halomonas boliviensis LC1. GN=KUC_0309 PE=4 SV=1 --ASLAKSEFMAVMSHEIRTPLNGVVGMADLLSEAALKRSAESLRAVINDILDYTKIEPFDLHQCIDQLCESRETKAKVTFSYVMGDIALTTKHILVVED NPLNQTVARVMLERLGQQVTIAKNGLEALDLLQHSRVDLVLMDMQMPKLDGTETTQRWRDYEALPIVSMTANVMPEHRERCMQSGMDDMIHKPFTRDELN LVI----CRY-L-------------LDKRELKSTFEPRALGALLSTFLTRLGERSARLNAYWQSE----------------------------------- ---- >tr|Q1QU44|Q1QU44_CHRSD Multi-sensor hybrid histidine kinase OX=290398 OS=13768). GN= PE=4 SV=1 --ANQAKSDFLATVSHEIRTPLNGVLGMTELLREETIHESGSQLLTLISDLLDFSKIETFSLQALIDSLLRLRAQH--VELISSLGDAGTCPAPLLVVED NPVNRQVAVAMLERLGQTVTVVDSGEAALELTADTPFALIFLDIRMPGFDGPETARRLRQQTGTPLIAMTASVTTLEHQRALDAGMVEVLTKPIRQQALR DVL----IRYPMPAAPATDAGEPPLLDNQMLGETLGKAQRSALVAAWQHQARLLEDALEAAIAQS----------------------------------- ---- >tr|E1V8C8|E1V8C8_HALED Signal transduction histidine kinase OX=768066 OS=2198 / 1H9). GN= PE=4 SV=1 --ASQAKSEFLATVSHEIRTPLNGVIGMSELLSEDTIHDSAQRLLELINDILDFSKIESLSLSELVNGALSLHAEAKGIHLVAIVSDPGLSGARLLVVED NPVNQQVASAMLTKLGCRVSVACSGREALERVETECFDLIFMDVQMPDMDGLEVTRRLRERGDVPIVAMTAGGPGGDQARCLAAGMNGYLVKPLFQDVLQ AIL----RRHLQRGADEAVVGPETLLDGEALKASLSSDELAALVERYRGQAREHLAALNDAVAHG----------------------------------- ---- >tr|Q1QYN9|Q1QYN9_CHRSD Hpt sensor hybrid histidine kinase OX=290398 OS=13768). GN= PE=4 SV=1 --ASQSKSEFMAIMSHEVRTPLNGVVGMLDVLEDATLRESTASLRAVINDVLDYSKIEPFPLTRMVSRLAEGRQTSD-VSLIVVIGDATLASRHLLVVED NPINRDLVAALLKRLGQTFDLAEDGEQGLAAMKCKDYDLVLMDMQMPCMDGVETTRRFRAEEGLPIVAMTANVMPEHRQACREAGMDDILSKPFTRHELA NVL----RAY-Q-------------LDADELEQSLEHETLKTLLAAFFSRLDGRVEVLREAVDGS----------------------------------- ---- >tr|E8PCA8|E8PCA8_ACIB1 Sensor protein OX=696749 OS=Acinetobacter baumannii (strain 1656-2). GN= PE=4 SV=1 -RANDAKSRYVIGISHELRTPLNSILGYSQLLQKQVISRSGQHLTSLIDGLLDLARIEDVHFPNFIEQIIQPQFEQKNLQFVYEITDKKRVRKRILVVDN EAVDRGLVANFLKPLGFMIEEAESGIDCLRRVPIFQPNLILMDLNMPLMGGWETARLLRQNNITPILIISANAGEREVNPQDAVLSEDFMLKPIDLNLLL SKIGDK-----LGLVWIDSKSETLLENNEVQAIYKQEQVAVHTLIQQPENN------------------------------------------------- ---- >tr|I4ZRK6|I4ZRK6_9GAMM Signal transduction histidine kinase OX=1173062 OS=Acinetobacter sp. HA. GN= PE=4 SV=1 -KANEAKSRYVIGISHELRTPLNSILGYSQLLQKQVISRSGQHLTSLIDGLLDLARIENIDLPKFLKQIVHPQFAQKQLNFDVEIIDKKRLRRKILVVDN EATDRGLVLNFLKPLGFILEEAESGLACLRQVPEFNPDLILMDLSMPLLGGWETAKLLRQNKITPIIIISADANERMINPETDVLNEDFLVKPVDLNLLL NKIGDK-----LGLSWLHQHEQQPEQEQP-----SDQKVQISTVVVESRNE------------------------------------------------- ---- >tr|K2QQ64|K2QQ64_9BURK Sensor histidine kinase/response regulator protein OX=864073 OS=Herbaspirillum frisingense GSF30. GN=HFRIS_00519 PE=4 SV=1 -SANLAKSRFVAGMSHELRTPLNSILGYAQILHVDIIRRSGEHLLSLIDGLLDIAKIEELALEAFLEQIVGPQAEQKGLQFRFEKADPKRVSRHILVVDD QASQRRLLKDMLAPLGFAVSEADSGLACMERLAHEVPDLILLDIAMPQMDGWSVARAIRARGLTPILLVSANAFENLHERNGAPLVNDFLVKPVSYNELL GKIRQH-----LQLDWVAPGTASASATTP-----PTTATTAKPLMLVPSQD------------------------------------------------- ---- >tr|D6Z1L0|D6Z1L0_DESAT Response regulator receiver modulated metal dependent phosphohydrolase OX=589865 OS=Desulfurivibrio alkaliphilus (strain DSM 19089 / UNIQEM U267 / AHT2). GN= PE=4 SV=1 ----------VATARHDLRNPLNNILGYTEMLLEDHENPEVPDFTGQLQEFYVLSRLVLQEVNKLQQGVTAPTPKRTEAK-ALQAIDDRGEKPLILVVDD ITANREVLSRRLRRLGYLVMEAENGRQALAMLETGAFDLVLLDILMPELDGYEVLRRIKANRHVPVLMISAVGDINSVVRCLEAGADDYLAKPFNSVLLR ARVNASLVKKRLYDQELHYK--------QEI---EQHNQLLEQRVRQQVQEISRAQL------------------------------------------- ---- >tr|F9DTZ3|F9DTZ3_9BACL Diguanylate cyclase OX=1027292 OS=Sporosarcina newyorkensis 2681. GN=HMPREF9372_2274 PE=4 SV=1 ----------------ELYYFLHKLKGTSGTIELQQLSLFCASQLEILSSSN-DQKIPVSSLENLKNRIRS-HF---DGSEQLP-DNQLDQETFILIIDD DLEFVSYLKELLEKMGAQVIISLNGKRGIEQFYSMRPSIILVDTKLPDMSGFEVLDQIARQKNTMVALTSEQATKENEIESYRRGAMDFIPKPFDMDIFF PYLFNRQQRQHAISSSIITDSLTGIGN---RRYFDEVINNFAKKADQSDTTFSLVMV------------------------------------------- ---- >tr|Q2BA82|Q2BA82_9BACI Putative uncharacterized protein OX=313627 OS=Bacillus sp. NRRL B-14911. GN=B14911_14727 PE=4 SV=1 ----------------DVYRFLHSIKGTAGTLQLVGLHQVAGKLMDQVEKSS-EKIWGNRELRDFLYELMGLSY-EYEHFQSLP-RD--ENMPLIQVIDD DVSMLILLKDALESKGWMVMANTEPEKAVDQFFDMNPDCLIIDVNLPGKSGFHVLEDIQNKKFIPKLMISIMNDRETRIKAYRLGADDFISKPIDLEEFL IKVERHLDRKQIFDQSVLIDELTQVYN---RRFLKDSLKRYLKELERSNQYFSIAVL------------------------------------------- ---- >tr|I0JQH0|I0JQH0_HALH3 Diguanylate cyclase domain protein / two-component response regulator OX=866895 OS=NBRC 102448/ NCIMB 2269) (Sporosarcina halophila). GN= PE=4 SV=1 ----------------EIYKFLHSVKGTAASIQLPELSTEAEKVMDHVSPAS-TASWQEGSWSELLQPIFLAAF-NKELDLELP-VQKEEKQPTILILEN NLDFLHHFRNNMESHGYNILIAATEERALNLFYDETPDLVVIDFYLENKNGLEILEEISTSILTPVIIISEDGSPELAKKVYETSALDFIPKPIDFEVFT TLIRNRLLHVRSLKKQITTDKLTKAYN---RTYLQDLLARKKRTFHRNEQTFCLAIL------------------------------------------- ---- >tr|F2F133|F2F133_SOLSS Response regulator containing a CheY-like receiver domain and a GGDEF domain OX=1002809 OS=Solibacillus silvestris (strain StLB046) (Bacillus silvestris). GN= PE=4 SV=1 ----------------EIYTFLHNLKGTAGSIGMEGLSEIAATKLEQLEEKS-SKRWEKENWKAFLFQIEESFF-EKNVKRLPF-IKRSKQNNFILVIDD DIVFASYIKNILESKGFMVIVAHNGKRGLELIYELNPALVFLDIKLPDINGFSILENINKSNHMFVTIMSVDDSKGNRAKAYDLGALDFIKKPLDADILI SYVRNRLVFKQALELSIMTDELTQLYN---RKYMNTQLEFLMEQFEKRGEQFSIAIA------------------------------------------- ---- >tr|K9ANF9|K9ANF9_9BACI Response regulator receiver modulated diguanylate cyclase OX=1231627 OS=Lysinibacillus fusiformis ZB2. GN=C518_1962 PE=4 SV=1 ----------------ELYSFLHTLKGTAGSIGLHELTSIAREKLELLNEDS-NKQWTKSEWKKYLAPFIEHFY-QENNSMTTI-IKNPAKQDFILIIDD DVVFISYLKNVLEKKGYSVIAAHNGKRGLALIYELQPSIVFLDIMLPDSNGFSILDNIKKKELMFVTLISSNDSKENRVRTLEMGAMDFMAKPIDEELLV AYVSNRLAYKRELEQAIVIDELTQIYN---RKFMESQMKLFIKHHQQNKEHFSIAMI------------------------------------------- ---- >tr|D7D0W3|D7D0W3_GEOSC Response regulator receiver modulated diguanylate cyclase OX=691437 OS=Geobacillus sp. (strain C56-T3). GN= PE=4 SV=1 ----------------ELARFFNAVGRTAAVIGRSDIAAEAERLIHRLNGQT-KRQWTAEEALMEMVPLLRCFY-EAGEAATIP-SHRQGPKAAILLCGN DPLFFAYVRGALQTVPWCFTTTPSLEQAAASMFRLSPDCIIVSVKEGEWENPDLTVLLEQRPYLPVVIVRRDEGKEGRLKGYELGADDVITTP-AADELF VRVRRLIEKKRKIDDLVLIDELTGVYN---RKYLPRVYARLRSDLERYGAPSCLALL------------------------------------------- ---- >tr|D6XY81|D6XY81_BACIE Response regulator receiver modulated diguanylate cyclase OX=439292 OS=Bacillus selenitireducens (strain ATCC 700615 / DSM 15326 / MLS10). GN= PE=4 SV=1 ----------------DVFLFFHKLKGTAATVGLTDWEDHLQEGLSTLSRSS-ERSLSDEELNRWLEPFRQLQS-GSDSEELTD-REHDDDSLLILMIQE ETQMAEQSRKEIESYGIQVITAFSVQKGVELLYRFSPSTIVADFETIEHEGTTASKRLFSQDFIPVVYLTNQSEDRDKIKAYEEGASDYLVRPVAIDVFI AILANRMMFRSRVQKALTLDELTGALN---RGMLNNRIQDLVRGNRRKGESWSFVML------------------------------------------- ---- >tr|D3FZ83|D3FZ83_BACPE Signal transduction diguanylate cyclase (HPT-REC-GGDEF-REC domains) OX=398511 OS=Bacillus pseudofirmus (strain OF4). GN= PE=4 SV=1 ----------------QIYRFLHSVNGTAASIGLPDYSQAASEKLTELNKEL-KDKWSEEVMTGYFSPFIKTEQ-AEGLRELET-IEEQDAKAVVLMIDE DLDFLSGMKSGLEQRGFIVLLATSPEKGLEMFLNERPNCVIVDYVRPYERGYESAQSLSTSEFIPVLMISSSMDKHIELNAYKSGVADFIKKPVNSEELE VRMNNRLRYQKVIQQTIMTDDLTGVYT---RRYLSTEGKKQLSQALRSKQTLTAAMF------------------------------------------- ---- >tr|C0Z5E7|C0Z5E7_BREBN Putative uncharacterized protein OX=358681 OS=Brevibacillus brevis (strain 47 / JCM 6285 / NBRC 100599). GN= PE=4 SV=1 ----------------ELYRFLHSLKGTSGTIGLTDLSDLSQILLDRMEDMP-AKDWSLGEWRLFLHELISLCY-EQRPEQTLQ-RESPEQQPLVLILDD DVTLLMYLKEYLENHNWSVIATVYPHMALDYFHDMNPDCFILDLNIPETGGFQVIQTISKKQYVPTTIISIDCGRETRLNAYRLGADDVMCKPLDMEELV VRLERQLRRKRWMNSILFLDELTGVNN---RNSFVDTYQRLLSDAQRTNTPFSLAFL------------------------------------------- ---- >tr|L5MS54|L5MS54_9BACL Uncharacterized protein OX=1246477 OS=Brevibacillus agri BAB-2500. GN=D478_18116 PE=4 SV=1 ----------------ELYRFVHSLRGTAGTIGLADLSELAQTLLEQLDQQD-AKPWPTHEWRDFLQELIALCY-EHRPEQTLP-QESTDNQPLLLILDD DVTLLMYLKEYLEKQNWSVIATVYPHKALDYFHDLNPDCLILDLNIPETGGFQVMQTLSKKQYVPTTIISIDCDRETRLRAYRLGADDVMCKPLDLEELT VRLERQLRRKRWMDKILFLDELTGVYN---RNCLADTYQKLLADSMRSHAPFSLAFL------------------------------------------- ---- >tr|A6CTQ4|A6CTQ4_9BACI Putative uncharacterized protein OX=161544 OS=Bacillus sp. SG-1. GN=BSG1_15905 PE=4 SV=1 ----------------DVKRFFHSISGTAPTIELDEIGRQASESMKAVDHRG-DDSWNRKNIQEIISPLLKECY-QFEYENPIR-ADSNQRRKIVFLVEN DTTFLMYIKDFLERKGFHVFGFTDPDKAISALYDVKPDCILLDIFLGDQNGLDTLSHITQKQYVPVVMMSENNSDSLRMESYKKGADDFLLKPFILEELF IRVNRQIERKEFVDHLILVDELTRLYN---RKYLHSSFSQLCAKQWEEEQELSIAFL------------------------------------------- ---- >tr|E8SS64|E8SS64_GEOS2 Response regulator receiver modulated diguanylate cyclase OX=550542 OS=Geobacillus sp. (strain Y412MC52). GN= PE=4 SV=1 ----------------DVYRFFHNIAGTAAVIGMEELGEKARRLMKRLEEER-NVLWTPEALKEHVYELMHWYY-DETYRDSLS-KAAVEEAPLVLMIDD DPLFLMFMKEYMEKTGWHIVTVAQPEKAVAQFYEVKPDCIVIDVHMNGTDGLIVLKELKGPQFVPTVIISADDREEVRLQSYALGADDFIVKPFSLSEFL IRVNRLVERKRQLEALLLVDELTRLYN---RKYLPEAYRQFESERERHGDPYCIALL------------------------------------------- ---- >tr|B7GIE3|B7GIE3_ANOFW Signal transduction diguanylate cyclase (HPT-REC-GGDEF-REC domains) OX=491915 OS=Anoxybacillus flavithermus (strain DSM 21510 / WK1). GN= PE=4 SV=1 ----------------DVYRLLHSIAGTAAMIGMTDIGNYARQLMEQWGEDE-EKKWKSSDVKERLSPLFQLCY-EQQLGDSEQ-GKKQGDEPTILLIDD DPSFLMYVKEQLEQNGWYVVAIADPVKAVASFYDVRPDCVVIDIHMGKKTGFEVLTFLKKQQFVPMIMVSIDDRKETRMKSYQMGADDFIPKPFDIDEFI VRIYRQLERKQLIDELLLLDELTHVYN---RKYLKQAYEQLKSDWHRTHEPFCLAVL------------------------------------------- ---- >tr|E7RDI8|E7RDI8_9BACL Response regulator diguanylate cyclase/phosphodiesterase OX=933115 OS=Planococcus donghaensis MPA1U2. GN=GPDM_02580 PE=4 SV=1 ----------------EMYRFLHTMKGTAGTIGLMELSQFCATQLELFAEDS-DTLLPVDSLQSLMVALRN-HF-KEENSSNLQ-KEIPTNDVFVLVIDS DAELASYIKESLEEHGIQVVIALDGKKGMELFYTLQPQMVILDLQLPDVDGFELISRIYKNQYVPLAIVSSDDRIENQIKAMEIGATDFLSKPLNMALFV PYVLNRLRLQKMILQETLYDELTGAGN---RKYFNDVLSQMTTLSEKSKKTFTLVLF------------------------------------------- ---- >tr|G2RTM0|G2RTM0_BACME Signal transduction diguanylate cyclase (HPT-REC-GGDEF-REC domains) OX=1006007 OS=Bacillus megaterium WSH-002. GN=BMWSH_4307 PE=4 SV=1 ----------------EVYRFLHSISGTSATIGLHYLGDRSRELMEAVEKKQ-KKTWNYAELNTFLLDIIKICY-QDEIQSNKP-LEISEQTANILLVDD DLSMLMYLKEQFEKQGWYVLATASKEKAITAFYELKPDCFVVDVHMKDGSGFDILSFLRKQLFVPIVMMSVDNQREARLKAFKLGADDFLVKPLDMEESL LRIGHHIERKQMFNSYLMFDELTGAYN---RTYLKEIYSQQLSSFDRLKDPFCLAIL------------------------------------------- ---- >tr|E0RNS3|E0RNS3_SPITD Putative histidine kinase OX=665571 OS=Spirochaeta thermophila (strain ATCC 49972 / DSM 6192 / RI 19.B1). GN= PE=4 SV=1 -KANRAKSEFLANVSHEMRTPLHAIMGFAEALKGQLLLSEAQRLKVLIDEILDVEKMEPFNLHEVVCRVMEENARRKGLSFSFEIGDPFRLQQRILVVED YMPNQKIVSLVLSEAGAEVEVAVNGREALEKVAQGRYDLVLMDVHMPVMDGLEATRRIRALGDLPIVGLTADAYREDVRRCREAGMDGVLIKPIRRKEMV EEVRRLLSGR--GGSEGEEDAREGIFVEEFGDLRERAFEILEGFLDEAERQWER---------------------------------------------- ---- >tr|A3UMT7|A3UMT7_VIBSP Putative sensor histidine kinase OX=314291 OS=Vibrio splendidus 12B01. GN=V12B01_12405 PE=4 SV=1 -KANEAKSLFLATMSHELRTPMNGVLGIAQIIKEQIIIDSGQHLVTILNDILDFSKVEPFSVADVLDKTLTPLAEDKGLAFTIKIGDSALDDFSVLLVED NKINAMVIRKFCESINLTVENAYDGLQALDKLVSNQYDLIIMDNHMPNMSGIEAIQKIRNELKLTTVVFTADVFKEAHDEFIMSGADFVLTKPLQKNSLQ NAINEFHDQFEGN---LTGASNITMHPQNKLPLTEEELSRSQFLANDELKSDEKLKCLTSL--------------------------------------- ---- >tr|F9RT86|F9RT86_9VIBR Putative sensor histidine kinase OX=870967 OS=Vibrio scophthalmi LMG 19158. GN= PE=4 SV=1 -NAAKAKSQFLATMSHELRTPMNGVLGIAQIIAGGIILESGQHLMTVLNDILDFSKIEHFHLNQIVVSAITPLAEEKNLTLEINIGDSATRPLHILLAED NRVNAIVAKGFCNKLGHRVDIAENGQIAIDKATKESYDLILMDNHMPEMNGIEATHYLRHVLGIKTLIFTADVFREAHDDFINAGADHVLTKPLQNESFY DAISNFSHRIQTQNTPALPGTNVIHENVQDLRLTEEELSKSELLNEVKQDIDIYQSLLQSL--------------------------------------- ---- >tr|D0X6R7|D0X6R7_VIBHA Putative uncharacterized protein OX=673519 OS=Vibrio harveyi 1DA3. GN=VME_07800 PE=4 SV=1 -RAAKAKSQFLATMSHELRTPMNGVLGISQIIASQVILDSGQHLMTILNDILDFSKVEPFNLPQVVCSAIQPLIEEKNIKLYVENGDCAKRNLHILLAED NRVNAIVAKGFCEKLGHTVEIAENGLIATKKAQENQYDLILMDNHMPEMNGVEATRFIREKLGVNTLLFTADVFREAHDNFIEAGADHVLTKPLQRESFA DALKQFASRLPTQGDDAPQADNIVQEPIEKLRLTEEELTHSETVEILKQEPEALVDLLESI--------------------------------------- ---- >tr|C9NY32|C9NY32_9VIBR Sensor histidine kinase OX=675814 OS=Vibrio coralliilyticus ATCC BAA-450. GN=VIC_004356 PE=4 SV=1 -KANEAKSIFLATMSHELRTPMNGVLGIAQIIKDQILIDSGQHLMTILNDILDFSKVEPFSIGELLEKTLGPQAANKGLKFVIQHGDPALDNFTVLLVED NRVNAMVARKFCESMGLLVDHANDGLEAISRLKHSNYDLIIMDNHMPNMNGLDAIEHIRTKLRLSTVIFTADVFKEAHDEFIARGADFVLTKPLQKSSLQ NALSQFSRQLSTH---PQANNNISTFPVDKLPLTEEEVSQSPLINQLELQDREKVDLLESL--------------------------------------- ---- >tr|Q87H65|Q87H65_VIBPA Putative sensor histidine kinase OX=223926 OS=Vibrio parahaemolyticus serotype O3:K6 (strain RIMD 2210633). GN= PE=4 SV=1 -RAAKAKSSFLAAMSHELRTPMNGVLGISQLIAEKVILDSGQHLMTILNDILDFSKVEPFHLEQVVCSAIQPLIDEKSIDLIVETGDCATQNLRILLAED NRVNALVAKGFCEKLGHAVDVAENGLVAVEKARDNDYDLILMDNHMPEMNGVEATRFIREKLGVKTLLFTADVFREAHDHFIAAGADHVLTKPLQRESFA DALKQFSARLKVKQTEEVPVSNVLQKPIENLRLTEEELSNSEMLASLKEHPNELLDLLNSI--------------------------------------- ---- >tr|A7JY83|A7JY83_VIBSE Sensor histidine kinase OX=150340 OS=Vibrio sp. (strain Ex25). GN= PE=4 SV=1 -RAAKAKSSFLAAMSHELRTPMNGVLGLSQMIAEKVILDSGQHLMTILNDILDFSKVEKFNLDQVVCSAIKPLIDEKNIELIVETGDCATHNLQILLAED NRVNAIVAKGFCEKLGHEVDVAENGLVAVKKARDNEYDLILMDNHMPEINGVEATRVIRQELGIKTLLFTADVFREAHDSFIQAGADHVLTKPLQQESFS DALKQFATRLEPKENLPLNDNNVLQKPIEKLRLTEEELSSSEMLISLQEAPEAIDELLKSI--------------------------------------- ---- >tr|Q1VBC9|Q1VBC9_VIBAL Putative sensor histidine kinase OX=314288 OS=Vibrio alginolyticus 12G01. GN=V12G01_01090 PE=4 SV=1 -RAAKAKSNFLAAMSHELRTPMNGVLGLSQMIAERVILDSGQHLMTILNDILDFSKVEKFKLDQVVCSAIKPLVDEKNIELILETGDCASHHFKILLADD NRVNAIVAKGFCEKLGHEVDVAENGLIAVKKAKENEYDLILMDNHMPEMNGVEATRVIRQELGIKTLLFTADVFREAHDSFIEAGADHVLTKPLQHESFS DALKQFAARLEPKQNPPLSDSNVLQKPIENLRLTEEELSRSGMFTSLRDEPEALDELLKSI--------------------------------------- ---- >tr|F9RXU1|F9RXU1_9VIBR Putative sensor histidine kinase OX=870968 OS=Vibrio ichthyoenteri ATCC 700023. GN= PE=4 SV=1 -NAAKAKSQFLATMSHELRTPMNGVLGIAQIIAGNIILASGQHLMTVLNDILDFSKVESFHLDQVVISAITPLAEEKNLHLEVDIGDSALRPLKILLAED NRVNAIVAKGFCSKLGHQVDIAENGRIATQKAANKHYDLILMDNHMPEMNGVEATHYLRHTLGLKTLIFTADVFREAHDDFIAAGADHVLTKPLQNESFY DAISSFNHRIEQQQPVTAPDDKVVHEDVRDLRLTEEELSNSELLNEVKQDVDTYHSLVQSI--------------------------------------- ---- >tr|F9T8W9|F9T8W9_9VIBR Putative sensor histidine kinase OX=1051646 OS=Vibrio tubiashii ATCC 19109. GN= PE=4 SV=1 -KANHAKSIFLATMSHELRTPMSGVLGVAQMIRDDIIINSGNHLVTLLNDILDFSKVEPFSLRELLESTLLPLAQKKNITLTIPVGDVALHKKRILLVED NRVNAVVAKGFLKDYAQDITWAEDGLQALEVLENEQFDLIVIDNHMPNLSGVETIKRIRETLKLDTVIFTADVFKEAHDSLIDAGANFVLTKPLQKPSLE LALKQFSREIMAS----EEGSTVVPHPASQLALTEEELSASSSFNDPSLSRVTKLELLIGL--------------------------------------- ---- >tr|A5KWE1|A5KWE1_9GAMM Putative sensor histidine kinase OX=391574 OS=Vibrionales bacterium SWAT-3. GN=VSWAT3_03111 PE=4 SV=1 -KASEAKSLFLATMSHELRTPMNGVLGIAQIIKEQIIIDSGQHLVTILNDILDFSKVEPFSVTDVLDKTLTPLATDKGISFVIKVGDSALNDFTVLLVED NKINAMVIKKFCESINMTVENAYDGLQALDKLATNQYDLIIMDNHMPNMSGIEAIQKIRNELKLTTVIFTADVFKEAHDEFLVSGANFVLTKPLQKNSLQ NAINEFHHQFEIN---TESNSNVTMPPKNKLPLTEEEISRSPLLEGDKLEGNEKLKCLNNL--------------------------------------- ---- >tr|K9PNT6|K9PNT6_9CYAN Integral membrane sensor hybrid histidine kinase OX=99598 OS=Calothrix sp. PCC 7507. GN=Cal7507_4243 PE=4 SV=1 ERANHAKSEFLANMSHELRTPLNVILGFAQVMGLAIINRAGEHLLNLINDILEMSKIESFDLLRLLASLKEMRAAAQHLQLVFEVLDPAQPEYRILVVDD ATDSRLVLVKLLTSIGFAVREAVNGQEAIAQWLEWQPHLIFMDMRMPVMDGYEATRVIKARPACPIIALTASAFEEERQKILSTGCDDFIRKPFAQNLLL EKVSEHLGV-----KYISQETANITVASQQTQVLPSEAELLRHLSQMPPEW------------------------------------------------- ---- >tr|B3E6L4|B3E6L4_GEOLS Integral membrane sensor hybrid histidine kinase OX=398767 OS=Geobacter lovleyi (strain ATCC BAA-1151 / DSM 17278 / SZ). GN= PE=4 SV=1 -SANRAKSDFLSSMSHELRTPLNAILGYAQILRLDIMRNSGEHLLTLINDILDVGKIEVFDLPALIAQVFNLQAEEKELQFHYEVADPDGPRKRILVVDD TVGNTALLVSLLEPLGFDLDTAQNGQEALLQASEQRPDLVLLDLVMPEVDGLEAARLLRQDASTKIIGASATVTDSNHKEAFVNACDDFVTKPIRIDLLL EKISGLLGI-EWETAVVTTDDRESWKDDEPVVPPSAELEVLHELAMM------GDMLEIEAWATALEAQDTTYRCFAERLREL----------------- ---- >tr|I5AUN4|I5AUN4_EUBCE Signal transduction histidine kinase OX=633697 OS=Eubacterium cellulosolvens 6. GN=EubceDRAFT1_1729 PE=4 SV=1 -AANQAKSHFLANMSHEIRTPINAVLGMDEMILAESIKTAGNTLLGLINDILDFSKIEEYDLSSVINDLVSSRAEKKGLQIILELTDPNAPDANVLVVDD TPMNLEVFRSLLKRTGVIIDTAESGDECLDFTAQKKYDLIFLDHMMPKKDGIQTLQELRSQPGTPAVCLTANAVSGAREQYLAAGFDDYLSKPIDAGRLE EMMMRYLPEDKAGTAVIPEWLHAVEIDGGLLHCGSSYLDTLKIYAKNA--PDSADEI-EGLWNAG-DLSNTTVKIHIKSLSRAIGAE------------- ---- >tr|F8EYV1|F8EYV1_SPICH Two component transcriptional regulator, LuxR family OX=744872 OS=Spirochaeta caldaria (strain ATCC 51460 / DSM 7334 / H1). GN= PE=4 SV=1 -ELDETRSRFLAAASHELRTPVSLIITPIDAILFSLIRRNCERLKHLAESLLQVLKIDQVDLEDFVQNYVAAEASKKHIRLESRLEDPSPKAQSILIVED DEDMRTFLQENLSRA-LTVHTARSGQEGLNKIHALVPDLIISDVMMHPMDGLAFREQLRSIPAIPFLFISANPEPEVRSRALGTGAVDFIQKPFYIDDLT SKLFALMALRDCKGLVLDPSSKTTSLEDSYINLTERDREVLELLLQ----GLSDKEIAVKL-----NASVRTISNRVSSLLKRTGQPSR----------- ---- >tr|D2TZS7|D2TZS7_9ENTR Two-component system, NarL family, sensor histidine kinase OX=638 OS=Arsenophonus nasoniae (son-killer infecting Nasonia vitripennis). GN= PE=4 SV=1 -RASNEKSAFLAHISHEIRTALYVIIGILELEVLHTASRAANSLLGVIGDVLDFTRIEPVALYALLEQCAEPIAQDKGLGFTLELTDEQPESLNILLVDD LPANLQVLSLQLAASKHWVTLAEGAEQALTLMEENYFDMVLTDCQMPGMNGYELTRVLRTSRNLPILGCTANAFSSELSLCLAAGMDGVLVKPLTQDHLL SEIARYYQLVNQNELSFDGIS--ALASG----DKQKEYQLLQAILEGIEQDLATL---HNL--RSTAIDEEALSLHVHRMKGVF---------------- ---- >tr|A5EP63|A5EP63_BRASB Putative two-component hybrid sensor histidine kinase and response regulator OX=288000 OS=Bradyrhizobium sp. (strain BTAi1 / ATCC BAA-1182). GN= PE=4 SV=1 --SNVAKSRSLAAACHDLRQPLQNLVLLQELLAVERLDQALGNMSGMLNGLLDLNRIEVFPICALLHRIISDQAEARQIALEVSIWDTSSSSPIVYLVNH DRELRRSIRNTLDHDGWVVEDHENGASFLQAYKPGCDACLLIDTDGPGMSGSTVLSRLQDLHSIPVIMMARHGDVATTVAAMKAGAVDVIEKPIHSSELL DGVARVMRLVRMSDSTA---GRSDGSADRIPCFTRRQQEVMQRVLA----GHPSKNIAVDL-----GISRRTVESHRASIMKKAGVRSL----------- ----