>seq_1 GVVPQYGGGGNHGGGGNNSGPNSELNIYQYGGGNSALALQTDARNSDLTITQHGGGNGADVGQGSDDSSIDLTQRGFGNSATLDQWNGKNSEMTVKQFGG GNGAAVDQTASNSSVNVTQVGFGNNATAHQY >seq_2 GVVPQWGGGGGHNNGGNNSGPESTLSIFQYGSANSALALQSDARKSDLSIKQYGHGNGADVGQGADNSGIDLTQNGYRNSATIDQWNAKNSDIVVSQFGG RNGALVNQTASDSQVSVTQVGFGNNATANQY >seq_4 GTVPQFGGGGGHNGNGNNNGPNSELNIYQYGGGNSAVALQTDAKNSDLTITQHGGGNGADVGQGSDDSSIDLLQRGFGNSATLDQWNSKDSIMKVKQYGG GNGAAVDQTASNSQVNVTQVGFGNNATAHQY >seq_5 GVVPQWGGGGNHNGGGNSSGPDSTLSIYQYGSANAALALQSDARKSETTITQSGYGNGADVGQGADNSTIELTQNGFRNNATIDQWNAKNSDITVGQYGG NNAALVNQTASDSSVMVRQVGFGNNAPANQY >seq_6 GSVPQWGGGGNHGGGGSSTGPESTLSIYQSGVNNAALALQSDARKSETTIRQDGFGNGADVGQGADNSTIELTQSGFRNNATIDQWNGKNSDISVSQYGG NNAALVNQTASDSSVLVSQVGFGNNATANQY >seq_8 GAVPQFGGGHGGGGGGNN-GPDSTLSIYQYGGGNSALALQTDARDSELTITQHGGGNGADVGQGSDDSSIDLLQKGFGNSATIDQWNSKDSVINVKQFGG GNGAAVDQTASGSTVTVHQVGFGNNATAHQY >seq_10 GLINQ--GGWGD-HGGGYGGPNSTMNIYQSGGGNSAVALQSDARNSTMNISQTGGGNGADVGQGSDDSTISLTQNGFGNSATLDQWNSKDSTMTVSQYGG LNGASVDQTASNSSVSVTQVGIGNHVSAHQY >seq_11 GAIPQYGHGG--GWGGGNSGPNSTLSIYQTGGGNSAVALQSNAKDSVLSISQHGGGNGADVGQGSDDSSIELVQHGFGNSATLDQWNGKDSTMTVKQFGG GNGAAVDQTASGSTVSVTQVGFGNNATAHQY >seq_12 GVVPQWGGN--HHGGGSNYGPDSSLSIYQYGSNNSANALQSDARKSDVTITQHGRGNGAVVGQGADDSTISLKQTGFQNSATIDQWNAKNADISVTQFGG RNGALVNQTASDSNVLIQQVGFGNNATANQH >seq_13 GLINQ--GGWGHGGG-NNNGPDSTLSIYQYGGGNSALALQSDARDSSLSISQSGGGNGADVGQGSDDSTITLTQNGFGNSATLDQWNGKDSTMTVSQFGG GNGAAVDQTASGSTVTVQQVGFGNNATAHQY >seq_14 GMFPQYGGDH---GNGNQSGPDSTMSIYQYGSGNNATALQSDARKSDLTIKQFGSSNGADVGQGSDSSTIDLLQKGTANNATISQWNSKNSDIQVQQFGA LNGAVVHQTASDSSVTVHQVGFGNHASASQY >seq_16 GLIDQGGWGHGHGHGQGGDGPNSTLNIYQNGGGNSAVALQTNARDSTLSISQSGGGNGADVGQGSDDSTISLTQNGFANSATLDQWNSHDSTMNVSQYGG FNGAMVDQTASNSTVNVTQIGFGNHAAAYQY >seq_37 SVTIDQLGSDNYANAVQEEGSRNKVDIDQNGTGNQVAAAWTWGSDNKLVVAQKGTDNWAYADASHSDSVVKVTQDGELNNAKVESYLGSDNTLTVSQFGG NNEVDVAQTDSLNTANVTVYGSDNMAYVYQG >seq_38 TITITQSGHHNYAGGSTDDGGLSTVTVNQTGHHNEVNSKGTWGANNIVTVDQDGHHNKAYADAA-DTSIIDIDQSGHHNEAYAQSEWGVANEIDIDQTDG SNVVYVTQTDMNNSAVVSSIGFGNTVTITQG >seq_39 NISVTQSGTLNYAGAATELGGGSSITITQAGTNNSVKNNGTYGANNTLVISQSGSDNTAYADASHDNSSVDIAQSGEFNDATVESWFGSDNDLYVSQTGS MNAVSVTQTDSYNTANVTSSGSNNLVTVTQG >seq_40 AMLAQTGNSNDAGVTQSDVTQVGDLNIAQVTQRNEAIVYSAGEAN-EAYVTQSAINNLADINQDGSLNIANVNQSAADNLAGVEQ-TGDENTADVDQTGA ENSATVAQTSFGSTATVLQTGSLNDATVTQA >seq_41 AMVTQTGNGNDATVTQSDVTQLGNLNIAEVTQRNEAIVYSNGEVN-QAYVTQSARDNLADINQDGYLNIANVNQTAADNLADVDQ-TGDENTVDIDQTGT ENSVTVDQNSFGSSATVLQTGSLNDASITQT >seq_43 ---------------------------SVVSQYNSAVVLQNGNLN-TANVTQSAESNGADVAQDGDSNTATVAQTAMGNTADVDQ-VGNGNTADVDQSGP SNGVTIAQTSSGSFAQVAQTGESNGATVTQT >seq_45 GTWHGYGHPG-WGWGYMADGNNQQASISQWGWGNSAWITQT-GNNVDATITQGGHGNDGGIGQGGASLVAQLTQSGHHNDGYINQ-DGANLQAYVSQSGI GNDAVVLQKGNDYIANVTQVGYRNNAYVNQR >seq_46 STVNQTGGHDNWAYGDQREGTGGTIAIGQYGGGNSVEVWQDTQIGSQASVVQTGQSNEGYIDQSGVSNKVSLSQQGNANASWSDQFETNNSTTTITQTGN NNLHFTYQNGENLNLTINTQGNGNSITASNW >seq_47 SVITQYGGEGNYAYGDQRNGTGGTITINQFGNLNGTEIWQDSQLASQATVNQYGDSNETVVDQSGENNTAAVTQVGNTNAIYADQFESVNSTLALYQVGN GNVHFTYQNGDSHTLNATSVGNDNKVYASNW >seq_48 SLVMQAGGKQNWAYGDQRDGLGGTIGISQYGTGHSVEVWQDNQAASQANVYQTGQLNEGYIDQSGQGNSASLSQNGRANASWSDQFESNQSVTSISQTGN NNLHFTYQTGDNHSLSIDTTGNGNKIMASNW >seq_49 SLVNQDGGKQNWAYGDQRDGTGGLIGIDQNGTGNSVEVWQDTQVGSQATVNQNGQLNEGYIDQSGQDNTASLYQQGKSNASWSDQFETTNSTTSISQAGT SNLHFTYQTGDNQSLTVTTQGTGNKVMASNW >seq_50 SSINQTGGTGNWAYGDQRDGTGGTIGIGQYGGGNSVEVWQDNQTASQASVTQSGQMNEGYIDQSGMNNKVSLSQQGTANASWSDQFETSNSTTTITQTGS NNVHFTYQNGENLNLTINTKGTGNSVTASNW >seq_52 SISVSQTGSDNYAGAASQGGNASEITIVQTGTGNRVLGAGTYGNDNVLNITQTGTENLAYADAADNGSQIDIAQTGEFNSATVESYSGAGNDIDVVQYGA GNMVSVSQYDAYNTANVTSHGSYNQVTIMQA >seq_53 SISVTQTGSDNYAGAATQGGNASEIIIVQSGTGNRVLGAGTYGNDNIIHITQKGTENLAYADAADSGSQIDIAQTGEFNSATVESFTGAGNDIDVVQYGA GNIVSVSQYDAYNTANVTSYGSNNQVTIMQA >seq_54 SISINQNGEGNYAGAGTRYGSGSQIIIDQQGSDNVIVGAGTWGADNILMVSQDGTSNRVYADASHDQSVVDISQTGEFNEASVESWWGAENDITVAQTGA GNHVSIMQYNASNTAVVTSTGSHNSATIIQG >seq_55 SISVIQSGASNYAGAATELGGGSSITITQTGTNNSVIGAGTYGANNMLVISQSGNDNTAYADASHDNSSVDIAQSGEFNDATVESWFGSENDLSVSQTGS MNSVSVTQTDSYNTANVTSSGSNNLVMVTQG >seq_56 DVFIYQTGNENYAGASTELGGDSTVTINQEGNNNYVVGAGTYGAENVITIDQIGNENKAYADSSDFSSEVDIDQTGDLNTAIAESWLGTDNDIMISQTGG NNMVSITQTDASNTANVTSIGNGNSVTVVQG >seq_57 GYIDQ-------------SGVSNKVSLSQQGNANASWSDQFETNNSTTTITQTGNNNLHFTYQNGENLNLTINTQGNGNSITASNWKGAKMG---GQFGT NQNATINQTGNGNGVNLTQKGADQQATLSQT >seq_58 TVVDQ-------------SGENNTAAVTQVGNTNAIYADQFESVNSTLALYQVGNGNVHFTYQNGDSHTLNATSVGNDNKVYASNWKGPQHG---GQFGS NQRATVNQTGNGNIASFTQDGIGQIMTTNQT >seq_59 GYIDQ-------------SGQGNSASLSQNGRANASWSDQFESNQSVTSISQTGNNNLHFTYQTGDNHSLSIDTTGNGNKIMASNWKGDKSG---GQFGD SQRAVVNQAGNGNSVNFEQKGSSQLARLNQN >seq_60 GYIDQ-------------SGQDNTASLYQQGKSNASWSDQFETTNSTTSISQAGTSNLHFTYQTGDNQSLTVTTQGTGNKVMASNWKGEKLG---GQFGS DQTAVINQKGTENTVNLTQNGTLQLATLGQK >seq_61 GYIDQ-------------SGMNNKVSLSQQGTANASWSDQFETSNSTTTITQTGSNNVHFTYQNGENLNLTINTKGTGNSVTASNWKGAKMG---GQFGT NQTATINQNGNGNGVNLTQNGDNQLATLTQK >seq_63 AVISQYGNSL-------------NASISQQGLSNRAIVSQTGFST-QIDVNQIGSSNRVTAFQSGAEIEAAILQSGFNNVVVSSQ-QGSYLNLDIEQQGN DNLASVLQVGNESSVTIYQNGTGHGVSVIQY >seq_64 ASIMQLGDAL-------------QADITQSGEANRALFTQFGVAN-SAELQQQGIGNQAYLLQQGEAIEASLLQVGVNNSLIASQ-LGANLSLVAEQRGS DNQAYVQQSGYDNEVSIYQNGSGHGVIVTQW >seq_65 ANVNQFGDDM-------------LVNINQLGSNNHFISTQSGFNN-QTLTTQQGTNNYAIANQSGNDIEANILQAGFNNVVLLNQ-FGSSLMADVGQIGT DNLAIINQTGSENAIWIQQYGSGNAVSVTQW >seq_66 ASVSQVGDAH-------------SANISQQGISNELYLAQTGYNT-SLIANQQGASNDILLLQSGSDIEASIQQTGYGNLVIASQ-LGTALDIDVTQNGY GNQAYIHQTGYENSVWIQQNGSGHVVSVAQW >seq_67 SSVNQ-------------LGADNYSSVSSTGRLNSSQVQQVHNNN-ESYVTQTGQQNGSRVLQGGNDNYSEVTQTGFRNSSYVAQQPSEFNDSYVVQIGD LNSSSVAQLGDVNFSDVIQNGDGNEGTVYQN >seq_68 AYTNQ-------------GGSDHYSYIEQSGSDNLANVYQRGDEN-ESYVVQDGYQNEAYVVQSDHENDSDVNQDGRRNQAWVTQ-SGGHNDSYVEQDGS YNDAAVDQSGDYNDSDVVQDGGGNDAWVTQS >seq_69 AYSYQ-------------YGNLNDAAISQLGDSNFSQVVQQIEGN-TAEISQEGAVNGDYVYQYGTENDATVEQ--------------DFNYGYSDQSGA QNSVTLTQWGNDNAAYQIQNGNTNSATITQG >seq_70 SYVTQ-------------TGQQNGSRVLQGGNDNYSEVTQTGFRN-SSYVAQPSEFNDSYVVQIGDLNSSSVAQLGDVNFSDVIQ-NGDGNEGTVYQNGD GNFSYAHQGGFDNLSQVSQVGDDNYSHVDQY >seq_71 SYVVQ-------------DGYQNEAYVVQSDHENDSDVNQDGRRN-QAWVTQSGGHNDSYVEQDGSYNDAAVDQSGDYNDSDVVQ-DGGGNDAWVTQSGD YNTSAVNQDGFDNDASVVQTGHDNNSSVVQT >seq_72 AEISQ-------------EGAVNGDYVYQYGTENDATVEQ--------------DFNYGYSDQSGAQNSVTLTQWGNDNAAYQIQ-NGNTNSATITQGVD YTGWASSQIGF---AQQEQYGDANDATITQR >seq_73 SRLTQ-------------AAGAQQAYLQQLGQGNLAELRQNGQAL-SAQVLQQGSDQEAFILQHGEDLLAVIEQVGHGNYAEIRQ-TGSDNQASISQYGA YNDARIEQVGDGLRSTVTQYGYGQQINIVQG >seq_74 IRLLP-------------ASAGQAAVIEQQGAGNRAALDQNGQAL-LGRIVQAGGAQEAYILQEGSDLMALISQQGNGNNASIRQ-TGSSNSAAIEQIGN DNSASIVQSGTGLNSSVTQAGNGQHVQITQY >seq_75 VRLLP-------------VGSGQAAVIEQQGNGNRAALDQNGQAL-LGRIVQAGGAQEAYILQEGSDLMATISQQGYGNSATIRQ-SGSGNSAAIEQIGN QNSATIDQRGTGLNSSVTQAGNGQHIHITQY >seq_76 IRLLP-------------AGAGQVAVIEQLGSGNRAALDQNGQAL-LGRIVQAGGAQEAYILQEGSDLMASISQQGNGNSASIRQ-SGTSNNASIEQIGN DNSASIVQAGSGLNSSVTQAGNGQHVQITQY >seq_77 ---LP-------------PPVGQQALIDQNGQVNLALLSQNGQSL-LGKIVQSGSNQEAYILQQGSDLMALITQNGSGNAASITQ-TGSHNRAQISQNGN NNDASIEQAGTGLQSAVSQSGNGMSVSVKQY >seq_78 -----P-------------AAGRDAAIRQLGVGNQALIDQRGTAL-RAQVAQSGAAQEARILQDGTELSAVILQGGYGNVARIEQ-LGSGNQADILQLGV QGNARIEQHGSGLSSRIVQYGSNQNTVVRQY >seq_79 SFIDQ--LGDDNAAGVEQYGLSNDSDVDQDGNDNAAVVYQYGESN-DSDVDQDGNGNVADVDQYGLSNYSDVDQDGNDNAAVVYQ-NGESNDSDVDQDGS DNYAFVGQDGDDNDSDVDQDGTDNYAYVYQN >seq_89 ANIIQDGGHGFWTGTALQDGELNDADVLQLGDGNASFIDQLGDDN-AAGVEQYGLSNDSDVDQDGNDNAAVVYQYGESNDSDVDQ-DGNGNVADVDQYGL SNYSDVDQDGNDNAAVVYQNGESNDSDVDQD >seq_91 AFVEQLAGSD-----------GSTSLITQLGDENSATVSQSGAL-NVSELTQSGFNSTADVLQSGSGNTSRLSQNGTVNTADVNQ-LGNDNLSDIAQDGS GNSATVTQNSDANTSYVNQNGNGNTASVTQG >seq_92 ALVTQ-------------SGGSNISDIQQIGGDNSAIVEQLGSAD-VSNILQDGANQFANVLQNGSSEYSSIMQNGDGNSALVDQ-SGSNNESYIDQNGT GNAATVTQTGTSDYSSVAQNGTGNTATVTQG >seq_93 AIVSQ-------------TGFSTQIDVNQIGSSNRVTAFQS-GAEIEAAILQSGFNNVVVSSQQGSYLNLDIEQQGNDNLASVLQ-VGNESSVTIYQNGT GHGVSVIQYGQGQSAQVTQGYISN------- >seq_94 ALFTQ-------------FGVANSAELQQQGIGNQAYLLQQ-GEAIEASLLQVGVNNSLIASQLGANLSLVAEQRGSDNQAYVQQ-SGYDNEVSIYQNGS GHGVIVTQWGNAQRASVTQGYR--QQ----- >seq_95 FISTQ-------------SGFNNQTLTTQQGTNNYAIANQS-GNDIEANILQAGFNNVVLLNQFGSSLMADVGQIGTDNLAIINQ-TGSENAIWIQQYGS GNAVSVTQWGVSQHVVVTQGSLSN------- >seq_96 LYLAQ-------------TGYNTSLIASQQGVSNEILLLQS-GSDIEASIQQTGYGNLVIANQLGTALEIDVTQNGYGNQAYIYQ-TGYENSVWIQQNGS GHVVSVAQWGSSQTAVITQGHTSN------- >seq_97 AFVSQVGDNLVSNVSQTGYSNFSSAAVDQVGASNTATINQYSARQMEASITQEGSGNSASSLQTGTNGYSLIMQDGHDNLATLID-SGDLNRSTITQSGD LNSALVTQGGSSNVSTIAQDGAGNSATVAQN >seq_98 AMIDQIGDDNYASIDQSGNARVATASISQDGSNNSSTIDQFGARLMEASSSQGGSGNFASVDQGGNDNISTVMQDGALNEAMVDQ-SGNGNESWVSQAGS GHSATVTQSSDMNNSVVNQTGMNNTATVTQG >seq_99 GDITQMGNGNTAGMVSLNVGNNNDVTVYQEGDNNLGAVKGVAGDDNNFDIEQAGNDNTGFVYDLGSDNEFEIDQEGDNNTAYMAAVQGDDNFIEIKQDGM GNDLRVDQEGDNNTAQFQVFGNDNDVSLDQE >seq_100 GDIKQYGNNNQAGLIALNVGNNNDVSVEQIGNNNFGAAKGIAGNDNSVDIYQKGDNHTGFVYALGSENDISMEQEGSNNTAYLSMTTGDDNTVDITQDGD SNDITIKQKGDSNGAEFQVWGDSNDVDLKQR >seq_101 GDITQYGDNNVGGMVSLNLGNNNDVEVYQEGNNNLGAIKGVAGDNNNFLVTQEGDSNTGFVYSLGSDNEIEINQEGNSNTAYLALTTGDDNDVEITQRGD NNNIDLELEGNNNGAEFQVLGNENDIDLDQE >seq_102 GDIKQYGDNNQAGLIALNVGNNNDVSVEQIGNSNFGAAKGVAGNDNSIDIYQKGDSHVGFVYALGSDNDITMKQEGNSNTAYLSMTTGDDNSIDIAQDGD SNDISIKQKGDSNGAEFQVWGDSNDVDLKQR >seq_103 GEITQLGNNNQGGLIALNVGNNNDVSIYQEGDDNVGGVRGVAGDNNEVEIEQVGNNNVGFVYALGSNNDLTMTQDGDRNVAVLEFTTGDNNDVEINQSGS ENMIDIEQEGFSNSAQFIVDGDDNDVDLEQE >seq_104 GDISQEGNNNQAGMIALNVGNNNDVTLGQVGDDNLAGVRGVAGDDNTVTIMQEGDNNTGFIYALGSDNEIDIDQFGNNNNTVLQFATGNDNDVSVLQDGN GNHIDIEQTGDTNLAEFEVTTDDNDADLEQD >seq_105 GKITQIGDNNQAGLIALNVGNNNDVSVNQKGNNNFGAAKGIAGNDNTVDMDQKGDNHVAFVYALGSDNDISMDQKGNGNTAYLAMTTGDDNTVDITQNGS GNDITITQKGDVNGAEFQVWGDSNDVDLKQK >seq_106 GDIKQYGDNNQAGVIALNVGNNNDVSVEQVGNNNFGAAKGIAGDNNSVDMVQKGDNHVGFVYALGSDNDIAMDQKGRGNTAYLAMTTGDDNSIDITQNGS GNDITITQKGDVNGAEFQVWGDSNDVDLQQR >seq_107 FLSEQSGSLNESGYF--------------------------DGDDNEVYAQQYGSSNIFIARDLGNENFVDVFQDGSAN-FLIQPIQGEANFIVVQQQGN SNLIDAYQRGDNNYAV--TEGDMHNVELEQA >seq_108 EIKLDQEGSNNAIESGLFDGSFNEVTVNQIGDENLATTELMGD-NNELTAMQVGNTNEAYMGVIGSNNEFTVTQVSDFNSVHLANFNGSGNDVDLSQSGD ENAIDVSLTSNDNDIDVNQVGNQNEAIVTLA >seq_109 EIDLDQEGTSNTADSALFLGNNNEITITQLDTLNSATAELIGD-NNELTATQNGFGNEAYMGVIGSDNEFLINQVGDLNAAHLVNFNGTGNDVDVFQNGD ENTTNTALTSNDNNIDIMQSGVANETMVTLA >seq_110 EIDIDQEGSNNTADSALFEGQNNEVTITQVDTYNTATAELIGD-NNELTSNQNGLFNDAYMGVLGSDNEFTIDQVGEFNSAHLVNFNGSDNDINLTQTGQ DNTADTSLTSNDNDIDIVQDGSLNETMLTLA >seq_111 EIDLDQEGSLNTIESNLFEGQNNEVTVTQVDLSNSTTVDIIGD-DNELTATQVGLLNDAYMGVLGTDNEFMINQVGESNSAHVVNFNGSENDVNVIQSGQ ENTADISLTSNDNDIDIMQNGSQNEAMVTLA >seq_112 EVDIDQEGESNNVDSALFEGENNEVMAIQNGDFNSATAELIGN-DNELTAIQNGSANEAYMGVLGSNNEFSINQFGDVNSAHLVNFNGSGNDVDLTQLGE ENITDVSLTSNENDIEITQDGLQNETMVTLA >seq_113 EVGIDQEGLANNVDSALFEGENNEVMAMQNGDFNTTTAELIGN-DNELTAIQNGSENEAYMGVLGSDNEFTISQFGDVNSAHLVNFNGLGNDVNLTQLGE ENIADVSLTSNENDIEITQDGLQNETMVTLA >seq_114 SLMLDQNGNKNAIVSDVFSGDDNEVAINQIGDDNKASADVMGN-DNDFKVTQLGNTNASYMGVIGDNNDFTLVQVGDSNSSHIANFNAADSQVTVSQTGD GNSSNVALTSFNNEIMVTQSGDENESTVTFS >seq_116 SRIVQWGTHNEVGGGQ--YGFGNRLTVEQDGWSNSSIS------------TQDGRRNRAVVGQNGHRNSANTQQFGSCNVSGIAQ-FGGHNRARATQSGG CNASAIIQAGHGNRANTRQYGSGNVTVIVQ- >seq_117 VRIEQYGWSNSAGGAQ--EGYGNRIRTYQNGGYNRIVG------------HQYGRHNLSAVGQEGHDNYGSTYQNGSRNVAGIGQ-FGSNHTTILTQDGN GNIAAGVQVGRGCSANVSQGGNDNVAAFVQA >seq_118 IHINQFGWGHSAGGTQ--SGTGNTIGIFQDGWWNSSTN------------HQSGHGNVSASGQTGWNNEAETWQNGNFNEAGVGQ-FGSNHTSVLTQDGN GNVAAGVQVGNGCTASVDQNGSGNVAAFVQV >seq_119 VRFDQYGWSNSAGGSQ--QGYRNRIRVHQDGRYNRSVG------------EQRGSHNLSVIGQEGRRHYGATYQNGSRNSAGIGQ-FGSDHTTILSQDGH GNIAAGVQVGRGCSADVAQGGSGNVAALVQA >seq_120 FRIEQFGWANSSGGSQ--HGYRNRMRVHQDGRYNTSVG------------EQRGKRNLSVVGQNGRGNFGATYQTGKRNAAGIGQ-FGSNHTTILTQDGN GNIAAGVQVGHGCDANVAQRGRGNVAAIVQA >seq_121 IWIEQHGWSHSVGGSQ--DGRRNEIGIYQNGARNSAIA------------KQRGRGNVAAVGQEGRRNRGQAEQRGRNNAAGIGQ-FGSSHNSIMVQDGN GNIAAGVQVGHGCDAATSQSGRGNVAAIVQT >seq_122 QFTPQTG------------ALGGEVQVYQQGTSNISSVNQDSAVFSKTSVSQGGTGNYASVNQNGNVSVVDINQQGNNNAATSSQ-NGNWSVTEVSQSGF GNTATTNQSGDFSRTAVVQSGFGNNAVTSQS >seq_123 SINTQLGGDSNFASTTQT-GTFNDSSIDQDGTGNGAITAQLGTAN-DSDVEQDGTGNGSFVAQAGVGNISDVKQDGTTNGSLVLQ-LGVANNSDVDQVGT SNGSFVAQLGAANSSIVSQDGAGNGSAVLQA >seq_124 ARVYQGGGNGNFADTAQQ-GTLNDSTVDQFGDGNSSEVDAQGNSN-IARTYQGGNNNDSSIFSTGDSNFADTAQQGNGNSSEVIQ-TGDSNFSEVDQQGN GNLNFVSQVSNNSSSDVFQVGDANDTRVNQL >seq_125 ANVDQ--GGSFETSRARQDGQNSEIDVDQFGTGNESTILQGSEGNNDATINQEGASGQAFITQIGNLNTSEIVQNWADNTATSSQ-NGSELQSMITQNGE FNTATVNQTGDGHMSTVTQTGTGNSAMVTQG >seq_126 ADIDQ--TGDSNIVDAFQGTAGNDLTVLQSGDANTS--------NNSANVTQDGIGATSTIDQTG--NVSELTQSGFNSTADVLQ-SGSGNTSRLSQNGT VNTADVNQLGNDNLSDIAQDGSGNSATVTQN >seq_127 SIVNQ-------------VGNINTSTITQSGNDDLAFVDQIGERN-RSTITQGSNRNIADVDQNASDGISTITQTGDDNFAQLIQ-GGTFNESVITQDGT LNSAVVTQGGMMDYSGVMQTGTSNVATVNQS >seq_128 --VNQ-------------VGDENISRVTQSGTDDTAFVDQIGNEN-VSNVIQGGLENLADVNQDGDEGFSRIVQSGTTNEAELNQ-DGLLNTSIILQDGT SNMATVNQDGTGNFSRVDQAGTMNSVVVNQN >seq_129 TFAAV-------------TGNDNEIKLDQEGSNNAIESGLFDGSFNEVTVNQIGDENLATTELMGDNNELTAMQVGNTNEAYMGV-IGSNNEFTVTQVSF NSVHLANFNGSGNDVDLSQSGDENAILVQSS >seq_130 NFAAI-------------VGSDNEIDLDQEGTSNTADSALFLGNNNEITITQLDTLNSATAELIGDNNELTATQNGFGNEAYMGV-IGSDNEFLINQVGL NAAHLVNFNGTGNDVDVFQNGDENTTVVESS >seq_131 TFAAV-------------VGNDNEIDIDQEGSNNTADSALFEGQNNEVTITQVDTYNTATAELIGDNNELTSNQNGLFNDAYMGV-LGSDNEFTIDQVGF NSAHLVNFNGSDNDINLTQTGQDNTAVVESS >seq_132 TFVAV-------------LGDENEIDLDQEGSLNTIESNLFEGQNNEVTVTQVDLSNSTTVDIIGDDNELTATQVGLLNDAYMGV-LGTDNEFMINQVGS NSAHVVNFNGSENDVNVIQSGQENTAVVESS >seq_133 NFTAI-------------VGNNNEVDIDQEGESNNVDSALFEGENNEVMAIQNGDFNSATAELIGNDNELTAIQNGSANEAYMGV-LGSNNEFSINQFGV NSAHLVNFNGSGNDVDLTQLGEENITVVESS >seq_134 NFTAI-------------VGSDNEVGIDQEGLANNVDSALFEGENNEVMAMQNGDFNTTTAELIGNDNELTAIQNGSENEAYMGV-LGSDNEFTISQFGV NSAHLVNFNGLGNDVNLTQLGEENIAVVESS >seq_135 TIAAL-------------EGNGNSLMLDQNGNKNAIVSDVFSGDDNEVAINQIGDDNKASADVMGNDNDFKVTQLGNTNASYMGV-IGDNNDFTLVQVGS NSSHIANFNAADSQVTVSQTGDGNSSLVQSS >seq_137 STVTQYGWSQAAYVDQVGDGNTSSIDQALNGSDQYASVAQNGDDG-MSTITQRGQDQRAELTQGGLSNESFIDQSGSDHLAEVTQ-DGADNYSSVIQSGS GSSATVTQTSDLNNSFVNQSGNGHSATVVQG >seq_138 ATLTQTGDENIANFSAIGNSN----DITQTGDLNLVDLLVDGDLN-QSLIAQTGNSNRV----GGFSA---FSVDGNDNQLNIAQ-NGNNNLVTGSQSGM GNMIDVQQTGNLNVANVVQN----------- >seq_139 SEIIQ-------------DGEEGDVDVDQTGMGNESFVDQDDNTDRRILITQSGNENESEAFQDDDDSIITHTQSGNDNVADTDQ-GSEDSESTITQSGN NNEATVVQGSDDQFSTIVQSGSRNTAEVQQG >seq_140 GA-PS---------------FANHSRIVQWGTHNEVGGGQYGFGN-RLTVEQDGWSNSSISTQDGRRNRAVVGQNGHRNSANTQQ-FGSCNVSGIAQFGG HNRARATQSGGCNASAIIQAGHGNRANTRQY >seq_141 AAAPA---------------MANDVRIEQYGWSNSAGGAQEGYGN-RIRTYQNGGYNRIVGHQYGRHNLSAVGQEGHDNYGSTYQ-NGSRNVAGIGQFGS NHTTILTQDGNGNIAAGVQVGRGCSANVSQG >seq_142 MSAPA---------------MANNIHINQFGWGHSAGGTQSGTGN-TIGIFQDGWWNSSTNHQSGHGNVSASGQTGWNNEAETWQ-NGNFNEAGVGQFGS NHTSVLTQDGNGNVAAGVQVGNGCTASVDQN >seq_143 AVTQA---------------SANDVRFDQYGWSNSAGGSQQGYRN-RIRVHQDGRYNRSVGEQRGSHNLSVIGQEGRRHYGATYQ-NGSRNSAGIGQFGS DHTTILSQDGHGNIAAGVQVGRGCSADVAQG >seq_144 PVASA---------------SANDFRIEQFGWANSSGGSQHGYRN-RMRVHQDGRYNTSVGEQRGKRNLSVVGQNGRGNFGATYQ-TGKRNAAGIGQFGS NHTTILTQDGNGNIAAGVQVGHGCDANVAQR >seq_145 SM-PA---------------MANSIWIEQHGWSHSVGGSQDGRRN-EIGIYQNGARNSAIAKQRGRGNVAAVGQEGRRNRGQAEQ-RGRNNAAGIGQFGS SHNSIMVQDGNGNIAAGVQVGHGCDAATSQS >seq_146 SIVLQDNGTG---GGSNASDDNNSATVDQTGDSWSRQVTQNGDGN-SSDILQTGDNDDAFVDQQGSDLTSTIAQSGNSNDANVSQ-SGDTNQSGIIQSSG ANVAIVTQEGSLNESSILQNGLGNTANVEQT >seq_147 GQTATTTQSG---AVNTSTGDDNTITSTQNGSLNSEFVNESGDGN-QIDVVQDGNFNNNNVNQDGDNNTVDVDQIGNRNSTTTEQ-IGNNNFATFQVDGN RNDGVIKQTGDNNQAGLLSAGFSNSGDNNDV >seq_148 AAFAQ--------------SVGSDSSIYQYGSDNTNDVTQWGTQKSRINIEQTANGNEAIVDQLGAYNVSGIYQTSSDNDANVTQ-DGKMNDSYVKQSGN GNEAVVVQDGKFNDSVIEQQGSYNTASVDQE >seq_150 SNATQ--------GSPNRLTSLQQLDLLQDGTDNNSTITQFGSQN-VAAVEQYGERNTSIVDQPGLDNMVGVTQDGFDNYSSVTQ-SANFGSAFVSQVGD NLVSNVSQTGYSSSAAVDQVGASNTATINQY >seq_151 SVLNQ--------GDGNPFQTNGQQALITQGGDNNQSYTQSGSSG-DLNVNQSGDNNLSDVVQSGFDNMVGVNQEGNSNEAYVTQ-AGQLSDAMIDQIGD DNYASIDQSGNAATASISQDGSNNSSTIDQF >seq_152 SSLNQ------------------AAIIGQQGMLNDAQVRQDGSKL-LSIVSQDGAGNRARVDQS-----------GTYNIAWIDQ-SGSANDAGITQDGY GNSAKIIQKGSGNRANITQYGTQKTAVVVQR >seq_153 SSLNQ------------------AAIIGQKGAYNDAQVRQDGSKL-LSIVTQDGVGNRARVDQS-----------GTYNYAYIAQ-SGYANDADISQDGY GNTAKIIQQGSGNRASITQYGTQKTAVVVQK >seq_154 SSFNQ------------------AAIIGQVGTNNSAKIRQDGSKL-LSVVSQEGGSNRANVDQS-----------GTYNFAYIDQ-TGNANDASISQSNY GNTAMIIQKGSGNKANITQYGTQKTAVVVQR >seq_155 SSFNQ------------------AAIIGQVGTDNSARVRQEGSKL-LSVISQEGGNNRAKVDQA-----------GNYNFAYIEQ-TGNANDASISQSAY GNRVAIIQKGSGNKANITQYGTQKTAVVVQK >seq_156 SSNNQ------------------AAIIGQQGGYNNANIQQGGSKV-LSVITQDGVGNRANIDQS-----------GTYNFAYIAQ-AGSANDADIQQGGY GNTAAIIQQGSGNKASITQYGTQKTAVVVQK >seq_157 SSFNQ------------------AAIIGQVGTANSANTRQEGTKL-LSVISQEGSGNRAKTEQT-----------GSNNFAYIDQ-TGSSNDASIKQSSY GNTAMIIQKGSGNKANITQYGTQKTAVVVQR >seq_158 SSFNQ------------------AAIIGQAGTNNSAQLRQGGSKL-LAVVAQEGSSNRAKIDQT-----------GDYNLAYIDQ-AGSANDASISQGAY GNTAMIIQKGSGNKANITQYGTQKTAIVVQR >seq_159 ASYNQ------------------AAIIGQQGSGNNADVRQGGSKL-LSVISQEGGNNRANVDQS-----------GTYNLAYIDQ-TGNGNDASIKQGAF GNTAMIIQKGSGNRANITQYGTQKTAVVVQR >seq_160 SSLNQ------------------AAIIGQQGVNNDAQVRQGGSKL-LSVVSQEGTGNRARVDQS-----------GTYNFAYIAQ-SGSSNDASITQGAF GNTAMIIQKGSGNKANITQYGTQKTAVVVQR >seq_161 ATVGA-------------ASACNIATANQFGWDNSAGVQQFGNCN-NTAIGQNGWNNTAAGISNGFANTVVVGQDGAWNTGVVGQ-NGSFNAGAVQQSGA LNYGEVVQNGNFQTGAVIQSGVGNTGVLNQT >seq_162 GFTVL-------------PASADSIHIEQYGWANAAGGDQHGSRN-RIGIYQNGKRNGAVVRQQGRNNTAAIGQQGRRNAADIWQ-RGKGNAAGVGQFGR DHNAVVTQDGNGNIAAGVQVGKGCDGEVSQS >seq_164 AEIAQ--------------GDGNRLALVQDGNYNDADIHQGDYHN-ELSFTQSGDDNRLTVDQNGYGGVISGSSTGNRNSVDIDQ-RFASNRASVTQNGD DNLASIEQGNWGHQATITQLGSANEAMIRQG >seq_165 STIIQ--VGDNNDGQVQDYSSGNTASITQTGNNNKSGVFRDGGSENTFTSVQTGNNNRILAPQYGVGNMIDITQTGNSNLTTLRQ-LGNGNDLMVSQADG NQTAILVQSGNTNMLSVTQSGLGNMATVSQL >seq_167 AEITQRGDNNSLSFSQQTYYEGSRLSVSQDGVSNQAEIAQGDGN--RLALVQDGNYNDADIHQGDYHNELSFTQSGDDNRLTVDQ-NGYGGVISGSSTGN RNSVDIDQRFASNRASVTQNGDDNLASIEQG >seq_168 AMQDQ--------SSMSENAPGNFADIDQDGNRNMASQTQRGEAN-MAFITQDGNRNDATQTQTGDMNSATSTQTGNRNTSLESQ-DGDLNVSINTQLGD DNMTDHAQKGDSNFASTTQTGTFNDSSIDQD >seq_169 SVINQ--------YSAT---PGQLS------NDNEAVVSQVGDDN-ASVVSQDGDFNFADTAQNGNDNDAIVNQVGDSNFSEVDA-QGNSNVARVYQGGQ SNTSTILSSGNGNFADTAQQGTLNDSTVDQF >seq_170 SLVTQ-------------SGLDAEIIVSQNGQINRSNVVQSIANRAIVEVSQTGVFNSSFVNQSGDGSSADVVQGGARNVSGVIQ-SATDSTIEVTQTGE NNESFASQSGTLQTAAIIQDGDEHLSNLTQS >seq_172 RYFNTQVGSNATANGSQQSGDQNTLFGTQKGAFNAVDTGQSDTNN-KLTYSQDSGDSTATVTQNGSWNYSHGSQAGGGNTATIDQ-SNVGNRSTYSQVGS TNTLTVTQVHNGNLSNVNQNGTGNTATISQH >seq_173 SYVSNQSGSNNTANGTQQGGTNNSIAGSQTNNNNNVNNSQFGSYS-KLTYTQTGSQNTAVVEQSGFGLYSSGQQSGGGNSASVTQ-DGSLHASYYVQSGA TNTIAVAQTGAGNYSNLNQTGTGNYATIKQH >seq_174 TYINTQTGSSNSANGSQQSGGSETVRGTQVGTYNYVSNSQAGSSN-LLDYKQTAVSNTAYVVQDGNGLYSKGLQTGGYNSANVSQ-TGQFSASYYTQDGS SNSIIVAQVGNYNVSNVSQIGAGNHANIQQK >seq_175 SYLAYQTGYGNRANGSQQTGTRNDLSGSQTGFGNGVTSFQSGNGE-VLSYSQTGGSDHADLTQIGNNNTIVGAQAGGGNLATVIQ-NGNGNIGLYTQIGS GNVLTLTQIGNGNLANTSQTGSNNSIKIVQ- >seq_176 INANQNAGGYTYASLAQGGGNGNSINVSQSGAVLSADVNQNGNFN-LFQSSQSGSRNGVGTDVGAY-NDTPITQTGFGNRYFNTQ-VGSNATANGSQNGV ANRVTNNQSGDQNTLFGTQKGAFNAVDTGQS >seq_177 ATISQNSAGLNYAVAVQGGGDSNSLGIYQNGDLLGANAWQEGAGN-VFSSNQSGVGNTIGITGGIYGSDTPIKQKGNGNSYVSNQ-SGSNNTANGTQTGD VNVVFNNQGGTNNSIAGSQTNNNNNVNNSQF >seq_178 ASITQNSAGLNYAVAVQGGGNSNSLSISQSGNLQGANAWQNGNAN-VFSSIQTNVGNVVGLTGGIYGTDSPITQTGDNNTYINTQ-TGSSNSANGSQTGN FNQVFNTQSGGSETVRGTQVGTYNYVSNSQA >seq_179 VDVNQTTAGWTYAAVAQGGGSGNTVNIGQNGSYLGAGVSQSGFDN-RFNAVQSGDSNNIGLQGNA-GPDTPIRQFGDRNSYLAYQ-TGYGNRANGSQTGN SNDVWTSQTGTRNDLSGSQTGFGNGVTSFQS >seq_180 TLRTSWGQGG-------------AVDVYQSGKRNSVTVSQSGAIGELVTVKQIGNGNSGVITQTADINSVKLRQTGDNNTATLSQIAAGNDYISVVQTGN NNSSTVSQTGAGESATVTQTGSFNSATVNQN >seq_181 SDINELAIGINNAVNVTQLGDELSINLNQSGFNNQFNSNQLGYNN-QIFTHQQGMFNGVTAFQSGADIEASIYQSGFGNRVITSQ-VGSNLLTDVSQIGT QNLAIIHQTGSNNTIMIQQNGYGSAVGILQW >seq_182 ALIER-------------SGRDNLIDLVQQGTANQGIVFQSGSDN-SAYVTQAGNDNISLVTQIGTNNEVQLLQVGAQNKASITQ-IGNDNLVQLNQLGS GN-FSIQQIADGAAISITQY----------- >seq_183 ALLER-------------SGRDNLITFIQQGNINIGGVLQAGNDN-SAYLIQQGNNNNAIVIQVGIENEVQLQQAGNSNSALITQ-WGDANLVQLNQTGS NN-FSITQIADNAAISITQY----------- >seq_184 ALLER-------------SGRDNLIELVQQGNANQGLLFQSGSDN-KADVTQVGNDNDALITQIGSNNEVQLLQVGSQNTATVTQ-IGNDNLVQLNQLGS GN-FSIQQIADGAAISITQY----------- >seq_185 ALLER-------------SGRDNLIDLVQQGTANQSLLFQSGSNN-DAYVTQVGMDNSAFVTQIGSDNEVQLLQVGTQNTASITQ-IGNNNLVQLNQLGS GN-FSIEQIADGAAITITQY----------- >seq_186 SVIYQ-------------NGNNNLATTYQTGSGNSGVIRQSGSEN-TALVNQRGNGNNADISQSGSDNDASITQQNFGNTAYILQ-KGR-SVTEIRQNGT NQSSGVVQNSSGMAIRVTQH----------- >seq_187 VSVAQQGYQNDIAGRQ--TGHRQSMVINQRGERQYVTVIQDGARN-GTTVGQNGRYNEAHVDQYGRNNDAVVGQQGYNNYTRNIQ-TGRRNIVGVSQMGR NNTAVTDQYGNNNAAGVIQVGNGNNANVRQT >seq_188 IEIRQSGYANELAARV--DGHRNRLSLEQRGVRQFISTLQDGTRN-GAQVSQEGGDNQAGLSQQGRRNRVVIGQEGWLNDAFSSQ-SGYGNVVGVAQIGE GHQAITHQEGSNNALGVIQIGRGQFTDVHQS >seq_189 AYLQQ-------------LGQGNLAELRQNGQALSAQVLQQGSDQ-EAFILQHGEDLLAVIEQVGHGNYAEIRQTGSDNQASISQ-YGAYNDARIEQVGD GLRSTVTQYGYGQQINIVQGR---------- >seq_190 AVIEQ-------------QGSGNRAALDQNGQALLGRIVQSGGAQ-EAYILQEGSDLMAIIRQQGNGNSASIRQSGSSNNAAIEQ-IGNDNSASIVQSGS GLNSSVTQAGNGQHVQITQYR---------- >seq_191 AVIEQ-------------QGNGNRAALDQNGQALLGRIVQAGGAQ-EAYILQEGSDLMATISQQGYGNSATIRQSGSGNSAAIEQ-IGNQNSATIDQRGT GLNSSVTQAGNGQHIHITQYR---------- >seq_192 ALIDQ-------------NGQVNLALLSQNGQSLLGKIVQSGSNQ-EAYILQQGSDLMALITQNGSGNAASITQTGSHNRAQISQ-NGNNNDASIEQAGT GLQSAVSQSGNGMSVSVKQYR---------- >seq_193 AAIRQ-------------LGVGNQALIDQRGTALRAQVAQSGAAQ-EARILQDGTELSAVILQGGYGNVARIEQLGSGNQADILQ-LGVQGNARIEQHGS GLSSRIVQYGSNQNTVVRQYR---------- >seq_203 DVITA----DNYAEVSMENAIGGEVTIEQYGRGNKAKVIQHDSRRSQATIWQEG-----------SANTALITQTGNQNKAVIGQ-SGYGHEAWINQNGN FNVAAIIQTGAASSLSINQTGHNNQAYIVDK >seq_204 GVITQ------TIVADHFYREINSVKLRQTGDNNTATLSQIAAGNDYISVVQTGNNNSSTVSQTGENISATVTQTGSFNSATVNQNPADGATTRITQTGN NAVANVTQTGDDNSVKLFQTGSDQTATISQT >seq_205 ASVNQ-------------NGNVSVVDINQQGNNNAATSSQN-GNWSVTEVSQSGFGNTATTNQSGDFSRTAVVQSGFGNNAVTSQ-SGDAAVIVVRQSGA SNYANSSQTADFSYSNVNQVGAGNASYVNQN >seq_206 SLVTQ--GEGLAPPSPTDVGRDNSATVIQSGDGNTSAIDQR--RDSTALVTQSGGSNISDIGAGSDNSAINILQDGANQFANVLQ-NGSSEYSSIMQNGD GNSALVDQSGSNNESYIDQNGTGNAATVTQT >seq_207 TLLES-------------SGRDNLIDLFQIGVQNEAMVAQSGDSN-SLIVTQIGVANQALVRQLGTDNEVDLFQAGNHNSAEITQ-IGDNNLVQLKQLGS ANFS-IQQIGDGASIAVTQY----------- >seq_208 TLLES-------------NGRSSLVNLVQLGNLNTANIMQTGNNN-YVDLMQLGDNNEANITQDGKNNQVELIQVGGDNQADITQ-IGNDNLVNLNQLGS ANFS-IEQIADGAEITITQY----------- >seq_209 GLLES-------------HGRDNLVNLFQHGSFNQAILSQTGIEN-SAYLTQLGIGNQTSVIQIGSNNELELLQQGDNNRADVTQ-IGNDNLIQINQLGS ATFA-IEQIADNAAITITQY----------- >seq_210 SDIFQ-------------SQRLNDADVQQTMAGNSSEIIQDGEEG-DVDVDQTGMGNESFVDQDTDRGRILITQSGNENESEAFQ-DDDDSIITHTQSGN DNVADTDQGSEDSESTITQSGNNNEATVVQG >seq_211 SYIDQ-------------STINSTAVTSQVGNFNKATIEQFAALNGQAVINQTGDENQAYIGQGSYNNNAQITQSGDFNVAGVIQ-TGEGNQAVFQQIGS GNAIFVLQQGNNNSLTVTQTGMDNLLQIQQT >seq_212 AYIMQDGGELNYAGVGQN-GDLMEAFIYQDADRSFVRVSQFGGEGSFADITQYGYLNRIRTRQDGDFHSVTVDQLGELNYVLVEQ-FGEGQIATVSQDGD RNGTFVEQRGYDNLANVMQVGNWNDAEIHQN >seq_213 SNIEQ-------------TANGNEAIVDQLGAYNVSGIYQTSSDN-DANVTQDGKMNDSYVKQSGNGNEAVVVQDGKFNDSVIEQ-QGSYNTASVDQEGY INDSWVDQSGYGNEANVDQSGSNNQSGIQQS >seq_215 SSFNQ------------------AAIIGQAGTNNSAQLRQGGSKL-LAVVAQEGSSNRAKIDQTGDYNLAYIDQAGSANDASISQ-GAYGNTAMIIQKGS GNKANITQYGTQKTAIVVQRQSQMAIRVTQR >seq_216 SSFNQ------------------AAIIGQVGTDNSARVRQEGSKL-LSVISQEGGNNRAKVDQAGNYNFAYIEQTGNANDASISQ-SAYGNSAAIIQKGS GNKANITQYGTQKTAVVVQKQSQMAIRVTQR >seq_217 SSFNQ------------------AAIIGQVGTANSANTRQEGTKL-LSVISQEGSGNRAKTDQTGSNNFAYIDQTGSSNDASIKQ-SSYGNTAMIIQKGS GNKANITQYGTQKTAVVVQRQSQMAIRVTQR >seq_218 VTAVQ-------------SAPRAAARIAQDGARNIAHVEQRGEAA-AAAVTQRGDDNATRVTQQAGQNMLALAQIGHGNSAVVQQRAASSNQATIAQTGN GNQVALTQDGGDNQATLAQTGDNNAMTASQN >seq_219 ATIEQ-------------SSNTQTAEVRQEGSDNEATLEQNDNGS-HIAVGQDGAGNTAEVAQSDGEAVANLIQQGDNNTILLSQTDLLGTAAEIAQSGN DNMIGLVQDGSDNQARLVQNGNNNTMTASQL >seq_220 TTISQSGGVNGNNATVNQIAAGNTTSITQTADGNNATVNQWFEFGDNATINQSAASNNATITQQGYGNSASITQTAAFNTASISQNNSAGSSASITQASA DNQASVAQTASGSSTTVSQSGIGNYANASQT >seq_221 ALLWQ-------------LGSEQQLRLTQQGELNQARLLQQGWAN-ELALSQVGDENWVNGAQLGEGNSAELYQAGINNRIWLLQ-QGSGNEAWISQQGA NNQARAIQLGDGNQASIEQNGYGLAATVIQT >seq_222 VYIEQ-------------VGSYNRVTAVQSAPRAAARIAQDGARN-IAHVEQRGEAAAAAVTQRGDDNATRVTQQGAGQNMLALAQIGHGNSAVVQQRAS SNQATIAQTGNGNQVALTQDGGDNQATLAQT >seq_223 VFVEQ-------------IGERHEATIEQSSNTQTAEVRQEGSDN-EATLEQNDNGSHIAVGQDGAGNTAEVAQSGDGEAVANLIQQGDNNTILLSQTDL GTAAEIAQSGNDNMIGLVQDGSDNQARLVQN >seq_225 VTVIQ-------------DGARNGTTVGQNGRYNEAHVDQY-GRNNDAVVGQQGYNNYTRNIQTGRRNIVGVSQMGRNNTAVTDQ-YGNNNAAGVIQVGN GNNANVRQTGSGNVTLVIQGDH--------- >seq_226 ISTLQ-------------DGTRNGAQVSQEGGDNQAGLSQQ-GRRNRVVIGQEGWLNDAFSSQSGYGNVVGVAQIGEGHQAITHQ-EGSNNALGVIQIGR GQFTDVHQSGSGNVTLVIQGGF--------- >seq_227 ATANQFGWDN-------------SAGVQQFGNCNNTAIGQNGWNN-TAAGISNGFANTVVVGQDGAWNTGVVGQNGSFNAGAVQQ-SGALNYGEVVQNGN FQTGAVIQSGVGNTGVLNQTGTGNTALIIQA >seq_228 IHIEQYGWAN-------------AAGGDQHGSRNRIGIYQNGKRN-GAVVRQQGRNNTAAIGQQGRRNAADIWQRGKGNAAGVGQ-FGRDHNAVVTQDGN GNIAAGVQVGKGCDGEVSQSGSGNVAAFVQT >seq_229 STFAQ------------------TAQITQEGDQNLAEIYQENGADNVATTFQSGYGNVSYIDQSTINSTAVTSQVGNFNKATIEQFAALNGQAVINQTGY NNNAQITQSGDFNVAGVIQTGEGNQAVFQQI >seq_230 SEVPD------QIFAGARNSSGNSSVIYQNGNNNLATTYQT------------GSGNSGVIRQSGSENTALVNQRGNGNNADISQ-SGSDNFAYVSQTGG G------------DASITQQNFGNTAYILQK >seq_231 SYILQ---------GAAGTGDTNTALVGQSGMDNDSYVSQLNGDNNFAGVAQLGTDGESDIFQNGSGNTALVGQFGEGDVSFIIQ-NGNGKTATVLQDGW NNISTITQSTVNANASVTQVGSGNRASTIQY >seq_232 SYVFQ---------DSRNAGAGNSAVITQSGSDNDSYADQVGGGNEVT-VTQSGNDALSTIYQRGDNNVADVSQSGGFDASNILQ-FGDGHEANVIQSGW NNISTITQSAANASATVNQMGSGNRASTIQY >seq_234 SVPLQ-------------------------------TLLESSGRDNLIDLFQIGVQNEAMVAQSGDSNSLIVTQIGVANQALVRQ-LGTDNEVDLFQAGN HNSAEITQIGDNNLVQLKQLGSANFSAVTQY >seq_235 PITLQ-------------------------------TLLESNGRSSLVNLVQLGNLNTANIMQTGNNNYVDLMQLGDNNEANITQ-DGKNNQVELIQVGG DNQADITQIGNDNLVNLNQLGSANFSTITQY >seq_236 PVTLQ-------------------------------GLLESHGRDNLVNLFQHGSFNQAILSQTGIENSAYLTQLGIGNQTSVIQ-IGSNNELELLQQGD NNRADVTQIGNDNLIQINQLGSATFATITQY >seq_237 GIVNI-----------NQDGLDQGAMVSQSGTDNMADIDQTASLN-RVTVAQMGDFGAADIDQTGNNNEGWLNQTGEYNDAVLLQ-QGDDNYAMIEQSGA HNLATINQLGMNDSASIMQAGFGNTATITQN >seq_238 GAFISQIGSGNTAEADQQ-GTSAYARVIQNGDANQVDLRQDDGNH-YTEIAQDGDSNTVFAGQDNGQAALLLAQRGSGNSAELSQFESGDSAAAISQSGS GNRLNLVQDGSDNQARLAQSGDNNIMTATQL >seq_239 AQIVQ-------------DGDDNTASATQTGDSNDSETLQVGDFN-TAATVQLGNDNISIIDQNGDSNTATTTQTGDDNGSFVGQ-FGVDNTSTVTQTGG GNTSTVTQTGDSNLSIVGQLGSMNISTVTQM >seq_240 TASITQTGNNNKSGVFRDGGSENTFTSVQTGNNNRILANGGYILP-----PQYGVGNMIDITQTGNSNLTTLRQLGNGNDLMVSQADGNQTAILV-QSGN TNMLSVTQSGLGNMATVSQLGSGNSASVVQT >seq_242 AQVTQWAGFS-------------SFTFTQEGSGNELNARQA-TRDGIIRGNTVGNDNRVNIDQSYDSPVLDIAQNGSANEIDVVQ-HTPYSTVSIAQAGD GNVALLNQTTQFEFTAIIQNGTGNSASITQR >seq_243 SDVNQ-------------IGNDNDSRVIQFGDRNDAIVNQNGDDN-KSVVRQFQDDNVATVDQNGNDNDSNILQVGNFNNADVDQ-DGNENEARVDQLGD MNTSQTIQTGDDNDSDIDQQGDNNRAGIFQS >seq_244 QITQANPALGNDAFVYMTNTYKNDTVILQTGNENTATVSATYASRSDFDITQTGNQNTATITTAINRNDLDIVQQGENNTSTVYVSDGDFNDADINVRGN GNVTVANGEGADSNHI--------------K >seq_245 NIFQLSPAVGNDADVLIRNSDDNDVDIYQMGRHNEAVVRARNGSDNDLMIEQDGRHNRAVLKAGSDDNDFSIEQDGRNNLGKVIAYGSDDNNGLIDQRGR SNKANIDNADNNANNKIVQRGRRNEG----D >seq_246 NIFQMSPAVGNDADVLIRNSDDNDVDIYQMGRHNEAVVRVRNGSDNDLMIDQDGRHNRAVLKAGSDDNDFAIEQDGRLNLGKVIADDSDNNNGLIDQRGR KNEAYIDGASDNSNNKIVQRGRRNEG----E >seq_247 TITQVSSSQGNSALIDSSFSSGNTLSVYQNGGDNAATIRA-F-VGSTLAINQTGDTNTGSITAGAKNANFSINQGGDLNDASI---------VSVVQNGT ENTTSVKRGTDDSNVSLAASGSGNQANVDS- >seq_248 GLLER-------------QALANHAEIAQLGSYN------------LTSVIQSGEQNYAYLVQSGFENTLSLEQQGFNNSVTAEQ-SGRGNSAIILQLGN SNLIQLQQLGNDNAITIQQSGSAAEMSITQF >seq_249 DVSTQ-------------VGDDNTATINQTGWLNFNVLDQFGDDN-VATVDQDGWFNENEALSQGNNNSIDVDQVGTWNFNTTEQYDGNNNDTYAMQDGT GNYADQWQEGNGNDASSEQDGNGNEVYQDQD >seq_250 SSLNQ------------------AAIIGQQGTRNNALVRQEGATL-QATIVQNGIANQAAIDQQGEANVAAVMQTGAANQATISQ-EGYGNLASVTQQGV GNRASIIQAGTQKTAVVVQRQSMMAVRIIQR >seq_251 SYVFQ-------------LGAGDIAFVDQIGESNFSQINQNGA----------GGGNDANVNQDGVFSESGIEQTGTLNDANVFQQATSDTSISIFQDGT SNTATVTQAGTGDFSTVSQTGTSNTATVNQG >seq_252 SLVIQ-------------ADDASTVFVEQLNSNNISQVEQALG----------GGGNFASVFQRGLGGFSGVEQTGNGNFAELTQGDLSEDSVSIVQNGD TNIADVDQTGVGDFSSVSQTGIGNTATVNQG >seq_253 STVTQ-------------NGSDSTVLVNQLGSGNESTVTQAGSTQNDTEVNQDGSGNISNVDQNGDKSLVAVDQIGTGNDSDVGQGGDGPNDIDVVQEGT TNRSTVSQAENGSSVDVDQFGTSNVSTIVQG >seq_254 ALILQSGVGSNVAHAITQDGDDNTLRIEQIGRQNFVGAGSVDAVD----------NDYAGFLQQGDDNTATIRQDGDRNRVFQLKQIGNDNGATVSQTGS FNGVTISQSSDGNSVSVTQTGSQNLVGRSQY >seq_255 NILAIQWGDNNSATLKQSNLTGSTIFVEQASSGDTATVTQRASDYATAEVTQGGTGNVATLLQAASGSGIYLGQVGNVNKANLTQNNAPASTMTVAQVGD YNQITATQAANLSNIQSFQTGNSNSATIRQR >seq_256 AEITQ--IGDNWAHIGQAGSYMNEAYIMQDGYYNEGEVNQRGEFN-LADIDQKGELNYAGVGQNGDGSFADITQYGYLNSVTVDQLGGELNYVLVEQFGE GQIATVSQDGDRNGTFVEQRGYDNLANVMQV >seq_257 AFIIQ---------GDLGGLGGNEASIRQTGADNLSAIGQNSATN-FAAVNQSGDGNTAFVDQGGSSNNAFIVQPGAGNSATINQ--------NLNTTGG GNDADVFQFSDDNVAFIEQEDLTNVAFVVQG >seq_258 GIVNQ-------------NGQTNDSKIDQLGDSNKSEVYQYNAVNNKADVKQDGNSNGAFISQSNHDNQAYQTQKGNSNSATIWQSTGGFDKAWQTQTGN NNTATVDQGTTGNEATQTQKGDNNVAYASQG >seq_259 AIGEQWGGGGNHAEIKQGDSMDATGAFAQSGATNEAYATQFLSSD-ESYQEQLGTDNKAVVNQSGGDNLVEQFQDGDRNEARSTQ-NGNNNEVNQEQYGV GNMATSIQDGADHHSVINQRANGNEAMVDQT >seq_260 AIGEQWGGGGNHAEIQQGDSTEATGAFAQSGATNEAYATQYLSSD-TSYQEQLGTDNKAVVEQSGGNNLVEQFQDGDRNEARSDQ-NGNGNSVNQEQYGA GNMATSIQDGSLNNSVINQQSHGNMSMVDQT >seq_261 AIGTQYGGAGNYAQADQGDADDATNAFAQTGELNEAYASQKGTDN-TLFQEQIGSGNKAVASQSGSSLYAEQYQDGTDNEARSKQ-NGSNNEAYQEQYGP GNVALSIQDGSLNHSMIEQKARGNSAIVDQT >seq_262 AIGTQYGGGDNYAEANQGDTVTSERAFSQSGGYNNAYSTQYGSDN-SSFQEQNGLLNTAEVHQTGGDNYAEQYQLGGLNEAYIEQ-DGQNHSANQEQYGS GNQALSIQRGMNNDSYINQMANGNIADVDQL >seq_263 IFVDQIGTGNDSDVGQGGDGPN-DIDVVQEGTTNRSTVSQFGFRN-VVDVDQFGTSNVSTIVQGSDDNQALVDQDGTGNESTIQQNNGDFQIADVTQDGT GNMSFVLQDEDLNEATVVQIGTTNFSDIDQN >seq_264 VTIEQYGQAGNKAKVIQHDSRRSQATIWQEGSAN------------TALITQTGNQNKAVIGQSGYGHEAWINQNGNFNVAAIIQ-TGAASSLSINQTGH NNQAYISQNGNDYLAIIG--GSGINVYVTQY >seq_266 SASITQTAAFNTASISQNNSAGSSASITQASADNQASVAQT-ASGSSTTVSQSGDGNYANASQTGADDVLSLSQAGGFNIINASQVSGSGNQGYVTQNGY FNNATLVQQGNANYASINQNGAYNTVSLKQH >seq_267 NTVDYVG-----------GGTNGKVSIQQNGAGNAFSGYSWGDQQ-YIAVKQSGDNNGVYIDGLGPQNKIFLTQNGTGNMITNDTSKSNDNHIQVGQIGS HNRALVGQKGNNNLADVIQTGDSNSATINQG >seq_268 NGIDYAG-----------IGTDGRVSIHQIGAGNTATANSNGDDQ-YIALKQRGDNNSISIAGLGPQNKVFFTQNGAGNIIESDPSTSNNNRLQVAQIGN NNNAQVGQKGNMNSADVSQTGDNNSATIMQG >seq_269 STVVQTGAENIANVNQMNNAERSVVAINQITNSSVATATQDGSADSSITINQNGSENLASVTQDSLNATASVSQTGSFNQVNTLQQHAINTSLSVMQNGE SNIASVNQEHNDSVATVNVTGDTNTTTVRQY >seq_270 DASPQ------------------RVSARQLGDDNAVQVNLTGHGN-RLELQQQGNRNSAGVLIGGEDSRLILTTQGNDNEISAVG-VGDNLELSVEQLGA SQYASISQAGAGNSLDLRQSGQGNRATIQQ- >seq_271 SIIVQQGNSN-------------QGRITQSSSNNNALIAQRGSGN-SADITQLSSNNNAVIAQLGNGNSDSIIQDSFGNSAYIIS-FGKNNITQITQTGT NRSAGVVQNASGMAIRVTQH----------- >seq_272 STYNQ------------------SALINQIGSDNRAFTHQQGTNN-HSIIVQQGNSNQGRITQSSSNNNALIAQRGSGNSADITQ-LSSNNNAVIAQLGN GNSDSIIQDSFGNSAYIISFGKNNITQITQT >seq_273 NYADQWQEGNGNDASSEQDGNGNEVYQDQDGNNNDAIAVQDGNLN-YTHQLQVGNGNYAIHEQDGNLNYADAHQTGNNNDAIGWQ-DGNGNTIWQYQTGN GNTADDWQVGDGNTADITQVGNGNMMTVTQN >seq_274 SYQEQ-------------LGTDNKAVVNQSGGDNLVEQFQDGDRN-EARSTQNGNNNEVNQEQYGMDNYSRVSQGHGGNMATSIQ-DGADHHSVINQRAN GNEAMVDQTGNGQRSLINQTSGQNTATVIQR >seq_275 SYQEQ-------------LGTDNKAVVEQSGGNNLVEQFQDGDRN-EARSDQNGNGNSVNQEQYGQDNFSSVKQRYGGNMATSIQ-DGSLNNSVINQQSH GNMSMVDQTGDGQTSLINQAGGYNNATVIQR >seq_276 LFQEQ-------------IGSGNKAVASQSGSSLYAEQYQDGTDN-EARSKQNGSNNEAYQEQYGDQNFSSIEQRGGGNVALSIQ-DGSLNHSMIEQKAR GNSAIVDQTGDGQMSVVNQGNGYNNATVIQR >seq_277 SFQEQ-------------NGLLNTAEVHQTGGDNYAEQYQLGGLN-EAYIEQDGQNHSANQEQYGIFNESVVIQNGGGNQALSIQ-RGMNNDSYINQMAN GNIADVDQLGMNHQSVINQTGGSNTAIVTQR >seq_278 AFVDQ-----GLGGGAGVDGSSNNAFIVQPGAGNSATINQN--------LNTTGGGNDADVFQFSDDNVAFIEQEDLTNVAFVVQGDAGTNTATISQNGS GNLVAIGQNGTLNEAVGNQQGDGNSLFIDQG >seq_279 DTVILQTGNENTATVSATYASRSDFDITQTGNQNTATITTTAINRNDLDIVQQGENNTSTVYVSGDFNDADINVRGNGNVTVAQNGEGADSNHI------ --KTKIVNGNSNKVTSVVNDGHANDVHVTVR >seq_280 DVDIYQMGRHNEAVVRARNGSDNDLMIEQDGRHNRAVLKAFGSDDNDFSIEQDGRNNLGKVIAYSDDNNGLIDQRGRSNKANINDNADNNANNKIVQRGR RNDISIMGGSNNGTLKIHQKGRDNDSTIHVY >seq_281 DVDIYQMGRHNEAVVRVRNGSDNDLMIDQDGRHNRAVLKALGSDDNDFAIEQDGRLNLGKVIADSDNNNGLIDQRGRKNEAYINDGASDNSNNKIVQRGR RNEISVMGGSNNGTLKIRQKGHDNDSKIHVY >seq_282 TLSVYQNGGDNAATIRA-F-VGSTLAINQTGDTNTGSITALGAKNANFSINQGGDLNDASI--------VSVVQNGTENTTSVEKRGTDDSNVSLAASGS GN-----------------QGTDNVADVLID >seq_283 VSVYQDGGSNEAAVNQQGGYYVGQVELRQLGSANRAQVTQWAGFS-SFTFTQEGSGNELNARQATRDGIIRGNTVGNDNRVNIDQ-SYDSPVLDIAQNGS ANEIDVVQHTPYSTVSIAQAGDGNVALLNQT >seq_284 GTITQDGGNRNTFALSQSGGKGNVIQLNQADTENNTQVTQTSGLFSVIYVGQNGYKNDNIIIQSTKDSSIWLSQFGQKHYAVLSQQETTNSSIAVSQYDL SQTADVTLYTADSTVLLRQSGEANHAVLGQW >seq_285 SLNLQ-------------------------------ALLERSGRDNLITFIQQGNINIGGVLQAGNDNSAYLIQQGNNNNAIVIQ-VGIENEVQLQQAGN SNSALITQWGDANLVQLNQTGSNNFISITQY >seq_286 PITLQ-------------------------------ALIERSGRDNLIDLVQQGTANQAIVIQSGSDNSAYATQAGNDNVSLVTQ-IGSNNEVQLLQVGT QNTASITQIGNDNLVQLNQLGSGNFISITQY >seq_287 PVTLQ-------------------------------ALLERSGRDNLIELVQQGNANQGLLFQSGSDNKADVTQVGNDNDALITQ-IGSNNEVQLLQVGS QNTATVTQIGNDNLVQLNQLGSGNFISITQY >seq_288 PITLQ-------------------------------ALLERSGRDNLIDLVQQGTANQSLLFQSGSNNDAYVTQVGMDNSAFVTQ-IGSDNEVQLLQVGT QNTASITQIGNNNLVQLNQLGSGNFITITQY >seq_289 SKVVQEGNGNDARVRHSSSSRPSTVDIAQRGDLNRADVNVY-GMGSQVTLAQTGNSNGASATVSGEGNQLDLASNGDGNGISAYY-LGSDGQLKVDQQGD NLGVTAYVTGNASSISVAQSGSQHTADLTQN >seq_290 AVVVQESPG---------NSAGNKAKIVQSGSSNTALIGQKGAAN-TAYIIQEGDNNAAAIGQLGRNGQALVAQKGDNNLAVIGQIFHPSSKLSINQEND NNIAFVA-GSGGANLGVSQNGGDNRIYIDQS >seq_291 SAIIQYSPG---------IKANNRAKIKQSGHSNSANINQVGRNN-IAYINQDGSENTAAIGQIGGNSEALISQDGNHNLAVIGQFSGQGSQLSINQKGN HNVAFMA-GSGGDNLGISQDGHDFRIFINQA >seq_292 LRLTQ-------------QGELNQARLLQQGWANELALSQVGDEN-WVNGAQLGEGNSAELYQAGINNRIWLLQQGSGNEAWISQ-QGANNQARAIQLGD GNQASIEQNGYGLAATVIQTGANQQVNVVQT >seq_293 SYIRQ--QWDTNQALAVQVGNDGISHIKQAGSSNLAVLADDGSFN-SSYILQTGDTNTALVGQSGMDNDSYVSQLNGDNNFAVAQ-LGTDGESDIFQNGS GNTALVGQFGEGDVSFIIQNGNGKTATVLQD >seq_294 SFIKQ--GRQDNYADAQQHGNDGLNMMRQRGNDNDAYLMQTGNDN-ESYVFQAGAGNSAVITQSGSDNDSYADQVGGGNEVTVTQ-SGNDALSTIYQRGD NNVADVSQSGGFDASNILQFGDGHEANVIQS >seq_295 ASLDQAGHNG--------------ALIWQAGDGNAIVARQTGAQN-WIAASQVGLGNTLNATQRGNGNTLQVQQNGVGNSVESTQ-VGTSLSARVTQNGI NNAVNIVQGGSNTGIQVIQTGNGARATVVTR >seq_297 STVDIAQRGDLNRADVNVYGMGSQVTLAQTGNSNGASATVSGEGN-QLDLASNGDGNGISAYYLGSDGQLKVDQQGDNLGVTAYV-TGNASSISVAQSGS QHTADLTQNTTGNAINLTQSGFSIHAVISQ- >seq_298 VSIDQREASQSHAYAYQGHGQRNTVEIVQRGLLDEALVRQGGDDQ-RLRIEQEGARNGLNLFQDGRDSAARLRQVGEDHLQDVLS-LGARNEVELLQGGA ANRAVVEQRGDDNRVQLIQIGNRNQAAVQQN >seq_299 ATVRQSGGTGNNAFAIQEGGNTNGNGITQNGNDNFSDVFQAGSRFVTAQTDQEGDRNDSIIIQSNGNVDARVDQLGDDNISSIVQTGVAGNGIDIIQSGS DNLSRVEQNNDASGILVEQNEDGNDSFISQS >seq_300 ATIDQGGGDLNTAVTVQLGGNSGRATTTQTGNRNNAETFQTTNRFLRSTINQTGDDNDAFVSQNDAGANVRIDQSGNDNAASIVQTGDRGGQADLDQNGN DNVSRVVQVNTPSTILVTQNDDGNDSFVDQS >seq_301 AITLQ-------------------------------TLLETNGRGNFINLVQSGVLNQAYLLQSGHDNSIELDQVGLDNQANITQ-HGDNNEVELLQVGV RNQADITQIGNDNLVQLNQLGSANFITITQY >seq_302 IDVTQ-------------LGASNSNITNQLGITNTATVNQDGSSN-IGRQNQIGMSNNAATDQNGFLNISAQNQLGFSNTATVDQ-DGSLNLTVQNQIGM GNTAASIQNGIANLTVQTQLGSNNGAYANQT >seq_303 IDVAQ-------------IGVSNSNMTNQLGITNSATVDQDGFLN-VSRQNQLGMSNNVVTDQYGSLNISTQNQLGFSNSASVDQ-DGSLNLATQTQIGM GNGAASIQNGVANLVIQTQLGNNNGAYANQT >seq_304 SSLTYQFGNANDLFSEQNKAHGSNILAIQWGDNNSATLKQSNLTGSTIFVEQASSGDTATVTQRSDYATAEVTQHGTGNVATLLQANASGSGIYLGQVGN VNKANLTQNAPASTMTVAQVGDYNQITATQA >seq_307 SIIAQIGNNNRTHVSQSASQAGNFSSIYQFGNYNAAMVTQTGGNN-VSNVSQIGRYHKADITQSGSTSRQL--------NSYVRQ-LGNRSDVQISQSGS GYR----------GISVEQQAFSNNARPVTV >seq_308 ----NGGVEGVFLHGVNEDDFNDLLN--VDDIDNYSEVHITNATDSLAVVVQESPGNS-------AGNKAKIVQSGSSNTALIGQ-KGAANTAYIIQEGD NNAAAIGQLGRNGQALVAQKGDNNLAVIGQA >seq_309 SIAVASGVEGDFLHNVDQDDYSDLYNISSGDLDNYSEIDITNSSDSSSAIIQYSPGIK-------ANNRAKIKQSGHSNSANINQ-VGRNNIAYINQDGS ENTAAIGQIGGNSEALISQDGNHNLAVIGQA >seq_310 TLLET-------------NGRGNFINLVQSGVLNQAYLLQSGHDN-SIELDQVGLDNQANITQHGDNNEVELLQVGVRNQADITQ-IGNDNLVQLNQLGS AN-FSIEQIADGAAITITQY----------- >seq_311 GFNNQFNSNQ--------LGYNNQIFTHQQGMFNGVTAFQS-GADIEASIYQSGFGNRVITSQVGSNLLTDVSQIGTQNLAIIHQ-TGSNNTIMIQQNGY GSAVGILQWGTMQNVSVTQSN---------- >seq_313 ASVFQ-------------DGKNNDADVDQ---RNRALVGQD-GKNNDADVDQ---RNRAFVDQDGKNNDADVKQ---RNKAFVDQ-DGKNNDADVDQRGK NNDASVDQR---NTADVDQDGKNNSADVDQE >seq_315 IAILSLGDPRAAARSVGSGGSLNVTTVRQVGDGNVASVTSVGDRT-RASLSQEGSGNRGALFLSGDGATMDVVQRGDDNATRLIG--GPGTTVTIRQFGS GN-AVSGVAPDGRTVSVRQVGDGLSAGIAQA >seq_316 TLIPE-------GAPDLVSTNGNSASIFQDGSGN------------DARIRQVGTGNEASASQLGDNNEIAIAQSGTGNRAVAVH-SGNRNQTAIAQLGT NNVAGIRLDGSDNTMNLLQTGNANRFLMDTS >seq_318 SLVEQ-------------TGEENDATVDQTGTGNVADVSQTGAPGNLVTVDQTGTENFAFVTQDSATNTARV-------AAIVVQ-DGVNSLADIDQVGA GGSATIGQNADFSFATIEQTGAANTATIVQT >seq_320 ALVGQ-------------DGKNNDADVDQ---RNRAFVDQD-GKNNDADVKQ---RNKAFVDQDGKNNDADVDQ---RNKAFVDQ-DGKNNDASVDQ--- RNTADVDQDGKNNSADVDQE---NTAVIDQS >seq_321 AVPVQ---------------EDRGAFISQIGSGNTAEADQNGDAN-QVDLRQDDGNHYTEIAQDGDSNTVFAGQDGNGQAALLLAQRGSGNSAELSQFES GDLAAISQSGSGNRLNLVQDGSDNQARLAQS >seq_322 TYATQTGGQNNQAGGSGVVGGANTSFTAQFGDNNMITASQKTADASGTGGFANGYGNASAALQRGADNIATIDQKGVFNASATVQIGEANSVTVVGQRGA LNNADINQKNGGNTQATFQTGYDNQVTVAQK >seq_323 ALAKQEGGKNNQGVGDMPVGGANNAETKQFGKENQAAVSQKSADGNADGEFQPGHSNASMSLQVGNRNEVATEQTGVRNTAATVQFGNDNLASTIGQSGA KNVAEVEQANGANTQALFQRGTKNAASIEQS >seq_324 ALAGQTGGKNNQAVGKG----TNGSFTSQFGSNNQSLTSQTTQDHTD-GVFSTGGQNSAATFQTGDANSAITSQDGVGNQALTVQAGSKNEAVTTGQFGS ANSAIVAQANAANTQSTLQYGSQNSAITSQQ >seq_325 GFLPSVSNSN-----------ANLATISQEGSSNQARLQQTGDLN-VATIMQTG--NGNIVRNADASLNSSALQQGSNNKLTVEQIAGSGNIANVAQIGS DNIANILQNGSANTVNLSEIGNGQKISVSQS >seq_326 ATVNQ-------------DGSSNIGRQNQIGMSNNAATDQNGFLN-ISAQNQLGFSNTATVDQDGSLNLTVQNQIGMGNTAASIQ-NGIANLTVQTQLGS NNGAYANQTGALNATIQTQSGDGQSAQSMQN >seq_327 ATVDQ-------------DGFLNVSRQNQLGMSNNVVTDQYGSLN-ISTQNQLGFSNSASVDQDGSLNLATQTQIGMGNGAASIQ-NGVANLVIQTQLGN NNGAYANQTGTLNAAIQTQSGDGQSAQSMQN >seq_328 GSVVQ-------------GGTDQFADIDQDSSGNFALVVQGSGALSA--------DNDALVNQDGDDNFSEIYQNGGNNLADVDQFGTDGNFSRITQNGS DHQAFVTQY-DGSSSFVTQSGTGSIATVTQG >seq_329 ASIFQ-------------DGSGNDARIRQVGTGNEASASQLGDNN-EIAIAQSGTGNRAVAVHSGNRNQTAIAQLGTNNVAGIRL-DGSDNTMNLLQTGN ANRFLMDTSVQDLDMNVLQNGNGNSLQTNVP >seq_330 TITQQQSAPEGGLGGN-------SATINQAGSANIATIVQSNSNANLATISQEGSSNQARLQQTGDLNVATIMQTGNGNNSSALQ-QGSNNKLTVEQIAS GNIANVAQIGSDNIANILQNGSANTVNLSEI >seq_331 PAFDY---------------SGNIVAVRQSGVGNNASLDQAGHN--GALIWQAGDGNAIVARQTGAQNWIAASQVGLGNTLNATQ-RGNGNTLQVQQNGV GNSVESTQVGTSLSARVTQNGINNAVNIVQG >seq_332 GAQQH------------------QVVITQYGILNKATVNQQNESTNEAYIVQNGSNNIAAILQYGSDNLINLSQQGNNNQAEVLQ-QGNANIANISQSG- EQAFKVQQIGNDLVVNVTFSQQ--------- >seq_333 GYLVAPSGQ-----------GQNSATVTQVGSNNSAVVDVQGSRN-TTLQTQTGVKNESSILLTGNQNRVRETQIGSGNSADITV-SGNNNNITNTQIGS NLGIGVNQIGNGKSVSITQIGVGR------- >seq_334 GYLVNRAQQGAVADGALPSGSHNTAIVNQTGNANSVTSDVRGSLN-VTLQSQLGSGNESGFTVNGNQNVLHNSQIGDNNSARLDV-TGNRNAYSSTQIGS NLSYGLSQVGNNGGAVIVQMGG--------- >seq_335 ATTMSYAHGGNSGNGGGELGVNVKLDVDVIDAGNMNDDFWSTQAS----IGQYGDENTASITQTSTSQLAGILQVGDQNKAGISQ-TGNNEYAAIGQYGD LNSAYINQTLQNAAAITIQWGNSNIASISQ- >seq_336 GTTAQFGGSDNAATGIDLAGTNNQLGIRQIGTENAVSLSMIAGLANNVGIEQFDFGNDASVSINSNGNQVGISQFGRTNDATVTINNGSDNTFTVQQTGG NNTATVSLTAGNSNLNIDQTGSRNEATVTLW >seq_337 ALIDS-------------SSSGNTLSVYQNGGDNAATIRAFDVDGSTLAINQTGDTNTGSITALAKNANFSINQGGDLNDASIRAYSYDNDTVSVVQNGT ENTTSVERGTDDSNVSLAASGSGNQANVVLD >seq_338 GMKDGGASGGNRAHGTGQQGEMNYAMIEQDGQDNRAKTRQQFDNN-QARTVQEGDDNVAGVFQNGPDGS-------DGNTAIAEQ-YGDENVTRIDQDGS RNKAHSVQEGDLNISNQYQTGDDNTALVDQG >seq_339 GMKDGGASGGNKAHGTGQQGEMNYAMIEQDGQTNSATTIQQYDNN-QARTIQEGDNNIAGVTQNGPDGS-------DGNTALAEQ-YGDSNATMIDQDGS RNTAHSNQLGDLNVANQIQTGDDNWALTDQG >seq_340 VVILQ----KNYG-----SDGKNKAKVSVSGAGNRVLAAQLGGRN-EAYIAQEGNNNTAIVGQVSSESVAVVIQEGDDNEAYIGQYRSASSNVSVNQIGN GNIAAVV-AGSGANLGISQVGNGSELIINAS >seq_341 SFIVQ-AGSTNHAIAGQTGGNNKQGTV-QVGRGNSALTAQSAAKPNESGVLQIGAQNGAVALQTGGNNKQGTVQGGVRNFAVTSQ-KGRQSDSTTAQFGA FNGSIVNQKDGNNKQTTLQVGGSNFAATSQD >seq_342 GYSTQ-------------NGNNNLVRARQAGDLNSMYATQDGDGN-LVSMRQTGNTNNGGFFISAIDNDAQANQVGDQNDARIEQ-QGDFNDAQVYQENT GNSADIAQNAENNYAGVDQRGNDNSARILQT >seq_343 ELSELNTSNDNNAIIELTNATNSQVTIIQHNVNKTGSN--------KAKVKQTGYDNTAAVMQLGSNNVGLIAQNGTNNSATLKQ-LGLNHEGAILQEGN DNIAHLIQTGNGKQNVVNQTGDSNIAAAVNK >seq_344 VTIIQ-------------HTGSNKAKVKQTGYDNTAAVMQLGSNN-VGLIAQNGTNNSATLKQLGLNHEGAILQEGNDNIAHLIQ-TGNGKQNVVNQTGD SNIAAAVNKNNGAGFSINQTGNQGIILVNGM >seq_345 MAVRQ-------------IGDDNSFTFLQNGTANAAAADFAGERN-RVALRQNGTGNLAVVSVSGNSNNADIVQAGTLNTVDVIV-TGDLNAFAFSQKGT GNQAVVTQAGTANVASFSQRGKGNGLVIQQ- >seq_346 AFVAQ--VGNNNGGLNFSSGSGNSQAIIQQGNGNGAAQFVAGNDN-ATGVYQDGGANFALNGARGDSNLVGVAQIGPENYAAAVA-VGGSNAVGIAQIGS GNRSVLRVRGDGNAVGTLQVGASHIANIRVR >seq_348 SDYAK------------------TVTI-QMGNQNEAYQIQNGIAN-TAFIYQEGILNYASQLQEGDHNEAYIIQSGSGNKAYQNQ-YNDYNTGIIIQIGH GNEAHQNQYGYAKGI-IIQNGNGHIITQN-- >seq_349 TAFAQ----------------NNTVTLTQIGTAQEASVSQAGSKL-TATVTQEGADQNAIISQKGFDNQATITQVASGNKATITQTNGGDNFATITQGST NNQATITIEGSNNRSRILQVGDNNSLTTSQY >seq_351 AVFAQ------------------DAFVAQVGNNNGGLNFSSGSGN-SQAIIQQGNGNGAAQFVAGNDNATGVYQDGGANFALNGA-RGDSNLVGVAQIGP ENYAAAVAVGGSNAVGIAQIGSGNRSVLRVR >seq_352 QAITI-------------DDETVFISVSQFGLNNITNIIQSGNGANLSNVVQNGSNNEAIITQLGEGNVVNLLQQNNNNYFEIIQ-DGFDNVANVNQLGE QS-FTVYQIGNEMVINITQYKE--------- >seq_353 FTAMQ-----------------------------DALPKGRAARGNVIMVTQIGQHNRLFSHSYGEGNTTVALQKGRGNTAELLQ-DGSGNRLDLVQQGN GNRADLQQIGDDMYLGLVQKGNGNRYTYRQT >seq_354 KTIVSVGGP--------------PVVLNQNSQLNMAGVFMVGGST-SATVVQNGTNNATGVLQFGGTNSASVGQAGMNNFAFVGQ-TGQSATSLVSQLGA MNTGTVAQFSTVNTSTIVQTGP--------- >seq_355 AVNRQ-------------SGDRNVTLIDQTGPSNVAALQAVGNDN-TGRIVQQGAGNVAQGSLNGNSIRFDLQQVGNANQADLTVSSGPGTSLAVQQVGN GNVAVVNQLGSGLSADVSQAGATKSITVMQV >seq_356 AMILQIGDDNSVSYVTSDDSTGNLYGFSQEGDGNGATGTVSGDEN-EMAVRQIGDDNSFTFLQNGTANAAAADFAGERNRVALRQ-NGTGNLAVVSVSGN SNNADIVQAGTLNTVDVIVTGDLNAFAFSQK >seq_358 VTLTQ-------------IGTAQEASVSQAGSKLTATVTQEGADQ-NAIISQKGSGNKATITQTGGDNFATITQTSTNNQATITI-EGSNNRSRILQVGD NNSLTTSQYGEGNLVEATQLGVGNSLSVVQS >seq_359 ETIVQ-------------FGNDQPVTIEENSRVNIARVIQIGGSTVDATIIQNGTRNYANVIQMGGTTNAAVGQSGLSNTADITQ-IGNSTNALLLQIGD MNSGAVRQFGRFNWLAIFQFGR--------- >seq_361 AAVPQ--------------GPGNVSVIGQRGTSNTASVEQQAILG--------RSANVASIGQAGSSNGVTLTQTGGGNNAAVGQ-VGDGNSTTLSQPGN AT-AAVAQVGTNLGVNINQSANSSIGVVQFG >seq_362 ATINQNLTSS--------GSVGNQATITQEGQSTVTGRKVPSDLN-QATISQEGDAHVASLMQGGLSNQATLTQTGTGNVVKGVD-SGSIISDTAQQLGD MNLLNVIQGGS-SVANVLQIGTANMATISQN >seq_363 TLIDQTGPSNVAALQA--VGNDNTGRIVQQGAGNVAQGSLNGNSI-RFDLQQVGNANQADLTVIGPGTSLAVQQVGNGNVATGTL--PAGRDVVVNQLGS GLSADVSQAGATKSITVMQVRTR-------- >seq_366 ELYDKSVPYSNYAEVVLENVFDSEVYISQTSAANKAKVVQRNVTGNTAVVSQSGSENLAVIEQSGQDNYAEIIQAGYNHNSYISQ-SGSHNIAYLRQCQ- --GFDCSYSDYGSDISIVQENNHNLAIVVDK >seq_367 DLLQREFEGNNFVELDITDSSKSEISITQINSPNKSKVIQRRVYSSSASIDQYGSNNIALIEQKGQYNNASIRQNGTGYKGIIIQ-DGSFNEAQLTQCD- --LSECAKENHGGEVSILQQNDHNVAIIIDN >seq_368 ITLMQ----------------ENVSIIEQRGNQNQASTFQS-----RSTSYQSA--NFSHIYQRGNNNHANITQNNGNNIGVIWQ-VGNNHNANISQQGN SLRADIYQLGFSGDVSISQSGSGQRGVSVQQ >seq_369 ASVSQSGANNNFNYRA--DGNSNQFAVRQTGGNTSLVASVVGDSN-QLLIQQFGSNQLVSLDVDGDSNRFSVNQANRGNRATGSI-AGDGNQAVVNQDGV SNTATFSQVGSGNTLGVRQ------------ >seq_370 SAIYS------------------ELYDKSVPYSNYAEVVLENVFDSEVYISQTSAGNKAKVVQRVTGNTAVVSQSGSENLAVIEQSDGQDNYAEIIQAGY NHNSYISQSGSHNIAYLRQSDYGSDISIVQE >seq_371 SLEYQ------------------DLLQREFEGNNFVELDITDSSKSEISITQINSGNKSKVIQRVYSSSASIDQYGSNNIALIEQKNGQYNNASIRQNGT GYKGIIIQDGSFNEAQLTQENHGGEVSILQQ >seq_372 SVINQ---------NAATNGVGNFANIDVAGVGNEVDADQGGQRYNTVDIDQNGGDGTVDAGISGRVNTITVTQSGSGNFVDGTT-SGIGQADGLYVEGD FNTVTVTQSSANNAADIEVVGSNNNTTITQQ >seq_386 SVINQ-------------SNWNNQATVSQNGKNDNSTIKQSGANSNTTNVTQKGVENKSDVDQSSNYSGVYVSQDGGVNHSEVQQWGGTDNYAAVQQSGS QNTSWVTQKGSYNTVSINQH----------- >seq_387 SIVQA-------------LAGAGTATIHQNGSQNTFSVGQ------RATISQMGGNHTASIDQSSANNQSYITQTGGGNTAITQQ-NSGQNIITMNQSGS LLNATVEQSSYNNNVKLTQSGSQNNATITET >seq_389 ATITQ-------------EGNAHEVSVVQQGSGNTSVINQSTAEGNRATVNQSGNGNIVTINQSSNDDYSA---TGSKNSVNTTQ-SGQGETI-ITQTNS GNTISVYQGPASPGKKKNQKGKQK------- >seq_390 ATIHQNGSASQNTFSVGQ-----RATISQMGGNHTASIDQSAYSNNQSYITQTGNGNTAITQQNSGQNIITMNQSGSLLNATVEQFESYNNNVKLTQSGS QNNATITETSDGNSVESSQSFVFNNLYVEQK >tr|H5V126|H5V126_ESCHE Major curlin subunit OX=1115512 OS=Escherichia hermannii NBRC 105704. GN= PE=4 SV=1 -----------------------ELKIYQDGAHHVVNSVQTGADGSLVDIKQQHYNNFANVQQSAKDSNIWLDQDGVKNTADIKQNGGFGSDTDVTQKGN YNAVNVNQNAVR---DVTQRGYNNFVNAAQ- >tr|H2J0L3|H2J0L3_RAHAC Uncharacterized protein OX=745277 OS=105701 / NCIMB 13365 / CIP 78.65). GN= PE=4 SV=1 ------TSPG-HGGGGSDHGHNSEIQIYQQGTANLANATQSGAQKSLTAISQDGYRNSAGTNQTGDGSLISVVQKGDYNGANVSQ-SANGSKVLVSQNGS GNYAQASQGANGSLTSITQVGTANLAYASQN >tr|I2BCF6|I2BCF6_SHIBC Curlin major subunit CsgA OX=630626 OS=105725 / CDC 9005-74) (Escherichia blattae). GN= PE=4 SV=1 ------DSSS-DRGR-PDHGHNSEIQIYQDGKSNYAGATQSSAKKSLTDISQQGDKNSATTIQTGDNSLIDVNQKGHFNNAYVNQ-SADSSKVLISQTGF ANSATAQQHTSGTLTSVTQVGFGNVAVTNQ- >tr|G8LIR0|G8LIR0_ENTCL Major curlin subunit OX=1045856 OS=Enterobacter cloacae EcWSU1. GN= PE=4 SV=1 -SINQ-GGWGNGHGHGHDSGPNSTLSIYQYGGGNSALALQTSARNSTLTINQSGGGNGADVGQGSDDSSISLTQNGFGNSATLDQWNGNHSVMNVSQYGG GNGAAVDQTASGSTVTVQQVGFGNHATAHQY >tr|Q7X237|Q7X237_CROSK Curlin-csgA protein OX=28141 OS=Cronobacter sakazakii (Enterobacter sakazakii). GN= PE=4 SV=1 -MINQ-GGWGHGHGHGGYGGPNSTLNIYQNGGGNSALALQTDARNSVLNISQTGGGNGADVGQGSDDSSINLTQNGFGNSATLDQWNSKDSVMNVSQYGG LNGALVDQTASNSTVNVTQIGFGNHATAHQY >tr|I6H081|I6H081_SHIFL Major curlin subunit OX=766154 OS=Shigella flexneri 1235-66. GN= PE=4 SV=1 -LVPQLNSN--HHHGGSNYGPDSSLSIYQYGSSNVANALQSDARKSDVTITQHGHGNGATVGQGADDSTISLKQTGFQNSADINQWNAKNADISVTQFGG RNGAVVNQTASDSNVLIQQVGYGNNATANQH >tr|K8CDA8|K8CDA8_9ENTR Major curlin subunit CsgA OX=1208591 OS=Cronobacter malonaticus 681. GN=BN131_3958 PE=4 SV=1 --FP-F----------HQSAENSTLKIYQQGVNNNTTALQSDAKNSTTEINQLGTANGADVGQGSDDSKILLNQNGFANNSTIDQWNGHDSGVTVNQNGV QNGALVNQTASGSQVYVTQTGYGNHASASQY >tr|J0W9T3|J0W9T3_9ENTR Major curlin subunit OX=1202448 OS=Enterobacter sp. Ag1. GN= PE=4 SV=1 -TYP-------------QQAQHSTLKIYQQGNGNNATALQSNAFYSKTEIKQLGTVNGAKVGQGSDSSDIKLLQDGYGNNATISQWNGKNAQIDVQQFGT NNGAVVNQTASSSLVSVTQFGNGNHATASQY >tr|K1J4R5|K1J4R5_9GAMM Uncharacterized protein OX=1073385 OS=Aeromonas veronii AMC35. GN=HMPREF1170_00015 PE=4 SV=1 ---------------------GNQALLWQLGSEQQLRLTQQGELN-QARLLQQGWAN-----QAGINNRIWLLQQGSGNEAWISQ-QGANNQARAIQLGD GNQASIEQNGYGLAATVIQTGANQQVNVVQT >tr|L0GSR5|L0GSR5_PSEST Curlin associated repeat-containing protein OX=644801 OS=Pseudomonas stutzeri RCH2. GN=Psest_3879 PE=4 SV=1 -----------------PLSSSQAAYVQQMGQGNLADLRQSGQAL-NAQILQQGSDQEAFILQHGDNLLAIVEQVGQGNFADIRQ-TGSDNQASISQYGA YNDARIEQTGSGLRSTVTQFGIGQQINIVQG >tr|I4CYE8|I4CYE8_PSEST Curli fiber surface-exposed nucleator CsgB OX=1196835 OS=Pseudomonas stutzeri CCUG 29243. GN=A458_19420 PE=4 SV=1 -----------------QLTDAKAAYIQQLGQGNLAELTQNGQTL-SAQMLQQGGDQEAFILQHGENLMAVIEQVGQGNYAEIRQ-TGSDNQASISQYGA YNDARIEQTGQGLRSAVTQYGVGQQINIVQG >tr|H7EWC9|H7EWC9_PSEST Curli fiber surface-exposed nucleator CsgB OX=32042 OS=Pseudomonas stutzeri ATCC 14405 = CCUG 16156. GN=PstZobell_11514 PE=4 SV=1 -----------------HLSDAKAAFVQQQGQGNLAELVQTGQAL-NAQVLQQGSDQEAFILQHGENLLAIIEQAGHGNFAEILQ-TGSDNQATISQYGA YNDARIEQAGSGQRSAVTQYGTGQQINIVQG >tr|K6CQ69|K6CQ69_PSEST Curli fiber surface-exposed nucleator CsgB OX=1218352 OS=Pseudomonas stutzeri KOS6. GN=B597_23597 PE=4 SV=1 -----------------ALAGSRQAILQQQGEGNLAELLQNGQTL-GAQILQQGTDQEAFILQHGHDLLAVIEQVGQGNFAEIRQ-AGSDNQATISQYGA YNDARIEQVGQGLRSAVTQFGVGQQINIVQG >tr|J3GV84|J3GV84_9PSED Curlin associated repeat-containing protein OX=1144334 OS=Pseudomonas sp. GM60. GN=PMI32_05286 PE=4 SV=1 -----------------PPPIGQHALIDQNGQANVALLQQNGQSL-LGTIVQSGSNQEAYILQQGSDLMAMITQQGSGNSASITQ-TGSQNRAQIAQNGN NNDASIDQAGTGLSSAVTQSGNGMSVSVKQY >tr|J2R4G3|J2R4G3_9PSED Curlin associated repeat-containing protein OX=1144329 OS=Pseudomonas sp. GM33. GN=PMI26_02418 PE=4 SV=1 -----------------PPPAGQHALIDQNGQANVAMLQQNGQSL-LGTIVQAGSNQEAYILQQGSDLMAMINQQGSGNDASITQ-TGSHNRARISQNGN DNSASIDQAGTGLQSAVNQSGNGMSVSVKQY >tr|L0GSR5|L0GSR5_PSEST Curlin associated repeat-containing protein OX=644801 OS=Pseudomonas stutzeri RCH2. GN=Psest_3879 PE=4 SV=1 ------------------MGQGNLADLRQSGQALNAQILQQGSDQ-EAFILQHGDNLLAIVEQVGQGNFADIRQTGSDNQASISQ-YGAYNDARIEQTGS GLRSTVTQFGIGQQINIVQG----------- >tr|I4CYE8|I4CYE8_PSEST Curli fiber surface-exposed nucleator CsgB OX=1196835 OS=Pseudomonas stutzeri CCUG 29243. GN=A458_19420 PE=4 SV=1 ------------------LGQGNLAELTQNGQTLSAQMLQQGGDQ-EAFILQHGENLMAVIEQVGQGNYAEIRQTGSDNQASISQ-YGAYNDARIEQTGQ GLRSAVTQYGVGQQINIVQG----------- >tr|H7EWC9|H7EWC9_PSEST Curli fiber surface-exposed nucleator CsgB OX=32042 OS=Pseudomonas stutzeri ATCC 14405 = CCUG 16156. GN=PstZobell_11514 PE=4 SV=1 ------------------QGQGNLAELVQTGQALNAQVLQQGSDQ-EAFILQHGENLLAIIEQAGHGNFAEILQTGSDNQATISQ-YGAYNDARIEQAGS GQRSAVTQYGTGQQINIVQG----------- >tr|K6CQ69|K6CQ69_PSEST Curli fiber surface-exposed nucleator CsgB OX=1218352 OS=Pseudomonas stutzeri KOS6. GN=B597_23597 PE=4 SV=1 ------------------QGEGNLAELLQNGQTLGAQILQQGTDQ-EAFILQHGHDLLAVIEQVGQGNFAEIRQAGSDNQATISQ-YGAYNDARIEQVGQ GLRSAVTQFGVGQQINIVQG----------- >tr|J3GV84|J3GV84_9PSED Curlin associated repeat-containing protein OX=1144334 OS=Pseudomonas sp. GM60. GN=PMI32_05286 PE=4 SV=1 ------------------NGQANVALLQQNGQSLLGTIVQSGSNQ-EAYILQQGSDLMAMITQQGSGNSASITQTGSQNRAQIAQ-NGNNNDASIDQAGT GLSSAVTQSGNGMSVSVKQY----------- >tr|J2YFM6|J2YFM6_9PSED Curlin associated repeat-containing protein OX=1144337 OS=Pseudomonas sp. GM78. GN=PMI35_00202 PE=4 SV=1 ------------------NGQANVALLQQNGQSLLGQIVQSGSNQ-EAYILQQGSDLMALINQQGSGNAASITQTGSHNRAQISQ-NGNNNDASIVQAGT GQQSAVTQAGNGMSVSVTQY----------- >tr|J2R4G3|J2R4G3_9PSED Curlin associated repeat-containing protein OX=1144329 OS=Pseudomonas sp. GM33. GN=PMI26_02418 PE=4 SV=1 ------------------NGQANVAMLQQNGQSLLGTIVQAGSNQ-EAYILQQGSDLMAMINQQGSGNDASITQTGSHNRARISQ-NGNDNSASIDQAGT GLQSAVNQSGNGMSVSVKQY----------- >tr|I0QV40|I0QV40_9ENTR Curlin minor subunit CsgB OX=932213 OS=Serratia sp. M24T3. GN= PE=4 SV=1 -----------------------LA----AGSNNRAFINQHGNEN-TAVISQRGSANNADINQYGNGNNASISQDSSGSSASIEQ-QGFGNTALIAQKGR RNVAQISQSGTDRSAAVVQNASGMAIKVTQ- >tr|K8AHR5|K8AHR5_9ENTR Minor curlin subunit CsgB, nucleation component of curlin monomers OX=1208656 OS=Cronobacter dublinensis 1210. GN=BN134_504 PE=4 SV=1 -----------------------AAVIGQEGSRNNARIGQEGTKL-QATIVQNGIANQAAIDQRGDANVASVTQTGAANQATISQ-EGYGNLASVTQQGV GNRASIIQAGTQKAAVVVQRQSMMAVRIIQ- >tr|J0MMU0|J0MMU0_9ENTR Curlin minor subunit CsgB OX=1202448 OS=Enterobacter sp. Ag1. GN= PE=4 SV=1 -----------------------TAIIGQSGSNNNAAVVQRGQKQ-LSEVTQSGAGNRANIEQSGSYNLAYISQEGNANEAGIKQ-DSFGNAALIIQRGS GNKANITQYGTQKSAVVVQNQSQMAIRIIQ- >tr|Q2CGY2|Q2CGY2_9RHOB Putative uncharacterized protein OX=314256 OS=Oceanicola granulosus HTCC2516. GN=OG2516_13429 PE=4 SV=1 -------------------SFANHSRIVQWGTHNEVGGGQYGFGN-RLTVEQDGWSNSSISTQDGRRNRAVVGQNGHRNVSGIAQ-FGGHNRARATQSGG CNASAIIQAGHGNRANTRQYGSGNVTVIVQ- >tr|K0VJ27|K0VJ27_9RHIZ Uncharacterized protein OX=1223565 OS=Rhizobium sp. Pop5. GN= PE=4 SV=1 -------------------TMANEVRIEQYGWQNSAGGAQEGYGN-RIRTYQNGGYNRIVGHQYGRHNLSAVGQEGNDNVAGIGQ-FGSNHTTILTQDGN GNIAAGVQVGHGCSANVSQGGNGNVAAFVQ- >tr|G4RCK8|G4RCK8_PELHB Putative uncharacterized protein OX=1082931 OS=Pelagibacterium halotolerans (strain JCM 15775 / CGMCC 1.7692 / B2). GN= PE=4 SV=1 -------------------AMANNIHINQFGWGHSAGGTQSGTGN-TIGIFQDGWWNSSTNHQSGHGNVSASGQTGWNNEAGVGQ-FGSNHTSVLTQDGN GNVAAGVQVGNGCTASVDQNGSGNVAAFVQ- >tr|Q1YNP8|Q1YNP8_MOBAS Putative uncharacterized protein OX=287752 OS=Manganese-oxidizing bacterium (strain SI85-9A1). GN=SI859A1_01780 PE=4 SV=1 -------------------ASANDVRFDQYGWSNSAGGSQQGYRN-RIRVHQDGRYNRSVGEQRGSHNLSVIGQEGRRNSAGIGQ-FGSDHTTILSQDGH GNIAAGVQVGRGCSADVAQGGSGNVAALVQ- >tr|Q0G483|Q0G483_9RHIZ Putative uncharacterized protein OX=314231 OS=Fulvimarina pelagi HTCC2506. GN=FP2506_14234 PE=4 SV=1 -------------------ASANDFRIEQFGWANSSGGSQHGYRN-RMRVHQDGRYNTSVGEQRGKRNLSVVGQNGRGNAAGIGQ-FGSNHTTILTQDGN GNIAAGVQVGHGCDANVAQRGRGNVAAIVQ- >tr|H0HU69|H0HU69_9RHIZ Curlin associated protein OX=1107882 OS=Mesorhizobium alhagi CCNWXJ12-2. GN=MAXJ12_18553 PE=4 SV=1 -------------------AMANSIWIEQHGWSHSVGGSQDGRRN-EIGIYQNGARNSAIAKQRGRGNVAAVGQEGRRNAAGIGQ-FGSSHNSIMVQDGN GNIAAGVQVGHGCDAATSQSGRGNVAAIVQ- >tr|I0QV40|I0QV40_9ENTR Curlin minor subunit CsgB OX=932213 OS=Serratia sp. M24T3. GN= PE=4 SV=1 ------------------AGSNNRAFINQHGNENTAVISQRGSAN-NADINQYGNGNNASISQDSSGSSASIEQQGFGNTALIAQ-KGRRNVAQISQSGT DRSAAVVQNASGMAIKVTQH----------- >tr|C9Y0V4|C9Y0V4_CROTZ Minor curlin subunit OX=693216 OS=Cronobacter turicensis (strain DSM 18703 / LMG 23827 / z3032). GN= PE=4 SV=1 ----------------GQQGTRNNALVRQEGATLQATIVQNGIAN-QAAIDQQGEANVAAVMQTGAANQATISQEGYGNLASVTQ-QGVGNRASIIQAGT QKTAVVVQRQSMMAVRIIQR----------- >tr|K8AHR5|K8AHR5_9ENTR Minor curlin subunit CsgB, nucleation component of curlin monomers OX=1208656 OS=Cronobacter dublinensis 1210. GN=BN134_504 PE=4 SV=1 ----------------GQEGSRNNARIGQEGTKLQATIVQNGIAN-QAAIDQRGDANVASVTQTGAANQATISQEGYGNLASVTQ-QGVGNRASIIQAGT QKAAVVVQRQSMMAVRIIQR----------- >tr|J0MMU0|J0MMU0_9ENTR Curlin minor subunit CsgB OX=1202448 OS=Enterobacter sp. Ag1. GN= PE=4 SV=1 ----------------GQSGSNNNAAVVQRGQKQLSEVTQSGAGN-RANIEQSGSYNLAYISQEGNANEAGIKQDSFGNAALIIQ-RGSGNKANITQYGT QKSAVVVQNQSQMAIRIIQR----------- >tr|A3W9G7|A3W9G7_9SPHN Curlin-associated protein OX=237727 OS=Erythrobacter sp. NAP1. GN=NAP1_02525 PE=4 SV=1 ------------------AGAMNTSTVTASSNNSDVFVDQIGFMN-TSTVTQNGDLNTADVNQDGDNGTSTMTQSGTSNMSILNQ-DGTFNISVITQNGT SNMATTNQGGTGDFSSTSQIGTGNVATVSQ- >tr|J3ISQ9|J3ISQ9_9PSED Curlin associated repeat-containing protein OX=1144337 OS=Pseudomonas sp. GM78. GN=PMI35_00203 PE=4 SV=1 ------------------SETYNELYFDQNGTDNILIADQRGTNN-LAQGSSTGTHNTTDINQDGTGNQAYTTQYGEDNMITIKQ-TDTMNVAYVTQGGT GNIASVDQSGMTQMANVQQFGTANSATVLQ- >tr|J2YS79|J2YS79_9PSED Curlin associated repeat-containing protein OX=1144329 OS=Pseudomonas sp. GM33. GN=PMI26_02417 PE=4 SV=1 ------------------DNLYNELYFDQNGTDNILISDQRGETN-LAMGSTMGTGNSTEFNQDGSSNRAYTQQSGSDNMITVKQ-ADTMNVAYVTQGGT GNIANIDQSGMAQTATVQQYGSTNQATVMQ- >tr|J2PIY4|J2PIY4_9PSED Curlin associated repeat-containing protein OX=1144326 OS=Pseudomonas sp. GM24. GN=PMI23_04806 PE=4 SV=1 ------------------AESYNELYFDQNGSDNILISDQRGTGN-LVQGSSNGTGNSAEFDQSGTGNQAYTAQYGSDNMITVKQ-ADTMNVAYVTQGGT GNIANVDQSGVTQTANIQQFGSANQATVLQ- >tr|J2NXQ3|J2NXQ3_9PSED Uncharacterized protein OX=1144325 OS=Pseudomonas sp. GM21. GN=PMI22_01591 PE=4 SV=1 ------------------AESYNELYFEQNGTDNILIADQRGTTN-LAQGSSEGIGNSTEFNQSGSDNQAFTTQYGSDNIITVKQ-TDNLNVAYVTQGGT GNIASVNQSGLTQTAIVQQNGSTNQATVFQ- >tr|Q1I9S2|Q1I9S2_PSEE4 Putative curlin OX=384676 OS=Pseudomonas entomophila (strain L48). GN= PE=4 SV=1 ------------------ADSNNELYFEQNGSDNLLIADQRGSGN-YAEGVAAGNGGTVTLDQSGSGNQSFTYQYGSGNQATVKQ-ADGVNVAYVTQGGN GNQAFVDQSGASQTATITQFGNTNMATVTQ- >tr|E4RBC2|E4RBC2_PSEPB Curlin-associated protein OX=931281 OS=Pseudomonas putida (strain BIRD-1). GN= PE=4 SV=1 ------------------DDVSNELYFEQNGTDNILIADQRGTDN-YAYGSSQGTGNTITLDQSGYSNQSYTNQYGSGNEATIKQ-TDSINVAYVTQGGQ SNKAFVDQSGVAQSATITQLGSANLATVTQ- >tr|B1J9K3|B1J9K3_PSEPW Curlin associated repeat protein OX=390235 OS=Pseudomonas putida (strain W619). GN= PE=4 SV=1 ------------------ADSNNELYFEQNGTDNILIADQRGIDN-YATGDTTGTGNTITLDQSGYANQSFTTQYGSGNSATIKQ-TDNANVAYVTQGGA GNMAFVDQSGANQAATITQMGNGNTATATQ- >tr|L0FL50|L0FL50_PSEPU Curlin-associated protein OX=1215088 OS=Pseudomonas putida HB3267. GN=B479_14245 PE=4 SV=1 ------------------ADMNNELYFEQNGTDNILIADQRGTDN-YAYGNSQGNGNVINLDQSGYSNQSYTNQYGSGNEANIKQ-VDSYNIAYVTQGGQ GNKAYVDQSGVTQTATISQLGSANVATVTQ- >tr|J3E050|J3E050_9PSED Curlin associated repeat-containing protein OX=1144340 OS=Pseudomonas sp. GM84. GN=PMI38_03957 PE=4 SV=1 ------------------ADNNNELYFEQNGSDNILVADQRGTEN-YAFGSTTGSGNTITLDQSGYANQSFTTQYGSGNTATIKQ-ADTANVAYVTQGGT GNQAIVDQAGANQSATISQMGNGNTATATQ- >tr|J3B1Q4|J3B1Q4_9PSED Curlin associated repeat-containing protein OX=1144334 OS=Pseudomonas sp. GM60. GN=PMI32_05287 PE=4 SV=1 ------------------AENYNELYFDQNGTDNMLIADQRGTTN-LVEGSSNGMNNRSEFDQSGTGNLASTSQWGNDNKITLKQ-TDSMNVAYVTQSGI GNIANVDQSGVTQMATIQQVGMDNQATVLQ- >tr|Q1NBA3|Q1NBA3_9SPHN Putative uncharacterized protein OX=314266 OS=Sphingomonas sp. SKA58. GN=SKA58_07108 PE=4 SV=1 ------------------SGNDNSSTVTQYGWSQAAYVDQVGDGN-TSSIDQGGSDQYASVAQNGDDGMSTITQRGQDQRAELTQ-GGLSNESFIDQSGS DHLAEVTQDGADNYSSVIQSGSGSSATVTQ- >tr|F2N2Q5|F2N2Q5_PSEU6 PPE repeat-containing protein OX=996285 OS=Pseudomonas stutzeri (strain DSM 4166 / CMT.9.A). GN= PE=4 SV=1 ------------------QGRGNELHFQQDGVGNELNAVQNGSDN-EIVGVSHGWHNSSDIEQTGGDNLATVRQEGTLNELMLTQ-SGYDHLAKVTQHGF ANDALVSQAGYGNAAYINQNGTNNSALVTQ- >tr|L0WHL8|L0WHL8_9GAMM Curlin-associated protein OX=1177179 OS=Alcanivorax hongdengensis A-11-3. GN=A11A3_02197 PE=4 SV=1 ------------------QGQSHQASIQQSGGHNQAELLQYGTDQ-QADITQLGWENLAVVNQYGLGNQADIYQRGAYNSATVLQ-NGSYNTASVNQIGT ANTAVISQNGFSNNVSVTQVGAGMYVSIQQW >tr|J3ISQ9|J3ISQ9_9PSED Curlin associated repeat-containing protein OX=1144337 OS=Pseudomonas sp. GM78. GN=PMI35_00203 PE=4 SV=1 ------SGGTNYAYGDQRYGDGGNLTINQYGDGNGSEVWQDTQVASRTTIDQNGQTNETIVDQSGDHNVAQVFQVGDLNAIYADQFESMGSTVTLNQQGI GNIHFTYQTGTDHVLNATSSGNDNKVYA--- >tr|J2YS79|J2YS79_9PSED Curlin associated repeat-containing protein OX=1144329 OS=Pseudomonas sp. GM33. GN=PMI26_02417 PE=4 SV=1 ------NGDGNYAYGDQRNGQGGNLSINQHGNGNGTEVWQETQVASQTTINQYGNTNETIVDQSGDNNVASVLQVGNLNALYADQYESTNATLALYQQGN SNVHYTYQNGTDQVLNVTTTGDGNKVYA--- >tr|J2NXQ3|J2NXQ3_9PSED Uncharacterized protein OX=1144325 OS=Pseudomonas sp. GM21. GN=PMI22_01591 PE=4 SV=1 ------SGNDNYAYGDQRNGEGGTVRITQEGDFNGTEVWQETQVSSHATINQSGQTNETVVDQSGQENVATVLQVGDLNAIYADQFESIGSTVALYQNGT SNVHFTYQSGDSHVLNANSVGTGNKVYA--- >tr|J3E050|J3E050_9PSED Curlin associated repeat-containing protein OX=1144340 OS=Pseudomonas sp. GM84. GN=PMI38_03957 PE=4 SV=1 ------QGKENWAYGDQRDGIGGMLTIDQQGTGNSAEVWQDNQTGSHASVSQTGQLNEAYIDQSGQGNTATLYQQGNTNASWSDQFETNNSKTSISQSGT GNLHYTYQTGDNQSLTVTTQGTGNKVMA--- >tr|J3B1Q4|J3B1Q4_9PSED Curlin associated repeat-containing protein OX=1144334 OS=Pseudomonas sp. GM60. GN=PMI32_05287 PE=4 SV=1 ------TGANNYAYGDQRNGDGGNLTINQNGDGNGTEIWQDGQAGSTATVDQMGETNETVVDQSGENNTAQVFQVGNLNAIYADQFESTGSTLALYQTGT GNVHFTYQSGEGHSLTATSVGSDNKVYA--- >tr|Q0HRH2|Q0HRH2_SHESR Curlin associated repeat protein OX=60481 OS=Shewanella sp. (strain MR-7). GN= PE=4 SV=1 -------GNNNFA-TFQVDGNRNDGVIKQTGDNNQAGLLSNSGDNNDVSVEQIGNNNLGGAIGLGNNNNVDVYQWGNDNQSIVYSLTGSGNDVDVDQIGQ QNAALSMASGDDNNVTVFQDGNNNNVGA--- >tr|A8AI39|A8AI39_CITK8 Uncharacterized protein OX=290338 OS=Citrobacter koseri (strain ATCC BAA-895 / CDC 4225-83 / SGSC4696). GN= PE=4 SV=1 ------------AAIIGQKGTHNSAKTRQDGSKLLSVISQEGGNN-RAKIDQSGAYNLAYIDQAGYANDASISQGSYGNTAMIIQ-KGSGNKADITQYGT QKTAVVVQRQSRMAIRVTQR----------- >tr|G5RH29|G5RH29_SALET Minor curlin subunit CsgB, nucleation component of curlin monomers OX=913083 OS=Salmonella enterica subsp. enterica serovar Uganda str. R8-3404. GN=LTSEUGA_2943 PE=4 SV=1 ------------AAIIGQVGTDNSARVRQEGSKLLSVISQEGGNN-RAKVDQAGNYNFAYIEQTGNANDASISQSAYGNSAAIIQ-KGSGNKANITQYGT QKTAVVVQKQSHMAIRVTQR----------- >tr|D4BB97|D4BB97_9ENTR Minor curlin subunit CsgB OX=500640 OS=Citrobacter youngae ATCC 29220. GN=CIT292_07745 PE=4 SV=1 ------------AAIIGQVGTANSANTRQGGSKLLSVISQEGSGN-RAKTDQTGSYNFAYIDQVGSSNDASIKQGSYGNTAVIIQ-KGSGNKANITQYGT QKTAVVVQRQSQMAIRVTQR----------- >tr|B7LT94|B7LT94_ESCF3 Curlin nucleator protein, minor subunit in curli complex OX=585054 OS=Escherichia fergusonii (strain ATCC 35469 / DSM 13698 / CDC 0568-73). GN= PE=4 SV=1 ------------AAIIGQYGTDHSAQIRQGGSKLLSVISQEGGSN-RAKIDQSGDYNLAYIDQSGSANDASISQGSYGNTAIILQ-KGSGNKANITQYGT QKTAIVVQRQSNMAIRVTQR----------- >tr|G2S881|G2S881_ENTAL Curlin associated repeat-containing protein OX=640513 OS=Enterobacter asburiae (strain LF7a). GN= PE=4 SV=1 ------------AAIIGQKGAYNDAQVRQDGSKLLSIVTQDGVGN-RARVDQSGTYNYAYIAQSGYANDADISQDGYGNTAKIIQ-QGSGNRASITQYGT QKTAVVVQKQSQMAIRVIQR----------- >tr|Q7X244|Q7X244_9ENTR Nucleation component of curlin monomers OX=213763 OS=Citrobacter sp. Fec2. GN= PE=4 SV=1 ------------AAIIGQVGTNNSAKMRQEGSKLLSVVSQEGGSN-RAKVDQSGAYNFAYIAQSGHSNDASISQSNYGNTAMIIQ-KGSGNKANITQYGT QKTAVVVQRQSQMAIRVIQR----------- >tr|K4YIW5|K4YIW5_9ENTR CsgB Protein OX=1216470 OS=Enterobacter sp. SST3. GN=B498_1543 PE=4 SV=1 ------------AAIIGQQGVFNDAQVRQEGSKLLSIVSQDGAGN-RARVDQSGSYNIAWIDQSGTANDAGITQEGYGNSAKIIQ-KGSGNRANITQYGT QKTAVVVQKQSQMAINVIQH----------- >tr|I6R8U6|I6R8U6_ENTCL Curlin minor subunit CsgB OX=1104326 OS=Enterobacter cloacae subsp. dissolvens SDM. GN= PE=4 SV=1 ------------AAIIGQQGVLNDAQVRQDGSKLLSIVSQDGSGN-RARVAQSGTYNFAYIAQSGFANDADITQDGFGNSAKIIQ-KGSGNRASITQYGT QKNAVVVQKQSQMAIRVIQR----------- >tr|H5V125|H5V125_ESCHE Minor curlin subunit OX=1115512 OS=Escherichia hermannii NBRC 105704. GN= PE=4 SV=1 ------------AAIIGQYGSNNRAITRQDGHKLQTVVIQNGNGN-RANVDQSGNYNLAYVSQDGFSNDADIQQNGFGNSAVVIQ-KGSNNRANVTQYGT QKTAVVVQKNSFMNIRVTQR----------- >tr|G9Z6W9|G9Z6W9_9ENTR Minor curlin subunit CsgB OX=1002368 OS=Yokenella regensburgei ATCC 43003. GN=HMPREF0880_02975 PE=4 SV=1 ------------AAIIGQQGGYNNANIQQGGSKVLSVITQDGVGN-RANIDQSGTYNFAYIAQAGSANDADIQQGGYGNTAAIIQ-QGSGNKASITQYGT QKTAVVVQKQSQMAVRVIQR----------- >tr|F5RT61|F5RT61_9ENTR Minor curlin subunit CsgB OX=888063 OS=Enterobacter hormaechei ATCC 49162. GN= PE=4 SV=1 ------------AAIIGQQGSGNNADVRQGGSKLLSVISQEGGNN-RANVDQSGTYNLAYIDQTGNGNDASIKQGAFGNTAMIIQ-KGSGNRANITQYGT QKTAVVVQRQSQMAIRVIQR----------- >tr|F5MEV9|F5MEV9_SHIBO Curlin associated repeat family protein OX=766141 OS=Shigella boydii 5216-82. GN=SB521682_2381 PE=4 SV=1 ------------AAIIGQAGTNNSAQLRQGGSKLLAVVAQEGSSN-RAKIDQTGDYNLAYIDQAGSANDASISQGAYGNTAMIIQ-KGSGNKANITQYGT QKTAIVVQRQSQMAIRVTQR----------- >tr|D2ZAV4|D2ZAV4_9ENTR Minor curlin subunit CsgB OX=500639 OS=Enterobacter cancerogenus ATCC 35316. GN=ENTCAN_05594 PE=4 SV=1 ------------AAIIGQQGVNNDAQVRQGGSKLLSVVSQEGTGN-RARVDQSGTYNFAYIAQSGSSNDASITQGAFGNTAMIIQ-KGSGNKANITQYGT QKTAVVVQRQSQMAIRVIQR----------- >tr|Q2CGY2|Q2CGY2_9RHOB Putative uncharacterized protein OX=314256 OS=Oceanicola granulosus HTCC2516. GN=OG2516_13429 PE=4 SV=1 -RIVQWGTHNEVGGGQ--YGFGNRLTVEQDGWSNSSISTQDGRRN-RAVVGQNGHRNSANTQQFGSCNVSGIAQFGGHNRARATQ-SGGCNASAIIQAGH GNRANTRQYGSGNVTVIVQ------------ >tr|K0Q5B3|K0Q5B3_9RHIZ Putative curlin nucleator protein, minor subunit in curli complex OX=1211777 OS=Rhizobium mesoamericanum STM3625. GN=BN77_p11016 PE=4 SV=1 -RIEQYGWLNSAGGAQ--EGYGNRIRTYQNGGYNRIVGHQYGRHN-LSAVGQEGNDNYGATYQSGDRNIAGIGQFGSNHTTILTQ-DGNGNIAAGVQVGH GCSANVSQGGNGNVAAFIQA----------- >tr|G4RCK8|G4RCK8_PELHB Putative uncharacterized protein OX=1082931 OS=Pelagibacterium halotolerans (strain JCM 15775 / CGMCC 1.7692 / B2). GN= PE=4 SV=1 -HINQFGWGHSAGGTQ--SGTGNTIGIFQDGWWNSSTNHQSGHGN-VSASGQTGWNNEAETWQNGNFNEAGVGQFGSNHTSVLTQ-DGNGNVAAGVQVGN GCTASVDQNGSGNVAAFVQV----------- >tr|Q1YNP8|Q1YNP8_MOBAS Putative uncharacterized protein OX=287752 OS=Manganese-oxidizing bacterium (strain SI85-9A1). GN=SI859A1_01780 PE=4 SV=1 -RFDQYGWSNSAGGSQ--QGYRNRIRVHQDGRYNRSVGEQRGSHN-LSVIGQEGRRHYGATYQNGSRNSAGIGQFGSDHTTILSQ-DGHGNIAAGVQVGR GCSADVAQGGSGNVAALVQA----------- >tr|Q0G483|Q0G483_9RHIZ Putative uncharacterized protein OX=314231 OS=Fulvimarina pelagi HTCC2506. GN=FP2506_14234 PE=4 SV=1 -RIEQFGWANSSGGSQ--HGYRNRMRVHQDGRYNTSVGEQRGKRN-LSVVGQNGRGNFGATYQTGKRNAAGIGQFGSNHTTILTQ-DGNGNIAAGVQVGH GCDANVAQRGRGNVAAIVQA----------- >tr|H0HU69|H0HU69_9RHIZ Curlin associated protein OX=1107882 OS=Mesorhizobium alhagi CCNWXJ12-2. GN=MAXJ12_18553 PE=4 SV=1 -WIEQHGWSHSVGGSQ--DGRRNEIGIYQNGARNSAIAKQRGRGN-VAAVGQEGRRNRGQAEQRGRNNAAGIGQFGSSHNSIMVQ-DGNGNIAAGVQVGH GCDAATSQSGRGNVAAIVQT----------- >tr|K6DKI9|K6DKI9_PSEST Curli fiber surface-exposed nucleator CsgB OX=1218352 OS=Pseudomonas stutzeri KOS6. GN=B597_05222 PE=4 SV=1 ---------------------ASLAIVQQNGQDMSGRIGQTGAEL-EAYIIQSGYANDAAIEQIGLGNAALISQNGFGNDARIDQ-FGSDNRAAIAQQGV SNSALIEQTGSGHTSSVSQSGQGLTVVVRQY >tr|F8H7C0|F8H7C0_PSEUT Curli fiber surface-exposed nucleator CsgB, putative OX=96563 OS=5965 / LMG 11199 / NCIMB 11358 / Stanier 221). GN= PE=4 SV=1 ---------------------GALAVVQQAGQGMTGRIAQSGAEL-EAYIFQNGYANSASIEQIGQGNAALISQDGFGNEAQIEQ-TGADNRAAIAQQGS SNRALIEQTGSGHSSNVSQSGRGLTVVVRQY >tr|L0GJ52|L0GJ52_PSEST Curlin associated repeat-containing protein OX=644801 OS=Pseudomonas stutzeri RCH2. GN=Psest_1442 PE=4 SV=1 ---------------------GAMAVVQQSGQDMAGRIVQRGAEL-EAYIIQSGYANSASIEQIGLGNAALISQYGFDNDAHIEQ-FGADNRAAIAQQGS SNRALIEQTGSGHSSNVSQSGRGLTVVVRQY >tr|I4CVW6|I4CVW6_PSEST Curli fiber surface-exposed nucleator CsgB OX=1196835 OS=Pseudomonas stutzeri CCUG 29243. GN=A458_14970 PE=4 SV=1 ---------------------GAVALVQQSGESMTGRIAQTGSEL-EAYIIQSGYANGASIEQIGQGNAALISQNGFDNDAHIEQ-TGSDNRAAIAQQGS SNRALIEQTGSGHSSNVSQSGRGLTVVVRQY >tr|H7EQV4|H7EQV4_PSEST Curli fiber surface-exposed nucleator CsgB OX=32042 OS=Pseudomonas stutzeri ATCC 14405 = CCUG 16156. GN=PstZobell_01667 PE=4 SV=1 ---------------------GALAVVQQSGQDMAGRIVQTGAEL-EAYIIQSGYANSASIEQIGQGNTALISQNGFGNDALIEQ-NGSDNRAAIAQQGA SNRALIEQTGSGHNSNVSQSGQGLTVVVRQY >tr|K5ZA45|K5ZA45_9PSED Curli fiber surface-exposed nucleator CsgB OX=440512 OS=Pseudomonas sp. Chol1. GN=C211_06872 PE=4 SV=1 ---------------------TALASIQQSGSALVGNIAQSGSEL-EAYILQAGYANTADINQIGTDNAALIMQSGDYNQARIDQ-DGSGNVAAIAQQGA SNSALIEQTGSGLSSSVNQMGQGLTVVVRQY >tr|A8AI39|A8AI39_CITK8 Uncharacterized protein OX=290338 OS=Citrobacter koseri (strain ATCC BAA-895 / CDC 4225-83 / SGSC4696). GN= PE=4 SV=1 ----------------------QAAIIGQKGTHNSAKTRQDGSKL-LSVISQEGGNNRAKIDQSGAYNLAYIDQAGYANDASISQ-GSYGNTAMIIQKGS GNKADITQYGTQKTAVVVQRQSRMAIRVTQ- >tr|B7LT94|B7LT94_ESCF3 Curlin nucleator protein, minor subunit in curli complex OX=585054 OS=Escherichia fergusonii (strain ATCC 35469 / DSM 13698 / CDC 0568-73). GN= PE=4 SV=1 ----------------------QAAIIGQYGTDHSAQIRQGGSKL-LSVISQEGGSNRAKIDQSGDYNLAYIDQSGSANDASISQ-GSYGNTAIILQKGS GNKANITQYGTQKTAIVVQRQSNMAIRVTQ- >tr|G2S881|G2S881_ENTAL Curlin associated repeat-containing protein OX=640513 OS=Enterobacter asburiae (strain LF7a). GN= PE=4 SV=1 ----------------------QAAIIGQKGAYNDAQVRQDGSKL-LSIVTQDGVGNRARVDQSGTYNYAYIAQSGYANDADISQ-DGYGNTAKIIQQGS GNRASITQYGTQKTAVVVQKQSQMAIRVIQ- >tr|Q7X244|Q7X244_9ENTR Nucleation component of curlin monomers OX=213763 OS=Citrobacter sp. Fec2. GN= PE=4 SV=1 ----------------------QAAIIGQVGTNNSAKMRQEGSKL-LSVVSQEGGSNRAKVDQSGAYNFAYIAQSGHSNDASISQ-SNYGNTAMIIQKGS GNKANITQYGTQKTAVVVQRQSQMAIRVIQ- >tr|K4YIW5|K4YIW5_9ENTR CsgB Protein OX=1216470 OS=Enterobacter sp. SST3. GN=B498_1543 PE=4 SV=1 ----------------------QAAIIGQQGVFNDAQVRQEGSKL-LSIVSQDGAGNRARVDQSGSYNIAWIDQSGTANDAGITQ-EGYGNSAKIIQKGS GNRANITQYGTQKTAVVVQKQSQMAINVIQ- >tr|I6R8U6|I6R8U6_ENTCL Curlin minor subunit CsgB OX=1104326 OS=Enterobacter cloacae subsp. dissolvens SDM. GN= PE=4 SV=1 ----------------------QAAIIGQQGVLNDAQVRQDGSKL-LSIVSQDGSGNRARVAQSGTYNFAYIAQSGFANDADITQ-DGFGNSAKIIQKGS GNRASITQYGTQKNAVVVQKQSQMAIRVIQ- >tr|I4ZL05|I4ZL05_ENTCL Curlin minor subunit CsgB OX=1177927 OS=Enterobacter cloacae subsp. cloacae GS1. GN= PE=4 SV=1 ----------------------QAAIIGQQGSGNNSDVRQDGSKL-LSVISQEGGNNRANVDQSGTYNLAYIDQTGNGNDASIKQ-GAFGNTAMIIQKGS GNRANITQYGTQKTAVVVQRQSQMAIRVIQ- >tr|H5V125|H5V125_ESCHE Minor curlin subunit OX=1115512 OS=Escherichia hermannii NBRC 105704. GN= PE=4 SV=1 ----------------------QAAIIGQYGSNNRAITRQDGHKL-QTVVIQNGNGNRANVDQSGNYNLAYVSQDGFSNDADIQQ-NGFGNSAVVIQKGS NNRANVTQYGTQKTAVVVQKNSFMNIRVTQ- >tr|G9Z6W9|G9Z6W9_9ENTR Minor curlin subunit CsgB OX=1002368 OS=Yokenella regensburgei ATCC 43003. GN=HMPREF0880_02975 PE=4 SV=1 ----------------------QAAIIGQQGGYNNANIQQGGSKV-LSVITQDGVGNRANIDQSGTYNFAYIAQAGSANDADIQQ-GGYGNTAAIIQQGS GNKASITQYGTQKTAVVVQKQSQMAVRVIQ- >tr|D2ZAV4|D2ZAV4_9ENTR Minor curlin subunit CsgB OX=500639 OS=Enterobacter cancerogenus ATCC 35316. GN=ENTCAN_05594 PE=4 SV=1 ----------------------QAAIIGQQGVNNDAQVRQGGSKL-LSVVSQEGTGNRARVDQSGTYNFAYIAQSGSSNDASITQ-GAFGNTAMIIQKGS GNKANITQYGTQKTAVVVQRQSQMAIRVIQ- >tr|I1DYJ0|I1DYJ0_9GAMM Minor curlin subunit CsgB, nucleation component of curlin monomers OX=562729 OS=Rheinheimera nanhaiensis E407-8. GN=RNAN_2108 PE=4 SV=1 -----------------RHALANGVDITQLGDLNVTDVTQSGQGH-YAVLLQQGVLNQLSLTQSGFANQTQVSQFGNGNSAHIIQ-NGSNNLIQLQQWGN RQ-FTIEQSGVGAEISIIQY----------- >tr|Q11JV4|Q11JV4_MESSB Curlin associated OX=266779 OS=Mesorhizobium sp. (strain BNC1). GN= PE=4 SV=1 --------------------SADSIHIEQYGWANAAGGDQHGSRN-RIGIYQNGKRNGAVVRQQGRNNTAAIGQQGRRNAADIWQ-RGKGNAAGVGQFGR DHNAVVTQDGNGCDGEVSQSGSGNVAAFVQ- >tr|Q28WJ4|Q28WJ4_JANSC Uncharacterized protein OX=290400 OS=Jannaschia sp. (strain CCS1). GN= PE=4 SV=1 -TANQFGWDNSA--GVQQFGNCNNTAIGQNGWNNTAAGISNGFAN-TVVVGQDGAWNTGVVGQNGSFNAGAVQQSGALNYGEVVQ-NGNFQTGAVIQSGV GNTGVLNQTGTGNTALIIQA----------- >tr|Q11JV4|Q11JV4_MESSB Curlin associated OX=266779 OS=Mesorhizobium sp. (strain BNC1). GN= PE=4 SV=1 -HIEQYGWANAA--GGDQHGSRNRIGIYQNGKRNGAVVRQQGRNN-TAAIGQQGRRNAADIWQRGKGNAAGVGQFGRDHNAVVTQ-DGNGNIAAGVQVGK GCDGEVSQSGSGNVAAFVQT----------- >tr|K1IA39|K1IA39_9GAMM Uncharacterized protein OX=1073384 OS=Aeromonas veronii AER397. GN= PE=4 SV=1 -----------------QSGNDNTAEVTQLGLANVATVEQSGSGE-TALLTQTGSRNQIDSEQRGSEDLLTITQNGNENIANARQ-SGTGETAVVTQLGN GNQADTNQQGSGNTAQVTQNGSNNQAVVDQ- >tr|Q084F0|Q084F0_SHEFN Curlin associated repeat protein OX=318167 OS=Shewanella frigidimarina (strain NCIMB 400). GN= PE=4 SV=1 -----------------QVGNGNIGLVEQTGLSNMADVYQNGESN-DAHVTQAGSNHDAQVNQYGLNQTATVTQMDFDHVATVNQ-SNEGNTATVTQLGT GNNATVDQEGMNDTATIVQAGFSNEAVVLQ- >tr|D4ZLW7|D4ZLW7_SHEVD Uncharacterized protein OX=637905 OS=DSS12). GN= PE=4 SV=1 -----------------QDGQGNDATINQNAMHNMASVEQNGDGN-FATVTQNGESNASTVMQTGTGHSADITQNDLSNLANVTQ-TGSGNGLTLTQFGT GNTASVSQNGDNDTASLSQNGNSNQVVIAQ- >tr|A3Q9V0|A3Q9V0_SHELP Curlin associated repeat protein OX=323850 OS=Shewanella loihica (strain ATCC BAA-1088 / PV-4). GN= PE=4 SV=1 STILQT-N-NNSANTDQDTSTDALSTVVQTGAENIANVNQ-------------GSADITINQ-----ATASVSQTGSFNQVN-QQ-HAINTSLSVMQNGE SNIASVNQTGDTNTTTVRQ-----EASV--- >tr|F4D7K1|F4D7K1_AERVB Curlin associated protein OX=998088 OS=Aeromonas veronii (strain B565). GN= PE=4 SV=1 ATITQSGEGGHIADVDQRESDGAVASVAQEGGNNTALVAQSGNDN-TAEVTQLGLANVATVEQSGSGETALLTQTGSRNQIDSEQ-RGSEDLLTITQNGN ENIANARQSGTGETAVVTQLGNGNQADTNQ- >tr|Q084F0|Q084F0_SHEFN Curlin associated repeat protein OX=318167 OS=Shewanella frigidimarina (strain NCIMB 400). GN= PE=4 SV=1 ATINQSGNSNNNAYIQQEASDAASATLTQTGSNNDGDILQVGNGN-IGLVEQTGLSNMADVYQNGESNDAHVTQAGSNHDAQVNQ-YGLNQTATVTQMDF DHVATVNQSNEGNTATVTQLGTGNNATVDQ- >tr|D4ZLW7|D4ZLW7_SHEVD Uncharacterized protein OX=637905 OS=DSS12). GN= PE=4 SV=1 ATISQNTGSDNHGTIYQESSDSATASLTQTGGANTGTINQDGQGN-DATINQNAMHNMASVEQNGDGNFATVTQNGESNASTVMQ-TGTGHSADITQNDL SNLANVTQTGSGNGLTLTQFGTGNTASVSQ- >tr|A3XJL1|A3XJL1_LEEBM Putative uncharacterized protein OX=398720 OS=(Flavobacterium sp. (strain MED217)). GN=MED217_04492 PE=4 SV=1 ------------------LGDANQALVWQDGDSHNSVITQTGDAN-LAYTQQNGDNSDSEVMQTGDANWSEVYQLGS-HMSIVTQ-TGDMNSSYVSQTGM THMSTVTQLGNSNVSSVVQIN---------- >tr|D7VSF1|D7VSF1_9SPHI Curlin associated repeat containing protein OX=525373 OS=Sphingobacterium spiritivorum ATCC 33861. GN=HMPREF0766_13905 PE=4 SV=1 ------------------SGTNNGAAITQDGIANASIQTQNGDAN-NATSVQDGFLNLNLQDQSGNGNIAVASQTGSLNLISQSQ-TGNTNSAFDFQTGI ANSSYVTQLGDQNYHMGIQTGMGNMMTIFQ- >tr|A3XID8|A3XID8_LEEBM Putative uncharacterized protein OX=398720 OS=(Flavobacterium sp. (strain MED217)). GN=MED217_15465 PE=4 SV=1 ------------------IGDNNEVVAYQGNDNNYSRQYQEGDNN-FANTQQYFSTQTSFQEQVGDGNYANTLQTGDLNFATVAQ-WGDDNYALTRQLGD SNGAAVTQLGLSHSSTIMQTGAFNAADVFQ- >tr|L0GKU7|L0GKU7_PSEST Curlin associated repeat-containing protein OX=644801 OS=Pseudomonas stutzeri RCH2. GN=Psest_1441 PE=4 SV=1 -------------------GRGNHLHFHQDGTGNELNAVQNGTDN-EIIGVSYGWANSSDIEQNGRDNLATVRQEGSLNDVMLSQ-SGADHLIKVTQHGV GNEAIASQAGYGNAAYITQNGVNNSALVTQ- >tr|K6BQM5|K6BQM5_PSEST PPE repeat-containing protein OX=1218352 OS=Pseudomonas stutzeri KOS6. GN=B597_05217 PE=4 SV=1 -------------------GRGNDLHFRQDGTGNELSATQNGADN-EIVGVSYGWANASEIDQNGHDNLATVSQNGTLNDVTLSQ-GGADQVAQVTQQGF GNDAIASQAGYGNAAYVNQNGWNNTALVTQ- >tr|I4CVW7|I4CVW7_PSEST PPE repeat-containing protein OX=1196835 OS=Pseudomonas stutzeri CCUG 29243. GN=A458_14975 PE=4 SV=1 -------------------GRGNDLHFHQDGTSNELNAVQHGTDN-EIVGVSYGWGNSADLEQDGRDNLATVRQEGSLNDVMLSQ-SGADHLIKVTQNGV GNEAIASQAGYGNAAYINQSGWNNTALVTQ- >tr|K5YCJ2|K5YCJ2_9PSED PPE repeat-containing protein OX=440512 OS=Pseudomonas sp. Chol1. GN=C211_06877 PE=4 SV=1 -------------------GRGNELYFGQNGTGNELTAVQDGTDN-QIVGLSVGFGNSVDVAQDGRDNLAGIQQNGVANEASLTQ-TGAAHLASVNQTGW DNSAVATQSGYGNAAYINQNGWNNSATVTQ- >tr|K5YIF3|K5YIF3_9PSED Uncharacterized protein OX=440512 OS=Pseudomonas sp. Chol1. GN=C211_15565 PE=4 SV=1 -------------------GDGNRLRLEQSGVLNSAAIRQDSYHN-ELDFAQRGNDNRLDVAQTGYGSRIEGSTSGNRNAVEISQ-SHALNRASVVQNGD DNLARIEQAYENHQAEISQLGSANEAVIRQ- >tr|H0JBD3|H0JBD3_9PSED Putative uncharacterized protein OX=1112217 OS=Pseudomonas psychrotolerans L19. GN=PPL19_09461 PE=4 SV=1 ----------------------DTVELTQRGQANVATVLSSGKFN-ELSFTQDGNRNTLNVNQAGRGNRFSGSSTGDDNLASVTV-QGDSNQTSIVQTGN GNEARLDVTSAISTASVSQVGDSNRASILQ- >tr|L0GSJ7|L0GSJ7_PSEST Uncharacterized protein OX=644801 OS=Pseudomonas stutzeri RCH2. GN=Psest_3880 PE=4 SV=1 -------------------GDGNRMTLAQIGSYNTADIHQSDYHN-ELDFTQNGDSNRLTVEQNGFGGRISGSSTGSGNSVDIAQ-RFMSNQATVIQNGN DNLASIEQGNYSHQATITQMGSANEAIIRQ- >tr|K6CQF4|K6CQF4_PSEST Uncharacterized protein OX=1218352 OS=Pseudomonas stutzeri KOS6. GN=B597_23592 PE=4 SV=1 -------------------GDGNRMTLAQNGAYNLAEIQQSDYQN-ELNFSQNGDANRLNVDQDGFGGIITGSSSGSRNSVDIVQ-SFMSNQATVIQNGT DNLASIEQANYGHQASITQLGSANQAHILQ- >tr|I4CYE9|I4CYE9_PSEST Uncharacterized protein OX=1196835 OS=Pseudomonas stutzeri CCUG 29243. GN=A458_19425 PE=4 SV=1 -------------------GDGNRMNIAQAGTYNIAEIQQNDYHN-ELDFTQSGDSNRLTVDQNGFGGRISGSSTGSGNSVDIAQ-SFMSNQATVIQNGN DNLASIEQGNYGHQATITQLGSANEAMIRQ- >tr|L0GEK3|L0GEK3_PSEST Curlin associated repeat-containing protein OX=644801 OS=Pseudomonas stutzeri RCH2. GN=Psest_0556 PE=4 SV=1 ------------------DNAFNDTVVDQFGEGNEALISQTGQEG-IIDVSQAGNMNVADIAQEGLANSVDLMQQGDGNLALVDQ-FGEANQAVVMQNGM DNFANVTQAGFADYANISQTGSNNTAIITQ- >tr|I4CP14|I4CP14_PSEST PPE repeat-containing protein OX=1196835 OS=Pseudomonas stutzeri CCUG 29243. GN=A458_02825 PE=4 SV=1 ------------------DGAYNSTIVDQFGEGNEALISQSGQEG-IVDLIQTGNMNVADIDQAGIGNSVDLEQQGDGNLALVDQ-FGDSSQAVLLQTGA DNFATVTQAGFADYASVSQTGSNNTAIVTQ- >tr|H7EVV8|H7EVV8_PSEST PPE repeat-containing protein OX=32042 OS=Pseudomonas stutzeri ATCC 14405 = CCUG 16156. GN=PstZobell_10629 PE=4 SV=1 ------------------DGALNSTAVDQFGEGNEVMIVQSGQEG-SIDVSQAGNMNVADIDQAGLGNSVDLMQQGDGNLALVEQ-FGESSQAVILQSGM DNFANVTQAGFADFASVSQTGSNNMAIVTQ- >tr|A4VNF5|A4VNF5_PSEU5 Curli fiber surface-exposed nucleator CsgB, putative OX=379731 OS=Pseudomonas stutzeri (strain A1501). GN= PE=4 SV=1 ---------------VQQAGQGMTGRIAQSGAELEAYIFQNGYAN-SASIEQIGQGNAALISQDGFGNEAQIEQTGADNRAAIAQ-QGSSNRALIEQTGS GHSSNVSQSGRGLTVVVRQY----------- >tr|K6DKI9|K6DKI9_PSEST Curli fiber surface-exposed nucleator CsgB OX=1218352 OS=Pseudomonas stutzeri KOS6. GN=B597_05222 PE=4 SV=1 ---------------VQQNGQDMSGRIGQTGAELEAYIIQSGYAN-DAAIEQIGLGNAALISQNGFGNDARIDQFGSDNRAAIAQ-QGVSNSALIEQTGS GHTSSVSQSGQGLTVVVRQY----------- >tr|L0GJ52|L0GJ52_PSEST Curlin associated repeat-containing protein OX=644801 OS=Pseudomonas stutzeri RCH2. GN=Psest_1442 PE=4 SV=1 ---------------VQQSGQDMAGRIVQRGAELEAYIIQSGYAN-SASIEQIGLGNAALISQYGFDNDAHIEQFGADNRAAIAQ-QGSSNRALIEQTGS GHSSNVSQSGRGLTVVVRQY----------- >tr|I4CVW6|I4CVW6_PSEST Curli fiber surface-exposed nucleator CsgB OX=1196835 OS=Pseudomonas stutzeri CCUG 29243. GN=A458_14970 PE=4 SV=1 ---------------VQQSGESMTGRIAQTGSELEAYIIQSGYAN-GASIEQIGQGNAALISQNGFDNDAHIEQTGSDNRAAIAQ-QGSSNRALIEQTGS GHSSNVSQSGRGLTVVVRQY----------- >tr|H7EQV4|H7EQV4_PSEST Curli fiber surface-exposed nucleator CsgB OX=32042 OS=Pseudomonas stutzeri ATCC 14405 = CCUG 16156. GN=PstZobell_01667 PE=4 SV=1 ---------------VQQSGQDMAGRIVQTGAELEAYIIQSGYAN-SASIEQIGQGNTALISQNGFGNDALIEQNGSDNRAAIAQ-QGASNRALIEQTGS GHNSNVSQSGQGLTVVVRQY----------- >tr|K5ZA45|K5ZA45_9PSED Curli fiber surface-exposed nucleator CsgB OX=440512 OS=Pseudomonas sp. Chol1. GN=C211_06872 PE=4 SV=1 ---------------IQQSGSALVGNIAQSGSELEAYILQAGYAN-TADINQIGTDNAALIMQSGDYNQARIDQDGSGNVAAIAQ-QGASNSALIEQTGS GLSSSVNQMGQGLTVVVRQY----------- >tr|L0WHL8|L0WHL8_9GAMM Curlin-associated protein OX=1177179 OS=Alcanivorax hongdengensis A-11-3. GN=A11A3_02197 PE=4 SV=1 --------------------GNNRASITQQGQSHQASIQQSGGHN-QAELLQYGTDQQADITQLGWENLAVVNQYGLGNQADIYQ-RGAYNSATVLQNGS YNTASVNQIGTANTAVISQNGFSNNVSVTQ- >tr|B1KDE0|B1KDE0_SHEWM Curlin associated repeat protein OX=392500 OS=Shewanella woodyi (strain ATCC 51908 / MS32). GN= PE=4 SV=1 --------DMNFGYADL-IGNGNDTDIEQDGDENEAVLTVVGMDNTDLDMTQEGDLNLIDLIIVGDENSALITQTGDGNWVGGDDVTGDGNSLTIAQTGN DNLVMGSQTGSDHSISVTQTGDFNTATVIQN >tr|Q0HMB6|Q0HMB6_SHESM Curlin associated repeat protein OX=60480 OS=Shewanella sp. (strain MR-4). GN= PE=4 SV=1 --------DTNFAYVDA-VGNDNEVDVEQDGDQNETIISVTGNNNADVTALQHGDLNLIDLIIEGDENSAQITQAGNGNWVGGSGVRGDNNSLMITQTGN DNLVVGSQAGNSNSISVNQTGDMNVATVVQY >tr|E6RQT4|E6RQT4_PSEU9 Putative secreted major subunit of curlin, may bind calcium OX=234831 OS=Pseudoalteromonas sp. (strain SM9913). GN= PE=4 SV=1 --------DENAILVSL-TSNDNDIDVNQVGNQNEAIVTLAGSNANQIDIAQNGDINVIDLMVEGSNHSIDIAQQGEGNWVGGGDLGGENVSFSVTQTGN YNTVEGNVIGFNSTFSVTQLGDGNTATITQM >tr|D4ZCE8|D4ZCE8_SHEVD Uncharacterized protein OX=637905 OS=DSS12). GN= PE=4 SV=1 --------DTNFGYADI-LGSDNEVDIDQDGNENEALVTVIGFDNTDVEATQNGDLNLIDLIIMGDENSVMVTQTGDGNWFGGDEVEGMDNSLMVAQNGK DNLVTGSQTGMGNSISVTQTGDFNSAVVIQN >tr|F7RJ49|F7RJ49_9GAMM Minor curlin subunit CsgB OX=327275 OS=Shewanella sp. HN-41. GN=SOHN41_00379 PE=4 SV=1 --------DVNFAYVDA-TGNDNEVDVEQDGGQNETIITVAGNNNADVTALQHGDLNLIDLIIEGDENSAQITQSGSGNWVGGSGVSGDNNSLLISQTGN DNLVLGSQAGDNNSISVTQSGDMNVATVVQY >tr|Q3ID61|Q3ID61_PSEHT Putative secreted major subunit of curlin, may bind calcium OX=326442 OS=Pseudoalteromonas haloplanktis (strain TAC 125). GN= PE=4 SV=1 --------DENTTVTAL-TSNDNNIDIMQSGVANETMVTLAGSNTNRIDISQTGEINLVDIMVEGSGHSIDISQEGEGNWVTGDQIAGEDVSFTVNQIGN DNLVQGGIIGSNNMVSVTQIGDGNTATVTQM >tr|A1S9U3|A1S9U3_SHEAM Putative uncharacterized protein OX=326297 OS=Shewanella amazonensis (strain ATCC BAA-1098 / SB2B). GN= PE=4 SV=1 --------DVNYAYSVA-FGNDNEVDIAQMGNDNEAIVTIEGNNNTDIIGTQSGDLNVLDLLIQGDENLAQITQTGNGNWVGGAGVVGEGNSFIVAQSGN DNLVTGAQMGSSNVINVTQVGNENVATVIQN >tr|G7FYD0|G7FYD0_9GAMM Putative uncharacterized protein OX=386429 OS=Pseudoalteromonas sp. BSi20495. GN=P20495_0643 PE=4 SV=1 --------QDNTAVTSL-TSNDNDIDIVQDGSLNETMLTLAGSNGSQIDIAQTGEMNVIDLMVEGNDHSIDISQQGENNWVGGEQVGGENVSFSVTQLGN DNIVEGAVIGFNSTVSVTQMGDGNTATITQM >tr|G7FNP2|G7FNP2_9GAMM Putative uncharacterized protein OX=386428 OS=Pseudoalteromonas sp. BSi20480. GN=P20480_1229 PE=4 SV=1 --------EENIAVVSL-TSNENDIEITQDGLQNETMVTLAGSNFNQIDIAQNGDFNLIDMMVEGSGHSIDISQEGENNWVVGEQVGGENVTFNVTQLGN DNIVEGGVIGVNSTVTVTQIGDGNTATITQM >tr|G7F5H6|G7F5H6_9GAMM Putative uncharacterized protein OX=1097676 OS=Pseudoalteromonas sp. BSi20429. GN=P20429_2526 PE=4 SV=1 --------QENTAVISL-TSNDNDIDIMQNGSQNEAMVTLAGSNMNQIDLAQTGEMNVVDMMVEGNGHSIDISQTGEGNWVVGEQVGGENVSFSVTQMGN DNIVEGAVIGFNSTVSVTQIGDGNVTSITQM >tr|Q12JK4|Q12JK4_SHEDO Curlin associated OX=318161 OS=Shewanella denitrificans (strain OS217 / ATCC BAA-1090 / DSM 15013). GN= PE=4 SV=1 --------DENYAYVEM-QGDDNEVDVIQDGGMNEAIVSVMGYDNTDIRVAQDGDLNLVDLIITGDENFADIAQTGSANWVGGGVVNGNENSLTIVQTGN DNLVEGSQTGMNNIANATQIGDFNVATIVQN >tr|E6XM90|E6XM90_SHEP2 Curlin associated repeat protein OX=399804 OS=Shewanella putrefaciens (strain 200). GN= PE=4 SV=1 --------DANFAYVDA-VGNDNEVDVEQNGNQNESIISIVGNNNADATALQKGDLNLIDLIIKGDENTAQIAQSGNRNWVGGDSVKGDNNSLLIAQKGN DNLVLGSQAGNNNSISVNQSGNMNIATVVQY >tr|G0DL20|G0DL20_9GAMM Curlin-associated protein OX=693970 OS=Shewanella baltica OS117. GN=Sbal117_3641 PE=4 SV=1 --------DINFAYVDA-VGSDNEVDVEQDGAQNEAVISIAGNNNADVTALQDGDLNLIDLIIDGDENTAQIAQSGNGNWVGGASVSGDNNSLLITQTGN DNLVLGSQAGNNNSISVNQSGNMNVATVVQY >tr|Q5QXG9|Q5QXG9_IDILO Uncharacterized conserved secreted protein with internal repeats OX=283942 OS=Idiomarina loihiensis (strain ATCC BAA-735 / DSM 15497 / L2-TR). GN= PE=4 SV=1 --------DANTAEVVF-VGDDNDVMSSQVGDGNVTLAYSAGTTSGNTLDSEQGDINLIDTIIGGNENLISFYQEGNSNVIAGDGVGGSNGSVDIQQVGD YNVVTGGQASDGNTMSVYQQGDYNVATVVQY >tr|K2KIC7|K2KIC7_9GAMM Uncharacterized protein OX=745411 OS=Gallaecimonas xiamenensis 3-C-1. GN=B3C1_02405 PE=4 SV=1 --------DDNYAYVAF-TGDQNFVDVNQDGAANAVALTGEG--NGNELALAQGDLNLVDLLVAGNQNLIDVSQDGNGNLVGGGGVTGTGGGVSITQSGN DNLVFGAQASSGNVMAVTQVGDYNTATISQN >tr|I1DYJ1|I1DYJ1_9GAMM Uncharacterized protein OX=562729 OS=Rheinheimera nanhaiensis E407-8. GN=RNAN_2109 PE=4 SV=1 --------DENFALVDV-LGDSNDLYVDQSGNANEVLLTSETVVENTVVSIEQGDINLVDMLVAGNANSIDILQNGSGNWVAGDNVSGNNGMLTVTQQGN DNLVLGSQAGEGNMMSITQVGDFNTAIVSQH >tr|A4C978|A4C978_9GAMM Putative secreted major subunit of curlin, may bind calcium OX=87626 OS=Pseudoalteromonas tunicata D2. GN=PTD2_08864 PE=4 SV=1 --------DGNSSLVAL-TSFNNEIMVTQSGDENESTVTFSNSNNSSVDIAQSGDLNSLDMMIEGSGHNIDIAQTGNYNMIGGGGVGGGDVMLSINQIGD GNLVEGSIISASGSIAITQVGDFNSATIVQQ >tr|K6X1B8|K6X1B8_9ALTE Uncharacterized protein OX=1127673 OS=Glaciecola lipolytica E3. GN=GLIP_1820 PE=4 SV=1 --------DENTLLSGF-YADDNIIDASQVGNQNDAVIRLGGSNFNTLDVAQTGDLNIIDLLIDGSDNTIDIMQEGNDNWVVGETITGDGMSLSISQVGN DNLVTGSMSGSGGSISVTQIGDYNVASITQQ >tr|K0D7Q3|K0D7Q3_ALTMS Putative secreted major subunit of curlin OX=1004785 OS=Alteromonas macleodii (strain Black Sea 11). GN= PE=4 SV=1 --------DFNLALSTY-IADGNLIEMSQEGIENTASVELARSGYNEIMVSQSGELNLLDLLIDGSDNIVSVMQEGAGNWVTDAGISGDMNTFEVTQMGN DNLVTGSITGNGGTVSVTQVGDYNVATVVQM >tr|K7RPD1|K7RPD1_ALTMA Putative secreted major subunit of curlin OX=1004786 OS=Alteromonas macleodii AltDE1. GN=amad1_16600 PE=4 SV=1 --------DANLALAAY-IADANLIEMSQEGNENIASVELARSSGNEIMVAQTGELNLLDLLVNGNDNVISVMQEGAGNWVTDMGISGDMNTFEVTQMGN DNLVTGSITGNGGTVSVTQVGDYNVATVVQM >tr|J2IIZ5|J2IIZ5_9ALTE Uncharacterized protein OX=1197174 OS=Alishewanella aestuarii B11. GN= PE=4 SV=1 --------SDNFALLDV-AGDRNRLNVNQAGNANEVIVTGESVNNNNVTVNQRGDLNLVDVLLEGNGNTLNVLQRGSQNWVGGNAVAGDNGTVLITQQGS DNLVLGAQGGVDNVLQVVQIGSFNTAIVNQN >tr|E5YGC9|E5YGC9_9ENTR Putative uncharacterized protein OX=469613 OS=Enterobacteriaceae bacterium 9_2_54FAA. GN=HMPREF0864_01583 PE=4 SV=1 ------------------IGSDNRAFTHQQGTNNHSIIVQQGNSN-QGRITQSSSNNNALIAQRGSGNSADITQLSSNNNAVIAQ-LGSGNSDSIIQDSF GNSAYIISFGKNNITQITQTGTNRSAGVVQ- >tr|A1S9U2|A1S9U2_SHEAM Minor curlin subunit CsgB, putative OX=326297 OS=Shewanella amazonensis (strain ATCC BAA-1098 / SB2B). GN= PE=4 SV=1 ----------------------------SSGRDNLIDLFQIGVQN-EAMVAQSGDSNSLIVTQIGVANQALVRQLGTDNEVDLFQ-AGNHNSAEITQIGD NNLVQLKQLGSANFS-IQQIGDGASIAVTQY >tr|B1KDE1|B1KDE1_SHEWM Curlin associated repeat protein OX=392500 OS=Shewanella woodyi (strain ATCC 51908 / MS32). GN= PE=4 SV=1 ----------------------------SNGRSSLVNLVQLGNLN-TANIMQTGNNNYVDLMQLGDNNEANITQDGKNNQVELIQ-VGGDNQADITQIGN DNLVNLNQLGSANFS-IEQIADGAEITITQY >tr|Q12JK5|Q12JK5_SHEDO Curlin associated OX=318161 OS=Shewanella denitrificans (strain OS217 / ATCC BAA-1090 / DSM 15013). GN= PE=4 SV=1 ----------------------------SHGRDNLVNLFQHGSFN-QAILSQTGIENSAYLTQLGIGNQTSVIQIGSNNELELLQ-QGDNNRADVTQIGN DNLIQINQLGSATFA-IEQIADNAAITITQY >tr|K6BQM5|K6BQM5_PSEST PPE repeat-containing protein OX=1218352 OS=Pseudomonas stutzeri KOS6. GN=B597_05217 PE=4 SV=1 -------------------GGNNTATTEQSGFHNEADVYQHGRNQ-TAKTLQEGAGNVASVDQQGRGNDLHFRQDGTGNASEIDQ-NGHDNLATVSQNGT LNDVTLSQGGADQVAQVTQQGFGNDAIASQ- >tr|I4CVW7|I4CVW7_PSEST PPE repeat-containing protein OX=1196835 OS=Pseudomonas stutzeri CCUG 29243. GN=A458_14975 PE=4 SV=1 -------------------GGNNTATTEQSGIGNEADVYQDGRNQ-TAKTIQEGGFNVASVEQQGRGNDLHFHQDGTSNSADLEQ-DGRDNLATVRQEGS LNDVMLSQSGADHLIKVTQNGVGNEAIASQ- >tr|H7EQV3|H7EQV3_PSEST PPE repeat-containing protein OX=32042 OS=Pseudomonas stutzeri ATCC 14405 = CCUG 16156. GN=PstZobell_01662 PE=4 SV=1 -------------------GGNNTATTEQFGTGNEADVYQNGRNQ-TAKTIQEGSYNVASVDQQGRGNDLHFHQDGVGNSSDIEQ-NGRDNVATVRQEGS LNDVMLSQSGADHLIKVTQHGFGNEAIASQ- >tr|K5YCJ2|K5YCJ2_9PSED PPE repeat-containing protein OX=440512 OS=Pseudomonas sp. Chol1. GN=C211_06877 PE=4 SV=1 -------------------GGNNTATTEQFGTGNSADVFQDGRNQ-TAKTTQVGAFNSADVDQQGRGNELYFGQNGTGNSVDVAQ-DGRDNLAGIQQNGV ANEASLTQTGAAHLASVNQTGWDNSAVATQ- >tr|J8UTF3|J8UTF3_PSEPU Curlin-associated protein OX=1218169 OS=Pseudomonas putida S11. GN=PPS11_39368 PE=4 SV=1 -------------------GTNQTATINQNGNGNGANLTQNGDNQ-LATLTQKGTGNQMETKQADMNNELYFEQNGTDNVINLDQ-SGYSNQSYTNQYGS GNETNIKQVDSYNIAYVTQGGQGNKAYVDQ- >tr|B6QZQ1|B6QZQ1_9RHOB Curlin associated repeat protein, putative OX=439495 OS=Pseudovibrio sp. JE062. GN=PJE062_5083 PE=4 SV=1 --LVQDAGATQSY---VIEGDMNDAGYRQTGDLQDVDVSVN-SNF-WIRTRQAGNSNTVDY-GGGTNGKVSIQQNGAGNAFS-YS-WGDQQYIAVK-SGD NN---IDGL-G-NLNKFY-LQTTNDNG---- >tr|G8PUY3|G8PUY3_PSEUV Curlin associated repeat protein OX=911045 OS=Pseudovibrio sp. (strain FO-BEG1). GN= PE=4 SV=1 --LTQDGGSNGSY---VIEGNMNNAVYFQTGDSQDADVAVT-DNF-MIRTKQSGADNGIDY-GIGTDGRVSIHQIGAGNTAT-NS-NGDDQYIALK-RGD NN---IAGL-G-NKNKFY-LQKTNGST---- >tr|B1KDE0|B1KDE0_SHEWM Curlin associated repeat protein OX=392500 OS=Shewanella woodyi (strain ATCC 51908 / MS32). GN= PE=4 SV=1 --IKQD-GMGNTIGDTLVQGDDNDLRVDQEGDNNTAQFQVFGNDN-DVSLDQEGNTNVATFGAYGNDNDFELRSDGAENEVTAFA-EGDDNKIEISQEGD MNFGYADLIGNGNDTDIEQDGDENEAVLT-- >tr|D4ZCE8|D4ZCE8_SHEVD Uncharacterized protein OX=637905 OS=DSS12). GN= PE=4 SV=1 --ITQR-GDNNTIGDTLIQGRDNNIDLELEGNNNGAEFQVLGNEN-DIDLDQEGDTNFATFGIYGDDNDIDMSSDGDNNEILAFA-AGDDNDIEVHQDGD TNFGYADILGSDNEVDIDQDGNENEALVT-- >tr|F7RJ49|F7RJ49_9GAMM Minor curlin subunit CsgB OX=327275 OS=Shewanella sp. HN-41. GN=SOHN41_00379 PE=4 SV=1 --IAQD-GDSNTVGDTLIQGSDNDISIKQKGDSNGAEFQVWGDSN-DVDLKQRGDTNFATFGAYGTDNDFDLSSKGDNNELVAFA-TGEDNSIEISQEGD VNFAYVDATGNDNEVDVEQDGGQNETIIT-- >tr|A1S9U3|A1S9U3_SHEAM Putative uncharacterized protein OX=326297 OS=Shewanella amazonensis (strain ATCC BAA-1098 / SB2B). GN= PE=4 SV=1 --INQS-GSENAIGDSLIEGSDNMIDIEQEGFSNSAQFIVDGDDN-DVDLEQEGDLNYAEFVAVGNDNTLNLSSEGNGNQLLAGA-FGEDNSLEVAQEGD VNYAYSVAFGNDNEVDIAQMGNDNEAIVT-- >tr|Q12JK4|Q12JK4_SHEDO Curlin associated OX=318161 OS=Shewanella denitrificans (strain OS217 / ATCC BAA-1090 / DSM 15013). GN= PE=4 SV=1 --VLQD-GNGNMIGDTNINGSNNHIDIEQTGDTNLAEFEVTTDDN-DADLEQDGNNNYAAFGTYGNDNDFDLSSEGNDNQIFAFA-IGEDNDADISQEGD ENYAYVEMQGDDNEVDVIQDGGMNEAIVS-- >tr|A1RG63|A1RG63_SHESW Curlin associated repeat protein OX=351745 OS=Shewanella sp. (strain W3-18-1). GN= PE=4 SV=1 --ITQN-GSGNTIGDTLIQGDDNDITITQKGDVNGAEFQVWGDSN-DVDLQQRGDANFATFGAYGTDNDFDLSSKGNYNEIVAFA-EGEDNSIEVDQKGD ANFAYVDAVGNDNEVDVEQNGNQNESIIS-- >tr|G0DL20|G0DL20_9GAMM Curlin-associated protein OX=693970 OS=Shewanella baltica OS117. GN=Sbal117_3641 PE=4 SV=1 --ITQN-GSGNTIGDTLIQGNDNDITITQKGDVNGAEFQVWGDSN-DVDLKQKGDDNFATFGAYGTDNDFDLSSKGDNNEIVAFA-SGEDNSIEVSQEGD INFAYVDAVGSDNEVDVEQDGAQNEAVIS-- >tr|Q5QXG9|Q5QXG9_IDILO Uncharacterized conserved secreted protein with internal repeats OX=283942 OS=Idiomarina loihiensis (strain ATCC BAA-735 / DSM 15497 / L2-TR). GN= PE=4 SV=1 --VQQQ-GNSNVVEAANMIGNDNLIDAYQRGDNNYAVFATEGDMH-NVELEQAGDNNTAVGESYGEQNDIMVSSRGDDNFNGVAV-QGVSNDIQSRQRGD ANTAEVVFVGDDNDVMSSQVGDGNVTLAY-- >tr|K2KIC7|K2KIC7_9GAMM Uncharacterized protein OX=745411 OS=Gallaecimonas xiamenensis 3-C-1. GN=B3C1_02405 PE=4 SV=1 --ADQR-GDANLIEAPYLVGNDVLIDIQQRGDENYSFVATDGDSH-NISINQYGDLNEAEALVLGNRNEATIVTRGDANYAGIGL-DGAANYAEVVQEGD DNYAYVAFTGDQNFVDVNQDGAANAVALT-- >tr|I1DYJ1|I1DYJ1_9GAMM Uncharacterized protein OX=562729 OS=Rheinheimera nanhaiensis E407-8. GN=RNAN_2109 PE=4 SV=1 --VEQR-GDANSVTAPILIGNDNQIDVLQRGNGNSSVFATAGDGH-NVEVRQIGDNNTSEMVLAGADNRGSIYSRGDNNLAAIGV-SGLGNTADVQQFGD ENFALVDVLGDSNDLYVDQSGNANEVLLT-- >tr|K6X1B8|K6X1B8_9ALTE Uncharacterized protein OX=1127673 OS=Glaciecola lipolytica E3. GN=GLIP_1820 PE=4 SV=1 --FEQS-GDDNYAQLGLFEGSDSTAEITQIGTFNDAQIDNLGDDN-DYEITQIGNDNIGYAGATGFYNDFSVFQNGDLNLASTVNFDGFYNEVTLSQEGD ENTLLSGFYADDNIIDASQVGNQNDAVIR-- >tr|K0D7Q3|K0D7Q3_ALTMS Putative secreted major subunit of curlin OX=1004785 OS=Alteromonas macleodii (strain Black Sea 11). GN= PE=4 SV=1 --FEQD-GDGNEITTGVFEGFENAVNIDQIGDFNLATVETLGGGN-TFEINQIGDSNTAYAGVIGLFNEFDLTQVGDANQISTVNFDGWFNTVEIEQDGD FNLALSTYIADGNLIEMSQEGIENTASVE-- >tr|K0CVW0|K0CVW0_ALTME Putative secreted major subunit of curlin OX=1004788 OS=Alteromonas macleodii (strain English Channel 673). GN= PE=4 SV=1 --FEQD-GDGNEITTGVFEGSGNEVNIDQIGDVNLATVETIGGQN-TFEINQVGEGNTAYAGVIGLFNEFDLTQVGDANEISTVNFDGLFNTVEIEQDGD ANLALATYYADGNIIEMSQEGVENTASVE-- >tr|J2IIZ5|J2IIZ5_9ALTE Uncharacterized protein OX=1197174 OS=Alishewanella aestuarii B11. GN= PE=4 SV=1 --VDQV-GANNLIEAPIISGDNNTYDFVQAGDDNYIFFASVGSGH-DLMISQTGNNNNGEMILVGSANVGTIETTGDNNYAGIGL-VGNANNGRISQNGS DNFALLDVAGDRNRLNVNQAGNANEVIVT-- >tr|K5YIF3|K5YIF3_9PSED Uncharacterized protein OX=440512 OS=Pseudomonas sp. Chol1. GN=C211_15565 PE=4 SV=1 --VSQIGVGNLLSVVQQGFYEGSTLVVAQDGEANLAMIEQG-DGN-RLRLEQSGVLNSAAIRQDSYHNELDFAQRGNDNRLDVAQ-TGYGSRIEGSTSGN RNAVEISQSHALNRASVVQNGDDNLARIEQ- >tr|H0JBD3|H0JBD3_9PSED Putative uncharacterized protein OX=1112217 OS=Pseudomonas psychrotolerans L19. GN=PPL19_09461 PE=4 SV=1 --LEQNGDNNQVRVDQSG----VTASLSQSGTDNRIDLTQ----D-TVELTQRGQANVATVLSSGKFNELSFTQDGNRNTLNVNQ-AGRGNRFSGSSTGD DNLASVTVQGDSNQTSIVQTGNGNEARLDV- >tr|L0GSJ7|L0GSJ7_PSEST Uncharacterized protein OX=644801 OS=Pseudomonas stutzeri RCH2. GN=Psest_3880 PE=4 SV=1 --VDQRGDDNQLTFRQEGFFEGAQLSVSQDGLGNMADIFQG-DGN-RMTLAQIGSYNTADIHQSDYHNELDFTQNGDSNRLTVEQ-NGFGGRISGSSTGS GNSVDIAQRFMSNQATVIQNGNDNLASIEQ- >tr|K6CQF4|K6CQF4_PSEST Uncharacterized protein OX=1218352 OS=Pseudomonas stutzeri KOS6. GN=B597_23592 PE=4 SV=1 --VEQRGNNNRLTFSQQGYFEGSNMNVSQDGLGNMADIFQG-DGN-RMTLAQNGAYNLAEIQQSDYQNELNFSQNGDANRLNVDQ-DGFGGIITGSSSGS RNSVDIVQSFMSNQATVIQNGTDNLASIEQ- >tr|I4CYE9|I4CYE9_PSEST Uncharacterized protein OX=1196835 OS=Pseudomonas stutzeri CCUG 29243. GN=A458_19425 PE=4 SV=1 --IEQRGDDNVLTFDQQGYFEGSRMNVSQDGLGNLADIYQG-DGN-RMNIAQAGTYNIAEIQQNDYHNELDFTQSGDSNRLTVDQ-NGFGGRISGSSTGS GNSVDIAQSFMSNQATVIQNGNDNLASIEQ- >tr|H7EWC8|H7EWC8_PSEST Putative uncharacterized protein OX=32042 OS=Pseudomonas stutzeri ATCC 14405 = CCUG 16156. GN=PstZobell_11509 PE=4 SV=1 --VEQRGDDNVLTFSQQGYYEGARMNVSQDGLGNMADIFQG-DGN-RMTLAQAGTYNVADIHQSDYHNELDFTQSGDGNRLTVDQ-NGFGGRISGSSTGS GNSVDIAQTYMSNQATVVQNGNDNLASIEQ- >tr|L0WFL0|L0WFL0_9GAMM Curlin-associated protein OX=1177179 OS=Alcanivorax hongdengensis A-11-3. GN=A11A3_02202 PE=4 SV=1 ALVDQTGDTNSTNISQQNVSDYAYAEVNQAGDNSSVDVLQDQSHDSYAYVDQNGSMLTAEVDQGGDRNLADINQTGTDNFARVEQ-SGSDNEGMITQSGG NNSADIDQWGGNGHAAITQSNVNNDAYINQT >tr|K4KLN9|K4KLN9_9GAMM Curlin-associated protein OX=1117647 OS=Simiduia agarivorans SA1 = DSM 21679. GN=M5M_09005 PE=4 SV=1 AYISQEGQGHNASISQTAISDLATADIAQTGANNSASIGQDGSLFSEAYVAQSGESNSTNVYQGQGIHLALVDQDGLGNSAWVNQ-YGIADTALVSQNGE GLYADVYQSGDLNDAIVDQSGMYNQTYISQY >tr|A5PB09|A5PB09_9SPHN Curlin associated protein OX=161528 OS=Erythrobacter sp. SD-21. GN=ED21_23541 PE=4 SV=1 -----------------------SATINQEGASGQAFITQIGNDQNTARITQNGDDSIATITQNGGLNTSEIVQNWADNTATSSQ-NGSELQSMITQNGE FNTATVNQTGDGHMSTVTQTGTGNSAMVTQG >tr|A5PB11|A5PB11_9SPHN Putative uncharacterized protein OX=161528 OS=Erythrobacter sp. SD-21. GN=ED21_23551 PE=4 SV=1 ------------------------AAVDQVGASNTATINQYSDSGNASTIGQTNTLNSASSLQTGTNGYSLIMQDGHDNLATLID-SGDLNRSTITQSGD LNSALVTQGGSSNVSTIAQDGAGNSATVAQN >tr|A5PB10|A5PB10_9SPHN Curlin associated protein OX=161528 OS=Erythrobacter sp. SD-21. GN=ED21_23546 PE=4 SV=1 -----------------------TSTIDQTGRNDSAFVEQLAGSENSATVSQSGALSTADVLQSGSGNTSRLSQNGTVNTADVNQ-LGNDNLSDIAQDGS GNSATVTQNSDANTSYVNQNGNGNTASVTQG >tr|Q1NBA4|Q1NBA4_9SPHN Putative uncharacterized protein OX=314266 OS=Sphingomonas sp. SKA58. GN=SKA58_07103 PE=4 SV=1 ------------------------ASISQDGSNNSSTIDQFGGDNNQSTISQTNTL--ASVDQGGNDNISTVMQDGALNEAMVDQ-SGNGNESWVSQAGS GHSATVTQSSDMNNSVVNQTGMNNTATVTQG >tr|L0GEK3|L0GEK3_PSEST Curlin associated repeat-containing protein OX=644801 OS=Pseudomonas stutzeri RCH2. GN=Psest_0556 PE=4 SV=1 ------------------AGLANSADITQDGLNNDVVLIQDNAFN-DTVVDQFGEGNEALISQTGQEGIIDVSQAGNMNVADIAQ-EGLANSVDLMQQGD GNLALVDQFGEANQAVVMQNGMDNFANVTQ- >tr|I4CP14|I4CP14_PSEST PPE repeat-containing protein OX=1196835 OS=Pseudomonas stutzeri CCUG 29243. GN=A458_02825 PE=4 SV=1 ------------------AGFANSADVTQDGVSNDVVLIQDGAYN-STIVDQFGEGNEALISQSGQEGIVDLIQTGNMNVADIDQ-AGIGNSVDLEQQGD GNLALVDQFGDSSQAVLLQTGADNFATVTQ- >tr|H7EVV8|H7EVV8_PSEST PPE repeat-containing protein OX=32042 OS=Pseudomonas stutzeri ATCC 14405 = CCUG 16156. GN=PstZobell_10629 PE=4 SV=1 ------------------AGLANSADVIQDGLSNDVVLLQDGALN-STAVDQFGEGNEVMIVQSGQEGSIDVSQAGNMNVADIDQ-AGLGNSVDLMQQGD GNLALVEQFGESSQAVILQSGMDNFANVTQ- >tr|E4RJ05|E4RJ05_HALSL Curlin associated repeat-containing protein OX=656519 OS=Halanaerobium sp. (strain sapolanicus). GN= PE=4 SV=1 ------------------NGEENYGCIDQHGYWNWAGIYQKGDNN-TSNLDQYGDFNFAGILQYGNDNQANIYQNGFKQWAGIGQ-FGDGHTASISQYGS NNMALILQGGNDKNAIIIQDGHDLK------ >tr|D4ZLW5|D4ZLW5_SHEVD Uncharacterized protein OX=637905 OS=DSS12). GN= PE=4 SV=1 ------------------TGDENTADVDQTGAENSATVAQTSFG-STATVLQTGSLNDATVTQAGDEQTANITQNDLSNLAQVNQ-SGASNSLTLTQSGT GNTATINQAGTNDSAVVAQTGNSNTVLIDQ- >tr|A9DHH5|A9DHH5_9GAMM Curlin associated protein OX=314608 OS=Shewanella benthica KT99. GN=KT99_10748 PE=4 SV=1 ------------------TGDENTVDIDQTGTENSVTVDQNSFG-SSATVLQTGSLNDASITQTGDEQTVNVTQNDLSNIANIDQ-SGASNSLTLTQGGS GNTATINQAGTNDTALVAQTGNSNTVLIDQ- >tr|A8H7L4|A8H7L4_SHEPA Curlin associated repeat protein OX=398579 OS=Shewanella pealeana (strain ATCC 700345 / ANG-SQ1). GN= PE=4 SV=1 ------------------IGNQNQASVNQSGAGQVANITQASDL-SVVSLSQQDQGNSATVNQTGTAQTATITQFDESNVANVTQ-GGSNNQLTLTQSGN GNEAIVSQAGNGDAAVIVQVGENNTAVVEQ- >tr|A9DHH0|A9DHH0_9GAMM Curlin associated protein OX=314608 OS=Shewanella benthica KT99. GN=KT99_10738 PE=4 SV=1 ------------------VGNGNTADVDQSGPSNGVTIAQTSSG-SFAQVAQTGESNGATVTQTGADHSAIITQNDFSNTAAVTQ-SGLGNGLTLTQSGT GNTATVNQDGDNDTATLSQTGNSNQVVIAQ- >tr|Q084E8|Q084E8_SHEFN Curlin associated repeat protein OX=318167 OS=Shewanella frigidimarina (strain NCIMB 400). GN= PE=4 SV=1 ------------------QGDNDTAYVSQSGQDGYASVSQSGNTLNTGLVYQSGFSNSATIAQVGSDHMSTITQSEYDNVANVDQ-TNVDNNATVTQSGM GNLATVMQDGLGDSALVEQIGSDNTAIVEQ- >tr|L0WFL0|L0WFL0_9GAMM Curlin-associated protein OX=1177179 OS=Alcanivorax hongdengensis A-11-3. GN=A11A3_02202 PE=4 SV=1 -YVDQ-NGSMLTAEVDQGLGDRNLADINQTGTDNFARVEQSGSDN-EGMITQSGGNNSADIDQWGGNGHAAITQSNVNNDAYINQTPAEFGVADITQSGS NNQADILQTGDPNTATITQTSSFNNASIM-- >tr|K4KLN9|K4KLN9_9GAMM Curlin-associated protein OX=1117647 OS=Simiduia agarivorans SA1 = DSM 21679. GN=M5M_09005 PE=4 SV=1 -YVAQ-SGESNSTNVYQGPQGIHLALVDQDGLGNSAWVNQYGIAD-TALVSQNGEGLYADVYQSGDLNDAIVDQSGMYNQTYISQY-------------- ---------GMNNTVFVQQTGAYDYANVV-- >tr|Q0AMM3|Q0AMM3_MARMM Curlin associated repeat protein OX=394221 OS=Maricaulis maris (strain MCS10). GN= PE=4 SV=1 -------------------GERQYVTVIQDGARNGTTVGQNGRYN-EAHVDQYGRNNDAVVGQQGYNNYTRNIQTGRRNIVGVSQ-MGRNNTAVTDQYGN NNAAGVIQVGNGNNANVRQTGSGNVTLVIQ- >tr|A3UDQ5|A3UDQ5_9RHOB Putative uncharacterized protein OX=314254 OS=Oceanicaulis sp. HTCC2633. GN=OA2633_04391 PE=4 SV=1 -------------------GVRQFISTLQDGTRNGAQVSQEGGDN-QAGLSQQGRRNRVVIGQEGWLNDAFSSQSGYGNVVGVAQ-IGEGHQAITHQEGS NNALGVIQIGRGQFTDVHQSGSGNVTLVIQ- >tr|I3C6C3|I3C6C3_9FLAO Curlin associated repeat-containing protein OX=926559 OS=Joostella marina DSM 19592. GN=JoomaDRAFT_2176 PE=4 SV=1 ------------------DGRRNEADVDQIGVLNVASVSQDGRRN-SSTQFQLGAGNLAVDSQTGRRNDADTYQLGFLNQSYVTQ-DGRRNSALAVQTGI GNMSNVNQSGMDHRALTVQTGTGNMSGITQ- >tr|A6EMN4|A6EMN4_9BACT Curlin associated repeat protein OX=50743 OS=unidentified eubacterium SCB49. GN=SCB49_02229 PE=4 SV=1 ------------------YGESNDSDVDQDGNGNVADVDQYGLSN-YSDVDQDGNDNAAVVYQNGESNDSDVDQDGSDNYAFVGQ-DGDDNDSDVDQDGT DNYAYVYQNGELNDSGVVQNGSTNAATVI-- >tr|I0QV39|I0QV39_9ENTR Major curlin subunit OX=932213 OS=Serratia sp. M24T3. GN= PE=4 SV=1 ----------------------EELQIYQEGSLNLANATQTSAQNSLTTVNQHGNANVASSNQVGDRSLLDIHQAGNFNRADALQ-SGDKSKLLISQSGT ANYANATQSASGSLANIVQVGTANRAYASQ- >tr|J0LL29|J0LL29_9BACT Uncharacterized protein OX=1144253 OS=Pontibacter sp. BAB1700. GN=O71_11009 PE=4 SV=1 ------------------IGDMNVAETSQIGDANSSVTAQDGTTN-SAAVSQSGSLQTATVAQMGNSNSVAVNQTGEGNVAGSLQ-VGDLNSIVTNQSGS GNYSGTYQSGTSNSALINQSGV--------- >tr|Q26FL3|Q26FL3_FLABB Putative uncharacterized protein OX=156586 OS=Flavobacteria bacterium (strain BBFL7). GN=BBFL7_02233 PE=4 SV=1 ------------------FLSSDESYQEQLGTDNKAVVFQDGDRN-EARSTQNGNNNEVNQEQYGMDNYSRVSQGHVGNMATSIQ-DGADHHSVINQRAN GNEAMVDQTGNGQRSLINQNSPQNTATVIQ- >tr|Q26FL5|Q26FL5_FLABB Putative uncharacterized protein OX=156586 OS=Flavobacteria bacterium (strain BBFL7). GN=BBFL7_02231 PE=4 SV=1 ------------------YLSSDTSYQEQLGTDNKAVVFQDGDRN-EARSDQNGNGNSVNQEQYGQDNFSSVKQRYAGNMATSIQ-DGSLNNSVINQQSH GNMSMVDQTGDGQTSLINQNNPYNNATVIQ- >tr|Q26FL4|Q26FL4_FLABB Putative uncharacterized protein OX=156586 OS=Flavobacteria bacterium (strain BBFL7). GN=BBFL7_02232 PE=4 SV=1 ------------------KGTDNTLFQEQIGSGNKAVAYQDGTDN-EARSKQNGSNNEAYQEQYGDQNFSSIEQRGPGNVALSIQ-DGSLNHSMIEQKAR GNSAIVDQTGDGQMSVVNQNLAYNNATVIQ- >tr|Q26FL6|Q26FL6_FLABB Putative uncharacterized protein OX=156586 OS=Flavobacteria bacterium (strain BBFL7). GN=BBFL7_02230 PE=4 SV=1 ------------------YGSDNSSFQEQNGLLNTAEVYQLGGLN-EAYIEQDGQNHSANQEQYGIFNESVVIQNGSGNQALSIQ-RGMNNDSYINQMAN GNIADVDQLGMNHQSVINQNSLSNTAIVTQ- >tr|A8AI38|A8AI38_CITK8 Uncharacterized protein OX=290338 OS=Citrobacter koseri (strain ATCC BAA-895 / CDC 4225-83 / SGSC4696). GN= PE=4 SV=1 --IFQY-GSANSALALQSDARKSDLSIKQYGHGNGADVGQGAD-NSGIDLTQNGYRNSATIDQNAKNSDIVVSQFGGRNGALVNQ-TASDSQVSVTQVGF GNNATANQY---------------------- >tr|H5V126|H5V126_ESCHE Major curlin subunit OX=1115512 OS=Escherichia hermannii NBRC 105704. GN= PE=4 SV=1 --IYQD-GAHHVVNSVQTGADGSLVDIKQQHYNNFANVQQSAK-DSNIWLDQDGVKNTADIKQGGFGSDTDVTQKGNYNAVNVNQ-NAVR---DVTQRGY NNFVNAAQ----------------------- >tr|H2J0L3|H2J0L3_RAHAC Uncharacterized protein OX=745277 OS=105701 / NCIMB 13365 / CIP 78.65). GN= PE=4 SV=1 --IYQQ-GTANLANATQSGAQKSLTAISQDGYRNSAGTNQTGD-GSLISVVQKGDYNGANVSQSANGSKVLVSQNGSGNYAQASQ-GANGSLTSITQVGT ANLAYASQN---------------------- >tr|K6W1F1|K6W1F1_ESCBL Major curlin subunit OX=1115514 OS=Escherichia blattae NBRC 105725. GN= PE=4 SV=1 --IYQD-GKSNYAGATQSSAKKSLTDISQQGDKNSATTIQTGD-NSLIDVNQKGHFNNAYVNQSADSSKVLISQTGFANSATAQQ-HTSGTLTSVTQVGF GNVAVTNQ----------------------- >tr|D2TT49|D2TT49_CITRI Major curlin subunit OX=637910 OS=4280). GN= PE=4 SV=1 --IYQS-GVNNAALALQSDARKSETTIRQDGFGNGADVGQGAD-NSTIELTQSGFRNNATIDQNGKNSDISVSQYGGNNAALVNQ-TASDSSVLVSQVGF GNNATANQY---------------------- >tr|L1YXF5|L1YXF5_ECOLX Major curlin subunit OX=1240777 OS=Escherichia coli O104:H4 str. 11-03943. GN=C221_00080 PE=4 SV=1 --IYQY-GGGNSALALQTDARNSDLTITQHGGGNGADVGQGSD-DSSIDLTQRGFGNSATLDQNGKNSEMTVKQFGGGNGAAVDQ-TASNSSVNVTQVGF GNNATAHQY---------------------- >tr|K8VV74|K8VV74_SALTM Major curlin subunit OX=1218153 OS=Salmonella enterica subsp. enterica serovar Typhimurium str. STm5. GN= PE=4 SV=1 --IYQY-GSANAALALQSDARKSETTITQSGYGNGADVGQGAD-NSTIELTQNGFRNNATIDQNAKNSDITVGQYGGNNAALVNQ-TASDSSVMVRQVGF GNNATANQY---------------------- >tr|I6SCT4|I6SCT4_ENTCL Major curlin subunit OX=1104326 OS=Enterobacter cloacae subsp. dissolvens SDM. GN= PE=4 SV=1 --IYQN-GGGNSAVALQTNARDSTLSISQSGGGNGADVGQGSD-DSTISLTQNGFANSATLDQNSHDSTMNVSQYGGFNGALVDQ-TASNSTVNVTQIGF GNHASAYQY---------------------- >tr|I4ZL04|I4ZL04_ENTCL Major curlin subunit OX=1177927 OS=Enterobacter cloacae subsp. cloacae GS1. GN= PE=4 SV=1 --IYQY-GGGNSALALQTDARDSELTITQHGGGNGADVGQGSD-DSSIDLLQKGFGNSATIDQNSKDSVINVKQFGGGNGAAVDQ-TASGSTVTVHQVGF GNNATAHQY---------------------- >tr|G8LIR0|G8LIR0_ENTCL Major curlin subunit OX=1045856 OS=Enterobacter cloacae EcWSU1. GN= PE=4 SV=1 --IYQY-GGGNSALALQTSARNSTLTINQSGGGNGADVGQGSD-DSSISLTQNGFGNSATLDQNGNHSVMNVSQYGGGNGAAVDQ-TASGSTVTVQQVGF GNHATAHQY---------------------- >tr|Q7X237|Q7X237_CROSK Curlin-csgA protein OX=28141 OS=Cronobacter sakazakii (Enterobacter sakazakii). GN= PE=4 SV=1 --IYQN-GGGNSALALQTDARNSVLNISQTGGGNGADVGQGSD-DSSINLTQNGFGNSATLDQNSKDSVMNVSQYGGLNGALVDQ-TASNSTVNVTQIGF GNHATAHQY---------------------- >tr|I6H081|I6H081_SHIFL Major curlin subunit OX=766154 OS=Shigella flexneri 1235-66. GN= PE=4 SV=1 --IYQY-GSSNVANALQSDARKSDVTITQHGHGNGATVGQGAD-DSTISLKQTGFQNSADINQNAKNADISVTQFGGRNGAVVNQ-TASDSNVLIQQVGY GNNATANQH---------------------- >tr|G2S882|G2S882_ENTAL Curlin associated repeat-containing protein OX=640513 OS=Enterobacter asburiae (strain LF7a). GN= PE=4 SV=1 --IYQS-GGGNSAVALQSDARNSTMNISQTGGGNGADVGQGSD-DSTISLTQNGFGNSATLDQNSKDSTMTVSQYGGLNGASVDQ-TASNSSVSVTQVGI GNHVSAHQY---------------------- >tr|A4W957|A4W957_ENT38 Curlin associated repeat protein OX=399742 OS=Enterobacter sp. (strain 638). GN= PE=4 SV=1 --IYQT-GGGNSAVALQSNAKDSVLSISQHGGGNGADVGQGSD-DSSIELVQHGFGNSATLDQNGKDSTMTVKQFGGGNGAAVDQ-TASGSTVSVTQVGF GNNATAHQY---------------------- >tr|D2ZAV5|D2ZAV5_9ENTR Major curlin subunit CsgA OX=500639 OS=Enterobacter cancerogenus ATCC 35316. GN=ENTCAN_05595 PE=4 SV=1 --IYQY-GGGNSALALQSDARDSSLSISQSGGGNGADVGQGSD-DSTITLTQNGFGNSATLDQNGKDSTMTVSQFGGGNGAAVDQ-TASGSTVTVQQVGF GNNATAHQY---------------------- >tr|G9Z6X0|G9Z6X0_9ENTR Major curlin subunit CsgA OX=1002368 OS=Yokenella regensburgei ATCC 43003. GN=HMPREF0880_02976 PE=4 SV=1 --IYQY-GSGNNATALQSDARKSDLTIKQFGSSNGADVGQGSD-SSTIDLLQKGTANNATISQNSKNSDIQVQQFGALNGAVVHQ-TASDSSVTVHQVGF GNHASASQY---------------------- >tr|C9Y0V5|C9Y0V5_CROTZ Major curlin subunit OX=693216 OS=Cronobacter turicensis (strain DSM 18703 / LMG 23827 / z3032). GN= PE=4 SV=1 --IYQQ-GVNNNTTALQSDAKNSTTEINQLGTANGADVGQGSD-DSKILLNQNGFANNSTIDQNGHDSGVTVNQNGVQNGALVNQ-TASGSQVYVTQTGY GNHASASQY---------------------- >tr|J0W9T3|J0W9T3_9ENTR Major curlin subunit OX=1202448 OS=Enterobacter sp. Ag1. GN= PE=4 SV=1 --IYQQ-GNGNNATALQSNAFYSKTEIKQLGTVNGAKVGQGSD-SSDIKLLQDGYGNNATISQNGKNAQIDVQQFGTNNGAVVNQ-TASSSLVSVTQFGN GNHATASQY---------------------- >tr|A2TNE1|A2TNE1_9FLAO Putative uncharacterized protein OX=313590 OS=Dokdonia donghaensis MED134. GN=MED134_06424 PE=4 SV=1 --------------------NDNEAVVSQVGDDNASVVSQDGDFN-FADTAQNGNDNDAIVNQVGDSNFSEVDAQGNSNVARVYQ-GGQSNTSTILSSGN GNFADTAQQGTLNDSTVDQFGDGNSSE---- >tr|G4F9Z0|G4F9Z0_9GAMM Putative uncharacterized protein OX=550984 OS=Halomonas sp. HAL1. GN=HAL1_16111 PE=4 SV=1 --VKQSWGNGNEAVVVQ-DGKFNDSVIEQQGSYNTASVDQEGYIN-DSWVDQSGYGNEANVDQSGSNNQSGIQQSGLFGGANVEQ-TGGDNESWIEQSGS NNSDSVIQDGNYLDSTIITIGYNNT------ >tr|G9EB64|G9EB64_9GAMM Minor curlin subunit OX=1072583 OS=Halomonas boliviensis LC1. GN=KUC_1221 PE=4 SV=1 ----------------------NVSVVEQIGYYNDADITQSGKFN-DSYAYIEGFGNETTVNQDGFANKSVVEQNGAFNDAEINQ-TGSYNDSYAYMGGV RNDAVVNQNGYSLESTILANGDHNT------ >tr|F6GJ92|F6GJ92_LACS5 Curlin associated repeat-containing protein OX=983544 OS=Lacinutrix sp. (strain 5H-3-7-4). GN= PE=4 SV=1 --TVQEGDDNVAGVFQNS---GNTAIAEQYGDENVTRIDQDGRNKAHS-----------ISNQYGDDNTALVDQGD----------TGDNNVAYIAQNGD VNGAILGQNGRNHLAYQRQYGDGNIVSSSQ- >tr|F6GJ93|F6GJ93_LACS5 Curlin associated repeat-containing protein OX=983544 OS=Lacinutrix sp. (strain 5H-3-7-4). GN= PE=4 SV=1 --TIQEGDNNIAGVTQNS---GNTALAEQYGDSNATMIDQDGRNTAHS-----------VANQIGDDNWALTDQGN----------TGDNNAAFIAQNGD SNIAILGQNGRDHLAYQRQYGDGNIVASSQ- >tr|I0KE93|I0KE93_9BACT Curlin-associated protein OX=1166018 OS=Fibrella aestuarina BUZ 2. GN=FAES_4447 PE=4 SV=1 --IKQAGESHTAEVLQTSVSAENEARIDQSGQSGLALVYQTGATNNQAGVTQNSAQPNAAIYQTSYYNQADIQQEGAGSTVSIQQSSSTGNVAVVEQGTK KSVVTIGQENDANVAQIQQGGENNQVTATQ- >tr|D2QPP3|D2QPP3_SPILD Uncharacterized protein OX=504472 OS=Spirosoma linguale (strain ATCC 33905 / DSM 74 / LMG 10896). GN= PE=4 SV=1 --ISQEKSGQTADLRQTDGSNDNRASIEQLGTNGSATLYQSNASNNTASILQTSAGSQATIYQTSTYNTATIDQALDANRATINQSSGSGNQATITQGSD LNQATISQEGDAHVASLMQGGLSNQATLTQ- >tr|I0KC59|I0KC59_9BACT Curlin associated repeat protein OX=1166018 OS=Fibrella aestuarina BUZ 2. GN=FAES_3705 PE=4 SV=1 --ITQTGEGSVGTVEQNNNSRGNEVTITQIGATYENTVRQSNSRNNEASVTQTGRGDVVNVYQTSVGNSATIEQ--DGSQATIRQTDGDNNTAQINQGSR NNTALITQENSENQARLLQVGNGNSATLSQ- >tr|I2GP08|I2GP08_9BACT Curlin associated repeat protein OX=1185876 OS=Fibrisoma limi BUZ 3. GN= PE=4 SV=1 --ITQLTSGAEATINQIDNSGGNEASISQAGASGAATINQTNSYNNLAAVQQSTGGHTATINQTSHDNQAFVNQNDGGNQATINQRNGSGNTAEIVQGAS GNVATLTQDGSSNTAKLYQAGDLHEATITQ- >tr|J0LL29|J0LL29_9BACT Uncharacterized protein OX=1144253 OS=Pontibacter sp. BAB1700. GN=O71_11009 PE=4 SV=1 ------------------AGLDNNASISQIGDMNVAETSQIGDAN-SSVTEQ---------AQDGTTNSAAVSQSGSLQTATVAQ-MGNSNSVAVNQTGE GNVAGSLQVGDLNSIVTNQSGSGNYSGTYQ- >tr|Q26FL3|Q26FL3_FLABB Putative uncharacterized protein OX=156586 OS=Flavobacteria bacterium (strain BBFL7). GN=BBFL7_02233 PE=4 SV=1 ------------------SGATNEAYATQFLSSDESYQEQLGTDN-KAVVNQSGGDNLVEQFQDGDRNEARSTQNGNNNEVNQEQ-YGMDNYSRVSQGHV GNMATSIQDGADHHSVINQRANGNEAMVDQ- >tr|Q26FL5|Q26FL5_FLABB Putative uncharacterized protein OX=156586 OS=Flavobacteria bacterium (strain BBFL7). GN=BBFL7_02231 PE=4 SV=1 ------------------SGATNEAYATQYLSSDTSYQEQLGTDN-KAVVEQSGGNNLVEQFQDGDRNEARSDQNGNGNSVNQEQ-YGQDNFSSVKQRYA GNMATSIQDGSLNNSVINQQSHGNMSMVDQ- >tr|Q26FL4|Q26FL4_FLABB Putative uncharacterized protein OX=156586 OS=Flavobacteria bacterium (strain BBFL7). GN=BBFL7_02232 PE=4 SV=1 ------------------TGELNEAYASQKGTDNTLFQEQIGSGN-KAVASQSGSSLYAEQYQDGTDNEARSKQNGSNNEAYQEQ-YGDQNFSSIEQRGP GNVALSIQDGSLNHSMIEQKARGNSAIVDQ- >tr|Q26FL6|Q26FL6_FLABB Putative uncharacterized protein OX=156586 OS=Flavobacteria bacterium (strain BBFL7). GN=BBFL7_02230 PE=4 SV=1 ------------------SGGYNNAYSTQYGSDNSSFQEQNGLLN-TAEVHQTGGDNYAEQYQLGGLNEAYIEQDGQNHSANQEQ-YGIFNESVVIQNGS GNQALSIQRGMNNDSYINQMANGNIADVDQ- >tr|E4RJ05|E4RJ05_HALSL Curlin associated repeat-containing protein OX=656519 OS=Halanaerobium sp. (strain sapolanicus). GN= PE=4 SV=1 -----------------------KAFIIQNGEENYGCIDQHGYWN-WAGIYQKGDNNTSNLDQYGDFNFAGILQYGNDNQANIYQ-NGFKQWAGIGQFGD GHTASISQYGSNNMALILQGGNDKNAIIIQ- >tr|H2J0L2|H2J0L2_RAHAC Curlin associated repeat-containing protein OX=745277 OS=105701 / NCIMB 13365 / CIP 78.65). GN= PE=4 SV=1 ------------------NSSGNASMIYQNGNSNLAATNQTGYGN-AGVIRQAGSENTALLNQRGNGNNADISQSGSDNFAYVSQ-TGGG-DASITQQNF GNTAYILQKGR-GVTEIRQNGTNQSSGVVQ- >tr|K4KYG6|K4KYG6_9GAMM PPE repeat-containing protein OX=1117647 OS=Simiduia agarivorans SA1 = DSM 21679. GN=M5M_09000 PE=4 SV=1 -----------------QLGSVNSADIQQSGIGQRASVQQLGDNH-SARVSQQGEDNLLFLIQSGSLNQLVLQQSGFANSANVHQ-DGSANTATIQQQGN QNSVVLQQYGNHHAASITQYGDQLSVSVRQ- >tr|K8RLN5|K8RLN5_9BURK Uncharacterized protein OX=406819 OS=Burkholderia sp. SJ98. GN=BURK_013498 PE=4 SV=1 ------------------GGATQVARVSQHGQGNRLDAIQAGHN--GLMANQIGIANAAATAQAGNDNWTVLQQSGAANLYAGYQ-SGQYNRSIAMQNGS GNEATVTQVGNSMSSSVTQLGNNNEISILQ- >tr|Q5QXG8|Q5QXG8_IDILO Minor curlin subunit CsgB, nucleation component of curlin monomers OX=283942 OS=Idiomarina loihiensis (strain ATCC BAA-735 / DSM 15497 / L2-TR). GN= PE=4 SV=1 -------------------ALANHAEIAQLGSYNLTSVIQSGEQN-YAYLVQSGFENTLSLEQQGFNNSVTAEQSGRGNSAIILQ-LGNSNLIQLQQLGN DNAITIQQSGSAAEMSITQF----------- >tr|K4KYG6|K4KYG6_9GAMM PPE repeat-containing protein OX=1117647 OS=Simiduia agarivorans SA1 = DSM 21679. GN=M5M_09000 PE=4 SV=1 ---------------------NHAALVEQLGSVNSADIQQSGIGQ-RVALVQSGYNQHASVQQLGDNHSARVSQQGEDNLLFLIQ-SGSLNQLVLQQSGF ANSANVHQDGSANTATIQQQGNQNSVVLQQ- >tr|I3C6C3|I3C6C3_9FLAO Curlin associated repeat-containing protein OX=926559 OS=Joostella marina DSM 19592. GN=JoomaDRAFT_2176 PE=4 SV=1 --------------------PLNLAIVDQDGRRNEADVDQIGVLN-VASVSQDGRRNSSTQFQLGAGNLAVDSQTGRRNDADTYQ-LGFLNQSYVTQDGR RNSALAVQTGIGNMSNVNQSGMDHRA----- >tr|K8RLN5|K8RLN5_9BURK Uncharacterized protein OX=406819 OS=Burkholderia sp. SJ98. GN=BURK_013498 PE=4 SV=1 ---------------------HNGLMANQIGIANAAATAQAGNDN-WTVLQQSGAANLYAGYQSGQYNRSIAMQNGSGNEATVTQ-VGNSMSSSVTQLGN NNEISILQSRNGSGLSVTQTG-GARAAVL-- >tr|K8RD90|K8RD90_9BURK Curlin associated protein OX=406819 OS=Burkholderia sp. SJ98. GN=BURK_013503 PE=4 SV=1 ----------NTAAVHQTDGGSNLAAAVQVGSNNNMDLTQQGAFESTLLAGQGGGFNEAKGKQSGSHEDMTLTQVGYFNSADASQ-TGAYNTASIVQIGN ANYGSVTQSGIGNAAAITQGGSFNKAVVVQK >tr|C6X5S4|C6X5S4_FLAB3 Putative uncharacterized protein OX=531844 OS=Flavobacteriaceae bacterium (strain 3519-10). GN= PE=4 SV=1 ------------------YSQANNDVSTQVGDDNTATINQTGWLN-FNVLDQFGDDNVATVDQD-NNNSIDV----NGNDAQTDQ-NGLLNEVVQLQDGN NNDTYAMQDGTG---W---EGNGNDASSEQ- >tr|H7FR16|H7FR16_9FLAO Minor curlin subunit CsgB OX=1086011 OS=Flavobacterium frigoris PS1. GN=HJ01_01802 PE=4 SV=1 ------------------YAQSNMSHVDQMGMTHDAMVNQIGWNQ-DSDISQKGTSNKSEVYQGNNGNEADVIQDGKRNTAFISQ-NNWGNDATQTQTGD DNKATIWQDETSGGATQTQTGKHNIATIDQ- >tr|A5FHE3|A5FHE3_FLAJ1 Curlin associated repeat protein OX=376686 OS=(Cytophaga johnsonae). GN= PE=4 SV=1 ------------------IAQGNTSSVNQVGNSDTGIVNQNGQTN-DSKIDQLGDSNKSEVYQGN--NKADVKQDGNSNGAFISQ-SNHDNQAYQTQKGN SNSATIWQDQIVNAAWQTQTGNNNTATVDQ- >tr|A5PB12|A5PB12_9SPHN Curlin associated protein OX=161528 OS=Erythrobacter sp. SD-21. GN=ED21_23556 PE=4 SV=1 --------------SPTDVGRDNSATVIQSGDGNTSAIDQR--RDSTALVTQSGGDNSAIVEQLGSADVSNILQDGANQFANVLQ-NGSSEYSSIMQNGD GNSALVDQSGSNNESYIDQNGTGNAATVTQT >tr|F7SNL8|F7SNL8_9GAMM Putative uncharacterized protein OX=999141 OS=Halomonas sp. TD01. GN=GME_10456 PE=4 SV=1 ----------------IQFGSANQSYIRQQWDTNQALAVQVGNDG-ISHIKQAGSSNLAVLADDGSFNSSYILQTGDTNTALVGQ-SGMDNDSYVSQLNG DNNFAVAQLGTDGESDIFQNGSGNTALVGQ- >tr|H0J186|H0J186_9GAMM Curlin-associated protein OX=1118153 OS=Halomonas sp. GFAJ-1. GN=MOY_06972 PE=4 SV=1 ----------------EQSGAANESFIKQGRQDNYADAQQHGNDG-LNMMRQRGNDNDAYLMQTGNDNESYVFQAGAGNSAVITQ-SGSDNDSYADQVGG GNEVTVTQSGNDALSTIYQRGDNNVADVSQ- >tr|J2X9G7|J2X9G7_9PSED Curlin associated repeat-containing protein OX=1144325 OS=Pseudomonas sp. GM21. GN=PMI22_00543 PE=4 SV=1 ------------------------LAQSQVGIGNRAVINQSNVVN-SISAKQEGLNNGLEINQSGAYNQTGVVQQGTSNLASVVQ-NGAYNQAFVGQYGQ ANQATVDQTGFGGTVSIIQHGDNMSANATQ- >tr|H1R220|H1R220_VIBFI Putative uncharacterized protein OX=1088719 OS=Vibrio fischeri SR5. GN=VFSR5_2495 PE=4 SV=1 --------VGNDADVLIRNSDDNEVDIYQMGRHNEAVVRVNGSDDNDLMIEQDGRHNRAVLKAGSDDNDFAIEQDGRLNLGKIIADDSDNNNGLIDQRGR KNEAYINFDGASSNNKIVQRGRRNEGEI--- >tr|J2X9G7|J2X9G7_9PSED Curlin associated repeat-containing protein OX=1144325 OS=Pseudomonas sp. GM21. GN=PMI22_00543 PE=4 SV=1 ----------------------SVATISQIGSYNKTRVDQNGAAQ-RLAQSQVGIGNRAVINQSNVVNSISAKQEGLNNGLEINQ-SGAYNQTGVVQQGT SNLASVVQNGAYNQAFVGQYGQANQATVDQ- >tr|D2QT94|D2QT94_SPILD Curlin associated repeat protein OX=504472 OS=Spirosoma linguale (strain ATCC 33905 / DSM 74 / LMG 10896). GN= PE=4 SV=1 -------------QVQDYFSSGNTASITQTGNNNKSGVFRDGGSENTFTSVQTGNNNRILAPQYGVGNMIDITQTGNSNLTTLRQ-LGNGNDLMVSQSGN TNMLSVTQSGLGNMATVSQLGSGNSASVVQ-