>seq_1 STEGVTDDMKKIIEEVRVGVDDLVEEMVKSAGHCCSIEKKHALHNLQCAADAECRIIPGELYRILLDENKEESTDFYCLKKLDGNILSHYDRYKKRHRHA EFSDKVGKTEYQPSDVEEWLECIKCRTHHHATCTKYHPAKDGKQFICSRCDSSKITICRAEELETTKATEFFERELSDAFPELKILCRILSAVQKYGDHF QKNQETEFPNLGKLPPVPYIKKTYGFFVERDGVDVLFFTLVTHEYKNVASRKNNTIHLEYLDSVKYVDGQFCAKVHISVFQTLCSYARDAGFESIHFWAC PPYENDDYVLNAHPQPARKATFKELFGWYDKAAVCEAVEQVIRGYYKKTNDETINELLDAVCYENYLITAELEQTLGKHEALSDEEKMKKLEYLLQKKYR ANMYSIKLKKADQGEVQDYPEILFPGSFALSDTFVAKQRAHKLEFHTIQSAKYATQIIIFSV >seq_2 --------------------DQFLN--IMTSAFCCKRFLVATSVKQQCNHETTCWIKDGDRCYKNGE-------KYYCMEEKNSQ--------------- --TSRMKKVIYRHFQPEKYETCTTCDTVWHPKCRLTRLLKDEDSLPCRNHHIK---------FKETPLQKALSKNYHKNRGPELQFVHAICETVTPGEEL EKVK-------VKPEDYTHEANQIFVIVMIKNVETLIFALNTQEYR----SGPKWVVIEYLDSLSIL-TKDRSAIYKRIIRSYLKYARDSGFKIAHLWSC APDNGVDYIFSGHPKSQRFLRKDQLDEWYLSIFQECDGGKVDVGYAKYFDEV--LKKTTVEEVVAGFWTKEIEEIIKKMKENKNDSLENFKIQKIKDASI NSLFYVELANGDEGKETNETTE--ECEEFSNDKFLDQCS-YDIEFTDLRQAQFATRKILMEV >seq_3 AVHKFFLELLKVNRGRKTEE-AKVTRKLRNLGLCCG-ELREYKATTMSCGGNCDGIKANESYYGYKE-------YCYCVDADCMAATYLTLSEEK----E VHKNRFEECCNVSI-AEPLIECNACCNFTHEPCKALQEDSIM--CKCDVK------TNMLTNFVEEALSQLNF--------IKLLKVRIVRNVTALAPIG QKMKSFLEKSLPLPQQVKCKERKIVVYQKQGAVYVNFLSFSVKEYESG------WIVLYYLDSVQHLEPKKRSAVYRKVLNAYLSYAENCGYQMAHIWAD APKKGVEFFFNERPEKQHWCSQDQLEGFYKKMLEEGNRYRIEYNFK--------KTKSYTKLLQFDFFMDCMEEYIGSMDGKKVTFKEFKSLVKKAESRD GALFYINLCADPFNGEVNDVDEEIKCDFAVSEDWINFQFQNKLSFDT-RRNAFATIKTIEAL >seq_4 -----EVDLNKIMIGLENGLCSSADPIMQKLGYCCA--TYYYKDITFTCTGNRCPIKSGHKYLEYVN------SYRICYEKLEYK--GIFTDDLNKSAVL IPRNQLSDILVNKAHADELIRCRECGRRFHALCVLYKEHMWP--YICKSCVQRLEKRNRLSDFIEERVHRFLRNKKSIT--TGKVRIRVLSSVDKFMLAK PLMRAYLHAKRMSTEPFSYRQKVVFAFQEVEGNEVCIFGFYSHEHGLNS-LSQNRVFIECVDCLRYFQESECAVLHQIIITAYLEYVRNLGFWCAHLHSG SKEEGEIAIFFGNSQNPLIRNRAARRVLYNGIFQILINELIAQSVEDIYDE---SLRSDARNFEGDLWPDVIEF---CLQQAVNNVRQITQLYDTLKDYR KDFHVVRLNSP-------DPDPLIQC--LQNEKFVMEVERNKLDFAILREAKFLTTSMLFEL >seq_6 KALS-TVEQEDIVTKLKKAH-CEFIDMFKALGLCCGSIKEELLQTIGCDEGDKCRIRRDDKYYHYKN-------FAICVDLLEDR--FRTRDNKK----I LHKTDFDFKTHDEVENEAMEVCSHCGKHWHDSCRMNQLFANRSRSMCFKNHDDSIKENIPITVASDRIEEFVNDKIIGTFPAKKISIREVATKITTIKSN NKMSKFLKKTTGESLKLKYRYRQFIAVQKMQGRELLLFSFSVQEYLDGF----KWTNVEYVDSVKYVLPKLRTTIYQTILLGYFKFAASIGFQHAHIFAM APQEGDSFFFRGRPESQKVSNQEHLINWYHNFLNIGK-EDIIDSFKDYSSD---YEMLAAPEFVGGLFTYHFEQILKEIKISLFNNKSRQA--------- -------------------PTPVVYVEYKTSDEWINFQETEKLEFSSLDQAHYATLMIV--- >seq_7 IVQH-PMDLGTIKRNLAAGEQHACLVCRGNTCIVCNQQCLEVTQPHLQCGGCSTDIRKGSVYFVSED-------RVWCQTRLARD--RNSPDREPDDDTA LLDSLIKKKCEAGVEP--WVKCGECDRWLHQVCGLYNPVIGAPSYACPLCRCRRKRSCELSIFIQNFLRRGLDKIGEREA-AQTLYVRALSFPGERMAVP EAVVRTFDENQRIPAHISYLSRGLYLFQKHEGMEVCLFTIYAQEFGDDCELEANAVYIAYLDSVRYLKASARTAAYHLILLAYFDYVRRHGFSRVHIWSC PPQKRISYVFWCRPSFQKTPSAEHLRRWYNKLLTKAKDCGIVKDWTTMYDR-----------FSE---------SVCGAVAKDDSKSSASF--------- ------------------NPNELELPPIFDGDRILGIMSREKQKRAS-ENKK---------- >seq_8 IVQH-PMDLGTVKRNLAAGEQHVCLVCRGNTCIVCNQQCLDVTPPHLQCAGCSTEIRKGSIYFISED-------RVWCQTRLARD--RNAADRAPDVDTD AMLDALIKKKCEA-GVEPWVKCGECDRWLHQVCGLYNPVIGANPYVCPLCRCRRKRSCELSIFIQNFLRRGLNEIGELEA-AQTLYVRALSFPGEHLTVP EGVVRVFDENQRLPARISYLSRGLYLFQKHEGMEVCLFTIYAQEFGDDCELEANAVYIAYLDSVRYLKTSARTAAYHLILLAYFDYVRRHGFSRVHIWSC PPQKRISYVFWCRPAFQKTPSAEHLRRWYSNLLTKAKEYGIVKDWTTLYDRC--GSQSNGESFDGDIVPSELERLGRIISRNEKQKRASENDVKVREVFT KCQFAVQRLKNDLLVVDLEQ----VPRFFSSFMFHQLCSLAGYQFDSLRRAKHSTMMMVHHY >seq_11 IVQH-PMDLGTIKRNLAAGEQHVCLVCRGNTCIVCNQQCLDITPPHLQCAGCSSEIRKGSIYFITED-------RVWCQTRLARD--RNSTDRAPDDDTD ALLDSLIKKKCEG-GVEPWVKCGECDRWLHQVCGLYNPVIGTNPYVCPLCRCRRKRLNIPSCELSIFIQNFLRRGLNDIGEAQTLYVRALSFPGEHLTVP EGVVRAFDENQRIPARISYLSRGLYLFQKHEGMEVCLFTIYAQEFGDDCELEANAVYIAYLDSVRYLKTSARTAAYHLILLAYFDYVRRHGFSRVHIWSC PPQKRISYVFWCRPSFQKTPSAEHLRRWYNNLLTKAKEYGIVKDWTTLYDR-----------FSD---------SVCGSTANVDATPSSSF--------- -------------------PDELVAHQLFDGERILGIISREKQKRAS-ENKK---------- >seq_12 IVAH-PMDLGTIKKTLDAGKAHACNVCAGHICALCDDGCLELTLPHYQCGNCGTVFRKGMSYYVTRD-------RMWCMNRGMKEEKTKRDEDDENGGNA AEMAPLLSKKKCEVDVEPWVRCGTCDRWMHQVCALFNAVEDANAFVCPLCELELEAESPLSLYLEARLRTEIPDVD-----ADSLFVRESTFANHKFVLP PSIVAFQMNARELPVRVSFATKSIFLFQKQQGVDVCLFAMYVQEYNDVCELVENSAYLAYIDSVRYLQPTIRTRVYHRILLAYFDYARHHGMDRIHIWSC PPTRSQSYVFWCHPSFQKTPSVDHLRAWYKRVLQKAHDEKIIDGYTTLYERLPEDKTTNMAIFDGDIVPSELDRVIRQLKSKKRKKIWAADFLSAMKIVK DDLLVVDLAPPTSPLAIPTSER--TSAMIGSFSFHQLSLRASYQFDSLRRAKHSTMMLLHHM >seq_13 IIAR-PMDLGTIKKTLDAGKAHACSVCAGHTCALCDDGCLELTLPHYQCGNCGTVFRKGMSYYRDGT-------RMWCMNRGMKE--EKTKRDEDEGG-A AEMAPLLSKKKCEVDVEPWVRCGSCDRWMHQVCALFNAVEDA-AFTCPLCELAAKQESPLSIFLEQRLRAAIPTGD-----ADALYVRESTFANHKFFMP PSIVEAFKMNRELPVRVSFATKSIFLFQKQQGVDVCLFAMYVQEYNDDCELVENSAYLAYIDSVRYLQPSIRTLVYHRVLLGYFDYARHHGLDRIHIWSC PPTRSQSYVFWCHPTFQKTPSVDHLRAWYKRVLQKAQDEHIIDGYTTLYERAKAAKAPELPPFDGDIVPSELDRVIRQQKSKKRKKIWAADFLSAMKIVK DDLLVVDLAPP----TAPPPRSMMMPAFIGSFSFHQLSLRASYQFDSLRRAKHSTMMLLHHM >seq_14 IVRH-PMDLGTIKARLESAQGHACDVCCGHTCAACDQRCLELLVPFSQCGNCGTTFRKGSTYFRDGT-------RVWCANKSMKE--ERILQEDDE---A SEVSAWLVKRKMDVGVEPWVCCSCCGRWMHQVCVLFNPVEAA--FVCPHCQLQSRRETELSRFLEANLGDAAGDA------IDSLCVRTMTFTGQESHLP TEMVHLFQSNVDVPDKLTHNTKTIFLFQKHHGVDVCLFAMYVQEYDDSVEYAPNSVYLAYIDSVRYMEASIRTGVYHSILTSYFDYIRRHGMERVYIWSC PPQRSQSYVFWCHPPFQKTPSGDHLRSWYKQVLDKAQARGIIQSYGTLYDRAAQWKKDAAKGFDGDFIPGELDRIARALKAKHKPRRDKEPFLTAMKAMR DDLLQVDLCRSNEGAVVPAKDPLP--PFVGSFAFHQMSSHASYQFDSLRRAKHSTMMLLHQM >seq_15 VVRH-PMDLGTVKASLDKGLAHSCDVCLGHTCAACDQRCLELVVPFNQCGNCGTTFRKGSTYFVTRD-------RMWCATKSMRE--ERLLQEDDECFQA SEVSAWLVKRKCEADVEPWVQCSCCAQWMHQVCVLFNPVEDANRFVCPHCRRVASIETELSRFLEASLGDAAGDA------IDSLCVRTMTFTGQVAHLP TEMVRIVREHVDVPDRLVHSTKTIFLFQKHHGVDVCLFAMYVQEYDDSVEYAPNSAYLAYIDSVRYMEASIRTAVYHSILASYFDYIRRQGMERVYIWSC PPQRSQSYVFWCHPHFQKTPGVDHLRMWYKRVLDTAKARGIIASYGTLYDIM--QLKKDSAKFEGDFIPGELDRVARALKAKNKPRREKEPFMTALKGMR DDLLQVDLTPPPAGCPL--PDP--PTKLVGSFAFHQMSSRASYQFDSLRRAKHSTMMLLHQM >seq_16 IIKK-PMDLGTVRDRLASGYNHFCSSCRGTPCRICGEKCLRYSPPVFVCGDCHGRILRCSTYYRGQK-------GRYCQAK------KIAGMDK-----A DRKNLLVKKKNDEMFPESWVQCSRCHEWLHCICGLVHPRQVTNNYVCPICLSEDPRSEYLTERVRQRVDEVVQKLPKRFRPKQSLIVRVVSNITTSVTVK RSIAPLFTTNHSDELSLPYRSKCIAFFQHRNGVDILLFVLYVHEFDENT-IPANCVYISYLDSVHFLSRYLRTPLYHTLLNGYLAYAKSNGYCRAHIWAC PPSRGDDYIFPHHPRDQRTPNADHLIGWYRQLLAEAVEAGIVSHASCQLDE-----------INARSNNPDINQQIRCFSIREGAETELPGSPFSLSSCS SVYET-------------ENDSEVGMDIINEEAFFS----SDISSMSLPTPS------IPSI >seq_17 IIRE-PMDLGTAIKKLNSGQNAFTSSLMKSLGYCCN-KRYVYTPQPLYCSSDFCNIPVNTYYYKYDE------RIFYCVESFGDY-VDCIVDGIGSK--R IKKDLFLKTRNNETIYEPFVSCEVCKKKFHKVCVGHVDTI----FYCDRCFEEFKIECSLSNYIESRVNEFLLKE----CKYLYVSIRMICNIQKILMTK QGLCRFG---SKFPSRFPFRSKALFAFQEVEGREVCFFSMHVQEYGEDC-PHPNRVYLSYLDSVHYLQKNYKTRVYHEILLSYLGFCKERGFEYCHIWSC PPCEGDDYVFHCHSYEQKLPKPRRLLDWYNKMIELGIQKKVIIECQDIYKYF-NYVGEMAHSFEGDYWSNSLEDLIIEHEKESLNSSNHQSLHSLMDKYK EVFFVLKLNEPG------DNNPILQSELMEGSGFLSLCRENHWEFSSFRHAVYSTRAMMYVV >seq_26 VIKH-PMDLTTIRNNLEDGIESQVDQVMQAMGFCCG-HEYSYQ-QILYCSANVCTIGKDAHYYMYTNV-LICDTYYQCENE-AGD--EIMLADEANQ--P IRKELFERKKNNVTIEETDVHCKECGRRWHKVCALHMDEIWT--FVCSGCLRERGLTNKLSNFLERRVNDFLKKK----EVTGEVTIRVLASSDKLVEVK PLMRRFT--EGELSESFPYRLKAIFAFQEIDGQDVCFFGLYIQEYGSES-PQPNRVYVAYLDSVFYFRKQYRTDVYHEILVGYLHYAKQLGYTMAHIWAC PPAEGDDYIFHMHPLEQRIPKAKRLQDWYKRMLQKAMIEGIVADFKDILKDA-DHHLVSPTEFEGDFWPNTLEEILQDLDKEEERRRREEAVYDTMEKLK EIFFVIRLHKHG---ATADPDALISSELMDSDSFLQMAREKHLEFSSLRRAKYSTLVMLYEL >seq_33 IVPN-PMDLSTIEKKLQIGDESLIDPLMRDLGFCCG-RAYVHHPHVLSCTEKSCNINRDDIYYVCETKKEFLDKYIVCDQS-AGS--TITID-DQSS--H LEKSRFKKQVNNTIVHEKFVHCRECGRKWHKVCACHMDEIWP--FVCDNCRKTYNIKCKLSEFLEKRVNDFMKKN----EPTKEVTIRVLASGYKSVEVK KYMRVFY--AGKFPESFPYKTKAIFAFQEVDGQEVCFFGLYVQEYSSDC-PPPNRVYIAYLDSVYFFQKQYRTDIYHEILIGYMHYAKKQGYANAHIWAC PPGEGDDYIFHMHPVDQKIPKPKRLQDWYRKMLQKGHNERIVFDYKNIYEDA-ETKHLLPTDFDGDLWPNTMEELLKPLVENRRR--------------- -------------------------------------------------------------- >seq_35 IVPF-PMDLSTIEKKLKSGQEGIVDPLMQKLGFCCG-HAYVHHPHVLTCTDKSCNINRDDIYYVYETKKELLDKYIVCEQN-AGS--TITID-DQSS--H LRKNLFEKQVNNTIVNEKFVNCRECGRKWHKVCACHMDEIWP--FVCNTCRKMYGITCKLSDFLEKRVNDFMKKN----ESTNEVIIRVLASANKTVEVK KYMRIFS--DGKFPESFPYRAKVIFAFQELDGQEVCFFGLYVQEYSSDC-PAPNRVYIAYLDSVYFFQKQYRTDVYHEILIGYLHYAKKQGFANAHIWAC PPGEGDDYIFHMHPPDQKIPKPKRLQDWYRKMLQKAHNERIVVDYNNIYQDA-DTNCLSPVDFDGDLWPNTLEELLKPAVESRRK--------------- -------------------------------------------------------------- >seq_39 IITH-PMDLSTIRKKLENREETTIDPVMQQLGFCCG-REYYYLPPPLTCTPKFCTIQRDAVYYVYKNPGLLEQKYTVCENE-AGD--EIRLDSESST--T VQKNLMEKCKNDRKEKEPFVFCKKCGRKWHRVCAIHADEVWP--FVCNRCMRDYGLTCRLSNFLEKRVNDFLKKK----GTASDVIIRVLASADKTVEVK PGMKRFC--EGDMPESFPYRVKAIFAFQEIDGQEVCFFGLHVQEYGSEC-PPPNRVYLAYLDSVYFFRKQYRTDVYHEILVGYIHYAKLLGYAVAHIWAC PPSEGDDYIFHMHPTDQKIPKPKRLQEWYQKMLKKALLERIVVDYKDICQEA-ESGLVSPTEFEGDFWPNTLEELFKEMDEEDAKRKREQEVYEIMEKHK ENFFVIRLHPQN---SVSDLDPLINSDLMECGAFLERAREKHLEFSSVRRAKYSTLVMLYEL >seq_43 IVKN-PMDLSSIKKKLEEGGEADIEPIMKKLGYCCG-RKYVFNPQVLYCGKNLCSIPRDTPYYSYQN------RYHYCEKEFEGE--SIDLGEDSGL--I INKNEFEKLKNDHLDLEPFVTCVECGRKMHQVCVLHLDIIYE--YICPTCREEKKLQSKLAEHIENRVNNFLKTQ-VT-DEPGYIHIRIVSSVEKVVEVR QGMRRYG---NEFPEKFPYRARALFAFEEIDGVDVCFFGMHTQEFGSDC-PEPNRIYLSYLDSVHFFRPHLRTSVYHEILIGYFDFCKNRGFHYGHIWSC PPSEGDDYIFHCHPVEQKIPKHKRLVEWYKVMLNKACEDGVMAGYKDMFTQA-EDEITSPTQFEGDYWPNAIEESIREISQEEEERRKTEAVFTTMEKFK EIFFVLHLHS-----PANDPDPVMSCDIMDGDAFLTLSRERHWEFSTLRRAKFSTLAMLVEL >seq_45 IIKN-PIDLSTIKFKLQMGLRSVIDQPMQSLGYCCG-QRYAFEAQVLCC-TQACSIPVGAIYWEDNN------GIKYCQSKI-SE--DCTLGDNSVD--S LKAKQFQQRKNNSLKPEPFIHCKNCNRKQHEICVLYMEIMWP--FLCNTCLKIKGKKTELGLYLENHVNSYLEKM----NCAFRVIIRVLSSSEKSVEVK PEMKKFF--EEEQPLWFPYRAKAIFAFQRINATDVCFFGMYVQEYGSEC-PFPNRVYIAYLDSVHFFQKQHKTAVYHEILLGYLSYSKHLGYIMAHIWAS PPSEGDDYIFNCHPPEQKIPKPGRLHEWYKILIDKGIKDGIILDYKDIFMQA-EDNLKSVSEFDGDYWPDVLEHIIKELAIEERKAKACVVLYSTMEKFR KAFYVIRLHSTE---TAEDPDPLISCDLMDSDVFLTFTRENNYEFSSLRRAKFSTMGLLCEL >seq_48 VVKR-PMDFYTIKKKLNEGNVKWVEPIMKDLGYCCG-NKLTFTPLALFCGTSMCVIGRDQPYYLFEAGVTVSDKYIYCVEALPEE--GINLSDSPNE--M APKKSFQLMKNDQLDNEPFENCKICKRKWHKICALYNKKIFA--FICETCRCEKLIHCHLSRFIEDRVNIFMNKN-GGIEADYEVVIRVLAVQDKEVEVK PHMKRYG--KG-FPEKFPYRTKAIFAFEVVDGVEVCFFGLHVQEYGSNC-PPPNRVYIAYLDSVHFFQRHLRTDVYHEILLAYLDYVKDLGYTMAHIWAC PPSEGDDYIFHCHPPEQKIPKPKRLQDWYKKMLDKGLRDKVVSEYKDIYKQA-DDNLNTPMAFEGDFWPNVIEDCIKEVEKEENERRKLEALLGVLEKHK EVFFTIRLAT-KS--DKDDPDDLMPSDLMDGDTFLNKAREEHWEFSSLRRAKYSTICFSYAL >seq_58 IIKH-PMDLSTICVKLDSGRVSEMDQVMQQMGYCCS-RKLSFTPLALFCGASMCTIARDQPYWVYEQ----TSSYTYCLDALPPE--GISLSENPNDQSM APKDKFVQMKNNVIDYEPFEVCKYCHRKWHRICALHDKKVFP--FICDTCRKEKGYHNKLSQFLEDRVNTFLKNAMPS-NPQYEVIIRTLCVQDKEVEVK PLMKKYGPQG--FPDKFPYRTKAVFAFEIIDGVEVCFFGLHVQEYGSNC-KDPNRVYIAYLDSVHFFQRELRTEVYHEILLGYLDYVKRLGYTMAHIWAC PPSEGDDYIFHCHPPEQKIPKPKRLQDWYKKMLEKGVAEKTVVEFKDIYKQ---DNLTTPMSFEGDFWPNVIEDCIREAGNEEAQRRKEVALYSQFEKHK EVFFTIRLVTQQ----SADPDPLMASDMMDGDTFLTRARDEHWEFSSLRRAKYSTLALCHAL >seq_69 IVKH-PMDLLTIKEKLLAGEMEEIDPVMRRLGYCCG-RKLAFTPLALVCGQPMCTIPRDAQYYCYETIGVNSERYTYCKEEYPGD--SINMSEDPNS--L ISKSNFSLMKNDQIEYESFVECGVCGRKWHQICALYHEQIYSFRFVCSSCSGLSSSPCKLAAHIECRVNNYLRKK--D--SAGEVIIRVLASSDKEVEVK PLMKKFC--TGEIPEKFPYRTKAIFAFEVLDGIEICFFGLHVQEYGSNC-PQPNRVYIAYLDSVHYFQKQYRTAVYHEILLGYLEYVKLLGYTMAHIWAC PPSEGDDYIFHCHPAEQKIPKPKRLQEWYKKMLDKGIVERIVVDYKDIYKHA-DEHLQSATEFEGDYWPNVLEDCIKELEAEEAERRREAEIYAMMEKHK EVFFVVRLHSAQ---AAADPDPLISCDLMDGDSFLSIARERHWEFSSFRRALFSTICLSYEL >seq_75 KIKN-PMDLSTIEMKLNNRQ-LSVNCLATLVIYDCY-NLSVIIYTVITVSDNAYLVGMNAFSYSHMLLLLLKNRYAYCENDLPSP--VIDI-DEPTQ--K VNKSEFTRERNDKLEGEAFVTCTVCDRKMHQICVSHFEPVTT--FVCKNCQASTGMNTKLGTYLENRVNQFLKKK--D--SSVEVSIRVLSSFDKFAEIK QGMKRFS--DLVDPPQYPYRVKAIFAFAEIDSVDVCFFGMHVQEYGSEC-PDPNRVYIAYLDSVNFFQRQYRTSVYHEILLGYLEYAKSLGFLTAHIWAC PPSEGDDYIFHCHPPEQKIPKPKRLQEWYRKMLDKAVEERVIVEYKDIIQDA-DGQMQCLAGFEGDYLPMTIEEVIK----------------------- -------------------------------------------------------------- >seq_77 VIRK-PMDLSKIRRKLDTGVEPEIEALMKSLGYCCG-RYYYFNPQVLCCGKPMCTIPVKASYFGYLN------KYVYCENDIKSK--DVELTDDPTQ--I IEKASFSRLKNNKQEPELFVECTHCGRKQHQICSLYYQPIWP--FVCHHCCKHANIQTKLGTYLENRINSFLRDN--K-S-SGDVIIRVHSSSEKFVEVK PGMKRFCASQSEFPESFPYTAKAIFAYEDINGTDVCFFGMHVQEYGSKC-PPPNRVYIAYLDSVNFFQRHFRTSVYHEILIGYLDYVKNLGYTTAHIWAC PPSEGDDYIFHCHPPEMKIPKPKRLADWYKKMLDKAIIERVVIDYKDIIKDA-ENGMKNVTEFEGDFWPNVIETCISEMEQEEIGRK------------- ----------------------------------FAIAESCSGDFAATA------------- >seq_80 IIKH-PMDLSTIKDKLDNGIDNEIDEAMISLGYCCG-RRHVFSPQVLCCGKQLCSIPRDTVYYNYLDVVDDVVMYIYCENELQGE--EVEMVDDPSQ--K IKKGQFVREKNDKLDHEQFVNCKECDRKMHKICVLHMENIWP--FLCDYCLKALQTTTKLGTYLETRVNSLLKKK--N-S-AGEVTIRVLSSYDKVVEVK SGMKRYV--NSEMPEGYPYRVKAMFAFEEIDGIDVCFFGMHVQEYGSDC-PMPNRVYIAYLDSVHFFQRHLRTTVYHEILIGYLEYAKIQGYEWAHIWAC PPSEGDDYIFHCHPPDQKIPKPKRLQDWYKKMLDKAYQDRVIIDYKDIYKDA-EGNITKVTDFEGDFWPNILEECIKDLEQEEVEKRKRAELLSTMEKHK EGFFVIRLQPIN---NCPDVDPLMNCDLMDGDAFLTLARDKHYEFSSLRRAKYSSMGLLYEL >seq_81 VIKK-PMDLSTIKKKLDAGFDHEIDNMMKSLGYCCG-HKHIFSPIVLICGKHMCTIPVNTYYYTYQN------RFSYCENDIRTD--YIELNEDASQ--R LLKTLFLKMKNGEFDYEPLVACSICNNKHHNICVMHMDQISS-RFVCSICTKSQNLQTKLGSYLELRVNSFLKRK--D-A-------------------- -GMRKFCNDEKEMEKSFPYIAKAIFAFMDVDGVDVCFFGMHVQEYASSV--QPNRVYLAYMDSVNFFRKHLRTSVYHELLIGYLDYAKNLGYMYAHIWAC PPSDGDDYIFNCHPPDMKIPKPKRLQEWYKKMLNKAVNDQVIHDYKDLLKDV-EKNLQSVTEFEGDYWPNVLEECIKELDEEEKKRK------------- -------------------------------------------------------------- >seq_96 IVKV-PMDLSSIKRKLDTGQEGEVDAVMQSLGYCCG-RKHVYHPQVLCCGKPVCTIPRDSVYFSFQNTGLFSNRYVFCENEIKGD--EVELTDDPTA--M ITKDQFLKEKNNKFDYEPFVQCMDCGRKLHQICVLHFDPIWP--FICDHCHKARGSNTKLGTYLENRVNSFLKRK--D-S-AGEVTIRVLASGDKIVDVK PGMRRFV--EGEMVESYPYKAKAMFAFEELDGVDVCFFGMHVQEYGSDC-AAPNRVYIAYLDSVHFFQRMFRTAVYHEMLIGYLDYAKSMGYTMAHIWAC PPSEGDDYIFHCHPAEQKIPKPKRLQDWYKKMLDKAIIERVVIDYKDILKDA-ENHMQSATEFEGDFWPNVLEESIKELETEEEEKRKREELFNTMEKHK EVFFVIRLHSTQ---TAADPDPIISNDLMDGDAFLTLAREKHYEFSSLRRCKFSTMALLYEL >seq_97 IIKH-PMDLSTIKRKLDTDGEGEIDGVMRALGFCCG-RKYMFSPQALCCGKAMCSIPVNAVYYSHQN------RYVFCELDNKSE--EMELNEDATQ--R IRKELFEKTKNNQLEMEPFIECSLCGKKQHQICSLYLEAVWQ--FVCQQCSTANNIQTKLGTYLEARVNSFLTKS--D-C-AGEVTIRVLSSSEKTVEVK PLMRRHG--D-EMPDSFPYTAKSIFAFEEIDGCSICFFGMHVQEYGSNS-PAPNRIYIAYLDSVNFFQKHMRTPVYHEILIGYLDYVKKLGYATAHIWAC PPSEGDDYIFHCHPPDMKIPKPKRLQDWYKKMLDRAIMEKIVVDYKDILKDA-ENNISSATEFEGDFWPTVLEDSIKEL--------------------- -------------------------------------------------------------- >seq_99 IVKT-PMDLSTIKRKLDTGQEQEIDPVMQSLGYCCG-RKLEFSPQTLCCGKQLCTIPRDSTYYSYQN------RYHFCENEIQGE--TVSLGDDPSQ--T ISKEQFSKKKNDTLDPEAFVECSECGRKMHQICVLHHELIWP--FVCDPCLKKIGKTTKLGVYLETRINDFLRRQ--N-HPAGECTVRVVHSSDKTVEVK PGMKRFV--DGEMAESFPYRTKALFAFEEIDGVDVCFFGMHVQEYGSDC-PPPNRVYISYLDSVHFFRRCLRTAVYHEILIGYLEYVRKLGYTAGHIWAC PPSEGDDYIFHCHPLDQKIPKPKRLQEWYKKMLDKAVTERIIHDYKDIFKQA-EDRLTSAKEFEGDFWPNVLEESIKELEQEEEERKREENLYATMEKHK EVFFVIRLIAGP---NSNDPDPLIACDLMDGDAFLTLARDKHMEFSSLRRAKWSTMCMLVEL >seq_301 IVKN-PIDLSTIKRKLDTGQEAEIDPVMQGLGYCCG-RKLEFSPQTLCCGKQLCTIPRDAAYFSYQNLLFLPITEYLLSSEIV-K--LLKIKFCSYL--S ISKDQFQRKKNDTLDPEWLVECTDCGRKMHQICVLHNDTIWP--FVCDSCLKKANKQTKLGSFLEGRVNDYLRRQ--N-HPSGDVTIRVVHVSDKVVEVK PGMKRFV--DGEMSESFPYRTKALFAFEDIDGADVCFFGMHVQEYGSDS-PPPNRVYISYLDSVHFFQRHLRTGVYHEILIGYLEYVKKLGFTTGHIWAC PPSEGDDYIFHCHPVDQKIPKPKRLQEWYKKMLDKAVAERIVHDYKDVFKQA-EDRLTSANEFEGDFWPNVLEESIKELEQEEEERKREENLYACMEKHK EVFFVIRLIAGP---TANDPDPLMACDLMDGDAFLTLARDKHLEFSSLRRAKWSSMCMLVEL >seq_320 IVKN-PIDLSTIKRKLDTGQEQEIDPVMQELGYCCG-RKLEFSPQTLCCGKQLCTIPRDAAYFSYQNYGLLADRYHFCENEIQGE--NVSLGDDPTQ--S INKDQFQRKKNDTLDPELLLECGDCGRKMHQICVLHNETIWP--FICDGCLKKSNSQTKLGNFLETRVNDYLKRQ--N-HPSGEVTIRVVHISDKVVEVK PGMKRFV--DGEMSESFPYRTKALFAFEEIDGTDVCFFGMHVQEYGSDC-PPPNRVYISYLDSVHFFQRHLRTGVYHEILIGYLEYVKKMGFVMGHIWAC PPSEGDDYIFHCHPSDQKIPKPKRLQEWYKKMLDKAVTERIVHDYKDIFKQA-EDRLTSAKEFEGDFWPNVLEESIKELEQEEEERKREENLYATMEKHK EVFFVIRFIAGP---AANDPDTLMACDLMDGDAFLTLARDKHLEFSSLRRAKWSSMCMLVEL >seq_366 IVKN-PMDLSTIKRKLDTGQEQEIDPVMQSLGYCCG-RKLEFSPQTLCCGKQLCTIPRDAAYFSYQ------NRYHFCENEIQGE--SVSLGDDPSQ--S ITKEQFEKKKNDTLDPELFVECLDCGRKMHQICVLHHETIWP--FVCNGCLKKANKPTKLGIYLENRVNDYVKRQ--N-HPAGEVTIRVVHVSDKVVEVK PGMKRFV--DGEMAESFPYRSKALFAFEDIDGADVCFFGMHVQEYGSDC-PPPNRVYISYLDSVHFFQRCMRTTVYHEILIGYLEYAKKLGFTTGHIWAC PPSEGDDYIFHCHPTDQRIPKPKRLQEWYKKMLDKAVAERIVHDYKDIFKQA-EDRLTSAKEFEGDFWPNVLEESIKELEQEEEERKREENLYATMEKHK EVFFVIRLSAGP---NSNDPDPLMACDLMDGDAFLTLARDKHLEFSSLRRSRWSSMCMLVEL >seq_370 IVKN-PMDLSTIKRKLDTGQEQEIDPVMQGLGYCCG-RKV--------CHTLLLSAENIAKYFFYQN------RYHYCENEIQGE--TVSLGDDPTQ--S INKDQFEKKKNDTLDPELLVECMDCGRRMHQICVLHNETIWP--FVCDGCLKKTNKQTKLGTFLESRVNDFLKRQ--A-HPSGEVFIRVVHVSDKVVEVK PGMKRFV--DGEMSESFPYRTKALFAFEDIDGVDVCFFGMHVQEYGSDC-PQPNRVYISYLDSVHFFRRCLRTAVYHEILISYLEYVKKLGYTTGHIWAC PPSEGDDYIFHCHPLDQKIPKPKRLQEWYKKMLDKAVAERIVHDYKDIFKQA-EDRLTSAKEFEGDFWPNVLEESIKELEQEEEERKKEESLYATMEKHK EVFFVIRLIAGP---MANDPDPLMACDLMDGDAFLTLARDKHLEFSSLRRAKWSSMCMLVEL >seq_397 IVKN-PIDLSTIKRKLDTGQEAEIDPVMQGLGYCCG-RKLEFSPQTLCCGKQLCTIPRDAAYFSYQN------RYHFCENEIQSE--CVSLGDDPSQ--S ISKDQFQRKKNDTLDPELLVECTDCGRKMHQICVLHNDTIWP--FVCDNCLKKANKQTKLGSFLEGRVNDYLRRQ--N-HPSGDVTIRVVHVSDKVVEVK PGMKRFV--DGEMSESFPYRTKALFAFEDIDGSDVCFFGMHVQEYGSDS-PPPNRVYISYLDSVHFFQRHLRTGVYHEILIGYLEYVKKLGELLGEFIAL ------KIIFICGPTKT--------------------------GFDLCLRQL-EKVL----------------------------------LYATMEKHK EC----------------------------------QCRNANCSLPSCQKMKRVVQ------ >seq_398 IVKGLSHDLSTIKRKLDRGQKQRLIAVMAGPGYCCG-RKYEFSPQTLCCGKQLCTISRDGTYYSYQN------RYHFCENEIQGN--SVTLGDDPSQ--M ISKEQFEKKKNDMLDPEPFVECKDCGRKMHQICVLHYDVIWP--FICDNCLKKSGKTTRLGTYIEDRVNKYLKRQ--N-HPAGEVFVRVVASSDKNVEIK PGMKRFV--DGEMVETFPYRTKALFAFEEIDGVDVCFFGMHVQEYGSEC-PFPNRVYISYLDSIHFFRRALRTAVYHEILIGYLEYVKKLGYVMGHIWAC PPSEGDDYIFHCHPFDQKIPKPKRLQEWYRKMLDKAFAERILHDYKDIFKQA-EDRITTANEFEGDFWPNVLEESIKELEQEEEERKKEENLYASMEKHK EVFFVIHLHAGP---VVNDPDPLLTCDLMDGDAFLTLARDKHWEFSSLRRCKWSTMCMLVEL >seq_401 IVKN-PIDLSTIKRKLDTGQEAEIDPVMMSLGYCCG-RKYEFSPQTLCCGKQLCTIPTGGTYYSYQN------RYHFCENEIQGE--NVTLGDDPAQ--M ISKDQFERKKNDTLDPEPFVECKDCGRKMHQICVLHYDVIWP--FICDNCLKRTGKVTRLGTYIEDRVNKYLKRQ--N-HPAGEVFVRVVASSDKVTEVK PGMKRFV--DGEMAETFPYRTKALFAFEEIDGVDVCFFGMHVQEYGSDC-PIPNRVYISYLDSIHFFRRQLRTAVYHEILIGYLEYVKKLGYTQGHIWAC PPSEGDDYIFHCHPADQKIPKPKRLQEWYRKMLEKAYAERILHDFKDIFKQA-EDRLTGANEFEGDFWPNVLEESIKELEQEEEERKKEENLYATLEKHK EVFFVIHLHSPQ---MANDPDPLLTCDLMDGDAFLTLARDKHWEFSSLRRCKWSTMCMLVEL >seq_754 IVKK-PMDLSTIKKKLDTGQEAEIDPVMVSLGFCCG-RKYVYQPQTLCCGKQLCAISKDAKYFSYQN------RYMYCVEDIPGD--TVTISDDPSGA-T IPKNQFVEMKNDHLENEPFVDCTDCGRKLHQVCVLHFEAIWP--FTCEGCMKAKGMTCKLSNFLETRVNNYLKKK--N-S-AGEVYIRVVSVSDKVVEVK PGMKRFV--DGTWPEQFPYRAKCMFAFEDIDGADTCFFGMHVQEYGSDA-RPPNRVYIAYLDSVHFFKKQYRTAVYHEILLGYLDYAKQLGYVMAHIWAC PPSEGDDYIFHCHPAEQKIPKPKRLQDWYKKMLDKAIMERIVIEYKDIHKQA-EDNLKSASEFEGDFWPNVLEDSIKELDQEEEERRKQEAVFATMDKHK ENFFTIRLHPVQ---VAADPDSMMTCELMDGDAFLTMAREKHYEFSSYRRCRYSSMAMLYEL >seq_755 IVRN-PMDLSTIKRKLDNGQESEIDPVMKLLGYCCG-RKYVFHPQVLCCGKQLCTITRDTAYYTYQN-----QEIVTFPNEIQGD--MVNLGEDPTL--L IPKDQFVKLKNDHLDVEPFVECIDCGRKVHQICVLHHDLIWP--YQCDACLNKRGVHTRLGEKIETRVNTFLQKQ--G-C-PAEVTIRVVSSVQKQAEIK PGMKRYE--GGQLPEYFPYQAKALFAFEEIDGTDVCFFGMHVQEYGSDC-QQPNRVYVSYLDSVHFFQRQLRTAVYHEILIGYLEHVGQIGFQMAHIWAC PPSEGDDYIFHCHPPEQKIPKHKRLVDWYKKMLDKAIMDKVVIEYKDILKQA-DDGLTNPKEFDGDFWPNALEDSIRELDQEEEERKANAALYSTMEKHR EVFFVIRLNGRK---TPDDPDPMMTCDLMDGDAFLTLARDKHYEFSSNRRAQFSTMAMLVEL >seq_761 IIKE-PMDLSTIAWKLDNGQEQAIDPVMKSLGYCCG-RKYTFNPPVLCCGKPMCTISREAKYYSYQN------RYTYCFNDIAGD--TVRLADDPTQ--V IKKEEFREMKNDELEHEPFVECSDCGRQVHQICVLHLEPIW---FTCDNCLKKKGVVTKLSTYIETRVNNFLKKK--E-A-AGDVHIRVVASSEKIVEVK PGMKRF----GEMCETFPYRAKALFAFEEIDGVDVCFFGMHVQEYGSEC-SPPNRVYIAYLDSVHFFKRQYRTAVYHEILLGYLDYVKQLGYTMAHIWAC PPSEGDDYIFHCHPHEQKIPKPKKLQDWYRKMLEKGFDEKIVVDFKDILKQA-EDNMVSVSEFEGDFWPNILEESIKELDQEEEEQKKAVA--------- -----------------ADPDPTINCDLMDGDAFLTMARDRHYEFSSLRRAKFSTMAMLFEL >seq_762 IIKR-PMDLSTIKRKLDTGQEQEIDPVMQSLGYCCG-RKYVFQPQVLCCGKQLCTIPRDAKYWSYQN------RYTYCQSEIPGD--TVTLGDDPTQ--N IKKEQFVEMKNDHLEQEPFVECSDCGRKLHQVCVLHFETIWP--FTCDNCLKSKGKQSKLGQYIENRVNSFLKKK--D-C-AGEVTIRVVSSSEKFVEVK PGMRRYV--DGEWPEQFPYRAKALFAFEEIDGVDTCFFGMHVQEYGSEC-ALPNRVYIAYLDSVHFFRKQFRTAVYHEILLGYLDYAKQLGYTMAHIWAC PPSEGDDYIFHCHPTEQKIPKPKRLQEWYKKMLDKGIMDRIVMDYKDILKQA-EDNLKSASEFEGDFWPNVLEESIKELDQEEEEKRKAEAIYSTMEKHK EVFFVIRLHSVQ---AAADPDPFITCDMMDGDAFLTLAREKHYEFSSLRRTKFSTMAMLYEL >seq_775 IVTK-PIDLSTIKRKLDTGQESEIDSVMQSLGYCCG-HKYTFSPQVLCCGKQLCTIPRDSMYRSYQN------RYVFCEQEIQGD--EVELSDDPTQ--R IKKDQFTLQKNDQMDYEQFVNCIECGRKHHVICALWFEPIWP--YTCDNCLKAKGMTTKMGNFLENRVNNYLKKK--D-V-AGDVSIRVLSSSDKIVEVK PGMKKF----DEVPDSFPYRSKAMFAFEEIDGVDVCFFGMHVQEYGSEC-QMPNRVYISYLDSVHFFQRHLRTAVYHEILIGYLEYAKNMGYTVAHIWAC PPSEGDDYIFHCHPPEQKIPKPKRLQDWYKKMLDKAIIERVCTDYKDILKDA-ENNFQSATEFEGDFWPNVVEESIKELNQEEEEKKKREELYQTMEKHK EVFFVIRLHPQQ---NPPDLDPLCSCELMDGDSFLTLAREKHYEFSSFRRAKYSTLAMLYEL >seq_883 VIKC-PMDLGTIKKRLEAEHNKRCELRREEMCQLCGGDSFKFAPCMLFCGPCAGRIRRHTHYYSDPR-------YHWCSKQMKDGASTNATGEEPLKT-V LLKAHLLKKKNSEVAEEPWVECDSCKLWYHQICGLFNERNHAEPFVCPFCTKKQRQATRMATRIEKRVVDALAKAQDEEMSSFAITIREVLSVDKQVQMK PRMAKMFALKLKKADKFEYRSRCICVFQELDGVDVLIFTLYVQEYGEQC-AEPNRVYISYLDSVAYFQKKFRVLMHQQVILGFLDDAKMRGYHTAHIWSC PPLKGDDYIFFCKPENQKIPKAARLRSWYSKLLQGAKKEGLVYNISNLYAE---VKKKTALEFEGDYWPRLAEDLIKQVEDKASGK-------------- ----------------------------GNG---------KTAGKAS-RKT----------- >seq_885 VIKC-PMDLGTIKKRLESEHKSRCEQRREEMCQLCGGDSFKFAPCMLFCGPCTGRIRRHTHYYSDPR-------YHWCSKQMKDG--PLDMSLLPVSSSP LTKAMLLKKKNSEVAEEPWVACDTCKLWYHQICGLFNERNHAELFVCPFCTIKTRQVTRMAARIEKRVVDALAKAQGEELASFAVTIREVLSVDKQVQIK PRMSLFVAKEREKKDKFDYRSRCICVFQELDGVDVLIFTLYVQEYGDTC-AEPNRVYVSYLDSVAYFQKKFRVLMHQQVILGFLEDAKIRGYHTAHIWSC PPLKGDDYIFFCKPENQKIPKAARLRSWYSKLLQGAKKEGLVYNISNLYAE---MKRKTALEFEGDYWPRLAEDLIKQVEEKAKPSTSSKS--------- --------------------DP--NAAI--GAAVPT----QADTVDPL-------------- >seq_888 IIKC-PMDLGTIKKRLEAEH-KRCEQRREEMCQLCGGDSFKFAPCMLFCGPCAGRIRRHTHYYSDPR-------HHWCSKQMKDG--PIDLSLLPQLSSV LKKSSLVKRKNSEVAEEPWVECDSCKLWYHQICALFNERNHAEPFVCPICVKKQRIITRMAKLIETRVMEAIEQAGKDEAVQYGITIREVLSIDKQVQVK PRMGKLLAGKASLSLQLTYRSRCICVFQELDGVDVLIFTLYVQEYGPDA-LSPNRVYVSYLDSVNYFQKKLRTLMHQQVMLGFLEDCKNRGFHTCHIWSC PPLKGDDYIFFCKPENQKIPKSARLRGWYQKLLQQAKHDGLVVNISNLYAEM---KKKAAHEFEGDYWPRLAEDLIKQLEEKDHGNRAKTG--------- ------------------APDA------SPS-----GAKTANGSKTPISRSPSPAMTP---- >seq_899 IIKC-PMDLGAIKKRLENEHCRRCEQRREEMCQLCGGDSFKFAPCMLFCGPCRGRIRRHTHYYSDPR-------HHWCSKQMKDG--PIDLTALPQLASA LTKAVLVKRKNSEVAEEPWVECDSCKLWYHQTCALFNERNHAEPFVCPICVKEQRTITRMAKLIEDRVNQAIDKANEEEATRLAITVREVLSVDKLVQVK TRMGHYCKSKDTSPLHLTYRSRCICVFQELDGVDVLIFTLYVQEYGPES-LPPNRVYVSYLDSVNYFQKKWRTLMHQQVMVGFLQDSKQRGFHTAHIWSC PPLKGDDYIFFCKPENQKIPKSARLRQWYHKLLQEAKKEKIVVNISNLYAE---LKKKEAHQFEGDYWPRLAEDLLKQLQDKSGRG-------------- ------------------------EINI---DDL------THPGESG-LKADPSSLSV---- >seq_911 IIQH-PMDLSTVKTKCIQLEQHSCSSCLSNVCGICLEKCIHFEPPLLICGPCRQRIKRYAIYYLDSK-------HHWCATTLPK---WIEVDNTQDR--S IAKCELIKDKFTDKVTEPWVQCDECNGWAHQVCSLFNAAVNNTPYVCPLCLEKVNGETSLSKFMETWIMDRFSRMGESEL-AESITVRVVSSIKTNHKIP EAVRHFGTKNLKYPDQVEYTSKAICVFQRINKIEVCIFFMYVQEYDHQCSLVFNRVYLAYIDSLIYMRRRIRTALYQEILLSYFAFCKERGYMWIHIWAC PTTRGGDFIYWCHPTSQKNPAKERLQKWYLQMVERGQQVGVIASCDDLYGTL-EETLRNEVKFDGDFWPAQVELLLKYPGKRGRGSSASLEVVDSVQACR DSLFVISLVRKR---VEDP-DI--SCPFLDYTDMLRNSEQAHYQFDTFRRAKYSTMMLMYEV >seq_912 DLAL-VKNKCLNLEYP----EHSCPFCLDNVCGICNEKCINFEPPFVVCCR--QRIKRHAMYYKTPD-----GQYHWCSSSLPKT---LTLKAAPSS--T SDNGATDPFATEYTVSKLLLKAKFLDELTEPWVQCDQCNGW-CALFNACENADEEENEELGMSPI---ETSVEDFPVKVEKRRPYTKDFTRALGFDDEIE EKVFYLTQVDVVKVVSAIYTSKVIFVFQMINGVEVCIFSMYVQEYDKHCQLPANRTYIAYLDSLVYMRRH------------------------------ ------------------NPGKERLLQWYLSMAKKAKERKRVSESVKSAREF-IALQPTCAA-------------------------------------- -------------------------------------------------------------- >seq_913 VVKH-PMDLAIVKNKCLNLEEHSCPFCLDNVCGICNEKCINFEPPFVMCGACRQRIKRHAVYYGH---------YHWCSTSLPKM---LTLKVAPSSN-S LSKFALLKAKFMDELTEPWVQCDQCSGWVHQICAMFNACENAVMYTCPLCRLEELDQDLQSCGLSRFMQKWVQQHLENLGEAESIAVKVVSAIKSSCHVS SVVRNFRSPSQEYPQTIDYTSKVIFVFQMINGVEVCIFSMYVQEYDKHCQLPANRTYIAYLDSLVYMRRHVRTSLYHQILISYLASCKAKGFEYAHIWAC PTTRGGDFIYWCHPSFQKNPGKERLLQWYLSMAKKAKELGVVFACDDLYTRE-LEETLDTQLFDGDYWPSEAERVAASPLKRGRKEPNTAGCQ-VVVKTT ATASGQETVLTEVSSANFTEEPEMSCPFLDYPNMLKNCEEHHYQFDSFRRAKYSTMMLVYQI >seq_918 ---------------------------TSQNLQSCGVSRF------------------------------------------------------------ ------------------------MQKWVQQH------------------------LENLGEHEA----------------AQSIVVKVVSSIKSSCHVS SVVRNFRSASQEFPQTIDYTSKVIFVFQMINGVEVCIFSMYVQEYDKYCQVPVNRTYIAYLDSLVYMRRHVRTSLYHQILISYLASCKAKGYEYAHIWAC PTTRGGDFIYWCHPSFQKNPGKERLLQWYLIMAKKAKELGVVFACDDLYKHELEETLDTQL-FDGDYWPSEAER--VAASPPKRGRKEANTAYYCSKCQG AVAEAIAPASGQKAVLTKEANSKTSCPFLDCPNMLKNCEEHHYQFDSFRRAKYSTMMLVYQI >seq_919 VVKH-PMDLAIVKNKCLNLEEHSCPFCLDNVCGMCNEKCINFEPPFVMCGACRQRIKRHAVYYGQ---------YHWCSTSLPKEPSTITDHEHPTPSTT LSKFALLKAKFMDELTEPWVQCDQCSGWVHQICALFNACENAVMYTCPLCRLEELDQNLQSCGVSRFMQKWVQQHLENLGEAQSIVVKVVSSIKSSCHVS SVVRNFRSASQEFPQTIDYTSKVIFVFQMINGVEVCIFSMYVQEYDKYCQVPVNRTYIAYLDSLVYMRRHVRTSLYHQILISYLASCKAKGYEYAHIWAC PTTRGGDFIYWCHPSFQKNPGKERLLQWYLIMAKKAKELGVVFACDDLYKHELEETLDTQL-FDGDYWPSEAERAASPPKRGRKEANTAGV--------- -------------------------------------------------------------- >seq_921 -------------------------------------------------G------------------------EFLCPLEHRLK--------------- --------------EPGK--------------------------------YRVTRHRTKLSDYLERWIAKVLQEEREDEAVAENLTIRQVSNIDKQLMVR DKMFRYKESH-NYSSDFRFKSKCICMFQEIHGVSVLLFGMYVHEFDEQE-APANRVYISYLDSVSYFESHLRTKVYHELLIAYLDFVKKRGFYAAHLWAC PPLKGDDYILYCHPETQKTPKSERLRQWYVDMLVKAQEKGIVWHITNMYDDR--NNNPACK-FEGDYWVGLAEDLIEKIESEQSKKATSKK--------- -------------------T----ASSK---KTSKKLTMKTKAKKQK-RRKSGSSLTS---- >seq_923 -------------------------------------HSITISPGS----------------------------EFLCPLEHRKS--------------E PQKYKIGRHAFSAKDLQ---------------------------------------RTKLSDFLERRLTKSLQAEREDEAKAEGLTVRLVSNIEKQLMVR DKMFRYKDSH-KYTSEHRFKSKCICMFQEIDGVSVLLFGMYVHEFDEQE-ADCNRVYISYLDSVNYFKAHLRTKVYHELLIAYFDFVKQRGFHTAHLWAC PPLKGDDYILYCHPEVQKTPKSDRLRAWYVDMLVKAEEEGVVWQITNMYDDR--NDNTPCA-FEGDYWVGLVEDLIEKLDAE------------------ ------------------KPKK--------------IVNKGSKSHKR-KNSD---------- >seq_932 IIRR-PMDFGTIKKKLDANLDEASARARDGACKLCGMESMVFEPAILYCGECNTRIRRNCYYYCAPD-------YHCCTPALPE---SISAPSNDGG--P YEKSGLVRKKNDEVHEEPWVQCDHCNQWVHQVCALFHHKPGSTAFHCPECRAKSPSRSKLSDFLERRVQHLLAKEAEADPSADALVIRQVSNIEKQLIVR DKMYRYKEHK--IPSEHRFKSKCLCMFQETHGVSVLLFGMYVHEFDEQE-SAANRVYVSYLDSVNYLEVHLRTKVYHEILIGYLEFVKQRGFHTAHIWAC PPLKGDDYILYCHPESQKTPKSDRLRQWYVQMLLKAKEEGIVVDIHNMYDEA----------FEGDYWVGLAEDLIEKMDEEEKADVKTKK--------- -------------------PASLSKPNKRQGDKDKS---EEVIDFSDPLMAK------LGEV >seq_943 ------------ARQLQKEAERAVINATESSCRACGVERLTFEPPPLYCYSCVGRIKRGQVFHTRRD--------AWCNNAIQG---YVDVEGQ-----R FPKATLIKKKNDDDLEEPWVQCDYCEDWYHQLCVLFNGRRNEAPFTCPNCLDKNERKTKMSTFLEERLASKLSAERVERAKAENLTIRVVSQTLKQMDTK PHYYHFKEQG--IPAHFTYRSRVILLFQKLEATDVCLMAIYVQEYDDEC-PEPNRIYLSYLDSVKYFRGENCALVYHNILIAYLDYVKQRGFTSCFIWAC PPFQGDDYILYCHPKVQKTPKADKLREWYLKMLRSAQKDGIVISTSNVYDEN--HDIRCATEFDGDYFSGIAEDIPTIMKELEEAKNIEAK--------- ------------------SGAA--DAELNKE-----LMKKLGTTISN-MRND----FMLAHL >seq_944 --------------------AERAVLNAESACRACGVERLTFEPPPLYCYSCVTRIKRGQVYHTRRD--------AWCNNGIQGF--------------R FPKQALIKKKNDDDLEEPWVQCDYCEDWYHQICVLFNGRRNEAPFTCPNCLEKDERKTKMSNFLEERLATVLANEREARSKAEGLTIRVVSQTMKQMDTR SHYY------EGIPMHFAYRSRVILLFQNLEAVDVCLMAIYVQEYDDEC-PEPNRIYLSYLDSVKYFRGESCALVYHNILVAYLDYAKARGFTSCFIWAC PPFQGDDYILYCHPKVQKTPKADKLREWYMKMLRSAQQDGIVLSTSNVYDEN----------FDGDYFSGIVEDIPTIMKELEEAKNIEAKCKDFEDALP EHERRYTGRKLSSEECPEKEEEKLESEFFDTQAFLSLCQGNHFQFDSLRRAKHTTMMVLYHL >seq_946 VIKR-PMDFGTIRKKLESNYEEVMAREKDGACRLCGLESLVFEPAILYCGECNTKIRRNNYYFADNK-------FHCCVPNLPES--IPNADGVPYV--- --KAELTRKKNDEVHEEPWVCCDKCNQWVHQICGLFNMKNDKQDFVCPSCVLKSGRRSKLSDYLERRVAKVMEAEITAEMRTDKLIIRQVSNIEKTLMVR DKMYRYK--DAKYPSEHRFKSKCLCMFQEIHGVSVLLFGMYVHEFDQQE-AKCNRVYVSYLDSVNYLEAYLRTKLYHEILIGYLDYVKQRGFHTAHIWAC PPLKGDDYILYCHPETQKTPKSDRLRQWYIQMLMRAKEEGIVVDINNIYDES--TAQASPMEFEGDYWVGLAEELIEKLDEDDKTNKKKKDILNQAKKET PPYIPIVL----------DPDDIVESEVFDTQAFLSLCQTNHYQFDELRRAKHTSMMTLFNL >seq_948 VIKR-PMDFGTIKKKLEGNVDEAAARAKDGACQLCGSEGLVFEPAILYCGDCNSKIRRNNYYFCSPD-----NKFHCCVPGLPDT--ITKPADGGGP--P YIKAELCRKKNDDVHEEPWVQCDSCNQWVHQICGLFSDKEKGSEFQCPTCLLQLENRSKLSDYLERRVAKVLQAE--DQAMHDKLIIRQVSNIDKTLMVR DKMYRYKDQA-KYPSEHRFKSKCICMFQEIHGVSVLLFGMYVHEFDEQE-AQCNRVYVSYLDSVNYLEAYLRTKLYHEILIGYLEYVKQRGFHTAHIWAC PPLKGDDYILYCHPETQKTPKSDRLRHWYIQMLMKAKEEGIVVEINNMYDEA-AHASPTD--FEGDYWVGLAEELIEKLDEEGKSAKKKKDCYQTQMSTQ S-----NVKETPPYVPIKDPDDVVESEIFDTQAFLSLCQANHYQFDELRRAKHTSMLTLFHL >seq_1023 IITH-PMDLGTVARKLAKEGEENGRKERGETCNLCGYSAKTFEPMTYYCQCNGKRIGRGRYFYGSNQ-------WHWCSNDLKDG-EIIALAET-----A VRKADLKRKKNDEQAEVGVDNASKLSWSFTGVCTCERSRRGTAGNIAPTAHKLGGKHGPLSAYVEAQVKKRLDAAYEAEAKANTLYIREVSVMDTVHLVK PGFHRRYGPAGEYPADFPVRSKCIVLFQELDGVDVLLFGMYVYEYGHTC-PAPNRVYISYLDSVHYFRRNYRTMVYHEILIAYLEEVKTRGFHTAHIWAC PPAKGDDYILYCHPPEQQTPKDDRLQQWYVTMLEEAKKRGIVEGLTNLFDEP---ETADARQLEGDYWIGEAEN-------------------------- ------------------------------------IIKD------------------LPE- >seq_1024 ---------SAVEGMCDLKE-------EEESCQLCDDGTLLFPPQPLYCLLCSRRIDDRSFYYEELS-------HQICSSRCKTK--FPLCGV------F IDKHKMLKRSNFDNADTEWVQCESCEKWQHQICGLYNKLKDEAEYICPTCLLEECQETVLSYFLEQRLFKRLKEERYQTAKPEGLTLRVVFSADRTLTVN KQFASLLHKE-NFPSEFPYRSKVILLFQKVHGVDICIFALFVQEFGSEC-SQPNSTYIFYLDSVKYFKEALRTFVYHEVLIGYLEYCKLRGFTTSYIWAC PPKIGQDYIMYSHPKTQQTPDTKKLRKWYVSMLQKAAEQRVVMNVTNLYDRDTEEYMTAAR-FEGSFWSNRAEIMIQDIEREGNNELQKKV--------- ------------------TMKTTGDVDVDDVNILL-MEKLEKEVFPN-KKDL------M--- >seq_1026 IVQH-PMDLALVETKL---EKEEPKKKDDTSCTLCGNHRRLFEPTTLYCCG-MQKIRRNASYYTDRY-----RQNQWCEDVLMEE---KPVLLDDGK--E TKKSLLVKMKN-DSTPEEWVQCDNCHNWAHQICALFN---E-NAFTCPKCFLKQQDQCKLSTVIEEGLATTLSVEYEKIAKAEGLCVRVVSSLEKKHKVR DEMLRY-SKK-GYPSEFPVTSKCILLFQKIHGVDVLLFGMYVYEYGDKC-AAPNRVYISYLDSVQYLESSYRTTTYQSIIVEYLRYARMRGFHTAHIWSC PPSKGDEYIFYCHPSSQLVPKDDMLCAWYIETLKKAQDQGIVLETRTIYDEK-NGI-------------------------------------------- ------------------EPFDPMSLPYFEGDYIPGEIEKIIRDFNKTKLKELKSAPAEGNR >seq_1044 VI------------------TEEESGSDAYKCQLCSMATLHYAPQPIYCFCCGNSIRRNAHYYEDTD-------NCFCAKNGRGG--NITCNGT-----T VSKTDLNKKENNRKCEEAWIECSKCKRWQHQICALYNDKRDLAEYVCLLC------KTMLSDHIEERLLKRLAKIGLK---LPELFVRVVLSVDKQIEVK KQFLNIFEKQ-DYPADFSYTSKVILLFQKIDGVDR-------------------SLYVSYLDSVKYLRESLRTLVYHEIL-------------------- ----GEDYIFYCHPEFQKTPKKDQLRHWYHSVLRKAGEEDIVVGLCNMYDYTCESKVTAAG-FDGDFWSGAAMDEASQIEQCTGGDREKMLQYVCIHCHK VIECGKRWFCTECKIFQENEDIIFDNGLFGNYNFLSFCQRNRFQFDSLRRAKYSSMMILHFL >seq_1045 VIKK-PMDLGTIKQQLGSLEAHSCPHCQSHVCGLCNEKCINFEPPLVMCGKCGQRLRRHAAYFTTVD-------LNWCTPKLK----EVVIGDR-----R IGKNDLVKAKFQDELTEPWVQCDSCSGWVHQICALFNASTVDTPYTCPLCLDTSSATSHLGIFMETWIRDHLISLGEPNA-ATSIRVKLAASIHITAPIS SRVQHFVAVDSMYPSHVTYTSKTILVFQKIHGIEVCLFSMYVQEYEDNCGIPSNRTYIAYLDSLGYFRRHARSSVYQQLVIAYLAFCKLRGFTHAHIWAC PTTRGGDFIYWCHPTYQRNPNKDRLLLWYRSIIASAKRKQVAFGHDTLWST---FSASSATEFEGDYW--VAEARIVGIKPRKTRRKKSQ---------- ------------------DPEE--DPPMPQP-----LVKS---------------------- >seq_1046 VIKT-PMDLGTIKQRLGSLEAHSCPHCQSHVCALCNEKCINFDPPLVMCGKCGQRLRRHAAYFTTVD-------LNWCAPKLK----EVVVAGR-----K IGKRDLVKAKFQDELTEPWVQCDSCTGWVHQICALFNASSVETPYTCPICMEVAKATSHLGTFMETWIRDHMIALGEPHA-ATSISVKLAASIRKTVPIN PRVQHIVAADSTYPSHVTYTSKTILVFQKIHGIDVCLFSMYVQEYGDNCGVSSNRTYIAYLDSLGYFRRHARSSVYQQLVIAYLAFCKLRGFTHAHIWAC PTTRGGDFIYWCHPTYQRNPNKDRLLLWYKAIIASAKRSHVAFGHDTLWST---HFQSSSAEFEGDYWVA------------------------------ ------------------------EADRIAG-----M---RPRKVRR-KKTE---------- >seq_1049 ---------------------------------------------------------------------------------------------------- --------------------------------------------------------TTHLGTFMQTWIRDHLVSLGESPAVANSLFVKLASSIRVTAPVT ANAQHFRLNGFAYPSEVTYISKTILVFQEIDGTDVCLFSMYVQEYGPDCGIPSNRTYIAYLDSLGYFRRHARTSVYQQLVIAYLAFAKARGFTHAHIWAC PTTRGGDFIYWCHPSHQRNPSKERLLLWYKAVIAAAKARHVAFGHDTLWKTA--SGISNAPLFDGDYWPAELDRVLTPLKPRAKKKTDEMPPYWQSPTGA VCDVCAAVSVDSDVPLQRND--VLACPFLDHSTLLKNCEERHYQFDTLRRAKYSTMMLLYHM >seq_1050 VIAR-PMDLGTVKQNLSALLEHSCPHCQGHVCGLCEEKCINFEPPLVLCGKCGQRLRRHATYFPDGQ-------LNWCAPKQA----EFTYGDR-----V VTKAELLKAKFQEELTEPWVQCDGCMGWVHQICALFNSEIAAVPFTCALCRLRSLETTHLGTFMQTWIRDHFLFLGESPAVANSLFVKLASSVRVTAPVT ATAQHFRLNGFAYPSEVTYTSKTILVFQEIDGTDVCLFSMYVQEYGPDCGIPSNRTYIAYLDSLGYFRRHARTSVYQQLVIAYLAFAKARGFTHAHIWAC PTTRGGDFIYWCHPSHQRNPSKERLLLWYKAVIAAAKARHVAFGHDTLWSSA-LHSGISNAPFDGDYWPAELDRAVLTPLKPRAKKKPSDAAYWQSPTCA VCDVCAAVSVDSNVPLQR-ANDVLPCPFLDHSTLLKNCEERHYQFDTLRRAKYSTMMLLYHM >seq_1061 QIKE---HITSLRKQFNQSTMVEESGSDVYTCQLCGMGTLSFAPVPIYCFCCGIRIKRNACYY-YRR-------HCFCSRTSRGG--NIKFNGT-----S VSKTDLDKKTNNREFEESWVECNKCKCWQHQICALYNDKRDLAEYTCPICLKEIGNRTMLSDHIESRLFKRLWQEDEDWAKAESLSVRVVLSVDKQLKVK KQFLDIFGEE-NYPSEFPYTLKVILLFQKIEGVDVCLFAMYAQEFGSEC-GYPNSVYISYLDSVKYFREALRTIVYHEILIGYLDFCKKRGFTTCYLWAC PPMKGEDYLLYCHPDTQKTPKKDKLRQWYHSMLRKAAEENIVVGLTNLHDHS--SKVTAAR-FDGDFWSGAAMDKARHIEQECGGDYK------------ -------------------HPP------SEGAKDILVMHKLGQTILP-FKED------FLVV >seq_1065 ---------SSLRKKFVRVANEEETGIDANTCQLCLMQKLYFTPVPIFCLSCGIRIKRSTPYFCSKE-------RCFCSKASKDG--YIKFNGA-----S VSKANLEKRNNGDVLEEPWVECNKCKRWQHQICALYNNKKDLAEYTCPFCIENGMHRTMLSDHLEKRLIERLMQERADWEKAKSLYVREVFSVDKQLKVN KEFLDIIPEE-NYPAEFSYRSRVILLFQKIEGADVCIFGMYTKEFGSQC-GGPNCIYITYLDSVKYFREALRTFVYHEILIGYLDFCKKRGFLNCYIWAC PPSKGDDYILNCKPGDQKTPKNDKLRRWYYSLLKKATEENIVVGLTNIYDHK--FKLTASR-FDGDYWCYHAMEIAKKIEKECGGEYETMLCHEVIVSGK RWFCTECKKFQECERCHSGFNQIIENGLFENNNFLSFCQKNQFQFDTLRRAKYSSMMIIYYL >seq_1068 ----------------------------------SKGEKITFNGT------------------------------------------------------S ISKKNLEKRNNDEVLEEP-VECNKCERWQHQICALY---------------------------------------------------------NKKEDVD SGLI--------MSFSYSYQKKIF-------QLEKCYLSTTIPEENYPTELSYR--------------------------IGYLDFCKKGGFSTCYIWAC PSKKGDDYILYCHPEEQKTPKSDKLR------------QNIVVGLTNVYDRL-TEK-------------------------------------------- ------------------GKSKVIRLPCFMGDCWCRAMVVETLEKESLKQVSNKTIKGMGHA >seq_1070 AKEKVSEPTPDQEKQTKLSNSEEEAGTEANTCQLCERKKLYFAPVPIVC--SCCGIRVKRIYFCRKE-------GCICSKTSKGG--KITFNGT-----S ISKKNLEKRTNDEVLEEPWVECNKCKRWQHQICALYNNRRDLAEYICPICIENGKHRTVLSDHLEKRLIECLIQERAKWENAKGLSIREVLSVDKQLKVN KQFLDIIPEE-NYPAEFSYRSRVILLFQQIEGADICIFGMYVQEFGSEC-GNPNCVYISYLDSVKYFMEALRTFVYHEILIGYLDFCKKRGFLTCYIWAC PSRKGDDYILYCHPKEQKTPKNDKLRRWYLSMLKKATEENIVVGLTNVYDHA--SKVTASR-FDGDCWSGNAMEAAKTIEKECGGDYEKML--------- ------------------NPDTILGQNIWPTESFLI----AHLQYACI-------------- >seq_1071 SLRKESVQIT----------SKEEAGIDANTCQLCQRKKLYFAPVPIFCSCCGVRIRR--TYFAQGC---------ICSKTSKGG--KITFNGA-----F VSKTNLEKKNNDEVFEEPWVECNKCKRWQHQICALYNNKRDLAEYICPVCIENGIHRTMLSDHLEKRLFERLVEERANWEKAESLSIREVLSVDKQLKVN KLFLDIIPEE-NYPAEFSYRSRVILLFQQIEGVDICIFGMYAQEFGSEC-GNPNCVHISYLDSVKYFREALRTFVYHEILIGYLDFCKKRGFSTCYIWAC PSRKGDDYILYCHPEEQKTPKNDKLRRWYLSMIKKATEENIIVGLTNVYDHK--SKVTTSR-FDGDYWCGYVMEAARTIEKESGGDYEKML--------- --------------------DTILGQKILPTENFII----AHLQYSCI-------------- >seq_1072 ---------SSLRKESVQITSKAEARIDANTCQLCEKERLYFAPVPLFCLHCGNRIKRTYFCTFDAH-------GCICSKTSKGG--KIAFNGT-----S ISKKNLEKRNNDEVLEEPWVECTKCERWQHQICALYNKKADLAEYICLLCRLKEIERTVLSDHLEKRLFERLLQERENWEKAENLSIREVLSVDKQLKVN KQFLDIIPEE-NYPTEFSYRSRVILLFQKIEGADVCIFGMYVQEFGSEC-SNPNCVYISYLDSVKYFREALRTVVYHEILIGYLDFCKKNGFSTCYIWSC PSEKGDDYILYCHPEEQKTPKNDKLRRWYLSMLKKASEENIVVGLTNVYDRLWKSKVTASR-FDGDCWCG------------------------------ --------------------NAMVVANTLEKVNYEKLLKQSNRTIKDMGHAKKDILVMVGQN >seq_1076 IIKK-PMDLGTIQKRLESSAEDLERRQNERACTLCGSEKLLFEPPVYFCNCQSQRIRRNSHFYIGGN-------YFWCSNELDDK-IPIELADL-----T VMKNNLKKKKNDEIHEESWVQCDTCERWVHQICGLFNTRQNKSEYCCPKCLLE---RTTLSEWLERSVTKKVEKRKRELAEGGPIIIRQVTAMDRKLEVR ELMKRYAHKN--YPDEFPFRCKSIVVFQHLDGVDVILFALYLYEHGEDN-PPPNTVYISYLDSVHFMRRKLRTFVYHEILIAYLDYARRRGFATAHIWAC PPLKGDDYIFYAKPEDQKTPRDSRLRLWYIDMLVECQKRSIVGKVTNMYDIA--DPNLDATALEGDYFPGEAENIIKMLEEGGGKKLGSVG--------- ------------------DEEALIASGILDGKSLKDLDRD---------------------- >seq_1077 IIKK-PMDLGTIGKKLEQGSEHAENSKKAHACGLCGCEKLNFEPPIYFCNCQTKRIRRNTHYYADKQ-------YSYCNGEIKGD--HIDLGTT-----K IKKSDLAKRKNDEIHEESWVQCDDCERWIHQICGLYNTRHDKSAYSCPKCLQKRKRRTKLSEWLEGHVHKKVEERMNMLARGGPVTIRQVTSTDRKHEVR ELMKRYA--DKNYPDEFLYRGKCIVVFQNIDGVDVVLFALYVYEHGDDN-PLPNTVYVSYLDSVHFMKRKMRTFIYHEILISYLDYARAKGFQQAFIWAC PPLKGDDYIFYAKPEDQRIPKEHRLRQWYVDMLEECQRRDIVGKVTNMYDEN--DPNLDFSAFDGDYFPGEAENIIKQINDDVKKSGKKGK--------- ------------------DPEK--FCECISGDSFMVFLNCKGAKAENMREANKESLAPLIEM >seq_1078 VIKK-PMDLGTISRRLDNGSEHRENSMRQQACGLCGCEKLNFEPPVFFCNCPSKRIRRNTHFYADKQ-------YAWCSNELGGE---IDLGTS-----V LKKVDLAKKKNDETHEESWVQCDDCERWIHQICGLYNTRQDKSAYSCPLCRKKEGERTNLSDWLERDVHKKVNQRLKELAQGGPLTIRQVTSTDRKLEVR DQMRRYAHKN--YPEEFPYRCKCIVVFQNIDGVDVVLFALYVYEHGDDN-PFPNTVYVSYLDSVHFMKRQMRTFLYHEILISYLDYARQKGFLQAFIWAC PPLKGDDYIFYAKPEDQKTPKDVRLRQWYLDMLVECQKRNIVGMVSNMYDQ---NKSLDAASFDGDYFPGEAENIIKDLEESNSKRKGGAG--------- --------------------DP--------------------------SKSK---------- >seq_1079 IIKN-PMDLGSIKKRMENNCEEEHRKSNGDVCVLCGCEKLLFEPTVYYCSCNGQRIRRNSYYYTGGR-------YHWCQNELREK-EPLEFADC-----T LWKKELQKKKNDEMHEEPWVECSQCNRWVHQICALFNGRMNKTIYHCPFCMARRGAHTKMSRFLEDRVIKSLDDAYALRSSASAVYVRQLSNIEKAHQVK PRILRYA--DQKYPREFPVRSKCILLFQEMDGVDVILFGMYLYEYGHNC-PQPNRVYVSYLDSVYYFRRQYRTLVYHEMLIAYLAHTKERGFHTAHIWAC PPCKGDDYIFFCHPEDQKTPKDDRLRSWYITLLEKAKEEGIVTHITNLWDE----HFQADYDFEGDYWPGEAENVVKALEDEANERNESKS--------- -------------------RGS--ATKSMKG-----TQRG--------LRSD------GSIE >seq_1080 IEQL----------------NMTVLPISENVCQLCGTDRLVFVPTPVYCSSCCKCIKRNLVYYAGGR-------HCFCTRKSCGD--DVSSQGL-----S INKNKFQKAKNNDQNEESWVQCDKCEGWQHQVCALYNAKKDFAKYICPFCCLKEIERTMLSDHIEQRLFRRLKLERNERAKAADLIVRVVLSVNRNLKVK QQFLDLCHNE-GYPPEFQYKSKVILLFQKIGGVDICLFGMYVQEFGSEC-APPNCVYISYLDSVKYFKEALRTFVYHEILIGYMDYCRKRGFTTCYLWAC PPIKGEDYILYCHPESQKTPKPEKLRSWYWSMLRKASEEDIVVNYTNLYDHV--ARISAAH-FDGDYWSGAAEDIVRNIEKESRGDSQNKV--------- ------------------DN--LS----ADAKDIL-VMQKLGQTILP-VKED----FIIVNL >seq_1082 IKEH----LSNLRQSIDQSILMEDRENSEKVCQLCALGKLFFAPAQIYCSCCGLRIKRGVNYYHSSR-------YCFCNKGSRRG--NISFHGI-----R VSKATLGRKKNDEETEEPWVQCDKCKGWQHQICAVLNDKSALAENTCLKCLLKETETTMLSDHIEQRLFRRLKQEREERAKAEDLVVRVVLSVQKTLKVK QKFLDLFHDE-NYPAEFPYKSKVILLFQKIEGVDVCLFGLYVQEFGSEC-SHPNCVYISYLDSIKYFREALRTFVYHEILIAYLEFCKKRGFVTSYIWAC APLKGEDYILHCHPEMQNMPKPEKLRQWYQSMIKKAAEEKIVVSHTNLYDRT--SKVTATR-FDGDYWSGATEDAIRAIEQGMADTQKKSK--------- ------------------------HTNPSDGTKDILLMQRMGQTILQ-SKEDFIIVDMQYVC >seq_1083 ----SSLEHSMCQRKSNEE-DKIINHVNENRCQLCAEDKLWFSPVPLYCSCCGARIKRGVIYYENGT------QPCFCSKSSPPR--KITFYGI-----T ILKEKLHKRKNDEATDEPWVECDKCKRWQHQICALFNDKRDMAEYICPKCCLKEMKRTILSDFIEERLFRRLNQEREERAKAEDLVLRVVLSVNKQLKVK EKFLEIFHGE-NYPAEFPYRSKVILLFQRIEGVDVCLFGLYVQEFGSEC-SQPNSVYISYLDSVKYFREALRTFVYHEILIGYLEYCKKRGFATCYLWAC PPIKGEDYILYCHPENQKTPKSDKLRQWYHLMLRKAAKENIVVNCTNLYDHVFYSKITAAR-FDGCYWYDAAEDILKNIEQKTGVYAERKV--------- ------------------TLKAMGHTESSGGTKAILVTNHRGGPMCG-RKED------FMVV >seq_1086 VIKK-PMDLGTIRKKLENGVEEDVKRKNGEACCLCGCEKLLFEPPVFYCNCPSKRIRRNSYYYIGGN-------YHWCHQELRDN-STIDLGDL-----S VKKESLVKKKNDEVHEESWVQCDRCERWVHQICALFNTRQNKSEYACPKCRKAKGESEYLENHVREKVDEFVEQRSQDMVVGGAITIRQVTSMDRRLEVR DRMKRYAFKN--YPEEFNFRCKCIVVFQNLDGVDVVLFGLYVYEHDEKN-PAPNAVYVSYLDSVHYMRRDMRTFIYHEILISYLDYVRRRGFSTAHIWAC PPLRGDDYILYAKPEDQKTPKDDRLRQWYIDMLIEAQRRGIVGKLTNMYDLS--NEKNDATVMDGDYFPAEVENIIKDIEEGKTGKKGSSQ--------- -------------------QKK------KSGGGTRS----TGLDEDALKASG------FL-- >seq_1092 AEKN----------------QAMEHSMSENSCQLCAVEKLTFEPPPIYCTPCGARIKRNAMYYTVGA---GDTRHSFCINEARGD--TIAIDGT-----A IPKIKLDKKKNDE-ETEEWVQCDKCEAWQHQICALFNGRRNDAEYTCPYCYMQEIEKTILSDHIEQRLFKRLKQERLDRARADGIVVRVVSSVDKKLEVK PRFLEIFREE-NYPTEFAYKSKVVLLFQKIEGVEVCLFGMYVQEFGSEC-LPPNRVYLSYLDSVKYFREALRTFVYHEILIGYLEYCKLRGFTSCYIWAC PPLKGEDYILYCHPEIQKTPKSDKLREWYLAMLRKATKENIVVDLTNLFDHVSKAKVTAAR-FDGDYWPGAAEDMIYLINQEEECRKLNKK--------- ------------------------QSDLSSNSKDLLLMHKLGETISP-MKED------FIMV >seq_1098 AEKN----------------QAMEHSMSENSCQLCAVERLTFEPPPIYCSPCGARIKRNAMYYTMGA---GDTRHYFCINEARGD--SIVVDGT-----A IPKSRLEKKKNDE-EIEEWVQCDKCEAWQHQICALFNGRRNDAEYTCPNCVERGERRTILSDHIEQRLFKRLKQERQERARAEALVIRVVSSVDKKLEVK QRFLEIFQEQ-NYPLEFPYKSKVVLLFQKIEGVEVCLFGMYVQEFGSEC-AFPNRVYLSYLDSVKYFREALRTFVYHEILIGYLEYCKKRGFTSCYIWAC PPLKGEDYILYCHPEIQKTPKSDKLREWYLAMLRKASRENIVVDLTNLYDHE--AKITAAR-FDGDYWPGAAEDLINQFRLEEDGRKLNKK--------- ------------------------QSDLSANSKDLLLMHKLGETICP-MRED------FIMV >seq_1115 AEKN----------------QAMEHSMSENSCQLCAVEKLNFEPPPIYCTPCGARIKRNAMYYTIGT---GDTRHYFCINEARGD--TIVVDGT-----T IPKARMEKKKNDE-ETEEWVQCDKCEAWQHQICALFNGRRNDAEYTCPNCVERGERRTILSDHIEQRLVRRLKHERQERARAEGLVVRIVSSVDKKLEVK SRFLEIFQEE-NYPPEFPYKSKVLLLFQKIEGVEVCLFGMYVQEFGSEC-AQPNRVYLSYLDSVKYFREALRTFVYHEILIGYLEYCKKRGFSSCYIWAC PPLKGEDYILYCHPEIQKTPKSDKLREWYLSMLRKALKENVVVDLTNLYDHTCKAKVTAAR-FDGDYWPGAAEDMIYQLQQEEDGRKQHKK--------- ------------------------QSDLSGNSKDILLMHKLGETISP-MKED------FIMV >seq_1130 AERN----------------QAMEHSMSENSCQLCAVEKLTFEPPPIYCTPCGARIKRNAMYYTIGA---GETRHCFCINDARGD--TIVVDGA-----T LPKARAEKKKNDE-EIEEWVQCDKCEAWQHQICALFNGRRNDAEYTCPNCYMAEVERTNLSDHLEQRLFAKLKHERHERARAEALVVRVVSSVDKKLDVK PRFLEIFQEE-NYPVEFPYKSKVILLFQRIEGVEVCLFGMYVQEFGSEC-QQPNRVYLSYLDSVKYFREALRTFVYHEILIGYLEYCKRRGFTSCYIWAC PPLKGEDYILYCHPEIQKTPKSDKLREWYLSMLRKASKENIVVDLTNLYEHTCKAKVTAAR-FDGDYWPGAAEDILYQLQQDEDGKKQHKK--------- ------------------------QTDLSSNSKDLMLMHKLGETISP-MKED------FIMV >seq_1138 AEKN----------------QAMEHSMSENSCQLCAVEKLTFEPPPIYCTPCGARIKRNAMYYTVGT---GDTRHYFCINEARGD--TIEVDGT-----P ILKAKLEKKRNDE-ETEEWVQCDKCEAWQHQICALFNGRRNDAEYTCPNCYIGEIERTILSDHIEQRLFRRLKQERQERARAEALVIRVVSSVDKKLEVK PRFLEIFQED-NYPTEFPYKSKVILLFQKIEGVEVCLFGMYVQEFGSEC-QLPNRVYLSYLDSVKYFREALRTFVYHEILIGYLEYCKKRGFTSCYIWAC PPLKGEDYILYCHPEIQKTPKSDKLREWYLAMLRKAAKENIVVDLTNLYDHVCKAKVTASR-FDGDYWPGAAEDMINQLRQEEDGKQQKKG--------- ------------------ALKAAGQSDLSANSKDLLLMQKLGETICP-MKED------FIMV >seq_1139 AEKN----------------QAMEHSMSENSCQLCAVEKLTFEPPPIYCTPCGARIKRNAMYYTVGA---GDTRHYFCINESRGD--TILAEGT-----P IPKARLEKKKNDE-ETEEWVQCDKCEAWQHQICALFNGRRNDAEYTCPYCFIAEVERTILSDHIEQRLFKRLKQERTERARAESLVIRVVSSVDKKLEVK SRFLEIFRED-NYPTEFAYKSKVVLLFQKIEGVEVCLFGMYVQEFGSEC-AFPNRVYLSYLDSVKYFREALRTFVYHEILIGYLEYCKLRGFTSCYIWAC PPLKGEDYILYCHPEIQKTPKSDKLREWYLAMLKKASKEGIVAETINLYDHMCKAKVTAAR-FDGDYWPGAAEDLIYQMSQEEDGRKGNKK--------- ------------------------QTDLSGNSKDLLLMHKLGETIHP-MKED------FIMV >tr|Q4SVN3|Q4SVN3_TETNG Chromosome undetermined SCAF13749, whole genome shotgun sequence OX=99883 OS=nigroviridis). GN=GSTENG00011900001 PE=4 SV=1 IVKN-PMDLSTIKRKLDTGQESEIDPVMQGLGYCCG-RKFEFSPQTLCCGKQLCTIQRDAAYFSYQN------RYHFCENEIQGE--NVSLGDDPSQQTS INKDQFQRKKNDTLDPELLVECLDCGRKMHQICVLHNETIWPSGFVCDNCLKMSNKQTKLGNYLETRVNDFIKRQN--HPESGEVTIRMVHVSDKVVEVK PGMKSRFVDSGEMAESFPYRLKALFAFEDIDGTEVCFFGMHVQEYGSDC-PPPNRVYISYLDSIHFFKRHLRTAVYHEILLGYLEYVKRQGYTTGHIWAC PPSEGDDYIFHCHPVDQKIPKPKRLQEWYKKMLDKAVSERIVHDYKDIFKQATEDRLTSAKEFEGDFWPNVLEESIKELEQEEEERKREEKLYATMEKHK EVFFVIRLIAGPSLPPITDPDSLMACDLMDGDAFLTLCRDKHLEFSSLRRSKWSSMCMLVEL >tr|F8W518|F8W518_DANRE Uncharacterized protein OX=7955 OS=Danio rerio (Zebrafish) (Brachydanio rerio). GN= PE=4 SV=1 IVKN-PMDLSTIKRKLDTGQEQEIDPVMQSLGYCCG-RKLEFSPQTLCCGKQLCTIPRDAAYFSYQNSSPLADRYHFCENEIQGE--TVSLGDDPSQQTS INKDQFEKKKNDTLDPELFVECMDCGRKMHQICVLHNETIWPSGFVCDGCLKKSNKQTKLGNFLETRVNAYLKRQN--HPESGEVTVRVVHVSEKMVEVK PGMKSRFVDSGEMSESFPYKSKALFAFEEIDGVDVCFFGMHVQEYGSDC-PPPNRVYISYLDSVHFFQRFLRTEVYHEILIGYLDYAKRQGFTTGHIWAC PPSEGDDYIFHCHPADQKIPKPKRLQEWYKKMLDKAVAERVVHDYKDIFKQATEDRLTSAKEFEGDFWPNVLEESIKELEQEEEERKREEKLYATMEKHK EVFFVIRLFAAPALLPIVDPDPLMACDLMDGDAFLTIARDKHLEFSSLRRSKWSTMCMLVEL >tr|F6RXU0|F6RXU0_ORNAN Uncharacterized protein OX=9258 OS=Ornithorhynchus anatinus (Duckbill platypus). GN= PE=4 SV=1 IVKS-PMDLSTIKRKLDTGQEQEIDPVMQSLGYCCG-RKLEFSPQTLCCGKQLCTIPRDATYYSYQN------RYHFCENEIQGE--SVSLGDDPSQQTT INKEQFSKRKNDTLDPELFVECTECGRKMHQICVLHNEIIWPSGFVCDGCLKKTGRSTRLGTFLENRVNDFLRRQN--HPESGEVTVRVVHASDKTVEVK PGMKARFVDSGEMAESFPYRTKALFAFEEIDGVDLCFFGMHVQEYGSDC-PPPNRVYISYLDSVHFFRKCLRTAVYHEILIGYLEYVKKLGYTTGHIWAC PPSEGDDYIFHCHPPDQKIPKPKRLQEWYKKMLDKSVSERIVHDYKDIFKQATEDRLTSAKEFEGDFWPNVLEESIKELEQEEEERKREQKLYATMEKHK EVFFVIRLIAGPALPPIVEPDPLIPCDLMDGDAFLTLARDKHLEFSSLRRAQWSTMCMLVEL >tr|K9J4E6|K9J4E6_DESRO Putative histone acetylation protein OX=9430 OS=Desmodus rotundus (Vampire bat). GN= PE=2 SV=1 IVKN-PMDLSTIKRKLDTGQEQEIDPVMQSLGYCCG-RKYEFSPQTLCCGKQLCTIPRDAAYYSYQN------RYHFCETEIQGE--NVTLGDDPSQQTT ISKDQFEKKKNDTLDPEPFVDCKECGRKMHQICVLHYDIIWPSGFVCDNCLKKTGRTTRLGNHLEDRVNKFLRRQN--HPEAGEVFVRVVASSDKTVEVK PGMKSRFVDSGEMSESFPYRTKALFAFEEIDGVDVCFFGMHVQEYGSDC-PPPNRVYISYLDSIHFFRRCLRTAVYHEILIGYLEYVKKLGYVTGHIWAC PPSEGDDYIFHCHPPDQKIPKPKRLQEWYKKMLDKAFAERIIHDYKDIFKQATEDRLTSAKEFEGDFWPNVLEESIKELEQEEEERKKEQKLYATMEKHK EVFFVIHLHAGPVLPPIVDPDPLLSCDLMDGDAFLTLARDKHWEFSSLRRSKWSTLCMLVEL >tr|G3NM42|G3NM42_GASAC Uncharacterized protein OX=69293 OS=Gasterosteus aculeatus (Three-spined stickleback). GN= PE=4 SV=1 IVKN-PMDLSTIKRKLDTGQEQEIDPVMQSLGYCCG-RKLEFSP-TLCCGKQLCTIPRDAAYFSYQN------RYHF--NEIQGD--TVSLGDDPTQQTS INKEQFEKKKNDSLDPELLV-CLDCGRRMHQICVLHHETIWPSGFVCDGCLKKTNKQTKLGSYLEMRVNDFLKRQT--HTGA--VFIRVVHVSDKVVEVK PGMKSRFVDSGEMSESFPYRTKALFAFEDI-GSDVCFFGMHVQEYGSDC-PQPNRVYISYLDSVHFFRRGLRTAVYHEVLIGYLEYVRKLGYTTGHIWAC PPSEGDDYIFHCHPADQKIPKP---QEWYR--LDKAVAERIVHDYKDVFKQATEDRLTSAKEFEGDFWPN-L-ESIK-LEQEEEERKREQKLYATMEKHK EVFFVIRLIAAPMLPPITDPDPLMACDLMDGDAFLTLARDKHLEFSSLRRSLWSSMCMLVEL >tr|L7M8Z9|L7M8Z9_9ACAR Putative histone acetylation protein OX=72859 OS=Rhipicephalus pulchellus. GN= PE=2 SV=1 IVKN-PMDLSTIKRKLDTGQEQEIDPVMQSLGYCCG-RKYVYQPQVLCCGKQLCTIPRDAKYWSYQN------RYTYCQAEIPGD--SVTLGDDPTQPTT IRKDQFVEMKNDHLELEPFVECLDCGRKLHQVCVLHFDSIWPEGFTCDNCLKQKGKTCKLANFIENRVNCYLKKK---ESGAGEVTIRVVSCTEKLVEVK PGMKMRYVDTGHWPEQFPYRAKALFAFEEIDGVDTCFFGMHVQEYGSEC-HPPNRVYIAYLDSVHFFRKRFRTAVYHEILLGYLDYVKQLGFAMAHIWAC PPSEGDDYIFHCHPPEQKIPKPKRLQDWYKMMLDKGIMERIVLEYKDILKQATEDNLKSACEFEGDFWPNVLEESIKELDQEEEEKRKAAKIYSTMEKHK EVFFVIRLHSAQAAVQIHDPDLLMQCDLMDGDAFLTLAREKHYEFSSLRRAKYSTMAMLYEL >tr|D2A0K2|D2A0K2_TRICA Putative uncharacterized protein GLEAN_08222 OX=7070 OS=Tribolium castaneum (Red flour beetle). GN= PE=4 SV=1 IVKR-PMDLSTIKKKLDIGQEMEIDPVMQSMGYCCG-RKYTFNPQVLCCGKQLCTIPRDAKYYSYQNSLKGSDRYTFCQNDIQGD--TVTLGDDPTQATA IKKDQFKEMKNDHLEMEAFVHCTDCGRKLHQICVLYNENIWTQGFTCDECLKKKGQVTKLGVYIETRVNNFLKKK---EAGAGEVSIRVVSSSEKTVEVK PGMRGKFVETGELASEFPYRAKALFAFEEIDGVDVCFFGMHVQEYGSEC-PPPNRVYIAYLDSVHFFKRQFRTAVYHEILLGYMDYVKQLGYTMAHIWAC PPSEGDDYIFHCHPVEQKIPKPKRLQDWYKKMLDKGIIERIVLDYKDILKQAMEDNLRSAADFEGDFWPNVLEESIKELDQEEEEKRKQAKIFATMEKHK EVFFVIRLHSVQSAAPIQDPDPFINCDLMDGDAFLTLAREKHYEFSSLRRAKYSTMCMLYEL >tr|E0VCF3|E0VCF3_PEDHC CREB-binding protein, putative OX=121224 OS=Pediculus humanus subsp. corporis (Body louse). GN=Phum_PHUM086710 PE=4 SV=1 -------------------------------------------TKVLCCGKQLCTIPKDAKYFSFQN------RYTFCQNDIPGD--TVTLGDDPSQSSV IKKDQFKELKNDHLELEPFVACSDCGRKLHQICVLYMEAIWPNGFTCDNCLKKKGAVTKLGTYIETRVNNYLKKK---EAGAGEVAIRVVSSSDKIVEVK PGMRNRFVDNGDLPEQFPYRAKALFAFEDVDGADVCFFGMHVQEYGSDC-PTPNRVYIAYLDSVHFFRRQFRTAVYHEILLGYLDYVKQLGYTMAHIWAC PPSEGDDYIFHCHPAEQKIPKPKRLQEWYKKMLDKGIIERIVLDYKDILKQAMEDKLSSAAEFEGDFWPNVLEESIKELDQEEEEKRKQAKIFATMEKHK EVFFVIRLHSAQSAAPIQDPDPFINCDLMDGDAFLTMAREKHYEFSSLRRAKYSSMAMLYEL >tr|J9JSK5|J9JSK5_ACYPI Uncharacterized protein OX=7029 OS=Acyrthosiphon pisum (Pea aphid). GN= PE=4 SV=1 IIKK-PMDLSTIREKLNTGQEQEIYPVMRSLGYCCG-RKYTFNPQVLCCGKQLCTIPRDAKYFSFEN------RYIYCVNDIPGD--AVTLGEDPTQAQV IKKEQFMEMKNDHLELEPFILCTHCGRKLHQICVLHNENINPLGYVCDNCLKKRGEITKLGTYIETRVNNFLKKK---EAGAGEVAIRVVSSSDKIVEVK PGMRSRFVDNGEMIGEFPYRAKALYAFEEIDGTDVCFFGMHVQEYGSEA-PSPNRVYIAYLDSVNFFRKQYRTYVYHEILLGYLDYVKQLGYTMAHIWAC PPSEGDDYIFHCHPPEQKIPKPKRLQEWYKKMLDKGIIERIILDYKDILKQAIEDKLSSAAEFEGDFWPNALEDSIKELDQEEEQKRKLAKIFDNMDKHK EVFFVIRLHSVQSAAAIQDPDPLISCDLMDGDAFLTMAREKHYEFSSLRRAKFSSMAMLYEL >tr|B0WGE7|B0WGE7_CULQU Putative uncharacterized protein OX=7176 OS=Culex quinquefasciatus (Southern house mosquito) (Culex pungens). GN=CpipJ_CPIJ005540 PE=4 SV=1 IVRK-PMDLSTIRKKLDSGQEQEIDPVMQSLGYCCG-RKYTFNPQVLCCGKQLCTIPRDAKYYSY--Q----NRYTYCQNEIPGD--MVTLGDDPMQTTQ IKKDQFKEMKNDHLELEPFLDCLDCGRKQHQICVLYLEQIWPGGFVCDNCLKKKGTTSKLGTYIETRVNNFLKKK---EAGAGEVHIRVVSSSDKMVEVK PGMRSRFVDNNEMLAEFPYRAKALFAFEEVDGIDVCFFGMHVQEYGSEC-AAPNRVYIAYLDSVHFFQRQYRTSVYHEILLGYMDYAKQLGYTMAHIWAC PPSEGDDYIFHCHPPEQRIPKPKRLQEWYKKMLDKGMVERTIQDYKDILKQAMEDKLQSASEFEGDFWPNVLEESIKELDQEEEEKRKQAKIFATMEKHK EVFFVIRLHSAQSAAPIQDPDPLINCDLMDGDAFLTMARDKHYEFSSLRRAQFSTLCMLYEL >tr|F4WL85|F4WL85_ACREC CREB-binding protein OX=103372 OS=octospinosus echinatior). GN=G5I_06509 PE=4 SV=1 IVKK-PMDLSTIKKKLDTEKEQEIDPVMQALGYCCG-RKYTFNPQVLCCGKQLCTIPRDAKYYSYQNSGLLSDRYTFCQNDIPGD--TVTLGDDPTQPTA IKKEQFQEMKNDHLELEPFVTCTDCGRRVHQICVLHMEAIWPLGFTCDNCLKKKGQVTKLGTYIETRVNNFLKKK---EAGAGEVAIRVVASSDKVVEVK PGMRSRFVENGDMPGEFPYRAKALFAFEEVDGTDVCFFGMHVQEYGSEC-TPPNRVYIAYLDSVHFFRRQFRTAVYHEILLGYLDYAKQLGYTMAHIWAC PPSEGDDYIFHCHPAEQKIPKPKRLQEWYKKMLDKGMVERIVLDYKDILKQAMEDRLSSAADFEGDFWPNVLEESIKELDQEEEEKRKQAKIFATMEKHK EVFFVIRLHSAQSAAPIQDPDPVINCDLMDGDAFLTMARERHYEFSSLRRAKFSSMSMLYEL >tr|E9HPX2|E9HPX2_DAPPU Putative uncharacterized protein OX=6669 OS=Daphnia pulex (Water flea). GN=DAPPUDRAFT_64953 PE=4 SV=1 IIKK-PMDLATIRRKIDNGQEQEIDPVMQSLGYCCG-RKYTFNPQVLCCGKQLCSIPRDAKYFSYH-------RYTYCLNDIPGD--TVSLGDDPTQPTV IRKEQFTELKNDALDLEPFVDCGECGRKLHQICVLHMDCIWPQGFTCDNCLKARGPTTKLSTFIENRVNNFLKKK---EAGAGEVSIRVVASADKITEVK PGMKSRFVDNGEMCETFPYRAKALFAFEEIDGTDVCFFGMHVQEYGSDS-PSPNRVYLAYLDSVHFFRKQFRTAVYHEILLGYLDYVKQLGYCMAHIWAC PPSEGDDYIFHCHPPDQRIPKPKRLQDWYKKMLDRGIIERIVLDYKDILKQAMEDNLKSAAEFEGDFWPNVLEESIREMDQEEEEKRRQTKIFTTMEKHK EVFFVIRLHSVQSLGPVVDPDPLIQCDLMDGDAFLTLARDKHYEFSSLRRCKYSSMAMLYDL >tr|B4IDI6|B4IDI6_DROSE GM11349 OX=7238 OS=Drosophila sechellia (Fruit fly). GN= PE=4 SV=1 IVKK-PMDLGTIRTNIQNGFEAEIDPVMQALGYCCG-RKYTFNPQVLCCGKQLCTIPRDAKYYSYQNYGVASNRYTYCQNDIQGD--TVTLGDDPLQQTQ IKKDQFKEMKNDHLELEPFVDCQECGRKQHQICVLWLDSIWPGGFVCDNCLKKKNSTTKLGVYIETRVNNFLKKK---EAGAGEVHIRVVSSSDKCVEVK PGMRRRFVEQGEMMNEFPYRAKALFAFEEVDGIDVCFFGMHVQEYGSEC-PAPNRVYIAYLDSVHFFRRQYRTAVYHEILLGYMDYVKQLGYTMAHIWAC PPSEGDDYIFHCHPTDQKIPKPKRLQEWYKKMLDKGMIERIIQDYKDILKQAMEDKLGSAAEFEGDFWPNVLEESIKELDQEEEEKRKQAKIYATMEKHK EVFFVIRLHSAQSAAPIQDPDPLLTCDLMDGDAFLTLARDKHFEFSSLRRAQFSTLSMLYEL >tr|F4IDH2|F4IDH2_ARATH E1A/CREB-binding protein OX=3702 OS=Arabidopsis thaliana (Mouse-ear cress). GN= PE=4 SV=1 -------------AKAEKNQ-AMEHSMSENSCQLCAVEKLTFEPPPIYCTPCGARIKRNAMYYTVGA---GDTRHYFCYNESRGD--TILAEGT-----P MPKARLEKKKNDEETEEWWVQCDKCEAWQHQICALFNGRRNQAEYTCPYCFIAELPRTILSDHIEQRLFKRLKQERTERATAESLVIRVVSSVDKKLEVK PRFLEIFRED-SYPTEFAYKSKVVLLFQKIEGVEVCLFGMYVQEFGSEC-AFPNRVYLSYLDSVKYFRPALRTFVYHEILIGYLEYCKLRGFTSCYIWAC PPLKGEDYILYCHPEIQKTPKSDKLREWYLAMLRKASKEGIVAETINLYDHFCRAKVTAARLFDGDYWPGAAEDLIYQMSQEEDGRKGNKKLGETIHPMK EDFIMVHLQPSDIPADTRDKDEILESEFFDTQAFLSLCQGNHYQYDTLRRAKHSSMMVLYHL >tr|K7KYS1|K7KYS1_SOYBN Uncharacterized protein OX=3847 OS=Glycine max (Soybean) (Glycine hispida). GN= PE=4 SV=1 -------------SKAEKNQ-AMEHSMSENSCQLCAVEKLTFEPPPIYCTTCGVRIKRNNMYYTTGT---GDTRHYFCYNDARTE--NIIVDGT-----P IAKSRLEKKKNDEETEEWWVQCDKCEAWQHQICALFNGRRNQAEYTCPNCYIQELPRTILSDHIEQRLFKRLKQERQERAGAEALVIRVVSSVDKKLEVK PRFLEIFQEE-NYPTEFPYKSKVVLLFQRIEGVEVCLFGMYVQEFGSEC-QFPNRVYLSYLDSVKYFRPALRTFVYHEILIGYLEYCKKRGFTSCYIWAC PPLKGEDYILYCHPEIQKTPKSDKLREWYLAMLRKAAKENIVVDLTNLYDHFCRAKVTAARLFDGDYWPGAAEDLIYQLRQEEDGRKQNKKLGETICPMK EDFIMVHLQHADVPSDTKDKDDILESEFFDTQAFLSLCQGNHYQYDTLRRAKHSSMMVLYHL >sp|Q6YXY2|HACL1_ORYSJ Probable histone acetyltransferase HAC-like 1 OX=39947 OS=Oryza sativa subsp. japonica (Rice). GN=OSJNBa0026E05.8, OSJNBa0081C13.32 PE=3 SV=1 -------------AKAEKNQ-LMGHNENENSCQLCKVEKLTFEPPPIYCSPCGARIKRNAPYYTVGT---GDTRHFFCYNESRGD--TIEVEGQ-----N FLKARFEKKRNDEETEEWWVQCDKCECWQHQICALFNGRRNQAEYTCPNCYVEELPRTVLSDHIEDRLFKRLKQERQDRAGAEGLVVRVVSSVDKKLEVK PRFLEIFQED-NYPTEFPYKSKAVLLFQKIEGVEVCLFGMYVQEFGAEC-SYPNRVYLSYLDSVKYFRPALRTFVYHEILIGYLEYCKQRGFTSCYIWAC PPLKGEDYILYCHPEIQKTPKSDKLREWYLSMLRKATKEEIVVELTNLYDHFCKAKVTASRLFDGDYWPGAAEDMINQLRQEEDDRKQQKKLGETIYPMK EDFIMVHLQYSGLPKDTKDRDDILESEFFDTQAFLSLCQGNHYQYDTLRRAKHSSMMVLYHL >tr|D7L0M4|D7L0M4_ARALL Histone acetyltransferase 5 OX=81972 OS=Arabidopsis lyrata subsp. lyrata (Lyre-leaved rock-cress). GN=ARALYDRAFT_897509 PE=4 SV=1 -------------TKAEKNK-AMGLSMSENSCQLCAVERLAFEPTPIYCTPCGARVKRNAMHYTVVA---GESRHYVCYNEARAN--TVSVDGA-----S VPKSRFEKKKNDEEVEESWVQCDKCQAWQHQICALFNGRRNQAEYTCPNCYIQELPASTLSNHLEQRLFKKLKQERQERAGADSLVIRVVASVDKILEVK PRFLDIFRED-NYSSEFPYKSKAILLFQKIEGVEVCLFGMYVQEFGTDS-ASPNRVYLSYLDSVKYFRPALRTFVYHEILIGYLDYCKKRGFSSCYIWAC PPLKGEDYILYCHPEIQKTPKTDKLREWYLAMLKKASKEKVVVECTNFYDHFCRAKVTAARLFDGDYWPGAAEDLIDQMSQEEDGKKSNRKLGEIILPMK EDFIMVHLQHCDIPTEIKDNDAILESEFFDTQAFLSLCQGNHYQYDTLRRAKHSSMMILYHL >tr|K4BP17|K4BP17_SOLLC Uncharacterized protein OX=4081 OS=Solanum lycopersicum (Tomato) (Lycopersicon esculentum). GN= PE=4 SV=1 -------------AKAEKNQ-AMEHSMSENSCQLCAVEKLNFEPPPIYCTPCGARIKRNAMYYTIGA---GDTRHYFCYNEARGD--TIVVDGT-----S VPKARMEKKRNDEETEEWWVQCDKCEAWQHQICALFNGRRNQAEYTCPNCYIAELPQTTLSDHIEKRLANSLREEREKRAGAEGLVVRIVSSVDKKLEVK PRFLEIFQEE-NYPLEFPYKSKVLLLFQRIEGVEVCLFGMYVQEFGSEC-AQPNRVYLSYLDSVKYFRPALRTFVYHEILIGYLEYCKKRGFTSCYIWAC PPLKGEDYILYCHPEIQKTPKSDKLREWYLSMLRKAKEQNIVVELTNLYNHFCKAKVTAARLFDGDYWPGAAEDMIYQLQQEEDGRKQPKKLGETISPMK EDFIMVHLQHAEVPHDTKDEDEILESEFFDTQAFLSLCQGNHYQYDTLRRAKHSSMMVLYHL >tr|A2YGS5|A2YGS5_ORYSI Putative uncharacterized protein OX=39946 OS=Oryza sativa subsp. indica (Rice). GN=OsI_24386 PE=4 SV=1 -------------AKAEKNQ-VIGYSESESLCQLCKVENLTFEPRPIYCSPCGARIKRNASYYTGST---AMGRLFFCYNASLGN--TIEVE-------L ISKADLEKKRNSDEPEEGWVQCDKCECWQHQICALFNARRNEAEYTCFKCYIEELPRTLLSDHIEEQLFKRLREERQERAGADGLVVRVVSSVDKKLEVK PRFFKILQED-NYPAEFPYKSKAILLFQKIEGVEVCLFGIYVQEYGAEC-KFPNRVYLSYLDSVKYFRPALRTYVYHEILIGYLEYCKQRGFTSCYIWAC PPVKGEDYILYCHPEIQKTPKSDKLRQWYLSMLQKAIKENIVVELTNLYDQFCKIKVSASRLFDGDYWPGAAEDIINQLQLEGDG-KLLKKLGEIICPIK DDLIMVHLQYSGVSEDTKDRDIILENEIFDTQAFLSFCQGYHYQYDTLRRAKHSTMMMLYHL >tr|A9TAD6|A9TAD6_PHYPA p300/CBP acetyltransferase-related protein OX=145481 OS=Physcomitrella patens subsp. patens (Moss). GN= PE=4 SV=1 -------------AKAEKNQ-ALEYTISENACQLCAVEKLTFDPPPIYCTACGARIKRNALYYTTGS---GDTKHYFCYNEVRSE--NVEMEGM-----I YPKSKLEKRKNDEETEEAWVQCDKCNLWQHQVCALFNGRRNDSEYICPECCCVELPKTFLSDHLEQRLARKLKQERSERASAEALVVRVVSSVDKKLEVK QRFLEIFQEE-DYPTEYPYKSKVVLLFQKIEGVEVCLFGMYVQEFGMEC-SLPNRVYLSYLDSVKYFRPALRTFVYHEILIGYLDYCRRRGFSSCYIWAC PPLKGEDYILYCHPEIQKTPKSDKLRDWYLTMLRKASNDGIVVEITNLYDYFCKAKVTAARLFDGDYWPGAAEDMIVQLQQEEEDKNGLKKLGESIFPMK EDFIMVHMHHADIPSDTKDKDELMESEFFDTQAFLSLCQGNHYQYDTLRRAKHSSMMVLYHL >tr|I1GWT3|I1GWT3_BRADI Uncharacterized protein OX=15368 OS=Brachypodium distachyon (Purple false brome) (Trachynia distachya). GN= PE=4 SV=1 -------------AKVGKSQ-AMEHSENENSCQLCKVVKLNFEPPPIYCSPCGVRIKRNALYYTVST---IETSHNFCYNESRNH--KIEVEGK-----L IDKDKLAKKRNDVETEESWVECGKCQSWQHQICALFNSKRNEAEYICPKCYVWELPKTVLSDHIEERLFKRLREERHNRAGAAGLVVRVVSSVDKKLEVK PRFFEIFQED-KYPAEFPYKSKAILLFQKIEGVEVCLFGMYVQEFGAEC-AAPNRVYLSYLDSVKFFRPALRTYVYHEILIGYLEYCKQRGFTSCYIWAC PPLKGEDYILYCHPEIQKTPRSDKLREWYLSMLRKAMKEHIVVELTNLYDHFCKAKVTASRLFDGDYWPGAAEDMINQLFLEEND-SKLQKLGETIYPMK EDFIMVHLQHSGVPGDTMDKDDILECEFFDTQAFLSLCQGNRYQHDTLRRAKHSSMMVLYHL >tr|K1S5V1|K1S5V1_CRAGI CREB-binding protein OX=29159 OS=Crassostrea gigas (Pacific oyster) (Crassostrea angulata). GN= PE=4 SV=1 IVKK-PMDLSTIRRKLDTGQEGEIDGVMQSLGYCCG-HKFVFSPQVLCCGKQLCTIPRDAIYYSYQN------RYIYCEKEIQGD--EVELSDDPTQPTK ISKGQFSKLKNDQLDYEPFVECDECGRKMHQICVLHFEPIWPNGFTCDNCHRAKGTPTKLGTYLENRVNNFLKNKD---AGAGDVTIKVLSSGDKVVEVK PGMKARFCDNGEMQETFQYRAKAMFAFEEIDGTDVCFFGMHVQEYGSDC-PQPNRVYISYLDSVHFFQRQLRTAVYHEILIGYLEYVKQQGYSWAHIWAC PPSEGDDYIFHCHPPEQKIPKPKRLQEWYKKMLDKAIIERCVIDYKDILKDAIESNVTSATYFEGDFWPNVLEESIKELDQEEEEKRKRQKVYATMEKHK EVFFVIRLHSQQSLPPIIDPDPMITCDLMDGDAFLTMARDKHQEFSSLRRAKYSTLAMLYEI >tr|Q8MTV9|Q8MTV9_APLCA CREB-binding protein OX=6500 OS=Aplysia californica (California sea hare). GN= PE=2 SV=1 IVKK-PMDLSTIRRKLDSGLESEIDSVMQSLGYCCG-HKYVFCPQVLCCGKQLCTIPRDSMYYSYQN------RYVYCENEIQTD--EVELSEDPTQPMK IRKDQFDRVKNDQLDYEPFIDCQECGRRWHQICALWFESIWREGWSCDSCLKAKGVTTKLGTYLENRVNNFLRKKD---SGAGEVTIRVLSSYDKMTEVK PLMKKRF--GNDMEDSYPYRAKAMFAFEEIDGVDVCFFGMHVQEYGSSC-PGPNRVYISYLDSVHFFKRHLRTAVYHEILIGYLEYAKTLGYTTAHIWAC PPSEGDDYIFHCHPPEQKIPKPKRLQEWYKKMLDKAIIDRVVVDYKDIFKDAIDSGISSAKDFEGDFWPNVIEESIKELDQEEEEKRRRAKLYNHMEKHK EVFFVIRLHNQQILPPITEPDPAMSCELMDGDAFLTTAREKHYEFSSLRRAKLSSMALLYEL >tr|Q4SSB2|Q4SSB2_TETNG Chromosome 3 SCAF14475, whole genome shotgun sequence. OX=99883 OS=nigroviridis). GN=GSTENG00013510001 PE=4 SV=1 IVKN-PIDLSTIKRKLDTGQEAEIDPVMQGLGYCCG-RKYEFSPQTLCCGKQLCTISRDGIYYSYQNYGLIATRYHFCENEIQGN--SVTLGEDPAQQTM ISKEQFEKKKNDTLDAEPFVECKDCGRKMHQICVLHYDVIWPSGFICNNCLKKSGKSTKLGTYIEDRVNKYLKRQNH--PEAGEVFVRVVASSDKTVEIK PGMKSFV-DSGEMVENFPYRTKALFAFEEIDGVDVCFFGMHVQEYGSEC-PFPNRVYISYLDSVHFFKPLLRTAVYHEILIGYLEYVRKLGYVTGHIWAC PPGEGDDYIFHCHPPDQKIPKPKRLQEWYKKMLEKAFAERIIHDYKDIFKQATEDRLTSANEFEGDFWPNVLEESIKELEQEEEERKKEQKLYATMEKHK EVFFVIHLHAGPVLPPIIDPDPLLTCDLMDGDAFLTLARDKHWEFSSLRRCKWSTMCMLVEL >tr|A8P7P7|A8P7P7_BRUMA TAZ zinc finger family protein OX=6279 OS=Brugia malayi (Filarial nematode worm). GN=Bm1_18500 PE=4 SV=1 IVKK-PMDLSTVSRKLDTGEYEEINPVMQKMGYCCG-QKLSFTPLALFCYQSMCTIARDQTYHVYETGVTVSERYTYCTKALPES--GISLSENPNDTSM VPKSKFVQMKNDQIDFEPFEKCIKCFRKWHRVCALFNKKVFPEGFICATCRREKNPHCQLSRFIEERVNKFMKNN---AGKDYEVIIRVLCAADKEVEVK PLMKQKYGPLG-FPEKFPYRTKAIFAFEVIDGVEVCFFGLHVQEYGSNC-PPPNRVYIAYLDSVHFFQRQLRTEVYHEILLGYLNYAMMLGYTMAHIWAC PPSEGDDYIFHCHPPEQRIPKPKRLQDWYKKMLDKGVGEHTVFDYKDIYKQARDENLVTPMEFEGDFWPNVIEDCIREAQNEEQERKRQEKLYINFEKHK EVFFTIRLMSPQSELEIKDPDPLIASDLMDGDTFLTRARDEHWEFSSLRRAKYSTICFCHAL >tr|F1KPX9|F1KPX9_ASCSU Protein cbp-1 OX=6253 OS=Ascaris suum (Pig roundworm) (Ascaris lumbricoides). GN= PE=2 SV=1 IVKN-PMDLSTISEKLDNGLYEEINPVMRKMGYCCG-QKLAFTPLALFCYQSMCTIARDQPYHVYEAGVTVSDRYTYCTKALDEK--GISLSENPNDTSM VPKSKFTLMKNDQIDLEPFEQCKVCHRRWHRICALYNKKVFPDGFTCETCRAEKSPHCALSRFIEDRVNKYLKSN---AGKDCEVIIRVLCATDKEVEVK PLMKQKYGPMG-FPDKFPYRTKAVFAFEVIDGTEVCFFGLHVQEYGSNC-PPPNRVYIAYLDSVHFFQRQLRTDVYHEILLGYLHYAMQLGYTMAHIWAC PPSEGDDYIFHCHPPEQKIPKPKRLQDWYKKMLDKGVGEHTVFDYKDIYKQARDDNLTTPMAFEGDFWPNVIEDCIREVQNEEQERKRQEKLYNNFEKHK EVFFTIRLMSPQSELEINDPDPLMPSDLMDGDTFLTRARDEHWEFSSLRRAKYSSICFCHAL >tr|G0MKA3|G0MKA3_CAEBE CBN-CBP-1 protein OX=135651 OS=Caenorhabditis brenneri (Nematode worm). GN= PE=4 SV=1 IIKR-PMDLDTIHKKLHSAQYAEMDPVMKVMGYCCA-KKLAFTPLSLFCYAAMCTIAREQQYWVYEQNVTVTERYTYCQKALPPE--GISLSENPNDRSM APKTAFVEQKNSVIDYEPFERCKYCMRKWHRICALHDKKVYPEGFICECCRTAKKPHNKLSTFLEDRVNKFIKNQLQAEAHKYPVIIRTLCVQDKEAEVK PQMKQKYVEGGTFPEKFPYRTKAVFAFELIDGVEVCFFGLHVQEYGSNC-PAPNRVYIAYLDSVHFFQRELRTDVYHEILLGYLDYAKKLGYTMAHIWAC PPSEGDDYIFHCHPPEQKIPKPKRLQDWYKKMLEKGVQEGSVVEFKDIYKQARDDNLTTPTQFEGDFWPNVIEDCIREASNEEAQRKVKEKLYSQFEKHK EVFFTIRLVTVQNEPPIDDPDGLMPSDMMDGDTFLTKAREEHWEFSSLRRAKYSTLCLAYSL >tr|H9IUC3|H9IUC3_BOMMO Uncharacterized protein OX=7091 OS=Bombyx mori (Silk moth). GN= PE=4 SV=1 IVSK-PIDLSTIKSKLDRGVEQEIDPVMQSLGYCCG-RKYTFNPQVLCCGKQLCTIPRDAKYFSYKN------RYTFCQNEIQGD--TVTLGDDPLQQTA IKKDQFKEMKNDHLEQEPFVMCMDCGRKQHQICVLHHDSIWPQGFCCDNCLKKKGGTSKLGIYIETRVNNFLKKK---EAGAGEVHIRVVASSDKIVEVK PGMKTRFVEPGELSPEFPYRAKALFAFEEVDGTDICFFGMHVQEYGSES-PSPNRVYIAYLDSVHFFQRQCRTAVYHEILLGYLDYAKQLGYTMAHIWAC PPSEGDDYIFHCHPPEQKIPKPKRLQEWYKKMLDKGIIERIILDYKDILKQAMEDNISSAAEFEGDFWPNVLEESIKELDQEEEEKRKSAKIFATMEKHK EVFFVIRLHSAQSAAPIQDPDPLLNCDLMDGDAFLTMARDRHYEFSSTRRARYSTLCMLYEL >tr|F6R7T5|F6R7T5_CIOIN Uncharacterized protein OX=7719 OS=Ciona intestinalis (Transparent sea squirt) (Ascidia intestinalis). GN= PE=4 SV=1 IVKN-PIDLTTIRKKLEVGEEQEIDPVMQELGYCCG-RKLEFMPMTLCCGKALCTIQTGAVYYEYQNYGLLSDRYDFCENDIQGD--LVSLSDEPGQGNS VPKRDFMKKKNDVLDPEPFVQCDECGRKLHTICVLYNPQIWPEGFVCDNCHRSRGSHNKLSEHIEHRVNSYLKRHDTNN-EAGYIHIRSVYSGEKTVEVK PGMKSKFVDSKEMPETFPYKAKAFFAFEEIDGVDICFFGVHVQEYGSDA-PIPNRVYLSYLDSVHFFRRHLRTAVYHEILIGYFEYCKNLGYEFAHIWAC PPSEGDDYVFHCHPPEQKIPKPKRLQEWYKKMLDKAIVDQVVLEYKDIYKQAKDDRLQAAREFEGDFWPNVLEESIKELEQEEEDRKQADKLYQIMEKHK EVFFVIRLQAADTLGSIADPDPLMPCELMDGDSFLTLAREKHYEFSSLRRTKWSTMAMLVEL >tr|H3IXS8|H3IXS8_STRPU Uncharacterized protein OX=7668 OS=Strongylocentrotus purpuratus (Purple sea urchin). GN= PE=4 SV=1 IVKT-PMDLQTIKNKLDTGQEQEIDPVMQELGYCCG-RKHVFHPQVLCCGKQLCTIPRDAHYWTYQN------RYHFCENDIQGE--TVVLGEDPTQSTT IEKKTFCRKKNDILEPEPMYNCLECGRKLHQICVLHVDVIWTDGFICDGCRKQKNQPSKLGTHLENRVNKFLRDR---NAGAGEVVIRVMSCTEKIVEIK SGMKSRYLYGDDLPDTFPYRSKALFAFEEIDGVEVCFFGMHVQEYGSDS-PKPNRVYISYLDSVHFFQRAFRTAVYHEILIGYLEYTGCLGYEFAHIWAC PPSEGDDYIFHCHPVEQKIPKPKRLQDWYRRMLDKALGDGVINIYNDIMNAALEDGLTCATEFEGDFWPNVLEESIKELDQEEEERLKAQKLYATMEKHR EVFFVIKLKRAHN-EKIHDPDPLITCDLMDGDAFLTMARERHYEFSSLRRAKFSSLCMLCEL >tr|C3ZUG6|C3ZUG6_BRAFL Putative uncharacterized protein OX=7739 OS=Branchiostoma floridae (Florida lancelet) (Amphioxus). GN=BRAFLDRAFT_126580 PE=4 SV=1 IVKH-PMDLSTIKRKLDTGQEQEIDPVMANLGFCCG-RKYVFQPQALYCGKSVCTIPWGARYWTYEN------RYHYCQKEIGGD--TVTLGDDPSQAST IPKSSFEEKKNDVLDLEP--------------------------YQCDACLKQRLQTTKLGTFLENRVNTYLRERCVE---AGEVTIRVVHVSDKMVEVK PGMKNKFVDSGDMPEQFPYKAKALFAFEEIDGVEVCFFGMHVQEYGSDC-PQPNRVYISYLDSVHFFKRNLRTAVYHEILIGYLDYVKKLGYTMGHIWAC PPSEGDDYIFHCHPPEQKIPKPKRLQEWYKKMLDKAVAEGVVKDYKDILKQATEDGVSMANEFEGDFWPNVLEESIKEIDQEEEEERRSQKLFQTMEKHK EVFFVIRLIASNNLPSISDPDGLIQCDLMDGDAFLTLAREKHYEFSSLRRAKFSAMAMLYEL >tr|B7FZC4|B7FZC4_PHATC Predicted protein OX=556484 OS=Phaeodactylum tricornutum (strain CCAP 1055/1). GN=PHATRDRAFT_54505 PE=4 SV=1 IIKK-PMDLGTIQKRLESSAYHSIDDQNERACTLCGSEKLLFEPPVYFCNGQSQRIRRNSHFYIGGN-----NQYFWCSPELDDK-IPIELADL-----T VMKNNLKKKKNDEIHEESWVQCDTCERWVHQICGLFNTRQNKEEYCCPKCLLEKRPRTTLSEWLERSVTKKVEKRKRELESGGPIIIRQVTAMDRKLEVR ELMKKRYAHK-NYPDEFPFRCKSIVVFQHLDGVDVILFALYLYEHGEDN-PPPNTVYISYLDSVHFMRRKLRTFVYHEILIAYLDYARRRGFATAHIWAC PPLKGDDYIFYAKPEDQKTPRDSRLRLWYIDMLVECQKRSIVGKVTNMYDIYFNLDATAVPYLEGDYFPGEAENIIKMLEEGGGKKLGSVGLGETIQPMK ESFIVAFLNWKDAGKVIDDDAEDLDCEFLNNQAFLNLCRGNHYQFDELRRAKHTSLMLLWHL >tr|F0YI83|F0YI83_AURAN Putative uncharacterized protein OX=44056 OS=Aureococcus anophagefferens (Harmful bloom alga). GN=AURANDRAFT_38626 PE=4 SV=1 IIKN-PMDLGSIKKRMENNCYKSISESNGDVCVLCGCEKLLFEPTVYYCNGNGQRIRRNSYYYTGGR-----NQYHWCQQELREK-EPLEFADC-----T LWKKELQKKKNDEMHEEPWVECSQCNRWVHQICALFNGRMNKGIYHCPFCFMARRRHTKMSRFLEDRVIKSLDD-----LTASAVYVRQLSNIEKAHQVK PRILQRYADQ-KYPREFPVRSKCILLFQEMDGVDVILFGMYLYEYGHNC-PQPNRVYVSYLDSVYYFRRQYRTLVYHEMLIAYLAHTKERGFHTAHIWAC PPCKGDDYIFFCHPEDQKTPKDDRLRSWYITLLEKAKEEGIVTHITNLWDEHFDYDVNHIPYFEGDYWPGEAENVVKALEDEANERNESKSMGKILEPMK DAFIVAYLQPRDVSDETEDTDEIIESEFYDTQQFLNLCQGNHYQFDDLRRAKHTSMMSLYHM >tr|B8C9A0|B8C9A0_THAPS Putative uncharacterized protein ZFP16 OX=35128 OS=Thalassiosira pseudonana (Marine diatom) (Cyclotella nana). GN= PE=4 SV=1 IVQH-PMDLALVETKLENGVYKDLDSKDDTSCTLCGNHRRLFEPTTLYCSGGMQKIRRNASYYTDRY-----RQNQWCEKVLMEE-KPVLLDDG-----K ETKKSLVKMKNDSTPEEKWVQCDNCHNWAHQICALFNEVQSSN-FTCPKCFLKQQPQCKLSTVIEEGLATTLSVEYEKIEKAEGLCVRVVSSLEKKHKVR DEMLGRYSKK-GYPSEFPVTSKCILLFQKIHGVDVLLFGMYVYEYGDKC-AAPNRVYISYLDSVQYLESSYRTTTYQSIIVEYLRYARMRGFHTAHIWSC PPSKGDEYIFYCHPSSQLVPKDDMLCAWYIETLKKAQDQGIVLETRTIYDEYFGFDPMSLPYFEGDYIPGEIEKIIRDFNKDENLREETKLLDLALAKMK QNFIVAQLLSDDDG-NTIDEDPLMEQEFIDTLQFLNYCQKNNLQFDELRRAKHTTMMLLCNL >tr|B8LBW8|B8LBW8_THAPS Putative uncharacterized protein ZFP14 OX=35128 OS=Thalassiosira pseudonana (Marine diatom) (Cyclotella nana). GN= PE=4 SV=1 --------------------------KNGDACSLCGCEKLLFEPPVFYCNGRSKRIRRNSYFFVGGN-----NQYHWCQPDMKDN-QKLELADM-----T LTKNQLDKKKNNEVPEESWVQCDRCERWIHQICALFNTRQNKDEFVCPSCTIEDRPRTKLSEHLEKHVREKFHEEMARMDGGGEIYIRQVSSMTRTLDVR ERMRKRYSFK-NYPSEFKYRCKCIIVFQNLDGVDVILFGLYVYEHDETN-PMPDAVYVSYLDSVHYMRRRMRTFVYHELLISYLDYVRNKGYSTAHIWAC PPLKGDDYILFAKPEDQKIPKDDRLRQWYLDMLKDCQKRGIVGKLTNAYDLYFKNDATVLPYMEGDYFPAELENIIKDLEEGKNLSKKPDGLGETIHPMK ESFLVAFLDWEGSG------DDEMDCEFLNNQLFLNLCQGNHYQFDQLRRAKHTSMMVLWHL >tr|Q2VJ10|Q2VJ10_SCHMA CREB-binding protein OX=6183 OS=Schistosoma mansoni (Blood fluke). GN= PE=2 SV=1 VIKE-PMDLTTIRNNLEDGKQNRIDQVMQAMGFCCG-QDYEYQPQGLYCSSNLCTINRDATYFVYINIGLVCDKYYQCEKCFAGD--VIILADEPNQSIP IKREMFEKRKNNVKEKEEFVICVECGRRWHKVCALHMNEIWPSGYICPGCLRERGIVTRLSNFLEKRVNDFLKKKE---VGTGEVTIRVLASSDKVVEVK PLMRARFTESGELSESFPYRLKAVFAFQEIDGQDVCFFGLYVQEYGSES-PQPNRVYVAYLDSVFFFRPQYRTDVYHEILVGYLLYAKRCGYAMAHIWAC PPGEGDDYIFHMHPTEQKIPKAKRLQEWYRRMLQKAIIEGIVVDYKNILKDALDHQLVSPTEFEGDFWPNTLEDILKELEEEEARRRRERKVYDTMEKLK EIFFVIRLHRHNSAPPTTDPDQPVHSELMDSDAFLQMARERHLEFSSLRRAKYSSMVLLYEL >tr|I1GCN2|I1GCN2_AMPQE Uncharacterized protein OX=400682 OS=Amphimedon queenslandica (Sponge). GN= PE=4 SV=1 IIKN-PIDLSIIRRKLEDGSYENIDQAMVSLGYCCG-QKHVFHPQVLYCYGKLCPIPRDSPYYGYQN------KYVYCTTEIPGD--SVAVSLDGTSNTS LPKGAFKEMKNDSVGIEPMVECIHCKRSFHKICVLYHDAIWPEGYQCCNCLQRLRPTTKLGSFLEERVNSFLRNAN---VPGAEVTIRVVSSSNKLLDTR LGMMDRF---SYFPSQFPYRTKALFVFEEIDGAEVCFFGMHVQEYGSDC-PAPNRVYISYLDSVHFFRPEYRTLVYHEILIGYLEFCKQNGFQTAHIWAC PPGEGDDYIFHCHPMEQKIPKPKRLVEWYRNMLERGKDQSVLCDFQDIFQYCVEEEITTVTYFEGDFWPNVIEETIKELDQERKASELTQKIYQTMDKHK EVFFVVYLQPPNQLPTTSDPDSLVTCELMDGDAFLNLAREKHWEFSSLRRAKFSTMSMLYEL >tr|B4R761|B4R761_DROSI GD16050 OX=7240 OS=Drosophila simulans (Fruit fly). GN= PE=4 SV=1 IVKK-PMDLGTIRTNIQNGKKAEIDPVMQALGYCCG-RKYTFNPQVLCCGKQLCTIPRDAKYYSYQNYGVASNRYTYCQNDIQGD--TVTLGDDPLQQTQ IKKDQFKEMKNDHLELEPFVDCQECGRKQHQICVLWLDSIWPGGFVCDNCLKKKNPTTKLGVYIETRVNNFLKKK---EAGAGEVHIRVVSSSDKCVEVK PGMRRRFVEQGEMMNEFPYRAKALFAFEEVDGIDVCFFGMHVQEYGSEC-PAPNRVYIAYLDSVHFFRRQYRTAVYHEILLGYMDYVKQLGYTMAHIWAC PPSEGDDYIFHCHPTDQKIPKPKRLQEWATSVLEESIK--------ELDQEE-EEKRKQAEAAEANLFSIEENEVSGKAKQRKNSKDLSAKIYATMEKHK EVFFVIRLHSAQSAAPIQDPDPLLTCDLMDGDAFLTLARDKHFEFSSLRRAQFSTLSMLYEL >tr|D8T1P3|D8T1P3_SELML Putative uncharacterized protein OX=88036 OS=Selaginella moellendorffii (Spikemoss). GN=SELMODRAFT_40635 PE=4 SV=1 ---------GQSKAKAE-KHQAMEQQPSESACRVCAVERLTFEPPPLYCTTCGVRLKRNSVYYTAGS---GETRHYFCYNASPAD--TVGLDGQ-----L YPKSKLEKKKNDEETEEAWVQCDKCNVWQHQICALFNGRRNETEFICPYCLLNDMEKTLLSDHLEQRLARSLRKERADRAEAEGLVVRVVSSVDKKLEVK QRFLEIFKDS-NYPSEYLYKSKVVLLFQRIEGVEVCLFGMYVQEFGAEC-PEPNRLYIAYLDSVKYFRPALRTFVYHEILIGYLEYCKRRGFGSCYIWAC PPLRGEDYILYCHPEIQKTPKSDKLREWYLAMLGKATKENIVVELTNFYDYFFKAHVTAARYFDGDYWPGAAEEVLLQLHQDEEDKRRLHKVLQLMHPLK EDFILVHMHHNDIPAETEDKDEIMESEFFDTQAFLSLCQGNHYQHDTLRRAKHSSMMVLYHL >tr|D7TT54|D7TT54_VITVI Putative uncharacterized protein OX=29760 OS=Vitis vinifera (Grape). GN= PE=4 SV=1 ---------GSIQQKVA-GQK-ITNSTSENSCQLCMADNLLFAPEPMYCSLCGTRIKHGVLYYTQAE---NGATHCCCYKMSRGG--NITFCGF-----T ISKAKLDKKKNDKETEESWVQCDKCEGWQHQICALFNDKRDMAKYICPICYLKEIESTMLSNHIEQRLFSRLEQERKERASAEDLVVREVSSVDKQLKVN KEFLDFFHDD-NYPAEFPYKSKMILLFQKIEGVDVCLFGMHVQEFGSEC-RQPNSVYISYLDSVKYFRPALRTFVYNEILIGYLDYCKKRGFTTCYLWSC PPLKGEDYILYCHPRTQKTPKTDKLRQWYRSMLAKAAKENIAVELTNLYDHFFNTKVTAARYFDGDYWTTAAEDLIRNIVRESGGNLQKATLILLMQPKK EDFIMAHLQSNNVPVDTEDKDATLNNGFFENQSILSFLQTNHYQFDTLRRAKHSSMMMLHYL >tr|A8IRU2|A8IRU2_CHLRE CREB-binding protein OX=3055 OS=Chlamydomonas reinhardtii (Chlamydomonas smithii). GN=CHLREDRAFT_145462 PE=4 SV=1 ---------------------------PDDACKVCALTKLSFEPPVIYCSSCGLRIKRGQIFYSTPPDHGNDLKGYFCFTDQKGE--RILVEGV-----S IKKSDLVKRKNDEEIEEGWVQCDHCEGWVHQICGMFNKGRNDVHYLCPDCLAVGLPTSRLSEFITERLNRELEKEHHKRAKPEPLTVRMINSVMKKCEVK PRFHETFPTD-GYPGEFGYRQKVLLLFQSLDGVDVCLFCMYVQEYGKDC-PAPNVVYLSYLDSVKYFRPSLRTFVYHQLLIAYVEFTRNMGFEQMYIWAC PPMQGDDYILYCHPTKQKTPRSDRLRMWYIEMLKLAKEEGIVKHLSTLWDTYFEGGRTYIPYMEGDYWPGEAENQLMAINDAAKGKPGTKRLGEILGGMR EDFIVVHMQVPCAGVWDRDPDGDMESEFFETQTFLSLCQGNHYQFDTLRRAKHSSMMVLYHL >tr|I0YZA8|I0YZA8_9CHLO DUF906-domain-containing protein OX=574566 OS=Coccomyxa subellipsoidea C-169. GN= PE=4 SV=1 ----------------------------SDPCAACGVCRYTFEPPSIFCTSCSQRIKRNQVYYTTPAKKSEGVKGFWCYSEHRGE--IIQMEGM-----R VRKNELEKRKNDEETEEGWVQCDVCDCWVHQICGLFNKGRNETPYVCPMCLMDGLPRCDLSEFLEDRVAKALERDLHMRAAATGLTLRVINNVVKKMDTK SKFYDAFRKD-GYPANFEYKQKVIMLFQKQDGVDVCLYCLYMQEYPDSC-PAPNWIYLSYLDSVKYFRPALRTLVYHEVLQGYLSYVKARGYTSMFIWAC PPLQGDDYILYCHPSRQKTPRSEHLREWYLRMLRQAQVEGSVTHLSNLWDTFFEGGRTHLPYFEGDYWPGEAESLLMNMGEEARGKKGSKKLGDTISGMK ADFIVVHMQEPCSGVTHDEGNPEMHCEFFDTQAFLSLCQGNHYQFDSMRRAKHSSMMVLYHL >tr|G4VIH1|G4VIH1_SCHMA Putative creb-binding protein 2 OX=6183 OS=Schistosoma mansoni (Blood fluke). GN=Smp_127010 PE=4 SV=1 IVSD-PMDLSTIRRKLEDREYKTCTPVMQSLGFCCG-REYYYQPPTLTCLPKFCTIYRDAVYYVYKSPGLLEQKYTVCERCYNEALDQIALDSDSSNPVN VQKSLMEKCKNDIKEKEPFVFCKHCGRKWHRVCAIYLEEIWPDGFICNHCIVKKLTTCKLSNFLEKRVNDFLKKKE---ADAGDVIIRVLAAADKTVEVK SGMKARFCDNGEMPESFPYRVKAIFAFQEIDGQEVCFFGLHVQEYGSEC-PLPNRVYVAYLDSVYFFRPQFRTEIYHELLVGYIHYAKLLGFTMAHIWAC PPSEGDDYIFHMHPPDQKIPKPKRLQEWYQKMLKKALIERIVVDYKDICQDANESHMISPSEFEGDFWPNTLEEIFKEMDEEDAKRKQERRVYEIMEKHK ENFFVIRLHPQNSVAPIKDPDPLINSELMECGAFLEKAREKHLEFSSLRRAKYSTLVMLYEL >tr|C1MXM8|C1MXM8_MICPC Histone acetyltransferase OX=564608 OS=Micromonas pusilla (strain CCMP1545) (Picoplanktonic green alga). GN=MICPUCDRAFT_60218 PE=4 SV=1 ------------ARRRDAESEKAIAGATESACQACGVERLTFEPPPLYCYACVGRIKRGQVFYHIPHTGGEVRKDAWCNNQIQG---SIEVEGV-----K YPKNQLAKKKNDDDLEEPWVQCDYCNCWYHQICVLFNGRKNEAPFTCPSCILSQLPHTRLSFFLEERLKKVMTHERNERAKADGLVIRVVSSVEKTLEVK KNFKDAFKDQ-NFPERFPYRSKVLLLFQKTEGVDVCLLGIYVQEYGSDC-PAPNRVYLSYLDSVKYFRPALRTYCYHQILIGYLQYIKQRGFTSCFIWAC PPFQGEDYILYCHPKEQKTPKSDKLREWYLKMLRQAQKEGIVLSLQNLYDEFHLHDIASATEFDGDYFPGVAEDWIPGIQKEQAELAKTKKLGTTIHTMR HDFIMVHLAHQCPLPDVKESDEVINSEFFDTQAFLSLCQGNHFQFDSLRRAKHTTMMVLYHL >tr|K8EH95|K8EH95_9CHLO Histone acetyltransferase OX=41875 OS=Bathycoccus prasinos. GN=Bathy08g02820 PE=4 SV=1 ------------ARRRDAEADKIIANATESSCRACGVERLTFEPPPMYCYSCVGRIKRGQVYYHVPNPGGEMRKDTWCNNTIQG---HIDIEGQ-----R FPKNSLIKKKNDDDLEEPWVQCDFCNNWYHQICVLFNGRKNEAHFTCPTCILSQITKTVLGDYLEKRVLDALAEERVARAKAEDLTIRVVSQTDKKCDTK QRFMDAFKKD-GFPEEFLYKNRVILLFQKIEGVDVCLMAIYVQEYGEDC-PNPNRIYLSYLDSVKYFRPALRTFVYHQILIGYLDYAKQRGFTSCFIWAC PPFQGDDYILYCHPKVQKVPKSDKLREWYMKMLRGAAKEGIVHSVTNIYDEYNLQEVRSARDFDGDYFPGVAEEWIPGIIAEQEEQKKAKKLGVTISAMR NDFILAHLAPKCPLPAVKEEDKAMQSEFFDTQAFLSLCQGNHFQFDSLRRAKHTTMMVLYHL >tr|A4RWA9|A4RWA9_OSTLU Predicted protein OX=436017 OS=Ostreococcus lucimarinus (strain CCE9901). GN= PE=4 SV=1 ------------ARQLQKEAERAVINATESSCRACGVERLTFEPPPLYCYSCVGRIKRGQVFHQM-S-GGETRRDAWCNNAIQG---YVDVEGQ-----R FPKATLIKKKNDDDLEEPWVQCDYCEDWYHQLCVLFNGRRNEAPFTCPNCILSQLPKTKMSTFLEERLASKLSAERVERAKAENLTIRVVSQTLKQMDTK PHYYAHFKEQ-GIPAHFTYRSRVILLFQKLEATDVCLMAIYVQEYDDEC-PEPNRIYLSYLDSVKYFRPALRTYVYHNILIAYLDYVKQRGFTSCFIWAC PPFQGDDYILYCHPKVQKTPKADKLREWYLKMLRSAQKDGIVISTSNVYDEFRLHDIRCATEFDGDYFSGIAEDWIPTIMKELEEAKNSKKLGTTISNMR NDFMLAHLAHQCEIPTLTKEEEKLESEFFDTQAFLSLCQGNHFQFDSLRRAKHTTMMVLYHL >tr|C1EDZ3|C1EDZ3_MICSR Histone acetyltransferase OX=296587 OS=Micromonas sp. (strain RCC299 / NOUM17) (Picoplanktonic green alga). GN=MICPUN_98056 PE=4 SV=1 ------------ARRRDAEAEKAIAGASESACQGCGVERLTFEPPPLYCYACVGRIKRGQVYYHIPSTGGEVRKDAWCNNAIQG---HVELEGQ-----K WPKNVLAKKKNDDDLEEPWVQCDYCNSWYHQICVLFNGRRNEAPFTCPQCILSQLPQTKLSFFLEERLRKVLGRERQERAKAEGLTIRVVSSVEKKLDTK SNFMRVFKDQ-KFPESFPYRSKVLLLFQKIEGVDVCLLGIYVQEYGSEC-PAPNRVYLSYLDSVKYFRPALRTYCYHQILIGYLQYVKQRGFTSCFIWAC PPFQGEDYILYCHPKEQKTPKSDKLREWYLKMLRQAQKEGIVLSLGNLYDEFHLHDIASATEFDGDYFPGVAEDWIPGIEKEQAENAKTKKLGATIQNMR HDFIMAHLAHQCALVEKTGDDEEMQSEFFDTQAFLSLCQGNHFQFDSLRRAKHTTMMVLYHL >tr|B9SJ01|B9SJ01_RICCO Putative uncharacterized protein OX=3988 OS=Ricinus communis (Castor bean). GN=RCOM_0597450 PE=4 SV=1 ----------------EEKEEQEIHCANENTCQLCAADKLLLAPVPIYCSSCGSRIKRSVIYYNASE--ENGTRHSFCTLCYKARGASITFYGI-----T IPKAKLDKKKNDEEIEEPWVQCDKCKSWQHQICALFNDKRDESEYICPKCCLEEIRSTLLSDFIEQRLFRRLQQEKEDKAKAENLVVRVVLSVKKQLKVK KQFLEIFRDG-NYPDEFSYSSKVILLFQKIEGVDVCLFGMYVQEFGSEC-SQPNCVYISYLDSVKYFRPALRTFVYHEILIGYLEYCKKQGFAACYLWAC PPLKGEDYILYSHPAIQKTPKSDKLRQWYNSMLRKAAKENVVVNVTNLYDHLFHSKVTAARYFDGDYWSGAAETIINNIEQQNGKSSGRKKLGQTIFPVK EDFFVLHLQFVDIPSDTKDEDVILDSWLFENHTLLGFCQKNHHQFDTLRRAKHSSMMILHHL >tr|B9H3R2|B9H3R2_POPTR Histone acetyltransferase OX=3694 OS=subsp. trichocarpa). GN= PE=4 SV=1 ----------------SNEEDKIINHVNENRCQLCAEDKLWFAPVPIYCSCCGARIKRGVIYYTSSD--ENGTQPCFCSLCFKSSPPKITFYGI-----T ILKEKLHKRKNDEATDEPWVECDKCKRWQHQICALFNDKRDKAEYICPKCCLKEMKRTNLSDFIEERLFRRLNQEREERAKAEDLVLRVVLSVNKQLKVK EKFLEIFHGE-NYPAEFPYRSKVILLFQRIGGVDVCLFGLYVQEFGSEC-SQPNSVYISYLDSVKYFRPALRTFVYHEILIGYLEYCKKRGFATCYLWAC PPIKGEDYILYCHPENQKTPKSDKLRQWYHLMLRKAAKENIVVNCTNLYDHFFYSKITAARYFDGCYWYDAAEDILKNIEQKTGVYAERKHLGGPMCGRK EDFMVVHLQHVGIPSDTEDNDAILENWHFDNHTFLGLCQKNHYQFDTLRRAKHSSMMILHNL >tr|H3FSY6|H3FSY6_PRIPA Uncharacterized protein OX=54126 OS=Pristionchus pacificus. GN= PE=4 SV=1 IVKR-PMDLSTISTNLQSGKYDLINPVMEKMGYCCG-EKRCFTPLALFCYGA-------SIYETTSNGVVVSERYTYCVKGLPDE--GISLSENPNDKNM VPKNQFKQCKNNVIEVEPFENCKYCHRKFHKICVLHDKKVYPEGFVCNRCRNDKLAPCRLSNHLEERVNGFIKTRLKG-GEQKEVIIRVLSVVEKEVEVK PMMKSKYDAPGSFPQKFPYKTKAVFAFEVIDGVEVCFFGLHVQEYGSKS-GSPNRVYIAYLDSVHYFEKHLRTDVYHEILLGYLDYARKLGYTMAHIWAC PPSEGDDYIFHCHPTEQKIPKPKRLQDWYKKMLDKGVQEQCVVEYKDIYKQARDDGLTSPADFEGDFWPNVIEDCIREASSEEAQRKKTDKLYAQLEKHK EVFFTIRLVTQQSALDIEDPDGLMSSELMDGDTFLTRAREEHWEFSSLRRAKYSTLCLAHAL >tr|B3S9N8|B3S9N8_TRIAD Putative uncharacterized protein OX=10228 OS=Trichoplax adhaerens (Trichoplax reptans). GN=TRIADDRAFT_60973 PE=4 SV=1 IIKH-PMDLSTISRRLHQGMEKEITPVMRGFHYCCG-KKYVFNPQVLCCGKQLCTIPRDKVYWTYQD------RYHFCEKEIEGE--NVRIGDDPILDSA IPKKYFKKKTNDHLEPEPWTACIDCGRKFHSICVLHYDQLSSNGYICSHCLKTSGSRSNLSDHIEKRVNNFLKGK-----GAGNVTVRMVSNVEKTVEVK QGMRTKF--QDQFPESFPYRAKALFVFEEIDGTDVCFFGMHVQEYGTDC-SPPN-----------------------------------KSYTMAHIWAC PPSEGDDYIFHCHPPEQKIPKPKRLQEWYKRMLDKAKKESVVVDYKDIHRMSQEENITSANYFEGDFWPNVLEENIKEIDQEEQERKASQRLFATMEKHK EVFFVIRLYSVEEAKDIKDPDLVMSCDLMDGDAFLTLAREKHFEFSSLRRAKYSTIAMLVEL >tr|E5S9Y5|E5S9Y5_TRISP Putative bromodomain protein OX=6334 OS=Trichinella spiralis (Trichina worm). GN= PE=4 SV=1 IVKK-PMDLLTIKENLLGGEVEEIDPVMRMLGYCCG-RKLSFTPLALVCYQPMCAIPRDAHYYCYEP-------KFVVENGI------------------ -----------------RFVHCT---------MIRFIPA--------------GLPHCKLSMHVENRVNKFLRKSN---SCAGEVIIRVLASSDKEVEVK PLMKTKFCTIGEIPEKFPYRTKAIFAYEVVDGVEICFFGLHVQEYGSNC-PQPNRVYIAYLDSVHFFQPHIRTAVYHEILLGYLEYVKVLGYTMAHIWAC PPSEGDDYIFHCHPAEQRIPKPKRLQEWYKKMLDKGIVERIVVDYKDIYKHAQDEHLQSATEFEGDYWPNVLEDCIKELDAEEAERRREDKIYTMMEKHK EVFFVVRLHSAQAAAPVNDPDALITCELMDGDSFLSIARERHWEFSSLRRAKFSTMALCYEL >tr|E9CA03|E9CA03_CAPO3 E1A binding protein p300 OX=595528 OS=Capsaspora owczarzaki (strain ATCC 30864). GN=CAOG_04903 PE=4 SV=1 VIKR-PMDFSTIRTRLDRPTENRVEPALRKLGYCCALRLLQFSPPSLYCGKCVSTIPRDTQYHTYEN------RIHYCARCAPDP--IVGIDGEE----S MAKAQFSTATNNKLDDEPFNDCDECGRRAHVICALHNKHTDT-RFLCPLCKKKSGKKCELSTHLEARISTHLRSV----PDCGEVTIRVILVRDHQVNFK EQMKKKSVEE-GGPTGLPYRHKAMFAFQKQDGVEVCFFGMHVQEYGSDC-PEPNRVYVSYLDSVFWFRPQHRTFVYHEILIGYLEYCRNLGYRYAHIWAC PPTPGDDYIFYCHPEDQKVPKPKRLQEWYRQMLTRARDDKVILDFCDIYSQCMTDDILTVRQFEGDYWPGVIEDSLTEAAKERKGKKVCEKLYQMMEKAK DGFFVATLRPPTDPVDIKDPDPLVACALMEGEAFLDLARENHYEFSTLRRAKYASLMFLYHL >tr|H3AHW2|H3AHW2_LATCH Uncharacterized protein OX=7897 OS=Latimeria chalumnae (West Indian ocean coelacanth). GN= PE=4 SV=1 ---------------------------------------------------------------------------------------------------- ------------------------------------------------------------------------------HPESGEVTIRVVHASDKTVEVK PGMKARFVDSGEMAESFPYRTKALFAFEDIDGVDVCFFGMHVQEYNSDC-PPPNRVYISYLDSVHFFRKCLRTGVYHEILIGYLEYVKRLGFTTGHIWAC PPSEGDDYIFHCHPPDQKIPKPKRLQEWYKKMLDKAVSERIVHDYKDIFKQATEDRLTSAKEFEGDFWPNVLEESIKELEQEEEERKREEKLYATMEKHK ET------HLGCLRPPACTHTLILQCVLS-----VPLPTDKHLEFSSLRRAKWSTMCMLVEL >tr|F2TVQ2|F2TVQ2_SALS5 CBP-A protein OX=946362 OS=50818 / BSB-021)). GN=PTSG_00169 PE=4 SV=1 IIKQ-PMDLGTIGKKLKAGDNAEIEPIMKRMGFCCG-QELVFNPPTLYCGKNLCTIPRESKYMSYRTSTLIVNYCPKCWKKLPTG--PVKITDENGQEVP VERSQFEEKVNNELEYEKLIECHVCKRRQHQICELYHEHFNQ-PFVCSHCREALLPTTHLSKYLEDRVRGALRDD----PVAKDVLVRVVSVRSKSFEAK EGISQYYKRDSPFPESFPYKSKALFVFQRIDGRDLCFFGLHVQEYGEDC-PEPNRVYISYLDSVNLFEPSYRTTVYHELLIGYLYYVAKLGFLHAHIWAC PPSPGDDYMFHCHPPNQRVPTSKRLQAWYRDMLIKAKERGVIQDWCDLQQYVDSERLSEARQFDGDFWPNQLEQIVQEINKDEEEEAAKDRAYLDVQRHK DVFFAIKLVANTKNPMPHDPNPEIKCELMDGDSFLTLCRENHYEFSSKRRAKFSTLMMLYHI >tr|K3X9X7|K3X9X7_PYTUL Uncharacterized protein OX=65071 OS=Pythium ultimum. GN= PE=4 SV=1 IIRR-PMDLGTIKKKLDLGIYKNIVREKESSCRLCGVERLVFEPAVLYCGECNSRIRRNCYYYTSSD-----NKYHCCHPCYGGD--SVKHS----EGRT YKKAELSRKKNDEVHEEPWVQCDKCNRWVHQICALFNGKIDASEFLCPECLLDDLMRTKLSDFLERRIYKALQDEREDEAKVESLTVRLVSNIEKQLMVR DKMFQRYKDSHKYTYEHRFKSKCICMFQEIHGVSVLLFGMYVHEFDEQE-ADSRRVYISYLDSVNYFEPHLRTKVYHELLVAYLDFVKQRGFHTAHLWAC PPLKGDDYILYCHPESQKTPKSDRLRQWYVDMLVKAQDEGVIWQITNMYDDYWRNDNSACPYFEGDYWVGLAEDLIEKIESEKPKKTKKAKLGEVIEPMK DDFLVVKLYPCCIPLKCTDPDEVNDSEFFDTQAFLSLCQGNHYQFDELRRAKHSSMMALYHL >tr|E3X6V0|E3X6V0_ANODA Uncharacterized protein OX=43151 OS=Anopheles darlingi (Mosquito). GN=AND_15406 PE=4 SV=1 IVRQ-PMDLSTIRKKLESGAYQDPREYVDD-------VWLMF---------------DNAWLYNRKT----SRVYRYCT--------------------- --KDQFKEMKNDHLELEPFVDCLDCGRKQHQICVLYLETIWPGGFVCDACLKKKAPTSKLGTYIETRVNNFLKKK---EAGAGEVHIRVVSSSDKLVEVK PGMRNRFVESGEMLPEFPYRAKALFAFEEVDGIDVCFFGMHVQEYGSEC-AAPNRVYIAYLDSVHFFRRQYRTAVYHEILLGYMDYAKQLGYTMAHIWAC PPSEGDDYIFHCHPPEQRIPKPKRLQEWYKKMLDKGMVERTIQDYKDILKQAMEDKLQSASEFEGDFWPNVLEESIKELDQEEEEKRKSAKIFATMEKHK EVFFVIRLHSAQS------------------------------------------------- >tr|K4BCD0|K4BCD0_SOLLC Uncharacterized protein OX=4081 OS=Solanum lycopersicum (Tomato) (Lycopersicon esculentum). GN= PE=4 SV=1 --------------------NMTVLPISENVCQLCGTDRLVFVPTPVYCSSCCKCIKRNLVYYWAVD--EAGGRHCFCTKCFCGD--DVS-----SQGLS INKNKFQKAKNNDQNEESWVQCDKCEGWQHQVCALYNAKKDQAKYICPFCCLKEIPRTMLSDHIEQRLFRRLKLERNERAKAADLIVRVVLSVNRNLKVK QQFLDLCHNE-GYPPEFQYKSKVILLFQKIGGVDICLFGMYVQEFGSEC-APPNRVYISYLDSVKYFKPALRTFVYHEILIGYMDYCRKRGFTTCYLWAC PPIKGEDYILYCHPESQKTPKPEKLRSWYWSMLRKASEEDIVVNYTNLYDHFFVARISAAPYFDGDYWSGAAEDIVRNIEKESRGDSQNKVLGQTILPVK EDFIIVNLHVVDIPASTEDQDAIIENDFFENHSFLSFCEKNHYQFDSLRRAKHSSMMILYHL >tr|F2DVT8|F2DVT8_HORVD Predicted protein OX=112509 OS=Hordeum vulgare var. distichum (Two-rowed barley). GN= PE=2 SV=1 ---------------------------DQNTCNLCGMERILFEPPPRFCALCFKIINSTGSYYVHVE--NGIDKASICAKCHHL---------------S TAKVKYQK-RLNYAETDAWVECDKCKAWQHQICALFNPKVAGVEYTCAKCLLKEKDRTKLSDHIEQRLSVRLEHERLQRARAEGLTVRVVSSAARVLQVQ PLFREFFKEG-KYPGEFPYKSKAILLFQKNEGVDVCLFAMYVQEYGSDS-PLPNRVYLAYIDSVKYFRPALRTFVYHEILIGYLDYCKKQGFVSCSIWAC PSTKRDDYVLYCHPTSQKMPKSDKLRSWYQNLIKKAVKDGVVVERNTLYDFFLQPSISAAPYCENDFWPGEAERLLEKKDDNTSQKKEPQKLGEKMRTMK EDFIMLSLQRFEPLPETDDGDPTMESKYFDSRDFLKHCQDNQYQFDTLRRAKHSTMMILYHL >tr|K4BWE5|K4BWE5_SOLLC Uncharacterized protein OX=4081 OS=Solanum lycopersicum (Tomato) (Lycopersicon esculentum). GN= PE=4 SV=1 ----------------------------NNVCQLCSMDSLDFVPTPIHCSSCYKCIKRNLIYYWAVD--ESDRRHCYCNNCFRK---------------C SGKNKFQKAKNNYRNEEPWVQCDKCECWQHQVCGLYNANEDLAKYICPFCRLKEIERTMLSDHIEQRLFRRLELDRNERAKYADLTVRVVLSVNRNLKLN QQFLDIFQND-EYPPEFQYKSKLILLFQKIGGVDVCLFGMYVQEFGSEC-ASPNRVYISYLDSINYFTPTLRTFVYHEILIGYMDYCKKRGFTACYLWAC PSLKGEDYIFYCHPKSQKTPKPEKLRLWYKSMLRKASEEGIVVNHTNLYDQFFGP--SAAPYFNGDYWSGAAEEIIREKENRADKAKKLMKLGQTILPVK KEFIVVNLRFVDDIPSTEDQDAIIENDIFENRSVLSFCEKNHYQFDSLRRAKHSSMMILYHL >tr|G7ZZ62|G7ZZ62_MEDTR Histone acetyltransferase OX=3880 OS=Medicago truncatula (Barrel medic) (Medicago tribuloides). GN=MTR_082s0046 PE=4 SV=1 --------ISSLRKESDQIISEDQAGIDANTCNLCKRERLYFAKVPLFCLCCGARIKK--IYFCKKEEEFV-AQGCVCYNSVKGE--NVAFNGT-----S ISKKNLAKRNNDEVIEEPWVECNKCERWQHQICALYNKKADSAEYICPLCRLKESTRTVLSDHLEKRLFERLMQERKNWEKAESLSIREVLSVDKQLKVN KQFLDIIPEE-NYPTEFSYRSRVILLFQKIEGVDVCIFGMYVQEFGSECGGNPNCVYISYLDSVKYFRPALRTFVYHEILIGYLDFCKERGFSTCYIWAC PPKKGDDYILYCHPKEQKTPKNDKLRHWYLSMLKKANKENIVVGLTNVYDHFFNSKVTASPYFDGDWWCSNAVVVAKTLEKENRADYEKLLDILVMQKTK ENFLVAHLRYSEIPSDTKCNDIVLESELFENDNFLIFCQKSQFQFDTLRRAKYSSMMILYHL >tr|K3Y149|K3Y149_SETIT Uncharacterized protein OX=4555 OS=Setaria italica (Foxtail millet) (Panicum italicum). GN= PE=4 SV=1 ---------------------------NVNSCQLCKVEKLFFEPPPKYCSPCGARIKRNAPYYSDTV--TESGPYYFCISESRSD--SILVDNI-----Q LLKSKLVKNRNDDELEEAWVACDKCKRWQHQICALFNAKRNDAEYICHSCYIQEIPRTVLSDHIEERLLQRLKEERQNRAGAEGLVVRVVSSVDKKLKVK PHFLEIFRED-NYPAEFPYKSKAILLFQRIEGVEVCLFGIYVQEFGAEC-AFPNRVYLSYLDSVKYFRPALRTFVYHEILIGYLQYCKQRGFTSCYIWAC PPFKGEDYIMYCHPEIQKTPKSDKLREWYLSMLRKATNEGIVVELTNLYEHFFNAKVTAARYFDGDYWPGAAEDIINQIRLPEDDRNLKGKLKKLMQKMK EDLIMVHLHPVGVPKDTKDRDGILESEFFDTQAFLSLCQGNHYQYDTLRGAKHSSMMVLYHL >tr|C5Z9A0|C5Z9A0_SORBI Putative uncharacterized protein Sb10g029285 OX=4558 OS=Sorghum bicolor (Sorghum) (Sorghum vulgare). GN= PE=4 SV=1 ---------------------------NVNSCQLCKVEKLFFEPPPKFCSPCGARIKKNAPYYSGTI--TESGPYYFCVNESRSD--SVLVDSI-----Q FLKSKLEKKRNNDEFEEAWAQCDRCERWQHQICALFNAKRNAAEYICHSCCIEEIPRTVLSDHIEEHLCQRLKEERQNRAGAEGLVVRVVSSVDKNLVVK PKFLELFQEE-NYPIEFPYKSKAILLFQRIDGVEVCLFGIYVQEYGAEC-ALPNRVYLSYLDSVKYFRPALRTFVYHEILIGYLQHCKQRGFTSCYIWAC PPLKGDDYIMYCHPEIQKTPKSEKLREWYLSMIQKATDAGIVVELTNLYEHFFNAKVTAARYFDGDYWPGAAEDIINQIFLPEDGKNLKGKVKKLMQKMK EDLIMVHLHPVGVPKDTKDRDGILESEFFYTQAFLSLCQGNNYQYDTLRAAKHSSMMLLYHL >tr|H3FSY7|H3FSY7_PRIPA Uncharacterized protein OX=54126 OS=Pristionchus pacificus. GN= PE=4 SV=1 IIKH-PMDLSTIYAKLQSCFVELINPVMENMGYCCG-EDRYFTQLPLFCYG--ATI-----YETTSYGVVVSDRYTYCVKVLPEK--GISVSENPNDKNM VAKNLFKQYKNNV-YPEG--------------------------FVCNRCREKQLAKCQLSDHLEDRVNRFIRRQLKE--EGKEVIIRVLSAVTKEVEVK PMMKSKYDAPDAFPHKFPYTTKAIFAFEIIDGVEVCFFGLHVQEYGSKS-GSPNRVYIAYLDSVHYFEKHLRTDVYHEILLGYLDYARKLGYTMAHIWAC PPSEGDDYIFHCHPTEQKIPKPKRLQDWYKKMLDKGVQEQCVVEYKDIYKQARDDGLTSPADFEGDFWPNVIEDCIREASSEEAQRKKEDKLYAQLEKHK EVFFTIRLVTQQSALDIEDPDGLMSSELMDGDTFLTRAREEHWEFSSLRRAKYSTLCLAHAL >tr|G7J5R6|G7J5R6_MEDTR Histone acetyltransferase OX=3880 OS=Medicago truncatula (Barrel medic) (Medicago tribuloides). GN=MTR_3g101100 PE=4 SV=1 ---------NSLRKESVLITSEEEAGTEANTCQLCERKKLYFAPVPIVCSCCGIRVKR--IYFCRKE--ELDVQGCICYKTSKGG--KITFNGT-----S ISKKNLEKRTNDEVLEEPWVECNKCKRWQHQICALYNNRRDLAEYICPICRSKEIERTVLSDHLEKRLIECLIQERAKWEAAKGLSIREVLSVDKQLKVN KQFLDIIPEE-NYPAEFSYRSRVILLFQQIEGADICIFGMYVQEFGSEC-GNPNCVYISYLDSVKYFMPALRTFVYHEILIGYLDFCKKRGFLTCYIWAC APSKGDDYILYCHPKEQKTPKNDKLRRWYLSMLKKATEENIVVGLTNVYDHFFDSKVTASPYFDGDCWSGNAMEAAKTIEKECGGDYEKMLKLKVMRKTR ESFLIAHLQSSDIPSNTKHNDIVLESRLFENDNFLSFCQKSQFQFDTLRRAKYSSMMVLYHL >tr|G7JBQ0|G7JBQ0_MEDTR Histone acetyltransferase OX=3880 OS=Medicago truncatula (Barrel medic) (Medicago tribuloides). GN=MTR_3g107960 PE=4 SV=1 ---------SSLRKESVQITSKEEAGIDANTCQLCQRKKLYFAPVPIFCSCCGVRIRR--TYFCRKE--EFDAQGCICYKTSKGG--KITFNGA-----F VSKTNLEKKNNDEVFEEPWVECNKCKRWQHQICALYNNKRDLAEYICPVCRLKEIERTMLSDHLEKRLFERLVEERANWEAAESLSIREVLSVDKQLKVN KLFLDIIPEE-NYPAEFSYRSRVILLFQQIEGVDICIFGMYAQEFGSEC-GNPNCVHISYLDSVKYFRPALRTFVYHEILIGYLDFCKKRGFSTCYIWAC APSKGDDYILYCHPEEQKTPKNDKLRRWYLSMIKKATEENIIVGLTNVYDHFFNSKVTTSPYFDGDYWCGYVMEAARTIEKESGGDYEKMLKLKVMQKTR ENFIIAHLQSSDIPSNTMHNDIILESGLFENNSLLSFCQKYQFQFDILRRAKYSSMMILYHL >tr|J3MHF0|J3MHF0_ORYBR Uncharacterized protein OX=4533 OS=Oryza brachyantha. GN= PE=4 SV=1 ---------------------------SANLCQLCKVEKLNFEPPPMYCSPCCARIKRNASYYTGST---AMGRLYFCYNASLGK--TIEVELI-----K LSKADLEKRRNNVETEEGWVQCDKCECWQHQICALFNARRNQAEYTCSKCYIEELPRTLLSDHIEERLLKRLREERQKRAGADGLVVRVVSSVDKKLEVK PRFFKLLQED-NYPAEFPYKSKAILLFQKIEGVEVCLFGMYVQEYGAEC-KFPNRVYISYLDSVKYLRPALRTYVYHEILIGYLEFCKQRGFTSCYIWAC PPTKGEDYIFYCHPEIQKTPKSDKLREWYLCMLQKAIKENIVVELTNLYDHFFKTKVAAAPYFDGDYWPGAAEEIINQLLLEDNGMLQKKGEAILMQKLK DDLIMVHLHPVEVPEDTKDRDAILENVFFDTQAFLSFCQGKNYQYDTLRRAKHSTMMILYHL >tr|K0SP97|K0SP97_THAOC Uncharacterized protein OX=159749 OS=Thalassiosira oceanica (Marine diatom). GN=THAOC_19472 PE=4 SV=1 IIKK-PMDLGTIGKKLEQGSYHSENSKKAHACGLCGCEKLNFEPPIYFCSGQTKRIRRNTHYYITAD-----KQYSYCNGEIKGD--HIDLGTT-----K IKKSDLAKRKNDEIHEESWVQCDDCERWIHQICGLYNTRHNTSAYSCPKCVLQKRPRTKLSEWLEGHVHKKVEERYKELSAGGPVTIRQVTSTDRKHEVR ELMKERYADK-NYPDEFLYRGKCIVVFQNIDGVDVVLFALYVYEHGDDN-PLPNTVYVSYLDSVHFMKRKMRTFIYHEILISYLDYARAKGFQQAFIWAC PPLKGDDYIFYAKPEDQRIPKEHRLRQWYVDMLEECQRRDIVGKVTNMYDEYLNDPNLAVPYFDGDYFPGEAENIIKQINDDVKKSGKKEKFCECISGMK DSFMVAFLNCKGAIQILDDDADEIDSEFFNTQDFLDLCRGNHYQFDDLRRAKHTSMMVLWHL >tr|B8LBX3|B8LBX3_THAPS Predicted protein OX=35128 OS=Thalassiosira pseudonana (Marine diatom) (Cyclotella nana). GN=THAPSDRAFT_24331 PE=4 SV=1 VIKK-PMDLGTISRRLDNGSYHAENSMRQQACGLCGCEKLNFEPPVFFCNGPSKRIRRNTHFYITAD-----KQYAWCSNELGGE---IDLGTS-----V LKKVDLAKKKNDETHEESWVQCDDCERWIHQICGLYNTRQNKSAYSCPLCLLDKRPRTNLSDWLERDVHKKVNQRYADLSAGGPLTIRQVTSTDRKLEVR DQMRQRYAHK-NYPEEFPYRCKCIVVFQNIDGVDVVLFALYVYEHGDDN-PFPNTVYVSYLDSVHFMKRQMRTFLYHEILISYLDYARQKGFLQAFIWAC PPLKGDDYIFYAKPEDQKTPKDVRLRQWYLDMLVECQKRNIVGMVSNMYDQYFANKSLSVPYFDGDYFPGEAENIIKDLEESNSKAGKKQKFCDAIQGMK ESFIVAYLNAKDAVRVLNDDDEEIDCEFFNTQCFLDLCRGNHYQFDELRRAKHTSMMVLWHL >tr|G4YXF0|G4YXF0_PHYSP Putative uncharacterized protein OX=1094619 OS=(Phytophthora megasperma f. sp. glycines). GN=PHYSODRAFT_483048 PE=4 SV=1 IIRK-PMDLGTVKKKLEAGIDEKAARAKESSCRLCGIERMVFEPAVLYCGECNSRIRRNCYYYTSAD-----NKYHCCHQGLPDS---IKHNE--GR--Q YKKNELARKKNDEVHEEPWVQCDKCERWVHQICALFNGKIDSSEFLCPECLLEHLQRTKLSDFLERRIQQSLKAEREDEAKAEGLTVRLVSNIEKQLMVR DKMFQRYKDSHKYTSEHRFKSKCICMFQELDGVSVLLFGMYVHEFDEQE-ADSRRVYISYLDSVNYFKPHLRTKVYHELLIAYFDFVKQRGFHTAHLWAC PPLKGDDYILYCHPESQKTPKSDRLRAWYVDMLVKAEEEGVVWQITNMYDDYWRNDNTPCPYCEGDYWVGLAEDLIEKLDAEKPKKIVSHKLGEVIEPMK DDFLVVKLVPCCRPERCTDPDEINDSEFFDTQAFLSLCQGNHYQFDELRRAKHSSMMALYHL >tr|K7MWI3|K7MWI3_SOYBN Uncharacterized protein OX=3847 OS=Glycine max (Soybean) (Glycine hispida). GN= PE=4 SV=1 --------ITSLRKQFNQSTMVEESGSDVYTCQLCGMGTLSFAPVPIYCFCCGIRIKRNACYYYRRE--EDDTQHCFCSRTSRGG--NIKFNGT-----S VSKTDLDKKTNNREFEESWVECNKCKCWQHQICALYNDKRDRAEYTCPICRLKEIPRTMLSDHIESRLFKRLWQEDEDWAKAESLSVRVVLSVDKQLKVK KQFLDIFGEE-NYPSEFPYTLKVILLFQKIEGVDVCLFAMYAQEFGSEC-GYPNSVYISYLDSVKYFRPALRTIVYHEILIGYLDFCKKRGFTTCYLWAC PPMKGEDYLLYCHPDTQKTPKKDKLRQWYHSMLRKAAEENIVVGLTNLHDHFCDSKVTAARYFDGDFWSGAAMDKARHIEQECGGDYKMIKLGQTILPFK EDFLVVQFQYDDVLGDTKENDIILDNGLFDSHNFLSFCQRNRFQFDSLRRAKYSSMMILYL- >tr|G7KH97|G7KH97_MEDTR Histone acetyltransferase OX=3880 OS=Medicago truncatula (Barrel medic) (Medicago tribuloides). GN=MTR_5g085310 PE=4 SV=1 --------VSSLRKESVQITSKAEARIDANTCQLCEKERLYFAPVPLFCLHCGNRIKR--TYFCTKE-DDFDAHGCICYKTSKGG--KIAFNGT-----S ISKKNLEKRNNDEVLEEPWVECTKCERWQHQICALYNKKADLAEYICLLCRLKEITRTVLSDHLEKRLFERLLQERENWEKAENLSIREVLSVDKQLKVN KQFLDIIPEE-NYPTEFSYRSRVILLFQKIEGADVCIFGMYVQEFGSEC-SNPNCVYISYLDSVKYFRPALRTVVYHEILIGYLDFCKKNGFSTCYIWSC APSKGDDYILYCHPEEQKTPKNDKLRRWYLSMLKKASEENIVVGLTNVYDRFWKSKVTASPYFDGDCWCGNAMVVANTLEKESRVNYEKLKDILVMQKTK ENFLVAHLRSSDIPSDTKRNDIVLESELFENDNFLIFCQKSQFQFDTLRRAKYSSMMILYHL >tr|D7KN57|D7KN57_ARALL Histone acetyltransferase HAC4 OX=81972 OS=Arabidopsis lyrata subsp. lyrata (Lyre-leaved rock-cress). GN=ARALYDRAFT_337685 PE=4 SV=1 --------------------------MSVNSCQLCAVEWLVFEPVPLYCSPCGIRIKKNALHYSIAA---GESRHYVCYNEAREN--LVFLDGT-----S IPKTRLEKKKNDEQVPEGWVQCDKCEAWQHQICALFNSRRNTTKYTCPNCYIQEVPVTALSNHLEERLFKKLREERQERPGAESLTVRVVASVDKVLEVK QRFLELFREK-NYPTEFPYKSKAILLFQKIENVEVCLFGMFVQEFGTDS-GPPNRVYLSYLDSVKYFRPALRTFVYHEILIGYLDYCKKRGFTSCYIWAC PPLKGEDYILYCHPEIQKTPKTDKLREWYLAMLRKASKEDVVVECTNLYNHFCRANVTAARYFDGDYWPSAAEDLLRQMNQEDDGETKLQKLGETICPMK EDFIMVHLQHCDVPVKIEDTDDNLESEFFDNQAFLNLCQGNNYQYDTLRWAKHSSMMILYHL >tr|I1FGK9|I1FGK9_AMPQE Uncharacterized protein OX=400682 OS=Amphimedon queenslandica (Sponge). GN= PE=4 SV=1 VIKN-PIDLSVIRMKLESGSYETIDEAMVSLGYCCG-QKYFFHARVLYCDGKSCPIPRDSVYYNYKD------IYAYCTKETPGD--SVTLSLDDTSNIS LPKGVFKQIKNDTVEIEPMVECIHCRRSFHKICVLHHDEIWPEGYQCSNCLQSLLPTTKLGLFLEERVNSFLKAAN---VPEAEVTIRVLSSSNKLLDTG SGMMDRF---SYFPSQFPYRTKALFAFEEIDGAEVCFFGMHVQEYGSDC-PAPNTVYISYLDALPFFKPEYQTFVNHEIVLGYLEFCKRNGFQTAHLWAS LPAEDYDYIFYCRPMDQDTTKPKRLVEWYRNMLEKGKAQSVLCDFQDIFQYCVKEHITDIPYFEGDFWPNVIEETIKELDQEGMAKRLTQKIYQTMGKHK EVFFVAHLQPPKKLPTTSDPDSLVTCKLMDGGAFLNLARENHWEFSSLRRAKFSTMSMLYEL >tr|A9V411|A9V411_MONBE Predicted protein OX=81824 OS=Monosiga brevicollis (Choanoflagellate). GN=9670 PE=4 SV=1 IIKK-PMDLSTIEDKLEKGTEDTIDLVMKRLRFCCG-KELTFNPQVLYCSKQICPIRRDEEYMKYDA--NLNSAFNCCMAEGEKSTFTISAELTPDQKIE VSRTHFTKAINNHIDYEEFVTCAGCGRRHHQICVMYHASIHK-EFICQACSKSNSPRTKLSDYLEKRVNDCLHDETVSSPEARRVTVRVHSSRERGVVVK DNIRRYYADEDPLPAEYPYRTKAIFVYQESEGHEVCFFGLHVQEYGDDC-PGPNRVYISYLDSVFFFEPSLRTKVYHQLLIGYLQYMAALGYMYAHIWAC PPAAGDDYIFHCHPTEQKTPTPKRLSQWYAKMLKAAKEQNAIHDFYDMYQYVCSRDIVRGPYFEGDYWVNRLESLVQEIDQADGNDKQAARAWPEIEKNK SVFFVVQLHADPVNLPKADPDPDITCDFMDGEPFLQHCRENNHEFSTRRRALYSSMMMLYKL >tr|D8SJY6|D8SJY6_SELML Putative uncharacterized protein OX=88036 OS=Selaginella moellendorffii (Spikemoss). GN=SELMODRAFT_422911 PE=4 SV=1 -----------------------ELQANEHACRLCAVEKLFFDPPPIYCTSCGIRVKRNALYYTTAV---RETRHYFCYNDVRTE--NVELEGL-----T YAKSCLEKRKHDEEQEEAWVCCDKCNLWQHQTCALFNSRRNEAEYTCPECCMAELPRTWLSDHLESRLTLKLRQERSERAGAEGLVVRVVSSVDKRLEVK PRFLEIFQEE-DYSTEFPYKSKAVLLFQKIEGVEVCLFGMYVQEFGVES-AQPNRVYLSYLDSVKYFRPALRTFVYHELLIAYLEYCKRRGFTSCYIWAC PPLKGEDYILYCHPEIQKTPKSDKLREWYLTMLRKASRENIVVEITNFYDFFFKAKVTAARYFDGDYWPGAAEDMILQLQQQDDDDGSKQKLGESIMQMK EDFIMVHLQHTDVPPDTKDNDDLMESEFFDTHAFLTLCQGNHYQYDTLRRAKHSSMMVLYHL >tr|F0WT92|F0WT92_9STRA Histone acetyltransferase putative OX=890382 OS=Albugo laibachii Nc14. GN= PE=4 SV=1 --------------------------LKEAACRLCGVERMLFEPSVLYCGECNARIRRNCYYYASVD-----NKYHCCHPCYGNL--GDAVKSTEGQ--T YNKSSLCRKKNDEVHEEPWVQCDKCNRWVHQVCALFNGKIDAGEFLCPECLLEHLMRTKLSDYLERWIAKVLQEEREDEATAENLTIRQVSNIDKQLMVR DKMFFRYKESHNYSSDFRFKSKCICMFQEIHGVSVLLFGMYVHEFDEQE-APANRVYISYLDSVSYFEPHLRTKVYHELLIAYLDFVKKRGFYAAHLWAC PPLKGDDYILYCHPETQKTPKSERLRQWYVDMLVKAQEKGIVWHITNMYDDYWRNNNPACPYFEGDYWVGLAEDLIEKIESEQS---------------- -------------------------------------------------------------- >tr|D0NXJ5|D0NXJ5_PHYIT Histone acetyltransferase, putative OX=403677 OS=Phytophthora infestans (strain T30-4) (Potato late blight fungus). GN=PITG_18027 PE=4 SV=1 -------------------DEHSCPFCLDNVCGMCNEKCINFEPPFVMCGACRQRIKRHAVYYKTPD-----GQYHWCSKCFTSLPKELTIKVLPSTDNT LSKFALLKAKFMDELTEPWVQCDQCSGWVHQICALFNACENADMYTCPLCRLEELQSCGVSRFMQKWVQQHLENLGEHEA-AQSIVVKVVSSIKSSCHVS SVVREFRSASQEFPQTIDYTSKVIFVFQMINGVEVCIFSMYVQEYDKYCQVPVNRTYIAYLDSLVYMRRHVRTSLYHQILISYLASCKAKGYEYAHIWAC PTTRGGDFIYWCHPSFQKNPGKERLLQWYLIMAKKAKELGVVFACDDLYKHGFELTTQLPPYFDGDYWPSEAERVAASPPKRGRKEANTAKVSESVKSAC ESLFVIALQPTCAAANSKTEELEMSCPFLDCPNMLKNCEEHHYQFDSFRRAKYSTMMLVYQI >tr|H3GFF9|H3GFF9_PHYRM Uncharacterized protein OX=164328 OS=Phytophthora ramorum (Sudden oak death agent). GN= PE=4 SV=1 -------------------DEHSCPFCLDNVCGICNEKCINFEPPFVMCGACRQRIKRHAVYYKTRD-----GQYHWCSKCYASLPKTLTLKAAPCTEYT ISKLGLLKAKFLDELTEPWVQCDQCNGWVHQICALFNACENADLYTCPLCRVKELQSCDLSRFMQKWILQHLAKLGEHDA-AQSIVVKVVSSIKISCHVS SVVREFRSESQEYPQAVDYTSKVIFVFQMINGVEVCIFSMYVQEYDKYCQLPANRTYIAYLDSLVYMRRHVRTSLFHQILISYLASCKANGYEYAHIWAC PTTRGGDFIYWCHPSFQKNPGKERLLQWYSSMAKKAKELGVVFACDDLYAREFEQASQLPPYFDGDYWPSEAERVAASPPKRGRKEANTARVSESVKSAR ESQFVIALQPMCAAATNRVKEVEMSCPFLDYTNMLKNCEEHHYQFDSFRRAKYSTMMLVYKI >tr|G7ZZ59|G7ZZ59_MEDTR Histone acetyltransferase OX=3880 OS=Medicago truncatula (Barrel medic) (Medicago tribuloides). GN=MTR_082s0043 PE=4 SV=1 -----------------KQSVQSTTEVDNDTCNLCGMNELPFSPVQIFCSSCGKCINRNVNYFGKKG-EEFDPVCCFCSKMSKGG--HITFNGT-----S VSKTLLEKKTNDEVINEPWVECSKCNKWQHQICALYNKDLDCSVYTCPLCLLKEIEKTVLSDHLEKRLFERLMQEREERQKTESLTVREVISVDKQLTVK KQFQDITPEE-NYPAEFSYRSRVILLFQKIEGADVCIFAMYVQEYGSEC-GNTNCVYISYLDSVNHFTEALRTFVYHEILIGYLDFCKKRGFTTCYIHAC APKKGDDYILNCHPKTQKTPKDDKLRNWYISMLTKATKENVVVGLTNMYDHFFYSKVTTARYFDGDCWSGAAMDQAVIIEKECGSDYGNAKDILVMQKTK QNFIVAHLQTADIPFNTKENDIILENALFENSNFLSFCQKNHFQFDTLRHAKYSSMMILYHL >tr|I1NLU2|I1NLU2_ORYGL Uncharacterized protein OX=4538 OS=Oryza glaberrima (African rice). GN= PE=4 SV=1 ---------------------------DQNTCNLCGMERLLFEPPPRFCALCFKIINSTGSYYVEVENGN--DKSSICGRCHH-------L--------S SAKAKYQKRFSTDAEAEWWVQCDKCKAWQHQICALFNPKIVEAEYTCAKCFLKEKPRTRLSDHIEQRLSERLVQERQQRAIAEGLTVRVVSSADRTLQVQ PRFKDFFKKE-QYPGEFPYKSKAILLFQKNEGVDVCLFAMYVQEYGSAC-PSPNHVYLAYIDSVKYFRPALRTFVYHEILIGYLDFCKKRGFVSCSIWTC PSTKRDDYVLYCHPTIQKMPKSDKLRSWYQNLVKKAVKEGVVVERNTLYDFFCKTNISAAPYCDNDFWPGEAERLLEKKDDDTSQKKETQDLGERLRTMK EDFLMLCLQTPEPLPETDDVDPTMESKYFDSRDFLKHCQDNQYQFDTLRRAKHSTMMILYHL >tr|K3ZQ44|K3ZQ44_SETIT Uncharacterized protein OX=4555 OS=Setaria italica (Foxtail millet) (Panicum italicum). GN= PE=4 SV=1 ---------------------------DQNTCSLCGMEKLLFEPPPRFCALCFKIINSTGCYYAEVENGK--DKSSICSKCHH-------L--------S SSRAKYVKRFDTDAETEWWVQCDKCKAWQHQICALFNKKCEKAEYTCANCFLKEKPRTKLSDHIEQRLSERLEQDRQQRASTEGLTVRVVSSADRVLQVQ PRFHEFFKQE-KYPGEFPYKSKAILLFQKIEGVDVCLFAMYVQEYGSDC-PSPNHVYLAYIDSVKYFRPALRTFVYHEILIGYLDYCKKRGFVSCSIWAC PSTKRDDYVLYCHPTVQKMPKSDKLRSWYQNLIKKAVKEGVVVERNTLYDFFCKANISAAPYCENDFWPGEAERLLEKKDDKTSQKQETQDLGERMRTMK EDFMMLCLQTAESLPETDDGDPIMESKYFDSRDFLKHCQDNQFQFDTLRRAKHSTMMILYYL >tr|I1GT16|I1GT16_BRADI Uncharacterized protein OX=15368 OS=Brachypodium distachyon (Purple false brome) (Trachynia distachya). GN= PE=4 SV=1 ---------------------------DQNTCSLCGLERLIFEPPPRFCALCFKIINSTGSYYVEVENGN--DKTSICCKCHH-------L--------S SAKAKYQKRFSTDAEPEWWVECDKCKAWQHQICALFNPKVLEAEYSCAKCLLKEKPKTKMSDHIEQRLSQRLVQERLLRARAEGLTVRVVSSAARVLQVL PRFRDFFKQG-NYPGEFPYKSKAILLFQKNEGVDVCLFAMYVQEYGSAS-PLPNHVYLAYIDSVKYFRPALRTFVYHEILIGYLDYCKKQGFVSCSIWAC PSTKRDDYVLYCHPTSQKMPKSDKLRSWYQNLVKKAVKEGVVVERNTLYDFFRKADISAAPYCENDFWPGEAERLLEKKEDNTSQKEETQDLGEKMRTMK EDFIMLCLQMPEPLPETDDGDPTMESKYFDSRDFLKHCQDNQYQFDTLRRAKHSTMMILYHL >tr|D7KV09|D7KV09_ARALL Putative uncharacterized protein OX=81972 OS=Arabidopsis lyrata subsp. lyrata (Lyre-leaved rock-cress). GN=ARALYDRAFT_894451 PE=4 SV=1 --------------------------EEEEACQLCVNGRLLYPPPPLYCSLCSRRIDDESFYYTPGEEELTDAKHQICSPCHTKCKTKFTLCGI-----F IDKNKMLRRNNVNANTEEWVQCESCQKWQHQICGLYNKHKDTADYFCPECLLEELPETILSYFLEQRLFRRLKEERYQTAKAEGLTLRVVFSADKTLTVN KQFANLLHRE-NFPSEFPYRSKVILLFQKVDGVDICIFALFVQEFGSEC-GQPNSTYIVYLDSVKYFRPALRTFVYHEILIGYLEYCKLRGFMTSYIWAC PPKKGQDYIMYSHPKTQQTPQTKKLRQWYMSLLKKAAERRIVMNVTNLYDRFTEEYMTAARYFEGSFWCTRAEIMTQEIEKEGNNELQKKVKYLVMEKSK EDFMVVDLNYSVLSSTTEDNDIIQENDMFESQAFLAFSQKHNYNFHTLRHAKHSSMMILHHL >sp|Q9FYH1|HAC2_ARATH Histone acetyltransferase HAC2 OX=3702 OS=Arabidopsis thaliana (Mouse-ear cress). GN=F1N21.4 PE=2 SV=1 ----------------------------EESCQLCDDGTLLFPPQPLYCLLCSRRIDDRSFYYTPGEEELSNAQHQICSPCHSRCKTKFPLCGV-----F IDKHKMLKRSNDNADTEEWVQCESCEKWQHQICGLYNKLKDTAEYICPTCLLEECPETVLSYFLEQRLFKRLKEERYQTAKAEGLTLRVVFSADRTLTVN KQFASLLHKE-NFPSEFPYRSKVILLFQKVHGVDICIFALFVQEFGSEC-SQPNSTYIFYLDSVKYFKPALRTFVYHEVLIGYLEYCKLRGFTTSYIWAC PPKIGQDYIMYSHPKTQQTPDTKKLRKWYVSMLQKAAEQRVVMNVTNLYDRFTEEYMTAARYFEGSFWSNRAEIMIQDIEREGNNELQKKVKYLLMEKNK KDLMVVELNYSQLFSTTEDNDIIQENDMFESQAFLAFSQKHNYNFHTLRHAKHSSMMILHHL >tr|H3H0Z0|H3H0Z0_PHYRM Uncharacterized protein OX=164328 OS=Phytophthora ramorum (Sudden oak death agent). GN= PE=4 SV=1 IVQH-PMDLGTIKRNLTAGEQHVCLVCRGNTCIVCNQQCLPITQPHLQCAGCSTDIRKGSVYFISED-----GTRVWCQKRLARD--RSSTDRMT--DTN SFLDSLIKMKS-EPSVEPWVKCGECDRWLHQVCGLYNPVIGTNPYLCPLCRCRRKRSCELSIFIQNFLRRGLRDIGEHEA-AQTLHVRALSFPGERMTVP EGVVQAFDENARLPAHVSYLSRGLYLFQKHEGMEVCLFTIFAQEFGDDCELAANRVYIAYLDSVRYLKPSARTAAYHLIMLAYFDYVRRHGFSRVHIWSC PPQKRISYVFWCRPIFQKTPSAEHLRRWYNKLLTKAKEHGIVKGWSTMHDRYFSESVQLPPIFDGDIIPSELERILGRIISRNEKKRASEKCQFAVQRLK QDLLVVDLETKGC-----HPETLVPSWFFSRFMFHQLCSYSSYQFDSLRRAKHSTMMMLHH- >tr|G4ZB91|G4ZB91_PHYSP Putative uncharacterized protein OX=1094619 OS=(Phytophthora megasperma f. sp. glycines). GN=PHYSODRAFT_491564 PE=4 SV=1 IVQH-PMDLGTIKRNLAAGEQHACLVCRGNTCIVCNQQCLEVTQPHLQCGGCSTDIRKGSVYFVSED-----GTRVWCQKRLARD--RNSPDREPDDDTD ALLDSLIKKKC-EAGVEPWVKCGECDRWLHQVCGLYNPVIGTGPYACPLCRCRRKRSCELSIFIQNFLRRGLDKIGEREA-AQTLYVRALSFPGERMAVP EAVVRTFDENVRIPAHISYLSRGLYLFQKHEGMEVCLFTIYAQEFGDDCELEANRVYIAYLDSVRYLKPSARTAAYHLILLAYFDYVRRHGFSRVHIWSC PPQKRISYVFWCRPSFQKTPSAEHLRRWYNKLLTKAKDCGIVKDWTTMYDRYFSESVQLPPIFDGDIIPSELDRILGRIMSRNEKKRASEKCQFAVQRLK QDLLVVDLVAEGC-----RPEALVPSWFFSRFMFHQLCSYAGYQFDSLRRAKHSTMMMLHH- >tr|Q01AN4|Q01AN4_OSTTA CREB binding protein/P300 and related TAZ Zn-finger proteins (ISS) OX=70448 OS=Ostreococcus tauri. GN= PE=4 SV=1 --------------------------NAESACRACGVERLTFEPPPLYCYSCVTRIKRGQVYHHAPTLGGETRRDAWCNPCFNGIQGFVDVEGQ-----R FPKQALIKKKNDDDLEEPWVQCDYCEDWYHQICVLFNGRRNEAPFTCPNCILSQLPKTKMSNFLEERLATVLANEREARSTAEGLTIRVVSQTMKQMDTR SHYYSHFKGE-GIPMHLAYRSRVILLFQNLEAVDVCLMAIYVQEYDDEC-PEPNRIYLSYLDSVKYFRPALRTYVYHNILVAYLDYAKARGFTSCFIWAC PPFQGDDYILYCHPKVQKTPKADKLREWYMKMLRSAQQDGIVLSTSNVYDEFRLHDIRNATYFDGDYFSGIVEDWIPTIMKE------------------ -------------------------------------------------------------- >tr|K3WEQ4|K3WEQ4_PYTUL Uncharacterized protein OX=65071 OS=Pythium ultimum. GN= PE=4 SV=1 ------------------REEHSCPSCLANVCGICNEKCINFEPPFVMCGPCQQRIKRHSLYYKTPD-----GFHHWCGKSLPK---LVTLKSAQNQDKQ IAKNFLVKAKFLDELTEPWVQCDHCNGWVHQICALFNASEDEVPYTCPLCRINELQSCALSRFMQTWVRQHLVALGEHEA-AQSIAIKVASSIKSSCQVS QVIREFQSGNQSYPESIDFTSKAIFVFQKINGIEVCIFSMYVQEYDENSNLARNRTYIAYIDSLVYMRRHIRTSLFHETLISYLAFCKGRGFHYAHIWAC PTTRGGDFIYWCHPSFQKNPSKDRLLQWYLKMAEVGKEANVVFACQDLYTCEFENLEQLPPYFDGDYWAAEAERLAACPPKRGKGEKFRKRVVESVKASR ESLFVISLQPKC-------------------------------------------------- >tr|D8UIB3|D8UIB3_VOLCA p300/CBP acetyl-transferase OX=3067 OS=Volvox carteri (Green alga). GN=VOLCADRAFT_121714 PE=4 SV=1 ----------------------------EDACKVCLLAKLSFEPPVIYCSSCGLKIKRGQIYYSTPPEHGNDLKGHQCFTDQKGE--RILVEGV-----A IKKTDLVKRKNDDEIEEGWVQCDHCEGWVHQICGMFNKGRNNVHYLCPECLAYDLPTSKLSEWITERLNRELERERIRRAAPEPLTVRMINSVKKKCEVK PKFHETFAPD-GYVSEFEYRQKVLLLFQSLDG-----------EYGRDC-PAPNTVYLSYLDSVKYFRPSLRTFVYHQLLIGYIEFTRNMGFEQMYIWAC PPMQGDDYILYCHPSKQKTPRSDRLRMWYIDMLKQAREEGIVVHLSTLWDTYFEGGRTYIPYMEGDYWPGEAENQLATIAD------------------- -------------------------------------------------------------- >tr|I1EGZ1|I1EGZ1_AMPQE Uncharacterized protein OX=400682 OS=Amphimedon queenslandica (Sponge). GN= PE=4 SV=1 ---------------------------------------------------------------------------------------------------- ---------------------------------------------------------------------------------------------------- -----------YFPSQFPYRSKALFAFEEIDGAEVCFFGMHVQEYGSDC-PAPNCVYISYLDVLPFFKPEYQTLVSHEIVLGYLEFCKQNGFQTAHLWAS PPAEGDDYIFYCHPMDQDTTEPKRLAEWYRSMLERGKAQSVLFDFQDLFQYCVKEEITTVPYFEGDFWPIAIEETIKKLDQERNAPKRTQKIYETMDNHK EVFFVAYLQPPNKLPTTTDPDSLVTCELMDGDAFLNLARENHWEFSSLCRAKFSTMSMLYEL >tr|K3WB48|K3WB48_PYTUL Uncharacterized protein OX=65071 OS=Pythium ultimum. GN= PE=4 SV=1 --------------------VHSCNACHGSTCILCDQQCLPFAQPHLQCGSCGTDIRKGSIYYVARD-----GTRVWCHKCLVGGASHLSQSNTPTGSAD DVNANLVRMKCDT-TVEPWVKCSGCDRWMHQICGLYNPVQGPHDYVCPLCCWRQIPPCELSEFIQSYLRHELEAIGEGEA-AKSLNVRVLSFPEEKMTIP EGVVRAFDENSRLPSQVSFLSRGIYLFQKHDGVDISLFTMYSQEFGEDCEFAANRVYIAYIDSIRYLKPSARTSAYHLMMMAYFDYIRRRGFERVHIWSC PPQKRISYVFWCRPAFQKTPSAEHLRSWYNRLLGKAKARNIVRDWTTLYDRYLSHDIVLPPIFDGDFIPAELDRILGRIGARNGKIRRASKLQFAVKRLK QDLLVVYLAVNDVVDPTTVPDWCRQVRFFNRFMFHQLCSSAGYQFDSLRRAKHSTMMILHH- >tr|D8MAH7|D8MAH7_BLAHO Singapore isolate B (sub-type 7) whole genome shotgun sequence assembly, scaffold_8 OX=12968 OS=Blastocystis hominis. GN=GSBLH_T00004707001 PE=4 SV=1 ---------------------HFCSSCRGTPCRICGEKCLKYSPPVFVCGDCHERILRNTTYYIIRG-----QKGRYCQKCYAKK---IVGMDK-----A DRKNLFVKKKNDEVFPESWVQCSRCHEWLHCICGLVHPRQVTNNYVCPICLSEDPPTCPLSEYLTEQVYQRVDQVVKRSQLKQNLIVRVVSNITTSVTVK KAIAPLFTPDYSDELSLPYRSKCIAFFQHRNGVDILLFVLYVHEFGENT-TPANRVYISYLDSVHFLSRYLRTPLYHTLLNGYLAYAKSNGYCRAHIWAC PPSRGDDYIFPHHPRDQRTPNADHLIGWYRQLLAEAVEAGIVSHASCQLDEL--LHINAVRS----NNPDINQRQIRCFSIRGVAETE------------ -------------------------------------------------------------- >tr|G7ZY55|G7ZY55_MEDTR Histone acetyltransferase OX=3880 OS=Medicago truncatula (Barrel medic) (Medicago tribuloides). GN=MTR_067s0085 PE=4 SV=1 ----------------------------------------------------------------------------------KGG--RITFNGT-----S VTKQLLERKTNDEVINEPWVQCNK---YLHCN----------SVYTCPLCRINEIKKTVLSDHIEKRLFERLMQEREERQVADSLTVRKVISVDKQLTVK KQFRDIIPEE-NYPAEFSYRSRVILLFQKIEGADVCIFAMYVQEYGSEC-GNTNCVYISYLDSVNHFTEALRTFVYHEILIGYLDFCKKRGFATCYIHAC APKRGDDYILNCHPKTQKMPKDNKLRKWYISMLTKATKENVVVDLTNMYDHFFYSKVTTARYFDGDCWSGAAMDQAVIIEKECE---------------- -------------------------------------------------------------- >tr|G7K6L2|G7K6L2_MEDTR Histone acetyltransferase OX=3880 OS=Medicago truncatula (Barrel medic) (Medicago tribuloides). GN=MTR_5g017020 PE=4 SV=1 ---------------------------------------------------------------------------------------------------- ----------------SRWVECNKCERWQHQICALYNKKADL---DCSGMGAKDFPRTVLTDHLEKRLFERLNKREKTVLAAESLSIREVPSIDKQL--- ---------------------KVILLFQQIEGADVCIFGMYVQEFGSEC-GNPNCLCISYLDSVKYFRPERCTFVYHEILIGYLDFCNKRGFSTCYIRAC APSKGDDYILYCHPEEQKTPKNDKLRRCYLSMLKKATEENIVVGLTNIYDHFFLSKVTALPHFEGDCWCGNAMEVAKTFEKESVGDYEKLLIYEVIVSGK RWFCTECCHSDDISSNTKHNDIVLESRLFGRDNFLIFCQKSQFQFDTLRRAKYSSMMILYHL >tr|B7FYK8|B7FYK8_PHATC Predicted protein OX=556484 OS=Phaeodactylum tricornutum (strain CCAP 1055/1). GN=PHATRDRAFT_45703 PE=4 SV=1 ---------------------------NGEACALCGLQKRQLEPLSLYCHGNMQPIERHSSYFTDHS-----KSNLWCYDQLHEE--KIILLDDGS---D IRKKDLQEFKNDTCPEEAWITCDECNSQVHEVCALFSRRNEKASYTCPNCYTSKSPHCKMSIDIEKGLHRTLQDLYDAKAQAEGLTVRVLSNVEKKQSVG ARMQRCFSEK-GYPLEFPVRSKCIALFQKIHGVDTLLFSVYVYEYGQEC-PAPNRVYISCLDSVQYFEPSYRKAAYQAIIVEYLRYVKERGFHTAHIWSC PLTPEDGYIFYCHPSHQLIPREDMLQSWYHQLLEKAKSSGVAISTTTLYHEYFEGGATCLPYFEGDYIPGEIENILETIDEKENQSSVQKLIMSRIMKMK DNFLVVHLHNDGV------------------------------------------------- >tr|E5SB68|E5SB68_TRISP TAZ zinc finger family protein OX=6334 OS=Trichinella spiralis (Trichina worm). GN= PE=4 SV=1 --------------------SVKMDDVMRMLGYCCC-RSLSYTVMPLPCGNPSCFIRRNGCYYYYQK---------------------I----------M IRKEKFVILKNNQAIYETFVTCNICSKKWHRICALHHDGIHPEGF------------------------------------------------------- ------CD-SGLMSGTFPYRSKAIFAFQIIDDKEICFFGMFVQEYGSSC-PPPNSIYIAYLDSVKYFEPQLRTTVYQEILLGYMEYAKSLGYRKVNIWAC PSNRNNEYIFYCHPLEQKIPNEKKLQEWYKCLLDKGIMENIIVDYKEMYKHIQDSHFTSVTEFEGDWIPEMLENLIMELKKGIQKRTIFEDALLHIKKHR ETHFVAILYTNNVLKPIDDIDMLISCKLMRTNHFFSMAKEFKWEFSSMRRTKFSTMAICYRL >tr|K3WVF0|K3WVF0_PYTUL Uncharacterized protein OX=65071 OS=Pythium ultimum. GN= PE=4 SV=1 ---------------------------------------------------------------------------------------------------V LKKSSLLKRKNSEVAEEPWVESSEYL---ASNL--------------PLKMAGKDEAERMGNLFSRGVDEPVS------ETHFGVTIREVLSIDKQVQIK PKMGKLLAAHYGKSLQLTYRSRCICVFQELDGVDVLIFTLYVQEYGPDS-LEPNRVYVSYLDSVNYFQPKLRTLMHQQVMLGFLEDCKNRGFHTAHIWSC PPLKGDDYIFFCKPENQKIPKSARLRQWYQKLLQQAKKDGLVYNISNLYAEY-YMKKKAAHEFEGDYWPRLAEDLIKQIEEKTSGGR------------- -------------------------------------------------------------- >tr|K3XE25|K3XE25_SETIT Uncharacterized protein OX=4555 OS=Setaria italica (Foxtail millet) (Panicum italicum). GN= PE=4 SV=1 ---------------------------DQNTCSFCGMERLLFEPPPRFCALCFKIINSTGCYYAEVENG--KDKTSICSKCHHL---------------S SSRAKYVKRFNTDAEAEWWVQCDKCKAWQHEICALFNRKCAKAEYTCAKCFLKEKPRTKLSDHIEQRLSERLEQDRQQRANAEGLAVRVVSSADRVLQVQ PRFHDFFKQE-KYPGEFPYKSKAILLFQKIEGVDVCLFAMYVQEYGSDC-PSPNHVYLAYIDSVKYFRPALRTFVYHEILEGSYVLVHKLGERMRHSWVC TSCKNFHLCDRCHAEEQNTAQKDRH--------------------------------------------------------------------PATTKQK HAFQRIEVEPLP---ETDDGDPTMESKYFDSRDFLKHCQDNQFQFDTIRRAKHSTMMILYYL >tr|F2U9Z8|F2U9Z8_SALS5 Putative uncharacterized protein OX=946362 OS=50818 / BSB-021)). GN=PTSG_05280 PE=4 SV=1 VIKD-PIDLASIQTKLQSQAYATITEDLSSHRYCCG-QLRHLAPEVFRCGPCFVQWGSHYWYYDL----TSDEQIIYCTSKLQDE-FRIPRYGNGTGGEC LKKAAFTKTKHKPLLPEKYVACSQCGREEHEVCVLYVSYTHE-QHICLLCRARQGLSTAMADNIARELRGAVPPEYHN-----NVVVRSVNHSLETCETK PNIRRRY---PQHPTKFPYHRKVVLAFLDMGQFDVCFYGFIVYECDDKC-PQPNRVYISLLDSVKMLPSDMRTGIYHSLVRGYLRHCAQRGFRFCHIFTC PPRKGQNYIFPFKPADQREISVTRLRAWYDELLGDAMFDPAVLNFVNIADAYPDPCFTQLPYFEGDNWPDILEDIIKFEERDQLKRAVMDRLKAVMQRSC KDFLVVELMHFGRPFTP-DPDADKQIAVTGEDSVSSFFKAERMQFNSIRHAAFSTVCLLYVL >tr|D2XMR0|D2XMR0_SACKO CREB-binding protein OX=10224 OS=Saccoglossus kowalevskii (Acorn worm). GN= PE=2 SV=1 ---------------------------------------------------------------------------------------------------- ---------------------------------------------------------------------------------------------------- ---------------------------------------------------------------------------------------------------- ------------------------------KMLDKAVSDKVVLDFKDILRQANDDNLTSAKEFEGDFWPNVLEESIKELDQEEEERKMAEA--AAA---K EVFFVIRLQ---SYPHTSDPDPVITCDLMDGDSFLTIAREKHYEFSSLRRARWSSMTMLVEL >tr|E3MDI8|E3MDI8_CAERE Putative uncharacterized protein OX=31234 OS=Caenorhabditis remanei (Caenorhabditis vulgaris). GN=CRE_17822 PE=4 SV=1 --------------------------------RCCKLKDLRYGTSNFCMAEE-CRIPPGAKYMCPKKEM-DDEVQNYCLKCFEDA--------------D LDVNDWEEKENTNPALEEIDECGICKKLFHRVCELNIR--SKSSFICKNCSPRRVFEDECAKFMARKLNEFILEN--NRQKKHKIPVTVVSFTKKEVATS EMCPELDASDQKYSETVKFVYRAIYAYHMIDGIDVPFFSMFVTEYPSHA--GQSWCTINYLDTVPYFESIKRGAMHGEIVLTYIDYMKSIGYENAHIWSN PPNQGDDFIFNIHPDYQTFLGQNGLNDWYIRILQKGKEDGIIQSYKTFEEKMKENSMVDIPIFPDSLWSNVMKETNME---TSNKNTFKKKMQANYKKHA VDNFWLKLNEPSGSEPIPT---LFPHPIMDQETFMDKCAEMNLEFSTLRRAKFSSVSLIKL- >tr|E3NLA1|E3NLA1_CAERE Putative uncharacterized protein OX=31234 OS=Caenorhabditis remanei (Caenorhabditis vulgaris). GN=CRE_22722 PE=4 SV=1 --------------------------------VCCHQKNLKYSASNVCCGKEGCRIQPGDDYMCEALDA-NGMARTYCISCWNAG--------------T --VDAWSRMANVNKEKEPIKECTVCGDLWHESCSMSN----EEQFRCVHCVPRRKVENQFSVFMADRINRFCKSH--ETRRSHE--VVVVSFTNKTVDLV EERPQHLKKEKLFGRATEYTERLIYVFQ----TDVLFFSMVTHEYPNHC--GKSYCLIDTLDSIPFLDIVRRGEVYQEIILAYFDYMRRIGFEKGHIWAD APIQGDDLFFTCHPSTQLYLTQKKLEGWYEAMLRKGKEDGIFKEWMNFA------RPTDIPIHKGSLWDRLIQD---------------TYLTSEFKNHS KDTFWMDLAPPTQP-----R--SYSHEKLDKYSFLELCEERNWEFSTFQRARFATLGIIEE- >tr|E3MUM2|E3MUM2_CAERE Putative uncharacterized protein OX=31234 OS=Caenorhabditis remanei (Caenorhabditis vulgaris). GN=CRE_21375 PE=4 SV=1 --------------------------------SCCMLKNISYGTSNHCKEKDTCRIRKGEEYMCSKAK---PEAENICLDCFDTA--------------D VYKDKWTKKVNINNKREETIDCSECGDSWHKLCALHF----EESFVCPNCCGGERDDPNCEVFLAEKANKLVNEG--N--------VSVASFTTKSVTTR TLMPDFYQKDKKYGSTVDYVARAIYFFQIVDNISVAFFTMFAQEYHNLA--GKSWCVIDYLDSVPWMTSKKPSRVYSQLILAYFEHMGKKGFKHGHLWAN PPCPGDDYAFNVHPEFQKYLDRNALICWYQSLLEEGRKAGLISNFRNFREESADGKPIDLPVFVKSLWQEVLLATQDKMLVAYNKINFEKSLEEDYKERA EDNFYIDLCGSQRLKPLPTQ-NLNSHEILDRETFLDKCIAENWEFSSLRRAKYSSVGIIGL- >tr|I3L466|I3L466_HUMAN CREB-binding protein OX=9606 OS=Homo sapiens (Human). GN= PE=4 SV=1 IVKN-PMDLSTIKRKLDTGQYQEPWQYVDD-------VWLMFN---------------NAWLYNRKT----SRVYKFCSKL------------------- -AEDQFEKKKNDTLDPEPFVDCKECGRKMHQICVLHYDIIWPSGFVCDNCLKKTGQTTRLGNHLEDRVNKFLRRQN--HPEAGEVFVRVVASSDKTVEVK PGMKSRFVDSGEMSESFPYRTKALFAFEEIDGVDVCFFGMHV---------------------------------------------------------- ---------------------------------------------------------------------------------------------------- -------------------------------------------------------------- >tr|G0MAE8|G0MAE8_CAEBE Putative uncharacterized protein OX=135651 OS=Caenorhabditis brenneri (Nematode worm). GN=CAEBREN_16363 PE=4 SV=1 ------VEQEDIVTKLKKAHCEFID-MFKALGLCCGSIKEELLQTIGCDEGDKCRIRRDDKYYHYKNA---VYDFAICVEHFDED--RFRTRDNK-K--I LHKTDFDFKTHDEVENEAMEVCSHCGKHWHDSCRMNQLFANRSRSMCFKNHDDSIKITVASDRIEEFVNDKIIGT---FPGAKKISIREVATKITTIKSN NKMSKFLKKTTGESLKLKYRYRQFIAVQKMQGRELLLFSFSVQEYLDG--FKEGWTNVEYVDSVKYVNPKLRTTIYQTILLGYFKFAASIGFQHAHIFAM APQEGDSFFFRGRPESQKVSNQEHLINWYHNFLNIGK-EDIIDSFKTMDYSS-DYEMLAAPYFVGGLFTYHFEQILKEIKISLFNNKSRQDIEKIANENA NNLFYIDLKPTET--------PVVYVEYKEEDEWINFQETEKLEFSSLDQAHYATLMIV--- >tr|E4Y1K6|E4Y1K6_OIKDI Whole genome shotgun assembly, reference scaffold set, scaffold scaffold_634 OX=34765 OS=Oikopleura dioica (Tunicate). GN=GSOID_T00014064001 PE=4 SV=1 ---------------------------------------------------------------------------------------------------- ---------------------------------------------------------------------------------------------------- -------------------------------------------------------------------------IIFQTIFLGYLEYCRNLGYVWAHIWSC PPSEGDDYIFHCHPQEMKMPRSKRLCDWYRTILNKAQDRGVITEFKDIHSQAKDDGVVCATQFDGDFWPTAIEDLIAEVLKEQAQEKKKYKLNSLMERHK EVFFVIRMHTVEALPPIEDPDKDMPCDLMDGDPFLNKARDEHWEFSTLRRTKYSSMCFLYTL >tr|F0YM02|F0YM02_AURAN Putative uncharacterized protein OX=44056 OS=Aureococcus anophagefferens (Harmful bloom alga). GN=AURANDRAFT_55416 PE=4 SV=1 ---------------------------------------------------------------------------------------------------- ---------------------------------------------------------------------------GESAAHAPSVAVRLVSCVPQSLRVQ GVLRKFVGGGAELPEAIPFESRSVVLFQKNDGVDLCVFSMYVHEFGDACGPSAKRVYVAYLDSVEYFRPTARTEVYHELLVAYLDWSRRRGFTAAHIWAC PPQRGNNFIFWCHPTHQRTPSRERLTEWYRAMVRRAVETGAVSRVASLYDE------------------------------------------------- -------------------------------------------------------------- >tr|G0PBT5|G0PBT5_CAEBE Putative uncharacterized protein OX=135651 OS=Caenorhabditis brenneri (Nematode worm). GN=CAEBREN_06504 PE=4 SV=1 ---------GTSKENLENSGPS-----EAICCNICKQGDLEYHVSNLFCHQSSCRIQAGDEYFEAAS-------KTYCKPCFEKT--------------S Y--KRCKKFTHRSQATEKMLQCRTCKKHFHKVCVLFLGDHDM--YVCSACQEPSPRKTDQSKWMEKDVNDDLWRTQSETAEKHHIFIRTMNSSRTEEEMK EQIPEYLKEKEMYGDKFEYNERTICAFQKIDGTDVLFFVMFTQEYKDL--HGKNWVIIDYLDSVPLCNPIDRKEIFGTILNSYLAYMGLIGFTHAHFWAK PPKQGDDYIFHIHPAYQIFHGQIDLQNWYQKSLEIGKRKRKIADFHTSYNSETIKQPTELHVFKDNVWAFMLD-WAKWAVNEKRSKKEQKKLRMLMEKHG KEVFYIDLVKPNDEVANGD-GELKHIVMGDRDKFLNECGDNHWEWFDLRSAKYSTAAVI--- >tr|D8M0W6|D8M0W6_BLAHO Singapore isolate B (sub-type 7) whole genome shotgun sequence assembly, scaffold_14 OX=12968 OS=Blastocystis hominis. GN=GSBLH_T00001829001 PE=4 SV=1 ------------------------------------------------------------------------GRFVWCCAALPPR---GTVDGV-----E VVKSELIEREASPAAKEPWVACCRCRRWFHRDCLLFNEKLARTRILCPLCVAETPRQTPLGRFMEKRIVERVERQRRETPFAGDLHVRVMYAEKTTIPTG KRFQWYFQ-NFEIPAEISAWQKCLCLFQTVDSVDLLLFVMYVTECPDETPCNAHTIYVSYLDSVPYFPPNFRRLVYQEFVLSYLEYSREIGFQRAFLWAC PPTVGDEYILLSHPPEQKMPRELPLLRWYREIAQEGNSRGVIYELTNLVDEC------------------------------------------------ -------------------------------------------------------------- >tr|G0PH64|G0PH64_CAEBE Putative uncharacterized protein OX=135651 OS=Caenorhabditis brenneri (Nematode worm). GN=CAEBREN_00814 PE=4 SV=1 --------------ELKKAH-SEFIHLFKALGLCCGLVKEEVLETIPCDEEEKCRIRQNDKYYHYKD---ASYDFAICAQSLNDG--FMTRDNKK----A LDKTDFDFKIHDEVENEALEECGHCGKLWHESCRMNLLFTDSDRLTCSKSHDDSIKITLASDRIEKFVNDKIIKAFPD---AEKITIREVATKITTAESN NKMSKFLEKTTKESPILSYRFRQFIAVQKIQGREVLFFSFSVQEYLNE--FKRKWTNVEYVDSIKYISPKLRTSIYQTILLGYFAFAASIGFTNAHIFAM APQEGDSFFFRGRPESQKVSNQQHLLYWYHNFLNVGK-EDIIDSYKTMDYSS-DFKMLNAPYFVGGLFTNHFEEILKKTKVSF----------------- -------------------------------------------------------------- >tr|E3MQM1|E3MQM1_CAERE Putative uncharacterized protein OX=31234 OS=Caenorhabditis remanei (Caenorhabditis vulgaris). GN=CRE_10410 PE=4 SV=1 -------------------------------VVCCTNQDTQFNASNRYCLANECRIPPGAKYMYKKGE---RDVANYCIPCYETK---FPA--------H GNKRDWKMAENINTASDVILKCSTCQEKWHKTCALFFNP-DASKFVCMECGGGRSEHAHAAAFMTEKLNELLRSRIGI-TVARKDKIRVCGLSEKSIWTK ALVPAIVEAKKKYSGKFKYVKRAFHVFQRIDGVDVILCSLYTQETP-----MGKRWMIDYFDSLPYFKNLKSGELHQEVFHAYIEYMSSINYLNGHIWSS PPKPGDDFCFNIHPSAMKYLDRNGLIAWYQRILKNGRSLGIIKEYRNFREEL-EHKPSDLPVFVDSLWCQVMKEINFEHPN--SPSEFMEALQKQYEIHA TDNFVIELPGAKTPRER-DTW-YYSHPIGDRMGFLKKCVKENWEFPTLARAMYASVALIQ-- >tr|E3N4F3|E3N4F3_CAERE Putative uncharacterized protein OX=31234 OS=Caenorhabditis remanei (Caenorhabditis vulgaris). GN=CRE_23003 PE=4 SV=1 --------------------------------KCCNLNDLRYGTSYKICMAEECRIPPGAKYMCSKAEV-GKEVQNYCLKCFED------A--------D LDVNDWEQKENTNPALEEIEECGICRELFHRVCELNIRS-KSS-FICTTCSPLRLNKDACAKFMTKKLNEFIVAN-NQ-EKRHQQPVTVVSFSEKEVDTS EMCPELDASTTKYSETVKFVYRAIYVYYRIDGIDVPFFSMFVSEYPS---HAGQSWIINYLDTVPYFKGIKRGAMHGEIILTYIDYMKSIGYENAHIWSN PPEQGNDYIFNLHP---DYQKVPG-SKWIERL--KGKDDGIIQSFKTFEEKM-GESCVDIPIFPESLWSNVMIEANFETSY--KKC-FMKKLQVLYKKHA VDNFWINLNKSSGPQPT-IPA-LYPHSIGDRMMFLDKCAEMNLEFSTLRRAKFSSVYLIE-- >tr|G0ND54|G0ND54_CAEBE Putative uncharacterized protein OX=135651 OS=Caenorhabditis brenneri (Nematode worm). GN=CAEBREN_04501 PE=4 SV=1 ---------------------------RQSGFACCDKHVPPYNSFIRFCAK-GCEIAIGEKYRSNHV------GTVYCLECFDAK---HAVR-------P FKEENWKELENKNDSEEKTLTCD-CGAKYHQCCSLELQ---ESQFTCSACITSNLIRTKFEEFMEDRLNTLLKNR----LPADQLACRLVCYSIKQIVSR PLHRDFT---RLYGEKIEYDMRTIYLFQRLEGVDVIVFAMVCQEYKDI--RGKSWVVIDYLDSVPFVEPAARGAVFREAILSYFAWAKKMGFNHAHFFSN PPQQGTDFILSIHPTSQKYKKPAALLGYYNNLLAKGVERKILAEVRTLEREH-ENQPTDFLPFDGGLWPNCLREVTKKFGKLPAEEYAKKLVPFMKRKFK KNNFFIDLNLTSR--KVSDPDDIKSHVLGDRDAFLEKCRKENWEFSSLRRAKFTSVAVI--- >tr|G0MDL6|G0MDL6_CAEBE Putative uncharacterized protein OX=135651 OS=Caenorhabditis brenneri (Nematode worm). GN=CAEBREN_31096 PE=4 SV=1 --------------------------------ACCTGPNTKFSSFVRFCET-GCTIKIGDAYWINKQ------GTVYCSACYEAK--------------- EEAKNFNKCKNVNRDEEKTLTCKKCHRKWHECCSLDLQHKN---FTCSSCTKKRTIRTTLTGYSNNRFESFMEENLNPEQMHQPLSV--ASYKEKKFVPR PLRSAFV---EKYGETIKYDVRTIYLFQRVDGVDVLVFVMVCHKYKNI--LGKSWTVIDYLDSVPFVEPEARGTVFRGVILLYLEWAKKIGFNHAHLFSN PPDQGTDYILSIHPIDQLYKTADELLKWYNALLRKGVERGVLVEVRTFKQEM-ERKPTDLLPFEGGLWMNCMSEFDRDIRKEFGKTLPKDQYFEKFKANA KNNFFIDLDLKSKRV--TDPDPRKPHTLGDRDAFLEKCRKHNWEFSSLRRAKYTSVA----- >tr|A8XYN2|A8XYN2_CAEBR Protein CBG20801 OX=6238 OS=Caenorhabditis briggsae. GN=CBG20801, CBG_20801 PE=4 SV=1 ---------------------------VASGPVCCDHQDLQFLTSNGFCGDRSCRIAAGAKFMCAKSKSL--TYDIFCMKCFANK--------FPD---H LNKKNFKETTNVNKKSEDVINCMNCGDAWHRICALHLKK---DQFLCVKCGGGAKEYCRSSKYMSQKANQFLEKEIGED-GARRNPISVVTFSDKNVATR EMAPDLYTGREKYTGTIQFRTRSIYVFQKIDDVDVLFFVMHTQEYKKHA-DELSWFTIDYLDTVPFFQPHLKGFMAGELILIYAEYMKSIGFQKGYLWAN PPKSGDDFIFNIHPQDQKYLNRTRLEKWYRKVFQKGKEDGILNGFHAFKEEFKNQKPTDLPFFHNSLWSRTMKDINLQLAETRHPSFMRELEFEFKDHLT DNF-FLDLANHDETSPTSDVRTKPHKILGDREAFLIHCAKKNWEFSSLRRAKFASVAIIN-- >tr|G0NE11|G0NE11_CAEBE Putative uncharacterized protein OX=135651 OS=Caenorhabditis brenneri (Nematode worm). GN=CAEBREN_14019 PE=4 SV=1 -----------------------------TSAFCCK-RFLVATSVKQQCNGTTCWIKDGDRCYKNGE-------KYYCMTWFEEK--------------N SQTSRMKKVIYRHFQPEKYETCTTCDTVWHPKCRLTRLLKDEDSLPCRNHNDMLHIKQFSKQFKETPLQKALSKNYHKNRGTELQFVHAICETVTPGEEL EKVK-------VKPEDYTHEANQIFVIVMIKNVETLIFALNTQEYRSG--PKKGWVVIEYLDSLSILT-KDRSAIYKRIIRSYLKYARDSGFKIAHLWSC APDNGVDYIFSGHPKSQRFLRKDQLDEWYLSIFQECDGEKVDVGYAKKYDEV--LKKTTVEVVANGFWTKEIEEIIKKMKENKNDSLENQEIQKIKDASI NSLFYVELANGDEGKETNLEKKTEECEEFSRDKFLDEQCSYDIEFTDLRQAQFATRKILMEV >tr|D8M903|D8M903_BLAHO Singapore isolate B (sub-type 7) whole genome shotgun sequence assembly, scaffold_6 OX=12968 OS=Blastocystis hominis. GN=GSBLH_T00004264001 PE=4 SV=1 ---------------------------------------------------------------------------------------------------- ----------------------------------------------------ENFLETDLSRYIEQRIMKLVIDMQKAEAESLNTTLQGLSVRKKTWTMK ENFQKYFQGC-SFISKWPYWVKCICLFQKIDSMDVLLFIMYVYECPEKGSPNEKTAYISYLDSIRYLSPRYRKPIYQEMVLGYFAYVKQHGFKSALIWVC PPKKGDDYIIFSHPLEQQTPNEKRLGDWYHEMLANAHKQGIVTEVSNLVNEYLDVALLAVPQFSDDHWVQDSEE-------------------------- -------------------------------------------------------------- >tr|G0MWI7|G0MWI7_CAEBE Putative uncharacterized protein OX=135651 OS=Caenorhabditis brenneri (Nematode worm). GN=CAEBREN_11977 PE=4 SV=1 ---------------------------VKRAGHCCSIERMHAMDNLYCVGGRNCLISLGEQYRRLLNENEEESMEVCCLKKVEGK-THVELYVDKHHQ-A KFSNTVERREYQPSDVEEWLECTKCRTQHHATCTVYHPAKDGKKFVCPRCDPLNIRTYRVEDLDTTQATDFFEKELKNPNEDQIIWSRILSAHKKPGDLF QQNRNNSYPNLRRLPPVSYVRKTYGFFIKRDGLDVLFLVLMTHEYKNVASRKNNTIHVEFLDSVKYVEHKPLAAVAESVFQAFCSYARDAGFETIYLWAC PPFTNEGYVFNARPGPPKQTNNQHLYDWYRSAGKCGSVQRVYQGCVTK-DHGTLDDHLDALCYENNPIAGALEQSLEESKDLPEKNRI-RHLRGTLGMDN KKMFYLRLKKADQVGVQQYEEALFPGSF-ASSAFLENQRAHKLEFHNLQSAKYATQLIIF-- >tr|A8WYZ9|A8WYZ9_CAEBR Protein CBG05019 OX=6238 OS=Caenorhabditis briggsae. GN=CBG05019, CBG_05019 PE=4 SV=1 ---------------------------------------LSYSSSNLLCGFQDCRIHPDDHFWIPHR-QNGEFQEAICDTCFDY---------------M ENKKGWRKKINTNEKMEEVFTCQSCSGLWHQCCSFFYGEPE--EFMCRNCAPETSGSSPDSDFIEERINGFLENALERNQKFQKISVRTYFNPEDYAKTE DLAEKFI---RKYGNVVKYGSRAIHVFQQQDGVDQIFFSIFASEYRDPVRDGKSWLVIDCLDSIKLFQPSLRTQIYRELILSYFHLARSMELQNSFLWAD PPLHGDDYVFNIKPANQKTPKKMKLQNWYIKMMEKGKSDGIIKEFRSFAEEK-EKKPTDIPIFQKSLWPPLMCAY----DVEG--KELWESMSVEWKIRG SDYWFIEFNESEKSEQLEEMQERLHQILLRKEEMQYHCFKNNWQFNNPRRARFASVGLI--- >tr|A8X3J8|A8X3J8_CAEBR Protein CBG07171 OX=6238 OS=Caenorhabditis briggsae. GN=CBG07171, CBG_07171 PE=4 SV=1 ---------------------------------------LTYSNFNHQCGAPDCRIRPNDIYWVPRK-QQGEFQEAVCTTCFKE---------------M GNKDGWKQEKNINDRIEEVFKCQECSGLWHQCCSFFYGEPE--QFLCRTCAPEASGSSPDSNFIEERINGFLEEALEEDQEFQKISVRTFYSADDSVKTG DLASKFI---AKYGTVIKFASRAIHVFQRQNNVDQIVFSIFASEYRNL---RKTWMVIDCLDSIKMFEPSLRTKVYQELVLIYFDLARNIGLSHSYLWAD PPIQGDNYIFPVKPANQPSPTPTMLENWYLTVMEKGKSNGIVKEFRSFAEEK-EKKPTEIPIFHNSLWSSLMAVY----DVVG--KKLWKSLSAEWESHG SDNWFIEFNEVEKDEQSEGMQERLHEILLKKEELQYHCMENSWQFGNPRRARYASVGLI--- >tr|A8X3L2|A8X3L2_CAEBR Protein CBG07153 OX=6238 OS=Caenorhabditis briggsae. GN=CBG07153, CBG_07153 PE=4 SV=1 --------------------------------------------------DQDCRIKAMEPYMVEKK-IQGSIEINYCVECFSKA-------------TD LDKKKWKRMINVNEKIEKVLSCQECGMLWHEQCSFYYDD--PEEFICRSCDSDDKTRTRDSDFMERTLNEFLDSKLGGQKKPQKISVRIVRAQSDTINLA PGMFSNEFL-KKYKDKVQRSVRTIFVFQRQKGIEQLFFIIFTSEYPNHG-DGPSWFVLDYLDSVQHFQPHFKTEVYNEIVLTYFDWMRRTGFLKGYIWVD PPQPGDDYLLNIHPPQQKYPGPERLQNWYSNVLAKGRTRGLVREFRDFGAEKKLKKPTDLPFFHASLWTNLMALYSDDMEEKR---IPKSKFWTLMKGHV KDNIFIELTEPDSEKTQEDNRTRISQDLSDGSEFLYTLQEHDFEFGESRRALYASVGVV--- >tr|E3MUM5|E3MUM5_CAERE Putative uncharacterized protein OX=31234 OS=Caenorhabditis remanei (Caenorhabditis vulgaris). GN=CRE_21377 PE=4 SV=1 -----PQDETVAKKPSSSGDNNNTPSGSGDNLSCCGLERLRYSTPIRNCDNDTCMIQVGEEYMCRKS---RPEADNICLDCFARA-------N------V KNQNLWIKKVNLNEKVEDTVDCSECGETSHKVCVFHFEE---APFLCGNCSGEPGFKCEINAFLAQKANNQLEDKK--------AKISVASTTQKSTTTK KLMPDLYLKKEKYGSKIDFVARAIYFFQIVDNISVAFFGLFTQEYQDF--GGKSWCVIDYLDSVPWMKVSSKSKIYLELILAYFEYMGLKGFKNGHLWAN PPVKGVDYIFNIHPETQRYLDKVQLIGWYHKILKQGKDTGVLAGYRNFEEEFKKGEFIDLPVFVDSLWHKILEWVNDQLKETNKENFKRM-LVGEYEDRA LDNFYFELRGSE----LPNPIPLQPEELGDRETFLDKCFVENWEFSSLRRAKYSSVGII--- >tr|E3MQK9|E3MQK9_CAERE Putative uncharacterized protein OX=31234 OS=Caenorhabditis remanei (Caenorhabditis vulgaris). GN=CRE_10368 PE=4 SV=1 -------------------------------LACCPTDTLPYHAGNLRCVHADCRIPPDAEFWELEDSVT---ETFYCLKCFEE---------------K KPKGKFEKKKNDNDEVEPILECQKCRGSFHLCCSFYYEE-DLSKFICEVCSGVRVEKSPLAAFMTEKMMNFLASRLDG---VVEHPIRICGSAKSSTVVP PILRQFV---KKHSKTMNYVSRALHVYQRIDDVDVITFAIYTAEYELRG-GDEKWSLIDYIDSLPYFKKFQKKEIHHMLIHSYYEYMGGLGFRSAHLWSN PPQKGDDYVFNIHPFDQPFLNPTGLIGWYQSLLRAGQEKKILAGFSDFQGARRFAKPIDIPVFVGSLWSTLFQEVKKTTSMKVFE----SELEEKKAKHG DDNFFIEIAAPTTRQPQETA--HHTHELGDRVKLLEMCVEKNWQFENLRRAKYSSVALIR-- >tr|A8WNJ3|A8WNJ3_CAEBR Protein CBG00580 OX=6238 OS=Caenorhabditis briggsae. GN=CBG00580, CBG_00580 PE=4 SV=1 -------------------------------------QSLRYNASNIICDGDECRIHPNSSYFGGIS---EDEERSFCQKCFKKE-----LHAK-----N IKMDDFHEMINENEASEDVLKCIRCHDKWHRCCSFHLGSPE--SFVCKKHGKGHKEFEKGSKHMEVRLNTFLKRRIGV-KEALKSPIKVISFTAREAAIK EHIVQYYHEKAKFGTHIDYATRATYVFQRQEGVDQLFFVMFTETCWNHGKDGKSWFVIDYLDSVAHFQPHLKTKVYMEVIHSYMDYMRRIGYFYGHLYAN PPLQGDYYIFNVHPEWQKYPTKRRLQKWYHDMFKAGKDAGIIKSSRDFNAHK-IKSAADLPVFVDGLWANLMKQE-DTVDKEEF---EEAMAY-HFKQHG SDNFFIELEQPKGGVKKHD-IYLYAHPILENHLFLQECQKNNWEFGTRRRARFASAGVI--- >tr|G0NE10|G0NE10_CAEBE Putative uncharacterized protein OX=135651 OS=Caenorhabditis brenneri (Nematode worm). GN=CAEBREN_24928 PE=4 SV=1 ----------------------------KSGFVCCKEKVTTFSSFTRFCEK-GCKISVGDKYRTNGI-------KVYCPDCFNEK---------NAQK-P LKEENWKELKNENEAKEETLTCERCPAKWHRCCSLELQKK---KFVCSSCTRLDESKNEFEKFMEQELNTFLKDHHPQLAKN---RLSVVSYKEKEADPP PLYSDFS---NKYGKTIKYDVRTFYLFQRQGDVDVLVFAMSCHEYKNI--LGKSWVVIDYLDSVPYVVPG--GAVFREAIIAYLAWAKKIGFNHAHFFSD PPQKGTDYILSIHPADQVYKTAEELLGFYVALLEKGVERRILVEWRTLEQELAYAQPSDLLPFNEGLWMNCLREFDEEITEKYGSKKISSLIKRKFKVNA KNNFFIDLNLTADKVNDK-DDPLKSHVLGDRDKFFEKCRKENWEFSSLRRAKYTSVAVI--- >tr|E3NMY8|E3NMY8_CAERE Putative uncharacterized protein OX=31234 OS=Caenorhabditis remanei (Caenorhabditis vulgaris). GN=CRE_12449 PE=4 SV=1 -----------------------------------------------CSGKKECRIRPGASYMCADD-------DVYCMRCFGVEKKDNILGDI------ ---NNWRQLENVVETFEVLKECGDCGGLWHESCSMTLAT---TTFICYKCITGIKHECPLSQFMSERMNKLCGKPV-----TRNTGIAVVNVADRPDHLK EQFRN------KYGNTTNCTQRMIYVIQRTSKADVIFFSMICHEYENHA--GTKYCLIDTLDSVPYFTPTSRGAAHHEVMLSYFDFMRRVGFEKAHLWAN APVQGDNMIFTCHPMEQKYLSQVELEGYYEKMLAKGEKSGIFKKWRNFGKEDVESHPIHIPIFEGSQWEYFNQ----KYDPE-PEDKENSEAANFMRKFT RNTFWMDLKKPD---------EPMDPELLDKMSFLELCVENNWEFSSLRRAQFATMGII--- >tr|A8X3L3|A8X3L3_CAEBR Protein CBG07150 OX=6238 OS=Caenorhabditis briggsae. GN=CBG07150, CBG_07150 PE=4 SV=1 ---------------------------PKDLKCCCEILDLPYNVSNLYCGSEVCRIRPKERYFGLKRKGEGEERQYWCNNCFQEA--------KKDG--I VDVNQFVSMQNKILKSEDILECNGCQGSWHRCCTTHLGIRDV--FYGVIRKTEKPRIEMMASFIEQQLNQRICEEVRD---A--QKIKILCTTEKEALTR SLVPEFV---AKYGEKISYRTRATHAFQLQDDGDMIFFTMQTQEYIRPIRGDPKSFVIQILDSVNYLQGVNRTFVYHQLLLSYFDYMRSVGILHGHFQAD PPLKGDDYIFHVHPESQGYLDERSLIGWYRKMMDQGKAEWIIQKFEDFKKAM-PGTIQDLPLFDGELWSKAMLAS----EKEKNFEKAMKEQYSIH--AN DNFFKPKGKIGQ-----LDPRL-FRNKILNVDEFWKHLAMKNAEFVDRRRAVHSSLRVV--- >tr|E3MEC3|E3MEC3_CAERE Putative uncharacterized protein OX=31234 OS=Caenorhabditis remanei (Caenorhabditis vulgaris). GN=CRE_22368 PE=4 SV=1 --------------------------------ACCNIDSLPYHTCNLRCDNPDCRIPPDAEFWEFKG---SKEQKVYCLKCYE----TIN----P----N LKRRKFKKKKNENEAVEPILECQKCRRNFHLCCSFFYGE-DLSKFVCEECSGVRVEKCQLAAFMTKKMIDFLAARMG----VVEHPIRICGYSAKTSTVF PPVLRQFL--KKYSNMMNYVSRALHVYQRVDEVDVITFAIYTQEYHFQK-EDDKWCVIDYIDSLPYFKKFNKKEIHRLLIHSYFEYMGGLGFRKAHLWSN PPPKGDDYVFNIHPFDQPFLDFTQLIDWYHKLLKDGKDKKILAKFVNFQEAKCQVKPIDIPVFVGSLWSYVFQEAKKIFKRELEEK---------KKEHG EDNFFIEIAPTTTRQPQSTHNT--HEILGDREKLWETCVDKNWQFETIRRAKYSSVGL---- >tr|G7KH97|G7KH97_MEDTR Histone acetyltransferase OX=3880 OS=Medicago truncatula (Barrel medic) (Medicago tribuloides). GN=MTR_5g085310 PE=4 SV=1 ------------------------------------------------------------------------------VQTSKGE--KITFNGT-----S ISKKNLEKRNNDEVLEEP-VECNKCERWQHQICALYNKKEDV---------------------------------------DSGLIMSFSYSYQKKFQLE KCYLSTTIPEENYPTELSYRSR----------------------------------------------------------IGYLDFCKKGGFSTCYIWAC APSKGDDYILYCHPEEQKTPKSDKLR------------QNIVVGLTNVYDRFFLSKVSRLPCFMGDCWCRNAMVVAETLEKESSGDYEK----------- -------------------------------------------------------------- >tr|A8X3L4|A8X3L4_CAEBR Protein CBG07149 OX=6238 OS=Caenorhabditis briggsae. GN=CBG07149, CBG_07149 PE=4 SV=1 ---------------------------------------------------------------------------------------------------- ------------------------------------------NSFKCKLCVEKDKGGSRLVTAMEEKLNSILRAKLSKDAERNRISVRSMAFE------- ----------KKYGQAICYKTRTIAVFQRQDGVDQVFFMMFVREYKSTF-------VIDYLDSVKYLEADLRKQIYPEILLAYFDFARTLGILHGYIWAK PPVKGDDFIFNIHPEDQPYLDLNRLIGWYRGILDKGVREKRIKKYEDFGEKK-IKKTEDLPLFIDSLWTKKMKE----V--EERPRTDKKQFDQDMDHHQ KDNFFIELVQGCELEDDDTPT-TSHAWIMDSLMFREHCRENNWEFGCRERARFI-------- >tr|G0MQY1|G0MQY1_CAEBE Putative uncharacterized protein OX=135651 OS=Caenorhabditis brenneri (Nematode worm). GN=CAEBREN_05059 PE=4 SV=1 ---------------------------------------------------------------------------------------------------- -------------DEEEPKIRCKQCRKRSHVVCAQWNPIHDASEFICKNCKPRPLQTTPSSDFIETGINEFMGTLDSL-ETPGRISCRIVACIRRNGRMS DENKKQYPGLSRLMDDIPHTYKLYAITQQIEGQDVLVYMLAVNEYSGKGVPAQGLVSILFLDTVKYLQPYFSRKINQKFLQLYLQNAQQRGFQKARIYAR PPTSSDDFLFHCHPEFKKNLTQQGLFAWYRQAVRELNSAGIQATFKNSFDPSHVKNLCEELYLEEGFWPNFLERNLKKLIKEESEDRDNCQFYIYFDDNK GK-----------TVDARQEDPFIHSEIYTRDAWMDAQIENDWQFDSARRATYSTMMLISS- >tr|L1IWB0|L1IWB0_GUITH Uncharacterized protein OX=905079 OS=Guillardia theta CCMP2712. GN=GUITHDRAFT_142658 PE=4 SV=1 ------ERISTRNTKVDSNEDEWKDPCA--MCFNPAKKLGNAKTIALICSRCEGIIKQGWPFWQHCDPVSIVNMCIYCFQVVQRDGMYAKVAAPPDESKV ILATEFTS-KVVEQDVESMCECPFCGRKYHDRCVRYNREMYG--DIPPRCIARDMKESPLSQHMQSYVAKKVFKGLEK---SCPVIIRTVSNVKRMQEST PGFTARFGK-----QNFPYLLKNIMCFVECDETDVAFLGLVVAEFGSDA-PEPNKAYISYIDSVQLYHKAERKEVVRRVLLGYLDAVKKRGFEALFIWSM PPNDHHDYIFHMRPLHQHCPTPAQLDSWYAKLLNVAKQEGIITDWDSNARRDEDMSLRHVPQFPGGLMLRALEAAL------------------------ -------------------------------------------------------------- >tr|G0NI20|G0NI20_CAEBE Putative uncharacterized protein OX=135651 OS=Caenorhabditis brenneri (Nematode worm). GN=CAEBREN_14035 PE=4 SV=1 ----------------------------MSLAFCCG-RFHTATQVYIECKKTTCNIRDGNSCYVKGK-------DHYCAGWCEGS--GN----------- --TRGMKKIIYRHFKPEEYEECGKCKIVWHPKCRQLRLVKKRNDHPCQNNNDMVEIPNFSQAILTTPFQRRLSQIYHHNR--GDL--PELTFIHSTCIKK IPDNRYTKNALPVREDFEYTANQIFVTVPIENVDTMIFSMNTQEYRSG--PRKGFVVIELLDSIAILKEQ-RSKIYKRIIWSYLKYARESGFKFAHLYSC AAEGGVDYIFGGHPLNQVFLKKAALNGWYKGLLEEGNKEFGKMAYEKFFDEALSSEMCRSLVFGNGFLNDEIEKILQKMPQNKEDHFFKKEIQKFIAASK ESLFFVKLQEDGV-----EEKEIMDSTFLNWEEFLDEQYDKDIEFTDLRQAQYATRQILMEL >tr|E3MLU6|E3MLU6_CAERE Putative uncharacterized protein OX=31234 OS=Caenorhabditis remanei (Caenorhabditis vulgaris). GN=CRE_31319 PE=4 SV=1 -----------------------------IVCHCCNGGRARWNSPNLKCDKKSCRIKPNTKYFFRTE---DDVQKNLCEKCYNGI----S---------A RQKALYQPWTNETTDMEELYRCPTCENYFHIACALFLGE-DLSKFICRACGLRKLIPTKMAREMERVLNEHVWRNGKEDDKKNHIYIRVLHCVRQSYTTS DHAPATFSTEEKYGEEFQSVNRMICAFQEMDGVDTIFFTMFTQEYEKD--IKENVAILEFLDSVPFVQPASRKGIHRTIIASYYWYLSTIGFTRGHIFAN APVQGDDFGLPIHPSDQFYLSQGKLERFYGGALDLGVRNGLIGDFKTFEEKF-GNKLTDLPFFPEGCWPVKMARMAKVPTGDARKRKFLELLRPYLKSHK KDNFFIELQSALPPNAPIDQEVMMTSSTGNREEFLLFCYQIHLEFRDVQHAMFSSCVL---- >tr|G0MDL7|G0MDL7_CAEBE Putative uncharacterized protein OX=135651 OS=Caenorhabditis brenneri (Nematode worm). GN=CAEBREN_05920 PE=4 SV=1 -------------------------------------VDLRYGANNMYCETPGCRISAGDRYMEQNT-----DEQVYCIKCYNEK--FVTEEER-----Q ENLQSWTEKMNLNTDKEETRTCSVCGIKFHQCCSL---EIWVAPFICPTCPKPQGRRIRMEKFMEDEINEFYQSTLKDQNGTGVVTYRNKTSVLTKEIVP ASYSAFC---EKYGDSIEYYERTILLFQRLGGVDVLVYVMYAQEYPRL--DGKQWLVVDYMDSVAYVERKISGFVFGELIISYFSYMSRFGYTNVHLFAD PPVQNDDYIFHIHPASQIYKEANSLIRYYGSVFVKGQSDGVISQIQNFEDTQKPKTPTDLIPFHGGLWTRVMEEIEDGLKKDQVEQ----HVKLTPKKYK EHF----------------------------------------------------------- >tr|G0N753|G0N753_CAEBE Putative uncharacterized protein OX=135651 OS=Caenorhabditis brenneri (Nematode worm). GN=CAEBREN_19178 PE=4 SV=1 --------------------------------TCCPEAPLQYNSSNRTCHG-GCTIFIGDEYMRLKG-------IHYCMNCAATR--------------- -NYKRWTKVVNRNDSVEKIVRCVKCGKAYHRCCSLHVQCRSP--FYCPDCARDLKEPNQCDIHIRTMLNEYLKKTLNDGE-ELQNPISVASFKTKQELVP SLCLPAF--NTKYPSKINYDLRSIYVFQRIEDVDVLVFAMYCQEYIGL--DGNNWAVIHYMDSVPYVVSSSGGFVFGEAIIAYLSYVKSIGFNKAHFFAD PPAHNDDYIFRIHPSTHPYMTIERLIKWYNNVLKKAKEMGVVKSIRTFSQEQPYDGPTSLTLFDDGLWSRVMIEAAEEINFTEQTKIEKKRLEEKFEVNA ENNFFLDLSRIPFFVQ-EDSTRV-HYVLGNQDAFVKKCRRKNWEFSS--------------- >tr|D8LPU2|D8LPU2_ECTSI Putative uncharacterized protein OX=2880 OS=Ectocarpus siliculosus (Brown alga). GN=Esi_0053_0108 PE=4 SV=1 ---------------------------------------------------------------------------------------------------- ---------------------------------------------------------------------------------------------------- ------------------------------DDKDVLVFIMSVYEFAKTC-PEQTRVYVNYIDSVAYVQPRYRREVFQEVLVSYLESAKNRGFYAAHIWAC PADRGHDYVLYCHPVDQMMPNKDRLQAWYNLALEKALVRGIVESNETMGSMFADNGCIDVPYIPGNIWLRQLDTWV------------------------ -------------------------------------------------------------- >tr|A8XPS7|A8XPS7_CAEBR Protein CBG16774 OX=6238 OS=Caenorhabditis briggsae. GN=CBG16774, CBG_16774 PE=4 SV=1 ---------------------------------CCD-KEPFYRLDNFSCAATNCRVNEGDTYFALKENGEIK-REWLCKKHFD-E--MRSKGEVGSS--G NNRRVWLEHLHDHREYELKKMCSLCSTWKHVVCENFCLDEDTKSFVCQSCDPLALHETYLSKHMEKYCKRLFDEL----RCSSTTSIRLIASGESDSIVP EGPLTDWFNELKIDLNHPFVYKHIGVFQQSTGPDMFMFSMMVREYVNDG-PKKGTTFLYYLDTNQRCPSQFRSKIYQKLLASYWDYARSIGFWKCFVYMS APRKGDDYLFSGHPANQNYLTDDHLYEWYQKAVESARREGTTYQPKKICVEETVAHLADRLFYENGLWPDMLKVYLDEREEETGEE----DFIKFVEGRK NVFFV-DFYPKPLNIIILDRDQLYPSKIAGTESWIDHQLEQHLQFDTERKTKYSTQIQLR-- >tr|G0MDL7|G0MDL7_CAEBE Putative uncharacterized protein OX=135651 OS=Caenorhabditis brenneri (Nematode worm). GN=CAEBREN_05920 PE=4 SV=1 ---------------------------------------------------------------------------------------------------- ----SWTEKINLNTDKEETRTCSVCQIEFHQCCSLE---LWTAPFICPTCPKPQERRMRLEKFMEDKINEFFQSSLKDQAKFSVITYRNKTSVMTKDIVP AKYRNAFCK--KYGQKIEYYERTILLFQRLGGVDVLVYVMYAQEYPLL--DGKQWLVVDYMDSVAYVEPRMSGYVFGELIISYFSYMSRFGYKNVHLFAD PPVQNEDYIFHIHPASQVYKEVDSLIRYYGSVFAKGQRDGVIYRIQKFKDTPSSKKYTDFIPFDGGLWTRVMKEIEDDLEKDQE---------------- -------------------------------------------------------------- >tr|E3NEZ0|E3NEZ0_CAERE Putative uncharacterized protein OX=31234 OS=Caenorhabditis remanei (Caenorhabditis vulgaris). GN=CRE_14319 PE=4 SV=1 ---------------------------------------------------------------------------------------------------- -------------------ENCLTCGDQFHSVCREIGRRSRDRADRC-ECRIPQTPQDRCAKVIENRFKKWVKT-------DKKVSIRVLTADDMKAELG EKLEEFFAAMGKQLPLLKMRNRMILVFLEEDGEEILFFAMLANEYKER---KAELITFRYLETVSYINGELRRNVYDSIIFGYMAFAASIGYKKIHFWAC PPDPDDSYLFRGRPAYQTVLDKKKLIEWYTRLLELGKTRGVISNYSQKYNGPKPDELIEHLISDGGFWNKEIGKLFEEKYGCNEHQQVQKDLRRLIGKKD HPIFYITLAPSEEVEAEEENDPPISSKIKTGDVWREFLDDHGLDFTYRETAINATLVILWAL >tr|E3NEY8|E3NEY8_CAERE Putative uncharacterized protein OX=31234 OS=Caenorhabditis remanei (Caenorhabditis vulgaris). GN=CRE_14318 PE=3 SV=1 ---------------------------------------------------------------------------------------------------- --------------DCDPIQTCATCGDQFHPVCRDLGRRSNGGENQCPCRIRKAMPLDSCATFLEKCVKNWVHT-------TRKIAIRVTTAENMEADFG EAFEAFYAKMKNLPVQ-KIRNKMIFVFTDDDEDEILFFAMLANEYKDA--AKSDLMTIRYLDSVSFVNGDLRTNVYNSVVLGYMGYTASIGYKKTHFWAC PPDPEDSFLFRGRPAWQPVPTEKRLIKWYTTLLDLGKTRGVVAEYGRDYEGPQEKKLIEHMICDGGYWNEIIG---KKLFIETYGQPILSDLRAEIKLKD KPIFYIDLISRIMI--AIEELPSITSELTTRDTWKKFLEDNELDFTYRETAKYATAFII--- >tr|E3MV02|E3MV02_CAERE Putative uncharacterized protein OX=31234 OS=Caenorhabditis remanei (Caenorhabditis vulgaris). GN=CRE_20864 PE=4 SV=1 --------------------------TMNSATVCCDDRDLQHHAASSTCSNASHRIAPGQPCWKARN----GDMYWYCLKCFKSS--------------G ENKEEYFKTENINQAVEEILMCSRCSDRYHKCCAFFYGS-DSSRFVCHNCDKTAKQKCKLANFISKKLNDLLEQLVGK-ETASRHPFRVVGFSVKTTPIE DFTPNLFKE--EFETLLNRVSRALYVYQRIDEVDVISFTLFSSECDQN--KDRKLCQIDYIDSVPYFKNLKRGAIHEVIILAYYEYMTTIGYKHAPLWAE PATPHDDYALHIHPLVMKYLNGNELIGYYRRNHENGVRKGILQEFNNFEEENKDGKFDSPPILPGSLWSIVMKEIEEEVMRNKKDRIKLEQFWRKLKQLA ENNFFITLPVKAAIEE--RVEETNSHEFLDRIKFLNKCAKENWEFVNLRRAKYASVGV---- >tr|E3MAI9|E3MAI9_CAERE Putative uncharacterized protein OX=31234 OS=Caenorhabditis remanei (Caenorhabditis vulgaris). GN=CRE_17283 PE=4 SV=1 --------------------------------VCCSERNLMFEIANLICANQKCRVKPGDKYQFNAQ-----KNKVFCLECFSNA-----KEDD------ --KKDCEERMHKKSEFEIVLTC-TCERKWHKVCARYIGNNKK--FVCQQCSKDKNATTALTDILEVVVNGALGLKG-----KELVYFREMGSAEAKKALK TLVPSLHHDDEIYNETIDYVKRALFAFQEREGGEVSFCAMYIQEYQDFANKKNNIIILHYLDTAPHSSPSG-GRLSGTIMNYTLFHASRTGFTQIELWAQ PPLQGDDYVFHNHPSEQMYKTASQLVDWYKNCLQIGVTKGLFESVT-MFGDKFPNGIQSPLELDGPFWADRARKSVENSNKGTKRERLMSHLEPYIKELK STYFYIIPNNKEIELL--PASPFIQRTFVENKDFLYKCMYKHWQWSSLRTAIHATSEMVNH- >tr|G0MQX8|G0MQX8_CAEBE Putative uncharacterized protein OX=135651 OS=Caenorhabditis brenneri (Nematode worm). GN=CAEBREN_26034 PE=4 SV=1 ---------------------------------------------------------------------------------------------------- ------------EPFPEPEIRCRRCWKEFHVQCVQWNSIVDASEFVCRRCNPRKLQKTPSSIFIEDGINEFLGTLENVSEVPGKMSCRLFVCSDHEKKRF PGIY-------NLMDKIPRTFKVYGVTQEDEGQDVLVFLMAVNEYSKGV-GVPKQGYVEFIDTVKYLKPPQKREINQKLLRLYLQNAQQRGFTKARIHAS APSKEYDFMFNNRPQFKKNLNQKGLFGWYKENMEKLVRDGVKMTYEHVFDPTHHRSVKSILYMAEGFWPVFLERNHKPFKEKVAQE---------SRSRD DCQFYIHFEDLKEMQTVAEEDPFISSTIAARFAWMHVQRENTWLFDTLRHAKFSTMMLLTT- >tr|G0MWI9|G0MWI9_CAEBE Putative uncharacterized protein OX=135651 OS=Caenorhabditis brenneri (Nematode worm). GN=CAEBREN_11153 PE=4 SV=1 ------------IRNVEAAIWHKFDEEAKRRGLCCGRKQETFTEK--RCSHNVNCLIRDGVYHEKED-----SLLVYCPTHFRQN---VGPRNTQ----- GWREVFVRDEKQ----PKLETCSKCAVQYHPTCTHFKPQSEPLPKMCSKCAGNLLDDTSLSKYMTRYVRHASGKI--------PITIKELYVGDWTEEEK EKNRDLY----EDIKGISYRYRIIGAFQHVAEKLILMALFSVQEYGVSNPPKNGQVCLEYFDTNHQYQQQERTNVYSYILLAYFQYAASLGYSEVHIWAC PPFEGQNYMFLGRSEIQLPTTVEQLHAWYMNMSKKSKLTHKEDDLKNVIGDK--SDIRSIRYYEGGRFMNELAQ--KSIGADEQKFKKKVGDILQRQSYE LCFFKIKRPAKPQM----KKESKIPSTLFSRQSFALWQKREQYDWTTPQKANYFTKKLIW-- >tr|G0N4F7|G0N4F7_CAEBE Putative uncharacterized protein OX=135651 OS=Caenorhabditis brenneri (Nematode worm). GN=CAEBREN_19493 PE=4 SV=1 ---------------------------------------------------------------------------------------------------- --------------GVEDLEECDECGRYFHSSCRTARLLEGR-GCKC---GVPTEQLISPEDLPETDESIAMQYQLKDLA-EGSIRIRIFMSDEEKLEVK KNITQFLTKR-EERIDLSYRRKLVLAFGKPAGDKLTFFAMMIREYGPNCHPAAGEIGLEYLDSVQFFDKQERSQIYHSILNSYFKCAKDRGFSVVKFWAC PPLPTTEYLFLGHPKQQLNSTEPRLIQFYQDMIASGA--------------------------------------------------------------- -------------------------------------------------------------- >tr|E3MAI8|E3MAI8_CAERE Putative uncharacterized protein OX=31234 OS=Caenorhabditis remanei (Caenorhabditis vulgaris). GN=CRE_17282 PE=4 SV=1 ------------------------------ETDCCEKEGLMYEVSNLMCGNIECRIREGHEYYKSES-------SCYCAPCFNSQ--EINPSDKAEYRHE ------VHIKN---IKEEILRCVLCKRNWHKTCYFEKTGIANALFMCTCSIAESIKETRLSQDAEKDVNMKLQRMIPNLKKEEHVLIRELSYVQAEALIS RLFPDYTLKKELYGNSVKYRKRTLAAYQKKDGKVQLFLMIFLQQYRDLQKEGNNWQVLQYLDTVPHAQRPG-GMLSGTVMNSVMYQVSRMGYDKAFVWAK PPQQGDDYVFNKHPKSQEMPDLNKLLLFYNKSFEHAEAAGEIENVETYFKDKKPITLFDIPMFHNSLWDIMINWADYTMKKEKKGTPKHQQIMTVMQKHK EDNFFIVFKNNNDRIVEDEPEPKTASKISSRDEFLTLCVRNNWVFSDERHGAFSTAAIT--- >tr|L1JX38|L1JX38_GUITH Uncharacterized protein OX=905079 OS=Guillardia theta CCMP2712. GN=GUITHDRAFT_161107 PE=4 SV=1 ---------------------------------------------------------------------------------------------------- ------------------------------------------------------------SDLPETALCVFLTQQAKSVSTEKDVRVRVVSES------- -----LLTTGGGFAPRYPYRVKSIFAFQKVDGEEVIFFGMYVHEFGIDA-PSAGRVYIECLDSIPLFGSTDRQRVLTAIVHGYLKFVRSQGFRHVHLRVP PPSAESSHIFAFRTAAVRLEATLRMAQWYKRLLEGAKEAGLITEYESS------SHLSRL-----EYFPPCIEETAARMQQEAQ-DATEASMLARIVAFK DRFFVANLHDTIDDAAIEDDSSLRLCSVARRRDFIAACDEEKLSFTTLEGAKAASSFL---- >tr|G0NL18|G0NL18_CAEBE Putative uncharacterized protein OX=135651 OS=Caenorhabditis brenneri (Nematode worm). GN=CAEBREN_21317 PE=4 SV=1 ---------------------------------------------------------------------------------------------------- ------------------------------------------TTFTCTKCNERRRYEDKCSRFIEKRINNA------------NYCCRILSATEKPGTRF DV--PIL---SEL-PPVPYLEKIIGVFSNIGGFDILTFIMIVHEYTGVLSKKNKTVHIHFLDSVSLKEKKPKLTVNQVILRSYCLYAASVGFETIHFYAK PPRQGDDYLFHRHPPHQKYHTHKGLIEWYSTGLNRVKHEGLICEESDRL-----VDVLDRHCYEHGFYPSEMKEKLKNIAVVDEA-RTLEDLKKHFKGYK DPLFFLKLEKPEKMEQAK--EVFFPSSFVN-GNFLENQRLKALRFDSLESAQHATQRIVH-- >tr|F2E7E9|F2E7E9_HORVD Predicted protein OX=112509 OS=Hordeum vulgare var. distichum (Two-rowed barley). GN= PE=2 SV=1 ---------------------------------------------------------------------------------------------------- ---------------------------------------------------------------------------------------------------- ---------------------------------------------------------------------------------------------------- ---------------AQKMPRSDKLRSWYQNLIKKAVKEGVVVERNTLYDFFLKAVISAAPYCENDFWPGEAE---------KDDN---TKLGEKMRTMK EDFIMLCLQQFPLP-ETDDGDPTMESKYFDSTDFLKHCQDNQYQFDTLRRAKHSTMMILYNL >tr|E3NEZ2|E3NEZ2_CAERE Putative uncharacterized protein OX=31234 OS=Caenorhabditis remanei (Caenorhabditis vulgaris). GN=CRE_14311 PE=4 SV=1 ---------------------------------------------------------------------------------------------------- --------------AKEPIISCTVCNMYYHYKCRKFNEGKSTAECECQKSEILTGPPTFLSKYLQKIVNTRKK-------SSREVIIREVYSEKIQEEFG ENVEKLYNTSGKAVPVIEYKDRIFTVYIKVRGCPINTFSLQVQEYKMG-----KKVYIGYLDSVMFIKDSERGQLYRSVLLAYFDFIRNIGYEDVQIYSQ APSNKKKYMFNGCPKTQRLISQTHLRGWYDRMLREGIEEKIVASYTHTYNIKPDADVFSLIEYIGGSWWKKLERLITSNKEIDFNASKSKKIVDMVKVED EKLYHVKLCCPTGPLMKFEDNYEIGCE-EPDDDWLQYQKRNGLNFETDIGAMFASAVMV--- >tr|L1JZU3|L1JZU3_GUITH Uncharacterized protein OX=905079 OS=Guillardia theta CCMP2712. GN=GUITHDRAFT_100625 PE=4 SV=1 ---------------------------------------------------------------------------------------------------- ------------------------------------------------------------------------------AAGCSNVTVRVLSNVEKRLSLG KVDKSLKMGDDAARGEVVYMNKAVFAFQHVKGADVVIFGMYVNEYWAN--SAPNRVYVECFDSPPVWPGAERHAILTAIMQGYIDFAASNGFKFMHLHVP PPQDATKYIFVQRSLNFRLRVTMHLSCWYKRMLESATALGLISSFH--------------CGFSGDDVPQGLLDVRRREEKESSNSPILAESWQCHGCFN ERFFVAGLRPGRG--SLRDTIPVMPNKIVGSRDCCNFIKEKGLQFHTLSHAQYATMVLVH-- >tr|D7FGQ7|D7FGQ7_ECTSI Putative uncharacterized protein OX=2880 OS=Ectocarpus siliculosus (Brown alga). GN=Esi_0101_0020 PE=4 SV=1 ---------------------------------------------------------------------------------------------------- ---------------------------------------------------------------------------------------------------- ---------------------------------------MFVHDYGNDC-PSANSVFVSYLDSVPFSAYPRRGDLYLEVLIAYLKDAGRRGFEFAYFWAA PPRNNGSYILNVRPTSMKMPNDEALVRWYTNGLKLALERGYIAGYGNLFERIGNEPKKVAPYLPGFLWPDKAETIFEEN--------------------- -------------------------------------------------------------- >tr|A8XNP3|A8XNP3_CAEBR Protein CBG16323 OX=6238 OS=Caenorhabditis briggsae. GN=CBG16323, CBG_16323 PE=4 SV=1 ---------------------------------------------------------------------------------------------------- -------------VEKEELVKCDQCEHFYHETCRE---MGWGPTGEC-KCRRPEFLGTKYSLLPTNRMSEKLTNAMPE-AYKNQVEIRVVANEMRNGTIG ERLTDFCKKYNIAVPQHKFRYKMILVFLKPDEAEALIFIYTVHEFGEETTPKSKWTVLGYLDSNRYITDGKRQLIFQGIVLSYFLYAASAGYEMCHF-RC AAPGNDNYLFN-DPMLNKKPDQIRLLKWYSNILNSGKTRGIIKKWERKSVDSKGQDLLEKFYLDGGYWPKRIEKIL---EKEMKPEKLREELVKEISNDS DAFFV-DLSKQD---LIKDSTPVIPTMDMSKEDFLAVQVQSGLRYSTNRLLWYSTA------ >tr|E4Y211|E4Y211_OIKDI Whole genome shotgun assembly, reference scaffold set, scaffold scaffold_719 OX=34765 OS=Oikopleura dioica (Tunicate). GN=GSOID_T00016194001 PE=4 SV=1 -----------------------AEPAMRRNGFCCA-QSLTWTALPLCCGKTTCTISVGSWYYCYENDGTFSEKIYYCEKCGKGD--TIATSSDPDNPSL QPKSKFTKNKNDTRDFEPFQKCKRCGRKNHQICVLYKKEIWK-DFICDFCQDNTSPETELSKFIESKVNGFIT--------------------------- ---------------------------------------------------------------------------------------------------- ---------------------------------------------------------------------------------------------------- -------------------------------------------------------------- >tr|G0NKT3|G0NKT3_CAEBE Putative uncharacterized protein OX=135651 OS=Caenorhabditis brenneri (Nematode worm). GN=CAEBREN_10524 PE=4 SV=1 ---------------------------------------------------------------------------------------------------- --------------------------------------------------------DTACSMFLQTQLNSLNRSS-----FSNELTVKEVFSDTKDYFVG EHLKSLLDKVGIDLGTVNYRFRMIIVSQIQEGDEVLFFIMTVKEYESLG-----CVAIDYLDSLFYYKPGSRGAVYDAVINSYLDYAVTLGYCKSHIFVC SPRDNGSFLFNKRPKDMVIYDQTKLTNWYKKVLDNNKES--VESYQSYFDDC-----------------KTIEEFIRNLLVSDLLRDMLITLQGVAYEGG KQLIFITLAKKQSSSTSTNQQRIYHSMTESRKSYMDNLERRKLEFDTLEKAKKSTAKIF--- >tr|A8XBK2|A8XBK2_CAEBR Protein CBG10682 OX=6238 OS=Caenorhabditis briggsae. GN=CBG10682, CBG_10682 PE=4 SV=1 ---------------------------DNRRCYKCREGMRLYGQTVFQCMGCEVKIIAKEYYYHYEKGALPV---KVCETCYRSG---KTC--------N KRKDKFVLKQNDR-QPVPFYECR-CGRVAHKYCTHPEDNV--ATFQCDHCTKEVKAKNELTKNLEEELLRRMRRFDENAM----FTVSQ-------QCYK ENMYMNIKENFTTVKKVKYAVRTISVSQKIEDVDCCIVQWIIHETTSADKNGRKWTFPQYLDSLRVFEESDGGVVAAIILEEYFNFMKSFGYTRGFLLAQ VPKPGTEYLFFMRPSWQEKLTQENLLGWYNKTFGEMKKNGSIIGYEVLPSSNVPSTNDSPTQTRADKEPN---NYLIQITEDGGNRQMNGT--------- ------------------DIFSPMSVEIASQDAFYDNLFDHSLEFSSLRWIRYSTAVLT--- >tr|L1IA64|L1IA64_GUITH Uncharacterized protein OX=905079 OS=Guillardia theta CCMP2712. GN=GUITHDRAFT_120690 PE=4 SV=1 ---------------------------------------------------------------------------------------------------- -----------------------------------------------------------------TPLGSFLTQKARAI-CSRPILIKVVADVLRQRNLR -----TNKGDVERQSGLPFRCKAIFAFQLVDSHMVLCFGMYVHEYGPGCAPESGRIYIEV-KSVSLSLNELVQALLSAIVLGYFDYVRNMGFQYAHMRVP PPTEENR---------------------YHRLLQSAMQARIIASFEAGQAGMFPLSLLDPAEMAAEIAF--QNADLRALDQQTAKLMEA----SRVQQLQ ERFFLIHLLPAGAKVNRPDIAPLLASSVSDRHEFVSICVNQRLRFASLQQAKYATIMLVHR- >tr|G0P2H3|G0P2H3_CAEBE Putative uncharacterized protein OX=135651 OS=Caenorhabditis brenneri (Nematode worm). GN=CAEBREN_12618 PE=4 SV=1 ---------------------------------------------------------------------------------------------------- --------------------------------------------------------DTPLAQLMHKNLDTLFEGK------RHNLTVKEVFSKELGREAK Q-ML---HRHGIQFIDYSYRYRMICVFQEQDGEEVLFFCMSVKEYRAQ-----KWCSVDYLDSVSLYSGVFRTKVYEELIYSYFEYARSIGYERVHIFVS ALGENEEYFFHKRPTTMRLPEQPML--------------------------------------------------------------------------- -------------------------------------------------------------- >tr|G0P2H2|G0P2H2_CAEBE Putative uncharacterized protein OX=135651 OS=Caenorhabditis brenneri (Nematode worm). GN=CAEBREN_19526 PE=4 SV=1 ---------------------------------------------------------------------------------------------------- -------------------------------------------------------KDTPLSQLMQKNLDDLFEGTRH------NLTVKEVYSKEDGREAK QMLQRHG----IQFIDYSFRYRMICVFQKHDGEEVLFFCMSVKEYRAQ-----KWCTIDYLDSVSLYSGVFRTKVYEELIYSYFEFARSIGYERVHIFVA ALGENKEYFFHKRPTTMRLPEQPML--------------------------------------------------------------------------- -------------------------------------------------------------- >tr|L1JG14|L1JG14_GUITH Uncharacterized protein OX=905079 OS=Guillardia theta CCMP2712. GN=GUITHDRAFT_106997 PE=4 SV=1 ---------------------------------------------------------------------------------------------------- ---------------------------------------------------------------------------------------------------- ---------------GYLHRSKAIMAFQEAGGKEILLFAMYTNEFGADA-QAPNRVYIECIDGLPLERGEEREELVRGIMYGYLEYMKLCGFSFIHLRVP PPHDSNCQIFSRRPADVRLQWSIRMSLWLKKLLRSAAAAKIIDAYQCGAHG-------SILNYPPTLLPAYLECTFAKVEAASANRI----SFASQQQLR ERLFVVRLHPGPRGVRAIDRSPLVPATTASRQDLIKTLTTRRWSFHTLAHATTTTTGLL--- >tr|L1IJS9|L1IJS9_GUITH Uncharacterized protein OX=905079 OS=Guillardia theta CCMP2712. GN=GUITHDRAFT_146008 PE=4 SV=1 ---------------------------------------------------------------------------------------------------- ---------------------------------------------------------------------------------------------------- --------------RSSPYRCKTIFLFMSSQAIEILIFAMYIQEHFGPA-KPDTQIVIECIDSTPLYADEERQDILTAVTLAYLEWAKMRGFLAVNIVVP APIEEKNYIFAYRTLNVRLRTSSHLAKWYKRLLEKGMASNIVTGIQ------------------------------------------------------ -------------------------------------------------------------- >tr|G0P2K9|G0P2K9_CAEBE Putative uncharacterized protein OX=135651 OS=Caenorhabditis brenneri (Nematode worm). GN=CAEBREN_09931 PE=4 SV=1 ------------------------------IGYAENQGKQSFALNEACAGDSSCRIAPGAEYYRNIEDG-----------SLGMD----AVKGKKKK--E QMKSEHERLFYNDYPEEETIKCKRCERVVHKVCERYNPLIDLAEYLCSRCGGTRLKATPSSQAVEAELNATMRAK----GLTRDLYFRTLCCEERDGKIP EHEARYFSNVQKLARDCAHRFKMYGLFDHIGGNDVLIAVLAVDEYRGS--KLPEWKRIQYFDTSKYFEPNLSRELNQSPLRIYSNYARKQGFKKILVYAS APIPGSDYLINCPPLAKANLNQDGLFSYYQETFRG-LTPNFIFGYPN----------------------------------------------------- -------------------------------------------------------------- >tr|A8WRR4|A8WRR4_CAEBR Protein CBG01991 OX=6238 OS=Caenorhabditis briggsae. GN=CBG01991, CBG_01991 PE=4 SV=1 -------------------------PEENVACCSCPLQSRYNVPNRECSGGNESRIPPNADFMGLIE--PSSNGSDYCMECFA-------VEDR-----K IQKERYMKKKNENDKFEEVLRCSRCMKLWHICCSFHMKRSNG--FKCKFCVKNEKGRCRIVTMMEDKLNAILEEQ------------------------- ------------------------------QG--KIFFMLYTHEYKNPA-NQKSWFAIDYLDSVKYLEADRRKQINQEILS------------------- ---------------------------------------------------------------------------------------------------- -------------------------------------------------------------- >tr|A8WRR5|A8WRR5_CAEBR Protein CBG01993 OX=6238 OS=Caenorhabditis briggsae. GN=CBG01993, CBG_01993 PE=4 SV=1 ---------------------------------------------------------------------------------------------------- ---------------------------------------------------------------------------------------------------- ------------------------------------------------------------------------GRIYQQQSPSKHAVFKRSNYNLETLFVG HSLESDLFIFTIHPNHQVYLDIERLIKWYRGVLDEGVEKQRIKKYQDFGEKK-IKRIEDLPLFADSLWSKKMKEVVDRSDTD--KKLFNEDMVHHFGEHQ KDNFFIDLAGGCALEEDD--TPTTSHWILDSLTFLEHCRKHNWEIGEPARARLSSVAMMKK- >tr|E3N6D3|E3N6D3_CAERE Putative uncharacterized protein OX=31234 OS=Caenorhabditis remanei (Caenorhabditis vulgaris). GN=CRE_05158 PE=4 SV=1 ---------------------------------------------------------------------------------------------------- ---------------------------------------------------------------------------------------------------- --------------------------------------------------------------------------------------------GNAHLWSN TPQKGDDYVFNVHSFDQPFLNSTASI-----------------GARNQ-----FQKPINIPVFVGSLWSIAFQEAKKPNMKD-----FEIELEEKKAIHG ENNFFIEIAVPTSRQPQETA--HHTHEVGDQVKLFEICVDKNWQFEILRRDKYLSVAL---- >tr|G0NKV8|G0NKV8_CAEBE Putative uncharacterized protein OX=135651 OS=Caenorhabditis brenneri (Nematode worm). GN=CAEBREN_22671 PE=4 SV=1 ---------------------------------------------------------------------------------------------------- ---------------------------------------------------------------------------------------------------- -------------------------------------------------------------------------------MNSYLDYAATLGYCKSHFFVC SLRDNGSFLFNKRPKDMVIYDQTKLTNWYKKVLDN--NDGSVESYQQNLPSYFDD-CKTIEEFIRNLVSDLLRDMLGTLQGVAYGDILRASIKLYFSESG KQLIFITLAKKQSSSTSTNQKRIYHLMTESRKSYMDNLERQKLEFDTLEKAKKSTAKIFE--