BLASTX nr result
ID: Astragalus24_contig00025887
seq
BLASTX 2.2.26 [Sep-21-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Astragalus24_contig00025887 (1186 letters) Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF excluding environmental samples from WGS projects 149,584,005 sequences; 54,822,741,787 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value dbj|GAO14735.1| hypothetical protein UVI_02028530 [Ustilaginoide... 75 1e-10 gb|OEL28094.1| hypothetical protein BAE44_0010886 [Dichanthelium... 71 5e-10 ref|XP_019039068.1| hypothetical protein WICANDRAFT_62445 [Wicke... 68 1e-08 ref|XP_741839.2| lysophospholipase, putative [Plasmodium chabaud... 68 2e-08 ref|XP_010657700.1| PREDICTED: uncharacterized protein At1g10890... 65 3e-08 ref|XP_021285684.1| uncharacterized protein At1g10890 [Herrania ... 65 3e-08 gb|KDB16058.1| hypothetical protein UV8b_2988 [Ustilaginoidea vi... 67 4e-08 gb|OBZ72246.1| hypothetical protein A0H81_07924 [Grifola frondosa] 65 8e-08 ref|XP_007041495.1| PREDICTED: uncharacterized protein At1g10890... 64 1e-07 ref|WP_089674443.1| cell envelope integrity protein TolA [Halomo... 64 1e-07 emb|SEN03508.1| Cell division and transport-associated protein T... 64 1e-07 gb|KFV87962.1| FYVE and coiled-coil domain-containing protein 1 ... 65 2e-07 ref|XP_017881423.1| PREDICTED: trichohyalin [Ceratina calcarata] 65 2e-07 gb|ABV60383.1| pneumococcal surface protein A, partial [Streptoc... 63 3e-07 ref|XP_019638018.1| PREDICTED: trichohyalin-like [Branchiostoma ... 64 4e-07 gb|OFW85285.1| hypothetical protein A2W06_00195 [Alphaproteobact... 64 4e-07 gb|PAA89684.1| hypothetical protein BOX15_Mlig004592g1, partial ... 64 4e-07 emb|CCP29690.1| Pneumococcal surface protein A, partial [Strepto... 63 4e-07 ref|WP_050888657.1| choline-binding protein [Streptococcus pneum... 63 5e-07 gb|KIR88140.1| hypothetical protein I308_01198 [Cryptococcus gat... 64 5e-07 >dbj|GAO14735.1| hypothetical protein UVI_02028530 [Ustilaginoidea virens] Length = 1906 Score = 74.7 bits (182), Expect = 1e-10 Identities = 83/291 (28%), Positives = 132/291 (45%), Gaps = 41/291 (14%) Frame = -2 Query: 975 VPDSPSPVAHQADARVSEKRVRGEAGGEPRP---VKKSKTSRRPKHKDKVVPLEEKFLEQ 805 V DS + + A+ V+ +A GEP K +K+ + K KDK E+ Sbjct: 808 VDDSGDVITEKDQAKDDPAEVKEDATGEPEADPWEKPAKSKSKSKSKDKEA-------EK 860 Query: 804 TYAQWRVDPVGASTQIFLNLAEVHKIEKQKFEQRA----------------EEKLELEKS 673 A+ +P + ++ E K EK++ EQ A EEK LE+ Sbjct: 861 DKAKKEKEPKLSERELKKLEKEKKKAEKERLEQEAKEAAEREAEEQASREAEEKARLEEE 920 Query: 672 KRAKLVEDMSLKLKELARVKQELQTWKEKQGETVRRAEVAEARVKELEVQVEEREGSVAE 493 +R + E+ + + +E AR++QE + E+ E R AE +AR+K+ E E E ++ + Sbjct: 921 ERIRAEEEAAREAEEQARIEQEEKIRAEE--EAAREAE-EQARIKKEEKIRAEEEAAILK 977 Query: 492 AKGQIRRLEDRIVVL---------------EDDLKRASEAGAGTSMDPDARNAELLA--A 364 + ++ LE++ ++ E+ KRA EA A + D AR AE A Sbjct: 978 EERELAALEEKKLLRGKLTKKDTDKYNRLKENSEKRAKEAEAHEAGDQAAREAEEATRKA 1037 Query: 363 LQQSRKNAKEAARIANE-----ATKEAREAAKSAIELYKEGFECALQQAAL 226 +Q+ A+EAAR A E A + AREA + A +E A +QAAL Sbjct: 1038 EEQAALEAEEAARKAEEQAALEAEEAAREAEEQAALEAEEAAREAEEQAAL 1088 >gb|OEL28094.1| hypothetical protein BAE44_0010886 [Dichanthelium oligosanthes] Length = 282 Score = 70.9 bits (172), Expect = 5e-10 Identities = 64/223 (28%), Positives = 108/223 (48%), Gaps = 9/223 (4%) Frame = -2 Query: 966 SPSPVAHQADARVSEKRVRGEAGGEPRPVKKSKTSRRPKHKDKVVPLEEKFLEQTYAQWR 787 SPSPV ++ R S +R P P ++ S PK + P ++++ + Sbjct: 34 SPSPVRSRSPYRPSHRR----RSPSPSPRRRKSRSPSPKRRKSPSPSQKRYRRKRSPSVS 89 Query: 786 VDPVGASTQIFLNLAE-VHKIEKQKFEQRAEEKLELEKSKRAKLVEDMSLKLKELARVK- 613 P+ AS L LAE + +KQ+ E EEK +K +L+E+ + K E A K Sbjct: 90 SSPINASQSSRLGLAENKNATDKQRLE---EEKKRRQKEVELRLLEEETAKRVEQAIRKK 146 Query: 612 -QELQTWKEKQGETVRRAEVAEARVK-ELEVQVE-EREGSVAEAKGQIRR----LEDRIV 454 +E +E + E RR E R++ E+ VQ+E E+E ++ EAK ++ R E+ Sbjct: 147 VEESLNREEIKHEIQRRLEEGRKRIREEVAVQIEKEKEAALNEAKQKVEREKQEREELEK 206 Query: 453 VLEDDLKRASEAGAGTSMDPDARNAELLAALQQSRKNAKEAAR 325 LE++ K+A EA +M+ + E L++ +K +EA + Sbjct: 207 KLEEERKKAEEALMKEAMEQQQKELERYQELERLQKEREEAMK 249 >ref|XP_019039068.1| hypothetical protein WICANDRAFT_62445 [Wickerhamomyces anomalus NRRL Y-366-8] gb|ODQ59861.1| hypothetical protein WICANDRAFT_62445 [Wickerhamomyces anomalus NRRL Y-366-8] Length = 753 Score = 68.2 bits (165), Expect = 1e-08 Identities = 74/302 (24%), Positives = 143/302 (47%), Gaps = 8/302 (2%) Frame = -2 Query: 1014 VTQEDLAHDRIDFVPDSPSPVAHQADARVSEKRVRGEAGGEPRPVKKSKTSRRPKHKDKV 835 +TQE+L V +PV A+ ++R + + + + ++ K R+ K + + Sbjct: 309 MTQEELDSIASKIV----NPVLEDISAKAKKQREK-DLEIQKKKEEQIKLHRQVKLQQQA 363 Query: 834 VPLEEKFLEQTYAQWRVDPVGASTQIFLNLAEVHKIEKQKFEQRAEEKLELEKSKRAKLV 655 LEEK L++ + LN + + EK+K +Q KLELEK+K+ +L Sbjct: 364 KKLEEKRLKE--------------EAKLNRRKEMEQEKEKQKQA---KLELEKAKKEELS 406 Query: 654 EDMSL---KLKELARVKQELQTWKEKQGETVR--RAEVAEARVKELEVQVEEREGSVAEA 490 + + K KE R+K+EL K+ + E ++ + + R KEL+ EER+ +A Sbjct: 407 KHQDILTAKQKEEERLKKELLAKKQAEEERIQDESTKAEKQRSKELQDAKEERDLKLAPI 466 Query: 489 KGQIRRLEDRIVVLEDDLKRASEAGAGTSMDPDARNAELLAALQQSRKNAKEAARIANEA 310 Q++ D++ VL ++ K+A + + + LL A Q NA+ + + Sbjct: 467 LDQLKIETDKLAVLNEE-KQAIQDITDSQVKNTENARNLLIASQTELTNAENQLELIKQD 525 Query: 309 TKEAREAAKSAI---ELYKEGFECALQQAALVNPSLCLDRALIDVDHEVDGNHIVKIDAK 139 ++ + +++ I EL K+ E AL+++ + L +A ID + + N +K++ + Sbjct: 526 IAKSNDESETLIKESELKKQEAEVALKKSNEIEAEALLKQAEIDKEKAIVENERLKLELE 585 Query: 138 TE 133 E Sbjct: 586 LE 587 >ref|XP_741839.2| lysophospholipase, putative [Plasmodium chabaudi chabaudi] emb|CDR12318.1| lysophospholipase, putative [Plasmodium chabaudi chabaudi] Length = 880 Score = 67.8 bits (164), Expect = 2e-08 Identities = 63/226 (27%), Positives = 107/226 (47%), Gaps = 1/226 (0%) Frame = -2 Query: 924 EKRVRGEAGGEPRPVKKSKT-SRRPKHKDKVVPLEEKFLEQTYAQWRVDPVGASTQIFLN 748 EK+ + EA E KK K +++ K +++ EE E+ A+ + + + Sbjct: 641 EKKAKEEAKKEKEEAKKEKEEAKKAKEEEEKKAKEEAKKEKEEAKKEKEEAKKAKEEAKK 700 Query: 747 LAEVHKIEKQKFEQRAEEKLELEKSKRAKLVEDMSLKLKELARVKQELQTWKEKQGETVR 568 E K K++ E++A+E+ + EK + AK ++ + K KE K + + KEK+ E + Sbjct: 701 EKEEAKKAKEEEEKKAKEEAKKEKEE-AKKEKEEAKKAKEEEEKKAKEEAKKEKE-EAKK 758 Query: 567 RAEVAEARVKELEVQVEEREGSVAEAKGQIRRLEDRIVVLEDDLKRASEAGAGTSMDPDA 388 E E + KE E + +E +AK + ++ + E+D K+A E D Sbjct: 759 EKEEEEKKAKEEEKKAKE---DAKKAKEEAKKAK------EEDAKKAKEEEKKAKEDAKK 809 Query: 387 RNAELLAALQQSRKNAKEAARIANEATKEAREAAKSAIELYKEGFE 250 E A ++ K AKE A+ A E K+A+E AK A E K+ E Sbjct: 810 AKEEAKKAKEEEEKRAKEEAKKAKEDAKKAKEDAKKAKEDAKKAKE 855 >ref|XP_010657700.1| PREDICTED: uncharacterized protein At1g10890 isoform X1 [Vitis vinifera] emb|CBI23816.3| unnamed protein product, partial [Vitis vinifera] Length = 278 Score = 65.5 bits (158), Expect = 3e-08 Identities = 61/242 (25%), Positives = 113/242 (46%), Gaps = 18/242 (7%) Frame = -2 Query: 966 SPSPVAHQADARVSEKRVRGEAGGEPRPVKKSKT---------SRRPKHKDKVVPLEEKF 814 SPSPV H+ R R R +KS++ S P+H+ P ++ Sbjct: 16 SPSPVGHRYGRRSRRDRSRSPYSSYSHSRRKSRSISPRRRKSRSPSPRHRKSRSPTPRRY 75 Query: 813 LEQTYAQWRVDPVGASTQIFLNLAEVHKIEKQKFEQRAEEKLELEKSKRAKLVEDMSLKL 634 Q + + P+ S+ L + HK +K + EEK ++ KL+E+ + + Sbjct: 76 KRQKSSATSLSPMHKSSSPSLGSVD-HKNASEKVRKEEEEKKRRQQEAELKLIEEETTRR 134 Query: 633 KELA---RVKQELQTWKEKQGETVRRAEVAEAR-VKELEVQVE-EREGSVAEAK---GQI 478 E A +V++ L + +E + E RR E R + E+ +Q+E E+E ++ EA+ Q Sbjct: 135 VEEAIRKKVEESLNS-EEIKLEIQRRLEEGRKRLLDEVAIQLEKEKEAALIEARQKEEQA 193 Query: 477 RRLEDRI-VVLEDDLKRASEAGAGTSMDPDARNAELLAALQQSRKNAKEAARIANEATKE 301 RR ++ + +LE++ +R E+ +++ R E L+Q ++ +EA R + +E Sbjct: 194 RREKEELEKMLEENRRRVEESQKREALEQQRREEERYRELEQIQRQKEEALRRKKQEEEE 253 Query: 300 AR 295 R Sbjct: 254 ER 255 >ref|XP_021285684.1| uncharacterized protein At1g10890 [Herrania umbratica] ref|XP_021285685.1| uncharacterized protein At1g10890 [Herrania umbratica] ref|XP_021285686.1| uncharacterized protein At1g10890 [Herrania umbratica] Length = 282 Score = 65.5 bits (158), Expect = 3e-08 Identities = 67/243 (27%), Positives = 115/243 (47%), Gaps = 19/243 (7%) Frame = -2 Query: 966 SPSPVAHQADARVSEKRVRGEAGGEPRPVKKSKT-------SRRP---KHKDKVVPLEEK 817 SPSPV H+ R R R +KS++ SR P +HK + P + Sbjct: 20 SPSPVGHRYGRRSRRDRSRSPYSSYSYSRRKSRSISPRRRKSRSPTARRHKSRS-PTPKH 78 Query: 816 FLEQTYAQWRVDPVGASTQIFLNLAEVHKIEKQKFEQRAEEKLELEKSKRAKLVEDMSLK 637 F Q + P S+ L L E K +K +++ EEK ++ KL+E+ + K Sbjct: 79 FKRQRSRSSSLSPTHKSSSPSLGLIE-RKNASEKLKKQEEEKKRRQQEAELKLIEEETTK 137 Query: 636 LKELA---RVKQELQTWKEKQGETVRRAEVAEARVK-ELEVQVE-EREGSVAEAK---GQ 481 E A +V++ L + + KQ E RR E R+ E+E Q+E E+E ++ EA+ Q Sbjct: 138 RVEEAIQKKVEESLNSEELKQ-EIQRRLEEGRRRLNDEVEAQLEKEKEAALLEARRKEEQ 196 Query: 480 IRRLEDRI-VVLEDDLKRASEAGAGTSMDPDARNAELLAALQQSRKNAKEAARIANEATK 304 R+ ++ + +LE++ KR EA +++ R E L++ ++ +EA + + + Sbjct: 197 ARKEKEELEKMLEENRKRVEEAQRREALEQQRREEERYRELEELQRQKEEAMKRKKQQEE 256 Query: 303 EAR 295 E R Sbjct: 257 EER 259 >gb|KDB16058.1| hypothetical protein UV8b_2988 [Ustilaginoidea virens] Length = 1991 Score = 67.0 bits (162), Expect = 4e-08 Identities = 77/265 (29%), Positives = 121/265 (45%), Gaps = 41/265 (15%) Frame = -2 Query: 897 GEPRP---VKKSKTSRRPKHKDKVVPLEEKFLEQTYAQWRVDPVGASTQIFLNLAEVHKI 727 GEP K +K+ + K KDK E+ A+ +P + ++ E K Sbjct: 921 GEPEADPWEKPAKSKSKSKSKDKEA-------EKDKAKKEKEPKLSERELKKLEKEKKKA 973 Query: 726 EKQKFEQRA----------------EEKLELEKSKRAKLVEDMSLKLKELARVKQELQTW 595 EK++ EQ A EEK LE+ +R + E+ + + +E AR++QE + Sbjct: 974 EKERLEQEAKEAAEREAEEQASREAEEKARLEEEERIRAEEEAAREAEEQARIEQEEKIR 1033 Query: 594 KEKQGETVRRAEVAEARVKELEVQVEEREGSVAEAKGQIRRLEDRIVVL----------- 448 E+ E R AE +AR+K+ E E E ++ + + ++ LE++ ++ Sbjct: 1034 AEE--EAAREAE-EQARIKKEEKIRAEEEAAILKEERELAALEEKKLLRGKLTKKDTDKY 1090 Query: 447 ----EDDLKRASEAGAGTSMDPDARNAELLA--ALQQSRKNAKEAARIANE-----ATKE 301 E+ KRA EA A + D AR AE A +Q+ A+EAAR A E A + Sbjct: 1091 NRLKENSEKRAKEAEAHEAGDQAAREAEEATRKAEEQAALEAEEAARKAEEQAALEAEEA 1150 Query: 300 AREAAKSAIELYKEGFECALQQAAL 226 AREA + A +E A +QAAL Sbjct: 1151 AREAEEQAALEAEEAAREAEEQAAL 1175 >gb|OBZ72246.1| hypothetical protein A0H81_07924 [Grifola frondosa] Length = 393 Score = 65.1 bits (157), Expect = 8e-08 Identities = 55/191 (28%), Positives = 91/191 (47%), Gaps = 7/191 (3%) Frame = -2 Query: 780 PVGASTQIFLNLAEVHKIEKQKFEQRAEEKLELEKSKRAKLVEDMSLKLKELARVKQELQ 601 P ST + AE EK+ E+ ++KLE E+ +R +E + K+++ ++ L+ Sbjct: 85 PQPPSTVVDTAAAEKEAEEKRMKEEEEKKKLEEEEQRR---IETEAEKVRQEEEERKRLE 141 Query: 600 TWKEKQGETVRRAEVAEARVKELEVQ------VEEREGSVAEAKGQIRRLEDRIVVLEDD 439 K +Q E +R E A K LE + E + + A+ K R E+ V +E++ Sbjct: 142 AEKARQEEEKKREEAEAAEKKRLEDEEAAAEAAEAADAAEAQRKEDERLAEEARVKVEEE 201 Query: 438 LKRASEAGAGTSMDP-DARNAELLAALQQSRKNAKEAARIANEATKEAREAAKSAIELYK 262 ++A EA + D R + +AA + + AKE R A E ++A E K+ E K Sbjct: 202 RRQAEEARQKAEEEAEDKRKMDEVAAAAAAAEEAKEEERRAQEEAEKAEEEQKAKEEEAK 261 Query: 261 EGFECALQQAA 229 +G E A Q A Sbjct: 262 KGEEAAKAQEA 272 >ref|XP_007041495.1| PREDICTED: uncharacterized protein At1g10890 [Theobroma cacao] ref|XP_007041496.1| PREDICTED: uncharacterized protein At1g10890 [Theobroma cacao] ref|XP_007041497.1| PREDICTED: uncharacterized protein At1g10890 [Theobroma cacao] ref|XP_007041498.1| PREDICTED: uncharacterized protein At1g10890 [Theobroma cacao] ref|XP_007041499.1| PREDICTED: uncharacterized protein At1g10890 [Theobroma cacao] ref|XP_017971453.1| PREDICTED: uncharacterized protein At1g10890 [Theobroma cacao] gb|EOX97326.1| Uncharacterized protein TCM_006388 isoform 1 [Theobroma cacao] gb|EOX97327.1| Uncharacterized protein TCM_006388 isoform 1 [Theobroma cacao] gb|EOX97328.1| Uncharacterized protein TCM_006388 isoform 1 [Theobroma cacao] gb|EOX97329.1| Uncharacterized protein TCM_006388 isoform 1 [Theobroma cacao] gb|EOX97330.1| Uncharacterized protein TCM_006388 isoform 1 [Theobroma cacao] Length = 282 Score = 63.5 bits (153), Expect = 1e-07 Identities = 65/242 (26%), Positives = 113/242 (46%), Gaps = 18/242 (7%) Frame = -2 Query: 966 SPSPVAHQADARVSEKRVRGEAGGEPRPVKKSKT-------SRRP--KHKDKVVPLEEKF 814 SPSPV H+ R R R +KS++ SR P +H P ++F Sbjct: 20 SPSPVGHRYGRRSRRDRSRSPYSSYSYSRRKSRSISPRRRKSRSPIARHHKSRSPTPKRF 79 Query: 813 LEQTYAQWRVDPVGASTQIFLNLAEVHKIEKQKFEQRAEEKLELEKSKRAKLVEDMSLKL 634 Q + P S+ L L E K +K +++ EEK ++ KL+E+ + K Sbjct: 80 KRQRSRSSSLSPTHKSSSPSLGLIE-RKNASEKLKKQEEEKKRRQQEAELKLIEEETAKR 138 Query: 633 KELA---RVKQELQTWKEKQGETVRRAEVAEARVK-ELEVQVE-EREGSVAEAK---GQI 478 E A +V++ L + + KQ E RR E R+ E+ Q+E E+EG++ EA+ Q Sbjct: 139 VEEAIQKKVEESLNSEELKQ-EIRRRLEEGRRRLNDEVAAQLEKEKEGALLEARRKEEQA 197 Query: 477 RRLEDRI-VVLEDDLKRASEAGAGTSMDPDARNAELLAALQQSRKNAKEAARIANEATKE 301 R+ ++ + +LE++ KR EA +++ R E L++ ++ + A + + +E Sbjct: 198 RKEKEELEKMLEENRKRVEEAQRREAVEQQQREEERYRELEELQRQKEVAMKRKKQQEEE 257 Query: 300 AR 295 R Sbjct: 258 ER 259 >ref|WP_089674443.1| cell envelope integrity protein TolA [Halomonas aquamarina] Length = 378 Score = 64.3 bits (155), Expect = 1e-07 Identities = 66/255 (25%), Positives = 109/255 (42%), Gaps = 3/255 (1%) Frame = -2 Query: 972 PDSPSPVAHQADARVSEKRVRGEAGGEPRPVKKSKTSRRPKHKDKVVPLEEKFLEQTYAQ 793 P+ P P + +E++ EAG K ++ +R LEQ A+ Sbjct: 79 PEEPDPEPPSDEPSAAEQQAAAEAGQREAEAKAAEQARA--------------LEQAQAE 124 Query: 792 WRVDPVGASTQIFLNLAEVHKIE---KQKFEQRAEEKLELEKSKRAKLVEDMSLKLKELA 622 + + + LAE E ++ EQRA+ + E ++ + A+ E A Sbjct: 125 AEAEAQRRAEEA-ERLAEQQAAEAQAREAEEQRAQAEAEAQRQREAE---------AERA 174 Query: 621 RVKQELQTWKEKQGETVRRAEVAEARVKELEVQVEEREGSVAEAKGQIRRLEDRIVVLED 442 R + E Q +E + + R E AR E + Q E E EA+ Q +R E+ E Sbjct: 175 RAEAEAQRQREAEEQRAREEEAQRAREAEAQRQREAEEQREREAEAQRQREEEARRQREA 234 Query: 441 DLKRASEAGAGTSMDPDARNAELLAALQQSRKNAKEAARIANEATKEAREAAKSAIELYK 262 + +RA EA A + + R A A R+ A EA AN ++A++AA S I + + Sbjct: 235 EEQRAREAEAQKQREAEERRAAEAAEAAMQRQLAGEAEAAAN--AQQAQQAANSFINIVR 292 Query: 261 EGFECALQQAALVNP 217 A++QA ++ P Sbjct: 293 R----AVEQAWVIPP 303 >emb|SEN03508.1| Cell division and transport-associated protein TolA [Halomonas aquamarina] Length = 385 Score = 64.3 bits (155), Expect = 1e-07 Identities = 66/255 (25%), Positives = 109/255 (42%), Gaps = 3/255 (1%) Frame = -2 Query: 972 PDSPSPVAHQADARVSEKRVRGEAGGEPRPVKKSKTSRRPKHKDKVVPLEEKFLEQTYAQ 793 P+ P P + +E++ EAG K ++ +R LEQ A+ Sbjct: 86 PEEPDPEPPSDEPSAAEQQAAAEAGQREAEAKAAEQARA--------------LEQAQAE 131 Query: 792 WRVDPVGASTQIFLNLAEVHKIE---KQKFEQRAEEKLELEKSKRAKLVEDMSLKLKELA 622 + + + LAE E ++ EQRA+ + E ++ + A+ E A Sbjct: 132 AEAEAQRRAEEA-ERLAEQQAAEAQAREAEEQRAQAEAEAQRQREAE---------AERA 181 Query: 621 RVKQELQTWKEKQGETVRRAEVAEARVKELEVQVEEREGSVAEAKGQIRRLEDRIVVLED 442 R + E Q +E + + R E AR E + Q E E EA+ Q +R E+ E Sbjct: 182 RAEAEAQRQREAEEQRAREEEAQRAREAEAQRQREAEEQREREAEAQRQREEEARRQREA 241 Query: 441 DLKRASEAGAGTSMDPDARNAELLAALQQSRKNAKEAARIANEATKEAREAAKSAIELYK 262 + +RA EA A + + R A A R+ A EA AN ++A++AA S I + + Sbjct: 242 EEQRAREAEAQKQREAEERRAAEAAEAAMQRQLAGEAEAAAN--AQQAQQAANSFINIVR 299 Query: 261 EGFECALQQAALVNP 217 A++QA ++ P Sbjct: 300 R----AVEQAWVIPP 310 >gb|KFV87962.1| FYVE and coiled-coil domain-containing protein 1 [Struthio camelus australis] Length = 1505 Score = 65.1 bits (157), Expect = 2e-07 Identities = 52/194 (26%), Positives = 92/194 (47%), Gaps = 9/194 (4%) Frame = -2 Query: 765 TQIFLNLAEVHKIEKQKFEQRAE-EKLELEKSKRA--------KLVEDMSLKLKELARVK 613 TQ+ +LA+V +EK E R E EKLE E S+R L E + L+ L +V Sbjct: 512 TQVMGSLAQVGSLEKNLEEARKEKEKLEEECSRREGALKHKAHSLAEQLELQEGHLTKVS 571 Query: 612 QELQTWKEKQGETVRRAEVAEARVKELEVQVEEREGSVAEAKGQIRRLEDRIVVLEDDLK 433 + + +E++ + E +VK+LE Q+E++ +V+E + ++L+ L+ K Sbjct: 572 HTVHSLEEQKQKISSEKEHLSQKVKQLEEQLEQQNSAVSEKDEENQKLKSENADLQQAKK 631 Query: 432 RASEAGAGTSMDPDARNAELLAALQQSRKNAKEAARIANEATKEAREAAKSAIELYKEGF 253 + E G +D D +NA + + + R +A + + K+ EA K + L + Sbjct: 632 KMEEKGQNKQLDEDLQNARRQSQILEDRLDALHS---DYKELKQREEATKESCALLEGQL 688 Query: 252 ECALQQAALVNPSL 211 + A Q + SL Sbjct: 689 KRAKQDCLQMEKSL 702 >ref|XP_017881423.1| PREDICTED: trichohyalin [Ceratina calcarata] Length = 2857 Score = 65.1 bits (157), Expect = 2e-07 Identities = 57/242 (23%), Positives = 115/242 (47%), Gaps = 34/242 (14%) Frame = -2 Query: 858 RPKHKDKVVPLEEKFLEQTYAQWRVD------PVGASTQIFLNLAEVHKIEKQKFEQRAE 697 R KH +K+ ++EK LE Q +V+ + S ++ +N + + + E++ E Sbjct: 1849 REKHSEKIEKVKEK-LEAEAKQLQVERDQLIIQLEKSQEMLVNFQQELSTNEAELERQRE 1907 Query: 696 EKLELEKSKRAKL---------VEDMSLKLKELARVKQELQTWKEKQGETVRRAEVAEAR 544 E L++ ++ ++ E + +++E+ R+ Q++Q+ + Q + +RAE AE R Sbjct: 1908 EVCRLQQLQQQRVHAQTPDRAAKEALEAQMREVHRLSQQVQSLTQAQTKERQRAEQAEKR 1967 Query: 543 VKELEVQVEEREGSVAEAKGQIRRLEDRIVVLEDDLKRASEAGA-GTSMDPDARNAE--- 376 V+EL+ Q+ R+ S A ++E + E + +RA A + +N E Sbjct: 1968 VQELQKQITSRDASAAAGTANEAQVEQWRKLCEQEKQRADAAERQANELQKRIQNTERQL 2027 Query: 375 ---------LLAALQQSRK------NAKEAARIANEATKEAREAAKSAIELYKEGFECAL 241 + A+QQ ++ N +EAAR+ E + E ++A+E +E F+ L Sbjct: 2028 HAQQQQIQQMQVAMQQQQQQQPQQQNGQEAARLRKELERAREEVKQAAVE--RERFQAQL 2085 Query: 240 QQ 235 ++ Sbjct: 2086 EK 2087 >gb|ABV60383.1| pneumococcal surface protein A, partial [Streptococcus pneumoniae] Length = 392 Score = 63.2 bits (152), Expect = 3e-07 Identities = 78/290 (26%), Positives = 123/290 (42%), Gaps = 28/290 (9%) Frame = -2 Query: 978 FVPDSPSPVAHQADARVSEKRVRGEAGGEPRPVKKSKTSRRPKHKDKVVPLEEKFLEQTY 799 FV +PVA Q+ A + +A ++ ++ +++ K+KD EEK ++ Sbjct: 6 FVRAEEAPVASQSKAEKDYDIAKRDAENAKEALENARRAQK-KYKDDQKRTEEKAEKERK 64 Query: 798 AQWRVDPVGASTQIFLNLAEVHKIEKQKFEQRAEEKLELEKSKRAKLVEDMSLKLKELAR 619 A Q+ L +K E E+K ++EK A E+ K EL + Sbjct: 65 ASEEEQAANLKYQLLL--------QKYGSESDREKKKQIEKQADAAK-EENERKKAELNK 115 Query: 618 VKQELQTWKEKQGETVRR-AEVAEARVKELEVQVEEREGSVAEAK--------------G 484 ++QE+ +Q E RR AEVA+A+ L +VEE E V EAK Sbjct: 116 IRQEMVVPSSQQLEVTRRKAEVAKAKEPGLRKRVEEAEKKVTEAKQKLDAERAKEVALQA 175 Query: 483 QIRRLEDRIVVLEDDLKRASEAG----------AGTSMDPDARNAELLAALQQSRKNAKE 334 +I LE+ + LE LK E+ A + DA+ A+L + S K + Sbjct: 176 KIAELENEVHKLEQKLKEIDESDSEDYVKEGFRAPLQSELDAKQAKLSKLEELSDKIDEL 235 Query: 333 AARIAN-EATKEAREAAKSAIELYKEGFE--CALQQAALVNPSLCLDRAL 193 A IA E +A E + + +KEG E A ++A L L +A+ Sbjct: 236 DAEIAKLEDQLKAAEENNNVEDYFKEGLEKTIAAKKAELEKTEADLKKAV 285 >ref|XP_019638018.1| PREDICTED: trichohyalin-like [Branchiostoma belcheri] ref|XP_019638019.1| PREDICTED: trichohyalin-like [Branchiostoma belcheri] Length = 577 Score = 63.5 bits (153), Expect = 4e-07 Identities = 56/216 (25%), Positives = 110/216 (50%), Gaps = 11/216 (5%) Frame = -2 Query: 741 EVHKIEK--QKFEQRAEEKLELEKSKRAKLVEDMSLKLKELARVKQELQTWKEKQGETVR 568 E H EK ++ EQR+ +K E+ ++ + E++ + +E R K+E + K+K+ E ++ Sbjct: 159 EQHSQEKFRRREEQRSAQKRRDEEERKKREAEELRKRTEEENRKKEEERNLKQKEEEKIK 218 Query: 567 RAEVAEARVKELEVQVEEREGSVAEAKGQIRRLEDRIVVLED---------DLKRASEAG 415 E + E E + EE+E EA+ + +RLE++ + E+ ++KRA E Sbjct: 219 -----EKKRLEEEKEREEKERKRLEAEKEKQRLEEQRRLEEEEGRRQAELIEVKRAQEIQ 273 Query: 414 AGTSMDPDARNAELLAALQQSRKNAKEAARIANEATKEAREAAKSAIELYKEGFECALQQ 235 + AR + L+++ K+A+ AA+ A EA +A+ A+ A+E K+ ++ Sbjct: 274 LKLEAEAKARQEQRSKELEEA-KDAEVAAKEAEEAENQAKREAEEAMEAAKKAKTAEEKE 332 Query: 234 AALVNPSLCLDRALIDVDHEVDGNHIVKIDAKTEEK 127 AA ++A + + + N +K+ K++EK Sbjct: 333 AAEKARKKAEEKAKAERESRIRNN--LKVAVKSKEK 366 >gb|OFW85285.1| hypothetical protein A2W06_00195 [Alphaproteobacteria bacterium RBG_16_42_14] Length = 1770 Score = 63.9 bits (154), Expect = 4e-07 Identities = 70/282 (24%), Positives = 130/282 (46%), Gaps = 10/282 (3%) Frame = -2 Query: 945 QADARVSEKRVRGEAGGEPRPVKKSKTSRRPKHKDKVVPLEEKFLEQTYAQWRVDPVGAS 766 +A A V EKR +A E +K + K + E++ L + Q R D A Sbjct: 530 EAAAEVEEKRKEADAAVEVERKRKEAEAAVEVEKKRKEEAEKQHLVED--QKRKDAEVAE 587 Query: 765 TQIFLNLAEVHKIEKQKFEQRAE---EKLELEKSKRAKLVEDMSLKLKELA-----RVKQ 610 T+ AEV ++EK++ E A E+ E++++ +LVED K E+A R + Sbjct: 588 TERQRKEAEVAEVEKRRKETEAAAEAERKRKEEAEKQRLVEDQKRKDAEVAETERQRKEA 647 Query: 609 ELQTWKEKQGETVRRAEVAEARVKELEVQ--VEEREGSVAEAKGQIRRLEDRIVVLEDDL 436 E+ ++++ ET AE R +E E Q VE+++ AE R+ ++ +E + Sbjct: 648 EVAEVEKRRKETEAAAEAERKRKEEAEKQRLVEDQKRKDAEVAETERKRKEAEAAVEVEW 707 Query: 435 KRASEAGAGTSMDPDARNAELLAALQQSRKNAKEAARIANEATKEAREAAKSAIELYKEG 256 KR A ++ + E A +++ RK A A + + R+ A++A+E+ K+ Sbjct: 708 KRKEAEVA--EVEKRRKETEAAAEVEEKRKEADAAVEV-----ERKRKEAEAAVEVEKKR 760 Query: 255 FECALQQAALVNPSLCLDRALIDVDHEVDGNHIVKIDAKTEE 130 E A +Q LV D + + + + + +++ + +E Sbjct: 761 KEEAEKQ-HLVEDQKRKDAEVAETERQRKEAEVAEVEKRRKE 801 >gb|PAA89684.1| hypothetical protein BOX15_Mlig004592g1, partial [Macrostomum lignano] Length = 1987 Score = 63.9 bits (154), Expect = 4e-07 Identities = 72/261 (27%), Positives = 116/261 (44%), Gaps = 19/261 (7%) Frame = -2 Query: 945 QADARVSEKRVRGEAGGEPRPVKKSKTSRRPKHKDKVVPLEEKFLEQTYAQWRVDPVGAS 766 +A+ V E+R R A E R + +R K + +V E +E+ + A Sbjct: 902 EAEDLVIEERKRA-AEAETRATEAE--TRATKAETRVTEAEALIIEEQKRATEAETRAAE 958 Query: 765 TQIFLNLAEVHKIEKQKFEQRAE------EKLELEKSKRAKLVEDMSLKLKELARVKQ-- 610 + + +AE E +K AE E L +E+ KRAK +++ L+ + A + Sbjct: 959 VETRVTVAESRAEEAEKRAVEAETCAEKAEALVIEERKRAKEIQNRLLESETRASEAESR 1018 Query: 609 --ELQTWKEKQGETVRRAEV----AEARVKELEVQVEEREGSVAEAKGQIRRLEDRIVVL 448 ELQT K AE +E R KE+E + E E EA+ + ++R Sbjct: 1019 LTELQTDVMKAETRATEAEARLTESETRAKEIEARTSEAEQRATEAEQRATEADNRATEA 1078 Query: 447 EDDL----KRASEAGAGTSMDPDARNAELLAAL-QQSRKNAKEAARIANEATKEAREAAK 283 E L KRA++AG+ S P+ +A L+ A +Q R+ A AR+ +A A AA+ Sbjct: 1079 EQRLAEAEKRATDAGSDASGSPNKSDATLIPATEEQQRRLAWTEARLL-DAEARAAAAAE 1137 Query: 282 SAIELYKEGFECALQQAALVN 220 + ++ E A +A L N Sbjct: 1138 ESDSMHARFTELAAAEAELRN 1158 >emb|CCP29690.1| Pneumococcal surface protein A, partial [Streptococcus pneumoniae] Length = 481 Score = 63.2 bits (152), Expect = 4e-07 Identities = 78/290 (26%), Positives = 123/290 (42%), Gaps = 28/290 (9%) Frame = -2 Query: 978 FVPDSPSPVAHQADARVSEKRVRGEAGGEPRPVKKSKTSRRPKHKDKVVPLEEKFLEQTY 799 FV +PVA Q+ A + +A ++ ++ +++ K+KD EEK ++ Sbjct: 28 FVRAEEAPVASQSKAEKDYDIAKRDAENAKEALENARRAQK-KYKDDQKRTEEKAEKERK 86 Query: 798 AQWRVDPVGASTQIFLNLAEVHKIEKQKFEQRAEEKLELEKSKRAKLVEDMSLKLKELAR 619 A Q+ L +K E E+K ++EK A E+ K EL + Sbjct: 87 ASEEEQAANLKYQLLL--------QKYGSESDREKKKQIEKQADAAK-EENERKKAELNK 137 Query: 618 VKQELQTWKEKQGETVRR-AEVAEARVKELEVQVEEREGSVAEAK--------------G 484 ++QE+ +Q E RR AEVA+A+ L +VEE E V EAK Sbjct: 138 IRQEMVVPSSQQLEVTRRKAEVAKAKEPGLRKRVEEAEKKVTEAKQKLDAERAKEVALQA 197 Query: 483 QIRRLEDRIVVLEDDLKRASEAG----------AGTSMDPDARNAELLAALQQSRKNAKE 334 +I LE+ + LE LK E+ A + DA+ A+L + S K + Sbjct: 198 KIAELENEVHKLEQKLKEIDESDSEDYVKEGFRAPLQSELDAKQAKLSKLEELSDKIDEL 257 Query: 333 AARIAN-EATKEAREAAKSAIELYKEGFE--CALQQAALVNPSLCLDRAL 193 A IA E +A E + + +KEG E A ++A L L +A+ Sbjct: 258 DAEIAKLEDQLKAAEENNNVEDYFKEGLEKTIAAKKAELEKTEADLKKAV 307 >ref|WP_050888657.1| choline-binding protein [Streptococcus pneumoniae] emb|COT26666.1| surface protein PspA [Streptococcus pneumoniae] Length = 551 Score = 63.2 bits (152), Expect = 5e-07 Identities = 78/290 (26%), Positives = 123/290 (42%), Gaps = 28/290 (9%) Frame = -2 Query: 978 FVPDSPSPVAHQADARVSEKRVRGEAGGEPRPVKKSKTSRRPKHKDKVVPLEEKFLEQTY 799 FV +PVA Q+ A + +A ++ ++ +++ K+KD EEK ++ Sbjct: 28 FVRAEEAPVASQSKAEKDYDIAKRDAENAKEALENARRAQK-KYKDDQKRTEEKAEKERK 86 Query: 798 AQWRVDPVGASTQIFLNLAEVHKIEKQKFEQRAEEKLELEKSKRAKLVEDMSLKLKELAR 619 A Q+ L +K E E+K ++EK A E+ K EL + Sbjct: 87 ASEEEQAANLKYQLLL--------QKYGSESDREKKKQIEKQADAAK-EENERKKAELNK 137 Query: 618 VKQELQTWKEKQGETVRR-AEVAEARVKELEVQVEEREGSVAEAK--------------G 484 ++QE+ +Q E RR AEVA+A+ L +VEE E V EAK Sbjct: 138 IRQEMVVPSSQQLEVTRRKAEVAKAKEPGLRKRVEEAEKKVTEAKQKLDAERAKEVALQA 197 Query: 483 QIRRLEDRIVVLEDDLKRASEAG----------AGTSMDPDARNAELLAALQQSRKNAKE 334 +I LE+ + LE LK E+ A + DA+ A+L + S K + Sbjct: 198 KIAELENEVHKLEQKLKEIDESDSEDYVKEGFRAPLQSELDAKQAKLSKLEELSDKIDEL 257 Query: 333 AARIAN-EATKEAREAAKSAIELYKEGFE--CALQQAALVNPSLCLDRAL 193 A IA E +A E + + +KEG E A ++A L L +A+ Sbjct: 258 DAEIAKLEDQLKAAEENNNVEDYFKEGLEKTIAAKKAELEKTEADLKKAV 307 >gb|KIR88140.1| hypothetical protein I308_01198 [Cryptococcus gattii VGIV IND107] Length = 1123 Score = 63.5 bits (153), Expect = 5e-07 Identities = 72/231 (31%), Positives = 100/231 (43%), Gaps = 20/231 (8%) Frame = -2 Query: 921 KRVRGEAGGEPRPVKKSKTSRRPKHKD---KVVPLEEKFLEQTYAQWRVD---PVGASTQ 760 K +A E K S R K KD KV LEE+ E A + P G Sbjct: 860 KEAHEKASSELSLAKMSAKGREGKFKDLENKVKTLEEELGEAVKANKVTEAGPPAGTDVG 919 Query: 759 IFLNLAE--VHKIEKQKFEQRAE-EKLELEKSKR---AKLVEDMSLKLKELARVKQELQT 598 N AE + ++EK+ E++ E +KLE E K+ AK ED K +E A+ K E Sbjct: 920 DGANKAEEDLKRLEKENEEKKEELKKLEEEAKKQEEEAKKKEDEFRKKEEEAKKKDE--E 977 Query: 597 WKEKQGETVRRAEVAEARVKELEVQVEEREGSVAEAKGQIRRLEDRIVVLEDDLKRASEA 418 W K+ E + + E RVK+LE E A+ + LE +I LE+ L A+ A Sbjct: 978 WNTKEKEWEAKIKAGEDRVKQLEENSMSSEEKAKSAEEKTATLESKIKELEEKLATAASA 1037 Query: 417 GA--------GTSMDPDARNAELLAALQQSRKNAKEAARIANEATKEAREA 289 A G++ R AEL A +++ A +A E TK EA Sbjct: 1038 PAPVPAETTGGSNKQAKKRAAELDAKVKELE------ASLAEEKTKREEEA 1082