BLASTX nr result
ID: Cephaelis21_contig00008712
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Cephaelis21_contig00008712 (1393 letters) Database: ./nr 23,641,837 sequences; 8,123,359,852 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value ref|XP_001315960.1| hypothetical protein [Trichomonas vaginalis ... 75 5e-11 ref|XP_001311646.1| hypothetical protein [Trichomonas vaginalis ... 68 5e-09 emb|CBK20318.2| unnamed protein product [Blastocystis hominis] 68 6e-09 emb|CBH18182.1| hypothetical protein, conserved [Trypanosoma bru... 66 2e-08 gb|ELU11167.1| hypothetical protein CAPTEDRAFT_202309 [Capitella... 65 3e-08 >ref|XP_001315960.1| hypothetical protein [Trichomonas vaginalis G3] gi|121898653|gb|EAY03737.1| hypothetical protein TVAG_072290 [Trichomonas vaginalis G3] Length = 697 Score = 74.7 bits (182), Expect = 5e-11 Identities = 77/359 (21%), Positives = 152/359 (42%), Gaps = 23/359 (6%) Frame = +3 Query: 345 EEHELIDVAQSFEKQMLHEDKAIERPNESPIAEGIVNYSPEEEVME-ERTIKYANKKDAS 521 E H + + A E+ ++ I E+P + + +P EE+ + + + + NK Sbjct: 79 EVHSIGEKAAPSEQNDKENNENIVSEQETPNENQVRSENPNEEIKQYDENLPHDNKSSIE 138 Query: 522 KSEEDTEVNSDEIIYETTVNATESTQVK------DYSEDKKFP---NQDQYHSSLTVAED 674 ++ D NS I E + EST+ +++E+K+ N +Q + D Sbjct: 139 NAQSD---NSKGEITEEKSSPNESTEKSLQENSDEHTEEKENTPSNNSEQDEIENNLGND 195 Query: 675 PKREVQSKPHLNKRPTDGMLPKSEGSEARKQVDNSNKEIAPKDDMLDDLIQANNETRANH 854 ++++ S+P + P++ + S + + KD+ +D + + N N Sbjct: 196 EEKDLVSEPLSEETPSNDKQTNEDKSSNSDEKPQKESNVPDKDESVDSEVNSENPNENNE 255 Query: 855 MPFES--SIAASNLEITAKDPKTGTDAKIASADEEMSTMQKGPPGETLQQEIKELKPEDA 1028 +P E+ I S+ E+T K + G D + S Q+ P ET++Q+ E ED Sbjct: 256 IPTEAEEEIGKSSKEVTDKSNENGNDNNENPTSAQRSDPQEIP--ETIEQKDNE---EDQ 310 Query: 1029 FKAVAAAVSDLQQESRFKDDKMEERVT-------EQVKL-SKHTSIPKLHVDENEHKKIP 1184 + ++ +E+ + D EE +T EQ+ +K + V +N+ K+ P Sbjct: 311 NQTSNETPNESTEETPQEKDNKEELITDSPENNSEQINAQNKDREVSTNDVGKNDEKETP 370 Query: 1185 DQNEYHSSFTIDESAKDVQSELSFNKRTTEGMLPESEVTEATN---QVDNSNKVIDEKD 1352 +NE SS ++ D + +L+ + L E E N +++ SN+ E + Sbjct: 371 CENENKSS--NEQGGNDNKKDLALESEKSNETLSEKPSAEKENDDSEINPSNEKAAENE 427 >ref|XP_001311646.1| hypothetical protein [Trichomonas vaginalis G3] gi|121893464|gb|EAX98716.1| hypothetical protein TVAG_480920 [Trichomonas vaginalis G3] Length = 1996 Score = 68.2 bits (165), Expect = 5e-09 Identities = 80/373 (21%), Positives = 159/373 (42%), Gaps = 29/373 (7%) Frame = +3 Query: 345 EEHELIDVAQSFEKQMLHEDKAIERPNESPIAEGIVNYSPEEEVMEERTIKYANKKDASK 524 ++ EL + + +K+ LHE++ E+ E + E EEE +E+ + +K Sbjct: 1449 KKEELHEEEEEEKKEELHEEEEEEKKEE--LHEEEEKKEEEEEKIEKLHEEEEEEKKEEL 1506 Query: 525 SEEDTEVNSDEIIYETTVNATESTQVKDYSED----KKFPNQD-QYHSSLTVAEDPKREV 689 EE+ E +E+ E E +++ E+ K+ N++ Y+ ++ E P+ E Sbjct: 1507 HEEEEEEKKEELHEEEEKKEEEEEKIEKLHEEEEKKKEQTNEEINYNPAIKEVEGPESEE 1566 Query: 690 QSKPHLNKRPTDGMLPKSEGSEARKQVDNSNKEIAPKDDMLDDLIQANNETRANHMPFES 869 ++K K D K+E S+ +KQ + S + + +L + + E R N+ + Sbjct: 1567 ENK--YRKDEEDFEDEKNEESDYKKQKEES-------ESVAKELEETSTENRNNNQEETN 1617 Query: 870 SIAASNLEITAKDPKTGTDAKIASADE---EMSTMQKGPPGET---LQQEIKELKPEDAF 1031 N + + K T+ K+ D+ E + ET + QE ++++ +D Sbjct: 1618 QPKLENSSLNITEVKEETEEKVNITDDFETENQNEDRETDAETEKAIHQENEQIREKDFH 1677 Query: 1032 KAVAAAVSD--------------LQQESRFKDDKMEERVTEQVKL----SKHTSIPKLHV 1157 + QQE+ FKD++ EE+ E+ K+ K I KLH Sbjct: 1678 DEKQIENEEEKQEILKEPIEERNFQQENDFKDNEEEEKKEEEEKIEKLHEKEEKIEKLHE 1737 Query: 1158 DENEHKKIPDQNEYHSSFTIDESAKDVQSELSFNKRTTEGMLPESEVTEATNQVDNSNKV 1337 +E + +++ ++ E +E K + EL ++ E L E E E ++ + + Sbjct: 1738 EEEKKEELHEEEEKKEEQLHEEEEK--KEELHEEEKKEE--LHEEEKKEELHEEEKKEEQ 1793 Query: 1338 IDEKDDLIDDLTE 1376 + E++ + L E Sbjct: 1794 LHEEEKKEEQLHE 1806 >emb|CBK20318.2| unnamed protein product [Blastocystis hominis] Length = 863 Score = 67.8 bits (164), Expect = 6e-09 Identities = 78/355 (21%), Positives = 138/355 (38%), Gaps = 11/355 (3%) Frame = +3 Query: 324 LKNLCQLEEHELIDVAQSFEKQMLHEDKAIERPNESPIAEGIVNYSPEEEVMEERTIKYA 503 +K + + EE E + A E + + +++ E E+P + EEE EE+T K Sbjct: 326 VKKVAEKEEEE--EEAPKKEAKKVEKEEEEEEEEETPKKKETKKEEEEEEEEEEKTTKKE 383 Query: 504 NKKDASKSEEDTEVNSDEIIYETTVNATESTQVKDYSEDKKFPNQDQYHSSLTVAEDPKR 683 KK+ + EE+ E E E + K S KK +++ ED K+ Sbjct: 384 TKKEEEEEEEEEAPKKKETKKEEEEEEEEEEEEKTSS--KKETKKEE--------EDEKK 433 Query: 684 EVQSKPHLNKRPTDGMLPKSEGSEARKQVDNSNKEIAPKDDMLDDLIQANNE-------- 839 E + + PK E +A K+ D +E APK + + E Sbjct: 434 EEEEEEET---------PKKEVKKAEKEEDEEEEEEAPKKKEVKKAEEKEEEEEEKEEKV 484 Query: 840 -TRANHMPFESSIAASNLEITAKDPKTGTDAKIASADEEMSTMQKGPPGETLQQEIKELK 1016 T+ E E K+ K + +EE +K E ++E +E + Sbjct: 485 PTKKETKKEEEEEEEEEEEAPKKEVKKAEKEEEEEEEEEKVPTKKAEKKEEEEEEEEEKE 544 Query: 1017 PEDAFKAVAAAVSDLQQESRFKDDKMEERVTEQVKLSKHTSIPKLHVDENEHKKIPDQNE 1196 + K VA + ++E K++K+ + TE+ K +E E +K P + E Sbjct: 545 TKKETKKVAEKEEEEEEE---KEEKVPTKKTEE----------KEEEEEEEEEKAPKKKE 591 Query: 1197 YHSSFTIDESAKDVQSELSFNKRTTEGMLPESEVTEATNQVDNSNK--VIDEKDD 1355 + + K + + +K EG E++ TE ++ ++ K +KDD Sbjct: 592 EEEEEEEETATKKAEEDEDDDKEEEEGPQKETKKTEDDDEEEDDEKKSTTSKKDD 646 >emb|CBH18182.1| hypothetical protein, conserved [Trypanosoma brucei gambiense DAL972] Length = 1520 Score = 66.2 bits (160), Expect = 2e-08 Identities = 78/350 (22%), Positives = 158/350 (45%), Gaps = 17/350 (4%) Frame = +3 Query: 372 QSFEKQMLH---EDKAIERPNESPIAEGIVNYSPEEEVMEER-------TIKYANKKDAS 521 Q F K + H +D + ++ + S + I+N E + + T++Y ++D Sbjct: 364 QQFRKVITHRVMKDHSRQQKSSSKWQKKIINLRSTEVEKKTQDPTSITATVQYTKQEDDI 423 Query: 522 KSEEDTEVNSDEIIYETTVNATES--TQVKDYSEDKKFPNQDQYHSSLTVAEDPKREVQS 695 K++++ ++ E+ T+ TE+ T +S D++ + D+ HS + + K+E+Q Sbjct: 424 KNKQNA-ISERELSTSTSQKQTETLITSNNQHSLDERHQSDDRQHSKRS---ESKKELQE 479 Query: 696 KPHLNKRPTDGMLPKSEGSEARKQVDNSNKEIAPKDDMLDDL-IQANNETRANHMPFESS 872 H NK P+ E S+ K+ S + I ++ + DDL Q + ++ + E Sbjct: 480 SLHSNKSAVKDGEPELEASQGSKKSQRSRESI--REGLADDLQSQGSQHSKRSESKKELQ 537 Query: 873 IAASNLEITAKDPKTGTDAKIASADEEMS--TMQKGPPGETLQQEIKELKPEDAFKAVAA 1046 + + + KD + +A S + S ++++G + Q + K ++ K + Sbjct: 538 ESLHSNKSAVKDGEPELEASQGSKKSQRSRESIREGLADDLQSQGSQHSKRSESKKELQE 597 Query: 1047 AVSDLQQESRFKDDKMEERVTEQVKLSKHT--SIPKLHVDENEHKKIPDQNEYHSSFTID 1220 ++ +S KD + E ++ K S+ + SI + VD+ + Q HS + Sbjct: 598 SLH--SNKSAVKDGEPELEASQGSKKSQRSRESIREGFVDD-----LQSQGSQHSKRS-- 648 Query: 1221 ESAKDVQSELSFNKRTTEGMLPESEVTEATNQVDNSNKVIDEKDDLIDDL 1370 ES K++Q L NK + PE E ++ + + S + I ++ L DDL Sbjct: 649 ESKKELQESLHSNKSAVKDGEPELEASQGSKKSQRSRESI--REGLADDL 696 >gb|ELU11167.1| hypothetical protein CAPTEDRAFT_202309 [Capitella teleta] Length = 891 Score = 65.5 bits (158), Expect = 3e-08 Identities = 78/362 (21%), Positives = 147/362 (40%), Gaps = 8/362 (2%) Frame = +3 Query: 327 KNLCQLEEHELIDVAQSFEKQMLHEDKAIERPNESPIAEGIVNYSPEEEVMEERTIKYAN 506 K + ++ E + S E++ E++ E +ES EG +EE + + +K Sbjct: 263 KKKAEKKKKEKDEDGSSSEEESSSEEETSE--HESSDEEG----DEKEEKKKVKKVKKV- 315 Query: 507 KKDASKSEEDTEVNSDEIIYETTVNATESTQVKDYSEDKKFPNQDQYHSSLTVAEDPKRE 686 KK+ K +ED + E + + E + + K+ +++ S + E E Sbjct: 316 KKEKEKKKEDDSSEEESSSEEESSSEEEEEGEEKKKKKKEKKKKEEKKGSSSSEEKSSSE 375 Query: 687 VQSKPHLNKRPTDGMLPKSEGSEARKQVDNSNKEIAPKDDMLDDLIQANNETRANHMPFE 866 + + G K EG E+ +S++E KDD + +A +T+ Sbjct: 376 EEEGDEKKAKAEKGKKKKEEGEESGSSEADSSEEEEDKDDKKKKVKKAEKKTKKEKEKSS 435 Query: 867 SSIAASNLEITAKDPKTGTDAKIASADEEMSTMQKGPPGETLQQE---IKELKPEDAFKA 1037 SS S+ E ++D K T K DE S ++ E +E KE K D K Sbjct: 436 SSEEESS-EEESEDEKKITKTKKEDKDESSSESEESSSEEESDEEKDKKKEKKETDKKKK 494 Query: 1038 VAAAVSDLQQE--SRFKDDKMEERVTEQVKLSKHTSIPKLHVDENEHKKIPD---QNEYH 1202 + S ++E S +D++ +++ + K + D+ E K D + + Sbjct: 495 EEESSSSSEEESSSESEDEEEDKKAKKDEKEEVKKGDKEAKKDDKEEVKKGDKEAKKDDK 554 Query: 1203 SSFTIDESAKDVQSELSFNKRTTEGMLPESEVTEATNQVDNSNKVIDEKDDLIDDLTEAN 1382 D+ K+V+ + +K+ E ES E +++ + S+ DE+D+ +D E Sbjct: 555 EEVKKDDKKKEVKETKTEDKKKAESSSDESSSEEESSEEEESS---DEEDEKVDKKKEVK 611 Query: 1383 ND 1388 D Sbjct: 612 KD 613 Score = 58.5 bits (140), Expect = 4e-06 Identities = 65/361 (18%), Positives = 152/361 (42%), Gaps = 24/361 (6%) Frame = +3 Query: 348 EHELIDVAQSFEKQMLHEDKAIERPNESPIAEGIVNYSPEEEVMEERTIKYANKK----- 512 E E + EK++ K + + S E +EE +++ K +KK Sbjct: 438 EEESSEEESEDEKKITKTKKEDKDESSSESEESSSEEESDEEKDKKKEKKETDKKKKEEE 497 Query: 513 DASKSEEDTEVNSDEIIYETTVNATESTQVKDYSEDKKFPNQDQYHSSLTVA-------- 668 +S SEE++ S++ + E +VK ++ K ++++ A Sbjct: 498 SSSSSEEESSSESEDEEEDKKAKKDEKEEVKKGDKEAKKDDKEEVKKGDKEAKKDDKEEV 557 Query: 669 --EDPKREVQSKPHLNKRPTDGMLPKSEGSEARKQVDNSNKEIAPKDDMLDDLIQANNET 842 +D K+EV+ +K+ + +S E + + S+ E K D + ++ + E Sbjct: 558 KKDDKKKEVKETKTEDKKKAESSSDESSSEEESSEEEESSDEEDEKVDKKKE-VKKDEEK 616 Query: 843 RANHMPFESSIAASNLEITAKDPKTGTDAKIASADEEMSTMQKGPPGETLQQEIKELKPE 1022 + + S+ E ++ + +T ++ + +S +EE ++ E ++E+K+++ Sbjct: 617 KKEDEGGDKKKVESSSEESSSEEETSSEEESSSEEEEEKEKEEEKKKEVKKEEVKKVEES 676 Query: 1023 DAFKAVAAAVSDLQQESRFKDDKMEERVTEQVK---LSKHTSIPKLHVDENEHKKIPDQ- 1190 + + ++ +E +D+K E E+VK K + K +E + ++ Sbjct: 677 SSSEESSSEEESSSEEESSEDEKKPEVKKEEVKKEEAKKDEAKKKEESSSSEEESSSEEE 736 Query: 1191 --NEYHSSFTIDESAKDVQSELSFNKRTTEGMLPESEVT---EATNQVDNSNKVIDEKDD 1355 +E SS +E K+ + E ++ E E E + E++++ + K +++K + Sbjct: 737 SSSEEESSDEEEEPKKEEKKEAKKEEKKKEESSSEEESSSEEESSSEEEEEEKKVEKKKE 796 Query: 1356 L 1358 + Sbjct: 797 V 797 Score = 57.8 bits (138), Expect = 7e-06 Identities = 64/312 (20%), Positives = 131/312 (41%), Gaps = 13/312 (4%) Frame = +3 Query: 459 SPEEEVMEERTIKYANKKDASKSEEDTEVNSDEIIYETTVNATESTQVKDYSEDKKFPNQ 638 S EE+ EE + + ++ +++S EE++E DE E + K + KK ++ Sbjct: 216 SGEEDSEEESSTEESSSEESSSEEEESEEAEDEEEGEAKKKVKKEKSKKKAEKKKKEKDE 275 Query: 639 DQYHSSLTVAEDPKREVQSKPHLNKRPTDGMLPKSEGSEARKQVDNSNKEIAPKDDMLDD 818 D S E+ E ++ H + + + E +K+V K K+ +D Sbjct: 276 DGSSSE----EESSSEEETSEH------ESSDEEGDEKEEKKKVKKVKKVKKEKEKKKED 325 Query: 819 LIQANNETRANHMPFESSIAASNLEITAKDPKTGTDAKIASADEEMSTMQKGPPGETLQQ 998 + E + K+ K + K +S+ EE S+ ++ E + Sbjct: 326 DSSEEESSSEEESSSEEEEEGEEKKKKKKEKKKKEEKKGSSSSEEKSSSEEEEGDEKKAK 385 Query: 999 EIKELKPEDAFKAVAAAVSDLQQESRFKDDKME-----ERVTEQVKLSKHTSIPKLHVDE 1163 K K ++ + ++ +D +E KDDK + E+ T++ K +S + +E Sbjct: 386 AEKGKKKKEEGEESGSSEADSSEEEEDKDDKKKKVKKAEKKTKKEKEKSSSSEEESSEEE 445 Query: 1164 NE-HKKI-------PDQNEYHSSFTIDESAKDVQSELSFNKRTTEGMLPESEVTEATNQV 1319 +E KKI D++ S + E D + + K+ T+ E E + ++++ Sbjct: 446 SEDEKKITKTKKEDKDESSSESEESSSEEESDEEKDKKKEKKETDKKKKEEE-SSSSSEE 504 Query: 1320 DNSNKVIDEKDD 1355 ++S++ DE++D Sbjct: 505 ESSSESEDEEED 516