BLASTX nr result
ID: Cocculus23_contig00021461
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Cocculus23_contig00021461 (1693 letters) Database: ./nr 38,876,450 sequences; 13,856,398,315 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value emb|CBI16839.3| unnamed protein product [Vitis vinifera] 84 2e-13 gb|EXB82454.1| hypothetical protein L484_027628 [Morus notabilis] 75 9e-11 ref|XP_002515032.1| hypothetical protein RCOM_1082870 [Ricinus c... 71 1e-09 ref|XP_629009.1| hypothetical protein DDB_G0293562 [Dictyosteliu... 65 1e-07 ref|XP_002668020.1| predicted protein [Naegleria gruberi] gi|284... 62 6e-07 ref|XP_004143053.1| PREDICTED: uncharacterized protein LOC101207... 62 1e-06 ref|WP_002830793.1| hypothetical protein [Pediococcus acidilacti... 59 5e-06 ref|WP_004166443.1| hypothetical protein [Pediococcus acidilacti... 59 5e-06 gb|EAR94175.2| THO complex subunit 1 transcription elongation fa... 59 7e-06 ref|XP_001014420.1| hypothetical protein TTHERM_00522420 [Tetrah... 59 7e-06 gb|EFN72799.1| hypothetical protein EAG_03738 [Camponotus florid... 59 7e-06 ref|XP_001315960.1| hypothetical protein [Trichomonas vaginalis ... 59 7e-06 ref|XP_002676077.1| predicted protein [Naegleria gruberi] gi|284... 59 9e-06 >emb|CBI16839.3| unnamed protein product [Vitis vinifera] Length = 1309 Score = 84.0 bits (206), Expect = 2e-13 Identities = 102/407 (25%), Positives = 179/407 (43%), Gaps = 24/407 (5%) Frame = +2 Query: 530 GMEVDETLPQK-EEKVSSKDMKKTRS----------RKKVLANKLIDTNLESSDQYRTPD 676 G EV + K + V SK +KT+S R K+ K T +ES + P Sbjct: 811 GPEVSTSSTHKIKVDVPSKIKRKTKSVKTSSTNQFKRSKLETEKDAATEVESLHPKQDPG 870 Query: 677 DKGSSQMITD-ALDDDK-GRPVIAG---------CEDKTDNHVDGDKVNFIDYFVSKKNH 823 SSQ+ + AL+++ GRP+ A C ++ ++H + + +++ Sbjct: 871 TFDSSQIPSSYALENNPVGRPLEANVDGNLLKLACINEANDHKEVSSCQSDMVNMPRQSL 930 Query: 824 HEVASSAKESLEEDNHLRKKVKETRAVKKNQKINKHSGAALHDKENSFNSSFDSRQQKKK 1003 H+V + A+ +ED +++ + ++ K + N S A D +NS ++ +KK Sbjct: 931 HKVVAPAQVLADEDTKVKRSERVSKTNKNKKMSNVDSVATSRDLQNSLKTNKSQDVEKKS 990 Query: 1004 HLVGTLDSSQPRGSLKKDKQSELAVQPNEKGTAGARLVGQTFSQNAVLESLPRKDSKTKA 1183 +Q + L D ++L + K + +R ++ +++P + Sbjct: 991 E-----GDNQLQDPLSVDGHNKLMPESVSKFSKVSRNDLKSPHDIGKFDTIPEEIRWPNV 1045 Query: 1184 LDASEALGSRRRPDXXXXXXXXXXXXXXEHKANWETSGSSTENSQDRD-NKKKGQREILS 1360 ++AS + + S SS+++S+DR K+G+R+ S Sbjct: 1046 VNASGTSSTAHA--------------FLKENGKASLSTSSSDSSEDRTYQNKRGKRQ--S 1089 Query: 1361 QANHGRGAVGKTSGKKMGKFVNGSNKKESLLASPKTIFKXXXXXXXXXXVDGEVNSGTST 1540 + R V K K G+ VN S++++SLLA+ +IF DG NS ST Sbjct: 1090 NLDRYRVTVRKAPRKNPGEVVNSSHQRKSLLATYGSIFNDGGSESSEDH-DGVENSDAST 1148 Query: 1541 RTPDNSS-SSDYSEGDNVTDPDTPRTGSNNTKRKENDGKHMKESKRS 1678 RTP +SS SSDY+EG+N D+ G +TKR E+ K + +S S Sbjct: 1149 RTPSDSSASSDYTEGENNQHLDSSH-GLYSTKRNESGAKSIGKSNSS 1194 >gb|EXB82454.1| hypothetical protein L484_027628 [Morus notabilis] Length = 1284 Score = 75.1 bits (183), Expect = 9e-11 Identities = 99/417 (23%), Positives = 162/417 (38%), Gaps = 16/417 (3%) Frame = +2 Query: 452 PTETVKKIKIRDGGSGDGIKNDVQADGMEVDETLPQKEEKVSSKDMKKTRSRKKVLANKL 631 P +VK + +G + Q VD+TL SSK K + +K+ + ++L Sbjct: 848 PKSSVKNPPVLQSEAGSVVH---QISPAPVDKTLVVG----SSKSKDKGKLKKRAVEDRL 900 Query: 632 IDTNLESSDQYRTPDDKGSSQMITDALDDDKGRPVIAGCEDKTDNHVDG---DKVNFIDY 802 + +ES + +G + + + +A N +D + D Sbjct: 901 NEEKMES-------ESRGVEKELVPT-QPAQSNSAVAKSTKVQSNSIDPKYKENAGEADP 952 Query: 803 FVSKKNHHEVASSAKESLEEDNHLRKKVKETRAVKKNQKINKHSGAALHDKE---NSFNS 973 E++ + E D V + K+ K K S + + + + +S Sbjct: 953 SDGANKDIEISGAGSEKPLPDTSSGGLVDKKTGANKDAKTPK-SKTNIENPDTYSDKISS 1011 Query: 974 SFDSRQQ-------KKKHLVGTLDSSQPRGSLKKDKQSELAVQPNEKGTAGARLVGQTFS 1132 +F S Q+ +KK G S+ P SL KD E AVQP EK Sbjct: 1012 AFQSSQKANRKQGIEKKAPAGK-SSTTPLQSLSKDNPDESAVQPTEK------------- 1057 Query: 1133 QNAVLESLPRKDSKTKALDASEALGSRRRPDXXXXXXXXXXXXXXEHKANW--ETSGSST 1306 L+ + ++K D S L S R+ K S S Sbjct: 1058 ----LQKASKTEAKASPTDVSGKLNSTRKETKMQHAVGVSGTNIQSEKNTGLASVSNSPM 1113 Query: 1307 ENSQDRDNKKKGQREILSQANHGRGAVGKTSGKKMGKFVNGSNKKESLLASPKTIFKXXX 1486 E+S++ +K G + + R A K + K GK VN + L+A+P TIF+ Sbjct: 1114 ESSRNIISKDVGSNKHQPGMHSYRAANIKAAVKGDGKIVNSLEPTKKLIATPGTIFRDDD 1173 Query: 1487 XXXXXXXVDGEVNSGTSTRTP-DNSSSSDYSEGDNVTDPDTPRTGSNNTKRKENDGK 1654 G +S TSTRTP D S SSDYS+G++ ++ ++P GS + R ++ G+ Sbjct: 1174 SGESSEDEGGTDDSDTSTRTPSDYSQSSDYSDGESNSNFNSPERGSYASNRMKSGGR 1230 >ref|XP_002515032.1| hypothetical protein RCOM_1082870 [Ricinus communis] gi|223546083|gb|EEF47586.1| hypothetical protein RCOM_1082870 [Ricinus communis] Length = 1078 Score = 71.2 bits (173), Expect = 1e-09 Identities = 84/400 (21%), Positives = 153/400 (38%), Gaps = 16/400 (4%) Frame = +2 Query: 515 DVQADGMEVDETLPQKEEKVSS-KDMKKTRSRKKVLA----NKLIDTNLESSDQYRTPDD 679 +V A+ M+ K++ S KD+ + ++ + L+ NK+ + S+ ++ Sbjct: 684 EVSAENMDGKSRKKTKKKGTSDVKDLPELKNENEKLSAPAGNKIREAEYSSNGPLKSQSS 743 Query: 680 KGSSQMITDALDDDKGRPVIAGCEDKTDNHVDG----------DKVNFIDYFVSKKNHHE 829 +G + + G K+ + ++G +NF +YFV ++ ++ Sbjct: 744 QGQPHKTKSNREGRCLEAAVNGNPSKSGHAIEGTCNLDVSCESSGINFKNYFVPRQQSNK 803 Query: 830 VASSAKESLEEDNHLRKKVKETRAVKKNQKINKHSGAALHDKENSFNSSFDSRQQKKKHL 1009 + S + +++ + E + + +K+ HS D +NS++ + D K Sbjct: 804 IVGSDEALVDKATKTMEAYGEMKGNENKKKLGAHSHGPSPDLQNSYSLTEDHGVGAKPLK 863 Query: 1010 VGTLDSSQPRGSLKKDKQSELAVQPNEKGTAGARLVGQTFSQNAVLESLPRKDSKTKALD 1189 V + P S K DK + + NA+ S +K K Sbjct: 864 VSDSEVKAPLPS-KSDKLDSAS---------------ENTRSNALKPSATSTHAKNKKAG 907 Query: 1190 ASEALGSRRRPDXXXXXXXXXXXXXXEHKANWETSGSSTENSQDRDNKKKGQREILSQAN 1369 + +L S + + N +G +R N ++ Sbjct: 908 SVSSLESSKDTNFL----------------NRRVNGPQLHEDDNRMNSRR---------- 941 Query: 1370 HGRGAVGKTSGKKMGKFVNGSNKKESLLASPKTIFKXXXXXXXXXXVDGEVNSGTSTRTP 1549 TS + VNGS K SL+ +IFK D NS STRTP Sbjct: 942 --------TSTINSREVVNGSQHKRSLIGVSDSIFKDVTDEASSTEDD---NSDASTRTP 990 Query: 1550 -DNSSSSDYSEGDNVTDPDTPRTGSNNTKRKENDGKHMKE 1666 D S SSDYS+G++ D ++P GSN+ KRK+ K +++ Sbjct: 991 SDKSLSSDYSDGESNADFNSPLNGSNSCKRKDGGQKTIRK 1030 >ref|XP_629009.1| hypothetical protein DDB_G0293562 [Dictyostelium discoideum AX4] gi|60462372|gb|EAL60593.1| hypothetical protein DDB_G0293562 [Dictyostelium discoideum AX4] Length = 527 Score = 64.7 bits (156), Expect = 1e-07 Identities = 86/426 (20%), Positives = 164/426 (38%), Gaps = 26/426 (6%) Frame = +2 Query: 407 DGYSASEANSENILSPTETVKKIKIRDGGSGDGIKNDVQADGMEVDETLPQKEEKVSSKD 586 D +S ++S + S E KK +I+ + IK +D + D + E++ KD Sbjct: 114 DSSDSSSSDSSSSESEDEKKKKKEIKKVETKKPIKKVESSDSSDSDSSDSSSEDEKKKKD 173 Query: 587 MKKTRSRK----KVLANKLIDTNLESSDQYRTPDDKGSSQMITDALDDDKGRPV------ 736 KK ++K KV K+ ESSD + D SS+ +++ D+ K + Sbjct: 174 NKKVETKKVETKKVETKKVETKKEESSDSDSSDSDSSSSE--SESEDEKKKKDTKKVEIK 231 Query: 737 ------IAGCEDKTDNHVDGDKVNFIDYFVSKKNHHEVASSAKESLEEDNHLRKKVKETR 898 + + D + D KV K+ + SS+ ES ED +K K+ Sbjct: 232 KEESESESSESESEDENKDNKKVG-----TKKEESSDSDSSSSESESEDEKKKKNNKKVE 286 Query: 899 AVKKNQKINKHSGAALHDKENSFNSSFDSRQQKKK----------HLVGTLDSSQPRGSL 1048 A KK + + S + +S +SS +S + +K G+ DS S+ Sbjct: 287 A-KKEESSDSESESESESSSSSSSSSSESESEDEKKKKDSKKVETKKEGSSDSESSSESV 345 Query: 1049 KKDKQSELAVQPNEKGTAGARLVGQTFSQNAVLESLPRKDSKTKALDASEALGSRRRPDX 1228 + +K V+ ++ ++ + S++ E + + +TK ++S + Sbjct: 346 EDEKMDIEKVEIKKEESSDSESSSPASSESKEEEKMDIEKEETKKEESSSSSSESEEEQK 405 Query: 1229 XXXXXXXXXXXXXEHKANWETSGSSTENSQDRDNKKKGQREILSQANHGRGAVGKTSGKK 1408 E + E S SS+E+ D KKK + S+++ + Sbjct: 406 KSKKEDSDSDESSEDEKKKEESSSSSES---EDEKKKEDSD--SESSEDEKKKEDSDSSS 460 Query: 1409 MGKFVNGSNKKESLLASPKTIFKXXXXXXXXXXVDGEVNSGTSTRTPDNSSSSDYSEGDN 1588 + + KK+ +S K D +S + + + +SSSS S ++ Sbjct: 461 SSESEDEDKKKKDSSSSESESEKESDSSSSSSESDSSSSSDSDSSSSSSSSSSSSSSSES 520 Query: 1589 VTDPDT 1606 ++ D+ Sbjct: 521 ESESDS 526 >ref|XP_002668020.1| predicted protein [Naegleria gruberi] gi|284081051|gb|EFC35276.1| predicted protein [Naegleria gruberi] Length = 449 Score = 62.4 bits (150), Expect = 6e-07 Identities = 96/436 (22%), Positives = 162/436 (37%), Gaps = 10/436 (2%) Frame = +2 Query: 416 SASEANSENILSPTETVKKIKIRDGGSGDGIKNDVQADGMEVDETLPQKEEKVSSKDMKK 595 S E E I+ + KI + K D + + ++ DE P + K K +K Sbjct: 8 SKEETKPEKIVEHKKESTSSKIEK--KSEESKPDTKVEQVKSDEKKPDIKSKTEKKKEEK 65 Query: 596 TRSRKKVLANKLIDTNLESSDQYRTPDDKGS---SQMITDALDDDKGRPVIAGCEDKTDN 766 T S K + N E DDK S S+ D LDD+K + + K D Sbjct: 66 TSSHKDDVKNS------EKKKSEAKKDDKKSEKKSEHKDDELDDNKIQIKSENLQKKVDE 119 Query: 767 H-----VDGDKVNFIDYFVSKKNHHEVASSAKESLEEDNHLRKKVKETRAVKKNQKINKH 931 + + DK+ + +K+ + SA + LEE N KK + ++ KK K H Sbjct: 120 NKKKSDMKDDKMLNEN---KEKSKTDTKKSAGKKLEESND--KKSNDKKSEKKEDKAG-H 173 Query: 932 SGAALHDKENSFNSSFDSRQQKKKHLVGTLDSSQPRGSLKKDKQSELAVQ-PNEKGTAGA 1108 ++ D++ S + D + KK + K+DK+SE + + NEK A Sbjct: 174 KDDSMKDEKKSEKKADDKEETNKKR-------DDEKSEHKEDKKSENSDKMTNEKNKAN- 225 Query: 1109 RLVGQTFSQNAVLESLPRKDSKTKALDASEALGSRRRPDXXXXXXXXXXXXXXEHKANWE 1288 + + + + +K SK + E++ ++ D K+N + Sbjct: 226 -------DKKSDKDDVKKKSSKKSEENEKESIEHKK--DSETSKPDSKMQVKSSEKSNLK 276 Query: 1289 TSGSSTENSQDRDNKKKGQREILSQANHGRGAVGKTSGKKMGKFVNGSNKKESLLASPKT 1468 T +E+ D+ + KK S+AN K K S+ K+ K Sbjct: 277 TDEKKSEHKDDKKSDKKANPLDKSEANKDEKKTHDDENKSDKKDYKKSDHKDEKKTDKK- 335 Query: 1469 IFKXXXXXXXXXXVDGEVNSGTSTRTPDNSSSSDYSEGDNVTDPDTPRTGSNNTKRKEN- 1645 D E S ++ S D + D + DT + + K+ E+ Sbjct: 336 -----LDKDNKKTHDDEKKSEKDEKSEKKSYDEDEKKVDKKS--DTKKDDKKSDKKSEHK 388 Query: 1646 DGKHMKESKRSFDPKN 1693 D K + K+S D K+ Sbjct: 389 DDKKSDKEKKSDDKKD 404 >ref|XP_004143053.1| PREDICTED: uncharacterized protein LOC101207835 [Cucumis sativus] Length = 1107 Score = 61.6 bits (148), Expect = 1e-06 Identities = 100/397 (25%), Positives = 164/397 (41%), Gaps = 19/397 (4%) Frame = +2 Query: 512 NDVQADGMEVDETLPQKEEKVSSKDMKKTRSRKKVLANKLIDTNLES---SDQYRTPDDK 682 N QA ++D + +K +K MK T + A + + NL+S S+ +P Sbjct: 686 NKTQAVAKDMDGQVRKKTKKRPVASMKSTPDLQ---AESIEEENLDSTRFSEVEISPSYC 742 Query: 683 GSSQMITDALDDD------KGRPVIAGCEDKTDNHVDGDKVNFIDYFVSKKNHHEVASSA 844 S+ + +L+ + R V A T + KV+ ++ S+ N + +A Sbjct: 743 KKSKTVRSSLNPSHISEGYEDRYVEANRFSNTTEDCNTGKVDDVEV-PSESNKVGIEENA 801 Query: 845 KE------SLEEDNHLRKKVKET--RAVKKNQKINKHSGAALHDKENSFNSSFDSRQQKK 1000 L+ DN R+K T +A +K + + S AA +N+ S D + + Sbjct: 802 DRFQHESVKLQVDNLSREKSVNTLLKAKRKKKDPSACSSAASLSMQNAQKS--DENTENE 859 Query: 1001 KHLVGTLDSS-QPRGSLKKDKQSELAVQPNEKGTAGARLVGQTFSQNAVLESLPRKDSKT 1177 H + + S+ Q RGS KDK + N+ + S+ V +SLP + K Sbjct: 860 GHCLTSNSSALQLRGSSSKDKCDAMLHVDNKL---------KKISRGGV-KSLPSNEPKQ 909 Query: 1178 KALDASEALGSRRRP-DXXXXXXXXXXXXXXEHKANWETSGSSTENSQDRDNKKKGQREI 1354 K D+++A G R + D K + S+ N D K+KG + Sbjct: 910 KTSDSNQADGVRGKVVDSSRDSTEIYSETSSLPKTKPKMKKSA--NMVYHDQKRKGHQS- 966 Query: 1355 LSQANHGRGAVGKTSGKKMGKFVNGSNKKESLLASPKTIFKXXXXXXXXXXVDGEVNSGT 1534 GR G+ S + K V S ++ LL S IFK G V+S Sbjct: 967 ---TGIGRPEGGRKSSQTGKKDVTQSQRRNVLLTSGG-IFKDASSDSSEDEA-GIVDSDA 1021 Query: 1535 STRTPDNSSSSDYSEGDNVTDPDTPRTGSNNTKRKEN 1645 ST++PDNS SD+S+G++ D RT ++RK + Sbjct: 1022 STKSPDNSQISDFSDGESNESVDLERTNIRRSRRKND 1058 >ref|WP_002830793.1| hypothetical protein [Pediococcus acidilactici] gi|357540561|gb|EHJ24574.1| subtilisin-like serine protease [Pediococcus acidilactici MA18/5M] Length = 3481 Score = 59.3 bits (142), Expect = 5e-06 Identities = 81/437 (18%), Positives = 154/437 (35%), Gaps = 40/437 (9%) Frame = +2 Query: 482 RDGGSGDGIKNDVQADGMEVDETLPQKEEKVSSKDMKKTRSRKKVLANKL-IDTNLESSD 658 RD S +D Q D ++ K++ S K S K ++K+ ++ S+ Sbjct: 1141 RDSQSRSTSTSDKQ-DSESKSASISDKQDSESKSTSDKQESESKSESDKVESESKSASTS 1199 Query: 659 QYRTPDDKGSSQMITDALDDDKGRPVIAGCEDKTDNHVDGDKVNFIDYFVSKKNHHEVAS 838 + D K +S D + R + +D +++ DK S +H + Sbjct: 1200 DKQDSDSKSASNSDNQDSRDSESRSISQSDKDDSESKSTSDKQESESKSASTSDHQDSQD 1259 Query: 839 SAKESLEE------------DNHLRKKVKETRAVKKNQKINKHSGAALHDKENSFNSSFD 982 S S+ + D H + + + + +N + ++ + DKE S + S Sbjct: 1260 SESRSISQSDKEESESKSTSDKHESESISASNSDNQNSQDSESRSISQSDKEESESKSTS 1319 Query: 983 SRQQKKKHLVGTLDSSQPRGSLKKDKQSELAVQPNEKGTAGARLVGQTFSQNAVLESLPR 1162 +Q + DS R S +S + + A + S++ + + Sbjct: 1320 DKQDSESRSASKSDSQDSRDS---QSRSTSTSDKQDSESKSASTSDKQDSESKSASTSDK 1376 Query: 1163 KDSKTKALDASEALGSRRRPDXXXXXXXXXXXXXXEHKANWETSGSSTENSQDRDNKKKG 1342 +DS +K+ S+ SR E+ S NS ++D++ Sbjct: 1377 QDSDSKSASTSDNQDSRDSESRSISQSDKDESESKSTSDKHESESISASNSDNQDSRDSE 1436 Query: 1343 QREILSQANHGRGAVGKTSGKK------------------MGKFVNGSNKKESLLASPKT 1468 R I SQ++ TS K+ + + S+K++S S T Sbjct: 1437 SRSI-SQSDKDDSESKSTSDKQDSESRSASKSDSQDSRDSQSRSTSTSDKQDSESKSAST 1495 Query: 1469 IFKXXXXXXXXXXVDGEVNSGTSTRTPDNSSSSDYSEGDNVTDPDTPRTGSNNTKRK--- 1639 K D + + S T DN S D SE +++ D + S +T K Sbjct: 1496 SDKQDSESKSASTSDKQDSDSKSASTSDNQDSRD-SESRSISQSDKDESESKSTSDKHES 1554 Query: 1640 ------ENDGKHMKESK 1672 E+D ++ ++S+ Sbjct: 1555 ESISASESDNQNSRDSE 1571 >ref|WP_004166443.1| hypothetical protein [Pediococcus acidilactici] gi|304328006|gb|EFL95229.1| KxYKxGKxW signal domain protein [Pediococcus acidilactici DSM 20284] Length = 3030 Score = 59.3 bits (142), Expect = 5e-06 Identities = 81/436 (18%), Positives = 167/436 (38%), Gaps = 15/436 (3%) Frame = +2 Query: 416 SASEANSENILS---PTETVKKIKIRDGGSGDGIKNDVQADGMEVDETLPQ--KEEKVSS 580 S S +NS+N S + ++ + D S +D Q ++ Q KEE S Sbjct: 921 SKSASNSDNQDSRDSESRSISQSDKHDSESKSASTSDHQDSQDSESRSISQSDKEESESK 980 Query: 581 KDMKKTRSRKKVLANKLIDTNLESSDQYRTPDDKGSSQMITDALDDDKGRPVIAGCEDKT 760 K S K ++K D+ S+ + + D + S T D A DK Sbjct: 981 STSDKHDSESKSESDKH-DSESRSASKSDSQDSRDSQSRSTSTSDKQDSESKSASTSDKQ 1039 Query: 761 DNHVDGDKVNFIDYFVSKKNHHEVASSAKESLEEDNHLRKKVKETRAVKKNQKINKHSGA 940 ++ S+ + HE S +K + + D H + + + +N + ++ Sbjct: 1040 ESESKSTSDKQESESKSESDKHE--SESKSASDSDKHDSESRSASTSDNQNSQDSESRSI 1097 Query: 941 ALHDKENSFNSSFDSRQQKKKHLVGTLD-----SSQPRGSLKKDKQSELAVQPNEKGTAG 1105 + DK++S + S +Q+ + T D S+ R + DK+ + ++K + Sbjct: 1098 SQSDKDDSESKSTSDKQESESKSASTSDHQDSQDSESRSISQSDKEESESKSTSDKHESE 1157 Query: 1106 ARLVGQTFSQNAVLESLPRKDSKTKALDASEALGSRRRPDXXXXXXXXXXXXXXEHKANW 1285 + + +QN+ +DS+++++ S+ S + + + + Sbjct: 1158 SISASNSDNQNS-------QDSESRSISQSDKEESESKSTSDKQDSESRSASKSDSQDSR 1210 Query: 1286 ETSGSSTENSQDRDNKKKGQREILSQANHGRGAVGKTSGKKMGKFVNGSNKKESLLASPK 1465 ++ ST S +D++ K Q + + TS K+ + + S+K ES S Sbjct: 1211 DSQSRSTSASDKQDSESKSASTSDKQESESK----STSDKQESESKSESDKVESESKSAS 1266 Query: 1466 TIFKXXXXXXXXXXVDGEVNSGTSTRT-----PDNSSSSDYSEGDNVTDPDTPRTGSNNT 1630 T K D + + + +R+ D+S S S DN D+ + + Sbjct: 1267 TSDKQDSDSKSASNSDNQDSRDSESRSISQSDKDDSESKSASTSDNQDSRDSESRSISQS 1326 Query: 1631 KRKENDGKHMKESKRS 1678 +++++ K + + S Sbjct: 1327 DKEDSESKSTSDKQDS 1342 >gb|EAR94175.2| THO complex subunit 1 transcription elongation factor [Tetrahymena thermophila SB210] Length = 1181 Score = 58.9 bits (141), Expect = 7e-06 Identities = 96/529 (18%), Positives = 203/529 (38%), Gaps = 16/529 (3%) Frame = +2 Query: 146 PENIDGEDDVSFPIEKTKTSSVVDGNHDNIGTNLADETENKDAHXXXXXXXXXXXXXXXX 325 PE + ED I+KT+ SS N++N AD++ N ++ Sbjct: 606 PEQVQAEDLKKSKIDKTEKSS--RSNNENDENPQADQSRNTNSSGSNFSSRQDEKSKSQP 663 Query: 326 HALSSGVEHTDTDAVKPVGFNRVNAFVDGYSASEANSENILSPTETVKKIKIRDGGSGDG 505 +S+ E + + +P N N + S +N+E + P+ + K + G Sbjct: 664 QKISNQSE-SKQNQDQPKNNNSSN------NQSSSNAEKVNQPSNSNNISKNSNDNEQRG 716 Query: 506 IKNDVQADGMEVDETLPQKEEKVSSKDMKKTRSRKKVLANKLIDTNLESSDQYRTPDDKG 685 N+ + + +E E S D+K S+ N+ + + + + D+K Sbjct: 717 KHNEDKQKKDDKNERQNYNNEN-SKSDLKIDESKNSRQDNEK-ERRISQENNRQQGDEKA 774 Query: 686 SSQMITDALDDDKGRPVIAGCEDKTDNHVDGDKVNFIDYFVSKKNHHEVASSAKESLEED 865 + ++ + +++++ R +++ +++ K D S + S K Sbjct: 775 NIKVSDEQINNERPR------QNRQESNFSDSKNE--DVKSSSNKEDKSKSDDKNERSNQ 826 Query: 866 NHLRKKVKETRAVKKNQKINKHSGAALHDKENSFNSSFDSRQQKK---KHLVGTLDSSQP 1036 N +K+V + + K K + ++ F SS D+R+ K K D+ + Sbjct: 827 NQSQKQVSDDKYKNKVDSKQKDEKQQIDEENRRFQSSEDNRKTSKDESKRFYNQEDNRKN 886 Query: 1037 RGSLKK---DKQSELAVQPNEKGTAGARLVGQTFS-QNAVLESLPRKDSKTKALDASEAL 1204 +K D + + Q +K + G S QN++ S + +++ ++ +S + Sbjct: 887 NDESRKNNEDGRKNIEDQGFDKNDNQKQFQGNNNSNQNSINISSSKNNNQQQSNASSSSS 946 Query: 1205 GSR----RRPDXXXXXXXXXXXXXXEHKANWETSG-----SSTENSQDRDNKKKGQREIL 1357 S+ ++ + + +N + SG ++ NS+ + N KK + Sbjct: 947 SSKNVDSQKNEPKQGDESNKSQNQVSNNSNNQGSGNFSNMNNNNNSKPQLNLKKNDPPNV 1006 Query: 1358 SQANHGRGAVGKTSGKKMGKFVNGSNKKESLLASPKTIFKXXXXXXXXXXVDGEVNSGTS 1537 SQ N G + G+T G+ K K+ + +S +SG+S Sbjct: 1007 SQVNSGNSSGGRTQGRSRSK-----EKQSNTYSS---------------------SSGSS 1040 Query: 1538 TRTPDNSSSSDYSEGDNVTDPDTPRTGSNNTKRKENDGKHMKESKRSFD 1684 +NS+++ YS G N +P+ + N+ + G + S++ Sbjct: 1041 RNNTNNSNNNQYSSGGN-NNPNGNNSNYNSNSNYQQGGNSQYSNSNSYN 1088 >ref|XP_001014420.1| hypothetical protein TTHERM_00522420 [Tetrahymena thermophila] Length = 1224 Score = 58.9 bits (141), Expect = 7e-06 Identities = 96/529 (18%), Positives = 203/529 (38%), Gaps = 16/529 (3%) Frame = +2 Query: 146 PENIDGEDDVSFPIEKTKTSSVVDGNHDNIGTNLADETENKDAHXXXXXXXXXXXXXXXX 325 PE + ED I+KT+ SS N++N AD++ N ++ Sbjct: 649 PEQVQAEDLKKSKIDKTEKSS--RSNNENDENPQADQSRNTNSSGSNFSSRQDEKSKSQP 706 Query: 326 HALSSGVEHTDTDAVKPVGFNRVNAFVDGYSASEANSENILSPTETVKKIKIRDGGSGDG 505 +S+ E + + +P N N + S +N+E + P+ + K + G Sbjct: 707 QKISNQSE-SKQNQDQPKNNNSSN------NQSSSNAEKVNQPSNSNNISKNSNDNEQRG 759 Query: 506 IKNDVQADGMEVDETLPQKEEKVSSKDMKKTRSRKKVLANKLIDTNLESSDQYRTPDDKG 685 N+ + + +E E S D+K S+ N+ + + + + D+K Sbjct: 760 KHNEDKQKKDDKNERQNYNNEN-SKSDLKIDESKNSRQDNEK-ERRISQENNRQQGDEKA 817 Query: 686 SSQMITDALDDDKGRPVIAGCEDKTDNHVDGDKVNFIDYFVSKKNHHEVASSAKESLEED 865 + ++ + +++++ R +++ +++ K D S + S K Sbjct: 818 NIKVSDEQINNERPR------QNRQESNFSDSKNE--DVKSSSNKEDKSKSDDKNERSNQ 869 Query: 866 NHLRKKVKETRAVKKNQKINKHSGAALHDKENSFNSSFDSRQQKK---KHLVGTLDSSQP 1036 N +K+V + + K K + ++ F SS D+R+ K K D+ + Sbjct: 870 NQSQKQVSDDKYKNKVDSKQKDEKQQIDEENRRFQSSEDNRKTSKDESKRFYNQEDNRKN 929 Query: 1037 RGSLKK---DKQSELAVQPNEKGTAGARLVGQTFS-QNAVLESLPRKDSKTKALDASEAL 1204 +K D + + Q +K + G S QN++ S + +++ ++ +S + Sbjct: 930 NDESRKNNEDGRKNIEDQGFDKNDNQKQFQGNNNSNQNSINISSSKNNNQQQSNASSSSS 989 Query: 1205 GSR----RRPDXXXXXXXXXXXXXXEHKANWETSG-----SSTENSQDRDNKKKGQREIL 1357 S+ ++ + + +N + SG ++ NS+ + N KK + Sbjct: 990 SSKNVDSQKNEPKQGDESNKSQNQVSNNSNNQGSGNFSNMNNNNNSKPQLNLKKNDPPNV 1049 Query: 1358 SQANHGRGAVGKTSGKKMGKFVNGSNKKESLLASPKTIFKXXXXXXXXXXVDGEVNSGTS 1537 SQ N G + G+T G+ K K+ + +S +SG+S Sbjct: 1050 SQVNSGNSSGGRTQGRSRSK-----EKQSNTYSS---------------------SSGSS 1083 Query: 1538 TRTPDNSSSSDYSEGDNVTDPDTPRTGSNNTKRKENDGKHMKESKRSFD 1684 +NS+++ YS G N +P+ + N+ + G + S++ Sbjct: 1084 RNNTNNSNNNQYSSGGN-NNPNGNNSNYNSNSNYQQGGNSQYSNSNSYN 1131 >gb|EFN72799.1| hypothetical protein EAG_03738 [Camponotus floridanus] Length = 3385 Score = 58.9 bits (141), Expect = 7e-06 Identities = 116/561 (20%), Positives = 199/561 (35%), Gaps = 48/561 (8%) Frame = +2 Query: 146 PENIDG--EDDVS-------FPIEKTKTSSVVDGNHDN-----IGTNLADETENKDAHXX 283 PE+ G EDD S P + S D +DN I N +D+ + KD H Sbjct: 102 PEDYSGSKEDDQSDEITIPKIPPDNQSNSEENDNKNDNQSDDNINDNQSDKNKKKDKHSH 161 Query: 284 XXXXXXXXXXXXXXHALSSGVEHT-----DTDAVKPVGFNRVNAFVDGYSASEANSENIL 448 SS +HT + DA K +GF S E + Sbjct: 162 EDS--------------SSNEKHTKKPKKEKDASKEMGFTTERITTQRVSPEEYSESKED 207 Query: 449 SPTETVKKIKIRDGGSGD----GIKNDVQADGMEVDETLPQKEEKVSSKDMKKTRSRKKV 616 ++ + KI G + +ND Q+D E + ++ + K KK + K Sbjct: 208 DQSDEITIPKIPPGNQSNSEEHSNENDNQSDKNEKKDKHSHEDSSSNEKHTKKPKKEKD- 266 Query: 617 LANKLIDTNLESSDQYRTPDDKGSSQMITDALDD---DKGRPVIAGCEDKTDNHVDGDKV 787 A+K I+ + R ++ S D D+ K P ++ DN D Sbjct: 267 -ASKEIEITTQKITTQRVSPEEYSGSKEDDQSDEITIPKIPPDNQSNSEENDNKNDNQSD 325 Query: 788 NFIDYFVSKKNHHEVASSAKESLEEDNHLRKKVKETRAVKK----NQKI-------NKHS 934 + I+ S KN + S ++S + H +K KE A K+ Q+I +S Sbjct: 326 DNINDNQSDKNKKKDKHSREDSGSNEKHTKKPKKENDASKEIEITTQQITTQHILPEDYS 385 Query: 935 GAALHDKENSFN-SSFDSRQQKKKHLVGTLDSSQPRGSLKKDKQSELAVQPNEKGTA--- 1102 G+ D+ + Q + +Q + KKDK + NEK T Sbjct: 386 GSKEDDQSDEITIPKIPPGNQSNSEEHSNENDNQSDKNKKKDKHNHEDSSSNEKHTKKPK 445 Query: 1103 ----GARLVGQTFSQNAVLESLPRKDSKTKALDASEALGSRRRPDXXXXXXXXXXXXXXE 1270 ++ +G T + P + S++K D S+ + + P Sbjct: 446 KEKDASKEMGFTTERITTQRVSPEEYSESKEDDQSDEITIPKIPPG-------------- 491 Query: 1271 HKANWETSGSSTENSQDRDNKKKGQREILSQANHGRGAVGKTSGKKMGKFVNGSNKKESL 1450 +++N E + +N D++ KK S +N K K ++K+ Sbjct: 492 NQSNSEEHSNENDNQSDKNEKKDKHSHEDSNSNE--------KHTKKPKKEKDASKEMGF 543 Query: 1451 LASPKTIFKXXXXXXXXXXVDGEVNSGTSTRTP--DNSSSSDYS-EGDNVTDPDTPRTGS 1621 T + D + + T + P + S+S ++S E DN +D + + Sbjct: 544 TTERITTQRVSPEEYSESKEDDQSDEITIPKIPPGNQSNSEEHSNENDNQSDKNEKKDKH 603 Query: 1622 NNTKRKENDGKHMKESKRSFD 1684 ++ N+ KH K+ K+ D Sbjct: 604 SHEDSSSNE-KHTKKPKKEND 623 >ref|XP_001315960.1| hypothetical protein [Trichomonas vaginalis G3] gi|121898653|gb|EAY03737.1| hypothetical protein TVAG_072290 [Trichomonas vaginalis G3] Length = 697 Score = 58.9 bits (141), Expect = 7e-06 Identities = 116/570 (20%), Positives = 214/570 (37%), Gaps = 38/570 (6%) Frame = +2 Query: 71 AEPADVAGEVIPPRSMPEKAFDCVHPENIDGEDDVSFPIEKTKTSSVVDGNHDNIGTNLA 250 A+ + GE+ +S P ++ + EN D + EK T S + D I NL Sbjct: 140 AQSDNSKGEITEEKSSPNESTEKSLQENSDEHTE-----EKENTPSN-NSEQDEIENNLG 193 Query: 251 DETENKDAHXXXXXXXXXXXXXXXXHALSSGVEHTDTDAVKPVGFNRVNAFVDGYSASEA 430 ++ E KD SS ++D K + VD SE Sbjct: 194 ND-EEKDLVSEPLSEETPSNDKQTNEDKSS---NSDEKPQKESNVPDKDESVDSEVNSEN 249 Query: 431 NSENILSPTETVKKI-----KIRDGGSGDGIKNDV------QADGMEVDETLPQKEEKV- 574 +EN PTE ++I ++ D + +G N+ ++D E+ ET+ QK+ + Sbjct: 250 PNENNEIPTEAEEEIGKSSKEVTDKSNENGNDNNENPTSAQRSDPQEIPETIEQKDNEED 309 Query: 575 -----------SSKDMKKTRSRKKVLA-----NKLIDTNLESSDQYRTPDDKGSS-QMIT 703 S+++ + + K+ L N N ++ D+ + +D G + + T Sbjct: 310 QNQTSNETPNESTEETPQEKDNKEELITDSPENNSEQINAQNKDREVSTNDVGKNDEKET 369 Query: 704 DALDDDKGRPVIAGCEDKTDNHVDGDKVNFIDYFVSKKNHHEVASSAKESLEED-NHLRK 880 +++K G ++K D ++ +K N + E S+ KE+ + + N + Sbjct: 370 PCENENKSSNEQGGNDNKKDLALESEKSN--------ETLSEKPSAEKENDDSEINPSNE 421 Query: 881 KVKETRAVKKNQKINKHSGAALHDKENSFNSSFDSRQQKKKHLVGTLDSSQPRGSLKKDK 1060 K E K N + D+ + + + + Q+ K+ G ++SS + Sbjct: 422 KAAENEPEMKQHDTNDIKPSDKEDENQIKSENSEEKPQQVKNAPGEVNSST-----SSTE 476 Query: 1061 QSELAVQPNEKGTAGARLVGQTFS-------QNAVLESLPRKDSKTKALDASEALGSRRR 1219 + E NE + V + S QNA LE+L +D+ K + SE S + Sbjct: 477 EKETPSDNNESNLSNTPAVNEKESNENSEENQNAKLENL-NEDNSIKDENNSEETPSETK 535 Query: 1220 PDXXXXXXXXXXXXXXEHKANWETSGSSTENSQ-DRDNKKKGQREILSQANHGRGAVGKT 1396 +++ E G+S ENS + D + Q E + + + + Sbjct: 536 ITSSNENETKEPDSEKQNEVKPENVGASPENSSTNEDGSEIKQPETNNSSTNEEDKSREN 595 Query: 1397 SGKKMGKFVNGSNKKESLLASPKTIFKXXXXXXXXXXVDGEVNSGTSTRTPDNSSSSDYS 1576 GK + N K+ K + E++S D S S+ Sbjct: 596 EGKPSNEQNNSEEKQSQESVRDKDEITPNMSSSKEENKENEISSN------DESKQSEKE 649 Query: 1577 EGDNVTDPDTPRTGSNNTKRKENDGKHMKE 1666 E + V++ +TP +++ EN+ K KE Sbjct: 650 E-EIVSEKETPNETEFKSQKGENEQKENKE 678 >ref|XP_002676077.1| predicted protein [Naegleria gruberi] gi|284089677|gb|EFC43333.1| predicted protein [Naegleria gruberi] Length = 2645 Score = 58.5 bits (140), Expect = 9e-06 Identities = 86/409 (21%), Positives = 147/409 (35%), Gaps = 26/409 (6%) Frame = +2 Query: 542 DETLPQKEEKVSSKDMKKTRSRKKVLANKLIDTNLESSDQYRTPDDKGSSQMITDALDDD 721 ++T+ K++ SSK KK+ K DT +E + PD K ++ Sbjct: 141 EKTVEHKKDSTSSKIEKKSEESKP-------DTKVEHKFDEKKPDIKSKTEKKK------ 187 Query: 722 KGRPVIAGCEDKTDNHVDGDKVNFIDYFVSKKNHHEVASSAKESLEEDNHLRKKVKETRA 901 E+KT +H D K + S+ + + + + E N + K + + Sbjct: 188 ---------EEKTSSHKDDVKNSEKKAQKSELKEDKKSENYNKMTNEKNKMNDKKSDVKK 238 Query: 902 VKKNQKINKHSGAALHDKEN--SFNSSFDSRQQKKKHLVGTLDSSQPRGSLKKDKQSELA 1075 KN + NK H+K N S S DS+ Q K L + + K+DK+S+ Sbjct: 239 SSKNLEENKKESVE-HNKANKDSETSKPDSKMQVKSSEKSNLKKDEKKSEHKEDKKSDKK 297 Query: 1076 VQPNEKGTAGARLVGQTFSQNAVLESLPRKDSKTKALD-ASEALGSRRRPDXXXXXXXXX 1252 +K ++ ++S K+ D SE GS D Sbjct: 298 FDKKKKSDNKKDEKKSDDKKSTEIKSDKTDHKKSNKDDKKSEKKGS---TDKKKTDGEKS 354 Query: 1253 XXXXXEHKANWETSGSSTENSQDRDNKKKGQREILSQANHGRGAVGKTSGKKMGKFVNGS 1432 + K++ S +++++ + D KK ++ + +V K K K N + Sbjct: 355 DKKTKDVKSDNPDSKNTSKSKKSVDKKKTNTKKDKKTNKEEKKSVEKNEKKIGKKSTNTA 414 Query: 1433 NKKESLLASPKTIFKXXXXXXXXXXVDGEVNSGTSTRTPDNSSS---------------S 1567 KK++ ++ KT K E T++ P SS S Sbjct: 415 TKKDTTKSTKKTDKKSSENKKTSGSKKAEPKKKTNSTKPGKKSSTKKLSSTKSKSIGKKS 474 Query: 1568 DYSEGDNVTDPDTPRTG--------SNNTKRKENDGKHMKESKRSFDPK 1690 D EG+ P T TG S + K+ E +K+ K+S K Sbjct: 475 DIKEGNKNAKPKTTDTGKKSASDKKSKSDKKSEKSPTSVKKDKKSTKSK 523