BLASTX nr result
ID: Mentha25_contig00023571
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Mentha25_contig00023571 (3140 letters) Database: ./nr 38,876,450 sequences; 13,856,398,315 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value gb|EQB59971.1| hypothetical protein NAPIS_ORF02494 [Nosema apis ... 65 3e-07 gb|EZA54964.1| hypothetical protein X777_04427 [Cerapachys biroi] 64 4e-07 ref|XP_001310306.1| hypothetical protein [Trichomonas vaginalis ... 62 1e-06 ref|XP_001308810.1| hypothetical protein [Trichomonas vaginalis ... 61 3e-06 ref|XP_001582487.1| viral A-type inclusion protein [Trichomonas ... 60 6e-06 >gb|EQB59971.1| hypothetical protein NAPIS_ORF02494 [Nosema apis BRL 01] Length = 954 Score = 64.7 bits (156), Expect = 3e-07 Identities = 92/415 (22%), Positives = 164/415 (39%), Gaps = 11/415 (2%) Frame = +3 Query: 1119 TTEDQKYYEEAHNLDLDRNINIEDNSFGPSSISVDSDQGFDTLSDTQTTEPDTSTHLGHE 1298 T +++ ++ ++L+ +I E++S +D T+ + TT+ + E Sbjct: 12 TKKEESREDQEAPIELNESIKKEESSKESMKDKLDKS----TMKEKDTTKKENEESTKKE 67 Query: 1299 ---NEKKYAKKLQEKIKADLTKIAQMNRNVSSKNSPQNVEEEEKSCGITTSFNYDAKFDL 1469 N+++ KL E K +M +++ + S + + EK T + + Sbjct: 68 ETINKQESTNKLNESEKEK-----EMRKSILDQESTDKINKSEKEKEETEKSTTEKEESE 122 Query: 1470 KEKSREGQVSDILMSKQISNEKNEFKDEIDQELSEIIAEAKTKNRMGPEPNDCTYLPSTS 1649 K +E +V I +KQ+SN K E + ID++ E+ KN EP T Sbjct: 123 KSTIKEEEVESI--NKQVSNIKEEHEKSIDEQ------ESTIKNEKNIEPERSTINEEEH 174 Query: 1650 EVTQFIEIDIKKKNCGHDTNKTETLSENDHVLQEEKDPYIGFKDFSKTVGEXXXXXXXXX 1829 E + + +++ + N++ E+ + + E I K+F+K E Sbjct: 175 EKSTIKKESTNEESYSNKINESTKKDESINTINES----INNKEFNKEEEE--------- 221 Query: 1830 XXXXTLKESIQKKIIEETHTRDTEHNEE--IKSRSNFDSNINNPNRSLSHLTHGLPTQTH 2003 E KK EET + NE+ IK+ N + P RS + L Sbjct: 222 ------HEKSTKK--EETINKQESSNEKSIIKNEKNIE-----PERSTNKL--------- 259 Query: 2004 IYEMGLSKDLIKLQESIKSKEISIADSSHNQSSVVSRILT---ESILEHSVIEPEY---I 2165 + IK +ES K SI + H +S++ I T +I + I+ E+ I Sbjct: 260 -------NESIKEKESSNEKASSINEEEHEKSTIKKEISTMNERNIEQDKSIKSEFNKSI 312 Query: 2166 TSSILQEDMASYLKASNISKVQEETYGNIKQIPSNLITPTFETNFYDESDLKSKK 2330 E S LK+ N S +EET+ + + N E + ESD+ +K Sbjct: 313 KDEEENEKNESTLKSINKSNKEEETHSFVSEFVKNT-----ELSLNKESDINKEK 362 >gb|EZA54964.1| hypothetical protein X777_04427 [Cerapachys biroi] Length = 1200 Score = 63.9 bits (154), Expect = 4e-07 Identities = 171/870 (19%), Positives = 321/870 (36%), Gaps = 50/870 (5%) Frame = +3 Query: 438 NQMQGKSINQEDNSSLNNRKLPSTVSPASRDVECERLVSSSMDIENGNNKHNIGMETACV 617 N + ++ QE+ ++ N T S+ +E E VS NG G + Sbjct: 191 NLISEEADEQEEKTTKKNNSADETSQECSQSIEKEIDVSQEKSDRNGMEISQEGRSLSKS 250 Query: 618 GMIDTLAKTLENPQLLKLTKSLSTDDP------CCYSTPDDGNVKNGPHVIAKVLHENPR 779 DT T E + K KSLS D C + + +K +++ V + Sbjct: 251 RNQDT---TEERSEKKKKKKSLSNRDDSKIEEQACLNRSTEKELKAFANLLENVFDDESS 307 Query: 780 QEFCLPTATNWNVSSNEVIDERTKLEPTTPVSDYLNKNFDSSDESDAFMAETASIFQPDE 959 + A + + + + + S+ LN +S+ S AE + DE Sbjct: 308 DSETIIPAMDLGFENCSTVAHAGSIANDSDGSEKLNDTSNSNIVSSFLFAEADN----DE 363 Query: 960 ENTTLTPLDIQSNTTVAKETDSNAIDKPFSFTEMPFDW----------PNN------EFI 1091 +N T D N+ V KE + + ++ F ++P D PN+ +FI Sbjct: 364 DNDNDTDKDSSVNSDVRKEYNLDGTEQKFDDDDVPHDECRASESEYSDPNDNGSDLADFI 423 Query: 1092 VPPVATGIDTTEDQKY----YEEAHNLDLDRNINIEDNSFGPSSISVDSDQGFDTLSDTQ 1259 V + E+++ E+ NL+ + +N EDN S + ++ DT + Q Sbjct: 424 VDDDQVENEKNEEKEEGDSDEEQDKNLEEQKEMNDEDNQSEEEGESGEEEKKVDTKNKEQ 483 Query: 1260 -TTEPDTSTHLGHENEKKYAKKLQEKIKADLTKIAQMNRNVSSKNSPQNVEEEEKSCGIT 1436 T + H+ ENE Q ++K D + + S ++S N E EK Sbjct: 484 KKTRAKVNEHVMIENEDAE----QSEVKID-------SCDSSDESSDDNNELPEKIGENK 532 Query: 1437 TSFNYDAKFDLKEKSREGQVSDILMSKQISNEKNEFKDEIDQELSEIIAEAKTKNRMGPE 1616 + + + ++++ Q++D S + NE+ D D +S + + K + Sbjct: 533 KNDSVEIISSRINETKKEQITDPKKSCAMKNEETILLDTSDPNVS--VFDKSKKTHVAKR 590 Query: 1617 PNDCTYLPSTSEVTQFIEIDIKKKNCGHDTNKTETLSENDHVLQEEKDPYIGFKDFSKTV 1796 + + S S + I IK+K+ ++T+ +E D + +E + SK Sbjct: 591 EETISLISSESPSS----IKIKRKSLLR-CSETKVETEQDSLKLDESAKSRFSRKLSKLR 645 Query: 1797 GEXXXXXXXXXXXXXTLKESIQ-KKIIEETHTRDTEHNEEIKSRSNFDSNINNPNRSLSH 1973 L+ +I+ K +E+T E + KS++N S + S + Sbjct: 646 SPMDCSTPKLNSSKHKLEFNIETPKTVEKTRESKVELD---KSQTNTSSKKSKSKDSTTK 702 Query: 1974 LTHGLPTQTHIYEMGLSKDLIKLQESIKSKEISIADSSHNQSSVVSRILTESILEHSVIE 2153 L H + + + L S+ S I + ++ VS+ + L S+ Sbjct: 703 KNTSLTDTFHQDKRFVER---PLNTSLPSDLREIIEKANLSKPTVSK---TAELHKSMSV 756 Query: 2154 PEYITSSILQEDMASYLKASNISKVQEETYGNIKQIPSNLITPTFETNFYDESDLKSKKK 2333 T I K++ +SK+ E K+ L PT E ++ +K K K Sbjct: 757 THTETPRIRHLGKEKLNKSAPVSKLHVEV-DQSKESKDEL--PTTEEKAVAKTKIKEKSK 813 Query: 2334 N--------ADYITRTRXXXXXXXXXXXXXMNSNIHNIFRHTD---------GEQALAPE 2462 +D TR R +N NI + D ++ E Sbjct: 814 ENVSNVDSFSDNTTRKRKRERKHKKQAEETLNENITDKVLSEDIMELEASRNQKRVKFSE 873 Query: 2463 LLIISKETLNHLHK--KTALMQAEILRLSTALKIQENVSESEPI---SKNTPTQSTQQNS 2627 L + ++ + +T + + +L + +EN+ E++ T + QQ Sbjct: 874 SLTVKDDSCKGTSEINETKKEKKKKKKLVNVSEQRENIKEADTYQNKKNETELEEQQQEQ 933 Query: 2628 ELLENDEIFATQLPMLAKTQSCPRTKRLWENLWPFGRRDTENSTTLSPQRVTRKGYPFPQ 2807 E+ +N+E +L + + K++ +N +T+ S T S + K P Q Sbjct: 934 EVAQNEETSPEKLSTKKR-----KKKKIQKNEQEVSLLETKISKTKSKNSDSDK--PQAQ 986 Query: 2808 ASIGESTESVTHKEVKRYKVVGRVRAKDIR 2897 A + +S+ R K+ +RA ++R Sbjct: 987 ADESNTIDSINAFAKARCKMQEAIRATEMR 1016 >ref|XP_001310306.1| hypothetical protein [Trichomonas vaginalis G3] gi|121892071|gb|EAX97376.1| hypothetical protein TVAG_374570 [Trichomonas vaginalis G3] Length = 1793 Score = 62.4 bits (150), Expect = 1e-06 Identities = 150/783 (19%), Positives = 315/783 (40%), Gaps = 41/783 (5%) Frame = +3 Query: 465 QEDNSSLNNRKLPSTVSPASRDVECERLVSSSMDIENGNNKHNIGM-ETACVGMIDTLAK 641 QE+ + LP + S + E + + + +N +N + ET + + ++ Sbjct: 823 QEEENKEETPVLP-LIQSISENKENQEETEAEAETQNSEESNNEKLNETPSLSLTKSITD 881 Query: 642 TLENPQLLKLTKSLSTDDPCCYSTPDDGNVKNGPHVIAKVLHENPRQEFCLPTATNWNVS 821 LE+ K ++ + D + + ++ + + + E +++ + Sbjct: 882 NLES----KSSEQENEDKSPELKSEETPSLSLTASISSNITKEGEQEQSQEDSTNKAEEE 937 Query: 822 SNEVIDERTKLEPTTPVSDYLNKNFDSSD-------ESDAFMAETASIFQPDEE-----N 965 +NE +E L T +SD + N S+ E ++ ++ T +P+E + Sbjct: 938 TNETKEETPSLSLTQTISDSIEHNETSTSQQNEENKEPESNVSSTEPQEKPNESLFGSIS 997 Query: 966 TTLTPLDIQSNTTVAKETDSNAIDKPFSFTEMPFDWPNNEFIVPPVATGIDTTEDQKYYE 1145 L P SN K+ N +++ + D PN E TE++K E Sbjct: 998 DKLLPQTEISNEK--KQEGENPLEEHKDNQDTNQDKPNEESESTSDKQS-PITEEKK--E 1052 Query: 1146 EAHNLDLDRNI--NIEDNSFGPSSISVDSDQGFDTLSDTQTTEPDTSTHLGHENEKKYAK 1319 E +L L ++I NI++N + +LS T+ + ++G + Sbjct: 1053 ETPSLSLTKSIAENIQENKDEEKIEETPKENETPSLSLTKFI----AENIGEREVPTQEE 1108 Query: 1320 KLQEKIKADLTKIAQMNRNVSSKNSPQNVEEEEK---SCGITTSFNYDAKFDLKEKSREG 1490 K E+ L+ + N+ SK + +E+++ + +T + + + + K E Sbjct: 1109 KKDEEETPSLSLTKSIEENIESKQENKELEQKKDDVPTLSLTPTIEENIESKQENKELEQ 1168 Query: 1491 QVSDIL-MSKQISNEKNEFKDEIDQELSEIIAEAKTKN-RMGPEPNDCTYLPSTSEVTQF 1664 + D+L ++ I+ E KDE +E +EI + + TK+ + E N P E ++ Sbjct: 1169 KKDDVLPLTPTIAENTQENKDEEKKEETEIPSLSLTKSIQENIEENKEENEPPKDENSEQ 1228 Query: 1665 IEIDIKKKNCGHDTNKTETLSENDHV----LQEEKDPYIGFKDFSKTVGEXXXXXXXXXX 1832 + + K+N + T++++EN V QEEK + S T Sbjct: 1229 EKEETPKENESPTLSLTKSIAENIEVRELPTQEEKKDELETPSLSLT------------- 1275 Query: 1833 XXXTLKESIQKKIIEETHTRDTEHNEEIKSRSNFDSNINNPNRSLSHLTHGLPTQTHIYE 2012 T++ +I++K +EE EE K + P+ SL+ + + E Sbjct: 1276 --KTIENNIEEKTVEEKPV------EEKKEETQKQEKEGTPSLSLTKSIEQNIEEKQVDE 1327 Query: 2013 MGLSKDLIKLQESIKSKEISIADS----------SHNQSSVVSRILTESILEHSVIEPEY 2162 K K E +++ +S+A + ++ S LT+SI E+++ + Sbjct: 1328 KNEDKHEEKKDE-VETPSLSLAKTIAENIEEKPQEPDKEETPSLSLTKSI-ENNIESKQG 1385 Query: 2163 ITSSILQEDMASYLK---ASNISKVQEETYGNIKQIPSNLITPTFETNFYDESDLKSKKK 2333 ++D L A NI + ++E +IPS +T + + N D+++ K ++ Sbjct: 1386 DKELEQKKDDVLPLTPTIAENIQENKDEEKNEETEIPSLSLTKSIQENIEDKTEEKEREI 1445 Query: 2334 NA---DYITRTRXXXXXXXXXXXXXMNSNIHNIFRHTDGEQALAPELLIISKETLNHLHK 2504 + D + +T+ NS++ + T +L + + +K Sbjct: 1446 STSINDNLLQTKEES-----------NSSLASNESETPS-LSLTKSIADNIETKQEEENK 1493 Query: 2505 KTALMQAEILRLST-ALKIQENVSESEPISKNTPTQSTQQNSELLENDEIFATQLPMLAK 2681 + A ++E + T L + +++SE++ + T ++ QNSE N+++ + P L+ Sbjct: 1494 EIAANESEEKKEETPVLPLIQSISENKENQEETEAEAETQNSEESNNEKL--NETPSLSL 1551 Query: 2682 TQS 2690 T+S Sbjct: 1552 TKS 1554 >ref|XP_001308810.1| hypothetical protein [Trichomonas vaginalis G3] gi|121890508|gb|EAX95880.1| hypothetical protein TVAG_008910 [Trichomonas vaginalis G3] Length = 2263 Score = 61.2 bits (147), Expect = 3e-06 Identities = 130/632 (20%), Positives = 250/632 (39%), Gaps = 29/632 (4%) Frame = +3 Query: 462 NQEDNSSLNNRKLPSTVSPASRDVECERLVSSSMDIEN--GNNKHNIGMET--ACVGMID 629 N++++ +NN++ T +R E LVSS+ D N N + N M++ + + Sbjct: 595 NKQNDLLVNNKEENETYKSINRTQE---LVSSNKDNSNIIDNREENESMKSNEKEISSLS 651 Query: 630 TLAKTLENPQLLKLTKSLSTDDPCCYSTPDDGNVKNGPHVIAKVLHENPRQEFCLPTATN 809 K+ + +L + +S S D S + N + V K L EN N Sbjct: 652 NKEKSSISNKLTENIESKSHDKEISKSVNQEENDISNKSV-EKTLEENE---------IN 701 Query: 810 WNVSSNEVIDERTKLEPTTPVSDYLNKNFDSSD----ESDAFMAETASIFQPDEENTTLT 977 SN+ +D++ + +S+ +NK S+ + ++ +E + ++E ++ Sbjct: 702 KEEKSNKSVDKKLQ---KNEISNSVNKEEKPSNNKLQKKESIDSEEVNESVVNKEEKDIS 758 Query: 978 PLDIQSNTTVAKETDSNAIDKPFSFTEMPFDWPNNEFIVPPVATGIDTTEDQKYYEEAHN 1157 + SN+ +E + ++DK E+ E ++ T + + + E + Sbjct: 759 NKSVDSNSVKKEENSTKSVDKKLRENEISNSVNKEENVI----TNQSSDDKLQQKESIDS 814 Query: 1158 LDLDRN-INIEDNSFGPSSIS---VDSDQGFDTLSDTQTTEPDTSTHLGHENEKKYAKKL 1325 +++ + IN E+ S+ V+ ++ + ++ E D S + + KL Sbjct: 815 KEVNESVINKEEKDISNKSVDSNLVEKEKNSTKSVEKKSQEEDISNSVNKKENDISNNKL 874 Query: 1326 QEKIKADLTKIAQMNRNVSSKNSPQ-NVEEEEKSCGITTSFNYDAKFD---LKEKSREGQ 1493 EK+ NRN S N + +V + K I + +K + ++E + Sbjct: 875 NEKVVESQEINKSNNRNESIVNEVEKDVSKSVKEESINKQKDLLSKNEQNSVEENKLNDE 934 Query: 1494 VSDILMSKQISNEKNEFKDEIDQELSEIIAEAKTKNRMGPEPNDCTYLPSTSEVTQFIEI 1673 + S +E DEI++EL+E+ K + E+++ +E Sbjct: 935 SKSSIKGSNQSKSVDERNDEINKELNELNENTKKST-------------NEEEISKSVEE 981 Query: 1674 DIKKKNCGH---DTNKTETLSEN--DHVLQEEKDPYIGFKDFSKTVGEXXXXXXXXXXXX 1838 K N D ++ ET++++ D ++ + +K++ E Sbjct: 982 SNNKLNNEEKSIDNHREETIAKSIEDKQVKNKSVDENQINSNNKSIDENNIVGIVVVSHK 1041 Query: 1839 XTLKESIQKKII---EETHTRDTEHNEEIKSRSNFDSNINNPNRSLSHLTHGLPTQTHIY 2009 KE + I EE T+D + NE KS + + +I N S+ ++ Sbjct: 1042 DKSKEENKLTDINNKEEKVTKDIKENE--KSIVDKEKSIKENNSSVKNV----------- 1088 Query: 2010 EMGLSKDLIKLQESI-KSKEISIADSSHNQSSVVSRILTESILEHSVIEPEYITSSILQE 2186 KD KL+E I KSKE + + ++ N+S + I+ E S L+E Sbjct: 1089 -----KDKEKLEEQISKSKEENKSINNKNESIDEENKYSNQTESVQNIKEENSKSKELKE 1143 Query: 2187 DMASYLKASNISKVQEE----TYGNIKQIPSN 2270 D S L ISK +EE + N K I +N Sbjct: 1144 DEKSILSDEQISKSKEEISKSSKENTKSISNN 1175 >ref|XP_001582487.1| viral A-type inclusion protein [Trichomonas vaginalis G3] gi|121916722|gb|EAY21501.1| viral A-type inclusion protein, putative [Trichomonas vaginalis G3] Length = 1794 Score = 60.1 bits (144), Expect = 6e-06 Identities = 142/727 (19%), Positives = 285/727 (39%), Gaps = 39/727 (5%) Frame = +3 Query: 519 ASRDVECERLVSSSMDIENGNNKHNIGMETACVGMIDTL------AKTLENPQLLKLTKS 680 AS+ + + L+ S + N N + + + M+ L A +N + +L Sbjct: 200 ASKPAQSQDLLDFSSNQNNSNFNQSSNQQNSQQNMLLDLFGEQQPANQSQNTDIQRLNDK 259 Query: 681 LSTDDPCCYSTPDDGNVKNGPHVIAKVLHENPRQEFCLPTATNWNVSSNEVIDERTKLEP 860 +S + D N +A ++ EN +++ T N N++ N+ ++ + Sbjct: 260 ISQLEKELAEKDDQINE------LANLIEENDKKQ---GTQQNQNLNQNDEDAIQSLVTK 310 Query: 861 TTPVSDYLNKNFDSSDESDAFMAETASIFQPDEENTTLTPLDIQSNTTVAKETDSNAIDK 1040 D + KN + E+ I Q +E +L N ++ E D N + Sbjct: 311 YEEEIDDIKKNNQNEKEN--------LINQINELKNSL------KNKEISSENDLNEMKI 356 Query: 1041 PFSFTEMPFDWPNNEFIVPPVATGIDTTEDQKYYEEAHNL--DLDRNINIEDNSFGPSSI 1214 T + E + + T ++ QK E + L ++N + + S +S Sbjct: 357 IIEQTSKDY-----ETKIQDLMTNLEENS-QKLNEMSQKLKESEEKNQKLNEMSMLQASN 410 Query: 1215 SVDSDQGFDTLSDTQTTEPDTSTHLGHENEKKYAKKLQ-----EKIKADLTKIAQMNRNV 1379 + ++ +S+ T L +ENEK + + +K+ +DL +I ++N+N+ Sbjct: 411 DAEKEKFIKEISNLTKENEKLQTVL-NENEKNRTENERLVAENQKLNSDLHEIGEVNKNL 469 Query: 1380 SSK--------NSPQNVEEEEKSCGITTSFNY----DAKFDLKEKSREGQVSDILMS-KQ 1520 ++ S QN +E E ++ K + + +E Q D+ KQ Sbjct: 470 QTEIEKLTEIMKSEQNNKENEMMSLLSQKEEQVQALQVKLNQTNQEKEKQFEDLSQKLKQ 529 Query: 1521 ISNEKNEFKDEIDQELSEIIAEAKTKNRMGPEPNDCTYLPSTSEVTQFIEIDIKKKNCGH 1700 + EK + D+ + +++EI + N + +E Q +++K + Sbjct: 530 LEAEKQKLNDDYESKINEI--QQNDNETFTNYQNQIKEMMINNENLQNENKSLQEKISLN 587 Query: 1701 DTNKTETLSENDHVLQEEKDPYIGFKDFSKTVGEXXXXXXXXXXXXXTLKESIQKKIIEE 1880 + + E + + L+E K+ ++ K+ + E+++K I E+ Sbjct: 588 EKSDNEKVLSLEEQLKESKNSISSLQEQLKSSQQTI--------------ENLEKNISEK 633 Query: 1881 THTRDTEHNEEIKSRSNFDSNINNPNRSLSHLTHGLPTQTHIYEMGLSKDLIKLQESIKS 2060 + T +NE+IKS ++ S I N N +L + L + E ++ ++ L+E +K+ Sbjct: 634 SET----YNEKIKSLTDELSTIQNTNENLQNEIKSLQEKLSNNEKNDNEKILNLEEQLKN 689 Query: 2061 KEISIADSSHNQSSVVSRILTESILEHSVIEPEYITSSILQEDM----------ASYLKA 2210 + + S + + + S++E E TS ++E + S + Sbjct: 690 SQNEVRIGQEKLSKFENE-YDQMRSKLSLMEKELSTSQKMKESLQKEKESLQEKISLSEK 748 Query: 2211 SNISKV--QEETYGNIKQIPSNLITPTFETNFYDESDLKSKKKNADYITRTRXXXXXXXX 2384 S+ KV EE N K N+IT +E N E +L+S+ + T Sbjct: 749 SDNEKVLSLEEQLNNSK----NMIT-NYEQN---EKELQSQLSTLNEELST-------SK 793 Query: 2385 XXXXXMNSNIHNIFRHTDGEQALAPELLIISKETLNHLHKKTALMQAEILRLSTALK-IQ 2561 + I N ++ D + E L + T+N L + T + +I L + K +Q Sbjct: 794 KMIETLEEKISNNEKNGDEKVKSYEEQLNSYRNTINELQQITQSNEEKIKSLESQNKDLQ 853 Query: 2562 ENVSESE 2582 E +S SE Sbjct: 854 EKISLSE 860