BLASTX nr result
ID: Mentha22_contig00042454
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Mentha22_contig00042454 (662 letters) Database: ./nr 38,876,450 sequences; 13,856,398,315 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value ref|XP_006848744.1| hypothetical protein AMTR_s04155p00003130 [A... 140 4e-31 ref|XP_006830015.1| hypothetical protein AMTR_s04836p00002630, p... 135 9e-30 gb|EXB36258.1| hypothetical protein L484_013693 [Morus notabilis] 133 6e-29 gb|AAC13582.1| similar to maize transposon MuDR (GB:M76978) [Ara... 130 3e-28 ref|XP_002275053.2| PREDICTED: uncharacterized protein LOC100256... 130 5e-28 gb|AAG10809.1|AC018460_3 Similar to mutator transposase [Arabido... 129 1e-27 gb|AAG50597.1|AC079605_2 hypothetical protein [Arabidopsis thali... 128 1e-27 ref|XP_003633694.1| PREDICTED: uncharacterized protein LOC100241... 128 1e-27 ref|XP_006494886.1| PREDICTED: uncharacterized protein LOC102629... 126 5e-27 dbj|BAA96881.1| mutator-like transposase [Arabidopsis thaliana] 126 7e-27 emb|CAB51203.1| putative protein [Arabidopsis thaliana] 125 9e-27 ref|XP_006847242.1| hypothetical protein AMTR_s04652p00001680, p... 125 2e-26 gb|AAF87140.1|AC002423_5 T23E23.9 [Arabidopsis thaliana] 124 2e-26 emb|CAB43911.1| putative protein [Arabidopsis thaliana] gi|72697... 124 2e-26 gb|AAD49098.1|AF177535_2 contains similarity to maize transposon... 124 3e-26 ref|XP_006470974.1| PREDICTED: uncharacterized protein LOC102620... 124 3e-26 emb|CAB78061.1| putative protein [Arabidopsis thaliana] 124 3e-26 gb|AAD31079.1| Mutator-like transposase [Arabidopsis thaliana] 122 1e-25 ref|XP_007014883.1| Uncharacterized protein TCM_040459 [Theobrom... 122 1e-25 ref|XP_007052371.1| Uncharacterized protein TCM_005764 [Theobrom... 120 5e-25 >ref|XP_006848744.1| hypothetical protein AMTR_s04155p00003130 [Amborella trichopoda] gi|548852169|gb|ERN10325.1| hypothetical protein AMTR_s04155p00003130 [Amborella trichopoda] Length = 590 Score = 140 bits (353), Expect = 4e-31 Identities = 78/211 (36%), Positives = 122/211 (57%) Frame = +2 Query: 29 WHVSIFRKEHTCSEFGLTNVHRRTIKSSVVGSIYAKRMLRDSEIIKSRIIQNELLSDFSV 208 + V++F EHTC+ + HR+ VVG + + +D K++ IQ ++ ++ + Sbjct: 79 FEVTVFHNEHTCNLNARRSDHRQAAPW-VVGHLIKGKYTQDGTKYKAKDIQRDMFDNYGI 137 Query: 209 DTSYDACLKGRNKTIEMVYGKYDESFNDFLGYLYMLRKTNPETVTEHEISPDGRFKHVFV 388 SY + R + G + S+ GYLYML + NP T+T+ + D RFK+ F+ Sbjct: 138 SMSYVKAWRCREMGLTYARGTPEFSYMKLPGYLYMLEQKNPGTITDLYLE-DERFKYCFI 196 Query: 389 ALGASRLAYVNGHIRPVIVVDGTHLKGKNNGVMFVAVTKDANEQLFPLAYGIGPIENHES 568 +LGA R + RPV+ +DGT LK K G+M VAV DAN QL P+AYGI EN++S Sbjct: 197 SLGACRRGF--SFCRPVLSIDGTFLKTKYGGIMLVAVAYDANNQLLPVAYGIVDSENNDS 254 Query: 569 WIWFMQCLQKSYGLLTGHLIVSDQHKSIKYA 661 W +F+Q L+ + G++ + VSD+H+SI +A Sbjct: 255 WTYFLQKLRVAIGVVENLVFVSDRHQSIDHA 285 >ref|XP_006830015.1| hypothetical protein AMTR_s04836p00002630, partial [Amborella trichopoda] gi|548835784|gb|ERM97431.1| hypothetical protein AMTR_s04836p00002630, partial [Amborella trichopoda] Length = 547 Score = 135 bits (341), Expect = 9e-30 Identities = 76/211 (36%), Positives = 119/211 (56%) Frame = +2 Query: 29 WHVSIFRKEHTCSEFGLTNVHRRTIKSSVVGSIYAKRMLRDSEIIKSRIIQNELLSDFSV 208 + V++F EHTCS + R VVG + + D K++ IQ ++ ++ + Sbjct: 339 FEVTVFHNEHTCS-LDSRHADNRQAAPWVVGHLIKNKFKSDGTKYKAKDIQRDMFQEYGI 397 Query: 209 DTSYDACLKGRNKTIEMVYGKYDESFNDFLGYLYMLRKTNPETVTEHEISPDGRFKHVFV 388 SY+ + R K + G +++ GY Y+L + NP T+T+ I+ D RFK+ F Sbjct: 398 KMSYEKAWRCREKGLMYSRGTPAAAYSQLPGYFYVLEQKNPGTITDI-ITEDNRFKYCFW 456 Query: 389 ALGASRLAYVNGHIRPVIVVDGTHLKGKNNGVMFVAVTKDANEQLFPLAYGIGPIENHES 568 +L A R + RPVI +DGT LK + G M VAV DAN QLFP+A+ I ENH+S Sbjct: 457 SLAACRRGFK--FCRPVISIDGTFLKTRFGGTMLVAVAYDANNQLFPIAFAIVDSENHDS 514 Query: 569 WIWFMQCLQKSYGLLTGHLIVSDQHKSIKYA 661 W +F+Q L+++ G + + VSD+H+SI++A Sbjct: 515 WKYFLQKLKEAIGEVENLVFVSDRHQSIEHA 545 >gb|EXB36258.1| hypothetical protein L484_013693 [Morus notabilis] Length = 821 Score = 133 bits (334), Expect = 6e-29 Identities = 78/212 (36%), Positives = 122/212 (57%), Gaps = 4/212 (1%) Frame = +2 Query: 29 WHVSIFRKEHTCSEFGLTNVHRRTIKSSVVGSIYAKRMLRDS--EIIKSRIIQNELLSDF 202 + V+ + + HTCS L HR+ SS + S + K+ +S + I +L ++F Sbjct: 319 FQVTKYNETHTCSLDLLQCDHRQA--SSQIISEHIKKKYEESGRTVYAPNNIIEDLKNEF 376 Query: 203 SVDTSYDACLKGRNKTIEMVYGKYDESFNDFLGYLYMLRKTNPETVTEHEISPDGRFKHV 382 +D SY+ + R +E + G D+S+ + +LY L KTNP +V + + + +FK++ Sbjct: 377 GIDVSYEKAWRARKIALEKIGGDVDKSYEELASFLYTLNKTNPGSVADIVLDDENKFKYM 436 Query: 383 FVALGASRLAYVNG--HIRPVIVVDGTHLKGKNNGVMFVAVTKDANEQLFPLAYGIGPIE 556 F+A+ AS ++G H RPVIVVD L+ K+ G + A +DAN Q+FPLA+GIG E Sbjct: 437 FMAVAAS----IHGWKHCRPVIVVDEIFLECKHRGSLLCACAEDANNQIFPLAFGIGESE 492 Query: 557 NHESWIWFMQCLQKSYGLLTGHLIVSDQHKSI 652 N ESW +F + L +++ G IVSDQH SI Sbjct: 493 NDESWEYFFKRLSEAFSERDGMWIVSDQHPSI 524 >gb|AAC13582.1| similar to maize transposon MuDR (GB:M76978) [Arabidopsis thaliana] gi|8843876|dbj|BAA97402.1| mutator-like transposase [Arabidopsis thaliana] Length = 806 Score = 130 bits (328), Expect = 3e-28 Identities = 75/208 (36%), Positives = 110/208 (52%) Frame = +2 Query: 29 WHVSIFRKEHTCSEFGLTNVHRRTIKSSVVGSIYAKRMLRDSEIIKSRIIQNELLSDFSV 208 +HV I+ EHTCS + R+ V+G +Y + +K + + + +F V Sbjct: 321 FHVYIYDSEHTCSVTERSGRSRQATPD-VLGVLYRDYLGDVGPDVKPKSVGIIITKNFRV 379 Query: 209 DTSYDACLKGRNKTIEMVYGKYDESFNDFLGYLYMLRKTNPETVTEHEISPDGRFKHVFV 388 SY K E+ G +D SF + YLYM+R+ NP TV +I GRF ++F+ Sbjct: 380 KMSYSKSYKMLRFARELTLGTHDSSFEELPSYLYMIRRANPGTVARLQIDESGRFNYMFI 439 Query: 389 ALGASRLAYVNGHIRPVIVVDGTHLKGKNNGVMFVAVTKDANEQLFPLAYGIGPIENHES 568 A GAS + ++R V+VVDGT L G G + A+ +D N Q+FPLA+G+ EN +S Sbjct: 440 AFGASIAGF--HYMRRVVVVDGTFLHGSYKGTLLTALAQDGNFQIFPLAFGVVDTENDDS 497 Query: 569 WIWFMQCLQKSYGLLTGHLIVSDQHKSI 652 W WF L+ T I+SD+HKSI Sbjct: 498 WRWFFTQLKVVIPDATDLAIISDRHKSI 525 >ref|XP_002275053.2| PREDICTED: uncharacterized protein LOC100256986 [Vitis vinifera] Length = 1111 Score = 130 bits (326), Expect = 5e-28 Identities = 71/206 (34%), Positives = 111/206 (53%) Frame = +2 Query: 44 FRKEHTCSEFGLTNVHRRTIKSSVVGSIYAKRMLRDSEIIKSRIIQNELLSDFSVDTSYD 223 F HTC + + R S ++G + + + I ++ + + SYD Sbjct: 331 FYSTHTC-RLDMMSRDNRHASSWLIGESIRETYQGIGCEFRPKDIVADIRKQYGIPISYD 389 Query: 224 ACLKGRNKTIEMVYGKYDESFNDFLGYLYMLRKTNPETVTEHEISPDGRFKHVFVALGAS 403 + + + + G +ES+N Y Y+L + NP T+T+ D +FK+ F+++GAS Sbjct: 390 KAWRAKELALGSIRGSPEESYNTLPSYCYVLEQKNPGTITDIVTDCDNQFKYFFMSIGAS 449 Query: 404 RLAYVNGHIRPVIVVDGTHLKGKNNGVMFVAVTKDANEQLFPLAYGIGPIENHESWIWFM 583 LA + IRPV+VVDGT LK K G +F+A KD N Q++PLA+GIG EN SW WF+ Sbjct: 450 -LAGFHTSIRPVVVVDGTFLKAKYLGTLFIAACKDGNNQIYPLAFGIGDSENDASWEWFL 508 Query: 584 QCLQKSYGLLTGHLIVSDQHKSIKYA 661 Q L + G + ++SD+H SI+ A Sbjct: 509 QKLHDALGHIDDLFVISDRHGSIEKA 534 >gb|AAG10809.1|AC018460_3 Similar to mutator transposase [Arabidopsis thaliana] Length = 884 Score = 129 bits (323), Expect = 1e-27 Identities = 79/226 (34%), Positives = 122/226 (53%), Gaps = 10/226 (4%) Frame = +2 Query: 8 LKSDG-GPWHVSIFRKEHTCSEFGLTNVHRRTIK---SSVVGSIYAKRMLRDSEIIKSRI 175 + DG G + + + +HTCS + R+ +K S V+ S++ + S Sbjct: 362 IDDDGKGYYEIKKAQLQHTCS----VDTRRQYMKKATSKVIASVFKAKYSEASAGPVPMD 417 Query: 176 IQNELLSDFSVDTSYDACLKGRNKTIEMVYGKYDESFNDFLGYLYMLRKTNPETVTEHEI 355 +Q +L D V SY C + R + V G +ES+++ YL++L+ TNP T+T E Sbjct: 418 LQQLVLEDLRVSASYKKCWRARESALTDVGGSDEESYSNLAEYLHLLKLTNPGTITHIET 477 Query: 356 SPD------GRFKHVFVALGASRLAYVNGHIRPVIVVDGTHLKGKNNGVMFVAVTKDANE 517 PD RF ++F+A GAS + H+R V+VVDGTHLKGK GV+ A +DAN Sbjct: 478 EPDIEDERKERFLYMFLAFGASIQGFK--HLRRVLVVDGTHLKGKYKGVLLTASGQDANF 535 Query: 518 QLFPLAYGIGPIENHESWIWFMQCLQKSYGLLTGHLIVSDQHKSIK 655 Q++PLA+ + EN ++W WF L++ I+SD+H+SIK Sbjct: 536 QVYPLAFAVVDSENDDAWTWFFTKLERIIADNNTLTILSDRHESIK 581 >gb|AAG50597.1|AC079605_2 hypothetical protein [Arabidopsis thaliana] Length = 873 Score = 128 bits (322), Expect = 1e-27 Identities = 77/220 (35%), Positives = 119/220 (54%), Gaps = 9/220 (4%) Frame = +2 Query: 23 GPWHVSIFRKEHTCSEFGLTNVHRRTIK---SSVVGSIYAKRMLRDSEIIKSRIIQNELL 193 G + + + +HTCS + R+ +K S V+ S++ + S +Q +L Sbjct: 384 GYYEIKKAQLQHTCS----VDTRRQYMKKATSKVIASVFKAKYSEASAGPVPMDLQQLVL 439 Query: 194 SDFSVDTSYDACLKGRNKTIEMVYGKYDESFNDFLGYLYMLRKTNPETVTEHEISPD--- 364 D V SY C + R + V G +ES+++ YL++L+ TNP T+T E PD Sbjct: 440 EDLRVSASYKKCWRARESALTDVGGSDEESYSNLAEYLHLLKLTNPGTITHIETEPDIED 499 Query: 365 ---GRFKHVFVALGASRLAYVNGHIRPVIVVDGTHLKGKNNGVMFVAVTKDANEQLFPLA 535 RF ++F+A GAS + H+R V+VVDGTHLKGK GV+ A +DAN Q++PLA Sbjct: 500 ERKERFLYMFLAFGASIQGFK--HLRRVLVVDGTHLKGKYKGVLLTASGQDANFQVYPLA 557 Query: 536 YGIGPIENHESWIWFMQCLQKSYGLLTGHLIVSDQHKSIK 655 + + EN ++W WF L++ I+SD+H+SIK Sbjct: 558 FAVVDSENDDAWTWFFTKLERIIADNNTLTILSDRHESIK 597 >ref|XP_003633694.1| PREDICTED: uncharacterized protein LOC100241533 [Vitis vinifera] Length = 734 Score = 128 bits (322), Expect = 1e-27 Identities = 70/206 (33%), Positives = 110/206 (53%) Frame = +2 Query: 44 FRKEHTCSEFGLTNVHRRTIKSSVVGSIYAKRMLRDSEIIKSRIIQNELLSDFSVDTSYD 223 F HTC + + R S ++G + + + I ++ + + SYD Sbjct: 237 FYSTHTC-RLDMMSRDNRHASSWLIGESIRETYQGIGCEFRLKDIVADIRKQYGIQISYD 295 Query: 224 ACLKGRNKTIEMVYGKYDESFNDFLGYLYMLRKTNPETVTEHEISPDGRFKHVFVALGAS 403 + + + + G +ES+N Y Y+L + NP T+T+ D +FK+ F+++GAS Sbjct: 296 KAWRAKELALGSIRGSPEESYNTLPSYCYVLEQKNPGTITDIVTDCDNQFKYFFMSIGAS 355 Query: 404 RLAYVNGHIRPVIVVDGTHLKGKNNGVMFVAVTKDANEQLFPLAYGIGPIENHESWIWFM 583 LA + IRPV+ VDGT LK K G +F+A KD N Q++PLA+GIG EN SW WF+ Sbjct: 356 -LAGFHTSIRPVVAVDGTFLKAKYFGTLFIAACKDGNNQIYPLAFGIGDSENDASWEWFL 414 Query: 584 QCLQKSYGLLTGHLIVSDQHKSIKYA 661 Q L + G + ++SD+H SI+ A Sbjct: 415 QKLHDAIGHIDDLFVISDRHGSIEKA 440 >ref|XP_006494886.1| PREDICTED: uncharacterized protein LOC102629840 [Citrus sinensis] Length = 772 Score = 126 bits (317), Expect = 5e-27 Identities = 77/202 (38%), Positives = 111/202 (54%) Frame = +2 Query: 56 HTCSEFGLTNVHRRTIKSSVVGSIYAKRMLRDSEIIKSRIIQNELLSDFSVDTSYDACLK 235 HTC L + R ++S VVG I A++ ++D I I+ ++ ++ V +Y + Sbjct: 290 HTCHNEVLVD-GRHQVRSRVVGHIIAEKYIQDKRIYTPNDIRADMQQEYGVQLTYQQAYR 348 Query: 236 GRNKTIEMVYGKYDESFNDFLGYLYMLRKTNPETVTEHEISPDGRFKHVFVALGASRLAY 415 R +E+V G ES+N Y ++L N TVT E DG F + FVALG+S + Sbjct: 349 AREVGLEIVRGNAAESYNLLPKYSHVLTTANEGTVTHLEQDGDGNFLYYFVALGSSIKGF 408 Query: 416 VNGHIRPVIVVDGTHLKGKNNGVMFVAVTKDANEQLFPLAYGIGPIENHESWIWFMQCLQ 595 + +IRPVI VDGTHLKG G MFVA D N QL+PLA GI EN+++ WFM L Sbjct: 409 MQ-YIRPVIAVDGTHLKGLYRGSMFVATCLDGNNQLYPLAIGIMDSENNDALEWFMMKLH 467 Query: 596 KSYGLLTGHLIVSDQHKSIKYA 661 G + +I+ D+ +I+ A Sbjct: 468 GVIGDRSELVIIFDRCTAIRRA 489 >dbj|BAA96881.1| mutator-like transposase [Arabidopsis thaliana] Length = 875 Score = 126 bits (316), Expect = 7e-27 Identities = 75/202 (37%), Positives = 110/202 (54%), Gaps = 9/202 (4%) Frame = +2 Query: 77 LTNVHRRTIK---SSVVGSIYAKRMLRDSEIIKSRIIQNELLSDFSVDTSYDACLKGRNK 247 L V T+K S V+ S++ + S +Q +L D V SY C + R Sbjct: 400 LATVMNGTVKKATSKVIASVFKAKYSEASAGPVPMDLQQLVLEDLRVSASYKKCWRARES 459 Query: 248 TIEMVYGKYDESFNDFLGYLYMLRKTNPETVTEHEISPD------GRFKHVFVALGASRL 409 I V G +ES+++ YL++L+ TNP T+T E PD RF ++F+A GAS Sbjct: 460 AITDVGGSDEESYSNLAEYLHLLKLTNPGTITHIETEPDIEDERKERFLYMFLAFGASIQ 519 Query: 410 AYVNGHIRPVIVVDGTHLKGKNNGVMFVAVTKDANEQLFPLAYGIGPIENHESWIWFMQC 589 + H+R V+VVDGTHLKGK GV+ A +DAN Q++PLA+ + EN ++W WF Sbjct: 520 GFK--HLRRVLVVDGTHLKGKYKGVLLTASGQDANFQVYPLAFAVVDSENDDAWTWFFTK 577 Query: 590 LQKSYGLLTGHLIVSDQHKSIK 655 L++ I+SD+H+SIK Sbjct: 578 LERIIADSNTLTILSDRHESIK 599 >emb|CAB51203.1| putative protein [Arabidopsis thaliana] Length = 735 Score = 125 bits (315), Expect = 9e-27 Identities = 76/212 (35%), Positives = 113/212 (53%), Gaps = 3/212 (1%) Frame = +2 Query: 29 WHVSIFRKEHTCS---EFGLTNVHRRTIKSSVVGSIYAKRMLRDSEIIKSRIIQNELLSD 199 W V + RK HTCS E ++ R T + ++ S+ + E I + + Sbjct: 224 WSVRVHRKMHTCSRSVETTSNSIQRGTPR--LIASVLHCDYPGNLETPTPNNIMSIVRGR 281 Query: 200 FSVDTSYDACLKGRNKTIEMVYGKYDESFNDFLGYLYMLRKTNPETVTEHEISPDGRFKH 379 V SY L+G+ + V G + S+ YLYML K NP TVT E+ + +FK+ Sbjct: 282 LGVHCSYSTALRGKMLHVSDVRGTPERSYTMLFSYLYMLEKVNPGTVTYVELEGEKKFKY 341 Query: 380 VFVALGASRLAYVNGHIRPVIVVDGTHLKGKNNGVMFVAVTKDANEQLFPLAYGIGPIEN 559 +F+ALGA + +R VIVVD THLK G++ +A +D N +PLA+GI EN Sbjct: 342 LFIALGACIEGFRT--MRKVIVVDATHLKTVYGGMLVIATAQDPNHHHYPLAFGIIDSEN 399 Query: 560 HESWIWFMQCLQKSYGLLTGHLIVSDQHKSIK 655 SWIWF++ L+ Y + G + +SD+H+SIK Sbjct: 400 DVSWIWFLEKLKTVYSDVPGLVFISDRHQSIK 431 >ref|XP_006847242.1| hypothetical protein AMTR_s04652p00001680, partial [Amborella trichopoda] gi|548850309|gb|ERN08823.1| hypothetical protein AMTR_s04652p00001680, partial [Amborella trichopoda] Length = 607 Score = 125 bits (313), Expect = 2e-26 Identities = 74/224 (33%), Positives = 115/224 (51%), Gaps = 5/224 (2%) Frame = +2 Query: 5 LLKSDGGPWHVSIFRKE-----HTCSEFGLTNVHRRTIKSSVVGSIYAKRMLRDSEIIKS 169 L S GP I RK HTC + + +R + ++G+ R Sbjct: 156 LTASRNGPTKSFIIRKYDRKVIHTC-DLNIRFADKRQATTKLIGNYIKPRFTNIKTTQTP 214 Query: 170 RIIQNELLSDFSVDTSYDACLKGRNKTIEMVYGKYDESFNDFLGYLYMLRKTNPETVTEH 349 + I+ E+ + V +Y + + E + GK +ES+ G+L+ML+KTNP T+ Sbjct: 215 QDIRGEMKHKYGVRMNYMKAWRSKEHAQEELRGKANESYRLLPGFLHMLQKTNPGTIVHM 274 Query: 350 EISPDGRFKHVFVALGASRLAYVNGHIRPVIVVDGTHLKGKNNGVMFVAVTKDANEQLFP 529 E D FK++FVAL AS + +P+IVVDGT LK G + A T+DAN +FP Sbjct: 275 ETEDDNSFKYLFVALDASIKGWKK--CKPIIVVDGTFLKSTYGGTLLSACTQDANGHIFP 332 Query: 530 LAYGIGPIENHESWIWFMQCLQKSYGLLTGHLIVSDQHKSIKYA 661 LA+ + EN+ SW WF ++++YG+ ++SD+H+SI A Sbjct: 333 LAFSVVDSENNNSWQWFFTKVRETYGIREEQCLISDRHESISKA 376 >gb|AAF87140.1|AC002423_5 T23E23.9 [Arabidopsis thaliana] Length = 972 Score = 124 bits (312), Expect = 2e-26 Identities = 76/214 (35%), Positives = 114/214 (53%), Gaps = 3/214 (1%) Frame = +2 Query: 29 WHVSIFRKEHTCS---EFGLTNVHRRTIKSSVVGSIYAKRMLRDSEIIKSRIIQNELLSD 199 W V + RK HTCS E ++ R T + ++ S+ + E + I + + Sbjct: 275 WSVRVHRKMHTCSRSVETTSNSIQRGTPR--LIASVLHCDYPGNLETPTPKNIMSIVRGR 332 Query: 200 FSVDTSYDACLKGRNKTIEMVYGKYDESFNDFLGYLYMLRKTNPETVTEHEISPDGRFKH 379 V SY L+G+ + V G + S+ YLYML K NP TVT E+ + +FK+ Sbjct: 333 LGVHCSYSTALRGKMLHVSDVRGTPERSYTMLFSYLYMLEKVNPGTVTYVELEGEKKFKY 392 Query: 380 VFVALGASRLAYVNGHIRPVIVVDGTHLKGKNNGVMFVAVTKDANEQLFPLAYGIGPIEN 559 +F+ALGA + +R VIVVD THLK G++ +A +D N +PLA+GI E Sbjct: 393 LFIALGACIEGF--RAMRKVIVVDATHLKTVYGGMLVIATAQDPNHHHYPLAFGIIDSEK 450 Query: 560 HESWIWFMQCLQKSYGLLTGHLIVSDQHKSIKYA 661 SWIWF++ L+ Y + G + +SD+H+SIK A Sbjct: 451 DVSWIWFLEKLKTVYSDVPGLVFISDRHQSIKKA 484 >emb|CAB43911.1| putative protein [Arabidopsis thaliana] gi|7269795|emb|CAB79655.1| putative protein [Arabidopsis thaliana] Length = 914 Score = 124 bits (312), Expect = 2e-26 Identities = 76/214 (35%), Positives = 114/214 (53%), Gaps = 3/214 (1%) Frame = +2 Query: 29 WHVSIFRKEHTCS---EFGLTNVHRRTIKSSVVGSIYAKRMLRDSEIIKSRIIQNELLSD 199 W V + RK HTCS E ++ R T + ++ S+ + E + I + + Sbjct: 275 WSVRVHRKMHTCSRSVETTSNSIQRGTPR--LIASVLHCDYPGNLETPTPKNIMSIVRGR 332 Query: 200 FSVDTSYDACLKGRNKTIEMVYGKYDESFNDFLGYLYMLRKTNPETVTEHEISPDGRFKH 379 V SY L+G+ + V G + S+ YLYML K NP TVT E+ + +FK+ Sbjct: 333 LGVHCSYSTALRGKMLHVSDVRGTPERSYTMLFSYLYMLEKVNPGTVTYVELEGEKKFKY 392 Query: 380 VFVALGASRLAYVNGHIRPVIVVDGTHLKGKNNGVMFVAVTKDANEQLFPLAYGIGPIEN 559 +F+ALGA + +R VIVVD THLK G++ +A +D N +PLA+GI E Sbjct: 393 LFIALGACIEGF--RAMRKVIVVDATHLKTVYGGMLVIATAQDPNHHHYPLAFGIIDSEK 450 Query: 560 HESWIWFMQCLQKSYGLLTGHLIVSDQHKSIKYA 661 SWIWF++ L+ Y + G + +SD+H+SIK A Sbjct: 451 DVSWIWFLEKLKTVYSDVPGLVFISDRHQSIKKA 484 >gb|AAD49098.1|AF177535_2 contains similarity to maize transposon MuDR (GB:M76978) [Arabidopsis thaliana] Length = 872 Score = 124 bits (311), Expect = 3e-26 Identities = 70/189 (37%), Positives = 105/189 (55%), Gaps = 6/189 (3%) Frame = +2 Query: 107 SSVVGSIYAKRMLRDSEIIKSRIIQNELLSDFSVDTSYDACLKGRNKTIEMVYGKYDESF 286 S V+ S++ + S +Q +L D V SY C + R + V G +ES+ Sbjct: 413 SKVIASVFKAKYSEASAGPVPMDLQQLVLEDLRVSASYKKCWRARESALTDVGGSDEESY 472 Query: 287 NDFLGYLYMLRKTNPETVTEHEISPD------GRFKHVFVALGASRLAYVNGHIRPVIVV 448 ++ YL++L+ TNP T+T E PD RF ++F+A GAS + H+R V+VV Sbjct: 473 SNLAEYLHLLKLTNPGTITHIETEPDIEDERKERFLYMFLAFGASIQGFK--HLRRVLVV 530 Query: 449 DGTHLKGKNNGVMFVAVTKDANEQLFPLAYGIGPIENHESWIWFMQCLQKSYGLLTGHLI 628 DGTHLKGK GV+ A +DAN Q++PLA+ + EN ++W WF L++ I Sbjct: 531 DGTHLKGKYKGVLLTASGQDANFQVYPLAFAVVDSENDDAWTWFFTKLERIIADSNTLTI 590 Query: 629 VSDQHKSIK 655 +SD+H+SIK Sbjct: 591 LSDRHESIK 599 >ref|XP_006470974.1| PREDICTED: uncharacterized protein LOC102620129 [Citrus sinensis] Length = 805 Score = 124 bits (310), Expect = 3e-26 Identities = 70/200 (35%), Positives = 104/200 (52%) Frame = +2 Query: 56 HTCSEFGLTNVHRRTIKSSVVGSIYAKRMLRDSEIIKSRIIQNELLSDFSVDTSYDACLK 235 H CS ++ + R S ++ + + K I+ ++L F V+ SYD + Sbjct: 304 HNCS-LDISQRNHRQASSKLISKFIQSKCDGVARSYKPGSIREDILKQFGVNISYDKAWR 362 Query: 236 GRNKTIEMVYGKYDESFNDFLGYLYMLRKTNPETVTEHEISPDGRFKHVFVALGASRLAY 415 R + V G +ESF+ Y ML K NP T+T E + F+ F+ALG+S + Sbjct: 363 AREYALHSVKGSLEESFSLLSAYCEMLEKKNPGTITHIETDLENHFQSFFMALGSSIRGF 422 Query: 416 VNGHIRPVIVVDGTHLKGKNNGVMFVAVTKDANEQLFPLAYGIGPIENHESWIWFMQCLQ 595 IRPVI VD LKGK G+MF+A KD N+Q +PLA+GIG E+ SW WF+ L+ Sbjct: 423 -RSSIRPVIAVDRALLKGKYQGIMFLAACKDGNDQTYPLAFGIGDSESDSSWDWFLTKLR 481 Query: 596 KSYGLLTGHLIVSDQHKSIK 655 G + + +SD + SI+ Sbjct: 482 DLMGEVDDLVFISDDNVSIR 501 >emb|CAB78061.1| putative protein [Arabidopsis thaliana] Length = 960 Score = 124 bits (310), Expect = 3e-26 Identities = 76/214 (35%), Positives = 114/214 (53%), Gaps = 3/214 (1%) Frame = +2 Query: 29 WHVSIFRKEHTCS---EFGLTNVHRRTIKSSVVGSIYAKRMLRDSEIIKSRIIQNELLSD 199 W V + RK HTCS E ++ R T + ++ S+ + E + I + + Sbjct: 275 WSVRMHRKMHTCSRSVETTSNSIQRGTPR--LIASVLHCDYPGNLETPTPKNIMSIVRGR 332 Query: 200 FSVDTSYDACLKGRNKTIEMVYGKYDESFNDFLGYLYMLRKTNPETVTEHEISPDGRFKH 379 V SY L+G+ + V G + S+ YLYML K NP TVT E+ + +FK+ Sbjct: 333 LGVHCSYSTALRGKMLHVSDVRGTPERSYTMLFSYLYMLEKVNPGTVTYVELEGEKKFKY 392 Query: 380 VFVALGASRLAYVNGHIRPVIVVDGTHLKGKNNGVMFVAVTKDANEQLFPLAYGIGPIEN 559 +F+ALGA + +R VIVVD THLK G++ +A +D N +PLA+GI E Sbjct: 393 LFIALGACIEGF--RAMRKVIVVDATHLKTVYGGMLVIATAQDPNHHHYPLAFGIIDSEK 450 Query: 560 HESWIWFMQCLQKSYGLLTGHLIVSDQHKSIKYA 661 SWIWF++ L+ Y + G + +SD+H+SIK A Sbjct: 451 DVSWIWFLENLKTVYSDVPGLVFISDRHQSIKKA 484 >gb|AAD31079.1| Mutator-like transposase [Arabidopsis thaliana] Length = 819 Score = 122 bits (306), Expect = 1e-25 Identities = 73/208 (35%), Positives = 106/208 (50%) Frame = +2 Query: 29 WHVSIFRKEHTCSEFGLTNVHRRTIKSSVVGSIYAKRMLRDSEIIKSRIIQNELLSDFSV 208 ++V I+ EHTCS + R+ V+G +Y + +K + I + F V Sbjct: 316 FYVYIYDSEHTCSVRERSGRSRQATPD-VLGVLYRDYLGDVGPDVKPKSIGIIITKHFRV 374 Query: 209 DTSYDACLKGRNKTIEMVYGKYDESFNDFLGYLYMLRKTNPETVTEHEISPDGRFKHVFV 388 SY K E+ G D SF + YLYM+R+ NP TV +I GRF ++F+ Sbjct: 375 KMSYSKSYKTLRFARELTLGTPDSSFEELPSYLYMIRRANPGTVARLQIDESGRFNYMFI 434 Query: 389 ALGASRLAYVNGHIRPVIVVDGTHLKGKNNGVMFVAVTKDANEQLFPLAYGIGPIENHES 568 GAS + ++R V+VVDGT L G G + A+ +D N Q+FPLA+G+ EN +S Sbjct: 435 VFGASIAGF--HYMRRVVVVDGTFLHGSYKGTLLTALAQDGNFQIFPLAFGVVDTENDDS 492 Query: 569 WIWFMQCLQKSYGLLTGHLIVSDQHKSI 652 W W L+ T I+SD+HKSI Sbjct: 493 WRWLFTQLKVVIPDATDLAIISDRHKSI 520 >ref|XP_007014883.1| Uncharacterized protein TCM_040459 [Theobroma cacao] gi|508785246|gb|EOY32502.1| Uncharacterized protein TCM_040459 [Theobroma cacao] Length = 715 Score = 122 bits (305), Expect = 1e-25 Identities = 67/216 (31%), Positives = 111/216 (51%) Frame = +2 Query: 8 LKSDGGPWHVSIFRKEHTCSEFGLTNVHRRTIKSSVVGSIYAKRMLRDSEIIKSRIIQNE 187 L G W V F K HTC+ GL T + ++G + + ++ + ++ + I E Sbjct: 309 LPDRGEYWQVRTFHKVHTCTVDGLQRWFPTT-SAKMIGELMSHKLRANGVALRPKDIICE 367 Query: 188 LLSDFSVDTSYDACLKGRNKTIEMVYGKYDESFNDFLGYLYMLRKTNPETVTEHEISPDG 367 + + ++ Y + + +V+ +ESF Y YML + P TVT + Sbjct: 368 MRVQWGLECLYGKVWQAKEYAERLVFSPLEESFQLLPSYFYMLEQEIPGTVTVMATDEEE 427 Query: 368 RFKHVFVALGASRLAYVNGHIRPVIVVDGTHLKGKNNGVMFVAVTKDANEQLFPLAYGIG 547 RFK+ F + GA + + + P + +D THLKG+ NGV+FV V KDANE ++P+ +GI Sbjct: 428 RFKYCFWSYGACIRGF-SDVMHPTVAIDATHLKGRFNGVLFVTVCKDANECVYPVGFGID 486 Query: 548 PIENHESWIWFMQCLQKSYGLLTGHLIVSDQHKSIK 655 +E+ +SW WF+ L+ G + +S+QH IK Sbjct: 487 HVEDEDSWTWFLSKLRDVVGCHENTMFISNQHLIIK 522 >ref|XP_007052371.1| Uncharacterized protein TCM_005764 [Theobroma cacao] gi|508704632|gb|EOX96528.1| Uncharacterized protein TCM_005764 [Theobroma cacao] Length = 458 Score = 120 bits (300), Expect = 5e-25 Identities = 67/207 (32%), Positives = 112/207 (54%) Frame = +2 Query: 35 VSIFRKEHTCSEFGLTNVHRRTIKSSVVGSIYAKRMLRDSEIIKSRIIQNELLSDFSVDT 214 V F K HTC+ GL T + ++ + + ++ + ++ + I ++ + ++ Sbjct: 201 VQKFHKVHTCTVDGLQGWFP-TKSAKMITELMSHKIRANGVALRPKDIICDMRVQWGLEC 259 Query: 215 SYDACLKGRNKTIEMVYGKYDESFNDFLGYLYMLRKTNPETVTEHEISPDGRFKHVFVAL 394 Y + + +V+G +ESF Y YML + NP VT + + RFK+ + Sbjct: 260 LYGKAWQVKEYAKRLVFGPPEESFQLLPSYFYMLEQENPGIVTAVATNEEKRFKYCLWSY 319 Query: 395 GASRLAYVNGHIRPVIVVDGTHLKGKNNGVMFVAVTKDANEQLFPLAYGIGPIENHESWI 574 GA +++ +RP I +D THLKG+ GV+FVAV KDANE ++P+A+GI IE+ +SW Sbjct: 320 GACIRGFMDV-MRPTIAIDATHLKGRFKGVLFVAVCKDANECVYPIAFGIDHIEDEDSWT 378 Query: 575 WFMQCLQKSYGLLTGHLIVSDQHKSIK 655 WF+ L+ + G L + + DQH IK Sbjct: 379 WFLSKLRDAVGCLENTMFIFDQHLGIK 405