BLASTX nr result
ID: Ephedra27_contig00026656
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Ephedra27_contig00026656 (1294 letters) Database: ./nr 37,332,560 sequences; 13,225,080,153 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value emb|CAN60366.1| hypothetical protein VITISV_031870 [Vitis vinifera] 185 3e-50 emb|CAN71759.1| hypothetical protein VITISV_020777 [Vitis vinifera] 180 1e-48 emb|CBI37296.3| unnamed protein product [Vitis vinifera] 171 2e-45 gb|AAT38758.1| Putative gag-pol polyprotein, identical [Solanum ... 174 4e-44 emb|CAB75932.1| putative protein [Arabidopsis thaliana] 163 7e-44 emb|CAN79116.1| hypothetical protein VITISV_002093 [Vitis vinifera] 162 2e-43 emb|CAN72676.1| hypothetical protein VITISV_020406 [Vitis vinifera] 182 2e-43 gb|AAG60117.1|AC073555_1 copia-type polyprotein, putative [Arabi... 168 3e-43 gb|AAG50698.1|AC079604_5 copia-type polyprotein, putative [Arabi... 167 6e-43 gb|AAD50001.1|AC007259_14 Hypothetical protein [Arabidopsis thal... 166 1e-42 emb|CAB75469.1| copia-type reverse transcriptase-like protein [A... 166 2e-42 dbj|BAB01972.1| copia-like retrotransposable element [Arabidopsi... 172 4e-42 emb|CAB71063.1| copia-type polyprotein [Arabidopsis thaliana] 166 4e-42 dbj|BAB11200.1| copia-type polyprotein [Arabidopsis thaliana] gi... 162 4e-42 gb|AAG51247.1|AC055769_6 copia-type polyprotein, putative; 28768... 162 4e-42 gb|AAF25964.2|AC017118_1 F6N18.1 [Arabidopsis thaliana] 162 4e-42 ref|XP_002064813.1| GK15001 [Drosophila willistoni] gi|194160898... 168 5e-42 gb|AAF19226.1|AC007505_2 Highly similar to Ta1-3 polyprotein [Ar... 164 6e-42 gb|AGW47867.1| polyprotein [Phaseolus vulgaris] 166 4e-41 pir||S00954 pol polyprotein - fruit fly (Drosophila melanogaster... 167 5e-41 >emb|CAN60366.1| hypothetical protein VITISV_031870 [Vitis vinifera] Length = 1274 Score = 185 bits (470), Expect(2) = 3e-50 Identities = 110/288 (38%), Positives = 166/288 (57%), Gaps = 9/288 (3%) Frame = +2 Query: 458 ITLNFGNNKI-ILTDALYVEGLRKNLISLYKLLT*KYNMFAFQKDNGQTCC--RIPYNQK 628 + +N G+ + +L + ++ L +NL+S+ +L+ Y++ +G TC +Q Sbjct: 350 MAVNNGHGNVKLLYNVYFIPSLTQNLLSVGQLMVSGYSILF----DGATCVIKDKKSDQI 405 Query: 629 IVAEGTEQNGLFVMKPMLIE---CFLTNTEISNLWHNRLGHINNEYLWKVGA--VSHGPK 793 IV N LF ++ IE + T SNLWH R GH+N + L + + G Sbjct: 406 IVNVRMAANKLFPLEVSSIEKHALVVKETSESNLWHLRYGHLNVKGLKLLSKKEMVFGLP 465 Query: 794 KLLPTKMCSS*ITAKLHKKPFNKG-TRISTKCLEIIHSDLCGPITPPTTHGKSYILTFTD 970 K+ +C I K KKPF KG +R ++ CLEIIH+DLCGP+ + G Y L FTD Sbjct: 466 KIDSVNVCEGCIYGKQSKKPFPKGRSRRASSCLEIIHADLCGPMQTASFGGSRYFLLFTD 525 Query: 971 DHSKMTWTFILRHKSEVFESFMKF*KKVQTEKVSKILCLKIDNGIEFTSIEFKNYCVKKG 1150 DHS+M+W + L+ K+E FE+F KF V+ + I L+ D G EF S +FK +C ++G Sbjct: 526 DHSRMSWVYFLQSKAETFETFKKFKAFVEKQSGKCIKVLRTDRGGEFLSNDFKVFCEEEG 585 Query: 1151 IKKELTIPYNTQQNGVAERKNRTLIEMVRSLLQSSQLSTKWWGEAALT 1294 + +ELT PY+ +QNGVAERKNRT++EM RS++++ LS +W E T Sbjct: 586 LHRELTTPYSPEQNGVAERKNRTVVEMARSMMKAKNLSNHFWAEGVAT 633 Score = 41.6 bits (96), Expect(2) = 3e-50 Identities = 24/71 (33%), Positives = 41/71 (57%), Gaps = 3/71 (4%) Frame = +1 Query: 250 KANMVEIKENQVYVFFTNKD---FNRDGWFLDSDCNSHMTGNKEMFTNFRIAYG*KFVKI 420 +AN VE +E+QV +F + + + WFLDS C++HMTG K +F ++ K Sbjct: 278 QANYVEQEEDQVKLFMXYNEEVVSSNNIWFLDSGCSNHMTGIKSLFKELDESHKLKVKLG 337 Query: 421 VEERLLVKGVG 453 ++++ V+G G Sbjct: 338 DDKQVXVEGKG 348 >emb|CAN71759.1| hypothetical protein VITISV_020777 [Vitis vinifera] Length = 1472 Score = 180 bits (456), Expect(2) = 1e-48 Identities = 110/282 (39%), Positives = 161/282 (57%), Gaps = 8/282 (2%) Frame = +2 Query: 473 GNNKIILTDALYVEGLRKNLISLYKLLT*KYNMFAFQKDNGQTCC--RIPYNQKIVAEGT 646 GN K+ L + ++ L +NL+S+ +L+ Y++ +G TC +Q IV Sbjct: 342 GNVKL-LYNVYFIPSLTQNLLSVGQLMVSGYSILF----DGATCVIKDKKSDQIIVBVRM 396 Query: 647 EQNGLFVMKPMLIE---CFLTNTEISNLWHNRLGHINNEYLWKVGA--VSHGPKKLLPTK 811 N LF ++ IE + T SNLWH R GH+N + L + + G K+ Sbjct: 397 AANKLFPLEVSSIEKHALVVKETSESNLWHLRYGHLNVKGLKLLSKKEMVFGLPKIDSVN 456 Query: 812 MCSS*ITAKLHKKPFNKG-TRISTKCLEIIHSDLCGPITPPTTHGKSYILTFTDDHSKMT 988 +C I K KKPF KG +R ++ CLEIIH+DLCGP+ + G Y L FTDDHS+M+ Sbjct: 457 VCEGCIYGKQSKKPFPKGRSRRASSCLEIIHADLCGPMQTASFGGSRYFLLFTDDHSRMS 516 Query: 989 WTFILRHKSEVFESFMKF*KKVQTEKVSKILCLKIDNGIEFTSIEFKNYCVKKGIKKELT 1168 W + L+ K+E FE+F KF V+ + I L+ D G EF S +FK + ++G+ +ELT Sbjct: 517 WVYFLQSKAETFETFKKFKAFVEKQSGKCIKVLRTDRGGEFLSNDFKVFXEEEGLHRELT 576 Query: 1169 IPYNTQQNGVAERKNRTLIEMVRSLLQSSQLSTKWWGEAALT 1294 PY+ QNGVAERKNRT++EM RS++++ LS +W E T Sbjct: 577 TPYSPXQNGVAERKNRTVVEMARSMMKAKNLSNHFWAEGVAT 618 Score = 41.6 bits (96), Expect(2) = 1e-48 Identities = 24/71 (33%), Positives = 41/71 (57%), Gaps = 3/71 (4%) Frame = +1 Query: 250 KANMVEIKENQVYVFFTNKD---FNRDGWFLDSDCNSHMTGNKEMFTNFRIAYG*KFVKI 420 +AN VE +E+QV +F + + + WFLDS C++HMTG K +F ++ K Sbjct: 263 QANYVEQEEDQVKLFMAYNEEVVSSNNIWFLDSGCSNHMTGIKSLFKELDESHKLKVKLG 322 Query: 421 VEERLLVKGVG 453 ++++ V+G G Sbjct: 323 DDKQVQVEGKG 333 >emb|CBI37296.3| unnamed protein product [Vitis vinifera] Length = 3048 Score = 171 bits (434), Expect(2) = 2e-45 Identities = 106/277 (38%), Positives = 157/277 (56%), Gaps = 11/277 (3%) Frame = +2 Query: 488 ILTDALYVEGLRKNLISLYKLLT*KYNMFAFQKDNGQTCCRIPYNQKIVAEGTEQ--NGL 661 I+T YV L+ NL+S+ +L K FQ C++ ++QK + T+ N + Sbjct: 374 IITGVFYVPELKNNLLSIGQLQE-KGLTILFQHGK----CKVFHSQKGLIMDTKMSSNRM 428 Query: 662 FVM----KPMLIECFLTNTE-ISNLWHNRLGHINNEYLWKVGA---VSHGPKKLLPTKMC 817 F++ +P+ CF T TE I LWH R GH++ + L + V+ P+ P+K+C Sbjct: 429 FMLYALSQPISSTCFNTVTEDILQLWHCRYGHLSFQGLKTLQQRKMVNGLPQFQPPSKLC 488 Query: 818 SS*ITAKLHKKPFNKGTRI-STKCLEIIHSDLCGPITPPTTHGKSYILTFTDDHSKMTWT 994 + K H+ K + + + L+++H+D+CGPI P + K Y+LTFTDD S+ TW Sbjct: 489 KDCLVGKQHRSSIPKKSNWRAAEILQLVHADICGPINPISNSKKRYLLTFTDDFSRKTWV 548 Query: 995 FILRHKSEVFESFMKF*KKVQTEKVSKILCLKIDNGIEFTSIEFKNYCVKKGIKKELTIP 1174 + L KSE F F F V+ E S + CL+ D G EFTS EF +C GI+++LT Sbjct: 549 YFLVEKSEAFAVFKSFKTYVEKETSSFLRCLRTDRGGEFTSQEFAIFCDVHGIRRQLTAA 608 Query: 1175 YNTQQNGVAERKNRTLIEMVRSLLQSSQLSTKWWGEA 1285 Y QQNGVAERKNRT++ MVRS+L + +L +W EA Sbjct: 609 YTPQQNGVAERKNRTIMNMVRSMLSAKKLPKTFWPEA 645 Score = 39.7 bits (91), Expect(2) = 2e-45 Identities = 15/29 (51%), Positives = 20/29 (68%) Frame = +1 Query: 301 NKDFNRDGWFLDSDCNSHMTGNKEMFTNF 387 NK D WFLDS C++HM G K+ F++F Sbjct: 312 NKTSREDTWFLDSGCSNHMCGKKDYFSDF 340 >gb|AAT38758.1| Putative gag-pol polyprotein, identical [Solanum demissum] Length = 1333 Score = 174 bits (442), Expect(2) = 4e-44 Identities = 107/287 (37%), Positives = 154/287 (53%), Gaps = 13/287 (4%) Frame = +2 Query: 473 GNNKIILTDALYVEGLRKNLISLYKLLT*KYNMFAFQ-------KDNGQTCCRIPYNQKI 631 GN K L D YV L NL+S+ +L+T Y++ + K++G+T R+P Q Sbjct: 362 GNVKF-LYDVQYVPTLAHNLLSVGQLMTSGYSVVFYDNACDIKDKESGRTIARVPMTQNK 420 Query: 632 VAE---GTEQNGLFVMKPMLIECFLTNTEISNLWHNRLGHINNEYLWKVGAVSH--GPKK 796 + N V+K +NLWH R GH+N +L + G Sbjct: 421 MFPLDISNVGNSALVVK---------EKNETNLWHLRYGHLNVNWLKLLVQKDMVIGLPN 471 Query: 797 LLPTKMCSS*ITAKLHKKPFNKGTRI-STKCLEIIHSDLCGPITPPTTHGKSYILTFTDD 973 + +C I K +K F G +T CLE++H+DLCGP+ + G Y L FTDD Sbjct: 472 IKELDLCEGCIYGKQTRKSFPVGKSWRATTCLELVHADLCGPMKMESLGGSRYFLMFTDD 531 Query: 974 HSKMTWTFILRHKSEVFESFMKF*KKVQTEKVSKILCLKIDNGIEFTSIEFKNYCVKKGI 1153 +S+ +W + L+ KSE FE+F KF V+ + +KI L+ D G EF S +F +C + GI Sbjct: 532 YSRFSWVYFLKFKSETFETFKKFKAFVENQSGNKIKSLRTDRGGEFLSNDFNLFCEENGI 591 Query: 1154 KKELTIPYNTQQNGVAERKNRTLIEMVRSLLQSSQLSTKWWGEAALT 1294 ++ELT PY +QNGVAERKNRT++EM RS L++ L +WGEA T Sbjct: 592 RRELTAPYTPEQNGVAERKNRTVVEMARSSLKAKGLPDYFWGEAVAT 638 Score = 32.0 bits (71), Expect(2) = 4e-44 Identities = 13/45 (28%), Positives = 23/45 (51%), Gaps = 3/45 (6%) Frame = +1 Query: 253 ANMVEIKENQVYVFFTNKDFNRDG---WFLDSDCNSHMTGNKEMF 378 AN + E + +F + WF+DS C++HM+ +K +F Sbjct: 284 ANFTQNVEEESKLFMASSQITESANAVWFIDSGCSNHMSSSKSLF 328 >emb|CAB75932.1| putative protein [Arabidopsis thaliana] Length = 1339 Score = 163 bits (413), Expect(2) = 7e-44 Identities = 103/279 (36%), Positives = 151/279 (54%), Gaps = 13/279 (4%) Frame = +2 Query: 488 ILTDALYVEGLRKNLISLYKLLT*KYNMFAFQKDNGQTCCRIPYNQKIVAEGTEQNG--- 658 ++ + YV LR NL+SL +L + + +D C++ + K T +G Sbjct: 354 VIPEVYYVPELRNNLLSLGQLQ--ERGLAILIRDG---TCKVYHPSKGAIMETNMSGNRM 408 Query: 659 --LFVMKPMLIECFLTNTEI----SNLWHNRLGHINNEYLWKVG--AVSHGPKKLLPTK- 811 L KP L E+ ++LWH R GH+N E L + + G L TK Sbjct: 409 FFLLASKPQKNSLCLQTEEVMDKENHLWHCRFGHLNQEGLKLLAHKKMVIGLPILKATKE 468 Query: 812 MCSS*ITAKLHKKPFNKGTRI-STKCLEIIHSDLCGPITPPTTHGKSYILTFTDDHSKMT 988 +C+ +T K H++ +K T S+ L+++HSD+CGPITP + GK YIL+F DD ++ T Sbjct: 469 ICAICLTGKQHRESMSKKTSWKSSTQLQLVHSDICGPITPISHSGKRYILSFIDDFTRKT 528 Query: 989 WTFILRHKSEVFESFMKF*KKVQTEKVSKILCLKIDNGIEFTSIEFKNYCVKKGIKKELT 1168 W + L KSE F +F F V+ E + + CL+ D G EFTS EF +C GI ++LT Sbjct: 529 WVYFLHEKSEAFATFKIFKASVEKEIGAFLTCLRTDRGGEFTSNEFGEFCRSHGISRQLT 588 Query: 1169 IPYNTQQNGVAERKNRTLIEMVRSLLQSSQLSTKWWGEA 1285 + QQNGVAERKNRT++ VRS+L Q+ +W EA Sbjct: 589 AAFTPQQNGVAERKNRTIMNAVRSMLSERQVPKMFWSEA 627 Score = 42.4 bits (98), Expect(2) = 7e-44 Identities = 27/77 (35%), Positives = 41/77 (53%), Gaps = 5/77 (6%) Frame = +1 Query: 253 ANMVEIKENQ---VYVFFTNKDFNRDG-WFLDSDCNSHMTGNKEMFTNFRIAYG*KFVKI 420 AN E++E + + + NRD WFLDS C++HMTG+KE F+ + + VK+ Sbjct: 272 ANYAELEEEEELLLMAYVEQNQANRDEVWFLDSGCSNHMTGSKEWFSELEEGFN-RTVKL 330 Query: 421 VEE-RLLVKGVGRYYTK 468 + R+ V G G K Sbjct: 331 GNDTRMSVVGKGSVKVK 347 >emb|CAN79116.1| hypothetical protein VITISV_002093 [Vitis vinifera] Length = 1109 Score = 162 bits (409), Expect(2) = 2e-43 Identities = 101/288 (35%), Positives = 154/288 (53%), Gaps = 9/288 (3%) Frame = +2 Query: 458 ITLNFGNNKI-ILTDALYVEGLRKNLISLYKLLT*KYNMFAFQKDNGQTCC--RIPYNQK 628 + +N G+ + +L + ++ L +NL+S+ +L+ Y++ +G TC ++Q Sbjct: 336 VAVNNGHGNVKLLYNVYFIPSLTQNLLSVGQLMVSGYSILF----DGSTCVIKDKKFDQI 391 Query: 629 IVAEGTEQNGLFVMKPMLIE---CFLTNTEISNLWHNRLGHINNEYLWKVGA--VSHGPK 793 IV N LF ++ IE + T SNLWH R GH+N + L + + G Sbjct: 392 IVDVRMAANKLFPLEVSSIEKHALVVKETSESNLWHLRYGHLNVKGLKLLSKKEMVFGLP 451 Query: 794 KLLPTKMCSS*ITAKLHKKPFNKG-TRISTKCLEIIHSDLCGPITPPTTHGKSYILTFTD 970 K+ +C I K KKPF KG +R ++ CLEIIH+DLCGP+ + G Y L FTD Sbjct: 452 KIDSVNVCEGCIYGKQSKKPFPKGRSRRASSCLEIIHADLCGPMQIASFGGSRYFLLFTD 511 Query: 971 DHSKMTWTFILRHKSEVFESFMKF*KKVQTEKVSKILCLKIDNGIEFTSIEFKNYCVKKG 1150 DHS+M+W + L+ K L+ D G EF S +FK +C ++G Sbjct: 512 DHSRMSWVYFLQSK-----------------------VLRTDRGGEFLSNDFKVFCEEEG 548 Query: 1151 IKKELTIPYNTQQNGVAERKNRTLIEMVRSLLQSSQLSTKWWGEAALT 1294 + +ELT PY+ +QNGV ERKNRT++EM RS++++ LS +W E T Sbjct: 549 LHRELTTPYSPEQNGVVERKNRTVVEMARSMMKAKNLSNHFWAEGVAT 596 Score = 42.4 bits (98), Expect(2) = 2e-43 Identities = 24/71 (33%), Positives = 41/71 (57%), Gaps = 3/71 (4%) Frame = +1 Query: 250 KANMVEIKENQVYVFFTNKD---FNRDGWFLDSDCNSHMTGNKEMFTNFRIAYG*KFVKI 420 +AN VE +E+QV +F + + + WFLDS C++HMTG K +F ++ K Sbjct: 264 QANYVEQEEDQVKLFMAYNEEVVXSNNIWFLDSGCSNHMTGIKSLFKELDESHKLKVKLG 323 Query: 421 VEERLLVKGVG 453 ++++ V+G G Sbjct: 324 DDKQVXVEGKG 334 >emb|CAN72676.1| hypothetical protein VITISV_020406 [Vitis vinifera] Length = 1183 Score = 182 bits (463), Expect = 2e-43 Identities = 108/293 (36%), Positives = 169/293 (57%), Gaps = 14/293 (4%) Frame = +2 Query: 458 ITLNFGNNKI-ILTDALYVEGLRKNLISLYKLLT*KYNMFAFQKDNGQTCC-------RI 613 + +N G+ + +L + ++ L +NL+S+ +L+ Y++ +G TC +I Sbjct: 284 VAVNNGHGNVKLLYNVYFIPSLTQNLLSVGQLMVSGYSILF----DGATCVIKDKKSDQI 339 Query: 614 PYNQKIVAEGTEQNGLFVMKPMLIE---CFLTNTEISNLWHNRLGHINNEYLWKVGA--V 778 ++ ++ A N LF ++ IE + T SNLWH R GH+N + L + + Sbjct: 340 IFDVRMAA-----NKLFPLEVSSIEKHALVVKETSESNLWHLRYGHLNVKGLKLLSKKEM 394 Query: 779 SHGPKKLLPTKMCSS*ITAKLHKKPFNKG-TRISTKCLEIIHSDLCGPITPPTTHGKSYI 955 G K+ +C I K KKPF KG +R ++ CLEIIH+DLCGP+ + G Y Sbjct: 395 VFGLPKIDSVNVCEGCIYGKQSKKPFPKGRSRRASSCLEIIHADLCGPMQTASFGGSRYF 454 Query: 956 LTFTDDHSKMTWTFILRHKSEVFESFMKF*KKVQTEKVSKILCLKIDNGIEFTSIEFKNY 1135 L FT+DHS+M+W + L+ K+E FE+F KF V+ + I L+ D G EF S +FK + Sbjct: 455 LLFTNDHSRMSWVYFLQSKAETFETFKKFKAFVEKQSGKCIKVLRTDRGGEFLSNDFKVF 514 Query: 1136 CVKKGIKKELTIPYNTQQNGVAERKNRTLIEMVRSLLQSSQLSTKWWGEAALT 1294 C ++G+ +ELT PY+ +QNGVAERKNRT++EM RS++++ LS +W E T Sbjct: 515 CEEEGLHRELTTPYSPEQNGVAERKNRTVVEMARSMMKAKNLSNHFWAEGVAT 567 >gb|AAG60117.1|AC073555_1 copia-type polyprotein, putative [Arabidopsis thaliana] Length = 1352 Score = 168 bits (426), Expect(2) = 3e-43 Identities = 108/283 (38%), Positives = 159/283 (56%), Gaps = 7/283 (2%) Frame = +2 Query: 458 ITLNFGNNKIILTDALYVEGLRKNLISLYKLLT*KYNMFAFQKDNGQTCCRIPYNQKIVA 637 I L G+++ I ++ Y+ ++ N++SL +LL Y++ KDN + R + I Sbjct: 381 IRLKNGDHQFI-SNVYYIPSMKTNILSLGQLLEKGYDIRL--KDNNLSI-RDQESNLITK 436 Query: 638 EGTEQNGLFVM--KPMLIECF-LTNTEISNLWHNRLGHINN---EYLWKVGAVSHGPKKL 799 +N +FV+ + + +C + E S LWH R GH+N E L + V P Sbjct: 437 VPMSKNRMFVLNIRNDIAQCLKMCYKEESWLWHLRFGHLNFGGLELLSRKEMVRGLPCIN 496 Query: 800 LPTKMCSS*ITAKLHKKPFNK-GTRISTKCLEIIHSDLCGPITPPTTHGKSYILTFTDDH 976 P ++C + K K F K + + K LE+IH+D+CGPI P + +Y L F DD Sbjct: 497 HPNQVCEGCLLGKQFKMSFPKESSSRAQKSLELIHTDVCGPIKPKSLGKSNYFLLFIDDF 556 Query: 977 SKMTWTFILRHKSEVFESFMKF*KKVQTEKVSKILCLKIDNGIEFTSIEFKNYCVKKGIK 1156 S+ TW + L+ KSEVFE F KF V+ E I ++ D G EFTS EF YC GI+ Sbjct: 557 SRKTWVYFLKEKSEVFEIFKKFKAHVEKESGLVIKTMRSDRGGEFTSKEFLKYCEDNGIR 616 Query: 1157 KELTIPYNTQQNGVAERKNRTLIEMVRSLLQSSQLSTKWWGEA 1285 ++LT+P + QQNGVAERKNRT++EM RS+L+S +L + W EA Sbjct: 617 RQLTVPRSPQQNGVAERKNRTILEMARSMLKSKRLPKELWAEA 659 Score = 35.0 bits (79), Expect(2) = 3e-43 Identities = 25/80 (31%), Positives = 37/80 (46%), Gaps = 5/80 (6%) Frame = +1 Query: 229 SSNQATSKANMVEIK---ENQVYVFFTNKDFNRDG--WFLDSDCNSHMTGNKEMFTNFRI 393 S+ + KAN VE K E+ + + KD + W+LDS ++HM G K MF Sbjct: 298 SNKKFEEKANYVEEKIQEEDMLLMASYKKDEQEENHKWYLDSGASNHMCGRKSMFAELDE 357 Query: 394 AYG*KFVKIVEERLLVKGVG 453 + E ++ VKG G Sbjct: 358 SVRGNVALGDESKMEVKGKG 377 >gb|AAG50698.1|AC079604_5 copia-type polyprotein, putative [Arabidopsis thaliana] gi|12321387|gb|AAG50765.1|AC079131_10 copia-type polyprotein, putative [Arabidopsis thaliana] Length = 1320 Score = 167 bits (424), Expect(2) = 6e-43 Identities = 108/283 (38%), Positives = 159/283 (56%), Gaps = 7/283 (2%) Frame = +2 Query: 458 ITLNFGNNKIILTDALYVEGLRKNLISLYKLLT*KYNMFAFQKDNGQTCCRIPYNQKIVA 637 I L G+++ I ++ Y+ ++ N++SL +LL Y++ KDN + R + I Sbjct: 381 IRLKNGDHQFI-SNVYYIPSMKTNILSLGQLLEKGYDIRL--KDNNLSI-RDQESNLITK 436 Query: 638 EGTEQNGLFVM--KPMLIECF-LTNTEISNLWHNRLGHINN---EYLWKVGAVSHGPKKL 799 +N +FV+ + + +C + E S LWH R GH+N E L + V P Sbjct: 437 VPMSKNRMFVLNIRNDIAQCLKMCYKEESWLWHLRFGHLNFGGLELLSRKEMVRGLPCIN 496 Query: 800 LPTKMCSS*ITAKLHKKPFNK-GTRISTKCLEIIHSDLCGPITPPTTHGKSYILTFTDDH 976 P ++C + K K F K + + K LE+IH+D+CGPI P + +Y L F DD Sbjct: 497 HPNQVCEGCLLGKQFKMSFPKESSSRAQKPLELIHTDVCGPIKPKSLGKSNYFLLFIDDF 556 Query: 977 SKMTWTFILRHKSEVFESFMKF*KKVQTEKVSKILCLKIDNGIEFTSIEFKNYCVKKGIK 1156 S+ TW + L+ KSEVFE F KF V+ E I ++ D G EFTS EF YC GI+ Sbjct: 557 SRKTWVYFLKEKSEVFEIFKKFKAHVEKESGLVIKTMRSDRGGEFTSKEFLKYCEDNGIR 616 Query: 1157 KELTIPYNTQQNGVAERKNRTLIEMVRSLLQSSQLSTKWWGEA 1285 ++LT+P + QQNGVAERKNRT++EM RS+L+S +L + W EA Sbjct: 617 RQLTVPRSPQQNGVAERKNRTILEMARSMLKSKRLPKELWAEA 659 Score = 35.0 bits (79), Expect(2) = 6e-43 Identities = 25/80 (31%), Positives = 37/80 (46%), Gaps = 5/80 (6%) Frame = +1 Query: 229 SSNQATSKANMVEIK---ENQVYVFFTNKDFNRDG--WFLDSDCNSHMTGNKEMFTNFRI 393 S+ + KAN VE K E+ + + KD + W+LDS ++HM G K MF Sbjct: 298 SNKKFEEKANYVEEKIQEEDMLLMASYKKDEQEENHKWYLDSGASNHMCGRKSMFAELDE 357 Query: 394 AYG*KFVKIVEERLLVKGVG 453 + E ++ VKG G Sbjct: 358 SVRGNVALGDESKMEVKGKG 377 >gb|AAD50001.1|AC007259_14 Hypothetical protein [Arabidopsis thaliana] Length = 1352 Score = 166 bits (420), Expect(2) = 1e-42 Identities = 107/283 (37%), Positives = 158/283 (55%), Gaps = 7/283 (2%) Frame = +2 Query: 458 ITLNFGNNKIILTDALYVEGLRKNLISLYKLLT*KYNMFAFQKDNGQTCCRIPYNQKIVA 637 I L G+++ I ++ Y+ ++ N++SL +LL Y++ KDN + R + I Sbjct: 381 IRLKNGDHQFI-SNVYYIPSMKTNILSLGQLLEKGYDIRL--KDNNLSI-RDQESNLITK 436 Query: 638 EGTEQNGLFVM--KPMLIECF-LTNTEISNLWHNRLGHINN---EYLWKVGAVSHGPKKL 799 +N +FV+ + + +C + E S LWH R GH+N E L + V P Sbjct: 437 VPMSKNRMFVLNIRNDIAQCLKMCYKEESWLWHLRFGHLNFGGLELLSRKEMVRGLPCIN 496 Query: 800 LPTKMCSS*ITAKLHKKPFNK-GTRISTKCLEIIHSDLCGPITPPTTHGKSYILTFTDDH 976 P ++C + K K F K + + K LE+IH+D+CGPI P + +Y L F DD Sbjct: 497 HPNQVCEGCLLGKQFKMSFPKESSSRAQKPLELIHTDVCGPIKPKSLGKSNYFLLFIDDF 556 Query: 977 SKMTWTFILRHKSEVFESFMKF*KKVQTEKVSKILCLKIDNGIEFTSIEFKNYCVKKGIK 1156 S+ TW + L+ KSEVFE F KF V+ E I ++ D G EFTS EF YC GI+ Sbjct: 557 SRKTWVYFLKEKSEVFEIFKKFKAHVEKESGLVIKTMRSDRGGEFTSKEFLKYCEDNGIR 616 Query: 1157 KELTIPYNTQQNGVAERKNRTLIEMVRSLLQSSQLSTKWWGEA 1285 ++LT+P + QQNGV ERKNRT++EM RS+L+S +L + W EA Sbjct: 617 RQLTVPRSPQQNGVVERKNRTILEMARSMLKSKRLPKELWAEA 659 Score = 35.8 bits (81), Expect(2) = 1e-42 Identities = 25/80 (31%), Positives = 38/80 (47%), Gaps = 5/80 (6%) Frame = +1 Query: 229 SSNQATSKANMVEIK---ENQVYVFFTNKDFNRDG--WFLDSDCNSHMTGNKEMFTNFRI 393 S+ + KAN VE K E+ + + KD ++ W+LDS ++HM G K MF Sbjct: 298 SNKKFEEKANYVEEKIQEEDMLLMASYKKDEQKENHKWYLDSGASNHMCGRKSMFAELDE 357 Query: 394 AYG*KFVKIVEERLLVKGVG 453 + E ++ VKG G Sbjct: 358 SVRGNVALGDESKMEVKGKG 377 >emb|CAB75469.1| copia-type reverse transcriptase-like protein [Arabidopsis thaliana] Length = 1272 Score = 166 bits (420), Expect(2) = 2e-42 Identities = 107/283 (37%), Positives = 159/283 (56%), Gaps = 7/283 (2%) Frame = +2 Query: 458 ITLNFGNNKIILTDALYVEGLRKNLISLYKLLT*KYNMFAFQKDNGQTCCRIPYNQKIVA 637 I L G+++ I ++ Y+ ++ N++SL +LL Y++ KDN + R + I Sbjct: 381 IRLKNGDHQFI-SNVYYIPSMKTNILSLGQLLEKGYDIRL--KDNNLSI-RDKESNLITK 436 Query: 638 EGTEQNGLFVM--KPMLIECF-LTNTEISNLWHNRLGHINN---EYLWKVGAVSHGPKKL 799 +N +FV+ + + +C + E S LWH R GH+N E L + V P Sbjct: 437 VPMSKNRMFVLNIRNDIAQCLKMCYKEESWLWHLRFGHLNFGGLELLSRKEMVRGLPCIN 496 Query: 800 LPTKMCSS*ITAKLHKKPFNK-GTRISTKCLEIIHSDLCGPITPPTTHGKSYILTFTDDH 976 P ++C + K F K + + K LE+IH+D+CGPI P + +Y L F DD Sbjct: 497 HPNQVCEGCLLGNQFKMSFPKESSSRAQKPLELIHTDVCGPIKPKSLGKSNYFLLFIDDF 556 Query: 977 SKMTWTFILRHKSEVFESFMKF*KKVQTEKVSKILCLKIDNGIEFTSIEFKNYCVKKGIK 1156 S+ TW + L+ KSEVFE F KF V+ E I ++ D+G EFTS EF YC GI+ Sbjct: 557 SRKTWVYFLKEKSEVFEIFKKFKAHVEKESGLVIKTMRSDSGGEFTSKEFLKYCEDNGIR 616 Query: 1157 KELTIPYNTQQNGVAERKNRTLIEMVRSLLQSSQLSTKWWGEA 1285 ++LT+P + QQNGVAERKNRT++EM RS+L+S +L + W EA Sbjct: 617 RQLTVPRSPQQNGVAERKNRTILEMARSMLKSKRLPKELWAEA 659 Score = 35.0 bits (79), Expect(2) = 2e-42 Identities = 25/80 (31%), Positives = 37/80 (46%), Gaps = 5/80 (6%) Frame = +1 Query: 229 SSNQATSKANMVEIK---ENQVYVFFTNKDFNRDG--WFLDSDCNSHMTGNKEMFTNFRI 393 S+ + KAN VE K E+ + + KD + W+LDS ++HM G K MF Sbjct: 298 SNKKFKEKANYVEEKIQEEDMLLMASYKKDEQEENHKWYLDSGASNHMCGRKSMFAELDE 357 Query: 394 AYG*KFVKIVEERLLVKGVG 453 + E ++ VKG G Sbjct: 358 SVRGNVALGDESKMEVKGKG 377 >dbj|BAB01972.1| copia-like retrotransposable element [Arabidopsis thaliana] Length = 1499 Score = 172 bits (436), Expect(2) = 4e-42 Identities = 101/287 (35%), Positives = 161/287 (56%), Gaps = 7/287 (2%) Frame = +2 Query: 455 DITLNFGNNKIILTDALYVEGLRKNLISLYKLLT*KYNMFAFQKDNGQTCCRIPYNQKIV 634 DI ++ ++ D LYV L +NL+S+ ++++ Y + +DN + + + Sbjct: 367 DIRVSTNKGDHVIKDVLYVPELARNLLSVSQMISNGYRVIF--EDNKCVIQDLKGRKILD 424 Query: 635 AEGTEQNGLFVMKPMLIECFLT---NTEISNLWHNRLGHINN---EYLWKVGAVSHGPKK 796 + +++ + K E ++ E ++LWH R GH+N E + + V PK Sbjct: 425 IKMKDRSFPIIWKKSREETYMAFEEKEEQTDLWHKRFGHVNYDKIETMQTLKIVEKLPKF 484 Query: 797 LLPTKMCSS*ITAKLHKKPFNKGTRIST-KCLEIIHSDLCGPITPPTTHGKSYILTFTDD 973 + +C++ K ++ F K ++ +T K LE+IHSD+CGP+ + +G Y LTF DD Sbjct: 485 EVIKGICAACEMGKQSRRSFPKKSQSNTNKTLELIHSDVCGPMQTESINGSRYFLTFIDD 544 Query: 974 HSKMTWTFILRHKSEVFESFMKF*KKVQTEKVSKILCLKIDNGIEFTSIEFKNYCVKKGI 1153 S+MTW + L++KSEV F F V+ + S+I L+ D G EF S EF C + GI Sbjct: 545 FSRMTWVYFLKNKSEVITKFKIFKPYVENQSESRIKRLRTDGGGEFLSREFIKLCQESGI 604 Query: 1154 KKELTIPYNTQQNGVAERKNRTLIEMVRSLLQSSQLSTKWWGEAALT 1294 E+T PY+ QQNGVAER+NRTL+EM RS+++ +LS K+W EA T Sbjct: 605 HHEITTPYSPQQNGVAERRNRTLVEMARSMIEEKKLSNKFWAEAIAT 651 Score = 27.7 bits (60), Expect(2) = 4e-42 Identities = 16/89 (17%), Positives = 37/89 (41%), Gaps = 2/89 (2%) Frame = +1 Query: 118 CYICDMNNHETKYYFFNAKGTNYSPNRGSKPRST*QISSNQATSKANMV--EIKENQVYV 291 CY+CD H + + +G + + + S ++ + +M+ ++E ++ Sbjct: 268 CYVCDKQGHIAR---------DCKLRKGERAHLSIEESEDEKEDECHMLFSAVEEKEI-- 316 Query: 292 FFTNKDFNRDGWFLDSDCNSHMTGNKEMF 378 + W +DS C +HM+ + F Sbjct: 317 ----STIGEETWLVDSGCTNHMSKDVRHF 341 >emb|CAB71063.1| copia-type polyprotein [Arabidopsis thaliana] Length = 1352 Score = 166 bits (420), Expect(2) = 4e-42 Identities = 107/283 (37%), Positives = 158/283 (55%), Gaps = 7/283 (2%) Frame = +2 Query: 458 ITLNFGNNKIILTDALYVEGLRKNLISLYKLLT*KYNMFAFQKDNGQTCCRIPYNQKIVA 637 I L G+++ I ++ Y+ ++ N++SL +LL Y++ KDN + R + I Sbjct: 381 IRLKNGDHQFI-SNVYYIPSMKTNILSLGQLLEKGYDIRL--KDNNLSI-RDQESNLITK 436 Query: 638 EGTEQNGLFVM--KPMLIECF-LTNTEISNLWHNRLGHINN---EYLWKVGAVSHGPKKL 799 +N +FV+ + + +C + E S LWH R GH+N E L + V P Sbjct: 437 VPMSKNRMFVLNIRNDIAQCLKMCYKEESWLWHLRFGHLNFGGLELLSRKEMVRGLPCIN 496 Query: 800 LPTKMCSS*ITAKLHKKPFNK-GTRISTKCLEIIHSDLCGPITPPTTHGKSYILTFTDDH 976 P ++C + K K F K + + K LE+IH+D+CGPI P + +Y L F DD Sbjct: 497 HPNQVCEGCLLGKQFKMSFPKESSSRAQKPLELIHTDVCGPIKPKSLGKSNYFLLFIDDF 556 Query: 977 SKMTWTFILRHKSEVFESFMKF*KKVQTEKVSKILCLKIDNGIEFTSIEFKNYCVKKGIK 1156 S+ TW + L+ KSEVFE F KF V+ E I ++ D G EFTS EF YC GI+ Sbjct: 557 SRKTWVYFLKEKSEVFEIFKKFKAHVEKESGLVIKTMRSDRGGEFTSKEFLKYCEDNGIR 616 Query: 1157 KELTIPYNTQQNGVAERKNRTLIEMVRSLLQSSQLSTKWWGEA 1285 ++LT+P + QQNGV ERKNRT++EM RS+L+S +L + W EA Sbjct: 617 RQLTVPRSPQQNGVVERKNRTILEMARSMLKSKRLPKELWAEA 659 Score = 33.9 bits (76), Expect(2) = 4e-42 Identities = 24/80 (30%), Positives = 38/80 (47%), Gaps = 5/80 (6%) Frame = +1 Query: 229 SSNQATSKANMVEIK---ENQVYVFFTNKDFNRDG--WFLDSDCNSHMTGNKEMFTNFRI 393 S+ + KA+ VE K E+ + + KD ++ W+LDS ++HM G K MF Sbjct: 298 SNKKFEEKAHYVEEKIQEEDMLLMASYKKDEQKENHKWYLDSGASNHMCGRKSMFAELDE 357 Query: 394 AYG*KFVKIVEERLLVKGVG 453 + E ++ VKG G Sbjct: 358 SVRGNVALGDESKMEVKGKG 377 >dbj|BAB11200.1| copia-type polyprotein [Arabidopsis thaliana] gi|13872710|emb|CAC37622.1| polyprotein [Arabidopsis thaliana] Length = 1334 Score = 162 bits (409), Expect(2) = 4e-42 Identities = 99/283 (34%), Positives = 156/283 (55%), Gaps = 17/283 (6%) Frame = +2 Query: 488 ILTDALYVEGLRKNLISLYKLLT*KYNMFAFQKDNGQTCCRIPYN--QKIVAEGT-EQNG 658 +++D +V GL+ NL S+ +L K F + D C + + +++V T +N Sbjct: 351 VISDVYFVPGLKNNLFSVGQLQQ-KGLRFIIEGD----VCEVWHKTEKRMVMHSTMTKNR 405 Query: 659 LFVMKPML--------IECFLTNTEISNLWHNRLGHINNEYLWKVGA---VSHGPKKLLP 805 +FV+ + C + +N+WH R GH+N++ L + V PK L Sbjct: 406 MFVVFAAVKKSKETEETRCLQVIGKANNMWHKRFGHLNHQGLRSLAEKEMVKGLPKFDLG 465 Query: 806 TK--MCSS*ITAKLHKKPFNKGTRI-STKCLEIIHSDLCGPITPPTTHGKSYILTFTDDH 976 + +C + K ++ K + ST+ L+++H+D+CGPI P +T GK YIL F DD Sbjct: 466 EEEAVCDICLKGKQIRESIPKESAWKSTQVLQLVHTDICGPINPASTSGKRYILNFIDDF 525 Query: 977 SKMTWTFILRHKSEVFESFMKF*KKVQTEKVSKILCLKIDNGIEFTSIEFKNYCVKKGIK 1156 S+ WT++L KSE F+ F +F +V+ E K++CL+ D G E+ S EF YC + GIK Sbjct: 526 SRKCWTYLLSEKSETFQFFKEFKAEVERESGKKLVCLRSDRGGEYNSREFDEYCKEFGIK 585 Query: 1157 KELTIPYNTQQNGVAERKNRTLIEMVRSLLQSSQLSTKWWGEA 1285 ++LT Y QQNGVAERKNR+++ M R +L + K+W EA Sbjct: 586 RQLTAAYTPQQNGVAERKNRSVMNMTRCMLMEMSVPRKFWPEA 628 Score = 38.1 bits (87), Expect(2) = 4e-42 Identities = 21/71 (29%), Positives = 37/71 (52%), Gaps = 2/71 (2%) Frame = +1 Query: 250 KANMVEIKENQVYVFFTNK--DFNRDGWFLDSDCNSHMTGNKEMFTNFRIAYG*KFVKIV 423 +AN VE++E+ + + + D + WFLDS C++HM G +E F + Sbjct: 270 EANYVEMEEDLLLMAHVEQIGDEEKQIWFLDSGCSNHMCGTREWFLELDSGFKQNVRLGD 329 Query: 424 EERLLVKGVGR 456 + R+ V+G G+ Sbjct: 330 DRRMAVEGKGK 340 >gb|AAG51247.1|AC055769_6 copia-type polyprotein, putative; 28768-32772 [Arabidopsis thaliana] Length = 1334 Score = 162 bits (409), Expect(2) = 4e-42 Identities = 99/283 (34%), Positives = 156/283 (55%), Gaps = 17/283 (6%) Frame = +2 Query: 488 ILTDALYVEGLRKNLISLYKLLT*KYNMFAFQKDNGQTCCRIPYN--QKIVAEGT-EQNG 658 +++D +V GL+ NL S+ +L K F + D C + + +++V T +N Sbjct: 351 VISDVYFVPGLKNNLFSVGQLQQ-KGLRFIIEGD----VCEVWHKTEKRMVMHSTMTKNR 405 Query: 659 LFVMKPML--------IECFLTNTEISNLWHNRLGHINNEYLWKVGA---VSHGPKKLLP 805 +FV+ + C + +N+WH R GH+N++ L + V PK L Sbjct: 406 MFVVFAAVKKSKETEETRCLQVIGKANNMWHKRFGHLNHQGLRSLAEKEMVKGLPKFDLG 465 Query: 806 TK--MCSS*ITAKLHKKPFNKGTRI-STKCLEIIHSDLCGPITPPTTHGKSYILTFTDDH 976 + +C + K ++ K + ST+ L+++H+D+CGPI P +T GK YIL F DD Sbjct: 466 EEEAVCDICLKGKQIRESIPKESAWKSTQVLQLVHTDICGPINPASTSGKRYILNFIDDF 525 Query: 977 SKMTWTFILRHKSEVFESFMKF*KKVQTEKVSKILCLKIDNGIEFTSIEFKNYCVKKGIK 1156 S+ WT++L KSE F+ F +F +V+ E K++CL+ D G E+ S EF YC + GIK Sbjct: 526 SRKCWTYLLSEKSETFQFFKEFKAEVERESGKKLVCLRSDRGGEYNSREFDEYCKEFGIK 585 Query: 1157 KELTIPYNTQQNGVAERKNRTLIEMVRSLLQSSQLSTKWWGEA 1285 ++LT Y QQNGVAERKNR+++ M R +L + K+W EA Sbjct: 586 RQLTAAYTPQQNGVAERKNRSVMNMTRCMLMEMSVPRKFWPEA 628 Score = 38.1 bits (87), Expect(2) = 4e-42 Identities = 21/71 (29%), Positives = 37/71 (52%), Gaps = 2/71 (2%) Frame = +1 Query: 250 KANMVEIKENQVYVFFTNK--DFNRDGWFLDSDCNSHMTGNKEMFTNFRIAYG*KFVKIV 423 +AN VE++E+ + + + D + WFLDS C++HM G +E F + Sbjct: 270 EANYVEMEEDLLLMAHVEQIGDEEKQIWFLDSGCSNHMCGTREWFLELDSGFKQNVRLGD 329 Query: 424 EERLLVKGVGR 456 + R+ V+G G+ Sbjct: 330 DRRMAVEGKGK 340 >gb|AAF25964.2|AC017118_1 F6N18.1 [Arabidopsis thaliana] Length = 1207 Score = 162 bits (409), Expect(2) = 4e-42 Identities = 99/283 (34%), Positives = 156/283 (55%), Gaps = 17/283 (6%) Frame = +2 Query: 488 ILTDALYVEGLRKNLISLYKLLT*KYNMFAFQKDNGQTCCRIPYN--QKIVAEGT-EQNG 658 +++D +V GL+ NL S+ +L K F + D C + + +++V T +N Sbjct: 256 VISDVYFVPGLKNNLFSVGQLQQ-KGLRFIIEGD----VCEVWHKTEKRMVMHSTMTKNR 310 Query: 659 LFVMKPML--------IECFLTNTEISNLWHNRLGHINNEYLWKVGA---VSHGPKKLLP 805 +FV+ + C + +N+WH R GH+N++ L + V PK L Sbjct: 311 MFVVFAAVKKSKETEETRCLQVIGKANNMWHKRFGHLNHQGLRSLAEKEMVKGLPKFDLG 370 Query: 806 TK--MCSS*ITAKLHKKPFNKGTRI-STKCLEIIHSDLCGPITPPTTHGKSYILTFTDDH 976 + +C + K ++ K + ST+ L+++H+D+CGPI P +T GK YIL F DD Sbjct: 371 EEEAVCDICLKGKQIRESIPKESAWKSTQVLQLVHTDICGPINPASTSGKRYILNFIDDF 430 Query: 977 SKMTWTFILRHKSEVFESFMKF*KKVQTEKVSKILCLKIDNGIEFTSIEFKNYCVKKGIK 1156 S+ WT++L KSE F+ F +F +V+ E K++CL+ D G E+ S EF YC + GIK Sbjct: 431 SRKCWTYLLSEKSETFQFFKEFKAEVERESGKKLVCLRSDRGGEYNSREFDEYCKEFGIK 490 Query: 1157 KELTIPYNTQQNGVAERKNRTLIEMVRSLLQSSQLSTKWWGEA 1285 ++LT Y QQNGVAERKNR+++ M R +L + K+W EA Sbjct: 491 RQLTAAYTPQQNGVAERKNRSVMNMTRCMLMEMSVPRKFWPEA 533 Score = 38.1 bits (87), Expect(2) = 4e-42 Identities = 21/71 (29%), Positives = 37/71 (52%), Gaps = 2/71 (2%) Frame = +1 Query: 250 KANMVEIKENQVYVFFTNK--DFNRDGWFLDSDCNSHMTGNKEMFTNFRIAYG*KFVKIV 423 +AN VE++E+ + + + D + WFLDS C++HM G +E F + Sbjct: 175 EANYVEMEEDLLLMAHVEQIGDEEKQIWFLDSGCSNHMCGTREWFLELDSGFKQNVRLGD 234 Query: 424 EERLLVKGVGR 456 + R+ V+G G+ Sbjct: 235 DRRMAVEGKGK 245 >ref|XP_002064813.1| GK15001 [Drosophila willistoni] gi|194160898|gb|EDW75799.1| GK15001 [Drosophila willistoni] Length = 1249 Score = 168 bits (426), Expect(2) = 5e-42 Identities = 99/285 (34%), Positives = 152/285 (53%), Gaps = 6/285 (2%) Frame = +2 Query: 458 ITLNFGNNKIILTDALYVEGLRKNLISLYKLLT*KYNMFA-FQKDNGQTCCRIPYNQKIV 634 +T+ G K+ + + LYV GL N +S+ +++ +YN F+K +I N + + Sbjct: 312 VTIRTGICKLTMNNVLYVPGLAGNFMSVARVI--EYNSVVHFEKH----MAKIIQNGECI 365 Query: 635 AEGTEQNGLFVMKPMLIECFLTNTEISNLWHNRLGHINNEYLWKVGAVSH----GPKKLL 802 + + LFV + F E +LWH R GH+N + L ++ + Sbjct: 366 LKAKKIGNLFVFEAESENLFAAVGEDVSLWHKRFGHLNYKSLTQIASKGLVRGLSVTNFA 425 Query: 803 PTKMCSS*ITAKLHKKPFNKGTRI-STKCLEIIHSDLCGPITPPTTHGKSYILTFTDDHS 979 P C + + +K+H +PF K T S++ L+++HSD+CGP + G Y LTF DD S Sbjct: 426 PNTPCKTCMVSKIHVQPFPKMTESRSSELLQLVHSDVCGPFGTKSLGGSRYFLTFIDDKS 485 Query: 980 KMTWTFILRHKSEVFESFMKF*KKVQTEKVSKILCLKIDNGIEFTSIEFKNYCVKKGIKK 1159 + + + L+ K EVF F++F V+ + K+ C++ DNG E+ + F +Y K GI + Sbjct: 486 RRIFVYFLKGKDEVFGKFLEFKSLVERQTGKKLKCIRSDNGREYVNNAFDDYLKKNGILR 545 Query: 1160 ELTIPYNTQQNGVAERKNRTLIEMVRSLLQSSQLSTKWWGEAALT 1294 +LTI Y QQNGVAER NRTL+EM R LL S L W EA T Sbjct: 546 QLTIAYTPQQNGVAERANRTLVEMSRCLLAQSGLCEALWAEAIFT 590 Score = 31.2 bits (69), Expect(2) = 5e-42 Identities = 12/30 (40%), Positives = 19/30 (63%) Frame = +1 Query: 304 KDFNRDGWFLDSDCNSHMTGNKEMFTNFRI 393 ++ R+ W LDS SHM +K MF++F + Sbjct: 262 ENMKREKWCLDSGATSHMCCDKSMFSDFSV 291 >gb|AAF19226.1|AC007505_2 Highly similar to Ta1-3 polyprotein [Arabidopsis thaliana] Length = 1356 Score = 164 bits (415), Expect(2) = 6e-42 Identities = 97/275 (35%), Positives = 140/275 (50%), Gaps = 6/275 (2%) Frame = +2 Query: 488 ILTDALYVEGLRKNLISLYKLLT*KYNMFAFQKDNGQTCCRIPYNQKIVAEGTEQNGLFV 667 IL + YV LR+NLIS L + ++ + G+ R N K G+ NGL+V Sbjct: 363 ILENVKYVPHLRRNLISTGTL-----DKLGYRHEGGEGKVRYFKNNKTALRGSLSNGLYV 417 Query: 668 MKPMLIECFLTNTEISN----LWHNRLGHIN-NEYLWKVGAVSHGPKKLLPTKMCSS*IT 832 + + L N E LWH+RLGH++ N G K++ + C + Sbjct: 418 LDGSTVMSELCNAETDKVKTALWHSRLGHMSMNNLKVLAGKGLIDRKEINELEFCEHCVM 477 Query: 833 AKLHKKPFNKGTRISTKCLEIIHSDLCG-PITPPTTHGKSYILTFTDDHSKMTWTFILRH 1009 K K FN G S L +H+DL G P P+ GK Y L+ DD ++ W + L+ Sbjct: 478 GKSKKVSFNVGKHTSEDALSYVHADLWGSPNVTPSISGKQYFLSIIDDKTRKVWLYFLKS 537 Query: 1010 KSEVFESFMKF*KKVQTEKVSKILCLKIDNGIEFTSIEFKNYCVKKGIKKELTIPYNTQQ 1189 K E F+ F ++ V+ + K+ CL+ DNG+EF + F +YC + GI++ T Y QQ Sbjct: 538 KDETFDKFCEWKSLVENQVNKKVKCLRTDNGLEFCNSRFDSYCKEHGIERHRTCTYTPQQ 597 Query: 1190 NGVAERKNRTLIEMVRSLLQSSQLSTKWWGEAALT 1294 NGVAER NRT++E VR LL S + +W EAA T Sbjct: 598 NGVAERMNRTIMEKVRCLLNKSGVEEVFWAEAAAT 632 Score = 35.0 bits (79), Expect(2) = 6e-42 Identities = 13/30 (43%), Positives = 19/30 (63%) Frame = +1 Query: 301 NKDFNRDGWFLDSDCNSHMTGNKEMFTNFR 390 N+ +D W LDS C SHMT ++ F +F+ Sbjct: 300 NEQMVKDLWILDSGCTSHMTSRRDWFISFQ 329 >gb|AGW47867.1| polyprotein [Phaseolus vulgaris] Length = 1471 Score = 166 bits (421), Expect(2) = 4e-41 Identities = 101/276 (36%), Positives = 153/276 (55%), Gaps = 11/276 (3%) Frame = +2 Query: 491 LTDALYVEGLRKNLISLYKLLT*KYNMFAFQK----DNGQTCCRIPYNQKIVAEGTEQNG 658 L D YV L+ N++S+ +L Y++F + N Q C + +N Sbjct: 412 LQDVYYVPDLKTNILSMGQLTEKGYSIFLKDRFLHLKNKQGCL-------VARIEMARNR 464 Query: 659 LFVMKPMLI--ECFLTNTEI-SNLWHNRLGHINNEYLWKVGAVS--HG-PKKLLPTKMCS 820 ++ + I +C N E ++LWH R GH+++ L ++ + HG P K C Sbjct: 465 MYKLNLRSIREKCLQVNIEDKASLWHLRFGHLHHGGLKELAKKNMVHGLPNMDYEGKFCE 524 Query: 821 S*ITAKLHKKPFNKGTRISTKC-LEIIHSDLCGPITPPTTHGKSYILTFTDDHSKMTWTF 997 + +K + F K + K LE+IH+D+CGPITP + GK Y +TF DD S+ TW + Sbjct: 525 ECVLSKHVRTSFPKKAQYWAKQPLELIHTDICGPITPESFSGKRYFITFIDDFSRKTWVY 584 Query: 998 ILRHKSEVFESFMKF*KKVQTEKVSKILCLKIDNGIEFTSIEFKNYCVKKGIKKELTIPY 1177 L+ KSE FE F KF V+ +I ++ D G E+TS F YC ++GI++ LT PY Sbjct: 585 FLKEKSEAFEVFKKFKVMVERTTDKQIKAVRSDRGGEYTSTTFMEYCEEQGIRRFLTAPY 644 Query: 1178 NTQQNGVAERKNRTLIEMVRSLLQSSQLSTKWWGEA 1285 QQNGVAERKNRT+++MVRS+L+S ++ ++W EA Sbjct: 645 TPQQNGVAERKNRTILDMVRSMLKSKKMPKEFWAEA 680 Score = 30.0 bits (66), Expect(2) = 4e-41 Identities = 16/56 (28%), Positives = 29/56 (51%), Gaps = 1/56 (1%) Frame = +1 Query: 226 ISSNQATSKANMVEIKENQVYVFFTNKDFNRDG-WFLDSDCNSHMTGNKEMFTNFR 390 I + T+ A VE E + + + N D W+LDS ++HM G++ +F + + Sbjct: 322 IKIEETTNLALEVETNEGVLLMAQDEVNINNDTLWYLDSGASNHMCGHEYLFKDMQ 377 >pir||S00954 pol polyprotein - fruit fly (Drosophila melanogaster) transposon 1731 gi|8702|emb|CAA30503.1| unnamed protein product [Drosophila melanogaster] Length = 982 Score = 167 bits (422), Expect(2) = 5e-41 Identities = 98/275 (35%), Positives = 150/275 (54%), Gaps = 5/275 (1%) Frame = +2 Query: 485 IILTDALYVEGLRKNLISLYKLLT*KYNMFAFQKDNGQTCCRIPYNQKIVAEGTEQNGLF 664 ++L + L+V L N +S+ + +Y F + G + + + L+ Sbjct: 65 LVLNNVLFVPDLNGNFMSVSRAA--QYKCFV---NFGPHYADVIQEGERILRVMRAGNLY 119 Query: 665 VMKPMLIECFLTNTEISNLWHNRLGHINNEYLWKV--GAVSHGPKKLL--PTKMCSS*IT 832 + + CF +LWH R GH+N L ++ + +G +K++ P +C + + Sbjct: 120 MFQGKHNSCFAAVDADGSLWHKRNGHLNTSSLQEMVRKKMVYGVEKVVFKPDAVCKTCML 179 Query: 833 AKLHKKPFNKGTRI-STKCLEIIHSDLCGPITPPTTHGKSYILTFTDDHSKMTWTFILRH 1009 AK+H +PF K TR + + L++IHSDLCGP + P+ G Y LTF DD S+ + + LR Sbjct: 180 AKIHVQPFPKTTRSRAEELLDMIHSDLCGPFSTPSLAGSKYFLTFIDDKSRRIFVYFLRK 239 Query: 1010 KSEVFESFMKF*KKVQTEKVSKILCLKIDNGIEFTSIEFKNYCVKKGIKKELTIPYNTQQ 1189 K EVF F++F K V+ + KI C++ DNG EF + F +Y GI ++LTIP+ QQ Sbjct: 240 KDEVFTKFVEFKKLVERQTGRKIKCIRSDNGGEFVNNVFDDYLKAHGIARQLTIPHTPQQ 299 Query: 1190 NGVAERKNRTLIEMVRSLLQSSQLSTKWWGEAALT 1294 NGVAER NRTL+EM R +L S+L W EA T Sbjct: 300 NGVAERANRTLVEMARCMLLQSELGEALWAEAINT 334 Score = 29.3 bits (64), Expect(2) = 5e-41 Identities = 17/48 (35%), Positives = 23/48 (47%) Frame = +1 Query: 310 FNRDGWFLDSDCNSHMTGNKEMFTNFRIAYG*KFVKIVEERLLVKGVG 453 F + W LDS SHM ++ +FT F + K LL KG+G Sbjct: 8 FGKTQWCLDSGATSHMCCDRSVFTEFE-EHTEKISLAGNGFLLAKGIG 54