BLASTX nr result

ID: Ephedra27_contig00026656 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Ephedra27_contig00026656
         (1294 letters)

Database: ./nr 
           37,332,560 sequences; 13,225,080,153 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

emb|CAN60366.1| hypothetical protein VITISV_031870 [Vitis vinifera]   185   3e-50
emb|CAN71759.1| hypothetical protein VITISV_020777 [Vitis vinifera]   180   1e-48
emb|CBI37296.3| unnamed protein product [Vitis vinifera]              171   2e-45
gb|AAT38758.1| Putative gag-pol polyprotein, identical [Solanum ...   174   4e-44
emb|CAB75932.1| putative protein [Arabidopsis thaliana]               163   7e-44
emb|CAN79116.1| hypothetical protein VITISV_002093 [Vitis vinifera]   162   2e-43
emb|CAN72676.1| hypothetical protein VITISV_020406 [Vitis vinifera]   182   2e-43
gb|AAG60117.1|AC073555_1 copia-type polyprotein, putative [Arabi...   168   3e-43
gb|AAG50698.1|AC079604_5 copia-type polyprotein, putative [Arabi...   167   6e-43
gb|AAD50001.1|AC007259_14 Hypothetical protein [Arabidopsis thal...   166   1e-42
emb|CAB75469.1| copia-type reverse transcriptase-like protein [A...   166   2e-42
dbj|BAB01972.1| copia-like retrotransposable element [Arabidopsi...   172   4e-42
emb|CAB71063.1| copia-type polyprotein [Arabidopsis thaliana]         166   4e-42
dbj|BAB11200.1| copia-type polyprotein [Arabidopsis thaliana] gi...   162   4e-42
gb|AAG51247.1|AC055769_6 copia-type polyprotein, putative; 28768...   162   4e-42
gb|AAF25964.2|AC017118_1 F6N18.1 [Arabidopsis thaliana]               162   4e-42
ref|XP_002064813.1| GK15001 [Drosophila willistoni] gi|194160898...   168   5e-42
gb|AAF19226.1|AC007505_2 Highly similar to Ta1-3 polyprotein [Ar...   164   6e-42
gb|AGW47867.1| polyprotein [Phaseolus vulgaris]                       166   4e-41
pir||S00954 pol polyprotein - fruit fly (Drosophila melanogaster...   167   5e-41

>emb|CAN60366.1| hypothetical protein VITISV_031870 [Vitis vinifera]
          Length = 1274

 Score =  185 bits (470), Expect(2) = 3e-50
 Identities = 110/288 (38%), Positives = 166/288 (57%), Gaps = 9/288 (3%)
 Frame = +2

Query: 458  ITLNFGNNKI-ILTDALYVEGLRKNLISLYKLLT*KYNMFAFQKDNGQTCC--RIPYNQK 628
            + +N G+  + +L +  ++  L +NL+S+ +L+   Y++      +G TC       +Q 
Sbjct: 350  MAVNNGHGNVKLLYNVYFIPSLTQNLLSVGQLMVSGYSILF----DGATCVIKDKKSDQI 405

Query: 629  IVAEGTEQNGLFVMKPMLIE---CFLTNTEISNLWHNRLGHINNEYLWKVGA--VSHGPK 793
            IV      N LF ++   IE     +  T  SNLWH R GH+N + L  +    +  G  
Sbjct: 406  IVNVRMAANKLFPLEVSSIEKHALVVKETSESNLWHLRYGHLNVKGLKLLSKKEMVFGLP 465

Query: 794  KLLPTKMCSS*ITAKLHKKPFNKG-TRISTKCLEIIHSDLCGPITPPTTHGKSYILTFTD 970
            K+    +C   I  K  KKPF KG +R ++ CLEIIH+DLCGP+   +  G  Y L FTD
Sbjct: 466  KIDSVNVCEGCIYGKQSKKPFPKGRSRRASSCLEIIHADLCGPMQTASFGGSRYFLLFTD 525

Query: 971  DHSKMTWTFILRHKSEVFESFMKF*KKVQTEKVSKILCLKIDNGIEFTSIEFKNYCVKKG 1150
            DHS+M+W + L+ K+E FE+F KF   V+ +    I  L+ D G EF S +FK +C ++G
Sbjct: 526  DHSRMSWVYFLQSKAETFETFKKFKAFVEKQSGKCIKVLRTDRGGEFLSNDFKVFCEEEG 585

Query: 1151 IKKELTIPYNTQQNGVAERKNRTLIEMVRSLLQSSQLSTKWWGEAALT 1294
            + +ELT PY+ +QNGVAERKNRT++EM RS++++  LS  +W E   T
Sbjct: 586  LHRELTTPYSPEQNGVAERKNRTVVEMARSMMKAKNLSNHFWAEGVAT 633



 Score = 41.6 bits (96), Expect(2) = 3e-50
 Identities = 24/71 (33%), Positives = 41/71 (57%), Gaps = 3/71 (4%)
 Frame = +1

Query: 250 KANMVEIKENQVYVFFTNKD---FNRDGWFLDSDCNSHMTGNKEMFTNFRIAYG*KFVKI 420
           +AN VE +E+QV +F    +    + + WFLDS C++HMTG K +F     ++  K    
Sbjct: 278 QANYVEQEEDQVKLFMXYNEEVVSSNNIWFLDSGCSNHMTGIKSLFKELDESHKLKVKLG 337

Query: 421 VEERLLVKGVG 453
            ++++ V+G G
Sbjct: 338 DDKQVXVEGKG 348


>emb|CAN71759.1| hypothetical protein VITISV_020777 [Vitis vinifera]
          Length = 1472

 Score =  180 bits (456), Expect(2) = 1e-48
 Identities = 110/282 (39%), Positives = 161/282 (57%), Gaps = 8/282 (2%)
 Frame = +2

Query: 473  GNNKIILTDALYVEGLRKNLISLYKLLT*KYNMFAFQKDNGQTCC--RIPYNQKIVAEGT 646
            GN K+ L +  ++  L +NL+S+ +L+   Y++      +G TC       +Q IV    
Sbjct: 342  GNVKL-LYNVYFIPSLTQNLLSVGQLMVSGYSILF----DGATCVIKDKKSDQIIVBVRM 396

Query: 647  EQNGLFVMKPMLIE---CFLTNTEISNLWHNRLGHINNEYLWKVGA--VSHGPKKLLPTK 811
              N LF ++   IE     +  T  SNLWH R GH+N + L  +    +  G  K+    
Sbjct: 397  AANKLFPLEVSSIEKHALVVKETSESNLWHLRYGHLNVKGLKLLSKKEMVFGLPKIDSVN 456

Query: 812  MCSS*ITAKLHKKPFNKG-TRISTKCLEIIHSDLCGPITPPTTHGKSYILTFTDDHSKMT 988
            +C   I  K  KKPF KG +R ++ CLEIIH+DLCGP+   +  G  Y L FTDDHS+M+
Sbjct: 457  VCEGCIYGKQSKKPFPKGRSRRASSCLEIIHADLCGPMQTASFGGSRYFLLFTDDHSRMS 516

Query: 989  WTFILRHKSEVFESFMKF*KKVQTEKVSKILCLKIDNGIEFTSIEFKNYCVKKGIKKELT 1168
            W + L+ K+E FE+F KF   V+ +    I  L+ D G EF S +FK +  ++G+ +ELT
Sbjct: 517  WVYFLQSKAETFETFKKFKAFVEKQSGKCIKVLRTDRGGEFLSNDFKVFXEEEGLHRELT 576

Query: 1169 IPYNTQQNGVAERKNRTLIEMVRSLLQSSQLSTKWWGEAALT 1294
             PY+  QNGVAERKNRT++EM RS++++  LS  +W E   T
Sbjct: 577  TPYSPXQNGVAERKNRTVVEMARSMMKAKNLSNHFWAEGVAT 618



 Score = 41.6 bits (96), Expect(2) = 1e-48
 Identities = 24/71 (33%), Positives = 41/71 (57%), Gaps = 3/71 (4%)
 Frame = +1

Query: 250 KANMVEIKENQVYVFFTNKD---FNRDGWFLDSDCNSHMTGNKEMFTNFRIAYG*KFVKI 420
           +AN VE +E+QV +F    +    + + WFLDS C++HMTG K +F     ++  K    
Sbjct: 263 QANYVEQEEDQVKLFMAYNEEVVSSNNIWFLDSGCSNHMTGIKSLFKELDESHKLKVKLG 322

Query: 421 VEERLLVKGVG 453
            ++++ V+G G
Sbjct: 323 DDKQVQVEGKG 333


>emb|CBI37296.3| unnamed protein product [Vitis vinifera]
          Length = 3048

 Score =  171 bits (434), Expect(2) = 2e-45
 Identities = 106/277 (38%), Positives = 157/277 (56%), Gaps = 11/277 (3%)
 Frame = +2

Query: 488  ILTDALYVEGLRKNLISLYKLLT*KYNMFAFQKDNGQTCCRIPYNQKIVAEGTEQ--NGL 661
            I+T   YV  L+ NL+S+ +L   K     FQ       C++ ++QK +   T+   N +
Sbjct: 374  IITGVFYVPELKNNLLSIGQLQE-KGLTILFQHGK----CKVFHSQKGLIMDTKMSSNRM 428

Query: 662  FVM----KPMLIECFLTNTE-ISNLWHNRLGHINNEYLWKVGA---VSHGPKKLLPTKMC 817
            F++    +P+   CF T TE I  LWH R GH++ + L  +     V+  P+   P+K+C
Sbjct: 429  FMLYALSQPISSTCFNTVTEDILQLWHCRYGHLSFQGLKTLQQRKMVNGLPQFQPPSKLC 488

Query: 818  SS*ITAKLHKKPFNKGTRI-STKCLEIIHSDLCGPITPPTTHGKSYILTFTDDHSKMTWT 994
               +  K H+    K +   + + L+++H+D+CGPI P +   K Y+LTFTDD S+ TW 
Sbjct: 489  KDCLVGKQHRSSIPKKSNWRAAEILQLVHADICGPINPISNSKKRYLLTFTDDFSRKTWV 548

Query: 995  FILRHKSEVFESFMKF*KKVQTEKVSKILCLKIDNGIEFTSIEFKNYCVKKGIKKELTIP 1174
            + L  KSE F  F  F   V+ E  S + CL+ D G EFTS EF  +C   GI+++LT  
Sbjct: 549  YFLVEKSEAFAVFKSFKTYVEKETSSFLRCLRTDRGGEFTSQEFAIFCDVHGIRRQLTAA 608

Query: 1175 YNTQQNGVAERKNRTLIEMVRSLLQSSQLSTKWWGEA 1285
            Y  QQNGVAERKNRT++ MVRS+L + +L   +W EA
Sbjct: 609  YTPQQNGVAERKNRTIMNMVRSMLSAKKLPKTFWPEA 645



 Score = 39.7 bits (91), Expect(2) = 2e-45
 Identities = 15/29 (51%), Positives = 20/29 (68%)
 Frame = +1

Query: 301 NKDFNRDGWFLDSDCNSHMTGNKEMFTNF 387
           NK    D WFLDS C++HM G K+ F++F
Sbjct: 312 NKTSREDTWFLDSGCSNHMCGKKDYFSDF 340


>gb|AAT38758.1| Putative gag-pol polyprotein, identical [Solanum demissum]
          Length = 1333

 Score =  174 bits (442), Expect(2) = 4e-44
 Identities = 107/287 (37%), Positives = 154/287 (53%), Gaps = 13/287 (4%)
 Frame = +2

Query: 473  GNNKIILTDALYVEGLRKNLISLYKLLT*KYNMFAFQ-------KDNGQTCCRIPYNQKI 631
            GN K  L D  YV  L  NL+S+ +L+T  Y++  +        K++G+T  R+P  Q  
Sbjct: 362  GNVKF-LYDVQYVPTLAHNLLSVGQLMTSGYSVVFYDNACDIKDKESGRTIARVPMTQNK 420

Query: 632  VAE---GTEQNGLFVMKPMLIECFLTNTEISNLWHNRLGHINNEYLWKVGAVSH--GPKK 796
            +         N   V+K             +NLWH R GH+N  +L  +       G   
Sbjct: 421  MFPLDISNVGNSALVVK---------EKNETNLWHLRYGHLNVNWLKLLVQKDMVIGLPN 471

Query: 797  LLPTKMCSS*ITAKLHKKPFNKGTRI-STKCLEIIHSDLCGPITPPTTHGKSYILTFTDD 973
            +    +C   I  K  +K F  G    +T CLE++H+DLCGP+   +  G  Y L FTDD
Sbjct: 472  IKELDLCEGCIYGKQTRKSFPVGKSWRATTCLELVHADLCGPMKMESLGGSRYFLMFTDD 531

Query: 974  HSKMTWTFILRHKSEVFESFMKF*KKVQTEKVSKILCLKIDNGIEFTSIEFKNYCVKKGI 1153
            +S+ +W + L+ KSE FE+F KF   V+ +  +KI  L+ D G EF S +F  +C + GI
Sbjct: 532  YSRFSWVYFLKFKSETFETFKKFKAFVENQSGNKIKSLRTDRGGEFLSNDFNLFCEENGI 591

Query: 1154 KKELTIPYNTQQNGVAERKNRTLIEMVRSLLQSSQLSTKWWGEAALT 1294
            ++ELT PY  +QNGVAERKNRT++EM RS L++  L   +WGEA  T
Sbjct: 592  RRELTAPYTPEQNGVAERKNRTVVEMARSSLKAKGLPDYFWGEAVAT 638



 Score = 32.0 bits (71), Expect(2) = 4e-44
 Identities = 13/45 (28%), Positives = 23/45 (51%), Gaps = 3/45 (6%)
 Frame = +1

Query: 253 ANMVEIKENQVYVFFTNKDFNRDG---WFLDSDCNSHMTGNKEMF 378
           AN  +  E +  +F  +          WF+DS C++HM+ +K +F
Sbjct: 284 ANFTQNVEEESKLFMASSQITESANAVWFIDSGCSNHMSSSKSLF 328


>emb|CAB75932.1| putative protein [Arabidopsis thaliana]
          Length = 1339

 Score =  163 bits (413), Expect(2) = 7e-44
 Identities = 103/279 (36%), Positives = 151/279 (54%), Gaps = 13/279 (4%)
 Frame = +2

Query: 488  ILTDALYVEGLRKNLISLYKLLT*KYNMFAFQKDNGQTCCRIPYNQKIVAEGTEQNG--- 658
            ++ +  YV  LR NL+SL +L   +  +    +D     C++ +  K     T  +G   
Sbjct: 354  VIPEVYYVPELRNNLLSLGQLQ--ERGLAILIRDG---TCKVYHPSKGAIMETNMSGNRM 408

Query: 659  --LFVMKPMLIECFLTNTEI----SNLWHNRLGHINNEYLWKVG--AVSHGPKKLLPTK- 811
              L   KP      L   E+    ++LWH R GH+N E L  +    +  G   L  TK 
Sbjct: 409  FFLLASKPQKNSLCLQTEEVMDKENHLWHCRFGHLNQEGLKLLAHKKMVIGLPILKATKE 468

Query: 812  MCSS*ITAKLHKKPFNKGTRI-STKCLEIIHSDLCGPITPPTTHGKSYILTFTDDHSKMT 988
            +C+  +T K H++  +K T   S+  L+++HSD+CGPITP +  GK YIL+F DD ++ T
Sbjct: 469  ICAICLTGKQHRESMSKKTSWKSSTQLQLVHSDICGPITPISHSGKRYILSFIDDFTRKT 528

Query: 989  WTFILRHKSEVFESFMKF*KKVQTEKVSKILCLKIDNGIEFTSIEFKNYCVKKGIKKELT 1168
            W + L  KSE F +F  F   V+ E  + + CL+ D G EFTS EF  +C   GI ++LT
Sbjct: 529  WVYFLHEKSEAFATFKIFKASVEKEIGAFLTCLRTDRGGEFTSNEFGEFCRSHGISRQLT 588

Query: 1169 IPYNTQQNGVAERKNRTLIEMVRSLLQSSQLSTKWWGEA 1285
              +  QQNGVAERKNRT++  VRS+L   Q+   +W EA
Sbjct: 589  AAFTPQQNGVAERKNRTIMNAVRSMLSERQVPKMFWSEA 627



 Score = 42.4 bits (98), Expect(2) = 7e-44
 Identities = 27/77 (35%), Positives = 41/77 (53%), Gaps = 5/77 (6%)
 Frame = +1

Query: 253 ANMVEIKENQ---VYVFFTNKDFNRDG-WFLDSDCNSHMTGNKEMFTNFRIAYG*KFVKI 420
           AN  E++E +   +  +      NRD  WFLDS C++HMTG+KE F+     +  + VK+
Sbjct: 272 ANYAELEEEEELLLMAYVEQNQANRDEVWFLDSGCSNHMTGSKEWFSELEEGFN-RTVKL 330

Query: 421 VEE-RLLVKGVGRYYTK 468
             + R+ V G G    K
Sbjct: 331 GNDTRMSVVGKGSVKVK 347


>emb|CAN79116.1| hypothetical protein VITISV_002093 [Vitis vinifera]
          Length = 1109

 Score =  162 bits (409), Expect(2) = 2e-43
 Identities = 101/288 (35%), Positives = 154/288 (53%), Gaps = 9/288 (3%)
 Frame = +2

Query: 458  ITLNFGNNKI-ILTDALYVEGLRKNLISLYKLLT*KYNMFAFQKDNGQTCC--RIPYNQK 628
            + +N G+  + +L +  ++  L +NL+S+ +L+   Y++      +G TC      ++Q 
Sbjct: 336  VAVNNGHGNVKLLYNVYFIPSLTQNLLSVGQLMVSGYSILF----DGSTCVIKDKKFDQI 391

Query: 629  IVAEGTEQNGLFVMKPMLIE---CFLTNTEISNLWHNRLGHINNEYLWKVGA--VSHGPK 793
            IV      N LF ++   IE     +  T  SNLWH R GH+N + L  +    +  G  
Sbjct: 392  IVDVRMAANKLFPLEVSSIEKHALVVKETSESNLWHLRYGHLNVKGLKLLSKKEMVFGLP 451

Query: 794  KLLPTKMCSS*ITAKLHKKPFNKG-TRISTKCLEIIHSDLCGPITPPTTHGKSYILTFTD 970
            K+    +C   I  K  KKPF KG +R ++ CLEIIH+DLCGP+   +  G  Y L FTD
Sbjct: 452  KIDSVNVCEGCIYGKQSKKPFPKGRSRRASSCLEIIHADLCGPMQIASFGGSRYFLLFTD 511

Query: 971  DHSKMTWTFILRHKSEVFESFMKF*KKVQTEKVSKILCLKIDNGIEFTSIEFKNYCVKKG 1150
            DHS+M+W + L+ K                        L+ D G EF S +FK +C ++G
Sbjct: 512  DHSRMSWVYFLQSK-----------------------VLRTDRGGEFLSNDFKVFCEEEG 548

Query: 1151 IKKELTIPYNTQQNGVAERKNRTLIEMVRSLLQSSQLSTKWWGEAALT 1294
            + +ELT PY+ +QNGV ERKNRT++EM RS++++  LS  +W E   T
Sbjct: 549  LHRELTTPYSPEQNGVVERKNRTVVEMARSMMKAKNLSNHFWAEGVAT 596



 Score = 42.4 bits (98), Expect(2) = 2e-43
 Identities = 24/71 (33%), Positives = 41/71 (57%), Gaps = 3/71 (4%)
 Frame = +1

Query: 250 KANMVEIKENQVYVFFTNKD---FNRDGWFLDSDCNSHMTGNKEMFTNFRIAYG*KFVKI 420
           +AN VE +E+QV +F    +    + + WFLDS C++HMTG K +F     ++  K    
Sbjct: 264 QANYVEQEEDQVKLFMAYNEEVVXSNNIWFLDSGCSNHMTGIKSLFKELDESHKLKVKLG 323

Query: 421 VEERLLVKGVG 453
            ++++ V+G G
Sbjct: 324 DDKQVXVEGKG 334


>emb|CAN72676.1| hypothetical protein VITISV_020406 [Vitis vinifera]
          Length = 1183

 Score =  182 bits (463), Expect = 2e-43
 Identities = 108/293 (36%), Positives = 169/293 (57%), Gaps = 14/293 (4%)
 Frame = +2

Query: 458  ITLNFGNNKI-ILTDALYVEGLRKNLISLYKLLT*KYNMFAFQKDNGQTCC-------RI 613
            + +N G+  + +L +  ++  L +NL+S+ +L+   Y++      +G TC        +I
Sbjct: 284  VAVNNGHGNVKLLYNVYFIPSLTQNLLSVGQLMVSGYSILF----DGATCVIKDKKSDQI 339

Query: 614  PYNQKIVAEGTEQNGLFVMKPMLIE---CFLTNTEISNLWHNRLGHINNEYLWKVGA--V 778
             ++ ++ A     N LF ++   IE     +  T  SNLWH R GH+N + L  +    +
Sbjct: 340  IFDVRMAA-----NKLFPLEVSSIEKHALVVKETSESNLWHLRYGHLNVKGLKLLSKKEM 394

Query: 779  SHGPKKLLPTKMCSS*ITAKLHKKPFNKG-TRISTKCLEIIHSDLCGPITPPTTHGKSYI 955
              G  K+    +C   I  K  KKPF KG +R ++ CLEIIH+DLCGP+   +  G  Y 
Sbjct: 395  VFGLPKIDSVNVCEGCIYGKQSKKPFPKGRSRRASSCLEIIHADLCGPMQTASFGGSRYF 454

Query: 956  LTFTDDHSKMTWTFILRHKSEVFESFMKF*KKVQTEKVSKILCLKIDNGIEFTSIEFKNY 1135
            L FT+DHS+M+W + L+ K+E FE+F KF   V+ +    I  L+ D G EF S +FK +
Sbjct: 455  LLFTNDHSRMSWVYFLQSKAETFETFKKFKAFVEKQSGKCIKVLRTDRGGEFLSNDFKVF 514

Query: 1136 CVKKGIKKELTIPYNTQQNGVAERKNRTLIEMVRSLLQSSQLSTKWWGEAALT 1294
            C ++G+ +ELT PY+ +QNGVAERKNRT++EM RS++++  LS  +W E   T
Sbjct: 515  CEEEGLHRELTTPYSPEQNGVAERKNRTVVEMARSMMKAKNLSNHFWAEGVAT 567


>gb|AAG60117.1|AC073555_1 copia-type polyprotein, putative [Arabidopsis thaliana]
          Length = 1352

 Score =  168 bits (426), Expect(2) = 3e-43
 Identities = 108/283 (38%), Positives = 159/283 (56%), Gaps = 7/283 (2%)
 Frame = +2

Query: 458  ITLNFGNNKIILTDALYVEGLRKNLISLYKLLT*KYNMFAFQKDNGQTCCRIPYNQKIVA 637
            I L  G+++ I ++  Y+  ++ N++SL +LL   Y++    KDN  +  R   +  I  
Sbjct: 381  IRLKNGDHQFI-SNVYYIPSMKTNILSLGQLLEKGYDIRL--KDNNLSI-RDQESNLITK 436

Query: 638  EGTEQNGLFVM--KPMLIECF-LTNTEISNLWHNRLGHINN---EYLWKVGAVSHGPKKL 799
                +N +FV+  +  + +C  +   E S LWH R GH+N    E L +   V   P   
Sbjct: 437  VPMSKNRMFVLNIRNDIAQCLKMCYKEESWLWHLRFGHLNFGGLELLSRKEMVRGLPCIN 496

Query: 800  LPTKMCSS*ITAKLHKKPFNK-GTRISTKCLEIIHSDLCGPITPPTTHGKSYILTFTDDH 976
             P ++C   +  K  K  F K  +  + K LE+IH+D+CGPI P +    +Y L F DD 
Sbjct: 497  HPNQVCEGCLLGKQFKMSFPKESSSRAQKSLELIHTDVCGPIKPKSLGKSNYFLLFIDDF 556

Query: 977  SKMTWTFILRHKSEVFESFMKF*KKVQTEKVSKILCLKIDNGIEFTSIEFKNYCVKKGIK 1156
            S+ TW + L+ KSEVFE F KF   V+ E    I  ++ D G EFTS EF  YC   GI+
Sbjct: 557  SRKTWVYFLKEKSEVFEIFKKFKAHVEKESGLVIKTMRSDRGGEFTSKEFLKYCEDNGIR 616

Query: 1157 KELTIPYNTQQNGVAERKNRTLIEMVRSLLQSSQLSTKWWGEA 1285
            ++LT+P + QQNGVAERKNRT++EM RS+L+S +L  + W EA
Sbjct: 617  RQLTVPRSPQQNGVAERKNRTILEMARSMLKSKRLPKELWAEA 659



 Score = 35.0 bits (79), Expect(2) = 3e-43
 Identities = 25/80 (31%), Positives = 37/80 (46%), Gaps = 5/80 (6%)
 Frame = +1

Query: 229 SSNQATSKANMVEIK---ENQVYVFFTNKDFNRDG--WFLDSDCNSHMTGNKEMFTNFRI 393
           S+ +   KAN VE K   E+ + +    KD   +   W+LDS  ++HM G K MF     
Sbjct: 298 SNKKFEEKANYVEEKIQEEDMLLMASYKKDEQEENHKWYLDSGASNHMCGRKSMFAELDE 357

Query: 394 AYG*KFVKIVEERLLVKGVG 453
           +         E ++ VKG G
Sbjct: 358 SVRGNVALGDESKMEVKGKG 377


>gb|AAG50698.1|AC079604_5 copia-type polyprotein, putative [Arabidopsis thaliana]
            gi|12321387|gb|AAG50765.1|AC079131_10 copia-type
            polyprotein, putative [Arabidopsis thaliana]
          Length = 1320

 Score =  167 bits (424), Expect(2) = 6e-43
 Identities = 108/283 (38%), Positives = 159/283 (56%), Gaps = 7/283 (2%)
 Frame = +2

Query: 458  ITLNFGNNKIILTDALYVEGLRKNLISLYKLLT*KYNMFAFQKDNGQTCCRIPYNQKIVA 637
            I L  G+++ I ++  Y+  ++ N++SL +LL   Y++    KDN  +  R   +  I  
Sbjct: 381  IRLKNGDHQFI-SNVYYIPSMKTNILSLGQLLEKGYDIRL--KDNNLSI-RDQESNLITK 436

Query: 638  EGTEQNGLFVM--KPMLIECF-LTNTEISNLWHNRLGHINN---EYLWKVGAVSHGPKKL 799
                +N +FV+  +  + +C  +   E S LWH R GH+N    E L +   V   P   
Sbjct: 437  VPMSKNRMFVLNIRNDIAQCLKMCYKEESWLWHLRFGHLNFGGLELLSRKEMVRGLPCIN 496

Query: 800  LPTKMCSS*ITAKLHKKPFNK-GTRISTKCLEIIHSDLCGPITPPTTHGKSYILTFTDDH 976
             P ++C   +  K  K  F K  +  + K LE+IH+D+CGPI P +    +Y L F DD 
Sbjct: 497  HPNQVCEGCLLGKQFKMSFPKESSSRAQKPLELIHTDVCGPIKPKSLGKSNYFLLFIDDF 556

Query: 977  SKMTWTFILRHKSEVFESFMKF*KKVQTEKVSKILCLKIDNGIEFTSIEFKNYCVKKGIK 1156
            S+ TW + L+ KSEVFE F KF   V+ E    I  ++ D G EFTS EF  YC   GI+
Sbjct: 557  SRKTWVYFLKEKSEVFEIFKKFKAHVEKESGLVIKTMRSDRGGEFTSKEFLKYCEDNGIR 616

Query: 1157 KELTIPYNTQQNGVAERKNRTLIEMVRSLLQSSQLSTKWWGEA 1285
            ++LT+P + QQNGVAERKNRT++EM RS+L+S +L  + W EA
Sbjct: 617  RQLTVPRSPQQNGVAERKNRTILEMARSMLKSKRLPKELWAEA 659



 Score = 35.0 bits (79), Expect(2) = 6e-43
 Identities = 25/80 (31%), Positives = 37/80 (46%), Gaps = 5/80 (6%)
 Frame = +1

Query: 229 SSNQATSKANMVEIK---ENQVYVFFTNKDFNRDG--WFLDSDCNSHMTGNKEMFTNFRI 393
           S+ +   KAN VE K   E+ + +    KD   +   W+LDS  ++HM G K MF     
Sbjct: 298 SNKKFEEKANYVEEKIQEEDMLLMASYKKDEQEENHKWYLDSGASNHMCGRKSMFAELDE 357

Query: 394 AYG*KFVKIVEERLLVKGVG 453
           +         E ++ VKG G
Sbjct: 358 SVRGNVALGDESKMEVKGKG 377


>gb|AAD50001.1|AC007259_14 Hypothetical protein [Arabidopsis thaliana]
          Length = 1352

 Score =  166 bits (420), Expect(2) = 1e-42
 Identities = 107/283 (37%), Positives = 158/283 (55%), Gaps = 7/283 (2%)
 Frame = +2

Query: 458  ITLNFGNNKIILTDALYVEGLRKNLISLYKLLT*KYNMFAFQKDNGQTCCRIPYNQKIVA 637
            I L  G+++ I ++  Y+  ++ N++SL +LL   Y++    KDN  +  R   +  I  
Sbjct: 381  IRLKNGDHQFI-SNVYYIPSMKTNILSLGQLLEKGYDIRL--KDNNLSI-RDQESNLITK 436

Query: 638  EGTEQNGLFVM--KPMLIECF-LTNTEISNLWHNRLGHINN---EYLWKVGAVSHGPKKL 799
                +N +FV+  +  + +C  +   E S LWH R GH+N    E L +   V   P   
Sbjct: 437  VPMSKNRMFVLNIRNDIAQCLKMCYKEESWLWHLRFGHLNFGGLELLSRKEMVRGLPCIN 496

Query: 800  LPTKMCSS*ITAKLHKKPFNK-GTRISTKCLEIIHSDLCGPITPPTTHGKSYILTFTDDH 976
             P ++C   +  K  K  F K  +  + K LE+IH+D+CGPI P +    +Y L F DD 
Sbjct: 497  HPNQVCEGCLLGKQFKMSFPKESSSRAQKPLELIHTDVCGPIKPKSLGKSNYFLLFIDDF 556

Query: 977  SKMTWTFILRHKSEVFESFMKF*KKVQTEKVSKILCLKIDNGIEFTSIEFKNYCVKKGIK 1156
            S+ TW + L+ KSEVFE F KF   V+ E    I  ++ D G EFTS EF  YC   GI+
Sbjct: 557  SRKTWVYFLKEKSEVFEIFKKFKAHVEKESGLVIKTMRSDRGGEFTSKEFLKYCEDNGIR 616

Query: 1157 KELTIPYNTQQNGVAERKNRTLIEMVRSLLQSSQLSTKWWGEA 1285
            ++LT+P + QQNGV ERKNRT++EM RS+L+S +L  + W EA
Sbjct: 617  RQLTVPRSPQQNGVVERKNRTILEMARSMLKSKRLPKELWAEA 659



 Score = 35.8 bits (81), Expect(2) = 1e-42
 Identities = 25/80 (31%), Positives = 38/80 (47%), Gaps = 5/80 (6%)
 Frame = +1

Query: 229 SSNQATSKANMVEIK---ENQVYVFFTNKDFNRDG--WFLDSDCNSHMTGNKEMFTNFRI 393
           S+ +   KAN VE K   E+ + +    KD  ++   W+LDS  ++HM G K MF     
Sbjct: 298 SNKKFEEKANYVEEKIQEEDMLLMASYKKDEQKENHKWYLDSGASNHMCGRKSMFAELDE 357

Query: 394 AYG*KFVKIVEERLLVKGVG 453
           +         E ++ VKG G
Sbjct: 358 SVRGNVALGDESKMEVKGKG 377


>emb|CAB75469.1| copia-type reverse transcriptase-like protein [Arabidopsis thaliana]
          Length = 1272

 Score =  166 bits (420), Expect(2) = 2e-42
 Identities = 107/283 (37%), Positives = 159/283 (56%), Gaps = 7/283 (2%)
 Frame = +2

Query: 458  ITLNFGNNKIILTDALYVEGLRKNLISLYKLLT*KYNMFAFQKDNGQTCCRIPYNQKIVA 637
            I L  G+++ I ++  Y+  ++ N++SL +LL   Y++    KDN  +  R   +  I  
Sbjct: 381  IRLKNGDHQFI-SNVYYIPSMKTNILSLGQLLEKGYDIRL--KDNNLSI-RDKESNLITK 436

Query: 638  EGTEQNGLFVM--KPMLIECF-LTNTEISNLWHNRLGHINN---EYLWKVGAVSHGPKKL 799
                +N +FV+  +  + +C  +   E S LWH R GH+N    E L +   V   P   
Sbjct: 437  VPMSKNRMFVLNIRNDIAQCLKMCYKEESWLWHLRFGHLNFGGLELLSRKEMVRGLPCIN 496

Query: 800  LPTKMCSS*ITAKLHKKPFNK-GTRISTKCLEIIHSDLCGPITPPTTHGKSYILTFTDDH 976
             P ++C   +     K  F K  +  + K LE+IH+D+CGPI P +    +Y L F DD 
Sbjct: 497  HPNQVCEGCLLGNQFKMSFPKESSSRAQKPLELIHTDVCGPIKPKSLGKSNYFLLFIDDF 556

Query: 977  SKMTWTFILRHKSEVFESFMKF*KKVQTEKVSKILCLKIDNGIEFTSIEFKNYCVKKGIK 1156
            S+ TW + L+ KSEVFE F KF   V+ E    I  ++ D+G EFTS EF  YC   GI+
Sbjct: 557  SRKTWVYFLKEKSEVFEIFKKFKAHVEKESGLVIKTMRSDSGGEFTSKEFLKYCEDNGIR 616

Query: 1157 KELTIPYNTQQNGVAERKNRTLIEMVRSLLQSSQLSTKWWGEA 1285
            ++LT+P + QQNGVAERKNRT++EM RS+L+S +L  + W EA
Sbjct: 617  RQLTVPRSPQQNGVAERKNRTILEMARSMLKSKRLPKELWAEA 659



 Score = 35.0 bits (79), Expect(2) = 2e-42
 Identities = 25/80 (31%), Positives = 37/80 (46%), Gaps = 5/80 (6%)
 Frame = +1

Query: 229 SSNQATSKANMVEIK---ENQVYVFFTNKDFNRDG--WFLDSDCNSHMTGNKEMFTNFRI 393
           S+ +   KAN VE K   E+ + +    KD   +   W+LDS  ++HM G K MF     
Sbjct: 298 SNKKFKEKANYVEEKIQEEDMLLMASYKKDEQEENHKWYLDSGASNHMCGRKSMFAELDE 357

Query: 394 AYG*KFVKIVEERLLVKGVG 453
           +         E ++ VKG G
Sbjct: 358 SVRGNVALGDESKMEVKGKG 377


>dbj|BAB01972.1| copia-like retrotransposable element [Arabidopsis thaliana]
          Length = 1499

 Score =  172 bits (436), Expect(2) = 4e-42
 Identities = 101/287 (35%), Positives = 161/287 (56%), Gaps = 7/287 (2%)
 Frame = +2

Query: 455  DITLNFGNNKIILTDALYVEGLRKNLISLYKLLT*KYNMFAFQKDNGQTCCRIPYNQKIV 634
            DI ++      ++ D LYV  L +NL+S+ ++++  Y +    +DN      +   + + 
Sbjct: 367  DIRVSTNKGDHVIKDVLYVPELARNLLSVSQMISNGYRVIF--EDNKCVIQDLKGRKILD 424

Query: 635  AEGTEQNGLFVMKPMLIECFLT---NTEISNLWHNRLGHINN---EYLWKVGAVSHGPKK 796
             +  +++   + K    E ++      E ++LWH R GH+N    E +  +  V   PK 
Sbjct: 425  IKMKDRSFPIIWKKSREETYMAFEEKEEQTDLWHKRFGHVNYDKIETMQTLKIVEKLPKF 484

Query: 797  LLPTKMCSS*ITAKLHKKPFNKGTRIST-KCLEIIHSDLCGPITPPTTHGKSYILTFTDD 973
             +   +C++    K  ++ F K ++ +T K LE+IHSD+CGP+   + +G  Y LTF DD
Sbjct: 485  EVIKGICAACEMGKQSRRSFPKKSQSNTNKTLELIHSDVCGPMQTESINGSRYFLTFIDD 544

Query: 974  HSKMTWTFILRHKSEVFESFMKF*KKVQTEKVSKILCLKIDNGIEFTSIEFKNYCVKKGI 1153
             S+MTW + L++KSEV   F  F   V+ +  S+I  L+ D G EF S EF   C + GI
Sbjct: 545  FSRMTWVYFLKNKSEVITKFKIFKPYVENQSESRIKRLRTDGGGEFLSREFIKLCQESGI 604

Query: 1154 KKELTIPYNTQQNGVAERKNRTLIEMVRSLLQSSQLSTKWWGEAALT 1294
              E+T PY+ QQNGVAER+NRTL+EM RS+++  +LS K+W EA  T
Sbjct: 605  HHEITTPYSPQQNGVAERRNRTLVEMARSMIEEKKLSNKFWAEAIAT 651



 Score = 27.7 bits (60), Expect(2) = 4e-42
 Identities = 16/89 (17%), Positives = 37/89 (41%), Gaps = 2/89 (2%)
 Frame = +1

Query: 118 CYICDMNNHETKYYFFNAKGTNYSPNRGSKPRST*QISSNQATSKANMV--EIKENQVYV 291
           CY+CD   H  +         +    +G +   + + S ++   + +M+   ++E ++  
Sbjct: 268 CYVCDKQGHIAR---------DCKLRKGERAHLSIEESEDEKEDECHMLFSAVEEKEI-- 316

Query: 292 FFTNKDFNRDGWFLDSDCNSHMTGNKEMF 378
                    + W +DS C +HM+ +   F
Sbjct: 317 ----STIGEETWLVDSGCTNHMSKDVRHF 341


>emb|CAB71063.1| copia-type polyprotein [Arabidopsis thaliana]
          Length = 1352

 Score =  166 bits (420), Expect(2) = 4e-42
 Identities = 107/283 (37%), Positives = 158/283 (55%), Gaps = 7/283 (2%)
 Frame = +2

Query: 458  ITLNFGNNKIILTDALYVEGLRKNLISLYKLLT*KYNMFAFQKDNGQTCCRIPYNQKIVA 637
            I L  G+++ I ++  Y+  ++ N++SL +LL   Y++    KDN  +  R   +  I  
Sbjct: 381  IRLKNGDHQFI-SNVYYIPSMKTNILSLGQLLEKGYDIRL--KDNNLSI-RDQESNLITK 436

Query: 638  EGTEQNGLFVM--KPMLIECF-LTNTEISNLWHNRLGHINN---EYLWKVGAVSHGPKKL 799
                +N +FV+  +  + +C  +   E S LWH R GH+N    E L +   V   P   
Sbjct: 437  VPMSKNRMFVLNIRNDIAQCLKMCYKEESWLWHLRFGHLNFGGLELLSRKEMVRGLPCIN 496

Query: 800  LPTKMCSS*ITAKLHKKPFNK-GTRISTKCLEIIHSDLCGPITPPTTHGKSYILTFTDDH 976
             P ++C   +  K  K  F K  +  + K LE+IH+D+CGPI P +    +Y L F DD 
Sbjct: 497  HPNQVCEGCLLGKQFKMSFPKESSSRAQKPLELIHTDVCGPIKPKSLGKSNYFLLFIDDF 556

Query: 977  SKMTWTFILRHKSEVFESFMKF*KKVQTEKVSKILCLKIDNGIEFTSIEFKNYCVKKGIK 1156
            S+ TW + L+ KSEVFE F KF   V+ E    I  ++ D G EFTS EF  YC   GI+
Sbjct: 557  SRKTWVYFLKEKSEVFEIFKKFKAHVEKESGLVIKTMRSDRGGEFTSKEFLKYCEDNGIR 616

Query: 1157 KELTIPYNTQQNGVAERKNRTLIEMVRSLLQSSQLSTKWWGEA 1285
            ++LT+P + QQNGV ERKNRT++EM RS+L+S +L  + W EA
Sbjct: 617  RQLTVPRSPQQNGVVERKNRTILEMARSMLKSKRLPKELWAEA 659



 Score = 33.9 bits (76), Expect(2) = 4e-42
 Identities = 24/80 (30%), Positives = 38/80 (47%), Gaps = 5/80 (6%)
 Frame = +1

Query: 229 SSNQATSKANMVEIK---ENQVYVFFTNKDFNRDG--WFLDSDCNSHMTGNKEMFTNFRI 393
           S+ +   KA+ VE K   E+ + +    KD  ++   W+LDS  ++HM G K MF     
Sbjct: 298 SNKKFEEKAHYVEEKIQEEDMLLMASYKKDEQKENHKWYLDSGASNHMCGRKSMFAELDE 357

Query: 394 AYG*KFVKIVEERLLVKGVG 453
           +         E ++ VKG G
Sbjct: 358 SVRGNVALGDESKMEVKGKG 377


>dbj|BAB11200.1| copia-type polyprotein [Arabidopsis thaliana]
            gi|13872710|emb|CAC37622.1| polyprotein [Arabidopsis
            thaliana]
          Length = 1334

 Score =  162 bits (409), Expect(2) = 4e-42
 Identities = 99/283 (34%), Positives = 156/283 (55%), Gaps = 17/283 (6%)
 Frame = +2

Query: 488  ILTDALYVEGLRKNLISLYKLLT*KYNMFAFQKDNGQTCCRIPYN--QKIVAEGT-EQNG 658
            +++D  +V GL+ NL S+ +L   K   F  + D     C + +   +++V   T  +N 
Sbjct: 351  VISDVYFVPGLKNNLFSVGQLQQ-KGLRFIIEGD----VCEVWHKTEKRMVMHSTMTKNR 405

Query: 659  LFVMKPML--------IECFLTNTEISNLWHNRLGHINNEYLWKVGA---VSHGPKKLLP 805
            +FV+   +          C     + +N+WH R GH+N++ L  +     V   PK  L 
Sbjct: 406  MFVVFAAVKKSKETEETRCLQVIGKANNMWHKRFGHLNHQGLRSLAEKEMVKGLPKFDLG 465

Query: 806  TK--MCSS*ITAKLHKKPFNKGTRI-STKCLEIIHSDLCGPITPPTTHGKSYILTFTDDH 976
             +  +C   +  K  ++   K +   ST+ L+++H+D+CGPI P +T GK YIL F DD 
Sbjct: 466  EEEAVCDICLKGKQIRESIPKESAWKSTQVLQLVHTDICGPINPASTSGKRYILNFIDDF 525

Query: 977  SKMTWTFILRHKSEVFESFMKF*KKVQTEKVSKILCLKIDNGIEFTSIEFKNYCVKKGIK 1156
            S+  WT++L  KSE F+ F +F  +V+ E   K++CL+ D G E+ S EF  YC + GIK
Sbjct: 526  SRKCWTYLLSEKSETFQFFKEFKAEVERESGKKLVCLRSDRGGEYNSREFDEYCKEFGIK 585

Query: 1157 KELTIPYNTQQNGVAERKNRTLIEMVRSLLQSSQLSTKWWGEA 1285
            ++LT  Y  QQNGVAERKNR+++ M R +L    +  K+W EA
Sbjct: 586  RQLTAAYTPQQNGVAERKNRSVMNMTRCMLMEMSVPRKFWPEA 628



 Score = 38.1 bits (87), Expect(2) = 4e-42
 Identities = 21/71 (29%), Positives = 37/71 (52%), Gaps = 2/71 (2%)
 Frame = +1

Query: 250 KANMVEIKENQVYVFFTNK--DFNRDGWFLDSDCNSHMTGNKEMFTNFRIAYG*KFVKIV 423
           +AN VE++E+ + +    +  D  +  WFLDS C++HM G +E F      +        
Sbjct: 270 EANYVEMEEDLLLMAHVEQIGDEEKQIWFLDSGCSNHMCGTREWFLELDSGFKQNVRLGD 329

Query: 424 EERLLVKGVGR 456
           + R+ V+G G+
Sbjct: 330 DRRMAVEGKGK 340


>gb|AAG51247.1|AC055769_6 copia-type polyprotein, putative; 28768-32772 [Arabidopsis thaliana]
          Length = 1334

 Score =  162 bits (409), Expect(2) = 4e-42
 Identities = 99/283 (34%), Positives = 156/283 (55%), Gaps = 17/283 (6%)
 Frame = +2

Query: 488  ILTDALYVEGLRKNLISLYKLLT*KYNMFAFQKDNGQTCCRIPYN--QKIVAEGT-EQNG 658
            +++D  +V GL+ NL S+ +L   K   F  + D     C + +   +++V   T  +N 
Sbjct: 351  VISDVYFVPGLKNNLFSVGQLQQ-KGLRFIIEGD----VCEVWHKTEKRMVMHSTMTKNR 405

Query: 659  LFVMKPML--------IECFLTNTEISNLWHNRLGHINNEYLWKVGA---VSHGPKKLLP 805
            +FV+   +          C     + +N+WH R GH+N++ L  +     V   PK  L 
Sbjct: 406  MFVVFAAVKKSKETEETRCLQVIGKANNMWHKRFGHLNHQGLRSLAEKEMVKGLPKFDLG 465

Query: 806  TK--MCSS*ITAKLHKKPFNKGTRI-STKCLEIIHSDLCGPITPPTTHGKSYILTFTDDH 976
             +  +C   +  K  ++   K +   ST+ L+++H+D+CGPI P +T GK YIL F DD 
Sbjct: 466  EEEAVCDICLKGKQIRESIPKESAWKSTQVLQLVHTDICGPINPASTSGKRYILNFIDDF 525

Query: 977  SKMTWTFILRHKSEVFESFMKF*KKVQTEKVSKILCLKIDNGIEFTSIEFKNYCVKKGIK 1156
            S+  WT++L  KSE F+ F +F  +V+ E   K++CL+ D G E+ S EF  YC + GIK
Sbjct: 526  SRKCWTYLLSEKSETFQFFKEFKAEVERESGKKLVCLRSDRGGEYNSREFDEYCKEFGIK 585

Query: 1157 KELTIPYNTQQNGVAERKNRTLIEMVRSLLQSSQLSTKWWGEA 1285
            ++LT  Y  QQNGVAERKNR+++ M R +L    +  K+W EA
Sbjct: 586  RQLTAAYTPQQNGVAERKNRSVMNMTRCMLMEMSVPRKFWPEA 628



 Score = 38.1 bits (87), Expect(2) = 4e-42
 Identities = 21/71 (29%), Positives = 37/71 (52%), Gaps = 2/71 (2%)
 Frame = +1

Query: 250 KANMVEIKENQVYVFFTNK--DFNRDGWFLDSDCNSHMTGNKEMFTNFRIAYG*KFVKIV 423
           +AN VE++E+ + +    +  D  +  WFLDS C++HM G +E F      +        
Sbjct: 270 EANYVEMEEDLLLMAHVEQIGDEEKQIWFLDSGCSNHMCGTREWFLELDSGFKQNVRLGD 329

Query: 424 EERLLVKGVGR 456
           + R+ V+G G+
Sbjct: 330 DRRMAVEGKGK 340


>gb|AAF25964.2|AC017118_1 F6N18.1 [Arabidopsis thaliana]
          Length = 1207

 Score =  162 bits (409), Expect(2) = 4e-42
 Identities = 99/283 (34%), Positives = 156/283 (55%), Gaps = 17/283 (6%)
 Frame = +2

Query: 488  ILTDALYVEGLRKNLISLYKLLT*KYNMFAFQKDNGQTCCRIPYN--QKIVAEGT-EQNG 658
            +++D  +V GL+ NL S+ +L   K   F  + D     C + +   +++V   T  +N 
Sbjct: 256  VISDVYFVPGLKNNLFSVGQLQQ-KGLRFIIEGD----VCEVWHKTEKRMVMHSTMTKNR 310

Query: 659  LFVMKPML--------IECFLTNTEISNLWHNRLGHINNEYLWKVGA---VSHGPKKLLP 805
            +FV+   +          C     + +N+WH R GH+N++ L  +     V   PK  L 
Sbjct: 311  MFVVFAAVKKSKETEETRCLQVIGKANNMWHKRFGHLNHQGLRSLAEKEMVKGLPKFDLG 370

Query: 806  TK--MCSS*ITAKLHKKPFNKGTRI-STKCLEIIHSDLCGPITPPTTHGKSYILTFTDDH 976
             +  +C   +  K  ++   K +   ST+ L+++H+D+CGPI P +T GK YIL F DD 
Sbjct: 371  EEEAVCDICLKGKQIRESIPKESAWKSTQVLQLVHTDICGPINPASTSGKRYILNFIDDF 430

Query: 977  SKMTWTFILRHKSEVFESFMKF*KKVQTEKVSKILCLKIDNGIEFTSIEFKNYCVKKGIK 1156
            S+  WT++L  KSE F+ F +F  +V+ E   K++CL+ D G E+ S EF  YC + GIK
Sbjct: 431  SRKCWTYLLSEKSETFQFFKEFKAEVERESGKKLVCLRSDRGGEYNSREFDEYCKEFGIK 490

Query: 1157 KELTIPYNTQQNGVAERKNRTLIEMVRSLLQSSQLSTKWWGEA 1285
            ++LT  Y  QQNGVAERKNR+++ M R +L    +  K+W EA
Sbjct: 491  RQLTAAYTPQQNGVAERKNRSVMNMTRCMLMEMSVPRKFWPEA 533



 Score = 38.1 bits (87), Expect(2) = 4e-42
 Identities = 21/71 (29%), Positives = 37/71 (52%), Gaps = 2/71 (2%)
 Frame = +1

Query: 250 KANMVEIKENQVYVFFTNK--DFNRDGWFLDSDCNSHMTGNKEMFTNFRIAYG*KFVKIV 423
           +AN VE++E+ + +    +  D  +  WFLDS C++HM G +E F      +        
Sbjct: 175 EANYVEMEEDLLLMAHVEQIGDEEKQIWFLDSGCSNHMCGTREWFLELDSGFKQNVRLGD 234

Query: 424 EERLLVKGVGR 456
           + R+ V+G G+
Sbjct: 235 DRRMAVEGKGK 245


>ref|XP_002064813.1| GK15001 [Drosophila willistoni] gi|194160898|gb|EDW75799.1| GK15001
            [Drosophila willistoni]
          Length = 1249

 Score =  168 bits (426), Expect(2) = 5e-42
 Identities = 99/285 (34%), Positives = 152/285 (53%), Gaps = 6/285 (2%)
 Frame = +2

Query: 458  ITLNFGNNKIILTDALYVEGLRKNLISLYKLLT*KYNMFA-FQKDNGQTCCRIPYNQKIV 634
            +T+  G  K+ + + LYV GL  N +S+ +++  +YN    F+K       +I  N + +
Sbjct: 312  VTIRTGICKLTMNNVLYVPGLAGNFMSVARVI--EYNSVVHFEKH----MAKIIQNGECI 365

Query: 635  AEGTEQNGLFVMKPMLIECFLTNTEISNLWHNRLGHINNEYLWKVGAVSH----GPKKLL 802
             +  +   LFV +      F    E  +LWH R GH+N + L ++ +             
Sbjct: 366  LKAKKIGNLFVFEAESENLFAAVGEDVSLWHKRFGHLNYKSLTQIASKGLVRGLSVTNFA 425

Query: 803  PTKMCSS*ITAKLHKKPFNKGTRI-STKCLEIIHSDLCGPITPPTTHGKSYILTFTDDHS 979
            P   C + + +K+H +PF K T   S++ L+++HSD+CGP    +  G  Y LTF DD S
Sbjct: 426  PNTPCKTCMVSKIHVQPFPKMTESRSSELLQLVHSDVCGPFGTKSLGGSRYFLTFIDDKS 485

Query: 980  KMTWTFILRHKSEVFESFMKF*KKVQTEKVSKILCLKIDNGIEFTSIEFKNYCVKKGIKK 1159
            +  + + L+ K EVF  F++F   V+ +   K+ C++ DNG E+ +  F +Y  K GI +
Sbjct: 486  RRIFVYFLKGKDEVFGKFLEFKSLVERQTGKKLKCIRSDNGREYVNNAFDDYLKKNGILR 545

Query: 1160 ELTIPYNTQQNGVAERKNRTLIEMVRSLLQSSQLSTKWWGEAALT 1294
            +LTI Y  QQNGVAER NRTL+EM R LL  S L    W EA  T
Sbjct: 546  QLTIAYTPQQNGVAERANRTLVEMSRCLLAQSGLCEALWAEAIFT 590



 Score = 31.2 bits (69), Expect(2) = 5e-42
 Identities = 12/30 (40%), Positives = 19/30 (63%)
 Frame = +1

Query: 304 KDFNRDGWFLDSDCNSHMTGNKEMFTNFRI 393
           ++  R+ W LDS   SHM  +K MF++F +
Sbjct: 262 ENMKREKWCLDSGATSHMCCDKSMFSDFSV 291


>gb|AAF19226.1|AC007505_2 Highly similar to Ta1-3 polyprotein [Arabidopsis thaliana]
          Length = 1356

 Score =  164 bits (415), Expect(2) = 6e-42
 Identities = 97/275 (35%), Positives = 140/275 (50%), Gaps = 6/275 (2%)
 Frame = +2

Query: 488  ILTDALYVEGLRKNLISLYKLLT*KYNMFAFQKDNGQTCCRIPYNQKIVAEGTEQNGLFV 667
            IL +  YV  LR+NLIS   L     +   ++ + G+   R   N K    G+  NGL+V
Sbjct: 363  ILENVKYVPHLRRNLISTGTL-----DKLGYRHEGGEGKVRYFKNNKTALRGSLSNGLYV 417

Query: 668  MKPMLIECFLTNTEISN----LWHNRLGHIN-NEYLWKVGAVSHGPKKLLPTKMCSS*IT 832
            +    +   L N E       LWH+RLGH++ N      G      K++   + C   + 
Sbjct: 418  LDGSTVMSELCNAETDKVKTALWHSRLGHMSMNNLKVLAGKGLIDRKEINELEFCEHCVM 477

Query: 833  AKLHKKPFNKGTRISTKCLEIIHSDLCG-PITPPTTHGKSYILTFTDDHSKMTWTFILRH 1009
             K  K  FN G   S   L  +H+DL G P   P+  GK Y L+  DD ++  W + L+ 
Sbjct: 478  GKSKKVSFNVGKHTSEDALSYVHADLWGSPNVTPSISGKQYFLSIIDDKTRKVWLYFLKS 537

Query: 1010 KSEVFESFMKF*KKVQTEKVSKILCLKIDNGIEFTSIEFKNYCVKKGIKKELTIPYNTQQ 1189
            K E F+ F ++   V+ +   K+ CL+ DNG+EF +  F +YC + GI++  T  Y  QQ
Sbjct: 538  KDETFDKFCEWKSLVENQVNKKVKCLRTDNGLEFCNSRFDSYCKEHGIERHRTCTYTPQQ 597

Query: 1190 NGVAERKNRTLIEMVRSLLQSSQLSTKWWGEAALT 1294
            NGVAER NRT++E VR LL  S +   +W EAA T
Sbjct: 598  NGVAERMNRTIMEKVRCLLNKSGVEEVFWAEAAAT 632



 Score = 35.0 bits (79), Expect(2) = 6e-42
 Identities = 13/30 (43%), Positives = 19/30 (63%)
 Frame = +1

Query: 301 NKDFNRDGWFLDSDCNSHMTGNKEMFTNFR 390
           N+   +D W LDS C SHMT  ++ F +F+
Sbjct: 300 NEQMVKDLWILDSGCTSHMTSRRDWFISFQ 329


>gb|AGW47867.1| polyprotein [Phaseolus vulgaris]
          Length = 1471

 Score =  166 bits (421), Expect(2) = 4e-41
 Identities = 101/276 (36%), Positives = 153/276 (55%), Gaps = 11/276 (3%)
 Frame = +2

Query: 491  LTDALYVEGLRKNLISLYKLLT*KYNMFAFQK----DNGQTCCRIPYNQKIVAEGTEQNG 658
            L D  YV  L+ N++S+ +L    Y++F   +     N Q C        +      +N 
Sbjct: 412  LQDVYYVPDLKTNILSMGQLTEKGYSIFLKDRFLHLKNKQGCL-------VARIEMARNR 464

Query: 659  LFVMKPMLI--ECFLTNTEI-SNLWHNRLGHINNEYLWKVGAVS--HG-PKKLLPTKMCS 820
            ++ +    I  +C   N E  ++LWH R GH+++  L ++   +  HG P      K C 
Sbjct: 465  MYKLNLRSIREKCLQVNIEDKASLWHLRFGHLHHGGLKELAKKNMVHGLPNMDYEGKFCE 524

Query: 821  S*ITAKLHKKPFNKGTRISTKC-LEIIHSDLCGPITPPTTHGKSYILTFTDDHSKMTWTF 997
              + +K  +  F K  +   K  LE+IH+D+CGPITP +  GK Y +TF DD S+ TW +
Sbjct: 525  ECVLSKHVRTSFPKKAQYWAKQPLELIHTDICGPITPESFSGKRYFITFIDDFSRKTWVY 584

Query: 998  ILRHKSEVFESFMKF*KKVQTEKVSKILCLKIDNGIEFTSIEFKNYCVKKGIKKELTIPY 1177
             L+ KSE FE F KF   V+     +I  ++ D G E+TS  F  YC ++GI++ LT PY
Sbjct: 585  FLKEKSEAFEVFKKFKVMVERTTDKQIKAVRSDRGGEYTSTTFMEYCEEQGIRRFLTAPY 644

Query: 1178 NTQQNGVAERKNRTLIEMVRSLLQSSQLSTKWWGEA 1285
              QQNGVAERKNRT+++MVRS+L+S ++  ++W EA
Sbjct: 645  TPQQNGVAERKNRTILDMVRSMLKSKKMPKEFWAEA 680



 Score = 30.0 bits (66), Expect(2) = 4e-41
 Identities = 16/56 (28%), Positives = 29/56 (51%), Gaps = 1/56 (1%)
 Frame = +1

Query: 226 ISSNQATSKANMVEIKENQVYVFFTNKDFNRDG-WFLDSDCNSHMTGNKEMFTNFR 390
           I   + T+ A  VE  E  + +     + N D  W+LDS  ++HM G++ +F + +
Sbjct: 322 IKIEETTNLALEVETNEGVLLMAQDEVNINNDTLWYLDSGASNHMCGHEYLFKDMQ 377


>pir||S00954 pol polyprotein - fruit fly (Drosophila melanogaster) transposon 1731
            gi|8702|emb|CAA30503.1| unnamed protein product
            [Drosophila melanogaster]
          Length = 982

 Score =  167 bits (422), Expect(2) = 5e-41
 Identities = 98/275 (35%), Positives = 150/275 (54%), Gaps = 5/275 (1%)
 Frame = +2

Query: 485  IILTDALYVEGLRKNLISLYKLLT*KYNMFAFQKDNGQTCCRIPYNQKIVAEGTEQNGLF 664
            ++L + L+V  L  N +S+ +    +Y  F    + G     +    + +        L+
Sbjct: 65   LVLNNVLFVPDLNGNFMSVSRAA--QYKCFV---NFGPHYADVIQEGERILRVMRAGNLY 119

Query: 665  VMKPMLIECFLTNTEISNLWHNRLGHINNEYLWKV--GAVSHGPKKLL--PTKMCSS*IT 832
            + +     CF       +LWH R GH+N   L ++    + +G +K++  P  +C + + 
Sbjct: 120  MFQGKHNSCFAAVDADGSLWHKRNGHLNTSSLQEMVRKKMVYGVEKVVFKPDAVCKTCML 179

Query: 833  AKLHKKPFNKGTRI-STKCLEIIHSDLCGPITPPTTHGKSYILTFTDDHSKMTWTFILRH 1009
            AK+H +PF K TR  + + L++IHSDLCGP + P+  G  Y LTF DD S+  + + LR 
Sbjct: 180  AKIHVQPFPKTTRSRAEELLDMIHSDLCGPFSTPSLAGSKYFLTFIDDKSRRIFVYFLRK 239

Query: 1010 KSEVFESFMKF*KKVQTEKVSKILCLKIDNGIEFTSIEFKNYCVKKGIKKELTIPYNTQQ 1189
            K EVF  F++F K V+ +   KI C++ DNG EF +  F +Y    GI ++LTIP+  QQ
Sbjct: 240  KDEVFTKFVEFKKLVERQTGRKIKCIRSDNGGEFVNNVFDDYLKAHGIARQLTIPHTPQQ 299

Query: 1190 NGVAERKNRTLIEMVRSLLQSSQLSTKWWGEAALT 1294
            NGVAER NRTL+EM R +L  S+L    W EA  T
Sbjct: 300  NGVAERANRTLVEMARCMLLQSELGEALWAEAINT 334



 Score = 29.3 bits (64), Expect(2) = 5e-41
 Identities = 17/48 (35%), Positives = 23/48 (47%)
 Frame = +1

Query: 310 FNRDGWFLDSDCNSHMTGNKEMFTNFRIAYG*KFVKIVEERLLVKGVG 453
           F +  W LDS   SHM  ++ +FT F   +  K        LL KG+G
Sbjct: 8   FGKTQWCLDSGATSHMCCDRSVFTEFE-EHTEKISLAGNGFLLAKGIG 54


Top