BLASTX nr result

ID: Catharanthus22_contig00012811 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Catharanthus22_contig00012811
         (1396 letters)

Database: ./nr 
           37,332,560 sequences; 13,225,080,153 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

emb|CAN71427.1| hypothetical protein VITISV_027864 [Vitis vinifera]   343   1e-91
emb|CAN73532.1| hypothetical protein VITISV_012827 [Vitis vinifera]   304   5e-80
emb|CAN63563.1| hypothetical protein VITISV_003097 [Vitis vinifera]   263   2e-67
emb|CAN75363.1| hypothetical protein VITISV_026292 [Vitis vinifera]   150   1e-43
gb|ABR16307.1| unknown [Picea sitchensis]                             176   2e-41
gb|AAG50698.1|AC079604_5 copia-type polyprotein, putative [Arabi...   166   2e-38
emb|CBI37296.3| unnamed protein product [Vitis vinifera]              165   5e-38
emb|CAB71063.1| copia-type polyprotein [Arabidopsis thaliana]         165   5e-38
gb|AAD50001.1|AC007259_14 Hypothetical protein [Arabidopsis thal...   165   5e-38
emb|CAB75469.1| copia-type reverse transcriptase-like protein [A...   164   6e-38
gb|AAD32906.1| putative retroelement pol polyprotein [Arabidopsi...   162   4e-37
gb|AAG60117.1|AC073555_1 copia-type polyprotein, putative [Arabi...   161   5e-37
gb|AAP46257.1| putative polyprotein [Oryza sativa Japonica Group...   161   5e-37
gb|AAF16534.1|AC013482_8 T26F17.17 [Arabidopsis thaliana]             160   2e-36
gb|ABA99612.1| retrotransposon protein, putative, unclassified [...   159   3e-36
dbj|BAB01972.1| copia-like retrotransposable element [Arabidopsi...   156   3e-36
emb|CAB75932.1| putative protein [Arabidopsis thaliana]               159   3e-36
emb|CAN72676.1| hypothetical protein VITISV_020406 [Vitis vinifera]   158   6e-36
dbj|BAB11200.1| copia-type polyprotein [Arabidopsis thaliana] gi...   157   8e-36
gb|AAG51247.1|AC055769_6 copia-type polyprotein, putative; 28768...   157   8e-36

>emb|CAN71427.1| hypothetical protein VITISV_027864 [Vitis vinifera]
          Length = 1300

 Score =  343 bits (880), Expect = 1e-91
 Identities = 192/466 (41%), Positives = 264/466 (56%), Gaps = 8/466 (1%)
 Frame = +1

Query: 22   YAATEGSTSLLVNTRPTSD---KWIINSGVATYMIGNGEKLLI*TEYKGSWMVVIADNMR 192
            + A  G ++ +  T    D    WII+SG + +M G+ EKL   +EYKG  MVV A+N +
Sbjct: 298  FFAAIGESAFIATTSEQIDYEKDWIIDSGCSNHMTGDKEKLQDLSEYKGRHMVVTANNSK 357

Query: 193  FPIKSTGDSDCATEESTQGGVTK--CHALGKDKEEFVVC-VTTNNLRYLRGIRPDDIKIY 363
             PI   G++  +++ +T     +   H  G  K    V  +T++    L G  P D+K+Y
Sbjct: 358  LPIAHIGNTVVSSQYNTNDVSLQNVYHVPGMKKNLLSVAQLTSSGHSVLFG--PQDVKVY 415

Query: 364  RNVKIISTPILEGEKQEFVYVMSVETAYIDKTRKKEITDLWHARFEHFSYXXXXXXXXXX 543
             +++++  P+++G + E VYVMS ETAY+DKTRK E  DLWH R  H SY          
Sbjct: 416  HDLEVMEEPVIKGRRLESVYVMSAETAYVDKTRKNETADLWHMRLSHISYSKLTMMMKKS 475

Query: 544  XXXXXPNLEVRDETICAGCQNGKAHQVPYYESKFRARQPLELVHSDVFGPVRCLSVGGAR 723
                 P LEVR  TICA CQ GKAHQ+PY ESK++A+ PLEL+HSDVFGPV+  S+ G +
Sbjct: 476  MLKGLPQLEVRKXTICAXCQYGKAHQLPYEESKWKAKGPLELIHSDVFGPVKQASLSGMK 535

Query: 724  W*LSSTIIKGMCGHIL*KKNQKLLEYLSRSRKKLNGK*EEKFRCLRPDNKREFISQEFSE 903
            + ++                   ++  SR                     R ++   F  
Sbjct: 536  YMVT------------------FIDDFSR---------------------RVYLQMSFFT 556

Query: 904  FLQENRKRRQLTCSKTPKQ--VAKRTNRQLAETC*SMLHAKNALPEYWD*CIRTTAHVIN 1077
              +        TC+ TP+Q  V +R NR LAE C SMLHAKN    +W   ++T A VIN
Sbjct: 557  SSENXEYAISFTCANTPQQNGVXERKNRHLAEICRSMLHAKNVPGXFWAEXMKTAAFVIN 616

Query: 1078 RMPQAKFILKSTLEKLWNIRPTVNHFRVFGSVYYVFVPDHLRMKFDKKAIRCIFVGYDSG 1257
            R+PQ +    S  EKLWNI+PTV++FRVFG V YVFVP+HLR K DKKA+RC+ VGYDS 
Sbjct: 617  RLPQQRLNFSSPFEKLWNIKPTVSYFRVFGCVCYVFVPNHLRSKMDKKAVRCVLVGYDSQ 676

Query: 1258 RKGWRCSDPITGRGYVSRDVVFNEASSWWPLEKMRTEDIEKIEEKV 1395
            RK WRC DP TG+ Y SR+VVF+E+SSWW  EK    D +  ++++
Sbjct: 677  RKXWRCCDPTTGKCYTSRNVVFDESSSWWSSEKEILXDSBVFKDEL 722


>emb|CAN73532.1| hypothetical protein VITISV_012827 [Vitis vinifera]
          Length = 1194

 Score =  304 bits (779), Expect = 5e-80
 Identities = 170/443 (38%), Positives = 239/443 (53%), Gaps = 5/443 (1%)
 Frame = +1

Query: 82   WIINSGVATYMIGNGEKLLI*TEYKGSWMVVIADNMRFPIKSTGDSDCATEESTQGGVTK 261
            WII+SG + +M G+ EKL   +EYKG  MVV  +N + PI   G++  +++ +T     +
Sbjct: 321  WIIDSGCSNHMTGDKEKLXDLSEYKGRHMVVTXNNSKJPIAHIGNTVVSSQYNTNDVSLQ 380

Query: 262  --CHALGKDKEEFVVCVTTNNLRYLRGIRPDDIKIYRNVKIISTPILEGEKQEFVYVMSV 435
               H  G  K    V   T++  ++    P D+K+YR+++I+  P++   + E VYVMS 
Sbjct: 381  NVYHVPGMKKNLLSVAQLTSSGHFVL-FGPQDVKVYRDLEIMEEPVIXRWRLESVYVMSA 439

Query: 436  ETAYIDKTRKKEITDLWHARFEHFSYXXXXXXXXXXXXXXXPNLEVRDETICAGCQNGKA 615
            ETA +DKTRK E TDL H R  H SY               P LEVR +TICAGC  GKA
Sbjct: 440  ETAXVDKTRKNETTDLXHMRLSHVSYSKLTVMMKKSMLKGLPQLEVRKDTICAGCXYGKA 499

Query: 616  HQVPYYESKFRARQPLELVHSDVFGPVRCLSVGGARW*LSSTIIKGMCGHI---L*KKNQ 786
            HQ+PY ESK++ + PLEL+HSDVFGPV+  S+ G ++    T I     ++     K+  
Sbjct: 500  HQLPYEESKWKTKGPLELIHSDVFGPVKXASLSGMKY--MXTFIDDFSRYVWVHFMKEKS 557

Query: 787  KLLEYLSRSRKKLNGK*EEKFRCLRPDNKREFISQEFSEFLQENRKRRQLTCSKTPKQVA 966
            +        ++    + +++ RCLR DN  E                             
Sbjct: 558  ETFSKFKEFKEMTEAEVDKRIRCLRXDNGGE----------------------------- 588

Query: 967  KRTNRQLAETC*SMLHAKNALPEYWD*CIRTTAHVINRMPQAKFILKSTLEKLWNIRPTV 1146
                        SMLH KN    +W   ++T A VINR+PQ +    S  EKLWNI+P V
Sbjct: 589  ------------SMLHXKNVPGRFWVEAMKTAAFVINRLPQQRLNFSSPFEKLWNIKPIV 636

Query: 1147 NHFRVFGSVYYVFVPDHLRMKFDKKAIRCIFVGYDSGRKGWRCSDPITGRGYVSRDVVFN 1326
            ++FRVFG V Y FVP+H R K DKK +RC+ VGYDS  K WRC +P TG+ Y SR+VVF+
Sbjct: 637  SYFRVFGCVCYAFVPNHXRSKMDKKXVRCVLVGYDSQXKRWRCCNPTTGKYYTSRNVVFD 696

Query: 1327 EASSWWPLEKMRTEDIEKIEEKV 1395
            E+SSWW  EK    D +  ++++
Sbjct: 697  ESSSWWSSEKEILPDSDVFKDEL 719


>emb|CAN63563.1| hypothetical protein VITISV_003097 [Vitis vinifera]
          Length = 1052

 Score =  263 bits (671), Expect = 2e-67
 Identities = 152/377 (40%), Positives = 214/377 (56%), Gaps = 7/377 (1%)
 Frame = +1

Query: 118  GNGEKLLI*TEYKGSWMVVIADNMRFPIKSTGDSDCATEESTQGGVTK--CHALGKDKEE 291
            G+ EKL   +EYKG  MV+  +N + PI   G++  +++ +T     +   H  G  K  
Sbjct: 237  GDKEKLQDLSEYKGRHMVITTNNSKLPIAHIGNTVVSSQYNTNDVSLQNVYHVPGMKKNL 296

Query: 292  FVVCVTTNNLRYLRGIRPDDIKIYRNVKIISTPILEGEKQEFVYVMSVETAYIDKTRKKE 471
              V   T++  ++    P D+K+ R+++I+  P+++G + E +YVM VETAY+DKTRK E
Sbjct: 297  LSVAQLTSSGHFVL-FSPQDVKVXRDLEIMEEPVIKGWRLESIYVMFVETAYVDKTRKNE 355

Query: 472  ITDLWHARFEHFSYXXXXXXXXXXXXXXXPNLEVRDETICAGCQNGKAHQVPYYESKFRA 651
            I DLWH R  H SY               P LE            GKAHQ+ Y ESK++A
Sbjct: 356  IADLWHMRLSHVSYSKLTVMMKKSMLKGLPQLE------------GKAHQLSYEESKWKA 403

Query: 652  RQPLELVHSDVFGPVRCLSVGGARW*LSSTIIKGMCGHI---L*KKNQKLLEYLSRSRKK 822
            + PLEL+HSDVFGPV+   + G ++ +  T I     ++     K+  +        ++ 
Sbjct: 404  KGPLELIHSDVFGPVKQAXLSGMKYMV--TFIDDFSRYVWVYFMKEKSETFSKFKEFKEM 461

Query: 823  LNGK*EEKFRCLRPDNKREFISQEFSEFLQENRKRRQLTCSKTPKQ--VAKRTNRQLAET 996
               + +++  CLR DN   + S EF  FL+E R R Q TC+ T +Q  VA+R NR LAE 
Sbjct: 462  TEIEVDKRIHCLRTDNGXXYTSNEFFYFLRECRVRHQFTCANTLQQNGVAERKNRHLAEI 521

Query: 997  C*SMLHAKNALPEYWD*CIRTTAHVINRMPQAKFILKSTLEKLWNIRPTVNHFRVFGSVY 1176
            C SMLHAKN    +W   ++T A VINR+PQ +    S  EKLWNI+PTV++FRVFG V 
Sbjct: 522  CRSMLHAKNVPGRFWAEAMKTXAFVINRLPQQRLNFSSPFEKLWNIKPTVSYFRVFGCVC 581

Query: 1177 YVFVPDHLRMKFDKKAI 1227
            YVFVP HLR K DKKA+
Sbjct: 582  YVFVPKHLRNKMDKKAV 598


>emb|CAN75363.1| hypothetical protein VITISV_026292 [Vitis vinifera]
          Length = 1161

 Score =  150 bits (380), Expect(2) = 1e-43
 Identities = 76/134 (56%), Positives = 93/134 (69%), Gaps = 2/134 (1%)
 Frame = +1

Query: 832  K*EEKFRCLRPDNKREFISQEFSEFLQENRKRRQLTCSKTPKQ--VAKRTNRQLAETC*S 1005
            K +++ RCLR DN  E+ S EF  FL+E R R Q TC+ TP+Q  VA+R NR LAE C S
Sbjct: 507  KVDKRIRCLRTDNGGEYTSDEFFYFLRECRVRHQFTCANTPQQNSVAERKNRHLAEICRS 566

Query: 1006 MLHAKNALPEYWD*CIRTTAHVINRMPQAKFILKSTLEKLWNIRPTVNHFRVFGSVYYVF 1185
            MLHAKN    +W   ++T A VINR+PQ K    S  EKLWNI+PT+++FRVFG V YVF
Sbjct: 567  MLHAKNVPRRFWAEAMKTVAFVINRLPQQKLNFSSPFEKLWNIKPTISYFRVFGCVCYVF 626

Query: 1186 VPDHLRMKFDKKAI 1227
            VP+HLR K DKK I
Sbjct: 627  VPNHLRSKMDKKEI 640



 Score = 54.7 bits (130), Expect(2) = 1e-43
 Identities = 24/45 (53%), Positives = 33/45 (73%)
 Frame = +3

Query: 723 VVTFIDNY*RYVWAYFMKEKSETFGIFKSFKEEVEREIGRKISML 857
           +VTFI+++  YVW YFMKEKSETF  +K FKE  E ++ ++I  L
Sbjct: 471 MVTFINDFSNYVWVYFMKEKSETFSKYKEFKEMTEVKVDKRIRCL 515



 Score = 78.2 bits (191), Expect = 8e-12
 Identities = 50/150 (33%), Positives = 81/150 (54%), Gaps = 6/150 (4%)
 Frame = +1

Query: 37  GSTSLLVNTRPTSD---KWIINSGVATYMIGNGEKLLI*TEYKGSWMVVIADNMRFPIKS 207
           G ++ +  T    D    WII+SG + +M G+ EKL    EYKG  MVV  +N + PI  
Sbjct: 303 GESAFIATTSEQIDYEKDWIIDSGCSNHMTGDKEKLXDLXEYKGRHMVVTXNNSKLPIAH 362

Query: 208 TGDSDCATEESTQGGVTK--CHALGKDKEEFVVC-VTTNNLRYLRGIRPDDIKIYRNVKI 378
            G++  +++ +T     +   H  G  K    +  +T++    L G  P D+K+YR+++I
Sbjct: 363 IGNTVVSSQYNTNDVSLQNVYHVXGMKKNLLSIAQLTSSGXSVLFG--PQDVKVYRDLEI 420

Query: 379 ISTPILEGEKQEFVYVMSVETAYIDKTRKK 468
           +   +++G + E VYVMS ETAY   T K+
Sbjct: 421 MEELVIKGRRLESVYVMSAETAYDVSTAKR 450


>gb|ABR16307.1| unknown [Picea sitchensis]
          Length = 407

 Score =  176 bits (446), Expect = 2e-41
 Identities = 108/331 (32%), Positives = 170/331 (51%), Gaps = 7/331 (2%)
 Frame = +1

Query: 358  IYRNVKIISTPILEGEKQEFVYVMSVETAYIDKTRKKEITDLWHARFEHFSYXXXXXXXX 537
            I  N K++++  +E    +F    S E A + +     ++ LWH R  H +Y        
Sbjct: 35   INSNYKVVASGYVENGLYKFGRFTSNEKALVAEV--DNVSRLWHERMGHLNYKSLSLMKR 92

Query: 538  XXXXXXXPNLEVRDETICAGCQNGKAHQVPYYESK-FRARQPLELVHSDVFGPVRCLSVG 714
                   P +  + + +C GC +GK H   + + K +RA+ PL LVHSD+ GP+   S+ 
Sbjct: 93   FEMVYGLPKIS-QSQGVCEGCMSGKQHMEKFIKGKSWRAKTPLHLVHSDLMGPLEHPSIS 151

Query: 715  GARW*LSSTIIKGMCGHI---L*KKNQKLLEYLSRSRKKLNGK*EEKFRCLRPDNKREFI 885
            G+R+ L  T I      I     K   ++ E     +  +  +  +  + LR DN +E++
Sbjct: 152  GSRYVL--TFIDDYSRRIWVYFLKNKDEVFEKFKEFKAFVEKQSGKSIKILRTDNGKEYV 209

Query: 886  SQEFSEFLQENRKRRQLTCSKTPKQ--VAKRTNRQLAETC*SMLHAKNALPEYWD*CIRT 1059
            ++EF  + + N  +R+ T   TP+Q  VA+R NR L E    MLHA+N  P++W   I T
Sbjct: 210  NKEFDHYCKYNGIKREHTVPYTPQQNGVAERKNRTLMEMARCMLHARNMDPKFWAEAINT 269

Query: 1060 TAHVINRMPQAKFILKSTLEKLWNIR-PTVNHFRVFGSVYYVFVPDHLRMKFDKKAIRCI 1236
              +++NR P    +   T E+ W+ R PTV+HF+VFG   YV +PD  R K D+K+ +CI
Sbjct: 270  ATYIVNRTPTIA-VKHKTPEEAWSRRKPTVSHFKVFGCDVYVHIPDEKRKKLDRKSHKCI 328

Query: 1237 FVGYDSGRKGWRCSDPITGRGYVSRDVVFNE 1329
             VGY    K +R  DP      + RDV+F+E
Sbjct: 329  MVGYSETSKAYRVYDPEKNEILIRRDVIFDE 359


>gb|AAG50698.1|AC079604_5 copia-type polyprotein, putative [Arabidopsis thaliana]
            gi|12321387|gb|AAG50765.1|AC079131_10 copia-type
            polyprotein, putative [Arabidopsis thaliana]
          Length = 1320

 Score =  166 bits (421), Expect = 2e-38
 Identities = 115/427 (26%), Positives = 207/427 (48%), Gaps = 6/427 (1%)
 Frame = +1

Query: 79   KWIINSGVATYMIGNGEKLLI*TEYKGSWMVVIADNMRFPIKSTGDSDCATEESTQGGVT 258
            KW ++SG + +M G  + +    +      V + D  +  +K  G+     +      ++
Sbjct: 334  KWYLDSGASNHMCGR-KSMFAELDESVRGNVALGDESKMEVKGKGNILIRLKNGDHQFIS 392

Query: 259  KCHALGKDKEEFVVCVTTNNLRYLRGIRPDDIKIYRNVKIISTPILEGEKQEFVYVMSVE 438
              + +   K   +         Y   ++ +++ I      + T +   + + FV  +  +
Sbjct: 393  NVYYIPSMKTNILSLGQLLEKGYDIRLKDNNLSIRDQESNLITKVPMSKNRMFVLNIRND 452

Query: 439  TAYIDKTRKKEITDLWHARFEHFSYXXXXXXXXXXXXXXXPNLEVRDETICAGCQNGKAH 618
             A   K   KE + LWH RF H ++               P +   ++ +C GC  GK  
Sbjct: 453  IAQCLKMCYKEESWLWHLRFGHLNFGGLELLSRKEMVRGLPCINHPNQ-VCEGCLLGKQF 511

Query: 619  QVPY-YESKFRARQPLELVHSDVFGPVRCLSVGGARW*LS--STIIKGMCGHIL*KKNQK 789
            ++ +  ES  RA++PLEL+H+DV GP++  S+G + + L       +    + L K+  +
Sbjct: 512  KMSFPKESSSRAQKPLELIHTDVCGPIKPKSLGKSNYFLLFIDDFSRKTWVYFL-KEKSE 570

Query: 790  LLEYLSRSRKKLNGK*EEKFRCLRPDNKREFISQEFSEFLQENRKRRQLTCSKTPKQ--V 963
            + E   + +  +  +     + +R D   EF S+EF ++ ++N  RRQLT  ++P+Q  V
Sbjct: 571  VFEIFKKFKAHVEKESGLVIKTMRSDRGGEFTSKEFLKYCEDNGIRRQLTVPRSPQQNGV 630

Query: 964  AKRTNRQLAETC*SMLHAKNALPEYWD*CIRTTAHVINRMPQAKFILKSTLEKLWNIR-P 1140
            A+R NR + E   SML +K    E W   +    +++NR P  K +   T ++ W+ R P
Sbjct: 631  AERKNRTILEMARSMLKSKRLPKELWAEAVACAVYLLNRSP-TKSVSGKTPQEAWSGRKP 689

Query: 1141 TVNHFRVFGSVYYVFVPDHLRMKFDKKAIRCIFVGYDSGRKGWRCSDPITGRGYVSRDVV 1320
             V+H RVFGS+ +  VPD  R K D K+ + IF+GYD+  KG++  +P T +  +SR++V
Sbjct: 690  GVSHLRVFGSIAHAHVPDEKRSKLDDKSEKYIFIGYDNNSKGYKLYNPDTKKTIISRNIV 749

Query: 1321 FNEASSW 1341
            F+E   W
Sbjct: 750  FDEEGEW 756


>emb|CBI37296.3| unnamed protein product [Vitis vinifera]
          Length = 3048

 Score =  165 bits (417), Expect = 5e-38
 Identities = 135/471 (28%), Positives = 217/471 (46%), Gaps = 26/471 (5%)
 Frame = +1

Query: 7    ETQTKYAATEGSTSLL----VNTRPTSDKWIINSGVATYMIGNGEKLLI*TEYKGSWM-- 168
            ET   YA  +    L+    +N     D W ++SG + +M G  +     +++ G++   
Sbjct: 291  ETGAYYAKNQEEMLLMAYVDLNKTSREDTWFLDSGCSNHMCGKKDYF---SDFDGTFRDS 347

Query: 169  VVIADNMRFPIKSTGDSDCATEESTQ--GGVTKCHALGKDKEEFVVCVTTNNLRYLRGIR 342
            V + +N    +   G+      E TQ   GV            F V    NNL  +  ++
Sbjct: 348  VKLGNNTSMSVLGKGNVRLKVNEMTQIITGV------------FYVPELKNNLLSIGQLQ 395

Query: 343  PDDI---------KIYRNVK-IISTPILEGEKQEFVYVMSVE-TAYIDKTRKKEITDLWH 489
               +         K++ + K +I    +   +   +Y +S   ++    T  ++I  LWH
Sbjct: 396  EKGLTILFQHGKCKVFHSQKGLIMDTKMSSNRMFMLYALSQPISSTCFNTVTEDILQLWH 455

Query: 490  ARFEHFSYXXXXXXXXXXXXXXXPNLEVRDETICAGCQNGKAHQ--VPYYESKFRARQPL 663
             R+ H S+               P  +   + +C  C  GK H+  +P  +S +RA + L
Sbjct: 456  CRYGHLSFQGLKTLQQRKMVNGLPQFQPPSK-LCKDCLVGKQHRSSIPK-KSNWRAAEIL 513

Query: 664  ELVHSDVFGPVRCLSVGGARW*LSST--IIKGMCGHIL*KKNQKLLEYLSRSRKKLNGK* 837
            +LVH+D+ GP+  +S    R+ L+ T    +    + L +K++    +  +S K    K 
Sbjct: 514  QLVHADICGPINPISNSKKRYLLTFTDDFSRKTWVYFLVEKSEAFAVF--KSFKTYVEKE 571

Query: 838  EEKF-RCLRPDNKREFISQEFSEFLQENRKRRQLTCSKTPKQ--VAKRTNRQLAETC*SM 1008
               F RCLR D   EF SQEF+ F   +  RRQLT + TP+Q  VA+R NR +     SM
Sbjct: 572  TSSFLRCLRTDRGGEFTSQEFAIFCDVHGIRRQLTAAYTPQQNGVAERKNRTIMNMVRSM 631

Query: 1009 LHAKNALPEYWD*CIRTTAHVINRMPQAKFILKSTLEKLWNIRPTVNHFRVFGSVYYVFV 1188
            L AK     +W   +  T HV+NR P      K+  E    ++P+V++FRVFG + +V V
Sbjct: 632  LSAKKLPKTFWPEAVNWTVHVLNRSPTFAVQNKTPEEAWGKLKPSVDYFRVFGCLSHVHV 691

Query: 1189 PDHLRMKFDKKAIRCIFVGYDSGRKGWRCSDPITGRGYVSRDVVFNEASSW 1341
            PD  R K D K+  C+ +G     K +R  DPI+ +  +SRDVVF E  +W
Sbjct: 692  PDSKRTKLDDKSFSCVLLGVSEESKAYRLYDPISQKIIISRDVVFEEDKNW 742


>emb|CAB71063.1| copia-type polyprotein [Arabidopsis thaliana]
          Length = 1352

 Score =  165 bits (417), Expect = 5e-38
 Identities = 114/427 (26%), Positives = 206/427 (48%), Gaps = 6/427 (1%)
 Frame = +1

Query: 79   KWIINSGVATYMIGNGEKLLI*TEYKGSWMVVIADNMRFPIKSTGDSDCATEESTQGGVT 258
            KW ++SG + +M G  + +    +      V + D  +  +K  G+     +      ++
Sbjct: 334  KWYLDSGASNHMCGR-KSMFAELDESVRGNVALGDESKMEVKGKGNILIRLKNGDHQFIS 392

Query: 259  KCHALGKDKEEFVVCVTTNNLRYLRGIRPDDIKIYRNVKIISTPILEGEKQEFVYVMSVE 438
              + +   K   +         Y   ++ +++ I      + T +   + + FV  +  +
Sbjct: 393  NVYYIPSMKTNILSLGQLLEKGYDIRLKDNNLSIRDQESNLITKVPMSKNRMFVLNIRND 452

Query: 439  TAYIDKTRKKEITDLWHARFEHFSYXXXXXXXXXXXXXXXPNLEVRDETICAGCQNGKAH 618
             A   K   KE + LWH RF H ++               P +   ++ +C GC  GK  
Sbjct: 453  IAQCLKMCYKEESWLWHLRFGHLNFGGLELLSRKEMVRGLPCINHPNQ-VCEGCLLGKQF 511

Query: 619  QVPY-YESKFRARQPLELVHSDVFGPVRCLSVGGARW*LS--STIIKGMCGHIL*KKNQK 789
            ++ +  ES  RA++PLEL+H+DV GP++  S+G + + L       +    + L K+  +
Sbjct: 512  KMSFPKESSSRAQKPLELIHTDVCGPIKPKSLGKSNYFLLFIDDFSRKTWVYFL-KEKSE 570

Query: 790  LLEYLSRSRKKLNGK*EEKFRCLRPDNKREFISQEFSEFLQENRKRRQLTCSKTPKQ--V 963
            + E   + +  +  +     + +R D   EF S+EF ++ ++N  RRQLT  ++P+Q  V
Sbjct: 571  VFEIFKKFKAHVEKESGLVIKTMRSDRGGEFTSKEFLKYCEDNGIRRQLTVPRSPQQNGV 630

Query: 964  AKRTNRQLAETC*SMLHAKNALPEYWD*CIRTTAHVINRMPQAKFILKSTLEKLWNIR-P 1140
             +R NR + E   SML +K    E W   +    +++NR P  K +   T ++ W+ R P
Sbjct: 631  VERKNRTILEMARSMLKSKRLPKELWAEAVACAVYLLNRSP-TKSVSGKTPQEAWSGRKP 689

Query: 1141 TVNHFRVFGSVYYVFVPDHLRMKFDKKAIRCIFVGYDSGRKGWRCSDPITGRGYVSRDVV 1320
             V+H RVFGS+ +  VPD  R K D K+ + IF+GYD+  KG++  +P T +  +SR++V
Sbjct: 690  GVSHLRVFGSIAHAHVPDEKRSKLDDKSEKYIFIGYDNNSKGYKLYNPDTKKTIISRNIV 749

Query: 1321 FNEASSW 1341
            F+E   W
Sbjct: 750  FDEEGEW 756


>gb|AAD50001.1|AC007259_14 Hypothetical protein [Arabidopsis thaliana]
          Length = 1352

 Score =  165 bits (417), Expect = 5e-38
 Identities = 114/427 (26%), Positives = 206/427 (48%), Gaps = 6/427 (1%)
 Frame = +1

Query: 79   KWIINSGVATYMIGNGEKLLI*TEYKGSWMVVIADNMRFPIKSTGDSDCATEESTQGGVT 258
            KW ++SG + +M G  + +    +      V + D  +  +K  G+     +      ++
Sbjct: 334  KWYLDSGASNHMCGR-KSMFAELDESVRGNVALGDESKMEVKGKGNILIRLKNGDHQFIS 392

Query: 259  KCHALGKDKEEFVVCVTTNNLRYLRGIRPDDIKIYRNVKIISTPILEGEKQEFVYVMSVE 438
              + +   K   +         Y   ++ +++ I      + T +   + + FV  +  +
Sbjct: 393  NVYYIPSMKTNILSLGQLLEKGYDIRLKDNNLSIRDQESNLITKVPMSKNRMFVLNIRND 452

Query: 439  TAYIDKTRKKEITDLWHARFEHFSYXXXXXXXXXXXXXXXPNLEVRDETICAGCQNGKAH 618
             A   K   KE + LWH RF H ++               P +   ++ +C GC  GK  
Sbjct: 453  IAQCLKMCYKEESWLWHLRFGHLNFGGLELLSRKEMVRGLPCINHPNQ-VCEGCLLGKQF 511

Query: 619  QVPY-YESKFRARQPLELVHSDVFGPVRCLSVGGARW*LS--STIIKGMCGHIL*KKNQK 789
            ++ +  ES  RA++PLEL+H+DV GP++  S+G + + L       +    + L K+  +
Sbjct: 512  KMSFPKESSSRAQKPLELIHTDVCGPIKPKSLGKSNYFLLFIDDFSRKTWVYFL-KEKSE 570

Query: 790  LLEYLSRSRKKLNGK*EEKFRCLRPDNKREFISQEFSEFLQENRKRRQLTCSKTPKQ--V 963
            + E   + +  +  +     + +R D   EF S+EF ++ ++N  RRQLT  ++P+Q  V
Sbjct: 571  VFEIFKKFKAHVEKESGLVIKTMRSDRGGEFTSKEFLKYCEDNGIRRQLTVPRSPQQNGV 630

Query: 964  AKRTNRQLAETC*SMLHAKNALPEYWD*CIRTTAHVINRMPQAKFILKSTLEKLWNIR-P 1140
             +R NR + E   SML +K    E W   +    +++NR P  K +   T ++ W+ R P
Sbjct: 631  VERKNRTILEMARSMLKSKRLPKELWAEAVACAVYLLNRSP-TKSVSGKTPQEAWSGRKP 689

Query: 1141 TVNHFRVFGSVYYVFVPDHLRMKFDKKAIRCIFVGYDSGRKGWRCSDPITGRGYVSRDVV 1320
             V+H RVFGS+ +  VPD  R K D K+ + IF+GYD+  KG++  +P T +  +SR++V
Sbjct: 690  GVSHLRVFGSIAHAHVPDEKRSKLDDKSEKYIFIGYDNNSKGYKLYNPDTKKTIISRNIV 749

Query: 1321 FNEASSW 1341
            F+E   W
Sbjct: 750  FDEEGEW 756


>emb|CAB75469.1| copia-type reverse transcriptase-like protein [Arabidopsis thaliana]
          Length = 1272

 Score =  164 bits (416), Expect = 6e-38
 Identities = 114/427 (26%), Positives = 207/427 (48%), Gaps = 6/427 (1%)
 Frame = +1

Query: 79   KWIINSGVATYMIGNGEKLLI*TEYKGSWMVVIADNMRFPIKSTGDSDCATEESTQGGVT 258
            KW ++SG + +M G  + +    +      V + D  +  +K  G+     +      ++
Sbjct: 334  KWYLDSGASNHMCGR-KSMFAELDESVRGNVALGDESKMEVKGKGNILIRLKNGDHQFIS 392

Query: 259  KCHALGKDKEEFVVCVTTNNLRYLRGIRPDDIKIYRNVKIISTPILEGEKQEFVYVMSVE 438
              + +   K   +         Y   ++ +++ I      + T +   + + FV  +  +
Sbjct: 393  NVYYIPSMKTNILSLGQLLEKGYDIRLKDNNLSIRDKESNLITKVPMSKNRMFVLNIRND 452

Query: 439  TAYIDKTRKKEITDLWHARFEHFSYXXXXXXXXXXXXXXXPNLEVRDETICAGCQNGKAH 618
             A   K   KE + LWH RF H ++               P +   ++ +C GC  G   
Sbjct: 453  IAQCLKMCYKEESWLWHLRFGHLNFGGLELLSRKEMVRGLPCINHPNQ-VCEGCLLGNQF 511

Query: 619  QVPY-YESKFRARQPLELVHSDVFGPVRCLSVGGARW*LS--STIIKGMCGHIL*KKNQK 789
            ++ +  ES  RA++PLEL+H+DV GP++  S+G + + L       +    + L K+  +
Sbjct: 512  KMSFPKESSSRAQKPLELIHTDVCGPIKPKSLGKSNYFLLFIDDFSRKTWVYFL-KEKSE 570

Query: 790  LLEYLSRSRKKLNGK*EEKFRCLRPDNKREFISQEFSEFLQENRKRRQLTCSKTPKQ--V 963
            + E   + +  +  +     + +R D+  EF S+EF ++ ++N  RRQLT  ++P+Q  V
Sbjct: 571  VFEIFKKFKAHVEKESGLVIKTMRSDSGGEFTSKEFLKYCEDNGIRRQLTVPRSPQQNGV 630

Query: 964  AKRTNRQLAETC*SMLHAKNALPEYWD*CIRTTAHVINRMPQAKFILKSTLEKLWNIR-P 1140
            A+R NR + E   SML +K    E W   +    +++NR P  K +   T ++ W+ R P
Sbjct: 631  AERKNRTILEMARSMLKSKRLPKELWAEAVACAVYLLNRSP-TKSVSGKTPQEAWSGRKP 689

Query: 1141 TVNHFRVFGSVYYVFVPDHLRMKFDKKAIRCIFVGYDSGRKGWRCSDPITGRGYVSRDVV 1320
             V+H RVFGS+ +  VPD  R K D K+ + IF+GYD+  KG++  +P T +  +SR++V
Sbjct: 690  GVSHLRVFGSIAHAHVPDEKRNKLDDKSEKYIFIGYDNNSKGYKLYNPDTKKTIISRNIV 749

Query: 1321 FNEASSW 1341
            F+E   W
Sbjct: 750  FDEEGEW 756


>gb|AAD32906.1| putative retroelement pol polyprotein [Arabidopsis thaliana]
          Length = 822

 Score =  162 bits (409), Expect = 4e-37
 Identities = 126/428 (29%), Positives = 204/428 (47%), Gaps = 8/428 (1%)
 Frame = +1

Query: 82   WIINSGVATYMIGNGEKLLI*TEYKGSWMVVIADNMRFPIKSTGDSDCATEESTQG--GV 255
            W+I+SG   +M  N EKL           + + +      +  GD +  T +  +G   V
Sbjct: 30   WLIDSGCTNHMTPN-EKLFTKINRDFKVPIRVGNGAVMMSEGKGDIEVMTRKDKRGIRDV 88

Query: 256  TKCHALGKDKEEFVVCVTTNNLRYLRGIRPDDIKIYRNV-KIISTPILEGEKQEFVYVMS 432
                 LGK+       +      Y   ++ +   I+ +  K I    +  +     ++ +
Sbjct: 89   LLVPKLGKNLLSVPQMIING---YQVTLKNNYCTIHDSARKKIGEVEMVNKSFHLRWLSN 145

Query: 433  VETAYIDKTRKKEITDLWHARFEHFSYXXXXXXXXXXXXXXXPNLEVRDETICAGCQNGK 612
             ETA +    K E T+LWH R  H  +               P   V +E  C  C   K
Sbjct: 146  EETAMV---AKDEATELWHKRLGHTGHSNLKILQSKEMVTGLPKFNV-EEGKCESCILSK 201

Query: 613  AHQVPY-YESKFRARQPLELVHSDVFGPVRCLSVGGARW*LS--STIIKGMCGHIL*KKN 783
              + P+  ES+ RA+  LEL+HSDV GP++  S+ G+R+ L+      + +  + L K  
Sbjct: 202  HSRDPFPKESETRAKHKLELIHSDVCGPMQNSSINGSRYILTFIDDATRMVWVYFL-KAK 260

Query: 784  QKLLEYLSRSRKKLNGK*EEKFRCLRPDNKREFISQEFSEFLQENRKRRQLTCSKTPKQ- 960
             ++ +   + +  +      + + LR D   E++S+EFSEFL+ N   RQLT + +P+Q 
Sbjct: 261  SEVFQTFKKFKNLVENNANCRIKKLRIDRGTEYLSKEFSEFLEGNGIERQLTAAYSPQQN 320

Query: 961  -VAKRTNRQLAETC*SMLHAKNALPEYWD*CIRTTAHVINRMPQAKFILKSTLEKLWNIR 1137
             V++R NR L E   +M+ AK+   + W   +   A+  NR P      K+ LE   + +
Sbjct: 321  EVSERRNRSLVEMARAMIKAKDLPLKLWAEAVHVAAYAQNRTPTRTLKNKTPLEAWSDSK 380

Query: 1138 PTVNHFRVFGSVYYVFVPDHLRMKFDKKAIRCIFVGYDSGRKGWRCSDPITGRGYVSRDV 1317
            P+V+H +VFGS+ YV +PD  R K+D K+ R IFVGY S  KG+R       +  +SRDV
Sbjct: 381  PSVSHMKVFGSICYVHIPDEKRRKWDDKSKRAIFVGYSSQTKGYRVYLLKENKIDISRDV 440

Query: 1318 VFNEASSW 1341
            +F+E S W
Sbjct: 441  IFDEDSKW 448


>gb|AAG60117.1|AC073555_1 copia-type polyprotein, putative [Arabidopsis thaliana]
          Length = 1352

 Score =  161 bits (408), Expect = 5e-37
 Identities = 113/427 (26%), Positives = 206/427 (48%), Gaps = 6/427 (1%)
 Frame = +1

Query: 79   KWIINSGVATYMIGNGEKLLI*TEYKGSWMVVIADNMRFPIKSTGDSDCATEESTQGGVT 258
            KW ++SG + +M G  + +    +      V + D  +  +K  G+     +      ++
Sbjct: 334  KWYLDSGASNHMCGR-KSMFAELDESVRGNVALGDESKMEVKGKGNILIRLKNGDHQFIS 392

Query: 259  KCHALGKDKEEFVVCVTTNNLRYLRGIRPDDIKIYRNVKIISTPILEGEKQEFVYVMSVE 438
              + +   K   +         Y   ++ +++ I      + T +   + + FV  +  +
Sbjct: 393  NVYYIPSMKTNILSLGQLLEKGYDIRLKDNNLSIRDQESNLITKVPMSKNRMFVLNIRND 452

Query: 439  TAYIDKTRKKEITDLWHARFEHFSYXXXXXXXXXXXXXXXPNLEVRDETICAGCQNGKAH 618
             A   K   KE + LWH RF H ++               P +   ++ +C GC  GK  
Sbjct: 453  IAQCLKMCYKEESWLWHLRFGHLNFGGLELLSRKEMVRGLPCINHPNQ-VCEGCLLGKQF 511

Query: 619  QVPY-YESKFRARQPLELVHSDVFGPVRCLSVGGARW*LS--STIIKGMCGHIL*KKNQK 789
            ++ +  ES  RA++ LEL+H+DV GP++  S+G + + L       +    + L K+  +
Sbjct: 512  KMSFPKESSSRAQKSLELIHTDVCGPIKPKSLGKSNYFLLFIDDFSRKTWVYFL-KEKSE 570

Query: 790  LLEYLSRSRKKLNGK*EEKFRCLRPDNKREFISQEFSEFLQENRKRRQLTCSKTPKQ--V 963
            + E   + +  +  +     + +R D   EF S+EF ++ ++N  RRQLT  ++P+Q  V
Sbjct: 571  VFEIFKKFKAHVEKESGLVIKTMRSDRGGEFTSKEFLKYCEDNGIRRQLTVPRSPQQNGV 630

Query: 964  AKRTNRQLAETC*SMLHAKNALPEYWD*CIRTTAHVINRMPQAKFILKSTLEKLWNIRPT 1143
            A+R NR + E   SML +K    E W   +    +++NR P  K +   T ++ W+ R +
Sbjct: 631  AERKNRTILEMARSMLKSKRLPKELWAEAVACAVYLLNRSP-TKSVSGKTPQEAWSGRKS 689

Query: 1144 -VNHFRVFGSVYYVFVPDHLRMKFDKKAIRCIFVGYDSGRKGWRCSDPITGRGYVSRDVV 1320
             V+H RVFGS+ +  VPD  R K D K+ + IF+GYD+  KG++  +P T +  +SR++V
Sbjct: 690  GVSHLRVFGSIAHAHVPDEKRSKLDDKSEKYIFIGYDNNSKGYKLYNPDTKKTIISRNIV 749

Query: 1321 FNEASSW 1341
            F+E   W
Sbjct: 750  FDEEGEW 756


>gb|AAP46257.1| putative polyprotein [Oryza sativa Japonica Group]
            gi|108711922|gb|ABF99717.1| retrotransposon protein,
            putative, unclassified [Oryza sativa Japonica Group]
          Length = 1335

 Score =  161 bits (408), Expect = 5e-37
 Identities = 96/296 (32%), Positives = 149/296 (50%), Gaps = 5/296 (1%)
 Frame = +1

Query: 469  EITDLWHARFEHFSYXXXXXXXXXXXXXXXPNLEVRDETICAGCQNGKAHQVPY-YESKF 645
            +I+DLWH R  H +Y               P + ++ +  C GC  GK  +  + +   +
Sbjct: 453  DISDLWHKRMGHLNYRALKLLRTKGMVQGLPFITLKSDP-CEGCVFGKQIRASFPHSGAW 511

Query: 646  RARQPLELVHSDVFGPVRCLSVGGARW*LSSTI--IKGMCGHIL*KKNQKLLEYLSRSRK 819
            RA  PLELVH+D+ G V  +S GG  W   + I     M      K+    LE   + + 
Sbjct: 512  RASAPLELVHADIVGKVPTISEGG-NWYFITFIDDYTRMIWVYFLKEKSAALEIFKKFKA 570

Query: 820  KLNGK*EEKFRCLRPDNKREFISQEFSEFLQENRKRRQLTCSKTPKQ--VAKRTNRQLAE 993
             +  +   K + LR D  RE+IS+EF ++ +    RRQLT   + +Q  VA+R NR + +
Sbjct: 571  MVENQSNRKIKVLRSDQGREYISKEFEKYCENAGIRRQLTAGYSAQQNGVAERKNRTIND 630

Query: 994  TC*SMLHAKNALPEYWD*CIRTTAHVINRMPQAKFILKSTLEKLWNIRPTVNHFRVFGSV 1173
               SML  K     +W   + T  +++NR P      ++  E  +  +P + H RVFG +
Sbjct: 631  MANSMLQDKGMPKSFWAEAVNTAVYILNRSPTKAVTNRTPFEAWYGKKPVIGHMRVFGCI 690

Query: 1174 YYVFVPDHLRMKFDKKAIRCIFVGYDSGRKGWRCSDPITGRGYVSRDVVFNEASSW 1341
             Y  VP   R+KFD K+ RCIFVGY  G KG+R  +    +  +SRD +F+E+++W
Sbjct: 691  CYAQVPAQKRVKFDNKSDRCIFVGYADGIKGYRLYNLEKKKIIISRDAIFDESATW 746


>gb|AAF16534.1|AC013482_8 T26F17.17 [Arabidopsis thaliana]
          Length = 1291

 Score =  160 bits (404), Expect = 2e-36
 Identities = 115/425 (27%), Positives = 200/425 (47%), Gaps = 4/425 (0%)
 Frame = +1

Query: 79   KWIINSGVATYMIGNGEKLLI*TEYKGSWMVVIADNMRFPIKSTGDSDCATEESTQGGVT 258
            KW ++SG + +M G  + +    +      V + D  +  +K  G+     +      ++
Sbjct: 296  KWYLDSGASNHMCGR-KSMFAELDESVRGNVALGDESKMEVKGKGNILIRLKNGDHQFIS 354

Query: 259  KCHALGKDKEEFVVCVTTNNLRYLRGIRPDDIKIYRNVKIISTPILEGEKQEFVYVMSVE 438
              + +   K   +         Y   ++ +++ I      + T +   + + FV  +  +
Sbjct: 355  NVYYIPSMKTNILSLGQLLEKGYDIRLKDNNLSIRDQESNLITKVPMSKNRMFVLNIRND 414

Query: 439  TAYIDKTRKKEITDLWHARFEHFSYXXXXXXXXXXXXXXXPNLEVRDETICAGCQNGKAH 618
             A   K   KE + LWH RF H ++               P +   ++ +C GC  GK  
Sbjct: 415  IAQCLKMCYKEESWLWHLRFGHLNFGGLELLSRKEMVRGLPCINHPNQ-VCEGCLLGKQF 473

Query: 619  QVPY-YESKFRARQPLELVHSDVFGPVRCLSVGGARW*LSSTIIKGMCGHIL*KKNQKLL 795
            ++ +  ES  RA++PLEL+H+DV GP++  S+  +       I K    H+  +K   L+
Sbjct: 474  KMSFPKESSSRAQKPLELIHTDVCGPIKPKSLEKSE---VFKIFKKFKAHV--EKESGLV 528

Query: 796  EYLSRSRKKLNGK*EEKFRCLRPDNKREFISQEFSEFLQENRKRRQLTCSKTPKQ--VAK 969
                              + +R D   EF S+EF ++ ++N  RRQLT  ++P+Q  VA+
Sbjct: 529  -----------------IKTMRSDRGGEFTSKEFLKYCEDNGIRRQLTVPRSPQQNGVAE 571

Query: 970  RTNRQLAETC*SMLHAKNALPEYWD*CIRTTAHVINRMPQAKFILKSTLEKLWNIR-PTV 1146
            R NR + E   SML +K    E W   +    +++NR P  K +   T ++ W+ R P V
Sbjct: 572  RKNRTILEMARSMLKSKRLPKELWAEAVACAVYLLNRSP-TKSVSGKTPQEAWSGRKPGV 630

Query: 1147 NHFRVFGSVYYVFVPDHLRMKFDKKAIRCIFVGYDSGRKGWRCSDPITGRGYVSRDVVFN 1326
            +H RVFGS+ +  VPD  R K D K+ + IF+GYD+  KG++  +P T +  +SR++VF+
Sbjct: 631  SHLRVFGSIAHAHVPDEKRSKLDDKSEKYIFIGYDNNSKGYKLYNPDTKKTIISRNIVFD 690

Query: 1327 EASSW 1341
            E   W
Sbjct: 691  EEGEW 695


>gb|ABA99612.1| retrotransposon protein, putative, unclassified [Oryza sativa
            Japonica Group]
          Length = 1198

 Score =  159 bits (402), Expect = 3e-36
 Identities = 96/296 (32%), Positives = 149/296 (50%), Gaps = 5/296 (1%)
 Frame = +1

Query: 469  EITDLWHARFEHFSYXXXXXXXXXXXXXXXPNLEVRDETICAGCQNGKAHQVPY-YESKF 645
            +I+DLWH R  H +Y               P + ++ +  C GC  GK  +  + +   +
Sbjct: 404  DISDLWHKRMGHLNYRALKLLRTKGMVQGLPFITLKSDP-CEGCVFGKQIRASFPHSGAW 462

Query: 646  RARQPLELVHSDVFGPVRCLSVGGARW*LSSTI--IKGMCGHIL*KKNQKLLEYLSRSRK 819
            RA  PLELVH+D+ G V  +S GG  W   + I     M      K+    LE   + + 
Sbjct: 463  RASAPLELVHTDIVGKVPTISEGG-NWYFITFIDDYTRMIWVYFLKEKSAALEIFKKFKA 521

Query: 820  KLNGK*EEKFRCLRPDNKREFISQEFSEFLQENRKRRQLTCSKTPKQ--VAKRTNRQLAE 993
             +  +   K + LR D   E+IS+EF ++ +    RRQLT   + +Q  VA+R NR + +
Sbjct: 522  MVENQSNRKIKVLRSDQGGEYISKEFEKYCENAGIRRQLTAGYSAQQNGVAERKNRTIND 581

Query: 994  TC*SMLHAKNALPEYWD*CIRTTAHVINRMPQAKFILKSTLEKLWNIRPTVNHFRVFGSV 1173
               SML  K     +W   + T  +++NR P      ++  E  +  +P + H RVFG +
Sbjct: 582  MANSMLQDKGMPKSFWAEAVNTAIYILNRSPTKAVPNRTPFEAWYGKKPVIGHMRVFGCI 641

Query: 1174 YYVFVPDHLRMKFDKKAIRCIFVGYDSGRKGWRCSDPITGRGYVSRDVVFNEASSW 1341
             Y  VP   R+KFD K+ RCIFVGY  G KG+R  +    +  +SRDV+F+E+++W
Sbjct: 642  CYAQVPAQKRVKFDNKSDRCIFVGYADGIKGYRLYNLEKKKIIISRDVIFDESATW 697


>dbj|BAB01972.1| copia-like retrotransposable element [Arabidopsis thaliana]
          Length = 1499

 Score =  156 bits (395), Expect(2) = 3e-36
 Identities = 103/318 (32%), Positives = 162/318 (50%), Gaps = 5/318 (1%)
 Frame = +1

Query: 418  VYVMSVETAYIDKTRKKEITDLWHARFEHFSYXXXXXXXXXXXXXXXPNLEVRDETICAG 597
            ++  S E  Y+    K+E TDLWH RF H +Y               P  EV  + ICA 
Sbjct: 435  IWKKSREETYMAFEEKEEQTDLWHKRFGHVNYDKIETMQTLKIVEKLPKFEVI-KGICAA 493

Query: 598  CQNGKAHQVPY-YESKFRARQPLELVHSDVFGPVRCLSVGGARW*LSSTI-IKGMCGHIL 771
            C+ GK  +  +  +S+    + LEL+HSDV GP++  S+ G+R+ L+       M     
Sbjct: 494  CEMGKQSRRSFPKKSQSNTNKTLELIHSDVCGPMQTESINGSRYFLTFIDDFSRMTWVYF 553

Query: 772  *KKNQKLLEYLSRSRKKLNGK*EEKFRCLRPDNKREFISQEFSEFLQENRKRRQLTCSKT 951
             K   +++      +  +  + E + + LR D   EF+S+EF +  QE+    ++T   +
Sbjct: 554  LKNKSEVITKFKIFKPYVENQSESRIKRLRTDGGGEFLSREFIKLCQESGIHHEITTPYS 613

Query: 952  PKQ--VAKRTNRQLAETC*SMLHAKNALPEYWD*CIRTTAHVINRMPQAKFILKSTLEKL 1125
            P+Q  VA+R NR L E   SM+  K    ++W   I T+ ++ NR+P        T  ++
Sbjct: 614  PQQNGVAERRNRTLVEMARSMIEEKKLSNKFWAEAIATSTYLQNRLPSKSLEKGVTPMEI 673

Query: 1126 WN-IRPTVNHFRVFGSVYYVFVPDHLRMKFDKKAIRCIFVGYDSGRKGWRCSDPITGRGY 1302
            W+  +P+V+H +VFG V Y+ +PD  R K D KA + IFVGY +  KG+R       +  
Sbjct: 674  WSGKKPSVDHLKVFGCVCYIHIPDEKRRKLDTKAKQGIFVGYSNESKGYRVFLLNEEKIE 733

Query: 1303 VSRDVVFNEASSWWPLEK 1356
            VS+DV F+E  +W   EK
Sbjct: 734  VSKDVTFDEKKTWSHDEK 751



 Score = 23.9 bits (50), Expect(2) = 3e-36
 Identities = 9/28 (32%), Positives = 20/28 (71%)
 Frame = +2

Query: 257 QNVMHLERIKKNLLSVLQLTTSDIYVVF 340
           ++V+++  + +NLLSV Q+ ++   V+F
Sbjct: 380 KDVLYVPELARNLLSVSQMISNGYRVIF 407


>emb|CAB75932.1| putative protein [Arabidopsis thaliana]
          Length = 1339

 Score =  159 bits (401), Expect = 3e-36
 Identities = 142/489 (29%), Positives = 213/489 (43%), Gaps = 33/489 (6%)
 Frame = +1

Query: 7    ETQTKYAATEGSTSLLV------NTRPTSDKWIINSGVATYMIGNGEKLLI*TEYKGSWM 168
            E    YA  E    LL+      N     + W ++SG + +M G+ E           W 
Sbjct: 269  EKNANYAELEEEEELLLMAYVEQNQANRDEVWFLDSGCSNHMTGSKE-----------WF 317

Query: 169  VVIADNMRFPIKSTGDSDCATEESTQG-GVTKCHALGKDK---EEFVVCVTTNNLRYL-- 330
              + +     +K   D    T  S  G G  K    G  +   E + V    NNL  L  
Sbjct: 318  SELEEGFNRTVKLGND----TRMSVVGKGSVKVKVNGVTQVIPEVYYVPELRNNLLSLGQ 373

Query: 331  ---RG----IRPDDIKIYRNVK-IISTPILEGEKQEFVYVMSVETAYIDKTRKKEITD-- 480
               RG    IR    K+Y   K  I    + G +  F+     +   +   + +E+ D  
Sbjct: 374  LQERGLAILIRDGTCKVYHPSKGAIMETNMSGNRMFFLLASKPQKNSLC-LQTEEVMDKE 432

Query: 481  --LWHARFEHFSYXXXXXXXXXXXXXXXPNLEVRDETICAGCQNGKAHQVPYYE-SKFRA 651
              LWH RF H +                P L+   E ICA C  GK H+    + + +++
Sbjct: 433  NHLWHCRFGHLNQEGLKLLAHKKMVIGLPILKATKE-ICAICLTGKQHRESMSKKTSWKS 491

Query: 652  RQPLELVHSDVFGPVRCLSVGGARW*LS--STIIKGMCGHIL*KKNQKLLEY--LSRSRK 819
               L+LVHSD+ GP+  +S  G R+ LS      +    + L +K++    +     S +
Sbjct: 492  STQLQLVHSDICGPITPISHSGKRYILSFIDDFTRKTWVYFLHEKSEAFATFKIFKASVE 551

Query: 820  KLNGK*EEKFRCLRPDNKREFISQEFSEFLQENRKRRQLTCSKTPKQ--VAKRTNRQLAE 993
            K  G       CLR D   EF S EF EF + +   RQLT + TP+Q  VA+R NR +  
Sbjct: 552  KEIGA---FLTCLRTDRGGEFTSNEFGEFCRSHGISRQLTAAFTPQQNGVAERKNRTIMN 608

Query: 994  TC*SMLHAKNALPEYWD*CIRTTAHVINRMPQAKFILKSTLEKLWNIR-PTVNHFRVFGS 1170
               SML  +     +W    + + H+ NR P A  +   T E+ W+ R P V +FRVFG 
Sbjct: 609  AVRSMLSERQVPKMFWSEATKWSVHIQNRSPTAA-VEGMTPEEAWSGRKPVVEYFRVFGC 667

Query: 1171 VYYVFVPDHLRMKFDKKAIRCIFVGYDSGRKGWRCSDPITGRGYVSRDVVFNEASSW-WP 1347
            + YV +PD  R K D K+ +C+F+G     K WR  DP+  +  +S+DVVF+E  SW W 
Sbjct: 668  IGYVHIPDQKRSKLDDKSKKCVFLGVSEESKAWRLYDPVMKKIVISKDVVFDEDKSWDWD 727

Query: 1348 LEKMRTEDI 1374
               +  +++
Sbjct: 728  QADVEAKEV 736


>emb|CAN72676.1| hypothetical protein VITISV_020406 [Vitis vinifera]
          Length = 1183

 Score =  158 bits (399), Expect = 6e-36
 Identities = 90/293 (30%), Positives = 156/293 (53%), Gaps = 4/293 (1%)
 Frame = +1

Query: 475  TDLWHARFEHFSYXXXXXXXXXXXXXXXPNLEVRDETICAGCQNGKAHQVPYYESKFR-A 651
            ++LWH R+ H +                P ++     +C GC  GK  + P+ + + R A
Sbjct: 371  SNLWHLRYGHLNVKGLKLLSKKEMVFGLPKID--SVNVCEGCIYGKQSKKPFPKGRSRRA 428

Query: 652  RQPLELVHSDVFGPVRCLSVGGARW*LSSTIIKGMCGHIL*KKNQ-KLLEYLSRSRKKLN 828
               LE++H+D+ GP++  S GG+R+ L  T        +   +++ +  E   + +  + 
Sbjct: 429  SSCLEIIHADLCGPMQTASFGGSRYFLLFTNDHSRMSWVYFLQSKAETFETFKKFKAFVE 488

Query: 829  GK*EEKFRCLRPDNKREFISQEFSEFLQENRKRRQLTCSKTPKQ--VAKRTNRQLAETC* 1002
             +  +  + LR D   EF+S +F  F +E    R+LT   +P+Q  VA+R NR + E   
Sbjct: 489  KQSGKCIKVLRTDRGGEFLSNDFKVFCEEEGLHRELTTPYSPEQNGVAERKNRTVVEMAR 548

Query: 1003 SMLHAKNALPEYWD*CIRTTAHVINRMPQAKFILKSTLEKLWNIRPTVNHFRVFGSVYYV 1182
            SM+ AKN    +W   + T  +++N  P    + ++  E  +  +P V+H +VFGSV Y 
Sbjct: 549  SMMKAKNLSNHFWAEGVATAVYLLNISPTKAVLNRTPYEAWYGKKPWVSHLKVFGSVAYT 608

Query: 1183 FVPDHLRMKFDKKAIRCIFVGYDSGRKGWRCSDPITGRGYVSRDVVFNEASSW 1341
             +  H R K D+K+++CIF+GY S  KG++  +P++G+  VSR+VVF+E +SW
Sbjct: 609  LIESHNRSKLDEKSVKCIFIGYCSQSKGYKLYNPVSGKIIVSRNVVFDEKASW 661


>dbj|BAB11200.1| copia-type polyprotein [Arabidopsis thaliana]
            gi|13872710|emb|CAC37622.1| polyprotein [Arabidopsis
            thaliana]
          Length = 1334

 Score =  157 bits (398), Expect = 8e-36
 Identities = 121/475 (25%), Positives = 220/475 (46%), Gaps = 19/475 (4%)
 Frame = +1

Query: 7    ETQTKYAATEGSTSLLVNTRPTSDK----WIINSGVATYMIGNGEKLL-I*TEYKGSWMV 171
            E +  Y   E    L+ +     D+    W ++SG + +M G  E  L + + +K +  V
Sbjct: 268  EKEANYVEMEEDLLLMAHVEQIGDEEKQIWFLDSGCSNHMCGTREWFLELDSGFKQN--V 325

Query: 172  VIADNMRFPIKSTGDSDCATEESTQGGVTKCHALGKDKEEFVVC-VTTNNLRYLRGIRPD 348
             + D+ R  ++  G      +   Q         G     F V  +    LR++  I  D
Sbjct: 326  RLGDDRRMAVEGKGKLRLEVDGRIQVISDVYFVPGLKNNLFSVGQLQQKGLRFI--IEGD 383

Query: 349  DIKIYRNV--KIISTPILEGEKQEFVYVMSVETAYIDKTRKKEI----TDLWHARFEHFS 510
              +++     +++    +   +   V+    ++   ++TR  ++     ++WH RF H +
Sbjct: 384  VCEVWHKTEKRMVMHSTMTKNRMFVVFAAVKKSKETEETRCLQVIGKANNMWHKRFGHLN 443

Query: 511  YXXXXXXXXXXXXXXXPNLEV-RDETICAGCQNGKA--HQVPYYESKFRARQPLELVHSD 681
            +               P  ++  +E +C  C  GK     +P  ES +++ Q L+LVH+D
Sbjct: 444  HQGLRSLAEKEMVKGLPKFDLGEEEAVCDICLKGKQIRESIPK-ESAWKSTQVLQLVHTD 502

Query: 682  VFGPVRCLSVGGARW*LSSTI-IKGMCGHIL*KKNQKLLEYLSRSRKKLNGK*EEKFRCL 858
            + GP+   S  G R+ L+        C   L  +  +  ++    + ++  +  +K  CL
Sbjct: 503  ICGPINPASTSGKRYILNFIDDFSRKCWTYLLSEKSETFQFFKEFKAEVERESGKKLVCL 562

Query: 859  RPDNKREFISQEFSEFLQENRKRRQLTCSKTPKQ--VAKRTNRQLAETC*SMLHAKNALP 1032
            R D   E+ S+EF E+ +E   +RQLT + TP+Q  VA+R NR +      ML   +   
Sbjct: 563  RSDRGGEYNSREFDEYCKEFGIKRQLTAAYTPQQNGVAERKNRSVMNMTRCMLMEMSVPR 622

Query: 1033 EYWD*CIRTTAHVINRMPQAKFILKSTLEKLWNI-RPTVNHFRVFGSVYYVFVPDHLRMK 1209
            ++W   ++   +++NR P +K +   T E+ W+  +P+V H R+FGS+ Y  VP   R+K
Sbjct: 623  KFWPEAVQYAVYILNRSP-SKALNDITPEEKWSSWKPSVEHLRIFGSLAYALVPYQKRIK 681

Query: 1210 FDKKAIRCIFVGYDSGRKGWRCSDPITGRGYVSRDVVFNEASSWWPLEKMRTEDI 1374
             D+K+I+C+  G     K +R  DP TG+  +SRDV F+E   W   +K   E++
Sbjct: 682  LDEKSIKCVMFGVSKESKAYRLYDPATGKILISRDVQFDEERGWEWEDKSLEEEL 736


>gb|AAG51247.1|AC055769_6 copia-type polyprotein, putative; 28768-32772 [Arabidopsis thaliana]
          Length = 1334

 Score =  157 bits (398), Expect = 8e-36
 Identities = 121/475 (25%), Positives = 220/475 (46%), Gaps = 19/475 (4%)
 Frame = +1

Query: 7    ETQTKYAATEGSTSLLVNTRPTSDK----WIINSGVATYMIGNGEKLL-I*TEYKGSWMV 171
            E +  Y   E    L+ +     D+    W ++SG + +M G  E  L + + +K +  V
Sbjct: 268  EKEANYVEMEEDLLLMAHVEQIGDEEKQIWFLDSGCSNHMCGTREWFLELDSGFKQN--V 325

Query: 172  VIADNMRFPIKSTGDSDCATEESTQGGVTKCHALGKDKEEFVVC-VTTNNLRYLRGIRPD 348
             + D+ R  ++  G      +   Q         G     F V  +    LR++  I  D
Sbjct: 326  RLGDDRRMAVEGKGKLRLEVDGRIQVISDVYFVPGLKNNLFSVGQLQQKGLRFI--IEGD 383

Query: 349  DIKIYRNV--KIISTPILEGEKQEFVYVMSVETAYIDKTRKKEI----TDLWHARFEHFS 510
              +++     +++    +   +   V+    ++   ++TR  ++     ++WH RF H +
Sbjct: 384  VCEVWHKTEKRMVMHSTMTKNRMFVVFAAVKKSKETEETRCLQVIGKANNMWHKRFGHLN 443

Query: 511  YXXXXXXXXXXXXXXXPNLEV-RDETICAGCQNGKA--HQVPYYESKFRARQPLELVHSD 681
            +               P  ++  +E +C  C  GK     +P  ES +++ Q L+LVH+D
Sbjct: 444  HQGLRSLAEKEMVKGLPKFDLGEEEAVCDICLKGKQIRESIPK-ESAWKSTQVLQLVHTD 502

Query: 682  VFGPVRCLSVGGARW*LSSTI-IKGMCGHIL*KKNQKLLEYLSRSRKKLNGK*EEKFRCL 858
            + GP+   S  G R+ L+        C   L  +  +  ++    + ++  +  +K  CL
Sbjct: 503  ICGPINPASTSGKRYILNFIDDFSRKCWTYLLSEKSETFQFFKEFKAEVERESGKKLVCL 562

Query: 859  RPDNKREFISQEFSEFLQENRKRRQLTCSKTPKQ--VAKRTNRQLAETC*SMLHAKNALP 1032
            R D   E+ S+EF E+ +E   +RQLT + TP+Q  VA+R NR +      ML   +   
Sbjct: 563  RSDRGGEYNSREFDEYCKEFGIKRQLTAAYTPQQNGVAERKNRSVMNMTRCMLMEMSVPR 622

Query: 1033 EYWD*CIRTTAHVINRMPQAKFILKSTLEKLWNI-RPTVNHFRVFGSVYYVFVPDHLRMK 1209
            ++W   ++   +++NR P +K +   T E+ W+  +P+V H R+FGS+ Y  VP   R+K
Sbjct: 623  KFWPEAVQYAVYILNRSP-SKALNDITPEEKWSSWKPSVEHLRIFGSLAYALVPYQKRIK 681

Query: 1210 FDKKAIRCIFVGYDSGRKGWRCSDPITGRGYVSRDVVFNEASSWWPLEKMRTEDI 1374
             D+K+I+C+  G     K +R  DP TG+  +SRDV F+E   W   +K   E++
Sbjct: 682  LDEKSIKCVMFGVSKESKAYRLYDPATGKILISRDVQFDEERGWEWEDKSLEEEL 736


Top