BLASTX nr result

ID: Papaver25_contig00005631 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Papaver25_contig00005631
         (2243 letters)

Database: ./nr 
           38,876,450 sequences; 13,856,398,315 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

emb|CAN81355.1| hypothetical protein VITISV_039158 [Vitis vinifera]   213   2e-53
gb|AAK51235.1|AF287471_1 polyprotein [Arabidopsis thaliana]           199   5e-53
emb|CAN61322.1| hypothetical protein VITISV_012106 [Vitis vinifera]   207   1e-51
gb|AAK62788.1|AC027036_9 polyprotein, putative [Arabidopsis thal...   202   2e-50
gb|AAK62793.1|AC027036_14 polyprotein, putative [Arabidopsis tha...   202   2e-50
gb|AAF02855.1|AC009324_4 Similar to retrotransposon proteins [Ar...   194   5e-50
dbj|BAK41511.1| polyprotein [Arabidopsis thaliana]                    201   7e-50
emb|CDH30699.1| putative Ty1-copia-like retrotransposon [Cercis ...   194   2e-49
emb|CAN68489.1| hypothetical protein VITISV_037543 [Vitis vinifera]   200   3e-49
gb|AAC61290.1| putative retroelement pol polyprotein [Arabidopsi...   188   3e-49
gb|AAD21687.1| Strong similarity to gi|3600044 T12H20.12 proteas...   197   2e-47
gb|AAC67200.1| putative retroelement pol polyprotein [Arabidopsi...   185   3e-47
gb|AAC35532.1| contains similarity to proteases [Arabidopsis tha...   184   9e-47
emb|CAC37623.1| copia-like polyprotein [Arabidopsis thaliana]         194   1e-46
dbj|BAK41512.1| C-end truncated polyprotein [Arabidopsis thaliana]    189   3e-46
emb|CAN77295.1| hypothetical protein VITISV_005638 [Vitis vinifera]   193   3e-46
emb|CAB40035.1| retrotransposon like protein [Arabidopsis thalia...   184   1e-43
emb|CAN81099.1| hypothetical protein VITISV_017741 [Vitis vinifera]   181   2e-42
emb|CAN63649.1| hypothetical protein VITISV_037657 [Vitis vinifera]   172   5e-40
dbj|BAA78423.1| polyprotein [Arabidopsis thaliana]                    167   2e-38

>emb|CAN81355.1| hypothetical protein VITISV_039158 [Vitis vinifera]
          Length = 1402

 Score =  213 bits (543), Expect(2) = 2e-53
 Identities = 161/581 (27%), Positives = 260/581 (44%), Gaps = 5/581 (0%)
 Frame = -1

Query: 1730 FVRFVDG-SIEPQPQFLNHNNVPVVYPIYLEWRTLDQFVGSCINATISPSLATELLGKSI 1554
            F  F+DG S+ P+ +         + P ++  R  D+ + S I ++++P +  +++G + 
Sbjct: 58   FEDFIDGTSVCPEKELRPGE----INPAFVAXRRQDRTILSWIYSSLTPGIMAQIIGHNS 113

Query: 1553 ARDKWLHLSKIFTQQFFARKSQLRGQLHSIQRGHYSIFDYLHQLKTISDSLAEIGEPVQD 1374
            +   W  L KIF+    AR  QL  +  S ++G  S+ DY+ ++K  +DSLA IGEPV +
Sbjct: 114  SHSAWNALEKIFSSCSRARIMQLXLEFQSTKKGSMSMIDYIMKVKGAADSLAAIGEPVSE 173

Query: 1373 DDLVMYTLSGLGSEYAHFVITMQNREVPLSFAKLRSRLINHEQWLKDQENAIYSPLI--- 1203
             D +M  L GLGS+Y   V  +  RE  +S   + S L+  EQ L+ Q +    P +   
Sbjct: 174  QDQIMNLLGGLGSDYNAVVTAINIREDKISLEAVHSMLLAFEQRLEQQGSIEQLPAMSAN 233

Query: 1202 -ADPSNSAFFVRKQQNTFXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXPGFQFKK 1026
             A  SN+    RK                                         G   + 
Sbjct: 234  YASXSNNRGGGRKYNG----------------------GRGPNFMMTNSNFRGRGRGXRY 271

Query: 1025 GEFNPNLTVDYGEIPSQICNKKGHFANTFYYRYVPSMNNSPPMQKAFASFENALRFHSWS 846
            G+     +        Q+C K GH     Y+R+  +  ++       ++  N+    +  
Sbjct: 272  GQSGRQNSSSSERPQCQLCGKFGHTVQVCYHRFDITFQSTQNNTTGVSNSGNS----NXM 327

Query: 845  PAESMAYXXXXXXXXXXXXSHWSCNDQGPIWIPDSGATSHMTNNTAIMTDAVEFIGDEQA 666
            PA                    S N     W  DSGA+ H+T N A +T+A  + G ++ 
Sbjct: 328  PA----------------MVAXSNNXADDNWYLDSGASHHLTQNVANLTNATPYTGADKV 371

Query: 665  MVGDGKXXXXXXXXXXXXXXXTGDTSFDLHNVLYVPHIKHNLFSIANFTLENSCSYEFFP 486
             +G+GK                   SF L  V +VP I  NL S+A F  +N+   EF  
Sbjct: 372  TIGNGKHLTISNTXFTRLFS--NPHSFQLKKVFHVPFISANLISVAKFCSDNNALIEFHS 429

Query: 485  WGYEIKSLPSRKVLARGPNASELYPIKSSALRRNTALSASIMSNSNTXXXXXXXXXXXLW 306
             G+ +K L +++VLA+G   + LY     + ++   +    ++N +T           LW
Sbjct: 430  NGFFLKDLHTKRVLAQGKLENGLYKFPVISNKKTAYVG---ITNDSTFQCSNIENKRELW 486

Query: 305  NQRLGHPISTVINNLHTTGAIQLSNKVFESVCTSCQLGKSHSLPFLVSPSRACQPLSLVH 126
            + RLGH  + ++  +     +    K   +VC+SCQL KSH LP  +S   A +PL LV+
Sbjct: 487  HHRLGHAATDIVTRIMHNCNVSCG-KYKATVCSSCQLAKSHRLPTHLSSFHASKPLELVY 545

Query: 125  CDIWGPAPSISHLGFKYFIVFVDDFSKYNWIFPMKCRSVSL 3
             DIWGPA   S  G KYFI+FVDD+S+Y W++ ++ +  +L
Sbjct: 546  TDIWGPASVTSTSGAKYFILFVDDYSRYTWLYLLQSKDQAL 586



 Score = 25.0 bits (53), Expect(2) = 2e-53
 Identities = 6/20 (30%), Positives = 16/20 (80%)
 Frame = -2

Query: 1795 KLDGSNFLVWKDQLSSILIS 1736
            KLD +N+++W+ Q+ +++ +
Sbjct: 36   KLDRTNYILWRSQIDNVIFA 55


>gb|AAK51235.1|AF287471_1 polyprotein [Arabidopsis thaliana]
          Length = 1453

 Score =  199 bits (507), Expect(2) = 5e-53
 Identities = 154/586 (26%), Positives = 255/586 (43%), Gaps = 11/586 (1%)
 Frame = -1

Query: 1736 YQFVRFVDGSIEPQPQFLN----HNNVPVVYPIYLEWRTLDQFVGSCINATISPSLATEL 1569
            ++ + FV+G I P P+ LN      +V V  P Y  W   DQ + S +  T+S  +   +
Sbjct: 40   HKLIGFVNGGITPPPRTLNVVTGDTSVDVANPQYESWFCTDQLIRSWLFGTLSEEVLGYV 99

Query: 1568 LGKSIARDKWLHLSKIFTQQFFARKSQLRGQLHSIQRGHYSIFDYLHQLKTISDSLAEIG 1389
                 +RD W+ L++ F +   AR+  LR  L  + +   ++  Y  +   + D+L+ IG
Sbjct: 100  HNLQTSRDIWISLAENFNKSSVAREFTLRRTLQLLSKKDKTLSAYCREFIAVCDALSSIG 159

Query: 1388 EPVQDDDLVMYTLSGLGSEYAHFVITMQN---REVPLSFAKLRSRLINHEQWLKDQENAI 1218
            +PV +   +   L+GLG EY      +Q+   +  P +F  + S +   +  L+  E ++
Sbjct: 160  KPVDESMKIFGFLNGLGREYDPITTVIQSSLSKISPPTFRDVISEVKGFDVKLQSYEESV 219

Query: 1217 YSPLIADPSNSAFFVRKQQNTFXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXPGF 1038
                 A+P + AF  ++ + T                                     G 
Sbjct: 220  ----TANP-HMAFNTQRSEYT----------DNYTSGNRGKGRGGYGQNRGRSGYSTRGR 264

Query: 1037 QFKKGEFNPNLTVDYGEIP-SQICNKKGHFANTFYYRYVPSMNNSPPMQKAFASFENALR 861
             F + + N N T   GE P  QIC + GH A   Y R+             + S + A  
Sbjct: 265  GFSQHQTNSNNT---GERPVCQICGRTGHTALKCYNRF----------DHNYQSVDTAQA 311

Query: 860  FHSWSPAESMAYXXXXXXXXXXXXSHWSCNDQGPIWIPDSGATSHMTNNTAIMTDAVEFI 681
            F S   ++S                       G  W+PDS AT+H+T++T  +  A  + 
Sbjct: 312  FSSLRVSDS----------------------SGKEWVPDSAATAHVTSSTNNLQAASPYN 349

Query: 680  GDEQAMVGDGKXXXXXXXXXXXXXXXTGDTSFDLHNVLYVPHIKHNLFSIANFTLENSCS 501
            G +  +VGDG                +G  +  L+ VL  P I+ +L S++    +  C 
Sbjct: 350  GSDTVLVGDGAYLPITHVGSTTISSDSG--TLPLNEVLVCPDIQKSLLSVSKLCDDYPCG 407

Query: 500  YEFFPWGYEIKSLPSRKVLARGPNASELYPIKSS---ALRRNTALSASIMSNSNTXXXXX 330
              F      I  + ++KV+++GP ++ LY +++    A   N   +AS            
Sbjct: 408  VYFDANKVCIIDINTQKVVSKGPRSNGLYVLENQEFVAFYSNRQCAAS------------ 455

Query: 329  XXXXXXLWNQRLGHPISTVINNLHTTGAIQLSNKVFESVCTSCQLGKSHSLPFLVSPSRA 150
                  +W+ RLGH  S ++  L ++  I  +      VC  CQ+GKS  L F  S SR 
Sbjct: 456  ----EEIWHHRLGHSNSRILQQLKSSKEISFNKSRMSPVCEPCQMGKSSKLQFFSSNSRE 511

Query: 149  CQPLSLVHCDIWGPAPSISHLGFKYFIVFVDDFSKYNWIFPMKCRS 12
               L  +HCD+WGP+P +S  GFKY++VFVDD+S+Y+W +P+K +S
Sbjct: 512  LDLLGRIHCDLWGPSPVVSKQGFKYYVVFVDDYSRYSWFYPLKAKS 557



 Score = 37.7 bits (86), Expect(2) = 5e-53
 Identities = 19/43 (44%), Positives = 27/43 (62%), Gaps = 3/43 (6%)
 Frame = -2

Query: 1834 PLPFPN---VSNFVSTKLDGSNFLVWKDQLSSILISTNLYGLL 1715
            P PFP+   VS+ V+ KL+ SN+L+WK Q  S+L    L G +
Sbjct: 4    PYPFPDNVHVSSSVTLKLNDSNYLLWKTQFESLLSCHKLIGFV 46


>emb|CAN61322.1| hypothetical protein VITISV_012106 [Vitis vinifera]
          Length = 1432

 Score =  207 bits (527), Expect(2) = 1e-51
 Identities = 166/581 (28%), Positives = 262/581 (45%), Gaps = 5/581 (0%)
 Frame = -1

Query: 1730 FVRFVDG-SIEPQPQFLNHNNVPVVYPIYLEWRTLDQFVGSCINATISPSLATELLGKSI 1554
            F  F+DG SI P+       +  V+ P ++ WR  D+ + S I ++++P +  +++G + 
Sbjct: 61   FEDFIDGTSICPEKDL----SPGVMNPAFVAWRRQDRTILSWIYSSLTPGIMAQIIGHNT 116

Query: 1553 ARDKWLHLSKIFTQQFFARKSQLRGQLHSIQRGHYSIFDYLHQLKTISDSLAEIGEPVQD 1374
            +   W  L  IF+    AR  QLR +L S ++G  S+ DY+ ++K  +D+LA IGEPV +
Sbjct: 117  SHSAWNALESIFSSSSRARIMQLRLELQSTKKGSMSMIDYIMKIKGAADNLAAIGEPVSE 176

Query: 1373 DDLVMYTLSGLGSEYAHFVITMQNREVPLSFAKLRSRLINHEQWLKDQENAI--YSPLIA 1200
             D VM  L GLGS+Y   V  +  R+  +S   + S L+  E  L +Q+++I   S   A
Sbjct: 177  QDQVMNLLGGLGSDYNAVVTAINIRDDKISLEAIHSMLLAFEHRL-EQQSSIEQMSANYA 235

Query: 1199 DPSNSAFFVRKQQNTFXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXPGFQFKKGE 1020
              SN+    RK                                         G   + G+
Sbjct: 236  SSSNNRGGGRKFNG--------------------GRGQGYSPNNNNYTYRGRGRGGRNGQ 275

Query: 1019 FNPNLTVDYGEIPSQICNKKGHFANTFYYRYVPSMNNSPPMQKAFASFENALRFHSWSPA 840
                 +    +   Q+C K GH A   Y+R+  S       Q    +  ++L   + +  
Sbjct: 276  GGRQNSSPSEKPQCQLCGKFGHTAQICYHRFDISF------QGGQTTISHSLNNGNQNNI 329

Query: 839  ESMAYXXXXXXXXXXXXSHWSCNDQGPIWIPDSGATSHMTNNTAIMTDAVEFIGDEQAMV 660
             +M                 S N     W  DSGA+ H+T N   +T    + G ++  +
Sbjct: 330  PAMVASA-------------SNNPADESWYLDSGASHHLTQNLGNLTSTSPYTGTDKVTI 376

Query: 659  GDGKXXXXXXXXXXXXXXXTGDTSFDLHNVLYVPHIKHNLFSIANFTLENSCSYEFFPWG 480
            G+GK               T   SF L  V +VP I  NL S+A F  EN+   EF    
Sbjct: 377  GNGKHLSISNIGSKQLHSHTH--SFRLKKVFHVPFISANLISVAKFCSENNALIEFHSNA 434

Query: 479  YEIKSLPSRKVLARGPNASELY--PIKSSALRRNTALSASIMSNSNTXXXXXXXXXXXLW 306
            + +K L ++ VLA+G   + LY  P+ S+    ++  +AS     ++           LW
Sbjct: 435  FFVKDLHTKMVLAQGKLENGLYKFPVFSNLKPYSSINNASAF---HSQFSSTVENKAELW 491

Query: 305  NQRLGHPISTVINNLHTTGAIQLSNKVFESVCTSCQLGKSHSLPFLVSPSRACQPLSLVH 126
            + RLGH    +++ +  T  +  S K    VC+ CQL KSH LP  +S   A +PL LV+
Sbjct: 492  HNRLGHASFDIVSKVMNTCNVA-SGKYKSFVCSDCQLAKSHRLPTQLSNFHASKPLELVY 550

Query: 125  CDIWGPAPSISHLGFKYFIVFVDDFSKYNWIFPMKCRSVSL 3
             DIWGPA   S  G +YFI+FVDD+S+Y W + ++ +  +L
Sbjct: 551  TDIWGPASIKSTSGARYFILFVDDYSRYTWFYSLQTKDQAL 591



 Score = 25.4 bits (54), Expect(2) = 1e-51
 Identities = 6/27 (22%), Positives = 20/27 (74%)
 Frame = -2

Query: 1816 VSNFVSTKLDGSNFLVWKDQLSSILIS 1736
            +++ +  KLD +N+++W+ Q+ +++ +
Sbjct: 32   LNHTLPVKLDRTNYILWRSQIDNVIFA 58


>gb|AAK62788.1|AC027036_9 polyprotein, putative [Arabidopsis thaliana]
            gi|18265373|dbj|BAB84015.1| polyprotein [Arabidopsis
            thaliana]
          Length = 1466

 Score =  202 bits (515), Expect(2) = 2e-50
 Identities = 150/583 (25%), Positives = 251/583 (43%), Gaps = 6/583 (1%)
 Frame = -1

Query: 1742 DQYQFVRFVDGSIEPQPQFLNHNNVPVVYPIYLEWRTLDQFVGSCINATISPSLATELLG 1563
            D Y+   F+DGS    P  +  +  P V P Y  W+  D+ + S +   IS S+   +  
Sbjct: 43   DGYELAGFLDGSTTMPPATIGTDAAPRVNPDYTRWKRQDKLIYSAVLGAISMSVQPAVSR 102

Query: 1562 KSIARDKWLHLSKIFTQQFFARKSQLRGQLHSIQRGHYSIFDYLHQLKTISDSLAEIGEP 1383
             + A   W  L KI+    +   +QLR QL    +G  +I DY+  L T  D LA +G+P
Sbjct: 103  ATTAAQIWETLRKIYANPSYGHVTQLRTQLKQWTKGTKTIDDYMQGLVTRFDQLALLGKP 162

Query: 1382 VQDDDLVMYTLSGLGSEYAHFVITMQNREVPLSFAKLRSRLINHEQWLKDQENAIYSPLI 1203
            +  D+ V   L  L  EY   +  +  ++ P +  ++  RL+NHE  +    +A   P+ 
Sbjct: 163  MDHDEQVERVLENLPEEYKPVIDQIAAKDTPPTLTEIHERLLNHESKILAVSSATVIPIT 222

Query: 1202 ADPSNSAFFVRKQQNTFXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXPGFQFKKG 1023
            A+  +         N                                       +Q    
Sbjct: 223  ANAVSHRNTTTTNNNN-------------------NGNRNNRYDNRNNNNNSKPWQQSST 263

Query: 1022 EFNPNLTVDYGEIPS-QICNKKGHFAN--TFYYRYVPSMNNSPPMQKAFASFENALRFHS 852
             F+PN       +   QIC  +GH A   +    ++ S+N+  P             F  
Sbjct: 264  NFHPNNNQSKPYLGKCQICGVQGHSAKRCSQLQHFLSSVNSQQPPSP----------FTP 313

Query: 851  WSPAESMAYXXXXXXXXXXXXSHWSCNDQGPIWIPDSGATSHMTNNTAIMTDAVEFIGDE 672
            W P  ++A               +S N+    W+ DSGAT H+T++   ++    + G +
Sbjct: 314  WQPRANLALGSP-----------YSSNN----WLLDSGATHHITSDFNNLSLHQPYTGGD 358

Query: 671  QAMVGDGKXXXXXXXXXXXXXXXTGDTSFDLHNVLYVPHIKHNLFSIANFTLENSCSYEF 492
              MV DG                      +LHN+LYVP+I  NL S+      N  S EF
Sbjct: 359  DVMVADGSTIPISHTGSTSLSTK--SRPLNLHNILYVPNIHKNLISVYRLCNANGVSVEF 416

Query: 491  FPWGYEIKSLPSRKVLARGPNASELY--PIKSSALRRNTALSASIMSNSNTXXXXXXXXX 318
            FP  +++K L +   L +G    ELY  PI SS   +  +L AS  S +           
Sbjct: 417  FPASFQVKDLNTGVPLLQGKTKDELYEWPIASS---QPVSLFASPSSKAT---------- 463

Query: 317  XXLWNQRLGHPISTVINNLHTTGAIQLSNKVFESV-CTSCQLGKSHSLPFLVSPSRACQP 141
               W+ RLGHP  +++N++ +  ++ + N   + + C+ C + KS+ +PF  S   + +P
Sbjct: 464  HSSWHARLGHPAPSILNSVISNYSLSVLNPSHKFLSCSDCLINKSNKVPFSQSTINSTRP 523

Query: 140  LSLVHCDIWGPAPSISHLGFKYFIVFVDDFSKYNWIFPMKCRS 12
            L  ++ D+W  +P +SH  ++Y+++FVD F++Y W++P+K +S
Sbjct: 524  LEYIYSDVWS-SPILSHDNYRYYVIFVDHFTRYTWLYPLKQKS 565



 Score = 25.8 bits (55), Expect(2) = 2e-50
 Identities = 13/35 (37%), Positives = 20/35 (57%)
 Frame = -2

Query: 1819 NVSNFVSTKLDGSNFLVWKDQLSSILISTNLYGLL 1715
            N+SN   TKL  +N+L+W  Q+ ++     L G L
Sbjct: 19   NMSNV--TKLTSTNYLMWSRQVHALFDGYELAGFL 51


>gb|AAK62793.1|AC027036_14 polyprotein, putative [Arabidopsis thaliana]
            gi|338746561|dbj|BAK41510.1| polyprotein [Arabidopsis
            thaliana]
          Length = 1466

 Score =  202 bits (515), Expect(2) = 2e-50
 Identities = 150/583 (25%), Positives = 251/583 (43%), Gaps = 6/583 (1%)
 Frame = -1

Query: 1742 DQYQFVRFVDGSIEPQPQFLNHNNVPVVYPIYLEWRTLDQFVGSCINATISPSLATELLG 1563
            D Y+   F+DGS    P  +  +  P V P Y  W+  D+ + S +   IS S+   +  
Sbjct: 43   DGYELAGFLDGSTTMPPATIGTDAAPRVNPDYTRWKRQDKLIYSAVLGAISMSVQPAVSR 102

Query: 1562 KSIARDKWLHLSKIFTQQFFARKSQLRGQLHSIQRGHYSIFDYLHQLKTISDSLAEIGEP 1383
             + A   W  L KI+    +   +QLR QL    +G  +I DY+  L T  D LA +G+P
Sbjct: 103  ATTAAQIWETLRKIYANPSYGHVTQLRTQLKQWTKGTKTIDDYMQGLVTRFDQLALLGKP 162

Query: 1382 VQDDDLVMYTLSGLGSEYAHFVITMQNREVPLSFAKLRSRLINHEQWLKDQENAIYSPLI 1203
            +  D+ V   L  L  EY   +  +  ++ P +  ++  RL+NHE  +    +A   P+ 
Sbjct: 163  MDHDEQVERVLENLPEEYKPVIDQIAAKDTPPTLTEIHERLLNHESKILAVSSATVIPIT 222

Query: 1202 ADPSNSAFFVRKQQNTFXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXPGFQFKKG 1023
            A+  +         N                                       +Q    
Sbjct: 223  ANAVSHRNTTTTNNNN-------------------NGNRNNRYDNRNNNNNSKPWQQSST 263

Query: 1022 EFNPNLTVDYGEIPS-QICNKKGHFAN--TFYYRYVPSMNNSPPMQKAFASFENALRFHS 852
             F+PN       +   QIC  +GH A   +    ++ S+N+  P             F  
Sbjct: 264  NFHPNNNQSKPYLGKCQICGVQGHSAKRCSQLQHFLSSVNSQQPPSP----------FTP 313

Query: 851  WSPAESMAYXXXXXXXXXXXXSHWSCNDQGPIWIPDSGATSHMTNNTAIMTDAVEFIGDE 672
            W P  ++A               +S N+    W+ DSGAT H+T++   ++    + G +
Sbjct: 314  WQPRANLALGSP-----------YSSNN----WLLDSGATHHITSDFNNLSLHQPYTGGD 358

Query: 671  QAMVGDGKXXXXXXXXXXXXXXXTGDTSFDLHNVLYVPHIKHNLFSIANFTLENSCSYEF 492
              MV DG                      +LHN+LYVP+I  NL S+      N  S EF
Sbjct: 359  DVMVADGSTIPISHTGSTSLSTK--SRPLNLHNILYVPNIHKNLISVYRLCNANGVSVEF 416

Query: 491  FPWGYEIKSLPSRKVLARGPNASELY--PIKSSALRRNTALSASIMSNSNTXXXXXXXXX 318
            FP  +++K L +   L +G    ELY  PI SS   +  +L AS  S +           
Sbjct: 417  FPASFQVKDLNTGVPLLQGKTKDELYEWPIASS---QPVSLFASPSSKAT---------- 463

Query: 317  XXLWNQRLGHPISTVINNLHTTGAIQLSNKVFESV-CTSCQLGKSHSLPFLVSPSRACQP 141
               W+ RLGHP  +++N++ +  ++ + N   + + C+ C + KS+ +PF  S   + +P
Sbjct: 464  HSSWHARLGHPAPSILNSVISNYSLSVLNPSHKFLSCSDCLINKSNKVPFSQSTINSTRP 523

Query: 140  LSLVHCDIWGPAPSISHLGFKYFIVFVDDFSKYNWIFPMKCRS 12
            L  ++ D+W  +P +SH  ++Y+++FVD F++Y W++P+K +S
Sbjct: 524  LEYIYSDVWS-SPILSHDNYRYYVIFVDHFTRYTWLYPLKQKS 565



 Score = 25.8 bits (55), Expect(2) = 2e-50
 Identities = 13/35 (37%), Positives = 20/35 (57%)
 Frame = -2

Query: 1819 NVSNFVSTKLDGSNFLVWKDQLSSILISTNLYGLL 1715
            N+SN   TKL  +N+L+W  Q+ ++     L G L
Sbjct: 19   NMSNV--TKLTSTNYLMWSRQVHALFDGYELAGFL 51


>gb|AAF02855.1|AC009324_4 Similar to retrotransposon proteins [Arabidopsis thaliana]
          Length = 1522

 Score =  194 bits (494), Expect(2) = 5e-50
 Identities = 151/584 (25%), Positives = 253/584 (43%), Gaps = 14/584 (2%)
 Frame = -1

Query: 1721 FVDGSIEP--QPQFLNHNNVPVVYPI--YLEWRTLDQFVGSCINATISPSLATELLGKSI 1554
            FV GSI    Q + + HNNV    P   +  W   DQ V S +  + +  + + ++    
Sbjct: 43   FVTGSISAPAQTRSVTHNNVTSEEPNPEFYTWHQTDQVVKSWLLGSFAEDILSVVVNCFT 102

Query: 1553 ARDKWLHLSKIFTQQFFARKSQLRGQLHSIQRGHYSIFDYLHQLKTISDSLAEIGEPVQD 1374
            +   WL L+  F +   +R  +L+ +L ++++   ++  +L  LK I D LA +G PV +
Sbjct: 103  SHQVWLTLANHFNRVSSSRLFELQRRLQTLEKKDNTMEVFLKDLKHICDQLASVGSPVPE 162

Query: 1373 DDLVMYTLSGLGSEYAHFVITMQNR---EVPLSFAKLRSRLINHEQWLKDQENAIYSPLI 1203
               +   L+GLG EY     T++N       LS  ++ S+L  ++  L   ++ +  P I
Sbjct: 163  KMKIFSALNGLGREYEPIKTTIENSVDSNPSLSLDEVASKLRGYDDRL---QSYVTEPTI 219

Query: 1202 ADPSNSAFFVRKQQNTFXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXPGFQFKKG 1023
            +   + AF V    + +                                     F  +  
Sbjct: 220  SP--HVAFNVTHSDSGYYHNNNRGKGRSNSGSGKS------------------SFSTRGR 259

Query: 1022 EFNPNLTVDYGE------IPSQICNKKGHFANTFYYRYVPSMNNSP-PMQKAFASFENAL 864
             F+  ++   G       +  QIC K GH A   ++R+  S  +   PM  A     +  
Sbjct: 260  GFHQQISPTSGSQAGNSGLVCQICGKAGHHALKCWHRFDNSYQHEDLPMALATMRITDVT 319

Query: 863  RFHSWSPAESMAYXXXXXXXXXXXXSHWSCNDQGPIWIPDSGATSHMTNNTAIMTDAVEF 684
              H                              G  WIPDS A++H+TNN  ++  +  +
Sbjct: 320  DHH------------------------------GHEWIPDSAASAHVTNNRHVLQQSQPY 349

Query: 683  IGDEQAMVGDGKXXXXXXXXXXXXXXXTGDTSFDLHNVLYVPHIKHNLFSIANFTLENSC 504
             G +  MV DG                +G     L  VL  P I  +L S++  T +  C
Sbjct: 350  HGSDSIMVADGNFLPITHTGSGSIASSSG--KIPLKEVLVCPDIVKSLLSVSKLTSDYPC 407

Query: 503  SYEFFPWGYEIKSLPSRKVLARGPNASELYPIKSSALRRNTALSASIMSNSNTXXXXXXX 324
            S EF      I    ++K+L  G N   LY ++   L+    +  S   NS +       
Sbjct: 408  SVEFDADSVRINDKATKKLLVMGRNRDGLYSLEEPKLQ----VLYSTRQNSASSEV---- 459

Query: 323  XXXXLWNQRLGHPISTVINNLHTTGAIQLSNKVFESVCTSCQLGKSHSLPFLVSPSRACQ 144
                 W++RLGH  + V++ L ++ +I + NKV ++VC +C LGKS  LPF++S   A +
Sbjct: 460  -----WHRRLGHANAEVLHQLASSKSIIIINKVVKTVCEACHLGKSTRLPFMLSTFNASR 514

Query: 143  PLSLVHCDIWGPAPSISHLGFKYFIVFVDDFSKYNWIFPMKCRS 12
            PL  +HCD+WGP+P+ S  GF+Y++VF+D +S++ W +P+K +S
Sbjct: 515  PLERIHCDLWGPSPTSSVQGFRYYVVFIDHYSRFTWFYPLKLKS 558



 Score = 32.7 bits (73), Expect(2) = 5e-50
 Identities = 14/39 (35%), Positives = 22/39 (56%)
 Frame = -2

Query: 1831 LPFPNVSNFVSTKLDGSNFLVWKDQLSSILISTNLYGLL 1715
            +P  N+SN V+  L+  N+++WK Q  S L    L G +
Sbjct: 6    VPPLNISNCVTVTLNQQNYILWKSQFESFLSGQGLLGFV 44


>dbj|BAK41511.1| polyprotein [Arabidopsis thaliana]
          Length = 1466

 Score =  201 bits (511), Expect(2) = 7e-50
 Identities = 149/583 (25%), Positives = 250/583 (42%), Gaps = 6/583 (1%)
 Frame = -1

Query: 1742 DQYQFVRFVDGSIEPQPQFLNHNNVPVVYPIYLEWRTLDQFVGSCINATISPSLATELLG 1563
            D Y+   F+DGS    P  +  +  P V P Y  W+  D+ + S +   IS S+   +  
Sbjct: 43   DGYELAGFLDGSTTMPPATIGTDAAPRVNPDYTRWKRQDKLIYSAVLGAISMSVQPAVSR 102

Query: 1562 KSIARDKWLHLSKIFTQQFFARKSQLRGQLHSIQRGHYSIFDYLHQLKTISDSLAEIGEP 1383
             + A   W  L KI+    +   +QLR QL    +G  +I DY+    T  D LA +G+P
Sbjct: 103  ATTAAQIWETLRKIYANPSYGHVTQLRTQLKQWTKGTKTIDDYMQGFVTRFDQLALLGKP 162

Query: 1382 VQDDDLVMYTLSGLGSEYAHFVITMQNREVPLSFAKLRSRLINHEQWLKDQENAIYSPLI 1203
            +  D+ V   L  L  EY   +  +  ++ P +  ++  RL+NHE  +    +A   P+ 
Sbjct: 163  MDHDEQVERVLENLPEEYKPVIDQIAAKDTPPTLTEIHERLLNHESKILAVSSATVIPIT 222

Query: 1202 ADPSNSAFFVRKQQNTFXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXPGFQFKKG 1023
            A+  +         N                                       +Q    
Sbjct: 223  ANAVSHRNTTTTNNNN-------------------NGNRNNRYDNRNNNNNSKPWQQSST 263

Query: 1022 EFNPNLTVDYGEIPS-QICNKKGHFAN--TFYYRYVPSMNNSPPMQKAFASFENALRFHS 852
             F+PN       +   QIC  +GH A   +    ++ S+N+  P             F  
Sbjct: 264  NFHPNNNQSKPYLGKCQICGVQGHSAKRCSQLQHFLSSVNSQQPPSP----------FTP 313

Query: 851  WSPAESMAYXXXXXXXXXXXXSHWSCNDQGPIWIPDSGATSHMTNNTAIMTDAVEFIGDE 672
            W P  ++A               +S N+    W+ DSGAT H+T++   ++    + G +
Sbjct: 314  WQPRANLALGSP-----------YSSNN----WLLDSGATHHITSDFNNLSLHQPYTGGD 358

Query: 671  QAMVGDGKXXXXXXXXXXXXXXXTGDTSFDLHNVLYVPHIKHNLFSIANFTLENSCSYEF 492
              MV DG                      +LHN+LYVP+I  NL S+      N  S EF
Sbjct: 359  DVMVADGSTIPISHTGSTSLSTK--SRPLNLHNILYVPNIHKNLISVYRLCNANGVSVEF 416

Query: 491  FPWGYEIKSLPSRKVLARGPNASELY--PIKSSALRRNTALSASIMSNSNTXXXXXXXXX 318
            FP  +++K L +   L +G    ELY  PI SS   +  +L AS  S +           
Sbjct: 417  FPASFQVKDLNTGVPLLQGKTKDELYEWPIASS---QPVSLFASPSSKAT---------- 463

Query: 317  XXLWNQRLGHPISTVINNLHTTGAIQLSNKVFESV-CTSCQLGKSHSLPFLVSPSRACQP 141
               W+ RLGHP  +++N++ +  ++ + N   + + C+ C + KS+ +PF  S   + +P
Sbjct: 464  HSSWHARLGHPAPSILNSVISNYSLSVLNPSHKFLSCSDCLINKSNKVPFSQSTINSTRP 523

Query: 140  LSLVHCDIWGPAPSISHLGFKYFIVFVDDFSKYNWIFPMKCRS 12
            L  ++ D+W  +P +SH  ++Y+++FVD F++Y W++P+K +S
Sbjct: 524  LEYIYSDVWS-SPILSHDNYRYYVIFVDHFTRYTWLYPLKQKS 565



 Score = 25.8 bits (55), Expect(2) = 7e-50
 Identities = 13/35 (37%), Positives = 20/35 (57%)
 Frame = -2

Query: 1819 NVSNFVSTKLDGSNFLVWKDQLSSILISTNLYGLL 1715
            N+SN   TKL  +N+L+W  Q+ ++     L G L
Sbjct: 19   NMSNV--TKLTSTNYLMWSRQVHALFDGYELAGFL 51


>emb|CDH30699.1| putative Ty1-copia-like retrotransposon [Cercis chinensis]
          Length = 646

 Score =  194 bits (492), Expect(2) = 2e-49
 Identities = 162/569 (28%), Positives = 259/569 (45%), Gaps = 10/569 (1%)
 Frame = -1

Query: 1679 HNNVPVVYPIYLEWRTLDQFVGSCINATISPSLATELLGKSIARDKWLHLSKIFTQQFFA 1500
            H +V ++ P Y+ W+  ++ V S I ++++  + T+++  + A + W  L + +     A
Sbjct: 95   HGSV-ILNPEYVLWQRQNRLVMSWIYSSLTEQMMTQIMAYNSACEIWTALRESYASASRA 153

Query: 1499 RKSQLRGQLHSIQRGHYSIFDYLHQLKTISDSLAEIGEPVQDDDLVMYTLSGLGSEYAHF 1320
            R  +LR QL + ++G  S+ DY+ +++ I D L  IGE V  DD VM  L+GLGSEY   
Sbjct: 154  RIMELRLQLQTTRKGGLSVMDYMLRIQHICDHLRAIGESVSIDDQVMAVLAGLGSEYNPI 213

Query: 1319 VITMQNREVPLSFAKLRSRLINHEQWLKDQENAI-YSPLIADPS--NSAFFVRK--QQNT 1155
            V ++ +R   +S   L+S L  +E+ L+ Q +   + PL A+ +  NS    RK  QQN 
Sbjct: 214  VASITSRLDSISVQALQSYLETYEKRLEIQNSVEQHIPLQANAAMYNSNRGKRKPYQQN- 272

Query: 1154 FXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXPGFQFKKGEFNPNLTVDYGEIPS- 978
                                                 G    KG  NP      G  P  
Sbjct: 273  ------------FHHSQTPVTATHNFNHRNSSQGGFRGGHSNKGRANP------GSRPQC 314

Query: 977  QICNKKGHFANTFYYRYVPSMNNSPPMQKAFASFENALRFHSWSPAESMAYXXXXXXXXX 798
            QIC K GH A   ++R+    N   P     AS  N     +  P   MA          
Sbjct: 315  QICCKIGHVATECWHRFD---NQFQPK----ASHSNQFSSSAQDPQALMAAPGLLGETP- 366

Query: 797  XXXSHWSCNDQGPIWIPDSGATSHMTNNTAIMTDAVEFIGDEQAMVGDGKXXXXXXXXXX 618
                          W  D+GAT H+T++ A ++    F GD++ +VG+GK          
Sbjct: 367  --------------WFLDTGATHHVTSDLANLSLHNPFSGDDKVIVGNGKGLYVLHTGHS 412

Query: 617  XXXXXTGDTSFDLHNVLYVPHIKHNLFSIANFTLENSCSYEFFPWGYEIKSLPSRKVLAR 438
                  G  +  L NVL+VP I  NL S+     +N+   EF+P  + +K   ++ +L +
Sbjct: 413  SIPTSQG--ALLLKNVLHVPKIAANLVSVQKLCHDNNAYVEFYPSYFAVKDQKTQAILLK 470

Query: 437  GPNASELYPIKSS-ALRRNTALSASIMSNSNTXXXXXXXXXXXLWNQRLGHPISTVIN-- 267
            G     LY + S+ +     A    I S S++            W+ RLGHP   ++N  
Sbjct: 471  GGLDKGLYSVPSAYSSHAPQARVFQIFSTSSSTSIDSMRL----WHNRLGHPSLAIVNKV 526

Query: 266  -NLHTTGAIQLSNKVFESVCTSCQLGKSHSLPFLVSPSRACQPLSLVHCDIWGPAPSISH 90
             N +      L NK+   +C SCQL KSH LPF+ + S+A +P  LVH D+WG    +S 
Sbjct: 527  LNHYNLPVFSLQNKL---LCDSCQLAKSHKLPFVRNYSKAMKPFDLVHADLWGSPSCLSV 583

Query: 89   LGFKYFIVFVDDFSKYNWIFPMKCRSVSL 3
             G  YF++ +DD+S+++W++ ++ +  +L
Sbjct: 584  NGACYFLLLIDDYSRFSWLYLLQSKDETL 612



 Score = 32.0 bits (71), Expect(2) = 2e-49
 Identities = 12/36 (33%), Positives = 23/36 (63%)
 Frame = -2

Query: 1822 PNVSNFVSTKLDGSNFLVWKDQLSSILISTNLYGLL 1715
            P     ++ KLD +NFL+W++QL +I+++     +L
Sbjct: 39   PTFGQTLTIKLDRNNFLIWRNQLLNIVVANGYEDIL 74


>emb|CAN68489.1| hypothetical protein VITISV_037543 [Vitis vinifera]
          Length = 1449

 Score =  200 bits (508), Expect(2) = 3e-49
 Identities = 140/556 (25%), Positives = 248/556 (44%), Gaps = 5/556 (0%)
 Frame = -1

Query: 1655 PIYLEWRTLDQFVGSCINATISPSLATELLGKSIARDKWLHLSKIFTQQFFARKSQLRGQ 1476
            P ++ WR  D+ + S I ++++P +  +++G   +   W  L   F     AR  QLR +
Sbjct: 144  PDFVMWRRFDRMILSWIYSSLTPEIMGQIVGYQSSHAXWFALEXXFXASSRARVMQLRLE 203

Query: 1475 LHSIQRGHYSIFDYLHQLKTISDSLAEIGEPVQDDDLVMYTLSGLGSEYAHFVITMQNRE 1296
              + ++G  ++ +Y+ +LK+++D+LA IGEPV D D ++  L GLG++Y   V ++  RE
Sbjct: 204  FQTTRKGSLTMMEYILKLKSLADNLAAIGEPVTDRDQILQLLGGLGADYNSIVASLTARE 263

Query: 1295 VPLSFAKLRSRLINHEQWLKDQENAIYSPLIADPSNSAFFVRKQQNTFXXXXXXXXXXXX 1116
                           ++     E+ + S  +A P    F  ++                 
Sbjct: 264  ---------------DEDNSVAEDNVISANLATPQYQHFNNKRSSGQ------------- 295

Query: 1115 XXXXXXXXXXXXXXXXXXXXXXXPGFQFKKGEFNPNLTVDYGEIPSQICNKKGHFANTFY 936
                                    GF  ++G               Q+C K GH     Y
Sbjct: 296  --------------------NRQSGFNTRRGTNGGRSQSSQHRPQCQLCGKFGHTVVRCY 335

Query: 935  YRYVPSMNNSPP----MQKAFASFENALRFHSWSPAESMAYXXXXXXXXXXXXSHWSCND 768
            +R+  +     P    +Q    + +N ++    SP+                    + +D
Sbjct: 336  HRFDINFQGYNPNMDTVQTNKPNAKNQVQAMMASPS--------------------TISD 375

Query: 767  QGPIWIPDSGATSHMTNNTAIMTDAVEFIGDEQAMVGDGKXXXXXXXXXXXXXXXTGDTS 588
            +   W  D+GAT H++ +   ++D   ++G+++ +VG+GK                   +
Sbjct: 376  EA--WFFDTGATHHLSQSIDPLSDVQPYMGNDKVIVGNGKHLRILHTGTTFFPS--SSKT 431

Query: 587  FDLHNVLYVPHIKHNLFSIANFTLENSCSYEFFPWGYEIKSLPSRKVLARGPNASELYPI 408
            F L  VL+VP I  NL S++ F  +N+  +EF P  + +K   ++K+L +G     LY  
Sbjct: 432  FQLRQVLHVPDIATNLISVSQFCADNNTFFEFHPRFFFVKDQVTKKILLQGSLEHGLYRF 491

Query: 407  KSSALRRNTALSASIMSNSNTXXXXXXXXXXXLWNQRLGHPISTVINNLHTTGAIQLSNK 228
             +  +    A  +S    S+             W+ RLGHP   ++ ++ T+    +S++
Sbjct: 492  PARFVPSPAAFVSSSYDRSSNLSLTTTTTL---WHSRLGHPADNILKHILTS--CNISHQ 546

Query: 227  VFES-VCTSCQLGKSHSLPFLVSPSRACQPLSLVHCDIWGPAPSISHLGFKYFIVFVDDF 51
              ++ VC +CQ  KSH LPF V  SRA  PL+L+H D+WGP    S  G +YFI+FVDDF
Sbjct: 547  CHKNNVCCACQFAKSHKLPFNVXVSRASHPLALLHADLWGPXSIPSTTGARYFILFVDDF 606

Query: 50   SKYNWIFPMKCRSVSL 3
            S+++WI+P+  +  +L
Sbjct: 607  SRFSWIYPLHSKDQAL 622



 Score = 24.6 bits (52), Expect(2) = 3e-49
 Identities = 9/34 (26%), Positives = 22/34 (64%), Gaps = 4/34 (11%)
 Frame = -2

Query: 1795 KLDGSNFLVWKDQLSSILIST----NLYGLLMVP 1706
            KLD +N+++W+ Q+ +++ +     ++ GL + P
Sbjct: 100  KLDRNNYILWRTQMENVVFANGFEDHIEGLKICP 133


>gb|AAC61290.1| putative retroelement pol polyprotein [Arabidopsis thaliana]
          Length = 1149

 Score =  188 bits (478), Expect(2) = 3e-49
 Identities = 149/553 (26%), Positives = 245/553 (44%), Gaps = 5/553 (0%)
 Frame = -1

Query: 1655 PIYLEWRTLDQFVGSCINATISPSLATELLGKSIARDKWLHLSKIFTQQFFARKSQLRGQ 1476
            P Y  W   DQ V       +S  + + ++G   + + W++L+K F +   +R  +L+ +
Sbjct: 74   PDYQAWFRSDQVV-------MSEDILSVVVGSKTSHEVWMNLAKHFNRISSSRIFELQRR 126

Query: 1475 LHSIQRGHYSIFDYLHQLKTISDSLAEIGEPVQDDDLVMYTLSGLGSEYAHFVITMQNRE 1296
            LHS+ +   ++ +YL  LKTI D LA +G PV +   +   + GL  EY   + +++   
Sbjct: 127  LHSLSKEGKTMEEYLRYLKTICDQLASVGSPVAEKMKIFAMVHGLTREYEPLITSLEGTL 186

Query: 1295 VPL---SFAKLRSRLINHEQWLKDQENAIYSPLIADPSNSAFFVRKQQNTFXXXXXXXXX 1125
                  S+  +  RL N +  L+       SP +A             NTF         
Sbjct: 187  DAFPGPSYEDVVYRLKNFDDRLQGYTVTDVSPHLAF------------NTFRSSNRGRGG 234

Query: 1124 XXXXXXXXXXXXXXXXXXXXXXXXXXPGFQFKKGEFNPNLTVDYGEIPS-QICNKKGHFA 948
                                       G  F++   + + +V   E P  QIC K+GH+A
Sbjct: 235  RNNRGKGNFSTR---------------GRGFQQQFSSSSSSVSASEKPMCQICGKRGHYA 279

Query: 947  NTFYYRYVPSMNNSPPMQKAFASFENALRFHSWSPAESMAYXXXXXXXXXXXXSHWSCND 768
               ++R+  S  +S     AF+    AL     S                        +D
Sbjct: 280  LQCWHRFDDSYQHSEAAAAAFS----ALHITDVS------------------------DD 311

Query: 767  QGPIWIPDSGATSHMTNNTAIMTDAVEFIGDEQAMVGDGKXXXXXXXXXXXXXXXTGDTS 588
             G  W+PDS AT+H+TNN++ +     ++G++  M  DG                +G+  
Sbjct: 312  SG--WVPDSAATAHITNNSSRLQQMQPYLGNDTVMASDGNFLPITHIGSANLPSTSGN-- 367

Query: 587  FDLHNVLYVPHIKHNLFSIANFTLENSCSYEFFPWGYEIKSLPSRKVLARGPNASE-LYP 411
              L +VL  P+I  +L S++  T +  CS+ F   G  +K   + KVL +G + SE LY 
Sbjct: 368  LPLKDVLVCPNIAKSLLSVSKLTKDYPCSFTFDADGVLVKDKATCKVLTKGSSTSEGLYK 427

Query: 410  IKSSALRRNTALSASIMSNSNTXXXXXXXXXXXLWNQRLGHPISTVINNLHTTGAIQLSN 231
            +++   +   +      ++              +W+ RLGHP   V+  L    AIQ+ N
Sbjct: 428  LENPKFQMFYSTRQVKATDE-------------VWHMRLGHPNPQVLQLLANKKAIQI-N 473

Query: 230  KVFESVCTSCQLGKSHSLPFLVSPSRACQPLSLVHCDIWGPAPSISHLGFKYFIVFVDDF 51
            K    +C SC+LGKS  LPF+ S   A +PL  VHCD+WGPAP  S  GF+Y+++F+D+ 
Sbjct: 474  KSTSKMCESCRLGKSSRLPFIASDFIASRPLERVHCDLWGPAPVSSIQGFQYYVIFIDNR 533

Query: 50   SKYNWIFPMKCRS 12
            S++ W +P+K +S
Sbjct: 534  SRFCWFYPLKHKS 546



 Score = 36.2 bits (82), Expect(2) = 3e-49
 Identities = 16/39 (41%), Positives = 22/39 (56%)
 Frame = -2

Query: 1831 LPFPNVSNFVSTKLDGSNFLVWKDQLSSILISTNLYGLL 1715
            LP  N+SN V+ KL   N+++WK Q  S L    L G +
Sbjct: 10   LPSLNISNCVTVKLTDRNYILWKSQFESFLSGQGLLGFV 48


>gb|AAD21687.1| Strong similarity to gi|3600044 T12H20.12 protease homolog from
            Arabidopsis thaliana BAC gb|AF080119 and is a member of
            the reverse transcriptase family PF|00078 [Arabidopsis
            thaliana]
          Length = 1415

 Score =  197 bits (501), Expect = 2e-47
 Identities = 155/589 (26%), Positives = 251/589 (42%), Gaps = 11/589 (1%)
 Frame = -1

Query: 1745 IDQYQFVRFVDGSIEPQPQFLNHNNVPVVY----PIYLEWRTLDQFVGSCINATISPSLA 1578
            +   + + FV+G++    Q     N  V      P+Y  W   DQ V S +  T+S  + 
Sbjct: 37   LSSQKLIGFVNGAVNAPSQSRLVVNGEVTSEEPNPLYESWFCTDQLVRSWLFGTLSEEVL 96

Query: 1577 TELLGKSIARDKWLHLSKIFTQQFFARKSQLRGQLHSIQRGHYSIFDYLHQLKTISDSLA 1398
              +   S +R  W+ L++ F +   AR+  LR  L  + +       Y  + KTI D+L+
Sbjct: 97   GHVHNLSTSRQIWVSLAENFNKSSVAREFSLRQNLQLLSKKEKPFSVYCREFKTICDALS 156

Query: 1397 EIGEPVQDDDLVMYTLSGLGSEYAHFVITMQNREVPLSFAKLRSRLINH---EQWLKDQE 1227
             IG+PV +   +   L+GLG +Y      +Q+     S +KL +   N    E    D +
Sbjct: 157  SIGKPVDESMKIFGFLNGLGRDYDPITTVIQS-----SLSKLPTPTFNDVVSEVQGFDSK 211

Query: 1226 NAIYSPLIADPSNSAFFVRKQQNTFXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 1047
               Y    +   + AF + + ++                                     
Sbjct: 212  LQSYEEAASVTPHLAFNIERSES-----------GSPQYNPNQKGRGRSGQNKGRGGYST 260

Query: 1046 PGFQFKKGEFNPNLTVDYGEIP-SQICNKKGHFANTFYYRYVPSMNNSPPMQKAFASFEN 870
             G  F + + +P ++   G  P  QIC + GH A   Y R+    NN     +AF++   
Sbjct: 261  RGRGFSQHQSSPQVS---GPRPVCQICGRTGHTALKCYNRFD---NNYQAEIQAFSTLRV 314

Query: 869  ALRFHSWSPAESMAYXXXXXXXXXXXXSHWSCNDQGPIWIPDSGATSHMTNNTAIMTDAV 690
            +                               +D G  W PDS AT+H+T++T  +  A 
Sbjct: 315  S-------------------------------DDTGKEWHPDSAATAHVTSSTNGLQSAT 343

Query: 689  EFIGDEQAMVGDGKXXXXXXXXXXXXXXXTGDTSFDLHNVLYVPHIKHNLFSIANFTLEN 510
            E+ GD+  +VGDG                 G     L+ VL VP+I+ +L S++    + 
Sbjct: 344  EYEGDDAVLVGDGTYLPITHTGSTTIKSSNG--KIPLNEVLVVPNIQKSLLSVSKLCDDY 401

Query: 509  SCSYEFFPWGYEIKSLPSRKVLARGPNASELYPIKSS---ALRRNTALSASIMSNSNTXX 339
             C   F      I  L ++KV+  GP  + LY +++    AL  N   +A+         
Sbjct: 402  PCGVYFDANKVCIIDLQTQKVVTTGPRRNGLYVLENQEFVALYSNRQCAAT--------- 452

Query: 338  XXXXXXXXXLWNQRLGHPISTVINNLHTTGAIQLSNKVFESVCTSCQLGKSHSLPFLVSP 159
                     +W+ RLGH  S  + +L  + AIQ++      VC  CQ+GKS  LPFL+S 
Sbjct: 453  -------EEVWHHRLGHANSKALQHLQNSKAIQINKSRTSPVCEPCQMGKSSRLPFLISD 505

Query: 158  SRACQPLSLVHCDIWGPAPSISHLGFKYFIVFVDDFSKYNWIFPMKCRS 12
            SR   PL  +HCD+WGP+P +S+ G KY+ +FVDD+S+Y+W +P+  +S
Sbjct: 506  SRVLHPLDRIHCDLWGPSPVVSNQGLKYYAIFVDDYSRYSWFYPLHNKS 554


>gb|AAC67200.1| putative retroelement pol polyprotein [Arabidopsis thaliana]
          Length = 1402

 Score =  185 bits (469), Expect(2) = 3e-47
 Identities = 149/588 (25%), Positives = 242/588 (41%), Gaps = 12/588 (2%)
 Frame = -1

Query: 1739 QYQFVRFVDGSIEPQPQFLNHNNVPVVYPIYLEWRTLDQFVGSCINATISPSLATELLGK 1560
            Q   V  +DGS    P            P Y  W   D+ V S +  +    + + ++  
Sbjct: 53   QTSVVSDIDGSTSASPN-----------PEYYTWFKTDRVVKSWLLGSFLEDILSVVVNC 101

Query: 1559 SIARDKWLHLSKIFTQQFFARKSQLRGQLHSIQRGHYSIFDYLHQLKTISDSLAEIGEPV 1380
            + + + W+ ++  F +   +R  +L+ +L ++ +   S+ +YL  LKTI D LA +G PV
Sbjct: 102  NTSHEVWISVANHFNRVSSSRLFELQRRLQNVSKRDKSMDEYLKDLKTICDQLASVGSPV 161

Query: 1379 QDDDLVMYTLSGLGSEYAHFVITMQNREVPLSFAKLRS---RLINHEQWLKDQ-ENAIYS 1212
             +   +   L+GLG EY     T++N    L    L     +L  ++  L+   E    S
Sbjct: 162  TEKMKIFAALNGLGREYEPIKTTIENSMDALPGPSLEDVIPKLTGYDDRLQGYLEETAVS 221

Query: 1211 PLIA------DPSNSAFFVRKQQNTFXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 1050
            P +A      D SN++ +                                          
Sbjct: 222  PHVAFNITTSDDSNASGYFNAYNR---------------------GKGKSNRGRNSFSTR 260

Query: 1049 XPGFQFKKGEFNPNLTVDYG--EIPSQICNKKGHFANTFYYRYVPSMNNSPPMQKAFASF 876
              GF  +    N +     G   +  QIC K GH                 P  K +  F
Sbjct: 261  GRGFHQQISSTNSSSGSQSGGTSVVCQICGKMGH-----------------PALKCWHRF 303

Query: 875  ENALRFHSWSPAESMAYXXXXXXXXXXXXSHWSCNDQGPIWIPDSGATSHMTNNTAIMTD 696
             N+ ++     A +                    +  G  W+PDS AT+H+TN+   +  
Sbjct: 304  NNSYQYEELPRALAAMRITDIT------------DQHGNEWLPDSAATAHVTNSPRSLQQ 351

Query: 695  AVEFIGDEQAMVGDGKXXXXXXXXXXXXXXXTGDTSFDLHNVLYVPHIKHNLFSIANFTL 516
            +  + G +  MV DG                +G+    L +VL  P I  +L S++  T 
Sbjct: 352  SQPYHGSDAVMVADGNFLPITHTGSTNLASSSGNVP--LTDVLVCPSITKSLLSVSKLTQ 409

Query: 515  ENSCSYEFFPWGYEIKSLPSRKVLARGPNASELYPIKSSALRRNTALSASIMSNSNTXXX 336
            +  C+ EF   G  I    ++K+L  G     LY +K  + +     S    S S+    
Sbjct: 410  DYPCTVEFDSDGVRINDKATKKLLIMGSTCDGLYCLKDDS-QFKAFFSTRQQSASDEV-- 466

Query: 335  XXXXXXXXLWNQRLGHPISTVINNLHTTGAIQLSNKVFESVCTSCQLGKSHSLPFLVSPS 156
                     W++RLGHP   V+  L  T +I + NK  +S+C +CQLGKS  LPF+ S  
Sbjct: 467  ---------WHRRLGHPHPQVLQQLVKTNSISI-NKTSKSLCEACQLGKSTRLPFVSSSF 516

Query: 155  RACQPLSLVHCDIWGPAPSISHLGFKYFIVFVDDFSKYNWIFPMKCRS 12
             + +PL  VHCD+WGP+P  S  GF+Y+ VF+D +S+++WI+P+K +S
Sbjct: 517  TSNRPLERVHCDLWGPSPITSVQGFRYYAVFIDHYSRFSWIYPLKLKS 564



 Score = 33.1 bits (74), Expect(2) = 3e-47
 Identities = 14/39 (35%), Positives = 21/39 (53%)
 Frame = -2

Query: 1831 LPFPNVSNFVSTKLDGSNFLVWKDQLSSILISTNLYGLL 1715
            +P  N+SN V+  L   N+++WK Q  S L    L G +
Sbjct: 6    VPSLNISNCVTVTLTAKNYILWKSQFESFLDGQGLLGFV 44


>gb|AAC35532.1| contains similarity to proteases [Arabidopsis thaliana]
          Length = 1392

 Score =  184 bits (468), Expect(2) = 9e-47
 Identities = 144/551 (26%), Positives = 244/551 (44%), Gaps = 5/551 (0%)
 Frame = -1

Query: 1649 YLEWRTLDQFVGSCINATISPSLATELLGKSIARDKWLHLSKIFTQQFFARKSQLRGQLH 1470
            +L+W  +DQ V + I  ++S      ++G + A++ WL L++ F +    RK  L+ +L 
Sbjct: 72   FLKWTRIDQLVKAWIFGSLSEEALKVVIGLNSAQEVWLGLARRFNRFSTTRKYDLQKRLG 131

Query: 1469 SIQRGHYSIFDYLHQLKTISDSLAEIGEPVQDDDLVMYTLSGLGSEYAHFVITMQNREVP 1290
            +  +   ++  YL ++K I D L  IG PV + + +   L+GLG EY      +++    
Sbjct: 132  TCSKAGKTMDAYLSEVKNICDQLDSIGFPVTEQEKIFGVLNGLGKEYESIATVIEHSLDV 191

Query: 1289 LS---FAKLRSRLINHEQWLKDQE-NAIYSPLIADPSNSAFFVRKQQNTFXXXXXXXXXX 1122
                 F  +  +L   +  L     N+  +P +A  ++ ++  R   N+           
Sbjct: 192  YPGPCFDDVVYKLTTFDDKLSTYTANSEVTPHLAFYTDKSYSSRGNNNS----------- 240

Query: 1121 XXXXXXXXXXXXXXXXXXXXXXXXXPGFQFKKGEFNPNLTVDYGEIPSQICNKKGHFANT 942
                                      GF  + G  + N + +  +   QIC K GH A  
Sbjct: 241  -------RGGRYGNFRGRGSYSSRGRGFHQQFGSGSNNGSGNGSKPTCQICRKYGHSAFK 293

Query: 941  FYYRYVPSMNNSPP-MQKAFASFENALRFHSWSPAESMAYXXXXXXXXXXXXSHWSCNDQ 765
             Y R+    N  P  +  AFA    A+R    + A S                       
Sbjct: 294  CYTRF--EENYLPEDLPNAFA----AMRVSDQNQASSHE--------------------- 326

Query: 764  GPIWIPDSGATSHMTNNTAIMTDAVEFIGDEQAMVGDGKXXXXXXXXXXXXXXXTGDTSF 585
               W+PDS AT+H+TN T  + ++  + GD+  +VG+G                 G  + 
Sbjct: 327  ---WLPDSAATAHITNTTDGLQNSQTYSGDDSVIVGNGDFLPITHIGTIPLNISQG--TL 381

Query: 584  DLHNVLYVPHIKHNLFSIANFTLENSCSYEFFPWGYEIKSLPSRKVLARGPNASELYPIK 405
             L +VL  P I  +L S++  T +  CS+ F      IK   ++++L +G     LY +K
Sbjct: 382  PLEDVLVCPGITKSLLSVSKLTDDYPCSFTFDSDSVVIKDKRTQQLLTQGNKHKGLYVLK 441

Query: 404  SSALRRNTALSASIMSNSNTXXXXXXXXXXXLWNQRLGHPISTVINNLHTTGAIQLSNKV 225
                +  T  S    S+ +             W+QRLGHP   V+ +L  T AI + NK 
Sbjct: 442  DVPFQ--TYYSTRQQSSDDEV-----------WHQRLGHPNKEVLQHLIKTKAIVV-NKT 487

Query: 224  FESVCTSCQLGKSHSLPFLVSPSRACQPLSLVHCDIWGPAPSISHLGFKYFIVFVDDFSK 45
              ++C +CQ+GK   LPF+ S   + +PL  +HCD+WGPAP  S  GF+Y+++F+D++S+
Sbjct: 488  SSNMCEACQMGKVCRLPFVASEFVSSRPLERIHCDLWGPAPVTSAQGFQYYVIFIDNYSR 547

Query: 44   YNWIFPMKCRS 12
            + W +P+K +S
Sbjct: 548  FTWFYPLKLKS 558



 Score = 32.0 bits (71), Expect(2) = 9e-47
 Identities = 15/35 (42%), Positives = 21/35 (60%)
 Frame = -2

Query: 1819 NVSNFVSTKLDGSNFLVWKDQLSSILISTNLYGLL 1715
            N+S  V+ KL  +N+L+WK Q  S L S  L G +
Sbjct: 11   NISQVVTLKLTPTNYLLWKTQFESYLSSHLLLGFV 45


>emb|CAC37623.1| copia-like polyprotein [Arabidopsis thaliana]
          Length = 1466

 Score =  194 bits (494), Expect = 1e-46
 Identities = 159/591 (26%), Positives = 260/591 (43%), Gaps = 13/591 (2%)
 Frame = -1

Query: 1745 IDQYQFVRFVDGSIEP--QPQFLNHNNVP--VVYPIYLEWRTLDQFVGSCINATISPSLA 1578
            +   + + FV+G + P  Q + + +++V   V  P Y +W   DQ V S +  T+S  + 
Sbjct: 37   LSSQKLIGFVNGVVTPPAQTRLVVNDDVTSEVPNPQYEDWFCTDQLVRSWLFGTLSEEVL 96

Query: 1577 TELLGKSIARDKWLHLSKIFTQQFFARKSQLRGQLHSIQRGHYSIFDYLHQLKTISDSLA 1398
              +   + +R  W+ L++ F +   AR+  LR  L  + +   S+  Y    K I DSL+
Sbjct: 97   GHVHNLTTSRQIWISLAENFNKSSIAREFSLRRNLQLLTKKDKSLSVYCRDFKIICDSLS 156

Query: 1397 EIGEPVQDDDLVMYTLSGLGSEYAHFVITMQNREVPLSFAKLRSRLINH---EQWLKDQE 1227
             IG+PV++   +   L+GLG EY      +Q+     S +KL +   N    E    D +
Sbjct: 157  SIGKPVEESMKIFGFLNGLGREYDPITTVIQS-----SLSKLPAPTFNDVISEVQGFDSK 211

Query: 1226 NAIYSPLIADPSNSAFFVRKQQNTFXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 1047
               Y   ++   + AF   +  +                                     
Sbjct: 212  LQSYDDTVSVNPHLAFNTERSNS----------------GAPQYNSNSRGRGRSGQNRGR 255

Query: 1046 PGFQFKKGEFNPNLTVD--YGEIP-SQICNKKGHFANTFYYRYVPSMNNSPPMQKAFASF 876
             G+  +   F+ + +     G+ P  QIC + GH A   Y R+  +  +  P Q AF+  
Sbjct: 256  GGYSTRGRGFSQHQSASPSSGQRPVCQICGRIGHTAIKCYNRFDNNYQSEVPTQ-AFS-- 312

Query: 875  ENALRFHSWSPAESMAYXXXXXXXXXXXXSHWSCNDQGPIWIPDSGATSHMTNNTAIMTD 696
              ALR                             ++ G  W PDS AT+H+T +T+ + +
Sbjct: 313  --ALRVS---------------------------DETGKEWYPDSAATAHITASTSGLQN 343

Query: 695  AVEFIGDEQAMVGDGKXXXXXXXXXXXXXXXTGDTSFDLHNVLYVPHIKHNLFSIANFTL 516
            A  + G++  +VGDG                 G  +  L+ VL  P I+ +L S++    
Sbjct: 344  ATTYEGNDAVLVGDGTYLPITHVGSTTISSSKG--TIPLNEVLVCPAIQKSLLSVSKLCD 401

Query: 515  ENSCSYEFFPWGYEIKSLPSRKVLARGPNASELYPIKSS---ALRRNTALSASIMSNSNT 345
            +  C   F      I  L ++KV+++GP  + LY +++S   AL  N   +AS+ +    
Sbjct: 402  DYPCGVYFDANKVCIIDLTTQKVVSKGPRNNGLYMLENSEFVALYSNRQCAASMET---- 457

Query: 344  XXXXXXXXXXXLWNQRLGHPISTVINNLHTTGAIQLSNKVFESVCTSCQLGKSHSLPFLV 165
                        W+ RLGH  S ++  L T   IQ++      VC  CQ+GKS  L F  
Sbjct: 458  ------------WHHRLGHSNSKILQQLLTRKEIQVNKSRTSPVCEPCQMGKSTRLQFFS 505

Query: 164  SPSRACQPLSLVHCDIWGPAPSISHLGFKYFIVFVDDFSKYNWIFPMKCRS 12
            S  RA +PL  VHCD+WGP+P +S+ GFKY+ VFVDDFS+++W FP++ +S
Sbjct: 506  SDFRALKPLDRVHCDLWGPSPVVSNQGFKYYAVFVDDFSRFSWFFPLRMKS 556


>dbj|BAK41512.1| C-end truncated polyprotein [Arabidopsis thaliana]
          Length = 1048

 Score =  189 bits (480), Expect(2) = 3e-46
 Identities = 147/583 (25%), Positives = 245/583 (42%), Gaps = 6/583 (1%)
 Frame = -1

Query: 1742 DQYQFVRFVDGSIEPQPQFLNHNNVPVVYPIYLEWRTLDQFVGSCINATISPSLATELLG 1563
            D Y+   F+DGS    P  +  +  P V P Y  W+  D+ + + +   IS S+   +  
Sbjct: 43   DGYELAGFLDGSTTMPPATIGTDAAPRVNPDYTRWKRQDKLIYNAVLGAISMSVQPAVSR 102

Query: 1562 KSIARDKWLHLSKIFTQQFFARKSQLRGQLHSIQRGHYSIFDYLHQLKTISDSLAEIGEP 1383
             + A   W  L KI+    +   +QLR QL    +G  +I DY+    T  D LA +G+P
Sbjct: 103  ATTAAQIWETLRKIYANPSYGHVTQLRTQLKQWTKGTKTIDDYMQGFVTRFDQLALLGKP 162

Query: 1382 VQDDDLVMYTLSGLGSEYAHFVITMQNREVPLSFAKLRSRLINHEQWLKDQENAIYSPLI 1203
            +  D+ V   L  L  EY            P +  ++  RL+N E  +    +A   P+ 
Sbjct: 163  MDHDEQVERVLENLPEEYKPVKAC-----TPPTLTEIHERLLNQESKILAVSSATVIPIT 217

Query: 1202 ADPSNSAFFVRKQQNTFXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXPGFQFKKG 1023
            A+  +         N                                       +Q    
Sbjct: 218  ANAVSHRNTTTTNNNN-------------------NGNRNNRYDNRNNNNNSKPWQQSST 258

Query: 1022 EFNPNLTVDYGEIPS-QICNKKGHFAN--TFYYRYVPSMNNSPPMQKAFASFENALRFHS 852
             F+PN       +   QIC  +GH A   +    ++ S+N+  P             F  
Sbjct: 259  NFHPNNNQSKPYLGKCQICGVQGHSAKRCSQLQHFLSSVNSQQPSSP----------FTP 308

Query: 851  WSPAESMAYXXXXXXXXXXXXSHWSCNDQGPIWIPDSGATSHMTNNTAIMTDAVEFIGDE 672
            W P  ++A               +S N+    W+ DSGAT H+T++   ++    + G +
Sbjct: 309  WQPRANLALGSP-----------YSSNN----WLLDSGATHHITSDFNNLSLHQPYTGGD 353

Query: 671  QAMVGDGKXXXXXXXXXXXXXXXTGDTSFDLHNVLYVPHIKHNLFSIANFTLENSCSYEF 492
              MV DG                      +LHN+LYVP+I  NL S+      N  S EF
Sbjct: 354  DVMVADGSTIPISHTGSTSLSTK--SRPLNLHNILYVPNIHKNLISVYRLCNANGVSVEF 411

Query: 491  FPWGYEIKSLPSRKVLARGPNASELY--PIKSSALRRNTALSASIMSNSNTXXXXXXXXX 318
            FP  +++K L +   L +G    ELY  PI SS   +  +L AS  S +           
Sbjct: 412  FPASFQVKDLNTGVPLLQGKTKDELYEWPIASS---QPVSLFASPSSKAT---------- 458

Query: 317  XXLWNQRLGHPISTVINNLHTTGAIQLSNKVFESV-CTSCQLGKSHSLPFLVSPSRACQP 141
               W+ RLGHP  +++N++ +  ++ + N   + + C+ C + KS+ +PF  S   + +P
Sbjct: 459  HSSWHARLGHPAPSILNSVISNYSLSVLNPSHKFLSCSDCLINKSNKVPFSQSTINSTRP 518

Query: 140  LSLVHCDIWGPAPSISHLGFKYFIVFVDDFSKYNWIFPMKCRS 12
            L  ++ D+W  +P +SH  ++Y+++FVD F++Y W++P+K +S
Sbjct: 519  LEYIYSDVWS-SPILSHDNYRYYVIFVDHFTRYTWLYPLKQKS 560



 Score = 25.8 bits (55), Expect(2) = 3e-46
 Identities = 13/35 (37%), Positives = 20/35 (57%)
 Frame = -2

Query: 1819 NVSNFVSTKLDGSNFLVWKDQLSSILISTNLYGLL 1715
            N+SN   TKL  +N+L+W  Q+ ++     L G L
Sbjct: 19   NMSNV--TKLTSTNYLMWSRQVHALFDGYELAGFL 51


>emb|CAN77295.1| hypothetical protein VITISV_005638 [Vitis vinifera]
          Length = 1198

 Score =  193 bits (490), Expect = 3e-46
 Identities = 153/536 (28%), Positives = 235/536 (43%), Gaps = 12/536 (2%)
 Frame = -1

Query: 1574 ELLGKSIARDKWLHLSKIFTQQFFARKSQLRGQLHSIQRGHYSIFDYLHQLKTISDSLAE 1395
            +++G + +   W  L K F+    AR  QLR +L S ++G  S+ DY+ ++K  +DSLA 
Sbjct: 3    QIIGHNSSHSAWNALEKTFSSSSRARIMQLRLELQSTKKGSLSMIDYIMKVKGAADSLAA 62

Query: 1394 IGEPVQDDDLVMYTLSGLGSEYAHFVITMQNREVPLSFAKLRSRLINHEQWLKDQEN--- 1224
            IGEPV + D VM  L GLGS+Y   V  +  ++  +S   + S L+  E  L+ Q +   
Sbjct: 63   IGEPVSEQDQVMNLLGGLGSDYNAVVTAINIKDDKISIEVVHSMLLAFEHRLEQQSSIEQ 122

Query: 1223 -AIYSPLIADPSNSAFFVRKQQNTFXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 1047
             +  S   A  SNS    R+                                        
Sbjct: 123  FSSISANYASSSNSRGSGRRYNG-----------------GRGQNHTPNISNYTYRGRGR 165

Query: 1046 PGFQFKKGEFNPNLTVDYGEIPS-QICNKKGHFANTFYYRYVPSMNNSPPMQKAFASFEN 870
             G   + G  N N +    E P  Q+C K GH     Y+++  S  +S   Q +  S  N
Sbjct: 166  GGRYGQNGRHNSNSS----EKPQCQLCGKFGHTVQICYHKFDISYQSS---QSSNTSPSN 218

Query: 869  ALRFHSWSPAESMAYXXXXXXXXXXXXSHWSCNDQGPIWIPDSGATSHMTNNTAIMTDAV 690
            A   +S  PA   +                S N     W  DSGA  H+T +   +T + 
Sbjct: 219  ASNPNS-IPAMVAS----------------SNNLAEDTWYLDSGANHHLTQSVGNLTSSS 261

Query: 689  EFIGDEQAMVGDGKXXXXXXXXXXXXXXXTGDTSFDLHNVLYVPHIKHNLFSIANFTLEN 510
             + G ++  +G+GK                   SF L  V +V  I  NL S+A F L+N
Sbjct: 262  PYTGIDKVTIGNGKHLSISNTGSHRLLSD--SRSFHLKKVFHVHFISANLISVAKFYLDN 319

Query: 509  SCSYEFFPWGYEIKSLPSRKVLARGPNASELYPIKSSALRRNTALSASIMSNSNTXXXXX 330
            +  +EF    + +K L ++KVLA+G   + LY       ++   + A   S   +     
Sbjct: 320  NALFEFRSNSFFVKDLHTKKVLAQGKLENGLYRFPVLNSKKVAFVGAINSSTFYSHNSSI 379

Query: 329  XXXXXXLWNQRLGHPISTVINNLHTTGAIQLSNK-------VFESVCTSCQLGKSHSLPF 171
                  LW+ RLGH  + ++  +  +  +            V  +VC+SCQL KSH LP 
Sbjct: 380  FDNKVKLWHHRLGHASTNIVTQIMQSCNVSFEKNKNTVCSTVCSTVCSSCQLAKSHRLPT 439

Query: 170  LVSPSRACQPLSLVHCDIWGPAPSISHLGFKYFIVFVDDFSKYNWIFPMKCRSVSL 3
             +S S A +PL LVH D+WGPA   S  G +YFI+F+DD+S+Y W +P++ +  +L
Sbjct: 440  HLSLSCASKPLELVHTDLWGPASVKSTSGARYFILFLDDYSRYTWFYPLQTKDQAL 495


>emb|CAB40035.1| retrotransposon like protein [Arabidopsis thaliana]
            gi|7267767|emb|CAB81170.1| retrotransposon like protein
            [Arabidopsis thaliana]
          Length = 1515

 Score =  184 bits (468), Expect = 1e-43
 Identities = 144/551 (26%), Positives = 244/551 (44%), Gaps = 5/551 (0%)
 Frame = -1

Query: 1649 YLEWRTLDQFVGSCINATISPSLATELLGKSIARDKWLHLSKIFTQQFFARKSQLRGQLH 1470
            +L+W  +DQ V + I  ++S      ++G + A++ WL L++ F +    RK  L+ +L 
Sbjct: 69   FLKWTRIDQLVKAWIFGSLSEEALKVVIGLNSAQEVWLGLARRFNRFSTTRKYDLQKRLG 128

Query: 1469 SIQRGHYSIFDYLHQLKTISDSLAEIGEPVQDDDLVMYTLSGLGSEYAHFVITMQNREVP 1290
            +  +   ++  YL ++K I D L  IG PV + + +   L+GLG EY      +++    
Sbjct: 129  TCSKAGKTMDAYLSEVKNICDQLDSIGFPVTEQEKIFGVLNGLGKEYESIATVIEHSLDV 188

Query: 1289 LS---FAKLRSRLINHEQWLKDQE-NAIYSPLIADPSNSAFFVRKQQNTFXXXXXXXXXX 1122
                 F  +  +L   +  L     N+  +P +A  ++ ++  R   N+           
Sbjct: 189  YPGPCFDDVVYKLTTFDDKLSTYTANSEVTPHLAFYTDKSYSSRGNNNS----------- 237

Query: 1121 XXXXXXXXXXXXXXXXXXXXXXXXXPGFQFKKGEFNPNLTVDYGEIPSQICNKKGHFANT 942
                                      GF  + G  + N + +  +   QIC K GH A  
Sbjct: 238  -------RGGRYGNFRGRGSYSSRGRGFHQQFGSGSNNGSGNGSKPTCQICRKYGHSAFK 290

Query: 941  FYYRYVPSMNNSPP-MQKAFASFENALRFHSWSPAESMAYXXXXXXXXXXXXSHWSCNDQ 765
             Y R+    N  P  +  AFA    A+R    + A S                       
Sbjct: 291  CYTRF--EENYLPEDLPNAFA----AMRVSDQNQASSHE--------------------- 323

Query: 764  GPIWIPDSGATSHMTNNTAIMTDAVEFIGDEQAMVGDGKXXXXXXXXXXXXXXXTGDTSF 585
               W+PDS AT+H+TN T  + ++  + GD+  +VG+G                 G  + 
Sbjct: 324  ---WLPDSAATAHITNTTDGLQNSQTYSGDDSVIVGNGDFLPITHIGTIPLNISQG--TL 378

Query: 584  DLHNVLYVPHIKHNLFSIANFTLENSCSYEFFPWGYEIKSLPSRKVLARGPNASELYPIK 405
             L +VL  P I  +L S++  T +  CS+ F      IK   ++++L +G     LY +K
Sbjct: 379  PLEDVLVCPGITKSLLSVSKLTDDYPCSFTFDSDSVVIKDKRTQQLLTQGNKHKGLYVLK 438

Query: 404  SSALRRNTALSASIMSNSNTXXXXXXXXXXXLWNQRLGHPISTVINNLHTTGAIQLSNKV 225
                +  T  S    S+ +             W+QRLGHP   V+ +L  T AI + NK 
Sbjct: 439  DVPFQ--TYYSTRQQSSDDEV-----------WHQRLGHPNKEVLQHLIKTKAIVV-NKT 484

Query: 224  FESVCTSCQLGKSHSLPFLVSPSRACQPLSLVHCDIWGPAPSISHLGFKYFIVFVDDFSK 45
              ++C +CQ+GK   LPF+ S   + +PL  +HCD+WGPAP  S  GF+Y+++F+D++S+
Sbjct: 485  SSNMCEACQMGKVCRLPFVASEFVSSRPLERIHCDLWGPAPVTSAQGFQYYVIFIDNYSR 544

Query: 44   YNWIFPMKCRS 12
            + W +P+K +S
Sbjct: 545  FTWFYPLKLKS 555


>emb|CAN81099.1| hypothetical protein VITISV_017741 [Vitis vinifera]
          Length = 1455

 Score =  181 bits (458), Expect = 2e-42
 Identities = 145/541 (26%), Positives = 241/541 (44%), Gaps = 29/541 (5%)
 Frame = -1

Query: 1538 LHLSKIFTQQFFARKS-----QLRGQLHSIQRGHYSIFDYLHQLKTISDSLAEIGEPVQD 1374
            L LS+ F +Q+FA ++     Q + QL   ++G  +I +YL ++K   DSLA +G  +  
Sbjct: 102  LFLSQYFLEQYFASQTRAKAKQFKTQLQHTKKGGSTIDEYLAKIKVCVDSLASVGVSLST 161

Query: 1373 DDLVMYTLSGLGSEYAHFVITMQNREVPLSFAKLRSRLINHEQWLKDQENAIYSPLIADP 1194
             D V   L GL ++Y  FV ++  R    S  ++ + L+ HE  ++   N++ S   A  
Sbjct: 162  KDHVESILDGLPNDYESFVTSVILRNDDFSVEEIEALLMAHESRVEKNNNSLDSSPSAHV 221

Query: 1193 SNSA-----------FFVRKQQNTFXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 1047
            ++S            ++    Q +                                    
Sbjct: 222  ASSNAVEKGNRFKQDYYAANSQGSHSGYNGGFGRGGDFGRRGGFYGGRGFNWNYNGRSNR 281

Query: 1046 PGFQFK--KGEFN---PNLTVDYGEIPS-QICNKKGHFANTFYYRYVPSMNNSPPMQKAF 885
             GF+ +  KG F    P  + +  E P+ Q+C K GH     YYR+    +++  + +  
Sbjct: 282  GGFRGRGNKGSFQARPPWNSDNQNEKPACQLCGKIGHVVAQCYYRF----DHTFQVPQNL 337

Query: 884  ASFENALR-FHSWSPAESMAYXXXXXXXXXXXXSHWSCNDQGPIWIPDSGATSHMTNNTA 708
            +S  ++ R ++S+SP  +                          W PDSGA++H+T N  
Sbjct: 338  SSRNSSPRAYYSFSPQVNGVIPTSEVFSDDN-------------WYPDSGASNHVTPNPE 384

Query: 707  IMTDAVEFIGDEQAMVGDGKXXXXXXXXXXXXXXXTGDTSFDLHNVLYVPHIKHNLFSIA 528
             +  + EF G  Q  VG+G                       L+++L+VP I  NL S++
Sbjct: 385  NLMKSAEFAGQNQVHVGNGTGLSIKHIGQSEFLSPFSSKPLLLNHLLHVPSITKNLLSVS 444

Query: 527  NFTLENSCSYEFFPWGYEIKSLPSRKVLARGPNASELYPIKSS--ALRRNTALSAS---- 366
             F  +N   +EF      +K   ++ VL  G     LY   SS  ALR   +LS S    
Sbjct: 445  KFAKDNKVFFEFHSDSCFVKDQVTQAVLMVGKVRDGLYAFDSSHLALRPTQSLSKSPSVV 504

Query: 365  IMSNSNTXXXXXXXXXXXLWNQRLGHPISTVINNLHTTGAIQLSNKVFESVCTSCQLGKS 186
              S S+            LW++RLGHP +  I N+ +   +   NK+  + C+SC LGK 
Sbjct: 505  ASSFSSKVCTTSLSSTFDLWHKRLGHPSAATIKNVLSKCNVAHINKMDSNFCSSCCLGKI 564

Query: 185  HSLPFLVSPSRACQPLSLVHCDIWGPAPSISHLGFKYFIVFVDDFSKYNWIFPMKCRSVS 6
            H  PF +S +   +PL L+H D+WGP   +S+ G++Y+I FVD FS+++WIF ++ +S +
Sbjct: 565  HRFPFSLSHTTYTKPLELIHLDLWGPTLVLSNSGYRYYIHFVDAFSRFSWIFLLRNKSEA 624

Query: 5    L 3
            +
Sbjct: 625  I 625


>emb|CAN63649.1| hypothetical protein VITISV_037657 [Vitis vinifera]
          Length = 1131

 Score =  172 bits (437), Expect = 5e-40
 Identities = 150/570 (26%), Positives = 226/570 (39%), Gaps = 30/570 (5%)
 Frame = -1

Query: 1649 YLEWRTLDQFVGSCINATISPSLATELLGKSIARDKWLHLSKIFTQQFFARKSQLRGQLH 1470
            YL WRT        +N  I+  L   + GK  A  ++L  S+    ++            
Sbjct: 38   YLLWRT------QMLNIIIANGLEEMIHGKIXAPSRFLGDSENINPEY------------ 79

Query: 1469 SI-QRGHYSIFDYLHQLKTISDSLAEIGEPVQDDDLVMYTLSGLGSEYAHFVITMQNREV 1293
            SI QR +  +  +++   T  D+L  IGE + + D ++Y L+GL +EY  FVIT+ +R  
Sbjct: 80   SIWQRQNRLVMCWIYSSLT-EDNLLAIGENITEQDRILYLLAGLRAEYNSFVITVTSRHE 138

Query: 1292 PLSFAKLRSRLINHEQWLKDQENAIYSPLIA---DPSNSAFFVRKQQNTFXXXXXXXXXX 1122
            PLS  ++ S L+ HE  L+ Q     + L+       N     +K Q +           
Sbjct: 139  PLSLEEIHSMLLTHENRLEQQHTTEETNLLQANITTMNIQGHNKKNQKSGQFKTQGRGNQ 198

Query: 1121 XXXXXXXXXXXXXXXXXXXXXXXXXPGFQFKKGEFNPNLTVDY----------GEIPSQI 972
                                      G    +G FN +               G+   Q+
Sbjct: 199  NQQQFNHQNFGRGRGRGHYNNNGGNFGHGPGRGSFNNHSYSSRNFNNVSGGSNGKPQCQV 258

Query: 971  CNKKGHFANTFYYR----YVPSMNNSPPMQKAFASFENALRFHSWSPAESMAYXXXXXXX 804
            C K GH A   Y+R    Y P+MNN      A  S                         
Sbjct: 259  CGKYGHIAINCYHRFDQTYQPTMNNHLAAMVATPS------------------------- 293

Query: 803  XXXXXSHWSCNDQGPIWIPDSGATSHMTNNTAIMTDAVEFIGDEQAMVGDGKXXXXXXXX 624
                    +  D+   W  D+GAT H+T N   +     F G ++ MVG+G         
Sbjct: 294  --------TVGDES--WYMDTGATHHLTPNLNKLNSHTPFAGSDKVMVGNGNRLNISNIG 343

Query: 623  XXXXXXXTGDTSFDLHNVLYVPHIKHNLFSIANFTLENSCSYEFFPWGYEIKSLPSRKVL 444
                       S +L N+L+VP +  NL S+     +N+ + EFF  G+ +K   S+K L
Sbjct: 344  HSTISSV--SRSLNLKNILHVPQLTTNLISVNRLCTDNNVTVEFFTNGFVVKDQASKKAL 401

Query: 443  ARGPNASELYPIKSSALRRN------------TALSASIMSNSNTXXXXXXXXXXXLWNQ 300
             +G     LY + SS   R             T+L+  +   S+T            W+ 
Sbjct: 402  LQGNLNYGLYKLSSSTPSRRYQDPDDNKLAGRTSLTTEVPCMSSTLQLSNKADL---WHF 458

Query: 299  RLGHPISTVINNLHTTGAIQLSNKVFESVCTSCQLGKSHSLPFLVSPSRACQPLSLVHCD 120
            RLGHP                       VC  CQ+ KSH LPF +S SRA QP +LVH D
Sbjct: 459  RLGHPAR---------------------VCEPCQMAKSHRLPFTLSESRASQPFALVHSD 497

Query: 119  IWGPAPSISHLGFKYFIVFVDDFSKYNWIF 30
            +WGPAP +   G +YF++FVDD ++++W++
Sbjct: 498  LWGPAPVVGTNGARYFVLFVDDHTRFSWLY 527


>dbj|BAA78423.1| polyprotein [Arabidopsis thaliana]
          Length = 1431

 Score =  167 bits (423), Expect = 2e-38
 Identities = 128/504 (25%), Positives = 217/504 (43%), Gaps = 6/504 (1%)
 Frame = -1

Query: 1505 FARKSQLRGQLHSIQRGHYSIFDYLHQLKTISDSLAEIGEPVQDDDLVMYTLSGLGSEYA 1326
            +   +QLR QL    +G  +I DY+  L T  D LA +G+P+  D+ V   L  L  EY 
Sbjct: 87   YGHVTQLRTQLKQWTKGTKTIDDYMQGLVTRFDQLALLGKPMDHDEQVERVLENLPEEYK 146

Query: 1325 HFVITMQNREVPLSFAKLRSRLINHEQWLKDQENAIYSPLIADPSNSAFFVRKQQNTFXX 1146
              +  +  ++ P +  ++  RL+NHE  +    +A   P+ A+  +         N    
Sbjct: 147  PVIDQIAAKDTPPTLTEIHERLLNHESKILAVSSATVIPITANAVSHRNTTTTTNNN--- 203

Query: 1145 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXPGFQFKKGEFNPNLTVDYGEIPS-QIC 969
                                               +Q     F+PN       +   QIC
Sbjct: 204  ----------------NGNRNNRYDNRNNNNNSKPWQQSSTNFHPNNNQSKPYLGKCQIC 247

Query: 968  NKKGHFAN--TFYYRYVPSMNNSPPMQKAFASFENALRFHSWSPAESMAYXXXXXXXXXX 795
              +GH A   +    ++ S+N+  P             F  W P  ++A           
Sbjct: 248  GVQGHSAKRCSQLQHFLSSVNSQQPPSP----------FTPWQPRANLALGSP------- 290

Query: 794  XXSHWSCNDQGPIWIPDSGATSHMTNNTAIMTDAVEFIGDEQAMVGDGKXXXXXXXXXXX 615
                +S N+    W+ DSGAT H+T++   ++    + G +  MV DG            
Sbjct: 291  ----YSSNN----WLLDSGATHHITSDFNNLSLHQPYTGGDDVMVADGSTIPISHTGSTS 342

Query: 614  XXXXTGDTSFDLHNVLYVPHIKHNLFSIANFTLENSCSYEFFPWGYEIKSLPSRKVLARG 435
                      +LHN+LYVP+I  NL S+      N  S EFFP  +++K L +   L +G
Sbjct: 343  LSTK--SRPLNLHNILYVPNIHKNLISVYRLCNANGVSVEFFPASFQVKDLNTGVPLLQG 400

Query: 434  PNASELY--PIKSSALRRNTALSASIMSNSNTXXXXXXXXXXXLWNQRLGHPISTVINNL 261
                ELY  PI SS   +  +L AS  S +              W+ RLGHP  +++N++
Sbjct: 401  KTKDELYEWPIASS---QPVSLFASPSSKAT----------HSSWHARLGHPAPSILNSV 447

Query: 260  HTTGAIQLSNKVFESV-CTSCQLGKSHSLPFLVSPSRACQPLSLVHCDIWGPAPSISHLG 84
             +  ++ + N   + + C+ C + KS+ +PF  S   + +PL  ++ D+W  +P +SH  
Sbjct: 448  ISNYSLSVLNPSHKFLSCSDCLINKSNKVPFSQSTINSTRPLEYIYSDVWS-SPILSHDN 506

Query: 83   FKYFIVFVDDFSKYNWIFPMKCRS 12
            ++Y+++FVD F++Y W++P+K +S
Sbjct: 507  YRYYVIFVDHFTRYTWLYPLKQKS 530


Top