BLASTX nr result

ID: Mentha29_contig00034777 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Mentha29_contig00034777
         (1135 letters)

Database: ./nr 
           38,876,450 sequences; 13,856,398,315 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

emb|CAN79321.1| hypothetical protein VITISV_018984 [Vitis vinifera]   500   e-139
ref|XP_007210190.1| hypothetical protein PRUPE_ppa017790mg [Prun...   481   e-133
gb|ADP20178.1| gag-pol polyprotein [Silene latifolia]                 479   e-133
ref|XP_007207232.1| hypothetical protein PRUPE_ppa026856mg [Prun...   477   e-132
ref|XP_007045326.1| DNA/RNA polymerases superfamily protein [The...   469   e-130
ref|XP_007051412.1| DNA/RNA polymerases superfamily protein [The...   468   e-129
ref|XP_007220740.1| hypothetical protein PRUPE_ppa023598mg [Prun...   467   e-129
ref|XP_007049888.1| DNA/RNA polymerases superfamily protein, par...   461   e-127
gb|ADP20179.1| gag-pol polyprotein [Silene latifolia]                 455   e-125
gb|ADP20180.1| mutant gag-pol polyprotein [Pisum sativum]             449   e-124
emb|CAN64427.1| hypothetical protein VITISV_029384 [Vitis vinifera]   449   e-124
gb|ABF97027.1| retrotransposon protein, putative, Ty3-gypsy subc...   448   e-123
gb|AAF79348.1|AC007887_7 F15O4.13 [Arabidopsis thaliana]              445   e-122
gb|AAK91332.1|AC090441_14 Putative gag-pol polyprotein [Oryza sa...   445   e-122
ref|XP_004309164.1| PREDICTED: uncharacterized protein LOC101300...   444   e-122
dbj|BAA89466.1| gag-pol polyprotein [Oryza sativa Indica Group]       444   e-122
ref|XP_007206823.1| hypothetical protein PRUPE_ppa025991mg [Prun...   443   e-122
gb|AAK51582.1|AC022352_18 Putative retroelement [Oryza sativa Ja...   443   e-122
gb|AAK94517.1| gag-pol polyprotein [Hordeum vulgare]                  443   e-122
emb|CAD39843.2| OSJNBb0072N21.12 [Oryza sativa Japonica Group]        442   e-121

>emb|CAN79321.1| hypothetical protein VITISV_018984 [Vitis vinifera]
          Length = 1521

 Score =  500 bits (1287), Expect = e-139
 Identities = 236/380 (62%), Positives = 299/380 (78%), Gaps = 2/380 (0%)
 Frame = +1

Query: 1    AELRRQVEELLQRGVIRESLSPCAVPTLLIPKKDSTWRMCIDSRAINKITVRYRFPIPRL 180
            AEL+RQV+ELL +G IRESLSPC VP LL PKKD +WRMC+DSRAINKIT++YRFPIPRL
Sbjct: 637  AELKRQVDELLTKGFIRESLSPCGVPALLTPKKDGSWRMCVDSRAINKITIKYRFPIPRL 696

Query: 181  EDLLDQLSGASVFSKLDLRSG--QIRVRPGDEWKIAFKTREGLFEWLVMPFGLSNAPAPS 354
            +D+LD + G+ +FSK+DLRSG  QIR+RPGDEWK +FKT++GL+EWLVMPFGL+NAP  S
Sbjct: 697  DDMLDMMVGSVIFSKIDLRSGYHQIRIRPGDEWKTSFKTKDGLYEWLVMPFGLTNAP--S 754

Query: 355  TFMRVMNQA*RSLIGTCVVVYFDDILVYSRSLTDHWRHLRAVLLILRNEKFYAKPNKCVF 534
            TFMR+M Q  +  IG  VVVYFDDIL+YSRS  DH  HL+ V+  LR EKFY    KC F
Sbjct: 755  TFMRIMTQVLKPFIGRFVVVYFDDILIYSRSCEDHEEHLKQVMRTLRAEKFYINLKKCTF 814

Query: 535  CEPAILFLGYLVSGDGIRVDVSKVAVIREWPSPSTITEARSFHGLASFYCHFIAHFSTIM 714
              P+++FLG++VS  G+  D  K+  I +WP P+ I E RSFHG+A+FY  FI +FS+IM
Sbjct: 815  MSPSVVFLGFVVSSKGVETDPEKIKAIVDWPVPTNIHEVRSFHGMATFYRRFIRNFSSIM 874

Query: 715  ALITDCIGLKPFVWTPAAAAAFDVIKQRLSAAPVLLLPDFEQLFELSCDASKAGIGAVLS 894
            A IT+C+    F+WT AA  AF+ IK ++   P+L LPDFE++FE++CDAS  GIGAVLS
Sbjct: 875  APITECMKPGLFIWTKAANKAFEEIKSKMVNPPILRLPDFEKVFEVACDASHVGIGAVLS 934

Query: 895  QSAHPVAFFSEKLSGPCSRYNTYDVEFYAVVRAVRHWRHYLFHHEFILYSDHEALRHLAT 1074
            Q  HPVAFFSEKL+G   +Y+TYD+EFYAVV+A+RHW+HYL + EF+LYSDHEALR+L +
Sbjct: 935  QEGHPVAFFSEKLNGAKKKYSTYDLEFYAVVQAIRHWQHYLSYKEFVLYSDHEALRYLNS 994

Query: 1075 QDNLPARHASWTTYLQQFTF 1134
            Q  L +RHA W+++LQ FTF
Sbjct: 995  QKKLNSRHAKWSSFLQLFTF 1014


>ref|XP_007210190.1| hypothetical protein PRUPE_ppa017790mg [Prunus persica]
            gi|462405925|gb|EMJ11389.1| hypothetical protein
            PRUPE_ppa017790mg [Prunus persica]
          Length = 1485

 Score =  481 bits (1239), Expect = e-133
 Identities = 230/378 (60%), Positives = 290/378 (76%), Gaps = 2/378 (0%)
 Frame = +1

Query: 7    LRRQVEELLQRGVIRESLSPCAVPTLLIPKKDSTWRMCIDSRAINKITVRYRFPIPRLED 186
            LR Q+EELL++G IRESLSPCAVP LL+PKKD TWRMC+DSRAINKITV+YRFPIPRLED
Sbjct: 642  LREQIEELLRKGFIRESLSPCAVPVLLVPKKDKTWRMCVDSRAINKITVKYRFPIPRLED 701

Query: 187  LLDQLSGASVFSKLDLRSG--QIRVRPGDEWKIAFKTREGLFEWLVMPFGLSNAPAPSTF 360
            +LD LSG+ VFSK+DLRSG  QIR+RPGDEWK AFK+++GLFEWLVMPFGLSN P  STF
Sbjct: 702  MLDVLSGSKVFSKIDLRSGYHQIRIRPGDEWKTAFKSKDGLFEWLVMPFGLSNTP--STF 759

Query: 361  MRVMNQA*RSLIGTCVVVYFDDILVYSRSLTDHWRHLRAVLLILRNEKFYAKPNKCVFCE 540
            MR+MNQ  R  IG+ VVVYFDDIL+YS +  +H  HLR VL +LR  K +    KC FC 
Sbjct: 760  MRLMNQVLRPFIGSFVVVYFDDILIYSTTKEEHLVHLRQVLDVLRENKLFVNLKKCTFCT 819

Query: 541  PAILFLGYLVSGDGIRVDVSKVAVIREWPSPSTITEARSFHGLASFYCHFIAHFSTIMAL 720
              +LFLG++V   GI+VD  K+  I +WP+P T++E RSFHGLA+FY  F+ HFS+I+A 
Sbjct: 820  NKLLFLGFVVGEHGIQVDDEKIKAILDWPAPKTVSEVRSFHGLATFYRRFVRHFSSIVAP 879

Query: 721  ITDCIGLKPFVWTPAAAAAFDVIKQRLSAAPVLLLPDFEQLFELSCDASKAGIGAVLSQS 900
            IT+C+    F W      +F  IK++L  APVL LP+FE++FE+ CDAS  G+GAVLSQ 
Sbjct: 880  ITECLKKGRFSWGEEQERSFADIKEKLCTAPVLALPNFEKVFEVECDASGVGVGAVLSQD 939

Query: 901  AHPVAFFSEKLSGPCSRYNTYDVEFYAVVRAVRHWRHYLFHHEFILYSDHEALRHLATQD 1080
              PVAFFSEKLS    +++TYD EFYAVVRA++ W HYL   EF+L++DH+AL+++ +Q 
Sbjct: 940  KRPVAFFSEKLSDARQKWSTYDQEFYAVVRALKQWEHYLIQKEFVLFTDHQALKYINSQK 999

Query: 1081 NLPARHASWTTYLQQFTF 1134
            N+   HA W T+LQ+F+F
Sbjct: 1000 NIDKMHARWVTFLQKFSF 1017


>gb|ADP20178.1| gag-pol polyprotein [Silene latifolia]
          Length = 1518

 Score =  479 bits (1233), Expect = e-133
 Identities = 229/379 (60%), Positives = 285/379 (75%), Gaps = 2/379 (0%)
 Frame = +1

Query: 4    ELRRQVEELLQRGVIRESLSPCAVPTLLIPKKDSTWRMCIDSRAINKITVRYRFPIPRLE 183
            EL+ Q+EEL+ +G +RESLSPCAVP LL+PKKD TWRMC DSRAIN ITV+YRFPIPRL+
Sbjct: 631  ELQHQIEELMAKGFVRESLSPCAVPALLVPKKDGTWRMCTDSRAINNITVKYRFPIPRLD 690

Query: 184  DLLDQLSGASVFSKLDLRSG--QIRVRPGDEWKIAFKTREGLFEWLVMPFGLSNAPAPST 357
            D+LD+LSGAS+FSK+DLR G  Q+R+R GDEWK AFKT+ GL+EWLVMPFGLSNAP  ST
Sbjct: 691  DMLDELSGASIFSKIDLRQGYHQVRIREGDEWKTAFKTKHGLYEWLVMPFGLSNAP--ST 748

Query: 358  FMRVMNQA*RSLIGTCVVVYFDDILVYSRSLTDHWRHLRAVLLILRNEKFYAKPNKCVFC 537
            FMR+M +  R  +G   VVYFDDILVYS++  +H +HL  V  ILR +K Y K  KC F 
Sbjct: 749  FMRLMTEVLRPCLGKFAVVYFDDILVYSKTKGEHLKHLEVVFKILREQKLYGKLEKCTFM 808

Query: 538  EPAILFLGYLVSGDGIRVDVSKVAVIREWPSPSTITEARSFHGLASFYCHFIAHFSTIMA 717
               + FLGYL+SG GI VD   +A ++ WP+P+T+TE RSFHGLASFY  FI +FST++A
Sbjct: 809  VEEVAFLGYLISGRGISVDQENIAAMQSWPTPTTVTEVRSFHGLASFYRRFIKNFSTVVA 868

Query: 718  LITDCIGLKPFVWTPAAAAAFDVIKQRLSAAPVLLLPDFEQLFELSCDASKAGIGAVLSQ 897
             IT+C+    F WT  A  +F+ IKQ +   P+L LPDF+QLFE+ CDAS  GIGAVL Q
Sbjct: 869  PITECMRKGEFQWTEQAQQSFEKIKQLMCNTPILKLPDFDQLFEVECDASGVGIGAVLIQ 928

Query: 898  SAHPVAFFSEKLSGPCSRYNTYDVEFYAVVRAVRHWRHYLFHHEFILYSDHEALRHLATQ 1077
            S  PVA+FSEKL+G   +Y+TYD EFYA++RA+ HW HYL    F+L+SDHEAL+++  Q
Sbjct: 929  SQKPVAYFSEKLNGAKLKYSTYDKEFYAIIRALMHWNHYLKPKPFVLHSDHEALKYINGQ 988

Query: 1078 DNLPARHASWTTYLQQFTF 1134
              L  RHA W  +LQ FTF
Sbjct: 989  HKLNFRHAKWVEFLQSFTF 1007


>ref|XP_007207232.1| hypothetical protein PRUPE_ppa026856mg [Prunus persica]
            gi|462402874|gb|EMJ08431.1| hypothetical protein
            PRUPE_ppa026856mg [Prunus persica]
          Length = 1493

 Score =  477 bits (1228), Expect = e-132
 Identities = 228/378 (60%), Positives = 288/378 (76%), Gaps = 2/378 (0%)
 Frame = +1

Query: 7    LRRQVEELLQRGVIRESLSPCAVPTLLIPKKDSTWRMCIDSRAINKITVRYRFPIPRLED 186
            LR Q+EELL++G IRESLSPCAVP LL+PKKD TWRMC+DSRA+NKI V+YRF IPRLED
Sbjct: 650  LREQIEELLRKGFIRESLSPCAVPVLLVPKKDKTWRMCVDSRAVNKIKVKYRFSIPRLED 709

Query: 187  LLDQLSGASVFSKLDLRSG--QIRVRPGDEWKIAFKTREGLFEWLVMPFGLSNAPAPSTF 360
            +LD LSG+ VFSK+DLRSG  QIR+RPGDEWK AFK+++GLFEWLVMPFGLSNAP  STF
Sbjct: 710  ILDVLSGSKVFSKIDLRSGYHQIRIRPGDEWKTAFKSKDGLFEWLVMPFGLSNAP--STF 767

Query: 361  MRVMNQA*RSLIGTCVVVYFDDILVYSRSLTDHWRHLRAVLLILRNEKFYAKPNKCVFCE 540
            MR+MNQ  R  IG+ VVVYFDDIL+YS +  +H  HLR VL +LR  K Y    KC FC 
Sbjct: 768  MRLMNQVLRPFIGSFVVVYFDDILIYSTTKEEHLVHLRQVLDVLRENKLYVNLKKCTFCT 827

Query: 541  PAILFLGYLVSGDGIRVDVSKVAVIREWPSPSTITEARSFHGLASFYCHFIAHFSTIMAL 720
              +LFLG++V  +GI+VD  K+  I +WP+P T++E RSFHGLA+FY  F+ HFS+I A 
Sbjct: 828  NKLLFLGFVVGENGIQVDDEKIKAILDWPAPKTVSEVRSFHGLATFYMRFVRHFSSIAAP 887

Query: 721  ITDCIGLKPFVWTPAAAAAFDVIKQRLSAAPVLLLPDFEQLFELSCDASKAGIGAVLSQS 900
            IT+C+    F W      +F  IK++L  APVL LP+FE++FE+ CDAS  G+GAVL Q 
Sbjct: 888  ITECLKKGRFSWGEEQERSFADIKEKLCTAPVLALPNFEKVFEVECDASGVGVGAVLLQD 947

Query: 901  AHPVAFFSEKLSGPCSRYNTYDVEFYAVVRAVRHWRHYLFHHEFILYSDHEALRHLATQD 1080
              PVAFFSEKLS    +++TYD EFYAVVRA++ W HYL   EF+L++DH+AL+++ +Q 
Sbjct: 948  KRPVAFFSEKLSDARQKWSTYDQEFYAVVRALKQWEHYLIQKEFVLFTDHQALKYINSQK 1007

Query: 1081 NLPARHASWTTYLQQFTF 1134
            N+   HA W T+LQ+F+F
Sbjct: 1008 NIDKMHARWVTFLQKFSF 1025


>ref|XP_007045326.1| DNA/RNA polymerases superfamily protein [Theobroma cacao]
            gi|508709261|gb|EOY01158.1| DNA/RNA polymerases
            superfamily protein [Theobroma cacao]
          Length = 786

 Score =  469 bits (1207), Expect = e-130
 Identities = 223/380 (58%), Positives = 293/380 (77%), Gaps = 2/380 (0%)
 Frame = +1

Query: 1    AELRRQVEELLQRGVIRESLSPCAVPTLLIPKKDSTWRMCIDSRAINKITVRYRFPIPRL 180
            AE++RQVEELL++G++RES SPCA P LL PKKD +WRMC+DSRAINKIT++YRFPIPRL
Sbjct: 7    AEVQRQVEELLEKGLVRESKSPCACPALLAPKKDGSWRMCVDSRAINKITIKYRFPIPRL 66

Query: 181  EDLLDQLSGASVFSKLDLRSG--QIRVRPGDEWKIAFKTREGLFEWLVMPFGLSNAPAPS 354
            +++LDQL G+ VFSK+DL+SG  QIR+R GDEWK AFKT +GLFEWLVMPFGLSNAP  S
Sbjct: 67   DEMLDQLVGSRVFSKIDLKSGYHQIRMRDGDEWKTAFKTPDGLFEWLVMPFGLSNAP--S 124

Query: 355  TFMRVMNQA*RSLIGTCVVVYFDDILVYSRSLTDHWRHLRAVLLILRNEKFYAKPNKCVF 534
            TFMRVM +  +  + + VVVYFDDIL+YS +   H +HLR VL +L+ E+ Y    KC F
Sbjct: 125  TFMRVMAEVLKPFLNSFVVVYFDDILIYSHTKEKHLKHLRQVLEVLQKEQLYINLKKCSF 184

Query: 535  CEPAILFLGYLVSGDGIRVDVSKVAVIREWPSPSTITEARSFHGLASFYCHFIAHFSTIM 714
             +P ++FLG++VS +G++ D  K+  I EWP+P++I E RSFHGLASFY  FI +FS+IM
Sbjct: 185  MQPEVVFLGFIVSAEGLKPDPEKIRAISEWPAPTSIKEVRSFHGLASFYRRFIRNFSSIM 244

Query: 715  ALITDCIGLKPFVWTPAAAAAFDVIKQRLSAAPVLLLPDFEQLFELSCDASKAGIGAVLS 894
            + IT+ +    F W+ +A  AF+ +K  ++ APVL LPDFE+LF + CDAS  GIGAVLS
Sbjct: 245  SPITESLKKDGFEWSHSAQKAFERVKALMTEAPVLALPDFEKLFVVECDASYVGIGAVLS 304

Query: 895  QSAHPVAFFSEKLSGPCSRYNTYDVEFYAVVRAVRHWRHYLFHHEFILYSDHEALRHLAT 1074
            Q   P+ FFSEKL+    RY+TYD+EFYA+VRA+RHW+HYL + EF +YSDH+ALR+L +
Sbjct: 305  QDGRPIEFFSEKLTDSRRRYSTYDLEFYALVRAIRHWQHYLAYREFAVYSDHQALRYLHS 364

Query: 1075 QDNLPARHASWTTYLQQFTF 1134
            Q  L  +HA W+++L +F F
Sbjct: 365  QKKLSNQHAKWSSFLNEFNF 384


>ref|XP_007051412.1| DNA/RNA polymerases superfamily protein [Theobroma cacao]
            gi|508703673|gb|EOX95569.1| DNA/RNA polymerases
            superfamily protein [Theobroma cacao]
          Length = 1452

 Score =  468 bits (1203), Expect = e-129
 Identities = 222/380 (58%), Positives = 292/380 (76%), Gaps = 2/380 (0%)
 Frame = +1

Query: 1    AELRRQVEELLQRGVIRESLSPCAVPTLLIPKKDSTWRMCIDSRAINKITVRYRFPIPRL 180
            AE++RQVEEL ++G++RES SPCA P LL PKKD +WRMC+DSRAINKIT++YRFPIPRL
Sbjct: 555  AEVQRQVEELFEKGLVRESKSPCACPALLAPKKDGSWRMCVDSRAINKITIKYRFPIPRL 614

Query: 181  EDLLDQLSGASVFSKLDLRSG--QIRVRPGDEWKIAFKTREGLFEWLVMPFGLSNAPAPS 354
            +++LDQL G+ VFSK+DL+SG  QIR+R GDEWK AFKT +GLFEWLVMPFGLSNAP  S
Sbjct: 615  DEMLDQLVGSRVFSKIDLKSGYHQIRMRDGDEWKTAFKTPDGLFEWLVMPFGLSNAP--S 672

Query: 355  TFMRVMNQA*RSLIGTCVVVYFDDILVYSRSLTDHWRHLRAVLLILRNEKFYAKPNKCVF 534
            TFMRVM +  +  + + VVVYFDDIL+YS +   H +HLR VL +L+ E+ Y    KC F
Sbjct: 673  TFMRVMAEVLKPFLNSFVVVYFDDILIYSHTKEKHLKHLRQVLEVLQKEQLYINLKKCSF 732

Query: 535  CEPAILFLGYLVSGDGIRVDVSKVAVIREWPSPSTITEARSFHGLASFYCHFIAHFSTIM 714
             +P ++FLG++VS +G++ D  K+  I EWP+P++I E RSFHGLASFY  FI +FS+IM
Sbjct: 733  MQPEVVFLGFIVSAEGLKPDPEKIRAISEWPAPTSIKEVRSFHGLASFYRRFIRNFSSIM 792

Query: 715  ALITDCIGLKPFVWTPAAAAAFDVIKQRLSAAPVLLLPDFEQLFELSCDASKAGIGAVLS 894
            + IT+ +    F W+ +A  AF+ +K  ++ APVL LPDFE+LF + CDAS  GIGAVLS
Sbjct: 793  SPITESLKKDGFEWSHSAQKAFERVKALMTEAPVLALPDFEKLFVVECDASYVGIGAVLS 852

Query: 895  QSAHPVAFFSEKLSGPCSRYNTYDVEFYAVVRAVRHWRHYLFHHEFILYSDHEALRHLAT 1074
            Q   P+ FFSEKL+    RY+TYD+EFYA+VRA+RHW+HYL + EF +YSDH+ALR+L +
Sbjct: 853  QDGRPIEFFSEKLTDSRRRYSTYDLEFYALVRAIRHWQHYLAYREFAVYSDHQALRYLHS 912

Query: 1075 QDNLPARHASWTTYLQQFTF 1134
            Q  L  +HA W+++L +F F
Sbjct: 913  QKKLSNQHAKWSSFLNEFNF 932


>ref|XP_007220740.1| hypothetical protein PRUPE_ppa023598mg [Prunus persica]
            gi|462417202|gb|EMJ21939.1| hypothetical protein
            PRUPE_ppa023598mg [Prunus persica]
          Length = 1457

 Score =  467 bits (1201), Expect = e-129
 Identities = 229/378 (60%), Positives = 282/378 (74%), Gaps = 2/378 (0%)
 Frame = +1

Query: 7    LRRQVEELLQRGVIRESLSPCAVPTLLIPKKDSTWRMCIDSRAINKITVRYRFPIPRLED 186
            LR Q+EELLQ+G IRESLSPCAVP LL+PKKD TWRMC+DSRAINKITV+ RFPIPRLED
Sbjct: 632  LREQIEELLQKGFIRESLSPCAVPVLLVPKKDKTWRMCVDSRAINKITVKSRFPIPRLED 691

Query: 187  LLDQLSGASVFSKLDLRSG--QIRVRPGDEWKIAFKTREGLFEWLVMPFGLSNAPAPSTF 360
            +LD LSG+ VFSK+DLRSG  QIR+RPGDEWK AFK+++GLFEWLVMPFGLSNAP  STF
Sbjct: 692  MLDVLSGSRVFSKIDLRSGYHQIRIRPGDEWKTAFKSKDGLFEWLVMPFGLSNAP--STF 749

Query: 361  MRVMNQA*RSLIGTCVVVYFDDILVYSRSLTDHWRHLRAVLLILRNEKFYAKPNKCVFCE 540
            MR+MNQ  R  IG+ VVVYFDDIL+YS +  +H  HLR VL +LR  K Y    KC FC 
Sbjct: 750  MRLMNQVLRPFIGSFVVVYFDDILIYSTTKEEHLVHLRQVLDVLRENKLYMNLKKCTFCT 809

Query: 541  PAILFLGYLVSGDGIRVDVSKVAVIREWPSPSTITEARSFHGLASFYCHFIAHFSTIMAL 720
              +LFLG++V  +GI+VD  K+  I +WP+P  ++E RSFHGLA+FY  F+ HFS+I A 
Sbjct: 810  NKLLFLGFVVGENGIQVDDEKIKAILDWPTPKIVSEVRSFHGLATFYRRFVRHFSSITAP 869

Query: 721  ITDCIGLKPFVWTPAAAAAFDVIKQRLSAAPVLLLPDFEQLFELSCDASKAGIGAVLSQS 900
            IT+C+    F W      +F  IK++L  APVL LP+FE++FE+ CDAS  G+GAVLSQ 
Sbjct: 870  ITECLKKGRFSWGDEQERSFADIKEKLCTAPVLALPNFEKVFEVECDASGVGVGAVLSQD 929

Query: 901  AHPVAFFSEKLSGPCSRYNTYDVEFYAVVRAVRHWRHYLFHHEFILYSDHEALRHLATQD 1080
              PVAFFSEKLS  C +++TYD EFYAVVRA++ W HYL   EF+L++DH+ALR      
Sbjct: 930  KRPVAFFSEKLSDACQKWSTYDQEFYAVVRALKQWEHYLIQKEFVLFTDHQALR------ 983

Query: 1081 NLPARHASWTTYLQQFTF 1134
                    W T+LQ+F+F
Sbjct: 984  --------WVTFLQKFSF 993


>ref|XP_007049888.1| DNA/RNA polymerases superfamily protein, partial [Theobroma cacao]
            gi|508702149|gb|EOX94045.1| DNA/RNA polymerases
            superfamily protein, partial [Theobroma cacao]
          Length = 624

 Score =  461 bits (1185), Expect = e-127
 Identities = 219/380 (57%), Positives = 289/380 (76%), Gaps = 2/380 (0%)
 Frame = +1

Query: 1    AELRRQVEELLQRGVIRESLSPCAVPTLLIPKKDSTWRMCIDSRAINKITVRYRFPIPRL 180
            AE++RQVEELL++G++RES SPCA P LL PKKD +WRMC+DSRAINKIT++YRFPIPRL
Sbjct: 7    AEVQRQVEELLEKGLVRESKSPCACPALLAPKKDGSWRMCVDSRAINKITIKYRFPIPRL 66

Query: 181  EDLLDQLSGASVFSKLDLRSG--QIRVRPGDEWKIAFKTREGLFEWLVMPFGLSNAPAPS 354
            +++LDQL G+ VFSK+DL+SG  QIR+R GDEWK AFKT +GLFEWLVMPFGLSNAP  S
Sbjct: 67   DEMLDQLVGSRVFSKIDLKSGYHQIRMRDGDEWKTAFKTPDGLFEWLVMPFGLSNAP--S 124

Query: 355  TFMRVMNQA*RSLIGTCVVVYFDDILVYSRSLTDHWRHLRAVLLILRNEKFYAKPNKCVF 534
            TFMRVM +  +  + + VVVYFDDIL+YS +   H +HLR VL +L+ E+ Y    KC F
Sbjct: 125  TFMRVMAEVLKPFLNSFVVVYFDDILIYSHTKEKHLKHLRQVLEVLQKEQLYINLKKCSF 184

Query: 535  CEPAILFLGYLVSGDGIRVDVSKVAVIREWPSPSTITEARSFHGLASFYCHFIAHFSTIM 714
             +P ++FLG++VS +G++ D  K+  I EWP+P++I E RSFHGLASFY  FI +FS+IM
Sbjct: 185  MQPEVVFLGFIVSAEGLKPDPEKIRAISEWPAPTSIKEVRSFHGLASFYRRFIRNFSSIM 244

Query: 715  ALITDCIGLKPFVWTPAAAAAFDVIKQRLSAAPVLLLPDFEQLFELSCDASKAGIGAVLS 894
            + IT+ +    F W+ +A  AF+ +K  ++ APVL LPDFE+LF + CDAS  G    LS
Sbjct: 245  SPITESLKKDGFEWSHSAQKAFERVKALMTEAPVLALPDFEKLFVVECDASYVGXXXXLS 304

Query: 895  QSAHPVAFFSEKLSGPCSRYNTYDVEFYAVVRAVRHWRHYLFHHEFILYSDHEALRHLAT 1074
            Q   P+ FFSEKL+    RY+TYD+EFYA+VRA+RHW+HYL + EF +YSDH+ALR+L +
Sbjct: 305  QDGRPIEFFSEKLTDSRRRYSTYDLEFYALVRAIRHWQHYLAYREFAVYSDHQALRYLHS 364

Query: 1075 QDNLPARHASWTTYLQQFTF 1134
            Q  L  +HA W+++L +F F
Sbjct: 365  QKKLSNQHAKWSSFLNEFNF 384


>gb|ADP20179.1| gag-pol polyprotein [Silene latifolia]
          Length = 1475

 Score =  455 bits (1170), Expect = e-125
 Identities = 218/379 (57%), Positives = 278/379 (73%), Gaps = 2/379 (0%)
 Frame = +1

Query: 4    ELRRQVEELLQRGVIRESLSPCAVPTLLIPKKDSTWRMCIDSRAINKITVRYRFPIPRLE 183
            EL++Q+ EL+ +G +RESLSPC+VP LL+PKKD +WRMC DSRAIN IT++YRFPIPRL+
Sbjct: 619  ELQQQIGELVSKGFVRESLSPCSVPALLVPKKDGSWRMCTDSRAINNITIKYRFPIPRLD 678

Query: 184  DLLDQLSGASVFSKLDLRSG--QIRVRPGDEWKIAFKTREGLFEWLVMPFGLSNAPAPST 357
            D+LD+LSGA +FSK+DLR G  Q+R++ GDEWK AFKT+ GL+EWLVMPFGLSNAP  ST
Sbjct: 679  DILDELSGAQLFSKIDLRQGYHQVRIKEGDEWKTAFKTKHGLYEWLVMPFGLSNAP--ST 736

Query: 358  FMRVMNQA*RSLIGTCVVVYFDDILVYSRSLTDHWRHLRAVLLILRNEKFYAKPNKCVFC 537
            FMR+M +  R  +G  VVVYFDDILVYS S  +H +HL+ +   LR  K Y K  KC F 
Sbjct: 737  FMRLMTEVLRPYLGRFVVVYFDDILVYSPSKEEHLKHLQVLFETLREHKLYGKLEKCSFM 796

Query: 538  EPAILFLGYLVSGDGIRVDVSKVAVIREWPSPSTITEARSFHGLASFYCHFIAHFSTIMA 717
            +  + FLG+++S  GI VD  KV  I+ WP P  IT+ RSFHGLASFY  FI  FST+MA
Sbjct: 797  QNEVQFLGFIISDRGILVDQEKVKAIKSWPIPKNITDVRSFHGLASFYRRFIKDFSTLMA 856

Query: 718  LITDCIGLKPFVWTPAAAAAFDVIKQRLSAAPVLLLPDFEQLFELSCDASKAGIGAVLSQ 897
             IT+C+    F W   A ++F++IK++L  +P+L LP+F +LFE+ CDAS  GIGAVL Q
Sbjct: 857  PITECMKKGEFKWGDKAESSFNIIKEKLCESPILTLPNFNKLFEVECDASGIGIGAVLVQ 916

Query: 898  SAHPVAFFSEKLSGPCSRYNTYDVEFYAVVRAVRHWRHYLFHHEFILYSDHEALRHLATQ 1077
               P+A+FSEKLSG    Y+TYD EFYA+VRA+ HW HYL    F+L+SDHEAL+++  Q
Sbjct: 917  EHKPIAYFSEKLSGAKLNYSTYDKEFYAIVRALNHWSHYLKPRPFVLHSDHEALKYINGQ 976

Query: 1078 DNLPARHASWTTYLQQFTF 1134
              L  RHA W  +LQ F F
Sbjct: 977  HKLNHRHAKWVEFLQSFNF 995


>gb|ADP20180.1| mutant gag-pol polyprotein [Pisum sativum]
          Length = 1004

 Score =  449 bits (1156), Expect = e-124
 Identities = 214/380 (56%), Positives = 277/380 (72%), Gaps = 3/380 (0%)
 Frame = +1

Query: 4    ELRRQVEELLQRGVIRESLSPCAVPTLLIPKKDSTWRMCIDSRAINKITVRYRFPIPRLE 183
            E++RQV EL+ +G +RESLSPCAVP +L+PKKD +WRMC D RAI+ IT++YR PIPRL+
Sbjct: 623  EIQRQVAELISKGWVRESLSPCAVPIILVPKKDGSWRMCTDCRAISNITIKYRHPIPRLD 682

Query: 184  DLLDQLSGASVFSKLDLRSG--QIRVRPGDEWKIAFKTREGLFEWLVMPFGLSNAPAPST 357
            DLLD+L GA +FSK+DL+SG  QIR+R GDEWK AFKT+ GL+EW+VMPFGL+NAP  ST
Sbjct: 683  DLLDELFGACLFSKIDLKSGYHQIRIREGDEWKTAFKTKFGLYEWMVMPFGLTNAP--ST 740

Query: 358  FMRVMNQA*RSLIGTCVVVYFDDILVYSRSLTDHWRHLRAVLLILRNEKFYAKPNKCVFC 537
            FMR+MN   R  +G  VVVYFDDIL+YS++L DH  HL+AVL +LR E  YA   KCVFC
Sbjct: 741  FMRLMNHVLREFLGKFVVVYFDDILIYSKNLDDHCIHLKAVLQVLRYENLYANLEKCVFC 800

Query: 538  EPAILFLGYLVSGDGIRVDVSKVAVIREWPSPSTITEARSFHGLASFYCHFIAHFSTIMA 717
               ++FLG++VS  G+ VD  KV  IREWP P  ++E RSFHGLASFY  F+  FST+ A
Sbjct: 801  TDHVIFLGFIVSSKGVHVDEEKVKAIREWPPPKNVSEVRSFHGLASFYRRFVKDFSTLAA 860

Query: 718  LITDCIGLK-PFVWTPAAAAAFDVIKQRLSAAPVLLLPDFEQLFELSCDASKAGIGAVLS 894
             + + +     F W      AF  +K++L+ APVL LP+F + FE+ CDAS  GIGAVL 
Sbjct: 861  PLNEIVEKDVGFKWGEKQEQAFAALKEKLTQAPVLALPNFSKSFEIECDASNVGIGAVLL 920

Query: 895  QSAHPVAFFSEKLSGPCSRYNTYDVEFYAVVRAVRHWRHYLFHHEFILYSDHEALRHLAT 1074
            Q  HP+A+FSEKL G    Y+TYD E Y++VRA++ W+HYL   EF+++SDHE+L+HL  
Sbjct: 921  QEGHPLAYFSEKLKGAALNYSTYDKELYSLVRALQTWQHYLLPKEFVIHSDHESLKHLKG 980

Query: 1075 QDNLPARHASWTTYLQQFTF 1134
            Q  L  RH  W  +L+QF +
Sbjct: 981  QGKLNKRHVKWVEFLEQFPY 1000


>emb|CAN64427.1| hypothetical protein VITISV_029384 [Vitis vinifera]
          Length = 1392

 Score =  449 bits (1156), Expect = e-124
 Identities = 220/380 (57%), Positives = 277/380 (72%), Gaps = 2/380 (0%)
 Frame = +1

Query: 1    AELRRQVEELLQRGVIRESLSPCAVPTLLIPKKDSTWRMCIDSRAINKITVRYRFPIPRL 180
            AEL+RQV+ELL +G IRESLSPC VP LL PKKD +WRMC+DSRAINKIT++YRFPIPRL
Sbjct: 596  AELKRQVDELLTKGFIRESLSPCGVPALLTPKKDGSWRMCVDSRAINKITIKYRFPIPRL 655

Query: 181  EDLLDQLSGASVFSKLDLRSG--QIRVRPGDEWKIAFKTREGLFEWLVMPFGLSNAPAPS 354
            +D+LD + G+ +FSK+DLRSG  QIR+RPGDEWK +FKT++GL+EWLVMPFGL+N  APS
Sbjct: 656  DDMLDMMVGSVIFSKIDLRSGYHQIRIRPGDEWKTSFKTKDGLYEWLVMPFGLTN--APS 713

Query: 355  TFMRVMNQA*RSLIGTCVVVYFDDILVYSRSLTDHWRHLRAVLLILRNEKFYAKPNKCVF 534
            TFMR+M Q  +  IG  VVVYFDDIL+YSRS  DH  HL+                    
Sbjct: 714  TFMRIMTQVLKPFIGRFVVVYFDDILIYSRSCEDHEEHLK-------------------- 753

Query: 535  CEPAILFLGYLVSGDGIRVDVSKVAVIREWPSPSTITEARSFHGLASFYCHFIAHFSTIM 714
                           G+  D  K+  I +WP P+ I E RSFHG+A+FY  FI +FS+IM
Sbjct: 754  --------------QGVETDPEKIKAIVDWPVPTNIHEVRSFHGMATFYRRFIRNFSSIM 799

Query: 715  ALITDCIGLKPFVWTPAAAAAFDVIKQRLSAAPVLLLPDFEQLFELSCDASKAGIGAVLS 894
            A IT+C+    F+WT AA  AF+ IK ++   P+L LPDFE++FE++CDAS  GIGAVLS
Sbjct: 800  APITECMKPGLFIWTKAANKAFEEIKSKMVNPPILRLPDFEKVFEVACDASHVGIGAVLS 859

Query: 895  QSAHPVAFFSEKLSGPCSRYNTYDVEFYAVVRAVRHWRHYLFHHEFILYSDHEALRHLAT 1074
            Q  HPVAFFSEKL+G   +Y+TYD+EFYAVV+A+RHW+HYL + EF+LYSDHEALR+L +
Sbjct: 860  QEGHPVAFFSEKLNGAKKKYSTYDLEFYAVVQAIRHWQHYLSYKEFVLYSDHEALRYLNS 919

Query: 1075 QDNLPARHASWTTYLQQFTF 1134
            Q  L +RHA W+++LQ FTF
Sbjct: 920  QKKLNSRHAKWSSFLQLFTF 939


>gb|ABF97027.1| retrotransposon protein, putative, Ty3-gypsy subclass [Oryza sativa
            Japonica Group]
          Length = 889

 Score =  448 bits (1152), Expect = e-123
 Identities = 216/380 (56%), Positives = 277/380 (72%), Gaps = 3/380 (0%)
 Frame = +1

Query: 4    ELRRQVEELLQRGVIRESLSPCAVPTLLIPKKDSTWRMCIDSRAINKITVRYRFPIPRLE 183
            E++RQV+ELL +G +RESLSPC+VP LL+PKKD +WRMC+D RAIN IT+RYR PIPRL+
Sbjct: 74   EIQRQVQELLDKGYVRESLSPCSVPVLLVPKKDGSWRMCVDCRAINNITIRYRHPIPRLD 133

Query: 184  DLLDQLSGASVFSKLDLRSG--QIRVRPGDEWKIAFKTREGLFEWLVMPFGLSNAPAPST 357
            D+LD+LSG+ VF K+DLRSG  QIR++ GDEWK AFKT+ GL+EWLVMPFGL+NAP  ST
Sbjct: 134  DMLDELSGSLVFCKIDLRSGYHQIRMKLGDEWKTAFKTKFGLYEWLVMPFGLTNAP--ST 191

Query: 358  FMRVMNQA*RSLIGTCVVVYFDDILVYSRSLTDHWRHLRAVLLILRNEKFYAKPNKCVFC 537
            FMR+MN+  R+ IG  VVVYFDDIL+YSRS+ DH  HLRAV   LR+ + +    KC FC
Sbjct: 192  FMRLMNEVLRAFIGRFVVVYFDDILIYSRSIEDHHGHLRAVFDALRDARLFGNLEKCTFC 251

Query: 538  EPAILFLGYLVSGDGIRVDVSKVAVIREWPSPSTITEARSFHGLASFYCHFIAHFSTIMA 717
               + FLGY+V+  GI VD +KV  I  WP P+TIT+ RSF GLA FY HF+  FSTI A
Sbjct: 252  TDRVSFLGYVVTPQGIEVDQAKVEAIHSWPVPTTITQVRSFLGLAGFYRHFVKDFSTIAA 311

Query: 718  LITDCIGLK-PFVWTPAAAAAFDVIKQRLSAAPVLLLPDFEQLFELSCDASKAGIGAVLS 894
             + +       F W  A   AFD +K +L+ AP+L LPDF + FEL CDAS  G+G VL 
Sbjct: 312  PLHELTKRNVTFTWAAAQRNAFDTLKDKLTHAPLLQLPDFNKTFELECDASGIGLGGVLL 371

Query: 895  QSAHPVAFFSEKLSGPCSRYNTYDVEFYAVVRAVRHWRHYLFHHEFILYSDHEALRHLAT 1074
            Q   PVA+FSEKLSGP   Y+TYD E +A+VR +  W+HYL+  EF+++SDHE+L+H+ +
Sbjct: 372  QEGKPVAYFSEKLSGPSLNYSTYDKELFALVRTLETWQHYLWPKEFVIHSDHESLKHIRS 431

Query: 1075 QDNLPARHASWTTYLQQFTF 1134
            Q  L  RHA W  +++ F +
Sbjct: 432  QAKLNRRHAKWVEFIESFPY 451


>gb|AAF79348.1|AC007887_7 F15O4.13 [Arabidopsis thaliana]
          Length = 1887

 Score =  445 bits (1144), Expect = e-122
 Identities = 217/380 (57%), Positives = 279/380 (73%), Gaps = 3/380 (0%)
 Frame = +1

Query: 4    ELRRQVEELLQRGVIRESLSPCAVPTLLIPKKDSTWRMCIDSRAINKITVRYRFPIPRLE 183
            EL +QV EL++RG IRES+SPCAVP LL+PKKD +WRMC+D RAIN ITV+YR PIPRL+
Sbjct: 925  ELEKQVTELMERGHIRESMSPCAVPVLLVPKKDGSWRMCVDCRAINNITVKYRHPIPRLD 984

Query: 184  DLLDQLSGASVFSKLDLRSG--QIRVRPGDEWKIAFKTREGLFEWLVMPFGLSNAPAPST 357
            D+LD+L G+S+FSK+DL+SG  QIR++ GDEWK AFKT +GL+EWLVMPFGL+NAP  ST
Sbjct: 985  DMLDELHGSSIFSKVDLKSGYHQIRMKEGDEWKTAFKTIQGLYEWLVMPFGLTNAP--ST 1042

Query: 358  FMRVMNQA*RSLIGTCVVVYFDDILVYSRSLTDHWRHLRAVLLILRNEKFYAKPNKCVFC 537
            FMR+MN   R+ IG  V+VYFDDILVYS+SL +H  HL+ VL +LR EK YA   KC F 
Sbjct: 1043 FMRLMNHVLRAFIGRFVIVYFDDILVYSKSLEEHVEHLKMVLEVLRKEKLYANLKKCTFG 1102

Query: 538  EPAILFLGYLVSGDGIRVDVSKVAVIREWPSPSTITEARSFHGLASFYCHFIAHFSTIMA 717
               ++FLG++VS DG++VD  KV  IREWPSP ++ E RSFHGLA FY  F+  FST+ A
Sbjct: 1103 TDNLVFLGFVVSTDGVKVDEEKVKAIREWPSPKSVGEVRSFHGLAGFYRRFVKDFSTLAA 1162

Query: 718  LITDCIGLK-PFVWTPAAAAAFDVIKQRLSAAPVLLLPDFEQLFELSCDASKAGIGAVLS 894
             +T+ I     F W  A   AF  +K++L+ APVL LPDF + FE+ CDAS  GIG VL 
Sbjct: 1163 PLTEVIKKNVGFKWEQAQEDAFQALKEKLTHAPVLSLPDFLKTFEIECDASGVGIGVVLM 1222

Query: 895  QSAHPVAFFSEKLSGPCSRYNTYDVEFYAVVRAVRHWRHYLFHHEFILYSDHEALRHLAT 1074
            Q   P+A+FSEKL G    Y TYD E YA+VRA++  +HYL+  EF++++DHE+L+HL  
Sbjct: 1223 QDKKPIAYFSEKLGGATLNYPTYDKELYALVRALQTGQHYLWPKEFVIHTDHESLKHLKG 1282

Query: 1075 QDNLPARHASWTTYLQQFTF 1134
            Q  L  RHA W  +++ F +
Sbjct: 1283 QQKLNKRHARWVEFIETFPY 1302


>gb|AAK91332.1|AC090441_14 Putative gag-pol polyprotein [Oryza sativa Japonica Group]
            gi|15217296|gb|AAK92640.1|AC079634_1 Putative
            retroelement [Oryza sativa Japonica Group]
            gi|31431373|gb|AAP53161.1| retrotransposon protein,
            putative, Ty3-gypsy subclass [Oryza sativa Japonica
            Group]
          Length = 1708

 Score =  445 bits (1144), Expect = e-122
 Identities = 213/380 (56%), Positives = 277/380 (72%), Gaps = 3/380 (0%)
 Frame = +1

Query: 4    ELRRQVEELLQRGVIRESLSPCAVPTLLIPKKDSTWRMCIDSRAINKITVRYRFPIPRLE 183
            E++RQV+ELL +G +RESLSPC++P LL+PKKD +WRMC+D RAIN IT+RYR PIPRL+
Sbjct: 761  EIQRQVQELLDKGYVRESLSPCSIPVLLVPKKDGSWRMCVDCRAINNITIRYRHPIPRLD 820

Query: 184  DLLDQLSGASVFSKLDLRSG--QIRVRPGDEWKIAFKTREGLFEWLVMPFGLSNAPAPST 357
            D+LD+LSG+ VFSK+DLRSG  QIR++ GDEWK AFKT+ GL+EWLVMPFGL+NAP  ST
Sbjct: 821  DMLDELSGSLVFSKIDLRSGYHQIRMKLGDEWKTAFKTKFGLYEWLVMPFGLTNAP--ST 878

Query: 358  FMRVMNQA*RSLIGTCVVVYFDDILVYSRSLTDHWRHLRAVLLILRNEKFYAKPNKCVFC 537
            F+R+MN+  R+ IG  VVVYFDDIL+YSRS+ DH  HLRAV   LR+E+ +    KC FC
Sbjct: 879  FIRLMNEVLRAFIGRFVVVYFDDILIYSRSIEDHHGHLRAVFDALRDERLFGNLEKCTFC 938

Query: 538  EPAILFLGYLVSGDGIRVDVSKVAVIREWPSPSTITEARSFHGLASFYCHFIAHFSTIMA 717
               + FLGY+V+  GI VD +KV  I  WP P+TIT+ RSF GLA FY  F+  FSTI A
Sbjct: 939  TDRVSFLGYVVTPQGIEVDQAKVEAIHSWPVPTTITQVRSFLGLAGFYRRFVKDFSTIAA 998

Query: 718  LITDCIGLK-PFVWTPAAAAAFDVIKQRLSAAPVLLLPDFEQLFELSCDASKAGIGAVLS 894
             + +       F W  A   AFD +K +L+ AP+L LPDF + FEL CDAS  G+G VL 
Sbjct: 999  PLHELTKRNVTFTWAAAQRNAFDTLKDKLTHAPLLQLPDFNKTFELECDASGIGLGGVLL 1058

Query: 895  QSAHPVAFFSEKLSGPCSRYNTYDVEFYAVVRAVRHWRHYLFHHEFILYSDHEALRHLAT 1074
            Q   P+ +FSEKLSGP   Y+TYD E +A+VR +  W+HYL+  EF+++SDHE+L+H+ +
Sbjct: 1059 QEGKPIEYFSEKLSGPSLNYSTYDKELFALVRTLETWQHYLWPKEFVIHSDHESLKHIRS 1118

Query: 1075 QDNLPARHASWTTYLQQFTF 1134
            Q  L  RHA W  +++ F +
Sbjct: 1119 QAKLNRRHAKWVEFIESFPY 1138


>ref|XP_004309164.1| PREDICTED: uncharacterized protein LOC101300012 [Fragaria vesca
            subsp. vesca]
          Length = 1034

 Score =  444 bits (1143), Expect = e-122
 Identities = 213/356 (59%), Positives = 273/356 (76%), Gaps = 2/356 (0%)
 Frame = +1

Query: 7    LRRQVEELLQRGVIRESLSPCAVPTLLIPKKDSTWRMCIDSRAINKITVRYRFPIPRLED 186
            L+ ++EELL++G IRES+SPCAVP LL+PKKD +WRMC+DSRAINKIT++YRFPIP+LED
Sbjct: 679  LKEKIEELLRKGHIRESMSPCAVPVLLVPKKDRSWRMCVDSRAINKITIKYRFPIPQLED 738

Query: 187  LLDQLSGASVFSKLDLRSG--QIRVRPGDEWKIAFKTREGLFEWLVMPFGLSNAPAPSTF 360
            +LD L G+ VFSK+DLRSG  QIR++ GDEWK AFK+++GL+EWLVMPFGLSNAP  STF
Sbjct: 739  MLDVLGGSVVFSKIDLRSGYHQIRIKLGDEWKTAFKSKDGLYEWLVMPFGLSNAP--STF 796

Query: 361  MRVMNQA*RSLIGTCVVVYFDDILVYSRSLTDHWRHLRAVLLILRNEKFYAKPNKCVFCE 540
            MRVMNQ  +  IGTCVVVYFDDIL+YS+S  +H +HLR VL +L+  K Y    KC F  
Sbjct: 797  MRVMNQVLKPYIGTCVVVYFDDILIYSKSKEEHLQHLRKVLEVLQENKLYVNLKKCSFMT 856

Query: 541  PAILFLGYLVSGDGIRVDVSKVAVIREWPSPSTITEARSFHGLASFYCHFIAHFSTIMAL 720
              +LFLGY+VS +GI VD  KV  I+EWP+P T+ + RSFHGLA+FY HF+ +FS I A 
Sbjct: 857  KKLLFLGYVVSSEGINVDQDKVKAIQEWPTPKTVGDVRSFHGLATFYRHFVPNFSAITAP 916

Query: 721  ITDCIGLKPFVWTPAAAAAFDVIKQRLSAAPVLLLPDFEQLFELSCDASKAGIGAVLSQS 900
            IT+C+    F W      +F +IK +LS APVL L DF+++FE+  D    GIGAVLSQ 
Sbjct: 917  ITECMKKGRFQWGEEHEKSFAMIKYKLSTAPVLALSDFDKIFEIETDVCGVGIGAVLSQE 976

Query: 901  AHPVAFFSEKLSGPCSRYNTYDVEFYAVVRAVRHWRHYLFHHEFILYSDHEALRHL 1068
              PVA+FSEKL+    +++TYD EFYAVVRA++ W HYL   EF+LY+DH+AL+++
Sbjct: 977  RKPVAYFSEKLNEARQKWSTYDQEFYAVVRALKQWEHYLVQREFVLYTDHQALKYI 1032


>dbj|BAA89466.1| gag-pol polyprotein [Oryza sativa Indica Group]
          Length = 1587

 Score =  444 bits (1142), Expect = e-122
 Identities = 215/380 (56%), Positives = 276/380 (72%), Gaps = 3/380 (0%)
 Frame = +1

Query: 4    ELRRQVEELLQRGVIRESLSPCAVPTLLIPKKDSTWRMCIDSRAINKITVRYRFPIPRLE 183
            E++RQV+ELL +G +RESLSPC+VP LL+PKKD +WRMC+D RAIN IT+RYR PIPRL+
Sbjct: 761  EIQRQVQELLDKGYVRESLSPCSVPVLLVPKKDGSWRMCVDCRAINNITIRYRHPIPRLD 820

Query: 184  DLLDQLSGASVFSKLDLRSG--QIRVRPGDEWKIAFKTREGLFEWLVMPFGLSNAPAPST 357
            D+LD+LSG+ VFSK+DLRSG  QIR++ GDEWK AFKT+ GL+EWLVMPFGL+NAP  ST
Sbjct: 821  DMLDELSGSLVFSKIDLRSGYHQIRMKLGDEWKTAFKTKFGLYEWLVMPFGLTNAP--ST 878

Query: 358  FMRVMNQA*RSLIGTCVVVYFDDILVYSRSLTDHWRHLRAVLLILRNEKFYAKPNKCVFC 537
            FMR+MN+  R+ IG  VVVYFDDIL+YSRS+ DH  HLRAV   LR+ + +    KC FC
Sbjct: 879  FMRLMNEVLRAFIGRFVVVYFDDILIYSRSIEDHHGHLRAVFDALRDARLFGNLEKCTFC 938

Query: 538  EPAILFLGYLVSGDGIRVDVSKVAVIREWPSPSTITEARSFHGLASFYCHFIAHFSTIMA 717
               + FLGY+V+  GI VD +KV  I  WP P+TIT+ RSF GLA FY  F+  FSTI A
Sbjct: 939  TDRVSFLGYVVTPQGIEVDQAKVEAIHSWPVPTTITQVRSFLGLAGFYRRFVKDFSTIAA 998

Query: 718  LITDCIGLK-PFVWTPAAAAAFDVIKQRLSAAPVLLLPDFEQLFELSCDASKAGIGAVLS 894
             + +       F W  A   AFD +K +L+ AP+L LPDF + FEL CDAS  G+G VL 
Sbjct: 999  PLHELTKRNVTFTWAAAQRNAFDTLKDKLTHAPLLQLPDFNKTFELECDASGIGLGGVLL 1058

Query: 895  QSAHPVAFFSEKLSGPCSRYNTYDVEFYAVVRAVRHWRHYLFHHEFILYSDHEALRHLAT 1074
            Q   PVA+FSEKLSGP   Y+TYD E +A+VR +  W+HYL+  EF+++SDHE+L+H+ +
Sbjct: 1059 QEGKPVAYFSEKLSGPSLNYSTYDKELFALVRTLETWQHYLWPKEFVIHSDHESLKHIRS 1118

Query: 1075 QDNLPARHASWTTYLQQFTF 1134
            Q     RHA W  +++ F +
Sbjct: 1119 QAKHNRRHAKWVEFIESFPY 1138


>ref|XP_007206823.1| hypothetical protein PRUPE_ppa025991mg [Prunus persica]
            gi|462402465|gb|EMJ08022.1| hypothetical protein
            PRUPE_ppa025991mg [Prunus persica]
          Length = 1274

 Score =  443 bits (1140), Expect = e-122
 Identities = 216/361 (59%), Positives = 273/361 (75%), Gaps = 2/361 (0%)
 Frame = +1

Query: 1    AELRRQVEELLQRGVIRESLSPCAVPTLLIPKKDSTWRMCIDSRAINKITVRYRFPIPRL 180
            ++L RQ++ LL +G IR SLSPCAVP LL PKKD +W MC+DS A+NKI V+YRFPIPRL
Sbjct: 374  SKLNRQIQGLLAKGFIRHSLSPCAVPVLLTPKKDCSWGMCVDSCAVNKIIVKYRFPIPRL 433

Query: 181  EDLLDQLSGASVFSKLDLRSG--QIRVRPGDEWKIAFKTREGLFEWLVMPFGLSNAPAPS 354
            ED+LD L+G+  FSK+DLRSG  QI +R GDEWK AFKT +GL+EWLVMPFG+SNAP  S
Sbjct: 434  EDMLDDLAGSQWFSKIDLRSGYHQISIREGDEWKTAFKTPDGLYEWLVMPFGMSNAP--S 491

Query: 355  TFMRVMNQA*RSLIGTCVVVYFDDILVYSRSLTDHWRHLRAVLLILRNEKFYAKPNKCVF 534
            TFMRVM    R  IG  +VVYFDDIL+YSRS  +H +HLR +   LR EK YA   KC F
Sbjct: 492  TFMRVMTHVLRPYIGKFLVVYFDDILIYSRSREEHIQHLRTIFSTLRKEKLYANLKKCSF 551

Query: 535  CEPAILFLGYLVSGDGIRVDVSKVAVIREWPSPSTITEARSFHGLASFYCHFIAHFSTIM 714
             +P +LFLG+ +S  G+  D +KV  I  WP+P+T+TEARSFHGL SFY  FI  FS IM
Sbjct: 552  LQPQVLFLGFNISAAGVSTDPAKVEAIINWPTPTTLTEARSFHGLTSFYRRFIPGFSIIM 611

Query: 715  ALITDCIGLKPFVWTPAAAAAFDVIKQRLSAAPVLLLPDFEQLFELSCDASKAGIGAVLS 894
            ALITDC+    F+WT AAA AF ++KQ+++ APV   PD  ++FE++CDAS  GIG VLS
Sbjct: 612  ALITDCMKQGAFLWTHAAAKAFTILKQKMTQAPVFRHPDLTKVFEVTCDASGVGIGGVLS 671

Query: 895  QSAHPVAFFSEKLSGPCSRYNTYDVEFYAVVRAVRHWRHYLFHHEFILYSDHEALRHLAT 1074
            Q  HPVA+FSEKL+    RY+T+D EFY VV+A+R+W++YL  +EF+LYSDH+AL++L +
Sbjct: 672  QEGHPVAYFSEKLNEAKQRYSTHDKEFYDVVQALRYWQYYLLPNEFVLYSDHQALKYLHS 731

Query: 1075 Q 1077
            Q
Sbjct: 732  Q 732


>gb|AAK51582.1|AC022352_18 Putative retroelement [Oryza sativa Japonica Group]
            gi|31431012|gb|AAP52850.1| retrotransposon protein,
            putative, Ty3-gypsy subclass [Oryza sativa Japonica
            Group]
          Length = 2447

 Score =  443 bits (1139), Expect = e-122
 Identities = 213/381 (55%), Positives = 280/381 (73%), Gaps = 4/381 (1%)
 Frame = +1

Query: 4    ELRRQVEELLQRGVIRESLSPCAVPTLLIPKKDSTWRMCIDSRAINKITVRYRFPIPRLE 183
            E++RQV ELL +G +RESLSPCAVP +L+PKKD +WRMC+D RAIN IT+RYR PIPRL+
Sbjct: 733  EIQRQVHELLDKGYVRESLSPCAVPVILVPKKDGSWRMCVDCRAINNITIRYRHPIPRLD 792

Query: 184  DLLDQLSGASVFSKLDLRSG--QIRVRPGDEWKIAFKTREGLFEWLVMPFGLSNAPAPST 357
            D+LD+LSG+ VFSK+DLRSG  QIR++ GDEWK AFKT+ GL+EWLVMPFGL+NAP  ST
Sbjct: 793  DMLDELSGSIVFSKVDLRSGYHQIRMKLGDEWKTAFKTKFGLYEWLVMPFGLTNAP--ST 850

Query: 358  FMRVMNQA*RSLIGTCVVVYFDDILVYSRSLTDHWRHLRAVLLILRNEKFYAKPNKCVFC 537
            FMR+MN+  R  IG  VVVYFDDIL+YS+S+ +H+ HLRAV   LR+ + +    KC FC
Sbjct: 851  FMRLMNEVLRPFIGKFVVVYFDDILIYSKSMGEHFNHLRAVFNALRDARLFGNLEKCTFC 910

Query: 538  EPAILFLGYLVSGDGIRVDVSKVAVIREWPSPSTITEARSFHGLASFYCHFIAHFSTIMA 717
               + FLGY+V+  GI VD +KV  I+ WP+P T+++ RSF GLA FY  F+  FSTI A
Sbjct: 911  TDRVSFLGYVVTPQGIEVDQAKVEAIQSWPTPKTVSQVRSFLGLAGFYRRFVQDFSTIAA 970

Query: 718  LITDCIGLK--PFVWTPAAAAAFDVIKQRLSAAPVLLLPDFEQLFELSCDASKAGIGAVL 891
             + + +  K  PF W  +   AF ++K +L+ AP+L LPDF + FEL CDAS  G+G VL
Sbjct: 971  PL-NVLTKKGVPFTWGTSQENAFHMLKDKLTHAPLLQLPDFNKTFELECDASGIGLGGVL 1029

Query: 892  SQSAHPVAFFSEKLSGPCSRYNTYDVEFYAVVRAVRHWRHYLFHHEFILYSDHEALRHLA 1071
             Q   PVA+FSEKLSGP   Y+TYD E YA+VR +  W+HYL+  EF+++SDHE+L+H+ 
Sbjct: 1030 LQEGKPVAYFSEKLSGPVLNYSTYDKELYALVRTLETWQHYLWPKEFVIHSDHESLKHIR 1089

Query: 1072 TQDNLPARHASWTTYLQQFTF 1134
            +Q  L  RHA W  +++ F +
Sbjct: 1090 SQGKLNRRHAKWVEFIESFPY 1110



 Score =  226 bits (575), Expect = 2e-56
 Identities = 118/278 (42%), Positives = 170/278 (61%), Gaps = 1/278 (0%)
 Frame = +1

Query: 298  GLFEWLVMPFGLSNAPAPSTFMRVMNQA*RSLIGTCVVVYFDDILVYSRSLTDHWRHLRA 477
            GL+E+ VM FGL+NAPA   FM +MN+     +   VVV+ DDILVYS+S  DH  HLR 
Sbjct: 1625 GLYEFTVMSFGLTNAPA--FFMNLMNKVFMEYLDKFVVVFIDDILVYSQSEEDHQHHLRL 1682

Query: 478  VLLILRNEKFYAKPNKCVFCEPAILFLGYLVSGDGIRVDVSKVAVIREWPSPSTITEARS 657
            VL  LR  + YAK +KC F    + FLG+++S  G+ VD   V  + +W  P T+T+ RS
Sbjct: 1683 VLGKLREHQLYAKLSKCEFWLSEVKFLGHVISAKGVAVDPETVTAVTDWKQPKTVTQIRS 1742

Query: 658  FHGLASFYCHFIAHFSTIMALITDCIGL-KPFVWTPAAAAAFDVIKQRLSAAPVLLLPDF 834
            F GLA +Y  FI +FS I   +T  +   + FVW+P    AF  +K++L ++PVL+LPD 
Sbjct: 1743 FLGLAGYYRRFIENFSKIARPMTQLLKKEEKFVWSPQCEKAFQTLKEKLVSSPVLILPDT 1802

Query: 835  EQLFELSCDASKAGIGAVLSQSAHPVAFFSEKLSGPCSRYNTYDVEFYAVVRAVRHWRHY 1014
             + F + CDAS  G+G VL Q  H VA+ S +L      Y T+D+E  AVV A++ WRHY
Sbjct: 1803 RKDFMVYCDASPQGLGCVLMQEGHVVAYASRQLWPHEGNYPTHDLELAAVVHALKIWRHY 1862

Query: 1015 LFHHEFILYSDHEALRHLATQDNLPARHASWTTYLQQF 1128
            L  +   +Y+DH++L+++ TQ +L  R   W   ++ +
Sbjct: 1863 LIGNRCEIYTDHKSLKYIFTQSDLNLRQRRWLELIKDY 1900


>gb|AAK94517.1| gag-pol polyprotein [Hordeum vulgare]
          Length = 1717

 Score =  443 bits (1139), Expect = e-122
 Identities = 215/380 (56%), Positives = 274/380 (72%), Gaps = 3/380 (0%)
 Frame = +1

Query: 4    ELRRQVEELLQRGVIRESLSPCAVPTLLIPKKDSTWRMCIDSRAINKITVRYRFPIPRLE 183
            E+ RQV+ELL +G IRESLSPCAVP +L+PKKD T RMC+D R IN IT+RYR PIPRL+
Sbjct: 789  EIMRQVQELLDKGYIRESLSPCAVPIILVPKKDGTSRMCVDCRGINNITIRYRHPIPRLD 848

Query: 184  DLLDQLSGASVFSKLDLRSG--QIRVRPGDEWKIAFKTREGLFEWLVMPFGLSNAPAPST 357
            D+LD+LSG+ +FSK+DLRSG  QIR++ GDEWK AFKT+ GL+EWLVMPFGL+NAP  ST
Sbjct: 849  DMLDELSGSIIFSKVDLRSGYHQIRMKLGDEWKTAFKTKFGLYEWLVMPFGLTNAP--ST 906

Query: 358  FMRVMNQA*RSLIGTCVVVYFDDILVYSRSLTDHWRHLRAVLLILRNEKFYAKPNKCVFC 537
            FMR+MN+  R+ IG  VVVYFDDIL+YSRSL DH  HLRAV   LR+ + +    KC FC
Sbjct: 907  FMRLMNEVLRAFIGRFVVVYFDDILIYSRSLEDHLDHLRAVFTALRDARLFGNLGKCTFC 966

Query: 538  EPAILFLGYLVSGDGIRVDVSKVAVIREWPSPSTITEARSFHGLASFYCHFIAHFSTIMA 717
               + FLGY+V+  GI VD +K+  I  WP P T+T+ RSF GLA FY  F+  FSTI A
Sbjct: 967  TDRVSFLGYVVTPQGIEVDKAKIEAIESWPQPKTVTQVRSFLGLAGFYRRFVRDFSTIAA 1026

Query: 718  LITDCIGLK-PFVWTPAAAAAFDVIKQRLSAAPVLLLPDFEQLFELSCDASKAGIGAVLS 894
             + +      P+ W  A   AF V+K +L+ AP+L LPDF + FEL CDAS  G+G VL 
Sbjct: 1027 PLNELTKKDVPYSWGTAQEEAFTVLKDKLTHAPLLQLPDFNKTFELECDASGIGLGGVLL 1086

Query: 895  QSAHPVAFFSEKLSGPCSRYNTYDVEFYAVVRAVRHWRHYLFHHEFILYSDHEALRHLAT 1074
            Q   PVA+FSEKLSGP   Y+TYD E YA+VR +  W+HYL+  EF+++SDHE+L+H+ +
Sbjct: 1087 QDGKPVAYFSEKLSGPSLNYSTYDKELYALVRTLETWQHYLWPKEFVIHSDHESLKHIKS 1146

Query: 1075 QDNLPARHASWTTYLQQFTF 1134
            Q  L  RHA W  +++ F +
Sbjct: 1147 QAKLNRRHAKWVEFIETFPY 1166


>emb|CAD39843.2| OSJNBb0072N21.12 [Oryza sativa Japonica Group]
          Length = 1239

 Score =  442 bits (1137), Expect = e-121
 Identities = 215/380 (56%), Positives = 275/380 (72%), Gaps = 3/380 (0%)
 Frame = +1

Query: 4    ELRRQVEELLQRGVIRESLSPCAVPTLLIPKKDSTWRMCIDSRAINKITVRYRFPIPRLE 183
            E++RQV+ELL +G +RESLSPCAVP LL+PKKD +WRMC+D RAIN IT+RYR PIPRL+
Sbjct: 539  EIQRQVQELLDKGYVRESLSPCAVPVLLVPKKDGSWRMCVDCRAINNITIRYRHPIPRLD 598

Query: 184  DLLDQLSGASVFSKLDLRSG--QIRVRPGDEWKIAFKTREGLFEWLVMPFGLSNAPAPST 357
            D+LD+LSG+ VFSK+DLRSG  QIR++ GDEWK AFKT+  L+EWLVMPFGL+NAP  ST
Sbjct: 599  DMLDELSGSLVFSKIDLRSGYHQIRMKLGDEWKTAFKTKFSLYEWLVMPFGLTNAP--ST 656

Query: 358  FMRVMNQA*RSLIGTCVVVYFDDILVYSRSLTDHWRHLRAVLLILRNEKFYAKPNKCVFC 537
            FMR+MN+  R+ IG  VVVYFD IL+YSRS+ DH  HLRAV   LR+ + +    KC FC
Sbjct: 657  FMRLMNEVLRAFIGRFVVVYFDGILIYSRSIEDHHGHLRAVFDALRDARLFGNLEKCTFC 716

Query: 538  EPAILFLGYLVSGDGIRVDVSKVAVIREWPSPSTITEARSFHGLASFYCHFIAHFSTIMA 717
               + FLGY+V+  GI VD +KV  I  WP P+TIT+ RSF GLA FY  F+  FSTI A
Sbjct: 717  TDRVSFLGYVVTPQGIEVDQAKVEAIHSWPVPTTITQVRSFLGLAGFYRRFVKDFSTIAA 776

Query: 718  LITDCIGLK-PFVWTPAAAAAFDVIKQRLSAAPVLLLPDFEQLFELSCDASKAGIGAVLS 894
             + +       F W  A   AFD +K +L+ AP+L LPDF + FEL CDAS  G+G VL 
Sbjct: 777  PLHELTKRNVTFTWAAAQRNAFDTLKDKLTHAPLLQLPDFNKTFELECDASGIGLGGVLL 836

Query: 895  QSAHPVAFFSEKLSGPCSRYNTYDVEFYAVVRAVRHWRHYLFHHEFILYSDHEALRHLAT 1074
            Q   PVA+FSEKLSGP   Y+TYD E +A+VR +  W+HYL+  EF+++SDHE+L+H+ +
Sbjct: 837  QEDKPVAYFSEKLSGPSLNYSTYDKELFALVRTLETWQHYLWPKEFVIHSDHESLKHIRS 896

Query: 1075 QDNLPARHASWTTYLQQFTF 1134
            Q  L  RHA W  +++ F +
Sbjct: 897  QAKLNRRHAKWVEFIESFPY 916


Top