BLASTX nr result

ID: Rheum21_contig00020082 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Rheum21_contig00020082
         (231 letters)

Database: ./nr 
           37,332,560 sequences; 13,225,080,153 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

gb|AAO45752.1| pol protein [Cucumis melo subsp. melo]                 146   2e-33
gb|EOY26390.1| Uncharacterized protein TCM_027940 [Theobroma cacao]   146   3e-33
gb|EOY03326.1| DNA/RNA polymerases superfamily protein [Theobrom...   146   3e-33
gb|EOX93842.1| Uncharacterized protein TCM_002794 [Theobroma cacao]   146   3e-33
emb|CAN62233.1| hypothetical protein VITISV_010121 [Vitis vinifera]   145   4e-33
gb|EMJ11440.1| hypothetical protein PRUPE_ppa014973mg, partial [...   145   7e-33
ref|XP_004153733.1| PREDICTED: uncharacterized protein LOC101205...   144   9e-33
gb|EMJ14281.1| hypothetical protein PRUPE_ppa021229mg [Prunus pe...   144   1e-32
gb|AAT38724.1| Putative retrotransposon protein, identical [Sola...   143   2e-32
gb|AAT38744.1| Putative gag-pol polyprotein, identical [Solanum ...   143   2e-32
gb|EOY19088.1| Uncharacterized protein TCM_043787 [Theobroma cacao]   143   3e-32
gb|EOY03078.1| Retrotransposon protein, putative [Theobroma cacao]    143   3e-32
gb|EMJ26157.1| hypothetical protein PRUPE_ppa021114mg, partial [...   143   3e-32
ref|XP_006366848.1| PREDICTED: uncharacterized protein LOC102605...   142   4e-32
gb|EOX94089.1| Uncharacterized protein TCM_003206 [Theobroma cacao]   142   4e-32
gb|EOY14099.1| DNA/RNA polymerases superfamily protein [Theobrom...   142   5e-32
gb|EMJ25340.1| hypothetical protein PRUPE_ppa016115mg [Prunus pe...   142   5e-32
ref|WP_016971918.1| hypothetical protein [Pseudomonas tolaasii]       142   6e-32
gb|AAL79340.1|AC099402_4 Putative 22 kDa kafirin cluster; Ty3-Gy...   142   6e-32
gb|ADU56212.1| gag-pol polyprotein [Solanum lycopersicum]             142   6e-32

>gb|AAO45752.1| pol protein [Cucumis melo subsp. melo]
          Length = 923

 Score =  146 bits (369), Expect = 2e-33
 Identities = 67/76 (88%), Positives = 73/76 (96%)
 Frame = +3

Query: 3   WGAPVLFVKKKDGSFRLCIDYRELNKVTIKNKYPLPRIDDLLDQLQGAKVFSKIDLRSGY 182
           WGAPVLFVKKKDGS RLCIDYRELNKVT+KN+YPLPRIDDL DQLQGA VFSKIDLRSGY
Sbjct: 29  WGAPVLFVKKKDGSMRLCIDYRELNKVTVKNRYPLPRIDDLFDQLQGATVFSKIDLRSGY 88

Query: 183 HQLRIREEDIPRTAFR 230
           HQLRI++ED+P+TAFR
Sbjct: 89  HQLRIKDEDVPKTAFR 104


>gb|EOY26390.1| Uncharacterized protein TCM_027940 [Theobroma cacao]
          Length = 1052

 Score =  146 bits (368), Expect = 3e-33
 Identities = 68/76 (89%), Positives = 72/76 (94%)
 Frame = +3

Query: 3   WGAPVLFVKKKDGSFRLCIDYRELNKVTIKNKYPLPRIDDLLDQLQGAKVFSKIDLRSGY 182
           WGAPVLFVKKKDGS RLCIDYR+LNKVT+KNKYPLPRIDDL DQLQGA+ FSKIDLRSGY
Sbjct: 540 WGAPVLFVKKKDGSLRLCIDYRQLNKVTVKNKYPLPRIDDLFDQLQGAQCFSKIDLRSGY 599

Query: 183 HQLRIREEDIPRTAFR 230
           HQLRIR EDIP+TAFR
Sbjct: 600 HQLRIRNEDIPKTAFR 615


>gb|EOY03326.1| DNA/RNA polymerases superfamily protein [Theobroma cacao]
          Length = 1447

 Score =  146 bits (368), Expect = 3e-33
 Identities = 68/76 (89%), Positives = 72/76 (94%)
 Frame = +3

Query: 3   WGAPVLFVKKKDGSFRLCIDYRELNKVTIKNKYPLPRIDDLLDQLQGAKVFSKIDLRSGY 182
           WGAPVLFVKKKDGS RLCIDYR+LNKVT+KNKYPLPRIDDL DQLQGA+ FSKIDLRSGY
Sbjct: 582 WGAPVLFVKKKDGSLRLCIDYRQLNKVTVKNKYPLPRIDDLFDQLQGAQCFSKIDLRSGY 641

Query: 183 HQLRIREEDIPRTAFR 230
           HQLRIR EDIP+TAFR
Sbjct: 642 HQLRIRNEDIPKTAFR 657


>gb|EOX93842.1| Uncharacterized protein TCM_002794 [Theobroma cacao]
          Length = 509

 Score =  146 bits (368), Expect = 3e-33
 Identities = 68/76 (89%), Positives = 72/76 (94%)
 Frame = +3

Query: 3   WGAPVLFVKKKDGSFRLCIDYRELNKVTIKNKYPLPRIDDLLDQLQGAKVFSKIDLRSGY 182
           WGAPVLFVKKKDGS RLCIDYR+LNKVT+KNKYPLPRIDDL DQLQGA+ FSKIDLRSGY
Sbjct: 344 WGAPVLFVKKKDGSLRLCIDYRQLNKVTVKNKYPLPRIDDLFDQLQGAQCFSKIDLRSGY 403

Query: 183 HQLRIREEDIPRTAFR 230
           HQLRIR EDIP+TAFR
Sbjct: 404 HQLRIRNEDIPKTAFR 419


>emb|CAN62233.1| hypothetical protein VITISV_010121 [Vitis vinifera]
          Length = 1797

 Score =  145 bits (367), Expect = 4e-33
 Identities = 67/76 (88%), Positives = 72/76 (94%)
 Frame = +3

Query: 3    WGAPVLFVKKKDGSFRLCIDYRELNKVTIKNKYPLPRIDDLLDQLQGAKVFSKIDLRSGY 182
            WGAPVLFVKKKDGS RLCIDYRELNKVT++NKYPLPRIDDL DQLQGA VFSKIDLRSGY
Sbjct: 977  WGAPVLFVKKKDGSMRLCIDYRELNKVTVRNKYPLPRIDDLFDQLQGACVFSKIDLRSGY 1036

Query: 183  HQLRIREEDIPRTAFR 230
            HQLR+R ED+P+TAFR
Sbjct: 1037 HQLRVRSEDVPKTAFR 1052


>gb|EMJ11440.1| hypothetical protein PRUPE_ppa014973mg, partial [Prunus persica]
          Length = 747

 Score =  145 bits (365), Expect = 7e-33
 Identities = 63/76 (82%), Positives = 74/76 (97%)
 Frame = +3

Query: 3   WGAPVLFVKKKDGSFRLCIDYRELNKVTIKNKYPLPRIDDLLDQLQGAKVFSKIDLRSGY 182
           WGAPVLFVKKKDG+ RLC+DYR+LNK+T++N+YPLPRIDDL DQL+GAKVFSKIDLRSGY
Sbjct: 404 WGAPVLFVKKKDGTMRLCVDYRQLNKITVRNRYPLPRIDDLFDQLKGAKVFSKIDLRSGY 463

Query: 183 HQLRIREEDIPRTAFR 230
           HQLR+REED+P+TAFR
Sbjct: 464 HQLRVREEDVPKTAFR 479


>ref|XP_004153733.1| PREDICTED: uncharacterized protein LOC101205308, partial [Cucumis
           sativus]
          Length = 768

 Score =  144 bits (364), Expect = 9e-33
 Identities = 69/76 (90%), Positives = 71/76 (93%)
 Frame = +3

Query: 3   WGAPVLFVKKKDGSFRLCIDYRELNKVTIKNKYPLPRIDDLLDQLQGAKVFSKIDLRSGY 182
           WGAPVLFVKKKDGS RLCIDYRELNKVTIKN YPLPRIDDL DQLQGA VFSKIDLRSGY
Sbjct: 569 WGAPVLFVKKKDGSMRLCIDYRELNKVTIKNIYPLPRIDDLFDQLQGATVFSKIDLRSGY 628

Query: 183 HQLRIREEDIPRTAFR 230
           HQLRIR+ DIP+TAFR
Sbjct: 629 HQLRIRDSDIPKTAFR 644


>gb|EMJ14281.1| hypothetical protein PRUPE_ppa021229mg [Prunus persica]
          Length = 1194

 Score =  144 bits (363), Expect = 1e-32
 Identities = 63/76 (82%), Positives = 74/76 (97%)
 Frame = +3

Query: 3   WGAPVLFVKKKDGSFRLCIDYRELNKVTIKNKYPLPRIDDLLDQLQGAKVFSKIDLRSGY 182
           WGAPVLFVKKKDG+ RLC+DYR+LNK+T++N+YPLPRIDDL DQL+GAKVFSKIDLRSGY
Sbjct: 301 WGAPVLFVKKKDGTMRLCVDYRQLNKITVRNRYPLPRIDDLFDQLKGAKVFSKIDLRSGY 360

Query: 183 HQLRIREEDIPRTAFR 230
           HQLR+REED+P+TAFR
Sbjct: 361 HQLRVREEDMPKTAFR 376


>gb|AAT38724.1| Putative retrotransposon protein, identical [Solanum demissum]
          Length = 1602

 Score =  143 bits (361), Expect = 2e-32
 Identities = 66/76 (86%), Positives = 71/76 (93%)
 Frame = +3

Query: 3   WGAPVLFVKKKDGSFRLCIDYRELNKVTIKNKYPLPRIDDLLDQLQGAKVFSKIDLRSGY 182
           WGAPVLFV+KKDGS R+CIDYR+LNKVTIKNKYPLPRIDDL DQLQGA  FSKIDLRSGY
Sbjct: 700 WGAPVLFVRKKDGSLRICIDYRQLNKVTIKNKYPLPRIDDLFDQLQGATCFSKIDLRSGY 759

Query: 183 HQLRIREEDIPRTAFR 230
           HQLR+RE DIP+TAFR
Sbjct: 760 HQLRVRERDIPKTAFR 775


>gb|AAT38744.1| Putative gag-pol polyprotein, identical [Solanum demissum]
          Length = 1515

 Score =  143 bits (361), Expect = 2e-32
 Identities = 66/76 (86%), Positives = 71/76 (93%)
 Frame = +3

Query: 3   WGAPVLFVKKKDGSFRLCIDYRELNKVTIKNKYPLPRIDDLLDQLQGAKVFSKIDLRSGY 182
           WGAPVLFV+KKDGS R+CIDYR+LNKVTIKNKYPLPRIDDL DQLQGA  FSKIDLRSGY
Sbjct: 694 WGAPVLFVRKKDGSLRMCIDYRQLNKVTIKNKYPLPRIDDLFDQLQGATCFSKIDLRSGY 753

Query: 183 HQLRIREEDIPRTAFR 230
           HQLR+RE DIP+TAFR
Sbjct: 754 HQLRVRERDIPKTAFR 769


>gb|EOY19088.1| Uncharacterized protein TCM_043787 [Theobroma cacao]
          Length = 649

 Score =  143 bits (360), Expect = 3e-32
 Identities = 64/76 (84%), Positives = 73/76 (96%)
 Frame = +3

Query: 3   WGAPVLFVKKKDGSFRLCIDYRELNKVTIKNKYPLPRIDDLLDQLQGAKVFSKIDLRSGY 182
           WGAPVLFVKKKDG+ RLCIDYR+LN++TIKNKYPLPRIDDL DQLQGA VFSK+DLRSGY
Sbjct: 386 WGAPVLFVKKKDGTLRLCIDYRQLNRMTIKNKYPLPRIDDLFDQLQGATVFSKVDLRSGY 445

Query: 183 HQLRIREEDIPRTAFR 230
           HQLRI+E+D+P+TAFR
Sbjct: 446 HQLRIKEQDVPKTAFR 461


>gb|EOY03078.1| Retrotransposon protein, putative [Theobroma cacao]
          Length = 1263

 Score =  143 bits (360), Expect = 3e-32
 Identities = 67/76 (88%), Positives = 71/76 (93%)
 Frame = +3

Query: 3   WGAPVLFVKKKDGSFRLCIDYRELNKVTIKNKYPLPRIDDLLDQLQGAKVFSKIDLRSGY 182
           WGAPVLFVKKKDGS RLCIDYR+LNKVT+KNKYPLPRIDDL DQLQ A+ FSKIDLRSGY
Sbjct: 391 WGAPVLFVKKKDGSLRLCIDYRQLNKVTVKNKYPLPRIDDLFDQLQRAQCFSKIDLRSGY 450

Query: 183 HQLRIREEDIPRTAFR 230
           HQLRIR EDIP+TAFR
Sbjct: 451 HQLRIRNEDIPKTAFR 466


>gb|EMJ26157.1| hypothetical protein PRUPE_ppa021114mg, partial [Prunus persica]
          Length = 177

 Score =  143 bits (360), Expect = 3e-32
 Identities = 63/76 (82%), Positives = 73/76 (96%)
 Frame = +3

Query: 3   WGAPVLFVKKKDGSFRLCIDYRELNKVTIKNKYPLPRIDDLLDQLQGAKVFSKIDLRSGY 182
           WGAPVLFVKKKDG+ RLCIDYR+LNK+T++N+YPLPRIDDL DQL+GAKVFSKIDLR GY
Sbjct: 72  WGAPVLFVKKKDGTMRLCIDYRQLNKITVRNRYPLPRIDDLFDQLKGAKVFSKIDLRFGY 131

Query: 183 HQLRIREEDIPRTAFR 230
           HQLR+REED+P+TAFR
Sbjct: 132 HQLRVREEDVPKTAFR 147


>ref|XP_006366848.1| PREDICTED: uncharacterized protein LOC102605741 [Solanum tuberosum]
          Length = 823

 Score =  142 bits (359), Expect = 4e-32
 Identities = 64/76 (84%), Positives = 72/76 (94%)
 Frame = +3

Query: 3   WGAPVLFVKKKDGSFRLCIDYRELNKVTIKNKYPLPRIDDLLDQLQGAKVFSKIDLRSGY 182
           WGAPVLFVKKKDGS R+CIDYR+LNKVTI+NKYP+PRIDDL DQLQGA +FSKIDLRSGY
Sbjct: 155 WGAPVLFVKKKDGSMRMCIDYRQLNKVTIRNKYPIPRIDDLFDQLQGASIFSKIDLRSGY 214

Query: 183 HQLRIREEDIPRTAFR 230
           HQL++R EDIP+TAFR
Sbjct: 215 HQLKVRVEDIPKTAFR 230


>gb|EOX94089.1| Uncharacterized protein TCM_003206 [Theobroma cacao]
          Length = 694

 Score =  142 bits (359), Expect = 4e-32
 Identities = 63/76 (82%), Positives = 73/76 (96%)
 Frame = +3

Query: 3   WGAPVLFVKKKDGSFRLCIDYRELNKVTIKNKYPLPRIDDLLDQLQGAKVFSKIDLRSGY 182
           WGAP+LFVKKKDG+ RLCIDYR+LN++TIKNKYPLPRIDDL DQLQGA VFSK+DLRSGY
Sbjct: 343 WGAPILFVKKKDGTLRLCIDYRQLNRMTIKNKYPLPRIDDLFDQLQGATVFSKVDLRSGY 402

Query: 183 HQLRIREEDIPRTAFR 230
           HQLRI+E+D+P+TAFR
Sbjct: 403 HQLRIKEQDVPKTAFR 418


>gb|EOY14099.1| DNA/RNA polymerases superfamily protein [Theobroma cacao]
          Length = 1502

 Score =  142 bits (358), Expect = 5e-32
 Identities = 66/76 (86%), Positives = 71/76 (93%)
 Frame = +3

Query: 3   WGAPVLFVKKKDGSFRLCIDYRELNKVTIKNKYPLPRIDDLLDQLQGAKVFSKIDLRSGY 182
           WGAPVLFVKKKDGS RLCIDYR+LNKVT+KNKYPLPRIDDL DQLQGA+ FSKIDLRSGY
Sbjct: 674 WGAPVLFVKKKDGSLRLCIDYRQLNKVTVKNKYPLPRIDDLFDQLQGAQCFSKIDLRSGY 733

Query: 183 HQLRIREEDIPRTAFR 230
           HQLRIR EDIP+ AF+
Sbjct: 734 HQLRIRNEDIPKIAFQ 749


>gb|EMJ25340.1| hypothetical protein PRUPE_ppa016115mg [Prunus persica]
          Length = 1269

 Score =  142 bits (358), Expect = 5e-32
 Identities = 63/76 (82%), Positives = 73/76 (96%)
 Frame = +3

Query: 3   WGAPVLFVKKKDGSFRLCIDYRELNKVTIKNKYPLPRIDDLLDQLQGAKVFSKIDLRSGY 182
           WGAPVLFVKKKDG+ RLCIDYR+LNK+T++N+YPLPRIDDL DQL+GAKVFSKIDLRSGY
Sbjct: 411 WGAPVLFVKKKDGTMRLCIDYRQLNKITVRNRYPLPRIDDLFDQLKGAKVFSKIDLRSGY 470

Query: 183 HQLRIREEDIPRTAFR 230
           HQLR+REED+ +TAFR
Sbjct: 471 HQLRVREEDVTKTAFR 486


>ref|WP_016971918.1| hypothetical protein [Pseudomonas tolaasii]
          Length = 137

 Score =  142 bits (357), Expect = 6e-32
 Identities = 63/76 (82%), Positives = 73/76 (96%)
 Frame = +3

Query: 3   WGAPVLFVKKKDGSFRLCIDYRELNKVTIKNKYPLPRIDDLLDQLQGAKVFSKIDLRSGY 182
           WGAPVLFVKKKDG+ RLCIDYR+LNKVT+KNKYPLPRIDDL DQL+GA+VFSKIDLRSGY
Sbjct: 29  WGAPVLFVKKKDGTLRLCIDYRQLNKVTVKNKYPLPRIDDLFDQLKGAEVFSKIDLRSGY 88

Query: 183 HQLRIREEDIPRTAFR 230
           HQLR+++ D+P+TAFR
Sbjct: 89  HQLRVKDADVPKTAFR 104


>gb|AAL79340.1|AC099402_4 Putative 22 kDa kafirin cluster; Ty3-Gypsy type [Oryza sativa]
           gi|21327374|gb|AAM48279.1|AC122148_32 Putative 22 kDa
           kafirin cluster; Ty3-Gypsy type [Oryza sativa Japonica
           Group] gi|31431495|gb|AAP53268.1| retrotransposon
           protein, putative, Ty3-gypsy subclass [Oryza sativa
           Japonica Group]
          Length = 1230

 Score =  142 bits (357), Expect = 6e-32
 Identities = 66/76 (86%), Positives = 71/76 (93%)
 Frame = +3

Query: 3   WGAPVLFVKKKDGSFRLCIDYRELNKVTIKNKYPLPRIDDLLDQLQGAKVFSKIDLRSGY 182
           WGAPVLFVKKKDGS RLC DYRELNKVTIKNKYPLPRIDDL DQLQGA+VFSKIDL+SGY
Sbjct: 532 WGAPVLFVKKKDGSLRLCTDYRELNKVTIKNKYPLPRIDDLFDQLQGARVFSKIDLQSGY 591

Query: 183 HQLRIREEDIPRTAFR 230
           HQL+I+  DIP+TAFR
Sbjct: 592 HQLKIKPSDIPKTAFR 607


>gb|ADU56212.1| gag-pol polyprotein [Solanum lycopersicum]
          Length = 624

 Score =  142 bits (357), Expect = 6e-32
 Identities = 66/76 (86%), Positives = 71/76 (93%)
 Frame = +3

Query: 3   WGAPVLFVKKKDGSFRLCIDYRELNKVTIKNKYPLPRIDDLLDQLQGAKVFSKIDLRSGY 182
           WGAPVLFVK KDGSFR+CIDYR+LNKVTIKNKYPLPRIDDL DQLQGA VFSKIDLRSGY
Sbjct: 77  WGAPVLFVKTKDGSFRMCIDYRQLNKVTIKNKYPLPRIDDLFDQLQGACVFSKIDLRSGY 136

Query: 183 HQLRIREEDIPRTAFR 230
           HQL+IR  D+P+TAFR
Sbjct: 137 HQLKIRATDVPKTAFR 152


Top