BLASTX nr result

ID: Rehmannia25_contig00028514 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Rehmannia25_contig00028514
         (608 letters)

Database: ./nr 
           37,332,560 sequences; 13,225,080,153 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

gb|ABN08587.1| Polynucleotidyl transferase, Ribonuclease H fold ...   226   5e-57
gb|ABM55240.1| retrotransposon protein [Beta vulgaris]                218   1e-54
gb|AEV42258.1| hypothetical protein [Beta vulgaris]                   217   2e-54
gb|ABG37663.1| CCHC-type integrase [Populus trichocarpa]              217   2e-54
emb|CAN77801.1| hypothetical protein VITISV_031477 [Vitis vinifera]   216   5e-54
emb|CAN61139.1| hypothetical protein VITISV_009489 [Vitis vinifera]   214   1e-53
emb|CAA73042.1| polyprotein [Ananas comosus]                          212   6e-53
gb|EOY19685.1| DNA/RNA polymerases superfamily protein, putative...   210   2e-52
gb|AAP43917.1| integrase [Gossypium hirsutum]                         209   6e-52
gb|EOY00082.1| DNA/RNA polymerases superfamily protein [Theobrom...   206   4e-51
gb|ADB85337.1| putative retrotransposon protein [Phyllostachys e...   206   4e-51
emb|CBL94133.1| putative polyprotein (retrotransposon protein) [...   206   5e-51
emb|CAC44142.1| putative polyprotein [Cicer arietinum]                202   4e-50
gb|ABA95071.1| retrotransposon protein, putative, Ty3-gypsy subc...   202   8e-50
gb|EMJ14281.1| hypothetical protein PRUPE_ppa021229mg [Prunus pe...   201   1e-49
emb|CAN79387.1| hypothetical protein VITISV_000074 [Vitis vinifera]   201   2e-49
gb|EOY00215.1| DNA/RNA polymerases superfamily protein [Theobrom...   199   5e-49
gb|EOY31663.1| CCHC-type integrase [Theobroma cacao]                  199   6e-49
gb|EOY08678.1| DNA/RNA polymerases superfamily protein [Theobrom...   199   6e-49
gb|EOY21478.1| DNA/RNA polymerases superfamily protein [Theobrom...   198   8e-49

>gb|ABN08587.1| Polynucleotidyl transferase, Ribonuclease H fold [Medicago
           truncatula]
          Length = 482

 Score =  226 bits (575), Expect = 5e-57
 Identities = 116/207 (56%), Positives = 148/207 (71%), Gaps = 7/207 (3%)
 Frame = -2

Query: 604 GAVLMQHGKVIAYASRKLKPYEVNYPTHDLELAAIVFALKKWRHYLYGAEFTVFTDHKSL 425
           G VLMQ GKV+AYASR+L+ YE NYPTHDLELAA+VF LK WRHYLYG+ F VF+DHKSL
Sbjct: 77  GGVLMQDGKVVAYASRQLRIYEKNYPTHDLELAAVVFVLKIWRHYLYGSRFEVFSDHKSL 136

Query: 424 KYLFSQKDLNLRQRRWMELIEDYHFDIQYHPGKANVVADALSRKA-YVARLMIREWKSLE 248
           KYLF QK+LN+RQRRW+EL++DY F + YHPGKANVVADALSRK  +++ LM+RE++ LE
Sbjct: 137 KYLFDQKELNMRQRRWLELLKDYDFGLNYHPGKANVVADALSRKTLHMSALMVREFELLE 196

Query: 247 E------LSYWNPKKVGTQLLCANLSVKPELLTEILLSQQKDHKFGEFKNKMIQRGKSDF 86
           +      +  W+P+ V   +    L +  E L  I  +Q+ D KF +      Q   SDF
Sbjct: 197 QFRDLSLVCEWSPQSVKLGM----LKIDSEFLKSIKEAQKVDVKFVDLLVASNQTEDSDF 252

Query: 85  KLGDDSILRFRNRICIPHDEDLKRKIL 5
           K+ D  +LRFR RICIP D ++K+ IL
Sbjct: 253 KVDDHGVLRFRGRICIPDDAEMKKMIL 279


>gb|ABM55240.1| retrotransposon protein [Beta vulgaris]
          Length = 1501

 Score =  218 bits (554), Expect = 1e-54
 Identities = 118/208 (56%), Positives = 144/208 (69%), Gaps = 8/208 (3%)
 Frame = -2

Query: 604  GAVLMQHGKVIAYASRKLKPYEVNYPTHDLELAAIVFALKKWRHYLYGAEFTVFTDHKSL 425
            G VL Q+GKVIAYAS +LKPYE NYPTHDLELAAIVFALK WRHYLYGA   +FTDHKSL
Sbjct: 868  GCVLQQNGKVIAYASCQLKPYEANYPTHDLELAAIVFALKIWRHYLYGATCKIFTDHKSL 927

Query: 424  KYLFSQKDLNLRQRRWMELIEDYHFDIQYHPGKANVVADALSRKA-------YVARLMIR 266
            KY+F+QKDLN+RQRRW+ELI+DY  DIQYH GKANVVADALSRK+        V   + R
Sbjct: 928  KYIFTQKDLNMRQRRWLELIKDYDLDIQYHEGKANVVADALSRKSSHSLSTLIVPEELCR 987

Query: 265  EWKSLEELSYWNPKKVGTQLLCANLSVKPELLTEILLSQQKDHKFGEFKNKMIQRGKSDF 86
            + K L  L   NP +   +L  +NLS+   +  EI+  Q  D    + K KM Q  + DF
Sbjct: 988  DMKRL-NLEILNPGESEARL--SNLSLGVSIFDEIIEGQVGDEHLDKIKEKMKQGKEIDF 1044

Query: 85   KLGDDSILRFRNRICIPHD-EDLKRKIL 5
            K+ +D  LRF+ R C+P    DLKR+++
Sbjct: 1045 KIHEDGSLRFKGRWCVPQKCNDLKRRLM 1072


>gb|AEV42258.1| hypothetical protein [Beta vulgaris]
          Length = 1553

 Score =  217 bits (552), Expect = 2e-54
 Identities = 116/206 (56%), Positives = 145/206 (70%), Gaps = 5/206 (2%)
 Frame = -2

Query: 604  GAVLMQHGKVIAYASRKLKPYEVNYPTHDLELAAIVFALKKWRHYLYGAEFTVFTDHKSL 425
            G VLMQ+GKVIAYASR+LKPYEVNYPTHDLELAAIVFALK WRHYLYG    +FTDHKSL
Sbjct: 900  GCVLMQNGKVIAYASRQLKPYEVNYPTHDLELAAIVFALKIWRHYLYGVTCRIFTDHKSL 959

Query: 424  KYLFSQKDLNLRQRRWMELIEDYHFDIQYHPGKANVVADALSRK-AYVARLMIREWKSLE 248
            KY+F+QKDLN+RQRRW+ELI+DY  DIQYH GKANVVADALSRK ++    ++   K  E
Sbjct: 960  KYIFTQKDLNMRQRRWLELIKDYDLDIQYHEGKANVVADALSRKSSHSLNTLVVADKLCE 1019

Query: 247  ELSYWNPKKV---GTQLLCANLSVKPELLTEILLSQQKDHKFGEFKNKMIQRGKSDFKLG 77
            E S    + V     + L + L+++P  L EI  SQ  D K    K K+ +     F + 
Sbjct: 1020 EFSRLQIEVVHEGEVERLLSALTIEPNFLEEIRASQPGDVKLERVKAKLKEGKAEGFAIH 1079

Query: 76   DDSILRFRNRICIPHD-EDLKRKILS 2
            +D  +R++ R C+P   E+LK+KI+S
Sbjct: 1080 EDGSIRYKGRWCVPQKCEELKQKIMS 1105


>gb|ABG37663.1| CCHC-type integrase [Populus trichocarpa]
          Length = 2037

 Score =  217 bits (552), Expect = 2e-54
 Identities = 110/209 (52%), Positives = 148/209 (70%), Gaps = 9/209 (4%)
 Frame = -2

Query: 604  GAVLMQHGKVIAYASRKLKPYEVNYPTHDLELAAIVFALKKWRHYLYGAEFTVFTDHKSL 425
            G VLMQHGKVIAYASR+LK +E NYPTHDLE+ A++FALK WRHYLYG    +FTDHKSL
Sbjct: 1800 GCVLMQHGKVIAYASRQLKKHEQNYPTHDLEMTAVIFALKIWRHYLYGETCEIFTDHKSL 1859

Query: 424  KYLFSQKDLNLRQRRWMELIEDYHFDIQYHPGKANVVADALSRKA--------YVARLMI 269
            KY+F Q+DLNLRQRRWMEL++DY   I YHPGKANVVADALSRK+         V R +I
Sbjct: 1860 KYIFQQRDLNLRQRRWMELLKDYDCTIHYHPGKANVVADALSRKSSGSLAHIQEVRRPLI 1919

Query: 268  REWKSL-EELSYWNPKKVGTQLLCANLSVKPELLTEILLSQQKDHKFGEFKNKMIQRGKS 92
            RE   L +E   ++  + G  +  A+  VK +L  +I  +Q+KD      +N++ Q   +
Sbjct: 1920 RELHELVDEGVRFDLSEAGAMI--AHFQVKSDLFDKIKAAQKKDDSLLRIRNEVEQGKAA 1977

Query: 91   DFKLGDDSILRFRNRICIPHDEDLKRKIL 5
             F +GDD +LR+++R+C+P  +DL+R+++
Sbjct: 1978 GFVIGDDDVLRYKDRLCVPDVDDLRRELM 2006


>emb|CAN77801.1| hypothetical protein VITISV_031477 [Vitis vinifera]
          Length = 855

 Score =  216 bits (549), Expect = 5e-54
 Identities = 108/201 (53%), Positives = 146/201 (72%)
 Frame = -2

Query: 607 FGAVLMQHGKVIAYASRKLKPYEVNYPTHDLELAAIVFALKKWRHYLYGAEFTVFTDHKS 428
           +G VLMQH KV+A AS++LKPYE NYPTHDLELAA+VFALK WRH+L+G  + +FTDHKS
Sbjct: 302 WGCVLMQHXKVVACASKQLKPYERNYPTHDLELAAVVFALKIWRHFLFGETYEIFTDHKS 361

Query: 427 LKYLFSQKDLNLRQRRWMELIEDYHFDIQYHPGKANVVADALSRKAYVARLMIREWKSLE 248
           LKYLFSQK+LN+R  RW+EL++DY   IQYHPGKANVVADALSRK   +R ++ + +SL+
Sbjct: 362 LKYLFSQKELNMRXGRWIELLKDYDCIIQYHPGKANVVADALSRK---SRQLLEDLRSLQ 418

Query: 247 ELSYWNPKKVGTQLLCANLSVKPELLTEILLSQQKDHKFGEFKNKMIQRGKSDFKLGDDS 68
                + + + +  L AN  V+P+L+  I   Q+ D    +   ++ +  K DF L DD 
Sbjct: 419 V----HMRVLDSGALVANFKVQPDLVGRIKALQKNDLNLVQLMEEVKKGSKPDFVLSDDG 474

Query: 67  ILRFRNRICIPHDEDLKRKIL 5
           ILRF  R+C+P+D DL+R++L
Sbjct: 475 ILRFMTRLCVPNDGDLRRELL 495


>emb|CAN61139.1| hypothetical protein VITISV_009489 [Vitis vinifera]
          Length = 984

 Score =  214 bits (546), Expect = 1e-53
 Identities = 111/204 (54%), Positives = 146/204 (71%), Gaps = 4/204 (1%)
 Frame = -2

Query: 604 GAVLMQHGKVIAYASRKLKPYEVNYPTHDLELAAIVFALKKWRHYLYGAEFTVFTDHKSL 425
           G VLMQHG+V+AYASR+LKPYE NYPTHD ELA +VFALK WRH+L+G    +FTDHKSL
Sbjct: 367 GCVLMQHGRVVAYASRQLKPYERNYPTHDSELADVVFALKIWRHFLFGETCEIFTDHKSL 426

Query: 424 KYLFSQKDLNLRQRRWMELIEDYHFDIQYHPGKANVVADALSRKAYVARLMIR--EWKSL 251
           KYLFSQK LN+RQRRW+EL++DY + IQYH  KANVVADALSRK+  +   IR  + + L
Sbjct: 427 KYLFSQKKLNMRQRRWIELLKDYDYIIQYHSRKANVVADALSRKSVGSLTAIRGCQRQLL 486

Query: 250 EELS--YWNPKKVGTQLLCANLSVKPELLTEILLSQQKDHKFGEFKNKMIQRGKSDFKLG 77
           E+L     + + + +  L AN  V+P+L+  I   Q+ D    +   ++ +  K DF L 
Sbjct: 487 EDLRSLQVHMRVLDSGALIANFRVQPDLVGRIKALQKNDLNLVQLMEEVKKGSKLDFVLS 546

Query: 76  DDSILRFRNRICIPHDEDLKRKIL 5
           DD ILRF  R+C+P+DEDL+R++L
Sbjct: 547 DDGILRFGTRLCVPNDEDLRRELL 570


>emb|CAA73042.1| polyprotein [Ananas comosus]
          Length = 871

 Score =  212 bits (540), Expect = 6e-53
 Identities = 112/204 (54%), Positives = 139/204 (68%), Gaps = 4/204 (1%)
 Frame = -2

Query: 604 GAVLMQHGKVIAYASRKLKPYEVNYPTHDLELAAIVFALKKWRHYLYGAEFTVFTDHKSL 425
           G VLMQ  KVIAYASR+LK YE NYPTHDLELAA+VFALK WRHYLYG    V+TDHKSL
Sbjct: 325 GCVLMQDDKVIAYASRQLKEYEKNYPTHDLELAAVVFALKLWRHYLYGERCEVYTDHKSL 384

Query: 424 KYLFSQKDLNLRQRRWMELIEDYHFDIQYHPGKANVVADALSRKAY--VARLMIREWKSL 251
           KYLF+QK+LNLRQRRW+EL++DY   I YHPGKANVVADALSRK+   +A  ++ + + +
Sbjct: 385 KYLFTQKELNLRQRRWLELLKDYDLTILYHPGKANVVADALSRKSMENLAMHVVTQPRLI 444

Query: 250 EELSYWNPKKV--GTQLLCANLSVKPELLTEILLSQQKDHKFGEFKNKMIQRGKSDFKLG 77
           E++     + V   T +    L V+P LL  I   Q  D +  + K KM+     DF L 
Sbjct: 445 EQMKRLELEIVTPDTPMRLMTLVVQPTLLDRIKEKQASDVELQKIKGKMVDGCTGDFTLD 504

Query: 76  DDSILRFRNRICIPHDEDLKRKIL 5
            D ++RFR RIC+P D  +K  IL
Sbjct: 505 GDGLMRFRGRICVPADSGIKEDIL 528


>gb|EOY19685.1| DNA/RNA polymerases superfamily protein, putative [Theobroma cacao]
          Length = 1347

 Score =  210 bits (535), Expect = 2e-52
 Identities = 111/209 (53%), Positives = 144/209 (68%), Gaps = 9/209 (4%)
 Frame = -2

Query: 604  GAVLMQHGKVIAYASRKLKPYEVNYPTHDLELAAIVFALKKWRHYLYGAEFTVFTDHKSL 425
            G VLMQ  KV+AYASR+LK +E NYPTHDLELAA+VFALK WRHYLYG    +FTDHKSL
Sbjct: 739  GCVLMQDEKVVAYASRQLKRHEANYPTHDLELAAVVFALKIWRHYLYGEHCRIFTDHKSL 798

Query: 424  KYLFSQKDLNLRQRRWMELIEDYHFDIQYHPGKANVVADALSRK--AYVARLMIREWKSL 251
            KYL +QK+LNLRQRRW+ELI+DY   I YHPGKANVVADALSRK  + +A L    + +L
Sbjct: 799  KYLLTQKELNLRQRRWLELIKDYDLVIDYHPGKANVVADALSRKSSSSLAALQSCYFSAL 858

Query: 250  EELSYWNPKKVGTQL-------LCANLSVKPELLTEILLSQQKDHKFGEFKNKMIQRGKS 92
             E+     K +G QL       + AN  V+P LL +I   Q+ D +  +   K+   G S
Sbjct: 859  IEM-----KSLGVQLRNGEDGSVLANFIVRPSLLNQIKDIQRSDDELRKEIQKLTDGGVS 913

Query: 91   DFKLGDDSILRFRNRICIPHDEDLKRKIL 5
            +F+ G+D++L FR+R+C+P    L++ I+
Sbjct: 914  EFRFGEDNVLMFRDRVCVPEGNQLRQTIM 942


>gb|AAP43917.1| integrase [Gossypium hirsutum]
          Length = 355

 Score =  209 bits (531), Expect = 6e-52
 Identities = 108/202 (53%), Positives = 141/202 (69%), Gaps = 1/202 (0%)
 Frame = -2

Query: 607 FGAVLMQHGKVIAYASRKLKPYEVNYPTHDLELAAIVFALKKWRHYLYGAEFTVFTDHKS 428
           +G VLMQ GKV+AYASR+LKP++ NYPTHDLELAA+VFALK WRHYLYG +  V+TDHKS
Sbjct: 1   WGCVLMQEGKVVAYASRQLKPHKKNYPTHDLELAAMVFALKIWRHYLYGEKCRVYTDHKS 60

Query: 427 LKYLFSQKDLNLRQRRWMELIEDYHFDIQYHPGKANVVADALSRK-AYVARLMIREWKSL 251
           LKYL SQKDLNLRQRRW+EL++DY   I YHPGKANVVADAL+RK  +  R+M  + K  
Sbjct: 61  LKYLMSQKDLNLRQRRWLELLKDYELVIDYHPGKANVVADALNRKLLFALRVMNTQLKIS 120

Query: 250 EELSYWNPKKVGTQLLCANLSVKPELLTEILLSQQKDHKFGEFKNKMIQRGKSDFKLGDD 71
           ++ S           + A L  +P  L EI  +Q+ D      + +     +SDF++G +
Sbjct: 121 DDGS-----------ILAELRARPMFLQEISEAQKNDQNLLAKRKQCEADTRSDFRIGSN 169

Query: 70  SILRFRNRICIPHDEDLKRKIL 5
             L F+NRIC+P +++L +KIL
Sbjct: 170 GCLMFKNRICVPKNDELIQKIL 191


>gb|EOY00082.1| DNA/RNA polymerases superfamily protein [Theobroma cacao]
          Length = 1515

 Score =  206 bits (524), Expect = 4e-51
 Identities = 110/209 (52%), Positives = 143/209 (68%), Gaps = 9/209 (4%)
 Frame = -2

Query: 604  GAVLMQHGKVIAYASRKLKPYEVNYPTHDLELAAIVFALKKWRHYLYGAEFTVFTDHKSL 425
            G VLMQ  KV+AYASR+LK +E NYPTHDLELAA+VFALK WRHYLYG    +FTDHKSL
Sbjct: 885  GCVLMQDEKVVAYASRQLKRHEANYPTHDLELAAVVFALKIWRHYLYGEHCRIFTDHKSL 944

Query: 424  KYLFSQKDLNLRQRRWMELIEDYHFDIQYHPGKANVVADALSRK--AYVARLMIREWKSL 251
            KYL +QK+LNLRQRRW+ELI+DY   I YH GKANVVADALSRK  + +A L    + +L
Sbjct: 945  KYLLTQKELNLRQRRWLELIKDYDLVIDYHLGKANVVADALSRKSSSSLAALQSCYFPAL 1004

Query: 250  EELSYWNPKKVGTQL-------LCANLSVKPELLTEILLSQQKDHKFGEFKNKMIQRGKS 92
             E+     K +G QL       L AN  V+P LL +I   Q+ D +  +   K+   G S
Sbjct: 1005 IEM-----KSLGVQLRNGEDGSLLANFIVRPSLLNQIKDIQRSDDELRKEIQKLTDGGVS 1059

Query: 91   DFKLGDDSILRFRNRICIPHDEDLKRKIL 5
            +F+ G+D++L F++R+C+P    L++ I+
Sbjct: 1060 EFRFGEDNVLMFKDRVCVPEGNQLRQAIM 1088


>gb|ADB85337.1| putative retrotransposon protein [Phyllostachys edulis]
          Length = 1053

 Score =  206 bits (524), Expect = 4e-51
 Identities = 112/204 (54%), Positives = 143/204 (70%), Gaps = 4/204 (1%)
 Frame = -2

Query: 604  GAVLMQHGKVIAYASRKLKPYEVNYPTHDLELAAIVFALKKWRHYLYGAEFTVFTDHKSL 425
            G VLMQ GKV+AYASR+L+P+E NYPTHDLELAAIV ALK WRHYL G    +FTDHKSL
Sbjct: 432  GCVLMQEGKVVAYASRQLRPHEGNYPTHDLELAAIVHALKIWRHYLIGNRCEIFTDHKSL 491

Query: 424  KYLFSQKDLNLRQRRWMELIEDYHFDIQYHPGKANVVADALSRKAYVARLMIR--EWKSL 251
            KY+F+Q +LNLRQRRW+ELI+DY   I YHPGKANVVADALSRKAY   ++++  + +  
Sbjct: 492  KYIFTQSELNLRQRRWLELIKDYDLGIHYHPGKANVVADALSRKAYCNTILVQKNQPELY 551

Query: 250  EELSYWNPKKVGTQLLCAN-LSVKPELLTEILLSQQKDHKFGEFKNKMIQRGKS-DFKLG 77
            EEL + N + V     C N L V+P L ++I   Q +D    E K  M +RGK+  F   
Sbjct: 552  EELKHLNLEIVNQG--CVNALEVQPTLQSQIREKQLEDEDIKEIKKNM-RRGKAPGFSED 608

Query: 76   DDSILRFRNRICIPHDEDLKRKIL 5
            +   + F NRIC+P+ ++LK+ IL
Sbjct: 609  EQGTVWFGNRICVPNQQELKQSIL 632


>emb|CBL94133.1| putative polyprotein (retrotransposon protein) [Malus domestica]
          Length = 362

 Score =  206 bits (523), Expect = 5e-51
 Identities = 110/209 (52%), Positives = 143/209 (68%), Gaps = 9/209 (4%)
 Frame = -2

Query: 604 GAVLMQHGKVIAYASRKLKPYEVNYPTHDLELAAIVFALKKWRHYLYGAEFTVFTDHKSL 425
           G VLMQH +VIAYASR+LK +E NYPTHDLELAAIVFALK WRHYLYG +  +FTDHKSL
Sbjct: 98  GCVLMQHSRVIAYASRQLKTHERNYPTHDLELAAIVFALKIWRHYLYGEKCKIFTDHKSL 157

Query: 424 KYLFSQKDLNLRQRRWMELIEDYHFDIQYHPGKANVVADALSRK-------AYVARL-MI 269
           +YLF+Q DLNLRQRRW+EL+ DY   I+YHPG+ANVVADALSRK        Y +R+ ++
Sbjct: 158 QYLFTQHDLNLRQRRWLELLSDYDCTIEYHPGRANVVADALSRKPQGRLNALYTSRVPLL 217

Query: 268 REWKSLEELSYWNPKKVGTQLLCANLSVKPELLTEILLSQQKDHKFGEFKNKMIQRGKSD 89
            E +S      W  +   ++   AN  VKP L+  +L +Q  D +  E  N   +  K D
Sbjct: 218 AELRSTGVELEWEEQ---SEAFLANFQVKPILIDRVLAAQSLDEEIQELINLRNEGKKKD 274

Query: 88  FKL-GDDSILRFRNRICIPHDEDLKRKIL 5
            K+ G D +L   NR+ +P++E+LK++IL
Sbjct: 275 LKIRGSDGMLMQENRMYVPNNEELKKEIL 303


>emb|CAC44142.1| putative polyprotein [Cicer arietinum]
          Length = 655

 Score =  202 bits (515), Expect = 4e-50
 Identities = 106/205 (51%), Positives = 141/205 (68%), Gaps = 5/205 (2%)
 Frame = -2

Query: 604 GAVLMQHGKVIAYASRKLKPYEVNYPTHDLELAAIVFALKKWRHYLYGAEFTVFTDHKSL 425
           G VLMQH KV+AYASR+LK +E NYPTHDLELAA+VFALK WRHYLYG  FTVF+DHKSL
Sbjct: 128 GCVLMQHKKVVAYASRQLKIHERNYPTHDLELAAVVFALKIWRHYLYGCTFTVFSDHKSL 187

Query: 424 KYLFSQKDLNLRQRRWMELIEDYHFDIQYHPGKANVVADALSRKAYVARLMIRE-----W 260
           KYLF QK+LN+RQRRW+E ++D+ F +QYHPGKANVVADALSR++     +I       W
Sbjct: 188 KYLFDQKELNMRQRRWIETLKDFDFTLQYHPGKANVVADALSRRSVSVSSLIMARQQELW 247

Query: 259 KSLEELSYWNPKKVGTQLLCANLSVKPELLTEILLSQQKDHKFGEFKNKMIQRGKSDFKL 80
           ++  +L + N +     L    + +   LL +I  SQ  D    E +N ++Q   ++FK+
Sbjct: 248 EAFRDL-HLNVEFAPGILKFGMIKISSGLLEDIANSQD-DVLIQEKRNLIVQGKTTEFKI 305

Query: 79  GDDSILRFRNRICIPHDEDLKRKIL 5
           G D++LR   RIC+P    +++ IL
Sbjct: 306 GADNVLRCNGRICVPEITAMRKTIL 330


>gb|ABA95071.1| retrotransposon protein, putative, Ty3-gypsy subclass [Oryza sativa
            Japonica Group]
          Length = 1122

 Score =  202 bits (513), Expect = 8e-50
 Identities = 105/203 (51%), Positives = 139/203 (68%), Gaps = 3/203 (1%)
 Frame = -2

Query: 604  GAVLMQHGKVIAYASRKLKPYEVNYPTHDLELAAIVFALKKWRHYLYGAEFTVFTDHKSL 425
            G VLMQ  KV+AYASR+L+P+EVNYPTHDLELAA+V ALK WRHYL G    ++TDHKSL
Sbjct: 610  GCVLMQDRKVVAYASRQLRPHEVNYPTHDLELAAVVHALKIWRHYLIGNRCEIYTDHKSL 669

Query: 424  KYLFSQKDLNLRQRRWMELIEDYHFDIQYHPGKANVVADALSRKAYVARLMIREWKSLEE 245
            KY+F+Q +LN+RQRRW+ELI+DY+  I YHPGKAN+VADALSRK Y A  ++  W + + 
Sbjct: 670  KYIFTQSELNMRQRRWLELIKDYNLGIHYHPGKANMVADALSRKTYCATALV--WPTQDN 727

Query: 244  LSYWNPKKVGTQLLCAN---LSVKPELLTEILLSQQKDHKFGEFKNKMIQRGKSDFKLGD 74
            L     K   T +   N   L+V+P L T+I  +QQ+D    E K K+ ++    F + D
Sbjct: 728  LCREIEKLKLTMVPTGNLASLTVQPTLETQIREAQQEDEGMKEMKGKIRKKRLKGFTIDD 787

Query: 73   DSILRFRNRICIPHDEDLKRKIL 5
               + F +RIC+P  ++LK  IL
Sbjct: 788  QDTVWFGSRICVPARKELKDLIL 810


>gb|EMJ14281.1| hypothetical protein PRUPE_ppa021229mg [Prunus persica]
          Length = 1194

 Score =  201 bits (511), Expect = 1e-49
 Identities = 106/205 (51%), Positives = 142/205 (69%), Gaps = 5/205 (2%)
 Frame = -2

Query: 604  GAVLMQHGKVIAYASRKLKPYEVNYPTHDLELAAIVFALKKWRHYLYGAEFTVFTDHKSL 425
            G VLMQHG+VIAYASR+LK +E+NYP HDLELAA+VFALK WRHYLYG    +FTDHKSL
Sbjct: 573  GCVLMQHGRVIAYASRQLKKHELNYPVHDLELAAVVFALKIWRHYLYGETCQIFTDHKSL 632

Query: 424  KYLFSQKDLNLRQRRWMELIEDYHFDIQYHPGKANVVADALSRKAYVARLMIREWKSLEE 245
            KYLF+QK+LNLRQRRW+ELI+DY   I++HPG+ANVVADALSRK+  +   +R  + L  
Sbjct: 633  KYLFTQKELNLRQRRWLELIKDYDCTIEHHPGRANVVADALSRKSSGSIAYLR-GRYLPL 691

Query: 244  LSYWNPKKVGTQL-----LCANLSVKPELLTEILLSQQKDHKFGEFKNKMIQRGKSDFKL 80
            +      ++G  +     L A L V+P L+  IL +Q +D      + ++    ++D  +
Sbjct: 692  MVEMRKLRIGLDVDNQGALLATLHVRPVLVERILAAQSQDPLICTLRVEVANGDRTDCSV 751

Query: 79   GDDSILRFRNRICIPHDEDLKRKIL 5
             +D  L   NR+ +P+DE LKR+IL
Sbjct: 752  RNDGALMVGNRLYVPNDEALKREIL 776


>emb|CAN79387.1| hypothetical protein VITISV_000074 [Vitis vinifera]
          Length = 757

 Score =  201 bits (510), Expect = 2e-49
 Identities = 103/200 (51%), Positives = 139/200 (69%)
 Frame = -2

Query: 604 GAVLMQHGKVIAYASRKLKPYEVNYPTHDLELAAIVFALKKWRHYLYGAEFTVFTDHKSL 425
           G VLMQHGKV+AYASR+L  YE NYPTHDLEL  +VF LK WRH+L+G    +FT+HKSL
Sbjct: 232 GCVLMQHGKVVAYASRQLTSYERNYPTHDLELVVVVFELKIWRHFLFGETCEIFTNHKSL 291

Query: 424 KYLFSQKDLNLRQRRWMELIEDYHFDIQYHPGKANVVADALSRKAYVARLMIREWKSLEE 245
           KYLFSQK+LN+RQRRW+EL++DY   IQYHP KANV+AD LSRK    R ++   +SL+ 
Sbjct: 292 KYLFSQKELNMRQRRWIELLKDYDCIIQYHPRKANVMADVLSRK---FRQLLEVLRSLQV 348

Query: 244 LSYWNPKKVGTQLLCANLSVKPELLTEILLSQQKDHKFGEFKNKMIQRGKSDFKLGDDSI 65
               + + + +  L AN  V+P L+  I   Q+K+ +  +   ++ +  K +F L DD I
Sbjct: 349 ----HIRVLDSGALVANFRVRPNLVRIIKTLQKKNMQLVQLMEEVKRGSKPNFVLSDDGI 404

Query: 64  LRFRNRICIPHDEDLKRKIL 5
           LRF  R+C+P D DL+R++L
Sbjct: 405 LRFGTRLCVPKDGDLRRELL 424


>gb|EOY00215.1| DNA/RNA polymerases superfamily protein [Theobroma cacao]
          Length = 1537

 Score =  199 bits (506), Expect = 5e-49
 Identities = 111/209 (53%), Positives = 137/209 (65%), Gaps = 9/209 (4%)
 Frame = -2

Query: 604  GAVLMQHGKVIAYASRKLKPYEVNYPTHDLELAAIVFALKKWRHYLYGAEFTVFTDHKSL 425
            G VLMQ  KVIAYASR+LK +E NYPTHDLELA +VFALK WRHYLYG    +F DHKSL
Sbjct: 898  GCVLMQDEKVIAYASRQLKKHETNYPTHDLELATVVFALKIWRHYLYGERCRIFYDHKSL 957

Query: 424  KYLFSQKDLNLRQRRWMELIEDYHFDIQYHPGKANVVADALSRK--AYVARLMIREWKSL 251
            KYL +QK+LNLRQR+W+ELI+DY   I YHP KANVVADALSRK  + +A L    +  L
Sbjct: 958  KYLLTQKELNLRQRQWLELIKDYDLVIDYHPRKANVVADALSRKSSSSLATLRSSYFSML 1017

Query: 250  EELSYWNPKKVGTQL-------LCANLSVKPELLTEILLSQQKDHKFGEFKNKMIQRGKS 92
             E+     K +G QL       L A+  V+P LL +I   Q+ D    +   K+     S
Sbjct: 1018 LEM-----KSLGIQLNNGEDGTLLASFVVRPSLLNQIRELQKSDDWLKQEVQKLQDGKAS 1072

Query: 91   DFKLGDDSILRFRNRICIPHDEDLKRKIL 5
            +F+L DD  L  R+RIC+P D+ L+R IL
Sbjct: 1073 EFRLSDDGTLMLRDRICVPKDDQLRRAIL 1101


>gb|EOY31663.1| CCHC-type integrase [Theobroma cacao]
          Length = 395

 Score =  199 bits (505), Expect = 6e-49
 Identities = 113/213 (53%), Positives = 144/213 (67%), Gaps = 13/213 (6%)
 Frame = -2

Query: 604 GAVLMQHGKVIAYASRKLKPYEVNYPTHDLELAAIVFALKKWRHYLYGAEFTVFTDHKSL 425
           G VLMQHGKVIAYASR+LK +E NYP HDLE+AAIVFALK WRHYLYG    ++TDHKSL
Sbjct: 57  GCVLMQHGKVIAYASRQLKRHEQNYPIHDLEMAAIVFALKIWRHYLYGETCEIYTDHKSL 116

Query: 424 KYLFSQKDLNLRQRRWMELIEDYHFDIQYHPGKANVVADALSRK-----AYVA---RLMI 269
           KY+F Q+DLNLRQRRWMEL++DY   I YHPGKANVVADALSRK     A+++   R ++
Sbjct: 117 KYIFQQRDLNLRQRRWMELLKDYDCTILYHPGKANVVADALSRKSMGSLAHISIGRRSLV 176

Query: 268 REWKSLEELSYWNPKKVGTQLLCANLSVKPELLTEILLSQQKDHKFGEFKNKMIQ----- 104
           RE  SL ++     +   T  L A+  V+P L+  I  +Q KD    EF  K ++     
Sbjct: 177 REIHSLGDIGV-RLEVAETNALLAHFRVRPILMDRIKEAQSKD----EFVIKALEDPQGR 231

Query: 103 RGKSDFKLGDDSILRFRNRICIPHDEDLKRKIL 5
           +GK  F  G D +LR+  R+ +P  + L+R+IL
Sbjct: 232 KGKM-FTKGTDGVLRYGTRLYVPDGDGLRREIL 263


>gb|EOY08678.1| DNA/RNA polymerases superfamily protein [Theobroma cacao]
          Length = 666

 Score =  199 bits (505), Expect = 6e-49
 Identities = 113/213 (53%), Positives = 145/213 (68%), Gaps = 13/213 (6%)
 Frame = -2

Query: 604 GAVLMQHGKVIAYASRKLKPYEVNYPTHDLELAAIVFALKKWRHYLYGAEFTVFTDHKSL 425
           G VLMQHGKVIAYASR+LK +E NYP H+LE+AAIVFALK WRHYLYG    ++TDHKSL
Sbjct: 148 GCVLMQHGKVIAYASRQLKRHEQNYPIHNLEMAAIVFALKIWRHYLYGETCEIYTDHKSL 207

Query: 424 KYLFSQKDLNLRQRRWMELIEDYHFDIQYHPGKANVVADALSRK-----AYVA---RLMI 269
           KY+F Q+DLNLRQRRWMEL++DY   I YHPGKANVVADALSRK     A+++   R ++
Sbjct: 208 KYIFQQRDLNLRQRRWMELLKDYDCTILYHPGKANVVADALSRKSMGSLAHISIGRRSLV 267

Query: 268 REWKSLEELSYWNPKKVGTQLLCANLSVKPELLTEILLSQQKDHKFGEFKNKMIQ----- 104
           RE  SL ++     +   T  L A+  V+P L+ +I  +Q KD    EF  K ++     
Sbjct: 268 REIHSLGDIGV-RLEVAETNALLAHFRVRPILMDKIKEAQSKD----EFVIKALEDPQGR 322

Query: 103 RGKSDFKLGDDSILRFRNRICIPHDEDLKRKIL 5
           +GK  F  G D +LR+  R+ +P  + L+RKIL
Sbjct: 323 KGKM-FTKGTDGVLRYGTRLYVPDGDGLRRKIL 354


>gb|EOY21478.1| DNA/RNA polymerases superfamily protein [Theobroma cacao]
          Length = 878

 Score =  198 bits (504), Expect = 8e-49
 Identities = 112/213 (52%), Positives = 142/213 (66%), Gaps = 13/213 (6%)
 Frame = -2

Query: 604  GAVLMQHGKVIAYASRKLKPYEVNYPTHDLELAAIVFALKKWRHYLYGAEFTVFTDHKSL 425
            G VLMQHGKVIAYASR+LK +E NYP HDLE+AAIVFALK WRHYLYG    ++TDHKSL
Sbjct: 431  GCVLMQHGKVIAYASRQLKRHEQNYPIHDLEMAAIVFALKIWRHYLYGETCEIYTDHKSL 490

Query: 424  KYLFSQKDLNLRQRRWMELIEDYHFDIQYHPGKANVVADALSRKAYVA--------RLMI 269
            KY+F Q+DLNLRQRRWMEL++DY   I YHPGKANVVADALSRK+  +        R ++
Sbjct: 491  KYIFQQRDLNLRQRRWMELLKDYDCTILYHPGKANVVADALSRKSMGSLAHIFIGRRSLV 550

Query: 268  REWKSLEELSYWNPKKVGTQLLCANLSVKPELLTEILLSQQKDHKFGEFKNKMIQ----- 104
            RE  SL ++     +   T  L A+  V+P L+  I  +Q KD    EF  K ++     
Sbjct: 551  REIHSLGDIGV-RLEVAETNALLAHFRVRPILMDRIKEAQSKD----EFVIKALEDPQGR 605

Query: 103  RGKSDFKLGDDSILRFRNRICIPHDEDLKRKIL 5
            +GK  F  G D +LR+  R+ +P  + L+R+IL
Sbjct: 606  KGKM-FTKGTDGVLRYGTRLYVPDGDGLRREIL 637


Top