BLASTX nr result

ID: Rehmannia22_contig00035257 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Rehmannia22_contig00035257
         (732 letters)

Database: ./nr 
           37,332,560 sequences; 13,225,080,153 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

gb|EMJ11865.1| hypothetical protein PRUPE_ppa022462mg [Prunus pe...   176   5e-42
ref|XP_002329042.1| predicted protein [Populus trichocarpa]           163   5e-38
ref|XP_006392773.1| hypothetical protein EUTSA_v10012229mg [Eutr...   157   3e-36
ref|XP_006402169.1| hypothetical protein EUTSA_v10015409mg, part...   154   2e-35
ref|XP_006293129.1| hypothetical protein CARUB_v10019435mg [Caps...   149   9e-34
ref|XP_006299377.1| hypothetical protein CARUB_v10015536mg [Caps...   139   9e-31
ref|XP_006300423.1| hypothetical protein CARUB_v10021967mg, part...   134   2e-29
gb|AAM15062.1| putative retroelement integrase [Arabidopsis thal...   114   4e-23
ref|XP_004140476.1| PREDICTED: uncharacterized protein LOC101221...   108   2e-21
ref|XP_004161393.1| PREDICTED: uncharacterized protein LOC101232...   108   2e-21
ref|XP_004134253.1| PREDICTED: LOW QUALITY PROTEIN: uncharacteri...   107   4e-21
gb|EOX96724.1| Gag-pol polyprotein, putative [Theobroma cacao]        106   7e-21
gb|EOY10285.1| Uncharacterized protein TCM_025656 [Theobroma cacao]   106   9e-21
gb|EOY18660.1| Uncharacterized protein TCM_043155 [Theobroma cacao]   103   8e-20
gb|EMJ01412.1| hypothetical protein PRUPE_ppa015697mg [Prunus pe...   100   6e-19
ref|XP_004295592.1| PREDICTED: uncharacterized protein LOC101291...   100   8e-19
ref|XP_004292437.1| PREDICTED: uncharacterized protein LOC101306...    99   1e-18
gb|EMJ21583.1| hypothetical protein PRUPE_ppa021778mg [Prunus pe...    98   2e-18
gb|EMJ08431.1| hypothetical protein PRUPE_ppa026856mg [Prunus pe...    98   2e-18
gb|EMS54598.1| Transposon Ty3-G Gag-Pol polyprotein [Triticum ur...    97   4e-18

>gb|EMJ11865.1| hypothetical protein PRUPE_ppa022462mg [Prunus persica]
          Length = 606

 Score =  176 bits (447), Expect = 5e-42
 Identities = 104/257 (40%), Positives = 148/257 (57%), Gaps = 14/257 (5%)
 Frame = -2

Query: 731 LVYQKLQNLRQGSRTVDAYSEEFYKLLARVDVREIDDQLVSRYIGGLRISFQDMLNLFSP 552
           LVYQ+LQNLRQG+ TV  Y+ EFY+L+AR D+ E D+QL SRYIGG+R+ FQD LNLF P
Sbjct: 102 LVYQQLQNLRQGNHTVGEYTTEFYELVARSDLAETDEQLESRYIGGMRVQFQDTLNLFDP 161

Query: 551 VTVSEAHQRALLLERQQNRRT------SPAFQHPPGRADRQVPYTDSRTPGVPSVQPRAX 390
            +V++A QRAL LE+  +R+       S    +  G      P+  S TP V +  P++ 
Sbjct: 162 FSVAKAQQRALQLEKHMSRKANSGGAWSGNSPNNRGGGSNSAPFRAS-TPLVQN--PKSF 218

Query: 389 XXXXXXXXPHSGSRQNSGRPGACFSCGDSGHKQSACPKFLGARNFLVDELDSS------E 228
                      G ++ + R   CF CG++GH  + C K       L  E D +      +
Sbjct: 219 VSDPLGKAQTVGPKRTAFR---CFKCGETGHCMAECKKSDRVGKGLFIEHDENQLQEYHD 275

Query: 227 FSEPPVYDSPSPHDTPEEILIGDIGTSLILRRACLTPRVNDSPD--QRHNIFESTCTVNG 54
           F   PVYD+  P+D  EE +  D G  L++R+ C TPR  +  D   R+N+F+S CT+ G
Sbjct: 276 FEHGPVYDN-EPNDVVEEYMTEDDGPLLMVRKTCFTPRETEGSDGWLRNNVFQSICTIGG 334

Query: 53  KVCRFIIDSGSSENVVA 3
           KVC+ +ID GS EN+++
Sbjct: 335 KVCKLVIDPGSCENIIS 351


>ref|XP_002329042.1| predicted protein [Populus trichocarpa]
          Length = 442

 Score =  163 bits (413), Expect = 5e-38
 Identities = 98/257 (38%), Positives = 143/257 (55%), Gaps = 15/257 (5%)
 Frame = -2

Query: 728 VYQKLQNLRQGSRTVDAYSEEFYKLLARVDVREIDDQLVSRYIGGLRISFQDMLNLFSPV 549
           +YQ+LQNLRQG+R+VD Y+ EFY+L++R  + E ++  V RYIG LRI FQD+LN+F  +
Sbjct: 178 LYQRLQNLRQGNRSVDDYTTEFYQLVSRDAIAEDEESRVVRYIGRLRIQFQDVLNMFDVL 237

Query: 548 TVSEAHQRALLLERQQNRRTSPAFQHPPGRADR--------QVPYTDSRTPGVPSVQPRA 393
           +VS+AHQRA+ LE+Q  RR +         A+          + +  +      S   R 
Sbjct: 238 SVSDAHQRAVQLEKQLVRRNTGGLNFGGSGANTSNNSGRTGSMNFGGTGAGSASSSSTRT 297

Query: 392 XXXXXXXXXPHSGSRQNSGRPG-ACFSCGDSGHKQSACPKFLGARNFLVDELD------S 234
                    P   +   +   G  CF+CG+ GH+ + C K  G R  L  +++       
Sbjct: 298 AIPPSPITKPTMPTHVTTPNTGFRCFNCGELGHRFAECKK--GQRRGLFSDVEEINREQE 355

Query: 233 SEFSEPPVYDSPSPHDTPEEILIGDIGTSLILRRACLTPRVNDSPDQRHNIFESTCTVNG 54
            +    PVYD        EE L GD G  L++RR+CL P V +    R N+F+STCT++G
Sbjct: 356 GDVEAEPVYDE-------EERLEGDAGPMLMIRRSCLAPHVVEDDWLRTNVFQSTCTISG 408

Query: 53  KVCRFIIDSGSSENVVA 3
           K+CRFI+DSGS EN+V+
Sbjct: 409 KICRFIVDSGSCENIVS 425


>ref|XP_006392773.1| hypothetical protein EUTSA_v10012229mg [Eutrema salsugineum]
           gi|557089351|gb|ESQ30059.1| hypothetical protein
           EUTSA_v10012229mg [Eutrema salsugineum]
          Length = 382

 Score =  157 bits (397), Expect = 3e-36
 Identities = 101/249 (40%), Positives = 137/249 (55%), Gaps = 7/249 (2%)
 Frame = -2

Query: 728 VYQKLQNLRQGSRTVDAYSEEFYKLLARVDVREIDDQLVSRYIGGLRISFQDMLNLFSPV 549
           ++ +LQNLRQGSRTVD Y+EEFY LL R ++ +   QLVSR+IGGLR   Q+ L  F P 
Sbjct: 1   MFTRLQNLRQGSRTVDEYAEEFYLLLTRNELNDTQIQLVSRFIGGLRPQLQNSLTQFDPS 60

Query: 548 TVSEAHQRALLLERQQNRRTSPAFQHPPGRADRQVPYTDSRTPGVPSVQ-PRAXXXXXXX 372
           TV+EAH+RAL  E Q    +S       G    ++  TD+      S +  ++       
Sbjct: 61  TVAEAHRRALAFETQSKAGSS---WTNSGNWRPRLTGTDTENSSHDSPEVSKSQTAPRNS 117

Query: 371 XXPHSGSRQNSGRPGA--CFSCGDSGHKQSACPKFLGARNFLVDELDSSEFSEPPVYDSP 198
                 + + S RP A  C+SCG+ GH+Q+ACP     R  L+++ +        VY+S 
Sbjct: 118 TTLDESTLRRSTRPPALKCYSCGEPGHRQTACPN-QQRRGLLLEDTEG-------VYNSA 169

Query: 197 SPHDT---PEEILIGDIGTS-LILRRACLTPRVNDSPDQRHNIFESTCTVNGKVCRFIID 30
              DT    E +  GD     L+LRR CL P   + P  R NIF STCT+ GK+C  +ID
Sbjct: 170 DEEDTGIYEETLTSGDSNAPVLMLRRICLAPVGYEEPWLRTNIFRSTCTIKGKLCNLVID 229

Query: 29  SGSSENVVA 3
           SGSS NVV+
Sbjct: 230 SGSSRNVVS 238


>ref|XP_006402169.1| hypothetical protein EUTSA_v10015409mg, partial [Eutrema
           salsugineum] gi|557103259|gb|ESQ43622.1| hypothetical
           protein EUTSA_v10015409mg, partial [Eutrema salsugineum]
          Length = 367

 Score =  154 bits (390), Expect = 2e-35
 Identities = 98/260 (37%), Positives = 135/260 (51%), Gaps = 18/260 (6%)
 Frame = -2

Query: 728 VYQKLQNLRQGSRTVDAYSEEFYKLLARVDVREIDDQLVSRYIGGLRISFQDMLNLFSPV 549
           +Y + QNLRQG+RT+D Y+EEF  LL R ++ + + QLVSR+I GLR   Q  +  F P 
Sbjct: 2   MYTRHQNLRQGTRTIDEYAEEFSLLLTRTEIYDSEVQLVSRFISGLRPQLQSAMAQFDPD 61

Query: 548 TVSEAHQRALLLERQ-------------QNRRTSPAFQHPPGRADRQVPYTDSRTPGVPS 408
           TVSEAH+RA+  E+Q             ++R T  A          +   T++ T     
Sbjct: 62  TVSEAHRRAVAFEQQFKSSVTGWNSGFSRSRMTGTATSEGSHGQAHKKDTTEATTSNTLP 121

Query: 407 VQPRAXXXXXXXXXPHSGSR---QNSGRPGA--CFSCGDSGHKQSACPKFLGARNFLVDE 243
           V              +SG+    + S +P A  CF+CG+ GH Q+ACPK    R    DE
Sbjct: 122 VA-------------NSGTEPTLRRSSQPNALRCFACGEPGHLQTACPKQT-RRGLFGDE 167

Query: 242 LDSSEFSEPPVYDSPSPHDTPEEILIGDIGTSLILRRACLTPRVNDSPDQRHNIFESTCT 63
               +       +     + PE+   GD   SL+LR  CL P V + P  R NIF+STCT
Sbjct: 168 TKWDKDDAADDNEDEFDSEVPEDHHHGDTSPSLMLRHVCLAPVVLEEPWLRTNIFQSTCT 227

Query: 62  VNGKVCRFIIDSGSSENVVA 3
           + GKVCRF++DSGS  NV+A
Sbjct: 228 IKGKVCRFVVDSGSCRNVIA 247


>ref|XP_006293129.1| hypothetical protein CARUB_v10019435mg [Capsella rubella]
            gi|482561836|gb|EOA26027.1| hypothetical protein
            CARUB_v10019435mg [Capsella rubella]
          Length = 595

 Score =  149 bits (376), Expect = 9e-34
 Identities = 98/251 (39%), Positives = 133/251 (52%), Gaps = 9/251 (3%)
 Frame = -2

Query: 728  VYQKLQNLRQGSRTVDAYSEEFYKLLARVDVREIDDQLVSRYIGGLRISFQDMLNLFSPV 549
            +Y KLQNLRQGSRTV+ Y+ +F++++AR  + E +DQLVSR+IGGLR   Q  L  F+P 
Sbjct: 326  LYNKLQNLRQGSRTVEDYATDFFEMVARTTLLEAEDQLVSRFIGGLRTQLQLPLQQFNPT 385

Query: 548  TVSEAHQRALLLERQQNRRTSPAFQHPPGRADRQVPYTDSRTPGVPSVQPRAXXXXXXXX 369
            +VSEAHQ AL +  Q  +           R   Q     + T    S   R         
Sbjct: 386  SVSEAHQCALPMGVQYRQNWGSTGSR--SRFQSQPQSEIANTSNTESTSTRKIVSKTGAN 443

Query: 368  XPH-SGSRQNSGRPGACFSCGDSGHKQSACPKFLGARNFLVDELDSSEFSEPPVY----- 207
                + SRQ       CFSCG++GH+Q+ACP     R  L  E   +EF++ P +     
Sbjct: 444  VDSIAASRQPRTSALRCFSCGENGHRQTACPN-QTRRGLLAQE---TEFTDEPRFDEYLS 499

Query: 206  DSPSPHDTPEEILIGDIGTS---LILRRACLTPRVNDSPDQRHNIFESTCTVNGKVCRFI 36
            DS   HDT  + + GD G     L+LRR CL PR       R ++F S  T+ GK+C+ I
Sbjct: 500  DSNQEHDT--DCIGGDTGHGSQILVLRRNCLLPRSTKESWLRTSLFRSISTIKGKICKLI 557

Query: 35   IDSGSSENVVA 3
            IDSGS  NV++
Sbjct: 558  IDSGSCTNVIS 568


>ref|XP_006299377.1| hypothetical protein CARUB_v10015536mg [Capsella rubella]
           gi|482568086|gb|EOA32275.1| hypothetical protein
           CARUB_v10015536mg [Capsella rubella]
          Length = 483

 Score =  139 bits (350), Expect = 9e-31
 Identities = 91/248 (36%), Positives = 128/248 (51%), Gaps = 6/248 (2%)
 Frame = -2

Query: 728 VYQKLQNLRQGSRTVDAYSEEFYKLLARVDVREIDDQLVSRYIGGLRISFQDMLNLFSPV 549
           +Y  LQNL+Q SR+VD Y+EEFY LL R +V +   QLVS +IGGLR   Q +L  F P 
Sbjct: 157 MYNILQNLKQDSRSVDEYAEEFYVLLTRTEVADSQFQLVSCFIGGLRSQLQSLLAQFDPT 216

Query: 548 TVSEAHQRALLLERQQNRRTSPAFQHPPGRADRQVPYTDSRTPGVP--SVQPRAXXXXXX 375
           ++SEAH+RA   E QQ+R  S    + P    R +   +S +   P  S           
Sbjct: 217 SLSEAHRRAASFE-QQHRSAS---WNTPASRPRPIEQHNSTSASQPRDSKDQTKQEPKFG 272

Query: 374 XXXPHSGSRQNSGRPGACFSCGDSGHKQSACPKFLGARNFLVDELDSSEFSEPPVYDS-- 201
                +G ++++      FSCG+ GH+Q+A         +  D  D        VYDS  
Sbjct: 273 FREDENGMKRSTRNALKFFSCGEPGHRQNA---------YTGDPQDD-------VYDSTK 316

Query: 200 --PSPHDTPEEILIGDIGTSLILRRACLTPRVNDSPDQRHNIFESTCTVNGKVCRFIIDS 27
                H      + GD G SL+ R+ C+ P +      R+ IF+STCT++ +VC FIIDS
Sbjct: 317 ELDDDHHKDNHAIFGDKGVSLVSRQTCIAPPLPHDNWLRYKIFKSTCTIHDRVCTFIIDS 376

Query: 26  GSSENVVA 3
           GSS NV++
Sbjct: 377 GSSRNVIS 384


>ref|XP_006300423.1| hypothetical protein CARUB_v10021967mg, partial [Capsella rubella]
           gi|482569133|gb|EOA33321.1| hypothetical protein
           CARUB_v10021967mg, partial [Capsella rubella]
          Length = 454

 Score =  134 bits (338), Expect = 2e-29
 Identities = 88/245 (35%), Positives = 121/245 (49%), Gaps = 3/245 (1%)
 Frame = -2

Query: 728 VYQKLQNLRQGSRTVDAYSEEFYKLLARVDVREIDDQLVSRYIGGLRISFQDMLNLFSPV 549
           +Y KLQNL+QGSR+VD Y +EFY L+ R D+ +   QLVSR+IG LR+  Q+ ++ F P 
Sbjct: 179 IYNKLQNLKQGSRSVDEYVKEFYLLVTRNDIFDSPIQLVSRFIGVLRVQLQNAMSQFDPT 238

Query: 548 TVSEAHQRALLLERQQNRRTSPAFQHPPGRADRQVPYTDSRTPGVPSVQPRAXXXXXXXX 369
           ++SEAH+RA   E Q     SP++  P  +     PY  S T    +++           
Sbjct: 239 SISEAHRRAASFELQFR---SPSWSTPSAKTR---PYNQSTTTTSTAIKELGTANEVTNK 292

Query: 368 XPHSGS-RQNSGRPGA--CFSCGDSGHKQSACPKFLGARNFLVDELDSSEFSEPPVYDSP 198
                   + S RP A  C+S G++GH+Q+ CP      N   D  D             
Sbjct: 293 AAREEQPLRRSTRPNALRCYSFGEAGHRQTTCP------NQTQDGRDEDNVE-------- 338

Query: 197 SPHDTPEEILIGDIGTSLILRRACLTPRVNDSPDQRHNIFESTCTVNGKVCRFIIDSGSS 18
             H T      GD G  L+ RR C+ P        RHNI  S+C +  +VC FIID GSS
Sbjct: 339 GLHTT------GDTGRLLVARRLCIAPPSRTDSWLRHNIIRSSCIIQDRVCTFIIDLGSS 392

Query: 17  ENVVA 3
            N +A
Sbjct: 393 RNTMA 397


>gb|AAM15062.1| putative retroelement integrase [Arabidopsis thaliana]
          Length = 1215

 Score =  114 bits (284), Expect = 4e-23
 Identities = 79/249 (31%), Positives = 115/249 (46%), Gaps = 7/249 (2%)
 Frame = -2

Query: 728 VYQKLQNLRQGSRTVDAYSEEFYKLLARVDVREIDDQLVSRYIGGLRISFQDMLNLFSPV 549
           ++ KL+NL QG+R+V+ Y +E   L+ R D+ E  +  +SR++G L    QD L     V
Sbjct: 18  LHLKLRNLTQGNRSVEEYYKEMETLMLRADISEDREATLSRFLGDLNRDIQDRLETQYYV 77

Query: 548 TVSEAHQRALLLERQQNRRTSPAFQHPPGRADRQVPYTDSRTPGV---PSVQPRAXXXXX 378
            + E   +A+L E+Q  R++S    +  G   +     + RT      P V PRA     
Sbjct: 78  QIEEMLHKAILFEQQVKRKSSSRSSYGSGTIAKPTYQREERTSSYHNKPIVSPRAESKPY 137

Query: 377 XXXXPHSGSRQNSG---RPGACFSCGDSGHKQSACPKFLGARNFLVDELDSSEFS-EPPV 210
                H G  + S    R   C+ C   GH  + CP        ++  LD+ E   E  +
Sbjct: 138 AAVQDHKGKAEISTSRVRDVRCYKCQGKGHYANECP-----NKRVMILLDNGEIEPEEEI 192

Query: 209 YDSPSPHDTPEEILIGDIGTSLILRRACLTPRVNDSPDQRHNIFESTCTVNGKVCRFIID 30
            DSPS     EE+     G  L+ RR        D  +QR N+F + C V+GKVC  IID
Sbjct: 193 PDSPSSLKENEELPAQ--GELLVARRTLSVQTKTDEQEQRKNLFHTRCHVHGKVCSLIID 250

Query: 29  SGSSENVVA 3
            GS  NV +
Sbjct: 251 GGSCTNVAS 259


>ref|XP_004140476.1| PREDICTED: uncharacterized protein LOC101221994 [Cucumis sativus]
          Length = 1544

 Score =  108 bits (270), Expect = 2e-21
 Identities = 80/254 (31%), Positives = 125/254 (49%), Gaps = 12/254 (4%)
 Frame = -2

Query: 728  VYQKLQNLRQGSRTVDAYSEEFYKLLARVDVREIDDQLVSRYIGGLRISFQDMLNLFSPV 549
            +Y + QN RQG RTV  Y EEF++L AR ++ E +   V+R++GGLR   ++ + L    
Sbjct: 333  LYNQYQNCRQGVRTVAEYIEEFHRLSARTNLSENEQHQVARFVGGLRFDIKEKVRLQPFR 392

Query: 548  TVSEAHQRALLLE-------RQQNRRTSPAFQHPPGRADRQVPYTDSRTPGVPSVQPRAX 390
             +SEA   A  +E       +  NRR++        + + Q P T ++  G   +  +  
Sbjct: 393  FLSEAISFAETVEEMIAIRSKNLNRRSAWETNSTKSKTNDQ-PSTSTKAKG-KEIDNQEV 450

Query: 389  XXXXXXXXPHSGSRQNS---GRPGACFSCGDSGHKQSACPKFLGARNFLVDELDSSEFSE 219
                        S QNS      G CF CG +GH  + CP+    R  +    +  + SE
Sbjct: 451  AVERKKEQTFKPSGQNSYSRSSLGKCFRCGQTGHLSNNCPQ----RKTIAIAEEGGQTSE 506

Query: 218  PPVYDSPSPHDTPEEILIGDIG--TSLILRRACLTPRVNDSPDQRHNIFESTCTVNGKVC 45
              +       +   E++  D G   S +++R  +TP+  +   QRH +F++ CT+NG+VC
Sbjct: 507  DSI-----EAEEETELIEADDGERVSCVIQRLLITPK-EEKNLQRHCLFKTRCTINGRVC 560

Query: 44   RFIIDSGSSENVVA 3
              IIDSGSSEN VA
Sbjct: 561  DVIIDSGSSENFVA 574


>ref|XP_004161393.1| PREDICTED: uncharacterized protein LOC101232776 [Cucumis sativus]
          Length = 282

 Score =  108 bits (269), Expect = 2e-21
 Identities = 77/253 (30%), Positives = 121/253 (47%), Gaps = 11/253 (4%)
 Frame = -2

Query: 728 VYQKLQNLRQGSRTVDAYSEEFYKLLARVDVREIDDQLVSRYIGGLRISFQDMLNLFSPV 549
           +Y + QN RQG RTV  Y EEF++L AR ++ E +    +R++GGLR + ++ + L    
Sbjct: 18  LYNQYQNCRQGVRTVAEYIEEFHRLSARTNLSENEQHQAARFVGGLRFNIKEKVRLQPFR 77

Query: 548 TVSEAHQRALLLE-------RQQNRRTSPAFQHPPGRADRQVPYTDSRTPGVPSVQPRAX 390
            +SEA   A  +E       +  NRR++        + + Q P T ++  G         
Sbjct: 78  FLSEAISFAETVEEMIAIRSKNLNRRSAWETNSTKSKTNDQ-PSTSTKAKGKEIDNQEVA 136

Query: 389 XXXXXXXXPHSGSRQNSGRP--GACFSCGDSGHKQSACPKFLGARNFLVDELDSSEFSEP 216
                        + N  RP  G CF CG +GH  + CP+    R  +       + SE 
Sbjct: 137 VERKKEQTFKPSGQNNYSRPSLGKCFRCGQTGHLSNNCPQ----RRTIATAEGGGQTSED 192

Query: 215 PVYDSPSPHDTPEEILIGDIG--TSLILRRACLTPRVNDSPDQRHNIFESTCTVNGKVCR 42
            +       +   E++  D G   S +++R  +TP+  +   QRH +F++ CT+NG+VC 
Sbjct: 193 SI-----EAEEETELIEADDGERVSCVIQRLLITPK-EEKNLQRHCLFKTRCTINGRVCD 246

Query: 41  FIIDSGSSENVVA 3
            IIDS SSEN VA
Sbjct: 247 VIIDSSSSENFVA 259


>ref|XP_004134253.1| PREDICTED: LOW QUALITY PROTEIN: uncharacterized protein
           LOC101214124 [Cucumis sativus]
          Length = 586

 Score =  107 bits (267), Expect = 4e-21
 Identities = 79/264 (29%), Positives = 125/264 (47%), Gaps = 21/264 (7%)
 Frame = -2

Query: 731 LVYQKLQNLRQGSRTVDAYSEEFYKLLARVDVREIDDQLVSRYIGGLRISFQDMLNLFSP 552
           L+Y + Q   QGSR++  Y+EEFY+L AR ++ E + Q +SR+I GLR   +D+++L   
Sbjct: 182 LLYNQYQQCHQGSRSIMDYTEEFYRLGARNNLLETEHQQISRFIHGLRDEIKDIVHLHPL 241

Query: 551 VTVSEAHQRALLLE------------RQQN----RRTSPA-----FQHPPGRADRQVPYT 435
             +S+A   A  +E            R+ N    +RT+       FQ        Q+   
Sbjct: 242 TFLSDAISLASKIEDSEEIKKTKNSQRKNNWDKQQRTNLTNSFRNFQQGSSSTTSQLAKK 301

Query: 434 DSRTPGVPSVQPRAXXXXXXXXXPHSGSRQNSGRPGACFSCGDSGHKQSACPKFLGARNF 255
           D  +  +P+ +             +     N    G CF CG  GH  + CP+    R  
Sbjct: 302 DENSSKIPATKQGENNTMKKVDNIY-----NRPTLGKCFRCGQQGHLSNECPQ----RRT 352

Query: 254 LVDELDSSEFSEPPVYDSPSPHDTPEEILIGDIGTSLILRRACLTPRVNDSPDQRHNIFE 75
           L  E    +     +++  +P +       GD   S +++R   TP     P QR+++F 
Sbjct: 353 LTIEEGQEDNDSDDIFEISTPDE-------GD-QLSCVIQRILFTPTAGQIP-QRNSLFR 403

Query: 74  STCTVNGKVCRFIIDSGSSENVVA 3
           + CT+NGKVC+ IIDSGSSEN+V+
Sbjct: 404 TRCTINGKVCQVIIDSGSSENLVS 427


>gb|EOX96724.1| Gag-pol polyprotein, putative [Theobroma cacao]
          Length = 794

 Score =  106 bits (265), Expect = 7e-21
 Identities = 81/247 (32%), Positives = 120/247 (48%), Gaps = 5/247 (2%)
 Frame = -2

Query: 728 VYQKLQNLRQGSRTVDAYSEEFYKLLARVDVREIDDQLVSRYIGGLRISFQDMLNLFSPV 549
           ++ K  NLRQ + TV+ Y+ EF +L  + DV E ++Q V+RY+GGL +   D++ L    
Sbjct: 169 IFIKFHNLRQKTMTVEEYTMEFEQLHMKCDVHEPEEQTVARYLGGLNVGIADVVQLQPYW 228

Query: 548 TVSEAHQRALLLERQQNRRTSPAFQHPPGRADRQVPYTDSR-TPGVPSVQPRAXXXXXXX 372
            +++  + AL +E+QQ R++S +       + RQ   T +R      ++ P         
Sbjct: 229 NLNDVIRLALKVEKQQLRKSSMS-------SSRQKDSTSNRGRQSSATIPPPKVNSSKTI 281

Query: 371 XXPHSGSRQNSGRPGACFSCGDSGHKQSACPKFLGARNF--LVDELDSSEFSEPPVYDSP 198
               + S +       CF C   GH  S CP     R    L++E    E S   V D  
Sbjct: 282 NHKETTSTRAPNVNKKCFKCQGFGHIASDCPN----RRIISLIEEEVMEEPSLEEVDDEL 337

Query: 197 SPHDTPE-EILIGDIGTSLILRRACLTPRV-NDSPDQRHNIFESTCTVNGKVCRFIIDSG 24
              +  E E +  D G +L++RR   T  +  D    RHNIF + CT  GKVC  IIDSG
Sbjct: 338 EIFNNEEIEEVSADHGEALVVRRNLNTAMLTEDESWLRHNIFHTRCTSQGKVCNVIIDSG 397

Query: 23  SSENVVA 3
           S ENV+A
Sbjct: 398 SCENVIA 404


>gb|EOY10285.1| Uncharacterized protein TCM_025656 [Theobroma cacao]
          Length = 505

 Score =  106 bits (264), Expect = 9e-21
 Identities = 79/247 (31%), Positives = 125/247 (50%), Gaps = 5/247 (2%)
 Frame = -2

Query: 728 VYQKLQNLRQGSRTVDAYSEEFYKLLARVDVREIDDQLVSRYIGGLRISFQDMLNLFSPV 549
           ++ K  NLRQ + TV+ Y+ EF +L  + DV E ++Q V+RY+GGL +   D++ L    
Sbjct: 18  IFIKFHNLRQKTMTVEEYTMEFEQLHMKCDVHEPEEQTVARYLGGLNVEIADVVQLQPYW 77

Query: 548 TVSEAHQRALLLERQQNRRTSPAFQHPPGRADRQVPYTDSRTP-GVPSVQPRAXXXXXXX 372
            +++  + AL +E+Q++R+ S +      R    +   +S++   +P  +  +       
Sbjct: 78  NLNDVIRLALKVEKQRSRKRSMS----SSRQQESISNDESQSSVTIPPPKVNSSKTASSN 133

Query: 371 XXPHSGSRQNSGRPGACFSCGDSGHKQSACPKFLGARNF--LVDELDSSEFSE-PPVYDS 201
               + +R ++     CF C   GH    CP     R    LV+E D + + +  PVYD 
Sbjct: 134 DKETTFTRASNVNK-KCFKCQGFGHIAFDCPN----RRIISLVEEEDYANWEKLEPVYDE 188

Query: 200 PSPHDTPEEILIGDIGTSLILRRACLTPRV-NDSPDQRHNIFESTCTVNGKVCRFIIDSG 24
               +  E  +  D G +LI+RR   T  +  D    RHNIF + CT  GKVC  IIDSG
Sbjct: 189 YDDEEIEE--VSADHGEALIVRRNLNTAMMTKDESWLRHNIFYTRCTSQGKVCNVIIDSG 246

Query: 23  SSENVVA 3
           S ENV+A
Sbjct: 247 SCENVIA 253


>gb|EOY18660.1| Uncharacterized protein TCM_043155 [Theobroma cacao]
          Length = 625

 Score =  103 bits (256), Expect = 8e-20
 Identities = 78/247 (31%), Positives = 122/247 (49%), Gaps = 5/247 (2%)
 Frame = -2

Query: 728 VYQKLQNLRQGSRTVDAYSEEFYKLLARVDVREIDDQLVSRYIGGLRISFQDMLNLFSPV 549
           ++ K  NLRQ + TV+ Y+ EF +L  + DV E ++Q ++RY+GGL +   D++ L    
Sbjct: 138 IFIKFHNLRQKTMTVEEYTMEFEQLHMKCDVHEPEEQTLARYLGGLNVEIADVVQLQPYW 197

Query: 548 TVSEAHQRALLLERQQNRRTSPAFQHPPGRADRQVPYTDSRTP-GVPSVQPRAXXXXXXX 372
            +++  +  L +E+QQ+R+ S +      R    +   +S++   +P  +  +       
Sbjct: 198 NLNDVIRLTLKVEKQQSRKRSMS----SSRQQESISNDESQSSVTIPPPKVNSSKTASSN 253

Query: 371 XXPHSGSRQNSGRPGACFSCGDSGHKQSACPKFLGARNF--LVDELDSSEFSE-PPVYDS 201
               + +R ++     CF C   GH  S CP    +R    LV+E D   + +  PVYD 
Sbjct: 254 DKETTFTRASNVNK-KCFKCQRFGHIASDCP----SRRIISLVEEEDYVNWEKLEPVYDE 308

Query: 200 PSPHDTPEEILIGDIGTSLILRRACLTP-RVNDSPDQRHNIFESTCTVNGKVCRFIIDSG 24
               +  E  +  D G + I+RR   T     D    RHNIF + CT  G VC  IIDSG
Sbjct: 309 YDDEEIEE--VSADHGEAFIVRRNLNTALMTKDESCLRHNIFYTRCTSQGNVCNVIIDSG 366

Query: 23  SSENVVA 3
           S ENVVA
Sbjct: 367 SCENVVA 373


>gb|EMJ01412.1| hypothetical protein PRUPE_ppa015697mg [Prunus persica]
          Length = 983

 Score =  100 bits (248), Expect = 6e-19
 Identities = 77/256 (30%), Positives = 115/256 (44%), Gaps = 14/256 (5%)
 Frame = -2

Query: 728 VYQKLQNLRQGSRTVDAYSEEFYKLLARVDVREIDDQLVSRYIGGLRISFQDMLNLFSPV 549
           +Y++  NL+Q   +V  Y+ EF  L  RV + E ++ + SRY+ GL  + +D L +    
Sbjct: 3   LYERFYNLKQRDMSVQEYTSEFDNLSLRVGLNETNEHMTSRYLSGLNQTIRDELGVVRLS 62

Query: 548 TVSEAHQRALLLERQQNRRTSPAFQHPPGRADRQVPYTDSRTPGVPSVQPRAXXXXXXXX 369
            + +A Q AL+++RQQ RR    F    GR D       +   GV S Q           
Sbjct: 63  NLEDARQYALMVKRQQLRRGGRRFVF--GRTDNYWQRNTTTVHGVRSKQ----------- 109

Query: 368 XPHSGSRQNSG--RPGACFSCGDSGHKQSACPKFL------GARNFLVDELDSSEFSEPP 213
              +G R   G  R          G + +A P  L        R +  DE   + +    
Sbjct: 110 GARTGGRNMVGVDRSEKWKEIVKFGSQNTAVPSNLRGDSTSQVRCYTCDEKGHTSYVRSE 169

Query: 212 VYDSPSP----HDTPEEI--LIGDIGTSLILRRACLTPRVNDSPDQRHNIFESTCTVNGK 51
           V D P P        EE+  L+   G SL++RR   TP+V +   + HNIF +     GK
Sbjct: 170 VTDFPEPTYDDFGNEEEVINLLPVEGESLVVRRVMTTPKVEEEDWRHHNIFRTRVLCGGK 229

Query: 50  VCRFIIDSGSSENVVA 3
           VC  I+D GSSEN+++
Sbjct: 230 VCNVILDGGSSENIIS 245


>ref|XP_004295592.1| PREDICTED: uncharacterized protein LOC101291324 [Fragaria vesca
            subsp. vesca]
          Length = 2122

 Score = 99.8 bits (247), Expect = 8e-19
 Identities = 80/263 (30%), Positives = 120/263 (45%), Gaps = 20/263 (7%)
 Frame = -2

Query: 731  LVYQKLQNLRQGSRTVDAYSEEFYKLLARVDVREIDDQLVSRYIGGLRISFQDMLNLFSP 552
            ++Y+   +  QG++TV  Y+ EF +L  R D+ E + Q V+RYI  LR S Q+ + L + 
Sbjct: 904  ILYRMYLDCVQGAKTVTEYTAEFVRLSERNDLGESEGQKVARYISRLRPSIQEKIRLQTM 963

Query: 551  VTVSEAHQRALLLERQQNRRTSPAFQHPPGRADRQVPYTDSR-TPGVPSVQPRAXXXXXX 375
              V+EA   A+  E  + +    +FQ P   + R      S    G    Q         
Sbjct: 964  WYVTEAASLAIKAELME-KSPRVSFQFPRFTSQRSTEVRSSMGDQGKTVSQNTGGMATRA 1022

Query: 374  XXXPHSGSRQNSGR-------------PGACFSCGDSGHKQSAC---PKFLGARNFLVDE 243
                 S SR                  PG C+ C   GH+ + C   PK + A   LV+ 
Sbjct: 1023 FGAVGSTSRATRAAPVQRPFNPYARPFPGTCYKCLQPGHRSNECTAPPKVVNAVQALVEA 1082

Query: 242  LDSSEFSE-PPVYDSP--SPHDTPEEILIGDIGTSLILRRACLTPRVNDSPDQRHNIFES 72
             +  E  E    Y+    +  D+PE++       +++L+R  L+P+  D   QR NIF S
Sbjct: 1083 CEEDETEEGGDDYEGAEFAVEDSPEKV-------NIVLQRILLSPKEEDG--QRRNIFRS 1133

Query: 71   TCTVNGKVCRFIIDSGSSENVVA 3
             C+VN KVC  I+D+GS EN VA
Sbjct: 1134 YCSVNNKVCNMIVDNGSCENFVA 1156


>ref|XP_004292437.1| PREDICTED: uncharacterized protein LOC101306407 [Fragaria vesca
           subsp. vesca]
          Length = 1300

 Score = 99.4 bits (246), Expect = 1e-18
 Identities = 74/232 (31%), Positives = 106/232 (45%), Gaps = 7/232 (3%)
 Frame = -2

Query: 725 YQKLQNLRQGSRTVDAYSEEFYKLLARVDVREIDDQLVSRYIGGLRISFQDMLNLFSPVT 546
           + KL N+RQGSRTVD +++EF  L  R  + E ++Q V+RY+ GLR    D++ L    +
Sbjct: 306 FLKLHNIRQGSRTVDDFTKEFDLLTMRCGLAEEEEQTVARYLAGLRREIHDVVVLQPCWS 365

Query: 545 VSEAHQRALLLERQQNRRTSPAFQHPPGRADRQVPYTDSRTPGVPSVQPRAXXXXXXXXX 366
            SE +Q A+ +E+Q   R    ++              S TP +  +             
Sbjct: 366 YSEVYQLAIQVEKQLQSR----YKRGASEDYEAKKIASSSTPKITPMLDANIREPLKNQA 421

Query: 365 PHSGS--RQNSGRPGACFSCGDSGHKQSACPKFLGARNFLVDELDSSEFSEPPVYDSPSP 192
            H       N G+   CF C   GH  S CP        LV+EL   E S   + D P+ 
Sbjct: 422 EHKAEARESNKGKNVKCFKCSGLGHIASDCPNRRVVN--LVEEL--GESSSAGLDDMPTS 477

Query: 191 HD----TPEEILIGDIGTSLILRRACLTPRVNDSPD-QRHNIFESTCTVNGK 51
            D      EEI   D G SL++R+     +V D  +  +HNIF + CT NGK
Sbjct: 478 DDYGDQDEEEITWSDHGESLVIRQTMSASKVEDDSEWLKHNIFHTKCTSNGK 529


>gb|EMJ21583.1| hypothetical protein PRUPE_ppa021778mg [Prunus persica]
          Length = 1384

 Score = 98.2 bits (243), Expect = 2e-18
 Identities = 76/264 (28%), Positives = 119/264 (45%), Gaps = 21/264 (7%)
 Frame = -2

Query: 731 LVYQKLQNLRQGSRTVDAYSEEFYKLLARVDVREIDDQLVSRYIGGLRISFQDMLNLFSP 552
           ++Y+      QG+R+V  Y+EEF +L  R  + E D+Q V+RY  GL+IS Q+ + + + 
Sbjct: 198 ILYRLYLGCAQGTRSVSEYTEEFMRLAERNHLTETDNQKVARYNNGLKISIQEKIGMQNI 257

Query: 551 VTVSEAHQRAL---LLERQQN-----RRTSPAFQHPPGRADRQVPYTDSRTPGVPSVQPR 396
            T+ EA   AL   LLE+++      R T+ A  +  G +        ++      +   
Sbjct: 258 WTLQEAINMALKAELLEKEKRQPNFRRNTTEASDYTAGASSGAGDKGKAQQQNSGGMTKP 317

Query: 395 AXXXXXXXXXPHSGSRQNSGRP-------------GACFSCGDSGHKQSACPKFLGARNF 255
           A           S    N G+P               C+ C   GH+ + CP+   A NF
Sbjct: 318 ATVGQNKNFNEGSSRNYNRGQPRNQSQNPYAKPMTDICYRCQKPGHRSNVCPERKQA-NF 376

Query: 254 LVDELDSSEFSEPPVYDSPSPHDTPEEILIGDIGTSLILRRACLTPRVNDSPDQRHNIFE 75
           + +  +  E  E    D        EE   G    +L+L+R  L P+      QRH+IF 
Sbjct: 377 IEEADEDEENDEVGENDYAGAEFAVEE---GMEKITLVLQRVLLAPK---EEGQRHSIFR 430

Query: 74  STCTVNGKVCRFIIDSGSSENVVA 3
           S C++  KVC  I+D+GS EN V+
Sbjct: 431 SLCSIKNKVCDVIVDNGSCENFVS 454


>gb|EMJ08431.1| hypothetical protein PRUPE_ppa026856mg [Prunus persica]
          Length = 1493

 Score = 98.2 bits (243), Expect = 2e-18
 Identities = 75/264 (28%), Positives = 112/264 (42%), Gaps = 21/264 (7%)
 Frame = -2

Query: 731 LVYQKLQNLRQGSRTVDAYSEEFYKLLARVDVREIDDQLVSRYIGGLRISFQDMLNLFSP 552
           ++Y+      QG+R+V  Y+EEF +L  R  + E D+Q V+RY  GL+ S Q+ + + + 
Sbjct: 205 ILYRMYLGCAQGTRSVSEYTEEFMRLAERNHLTETDNQKVARYNNGLKSSIQEKIGMQNI 264

Query: 551 VTVSEAHQRALLLERQQNRRTSPAFQHPPGRADRQVPYTDSRTPGVPSVQPR-------- 396
            T+ EA   AL  E  +  +  P F+     A        S        Q +        
Sbjct: 265 WTLQEAINMALKAELLEKEKRQPNFRRNKTEASDYTAGASSGAGDKEKAQQQNSGGMTKP 324

Query: 395 AXXXXXXXXXPHSGSRQNSGRP-------------GACFSCGDSGHKQSACPKFLGARNF 255
           A           S    N G+P               C+ C   GH+ + CP+   A NF
Sbjct: 325 ATVGQNKNFNEGSSRNYNRGQPRNQSQNPYAKPMTDICYRCQKPGHRSNVCPERKQA-NF 383

Query: 254 LVDELDSSEFSEPPVYDSPSPHDTPEEILIGDIGTSLILRRACLTPRVNDSPDQRHNIFE 75
           + +  +  E  E    D        EE   G    +L+L+R  L P+      QRHNIF 
Sbjct: 384 IEEADEDEEKDEVGENDYAGAEFAVEE---GIEKITLVLQRVLLAPK---EEGQRHNIFR 437

Query: 74  STCTVNGKVCRFIIDSGSSENVVA 3
           S C++  KVC  I+D+GS EN V+
Sbjct: 438 SLCSIKNKVCDVIVDNGSCENFVS 461


>gb|EMS54598.1| Transposon Ty3-G Gag-Pol polyprotein [Triticum urartu]
          Length = 1704

 Score = 97.4 bits (241), Expect = 4e-18
 Identities = 72/230 (31%), Positives = 108/230 (46%), Gaps = 3/230 (1%)
 Frame = -2

Query: 731 LVYQKLQNLRQGSRTVDAYSEEFYKLLARVDVREIDDQLVSRYIGGLRISFQDMLNLFSP 552
           +++ + QN  QG+RTV  Y+EEF +L  R ++ E ++Q V+RYI GL  + QD L +   
Sbjct: 205 ILFIQFQNCAQGNRTVSDYTEEFLRLQVRCNLAETEEQQVARYINGLNDAIQDRLMMQQI 264

Query: 551 VTVSEAHQRALLLERQQNRRTSPAFQHPPGRADRQVPYTDSRTPGVPS-VQPRAXXXXXX 375
            +V +A   AL  ER    R +    +PP R      +T+  +   P+ V+ +A      
Sbjct: 265 WSVDQAQALALKAERFVRMRKTTKAPYPPYR------HTEGSSRSQPNRVEEKATPPKTK 318

Query: 374 XXXPHSGSRQNSGRPG-ACFSCGDSGHKQSACPKFLGARNFLVD-ELDSSEFSEPPVYDS 201
              P     +     G  C+ CG  GH  S CP        + D E D  E+    V   
Sbjct: 319 QPIPKQTRGKGKANEGPKCYKCGKEGHISSGCPLRKFVNTTIHDGESDEEEYKSKDVDGQ 378

Query: 200 PSPHDTPEEILIGDIGTSLILRRACLTPRVNDSPDQRHNIFESTCTVNGK 51
               +  EE++       +I R  C TP+++D+  QR  IFE  CTVNGK
Sbjct: 379 EVCQEEGEEVV------CVIQRLLCSTPQLDDT--QRKKIFERKCTVNGK 420


Top