BLASTX nr result

ID: Akebia25_contig00030146 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Akebia25_contig00030146
         (819 letters)

Database: ./nr 
           38,876,450 sequences; 13,856,398,315 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_007207232.1| hypothetical protein PRUPE_ppa026856mg [Prun...    94   9e-35
ref|XP_007210190.1| hypothetical protein PRUPE_ppa017790mg [Prun...    98   2e-34
ref|XP_007220740.1| hypothetical protein PRUPE_ppa023598mg [Prun...    94   2e-33
emb|CAN70290.1| hypothetical protein VITISV_019345 [Vitis vinifera]    99   2e-30
ref|XP_004161321.1| PREDICTED: uncharacterized protein LOC101223...   100   3e-30
ref|XP_007052567.1| Gag-pol polyprotein, putative [Theobroma cac...   110   1e-29
ref|XP_006385239.1| hypothetical protein POPTR_0003s02020g [Popu...    87   1e-29
ref|XP_006392773.1| hypothetical protein EUTSA_v10012229mg [Eutr...    94   1e-29
ref|XP_007029783.1| Uncharacterized protein TCM_025656 [Theobrom...   108   4e-29
ref|XP_007200213.1| hypothetical protein PRUPE_ppa015697mg [Prun...    98   4e-28
ref|XP_007009850.1| Uncharacterized protein TCM_043155 [Theobrom...   100   3e-27
ref|XP_007207970.1| hypothetical protein PRUPE_ppa016339mg, part...    81   4e-27
ref|XP_007027874.1| DNA/RNA polymerases superfamily protein [The...    95   2e-26
ref|XP_004309164.1| PREDICTED: uncharacterized protein LOC101300...    96   3e-26
ref|XP_007033335.1| Uncharacterized protein TCM_019516, partial ...    98   1e-25
ref|XP_007028192.1| Uncharacterized protein TCM_023754 [Theobrom...   100   1e-25
gb|ADP20179.1| gag-pol polyprotein [Silene latifolia]                  92   3e-25
ref|XP_007049887.1| DNA/RNA polymerases superfamily protein [The...    90   9e-25
ref|XP_007010495.1| Uncharacterized protein TCM_044370 [Theobrom...    94   1e-24
gb|ADP20178.1| gag-pol polyprotein [Silene latifolia]                  92   1e-24

>ref|XP_007207232.1| hypothetical protein PRUPE_ppa026856mg [Prunus persica]
           gi|462402874|gb|EMJ08431.1| hypothetical protein
           PRUPE_ppa026856mg [Prunus persica]
          Length = 1493

 Score = 94.0 bits (232), Expect(2) = 9e-35
 Identities = 40/80 (50%), Positives = 56/80 (70%)
 Frame = -3

Query: 361 EKHPASYTIGWIKKVGETKVIERYQISFSIGKFYKDIVTYDIVDMDACHMLLGRPWQYDV 182
           E H + Y++GW+KK    +V E  ++  SIGK Y+D V  D++DMDACH+LLGRPWQ+DV
Sbjct: 473 EPHVSPYSLGWVKKGPSVRVAETCRVPLSIGKHYRDDVLCDVIDMDACHILLGRPWQFDV 532

Query: 181 DSTHKGKSNTFMFYKDGIKI 122
           D+T KG+ N  +F  +  KI
Sbjct: 533 DATFKGRDNVILFSWNNRKI 552



 Score = 80.5 bits (197), Expect(2) = 9e-35
 Identities = 52/154 (33%), Positives = 73/154 (47%), Gaps = 5/154 (3%)
 Frame = -1

Query: 819 GNTQPNVTN*DSGSSQAKSIAKANTNRGS----SSNPYAHLQLNKCYHCNQPGHRSNKCL 652
           G T+P     +   ++  S    N NRG     S NPYA    + CY C +PGHRSN C 
Sbjct: 320 GMTKPATVGQNKNFNEGSS---RNYNRGQPRNQSQNPYAKPMTDICYRCQKPGHRSNVCP 376

Query: 651 *HPMANLTTHAXXXXXXXXXXXXXXXXXXXXXXXXXEGVFLVC-RLMCIPRQKIDPQKHN 475
               AN    A                         E + LV  R++  P++  + Q+HN
Sbjct: 377 ERKQANFIEEADEDEEKDEVGENDYAGAEFAVEEGIEKITLVLQRVLLAPKE--EGQRHN 434

Query: 474 IFRTSCTINQRICDLILDGGSSENIVSKTMVDKL 373
           IFR+ C+I  ++CD+I+D GS EN VSK +V+ L
Sbjct: 435 IFRSLCSIKNKVCDVIVDNGSCENFVSKKLVEYL 468


>ref|XP_007210190.1| hypothetical protein PRUPE_ppa017790mg [Prunus persica]
           gi|462405925|gb|EMJ11389.1| hypothetical protein
           PRUPE_ppa017790mg [Prunus persica]
          Length = 1485

 Score = 97.8 bits (242), Expect(2) = 2e-34
 Identities = 48/116 (41%), Positives = 67/116 (57%)
 Frame = -3

Query: 361 EKHPASYTIGWIKKVGETKVIERYQISFSIGKFYKDIVTYDIVDMDACHMLLGRPWQYDV 182
           E H + Y++GW+KK    +V E  ++  SIGK Y+D V  D++DMDACH+LLGRPWQ+DV
Sbjct: 462 EPHVSPYSLGWVKKGPSVRVAETCRVPLSIGKHYRDEVLCDVIDMDACHILLGRPWQFDV 521

Query: 181 DSTHKGKSNTFMFYKDGIKIFLVPMKGENCPKITKVEG*SFLIVSDILQESKETGK 14
           D+T KG+ N  +F  +  KI +   +        K    SFL +    QE  E  K
Sbjct: 522 DATFKGRDNVILFSWNNRKIAMTTTQPSKPSVEVKTRSSSFLTLISNEQELNEAVK 577



 Score = 75.5 bits (184), Expect(2) = 2e-34
 Identities = 51/154 (33%), Positives = 72/154 (46%), Gaps = 5/154 (3%)
 Frame = -1

Query: 819 GNTQPNVTN*DSGSSQAKSIAKANTNRGS----SSNPYAHLQLNKCYHCNQPGHRSNKCL 652
           G T+P     +   ++  S    N NRG     S N YA    + CY C +PGHRSN C 
Sbjct: 309 GMTKPTTVGQNKNFNEGSS---RNYNRGQPRNQSQNLYAKPMTDICYRCQKPGHRSNVCP 365

Query: 651 *HPMANLTTHAXXXXXXXXXXXXXXXXXXXXXXXXXEGVFLVC-RLMCIPRQKIDPQKHN 475
               AN    A                         E + LV  R++  PR+  + Q+H+
Sbjct: 366 ELKQANFIEEADEDEENDEVGENDYAGAEFAVEEGMEKITLVLQRVLLAPRE--EGQRHS 423

Query: 474 IFRTSCTINQRICDLILDGGSSENIVSKTMVDKL 373
           IFR+ C+I  ++CD+I+D GS EN VSK +V+ L
Sbjct: 424 IFRSLCSIKNKVCDVIVDNGSCENFVSKKLVEYL 457


>ref|XP_007220740.1| hypothetical protein PRUPE_ppa023598mg [Prunus persica]
           gi|462417202|gb|EMJ21939.1| hypothetical protein
           PRUPE_ppa023598mg [Prunus persica]
          Length = 1457

 Score = 94.0 bits (232), Expect(2) = 2e-33
 Identities = 47/116 (40%), Positives = 65/116 (56%)
 Frame = -3

Query: 361 EKHPASYTIGWIKKVGETKVIERYQISFSIGKFYKDIVTYDIVDMDACHMLLGRPWQYDV 182
           E H   Y++GW+KK    +V E Y +  SIGK Y D V  D++DMDACH+LLG+ WQ+DV
Sbjct: 437 EPHVRPYSLGWVKKGPSVRVAETYSVPLSIGKHYIDDVLCDVIDMDACHILLGQLWQFDV 496

Query: 181 DSTHKGKSNTFMFYKDGIKIFLVPMKGENCPKITKVEG*SFLIVSDILQESKETGK 14
           D+T+KG+ N  +F  +  KI +   K        K    SFL +    QE  +  K
Sbjct: 497 DATYKGRDNVILFSWNNRKIAMATTKPSKQSVEPKTRSSSFLTLISSEQELNKVVK 552



 Score = 76.3 bits (186), Expect(2) = 2e-33
 Identities = 48/152 (31%), Positives = 74/152 (48%), Gaps = 3/152 (1%)
 Frame = -1

Query: 819 GNTQPNVT--N*DSGSSQAKSIAKANTNRGSSSNPYAHLQLNKCYHCNQPGHRSNKCL*H 646
           G T+P  T  N +   S +++  +  + R  S NPYA  + + CY C +PGHRSN C   
Sbjct: 284 GTTKPATTVQNKNFNESSSRTFNRGQS-RNQSQNPYAKPRTDICYRCQKPGHRSNVCPEW 342

Query: 645 PMANLTTHAXXXXXXXXXXXXXXXXXXXXXXXXXEGVFLVC-RLMCIPRQKIDPQKHNIF 469
             AN                              E + LV  R++  P++  + Q+H+I 
Sbjct: 343 TQANFIEEVDEDEEKDEVGEDDYAGAEFAIEERMERIILVLQRVLLAPKE--EGQRHSIC 400

Query: 468 RTSCTINQRICDLILDGGSSENIVSKTMVDKL 373
           R+ C+I  ++CD+I+D GS EN VSK +V+ L
Sbjct: 401 RSLCSIKNKVCDVIVDNGSCENFVSKKLVEHL 432


>emb|CAN70290.1| hypothetical protein VITISV_019345 [Vitis vinifera]
          Length = 1464

 Score = 99.4 bits (246), Expect(2) = 2e-30
 Identities = 44/87 (50%), Positives = 64/87 (73%)
 Frame = -3

Query: 370  VRIEKHPASYTIGWIKKVGETKVIERYQISFSIGKFYKDIVTYDIVDMDACHMLLGRPWQ 191
            ++ ++HP+SY I WI K  + +V+E  +I  SIGK+YKD +  D++DMDAC++LLGR W 
Sbjct: 782  LKTKEHPSSYKIAWINKGMKVQVLEVCKIPLSIGKYYKDEIVCDVLDMDACYILLGRSWH 841

Query: 190  YDVDSTHKGKSNTFMFYKDGIKIFLVP 110
            YDVD T+KG+ NTF+F+    KI L+P
Sbjct: 842  YDVDVTYKGQDNTFVFWWFDKKIVLMP 868



 Score = 60.8 bits (146), Expect(2) = 2e-30
 Identities = 36/121 (29%), Positives = 55/121 (45%)
 Frame = -1

Query: 735  SSSNPYAHLQLNKCYHCNQPGHRSNKCL*HPMANLTTHAXXXXXXXXXXXXXXXXXXXXX 556
            +S++P+    + + +   +PGH SN C      N                          
Sbjct: 660  ASNDPFPWSIIWRSWAPMRPGHLSNNCPNRQFVNFLEEDGSEEERVLEEDIYEGVEFAEG 719

Query: 555  XXXXEGVFLVCRLMCIPRQKIDPQKHNIFRTSCTINQRICDLILDGGSSENIVSKTMVDK 376
                E   +V RL+   ++  D Q+H IFRT CTI  ++C++I+D GSSEN VSK +V  
Sbjct: 720  DVGEEVTCIVQRLLLTLKKSDDSQRHKIFRTQCTIRNKVCNVIIDSGSSENFVSKALVKA 779

Query: 375  L 373
            L
Sbjct: 780  L 780


>ref|XP_004161321.1| PREDICTED: uncharacterized protein LOC101223713 [Cucumis sativus]
          Length = 645

 Score =  100 bits (249), Expect(2) = 3e-30
 Identities = 43/92 (46%), Positives = 59/92 (64%)
 Frame = -3

Query: 370 VRIEKHPASYTIGWIKKVGETKVIERYQISFSIGKFYKDIVTYDIVDMDACHMLLGRPWQ 191
           ++ E HP SY IGW++K GE  V E   +  SI   YKD +  D+++MD CH+LLGRPWQ
Sbjct: 326 LKAEAHPTSYKIGWVRKEGEATVSEICTVPLSIENAYKDQIVCDVIEMDVCHLLLGRPWQ 385

Query: 190 YDVDSTHKGKSNTFMFYKDGIKIFLVPMKGEN 95
           YD  S HKG+ NT+     G K+ L+P+  +N
Sbjct: 386 YDTQSLHKGRENTYELQLMGRKVVLLPITRKN 417



 Score = 58.5 bits (140), Expect(2) = 3e-30
 Identities = 43/156 (27%), Positives = 67/156 (42%), Gaps = 8/156 (5%)
 Frame = -1

Query: 816 NTQPNVTN*DSGS---SQAKSIAKANTN--RGSSSNPYAHLQLNKCYHCNQPGHRSNKCL 652
           N QP+ +    G    +Q   + + N    + SS N Y+   L K + C Q  H SN C 
Sbjct: 174 NDQPSTSTKGKGKEVENQEVVVERKNEQAFKTSSQNNYSRPLLGKFFRCGQTEHLSNNC- 232

Query: 651 *HPMANLTTHAXXXXXXXXXXXXXXXXXXXXXXXXXEGVFLVC---RLMCIPRQKIDPQK 481
                   T A                         +G  + C   R++  P+++   Q+
Sbjct: 233 ----PQRKTIAIAEEGRQMSEDSKGAEDEIELIEADDGERVSCVIQRVLITPKEEKKQQR 288

Query: 480 HNIFRTSCTINQRICDLILDGGSSENIVSKTMVDKL 373
           H +F+  CTIN R+CD+I+D  SS+N V+K +V  L
Sbjct: 289 HCLFKARCTINGRVCDVIIDNDSSKNFVAKKLVTVL 324


>ref|XP_007052567.1| Gag-pol polyprotein, putative [Theobroma cacao]
           gi|508704828|gb|EOX96724.1| Gag-pol polyprotein,
           putative [Theobroma cacao]
          Length = 794

 Score =  110 bits (275), Expect(2) = 1e-29
 Identities = 49/95 (51%), Positives = 64/95 (67%)
 Frame = -3

Query: 370 VRIEKHPASYTIGWIKKVGETKVIERYQISFSIGKFYKDIVTYDIVDMDACHMLLGRPWQ 191
           ++ E HP  Y + W++K  E KV +R  + FSIG  Y+D V  D++ MDACH+LLGRPWQ
Sbjct: 413 LQTEVHPHPYKLQWLRKGNEVKVTKRCCVQFSIGNKYEDEVWCDVIPMDACHLLLGRPWQ 472

Query: 190 YDVDSTHKGKSNTFMFYKDGIKIFLVPMKGENCPK 86
           YD  + H G  NT+ F KDG KI L P+K E+CPK
Sbjct: 473 YDRRAHHDGYKNTYSFIKDGAKIMLTPLKPEDCPK 507



 Score = 47.0 bits (110), Expect(2) = 1e-29
 Identities = 18/37 (48%), Positives = 27/37 (72%)
 Frame = -1

Query: 483 KHNIFRTSCTINQRICDLILDGGSSENIVSKTMVDKL 373
           +HNIF T CT   ++C++I+D GS EN+++  MV KL
Sbjct: 375 RHNIFHTRCTSQGKVCNVIIDSGSCENVIANYMVKKL 411


>ref|XP_006385239.1| hypothetical protein POPTR_0003s02020g [Populus trichocarpa]
           gi|550342179|gb|ERP63036.1| hypothetical protein
           POPTR_0003s02020g [Populus trichocarpa]
          Length = 567

 Score = 86.7 bits (213), Expect(2) = 1e-29
 Identities = 49/124 (39%), Positives = 66/124 (53%), Gaps = 4/124 (3%)
 Frame = -3

Query: 361 EKHPASYTIGWIKKVGETKVIERYQISFSIGKFYKDIVTYDIVDMDACHMLLGRPWQYDV 182
           E H   Y +GW+K            +  SIGK YK  +  D++DMDA H+LLGRPWQ+DV
Sbjct: 370 EMHKNPYMLGWVK------------VPLSIGKHYKHEIWCDVIDMDASHVLLGRPWQFDV 417

Query: 181 DSTHKGKSNTFMFYKDGIKIFLVPMKGENCPKITKVEG*SFLIVSDILQE----SKETGK 14
           D+THKG+ N F+F     KI L P+      +  +V   +FL +S    E     KE G 
Sbjct: 418 DATHKGRDNVFIFEWVSHKIALAPVDQSRKLEKPQVGSSNFLAISKNSHEFEDIIKEVGC 477

Query: 13  MYAL 2
           MY +
Sbjct: 478 MYPI 481



 Score = 70.9 bits (172), Expect(2) = 1e-29
 Identities = 45/151 (29%), Positives = 64/151 (42%), Gaps = 14/151 (9%)
 Frame = -1

Query: 783 GSSQAKSIAKANTNRGSSSNPYAHLQLNKCYHCNQPGHRSNKCL*HPMANLTTHAXXXXX 604
           G S   +    N N+  +  PYA    + CY C QPGHRSN C     ANL         
Sbjct: 215 GESSQNNDINQNRNQRPNHGPYARATGDVCYRCFQPGHRSNNCPKRKQANLVEGTEEADD 274

Query: 603 XXXXXXXXXXXXXXXXXXXXEGVFLVCRLMCIPRQKI--------------DPQKHNIFR 466
                               E V L+     I   ++              + Q+++IFR
Sbjct: 275 HSGNYDDDYDGAEFAYEDNNEVVNLMMNRTAIEEDEVLSMVLQRALLSPKQEGQRNHIFR 334

Query: 465 TSCTINQRICDLILDGGSSENIVSKTMVDKL 373
           + C+++ ++C LI+DGGS EN VSK +VD L
Sbjct: 335 SLCSVDNKVCTLIVDGGSCENFVSKKLVDYL 365


>ref|XP_006392773.1| hypothetical protein EUTSA_v10012229mg [Eutrema salsugineum]
           gi|557089351|gb|ESQ30059.1| hypothetical protein
           EUTSA_v10012229mg [Eutrema salsugineum]
          Length = 382

 Score = 94.4 bits (233), Expect(2) = 1e-29
 Identities = 41/84 (48%), Positives = 54/84 (64%)
 Frame = -3

Query: 361 EKHPASYTIGWIKKVGETKVIERYQISFSIGKFYKDIVTYDIVDMDACHMLLGRPWQYDV 182
           E HPA Y + WI +  + K+  R  +SFSIG FYKD +  DI  MD  H++LGRPWQ+D 
Sbjct: 250 EDHPAPYALAWITEGTDVKITHRALVSFSIGAFYKDTIYCDIAPMDVSHLILGRPWQFDR 309

Query: 181 DSTHKGKSNTFMFYKDGIKIFLVP 110
           D+ H GK NT+ F  +  KI L+P
Sbjct: 310 DTCHNGKKNTYSFVFENRKIVLLP 333



 Score = 62.8 bits (151), Expect(2) = 1e-29
 Identities = 47/160 (29%), Positives = 74/160 (46%), Gaps = 11/160 (6%)
 Frame = -1

Query: 819 GNTQPNVTN*DSGSSQAKS--IAKANTN-RGSSSNPYAHLQLN------KCYHCNQPGHR 667
           GN +P +T  D+ +S   S  ++K+ T  R S++   + L+ +      KCY C +PGHR
Sbjct: 86  GNWRPRLTGTDTENSSHDSPEVSKSQTAPRNSTTLDESTLRRSTRPPALKCYSCGEPGHR 145

Query: 666 SNKCL*HPMANLTTHAXXXXXXXXXXXXXXXXXXXXXXXXXEGVFLVCRLMCI-PRQKID 490
              C       L                                 L+ R +C+ P    +
Sbjct: 146 QTACPNQQRRGLLLEDTEGVYNSADEEDTGIYEETLTSGDSNAPVLMLRRICLAPVGYEE 205

Query: 489 PQ-KHNIFRTSCTINQRICDLILDGGSSENIVSKTMVDKL 373
           P  + NIFR++CTI  ++C+L++D GSS N+VS+T V KL
Sbjct: 206 PWLRTNIFRSTCTIKGKLCNLVIDSGSSRNVVSETAVKKL 245


>ref|XP_007029783.1| Uncharacterized protein TCM_025656 [Theobroma cacao]
           gi|508718388|gb|EOY10285.1| Uncharacterized protein
           TCM_025656 [Theobroma cacao]
          Length = 505

 Score =  108 bits (269), Expect(2) = 4e-29
 Identities = 50/95 (52%), Positives = 63/95 (66%)
 Frame = -3

Query: 370 VRIEKHPASYTIGWIKKVGETKVIERYQISFSIGKFYKDIVTYDIVDMDACHMLLGRPWQ 191
           ++ E HP  Y + W++K  E KV +R  + FSIG  Y+D V  DI+ MDACH+LLGRPWQ
Sbjct: 262 LQTEVHPHPYKLQWLRKGNEVKVTKRCCVQFSIGNKYEDEVWCDIIPMDACHLLLGRPWQ 321

Query: 190 YDVDSTHKGKSNTFMFYKDGIKIFLVPMKGENCPK 86
           YD  + H G  NT+ F KDG KI L P+K EN PK
Sbjct: 322 YDRRAHHDGYKNTYSFIKDGAKIMLTPLKPENRPK 356



 Score = 47.4 bits (111), Expect(2) = 4e-29
 Identities = 18/37 (48%), Positives = 28/37 (75%)
 Frame = -1

Query: 483 KHNIFRTSCTINQRICDLILDGGSSENIVSKTMVDKL 373
           +HNIF T CT   ++C++I+D GS EN+++  MV+KL
Sbjct: 224 RHNIFYTRCTSQGKVCNVIIDSGSCENVIANYMVEKL 260


>ref|XP_007200213.1| hypothetical protein PRUPE_ppa015697mg [Prunus persica]
           gi|462395613|gb|EMJ01412.1| hypothetical protein
           PRUPE_ppa015697mg [Prunus persica]
          Length = 983

 Score = 98.2 bits (243), Expect(2) = 4e-28
 Identities = 56/122 (45%), Positives = 72/122 (59%), Gaps = 1/122 (0%)
 Frame = -3

Query: 364 IEKHPASYTIGWIKKVGETKVIERYQISFSIGKFYKDIVTYDIVDMDACHMLLGRPWQYD 185
           IEKHP  Y + W +K  E      YQ+           V  D+V MDACH+LLGRPW +D
Sbjct: 256 IEKHPNPYKVAWFRKGNEV-----YQLLQ---------VWCDVVPMDACHILLGRPWSFD 301

Query: 184 VDSTHKGKSNTFMFYKDGIKIFLVPMKG-ENCPKITKVEG*SFLIVSDILQESKETGKMY 8
            D  H  K+NT++F++DG K+ L P+K  +N PK+TKV    FL   +  QESKE G MY
Sbjct: 302 KDMIHYTKANTYVFHQDGKKLSLQPLKEVKNTPKVTKVS--RFLTCHNFEQESKEMGIMY 359

Query: 7   AL 2
           AL
Sbjct: 360 AL 361



 Score = 53.9 bits (128), Expect(2) = 4e-28
 Identities = 27/54 (50%), Positives = 39/54 (72%), Gaps = 1/54 (1%)
 Frame = -1

Query: 531 LVCRLMCIPR-QKIDPQKHNIFRTSCTINQRICDLILDGGSSENIVSKTMVDKL 373
           +V R+M  P+ ++ D + HNIFRT      ++C++ILDGGSSENI+SK  V+KL
Sbjct: 199 VVRRVMTTPKVEEEDWRHHNIFRTRVLCGGKVCNVILDGGSSENIISKEAVEKL 252


>ref|XP_007009850.1| Uncharacterized protein TCM_043155 [Theobroma cacao]
           gi|508726763|gb|EOY18660.1| Uncharacterized protein
           TCM_043155 [Theobroma cacao]
          Length = 625

 Score = 99.8 bits (247), Expect(2) = 3e-27
 Identities = 46/92 (50%), Positives = 60/92 (65%)
 Frame = -3

Query: 361 EKHPASYTIGWIKKVGETKVIERYQISFSIGKFYKDIVTYDIVDMDACHMLLGRPWQYDV 182
           E HP  Y + W++K  E KV +R  I F I   Y+D V  D++ MDACH+LLGRPWQYD 
Sbjct: 385 EVHPHPYKLQWLRKGNEVKVTKRCCIQFFIRNKYEDEVWCDVIPMDACHLLLGRPWQYDR 444

Query: 181 DSTHKGKSNTFMFYKDGIKIFLVPMKGENCPK 86
            + + G  NT+ F KDG+KI L P+K E+ PK
Sbjct: 445 RAHYDGYKNTYSFIKDGVKIMLTPLKPEDRPK 476



 Score = 49.3 bits (116), Expect(2) = 3e-27
 Identities = 41/156 (26%), Positives = 66/156 (42%), Gaps = 10/156 (6%)
 Frame = -1

Query: 810 QPNVTN*DSGSSQAKSIAKANTNRGSSSNPY------AHLQLNKCYHCNQPGHRSNKCL* 649
           Q +++N +S SS      K N+++ +SSN        A     KC+ C + GH ++ C  
Sbjct: 225 QESISNDESQSSVTIPPPKVNSSKTASSNDKETTFTRASNVNKKCFKCQRFGHIASDCPS 284

Query: 648 HPMANLTTHAXXXXXXXXXXXXXXXXXXXXXXXXXEG--VFLVCRLMCIPRQKIDPQ--K 481
             + +L                             +    F+V R +       D    +
Sbjct: 285 RRIISLVEEEDYVNWEKLEPVYDEYDDEEIEEVSADHGEAFIVRRNLNTALMTKDESCLR 344

Query: 480 HNIFRTSCTINQRICDLILDGGSSENIVSKTMVDKL 373
           HNIF T CT    +C++I+D GS EN+V+  MV+KL
Sbjct: 345 HNIFYTRCTSQGNVCNVIIDSGSCENVVANYMVEKL 380


>ref|XP_007207970.1| hypothetical protein PRUPE_ppa016339mg, partial [Prunus persica]
           gi|462403612|gb|EMJ09169.1| hypothetical protein
           PRUPE_ppa016339mg, partial [Prunus persica]
          Length = 566

 Score = 81.3 bits (199), Expect(2) = 4e-27
 Identities = 51/152 (33%), Positives = 74/152 (48%), Gaps = 3/152 (1%)
 Frame = -1

Query: 819 GNTQPNVT--N*DSGSSQAKSIAKANTNRGSSSNPYAHLQLNKCYHCNQPGHRSNKCL*H 646
           G T+P  T  N +   S ++S  +  + R  S NPYA    + CY C +PGHRSN C   
Sbjct: 225 GTTKPATTVQNKNFNESSSRSFNRGQS-RNQSQNPYAKPMTDICYRCQKPGHRSNVCPER 283

Query: 645 PMANLTTHAXXXXXXXXXXXXXXXXXXXXXXXXXEGVFLVC-RLMCIPRQKIDPQKHNIF 469
             AN                              E + LV  R++  P++  + Q+H+IF
Sbjct: 284 KQANFIEEVDEDEEKDEVGEDDYAWAEFAIEEGMERITLVLQRVLLAPKE--EGQRHSIF 341

Query: 468 RTSCTINQRICDLILDGGSSENIVSKTMVDKL 373
           R+ C+I  ++CD+I+D GS EN VSK +VD L
Sbjct: 342 RSLCSIKNKVCDVIVDNGSCENFVSKKLVDYL 373



 Score = 67.4 bits (163), Expect(2) = 4e-27
 Identities = 40/117 (34%), Positives = 55/117 (47%)
 Frame = -3

Query: 373 IVRIEKHPASYTIGWIKKVGETKVIERYQISFSIGKFYKDIVTYDIVDMDACHMLLGRPW 194
           ++ IE H   Y++GW+KK     V E Y++  SI K Y D V             L RPW
Sbjct: 373 LLSIEPHVRPYSLGWVKKGPSVCVAETYRVPLSISKHYSDDV-------------LWRPW 419

Query: 193 QYDVDSTHKGKSNTFMFYKDGIKIFLVPMKGENCPKITKVEG*SFLIVSDILQESKE 23
           Q+DVD+T+KG+ N  +F  +  KI +   K        K    SFL +    QE  E
Sbjct: 420 QFDVDATYKGRDNVILFSWNNQKITMATTKPSKQSVEPKTRSSSFLTLISSEQELNE 476


>ref|XP_007027874.1| DNA/RNA polymerases superfamily protein [Theobroma cacao]
           gi|508716479|gb|EOY08376.1| DNA/RNA polymerases
           superfamily protein [Theobroma cacao]
          Length = 558

 Score = 94.7 bits (234), Expect(2) = 2e-26
 Identities = 52/123 (42%), Positives = 67/123 (54%), Gaps = 4/123 (3%)
 Frame = -3

Query: 358 KHPASYTIGWIKKVGETKVIERYQISFSIGKFYKDIVTYDIVDMDACHMLLGRPWQYDVD 179
           KHP  Y IGW+KK  E  V  +  + F++G    D    D+V MD  H+L+GRPW YD D
Sbjct: 369 KHPYPYKIGWLKKGHEVPVTTQCLVKFTMGDNLDDEALCDVVPMDVGHILVGRPWLYDHD 428

Query: 178 STHKGKSNTFMFYKDGIKIFLVPMKGEN----CPKITKVEG*SFLIVSDILQESKETGKM 11
             HK K NT+ FYKD  +  L P+K E       KI+K+ G  +L   +   E  E G M
Sbjct: 429 MVHKTKPNTYSFYKDNKRYTLYPLKEETKKSANSKISKITG--YLSAENFEAEGSEMGIM 486

Query: 10  YAL 2
           YAL
Sbjct: 487 YAL 489



 Score = 52.0 bits (123), Expect(2) = 2e-26
 Identities = 35/126 (27%), Positives = 53/126 (42%), Gaps = 2/126 (1%)
 Frame = -1

Query: 744 NRGSSSNPYAHLQLNKCYHCNQPGHRSNKCL*HPMANLTTHAXXXXXXXXXXXXXXXXXX 565
           N GSS+N        +C+ C + GH S  C   P   +                      
Sbjct: 241 NSGSSTNKGGSNSHIRCFTCGEKGHTSFAC---PQRRVNLAELGEELEPVYDEYEEEVEE 297

Query: 564 XXXXXXXEGVFLVCRLMC--IPRQKIDPQKHNIFRTSCTINQRICDLILDGGSSENIVSK 391
                      +V R+M   +  +  D ++ +IFRT      ++CDL++DGGS ENI+SK
Sbjct: 298 IDVYPAQGESLVVRRVMTTTVNEEAEDWKRRSIFRTRVVCEGKVCDLVIDGGSMENIISK 357

Query: 390 TMVDKL 373
             V+KL
Sbjct: 358 EAVNKL 363


>ref|XP_004309164.1| PREDICTED: uncharacterized protein LOC101300012 [Fragaria vesca
           subsp. vesca]
          Length = 1034

 Score = 96.3 bits (238), Expect(2) = 3e-26
 Identities = 52/115 (45%), Positives = 74/115 (64%)
 Frame = -3

Query: 358 KHPASYTIGWIKKVGETKVIERYQISFSIGKFYKDIVTYDIVDMDACHMLLGRPWQYDVD 179
           KH A Y IGWIKK  E ++ E  ++S SIGKFY+D V  D+VDMDA H+LLG+PWQ+DV+
Sbjct: 485 KHRAPYAIGWIKKGLEVRITETCKVSISIGKFYQDEVECDVVDMDASHVLLGKPWQHDVN 544

Query: 178 STHKGKSNTFMFYKDGIKIFLVPMKGENCPKITKVEG*SFLIVSDILQESKETGK 14
           + H G+ NT  F  +   I L P K +    ++  E  +FLIV++  ++ +E  K
Sbjct: 545 TIHNGRENTVSFIWEKHHITLKP-KTKPTNLVSPKES-NFLIVAEPCEKVEELVK 597



 Score = 49.7 bits (117), Expect(2) = 3e-26
 Identities = 23/48 (47%), Positives = 35/48 (72%)
 Frame = -1

Query: 522 RLMCIPRQKIDPQKHNIFRTSCTINQRICDLILDGGSSENIVSKTMVD 379
           RL+C  +Q  + Q+H+IFR++CTI ++   LI+D GS EN VSK +V+
Sbjct: 432 RLLCSTKQ--ENQRHSIFRSTCTIKEKPMSLIIDSGSCENFVSKKVVE 477


>ref|XP_007033335.1| Uncharacterized protein TCM_019516, partial [Theobroma cacao]
           gi|508712364|gb|EOY04261.1| Uncharacterized protein
           TCM_019516, partial [Theobroma cacao]
          Length = 215

 Score = 97.8 bits (242), Expect(2) = 1e-25
 Identities = 45/95 (47%), Positives = 60/95 (63%)
 Frame = -3

Query: 370 VRIEKHPASYTIGWIKKVGETKVIERYQISFSIGKFYKDIVTYDIVDMDACHMLLGRPWQ 191
           ++ E  P  Y + W++K  E KV +   + FSIG  Y+D V  D++ MDAC +LLGRPWQ
Sbjct: 89  LQTEVLPHPYKLQWLRKGNEVKVTKHCCVQFSIGNKYEDEVWCDVIPMDACQLLLGRPWQ 148

Query: 190 YDVDSTHKGKSNTFMFYKDGIKIFLVPMKGENCPK 86
           YD  + H G  NT+ F KDG KI L P+K E+ PK
Sbjct: 149 YDRRAHHDGYKNTYSFIKDGAKIMLTPLKSEDYPK 183



 Score = 46.2 bits (108), Expect(2) = 1e-25
 Identities = 17/37 (45%), Positives = 27/37 (72%)
 Frame = -1

Query: 483 KHNIFRTSCTINQRICDLILDGGSSENIVSKTMVDKL 373
           +HNIF   CT   ++C++I+D GS EN+++  MV+KL
Sbjct: 51  RHNIFHARCTSQGKVCNVIIDSGSCENVIANYMVEKL 87


>ref|XP_007028192.1| Uncharacterized protein TCM_023754 [Theobroma cacao]
           gi|508716797|gb|EOY08694.1| Uncharacterized protein
           TCM_023754 [Theobroma cacao]
          Length = 440

 Score =  100 bits (249), Expect(2) = 1e-25
 Identities = 46/92 (50%), Positives = 60/92 (65%)
 Frame = -3

Query: 361 EKHPASYTIGWIKKVGETKVIERYQISFSIGKFYKDIVTYDIVDMDACHMLLGRPWQYDV 182
           E HP  Y + W++K  E KV +R  + FSIG  Y+D V  D++ MDACH+LLGRPWQYD 
Sbjct: 200 EVHPHPYKLQWLRKGNEVKVTKRCCVQFSIGSKYEDEVWCDVIPMDACHLLLGRPWQYDR 259

Query: 181 DSTHKGKSNTFMFYKDGIKIFLVPMKGENCPK 86
            + + G  N   F KDG+KI L P+K E+ PK
Sbjct: 260 RAHYDGYKNISSFIKDGVKIMLTPLKPEDRPK 291



 Score = 43.1 bits (100), Expect(2) = 1e-25
 Identities = 17/37 (45%), Positives = 27/37 (72%)
 Frame = -1

Query: 483 KHNIFRTSCTINQRICDLILDGGSSENIVSKTMVDKL 373
           +HNIF T  T   ++C++I+D GS EN+++  MV+KL
Sbjct: 159 RHNIFYTRYTSQGKVCNVIIDSGSCENVIANYMVEKL 195


>gb|ADP20179.1| gag-pol polyprotein [Silene latifolia]
          Length = 1475

 Score = 91.7 bits (226), Expect(2) = 3e-25
 Identities = 48/125 (38%), Positives = 72/125 (57%), Gaps = 5/125 (4%)
 Frame = -3

Query: 361 EKHPASYTIGWIKKVGETKVIERYQISFSIGKFYKDIVTYDIVDMDACHMLLGRPWQYDV 182
           + HP+ Y + W+ K  E +V ++  ++FSIGK Y D    D++ MDACH+LLGRPW++D 
Sbjct: 426 QDHPSPYKLRWLNKGAEVRVDKQCLVTFSIGKNYSDEALCDVLPMDACHLLLGRPWEFDR 485

Query: 181 DSTHKGKSNTFMFYKDGIKIFLVP----MKGENCPKITKVEG*SFLI-VSDILQESKETG 17
           DS H G+ NT+ F     K+ L P    +K    P + +      LI  +++LQE K   
Sbjct: 486 DSVHHGRDNTYTFKFRSRKVILTPLPPVLKHTTPPSMLEPSKEVLLINEAEMLQELKGDE 545

Query: 16  KMYAL 2
            +YAL
Sbjct: 546 DVYAL 550



 Score = 50.8 bits (120), Expect(2) = 3e-25
 Identities = 20/38 (52%), Positives = 29/38 (76%)
 Frame = -1

Query: 486 QKHNIFRTSCTINQRICDLILDGGSSENIVSKTMVDKL 373
           Q+  IFR+ CTI  R+C+LI+DGGS  N+ S T+++KL
Sbjct: 384 QRQQIFRSRCTIKGRVCNLIIDGGSCTNVASSTLIEKL 421


>ref|XP_007049887.1| DNA/RNA polymerases superfamily protein [Theobroma cacao]
           gi|508702148|gb|EOX94044.1| DNA/RNA polymerases
           superfamily protein [Theobroma cacao]
          Length = 546

 Score = 89.7 bits (221), Expect(2) = 9e-25
 Identities = 49/123 (39%), Positives = 66/123 (53%), Gaps = 4/123 (3%)
 Frame = -3

Query: 358 KHPASYTIGWIKKVGETKVIERYQISFSIGKFYKDIVTYDIVDMDACHMLLGRPWQYDVD 179
           KHP  Y IGW+KK  E  V  +  + F++G    D    D+V MD  H+L+GRPW YD D
Sbjct: 373 KHPYPYKIGWLKKGHEVPVTTQCLVKFTMGNNLDDEALCDVVPMDVGHILVGRPWLYDHD 432

Query: 178 STHKGKSNTFMFYKDGIKIFLVPMKGENCP----KITKVEG*SFLIVSDILQESKETGKM 11
             HK K NT+ FYK+  +  L P++ E       KI+K+ G  +L   +   E  E G  
Sbjct: 433 MVHKTKPNTYSFYKNNKRYTLYPLREETKKSANNKISKITG--YLSAENFEAEGSEMGIT 490

Query: 10  YAL 2
           YAL
Sbjct: 491 YAL 493



 Score = 51.2 bits (121), Expect(2) = 9e-25
 Identities = 40/150 (26%), Positives = 66/150 (44%), Gaps = 6/150 (4%)
 Frame = -1

Query: 804 NVTN*DSGSS----QAKSIAKANTNRGSSSNPYAHLQLNKCYHCNQPGHRSNKCL*HPMA 637
           NV   D G S      ++ + ++TN+G S++   H+   +C+ C + GH S  C   P  
Sbjct: 227 NVEKNDKGKSIMPYGGQNSSGSSTNKGGSNS---HI---RCFTCGEKGHISFAC---PQR 277

Query: 636 NLTTHAXXXXXXXXXXXXXXXXXXXXXXXXXEGVFLVCRLMC--IPRQKIDPQKHNIFRT 463
            +                                 +V R+M   +  +  D ++ +IFRT
Sbjct: 278 RVNLAELGEELEPVYDEYEEEVEEIDVYPAQGESLVVRRVMTTTVNEEAEDWKRRSIFRT 337

Query: 462 SCTINQRICDLILDGGSSENIVSKTMVDKL 373
                 ++CDL++DGGS ENI+SK  V+KL
Sbjct: 338 RVVCEGKVCDLVIDGGSMENIISKEAVNKL 367


>ref|XP_007010495.1| Uncharacterized protein TCM_044370 [Theobroma cacao]
           gi|508727408|gb|EOY19305.1| Uncharacterized protein
           TCM_044370 [Theobroma cacao]
          Length = 1306

 Score = 93.6 bits (231), Expect(2) = 1e-24
 Identities = 51/123 (41%), Positives = 66/123 (53%), Gaps = 4/123 (3%)
 Frame = -3

Query: 358 KHPASYTIGWIKKVGETKVIERYQISFSIGKFYKDIVTYDIVDMDACHMLLGRPWQYDVD 179
           KHP  Y IGW+KK  E  V  +Y + F++G    D    D+V MD  H+L+GRPW YD D
Sbjct: 365 KHPYPYKIGWLKKGHEVPVTTQYLVKFTMGDNLDDEALCDVVPMDVGHILVGRPWLYDHD 424

Query: 178 STHKGKSNTFMFYKDGIKIFLVPMKGEN----CPKITKVEG*SFLIVSDILQESKETGKM 11
             HK + NT+ FY D  +    P+K E       KI K+ G  +L V +   E  E G M
Sbjct: 425 MVHKTEPNTYSFYNDNKRYTSYPLKEETKKSANSKINKITG--YLSVENFEAEGSEMGIM 482

Query: 10  YAL 2
           YAL
Sbjct: 483 YAL 485



 Score = 47.0 bits (110), Expect(2) = 1e-24
 Identities = 20/40 (50%), Positives = 29/40 (72%)
 Frame = -1

Query: 492 DPQKHNIFRTSCTINQRICDLILDGGSSENIVSKTMVDKL 373
           D ++ +IFRT      ++CDL++DGGS ENI+SK  V+KL
Sbjct: 320 DWKRRSIFRTRVVCEGKVCDLVIDGGSMENIISKEAVNKL 359


>gb|ADP20178.1| gag-pol polyprotein [Silene latifolia]
          Length = 1518

 Score = 92.4 bits (228), Expect(2) = 1e-24
 Identities = 42/86 (48%), Positives = 59/86 (68%), Gaps = 1/86 (1%)
 Frame = -3

Query: 361 EKHPASYTIGWIKKVGETKVIERYQISFSIGKFYKDIVTYDIV-DMDACHMLLGRPWQYD 185
           ++HP  Y + W+ K    +V ++  ISFSIGK YKD V  D+V  MDACH+LLGRPW+YD
Sbjct: 437 QEHPNPYKLRWLSKDSGVRVDKQCIISFSIGKMYKDEVLCDVVVPMDACHLLLGRPWEYD 496

Query: 184 VDSTHKGKSNTFMFYKDGIKIFLVPM 107
            ++TH+GK N ++F   G K+ L P+
Sbjct: 497 RNTTHQGKDNVYIFKHQGKKVTLTPL 522



 Score = 47.8 bits (112), Expect(2) = 1e-24
 Identities = 20/38 (52%), Positives = 28/38 (73%)
 Frame = -1

Query: 486 QKHNIFRTSCTINQRICDLILDGGSSENIVSKTMVDKL 373
           Q+  IFR+ CT+  R+C+LI++GGS  N+ S TMV KL
Sbjct: 395 QRSMIFRSRCTVQGRVCNLIINGGSCTNVASTTMVSKL 432


Top