BLASTX nr result

ID: Akebia26_contig00033121 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Akebia26_contig00033121
         (835 letters)

Database: ./nr 
           38,876,450 sequences; 13,856,398,315 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_006358721.1| PREDICTED: uncharacterized protein LOC102596...   179   1e-42
ref|XP_004253277.1| PREDICTED: uncharacterized protein LOC101244...   175   2e-41
ref|XP_004253436.1| PREDICTED: uncharacterized protein LOC101262...   174   3e-41
ref|XP_004233579.1| PREDICTED: uncharacterized protein LOC101260...   174   3e-41
ref|XP_007022832.1| Uncharacterized protein TCM_026877 [Theobrom...   172   1e-40
ref|XP_007010390.1| Retrotransposon, unclassified-like protein [...   171   2e-40
ref|XP_004237272.1| PREDICTED: uncharacterized protein LOC101266...   170   5e-40
ref|XP_007046404.1| Uncharacterized protein TCM_011923 [Theobrom...   170   7e-40
ref|XP_004233578.1| PREDICTED: putative ribonuclease H protein A...   170   7e-40
ref|XP_007031312.1| Uncharacterized protein TCM_016762 [Theobrom...   168   2e-39
ref|XP_007017131.1| Uncharacterized protein TCM_033752 [Theobrom...   168   3e-39
ref|XP_007026457.1| Uncharacterized protein TCM_021521 [Theobrom...   166   8e-39
ref|XP_007031313.1| Uncharacterized protein TCM_016763 [Theobrom...   165   2e-38
ref|XP_004242524.1| PREDICTED: uncharacterized protein LOC101258...   165   2e-38
gb|AAC63844.1| putative non-LTR retroelement reverse transcripta...   165   2e-38
ref|XP_007040948.1| Uncharacterized protein TCM_016755 [Theobrom...   164   4e-38
ref|XP_007008704.1| Uncharacterized protein TCM_042330 [Theobrom...   164   5e-38
ref|XP_004248595.1| PREDICTED: uncharacterized protein LOC101261...   164   5e-38
ref|XP_004244918.1| PREDICTED: putative ribonuclease H protein A...   164   5e-38
ref|XP_007026458.1| Uncharacterized protein TCM_021522 [Theobrom...   162   1e-37

>ref|XP_006358721.1| PREDICTED: uncharacterized protein LOC102596481 [Solanum tuberosum]
          Length = 1135

 Score =  179 bits (453), Expect = 1e-42
 Identities = 101/278 (36%), Positives = 152/278 (54%), Gaps = 7/278 (2%)
 Frame = -2

Query: 828  FSVSKKGVAPSHLLFADDLIIFTKATSKSLSNIKGFLESYEKASEQQISLKKSNFFVAER 649
            + + K     SHL +ADD I+F    + S+  +   L  YEK S Q I+L KS  ++ ++
Sbjct: 537  YGMPKWSPVVSHLSYADDTILFCSGQTTSMRKMINILRGYEKVSGQMINLDKSMIYLHKQ 596

Query: 648  VADRRVELVKMILGFPKGSLPTNYLGVPLFVGKVTREMCRGLVSKILRKMDGWKSKLLSE 469
            V +R   LVK I G  +GS P  YLG P+F G+  +     L+ K+  +M+ W++KL+S 
Sbjct: 597  VPNRVCNLVKRITGIRQGSFPFTYLGCPIFYGRKNKGHFENLLKKVSNRMNTWQNKLMSF 656

Query: 468  GGRLTLLSHVLTSIPVYLISILPIPKTISISLESCFAKFFWGSTEGKTKGHLVAWHKVCK 289
            G R  L++HVL SIPVYL++ +  PK+I   L   FA FFW ++ G    H VAW K+C 
Sbjct: 657  GERYILIAHVLQSIPVYLLAAMNPPKSIIDQLHKLFAIFFWSNSSGARNKHWVAWDKMCY 716

Query: 288  PKMEGELGIRTISEVHKTGLMKMCWRLTTE-KGLWIEFLKKKYARDGNWWNPTNSRG--- 121
            PK+EG LG R++ +V K    K+ W   T+   LW  F+  KY +     +PT +RG   
Sbjct: 717  PKVEGGLGFRSLHDVSKAFFAKLWWNFRTDTSSLWASFMWNKYCKK---MHPTVARGQGA 773

Query: 120  SKLWR---SIRSFLQPTIRLSKRLIGDGASSSLFLHNW 16
            S +WR   ++R  ++  I    +      +SS +  NW
Sbjct: 774  SHVWRKMITVREEVEHNIWWQIK----AGNSSFWFDNW 807


>ref|XP_004253277.1| PREDICTED: uncharacterized protein LOC101244169 [Solanum
            lycopersicum]
          Length = 764

 Score =  175 bits (443), Expect = 2e-41
 Identities = 92/276 (33%), Positives = 157/276 (56%), Gaps = 2/276 (0%)
 Frame = -2

Query: 828  FSVSKKGVAPSHLLFADDLIIFTKATSKSLSNIKGFLESYEKASEQQISLKKSNFFVAER 649
            F+++KKG   +HL FADD+IIFT   + SL  I   +E YE  S+Q+++ +KS F V  +
Sbjct: 226  FNMNKKGPQVNHLSFADDIIIFTSTDNTSLQLIMKVIEDYEAVSDQKVNKEKSYFMVTPK 285

Query: 648  VADRRVELVKMILGFPKGSLPTNYLGVPLFVGKVTREMCRGLVSKILRKMDGWKSKLLSE 469
             ++  ++ +K I GF   + P NYLG PL++G         +V K+++K+ GW+SK+L+ 
Sbjct: 286  TSNGIIDNIKRITGFSMKNSPINYLGCPLYIGGQRIIYFSEVVDKVIKKISGWQSKILNF 345

Query: 468  GGRLTLLSHVLTSIPVYLISILPIPKTISISLESCFAKFFWGSTEGKTKGHLVAWHKVCK 289
            GG++TL+ HVL SIP++ ++ +   KT    ++   A FFWG  +   K H  +W  +  
Sbjct: 346  GGKITLIKHVLQSIPIHTLAAISPHKTTINHIKKLMADFFWGIDKEGKKYHWASWDTMAY 405

Query: 288  PKMEGELGIRTISEVHKTGLMKMCWRLTTEKGLWIEFLKKKYARDGN-WWNPTNSRGSKL 112
            P  EG +G+R + ++ K    K  W   T+  LW  FLK KY +  +      N+  S +
Sbjct: 406  PTNEGGIGVRLLDDICKAFQYKHWWDFRTKNSLWSNFLKSKYCQRAHPVAKKYNTGDSLM 465

Query: 111  WRSI-RSFLQPTIRLSKRLIGDGASSSLFLHNWRGS 7
            WR + R+ ++  +++  ++     +S+L+  NW G+
Sbjct: 466  WRYLTRNRIEVEVQIRWQI--QSGTSNLWWDNWTGN 499


>ref|XP_004253436.1| PREDICTED: uncharacterized protein LOC101262707 [Solanum
            lycopersicum]
          Length = 764

 Score =  174 bits (442), Expect = 3e-41
 Identities = 93/276 (33%), Positives = 155/276 (56%), Gaps = 2/276 (0%)
 Frame = -2

Query: 828  FSVSKKGVAPSHLLFADDLIIFTKATSKSLSNIKGFLESYEKASEQQISLKKSNFFVAER 649
            F+++KKG   +HL FADD+IIFT   + SL  I   +E YE  S+Q+++ +KS F V  +
Sbjct: 226  FNMNKKGPQVNHLSFADDIIIFTSTDNTSLQLIMKVIEDYEAVSDQKVNKEKSYFMVTLK 285

Query: 648  VADRRVELVKMILGFPKGSLPTNYLGVPLFVGKVTREMCRGLVSKILRKMDGWKSKLLSE 469
             ++  ++ +K I GF   + P NYLG PL++G         +V K+++K+ GW+SK+L+ 
Sbjct: 286  TSNGIIDNIKRITGFSMKNSPINYLGCPLYIGGQRIIYFFEVVDKVIKKISGWQSKILNF 345

Query: 468  GGRLTLLSHVLTSIPVYLISILPIPKTISISLESCFAKFFWGSTEGKTKGHLVAWHKVCK 289
            GG++TL+ HVL SIP++ ++ +  PKT    ++   A FFWG  +   K H  +W  +  
Sbjct: 346  GGKITLIKHVLQSIPIHTLAAISPPKTTINHIKKLMADFFWGIDKEGKKYHWASWDTMAY 405

Query: 288  PKMEGELGIRTISEVHKTGLMKMCWRLTTEKGLWIEFLKKKYARDGN-WWNPTNSRGSKL 112
            P  EG +G+R + ++ K    K  W   T+  LW  FL  KY +  +      N+  S +
Sbjct: 406  PTNEGGIGVRLLDDICKAFQYKHWWDFRTKHSLWSNFLMSKYCQRAHPVAKKYNTGDSLM 465

Query: 111  WRSI-RSFLQPTIRLSKRLIGDGASSSLFLHNWRGS 7
            WR + R+ ++  + +   +     +SSL+  NW G+
Sbjct: 466  WRYLTRNRIEVEVHIRWHI--QSGTSSLWWDNWTGN 499


>ref|XP_004233579.1| PREDICTED: uncharacterized protein LOC101260201 [Solanum
            lycopersicum]
          Length = 1531

 Score =  174 bits (442), Expect = 3e-41
 Identities = 94/278 (33%), Positives = 154/278 (55%), Gaps = 4/278 (1%)
 Frame = -2

Query: 828  FSVSKKGVAPSHLLFADDLIIFTKATSKSLSNIKGFLESYEKASEQQISLKKSNFFVAER 649
            FS+ K G   +HL FADD IIFT    +SL+ I   ++ YE+  +Q+++  KS F V  +
Sbjct: 961  FSMEKNGPQTNHLSFADDCIIFTSTDRRSLTLIMRIIDDYERVFDQKVNKDKSFFMVTRK 1020

Query: 648  VADRRVELVKMILGFPKGSLPTNYLGVPLFVGKVTREMCRGLVSKILRKMDGWKSKLLSE 469
             +   +E +K++ GF   + P NYLG PL++G         +V K+++++ GW+SK+L+ 
Sbjct: 1021 TSHEIIEDIKVVTGFGMKNSPINYLGCPLYIGGQRIIYFSEVVEKVIKRISGWQSKILNF 1080

Query: 468  GGRLTLLSHVLTSIPVYLISILPIPKTISISLESCFAKFFWGSTEGKTKGHLVAWHKVCK 289
            GG++TL+ HVL ++P++ ++++  PKT    ++   A FFWG  +   K H  +W  +  
Sbjct: 1081 GGKVTLVKHVLQAMPIHTLAVMSPPKTTLNYIKRAIADFFWGVDKDGKKYHWASWDTLAY 1140

Query: 288  PKMEGELGIRTISEVHKTGLMKMCWRLTTEKGLWIEFLKKKYARDGNWWNPTNSRGSKL- 112
            P  EG +G+R + ++ K    K  W   T+K LW +FLK KY +  N        G  L 
Sbjct: 1141 PTNEGGIGVRLLDDICKAFQYKHWWEFRTKKSLWSQFLKAKYCQRANPVAKKYDTGDSLV 1200

Query: 111  WRSI---RSFLQPTIRLSKRLIGDGASSSLFLHNWRGS 7
            WR +   RS ++  IR +   I  G +S  +  NW G+
Sbjct: 1201 WRYLTRNRSEMEAYIRWN---INSG-TSKFWWDNWLGN 1234


>ref|XP_007022832.1| Uncharacterized protein TCM_026877 [Theobroma cacao]
            gi|508778198|gb|EOY25454.1| Uncharacterized protein
            TCM_026877 [Theobroma cacao]
          Length = 2367

 Score =  172 bits (436), Expect = 1e-40
 Identities = 97/269 (36%), Positives = 144/269 (53%), Gaps = 3/269 (1%)
 Frame = -2

Query: 798  SHLLFADDLIIFTKATSKSLSNIKGFLESYEKASEQQISLKKSNFFVAERVADRRVELVK 619
            SHL FADD++IFT  +  +L  I  FL+ YE+ S Q+I+ +KS F     V+  R +++ 
Sbjct: 1730 SHLAFADDVLIFTNGSKSALQRILAFLQEYEEISRQRINAQKSCFVTHTNVSSSRRQIIA 1789

Query: 618  MILGFPKGSLPTNYLGVPLFVGKVTREMCRGLVSKILRKMDGWKSKLLSEGGRLTLLSHV 439
               GF    LP  YLG PL+ G     +   LV+KI  ++ GW++K+LS GGR+TLL  V
Sbjct: 1790 QTTGFNHQLLPITYLGAPLYKGHKKVILFNDLVAKIEERITGWENKILSPGGRITLLKSV 1849

Query: 438  LTSIPVYLISILPIPKTISISLESCFAKFFWGSTEGKTKGHLVAWHKVCKPKMEGELGIR 259
            LTS+P+YL  +L  P  +   +   F  F WG +    K H  +W K+  P  EG L IR
Sbjct: 1850 LTSLPIYLFQVLKPPVCVLERINRIFNSFLWGGSAASKKIHWTSWAKISLPVKEGGLDIR 1909

Query: 258  TISEVHKTGLMKMCWRLTTEKGLWIEFLKKKYARDGNWWNPTNSR--GSKLWRSIRSFLQ 85
            +++EV +   MK+ WR  T   LW  F++ KY R G     T  +   S+ W+ + +   
Sbjct: 1910 SLAEVFEAFSMKLWWRFRTTDSLWTRFMRMKYCR-GQLPMHTQPKLHDSQTWKRMVASSA 1968

Query: 84   PTIRLSKRLIGDGASSSLFLHN-WRGSCP 1
             T +  +  +G G  +  F H+ W G  P
Sbjct: 1969 ITEQNMRWRVGQG--NLFFWHDCWMGETP 1995


>ref|XP_007010390.1| Retrotransposon, unclassified-like protein [Theobroma cacao]
            gi|508727303|gb|EOY19200.1| Retrotransposon,
            unclassified-like protein [Theobroma cacao]
          Length = 1368

 Score =  171 bits (434), Expect = 2e-40
 Identities = 82/212 (38%), Positives = 125/212 (58%)
 Frame = -2

Query: 798  SHLLFADDLIIFTKATSKSLSNIKGFLESYEKASEQQISLKKSNFFVAERVADRRVELVK 619
            SHL FADD++IFT  +   L  I  FL+ YE+ S Q+++ +KS F  A  +   R +++ 
Sbjct: 644  SHLAFADDIMIFTNGSKSVLEKILEFLQEYEQISGQRVNHQKSCFVTANNMPSSRRQIIS 703

Query: 618  MILGFPKGSLPTNYLGVPLFVGKVTREMCRGLVSKILRKMDGWKSKLLSEGGRLTLLSHV 439
              +GF   +LP  YLG PLF G     +   L++KI  ++ GW++K+LS GGR+TLL  V
Sbjct: 704  QTIGFLHKTLPITYLGAPLFKGPKKVMLFDSLINKIRERITGWENKILSPGGRITLLRSV 763

Query: 438  LTSIPVYLISILPIPKTISISLESCFAKFFWGSTEGKTKGHLVAWHKVCKPKMEGELGIR 259
            L+S+P+YL+ +L  P  +   +E  F  F WGS+   T+ H  AWH +  P  EG LGIR
Sbjct: 764  LSSMPIYLLQVLKPPACVIQKIERLFNSFLWGSSMDSTRIHWTAWHNITFPSSEGGLGIR 823

Query: 258  TISEVHKTGLMKMCWRLTTEKGLWIEFLKKKY 163
            ++ +       K+ WR  T + LW+ +++ KY
Sbjct: 824  SLKDSFDAFSAKLWWRFDTCQSLWVRYMRLKY 855


>ref|XP_004237272.1| PREDICTED: uncharacterized protein LOC101266714 [Solanum
           lycopersicum]
          Length = 584

 Score =  170 bits (431), Expect = 5e-40
 Identities = 82/227 (36%), Positives = 136/227 (59%)
 Frame = -2

Query: 828 FSVSKKGVAPSHLLFADDLIIFTKATSKSLSNIKGFLESYEKASEQQISLKKSNFFVAER 649
           FS+ KKG   +HL FADD+IIFT    +SL+ I   +E YEK S+Q+++  KS F V  +
Sbjct: 72  FSMEKKGPQINHLSFADDIIIFTSTDRRSLNLIMRIIEDYEKVSDQKVNKDKSFFMVTSK 131

Query: 648 VADRRVELVKMILGFPKGSLPTNYLGVPLFVGKVTREMCRGLVSKILRKMDGWKSKLLSE 469
            +   +  +K++  F   + P +YLG PL++G         +V K+++++ GW+SK+L+ 
Sbjct: 132 TSQYIIGDIKLVTSFCMKNSPIHYLGCPLYIGGQRIIYFSEVVEKVIKRISGWQSKILNY 191

Query: 468 GGRLTLLSHVLTSIPVYLISILPIPKTISISLESCFAKFFWGSTEGKTKGHLVAWHKVCK 289
           GG++TL+ HVL ++P+++++ +  PKT  + ++   A FFWG  +   K H  +W+ +  
Sbjct: 192 GGKVTLVKHVLQAMPIHILAAMSPPKTTLMYIKREIAAFFWGVDKDGKKYHWASWNTLDY 251

Query: 288 PKMEGELGIRTISEVHKTGLMKMCWRLTTEKGLWIEFLKKKYARDGN 148
           P MEG +G+R + +V K    K  W   T+  LW +FLK KY +  N
Sbjct: 252 PTMEGGIGVRLLDDVCKAFQYKHWWEFRTKGSLWSQFLKAKYCQRAN 298


>ref|XP_007046404.1| Uncharacterized protein TCM_011923 [Theobroma cacao]
            gi|508710339|gb|EOY02236.1| Uncharacterized protein
            TCM_011923 [Theobroma cacao]
          Length = 1954

 Score =  170 bits (430), Expect = 7e-40
 Identities = 97/269 (36%), Positives = 149/269 (55%), Gaps = 3/269 (1%)
 Frame = -2

Query: 798  SHLLFADDLIIFTKATSKSLSNIKGFLESYEKASEQQISLKKSNFFVAERVADRRVELVK 619
            SHL FADD++IFT     +L  I  FL+ YE+ S QQ++ +KS F  A      R +++ 
Sbjct: 1263 SHLAFADDIVIFTNGCRPALQKILVFLQEYEEVSGQQVNHQKSCFITANGCPMTRRQIIA 1322

Query: 618  MILGFPKGSLPTNYLGVPLFVGKVTREMCRGLVSKILRKMDGWKSKLLSEGGRLTLLSHV 439
               GF   +LP  YLG PL  G     +   L++KI  ++ GW++K LS GGR+TLL  V
Sbjct: 1323 HTTGFQHKTLPVIYLGAPLHKGPKKVTLFDSLITKIRDRISGWENKTLSPGGRITLLRSV 1382

Query: 438  LTSIPVYLISILPIPKTISISLESCFAKFFWGSTEGKTKGHLVAWHKVCKPKMEGELGIR 259
            L+S+P+YL+ +L  P  +   +E  F  F WG +    + H  AWHK+  P  EG L IR
Sbjct: 1383 LSSLPLYLLQVLKPPVVVIEKIERLFNSFLWGDSTNDKRIHWAAWHKLTFPCSEGGLDIR 1442

Query: 258  TISEVHKTGLMKMCWRLTTEKGLWIEFLKKKY--ARDGNWWNPTNSRGSKLWRSIRSFLQ 85
             ++++     +K+ WR +T +GLW +FLK KY   +  ++ +P     S++W+ +    +
Sbjct: 1443 RLTDMFDAFSLKLWWRFSTCEGLWTKFLKTKYCMGQIPHYVHP-KLHDSQVWKRMVRGRE 1501

Query: 84   PTIRLSKRLIGDGASSSLFLHN-WRGSCP 1
              I+ ++  IG G  S  F H+ W G  P
Sbjct: 1502 VAIQNTRWRIGKG--SLFFWHDCWMGDQP 1528


>ref|XP_004233578.1| PREDICTED: putative ribonuclease H protein At1g65750-like [Solanum
            lycopersicum]
          Length = 955

 Score =  170 bits (430), Expect = 7e-40
 Identities = 98/275 (35%), Positives = 143/275 (52%), Gaps = 1/275 (0%)
 Frame = -2

Query: 828  FSVSKKGVAPSHLLFADDLIIFTKATSKSLSNIKGFLESYEKASEQQISLKKSNFFVAER 649
            F +   G   +HL FA+D+IIFT    +SL  I   +E YE  S+QQ++  KS F V  +
Sbjct: 239  FQMDSNGPQINHLSFANDIIIFTSTDRQSLQLIVKTIEEYELISDQQVNKDKSFFMVTTK 298

Query: 648  VADRRVELVKMILGFPKGSLPTNYLGVPLFVGKVTREMCRGLVSKILRKMDGWKSKLLSE 469
                 +  +K+  GF   + P  YLG PL+VG        G+V KI+RK+ GW +K+L+ 
Sbjct: 299  TNQAIINSIKIETGFGIQNSPITYLGCPLYVGGQRIIYFSGIVEKIIRKISGWHAKILNF 358

Query: 468  GGRLTLLSHVLTSIPVYLISILPIPKTISISLESCFAKFFWGSTEGKTKGHLVAWHKVCK 289
            GG++TL+ HVL SIP++L++ +  PKT    +++  A FFWG  +   K H  +W  +  
Sbjct: 359  GGKITLVKHVLQSIPIHLLAAVSPPKTTLKYIKNVIADFFWGMDKDGKKYHWASWETLAY 418

Query: 288  PKMEGELGIRTISEVHKTGLMKMCWRLTTEKGLWIEFLKKKYARDGNWWNPTNSRGSKL- 112
            P  EG +G+R + +V      K  W   T+  LW +FLK KY +  N        G+ L 
Sbjct: 419  PTNEGGIGVRNLEDVCIAFQYKQWWEFRTKNSLWSKFLKAKYCKRANPVAKKYDTGNSLV 478

Query: 111  WRSIRSFLQPTIRLSKRLIGDGASSSLFLHNWRGS 7
            WR      Q      K  I  G SSS +  NW G+
Sbjct: 479  WRYFTRNRQAVESYIKWNIHSG-SSSFWWDNWLGN 512


>ref|XP_007031312.1| Uncharacterized protein TCM_016762 [Theobroma cacao]
            gi|508710341|gb|EOY02238.1| Uncharacterized protein
            TCM_016762 [Theobroma cacao]
          Length = 2214

 Score =  168 bits (426), Expect = 2e-39
 Identities = 98/269 (36%), Positives = 142/269 (52%), Gaps = 3/269 (1%)
 Frame = -2

Query: 798  SHLLFADDLIIFTKATSKSLSNIKGFLESYEKASEQQISLKKSNFFVAERVADRRVELVK 619
            SHL FADD++IFT     +L  I  FL+ YE+ S QQ++ +KS F  A      R +++ 
Sbjct: 1524 SHLAFADDIVIFTNGCHSALQKILVFLQEYEQVSGQQVNHQKSCFITANGCPLSRRQIIA 1583

Query: 618  MILGFPKGSLPTNYLGVPLFVGKVTREMCRGLVSKILRKMDGWKSKLLSEGGRLTLLSHV 439
             + GF   +LP  YLG PL  G     +   L+SKI  ++ GW++K+LS G R+TLL  V
Sbjct: 1584 QVTGFQHKTLPVTYLGAPLHKGPKKVFLFDSLISKIRDRISGWENKILSPGSRITLLRSV 1643

Query: 438  LTSIPVYLISILPIPKTISISLESCFAKFFWGSTEGKTKGHLVAWHKVCKPKMEGELGIR 259
            L+S+P+YL+ +L  P  +   +E  F  F WG +    + H  AW+K+  P  EG L IR
Sbjct: 1644 LSSLPMYLLQVLKPPAIVIEKIERLFNSFLWGDSNEGKRMHWAAWNKINFPCSEGGLDIR 1703

Query: 258  TISEVHKTGLMKMCWRLTTEKGLWIEFLKKKY--ARDGNWWNPTNSRGSKLWRSIRSFLQ 85
             + +V     +K+ WR  T   LW  FLK KY   R  ++  P     S +W+ I     
Sbjct: 1704 NLKDVFDAFTLKLWWRFYTCDSLWTLFLKTKYCLGRIPHYVQP-KIHSSSIWKRITGGRD 1762

Query: 84   PTIRLSKRLIGDGASSSLFLHN-WRGSCP 1
             TI+ ++  IG G     F H+ W G  P
Sbjct: 1763 VTIQNTRWKIGRG--ELFFWHDCWMGDQP 1789


>ref|XP_007017131.1| Uncharacterized protein TCM_033752 [Theobroma cacao]
            gi|508722459|gb|EOY14356.1| Uncharacterized protein
            TCM_033752 [Theobroma cacao]
          Length = 2251

 Score =  168 bits (425), Expect = 3e-39
 Identities = 94/269 (34%), Positives = 144/269 (53%), Gaps = 3/269 (1%)
 Frame = -2

Query: 798  SHLLFADDLIIFTKATSKSLSNIKGFLESYEKASEQQISLKKSNFFVAERVADRRVELVK 619
            SHL FADD++IFT  +  +L  I  FL+ YE+ S Q+I+ +KS F     + + R +++ 
Sbjct: 1560 SHLAFADDVLIFTNGSKSALQRILVFLQEYEEISGQRINAQKSCFVTHTNIPNSRRQIIA 1619

Query: 618  MILGFPKGSLPTNYLGVPLFVGKVTREMCRGLVSKILRKMDGWKSKLLSEGGRLTLLSHV 439
               GF    LP  YLG PL+ G     +   LV+KI  ++ GW++K+LS GGR+TLL  V
Sbjct: 1620 QATGFNHQLLPITYLGAPLYKGHKKVILFNDLVAKIEERITGWENKILSPGGRITLLRSV 1679

Query: 438  LTSIPVYLISILPIPKTISISLESCFAKFFWGSTEGKTKGHLVAWHKVCKPKMEGELGIR 259
            L S+P+YL+ +L  P  +   +   F  F WG +    + H  +W K+  P  EG L IR
Sbjct: 1680 LASLPIYLLQVLKPPVCVLERVNRLFNSFLWGGSAASKRIHWASWAKIALPVTEGGLDIR 1739

Query: 258  TISEVHKTGLMKMCWRLTTEKGLWIEFLKKKYARDGNWWNPTNSR--GSKLWRSIRSFLQ 85
            +++EV +   MK+ WR  T   LW  F++ KY R G     T  +   S+ W+ + +   
Sbjct: 1740 SLAEVFEAFSMKLWWRFRTTDSLWTRFMRMKYCR-GQLPMQTQPKLHDSQTWKRMLTSST 1798

Query: 84   PTIRLSKRLIGDGASSSLFLHN-WRGSCP 1
             T +  +  +G G  +  F H+ W G  P
Sbjct: 1799 ITEQHMRWRVGQG--NVFFWHDCWMGEAP 1825


>ref|XP_007026457.1| Uncharacterized protein TCM_021521 [Theobroma cacao]
            gi|508715062|gb|EOY06959.1| Uncharacterized protein
            TCM_021521 [Theobroma cacao]
          Length = 1951

 Score =  166 bits (421), Expect = 8e-39
 Identities = 98/269 (36%), Positives = 140/269 (52%), Gaps = 3/269 (1%)
 Frame = -2

Query: 798  SHLLFADDLIIFTKATSKSLSNIKGFLESYEKASEQQISLKKSNFFVAERVADRRVELVK 619
            SHL FADD++IFT     SL  I  FL+ YE+ S QQ++ +KS F  A   A  R +++ 
Sbjct: 1260 SHLSFADDIVIFTNGCRSSLQKILNFLQEYEQVSGQQVNHQKSCFITANGCALSRRQIIS 1319

Query: 618  MILGFPKGSLPTNYLGVPLFVGKVTREMCRGLVSKILRKMDGWKSKLLSEGGRLTLLSHV 439
               GF   +LP  YLG PL  G     +   L+SKI  ++ GW++K+LS GGR+TLL  V
Sbjct: 1320 HTTGFHHKTLPVTYLGAPLHKGPKKVFLFDSLISKIRDRISGWENKILSPGGRITLLRSV 1379

Query: 438  LTSIPVYLISILPIPKTISISLESCFAKFFWGSTEGKTKGHLVAWHKVCKPKMEGELGIR 259
            L+S P+YL+ +L  P T+   +E  F  F WG +    K H   W K+  P  EG L IR
Sbjct: 1380 LSSQPMYLLQVLKPPVTVIEKIERIFNSFLWGDSNDGKKLHWTVWSKITFPVSEGGLDIR 1439

Query: 258  TISEVHKTGLMKMCWRLTTEKGLWIEFLKKKY--ARDGNWWNPTNSRGSKLWRSIRSFLQ 85
             + +V +   +K+ WR  T   LW +FL+ KY   R  ++  P     S++W+  R  + 
Sbjct: 1440 NLRDVFEAFSLKLWWRFQTCNSLWTKFLRTKYCLGRIPHFVQP-KLHDSQVWK--RMIVG 1496

Query: 84   PTIRLSKRLIGDGASSSLFLHN-WRGSCP 1
              + L       G     F H+ W G  P
Sbjct: 1497 RDVALQNIRWRIGKGELFFWHDCWMGDQP 1525


>ref|XP_007031313.1| Uncharacterized protein TCM_016763 [Theobroma cacao]
            gi|508710342|gb|EOY02239.1| Uncharacterized protein
            TCM_016763 [Theobroma cacao]
          Length = 2127

 Score =  165 bits (417), Expect = 2e-38
 Identities = 96/269 (35%), Positives = 143/269 (53%), Gaps = 3/269 (1%)
 Frame = -2

Query: 798  SHLLFADDLIIFTKATSKSLSNIKGFLESYEKASEQQISLKKSNFFVAERVADRRVELVK 619
            SHL FADD++IFT     +L  I  FL+ YE+ S Q+++ +KS F  A   +  R +++ 
Sbjct: 1437 SHLSFADDIVIFTNGGRSALQKILSFLQEYEQVSGQKVNHQKSCFITANGCSLSRRQIIS 1496

Query: 618  MILGFPKGSLPTNYLGVPLFVGKVTREMCRGLVSKILRKMDGWKSKLLSEGGRLTLLSHV 439
               GF   +LP  YLG PL  G     +   L+SKI  ++ GW++K+LS GGR+TLL  V
Sbjct: 1497 HTTGFQHKTLPVTYLGAPLHKGPKKVLLFDSLISKIRDRISGWENKILSPGGRITLLRSV 1556

Query: 438  LTSIPVYLISILPIPKTISISLESCFAKFFWGSTEGKTKGHLVAWHKVCKPKMEGELGIR 259
            L+S+P+YL+ +L  P T+   ++  F  F WG +    K H   W K+  P  EG LGIR
Sbjct: 1557 LSSLPMYLLQVLKPPVTVIERIDRLFNSFLWGDSTECKKMHWAEWAKISFPCAEGGLGIR 1616

Query: 258  TISEVHKTGLMKMCWRLTTEKGLWIEFLKKKY--ARDGNWWNPTNSRGSKLWRSIRSFLQ 85
             + +V     +K+ WR  T   LW +FL+ KY   R  +   P     S +W+ + S  +
Sbjct: 1617 KLEDVCAAFTLKLWWRFQTGNSLWTQFLRTKYCLGRIPHHIQP-KLHDSHVWKRMISGRE 1675

Query: 84   PTIRLSKRLIGDGASSSLFLHN-WRGSCP 1
              ++  +  IG G     F H+ W G  P
Sbjct: 1676 MALQNIRWKIGKG--DLFFWHDCWMGDKP 1702


>ref|XP_004242524.1| PREDICTED: uncharacterized protein LOC101258077 [Solanum
            lycopersicum]
          Length = 1454

 Score =  165 bits (417), Expect = 2e-38
 Identities = 92/265 (34%), Positives = 141/265 (53%), Gaps = 1/265 (0%)
 Frame = -2

Query: 828  FSVSKKGVAPSHLLFADDLIIFTKATSKSLSNIKGFLESYEKASEQQISLKKSNFFVAER 649
            F +   G   +HL FADD+IIF+   + SL+ I   ++ YE+ S+Q+++  KS F V   
Sbjct: 739  FHMESNGPKINHLSFADDIIIFSSTDNNSLNLIMKTIDQYEEVSDQKVNKDKSFFMVTSN 798

Query: 648  VADRRVELVKMILGFPKGSLPTNYLGVPLFVGKVTREMCRGLVSKILRKMDGWKSKLLSE 469
             +   +E +  I GF + + P NYLG PL+VG         +V K+++K+ GW  K+L+ 
Sbjct: 799  TSHDIIEEISRITGFSRKNSPINYLGCPLYVGGQRIIYYSEIVEKVIKKIAGWHLKILNF 858

Query: 468  GGRLTLLSHVLTSIPVYLISILPIPKTISISLESCFAKFFWGSTEGKTKGHLVAWHKVCK 289
            GG++TL+ HVL S+P++ +S +  PKTI  S++   A FFWG  +   K H  +W+ +  
Sbjct: 859  GGKVTLVKHVLQSMPIHTLSAISPPKTILNSIKKVIADFFWGIEKDGKKYHWSSWNNMAF 918

Query: 288  PKMEGELGIRTISEVHKTGLMKMCWRLTTEKGLWIEFLKKKYARDGN-WWNPTNSRGSKL 112
            P  EG +G+R I ++      K  W   T   LW +FLK KY +  N      N+  S +
Sbjct: 919  PTNEGGIGVRLIEDMCTAFQYKQWWAFRTNNSLWSKFLKAKYNQRANPVAKKYNTGDSIV 978

Query: 111  WRSIRSFLQPTIRLSKRLIGDGASS 37
            WR +    Q    L K  I  G  S
Sbjct: 979  WRYLTRNRQKVESLIKWHIQSGTCS 1003


>gb|AAC63844.1| putative non-LTR retroelement reverse transcriptase [Arabidopsis
            thaliana]
          Length = 1231

 Score =  165 bits (417), Expect = 2e-38
 Identities = 94/276 (34%), Positives = 154/276 (55%), Gaps = 6/276 (2%)
 Frame = -2

Query: 825  SVSKKGVAPSHLLFADDLIIFTKATSKSLSNIKGFLESYEKASEQQISLKKSNFFVAERV 646
            +VS  G   SH+ FADDLI+F +A+   +  I+  LE + +AS Q++SL+KS  F +  V
Sbjct: 531  AVSCGGSKLSHVCFADDLILFAEASVAQIRIIRRVLERFCEASGQKVSLEKSKIFFSHNV 590

Query: 645  ADRRVELVKMILGFPKGSLPTNYLGVPLFVGKVTREMCRGLVSKILRKMDGWKSKLLSEG 466
            +    +L+    G         YLG+P+   ++ +E    ++ ++  ++ GWK + LS  
Sbjct: 591  SREMEQLISEESGIGCTKELGKYLGMPILQKRMNKETFGEVLERVSARLAGWKGRSLSLA 650

Query: 465  GRLTLLSHVLTSIPVYLISILPIPKTISISLESCFAKFFWGSTEGKTKGHLVAWHKVCKP 286
            GR+TL   VL+SIPV+++S + +P +   +L+     F WGST  K K HL++W K+CKP
Sbjct: 651  GRITLTKAVLSSIPVHVMSAILLPVSTLDTLDRYSRTFLWGSTMEKKKQHLLSWRKICKP 710

Query: 285  KMEGELGIRTISEVHKTGLMKMCWRLTTEK-GLWIEFLKKKY----ARDGNWWNPTNSRG 121
            K EG +G+R+  +++K  + K+ WRL  +K  LW   ++KKY     +D +W  P   R 
Sbjct: 711  KAEGGIGLRSARDMNKALVAKVGWRLLQDKESLWARVVRKKYKVGGVQDTSWLKP-QPRW 769

Query: 120  SKLWRSIRSFLQPTIRLSKRLI-GDGASSSLFLHNW 16
            S  WRS+   L+  +      + GDG +   +L  W
Sbjct: 770  SSTWRSVAVGLREVVVKGVGWVPGDGCTIRFWLDRW 805


>ref|XP_007040948.1| Uncharacterized protein TCM_016755 [Theobroma cacao]
            gi|508778193|gb|EOY25449.1| Uncharacterized protein
            TCM_016755 [Theobroma cacao]
          Length = 1245

 Score =  164 bits (415), Expect = 4e-38
 Identities = 85/212 (40%), Positives = 120/212 (56%)
 Frame = -2

Query: 798  SHLLFADDLIIFTKATSKSLSNIKGFLESYEKASEQQISLKKSNFFVAERVADRRVELVK 619
            SHL FADD++IFT     +L  I  FL+ YE  S QQ++ +KS F  +      R +++ 
Sbjct: 1012 SHLAFADDIVIFTNGCRPALQKILIFLQEYEAVSGQQVNHQKSCFITSNGCPMTRRQIIA 1071

Query: 618  MILGFPKGSLPTNYLGVPLFVGKVTREMCRGLVSKILRKMDGWKSKLLSEGGRLTLLSHV 439
               GF   +LP  YLG PL  G     +   L++KI  ++ GW++K LS GGR+TLL  V
Sbjct: 1072 HTTGFQHKTLPVIYLGAPLHKGPKKVALFDSLITKIRDRISGWENKTLSPGGRITLLRSV 1131

Query: 438  LTSIPVYLISILPIPKTISISLESCFAKFFWGSTEGKTKGHLVAWHKVCKPKMEGELGIR 259
            L+S+P+YL+ +L  P  +   +E  F  F WG +    + H VAWHK+  P  EG + IR
Sbjct: 1132 LSSMPMYLLQVLKPPVVVIEKIERLFNSFLWGDSTTDKRMHWVAWHKLTFPCSEGGIDIR 1191

Query: 258  TISEVHKTGLMKMCWRLTTEKGLWIEFLKKKY 163
             +++V     MK+ WR  T  GLW  FLK KY
Sbjct: 1192 RLNDVSDAFTMKLWWRFQTCDGLWTNFLKTKY 1223


>ref|XP_007008704.1| Uncharacterized protein TCM_042330 [Theobroma cacao]
            gi|508725617|gb|EOY17514.1| Uncharacterized protein
            TCM_042330 [Theobroma cacao]
          Length = 2249

 Score =  164 bits (414), Expect = 5e-38
 Identities = 94/272 (34%), Positives = 144/272 (52%), Gaps = 3/272 (1%)
 Frame = -2

Query: 807  VAPSHLLFADDLIIFTKATSKSLSNIKGFLESYEKASEQQISLKKSNFFVAERVADRRVE 628
            ++ SHL FADD++IFT  +  +L  I  FL+ Y++ S Q+I+++KS F     V+  R +
Sbjct: 1555 ISVSHLAFADDVLIFTNGSKSALQRILAFLQEYQEISGQRINVQKSCFVTHTNVSSSRRQ 1614

Query: 627  LVKMILGFPKGSLPTNYLGVPLFVGKVTREMCRGLVSKILRKMDGWKSKLLSEGGRLTLL 448
            ++    GF    L   YLG PL+ G     +   LV+KI  ++ GW++K+LS GGR+TLL
Sbjct: 1615 IIAQTTGFSHQLLLITYLGAPLYKGHKKVILFNDLVAKIEERITGWENKILSPGGRITLL 1674

Query: 447  SHVLTSIPVYLISILPIPKTISISLESCFAKFFWGSTEGKTKGHLVAWHKVCKPKMEGEL 268
              VL S+P+YL+ +L  P  +   +   F  F WG +    K H  +W K+  P  EG L
Sbjct: 1675 RSVLASLPIYLLQVLKPPICVLERVNRIFNSFLWGGSAASKKIHWASWAKISLPIKEGGL 1734

Query: 267  GIRTISEVHKTGLMKMCWRLTTEKGLWIEFLKKKYARDGNWWNPTNSR--GSKLWRSIRS 94
             IR ++EV +   MK+ WR  T   LW  F++ KY R G     T  +   S+ W+ + +
Sbjct: 1735 DIRNLAEVFEAFSMKLWWRFRTIDSLWTRFMRMKYCR-GQLPMHTQPKLHDSQTWKRMVA 1793

Query: 93   FLQPTIRLSKRLIGDGASSSLFLHN-WRGSCP 1
                T +  +  +G G     F H+ W G  P
Sbjct: 1794 NSAITEQNMRWRVGQG--KLFFWHDCWMGETP 1823


>ref|XP_004248595.1| PREDICTED: uncharacterized protein LOC101261371 [Solanum
            lycopersicum]
          Length = 1246

 Score =  164 bits (414), Expect = 5e-38
 Identities = 92/272 (33%), Positives = 141/272 (51%), Gaps = 1/272 (0%)
 Frame = -2

Query: 828  FSVSKKGVAPSHLLFADDLIIFTKATSKSLSNIKGFLESYEKASEQQISLKKSNFFVAER 649
            F + + G   +HL FADD+IIF    S SL  I   +E YE+ S+QQ++  KS F V   
Sbjct: 545  FHMERNGPKINHLSFADDIIIFASTDSNSLHLIMKTIELYEEVSDQQVNKHKSFFMVTSN 604

Query: 648  VADRRVELVKMILGFPKGSLPTNYLGVPLFVGKVTREMCRGLVSKILRKMDGWKSKLLSE 469
                 +E +K   G+ + + P NYLG PL++G         +V K+++++ GW SK+L+ 
Sbjct: 605  TGHDIIEEIKRATGYSRKNSPINYLGCPLYIGGQRIIYYSEVVEKVIKRIAGWHSKILNF 664

Query: 468  GGRLTLLSHVLTSIPVYLISILPIPKTISISLESCFAKFFWGSTEGKTKGHLVAWHKVCK 289
            GG++TL+ HVL SIP++ ++ +  PKT    ++   A FFWG  +   K H  +W  +  
Sbjct: 665  GGKITLVKHVLQSIPIHTLAAISPPKTTLNCIKKLIADFFWGIDKDGKKYHWSSWENMAY 724

Query: 288  PKMEGELGIRTISEVHKTGLMKMCWRLTTEKGLWIEFLKKKYARDGN-WWNPTNSRGSKL 112
            P  EG +G+R + +V         W   T+  LW +FLK KY +  N      +S  S +
Sbjct: 725  PTSEGGIGVRLLEDVCTAFQYMQWWDFRTKNSLWSQFLKAKYCQRANPLAKKYDSGDSLV 784

Query: 111  WRSIRSFLQPTIRLSKRLIGDGASSSLFLHNW 16
            WR +         L K  I  G +SS +  NW
Sbjct: 785  WRYLTRNRLKVESLIKWQIHSG-TSSFWWDNW 815


>ref|XP_004244918.1| PREDICTED: putative ribonuclease H protein At1g65750-like [Solanum
            lycopersicum]
          Length = 1010

 Score =  164 bits (414), Expect = 5e-38
 Identities = 94/274 (34%), Positives = 146/274 (53%), Gaps = 1/274 (0%)
 Frame = -2

Query: 828  FSVSKKGVAPSHLLFADDLIIFTKATSKSLSNIKGFLESYEKASEQQISLKKSNFFVAER 649
            F+++KKG   +HL FADD+IIFT    KSL  I   ++ YE  S+Q+++  KS       
Sbjct: 294  FNMNKKGPQINHLSFADDIIIFTSTDLKSLQLIMHTIKEYEGVSDQRVNKDKSFCMATVN 353

Query: 648  VADRRVELVKMILGFPKGSLPTNYLGVPLFVGKVTREMCRGLVSKILRKMDGWKSKLLSE 469
                  E+VK + G+   + P NYLG PL++G  +      LV K+++++ GW+SK+L+ 
Sbjct: 354  TRTDIQEIVKSVTGYHMKTSPINYLGCPLYIGGKSIIYYSELVDKVIKRITGWQSKILNF 413

Query: 468  GGRLTLLSHVLTSIPVYLISILPIPKTISISLESCFAKFFWGSTEGKTKGHLVAWHKVCK 289
            GG++TL+ HVL SIP++ ++ +  PKTI  ++    A FFWGS     K H  +   +  
Sbjct: 414  GGKITLVKHVLQSIPIHTLATISPPKTIIKNINKVIADFFWGSDSVGKKYHWASLETMAY 473

Query: 288  PKMEGELGIRTISEVHKTGLMKMCWRLTTEKGLWIEFLKKKYARDGNWWNPTNSRGS-KL 112
            P  EG +G+R + +V ++   K  W   T+  LW +FLK KY +  N        G   +
Sbjct: 474  PISEGGIGVRLLDDVCRSFQYKHWWEFRTKDTLWSKFLKAKYCQRSNIVAKKFDTGDYVV 533

Query: 111  WRSIRSFLQPTIRLSKRLIGDGASSSLFLHNWRG 10
            WR +    Q   +  K  I  G + S +  NW G
Sbjct: 534  WRYLTRIRQEVEKYIKWNIHTG-NCSFWWDNWIG 566


>ref|XP_007026458.1| Uncharacterized protein TCM_021522 [Theobroma cacao]
            gi|508715063|gb|EOY06960.1| Uncharacterized protein
            TCM_021522 [Theobroma cacao]
          Length = 3503

 Score =  162 bits (411), Expect = 1e-37
 Identities = 81/212 (38%), Positives = 119/212 (56%)
 Frame = -2

Query: 798  SHLLFADDLIIFTKATSKSLSNIKGFLESYEKASEQQISLKKSNFFVAERVADRRVELVK 619
            SHL FADD+IIF   +  +L  I  FL+ YE+ S Q+I+ +KS       +A  R +++ 
Sbjct: 2811 SHLAFADDVIIFANGSKSALQRILAFLQEYEELSGQRINPQKSCVVTHTNMASSRRQIIL 2870

Query: 618  MILGFPKGSLPTNYLGVPLFVGKVTREMCRGLVSKILRKMDGWKSKLLSEGGRLTLLSHV 439
               GF    LP  YLG PLF G     +   LV+KI  ++ GW++K+LS GGR+TLL   
Sbjct: 2871 QATGFSHRPLPITYLGAPLFKGHKKVILFNDLVAKIEERITGWENKILSPGGRITLLRST 2930

Query: 438  LTSIPVYLISILPIPKTISISLESCFAKFFWGSTEGKTKGHLVAWHKVCKPKMEGELGIR 259
            L+S+P+YL+ +L  P  +   +   F  F WG +    + H  +W K+  P  EG L IR
Sbjct: 2931 LSSLPIYLLQVLKPPIIVLERINRLFNNFLWGGSASSKRIHWASWGKIALPIAEGGLDIR 2990

Query: 258  TISEVHKTGLMKMCWRLTTEKGLWIEFLKKKY 163
             + +V K   MK+ WR  T   LW++F++ KY
Sbjct: 2991 NLEDVFKAFSMKLWWRFRTTNSLWMQFMRAKY 3022



 Score =  155 bits (393), Expect = 1e-35
 Identities = 81/206 (39%), Positives = 115/206 (55%)
 Frame = -2

Query: 780  DDLIIFTKATSKSLSNIKGFLESYEKASEQQISLKKSNFFVAERVADRRVELVKMILGFP 601
            DD++IFT     SL  I  FL+ YE+ S QQ++ +KS F      A  R +++    GF 
Sbjct: 1023 DDIVIFTNGCRSSLQKILNFLQEYEQVSGQQVNHQKSCFITTNGCALSRRQIISHTTGFH 1082

Query: 600  KGSLPTNYLGVPLFVGKVTREMCRGLVSKILRKMDGWKSKLLSEGGRLTLLSHVLTSIPV 421
              +LP  YLG PL  G+    +   L+SKI  ++ GW++K+LS GGR+TLL  VL+S P+
Sbjct: 1083 HKTLPVTYLGAPLHKGQKKVILFDSLISKIRDRISGWENKILSPGGRITLLRSVLSSQPM 1142

Query: 420  YLISILPIPKTISISLESCFAKFFWGSTEGKTKGHLVAWHKVCKPKMEGELGIRTISEVH 241
            YL+ +L  P T+   +E  F  F WG +    K H  AW K+  P  EG L IR + +V 
Sbjct: 1143 YLLQVLKPPVTVIEKIERLFNSFLWGDSCDGKKLHWTAWSKITFPVSEGGLDIRNLRDVF 1202

Query: 240  KTGLMKMCWRLTTEKGLWIEFLKKKY 163
            +   +K+ WR  T   LW  FL+ KY
Sbjct: 1203 EAFSLKLWWRFQTCNSLWTRFLRTKY 1228


Top