BLASTX nr result

ID: Sinomenium22_contig00028559 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Sinomenium22_contig00028559
         (633 letters)

Database: ./nr 
           38,876,450 sequences; 13,856,398,315 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_007010390.1| Retrotransposon, unclassified-like protein [...   165   9e-39
ref|XP_006358721.1| PREDICTED: uncharacterized protein LOC102596...   165   1e-38
ref|XP_007022832.1| Uncharacterized protein TCM_026877 [Theobrom...   165   1e-38
ref|XP_007020288.1| Uncharacterized protein TCM_036737 [Theobrom...   164   2e-38
ref|XP_007017131.1| Uncharacterized protein TCM_033752 [Theobrom...   162   1e-37
ref|XP_007008704.1| Uncharacterized protein TCM_042330 [Theobrom...   161   1e-37
ref|XP_007040948.1| Uncharacterized protein TCM_016755 [Theobrom...   159   5e-37
ref|XP_007046404.1| Uncharacterized protein TCM_011923 [Theobrom...   159   9e-37
ref|XP_007031313.1| Uncharacterized protein TCM_016763 [Theobrom...   157   3e-36
gb|ABD28505.2| RNA-directed DNA polymerase (Reverse transcriptas...   153   5e-35
ref|XP_004253275.1| PREDICTED: uncharacterized protein LOC101268...   151   2e-34
ref|XP_004242524.1| PREDICTED: uncharacterized protein LOC101258...   150   2e-34
ref|XP_007036030.1| Uncharacterized protein TCM_021518 [Theobrom...   149   7e-34
emb|CCA66044.1| hypothetical protein [Beta vulgaris subsp. vulga...   149   9e-34
emb|CCA66050.1| hypothetical protein [Beta vulgaris subsp. vulga...   148   2e-33
emb|CCA66054.1| hypothetical protein [Beta vulgaris subsp. vulga...   147   2e-33
ref|XP_007031312.1| Uncharacterized protein TCM_016762 [Theobrom...   146   4e-33
ref|XP_006357717.1| PREDICTED: uncharacterized protein LOC102595...   145   1e-32
gb|ABW81175.1| non-LTR retrotransposon transposase [Arabidopsis ...   145   1e-32
ref|XP_004243111.1| PREDICTED: putative ribonuclease H protein A...   144   2e-32

>ref|XP_007010390.1| Retrotransposon, unclassified-like protein [Theobroma cacao]
            gi|508727303|gb|EOY19200.1| Retrotransposon,
            unclassified-like protein [Theobroma cacao]
          Length = 1368

 Score =  165 bits (418), Expect = 9e-39
 Identities = 87/209 (41%), Positives = 127/209 (60%)
 Frame = +2

Query: 5    FGLPENFCALIMNCVTTPWFSWFSIMINDTSKGFFKSKRGLRQGDPLSPYLFIILEEVLF 184
            FG  + +  +I  C+T     WFS++IN  S G+FKS+RGLRQGD +SP LFI+  E L 
Sbjct: 564  FGFNDMWIDMIRRCITN---CWFSVLINGHSAGYFKSERGLRQGDSISPMLFILAAEYLS 620

Query: 185  KMIKLKVNEKKIKLFSHPVNAPVVSHLLYADDIVLFANASKGTIRCVMDILKQYEGWTGQ 364
            + I  ++  + I L  H   +  +SHL +ADDI++F N SK  +  +++ L++YE  +GQ
Sbjct: 621  RGIN-ELFSRYISLHYHSGCSLNISHLAFADDIMIFTNGSKSVLEKILEFLQEYEQISGQ 679

Query: 365  KVNFEKSAIYFSKRVSVHRSREILMETGFSQGQFPFYYLGAPIVDGRLKVKHFDPLVAKL 544
            +VN +KS    +  +   R + I    GF     P  YLGAP+  G  KV  FD L+ K+
Sbjct: 680  RVNHQKSCFVTANNMPSSRRQIISQTIGFLHKTLPITYLGAPLFKGPKKVMLFDSLINKI 739

Query: 545  ESKLAGWKNRLLSQGGRLILLRHVLSSMP 631
              ++ GW+N++LS GGR+ LLR VLSSMP
Sbjct: 740  RERITGWENKILSPGGRITLLRSVLSSMP 768


>ref|XP_006358721.1| PREDICTED: uncharacterized protein LOC102596481 [Solanum tuberosum]
          Length = 1135

 Score =  165 bits (417), Expect = 1e-38
 Identities = 87/209 (41%), Positives = 127/209 (60%)
 Frame = +2

Query: 5    FGLPENFCALIMNCVTTPWFSWFSIMINDTSKGFFKSKRGLRQGDPLSPYLFIILEEVLF 184
            FG  E    +I+  ++    +W+S+++N  S GFF+S RGL+QGDPLSP LFII  EVL 
Sbjct: 466  FGFAERIIDMIVRLISN---NWYSVLMNGQSFGFFQSTRGLKQGDPLSPTLFIIAAEVLS 522

Query: 185  KMIKLKVNEKKIKLFSHPVNAPVVSHLLYADDIVLFANASKGTIRCVMDILKQYEGWTGQ 364
            + +     +     +  P  +PVVSHL YADD +LF +    ++R +++IL+ YE  +GQ
Sbjct: 523  RGLNSLFEDPDYIGYGMPKWSPVVSHLSYADDTILFCSGQTTSMRKMINILRGYEKVSGQ 582

Query: 365  KVNFEKSAIYFSKRVSVHRSREILMETGFSQGQFPFYYLGAPIVDGRLKVKHFDPLVAKL 544
             +N +KS IY  K+V       +   TG  QG FPF YLG PI  GR    HF+ L+ K+
Sbjct: 583  MINLDKSMIYLHKQVPNRVCNLVKRITGIRQGSFPFTYLGCPIFYGRKNKGHFENLLKKV 642

Query: 545  ESKLAGWKNRLLSQGGRLILLRHVLSSMP 631
             +++  W+N+L+S G R IL+ HVL S+P
Sbjct: 643  SNRMNTWQNKLMSFGERYILIAHVLQSIP 671


>ref|XP_007022832.1| Uncharacterized protein TCM_026877 [Theobroma cacao]
            gi|508778198|gb|EOY25454.1| Uncharacterized protein
            TCM_026877 [Theobroma cacao]
          Length = 2367

 Score =  165 bits (417), Expect = 1e-38
 Identities = 83/209 (39%), Positives = 130/209 (62%)
 Frame = +2

Query: 5    FGLPENFCALIMNCVTTPWFSWFSIMINDTSKGFFKSKRGLRQGDPLSPYLFIILEEVLF 184
            FG  + +  +I  C++     WFS+++N  ++G+FK +RGLRQGDP+SP LF+I  E L 
Sbjct: 1650 FGFNDQWIGMIQKCISN---CWFSLLLNGRTEGYFKFERGLRQGDPISPQLFLIAAEYLS 1706

Query: 185  KMIKLKVNEKKIKLFSHPVNAPVVSHLLYADDIVLFANASKGTIRCVMDILKQYEGWTGQ 364
            + +     +     +S  V+ PV SHL +ADD+++F N SK  ++ ++  L++YE  + Q
Sbjct: 1707 RGLNALYEQYPSLHYSTGVSIPV-SHLAFADDVLIFTNGSKSALQRILAFLQEYEEISRQ 1765

Query: 365  KVNFEKSAIYFSKRVSVHRSREILMETGFSQGQFPFYYLGAPIVDGRLKVKHFDPLVAKL 544
            ++N +KS       VS  R + I   TGF+    P  YLGAP+  G  KV  F+ LVAK+
Sbjct: 1766 RINAQKSCFVTHTNVSSSRRQIIAQTTGFNHQLLPITYLGAPLYKGHKKVILFNDLVAKI 1825

Query: 545  ESKLAGWKNRLLSQGGRLILLRHVLSSMP 631
            E ++ GW+N++LS GGR+ LL+ VL+S+P
Sbjct: 1826 EERITGWENKILSPGGRITLLKSVLTSLP 1854


>ref|XP_007020288.1| Uncharacterized protein TCM_036737 [Theobroma cacao]
            gi|508725616|gb|EOY17513.1| Uncharacterized protein
            TCM_036737 [Theobroma cacao]
          Length = 2215

 Score =  164 bits (415), Expect = 2e-38
 Identities = 84/208 (40%), Positives = 128/208 (61%)
 Frame = +2

Query: 8    GLPENFCALIMNCVTTPWFSWFSIMINDTSKGFFKSKRGLRQGDPLSPYLFIILEEVLFK 187
            G    +  +I  C++     WFS+++N  + G+FKS+RGLRQGD +SP LFI+  E L +
Sbjct: 1444 GFNAQWIGMIQKCISN---CWFSLLLNGRTVGYFKSERGLRQGDSISPQLFILAAEYLAR 1500

Query: 188  MIKLKVNEKKIKLFSHPVNAPVVSHLLYADDIVLFANASKGTIRCVMDILKQYEGWTGQK 367
             +    ++     +S   +  V SHL +ADD+++FAN SK  ++ +M  L++YE  +GQ+
Sbjct: 1501 GLNALYDQYPSLHYSSGCSLSV-SHLAFADDVIIFANGSKSALQKIMAFLQEYEKLSGQR 1559

Query: 368  VNFEKSAIYFSKRVSVHRSREILMETGFSQGQFPFYYLGAPIVDGRLKVKHFDPLVAKLE 547
            +N +KS +     ++  R + IL  TGFS    P  YLGAP+  G  KV  F+ LVAK+E
Sbjct: 1560 INPQKSCVVTHTNMASSRRQIILQATGFSHRPLPITYLGAPLYKGHKKVMLFNDLVAKIE 1619

Query: 548  SKLAGWKNRLLSQGGRLILLRHVLSSMP 631
             ++ GW+N+ LS GGR+ LLR  LSS+P
Sbjct: 1620 ERITGWENKTLSPGGRITLLRSTLSSLP 1647


>ref|XP_007017131.1| Uncharacterized protein TCM_033752 [Theobroma cacao]
            gi|508722459|gb|EOY14356.1| Uncharacterized protein
            TCM_033752 [Theobroma cacao]
          Length = 2251

 Score =  162 bits (409), Expect = 1e-37
 Identities = 83/209 (39%), Positives = 128/209 (61%)
 Frame = +2

Query: 5    FGLPENFCALIMNCVTTPWFSWFSIMINDTSKGFFKSKRGLRQGDPLSPYLFIILEEVLF 184
            FG  E +  +I  C++     WFS+++N   +G+FKS+RGLRQGD +SP LFI+  E L 
Sbjct: 1480 FGFNEQWIGMIQKCISN---CWFSLLLNGRIEGYFKSERGLRQGDSISPQLFILAAEYLS 1536

Query: 185  KMIKLKVNEKKIKLFSHPVNAPVVSHLLYADDIVLFANASKGTIRCVMDILKQYEGWTGQ 364
            + +    ++     +S  V   V SHL +ADD+++F N SK  ++ ++  L++YE  +GQ
Sbjct: 1537 RGLNALYDQYPSLHYSSGVPLSV-SHLAFADDVLIFTNGSKSALQRILVFLQEYEEISGQ 1595

Query: 365  KVNFEKSAIYFSKRVSVHRSREILMETGFSQGQFPFYYLGAPIVDGRLKVKHFDPLVAKL 544
            ++N +KS       +   R + I   TGF+    P  YLGAP+  G  KV  F+ LVAK+
Sbjct: 1596 RINAQKSCFVTHTNIPNSRRQIIAQATGFNHQLLPITYLGAPLYKGHKKVILFNDLVAKI 1655

Query: 545  ESKLAGWKNRLLSQGGRLILLRHVLSSMP 631
            E ++ GW+N++LS GGR+ LLR VL+S+P
Sbjct: 1656 EERITGWENKILSPGGRITLLRSVLASLP 1684


>ref|XP_007008704.1| Uncharacterized protein TCM_042330 [Theobroma cacao]
            gi|508725617|gb|EOY17514.1| Uncharacterized protein
            TCM_042330 [Theobroma cacao]
          Length = 2249

 Score =  161 bits (408), Expect = 1e-37
 Identities = 84/209 (40%), Positives = 129/209 (61%)
 Frame = +2

Query: 5    FGLPENFCALIMNCVTTPWFSWFSIMINDTSKGFFKSKRGLRQGDPLSPYLFIILEEVLF 184
            FG    +  +I  C++     WFS+++N  ++G+FKS+RGLRQGD +SP LFII  E L 
Sbjct: 1478 FGFNGQWIKMIQKCISN---CWFSLLLNGRTEGYFKSERGLRQGDSISPQLFIIAAEYLS 1534

Query: 185  KMIKLKVNEKKIKLFSHPVNAPVVSHLLYADDIVLFANASKGTIRCVMDILKQYEGWTGQ 364
            + +    ++     +S  V+  V SHL +ADD+++F N SK  ++ ++  L++Y+  +GQ
Sbjct: 1535 RGLNALYDQYPSLHYSSGVSISV-SHLAFADDVLIFTNGSKSALQRILAFLQEYQEISGQ 1593

Query: 365  KVNFEKSAIYFSKRVSVHRSREILMETGFSQGQFPFYYLGAPIVDGRLKVKHFDPLVAKL 544
            ++N +KS       VS  R + I   TGFS       YLGAP+  G  KV  F+ LVAK+
Sbjct: 1594 RINVQKSCFVTHTNVSSSRRQIIAQTTGFSHQLLLITYLGAPLYKGHKKVILFNDLVAKI 1653

Query: 545  ESKLAGWKNRLLSQGGRLILLRHVLSSMP 631
            E ++ GW+N++LS GGR+ LLR VL+S+P
Sbjct: 1654 EERITGWENKILSPGGRITLLRSVLASLP 1682


>ref|XP_007040948.1| Uncharacterized protein TCM_016755 [Theobroma cacao]
            gi|508778193|gb|EOY25449.1| Uncharacterized protein
            TCM_016755 [Theobroma cacao]
          Length = 1245

 Score =  159 bits (403), Expect = 5e-37
 Identities = 85/209 (40%), Positives = 126/209 (60%)
 Frame = +2

Query: 5    FGLPENFCALIMNCVTTPWFSWFSIMINDTSKGFFKSKRGLRQGDPLSPYLFIILEEVLF 184
            FG  + + ++I  C++     WFS++IN +  G+FKS+RGLRQGD +SP LFI+  E L 
Sbjct: 932  FGFNDRWISMIKACISN---CWFSLLINGSLVGYFKSERGLRQGDSISPLLFILAAEYLS 988

Query: 185  KMIKLKVNEKKIKLFSHPVNAPVVSHLLYADDIVLFANASKGTIRCVMDILKQYEGWTGQ 364
            + I    ++ K   +      P+ SHL +ADDIV+F N  +  ++ ++  L++YE  +GQ
Sbjct: 989  RGINQLFSDHKSLHYLSGCFMPI-SHLAFADDIVIFTNGCRPALQKILIFLQEYEAVSGQ 1047

Query: 365  KVNFEKSAIYFSKRVSVHRSREILMETGFSQGQFPFYYLGAPIVDGRLKVKHFDPLVAKL 544
            +VN +KS    S    + R + I   TGF     P  YLGAP+  G  KV  FD L+ K+
Sbjct: 1048 QVNHQKSCFITSNGCPMTRRQIIAHTTGFQHKTLPVIYLGAPLHKGPKKVALFDSLITKI 1107

Query: 545  ESKLAGWKNRLLSQGGRLILLRHVLSSMP 631
              +++GW+N+ LS GGR+ LLR VLSSMP
Sbjct: 1108 RDRISGWENKTLSPGGRITLLRSVLSSMP 1136


>ref|XP_007046404.1| Uncharacterized protein TCM_011923 [Theobroma cacao]
            gi|508710339|gb|EOY02236.1| Uncharacterized protein
            TCM_011923 [Theobroma cacao]
          Length = 1954

 Score =  159 bits (401), Expect = 9e-37
 Identities = 83/209 (39%), Positives = 126/209 (60%)
 Frame = +2

Query: 5    FGLPENFCALIMNCVTTPWFSWFSIMINDTSKGFFKSKRGLRQGDPLSPYLFIILEEVLF 184
            FG  + + ++I  C++     WFS++IN +  G+FKS+RGLRQGD +SP LF++  + L 
Sbjct: 1183 FGFNDRWISMIKACISN---CWFSLLINGSLVGYFKSERGLRQGDSISPLLFVLAADYLS 1239

Query: 185  KMIKLKVNEKKIKLFSHPVNAPVVSHLLYADDIVLFANASKGTIRCVMDILKQYEGWTGQ 364
            + I    N  K  L+      P+ SHL +ADDIV+F N  +  ++ ++  L++YE  +GQ
Sbjct: 1240 RGINQLFNRHKSLLYLSGCFMPI-SHLAFADDIVIFTNGCRPALQKILVFLQEYEEVSGQ 1298

Query: 365  KVNFEKSAIYFSKRVSVHRSREILMETGFSQGQFPFYYLGAPIVDGRLKVKHFDPLVAKL 544
            +VN +KS    +    + R + I   TGF     P  YLGAP+  G  KV  FD L+ K+
Sbjct: 1299 QVNHQKSCFITANGCPMTRRQIIAHTTGFQHKTLPVIYLGAPLHKGPKKVTLFDSLITKI 1358

Query: 545  ESKLAGWKNRLLSQGGRLILLRHVLSSMP 631
              +++GW+N+ LS GGR+ LLR VLSS+P
Sbjct: 1359 RDRISGWENKTLSPGGRITLLRSVLSSLP 1387


>ref|XP_007031313.1| Uncharacterized protein TCM_016763 [Theobroma cacao]
            gi|508710342|gb|EOY02239.1| Uncharacterized protein
            TCM_016763 [Theobroma cacao]
          Length = 2127

 Score =  157 bits (396), Expect = 3e-36
 Identities = 82/209 (39%), Positives = 127/209 (60%)
 Frame = +2

Query: 5    FGLPENFCALIMNCVTTPWFSWFSIMINDTSKGFFKSKRGLRQGDPLSPYLFIILEEVLF 184
            FG   ++  +I +C++     WFS++IN +  G+FKS+RGLRQGD +SP LFI+  + L 
Sbjct: 1357 FGFNAHWINMIKSCISN---CWFSLLINGSLAGYFKSERGLRQGDSISPMLFILAADYLS 1413

Query: 185  KMIKLKVNEKKIKLFSHPVNAPVVSHLLYADDIVLFANASKGTIRCVMDILKQYEGWTGQ 364
            + +    +      +      P+ SHL +ADDIV+F N  +  ++ ++  L++YE  +GQ
Sbjct: 1414 RGLNHLFSCYSSLQYLSGCQMPI-SHLSFADDIVIFTNGGRSALQKILSFLQEYEQVSGQ 1472

Query: 365  KVNFEKSAIYFSKRVSVHRSREILMETGFSQGQFPFYYLGAPIVDGRLKVKHFDPLVAKL 544
            KVN +KS    +   S+ R + I   TGF     P  YLGAP+  G  KV  FD L++K+
Sbjct: 1473 KVNHQKSCFITANGCSLSRRQIISHTTGFQHKTLPVTYLGAPLHKGPKKVLLFDSLISKI 1532

Query: 545  ESKLAGWKNRLLSQGGRLILLRHVLSSMP 631
              +++GW+N++LS GGR+ LLR VLSS+P
Sbjct: 1533 RDRISGWENKILSPGGRITLLRSVLSSLP 1561


>gb|ABD28505.2| RNA-directed DNA polymerase (Reverse transcriptase);
           Polynucleotidyl transferase, Ribonuclease H fold
           [Medicago truncatula]
          Length = 729

 Score =  153 bits (386), Expect = 5e-35
 Identities = 88/207 (42%), Positives = 124/207 (59%), Gaps = 1/207 (0%)
 Frame = +2

Query: 14  PENFCALIMNCVTTPWFSWFSIMIN-DTSKGFFKSKRGLRQGDPLSPYLFIILEEVLFKM 190
           P     +I +C++TP +    IM N D S+ F+ S RG+RQGDPLSPYLF+I  E L  +
Sbjct: 29  PSKLINIIHHCISTPSYK---IMWNGDKSESFYPS-RGIRQGDPLSPYLFVICMERLSHI 84

Query: 191 IKLKVNEKKIKLFSHPVNAPVVSHLLYADDIVLFANASKGTIRCVMDILKQYEGWTGQKV 370
           I  +V     K        P +SHLL+ADD++LFA AS     CV+  L  +   +GQK+
Sbjct: 85  IADQVEADYWKPMRAGRYGPPISHLLFADDLLLFAEASIEQAHCVLHCLDMFCQSSGQKI 144

Query: 371 NFEKSAIYFSKRVSVHRSREILMETGFSQGQFPFYYLGAPIVDGRLKVKHFDPLVAKLES 550
           N EK+ +YFSK V  H   +I+  TGF+Q      YLGA I  GR    HF+ ++ K+++
Sbjct: 145 NREKTQVYFSKNVDNHLREDIIQHTGFNQVNSLGKYLGANITPGRTSRGHFNHIINKIQN 204

Query: 551 KLAGWKNRLLSQGGRLILLRHVLSSMP 631
           KL+GWK + LS  GR+ L + V+SS+P
Sbjct: 205 KLSGWKQQCLSLAGRITLSKFVISSIP 231


>ref|XP_004253275.1| PREDICTED: uncharacterized protein LOC101268853 [Solanum
            lycopersicum]
          Length = 1333

 Score =  151 bits (381), Expect = 2e-34
 Identities = 73/189 (38%), Positives = 115/189 (60%)
 Frame = +2

Query: 65   SWFSIMINDTSKGFFKSKRGLRQGDPLSPYLFIILEEVLFKMIKLKVNEKKIKLFSHPVN 244
            +W+SI+IN    GFF+SKRGL+QGDPLSP LF++  E+L + + L     + K F     
Sbjct: 563  NWYSIVINGKRHGFFQSKRGLKQGDPLSPALFVLGAEILSRQLNLLYQNHQYKGFHMERK 622

Query: 245  APVVSHLLYADDIVLFANASKGTIRCVMDILKQYEGWTGQKVNFEKSAIYFSKRVSVHRS 424
             P ++HL +ADDI++F +    +I  +M  ++ YE  + Q+VN EKS    +        
Sbjct: 623  GPKINHLSFADDIIIFTSTDTNSIHIIMKTIELYEAVSDQQVNKEKSFFMVTANTGYDII 682

Query: 425  REILMETGFSQGQFPFYYLGAPIVDGRLKVKHFDPLVAKLESKLAGWKNRLLSQGGRLIL 604
             EI   TGF++   P  YLG P+  G  ++ ++  LV K+  K++GW ++LL+ GG++IL
Sbjct: 683  EEIKTATGFNRKNSPINYLGCPLYSGGQRIIYYSELVEKVIKKISGWHSKLLNFGGKIIL 742

Query: 605  LRHVLSSMP 631
            ++HVL S+P
Sbjct: 743  VKHVLQSIP 751


>ref|XP_004242524.1| PREDICTED: uncharacterized protein LOC101258077 [Solanum
            lycopersicum]
          Length = 1454

 Score =  150 bits (380), Expect = 2e-34
 Identities = 75/189 (39%), Positives = 114/189 (60%)
 Frame = +2

Query: 65   SWFSIMINDTSKGFFKSKRGLRQGDPLSPYLFIILEEVLFKMIKLKVNEKKIKLFSHPVN 244
            +W+SI+IN    GFF SKRGL+QGDPLSP LF++  EV  + + L    +  K F    N
Sbjct: 685  NWYSIVINGKRHGFFHSKRGLKQGDPLSPALFVLGAEVFSRQLSLLYQNQLYKGFHMESN 744

Query: 245  APVVSHLLYADDIVLFANASKGTIRCVMDILKQYEGWTGQKVNFEKSAIYFSKRVSVHRS 424
             P ++HL +ADDI++F++    ++  +M  + QYE  + QKVN +KS    +   S    
Sbjct: 745  GPKINHLSFADDIIIFSSTDNNSLNLIMKTIDQYEEVSDQKVNKDKSFFMVTSNTSHDII 804

Query: 425  REILMETGFSQGQFPFYYLGAPIVDGRLKVKHFDPLVAKLESKLAGWKNRLLSQGGRLIL 604
             EI   TGFS+   P  YLG P+  G  ++ ++  +V K+  K+AGW  ++L+ GG++ L
Sbjct: 805  EEISRITGFSRKNSPINYLGCPLYVGGQRIIYYSEIVEKVIKKIAGWHLKILNFGGKVTL 864

Query: 605  LRHVLSSMP 631
            ++HVL SMP
Sbjct: 865  VKHVLQSMP 873


>ref|XP_007036030.1| Uncharacterized protein TCM_021518 [Theobroma cacao]
            gi|508715059|gb|EOY06956.1| Uncharacterized protein
            TCM_021518 [Theobroma cacao]
          Length = 1702

 Score =  149 bits (376), Expect = 7e-34
 Identities = 81/209 (38%), Positives = 123/209 (58%)
 Frame = +2

Query: 5    FGLPENFCALIMNCVTTPWFSWFSIMINDTSKGFFKSKRGLRQGDPLSPYLFIILEEVLF 184
            FG  + + ++I  C++     WFS++IN +  G+FKS+RGLRQGD +SP LFI+  + L 
Sbjct: 805  FGFNDRWISMIKACISN---CWFSLLINGSLVGYFKSERGLRQGDSISPLLFILAADYLS 861

Query: 185  KMIKLKVNEKKIKLFSHPVNAPVVSHLLYADDIVLFANASKGTIRCVMDILKQYEGWTGQ 364
            + I    +  K   +      P+ S L +ADDIV+F N  +  ++ ++  L++YE   GQ
Sbjct: 862  RGINQLFSHHKSLHYLSGCFMPI-SRLAFADDIVIFTNGCRPALQKILVFLQEYEKMFGQ 920

Query: 365  KVNFEKSAIYFSKRVSVHRSREILMETGFSQGQFPFYYLGAPIVDGRLKVKHFDPLVAKL 544
            +VN +KS    +   S+ R + I   TGF     P  YLGAP+     KV  FD L+ K+
Sbjct: 921  QVNHQKSCFITANGCSMTRRQIIAHTTGFQHKILPIIYLGAPLHKVPKKVALFDSLITKI 980

Query: 545  ESKLAGWKNRLLSQGGRLILLRHVLSSMP 631
              +++GW+N+ LS GGR+ LLR VLSS+P
Sbjct: 981  RDRISGWENKTLSPGGRITLLRSVLSSLP 1009


>emb|CCA66044.1| hypothetical protein [Beta vulgaris subsp. vulgaris]
          Length = 1355

 Score =  149 bits (375), Expect = 9e-34
 Identities = 82/208 (39%), Positives = 124/208 (59%)
 Frame = +2

Query: 8    GLPENFCALIMNCVTTPWFSWFSIMINDTSKGFFKSKRGLRQGDPLSPYLFIILEEVLFK 187
            G    +  LIM+CV++  +S+   +IN    G     RGLR GDPLSPYLFI++ +   K
Sbjct: 601  GFDGRWVNLIMSCVSSVSYSF---IINGGVCGSVTPARGLRHGDPLSPYLFILIADAFSK 657

Query: 188  MIKLKVNEKKIKLFSHPVNAPVVSHLLYADDIVLFANASKGTIRCVMDILKQYEGWTGQK 367
            MI+ KV EK++       + PV+SHL +AD  +LF  AS+     +++IL  YE  +GQK
Sbjct: 658  MIQKKVQEKQLHGAKASRSGPVISHLFFADVSLLFTRASRQECAIIVEILNLYEQASGQK 717

Query: 368  VNFEKSAIYFSKRVSVHRSREILMETGFSQGQFPFYYLGAPIVDGRLKVKHFDPLVAKLE 547
            +N++KS + FSK VS+ +  E+       Q +    YLG P + GR +   FD L+ ++ 
Sbjct: 718  INYDKSEVSFSKGVSIAQKEELSNILQMKQVERHMKYLGIPSITGRSRTAIFDSLMDRIW 777

Query: 548  SKLAGWKNRLLSQGGRLILLRHVLSSMP 631
             KL GWK +LLS+ G+ ILL+ V+ ++P
Sbjct: 778  KKLQGWKEKLLSRAGKEILLKSVIQAIP 805


>emb|CCA66050.1| hypothetical protein [Beta vulgaris subsp. vulgaris]
          Length = 1357

 Score =  148 bits (373), Expect = 2e-33
 Identities = 81/208 (38%), Positives = 123/208 (59%)
 Frame = +2

Query: 8    GLPENFCALIMNCVTTPWFSWFSIMINDTSKGFFKSKRGLRQGDPLSPYLFIILEEVLFK 187
            G    +  L+M+CV T  +S+   +IN    G     RGLRQGDPLSP+LFI++ +   +
Sbjct: 604  GFDGRWVNLVMSCVATVSYSF---IINGRVCGSVTPSRGLRQGDPLSPFLFILVADAFSQ 660

Query: 188  MIKLKVNEKKIKLFSHPVNAPVVSHLLYADDIVLFANASKGTIRCVMDILKQYEGWTGQK 367
            M+K KV  K+I       N P +SHLL+ADD +LF  A++     ++DIL +YE  +GQK
Sbjct: 661  MVKQKVVSKEIHGAKASRNGPEISHLLFADDSLLFTRATRQECLTIVDILNKYEAASGQK 720

Query: 368  VNFEKSAIYFSKRVSVHRSREILMETGFSQGQFPFYYLGAPIVDGRLKVKHFDPLVAKLE 547
            +N+EKS + FS+ VS  +  E++      Q      YLG P + GR K   F  L+ ++ 
Sbjct: 721  INYEKSEVSFSRGVSCEKKEELITLLHMRQVDRHQKYLGIPALCGRSKKVLFRELLDRMW 780

Query: 548  SKLAGWKNRLLSQGGRLILLRHVLSSMP 631
             KL GWK +LLS+ G+ +L++ V+ ++P
Sbjct: 781  KKLRGWKEKLLSRAGKEVLIKAVIQALP 808


>emb|CCA66054.1| hypothetical protein [Beta vulgaris subsp. vulgaris]
          Length = 1355

 Score =  147 bits (372), Expect = 2e-33
 Identities = 82/208 (39%), Positives = 124/208 (59%)
 Frame = +2

Query: 8    GLPENFCALIMNCVTTPWFSWFSIMINDTSKGFFKSKRGLRQGDPLSPYLFIILEEVLFK 187
            G    +  LIM  V++  +S+   +IN +  G     RGLRQGDPLSPYLFI++ +   K
Sbjct: 601  GFDGRWVNLIMEFVSSVTYSF---IINGSVCGSVVPARGLRQGDPLSPYLFIMVADAFSK 657

Query: 188  MIKLKVNEKKIKLFSHPVNAPVVSHLLYADDIVLFANASKGTIRCVMDILKQYEGWTGQK 367
            MI+ KV +K++       + P +SHL +ADD +LF  A++     ++DIL QYE  +GQK
Sbjct: 658  MIQRKVQDKQLHGAKASRSGPEISHLFFADDSLLFTRANRQECTIIVDILNQYELASGQK 717

Query: 368  VNFEKSAIYFSKRVSVHRSREILMETGFSQGQFPFYYLGAPIVDGRLKVKHFDPLVAKLE 547
            +N+EKS + +S+ VSV +  E+       Q      YLG P + GR K   FD L+ ++ 
Sbjct: 718  INYEKSEVSYSRGVSVSQKDELTNILNMRQVDRHEKYLGIPSISGRSKKAIFDSLIDRIW 777

Query: 548  SKLAGWKNRLLSQGGRLILLRHVLSSMP 631
             KL GWK +LLS+ G+ +LL+ V+ ++P
Sbjct: 778  KKLQGWKEKLLSRAGKEVLLKSVIQAIP 805


>ref|XP_007031312.1| Uncharacterized protein TCM_016762 [Theobroma cacao]
            gi|508710341|gb|EOY02238.1| Uncharacterized protein
            TCM_016762 [Theobroma cacao]
          Length = 2214

 Score =  146 bits (369), Expect = 4e-33
 Identities = 81/209 (38%), Positives = 124/209 (59%)
 Frame = +2

Query: 5    FGLPENFCALIMNCVTTPWFSWFSIMINDTSKGFFKSKRGLRQGDPLSPYLFIILEEVLF 184
            FG    +  +I  C++     WFS++IN +  G+FKS+RGLRQGD +SP LFI+  E L 
Sbjct: 1444 FGFNALWINMIKACISN---CWFSLLINGSLVGYFKSERGLRQGDSISPSLFILAAEYLS 1500

Query: 185  KMIKLKVNEKKIKLFSHPVNAPVVSHLLYADDIVLFANASKGTIRCVMDILKQYEGWTGQ 364
            + +  ++  +   L      +  VSHL +ADDIV+F N     ++ ++  L++YE  +GQ
Sbjct: 1501 RGLN-QLFSRYNSLHYLSGCSMSVSHLAFADDIVIFTNGCHSALQKILVFLQEYEQVSGQ 1559

Query: 365  KVNFEKSAIYFSKRVSVHRSREILMETGFSQGQFPFYYLGAPIVDGRLKVKHFDPLVAKL 544
            +VN +KS    +    + R + I   TGF     P  YLGAP+  G  KV  FD L++K+
Sbjct: 1560 QVNHQKSCFITANGCPLSRRQIIAQVTGFQHKTLPVTYLGAPLHKGPKKVFLFDSLISKI 1619

Query: 545  ESKLAGWKNRLLSQGGRLILLRHVLSSMP 631
              +++GW+N++LS G R+ LLR VLSS+P
Sbjct: 1620 RDRISGWENKILSPGSRITLLRSVLSSLP 1648


>ref|XP_006357717.1| PREDICTED: uncharacterized protein LOC102595469 [Solanum tuberosum]
          Length = 1079

 Score =  145 bits (365), Expect = 1e-32
 Identities = 68/188 (36%), Positives = 113/188 (60%)
 Frame = +2

Query: 65  SWFSIMINDTSKGFFKSKRGLRQGDPLSPYLFIILEEVLFKMIKLKVNEKKIKLFSHPVN 244
           +W+S+++N     FF+SKRGLRQGDP+SP LF+I  E L   +    N      FS    
Sbjct: 404 NWYSLIVNGNRHDFFQSKRGLRQGDPISPALFVISAEYLSLKLNELNNNTDFSSFSMNKK 463

Query: 245 APVVSHLLYADDIVLFANASKGTIRCVMDILKQYEGWTGQKVNFEKSAIYFSKRVSVHRS 424
            P ++HL +A+D++LF++  + ++  +M+ L  YE  +GQK+N  KS++  S + +    
Sbjct: 464 GPRINHLAFANDVILFSSGCRRSLDLLMETLNNYERVSGQKINKSKSSVSLSSKENEQAR 523

Query: 425 REILMETGFSQGQFPFYYLGAPIVDGRLKVKHFDPLVAKLESKLAGWKNRLLSQGGRLIL 604
           + +   TG +    P  YLG P+ +GR     F  +++K+  K+ GW+N+ LS GGR++L
Sbjct: 524 QRVQEITGMTYRSLPIKYLGCPLYEGRKDYALFSEMMSKILHKIGGWQNKFLSIGGRVVL 583

Query: 605 LRHVLSSM 628
           ++HVL S+
Sbjct: 584 IKHVLMSL 591


>gb|ABW81175.1| non-LTR retrotransposon transposase [Arabidopsis cebennensis]
          Length = 799

 Score =  145 bits (365), Expect = 1e-32
 Identities = 84/207 (40%), Positives = 120/207 (57%)
 Frame = +2

Query: 11  LPENFCALIMNCVTTPWFSWFSIMINDTSKGFFKSKRGLRQGDPLSPYLFIILEEVLFKM 190
           LP ++   IM CV  P  +   ++ N      F   RGLRQGDPLSPYLF++  E L   
Sbjct: 53  LPSSWINWIMKCVKEPAMT---VLWNGEKTESFIPSRGLRQGDPLSPYLFVLCLERLCHQ 109

Query: 191 IKLKVNEKKIKLFSHPVNAPVVSHLLYADDIVLFANASKGTIRCVMDILKQYEGWTGQKV 370
           I L V  K+ K  S     P++SH+ +ADD++LFA AS   IR V  +L+++   +GQKV
Sbjct: 110 IDLAVGTKEWKPISMSRGGPLLSHICFADDLILFAEASVAQIRVVRKVLEKFCIASGQKV 169

Query: 371 NFEKSAIYFSKRVSVHRSREILMETGFSQGQFPFYYLGAPIVDGRLKVKHFDPLVAKLES 550
           + EKS I+FS+ V     + I  E+G    +    YLG P++  R+    F  ++ ++ S
Sbjct: 170 SLEKSKIFFSQNVHRDLEKFISDESGIKSTKELGKYLGMPVLQKRINKDTFGEILLRVSS 229

Query: 551 KLAGWKNRLLSQGGRLILLRHVLSSMP 631
           +LAGWK R+LS  GRL L + VLSS+P
Sbjct: 230 RLAGWKGRMLSLAGRLTLTKSVLSSIP 256


>ref|XP_004243111.1| PREDICTED: putative ribonuclease H protein At1g65750-like [Solanum
           lycopersicum]
          Length = 927

 Score =  144 bits (363), Expect = 2e-32
 Identities = 72/189 (38%), Positives = 119/189 (62%)
 Frame = +2

Query: 65  SWFSIMINDTSKGFFKSKRGLRQGDPLSPYLFIILEEVLFKMIKLKVNEKKIKLFSHPVN 244
           +W+SI+IN    GFF+S RGL+QGDPLS  LFII  EVL K I L  N K  + F+   N
Sbjct: 209 NWYSIVINGKRHGFFQSTRGLKQGDPLSLALFIIGAEVLSKNINLLYNNKVYRGFNMEKN 268

Query: 245 APVVSHLLYADDIVLFANASKGTIRCVMDILKQYEGWTGQKVNFEKSAIYFSKRVSVHRS 424
            P ++HL +ADDI++F +  + ++  +M I++ YE  + ++VN +KS    + + + +  
Sbjct: 269 GPQINHLSFADDIIIFTSTDRRSLNLIMKIIEDYERVSDEQVNKDKSFCMVTSKTNYNII 328

Query: 425 REILMETGFSQGQFPFYYLGAPIVDGRLKVKHFDPLVAKLESKLAGWKNRLLSQGGRLIL 604
            +I + TGF     P  YLG P+  G  ++ +F  +V K+  +++GW++++LS GG+  L
Sbjct: 329 EDIKIVTGFGIKYSPISYLGCPLYIGGQRISYFSEVVEKVIRRISGWQSKILSFGGKDTL 388

Query: 605 LRHVLSSMP 631
           +++VL S+P
Sbjct: 389 VKNVLQSIP 397


Top