BLASTX nr result

ID: Lithospermum23_contig00002314 seq

BLASTX 2.2.26 [Sep-21-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Lithospermum23_contig00002314
         (1046 letters)

Database: ./nr 
           115,041,592 sequences; 42,171,959,267 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

EOY34748.1 Uncharacterized protein TCM_042328 [Theobroma cacao]       219   9e-61
EOY14356.1 Uncharacterized protein TCM_033752 [Theobroma cacao]       218   5e-60
EOY06960.1 Uncharacterized protein TCM_021522 [Theobroma cacao]       214   2e-58
EOY17514.1 Uncharacterized protein TCM_042330 [Theobroma cacao]       210   5e-57
EOY17513.1 Uncharacterized protein TCM_036737 [Theobroma cacao]       209   9e-57
XP_011085143.1 PREDICTED: uncharacterized protein LOC105167219 [...   206   1e-55
EOY19200.1 Retrotransposon, unclassified-like protein [Theobroma...   204   3e-55
EOY02239.1 Uncharacterized protein TCM_016763 [Theobroma cacao]       203   9e-55
EOY06959.1 Uncharacterized protein TCM_021521 [Theobroma cacao]       203   1e-54
EOY02234.1 Uncharacterized protein TCM_011921 [Theobroma cacao]       202   1e-54
EOY02236.1 Uncharacterized protein TCM_011923 [Theobroma cacao]       201   5e-54
EOY02238.1 Uncharacterized protein TCM_016762 [Theobroma cacao]       201   7e-54
KZV18919.1 hypothetical protein F511_17825 [Dorcoceras hygrometr...   199   2e-53
EOY34747.1 Uncharacterized protein TCM_042327 [Theobroma cacao]       198   3e-53
EOY25451.1 Uncharacterized protein TCM_016759 [Theobroma cacao]       196   8e-53
KZV46870.1 hypothetical protein F511_08631 [Dorcoceras hygrometr...   187   7e-52
EOY25454.1 Uncharacterized protein TCM_026877 [Theobroma cacao]       179   2e-46
KZV33338.1 hypothetical protein F511_18219 [Dorcoceras hygrometr...   172   4e-46
XP_012858045.1 PREDICTED: uncharacterized protein LOC105977287 [...   177   1e-45
XP_017972650.1 PREDICTED: uncharacterized protein LOC18606969 [T...   168   6e-45

>EOY34748.1 Uncharacterized protein TCM_042328 [Theobroma cacao]
          Length = 910

 Score =  219 bits (557), Expect = 9e-61
 Identities = 114/353 (32%), Positives = 181/353 (51%), Gaps = 7/353 (1%)
 Frame = +2

Query: 2    CRLKHPATANINKHHSKIWRGFAVFRNQAEDNLTWHLGTGDKDFWIDSWLSSGPI----N 169
            CR + P       H S+ W+         E ++ W +G G+  FW D W+   P+     
Sbjct: 431  CRGQLPMQTQPKLHDSQTWKRMLTSSATTEQHMRWRVGQGNLFFWHDCWMGDAPLISSNQ 490

Query: 170  QHAKSGTKVKEYWANNNWNLSKLQHLVESDNLFNSDMLQEMLQITLYPDREDILLWKPXX 349
            +   S  +V +++ NN+WN+ KL+ +++ +      ++ E+ +I +    +D   W P  
Sbjct: 491  EFTSSMVQVCDFFMNNSWNVEKLKTVLQQE------VVDEIAKIPIDTMSKDEAYWTPTP 544

Query: 350  XXXXXXXXAWQMVRQERPFSSIFALVWHSHIPKKMSFLVWRLFHKWLPVEQIQQMRGTSL 529
                    AWQ++R+ +  + +F  +WH  +P   SF +WRL H W+PVE   + +G  L
Sbjct: 545  NGDFSTKSAWQLIRKRKVVNPVFNFIWHKTVPLTTSFFLWRLLHDWIPVELKMKSKGLQL 604

Query: 530  ASKCVCCHNTETISHVFFTNKIAANIWDHFAHLFGLPRVLFKNVNHVILHWSLNSN--TV 703
            AS+C CC + E+I HV + N +A  +W++FA LF +  +    +N +I  W  + +    
Sbjct: 605  ASRCRCCKSEESIMHVMWDNPVAMQVWNYFAKLFQICIINPCTINQIIGAWFHSGDYCKP 664

Query: 704  GHIRQVMPVIILWVLWEFRNKTKHGELTYEFQYIKKRIDHLIIYLGKTELLQYKHWKGDF 883
            GHIR ++P+ ILW LW  RN  KH  L      +  R+  LI  L   + L    WKGD 
Sbjct: 665  GHIRTLVPLFILWFLWVERNDAKHRNLGMYPNRVVWRVLKLIQQLSLGQQLLKWQWKGDK 724

Query: 884  NVAVAFGIKVYKPFQIRPSILSWHRPTQGNLKLNIDGSYKLG-DAGLGGILRN 1039
             +A  +GI +       P + SWH+PT G  KLN+DGS K   +A  GGILR+
Sbjct: 725  QIAQEWGIILQAESLAPPKVFSWHKPTTGEFKLNVDGSAKHSHNAAGGGILRD 777


>EOY14356.1 Uncharacterized protein TCM_033752 [Theobroma cacao]
          Length = 2251

 Score =  218 bits (556), Expect = 5e-60
 Identities = 113/353 (32%), Positives = 180/353 (50%), Gaps = 7/353 (1%)
 Frame = +2

Query: 2    CRLKHPATANINKHHSKIWRGFAVFRNQAEDNLTWHLGTGDKDFWIDSWLSSGPI----N 169
            CR + P       H S+ W+         E ++ W +G G+  FW D W+   P+     
Sbjct: 1772 CRGQLPMQTQPKLHDSQTWKRMLTSSTITEQHMRWRVGQGNVFFWHDCWMGEAPLISSNQ 1831

Query: 170  QHAKSGTKVKEYWANNNWNLSKLQHLVESDNLFNSDMLQEMLQITLYPDREDILLWKPXX 349
            +   S  +V +++ NN+WN+ KL+ +++ +      ++ E+ +I +    +D   W P  
Sbjct: 1832 EFTSSMVQVCDFFTNNSWNIEKLKTVLQQE------VVDEIAKIPIDTMNKDEAYWTPTP 1885

Query: 350  XXXXXXXXAWQMVRQERPFSSIFALVWHSHIPKKMSFLVWRLFHKWLPVEQIQQMRGTSL 529
                    AWQ++R+ +  + +F  +WH  +P   SF +WRL H W+PVE   + +G  L
Sbjct: 1886 NGDFSTKSAWQLIRKRKVVNPVFNFIWHKTVPLTTSFFLWRLLHDWIPVELKMKSKGLQL 1945

Query: 530  ASKCVCCHNTETISHVFFTNKIAANIWDHFAHLFGLPRVLFKNVNHVILHWSLNSN--TV 703
            AS+C CC + E+I HV + N +A  +W++FA LF +  +    +N +I  W  + +    
Sbjct: 1946 ASRCRCCKSEESIMHVMWDNPVAMQVWNYFAKLFQILIINPCTINQIIGAWFYSGDYCKP 2005

Query: 704  GHIRQVMPVIILWVLWEFRNKTKHGELTYEFQYIKKRIDHLIIYLGKTELLQYKHWKGDF 883
            GHIR ++P+ ILW LW  RN  KH  L      +  R+  LI  L   + L    WKGD 
Sbjct: 2006 GHIRTLVPLFILWFLWVERNDAKHRNLGMYPNRVVWRVLKLIQQLSLGQQLLKWQWKGDK 2065

Query: 884  NVAVAFGIKVYKPFQIRPSILSWHRPTQGNLKLNIDGSYKLG-DAGLGGILRN 1039
             +A  +GI         P + SWH+P+ G  KLN+DGS K   +A  GGILR+
Sbjct: 2066 QIAQEWGIIFQAESLAPPKVFSWHKPSLGEFKLNVDGSAKQSHNAAGGGILRD 2118


>EOY06960.1 Uncharacterized protein TCM_021522 [Theobroma cacao]
          Length = 3503

 Score =  214 bits (544), Expect = 2e-58
 Identities = 112/354 (31%), Positives = 182/354 (51%), Gaps = 8/354 (2%)
 Frame = +2

Query: 2    CRLKHPATANINKHHSKIWRGFAVFRNQAEDNLTWHLGTGDKDFWIDSWLSSGPI----N 169
            C  + P       H S+ W+      +  E N+ W +G G   FW D W+   P+     
Sbjct: 3023 CGGQLPTHVQPKLHDSQTWKRMVTISSITEQNIRWRVGHGKLFFWHDCWMGEEPLVIRNQ 3082

Query: 170  QHAKSGTKVKEYWANNNWNLSKLQHLVESDNLFNSDMLQEMLQITLYPDREDILLWKPXX 349
            + A S  +V +++ NN+W++ KL+ +++ +      +++E+ +I +     D   W P  
Sbjct: 3083 EFASSMAQVSDFFLNNSWDIEKLKSVLQQE------VVEEIAKIPINASSNDRAYWTPTP 3136

Query: 350  XXXXXXXXAWQMVRQERPFSSIFALVWHSHIPKKMSFLVWRLFHKWLPVEQIQQMRGTSL 529
                    AWQ+ R+ +  +  +  +WH  +P   SF +WRL H W+PVE   + +G  L
Sbjct: 3137 NGDFSTKSAWQLSRERKVVNPTYNYIWHKSVPLTTSFFLWRLLHDWVPVELKMKSKGFQL 3196

Query: 530  ASKCVCCHNTETISHVFFTNKIAANIWDHFAHLFGLPRVLFKNVNHVILHW--SLNSNTV 703
            AS+C CC + E++ HV + N +A  +W +FA +F +  +    +NH+I  W  S + +  
Sbjct: 3197 ASRCRCCKSEESLMHVMWDNPVANQVWSYFAKVFQIHIINPCTINHIISAWFYSGDYSKP 3256

Query: 704  GHIRQVMPVIILWVLWEFRNKTKHGELTYEFQYIKKRIDHLIIYLGKTELLQYKHWKGDF 883
            GHIR ++P+ ILW LW  RN  KH  L      I  +I  LI  L + + LQ   W+GD 
Sbjct: 3257 GHIRTLVPLFILWFLWVERNDAKHRNLGMYPNRIVWKILKLIHQLFQGKQLQKWQWQGDK 3316

Query: 884  NVAVAFGIKVYKPFQIRPSILSWHRPTQGNLKLNIDGS--YKLGDAGLGGILRN 1039
             +A  +GI +       P +L W++P+ G  KLN+DGS  Y L  A  GG+LR+
Sbjct: 3317 QIAQEWGIILKAVAPSPPKLLFWNKPSIGEFKLNVDGSSKYNLQTAAGGGLLRD 3370



 Score =  203 bits (516), Expect = 1e-54
 Identities = 110/353 (31%), Positives = 181/353 (51%), Gaps = 7/353 (1%)
 Frame = +2

Query: 2    CRLKHPATANINKHHSKIWRGFAVFRNQAEDNLTWHLGTGDKDFWIDSWLSSGPINQHAK 181
            C  + P       H S++W+   V R+ A  N+ W +G G+  FW D W+   P+     
Sbjct: 1229 CLGRIPHLVQPKLHDSQVWKRMIVGRDVALQNIRWRIGKGELFFWHDCWMGDQPLATLFP 1288

Query: 182  SG----TKVKEYWANNNWNLSKLQHLVESDNLFNSDMLQEMLQITLYPDREDILLWKPXX 349
            S     + V +++  + W++ KL      ++   + ++ E+LQI     +ED+  W    
Sbjct: 1289 SFHNDMSHVHKFYNGDEWDIVKL------NSYLPTSLVDEILQIPFDRSQEDVAYWALTS 1342

Query: 350  XXXXXXXXAWQMVRQERPFSSIFALVWHSHIPKKMSFLVWRLFHKWLPVEQIQQMRGTSL 529
                    AW+++RQ +  +++ +  WH  IP  +SF +WR+ + W+PVE   + +G  L
Sbjct: 1343 NGEFSFWSAWEIIRQRQTPNALLSFNWHRSIPLSISFFLWRVLNNWIPVELRMKDKGIHL 1402

Query: 530  ASKCVCCHNTETISHVFFTNKIAANIWDHFAHLFGLPRVLFKNVNHVILHWSLNSNTV-- 703
            ASKCVCC + E++ HV + N +A  +W+ FA  F +     K+++ +I  W  + +    
Sbjct: 1403 ASKCVCCRSEESLIHVLWENPVAKQVWNFFAKSFQIYVSKPKHISQIIWAWFFSGDYTRN 1462

Query: 704  GHIRQVMPVIILWVLWEFRNKTKHGELTYEFQYIKKRIDHLIIYLGKTELLQYKHWKGDF 883
            GHIR ++P+ I W LW  RN  KH  +      +  RI  L+  L    LL+   WKGD 
Sbjct: 1463 GHIRILIPLFICWFLWLERNDAKHRHMGMYPNRVIWRIMKLLNQLHAGSLLKQWQWKGDT 1522

Query: 884  NVAVAFGIKVYKPFQIRPSILSWHRPTQGNLKLNIDGSYKLG-DAGLGGILRN 1039
            ++A  +G K    +   P I+SW +P  G  KLN+DGS K   +A  GG+LR+
Sbjct: 1523 DIATMWGFKYPPKYCQSPQIISWIKPFIGEYKLNVDGSSKSSQNAAGGGVLRD 1575


>EOY17514.1 Uncharacterized protein TCM_042330 [Theobroma cacao]
          Length = 2249

 Score =  210 bits (534), Expect = 5e-57
 Identities = 112/353 (31%), Positives = 178/353 (50%), Gaps = 7/353 (1%)
 Frame = +2

Query: 2    CRLKHPATANINKHHSKIWRGFAVFRNQAEDNLTWHLGTGDKDFWIDSWLSSGPI---NQ 172
            CR + P       H S+ W+         E N+ W +G G   FW D W+   P+   NQ
Sbjct: 1770 CRGQLPMHTQPKLHDSQTWKRMVANSAITEQNMRWRVGQGKLFFWHDCWMGETPLTSSNQ 1829

Query: 173  HAK-SGTKVKEYWANNNWNLSKLQHLVESDNLFNSDMLQEMLQITLYPDREDILLWKPXX 349
                S  +V +++ NN+W++ KL+ +++ +      ++ E+ +I +    +D   W P  
Sbjct: 1830 ELSLSMVQVCDFFMNNSWDIEKLKTVLQQE------VVDEIAKIPIDAMSKDEAYWAPTP 1883

Query: 350  XXXXXXXXAWQMVRQERPFSSIFALVWHSHIPKKMSFLVWRLFHKWLPVEQIQQMRGTSL 529
                    AWQ++R+    + +F  +WH  +P  +SF +WRL H W+PVE   + +G  L
Sbjct: 1884 NGEFSTKSAWQLIRKREVVNPVFNFIWHKTVPLTISFFLWRLLHDWIPVELKMKSKGFQL 1943

Query: 530  ASKCVCCHNTETISHVFFTNKIAANIWDHFAHLFGLPRVLFKNVNHVILHWSLNSN--TV 703
            AS+C CC + E+I HV + N +A  +W++F+  F +  +    +N ++  W  + +    
Sbjct: 1944 ASRCRCCKSEESIMHVMWDNPVATQVWNYFSKFFQILVINPCTINQILGAWFYSGDYCKP 2003

Query: 704  GHIRQVMPVIILWVLWEFRNKTKHGELTYEFQYIKKRIDHLIIYLGKTELLQYKHWKGDF 883
            GHIR ++P+  LW LW  RN  KH  L      I  RI  LI  L   + L    WKGD 
Sbjct: 2004 GHIRTLVPIFTLWFLWVERNDAKHRNLGMYPNRIVWRILKLIQQLSLGQQLLKWQWKGDK 2063

Query: 884  NVAVAFGIKVYKPFQIRPSILSWHRPTQGNLKLNIDGSYKLG-DAGLGGILRN 1039
             +A  +GI         P +  WH+P+ G  KLN+DGS KL  +A  GG+LR+
Sbjct: 2064 QIAQEWGITFQAESLPPPKVFPWHKPSIGEFKLNVDGSAKLSQNAAGGGVLRD 2116


>EOY17513.1 Uncharacterized protein TCM_036737 [Theobroma cacao]
          Length = 2215

 Score =  209 bits (532), Expect = 9e-57
 Identities = 111/354 (31%), Positives = 182/354 (51%), Gaps = 8/354 (2%)
 Frame = +2

Query: 2    CRLKHPATANINKHHSKIWRGFAVFRNQAEDNLTWHLGTGDKDFWIDSWLSSGPI---NQ 172
            C  + P       H S+ W+      +  E N+ W +G G+  FW D W+   P+   NQ
Sbjct: 1735 CGGQLPTDVQPKLHDSQTWKRMVTISSITEQNIRWRIGHGELFFWHDCWMGEEPLVNRNQ 1794

Query: 173  -HAKSGTKVKEYWANNNWNLSKLQHLVESDNLFNSDMLQEMLQITLYPDREDILLWKPXX 349
              A S  +V +++ NN+WN+ KL+ +++ +      +++E+++I +     D   W    
Sbjct: 1795 AFASSMAQVSDFFLNNSWNVEKLKTVLQQE------VVEEIVKIPIDTSSNDKAYWTTTP 1848

Query: 350  XXXXXXXXAWQMVRQERPFSSIFALVWHSHIPKKMSFLVWRLFHKWLPVEQIQQMRGTSL 529
                    AWQ++R  +  + +F  +WH  +P   SF +WRL H W+PVE   + +G  L
Sbjct: 1849 NGDFSTKSAWQLIRNRKVENPVFNFIWHKSVPLTTSFFLWRLLHDWIPVELKMKTKGFQL 1908

Query: 530  ASKCVCCHNTETISHVFFTNKIAANIWDHFAHLFGLPRVLFKNVNHVILHW--SLNSNTV 703
            AS+C CC + E++ HV + N +A  +W +FA +F +  +    +N +I  W  S + +  
Sbjct: 1909 ASRCRCCKSEESLMHVMWKNPVANQVWSYFAKVFQIQIINPCTINQIICAWFYSGDYSKP 1968

Query: 704  GHIRQVMPVIILWVLWEFRNKTKHGELTYEFQYIKKRIDHLIIYLGKTELLQYKHWKGDF 883
            GHIR ++P+  LW LW  RN  KH  L      +  +I  L+  L + + LQ   W+GD 
Sbjct: 1969 GHIRTLVPLFTLWFLWVERNDAKHRNLGMYPNRVVWKILKLLHQLFQGKQLQKWQWQGDK 2028

Query: 884  NVAVAFGIKVYKPFQIRPSILSWHRPTQGNLKLNIDGSYKLG--DAGLGGILRN 1039
             +A  +GI +       P +L W +P+ G LKLN+DGS K     A  GG+LR+
Sbjct: 2029 QIAQEWGIILKADAPSPPKLLFWLKPSIGELKLNVDGSCKHNPQSAAGGGLLRD 2082


>XP_011085143.1 PREDICTED: uncharacterized protein LOC105167219 [Sesamum indicum]
          Length = 1203

 Score =  206 bits (523), Expect = 1e-55
 Identities = 108/351 (30%), Positives = 175/351 (49%), Gaps = 6/351 (1%)
 Frame = +2

Query: 2    CRLKHPATANINKHHSKIWRGFAVFRNQAEDNLTWHLGTGDKDFWIDSWLSSGPINQHAK 181
            C   HP  A ++   S  W+     R +A+  + W LG G   FW D+W+   P+ +   
Sbjct: 852  CTGSHPVPAKLSYIASPNWKRMCRHRKEADRQIFWSLGKGHISFWFDNWIGEKPLFEIMP 911

Query: 182  ----SGTKVKEYWANNNWNLSKLQHLVESDNLFNSDMLQEMLQITLYPDREDILLWKPXX 349
                + T V  YW NN+WN++KL+ ++       +DM+ ++ QI    D  D  LWK   
Sbjct: 912  DFEWNTTPVNNYWENNSWNVAKLREVL------TADMVHQICQIPFDVDTSDTPLWKLSG 965

Query: 350  XXXXXXXXAWQMVRQERPFSSIFALVWHSHIPKKMSFLVWRLFHKWLPVEQIQQMRGTSL 529
                     W  +RQ R    +   +W   +   MS  +WRL +  LPV++  Q +G  L
Sbjct: 966  DGIFSMKATWNSLRQTRATQQLVKEIWSPFVTPTMSVFMWRLINDKLPVDEKLQKKGIQL 1025

Query: 530  ASKCVCCHNTETISHVFFTNKIAANIWDHFAHLFGLPRVLFKNVNHVILHWSLNSNTVGH 709
            ASKC CC++ E++ HVF        +W+HFA  F +      N+  ++ +W +++    H
Sbjct: 1026 ASKCSCCNHVESLQHVFIEGNGIRCVWEHFARKFNMNLPNTDNIVLLLNYWRISALGQNH 1085

Query: 710  IRQVMPVIILWVLWEFRNKTKHGELTYEFQYIKKRIDHLIIYLGKTELLQYKHWKGDFNV 889
            IR ++P++ILW  W  RN  KH    +    IK ++   I+   K++  +  +WKGD  V
Sbjct: 1086 IRMIVPMLILWFGWLERNDVKHRNKNFNSDRIKWKVHQHIVTTFKSKTTKRINWKGDRFV 1145

Query: 890  AVAFGIKVYKPFQIRPSILSWHRPTQGNLKLNIDGSYK--LGDAGLGGILR 1036
            A   G+++   ++ +  I+ W +P  G +K+N DG+ K   G AG GGI R
Sbjct: 1146 AKFMGLELGSQYKPKIKIVKWTKPELGWIKINTDGASKGNPGRAGAGGIAR 1196


>EOY19200.1 Retrotransposon, unclassified-like protein [Theobroma cacao]
          Length = 1368

 Score =  204 bits (520), Expect = 3e-55
 Identities = 107/341 (31%), Positives = 181/341 (53%), Gaps = 8/341 (2%)
 Frame = +2

Query: 41   HHSKIWRGFAVFRNQAEDNLTWHLGTGDKDFWIDSWLSSGPI----NQHAKSGTKVKEYW 208
            H S  W+     R  A   + W +G GD  FW D+W+   P+       ++S  KV  ++
Sbjct: 869  HDSATWKPLLAGRATASQQIRWRIGKGDIFFWHDAWMGDEPLVNSFPSFSQSMMKVNYFF 928

Query: 209  ANNNWNLSKLQHLVESDNLFNSDMLQEMLQITLYPDREDILLWKPXXXXXXXXXXAWQMV 388
             ++ W++ KL+  + +       +++E+L+I +  ++EDI  W            AW+++
Sbjct: 929  NDDAWDVDKLKTFIPNA------IVEEILKIPISREKEDIAYWALTANGDFSIKSAWELL 982

Query: 389  RQERPFSSIFALVWHSHIPKKMSFLVWRLFHKWLPVEQIQQMRGTSLASKCVCCHNTETI 568
            RQ +  + +  L+WH  IP  +SF +WR  H WLPVE   + +G  LASKC+CC + E++
Sbjct: 983  RQRKQVNLVGQLIWHKSIPLTVSFFLWRTLHNWLPVEVRMKAKGIQLASKCLCCKSEESL 1042

Query: 569  SHVFFTNKIAANIWDHFAHLFGLPRVLFKNVNHVILHWSLNSNTV--GHIRQVMPVIILW 742
             HV + + +A  +W++F+  F +     +N+  ++  W  + +    GHIR ++ + I W
Sbjct: 1043 LHVLWESPVAQQVWNYFSKFFQIYVHNPQNILQILNSWYYSGDFTKPGHIRTLILLFIFW 1102

Query: 743  VLWEFRNKTKHGELTYEFQYIKKRIDHLIIYLGKTELLQYKHWKGDFNVAVAFGIKVYKP 922
             +W  RN  KH +L      I  RI  ++  L +  LL    WKGD ++A+ +G    + 
Sbjct: 1103 FVWVERNDAKHRDLGMYPDRIIWRIMKILRKLFQGGLLCKWQWKGDLDIAIHWGFNFAQE 1162

Query: 923  FQIRPSILSWHRPTQGNLKLNIDGSYK--LGDAGLGGILRN 1039
             Q RP I++W +P  G LKLN+DGS K    +A  GG+LR+
Sbjct: 1163 RQARPKIINWIKPLIGELKLNVDGSSKDEFQNAAGGGVLRD 1203


>EOY02239.1 Uncharacterized protein TCM_016763 [Theobroma cacao]
          Length = 2127

 Score =  203 bits (517), Expect = 9e-55
 Identities = 108/342 (31%), Positives = 176/342 (51%), Gaps = 9/342 (2%)
 Frame = +2

Query: 41   HHSKIWRGFAVFRNQAEDNLTWHLGTGDKDFWIDSWLSSGPIN----QHAKSGTKVKEYW 208
            H S +W+     R  A  N+ W +G GD  FW D W+   P+     +     +    ++
Sbjct: 1662 HDSHVWKRMISGREMALQNIRWKIGKGDLFFWHDCWMGDKPLAASFPEFQNDMSHGYHFY 1721

Query: 209  ANNNWNLSKLQHLVESDNLFNSDMLQEMLQITLYPDREDILLWKPXXXXXXXXXXAWQMV 388
              + W++ KL+  + +       +++E+LQ+     RED+  W            AW+M+
Sbjct: 1722 NGDTWDVDKLRSFLPTI------LVEEILQVPFDKSREDVAYWTLTSNGDFSTRSAWEMI 1775

Query: 389  RQERPFSSIFALVWHSHIPKKMSFLVWRLFHKWLPVEQIQQMRGTSLASKCVCCHNTETI 568
            RQ +  +++ + +WH  IP  +SF +W+  H W+PVE   + +G  LASKCVCC++ E++
Sbjct: 1776 RQRQTSNALCSFIWHRSIPLSISFFLWKTLHNWIPVELRMKEKGIQLASKCVCCNSEESL 1835

Query: 569  SHVFFTNKIAANIWDHFAHLFGLPRVLFKNVNHVILHWSLNSNTV--GHIRQVMPVIILW 742
             HV + N +A  +W+ FA LF +     ++V+ +I  W ++ + V  GH R ++P+ I W
Sbjct: 1836 IHVLWENPVAKQVWNFFAQLFQIYIWNPRHVSQIIWAWYVSGDYVRKGHFRVLLPLFICW 1895

Query: 743  VLWEFRNKTKHGEL-TYEFQYIKKRIDHL-IIYLGKTELLQYKHWKGDFNVAVAFGIKVY 916
             LW  RN  KH     Y  + I + + H   +Y G   LLQ   WKGD ++A   G    
Sbjct: 1896 FLWLERNDAKHRHTGLYADRVIWRTMKHCRQLYDG--SLLQQWQWKGDTDIATMLGFSFT 1953

Query: 917  KPFQIRPSILSWHRPTQGNLKLNIDGSYKLG-DAGLGGILRN 1039
                  P I+ W +P+ G  KLN+DGS + G  A  GG+LR+
Sbjct: 1954 HKQHAPPQIIYWKKPSIGEYKLNVDGSSRNGLHAATGGVLRD 1995


>EOY06959.1 Uncharacterized protein TCM_021521 [Theobroma cacao]
          Length = 1951

 Score =  203 bits (516), Expect = 1e-54
 Identities = 108/340 (31%), Positives = 176/340 (51%), Gaps = 7/340 (2%)
 Frame = +2

Query: 41   HHSKIWRGFAVFRNQAEDNLTWHLGTGDKDFWIDSWLSSGPINQHAKSG----TKVKEYW 208
            H S++W+   V R+ A  N+ W +G G+  FW D W+   P+     S     + V +++
Sbjct: 1485 HDSQVWKRMIVGRDVALQNIRWRIGKGELFFWHDCWMGDQPLATLCPSFHNDMSHVHKFY 1544

Query: 209  ANNNWNLSKLQHLVESDNLFNSDMLQEMLQITLYPDREDILLWKPXXXXXXXXXXAWQMV 388
              + W++ KL   + +       ++ E+LQI     +ED+  W            AW+ +
Sbjct: 1545 NGDVWDIEKLSSCLPTS------LVDEILQIPFDRSQEDVAYWALTSNGDFSLWSAWEAI 1598

Query: 389  RQERPFSSIFALVWHSHIPKKMSFLVWRLFHKWLPVEQIQQMRGTSLASKCVCCHNTETI 568
            RQ +  +++F+L+WH  IP  +SF +WR+ + W+PVE   + +G  LASKCVCC + E++
Sbjct: 1599 RQRQTPNALFSLIWHRSIPLSISFFLWRVLNNWIPVELRMKDKGIHLASKCVCCRSEESL 1658

Query: 569  SHVFFTNKIAANIWDHFAHLFGLPRVLFKNVNHVILHWSLNSNTV--GHIRQVMPVIILW 742
             HV + N +A  +W  FA  F +      +++ +I  W  + +    GHIR ++P+ I W
Sbjct: 1659 IHVLWENPVATQVWFFFAKSFQIYVSKPNHISQIIWAWFFSGDYTRNGHIRILIPLFICW 1718

Query: 743  VLWEFRNKTKHGELTYEFQYIKKRIDHLIIYLGKTELLQYKHWKGDFNVAVAFGIKVYKP 922
             LW  RN  KH  +      +  RI  L+  L    LL+   WKGD ++A  +G K    
Sbjct: 1719 FLWLERNDAKHRHMGMYPNRVIWRIMKLLNQLYAGSLLKQWQWKGDTDIATMWGFKFPPK 1778

Query: 923  FQIRPSILSWHRPTQGNLKLNIDGSYKLG-DAGLGGILRN 1039
            +   P I+ W +P  G  KLN+DGS K   +A  GG+LR+
Sbjct: 1779 YCTSPQIIYWIKPFIGEYKLNVDGSSKSNLNAAGGGVLRD 1818


>EOY02234.1 Uncharacterized protein TCM_011921 [Theobroma cacao]
          Length = 926

 Score =  202 bits (513), Expect = 1e-54
 Identities = 104/340 (30%), Positives = 176/340 (51%), Gaps = 7/340 (2%)
 Frame = +2

Query: 41   HHSKIWRGFAVFRNQAEDNLTWHLGTGDKDFWIDSWLSSGPI----NQHAKSGTKVKEYW 208
            H+S IW+     R+    N  W +G G+  FW D W+   P+           + V +++
Sbjct: 461  HNSSIWKRITGGRDVTIQNTRWKIGRGELFFWHDCWMGDQPLVISFPSFRNDMSLVHKFY 520

Query: 209  ANNNWNLSKLQHLVESDNLFNSDMLQEMLQITLYPDREDILLWKPXXXXXXXXXXAWQMV 388
              ++W++ KL+  +  +      ++ E+L I     ++D+  W            AW+ +
Sbjct: 521  KGDSWDVDKLRLFLPVN------LVDEILLIPFDRTQQDVAYWILTSNGEFSTRSAWETI 574

Query: 389  RQERPFSSIFALVWHSHIPKKMSFLVWRLFHKWLPVEQIQQMRGTSLASKCVCCHNTETI 568
            R+ +P +++ +L+WH  IP  +SF +WR  + W+PVE   + +G  LASKCVCC++ E++
Sbjct: 575  RKRQPHNTLGSLIWHRSIPLSISFFIWRALNNWIPVELRMKEKGIHLASKCVCCNSEESL 634

Query: 569  SHVFFTNKIAANIWDHFAHLFGLPRVLFKNVNHVILHWSLNSNTV--GHIRQVMPVIILW 742
             HV + N +A  +W  FA+ F +     ++V+H++  W  + + V  GHIR ++P+ I W
Sbjct: 635  MHVLWGNSVAKQVWAFFANFFQIYIFNPQHVSHILWAWFYSGDYVKRGHIRTLLPIFICW 694

Query: 743  VLWEFRNKTKHGELTYEFQYIKKRIDHLIIYLGKTELLQYKHWKGDFNVAVAFGIKVYKP 922
             LW  RN  KH         +  RI  L+  L    LLQ   WKGD ++A  +   +   
Sbjct: 695  FLWLERNDAKHRYSGLYTDRVVWRIMKLLRQLHDGSLLQQWQWKGDTDIAAMWKYNLQLK 754

Query: 923  FQIRPSILSWHRPTQGNLKLNIDGSYKLGD-AGLGGILRN 1039
             +  P I+ W +P+ G  KLN+DGS + G  A  GG+LR+
Sbjct: 755  LRAPPQIVYWRKPSTGEYKLNVDGSSRHGQHAASGGVLRD 794


>EOY02236.1 Uncharacterized protein TCM_011923 [Theobroma cacao]
          Length = 1954

 Score =  201 bits (511), Expect = 5e-54
 Identities = 112/354 (31%), Positives = 184/354 (51%), Gaps = 8/354 (2%)
 Frame = +2

Query: 2    CRLKHPATANINKHHSKIWRGFAVFRNQAEDNLTWHLGTGDKDFWIDSWLSSGPINQ--- 172
            C  + P   +   H S++W+     R  A  N  W +G G   FW D W+   P+     
Sbjct: 1475 CMGQIPHYVHPKLHDSQVWKRMVRGREVAIQNTRWRIGKGSLFFWHDCWMGDQPLVTSFP 1534

Query: 173  HAKSG-TKVKEYWANNNWNLSKLQHLVESDNLF-NSDMLQEMLQITLYPDREDILLWKPX 346
            H ++  + V  ++  +NW++ KL       NL+   +++ E+LQI +   ++D+  W   
Sbjct: 1535 HFRNDMSTVHNFFNGHNWDVDKL-------NLYLPMNLVDEILQIPIDRSQDDVAYWSLT 1587

Query: 347  XXXXXXXXXAWQMVRQERPFSSIFALVWHSHIPKKMSFLVWRLFHKWLPVEQIQQMRGTS 526
                     AW+ +R  +  + + +L+WH  IP  +SF +WR+FH W+PV+   + +G  
Sbjct: 1588 SNGEFSTRSAWEAIRLRKSPNVLCSLLWHKSIPLSISFFLWRVFHNWIPVDIRLKEKGFH 1647

Query: 527  LASKCVCCHNTETISHVFFTNKIAANIWDHFAHLFGLPRVLFKNVNHVILHWSLNSNTV- 703
            LASKC+CC++ E++ HV + N IA  +W+ FA+ F +     +NV+ ++  W L+ + V 
Sbjct: 1648 LASKCICCNSEESLIHVLWDNPIAKQVWNFFANSFQIYISKPQNVSQILWTWYLSGDYVR 1707

Query: 704  -GHIRQVMPVIILWVLWEFRNKTKHGELTYEFQYIKKRIDHLIIYLGKTELLQYKHWKGD 880
             GHIR ++P+ I W LW  RN  KH  L      +  +I  L+  L    LL+   WKGD
Sbjct: 1708 KGHIRILIPLFICWFLWLERNDAKHRHLGMYSDRVVWKIMKLLRQLQDGYLLKSWQWKGD 1767

Query: 881  FNVAVAFGIKVYKPFQIRPSILSWHRPTQGNLKLNIDGSYKLGD-AGLGGILRN 1039
             + A  +G+      +  P IL W +P  G  KLN+DGS +    A +GG+LR+
Sbjct: 1768 KDFATMWGLFSPPKTRAAPQILHWVKPVPGEHKLNVDGSSRQNQTAAIGGVLRD 1821


>EOY02238.1 Uncharacterized protein TCM_016762 [Theobroma cacao]
          Length = 2214

 Score =  201 bits (510), Expect = 7e-54
 Identities = 105/340 (30%), Positives = 174/340 (51%), Gaps = 7/340 (2%)
 Frame = +2

Query: 41   HHSKIWRGFAVFRNQAEDNLTWHLGTGDKDFWIDSWLSSGPI----NQHAKSGTKVKEYW 208
            H S IW+     R+    N  W +G G+  FW D W+   P+           + V +++
Sbjct: 1749 HSSSIWKRITGGRDVTIQNTRWKIGRGELFFWHDCWMGDQPLVISFPSFRNDMSFVHKFY 1808

Query: 209  ANNNWNLSKLQHLVESDNLFNSDMLQEMLQITLYPDREDILLWKPXXXXXXXXXXAWQMV 388
              ++W++ KL+  +  +      ++ E+L I     ++D+  W            AW+ +
Sbjct: 1809 KGDSWDVDKLRLFLPVN------LIYEILLIPFDRTQQDVAYWTLTSNGEFSTKSAWETI 1862

Query: 389  RQERPFSSIFALVWHSHIPKKMSFLVWRLFHKWLPVEQIQQMRGTSLASKCVCCHNTETI 568
            RQ++  +++ +L+WH  IP  +SF +WR  + W+PVE   + +G  LASKCVCC++ E++
Sbjct: 1863 RQQQSHNTLGSLIWHRSIPLSISFFIWRALNNWIPVELRMKGKGIHLASKCVCCNSEESL 1922

Query: 569  SHVFFTNKIAANIWDHFAHLFGLPRVLFKNVNHVILHWSLNSNTV--GHIRQVMPVIILW 742
             HV + N +A  +W  FA  F +  +  K+V+H++  W  + + V  GHIR ++P+ I W
Sbjct: 1923 MHVLWGNSVAKQVWAFFAKFFQIYVLNPKHVSHILWAWFYSGDYVKRGHIRTLLPIFICW 1982

Query: 743  VLWEFRNKTKHGELTYEFQYIKKRIDHLIIYLGKTELLQYKHWKGDFNVAVAFGIKVYKP 922
             LW  RN  K+         I  RI  L+  L    LLQ   WKGD ++A  +       
Sbjct: 1983 FLWLERNDAKYRHSGLNTDRIVWRIMKLLRQLKDGSLLQQWQWKGDTDIAAMWQYNFQLK 2042

Query: 923  FQIRPSILSWHRPTQGNLKLNIDGSYKLGD-AGLGGILRN 1039
             +  P I+ W +P+ G  KLN+DGS + G  A  GG+LR+
Sbjct: 2043 LRAPPQIVYWRKPSTGEYKLNVDGSSRHGQHAASGGVLRD 2082


>KZV18919.1 hypothetical protein F511_17825 [Dorcoceras hygrometricum]
          Length = 1288

 Score =  199 bits (507), Expect = 2e-53
 Identities = 105/349 (30%), Positives = 169/349 (48%), Gaps = 7/349 (2%)
 Frame = +2

Query: 17   PATANINKHHSKIWRGFAVFRNQAEDNLTWHLGTGDKDFWIDSWLSSGPI----NQHAKS 184
            PA  ++    S  WR     R  AE ++ W +G GD  FW D WL SGP+    +     
Sbjct: 813  PAAVDLKVRISPNWRRLIKIRQLAESHICWSIGRGDLSFWYDLWLPSGPLYTLCDIVGPK 872

Query: 185  GTKVKEYWANNNWNLSKLQHLVESDNLFNSDMLQEMLQITLYPDREDILLWKPXXXXXXX 364
              KV        WN ++L+ L+       +D+++++LQ+ + P   D L+WKP       
Sbjct: 873  DRKVAWLIDEGRWNRARLELLI------GADLIEQVLQVPISPFMVDRLIWKPSSHGKFS 926

Query: 365  XXXAWQMVRQERPFSSIFALVWHSHIPKKMSFLVWRLFHKWLPVEQIQQMRGTSLASKCV 544
               AW+++R       IF   W   +   MS  VWR   + +P + + Q RG ++ SKC 
Sbjct: 927  SKSAWELLRHRDLTKDIFTACWSKLLTPTMSLFVWRWIQRKIPTDDVLQSRGVAMGSKCQ 986

Query: 545  CCHNTETISHVFFTNKIAANIWDHFAHLFGLPRVLFKNVNHVILHWSLNS--NTVGHIRQ 718
            CC   E+  HVFFT+ IA ++W HF ++ G+ +         + +W L +     GHI++
Sbjct: 987  CCAQEESFDHVFFTSHIAFHVWSHFGNILGIQQAT------QVFNWRLENLWKLSGHIQE 1040

Query: 719  VMPVIILWVLWEFRNKTKHGELTYEFQYIKKRIDHLIIYLGKTELLQYKHWKGDFNVAVA 898
             +P +ILW LW  RN +KH  +      + + I + I     + LL+ +HWKG   +A  
Sbjct: 1041 CIPFLILWFLWTGRNDSKHRNIKLRSAAVIRHIRYYIFAASSSGLLKLEHWKGCLALAQE 1100

Query: 899  FGIKVYKPFQIRPSILSWHRPTQGNLKLNIDG-SYKLGDAGLGGILRNC 1042
            F +++    +   +I+ W +P     KLN DG     G    GG++R+C
Sbjct: 1101 FNVRIKGFRRTSIAIIKWTKPPSHWFKLNTDGCRSNQGMISSGGLIRDC 1149


>EOY34747.1 Uncharacterized protein TCM_042327 [Theobroma cacao]
          Length = 1014

 Score =  198 bits (504), Expect = 3e-53
 Identities = 108/353 (30%), Positives = 181/353 (51%), Gaps = 7/353 (1%)
 Frame = +2

Query: 2    CRLKHPATANINKHHSKIWRGFAVFRNQAEDNLTWHLGTGDKDFWIDSWLSSGPINQHAK 181
            C  + P       H S++W+     R+ A  N  W +G G+  FW D W+ + P+     
Sbjct: 535  CMGQIPHYVQSKLHDSQVWKRMVRGRDVAIQNTRWRIGKGNLFFWHDCWMGNKPLVTSFP 594

Query: 182  SG----TKVKEYWANNNWNLSKLQHLVESDNLFNSDMLQEMLQITLYPDREDILLWKPXX 349
            S     T V +++  +NW+++ L+  +  +      ++ E+LQI     ++DI  W    
Sbjct: 595  SFRNDMTFVHKFYNGDNWDVNTLKLYLPMN------LIDEILQIPFDRSQDDIAYWALTS 648

Query: 350  XXXXXXXXAWQMVRQERPFSSIFALVWHSHIPKKMSFLVWRLFHKWLPVEQIQQMRGTSL 529
                    AW+ VRQ +  +++ + +WH  IP  +SF +WR+ + W+PVE   + +G  L
Sbjct: 649  DGEFSTWSAWEAVRQRQSPNTLCSFIWHKSIPLTISFFLWRVLNNWIPVELRLKEKGFHL 708

Query: 530  ASKCVCCHNTETISHVFFTNKIAANIWDHFAHLFGLPRVLFKNVNHVILHWSLNSNTV-- 703
            ASKCVCC++ E++ HV + N +A  +W+ FA  F +     ++V+ +I  W  + + V  
Sbjct: 709  ASKCVCCNSEESLIHVLWDNPVAKQVWNFFADFFQINISNPQHVSQIIWAWYYSGDFVRK 768

Query: 704  GHIRQVMPVIILWVLWEFRNKTKHGELTYEFQYIKKRIDHLIIYLGKTELLQYKHWKGDF 883
            GHIR ++P+ I W LW  RN  KH  L      +  +I  ++  L    LL+   WKGD 
Sbjct: 769  GHIRTLIPLFICWFLWLERNDAKHRHLGMYSDRVVWKIMKVLRQLQDGSLLKKWQWKGDT 828

Query: 884  NVAVAFGIKVYKPFQIRPSILSWHRPTQGNLKLNIDGSYKLG-DAGLGGILRN 1039
            ++A  +G  +    +  P I+ W +P  G  KLN+DGS +    A  GG+LR+
Sbjct: 829  DIAAMWGFTLPLKIRESPQIIHWVKPVTGEYKLNVDGSSRHNQSAATGGLLRD 881


>EOY25451.1 Uncharacterized protein TCM_016759 [Theobroma cacao]
          Length = 879

 Score =  196 bits (499), Expect = 8e-53
 Identities = 106/353 (30%), Positives = 174/353 (49%), Gaps = 7/353 (1%)
 Frame = +2

Query: 2    CRLKHPATANINKHHSKIWRGFAVFRNQAEDNLTWHLGTGDKDFWIDSWLSSGPINQHAK 181
            C  + P   +   H S +W+     R  A  N+ W +G GD  FW D W+ + P+     
Sbjct: 400  CLGRIPHYVHPKLHDSLVWKRMIRGREVAFRNIRWKIGKGDLFFWHDCWMGNQPLVMSFP 459

Query: 182  S----GTKVKEYWANNNWNLSKLQHLVESDNLFNSDMLQEMLQITLYPDREDILLWKPXX 349
            S     + V  ++  + W++ KL+  +  +      ++ E+L I     ++D+  W    
Sbjct: 460  SLRNDMSLVHNFYNGDTWDVDKLKAYLPMN------LIDEILLIPFNRTQQDVAYWTLTS 513

Query: 350  XXXXXXXXAWQMVRQERPFSSIFALVWHSHIPKKMSFLVWRLFHKWLPVEQIQQMRGTSL 529
                    AW+ +RQ +  +++ + +WH  IP  +SF +WR  + W+PVE   + +G  L
Sbjct: 514  NGEFATWSAWETIRQRKSSNALCSFIWHRSIPLSISFFLWRALNNWIPVELRMKEKGIQL 573

Query: 530  ASKCVCCHNTETISHVFFTNKIAANIWDHFAHLFGLPRVLFKNVNHVILHWSLNSNTV-- 703
            ASKCVCC++ E++ HV + N +A  +W  F   F +  +  ++V+ ++  W  + + V  
Sbjct: 574  ASKCVCCNSEESLMHVLWGNSVAKQVWAFFGKFFQIYVLNPQHVSQILWAWFFSGDYVKK 633

Query: 704  GHIRQVMPVIILWVLWEFRNKTKHGELTYEFQYIKKRIDHLIIYLGKTELLQYKHWKGDF 883
            GHIR ++P+ I W LW  RN  KH         +  RI  L+  L    LL    WKGD 
Sbjct: 634  GHIRSLLPIFICWFLWLERNDAKHRHTRLNPDRVVWRIMKLLRQLLDGSLLHQWQWKGDT 693

Query: 884  NVAVAFGIKVYKPFQIRPSILSWHRPTQGNLKLNIDGSYKLGD-AGLGGILRN 1039
            ++A  +G       +  P I+ W +P  G  KLN+DGS + G  A  GGILR+
Sbjct: 694  DIASMWGHTFQSKHRAPPQIIYWRKPFTGEYKLNVDGSSRNGHLAASGGILRD 746


>KZV46870.1 hypothetical protein F511_08631 [Dorcoceras hygrometricum]
          Length = 459

 Score =  187 bits (475), Expect = 7e-52
 Identities = 104/328 (31%), Positives = 164/328 (50%), Gaps = 8/328 (2%)
 Frame = +2

Query: 86   AEDNLTWHLGTGDKDFWIDSWLSSGPINQHAK----SGTKVKEYWANNNWNLSKLQ-HLV 250
            AE  + W++G GD  FW DSWL +GP++   +       KV  +     WN  +L+ +L+
Sbjct: 7    AESQIRWNIGKGDLSFWYDSWLDAGPLHTLCEIVGPKDRKVDWFLEQGGWNKDRLELYLI 66

Query: 251  ESDNLFNSDMLQEMLQITLYPDREDILLWKPXXXXXXXXXXAWQMVRQERPFSSIFALVW 430
             S       +L++++Q  + P  +D L+WKP           W++VR       IF   W
Sbjct: 67   PS-------VLEQVIQTPISPYLDDRLIWKPTSHGRFTSKSVWELVRHRNTPRDIFTACW 119

Query: 431  HSHIPKKMSFLVWRLFHKWLPVEQIQQMRGTSLASKCVCCHNTETISHVFFTNKIAANIW 610
               +   MS   WR   + +P + I + RG +LASKC CC   ET+ H+FF++ +A  +W
Sbjct: 120  SRILSPTMSLFAWRWTQRKIPTDDILKSRGVALASKCQCCEQEETLDHLFFSSSVATQVW 179

Query: 611  DHFAHLFGLPRVLFKNVNHVILHWSLNSN--TVGHIRQVMPVIILWVLWEFRNKTKHGEL 784
             HF  LFG+ +    +      +W +N N    GH+R+ +P +ILW LW  RN +KH  +
Sbjct: 180  THFGSLFGVAQPKQAS------NWKININWRARGHLRECIPFLILWFLWIGRNDSKHRLI 233

Query: 785  TYEFQYIKKRIDHLIIYLGKTELLQYKHWKGDFNVAVAFGIKVYKPFQIRPSILSWHRPT 964
                  I +RI + I     + LL+ +HW+G   +A  F ++V    +   S + W +P 
Sbjct: 234  CLRPAVIIRRIRYYIFTAASSGLLKAEHWQGVHALAQNFLVQVRGIRRTTVSTIYWIKPP 293

Query: 965  QGNLKLNIDGS-YKLGDAGLGGILRNCQ 1045
                KLN DGS    G    GG++R+ Q
Sbjct: 294  TTWFKLNTDGSRSNQGMTSTGGLVRDSQ 321


>EOY25454.1 Uncharacterized protein TCM_026877 [Theobroma cacao]
          Length = 2367

 Score =  179 bits (455), Expect = 2e-46
 Identities = 101/351 (28%), Positives = 161/351 (45%), Gaps = 5/351 (1%)
 Frame = +2

Query: 2    CRLKHPATANINKHHSKIWRGFAVFRNQAEDNLTWHLGTGDKDFWIDSWLSSGPI----N 169
            CR + P       H S+ W+         E N+ W +G G+  FW D W+   P+    +
Sbjct: 1942 CRGQLPMHTQPKLHDSQTWKRMVASSAITEQNMRWRVGQGNLFFWHDCWMGETPLISSNH 2001

Query: 170  QHAKSGTKVKEYWANNNWNLSKLQHLVESDNLFNSDMLQEMLQITLYPDREDILLWKPXX 349
            + + S  +V +++ NN+W++ KL+ +++ +      ++ E+ +I +    +D   W P  
Sbjct: 2002 EFSLSMVQVCDFFMNNSWDIEKLKTVLQQE------VVDEIAKIPIDAMSKDEAYWAPTP 2055

Query: 350  XXXXXXXXAWQMVRQERPFSSIFALVWHSHIPKKMSFLVWRLFHKWLPVEQIQQMRGTSL 529
                    AWQ++R+    + +F  +WH  IP   SF +WRL H W+PVE   + +G  L
Sbjct: 2056 NGEFSTKSAWQLIRKREVVNPVFNFIWHKAIPLTTSFFLWRLLHDWIPVELRMKSKGFQL 2115

Query: 530  ASKCVCCHNTETISHVFFTNKIAANIWDHFAHLFGLPRVLFKNVNHVILHWSLNSNTVGH 709
            AS+C CC + E+I HV + N +A                                   GH
Sbjct: 2116 ASRCRCCRSEESIIHVMWDNPVAVQ--------------------------------PGH 2143

Query: 710  IRQVMPVIILWVLWEFRNKTKHGELTYEFQYIKKRIDHLIIYLGKTELLQYKHWKGDFNV 889
            IR ++P+  LW LW  RN  KH  L                     +LL+++ WKGD  +
Sbjct: 2144 IRTLIPIFTLWFLWVERNDAKHRNL-------------------GQQLLEWQ-WKGDKQI 2183

Query: 890  AVAFGIKVYKPFQIRPSILSWHRPTQGNLKLNIDGSYKLG-DAGLGGILRN 1039
            A  +GI         P +  WH+P+ G  KLN+DGS KL  +A  GG+LR+
Sbjct: 2184 AQEWGITFQAKSLPPPKVFCWHKPSNGEFKLNVDGSAKLSQNAAGGGVLRD 2234


>KZV33338.1 hypothetical protein F511_18219 [Dorcoceras hygrometricum]
          Length = 459

 Score =  172 bits (436), Expect = 4e-46
 Identities = 97/326 (29%), Positives = 161/326 (49%), Gaps = 6/326 (1%)
 Frame = +2

Query: 77   RNQAEDNLTWHLGTGDKDFWIDSWLSSGPINQHAK----SGTKVKEYWANNNWNLSKLQ- 241
            +N AE ++ W +G GD  FW D+WLS GP+    +       K+  +  +  WN  +L+ 
Sbjct: 4    KNLAEGHIRWSIGKGDISFWYDAWLSDGPLFNRCEIIGPKERKIDWFIEHGCWNRDRLEL 63

Query: 242  HLVESDNLFNSDMLQEMLQITLYPDREDILLWKPXXXXXXXXXXAWQMVRQERPFSSIFA 421
            HL  +       ML ++L   + P  +D L+WKP          AW+++R       I+ 
Sbjct: 64   HLYPA-------MLDQVLLTPISPYLDDRLIWKPSSHGNFTTRSAWELIRLRNTPRDIYT 116

Query: 422  LVWHSHIPKKMSFLVWRLFHKWLPVEQIQQMRGTSLASKCVCCHNTETISHVFFTNKIAA 601
              W   +   MS   WR     +P + I ++RG  LASKC CC + E++ H+FF+  +A 
Sbjct: 117  ACWSKILSPTMSLFDWRCTQHKIPTDDILKIRGVPLASKCQCCDHEESLEHLFFSGSVAI 176

Query: 602  NIWDHFAHLFGLPRVLFKNVNHVILHWSLNSNTVGHIRQVMPVIILWVLWEFRNKTKHGE 781
             +W+HF  +FG+ +    +   +   W     + GHIR+ MP +ILW +   RN +KH  
Sbjct: 177  RVWEHFGRIFGVQQASQISNWRIFNSW----RSRGHIRECMPFLILWFICIGRNDSKHRG 232

Query: 782  LTYEFQYIKKRIDHLIIYLGKTELLQYKHWKGDFNVAVAFGIKVYKPFQIRPSILSWHRP 961
            +      I ++I + I     + L++++HW+G  ++A  F + +    +I  S +SW +P
Sbjct: 233  IMIRPAAIIRKIRYYITTAFTSGLMKHEHWQGLQSLARNFDVVLRGFKRITVSTISWIKP 292

Query: 962  TQGNLKLNIDG-SYKLGDAGLGGILR 1036
                 KLN DG     G    GG++R
Sbjct: 293  PDPFYKLNSDGCRSNTGMISTGGLIR 318


>XP_012858045.1 PREDICTED: uncharacterized protein LOC105977287 [Erythranthe guttata]
          Length = 1237

 Score =  177 bits (448), Expect = 1e-45
 Identities = 106/374 (28%), Positives = 176/374 (47%), Gaps = 29/374 (7%)
 Frame = +2

Query: 2    CRLKHPATANINKHHSKIWRGFAVFRNQAEDNLTWHLGTGDKDFWIDSWLSSGPINQHAK 181
            CR + P ++ ++  HS +W+     R + +  + W +G G   FW D W   GP++    
Sbjct: 732  CRNRFPGSSVVSSLHSTVWKRMCRVRERVQAQIFWRIGPGHVYFWHDHWFGDGPLSGIID 791

Query: 182  SG----TKVKEYWANNNWNLSKLQHLVESDNLFNSDMLQEMLQITLYPDREDILLWKPXX 349
             G     +V+ Y  N  W+ +KL   +  +       +  +  + +     D+ +W+   
Sbjct: 792  GGRLTSVRVEYYLVNGQWDRNKLAEDIPFE------WIDRICSVPISGASGDLPIWRASS 845

Query: 350  XXXXXXXXAWQMVRQERPFSSIFALVWHSHIPKKMSFLVWRLFHKWLPVEQIQQMRGTSL 529
                    AW ++RQ+   + +  + W S +   +S  +WRL  + LPV+   Q RGTSL
Sbjct: 846  DGKFSLTSAWALIRQQHTPTPLLRIFWGSCLTPTISIFLWRLLLQRLPVDTKLQSRGTSL 905

Query: 530  ASKCVCC-----------------HNTETISHVFFTNKIAANIWDHFAHLFGLPRVLFKN 658
            AS+C CC                  + E+I H+F  +  A  +W HF +LFG       +
Sbjct: 906  ASRCYCCPDPSIPVSSLVSQSVESPSVESIDHIFVESPTAKRVWHHFFYLFGYTPAHTTH 965

Query: 659  VNHVILHWS-LNSNTV---GHIRQVMPVIILWVLWEFRNKTKHGELTYEFQYIKKRIDHL 826
            +  ++L+W    S+T+    HI  ++P +ILW LW  RN +KH ++T     I  R+   
Sbjct: 966  IPQILLYWQHFTSHTLTHHTHITTIVPCLILWYLWIARNDSKHKDITVRASSIIYRVIQH 1025

Query: 827  IIYLGKTELLQYKHWKGDFNVAVAFGI--KVYKPFQIRPSILSWHRPTQGNLKLNIDGSY 1000
            I  L +T LL    W G  +VA + G+  +V  P  +RP  + W  P  G +KLN DG+ 
Sbjct: 1026 IKILHQTNLLSADSWTGIPHVAESLGLYYRVRTP-TLRPHRVVWLPPDPGWVKLNTDGAR 1084

Query: 1001 KLGD--AGLGGILR 1036
            +     A +GGI+R
Sbjct: 1085 RASTQIAAIGGIIR 1098


>XP_017972650.1 PREDICTED: uncharacterized protein LOC18606969 [Theobroma cacao]
          Length = 431

 Score =  168 bits (426), Expect = 6e-45
 Identities = 89/285 (31%), Positives = 150/285 (52%), Gaps = 3/285 (1%)
 Frame = +2

Query: 194  VKEYWANNNWNLSKLQHLVESDNLFNSDMLQEMLQITLYPDREDILLWKPXXXXXXXXXX 373
            V +++  + W++ KL      ++   + ++ E+LQI     +ED+  W            
Sbjct: 20   VHKFYNGDVWDIEKL------NSCLPTSLVDEILQIPFDRSQEDVAYWALTSNGEFSFWS 73

Query: 374  AWQMVRQERPFSSIFALVWHSHIPKKMSFLVWRLFHKWLPVEQIQQMRGTSLASKCVCCH 553
            AW+ +RQ +  +++F+ +WH  IP  +SF +WR+ + W+PVE   + +G  LASKCVCC 
Sbjct: 74   AWEAIRQRQTPNALFSFIWHRSIPLSISFFLWRVLNNWIPVELRMKDKGIHLASKCVCCR 133

Query: 554  NTETISHVFFTNKIAANIWDHFAHLFGLPRVLFKNVNHVILHWSLNSNTV--GHIRQVMP 727
            + E++ HV + N +A  +W  FA  F +      +++ +I  W  + +    GHIR ++P
Sbjct: 134  SEESLIHVLWENPVAKQVWFFFAKSFQIYVSKPNHISQIIWAWFFSGDYTRNGHIRILIP 193

Query: 728  VIILWVLWEFRNKTKHGELTYEFQYIKKRIDHLIIYLGKTELLQYKHWKGDFNVAVAFGI 907
            + I W LW  RN  KH  +      +  RI  L+  L    LL+   WKGD ++A  +G 
Sbjct: 194  LFICWFLWLERNDAKHRHMGMYPDRVIWRIMKLLNQLYAGSLLKRWQWKGDTDIATMWGF 253

Query: 908  KVYKPFQIRPSILSWHRPTQGNLKLNIDGSYKLG-DAGLGGILRN 1039
            K    +   P I+ W +P+ G  KLN+ GS +   +A  GG+LR+
Sbjct: 254  KFPPKYYTSPQIIYWIKPSIGEYKLNVYGSSESNQNAAGGGVLRD 298


Top