BLASTX nr result

ID: Mentha23_contig00012186 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Mentha23_contig00012186
         (736 letters)

Database: ./nr 
           38,876,450 sequences; 13,856,398,315 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_007020288.1| Uncharacterized protein TCM_036737 [Theobrom...   102   1e-19
ref|XP_007031312.1| Uncharacterized protein TCM_016762 [Theobrom...   102   1e-19
ref|XP_007046402.1| Uncharacterized protein TCM_011921 [Theobrom...   102   1e-19
ref|XP_007040952.1| Uncharacterized protein TCM_005954 [Theobrom...   102   2e-19
ref|XP_007031313.1| Uncharacterized protein TCM_016763 [Theobrom...   101   2e-19
ref|XP_007040946.1| Uncharacterized protein TCM_016753 [Theobrom...   100   9e-19
ref|XP_007026458.1| Uncharacterized protein TCM_021522 [Theobrom...    99   2e-18
ref|XP_007026457.1| Uncharacterized protein TCM_021521 [Theobrom...    99   2e-18
ref|XP_007040950.1| Uncharacterized protein TCM_016759 [Theobrom...    98   3e-18
ref|XP_007010390.1| Retrotransposon, unclassified-like protein [...    98   3e-18
ref|XP_007017128.1| Uncharacterized protein TCM_042327 [Theobrom...    97   7e-18
ref|XP_007008704.1| Uncharacterized protein TCM_042330 [Theobrom...    91   4e-16
ref|XP_007017129.1| Uncharacterized protein TCM_042328 [Theobrom...    90   7e-16
ref|XP_007017131.1| Uncharacterized protein TCM_033752 [Theobrom...    88   3e-15
ref|XP_007046404.1| Uncharacterized protein TCM_011923 [Theobrom...    88   3e-15
ref|XP_007036030.1| Uncharacterized protein TCM_021518 [Theobrom...    84   7e-14
ref|XP_007010293.1| Uncharacterized protein TCM_043836 [Theobrom...    83   9e-14
ref|XP_007028292.1| Uncharacterized protein TCM_023960 [Theobrom...    81   4e-13
ref|XP_007052624.1| Uncharacterized protein TCM_005952 [Theobrom...    80   6e-13
ref|XP_007008705.1| Uncharacterized protein TCM_042331 [Theobrom...    80   7e-13

>ref|XP_007020288.1| Uncharacterized protein TCM_036737 [Theobroma cacao]
            gi|508725616|gb|EOY17513.1| Uncharacterized protein
            TCM_036737 [Theobroma cacao]
          Length = 2215

 Score =  102 bits (255), Expect = 1e-19
 Identities = 68/232 (29%), Positives = 101/232 (43%), Gaps = 3/232 (1%)
 Frame = +3

Query: 3    ERNSHKHRGVPFLASHIISQVIQHLRLLVMAKKLAPSQWSDCSPQVDFMPYSAPVRRPIR 182
            ERN  KHR +    + ++ ++++ L  L   K+L   QW                  P  
Sbjct: 1986 ERNDAKHRNLGMYPNRVVWKILKLLHQLFQGKQLQKWQWQGDKQIAQEWGIILKADAPSP 2045

Query: 183  STPVFWRPPPALWVKLSTDGSFDRGQMRAAGGGLVRDHRASLLSAFSLPLQAASSFDAEL 362
               +FW  P    +KL+ DGS       AAGGGL+RDH  S++  FS       S  AEL
Sbjct: 2046 PKLLFWLKPSIGELKLNVDGSCKHNPQSAAGGGLLRDHTGSMIFGFSENFGPQDSLQAEL 2105

Query: 363  QAVYHGLLIASQHS-SHVWIEXXXXXXXXXXXXXRHGSGXXXXXXXXXXXXXXELQVRIS 539
             A++ GLL+  +H+ S +WIE               GS                +  RIS
Sbjct: 2106 MALHRGLLLCIEHNISRLWIEMDAKVAVQMIKEGHQGSSRTRYLLASIHRCLSGISFRIS 2165

Query: 540  HIHREGNRPADYMARLGHRLQTMTTFDASSAPRPFLSLVRMDQ--LGYPNFR 689
            HI REGN+ AD+++  GH  Q +     S A      ++R+++  L Y  F+
Sbjct: 2166 HIFREGNQAADHLSNQGHTHQNLQVI--SQAEGQLRGILRLEKINLAYVRFK 2215


>ref|XP_007031312.1| Uncharacterized protein TCM_016762 [Theobroma cacao]
            gi|508710341|gb|EOY02238.1| Uncharacterized protein
            TCM_016762 [Theobroma cacao]
          Length = 2214

 Score =  102 bits (255), Expect = 1e-19
 Identities = 69/230 (30%), Positives = 104/230 (45%), Gaps = 1/230 (0%)
 Frame = +3

Query: 3    ERNSHKHRGVPFLASHIISQVIQHLRLLVMAKKLAPSQWSDCSPQVDFMPYSAPVRRPIR 182
            ERN  K+R        I+ ++++ LR L     L   QW   +       Y+  ++    
Sbjct: 1987 ERNDAKYRHSGLNTDRIVWRIMKLLRQLKDGSLLQQWQWKGDTDIAAMWQYNFQLKLRAP 2046

Query: 183  STPVFWRPPPALWVKLSTDGSFDRGQMRAAGGGLVRDHRASLLSAFSLPLQAASSFDAEL 362
               V+WR P     KL+ DGS   GQ  AA GG++RDH   L+  FS  +   +S  AEL
Sbjct: 2047 PQIVYWRKPSTGEYKLNVDGSSRHGQ-HAASGGVLRDHTGKLIFGFSENIGTCNSLQAEL 2105

Query: 363  QAVYHGLLIASQ-HSSHVWIEXXXXXXXXXXXXXRHGSGXXXXXXXXXXXXXXELQVRIS 539
            +A+  GLL+  + H   +WIE             + GS                +  RIS
Sbjct: 2106 RALLRGLLLCKERHIEKLWIEMDALAAIQLLPHSQKGSHDIRYLLESIRKCLNSISYRIS 2165

Query: 540  HIHREGNRPADYMARLGHRLQTMTTFDASSAPRPFLSLVRMDQLGYPNFR 689
            HIHREGN+ AD+++  GH  Q +  F  + A      ++++D+L  P  R
Sbjct: 2166 HIHREGNQVADFLSNEGHNHQNLHVF--TEAQGKLHGMLKLDRLNLPYVR 2213


>ref|XP_007046402.1| Uncharacterized protein TCM_011921 [Theobroma cacao]
            gi|508710337|gb|EOY02234.1| Uncharacterized protein
            TCM_011921 [Theobroma cacao]
          Length = 926

 Score =  102 bits (254), Expect = 1e-19
 Identities = 69/231 (29%), Positives = 104/231 (45%), Gaps = 1/231 (0%)
 Frame = +3

Query: 3    ERNSHKHRGVPFLASHIISQVIQHLRLLVMAKKLAPSQWSDCSPQVDFMPYSAPVRRPIR 182
            ERN  KHR        ++ ++++ LR L     L   QW   +       Y+  ++    
Sbjct: 699  ERNDAKHRYSGLYTDRVVWRIMKLLRQLHDGSLLQQWQWKGDTDIAAMWKYNLQLKLRAP 758

Query: 183  STPVFWRPPPALWVKLSTDGSFDRGQMRAAGGGLVRDHRASLLSAFSLPLQAASSFDAEL 362
               V+WR P     KL+ DGS   GQ  AA GG++RDH   L+  FS  +   +S  AEL
Sbjct: 759  PQIVYWRKPSTGEYKLNVDGSSRHGQ-HAASGGVLRDHTGKLIFGFSENIGNCNSLQAEL 817

Query: 363  QAVYHGLLIASQ-HSSHVWIEXXXXXXXXXXXXXRHGSGXXXXXXXXXXXXXXELQVRIS 539
            +A+  GLL+  + H   +WIE             + GS                +  RIS
Sbjct: 818  RALLRGLLLCKERHIEQLWIEMDALAVIQLIPHSQKGSHDIRYLLESIRKCLNSISYRIS 877

Query: 540  HIHREGNRPADYMARLGHRLQTMTTFDASSAPRPFLSLVRMDQLGYPNFRL 692
            HI REGN+ AD+++  GH  Q +  F  + A      ++++D+L  P  RL
Sbjct: 878  HILREGNQVADFLSNEGHNHQNLRVF--TEAQGKLHGMLKLDRLNLPYVRL 926


>ref|XP_007040952.1| Uncharacterized protein TCM_005954 [Theobroma cacao]
            gi|508704887|gb|EOX96783.1| Uncharacterized protein
            TCM_005954 [Theobroma cacao]
          Length = 1134

 Score =  102 bits (253), Expect = 2e-19
 Identities = 65/230 (28%), Positives = 102/230 (44%), Gaps = 1/230 (0%)
 Frame = +3

Query: 3    ERNSHKHRGVPFLASHIISQVIQHLRLLVMAKKLAPSQWSDCSPQVDFMPYSAPVRRPIR 182
            ERN  KHR        +I + ++H R L     L   QW   +     + +S P ++   
Sbjct: 904  ERNDAKHRHTGLYPDRVIWRTMKHCRQLYDGSLLQQWQWKGDTDIAAMLGFSFPPQQHAS 963

Query: 183  STPVFWRPPPALWVKLSTDGSFDRGQMRAAGGGLVRDHRASLLSAFSLPLQAASSFDAEL 362
               ++W+ P     KL+ DGS  R  + AA GG++RDH   L+  FS  +   +S  AEL
Sbjct: 964  PQIIYWKKPSIGEYKLNVDGS-SRNGLHAATGGVLRDHTGKLIFGFSENIGPCNSLQAEL 1022

Query: 363  QAVYHGLLIASQ-HSSHVWIEXXXXXXXXXXXXXRHGSGXXXXXXXXXXXXXXELQVRIS 539
            +A+  GLL+  + H   +WIE             + G                    R+S
Sbjct: 1023 RALLRGLLLCKERHIEKLWIEMDALAAIQLIQPSKKGPYDIRYLLESIRMCLSSFSYRLS 1082

Query: 540  HIHREGNRPADYMARLGHRLQTMTTFDASSAPRPFLSLVRMDQLGYPNFR 689
            H  REGN+ ADY++  GH+ Q +  F  + A      ++++D+L  P  R
Sbjct: 1083 HTFREGNKAADYLSNEGHKHQNLCVF--TEAQGQLHGMLKLDRLNLPYVR 1130


>ref|XP_007031313.1| Uncharacterized protein TCM_016763 [Theobroma cacao]
            gi|508710342|gb|EOY02239.1| Uncharacterized protein
            TCM_016763 [Theobroma cacao]
          Length = 2127

 Score =  101 bits (252), Expect = 2e-19
 Identities = 66/230 (28%), Positives = 103/230 (44%), Gaps = 1/230 (0%)
 Frame = +3

Query: 3    ERNSHKHRGVPFLASHIISQVIQHLRLLVMAKKLAPSQWSDCSPQVDFMPYSAPVRRPIR 182
            ERN  KHR     A  +I + ++H R L     L   QW   +     + +S   ++   
Sbjct: 1900 ERNDAKHRHTGLYADRVIWRTMKHCRQLYDGSLLQQWQWKGDTDIATMLGFSFTHKQHAP 1959

Query: 183  STPVFWRPPPALWVKLSTDGSFDRGQMRAAGGGLVRDHRASLLSAFSLPLQAASSFDAEL 362
               ++W+ P     KL+ DGS  R  + AA GG++RDH   L+  FS  +   +S  AEL
Sbjct: 1960 PQIIYWKKPSIGEYKLNVDGS-SRNGLHAATGGVLRDHTGKLIFGFSENIGPCNSLQAEL 2018

Query: 363  QAVYHGLLIASQ-HSSHVWIEXXXXXXXXXXXXXRHGSGXXXXXXXXXXXXXXELQVRIS 539
            +A+  GLL+  + H   +WIE             + G                    R+S
Sbjct: 2019 RALLRGLLLCKERHIEKLWIEMDALVAIQLIQPSKKGPYNLRYLLESIRMCLSSFSYRLS 2078

Query: 540  HIHREGNRPADYMARLGHRLQTMTTFDASSAPRPFLSLVRMDQLGYPNFR 689
            HI REGN+ ADY++  GH+ Q +  F  + A      ++++D+L  P  R
Sbjct: 2079 HILREGNQAADYLSNEGHKHQNLCVF--TEAQGQLHGMLKLDRLNLPYVR 2126


>ref|XP_007040946.1| Uncharacterized protein TCM_016753 [Theobroma cacao]
            gi|508778191|gb|EOY25447.1| Uncharacterized protein
            TCM_016753 [Theobroma cacao]
          Length = 1275

 Score = 99.8 bits (247), Expect = 9e-19
 Identities = 64/212 (30%), Positives = 96/212 (45%), Gaps = 1/212 (0%)
 Frame = +3

Query: 3    ERNSHKHRGVPFLASHIISQVIQHLRLLVMAKKLAPSQWSDCSPQVDFMPYSAPVRRPIR 182
            ERN  KHR        ++ +++  LR L     L   QW   +       Y+  +++   
Sbjct: 903  ERNDAKHRHSGLYTDRVVWRIMTLLRQLQDDSLLQQWQWKGDTDIAAMWRYNFQLKQRAP 962

Query: 183  STPVFWRPPPALWVKLSTDGSFDRGQMRAAGGGLVRDHRASLLSAFSLPLQAASSFDAEL 362
               V+WR P     KL+ DGS   GQ  AA GG++RDH + L+  FS  +   +S  AEL
Sbjct: 963  PQIVYWRKPFTGEYKLNVDGSSRNGQ-HAASGGVLRDHTSKLIFCFSENIGTYNSLQAEL 1021

Query: 363  QAVYHGLLIASQ-HSSHVWIEXXXXXXXXXXXXXRHGSGXXXXXXXXXXXXXXELQVRIS 539
            +A++ GLL+  + H   +WIE             + GS                +  RIS
Sbjct: 1022 RALHRGLLLCKERHIEKLWIEMDALAVIQLIPHSQKGSHDIRYLLESIKKCLNSISYRIS 1081

Query: 540  HIHREGNRPADYMARLGHRLQTMTTFDASSAP 635
            HI REGN+ AD+++  GH  Q +  F  +  P
Sbjct: 1082 HIFREGNQAADFLSNEGHNHQNLRVFTKAQGP 1113


>ref|XP_007026458.1| Uncharacterized protein TCM_021522 [Theobroma cacao]
            gi|508715063|gb|EOY06960.1| Uncharacterized protein
            TCM_021522 [Theobroma cacao]
          Length = 3503

 Score = 99.0 bits (245), Expect = 2e-18
 Identities = 67/232 (28%), Positives = 100/232 (43%), Gaps = 3/232 (1%)
 Frame = +3

Query: 3    ERNSHKHRGVPFLASHIISQVIQHLRLLVMAKKLAPSQWSDCSPQVDFMPYSAPVRRPIR 182
            ERN  KHR +    + I+ ++++ +  L   K+L   QW                  P  
Sbjct: 3274 ERNDAKHRNLGMYPNRIVWKILKLIHQLFQGKQLQKWQWQGDKQIAQEWGIILKAVAPSP 3333

Query: 183  STPVFWRPPPALWVKLSTDGSFDRGQMRAAGGGLVRDHRASLLSAFSLPLQAASSFDAEL 362
               +FW  P     KL+ DGS       AAGGGL+RDH  S++  FS    +  S  AEL
Sbjct: 3334 PKLLFWNKPSIGEFKLNVDGSSKYNLQTAAGGGLLRDHTGSMIFGFSENFGSQDSLQAEL 3393

Query: 363  QAVYHGLLIASQHS-SHVWIEXXXXXXXXXXXXXRHGSGXXXXXXXXXXXXXXELQVRIS 539
             A++ GLL+   H+ + +WIE               GS                +  RIS
Sbjct: 3394 MALHRGLLLCIDHNVTRLWIEMDAKVAVQMINEGHQGSSRTRYLLASIHRCLSGISFRIS 3453

Query: 540  HIHREGNRPADYMARLGHRLQTMTTFDASSAPRPFLSLVRMDQ--LGYPNFR 689
            HI REGN+ AD+++  G+  Q +     S A      ++R+D+  L Y  F+
Sbjct: 3454 HIFREGNQAADHLSNQGYTHQNLQVI--SQAEGQLRGILRLDKINLAYVRFK 3503



 Score = 85.9 bits (211), Expect = 1e-14
 Identities = 66/224 (29%), Positives = 96/224 (42%), Gaps = 1/224 (0%)
 Frame = +3

Query: 3    ERNSHKHRGVPFLASHIISQVIQHLRLLVMAKKLAPSQWSDCSPQVDFMPYSAPVRRPIR 182
            ERN  KHR +    + +I ++++ L  L     L   QW   +       +  P +    
Sbjct: 1480 ERNDAKHRHMGMYPNRVIWRIMKLLNQLHAGSLLKQWQWKGDTDIATMWGFKYPPKYCQS 1539

Query: 183  STPVFWRPPPALWVKLSTDGSFDRGQMRAAGGGLVRDHRASLLSAFSLPLQAASSFDAEL 362
               + W  P     KL+ DGS  +    AAGGG++RDH   L  AFS  L    S  AEL
Sbjct: 1540 PQIISWIKPFIGEYKLNVDGS-SKSSQNAAGGGVLRDHTGKLAFAFSENLGPLPSLQAEL 1598

Query: 363  QAVYHGLLIASQHS-SHVWIEXXXXXXXXXXXXXRHGSGXXXXXXXXXXXXXXELQVRIS 539
             A+  GLL+  + + +++WIE             + GS                   RIS
Sbjct: 1599 HALLRGLLLCKERNITNLWIEMDALVAVQMVQQSQKGSHDIRYLLESIRLCLRSFSYRIS 1658

Query: 540  HIHREGNRPADYMARLGHRLQTMTTFDASSAPRPFLSLVRMDQL 671
            HI+REGN+ AD+++  G   QT  +    S  + F SL  M  L
Sbjct: 1659 HIYREGNQAADFLSNKG---QTHQSLCVVSEAQEFPSLPTMHGL 1699


>ref|XP_007026457.1| Uncharacterized protein TCM_021521 [Theobroma cacao]
            gi|508715062|gb|EOY06959.1| Uncharacterized protein
            TCM_021521 [Theobroma cacao]
          Length = 1951

 Score = 99.0 bits (245), Expect = 2e-18
 Identities = 66/230 (28%), Positives = 104/230 (45%), Gaps = 1/230 (0%)
 Frame = +3

Query: 3    ERNSHKHRGVPFLASHIISQVIQHLRLLVMAKKLAPSQWSDCSPQVDFMPYSAPVRRPIR 182
            ERN  KHR +    + +I ++++ L  L     L   QW   +       +  P +    
Sbjct: 1723 ERNDAKHRHMGMYPNRVIWRIMKLLNQLYAGSLLKQWQWKGDTDIATMWGFKFPPKYCTS 1782

Query: 183  STPVFWRPPPALWVKLSTDGSFDRGQMRAAGGGLVRDHRASLLSAFSLPLQAASSFDAEL 362
               ++W  P     KL+ DGS  +  + AAGGG++RDH   L  AFS  L    S  AEL
Sbjct: 1783 PQIIYWIKPFIGEYKLNVDGS-SKSNLNAAGGGVLRDHTGKLAFAFSENLGPLPSLQAEL 1841

Query: 363  QAVYHGLLIASQHS-SHVWIEXXXXXXXXXXXXXRHGSGXXXXXXXXXXXXXXELQVRIS 539
             A+  GLL+  + + +++WIE             + GS                   RIS
Sbjct: 1842 HALLRGLLLCKERNITNLWIEMDALVAVQMVQQSQKGSHDIRYLLESIRLCLRSFSYRIS 1901

Query: 540  HIHREGNRPADYMARLGHRLQTMTTFDASSAPRPFLSLVRMDQLGYPNFR 689
            HI+REGN+ AD+++  G   Q++  F  S A    + ++++D+L  P  R
Sbjct: 1902 HIYREGNQAADFLSNKGQTHQSLCVF--SEAQGELIGILKLDKLNLPYVR 1949


>ref|XP_007040950.1| Uncharacterized protein TCM_016759 [Theobroma cacao]
            gi|508778195|gb|EOY25451.1| Uncharacterized protein
            TCM_016759 [Theobroma cacao]
          Length = 879

 Score = 97.8 bits (242), Expect = 3e-18
 Identities = 66/230 (28%), Positives = 103/230 (44%), Gaps = 1/230 (0%)
 Frame = +3

Query: 3    ERNSHKHRGVPFLASHIISQVIQHLRLLVMAKKLAPSQWSDCSPQVDFMPYSAPVRRPIR 182
            ERN  KHR        ++ ++++ LR L+    L   QW   +       ++   +    
Sbjct: 651  ERNDAKHRHTRLNPDRVVWRIMKLLRQLLDGSLLHQWQWKGDTDIASMWGHTFQSKHRAP 710

Query: 183  STPVFWRPPPALWVKLSTDGSFDRGQMRAAGGGLVRDHRASLLSAFSLPLQAASSFDAEL 362
               ++WR P     KL+ DGS   G + AA GG++RDH   L+  FS  +   +S  AEL
Sbjct: 711  PQIIYWRKPFTGEYKLNVDGSSRNGHL-AASGGILRDHTGKLIFGFSENIGLCNSLQAEL 769

Query: 363  QAVYHGLLIASQ-HSSHVWIEXXXXXXXXXXXXXRHGSGXXXXXXXXXXXXXXELQVRIS 539
            +A+  GLL+  + H  ++WIE             + GS                +  RIS
Sbjct: 770  RALLRGLLLCKERHIENLWIEMDALAVIQLIQHSQKGSHDIRYLLESIRKCLSCISYRIS 829

Query: 540  HIHREGNRPADYMARLGHRLQTMTTFDASSAPRPFLSLVRMDQLGYPNFR 689
            HI REGN+ ADY+A  GH  Q +     + A      ++++D+L  P  R
Sbjct: 830  HIFREGNQAADYLANEGHSHQNLCVI--TEAQGELHGMLKLDRLNLPYVR 877


>ref|XP_007010390.1| Retrotransposon, unclassified-like protein [Theobroma cacao]
            gi|508727303|gb|EOY19200.1| Retrotransposon,
            unclassified-like protein [Theobroma cacao]
          Length = 1368

 Score = 97.8 bits (242), Expect = 3e-18
 Identities = 65/206 (31%), Positives = 93/206 (45%), Gaps = 1/206 (0%)
 Frame = +3

Query: 3    ERNSHKHRGVPFLASHIISQVIQHLRLLVMAKKLAPSQWSDCSPQVDFMPYSAPVRRPIR 182
            ERN  KHR +      II ++++ LR L     L   QW           ++    R  R
Sbjct: 1107 ERNDAKHRDLGMYPDRIIWRIMKILRKLFQGGLLCKWQWKGDLDIAIHWGFNFAQERQAR 1166

Query: 183  STPVFWRPPPALWVKLSTDGSFDRGQMRAAGGGLVRDHRASLLSAFSLPLQAASSFDAEL 362
               + W  P    +KL+ DGS       AAGGG++RDH  +L+  FS      +S  AEL
Sbjct: 1167 PKIINWIKPLIGELKLNVDGSSKDEFQNAAGGGVLRDHTGNLIFGFSENFGYQNSLQAEL 1226

Query: 363  QAVYHGLLIASQHS-SHVWIEXXXXXXXXXXXXXRHGSGXXXXXXXXXXXXXXELQVRIS 539
             A++ GL +  +++ S VWIE               GS                + VRIS
Sbjct: 1227 LALHRGLCLCMEYNVSRVWIEVDAQVVIQMIQNHHKGSYKIQYLLESIRKCLQVISVRIS 1286

Query: 540  HIHREGNRPADYMARLGHRLQTMTTF 617
            HIHREGN+ AD++++ GH  Q +  F
Sbjct: 1287 HIHREGNQAADFLSKHGHTHQNLHVF 1312


>ref|XP_007017128.1| Uncharacterized protein TCM_042327 [Theobroma cacao]
            gi|508787491|gb|EOY34747.1| Uncharacterized protein
            TCM_042327 [Theobroma cacao]
          Length = 1014

 Score = 96.7 bits (239), Expect = 7e-18
 Identities = 68/234 (29%), Positives = 107/234 (45%), Gaps = 3/234 (1%)
 Frame = +3

Query: 3    ERNSHKHRGVPFLASHIISQVIQHLRLLVMAKKLAPSQWSDCSPQVDFMPYSAPVRRPIR 182
            ERN  KHR +   +  ++ ++++ LR L     L   QW   +       ++ P++  IR
Sbjct: 786  ERNDAKHRHLGMYSDRVVWKIMKVLRQLQDGSLLKKWQWKGDTDIAAMWGFTLPLK--IR 843

Query: 183  STP--VFWRPPPALWVKLSTDGSFDRGQMRAAGGGLVRDHRASLLSAFSLPLQAASSFDA 356
             +P  + W  P     KL+ DGS  R    AA GGL+RDH  +L+  FS  +  ++S  A
Sbjct: 844  ESPQIIHWVKPVTGEYKLNVDGS-SRHNQSAATGGLLRDHTGTLVFGFSENIGPSNSLQA 902

Query: 357  ELQAVYHGLLIASQHS-SHVWIEXXXXXXXXXXXXXRHGSGXXXXXXXXXXXXXXELQVR 533
            EL+A+  GLL+    +   +WIE             + GS                   R
Sbjct: 903  ELRALLRGLLLCKDRNIEKLWIEMDALVVIQMIQQSKKGSHDIRYLLASIRKCLSFFSFR 962

Query: 534  ISHIHREGNRPADYMARLGHRLQTMTTFDASSAPRPFLSLVRMDQLGYPNFRLH 695
            ISHI REGN+ AD+++  GH  Q +     S A      ++++D+L  P  + H
Sbjct: 963  ISHIFREGNQAADFLSNKGHTHQNLQVI--SEAQGKLHGMLKLDRLNLPYVKFH 1014


>ref|XP_007008704.1| Uncharacterized protein TCM_042330 [Theobroma cacao]
            gi|508725617|gb|EOY17514.1| Uncharacterized protein
            TCM_042330 [Theobroma cacao]
          Length = 2249

 Score = 90.9 bits (224), Expect = 4e-16
 Identities = 67/235 (28%), Positives = 104/235 (44%), Gaps = 6/235 (2%)
 Frame = +3

Query: 3    ERNSHKHRGVPFLASHIISQVIQHLRLLVMAKKLAPSQWSDCSP-----QVDFMPYSAPV 167
            ERN  KHR +    + I+ ++++ ++ L + ++L   QW           + F   S P 
Sbjct: 2021 ERNDAKHRNLGMYPNRIVWRILKLIQQLSLGQQLLKWQWKGDKQIAQEWGITFQAESLP- 2079

Query: 168  RRPIRSTPVFWRPPPALWVKLSTDGSFDRGQMRAAGGGLVRDHRASLLSAFSLPLQAASS 347
              P +  P  W  P     KL+ DGS    Q  AAGGG++RDH   ++  FS  L   +S
Sbjct: 2080 --PPKVFP--WHKPSIGEFKLNVDGSAKLSQ-NAAGGGVLRDHAGVMVFGFSENLGIQNS 2134

Query: 348  FDAELQAVYHGLLIASQHS-SHVWIEXXXXXXXXXXXXXRHGSGXXXXXXXXXXXXXXEL 524
              AEL A+Y GL++   ++   +WIE             + G                  
Sbjct: 2135 LQAELLALYRGLILCRDYNIRRLWIEMDAASVIRLLQGNQRGPHAIRYLLVSIRQLLSHF 2194

Query: 525  QVRISHIHREGNRPADYMARLGHRLQTMTTFDASSAPRPFLSLVRMDQLGYPNFR 689
              R+SHI REGN+ AD++A  GH  Q++     + A      ++R+DQ   P  R
Sbjct: 2195 SFRLSHIFREGNQAADFLANRGHEHQSLQV--VTVAQGKLRGMLRLDQTSLPYVR 2247


>ref|XP_007017129.1| Uncharacterized protein TCM_042328 [Theobroma cacao]
            gi|508787492|gb|EOY34748.1| Uncharacterized protein
            TCM_042328 [Theobroma cacao]
          Length = 910

 Score = 90.1 bits (222), Expect = 7e-16
 Identities = 62/230 (26%), Positives = 95/230 (41%), Gaps = 1/230 (0%)
 Frame = +3

Query: 3    ERNSHKHRGVPFLASHIISQVIQHLRLLVMAKKLAPSQWSDCSPQVDFMPYSAPVRRPIR 182
            ERN  KHR +    + ++ +V++ ++ L + ++L   QW                     
Sbjct: 682  ERNDAKHRNLGMYPNRVVWRVLKLIQQLSLGQQLLKWQWKGDKQIAQEWGIILQAESLAP 741

Query: 183  STPVFWRPPPALWVKLSTDGSFDRGQMRAAGGGLVRDHRASLLSAFSLPLQAASSFDAEL 362
                 W  P     KL+ DGS       AAGGG++RDH   ++  FS  L   +S  AEL
Sbjct: 742  PKVFSWHKPTTGEFKLNVDGSAKHSH-NAAGGGILRDHAGVMVFGFSENLGIQNSLQAEL 800

Query: 363  QAVYHGLLIASQHS-SHVWIEXXXXXXXXXXXXXRHGSGXXXXXXXXXXXXXXELQVRIS 539
             A+Y GL++   ++   +WIE               G                    R S
Sbjct: 801  LALYRGLILCRDYNIRRLWIEMDAISVIRLLQGNHRGPHAIRYLMVSLRQLLSHFSFRFS 860

Query: 540  HIHREGNRPADYMARLGHRLQTMTTFDASSAPRPFLSLVRMDQLGYPNFR 689
            HI REGN+ AD++A  GH  Q +  F  + A      ++R+DQ  +P  R
Sbjct: 861  HIFREGNQAADFLANRGHEHQNLQVF--TVAQGKLRGMLRLDQTSFPYVR 908


>ref|XP_007017131.1| Uncharacterized protein TCM_033752 [Theobroma cacao]
            gi|508722459|gb|EOY14356.1| Uncharacterized protein
            TCM_033752 [Theobroma cacao]
          Length = 2251

 Score = 88.2 bits (217), Expect = 3e-15
 Identities = 66/233 (28%), Positives = 103/233 (44%), Gaps = 4/233 (1%)
 Frame = +3

Query: 3    ERNSHKHRGVPFLASHIISQVIQHLRLLVMAKKLAPSQWSDCSP--QVDFMPYSAPVRRP 176
            ERN  KHR +    + ++ +V++ ++ L + ++L   QW       Q   + + A    P
Sbjct: 2023 ERNDAKHRNLGMYPNRVVWRVLKLIQQLSLGQQLLKWQWKGDKQIAQEWGIIFQAESLAP 2082

Query: 177  IRSTPVF-WRPPPALWVKLSTDGSFDRGQMRAAGGGLVRDHRASLLSAFSLPLQAASSFD 353
             +   VF W  P     KL+ DGS  +    AAGGG++RDH   ++  FS  L   +S  
Sbjct: 2083 PK---VFSWHKPSLGEFKLNVDGSAKQSH-NAAGGGILRDHAGEMVFGFSENLGTQNSLQ 2138

Query: 354  AELQAVYHGLLIASQHS-SHVWIEXXXXXXXXXXXXXRHGSGXXXXXXXXXXXXXXELQV 530
            AEL A+Y GL++   ++   +WIE               G                    
Sbjct: 2139 AELLALYRGLILCRDYNIRRLWIEMDAISVIRLLQGNHRGPHAIRYLMVSLRQLLSHFSF 2198

Query: 531  RISHIHREGNRPADYMARLGHRLQTMTTFDASSAPRPFLSLVRMDQLGYPNFR 689
            R SHI REGN+ AD++A  GH  Q +  F  + A      ++ +DQ  +P  R
Sbjct: 2199 RFSHIFREGNQAADFLANRGHEHQNLQVF--TVAQGKLRGMLCLDQTSFPYVR 2249


>ref|XP_007046404.1| Uncharacterized protein TCM_011923 [Theobroma cacao]
            gi|508710339|gb|EOY02236.1| Uncharacterized protein
            TCM_011923 [Theobroma cacao]
          Length = 1954

 Score = 87.8 bits (216), Expect = 3e-15
 Identities = 63/230 (27%), Positives = 102/230 (44%), Gaps = 1/230 (0%)
 Frame = +3

Query: 3    ERNSHKHRGVPFLASHIISQVIQHLRLLVMAKKLAPSQWSDCSPQVDFMPYSAPVRRPIR 182
            ERN  KHR +   +  ++ ++++ LR L     L   QW             +P +    
Sbjct: 1726 ERNDAKHRHLGMYSDRVVWKIMKLLRQLQDGYLLKSWQWKGDKDFATMWGLFSPPKTRAA 1785

Query: 183  STPVFWRPPPALWVKLSTDGSFDRGQMRAAGGGLVRDHRASLLSAFSLPLQAASSFDAEL 362
               + W  P     KL+ DGS  R    AA GG++RDH  +L+  FS  +  ++S  AEL
Sbjct: 1786 PQILHWVKPVPGEHKLNVDGS-SRQNQTAAIGGVLRDHTGTLVFDFSENIGPSNSLQAEL 1844

Query: 363  QAVYHGLLIASQHS-SHVWIEXXXXXXXXXXXXXRHGSGXXXXXXXXXXXXXXELQVRIS 539
            +A+  GLL+  + +   +W+E             + GS                   RIS
Sbjct: 1845 RALLRGLLLCKERNIEKLWVEMDALVAIQMIQQSQKGSHDIRYLLASIRKYLNFFSFRIS 1904

Query: 540  HIHREGNRPADYMARLGHRLQTMTTFDASSAPRPFLSLVRMDQLGYPNFR 689
            HI REGN+ AD+++  GH  Q++  F  + A      ++++D+L  P  R
Sbjct: 1905 HIFREGNQAADFLSNKGHTHQSLHVF--TEAQGKLYGMLKLDRLNLPYVR 1952


>ref|XP_007036030.1| Uncharacterized protein TCM_021518 [Theobroma cacao]
            gi|508715059|gb|EOY06956.1| Uncharacterized protein
            TCM_021518 [Theobroma cacao]
          Length = 1702

 Score = 83.6 bits (205), Expect = 7e-14
 Identities = 51/167 (30%), Positives = 77/167 (46%), Gaps = 1/167 (0%)
 Frame = +3

Query: 192  VFWRPPPALWVKLSTDGSFDRGQMRAAGGGLVRDHRASLLSAFSLPLQAASSFDAELQAV 371
            ++W  P     KL+ DG        AA GG+ RDH ++++  FS      +S  AEL A+
Sbjct: 1536 IYWSRPLMGEFKLNVDGCSKEAFQNAASGGVPRDHTSTMIFGFSENFGPYNSTQAELMAL 1595

Query: 372  YHGLLIASQHS-SHVWIEXXXXXXXXXXXXXRHGSGXXXXXXXXXXXXXXELQVRISHIH 548
            + GLL+ ++++ S VWIE               G                 +  RISHIH
Sbjct: 1596 HRGLLLCNEYNISRVWIEIDAKAIVQMLHEGHKGYSRTQYLLSFICQCLSGISYRISHIH 1655

Query: 549  REGNRPADYMARLGHRLQTMTTFDASSAPRPFLSLVRMDQLGYPNFR 689
            RE N+ ADY++  GH  Q++  F  S A      ++R+D+   P  R
Sbjct: 1656 RESNQAADYLSNQGHTHQSLQVF--SKAEGELRGMIRLDKSNLPYVR 1700



 Score = 77.0 bits (188), Expect = 6e-12
 Identities = 61/213 (28%), Positives = 94/213 (44%), Gaps = 8/213 (3%)
 Frame = +3

Query: 3    ERNSHKHRGVPFLASHIISQVIQHLRLLVMAKKLAPSQWSDCSPQVDFMPYSAPVRRPIR 182
            ERN  K R +   +  ++ ++++ LR L     L   QW           ++   +  I+
Sbjct: 1306 ERNDAKQRHLGMYSDRVVWKIMKLLRQLQDGYVLKNWQWKGDMDIAAMWGFNFSPK--IQ 1363

Query: 183  STPVFWRPPPALWVKL-------STDGSFDRGQMRAAGGGLVRDHRASLLSAFSLPLQAA 341
            +TP  +      WVKL       + DGS  R    AA GGL+RDH  +L+  FS  +  +
Sbjct: 1364 ATPQIFH-----WVKLVSGEHKLNVDGS-SRQNQSAAIGGLLRDHTGTLVFGFSENIGPS 1417

Query: 342  SSFDAELQAVYHGLLIASQHS-SHVWIEXXXXXXXXXXXXXRHGSGXXXXXXXXXXXXXX 518
            +S  AEL+A+  GLL+  + +   +WIE             + GS               
Sbjct: 1418 NSLQAELRALLRGLLLCKERNIEKLWIEMDALVAIQMIQQSQKGSHDIQYLLASIRKCLS 1477

Query: 519  ELQVRISHIHREGNRPADYMARLGHRLQTMTTF 617
                RISHI REGN+ AD+++  GH  Q +  F
Sbjct: 1478 FFSFRISHIFREGNQVADFLSNKGHTQQNLLVF 1510


>ref|XP_007010293.1| Uncharacterized protein TCM_043836 [Theobroma cacao]
           gi|508727206|gb|EOY19103.1| Uncharacterized protein
           TCM_043836 [Theobroma cacao]
          Length = 228

 Score = 83.2 bits (204), Expect = 9e-14
 Identities = 52/145 (35%), Positives = 71/145 (48%), Gaps = 3/145 (2%)
 Frame = +3

Query: 171 RPIRSTP--VFWRPPPALWVKLSTDGSFDRGQMRAAGGGLVRDHRASLLSAFSLPLQAAS 344
           R + S P  + W  P     KL+ DGS       A GGGL+RDH ++L+  FS  L A +
Sbjct: 39  RKVISLPKVISWHKPSTGEFKLNVDGSSINNFQNAGGGGLLRDHTSTLVFVFSENLGAKN 98

Query: 345 SFDAELQAVYHGLLIASQHS-SHVWIEXXXXXXXXXXXXXRHGSGXXXXXXXXXXXXXXE 521
           S  AEL A++ GLL+  +++ S +WIE               GS                
Sbjct: 99  SLQAELLALHRGLLLCQENNISRLWIEMDAMIVIQMLKEGHIGSHDSRYLWASIRQQLKL 158

Query: 522 LQVRISHIHREGNRPADYMARLGHR 596
              RISHIHREGN+ AD++A  GH+
Sbjct: 159 FSFRISHIHREGNQAADWLANRGHQ 183


>ref|XP_007028292.1| Uncharacterized protein TCM_023960 [Theobroma cacao]
           gi|508716897|gb|EOY08794.1| Uncharacterized protein
           TCM_023960 [Theobroma cacao]
          Length = 303

 Score = 80.9 bits (198), Expect = 4e-13
 Identities = 52/164 (31%), Positives = 76/164 (46%), Gaps = 1/164 (0%)
 Frame = +3

Query: 192 VFWRPPPALWVKLSTDGSFDRGQMRAAGGGLVRDHRASLLSAFSLPLQAASSFDAELQAV 371
           V WR    L  +L  DGS       AA GG++RDH ++++  F       SS  AEL A+
Sbjct: 138 VIWRIMRMLR-QLYQDGSSKEAFQNAASGGVLRDHTSTMIFGFFENFGPYSSIQAELMAL 196

Query: 372 YHGLLIASQHS-SHVWIEXXXXXXXXXXXXXRHGSGXXXXXXXXXXXXXXELQVRISHIH 548
           + GLL+ ++++ S VWIE               GS                +  RISHIH
Sbjct: 197 HRGLLLCNEYNISRVWIEMDAKAIVQMLHKGHKGSSRTRYLLSSIHQCLSGISYRISHIH 256

Query: 549 REGNRPADYMARLGHRLQTMTTFDASSAPRPFLSLVRMDQLGYP 680
           R+GN+  DY++  GH  Q +  F  S A      ++R+D+   P
Sbjct: 257 RQGNQAVDYLSNKGHTHQNLQVF--SEAEGELKGMIRLDKSNLP 298


>ref|XP_007052624.1| Uncharacterized protein TCM_005952 [Theobroma cacao]
           gi|508704885|gb|EOX96781.1| Uncharacterized protein
           TCM_005952 [Theobroma cacao]
          Length = 445

 Score = 80.5 bits (197), Expect = 6e-13
 Identities = 56/200 (28%), Positives = 85/200 (42%), Gaps = 1/200 (0%)
 Frame = +3

Query: 21  HRGVPFLASHIISQVIQHLRLLVMAKKLAPSQWSDCSPQVDFMPYSAPVRRPIRSTPVFW 200
           HR +      II ++++ LR L     L   QW           ++    R  R   + W
Sbjct: 2   HRDLGMYPDRIIWRIMKMLRQLFQGGLLCKWQWKTDLDIAIHWGFNFAQERQARPKIIHW 61

Query: 201 RPPPALWVKLSTDGSFDRGQMRAAGGGLVRDHRASLLSAFSLPLQAASSFDAELQAVYHG 380
             P    +KL+ DGS       A GGG++RDH  +L+  FS      +S  AEL A++ G
Sbjct: 62  TKPLIGELKLNVDGSSKDEFQNAVGGGVLRDHTGNLIFGFSENFGYQNSLQAELLALHKG 121

Query: 381 LLIASQHS-SHVWIEXXXXXXXXXXXXXRHGSGXXXXXXXXXXXXXXELQVRISHIHREG 557
           L +  +++ S VWIE                                 + VRISHIH+EG
Sbjct: 122 LCLCMEYNVSRVWIEMDAQV----------------------------ISVRISHIHKEG 153

Query: 558 NRPADYMARLGHRLQTMTTF 617
           N+  D++++ GH  Q +  F
Sbjct: 154 NQATDFLSKCGHTHQNLHVF 173


>ref|XP_007008705.1| Uncharacterized protein TCM_042331 [Theobroma cacao]
            gi|508725618|gb|EOY17515.1| Uncharacterized protein
            TCM_042331 [Theobroma cacao]
          Length = 1176

 Score = 80.1 bits (196), Expect = 7e-13
 Identities = 51/167 (30%), Positives = 76/167 (45%), Gaps = 1/167 (0%)
 Frame = +3

Query: 192  VFWRPPPALWVKLSTDGSFDRGQMRAAGGGLVRDHRASLLSAFSLPLQAASSFDAELQAV 371
            V+WR P     KL+  GS   GQ  AA GG++RDH   L+  FS  +   +S   EL+A+
Sbjct: 1012 VYWRKPFTGEYKLNVGGSSRNGQ-HAASGGVLRDHTGKLIFGFSENIGTYNSLQGELRAL 1070

Query: 372  YHGLLIASQ-HSSHVWIEXXXXXXXXXXXXXRHGSGXXXXXXXXXXXXXXELQVRISHIH 548
            + GLL+    H   +WIE             + GS                +  RI HI 
Sbjct: 1071 HRGLLLCKDCHIEKLWIEMDALAVIQLIPHSQKGSHDIRYLLESIRKCLNNISYRILHIF 1130

Query: 549  REGNRPADYMARLGHRLQTMTTFDASSAPRPFLSLVRMDQLGYPNFR 689
            REGN+  D+++  GH  Q +  F  + A      ++++D+L  P  R
Sbjct: 1131 REGNQTVDFLSNRGHNHQNLRVF--TEAQGKLHGMLKLDRLNLPYVR 1175