BLASTX nr result

ID: Rehmannia22_contig00016810 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Rehmannia22_contig00016810
         (2429 letters)

Database: ./nr 
           37,332,560 sequences; 13,225,080,153 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

gb|EOY29076.1| Uncharacterized protein TCM_030494 [Theobroma cacao]   166   6e-38
gb|EOY06957.1| Uncharacterized protein TCM_021519 [Theobroma cacao]   160   2e-36
gb|EOY02245.1| Uncharacterized protein TCM_016772 [Theobroma cacao]   159   5e-36
gb|EOY19201.1| Uncharacterized protein TCM_044158 [Theobroma cacao]   159   7e-36
gb|EOY34749.1| Uncharacterized protein TCM_042329 [Theobroma cacao]   157   2e-35
gb|EOY25454.1| Uncharacterized protein TCM_026877 [Theobroma cacao]   157   2e-35
gb|EOY02238.1| Uncharacterized protein TCM_016762 [Theobroma cacao]   156   3e-35
gb|EOY02239.1| Uncharacterized protein TCM_016763 [Theobroma cacao]   152   5e-34
gb|EOY25452.1| Uncharacterized protein TCM_016760 [Theobroma cacao]   152   7e-34
gb|EOY02235.1| Uncharacterized protein TCM_011922 [Theobroma cacao]   151   1e-33
gb|EOY06960.1| Uncharacterized protein TCM_021522 [Theobroma cacao]   150   3e-33
gb|EOY06959.1| Uncharacterized protein TCM_021521 [Theobroma cacao]   147   3e-32
gb|EOY02243.1| Uncharacterized protein TCM_016768 [Theobroma cacao]   146   4e-32
gb|EOY17514.1| Uncharacterized protein TCM_042330 [Theobroma cacao]   145   8e-32
gb|EOY17515.1| Uncharacterized protein TCM_042331 [Theobroma cacao]   143   4e-31
gb|EOY26479.1| Uncharacterized protein TCM_028230 [Theobroma cacao]   142   7e-31
gb|EOY14356.1| Uncharacterized protein TCM_033752 [Theobroma cacao]   142   7e-31
gb|EOY19056.1| Uncharacterized protein TCM_043721 [Theobroma cacao]   135   6e-29
gb|EOY17513.1| Uncharacterized protein TCM_036737 [Theobroma cacao]   134   1e-28
gb|EOY02242.1| Uncharacterized protein TCM_016767 [Theobroma cacao]   127   2e-26

>gb|EOY29076.1| Uncharacterized protein TCM_030494 [Theobroma cacao]
          Length = 876

 Score =  166 bits (419), Expect = 6e-38
 Identities = 87/223 (39%), Positives = 126/223 (56%), Gaps = 7/223 (3%)
 Frame = -3

Query: 2136 IVPLGTTSFQDGRKVLNYSSVETNRLAGAWKLTLIGKFSFAIPQPKEITSGLSSLKLKGP 1957
            + P   T     +  + +   E   LA  +K  ++GKFS  +P+  EI     SL L G 
Sbjct: 103  VPPTRPTFRYKDKPAVRFFEDEIEALAQPFKFAIVGKFS-KMPRLTEIRQSFVSLGLSGV 161

Query: 1956 HSWKFANPSHIIIKVQLEEDFNRLWMGTIWSIRDCPMRVFKWTPSFNSKEEAPLAPVWIR 1777
            ++ ++ N  HI+I +  E+DFNR+W    W I +  MRVFKWTP F + +E+P+ PVWI 
Sbjct: 162  YNIRWMNYKHILIHLSNEQDFNRIWTKQTWFITNQKMRVFKWTPDFETDKESPIVPVWIS 221

Query: 1776 LPGLPIHFFDYHALYSIAKEIGTPLQVDSPTAKKTRLTMARICIELDLLKERLEE--IVL 1603
             P L  H F+  AL  IAK IG PL +D  TA  TR ++AR+CIE D LK  ++   IV+
Sbjct: 222  FPNLKAHLFEKSALLMIAKAIGNPLYIDEATANGTRPSVARVCIEYDCLKPPVDSVWIVV 281

Query: 1602 CSDGAV-----HVQKIVYERVPEYCEFCKHVGHNIQACYMNGN 1489
               G+      ++QK+ +  +PEYC  C HVGHN+  C + G+
Sbjct: 282  SKRGSEDMSGGYLQKVEFAPMPEYCNHCCHVGHNVSKCLILGS 324


>gb|EOY06957.1| Uncharacterized protein TCM_021519 [Theobroma cacao]
          Length = 667

 Score =  160 bits (405), Expect = 2e-36
 Identities = 100/327 (30%), Positives = 165/327 (50%), Gaps = 17/327 (5%)
 Frame = -3

Query: 2205 KSYANVTGXXXXXSINLSFDPKKIVPLGTTSFQDGRKVLNYSSVETNRLAGAWKLTLIGK 2026
            KS+ ++        I L+ DP          ++D    + Y   E   LA  + L L+GK
Sbjct: 56   KSFLSIAAGSKPPVIPLNRDP--------AVYKDRPAAVFYED-EICILAKPFSLCLVGK 106

Query: 2025 FSFAIPQPKEITSGLSSLKLKGPHSWKFANPSHIIIKVQLEEDFNRLWMGTIWSIRDCPM 1846
            F+  +P+ +E+ S    + L G +  K+ +  H++I +  ++DFNR+W    W I    M
Sbjct: 107  FT-RMPKLQEVRSAFKGIGLSGAYEIKWLDYKHVLIHLSNDQDFNRIWTRQQWFIVGQKM 165

Query: 1845 RVFKWTPSFNSKEEAPLAPVWIRLPGLPIHFFDYHALYSIAKEIGTPLQVDSPTAKKTRL 1666
            R+FKW+P F +++E+P+ PVWI  P L  H ++  AL  IAK IG PL VD PTAK +R 
Sbjct: 166  RIFKWSPEFEAEKESPVVPVWISFPNLKAHLYEKSALLLIAKTIGKPLFVDEPTAKGSRP 225

Query: 1665 TMARICIELDLLKERLEEIVLCSD----GAV---HVQKIVYERVPEYCEFCKHVGHNIQA 1507
            ++AR+C+E D  +  ++++ + +     G V   + QK+ + ++P+YCE C HVGHN   
Sbjct: 226  SVARVCVEYDCREPPIDQVWIVTQKRETGMVTNGYAQKVEFSQMPDYCEHCCHVGHNETT 285

Query: 1506 CYMNGNNVXXXXXXXXXXXXXRGAGNQESLKQKQGGTSDISKDINTKQ--GEQTNPVHEQ 1333
            C + GNN                   +  LK +   T ++SK    ++  GE+ +     
Sbjct: 286  CLVLGNN------------SKSSGSMKAQLKGQTKQTLNMSKTQTREKTDGEKEDKAKGI 333

Query: 1332 VVEKGEPNPKE--------WNIIRKKG 1276
            +VE+  P  K+        W ++ K G
Sbjct: 334  MVEEIRPATKQTDMSKQSIWRVVGKAG 360


>gb|EOY02245.1| Uncharacterized protein TCM_016772 [Theobroma cacao]
          Length = 1296

 Score =  159 bits (402), Expect = 5e-36
 Identities = 86/224 (38%), Positives = 130/224 (58%), Gaps = 8/224 (3%)
 Frame = -3

Query: 2136 IVPLGTT-SFQDGRKVLNYSSVETNRLAGAWKLTLIGKFSFAIPQPKEITSGLSSLKLKG 1960
            ++PL    S+   R   ++   E   LA ++K ++IGKF+  +P+ +EI +    + L G
Sbjct: 75   VIPLNREPSWYRDRPAASFFDNEIATLALSFKFSMIGKFT-RMPKLQEIRTAFKGIGLVG 133

Query: 1959 PHSWKFANPSHIIIKVQLEEDFNRLWMGTIWSIRDCPMRVFKWTPSFNSKEEAPLAPVWI 1780
             ++ ++ +  HI+I +  E D NR+WM   W I +  MRVFKWTP F+ ++E+ L PVWI
Sbjct: 134  AYNIRWLDYKHILIHLSNEHDLNRIWMKQNWFIVNKKMRVFKWTPEFHPEKESSLVPVWI 193

Query: 1779 RLPGLPIHFFDYHALYSIAKEIGTPLQVDSPTAKKTRLTMARICIELDLLKERLEEIVLC 1600
              P L  HF++   L  IAK +G PL VD  TA  TR  +ARIC+E D  K  L++I + 
Sbjct: 194  SFPNLRAHFYEKSTLMMIAKSVGRPLFVDEATANGTRPNVARICVEYDCQKSLLDQIWIV 253

Query: 1599 S----DGAV---HVQKIVYERVPEYCEFCKHVGHNIQACYMNGN 1489
            +     G V    +QK+ + ++P+YC  C HVGHN  AC + GN
Sbjct: 254  TRSRQTGEVTGGFIQKVEFVKMPDYCTHCCHVGHNASACLVLGN 297


>gb|EOY19201.1| Uncharacterized protein TCM_044158 [Theobroma cacao]
          Length = 830

 Score =  159 bits (401), Expect = 7e-36
 Identities = 88/317 (27%), Positives = 163/317 (51%), Gaps = 9/317 (2%)
 Frame = -3

Query: 2148 DPKKIVPLGTTSF-QDGRKVLNYSSVETNRLAGAWKLTLIGKFSFAIPQPKEITSGLSSL 1972
            +   ++PL    F    R   ++   E + LA  +K +++GKFS  + + +EI      +
Sbjct: 65   EKSSLIPLDREPFWYKDRPAASFFDDEISTLAQPFKFSMVGKFSRML-RMQEIRVAFKGI 123

Query: 1971 KLKGPHSWKFANPSHIIIKVQLEEDFNRLWMGTIWSIRDCPMRVFKWTPSFNSKEEAPLA 1792
             L G +  ++ +  HI+I++  E D NR+W+  +W I +  MRVFKW+P F  ++E+ + 
Sbjct: 124  GLIGAYEIRWLDYKHILIQLSNEHDLNRIWLKQVWFISNQKMRVFKWSPEFQPEKESSMV 183

Query: 1791 PVWIRLPGLPIHFFDYHALYSIAKEIGTPLQVDSPTAKKTRLTMARICIELDLLKERLEE 1612
            PVWI  P L  H ++  AL +I K +G PL VD  TA  TR ++AR+C+E D  +  +++
Sbjct: 184  PVWISFPNLKAHLYEKSALSAIVKTVGRPLMVDEATANGTRPSVARVCVEFDCQQPPIDQ 243

Query: 1611 IVLCS----DGAV---HVQKIVYERVPEYCEFCKHVGHNIQACYMNGNNVXXXXXXXXXX 1453
            + + +     G+V   ++QK+ + R+ E+C  C HVGH + +C + GN            
Sbjct: 244  VWIVTRNRQSGSVMGGYMQKVEFARLSEFCTHCSHVGHGVSSCMVIGNRPEKNKQPM--- 300

Query: 1452 XXXRGAGNQESLKQKQGGTSDISKDINTKQGEQTNPVHEQVVEKGEPNPKEWNIIRKKGP 1273
                  G ++  K+ +  T+    D+  ++ ++T P+  +     +     W ++ + GP
Sbjct: 301  -----GGKKQLKKEDKDRTNARKGDLKPQEEKETEPIQAE----QQKQSTRWQVMARPGP 351

Query: 1272 RET-GFISQEVLKLAKK 1225
                G   +E++  A+K
Sbjct: 352  SSAKGTRGEELVLNAQK 368


>gb|EOY34749.1| Uncharacterized protein TCM_042329 [Theobroma cacao]
          Length = 2606

 Score =  157 bits (398), Expect = 2e-35
 Identities = 99/322 (30%), Positives = 157/322 (48%), Gaps = 12/322 (3%)
 Frame = -3

Query: 2205 KSYANVTGXXXXXSINLSFDPKKIVPLGTTSFQDGRKVLNYSSVETNRLAGAWKLTLIGK 2026
            KS+ ++        I LS DP          F+D R    +   E   LA   KL+L+GK
Sbjct: 1688 KSFLSIVSGDKPPVIPLSRDP--------LVFKD-RPAAAFFEDEIQTLAQPLKLSLVGK 1738

Query: 2025 FSFAIPQPKEITSGLSSLKLKGPHSWKFANPSHIIIKVQLEEDFNRLWMGTIWSIRDCPM 1846
            FS  +P+ +++ S    + L G +  ++ +  H++I +  E+D NR+W   +W I +  M
Sbjct: 1739 FS-RMPKLQDVRSAFKGIGLTGAYEVRWLDYKHVLIHLSNEQDCNRVWTKQVWFIANQKM 1797

Query: 1845 RVFKWTPSFNSKEEAPLAPVWIRLPGLPIHFFDYHALYSIAKEIGTPLQVDSPTAKKTRL 1666
            RVFKWTP F  ++E+ + PVWI  P L  H F+  AL  IAK +G PL VD  TA  +R 
Sbjct: 1798 RVFKWTPEFEPEKESAVVPVWIAFPNLKAHLFEKSALLLIAKTVGKPLFVDEATANGSRP 1857

Query: 1665 TMARICIELDLLKERLEEIVLC----SDGAV---HVQKIVYERVPEYCEFCKHVGHNIQA 1507
            ++AR+CIE D  +  ++++ +       G V   + Q++ + ++P YC+ C HVGH    
Sbjct: 1858 SVARVCIEFDCRRPPIDQVWIVVQNRETGTVTSGYPQRVEFSQMPAYCDHCCHVGHKEND 1917

Query: 1506 CYMNGNNVXXXXXXXXXXXXXRGAGNQESLK-----QKQGGTSDISKDINTKQGEQTNPV 1342
            C + GN                G    +SL+     +K G      K++  ++    NP 
Sbjct: 1918 CIVLGNK-----------DKSLGLSKSQSLRTLAVEKKTGYGGGSEKNLEKRK----NPE 1962

Query: 1341 HEQVVEKGEPNPKEWNIIRKKG 1276
             E++V   EP    W  + K G
Sbjct: 1963 KEKIVRPEEPASLRWQQVSKAG 1984



 Score =  148 bits (374), Expect = 1e-32
 Identities = 81/227 (35%), Positives = 124/227 (54%), Gaps = 8/227 (3%)
 Frame = -3

Query: 2139 KIVPLGTTSF-QDGRKVLNYSSVETNRLAGAWKLTLIGKFSFAIPQPKEITSGLSSLKLK 1963
            +I+P     F    R  + +   E   LA  +K +++GKFS  +P+  +I +    + L 
Sbjct: 74   QIIPTNREPFWYRDRPAVAFFEDEIVALAQPFKHSMVGKFS-RMPKLNDIRAAFKGISLV 132

Query: 1962 GPHSWKFANPSHIIIKVQLEEDFNRLWMGTIWSIRDCPMRVFKWTPSFNSKEEAPLAPVW 1783
            G +  ++ +  HI+I +  E+D NRLWM   W I +  MRVFKWTP F  ++E+ L PVW
Sbjct: 133  GVYEIRWLDYKHILIHLSNEQDLNRLWMRQAWFIANQKMRVFKWTPDFQPEKESSLVPVW 192

Query: 1782 IRLPGLPIHFFDYHALYSIAKEIGTPLQVDSPTAKKTRLTMARICIELDLLKERLEEI-V 1606
            I  P L  H ++  AL  IAK +G PL VD  TA  TR ++AR+C+E D  +  LE+I +
Sbjct: 193  ISFPNLRAHLYEKSALLMIAKSVGRPLFVDEATANGTRPSVARVCVEYDCQQPPLEQIWI 252

Query: 1605 LCSDGAV------HVQKIVYERVPEYCEFCKHVGHNIQACYMNGNNV 1483
            +  D           QK+ + ++P YC  C HVGH+   C + G+ +
Sbjct: 253  VTRDRRTGDITGGFQQKVDFAKLPNYCTHCCHVGHSASTCLVMGHRM 299


>gb|EOY25454.1| Uncharacterized protein TCM_026877 [Theobroma cacao]
          Length = 2367

 Score =  157 bits (397), Expect = 2e-35
 Identities = 100/321 (31%), Positives = 156/321 (48%), Gaps = 7/321 (2%)
 Frame = -3

Query: 2205 KSYANVTGXXXXXSINLSFDPKKIVPLGTTSFQDGRKVLNYSSVETNRLAGAWKLTLIGK 2026
            KS+ ++        + LS DP          F+D R    +   E   LA   KL+L+GK
Sbjct: 91   KSFLSIVSGQKPPVVPLSRDP--------FVFKD-RPAAAFYEDEIQTLAQPLKLSLVGK 141

Query: 2025 FSFAIPQPKEITSGLSSLKLKGPHSWKFANPSHIIIKVQLEEDFNRLWMGTIWSIRDCPM 1846
            FS  +P+ +++ S    + L G +  ++ +  HI+I +  E D NR+W   +W I +  M
Sbjct: 142  FS-RMPKLQDVRSAFKGIGLAGAYEVRWLDYKHILIHLTNEHDCNRVWTKQVWFIANQKM 200

Query: 1845 RVFKWTPSFNSKEEAPLAPVWIRLPGLPIHFFDYHALYSIAKEIGTPLQVDSPTAKKTRL 1666
            RVFKWTP F  ++E+ + PVWI  P L  H F+  AL  IAK +G PL VD  TA  +R 
Sbjct: 201  RVFKWTPEFEPEKESAMVPVWIAFPNLKAHLFEKSALLLIAKTVGKPLFVDEATANGSRP 260

Query: 1665 TMARICIELDLLKERLEEIVLC----SDGAV---HVQKIVYERVPEYCEFCKHVGHNIQA 1507
            ++AR+CIE D  K  ++++ +       G V   + QK+ + ++P YC+ C HVGH    
Sbjct: 261  SVARVCIEYDCRKPPIDQVWIVVQNRETGTVTSGYPQKVEFSQMPAYCDHCCHVGHKEID 320

Query: 1506 CYMNGNNVXXXXXXXXXXXXXRGAGNQESLKQKQGGTSDISKDINTKQGEQTNPVHEQVV 1327
            C + GN                 A      K+  GG+S+ + +      +  NP  E++ 
Sbjct: 321  CIVLGNKDKPLGSSKSQFLRVLEA----EKKKGYGGSSEKNLE------KSKNPEKEKIA 370

Query: 1326 EKGEPNPKEWNIIRKKGPRET 1264
             + EP  + W  + K G   T
Sbjct: 371  RQEEPVSQRWQPVNKAGTSGT 391


>gb|EOY02238.1| Uncharacterized protein TCM_016762 [Theobroma cacao]
          Length = 2214

 Score =  156 bits (395), Expect = 3e-35
 Identities = 85/247 (34%), Positives = 137/247 (55%), Gaps = 7/247 (2%)
 Frame = -3

Query: 2205 KSYANVTGXXXXXSINLSFDPKKIVPLGTTSFQDGRKVLNYSSVETNRLAGAWKLTLIGK 2026
            KS+ ++        I L+ DP          ++D    + Y   E   LA  + L L+GK
Sbjct: 56   KSFLSIAAGSKPPVIPLNRDP--------AVYKDRPAAVFYED-EICILAKPFSLCLVGK 106

Query: 2025 FSFAIPQPKEITSGLSSLKLKGPHSWKFANPSHIIIKVQLEEDFNRLWMGTIWSIRDCPM 1846
            F+  +P+ +E+ S    + L G +  K+ +  H++I +  ++DFNR+W    W I    M
Sbjct: 107  FT-RMPKLQEVRSAFKGIGLSGAYEIKWLDYKHVLIHLSNDQDFNRIWTRQQWFIVGQKM 165

Query: 1845 RVFKWTPSFNSKEEAPLAPVWIRLPGLPIHFFDYHALYSIAKEIGTPLQVDSPTAKKTRL 1666
            R+FKW+P F +++E+P+ PVWI  P L  H ++  AL  IAK IG PL VD  TAK +R 
Sbjct: 166  RIFKWSPEFEAEKESPVVPVWISFPNLKAHLYEKSALLLIAKTIGKPLFVDEATAKGSRP 225

Query: 1665 TMARICIELDLLKERLEEIVLCSD----GAV---HVQKIVYERVPEYCEFCKHVGHNIQA 1507
            ++AR+C+E D  +  ++++ + +     G V   + QK+ + ++P+YCE C HVGHN   
Sbjct: 226  SVARVCVEYDCREPPIDQVWIVTQKRETGMVTNGYAQKVEFSQMPDYCEHCCHVGHNETT 285

Query: 1506 CYMNGNN 1486
            C + GNN
Sbjct: 286  CLVLGNN 292


>gb|EOY02239.1| Uncharacterized protein TCM_016763 [Theobroma cacao]
          Length = 2127

 Score =  152 bits (385), Expect = 5e-34
 Identities = 84/231 (36%), Positives = 130/231 (56%), Gaps = 8/231 (3%)
 Frame = -3

Query: 2157 LSFDPKKIVPLGTTSF-QDGRKVLNYSSVETNRLAGAWKLTLIGKFSFAIPQPKEITSGL 1981
            +S +   +VPL    F    R    +   E + LA  +KL+L+GKFS  +P+ +E+ S  
Sbjct: 91   VSGEKPSVVPLTRDPFVYKDRPAAAFFEDEIHILAQPFKLSLVGKFS-RMPKLQEVRSAF 149

Query: 1980 SSLKLKGPHSWKFANPSHIIIKVQLEEDFNRLWMGTIWSIRDCPMRVFKWTPSFNSKEEA 1801
              + L G +  ++ +  HI+I +  E+DFNR W    W I +  MRVFKWTP F  ++E+
Sbjct: 150  KGIGLAGSYEIRWLDYKHILIHLSNEQDFNRFWTKQAWFIANQKMRVFKWTPEFEPEKES 209

Query: 1800 PLAPVWIRLPGLPIHFFDYHALYSIAKEIGTPLQVDSPTAKKTRLTMARICIELDLLKER 1621
             + PVWI  P L  H F+  AL  IAK +G PL +D  TA  +R ++AR+CIE D  +  
Sbjct: 210  AVVPVWISFPNLKAHLFEKSALLLIAKTVGKPLFIDEATANGSRPSVARVCIEYDCREPP 269

Query: 1620 LEEIVLC----SDGAV---HVQKIVYERVPEYCEFCKHVGHNIQACYMNGN 1489
            ++++ +     + GAV   + QK+ + ++P YC+ C HVGH    C + GN
Sbjct: 270  VDQVWIVVQNRATGAVTSGYPQKVEFAQMPAYCDHCCHVGHKEINCIVLGN 320


>gb|EOY25452.1| Uncharacterized protein TCM_016760 [Theobroma cacao]
          Length = 1109

 Score =  152 bits (384), Expect = 7e-34
 Identities = 81/225 (36%), Positives = 124/225 (55%), Gaps = 7/225 (3%)
 Frame = -3

Query: 2145 PKKIVPLGTTSFQDGRKVLNYSSVETNRLAGAWKLTLIGKFSFAIPQPKEITSGLSSLKL 1966
            P  I P    S    R    +   E   LA  +  +L+GKFS  +P+ +EI      + L
Sbjct: 71   PPVIPPSRDPSVYKDRPAAIFYEDEIQTLARPFSHSLVGKFS-RMPKLQEIRHAFKGIGL 129

Query: 1965 KGPHSWKFANPSHIIIKVQLEEDFNRLWMGTIWSIRDCPMRVFKWTPSFNSKEEAPLAPV 1786
             G +  ++ +  H++I +  E+DFNR+W+   W I +  MRVFKW P F +++E+ + PV
Sbjct: 130  SGAYEIRWMDYKHVLIHLSNEQDFNRVWVKQQWFIVNQKMRVFKWAPDFEAEKESAMVPV 189

Query: 1785 WIRLPGLPIHFFDYHALYSIAKEIGTPLQVDSPTAKKTRLTMARICIELDLLKERLEEIV 1606
            WI  P L  H ++  AL  IAK +G PL VD  TA  +R ++AR+C+E D  K+ +EEI 
Sbjct: 190  WISFPNLKAHLYEKSALLLIAKTVGKPLYVDEATANGSRPSVARVCVEYDCRKQPVEEIW 249

Query: 1605 LC----SDGAV---HVQKIVYERVPEYCEFCKHVGHNIQACYMNG 1492
            +       GAV   + Q++ + R+P+YC +C HVGH    C + G
Sbjct: 250  IVIRNRETGAVTGGYSQRVEFARMPDYCGYCSHVGHKENECIVLG 294


>gb|EOY02235.1| Uncharacterized protein TCM_011922 [Theobroma cacao]
          Length = 928

 Score =  151 bits (382), Expect = 1e-33
 Identities = 85/246 (34%), Positives = 134/246 (54%), Gaps = 7/246 (2%)
 Frame = -3

Query: 2205 KSYANVTGXXXXXSINLSFDPKKIVPLGTTSFQDGRKVLNYSSVETNRLAGAWKLTLIGK 2026
            KS+ ++T       I L+ +P          ++D    + Y   E   LA  + L L+GK
Sbjct: 56   KSFLSITAGSKPPVIPLNRNP--------VVYKDRPAAVFYED-EICILAKPFSLCLVGK 106

Query: 2025 FSFAIPQPKEITSGLSSLKLKGPHSWKFANPSHIIIKVQLEEDFNRLWMGTIWSIRDCPM 1846
            F+  +P+ +E+ S    + L G +  K+ +  H+II +  ++DFNR+W    W I    M
Sbjct: 107  FT-RMPKLQEVRSAFKGIGLSGAYEIKWLDYKHVIIHLSNDQDFNRIWTRQQWFIVGQKM 165

Query: 1845 RVFKWTPSFNSKEEAPLAPVWIRLPGLPIHFFDYHALYSIAKEIGTPLQVDSPTAKKTRL 1666
            R+FKW+P F +++E+P+ PVWI  P L  H ++  AL  IAK IG PL VD  TAK +R 
Sbjct: 166  RIFKWSPEFEAEKESPVVPVWISFPNLKAHLYEKFALLLIAKTIGRPLFVDEATAKGSRP 225

Query: 1665 TMARICIELDLLKERLEEIVLCSD----GAV---HVQKIVYERVPEYCEFCKHVGHNIQA 1507
            ++AR+C E D  K  + ++ + +     G V   + QK+ + ++P YC+ C HVGHN   
Sbjct: 226  SVARVCAEYDCRKPPINQVWIVTQKRETGTVTNGYAQKVEFSQMPAYCDHCCHVGHNETN 285

Query: 1506 CYMNGN 1489
            C + GN
Sbjct: 286  CLVLGN 291


>gb|EOY06960.1| Uncharacterized protein TCM_021522 [Theobroma cacao]
          Length = 3503

 Score =  150 bits (379), Expect = 3e-33
 Identities = 95/313 (30%), Positives = 155/313 (49%), Gaps = 9/313 (2%)
 Frame = -3

Query: 2136 IVPLGTTSFQ-DGRKVLNYSSVETNRLAGAWKLTLIGKFSFAIPQPKEITSGLSSLKLKG 1960
            +VPL    F    R    +   E   LA  +KL+L+GKFS  +P+ +++ +    + L G
Sbjct: 1770 VVPLTRDPFVFKDRPAAAFFEDEIQTLAKPFKLSLVGKFS-RMPKLQDVRAAFKGIGLAG 1828

Query: 1959 PHSWKFANPSHIIIKVQLEEDFNRLWMGTIWSIRDCPMRVFKWTPSFNSKEEAPLAPVWI 1780
             +  ++ +  H++I +  E+DFNR+W    W I    MRVFKWTP F  ++E+ + PVWI
Sbjct: 1829 AYEVRWLDYKHVLIHLSNEQDFNRIWTKQNWFIATQKMRVFKWTPEFEPEKESAVVPVWI 1888

Query: 1779 RLPGLPIHFFDYHALYSIAKEIGTPLQVDSPTAKKTRLTMARICIELDLLKERLEEIVLC 1600
              P L  H F+  AL  IAK +G PL VD  TA  +R ++AR+C+E D  +  L+++ + 
Sbjct: 1889 SFPNLKAHLFEKSALLLIAKTVGKPLFVDEATANGSRPSVARVCVEFDCRQPPLDQVWIV 1948

Query: 1599 ----SDGAV---HVQKIVYERVPEYCEFCKHVGHNIQACYMNGNNVXXXXXXXXXXXXXR 1441
                  G +   + Q++ + ++P YC+ C HVGH    C + GN                
Sbjct: 1949 VQNRKTGEITNGYSQRVEFAQMPAYCDHCCHVGHKETDCILLGNKARPPGITKQPNSRLE 2008

Query: 1440 GAGNQESLKQKQGGTSDISKDINTKQGEQTNPVHEQVVEKGEPNPKEWNIIRKKG-PRET 1264
              G +   K+    T++  K+I   +     P +++++   EP PK     +K+G P   
Sbjct: 2009 DGGRRVGSKEDGEFTTEKRKNIENSK----KPQNDKILYPEEP-PKH----QKRGQPANK 2059

Query: 1263 GFISQEVLKLAKK 1225
            G  S   +   KK
Sbjct: 2060 GSTSGTKIWQGKK 2072



 Score =  144 bits (364), Expect = 1e-31
 Identities = 66/182 (36%), Positives = 107/182 (58%), Gaps = 7/182 (3%)
 Frame = -3

Query: 2013 IPQPKEITSGLSSLKLKGPHSWKFANPSHIIIKVQLEEDFNRLWMGTIWSIRDCPMRVFK 1834
            +P+ +EI      + L G +  ++ +  HI+I +  E+DFNR+W    W I +  MRVFK
Sbjct: 1    MPKMQEIRQAFKGIGLTGAYVIRWLDYKHILIHLSNEQDFNRIWTKQQWFIANQKMRVFK 60

Query: 1833 WTPSFNSKEEAPLAPVWIRLPGLPIHFFDYHALYSIAKEIGTPLQVDSPTAKKTRLTMAR 1654
            W+P F +++E+P+ PVWI  P L  H ++  AL  IAK +G PL +D  T+  +R ++AR
Sbjct: 61   WSPDFEAEKESPIVPVWISFPNLKAHLYEKSALLLIAKTVGKPLFIDEATSNASRPSVAR 120

Query: 1653 ICIELDLLKERLEEIVLCSDGAV-------HVQKIVYERVPEYCEFCKHVGHNIQACYMN 1495
            +C+E +     +EEI +     V       + QK+ + ++P+YCE C HVGH++  C + 
Sbjct: 121  VCVEYNCRNAPVEEIWIVIKDRVTGTVTGGYAQKVEFSKMPDYCEHCGHVGHSVSTCLVL 180

Query: 1494 GN 1489
            GN
Sbjct: 181  GN 182


>gb|EOY06959.1| Uncharacterized protein TCM_021521 [Theobroma cacao]
          Length = 1951

 Score =  147 bits (370), Expect = 3e-32
 Identities = 78/213 (36%), Positives = 120/213 (56%), Gaps = 7/213 (3%)
 Frame = -3

Query: 2100 RKVLNYSSVETNRLAGAWKLTLIGKFSFAIPQPKEITSGLSSLKLKGPHSWKFANPSHII 1921
            R  + +   E   LA  +K +++GKFS  +P+  +I +    + L G +  ++ +  HI+
Sbjct: 88   RPAVAFFEDEIVALAQPFKHSMVGKFS-RMPKLNDIRAAFKGIGLVGVYEIRWLDYKHIL 146

Query: 1920 IKVQLEEDFNRLWMGTIWSIRDCPMRVFKWTPSFNSKEEAPLAPVWIRLPGLPIHFFDYH 1741
            I +  E+D NRLWM   W I +  MRVFKW+P F  ++E+ L PVWI  P L  H ++  
Sbjct: 147  IHLSNEQDLNRLWMRQAWFIANQKMRVFKWSPDFQPEKESSLVPVWISFPNLRAHLYEKS 206

Query: 1740 ALYSIAKEIGTPLQVDSPTAKKTRLTMARICIELDLLKERLEEIVLCS----DGAV---H 1582
            AL  IAK +G PL VD  TA  TR ++AR+C+E D  +  LE+I + S     G +    
Sbjct: 207  ALLMIAKSVGRPLFVDEATANGTRPSVARVCVEYDCQQPPLEQIWIVSRDRRTGDITGGF 266

Query: 1581 VQKIVYERVPEYCEFCKHVGHNIQACYMNGNNV 1483
             QK+ + ++P YC  C HVGH+   C + G+ +
Sbjct: 267  QQKVDFAKLPNYCTHCCHVGHSASTCLVMGHRM 299


>gb|EOY02243.1| Uncharacterized protein TCM_016768 [Theobroma cacao]
          Length = 351

 Score =  146 bits (369), Expect = 4e-32
 Identities = 89/260 (34%), Positives = 136/260 (52%), Gaps = 17/260 (6%)
 Frame = -3

Query: 2019 FAIPQPKEITSGLSSLKLKGPHSWKFANPSHIIIKVQLEEDFNRLWMGTIWSIRDCPMRV 1840
            F +P+  EI      + L G +  K+ +  HI+I++  E D NR+W+  +W I +  M V
Sbjct: 83   FWMPRINEIRMAFKGIDLVGAYEIKWLDYKHILIQLSNEHDLNRIWLKQVWFISNQKMCV 142

Query: 1839 FKWTPSFNSKEEAPLAPVWIRLPGLPIHFFDYHALYSIAKEIGTPLQVDSPTAKKTRLTM 1660
            FKWTP+F  ++E+ L PVWI  P L  H ++  AL  IAK +G PL VD  TAK TR ++
Sbjct: 143  FKWTPNFQPEKESSLVPVWISFPNLRAHLYEKFALLVIAKTVGRPLMVDEATAKGTRPSV 202

Query: 1659 ARICIELDLLKERLEEIVLCS----DGAV---HVQKIVYERVPEYCEFCKHVGHNIQACY 1501
            AR+CIE D  K  ++++ + +     G+V   ++QK+ + ++ EYC  C HVGH +  C 
Sbjct: 203  ARVCIEYDCQKPPIDQVWIVTRDRKTGSVIGGYMQKVDFAKLLEYCSHCCHVGHGVSTCI 262

Query: 1500 MNGNNVXXXXXXXXXXXXXRG--AGNQESLKQKQ----GGTSDISKDINTKQGEQTNPVH 1339
            M G+                G   G ++ ++ +Q    G  +D  + I  KQ  +     
Sbjct: 263  MLGHRPEKRLQPTKTRMKRNGDDEGKEKPIEGEQGMRDGNGTDRVQFIEPKQSTKW---- 318

Query: 1338 EQVVEK----GEPNPKEWNI 1291
             QVVEK    G  +PK  NI
Sbjct: 319  -QVVEKPGTSGVNDPKPINI 337


>gb|EOY17514.1| Uncharacterized protein TCM_042330 [Theobroma cacao]
          Length = 2249

 Score =  145 bits (366), Expect = 8e-32
 Identities = 84/265 (31%), Positives = 133/265 (50%), Gaps = 5/265 (1%)
 Frame = -3

Query: 2043 LTLIGKFSFAIPQPKEITSGLSSLKLKGPHSWKFANPSHIIIKVQLEEDFNRLWMGTIWS 1864
            ++L+GKFS  +P+ ++I S    + L G +  ++ +  HI+I +  E D NR+W   +W 
Sbjct: 1    MSLVGKFS-RMPKLQDIRSAFKGIGLAGAYEVRWLDYKHILIHLTNEHDCNRVWTKQVWF 59

Query: 1863 IRDCPMRVFKWTPSFNSKEEAPLAPVWIRLPGLPIHFFDYHALYSIAKEIGTPLQVDSPT 1684
            I +  MRVFKWTP F  ++E+ + PVWI  P L  H F+  AL  IAK +G PL VD  T
Sbjct: 60   IANQKMRVFKWTPDFEPEKESAVVPVWIAFPNLKAHLFEKSALLLIAKTVGKPLFVDEAT 119

Query: 1683 AKKTRLTMARICIELDLLKERLEEIVLCSDGAVHVQKIVYERVPEYCEFCKHVGHNIQAC 1504
            A  +R ++AR+CIE D  +  ++            Q++ + ++P YC+ C HVGH    C
Sbjct: 120  ANGSRPSVARVCIEYDCRRPPID------------QRVEFSQMPAYCDHCCHVGHKEIDC 167

Query: 1503 YMNGNNVXXXXXXXXXXXXXRGAGNQESLK-----QKQGGTSDISKDINTKQGEQTNPVH 1339
             + GN                G+   + L+     +K+G      K++     +  NP  
Sbjct: 168  IVLGNK-----------DKPLGSSKSQYLRVLEAEKKKGYGGGSEKNLE----KSKNPEK 212

Query: 1338 EQVVEKGEPNPKEWNIIRKKGPRET 1264
            E++V   EP  + W  + K G   T
Sbjct: 213  EKIVRPEEPLTQRWQPVSKAGTSGT 237


>gb|EOY17515.1| Uncharacterized protein TCM_042331 [Theobroma cacao]
          Length = 1176

 Score =  143 bits (360), Expect = 4e-31
 Identities = 74/199 (37%), Positives = 112/199 (56%), Gaps = 7/199 (3%)
 Frame = -3

Query: 2061 LAGAWKLTLIGKFSFAIPQPKEITSGLSSLKLKGPHSWKFANPSHIIIKVQLEEDFNRLW 1882
            LA  + L L+GKF+  +P+ +E+      + L G +  K+ +  H+II +  ++DFNR+W
Sbjct: 74   LAKPFSLYLVGKFT-RMPKLQEVKFAFKGIDLLGAYEIKWLDYKHVIIHLSNDQDFNRIW 132

Query: 1881 MGTIWSIRDCPMRVFKWTPSFNSKEEAPLAPVWIRLPGLPIHFFDYHALYSIAKEIGTPL 1702
                W I    MR+FKW   F +K E+P+ PVWI  P L  H ++  AL  I K IG PL
Sbjct: 133  TRQQWFIAGQKMRIFKWPLEFEAKTESPIVPVWISFPNLKAHLYEKFALLLIVKTIGKPL 192

Query: 1701 QVDSPTAKKTRLTMARICIELDLLKERLEEIVLCSD----GAV---HVQKIVYERVPEYC 1543
             VD  T K +R TMAR+C++ D  K  ++++ + +     G V   + QK+ +  +P Y 
Sbjct: 193  FVDEATTKGSRPTMARVCVKYDCRKLPIDQVWIVTQKRNTGIVTNGYAQKVEFSHMPNYW 252

Query: 1542 EFCKHVGHNIQACYMNGNN 1486
            + C HVGHN   C + GNN
Sbjct: 253  DHCCHVGHNETNCLVLGNN 271


>gb|EOY26479.1| Uncharacterized protein TCM_028230 [Theobroma cacao]
          Length = 748

 Score =  142 bits (358), Expect(2) = 7e-31
 Identities = 79/255 (30%), Positives = 130/255 (50%), Gaps = 7/255 (2%)
 Frame = -3

Query: 2007 QPKEITSGLSSLKLKGPHSWKFANPSHIIIKVQLEEDFNRLWMGTIWSIRDCPMRVFKWT 1828
            +P EI +    + L G +  ++ +  HI I +  E+D NR+W+  +W I +  +RVFKWT
Sbjct: 89   RPTEIRNAFKGIGLAGAYDIRWLDYKHIHIGLSNEQDMNRIWLKQVWFISNQKLRVFKWT 148

Query: 1827 PSFNSKEEAPLAPVWIRLPGLPIHFFDYHALYSIAKEIGTPLQVDSPTAKKTRLTMARIC 1648
              F  ++E+ L PVWI  P L  H ++  A+  IAK +G PL VD  T   TR ++AR+C
Sbjct: 149  KDFQPEKESSLVPVWISFPNLRAHLYEKSAVLVIAKTVGRPLFVDEATDNGTRPSLARVC 208

Query: 1647 IELDLLKERLEEI-VLCSDGAV------HVQKIVYERVPEYCEFCKHVGHNIQACYMNGN 1489
            IE D LK  L+++ ++  D          +QK+ +ER+P+YC  C HVGH++  C + GN
Sbjct: 209  IEYDCLKPPLDQVWIVMRDRRTGEITGGFMQKVDFERMPDYCTHCCHVGHSVSTCIVMGN 268

Query: 1488 NVXXXXXXXXXXXXXRGAGNQESLKQKQGGTSDISKDINTKQGEQTNPVHEQVVEKGEPN 1309
                           +   N E + + +       + + T+ G ++  ++  V ++G   
Sbjct: 269  KRVMQGPERAKPSDEKNKINTEEIGKDKQPVERRERLVRTENGNESIDIN--VKKQG--- 323

Query: 1308 PKEWNIIRKKGPRET 1264
              EW  + K G   T
Sbjct: 324  -MEWREVMKAGKSGT 337



 Score = 21.2 bits (43), Expect(2) = 7e-31
 Identities = 9/13 (69%), Positives = 10/13 (76%)
 Frame = -1

Query: 2165 LSICPSTQRKSFL 2127
            L I P TQ+KSFL
Sbjct: 54   LPISPRTQKKSFL 66


>gb|EOY14356.1| Uncharacterized protein TCM_033752 [Theobroma cacao]
          Length = 2251

 Score =  142 bits (358), Expect = 7e-31
 Identities = 80/262 (30%), Positives = 131/262 (50%), Gaps = 12/262 (4%)
 Frame = -3

Query: 2013 IPQPKEITSGLSSLKLKGPHSWKFANPSHIIIKVQLEEDFNRLWMGTIWSIRDCPMRVFK 1834
            +P+ +++ S    + L G +  ++ +  H++I +  E+D NR+W   +W I +  MRVFK
Sbjct: 1    MPKLQDVRSAFKGIGLTGAYEVRWLDYKHVLIHLSNEQDCNRVWTKQVWFIANQKMRVFK 60

Query: 1833 WTPSFNSKEEAPLAPVWIRLPGLPIHFFDYHALYSIAKEIGTPLQVDSPTAKKTRLTMAR 1654
            WTP F  ++E+ + PVWI  P L  H F+  AL  IAK +G PL VD  TA  +R ++AR
Sbjct: 61   WTPDFEPEKESAVVPVWIAFPNLKAHLFEKSALLLIAKTVGKPLFVDEATANGSRPSVAR 120

Query: 1653 ICIELDLLKERLEEIVLC----SDGAV---HVQKIVYERVPEYCEFCKHVGHNIQACYMN 1495
            +CIE D  +  ++++ +       G V   + Q++ + ++P YC+ C HVGH    C + 
Sbjct: 121  VCIEYDCRRSPIDQVWIVVQNRETGTVTSGYPQRVEFSQMPAYCDHCCHVGHKEIDCIVL 180

Query: 1494 GNNVXXXXXXXXXXXXXRGAGNQESLK-----QKQGGTSDISKDINTKQGEQTNPVHEQV 1330
            GN                G    +SL+     +K G      K++  ++    NP  E++
Sbjct: 181  GNK-----------DKSLGRSKSQSLRALTVEKKTGYGGGSEKNLEKRK----NPEKEKI 225

Query: 1329 VEKGEPNPKEWNIIRKKGPRET 1264
            V   EP    W  + K G   T
Sbjct: 226  VRPEEPASLRWKQVSKAGTSGT 247


>gb|EOY19056.1| Uncharacterized protein TCM_043721 [Theobroma cacao]
          Length = 359

 Score =  135 bits (341), Expect = 6e-29
 Identities = 68/159 (42%), Positives = 94/159 (59%), Gaps = 7/159 (4%)
 Frame = -3

Query: 1947 KFANPSHIIIKVQLEEDFNRLWMGTIWSIRDCPMRVFKWTPSFNSKEEAPLAPVWIRLPG 1768
            K+ N  HI+I +  E+DFNR+W    W I +  MRVFKWTP F +++E    PVWI  P 
Sbjct: 20   KWMNYKHILIHLSNEQDFNRIWTKQTWFIANQKMRVFKWTPEFETEKEPSTVPVWISFPN 79

Query: 1767 LPIHFFDYHALYSIAKEIGTPLQVDSPTAKKTRLTMARICIELDLLKERLEE--IVLCSD 1594
            L  H F+  AL  IAK IG PL +D  TA  TR ++AR+CIE D LK  ++   IV+   
Sbjct: 80   LKAHLFEKSALLLIAKAIGNPLWIDEATANGTRPSVARVCIEYDCLKLPVDSVWIVVSKR 139

Query: 1593 GAV-----HVQKIVYERVPEYCEFCKHVGHNIQACYMNG 1492
            G+      ++QK+ +  + EYC  C HVGH++  C + G
Sbjct: 140  GSKDMLGGYLQKVEFSPMSEYCNHCCHVGHSVSECLIVG 178


>gb|EOY17513.1| Uncharacterized protein TCM_036737 [Theobroma cacao]
          Length = 2215

 Score =  134 bits (338), Expect = 1e-28
 Identities = 64/182 (35%), Positives = 104/182 (57%), Gaps = 7/182 (3%)
 Frame = -3

Query: 2013 IPQPKEITSGLSSLKLKGPHSWKFANPSHIIIKVQLEEDFNRLWMGTIWSIRDCPMRVFK 1834
            +P+ +++ +    + L G +  ++ +  H++I +  E+DFNR+W    W I    MRVFK
Sbjct: 1    MPKLQDVRAAFKGIALTGAYEVRWLDYKHVLIHLSNEQDFNRIWTKQNWFIATQKMRVFK 60

Query: 1833 WTPSFNSKEEAPLAPVWIRLPGLPIHFFDYHALYSIAKEIGTPLQVDSPTAKKTRLTMAR 1654
            WTP F  ++E+ + PVWI  P L  H F+  AL  IAK +G PL VD  TA  +R ++AR
Sbjct: 61   WTPEFEPEKESAVVPVWISFPNLKAHLFEKSALLLIAKTVGKPLFVDEATANGSRPSVAR 120

Query: 1653 ICIELDLLKERLEEIVLC----SDGAV---HVQKIVYERVPEYCEFCKHVGHNIQACYMN 1495
            +C+E D  K  ++++ +       G V   + Q++ + ++P YC+ C HVGH    C + 
Sbjct: 121  VCVEYDCRKSPVDQVWIVVQNRKTGEVMNGYSQRVEFAQMPAYCDHCCHVGHKETDCILL 180

Query: 1494 GN 1489
            GN
Sbjct: 181  GN 182


>gb|EOY02242.1| Uncharacterized protein TCM_016767 [Theobroma cacao]
          Length = 1707

 Score =  127 bits (319), Expect = 2e-26
 Identities = 70/188 (37%), Positives = 103/188 (54%)
 Frame = -3

Query: 2136 IVPLGTTSFQDGRKVLNYSSVETNRLAGAWKLTLIGKFSFAIPQPKEITSGLSSLKLKGP 1957
            I P   T     +  + +   E   LA +++ +++GKFS   P+  EI      L L G 
Sbjct: 95   IPPTRATFRYKDKPAVRFYEDEIETLAKSFRFSIVGKFS-RTPRLVEIRQAFVGLGLSGA 153

Query: 1956 HSWKFANPSHIIIKVQLEEDFNRLWMGTIWSIRDCPMRVFKWTPSFNSKEEAPLAPVWIR 1777
            ++ ++ +  H++I +  E+DFNR+W    W I    MRVFK TP+F S +E+ + PVWI 
Sbjct: 154  YNIRWMDYKHVLIHLSNEQDFNRIWTKQTWFIAKQKMRVFKGTPNFESDKESSIVPVWIS 213

Query: 1776 LPGLPIHFFDYHALYSIAKEIGTPLQVDSPTAKKTRLTMARICIELDLLKERLEEIVLCS 1597
             P L  H F+  AL  IAK IG PL VD  TA  TR ++AR+CIE D LK  ++ + + +
Sbjct: 214  FPNLRAHLFEKSALLLIAKAIGNPLGVDEATANGTRPSVARVCIEYDCLKSPIKSVWIVT 273

Query: 1596 DGAVHVQK 1573
               V  QK
Sbjct: 274  SKRVLGQK 281


Top