BLASTX nr result

ID: Mentha28_contig00026791 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Mentha28_contig00026791
         (991 letters)

Database: ./nr 
           38,876,450 sequences; 13,856,398,315 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_007017129.1| Uncharacterized protein TCM_042328 [Theobrom...   110   1e-21
ref|XP_007026457.1| Uncharacterized protein TCM_021521 [Theobrom...   108   3e-21
ref|XP_007010390.1| Retrotransposon, unclassified-like protein [...   108   3e-21
ref|XP_007017128.1| Uncharacterized protein TCM_042327 [Theobrom...   107   6e-21
ref|XP_007046404.1| Uncharacterized protein TCM_011923 [Theobrom...   106   2e-20
ref|XP_007040952.1| Uncharacterized protein TCM_005954 [Theobrom...   105   2e-20
ref|XP_007031312.1| Uncharacterized protein TCM_016762 [Theobrom...   105   4e-20
ref|XP_007017131.1| Uncharacterized protein TCM_033752 [Theobrom...   104   5e-20
ref|XP_007026458.1| Uncharacterized protein TCM_021522 [Theobrom...   102   2e-19
ref|XP_007046402.1| Uncharacterized protein TCM_011921 [Theobrom...   101   4e-19
ref|XP_007008704.1| Uncharacterized protein TCM_042330 [Theobrom...   100   9e-19
ref|XP_007023907.1| Ribonuclease H-like protein [Theobroma cacao...   100   2e-18
ref|XP_007040950.1| Uncharacterized protein TCM_016759 [Theobrom...   100   2e-18
ref|XP_007031313.1| Uncharacterized protein TCM_016763 [Theobrom...    99   3e-18
ref|XP_007020288.1| Uncharacterized protein TCM_036737 [Theobrom...    96   2e-17
ref|XP_007040946.1| Uncharacterized protein TCM_016753 [Theobrom...    92   3e-16
ref|XP_007036030.1| Uncharacterized protein TCM_021518 [Theobrom...    88   6e-15
ref|XP_007028292.1| Uncharacterized protein TCM_023960 [Theobrom...    85   5e-14
ref|XP_007022832.1| Uncharacterized protein TCM_026877 [Theobrom...    80   1e-12
ref|XP_007022459.1| RNase H family protein [Theobroma cacao] gi|...    79   3e-12

>ref|XP_007017129.1| Uncharacterized protein TCM_042328 [Theobroma cacao]
            gi|508787492|gb|EOY34748.1| Uncharacterized protein
            TCM_042328 [Theobroma cacao]
          Length = 910

 Score =  110 bits (274), Expect = 1e-21
 Identities = 74/254 (29%), Positives = 114/254 (44%), Gaps = 2/254 (0%)
 Frame = -3

Query: 947  WRHSFPRATCTHISFLIPCLITWFI*MERNSHKHRGTPFRSSNIIWQVKHHLHTLAV-MR 771
            W HS       HI  L+P  I WF+ +ERN  KHR      + ++W+V   +  L++  +
Sbjct: 655  WFHSGDYCKPGHIRTLVPLFILWFLWVERNDAKHRNLGMYPNRVVWRVLKLIQQLSLGQQ 714

Query: 770  LLPVHWQGCQPQVPFIPIAEPVSRSLRSRIVRWIPPEAPWIKLNTNGLFDXXXXXXXXXG 591
            LL   W+G +       I          ++  W  P     KLN +G             
Sbjct: 715  LLKWQWKGDKQIAQEWGIILQAESLAPPKVFSWHKPTTGEFKLNVDGSAKHSHNAAGGGI 774

Query: 590  LVRDHHGALLRAFCSPVKAASSFETELSTFLHRLDLATSFS-S*V*IEMDAAAIVTLLAS 414
            L RDH G ++  F   +   +S + EL      L L   ++   + IEMDA +++ LL  
Sbjct: 775  L-RDHAGVMVFGFSENLGIQNSLQAELLALYRGLILCRDYNIRRLWIEMDAISVIRLLQG 833

Query: 413  RKHGSSDIRHLMTRIRLRLQGIQVRFSHIHMEGNRPANFMARRGSQTDDMALFDVVSAPC 234
               G   IR+LM  +R  L     RFSHI  EGN+ A+F+A RG +  ++ +F V     
Sbjct: 834  NHRGPHAIRYLMVSLRQLLSHFSFRFSHIFREGNQAADFLANRGHEHQNLQVFTVAQGK- 892

Query: 233  YFLALVRMDQLGYP 192
                ++R+DQ  +P
Sbjct: 893  -LRGMLRLDQTSFP 905


>ref|XP_007026457.1| Uncharacterized protein TCM_021521 [Theobroma cacao]
            gi|508715062|gb|EOY06959.1| Uncharacterized protein
            TCM_021521 [Theobroma cacao]
          Length = 1951

 Score =  108 bits (271), Expect = 3e-21
 Identities = 77/248 (31%), Positives = 115/248 (46%), Gaps = 2/248 (0%)
 Frame = -3

Query: 914  HISFLIPCLITWFI*MERNSHKHRGTPFRSSNIIWQVKHHLHTLAVMRLLPV-HWQGCQP 738
            HI  LIP  I WF+ +ERN  KHR      + +IW++   L+ L    LL    W+G   
Sbjct: 1707 HIRILIPLFICWFLWLERNDAKHRHMGMYPNRVIWRIMKLLNQLYAGSLLKQWQWKGDTD 1766

Query: 737  QVPFIPIAEPVSRSLRSRIVRWIPPEAPWIKLNTNGLFDXXXXXXXXXGLVRDHHGALLR 558
                     P       +I+ WI P     KLN +G            G++RDH G L  
Sbjct: 1767 IATMWGFKFPPKYCTSPQIIYWIKPFIGEYKLNVDGS-SKSNLNAAGGGVLRDHTGKLAF 1825

Query: 557  AFCSPVKAASSFETELSTFLHRLDLATSFS-S*V*IEMDAAAIVTLLASRKHGSSDIRHL 381
            AF   +    S + EL   L  L L    + + + IEMDA   V ++   + GS DIR+L
Sbjct: 1826 AFSENLGPLPSLQAELHALLRGLLLCKERNITNLWIEMDALVAVQMVQQSQKGSHDIRYL 1885

Query: 380  MTRIRLRLQGIQVRFSHIHMEGNRPANFMARRGSQTDDMALFDVVSAPCYFLALVRMDQL 201
            +  IRL L+    R SHI+ EGN+ A+F++ +G     + +F    A    + ++++D+L
Sbjct: 1886 LESIRLCLRSFSYRISHIYREGNQAADFLSNKGQTHQSLCVFS--EAQGELIGILKLDKL 1943

Query: 200  GYPNFMLR 177
              P    R
Sbjct: 1944 NLPYVRFR 1951


>ref|XP_007010390.1| Retrotransposon, unclassified-like protein [Theobroma cacao]
            gi|508727303|gb|EOY19200.1| Retrotransposon,
            unclassified-like protein [Theobroma cacao]
          Length = 1368

 Score =  108 bits (270), Expect = 3e-21
 Identities = 77/247 (31%), Positives = 114/247 (46%), Gaps = 2/247 (0%)
 Frame = -3

Query: 989  HAHTSTDIARSLRWWRHSFPRATCTHISFLIPCLITWFI*MERNSHKHRGTPFRSSNIIW 810
            + H   +I + L  W +S       HI  LI   I WF+ +ERN  KHR        IIW
Sbjct: 1066 YVHNPQNILQILNSWYYSGDFTKPGHIRTLILLFIFWFVWVERNDAKHRDLGMYPDRIIW 1125

Query: 809  QVKHHLHTLAVMRLL-PVHWQGCQPQVPFIPIAEPVSRSLRSRIVRWIPPEAPWIKLNTN 633
            ++   L  L    LL    W+G               R  R +I+ WI P    +KLN +
Sbjct: 1126 RIMKILRKLFQGGLLCKWQWKGDLDIAIHWGFNFAQERQARPKIINWIKPLIGELKLNVD 1185

Query: 632  GLFDXXXXXXXXXGLVRDHHGALLRAFCSPVKAASSFETELSTFLHRLDLATSFS-S*V* 456
            G            G++RDH G L+  F       +S + EL      L L   ++ S V 
Sbjct: 1186 GSSKDEFQNAAGGGVLRDHTGNLIFGFSENFGYQNSLQAELLALHRGLCLCMEYNVSRVW 1245

Query: 455  IEMDAAAIVTLLASRKHGSSDIRHLMTRIRLRLQGIQVRFSHIHMEGNRPANFMARRGSQ 276
            IE+DA  ++ ++ +   GS  I++L+  IR  LQ I VR SHIH EGN+ A+F+++ G  
Sbjct: 1246 IEVDAQVVIQMIQNHHKGSYKIQYLLESIRKCLQVISVRISHIHREGNQAADFLSKHGHT 1305

Query: 275  TDDMALF 255
              ++ +F
Sbjct: 1306 HQNLHVF 1312


>ref|XP_007017128.1| Uncharacterized protein TCM_042327 [Theobroma cacao]
            gi|508787491|gb|EOY34747.1| Uncharacterized protein
            TCM_042327 [Theobroma cacao]
          Length = 1014

 Score =  107 bits (268), Expect = 6e-21
 Identities = 74/254 (29%), Positives = 115/254 (45%), Gaps = 2/254 (0%)
 Frame = -3

Query: 947  WRHSFPRATCTHISFLIPCLITWFI*MERNSHKHRGTPFRSSNIIWQVKHHLHTLAVMRL 768
            W +S       HI  LIP  I WF+ +ERN  KHR     S  ++W++   L  L    L
Sbjct: 759  WYYSGDFVRKGHIRTLIPLFICWFLWLERNDAKHRHLGMYSDRVVWKIMKVLRQLQDGSL 818

Query: 767  LPV-HWQGCQPQVPFIPIAEPVSRSLRSRIVRWIPPEAPWIKLNTNGLFDXXXXXXXXXG 591
            L    W+G            P+      +I+ W+ P     KLN +G            G
Sbjct: 819  LKKWQWKGDTDIAAMWGFTLPLKIRESPQIIHWVKPVTGEYKLNVDGS-SRHNQSAATGG 877

Query: 590  LVRDHHGALLRAFCSPVKAASSFETELSTFLHRLDLATSFS-S*V*IEMDAAAIVTLLAS 414
            L+RDH G L+  F   +  ++S + EL   L  L L    +   + IEMDA  ++ ++  
Sbjct: 878  LLRDHTGTLVFGFSENIGPSNSLQAELRALLRGLLLCKDRNIEKLWIEMDALVVIQMIQQ 937

Query: 413  RKHGSSDIRHLMTRIRLRLQGIQVRFSHIHMEGNRPANFMARRGSQTDDMALFDVVSAPC 234
             K GS DIR+L+  IR  L     R SHI  EGN+ A+F++ +G    ++ +        
Sbjct: 938  SKKGSHDIRYLLASIRKCLSFFSFRISHIFREGNQAADFLSNKGHTHQNLQVISEAQGKL 997

Query: 233  YFLALVRMDQLGYP 192
            +   ++++D+L  P
Sbjct: 998  H--GMLKLDRLNLP 1009


>ref|XP_007046404.1| Uncharacterized protein TCM_011923 [Theobroma cacao]
            gi|508710339|gb|EOY02236.1| Uncharacterized protein
            TCM_011923 [Theobroma cacao]
          Length = 1954

 Score =  106 bits (264), Expect = 2e-20
 Identities = 74/262 (28%), Positives = 119/262 (45%), Gaps = 2/262 (0%)
 Frame = -3

Query: 971  DIARSLRWWRHSFPRATCTHISFLIPCLITWFI*MERNSHKHRGTPFRSSNIIWQVKHHL 792
            ++++ L  W  S       HI  LIP  I WF+ +ERN  KHR     S  ++W++   L
Sbjct: 1691 NVSQILWTWYLSGDYVRKGHIRILIPLFICWFLWLERNDAKHRHLGMYSDRVVWKIMKLL 1750

Query: 791  HTLAVMRLLPV-HWQGCQPQVPFIPIAEPVSRSLRSRIVRWIPPEAPWIKLNTNGLFDXX 615
              L    LL    W+G +       +  P       +I+ W+ P     KLN +G     
Sbjct: 1751 RQLQDGYLLKSWQWKGDKDFATMWGLFSPPKTRAAPQILHWVKPVPGEHKLNVDGS-SRQ 1809

Query: 614  XXXXXXXGLVRDHHGALLRAFCSPVKAASSFETELSTFLHRLDLATSFS-S*V*IEMDAA 438
                   G++RDH G L+  F   +  ++S + EL   L  L L    +   + +EMDA 
Sbjct: 1810 NQTAAIGGVLRDHTGTLVFDFSENIGPSNSLQAELRALLRGLLLCKERNIEKLWVEMDAL 1869

Query: 437  AIVTLLASRKHGSSDIRHLMTRIRLRLQGIQVRFSHIHMEGNRPANFMARRGSQTDDMAL 258
              + ++   + GS DIR+L+  IR  L     R SHI  EGN+ A+F++ +G     + +
Sbjct: 1870 VAIQMIQQSQKGSHDIRYLLASIRKYLNFFSFRISHIFREGNQAADFLSNKGHTHQSLHV 1929

Query: 257  FDVVSAPCYFLALVRMDQLGYP 192
            F       Y   ++++D+L  P
Sbjct: 1930 FTEAQGKLY--GMLKLDRLNLP 1949


>ref|XP_007040952.1| Uncharacterized protein TCM_005954 [Theobroma cacao]
            gi|508704887|gb|EOX96783.1| Uncharacterized protein
            TCM_005954 [Theobroma cacao]
          Length = 1134

 Score =  105 bits (263), Expect = 2e-20
 Identities = 71/248 (28%), Positives = 110/248 (44%), Gaps = 2/248 (0%)
 Frame = -3

Query: 914  HISFLIPCLITWFI*MERNSHKHRGTPFRSSNIIWQVKHHLHTLAVMRLLPV-HWQGCQP 738
            H   L+P  I WF+ +ERN  KHR T      +IW+   H   L    LL    W+G   
Sbjct: 888  HFRVLLPLFICWFLWLERNDAKHRHTGLYPDRVIWRTMKHCRQLYDGSLLQQWQWKGDTD 947

Query: 737  QVPFIPIAEPVSRSLRSRIVRWIPPEAPWIKLNTNGLFDXXXXXXXXXGLVRDHHGALLR 558
                +  + P  +    +I+ W  P     KLN +G            G++RDH G L+ 
Sbjct: 948  IAAMLGFSFPPQQHASPQIIYWKKPSIGEYKLNVDGS-SRNGLHAATGGVLRDHTGKLIF 1006

Query: 557  AFCSPVKAASSFETELSTFLHRLDLATS-FSS*V*IEMDAAAIVTLLASRKHGSSDIRHL 381
             F   +   +S + EL   L  L L        + IEMDA A + L+   K G  DIR+L
Sbjct: 1007 GFSENIGPCNSLQAELRALLRGLLLCKERHIEKLWIEMDALAAIQLIQPSKKGPYDIRYL 1066

Query: 380  MTRIRLRLQGIQVRFSHIHMEGNRPANFMARRGSQTDDMALFDVVSAPCYFLALVRMDQL 201
            +  IR+ L     R SH   EGN+ A++++  G +  ++ +F       +   ++++D+L
Sbjct: 1067 LESIRMCLSSFSYRLSHTFREGNKAADYLSNEGHKHQNLCVFTEAQGQLH--GMLKLDRL 1124

Query: 200  GYPNFMLR 177
              P    R
Sbjct: 1125 NLPYVRFR 1132


>ref|XP_007031312.1| Uncharacterized protein TCM_016762 [Theobroma cacao]
            gi|508710341|gb|EOY02238.1| Uncharacterized protein
            TCM_016762 [Theobroma cacao]
          Length = 2214

 Score =  105 bits (261), Expect = 4e-20
 Identities = 76/254 (29%), Positives = 114/254 (44%), Gaps = 2/254 (0%)
 Frame = -3

Query: 947  WRHSFPRATCTHISFLIPCLITWFI*MERNSHKHRGTPFRSSNIIWQVKHHLHTLAVMRL 768
            W +S       HI  L+P  I WF+ +ERN  K+R +   +  I+W++   L  L    L
Sbjct: 1960 WFYSGDYVKRGHIRTLLPIFICWFLWLERNDAKYRHSGLNTDRIVWRIMKLLRQLKDGSL 2019

Query: 767  LPV-HWQGCQPQVPFIPIAEPVSRSLRSRIVRWIPPEAPWIKLNTNGLFDXXXXXXXXXG 591
            L    W+G             +      +IV W  P     KLN +G            G
Sbjct: 2020 LQQWQWKGDTDIAAMWQYNFQLKLRAPPQIVYWRKPSTGEYKLNVDGS-SRHGQHAASGG 2078

Query: 590  LVRDHHGALLRAFCSPVKAASSFETELSTFLHRLDLATS-FSS*V*IEMDAAAIVTLLAS 414
            ++RDH G L+  F   +   +S + EL   L  L L        + IEMDA A + LL  
Sbjct: 2079 VLRDHTGKLIFGFSENIGTCNSLQAELRALLRGLLLCKERHIEKLWIEMDALAAIQLLPH 2138

Query: 413  RKHGSSDIRHLMTRIRLRLQGIQVRFSHIHMEGNRPANFMARRGSQTDDMALFDVVSAPC 234
             + GS DIR+L+  IR  L  I  R SHIH EGN+ A+F++  G    ++ +F       
Sbjct: 2139 SQKGSHDIRYLLESIRKCLNSISYRISHIHREGNQVADFLSNEGHNHQNLHVFTEAQGKL 2198

Query: 233  YFLALVRMDQLGYP 192
            +   ++++D+L  P
Sbjct: 2199 H--GMLKLDRLNLP 2210


>ref|XP_007017131.1| Uncharacterized protein TCM_033752 [Theobroma cacao]
            gi|508722459|gb|EOY14356.1| Uncharacterized protein
            TCM_033752 [Theobroma cacao]
          Length = 2251

 Score =  104 bits (260), Expect = 5e-20
 Identities = 72/254 (28%), Positives = 113/254 (44%), Gaps = 2/254 (0%)
 Frame = -3

Query: 947  WRHSFPRATCTHISFLIPCLITWFI*MERNSHKHRGTPFRSSNIIWQVKHHLHTLAV-MR 771
            W +S       HI  L+P  I WF+ +ERN  KHR      + ++W+V   +  L++  +
Sbjct: 1996 WFYSGDYCKPGHIRTLVPLFILWFLWVERNDAKHRNLGMYPNRVVWRVLKLIQQLSLGQQ 2055

Query: 770  LLPVHWQGCQPQVPFIPIAEPVSRSLRSRIVRWIPPEAPWIKLNTNGLFDXXXXXXXXXG 591
            LL   W+G +       I          ++  W  P     KLN +G             
Sbjct: 2056 LLKWQWKGDKQIAQEWGIIFQAESLAPPKVFSWHKPSLGEFKLNVDGSAKQSHNAAGGGI 2115

Query: 590  LVRDHHGALLRAFCSPVKAASSFETELSTFLHRLDLATSFS-S*V*IEMDAAAIVTLLAS 414
            L RDH G ++  F   +   +S + EL      L L   ++   + IEMDA +++ LL  
Sbjct: 2116 L-RDHAGEMVFGFSENLGTQNSLQAELLALYRGLILCRDYNIRRLWIEMDAISVIRLLQG 2174

Query: 413  RKHGSSDIRHLMTRIRLRLQGIQVRFSHIHMEGNRPANFMARRGSQTDDMALFDVVSAPC 234
               G   IR+LM  +R  L     RFSHI  EGN+ A+F+A RG +  ++ +F V     
Sbjct: 2175 NHRGPHAIRYLMVSLRQLLSHFSFRFSHIFREGNQAADFLANRGHEHQNLQVFTVAQGK- 2233

Query: 233  YFLALVRMDQLGYP 192
                ++ +DQ  +P
Sbjct: 2234 -LRGMLCLDQTSFP 2246


>ref|XP_007026458.1| Uncharacterized protein TCM_021522 [Theobroma cacao]
            gi|508715063|gb|EOY06960.1| Uncharacterized protein
            TCM_021522 [Theobroma cacao]
          Length = 3503

 Score =  102 bits (255), Expect = 2e-19
 Identities = 78/240 (32%), Positives = 110/240 (45%), Gaps = 2/240 (0%)
 Frame = -3

Query: 914  HISFLIPCLITWFI*MERNSHKHRGTPFRSSNIIWQVKHHLHTLAVMRLLPV-HWQGCQP 738
            HI  LIP  I WF+ +ERN  KHR      + +IW++   L+ L    LL    W+G   
Sbjct: 1464 HIRILIPLFICWFLWLERNDAKHRHMGMYPNRVIWRIMKLLNQLHAGSLLKQWQWKGDTD 1523

Query: 737  QVPFIPIAEPVSRSLRSRIVRWIPPEAPWIKLNTNGLFDXXXXXXXXXGLVRDHHGALLR 558
                     P       +I+ WI P     KLN +G            G++RDH G L  
Sbjct: 1524 IATMWGFKYPPKYCQSPQIISWIKPFIGEYKLNVDGS-SKSSQNAAGGGVLRDHTGKLAF 1582

Query: 557  AFCSPVKAASSFETELSTFLHRLDLATSFS-S*V*IEMDAAAIVTLLASRKHGSSDIRHL 381
            AF   +    S + EL   L  L L    + + + IEMDA   V ++   + GS DIR+L
Sbjct: 1583 AFSENLGPLPSLQAELHALLRGLLLCKERNITNLWIEMDALVAVQMVQQSQKGSHDIRYL 1642

Query: 380  MTRIRLRLQGIQVRFSHIHMEGNRPANFMARRGSQTDDMALFDVVSAPCYFLALVRMDQL 201
            +  IRL L+    R SHI+ EGN+ A+F++ +G     +    VVS    F +L  M  L
Sbjct: 1643 LESIRLCLRSFSYRISHIYREGNQAADFLSNKGQTHQSLC---VVSEAQEFPSLPTMHGL 1699



 Score = 93.6 bits (231), Expect = 1e-16
 Identities = 69/251 (27%), Positives = 112/251 (44%), Gaps = 2/251 (0%)
 Frame = -3

Query: 947  WRHSFPRATCTHISFLIPCLITWFI*MERNSHKHRGTPFRSSNIIWQVKHHLHTLAVMRL 768
            W +S   +   HI  L+P  I WF+ +ERN  KHR      + I+W++   +H L   + 
Sbjct: 3247 WFYSGDYSKPGHIRTLVPLFILWFLWVERNDAKHRNLGMYPNRIVWKILKLIHQLFQGKQ 3306

Query: 767  LPV-HWQGCQPQVPFIPIAEPVSRSLRSRIVRWIPPEAPWIKLNTNGLFDXXXXXXXXXG 591
            L    WQG +       I          +++ W  P     KLN +G            G
Sbjct: 3307 LQKWQWQGDKQIAQEWGIILKAVAPSPPKLLFWNKPSIGEFKLNVDGSSKYNLQTAAGGG 3366

Query: 590  LVRDHHGALLRAFCSPVKAASSFETELSTFLHRLDLATSFS-S*V*IEMDAAAIVTLLAS 414
            L+RDH G+++  F     +  S + EL      L L    + + + IEMDA   V ++  
Sbjct: 3367 LLRDHTGSMIFGFSENFGSQDSLQAELMALHRGLLLCIDHNVTRLWIEMDAKVAVQMINE 3426

Query: 413  RKHGSSDIRHLMTRIRLRLQGIQVRFSHIHMEGNRPANFMARRGSQTDDMALFDVVSAPC 234
               GSS  R+L+  I   L GI  R SHI  EGN+ A+ ++ +G    ++ +  +  A  
Sbjct: 3427 GHQGSSRTRYLLASIHRCLSGISFRISHIFREGNQAADHLSNQGYTHQNLQV--ISQAEG 3484

Query: 233  YFLALVRMDQL 201
                ++R+D++
Sbjct: 3485 QLRGILRLDKI 3495


>ref|XP_007046402.1| Uncharacterized protein TCM_011921 [Theobroma cacao]
            gi|508710337|gb|EOY02234.1| Uncharacterized protein
            TCM_011921 [Theobroma cacao]
          Length = 926

 Score =  101 bits (252), Expect = 4e-19
 Identities = 74/254 (29%), Positives = 114/254 (44%), Gaps = 2/254 (0%)
 Frame = -3

Query: 947  WRHSFPRATCTHISFLIPCLITWFI*MERNSHKHRGTPFRSSNIIWQVKHHLHTLAVMRL 768
            W +S       HI  L+P  I WF+ +ERN  KHR +   +  ++W++   L  L    L
Sbjct: 672  WFYSGDYVKRGHIRTLLPIFICWFLWLERNDAKHRYSGLYTDRVVWRIMKLLRQLHDGSL 731

Query: 767  LPV-HWQGCQPQVPFIPIAEPVSRSLRSRIVRWIPPEAPWIKLNTNGLFDXXXXXXXXXG 591
            L    W+G             +      +IV W  P     KLN +G            G
Sbjct: 732  LQQWQWKGDTDIAAMWKYNLQLKLRAPPQIVYWRKPSTGEYKLNVDGS-SRHGQHAASGG 790

Query: 590  LVRDHHGALLRAFCSPVKAASSFETELSTFLHRLDLATS-FSS*V*IEMDAAAIVTLLAS 414
            ++RDH G L+  F   +   +S + EL   L  L L        + IEMDA A++ L+  
Sbjct: 791  VLRDHTGKLIFGFSENIGNCNSLQAELRALLRGLLLCKERHIEQLWIEMDALAVIQLIPH 850

Query: 413  RKHGSSDIRHLMTRIRLRLQGIQVRFSHIHMEGNRPANFMARRGSQTDDMALFDVVSAPC 234
             + GS DIR+L+  IR  L  I  R SHI  EGN+ A+F++  G    ++ +F       
Sbjct: 851  SQKGSHDIRYLLESIRKCLNSISYRISHILREGNQVADFLSNEGHNHQNLRVFTEAQGKL 910

Query: 233  YFLALVRMDQLGYP 192
            +   ++++D+L  P
Sbjct: 911  H--GMLKLDRLNLP 922


>ref|XP_007008704.1| Uncharacterized protein TCM_042330 [Theobroma cacao]
            gi|508725617|gb|EOY17514.1| Uncharacterized protein
            TCM_042330 [Theobroma cacao]
          Length = 2249

 Score =  100 bits (249), Expect = 9e-19
 Identities = 71/254 (27%), Positives = 111/254 (43%), Gaps = 2/254 (0%)
 Frame = -3

Query: 947  WRHSFPRATCTHISFLIPCLITWFI*MERNSHKHRGTPFRSSNIIWQVKHHLHTLAV-MR 771
            W +S       HI  L+P    WF+ +ERN  KHR      + I+W++   +  L++  +
Sbjct: 1994 WFYSGDYCKPGHIRTLVPIFTLWFLWVERNDAKHRNLGMYPNRIVWRILKLIQQLSLGQQ 2053

Query: 770  LLPVHWQGCQPQVPFIPIAEPVSRSLRSRIVRWIPPEAPWIKLNTNGLFDXXXXXXXXXG 591
            LL   W+G +       I          ++  W  P     KLN +G             
Sbjct: 2054 LLKWQWKGDKQIAQEWGITFQAESLPPPKVFPWHKPSIGEFKLNVDGSAKLSQNAAGGGV 2113

Query: 590  LVRDHHGALLRAFCSPVKAASSFETELSTFLHRLDLATSFS-S*V*IEMDAAAIVTLLAS 414
            L RDH G ++  F   +   +S + EL      L L   ++   + IEMDAA+++ LL  
Sbjct: 2114 L-RDHAGVMVFGFSENLGIQNSLQAELLALYRGLILCRDYNIRRLWIEMDAASVIRLLQG 2172

Query: 413  RKHGSSDIRHLMTRIRLRLQGIQVRFSHIHMEGNRPANFMARRGSQTDDMALFDVVSAPC 234
             + G   IR+L+  IR  L     R SHI  EGN+ A+F+A RG +   + +  V     
Sbjct: 2173 NQRGPHAIRYLLVSIRQLLSHFSFRLSHIFREGNQAADFLANRGHEHQSLQVVTVAQGK- 2231

Query: 233  YFLALVRMDQLGYP 192
                ++R+DQ   P
Sbjct: 2232 -LRGMLRLDQTSLP 2244


>ref|XP_007023907.1| Ribonuclease H-like protein [Theobroma cacao]
           gi|508779273|gb|EOY26529.1| Ribonuclease H-like protein
           [Theobroma cacao]
          Length = 458

 Score = 99.8 bits (247), Expect = 2e-18
 Identities = 72/242 (29%), Positives = 108/242 (44%), Gaps = 2/242 (0%)
 Frame = -3

Query: 911 ISFLIPCLITWFI*MERNSHKHRGTPFRSSNIIWQVKHHLHTLAV-MRLLPVHWQGCQPQ 735
           IS LIP  I WF+ +ERN  KHR        ++W+    L  L     L    W+  +  
Sbjct: 218 ISALIPLFICWFLWLERNDAKHRHLGMYPDRVVWETMKLLRQLHDGSPLKQWQWKVDKDI 277

Query: 734 VPFIPIAEPVSRSLRSRIVRWIPPEAPWIKLNTNGLFDXXXXXXXXXGLVRDHHGALLRA 555
                   P       +I+ W+ P     KLN +G            GL+RDH G L+  
Sbjct: 278 AAMWSFLFPPKHGTTPQIIHWVKPFTGEYKLNVDGS-SRNCQSATSGGLLRDHIGKLVFG 336

Query: 554 FCSPVKAASSFETELSTFLHRLDLATS-FSS*V*IEMDAAAIVTLLASRKHGSSDIRHLM 378
           F   +   +S + EL   L RL L        + IEMDA  ++ ++   + GS DIR+L+
Sbjct: 337 FSENIGRCNSLQAELRALLRRLLLCKEQHIERLWIEMDALVVIQMIHQYQKGSHDIRYLL 396

Query: 377 TRIRLRLQGIQVRFSHIHMEGNRPANFMARRGSQTDDMALFDVVSAPCYFLALVRMDQLG 198
           T IR  L  I  R  HI  EGN+ A F++ +G    ++ L  +  A      ++++D+L 
Sbjct: 397 TSIRKGLSSISYRILHIFREGNQAAYFLSNQGYTHQNLCL--ITEAQGELHGMLKLDRLN 454

Query: 197 YP 192
            P
Sbjct: 455 LP 456


>ref|XP_007040950.1| Uncharacterized protein TCM_016759 [Theobroma cacao]
            gi|508778195|gb|EOY25451.1| Uncharacterized protein
            TCM_016759 [Theobroma cacao]
          Length = 879

 Score = 99.8 bits (247), Expect = 2e-18
 Identities = 72/243 (29%), Positives = 109/243 (44%), Gaps = 2/243 (0%)
 Frame = -3

Query: 914  HISFLIPCLITWFI*MERNSHKHRGTPFRSSNIIWQVKHHLHTLAVMRLLPV-HWQGCQP 738
            HI  L+P  I WF+ +ERN  KHR T      ++W++   L  L    LL    W+G   
Sbjct: 635  HIRSLLPIFICWFLWLERNDAKHRHTRLNPDRVVWRIMKLLRQLLDGSLLHQWQWKGDTD 694

Query: 737  QVPFIPIAEPVSRSLRSRIVRWIPPEAPWIKLNTNGLFDXXXXXXXXXGLVRDHHGALLR 558
                             +I+ W  P     KLN +G            G++RDH G L+ 
Sbjct: 695  IASMWGHTFQSKHRAPPQIIYWRKPFTGEYKLNVDGS-SRNGHLAASGGILRDHTGKLIF 753

Query: 557  AFCSPVKAASSFETELSTFLHRLDLATS-FSS*V*IEMDAAAIVTLLASRKHGSSDIRHL 381
             F   +   +S + EL   L  L L        + IEMDA A++ L+   + GS DIR+L
Sbjct: 754  GFSENIGLCNSLQAELRALLRGLLLCKERHIENLWIEMDALAVIQLIQHSQKGSHDIRYL 813

Query: 380  MTRIRLRLQGIQVRFSHIHMEGNRPANFMARRGSQTDDMALFDVVSAPCYFLALVRMDQL 201
            +  IR  L  I  R SHI  EGN+ A+++A  G    ++ +  +  A      ++++D+L
Sbjct: 814  LESIRKCLSCISYRISHIFREGNQAADYLANEGHSHQNLCV--ITEAQGELHGMLKLDRL 871

Query: 200  GYP 192
              P
Sbjct: 872  NLP 874


>ref|XP_007031313.1| Uncharacterized protein TCM_016763 [Theobroma cacao]
            gi|508710342|gb|EOY02239.1| Uncharacterized protein
            TCM_016763 [Theobroma cacao]
          Length = 2127

 Score = 99.0 bits (245), Expect = 3e-18
 Identities = 67/243 (27%), Positives = 109/243 (44%), Gaps = 2/243 (0%)
 Frame = -3

Query: 914  HISFLIPCLITWFI*MERNSHKHRGTPFRSSNIIWQVKHHLHTLAVMRLLPV-HWQGCQP 738
            H   L+P  I WF+ +ERN  KHR T   +  +IW+   H   L    LL    W+G   
Sbjct: 1884 HFRVLLPLFICWFLWLERNDAKHRHTGLYADRVIWRTMKHCRQLYDGSLLQQWQWKGDTD 1943

Query: 737  QVPFIPIAEPVSRSLRSRIVRWIPPEAPWIKLNTNGLFDXXXXXXXXXGLVRDHHGALLR 558
                +  +    +    +I+ W  P     KLN +G            G++RDH G L+ 
Sbjct: 1944 IATMLGFSFTHKQHAPPQIIYWKKPSIGEYKLNVDGS-SRNGLHAATGGVLRDHTGKLIF 2002

Query: 557  AFCSPVKAASSFETELSTFLHRLDLATS-FSS*V*IEMDAAAIVTLLASRKHGSSDIRHL 381
             F   +   +S + EL   L  L L        + IEMDA   + L+   K G  ++R+L
Sbjct: 2003 GFSENIGPCNSLQAELRALLRGLLLCKERHIEKLWIEMDALVAIQLIQPSKKGPYNLRYL 2062

Query: 380  MTRIRLRLQGIQVRFSHIHMEGNRPANFMARRGSQTDDMALFDVVSAPCYFLALVRMDQL 201
            +  IR+ L     R SHI  EGN+ A++++  G +  ++ +F       +   ++++D+L
Sbjct: 2063 LESIRMCLSSFSYRLSHILREGNQAADYLSNEGHKHQNLCVFTEAQGQLH--GMLKLDRL 2120

Query: 200  GYP 192
              P
Sbjct: 2121 NLP 2123


>ref|XP_007020288.1| Uncharacterized protein TCM_036737 [Theobroma cacao]
            gi|508725616|gb|EOY17513.1| Uncharacterized protein
            TCM_036737 [Theobroma cacao]
          Length = 2215

 Score = 95.9 bits (237), Expect = 2e-17
 Identities = 69/256 (26%), Positives = 108/256 (42%), Gaps = 2/256 (0%)
 Frame = -3

Query: 947  WRHSFPRATCTHISFLIPCLITWFI*MERNSHKHRGTPFRSSNIIWQVKHHLHTLAVMRL 768
            W +S   +   HI  L+P    WF+ +ERN  KHR      + ++W++   LH L   + 
Sbjct: 1959 WFYSGDYSKPGHIRTLVPLFTLWFLWVERNDAKHRNLGMYPNRVVWKILKLLHQLFQGKQ 2018

Query: 767  LPV-HWQGCQPQVPFIPIAEPVSRSLRSRIVRWIPPEAPWIKLNTNGLFDXXXXXXXXXG 591
            L    WQG +       I          +++ W+ P    +KLN +G            G
Sbjct: 2019 LQKWQWQGDKQIAQEWGIILKADAPSPPKLLFWLKPSIGELKLNVDGSCKHNPQSAAGGG 2078

Query: 590  LVRDHHGALLRAFCSPVKAASSFETELSTFLHRLDLATSFS-S*V*IEMDAAAIVTLLAS 414
            L+RDH G+++  F        S + EL      L L    + S + IEMDA   V ++  
Sbjct: 2079 LLRDHTGSMIFGFSENFGPQDSLQAELMALHRGLLLCIEHNISRLWIEMDAKVAVQMIKE 2138

Query: 413  RKHGSSDIRHLMTRIRLRLQGIQVRFSHIHMEGNRPANFMARRGSQTDDMALFDVVSAPC 234
               GSS  R+L+  I   L GI  R SHI  EGN+ A+ ++ +G    ++ +        
Sbjct: 2139 GHQGSSRTRYLLASIHRCLSGISFRISHIFREGNQAADHLSNQGHTHQNLQVISQAEGQL 2198

Query: 233  YFLALVRMDQLGYPNF 186
              +  +    L Y  F
Sbjct: 2199 RGILRLEKINLAYVRF 2214


>ref|XP_007040946.1| Uncharacterized protein TCM_016753 [Theobroma cacao]
            gi|508778191|gb|EOY25447.1| Uncharacterized protein
            TCM_016753 [Theobroma cacao]
          Length = 1275

 Score = 92.0 bits (227), Expect = 3e-16
 Identities = 68/228 (29%), Positives = 103/228 (45%), Gaps = 3/228 (1%)
 Frame = -3

Query: 911  ISFLIPCLITWFI*MERNSHKHRGTPFRSSNIIWQVKHHLHTLAVMRLLPV-HWQGCQPQ 735
            I  L+P  I WF+ +ERN  KHR +   +  ++W++   L  L    LL    W+G    
Sbjct: 888  IRTLLPIFICWFLWLERNDAKHRHSGLYTDRVVWRIMTLLRQLQDDSLLQQWQWKGDTDI 947

Query: 734  VPFIPIAEPVSRSLRSRIVRWIPPEAPWIKLNTNGLFDXXXXXXXXXGLVRDHHGALLRA 555
                     + +    +IV W  P     KLN +G            G++RDH   L+  
Sbjct: 948  AAMWRYNFQLKQRAPPQIVYWRKPFTGEYKLNVDGS-SRNGQHAASGGVLRDHTSKLIFC 1006

Query: 554  FCSPVKAASSFETELSTFLHR--LDLATSFSS*V*IEMDAAAIVTLLASRKHGSSDIRHL 381
            F   +   +S + EL   LHR  L         + IEMDA A++ L+   + GS DIR+L
Sbjct: 1007 FSENIGTYNSLQAELRA-LHRGLLLCKERHIEKLWIEMDALAVIQLIPHSQKGSHDIRYL 1065

Query: 380  MTRIRLRLQGIQVRFSHIHMEGNRPANFMARRGSQTDDMALFDVVSAP 237
            +  I+  L  I  R SHI  EGN+ A+F++  G    ++ +F     P
Sbjct: 1066 LESIKKCLNSISYRISHIFREGNQAADFLSNEGHNHQNLRVFTKAQGP 1113


>ref|XP_007036030.1| Uncharacterized protein TCM_021518 [Theobroma cacao]
            gi|508715059|gb|EOY06956.1| Uncharacterized protein
            TCM_021518 [Theobroma cacao]
          Length = 1702

 Score = 87.8 bits (216), Expect = 6e-15
 Identities = 67/241 (27%), Positives = 106/241 (43%), Gaps = 2/241 (0%)
 Frame = -3

Query: 971  DIARSLRWWRHSFPRATCTHISFLIPCLITWFI*MERNSHKHRGTPFRSSNIIWQVKHHL 792
            ++++ L  W  S       HI  LIP  I WF+ +ERN  K R     S  ++W++   L
Sbjct: 1271 NVSQILWAWYFSGDYVRKGHIRTLIPLFICWFLWLERNDAKQRHLGMYSDRVVWKIMKLL 1330

Query: 791  HTLAVMRLLPV-HWQGCQPQVPFIPIAEPVSRSLRSRIVRWIPPEAPWIKLNTNGLFDXX 615
              L    +L    W+G                    +I  W+   +   KLN +G     
Sbjct: 1331 RQLQDGYVLKNWQWKGDMDIAAMWGFNFSPKIQATPQIFHWVKLVSGEHKLNVDGS-SRQ 1389

Query: 614  XXXXXXXGLVRDHHGALLRAFCSPVKAASSFETELSTFLHRLDLATSFS-S*V*IEMDAA 438
                   GL+RDH G L+  F   +  ++S + EL   L  L L    +   + IEMDA 
Sbjct: 1390 NQSAAIGGLLRDHTGTLVFGFSENIGPSNSLQAELRALLRGLLLCKERNIEKLWIEMDAL 1449

Query: 437  AIVTLLASRKHGSSDIRHLMTRIRLRLQGIQVRFSHIHMEGNRPANFMARRGSQTDDMAL 258
              + ++   + GS DI++L+  IR  L     R SHI  EGN+ A+F++ +G    ++ +
Sbjct: 1450 VAIQMIQQSQKGSHDIQYLLASIRKCLSFFSFRISHIFREGNQVADFLSNKGHTQQNLLV 1509

Query: 257  F 255
            F
Sbjct: 1510 F 1510



 Score = 64.7 bits (156), Expect = 6e-08
 Identities = 48/166 (28%), Positives = 76/166 (45%), Gaps = 1/166 (0%)
 Frame = -3

Query: 686  RIVRWIPPEAPWIKLNTNGLFDXXXXXXXXXGLVRDHHGALLRAFCSPVKAASSFETELS 507
            +I+ W  P     KLN +G            G+ RDH   ++  F       +S + EL 
Sbjct: 1534 KIIYWSRPLMGEFKLNVDGCSKEAFQNAASGGVPRDHTSTMIFGFSENFGPYNSTQAELM 1593

Query: 506  TFLHRLDLATSFS-S*V*IEMDAAAIVTLLASRKHGSSDIRHLMTRIRLRLQGIQVRFSH 330
                 L L   ++ S V IE+DA AIV +L     G S  ++L++ I   L GI  R SH
Sbjct: 1594 ALHRGLLLCNEYNISRVWIEIDAKAIVQMLHEGHKGYSRTQYLLSFICQCLSGISYRISH 1653

Query: 329  IHMEGNRPANFMARRGSQTDDMALFDVVSAPCYFLALVRMDQLGYP 192
            IH E N+ A++++ +G     + +F    A      ++R+D+   P
Sbjct: 1654 IHRESNQAADYLSNQGHTHQSLQVFS--KAEGELRGMIRLDKSNLP 1697


>ref|XP_007028292.1| Uncharacterized protein TCM_023960 [Theobroma cacao]
           gi|508716897|gb|EOY08794.1| Uncharacterized protein
           TCM_023960 [Theobroma cacao]
          Length = 303

 Score = 84.7 bits (208), Expect = 5e-14
 Identities = 69/267 (25%), Positives = 109/267 (40%), Gaps = 1/267 (0%)
 Frame = -3

Query: 989 HAHTSTDIARSLRWWRHSFPRATCTHISFLIPCLITWFI*MERNSHKHRGTPFRSSNIIW 810
           + H   ++   L  W +S       HI  L+P LI WF+ +ERN  KH+      + +IW
Sbjct: 81  YVHNPQNVLHILHPWYYSGDYVKPGHIRILLPLLIMWFLWVERNDAKHKELKMYPNRVIW 140

Query: 809 QVKHHLHTLAVMRLLPVHWQGCQPQVPFIPIAEPVSRSLRSRIVRWIPPEAPWIKLNTNG 630
           ++         MR+L   +Q    +  F   A                            
Sbjct: 141 RI---------MRMLRQLYQDGSSKEAFQNAAS--------------------------- 164

Query: 629 LFDXXXXXXXXXGLVRDHHGALLRAFCSPVKAASSFETELSTFLHRLDLATSFS-S*V*I 453
                       G++RDH   ++  F       SS + EL      L L   ++ S V I
Sbjct: 165 -----------GGVLRDHTSTMIFGFFENFGPYSSIQAELMALHRGLLLCNEYNISRVWI 213

Query: 452 EMDAAAIVTLLASRKHGSSDIRHLMTRIRLRLQGIQVRFSHIHMEGNRPANFMARRGSQT 273
           EMDA AIV +L     GSS  R+L++ I   L GI  R SHIH +GN+  ++++ +G   
Sbjct: 214 EMDAKAIVQMLHKGHKGSSRTRYLLSSIHQCLSGISYRISHIHRQGNQAVDYLSNKGHTH 273

Query: 272 DDMALFDVVSAPCYFLALVRMDQLGYP 192
            ++ +F    A      ++R+D+   P
Sbjct: 274 QNLQVFS--EAEGELKGMIRLDKSNLP 298


>ref|XP_007022832.1| Uncharacterized protein TCM_026877 [Theobroma cacao]
            gi|508778198|gb|EOY25454.1| Uncharacterized protein
            TCM_026877 [Theobroma cacao]
          Length = 2367

 Score = 80.5 bits (197), Expect = 1e-12
 Identities = 65/242 (26%), Positives = 96/242 (39%), Gaps = 1/242 (0%)
 Frame = -3

Query: 914  HISFLIPCLITWFI*MERNSHKHRGTPFRSSNIIWQVKHHLHTLAVMRLLPVHWQGCQPQ 735
            HI  LIP    WF+ +ERN  KHR                       +LL   W+G +  
Sbjct: 2143 HIRTLIPIFTLWFLWVERNDAKHRNLG-------------------QQLLEWQWKGDKQI 2183

Query: 734  VPFIPIAEPVSRSLRSRIVRWIPPEAPWIKLNTNGLFDXXXXXXXXXGLVRDHHGALLRA 555
                 I          ++  W  P     KLN +G             L RDH G ++  
Sbjct: 2184 AQEWGITFQAKSLPPPKVFCWHKPSNGEFKLNVDGSAKLSQNAAGGGVL-RDHAGVMIFG 2242

Query: 554  FCSPVKAASSFETELSTFLHRLDLATSFS-S*V*IEMDAAAIVTLLASRKHGSSDIRHLM 378
            F   +   +S + EL      L L   ++   + IEMDA +++ LL     G   IR+L+
Sbjct: 2243 FSENLGIQNSLKAELLALYRGLILCRDYNIRRLWIEMDATSVIRLLQGNHRGPHAIRYLL 2302

Query: 377  TRIRLRLQGIQVRFSHIHMEGNRPANFMARRGSQTDDMALFDVVSAPCYFLALVRMDQLG 198
              IR  L     R +HI  EGN+ A+F+A RG +   + +  V         ++R+DQ  
Sbjct: 2303 GSIRQLLSHFSFRLTHIFREGNQAADFLANRGHEHQSLQVITVAQGK--LRGMLRLDQTS 2360

Query: 197  YP 192
             P
Sbjct: 2361 LP 2362


>ref|XP_007022459.1| RNase H family protein [Theobroma cacao]
           gi|508722087|gb|EOY13984.1| RNase H family protein
           [Theobroma cacao]
          Length = 429

 Score = 79.0 bits (193), Expect = 3e-12
 Identities = 62/235 (26%), Positives = 93/235 (39%), Gaps = 1/235 (0%)
 Frame = -3

Query: 947 WRHSFPRATCTHISFLIPCLITWFI*MERNSHKHRGTPFRSSNIIWQVKHHLHTLAVMRL 768
           W  S       HI  LIP  I WF+ +ERN  KHR      +                  
Sbjct: 204 WLFSSDYTKKGHIHILIPLFIFWFLWVERNDAKHRNLGMYPNR----------------- 246

Query: 767 LPVHWQGCQPQVPFIPIAEPVSRSLRSRIVRWIPPEAPWIKLNTNGLFDXXXXXXXXXGL 588
                   +P +P            + ++  W  P     KLN +G             L
Sbjct: 247 --------KPSLP------------KPKVFSWQKPLTGEFKLNVDGGSKYDCQSAAGGRL 286

Query: 587 VRDHHGALLRAFCSPVKAASSFETELSTFLHRLDLATSFS-S*V*IEMDAAAIVTLLASR 411
           +RDH G L+ +F       +S + EL      L L    +   + IEMDA  ++ ++   
Sbjct: 287 LRDHTGTLIFSFVENFGPYNSLQAELMALYRGLLLCIEHNVRRLWIEMDAKVVIQMIHRG 346

Query: 410 KHGSSDIRHLMTRIRLRLQGIQVRFSHIHMEGNRPANFMARRGSQTDDMALFDVV 246
             GS+ IR+L+  IR  L  I  R SHIH EGN+ A+ ++ +G    ++ +F  V
Sbjct: 347 HKGSAQIRYLLASIRKCLSVISFRISHIHREGNQAADLLSNQGYMHQNLHVFSQV 401


Top