BLASTX nr result

ID: Mentha28_contig00015872 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Mentha28_contig00015872
         (731 letters)

Database: ./nr 
           38,876,450 sequences; 13,856,398,315 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_007010390.1| Retrotransposon, unclassified-like protein [...   117   3e-24
ref|XP_007020288.1| Uncharacterized protein TCM_036737 [Theobrom...   112   1e-22
ref|XP_007026458.1| Uncharacterized protein TCM_021522 [Theobrom...   112   1e-22
ref|XP_007017129.1| Uncharacterized protein TCM_042328 [Theobrom...   107   4e-21
ref|XP_007026457.1| Uncharacterized protein TCM_021521 [Theobrom...   107   5e-21
ref|XP_007017128.1| Uncharacterized protein TCM_042327 [Theobrom...   106   7e-21
ref|XP_007017131.1| Uncharacterized protein TCM_033752 [Theobrom...   105   2e-20
ref|XP_007031312.1| Uncharacterized protein TCM_016762 [Theobrom...   103   8e-20
ref|XP_007008704.1| Uncharacterized protein TCM_042330 [Theobrom...   102   2e-19
ref|XP_007046404.1| Uncharacterized protein TCM_011923 [Theobrom...   101   3e-19
ref|XP_007046402.1| Uncharacterized protein TCM_011921 [Theobrom...   100   9e-19
ref|XP_007040950.1| Uncharacterized protein TCM_016759 [Theobrom...    96   1e-17
ref|XP_007040952.1| Uncharacterized protein TCM_005954 [Theobrom...    96   2e-17
ref|XP_007023907.1| Ribonuclease H-like protein [Theobroma cacao...    94   6e-17
ref|XP_007031313.1| Uncharacterized protein TCM_016763 [Theobrom...    91   5e-16
ref|XP_007036030.1| Uncharacterized protein TCM_021518 [Theobrom...    90   9e-16
ref|XP_007040946.1| Uncharacterized protein TCM_016753 [Theobrom...    87   4e-15
ref|XP_007014629.1| Uncharacterized protein TCM_039895 [Theobrom...    87   6e-15
ref|XP_007028292.1| Uncharacterized protein TCM_023960 [Theobrom...    84   5e-14
ref|XP_007022459.1| RNase H family protein [Theobroma cacao] gi|...    80   9e-13

>ref|XP_007010390.1| Retrotransposon, unclassified-like protein [Theobroma cacao]
            gi|508727303|gb|EOY19200.1| Retrotransposon,
            unclassified-like protein [Theobroma cacao]
          Length = 1368

 Score =  117 bits (294), Expect = 3e-24
 Identities = 79/226 (34%), Positives = 108/226 (47%), Gaps = 3/226 (1%)
 Frame = +3

Query: 3    HAHTSTDIAHRLGLWRHSFPRATCTHISFLISCLITWFI*TERNSHKHCGIPFRSSNIIW 182
            + H   +I   L  W +S       HI  LI   I WF+  ERN  KH  +      IIW
Sbjct: 1066 YVHNPQNILQILNSWYYSGDFTKPGHIRTLILLFIFWFVWVERNDAKHRDLGMYPDRIIW 1125

Query: 183  QVKHHLHTLVTARKLLPVHWQGCHPQVSFMPVAEPVSRSLRSRIVRWIPPEAPWVKLNTD 362
            ++   L  L     L    W+G               R  R +I+ WI P    +KLN D
Sbjct: 1126 RIMKILRKLFQGGLLCKWQWKGDLDIAIHWGFNFAQERQARPKIINWIKPLIGELKLNVD 1185

Query: 363  GSFDAASESVAGGGLVCDH-GALLGAFSSPVEARSIFEVELSALLHGLDLAVTFS-SHIW 536
            GS     ++ AGGG++ DH G L+  FS     ++  + EL AL  GL L + ++ S +W
Sbjct: 1186 GSSKDEFQNAAGGGVLRDHTGNLIFGFSENFGYQNSLQAELLALHRGLCLCMEYNVSRVW 1245

Query: 537  IEMDAAIVTLLTSGKH-GSADIRHLMTRIRLRLQGIQVRFSHIHRE 671
            IE+DA +V  +    H GS  I++L+  IR  LQ I VR SHIHRE
Sbjct: 1246 IEVDAQVVIQMIQNHHKGSYKIQYLLESIRKCLQVISVRISHIHRE 1291


>ref|XP_007020288.1| Uncharacterized protein TCM_036737 [Theobroma cacao]
            gi|508725616|gb|EOY17513.1| Uncharacterized protein
            TCM_036737 [Theobroma cacao]
          Length = 2215

 Score =  112 bits (281), Expect = 1e-22
 Identities = 69/212 (32%), Positives = 103/212 (48%), Gaps = 3/212 (1%)
 Frame = +3

Query: 45   WRHSFPRATCTHISFLISCLITWFI*TERNSHKHCGIPFRSSNIIWQVKHHLHTLVTARK 224
            W +S   +   HI  L+     WF+  ERN  KH  +    + ++W++   LH L   ++
Sbjct: 1959 WFYSGDYSKPGHIRTLVPLFTLWFLWVERNDAKHRNLGMYPNRVVWKILKLLHQLFQGKQ 2018

Query: 225  LLPVHWQGCHPQVSFMPVAEPVSRSLRSRIVRWIPPEAPWVKLNTDGSFDAASESVAGGG 404
            L    WQG         +          +++ W+ P    +KLN DGS     +S AGGG
Sbjct: 2019 LQKWQWQGDKQIAQEWGIILKADAPSPPKLLFWLKPSIGELKLNVDGSCKHNPQSAAGGG 2078

Query: 405  LVCDH-GALLGAFSSPVEARSIFEVELSALLHGLDLAVTFS-SHIWIEMDAAI-VTLLTS 575
            L+ DH G+++  FS     +   + EL AL  GL L +  + S +WIEMDA + V ++  
Sbjct: 2079 LLRDHTGSMIFGFSENFGPQDSLQAELMALHRGLLLCIEHNISRLWIEMDAKVAVQMIKE 2138

Query: 576  GKHGSADIRHLMTRIRLRLQGIQVRFSHIHRE 671
            G  GS+  R+L+  I   L GI  R SHI RE
Sbjct: 2139 GHQGSSRTRYLLASIHRCLSGISFRISHIFRE 2170


>ref|XP_007026458.1| Uncharacterized protein TCM_021522 [Theobroma cacao]
            gi|508715063|gb|EOY06960.1| Uncharacterized protein
            TCM_021522 [Theobroma cacao]
          Length = 3503

 Score =  112 bits (281), Expect = 1e-22
 Identities = 71/226 (31%), Positives = 107/226 (47%), Gaps = 3/226 (1%)
 Frame = +3

Query: 3    HAHTSTDIAHRLGLWRHSFPRATCTHISFLISCLITWFI*TERNSHKHCGIPFRSSNIIW 182
            H      I H +  W +S   +   HI  L+   I WF+  ERN  KH  +    + I+W
Sbjct: 3233 HIINPCTINHIISAWFYSGDYSKPGHIRTLVPLFILWFLWVERNDAKHRNLGMYPNRIVW 3292

Query: 183  QVKHHLHTLVTARKLLPVHWQGCHPQVSFMPVAEPVSRSLRSRIVRWIPPEAPWVKLNTD 362
            ++   +H L   ++L    WQG         +          +++ W  P     KLN D
Sbjct: 3293 KILKLIHQLFQGKQLQKWQWQGDKQIAQEWGIILKAVAPSPPKLLFWNKPSIGEFKLNVD 3352

Query: 363  GSFDAASESVAGGGLVCDH-GALLGAFSSPVEARSIFEVELSALLHGLDLAVTFS-SHIW 536
            GS     ++ AGGGL+ DH G+++  FS    ++   + EL AL  GL L +  + + +W
Sbjct: 3353 GSSKYNLQTAAGGGLLRDHTGSMIFGFSENFGSQDSLQAELMALHRGLLLCIDHNVTRLW 3412

Query: 537  IEMDAAI-VTLLTSGKHGSADIRHLMTRIRLRLQGIQVRFSHIHRE 671
            IEMDA + V ++  G  GS+  R+L+  I   L GI  R SHI RE
Sbjct: 3413 IEMDAKVAVQMINEGHQGSSRTRYLLASIHRCLSGISFRISHIFRE 3458



 Score =  109 bits (273), Expect = 8e-22
 Identities = 73/201 (36%), Positives = 103/201 (51%), Gaps = 3/201 (1%)
 Frame = +3

Query: 78   HISFLISCLITWFI*TERNSHKHCGIPFRSSNIIWQVKHHLHTLVTARKLLPVHWQGCHP 257
            HI  LI   I WF+  ERN  KH  +    + +IW++   L+ L     L    W+G   
Sbjct: 1464 HIRILIPLFICWFLWLERNDAKHRHMGMYPNRVIWRIMKLLNQLHAGSLLKQWQWKGDTD 1523

Query: 258  QVSFMPVAEPVSRSLRSRIVRWIPPEAPWVKLNTDGSFDAASESVAGGGLVCDH-GALLG 434
              +      P       +I+ WI P     KLN DGS   +S++ AGGG++ DH G L  
Sbjct: 1524 IATMWGFKYPPKYCQSPQIISWIKPFIGEYKLNVDGS-SKSSQNAAGGGVLRDHTGKLAF 1582

Query: 435  AFSSPVEARSIFEVELSALLHGLDLAVTFS-SHIWIEMDAAI-VTLLTSGKHGSADIRHL 608
            AFS  +      + EL ALL GL L    + +++WIEMDA + V ++   + GS DIR+L
Sbjct: 1583 AFSENLGPLPSLQAELHALLRGLLLCKERNITNLWIEMDALVAVQMVQQSQKGSHDIRYL 1642

Query: 609  MTRIRLRLQGIQVRFSHIHRE 671
            +  IRL L+    R SHI+RE
Sbjct: 1643 LESIRLCLRSFSYRISHIYRE 1663


>ref|XP_007017129.1| Uncharacterized protein TCM_042328 [Theobroma cacao]
            gi|508787492|gb|EOY34748.1| Uncharacterized protein
            TCM_042328 [Theobroma cacao]
          Length = 910

 Score =  107 bits (267), Expect = 4e-21
 Identities = 71/215 (33%), Positives = 100/215 (46%), Gaps = 3/215 (1%)
 Frame = +3

Query: 36   LGLWRHSFPRATCTHISFLISCLITWFI*TERNSHKHCGIPFRSSNIIWQVKHHLHTLVT 215
            +G W HS       HI  L+   I WF+  ERN  KH  +    + ++W+V   +  L  
Sbjct: 652  IGAWFHSGDYCKPGHIRTLVPLFILWFLWVERNDAKHRNLGMYPNRVVWRVLKLIQQLSL 711

Query: 216  ARKLLPVHWQGCHPQVSFMPVAEPVSRSLRSRIVRWIPPEAPWVKLNTDGSFDAASESVA 395
             ++LL   W+G         +          ++  W  P     KLN DGS    S + A
Sbjct: 712  GQQLLKWQWKGDKQIAQEWGIILQAESLAPPKVFSWHKPTTGEFKLNVDGS-AKHSHNAA 770

Query: 396  GGGLVCDH-GALLGAFSSPVEARSIFEVELSALLHGLDLAVTFS-SHIWIEMDAAIVTLL 569
            GGG++ DH G ++  FS  +  ++  + EL AL  GL L   ++   +WIEMDA  V  L
Sbjct: 771  GGGILRDHAGVMVFGFSENLGIQNSLQAELLALYRGLILCRDYNIRRLWIEMDAISVIRL 830

Query: 570  TSGKH-GSADIRHLMTRIRLRLQGIQVRFSHIHRE 671
              G H G   IR+LM  +R  L     RFSHI RE
Sbjct: 831  LQGNHRGPHAIRYLMVSLRQLLSHFSFRFSHIFRE 865


>ref|XP_007026457.1| Uncharacterized protein TCM_021521 [Theobroma cacao]
            gi|508715062|gb|EOY06959.1| Uncharacterized protein
            TCM_021521 [Theobroma cacao]
          Length = 1951

 Score =  107 bits (266), Expect = 5e-21
 Identities = 72/201 (35%), Positives = 102/201 (50%), Gaps = 3/201 (1%)
 Frame = +3

Query: 78   HISFLISCLITWFI*TERNSHKHCGIPFRSSNIIWQVKHHLHTLVTARKLLPVHWQGCHP 257
            HI  LI   I WF+  ERN  KH  +    + +IW++   L+ L     L    W+G   
Sbjct: 1707 HIRILIPLFICWFLWLERNDAKHRHMGMYPNRVIWRIMKLLNQLYAGSLLKQWQWKGDTD 1766

Query: 258  QVSFMPVAEPVSRSLRSRIVRWIPPEAPWVKLNTDGSFDAASESVAGGGLVCDH-GALLG 434
              +      P       +I+ WI P     KLN DGS   ++ + AGGG++ DH G L  
Sbjct: 1767 IATMWGFKFPPKYCTSPQIIYWIKPFIGEYKLNVDGS-SKSNLNAAGGGVLRDHTGKLAF 1825

Query: 435  AFSSPVEARSIFEVELSALLHGLDLAVTFS-SHIWIEMDAAI-VTLLTSGKHGSADIRHL 608
            AFS  +      + EL ALL GL L    + +++WIEMDA + V ++   + GS DIR+L
Sbjct: 1826 AFSENLGPLPSLQAELHALLRGLLLCKERNITNLWIEMDALVAVQMVQQSQKGSHDIRYL 1885

Query: 609  MTRIRLRLQGIQVRFSHIHRE 671
            +  IRL L+    R SHI+RE
Sbjct: 1886 LESIRLCLRSFSYRISHIYRE 1906


>ref|XP_007017128.1| Uncharacterized protein TCM_042327 [Theobroma cacao]
            gi|508787491|gb|EOY34747.1| Uncharacterized protein
            TCM_042327 [Theobroma cacao]
          Length = 1014

 Score =  106 bits (265), Expect = 7e-21
 Identities = 72/212 (33%), Positives = 100/212 (47%), Gaps = 3/212 (1%)
 Frame = +3

Query: 45   WRHSFPRATCTHISFLISCLITWFI*TERNSHKHCGIPFRSSNIIWQVKHHLHTLVTARK 224
            W +S       HI  LI   I WF+  ERN  KH  +   S  ++W++   L  L     
Sbjct: 759  WYYSGDFVRKGHIRTLIPLFICWFLWLERNDAKHRHLGMYSDRVVWKIMKVLRQLQDGSL 818

Query: 225  LLPVHWQGCHPQVSFMPVAEPVSRSLRSRIVRWIPPEAPWVKLNTDGSFDAASESVAGGG 404
            L    W+G     +      P+      +I+ W+ P     KLN DGS    ++S A GG
Sbjct: 819  LKKWQWKGDTDIAAMWGFTLPLKIRESPQIIHWVKPVTGEYKLNVDGS-SRHNQSAATGG 877

Query: 405  LVCDH-GALLGAFSSPVEARSIFEVELSALLHGLDLAVTFS-SHIWIEMDA-AIVTLLTS 575
            L+ DH G L+  FS  +   +  + EL ALL GL L    +   +WIEMDA  ++ ++  
Sbjct: 878  LLRDHTGTLVFGFSENIGPSNSLQAELRALLRGLLLCKDRNIEKLWIEMDALVVIQMIQQ 937

Query: 576  GKHGSADIRHLMTRIRLRLQGIQVRFSHIHRE 671
             K GS DIR+L+  IR  L     R SHI RE
Sbjct: 938  SKKGSHDIRYLLASIRKCLSFFSFRISHIFRE 969


>ref|XP_007017131.1| Uncharacterized protein TCM_033752 [Theobroma cacao]
            gi|508722459|gb|EOY14356.1| Uncharacterized protein
            TCM_033752 [Theobroma cacao]
          Length = 2251

 Score =  105 bits (262), Expect = 2e-20
 Identities = 70/215 (32%), Positives = 100/215 (46%), Gaps = 3/215 (1%)
 Frame = +3

Query: 36   LGLWRHSFPRATCTHISFLISCLITWFI*TERNSHKHCGIPFRSSNIIWQVKHHLHTLVT 215
            +G W +S       HI  L+   I WF+  ERN  KH  +    + ++W+V   +  L  
Sbjct: 1993 IGAWFYSGDYCKPGHIRTLVPLFILWFLWVERNDAKHRNLGMYPNRVVWRVLKLIQQLSL 2052

Query: 216  ARKLLPVHWQGCHPQVSFMPVAEPVSRSLRSRIVRWIPPEAPWVKLNTDGSFDAASESVA 395
             ++LL   W+G         +          ++  W  P     KLN DGS    S + A
Sbjct: 2053 GQQLLKWQWKGDKQIAQEWGIIFQAESLAPPKVFSWHKPSLGEFKLNVDGS-AKQSHNAA 2111

Query: 396  GGGLVCDH-GALLGAFSSPVEARSIFEVELSALLHGLDLAVTFS-SHIWIEMDAAIVTLL 569
            GGG++ DH G ++  FS  +  ++  + EL AL  GL L   ++   +WIEMDA  V  L
Sbjct: 2112 GGGILRDHAGEMVFGFSENLGTQNSLQAELLALYRGLILCRDYNIRRLWIEMDAISVIRL 2171

Query: 570  TSGKH-GSADIRHLMTRIRLRLQGIQVRFSHIHRE 671
              G H G   IR+LM  +R  L     RFSHI RE
Sbjct: 2172 LQGNHRGPHAIRYLMVSLRQLLSHFSFRFSHIFRE 2206


>ref|XP_007031312.1| Uncharacterized protein TCM_016762 [Theobroma cacao]
            gi|508710341|gb|EOY02238.1| Uncharacterized protein
            TCM_016762 [Theobroma cacao]
          Length = 2214

 Score =  103 bits (256), Expect = 8e-20
 Identities = 74/219 (33%), Positives = 100/219 (45%), Gaps = 3/219 (1%)
 Frame = +3

Query: 24   IAHRLGLWRHSFPRATCTHISFLISCLITWFI*TERNSHKHCGIPFRSSNIIWQVKHHLH 203
            ++H L  W +S       HI  L+   I WF+  ERN  K+      +  I+W++   L 
Sbjct: 1953 VSHILWAWFYSGDYVKRGHIRTLLPIFICWFLWLERNDAKYRHSGLNTDRIVWRIMKLLR 2012

Query: 204  TLVTARKLLPVHWQGCHPQVSFMPVAEPVSRSLRSRIVRWIPPEAPWVKLNTDGSFDAAS 383
             L     L    W+G     +       +      +IV W  P     KLN DGS     
Sbjct: 2013 QLKDGSLLQQWQWKGDTDIAAMWQYNFQLKLRAPPQIVYWRKPSTGEYKLNVDGS-SRHG 2071

Query: 384  ESVAGGGLVCDH-GALLGAFSSPVEARSIFEVELSALLHGLDLA-VTFSSHIWIEMDA-A 554
            +  A GG++ DH G L+  FS  +   +  + EL ALL GL L        +WIEMDA A
Sbjct: 2072 QHAASGGVLRDHTGKLIFGFSENIGTCNSLQAELRALLRGLLLCKERHIEKLWIEMDALA 2131

Query: 555  IVTLLTSGKHGSADIRHLMTRIRLRLQGIQVRFSHIHRE 671
             + LL   + GS DIR+L+  IR  L  I  R SHIHRE
Sbjct: 2132 AIQLLPHSQKGSHDIRYLLESIRKCLNSISYRISHIHRE 2170


>ref|XP_007008704.1| Uncharacterized protein TCM_042330 [Theobroma cacao]
            gi|508725617|gb|EOY17514.1| Uncharacterized protein
            TCM_042330 [Theobroma cacao]
          Length = 2249

 Score =  102 bits (253), Expect = 2e-19
 Identities = 68/215 (31%), Positives = 101/215 (46%), Gaps = 3/215 (1%)
 Frame = +3

Query: 36   LGLWRHSFPRATCTHISFLISCLITWFI*TERNSHKHCGIPFRSSNIIWQVKHHLHTLVT 215
            LG W +S       HI  L+     WF+  ERN  KH  +    + I+W++   +  L  
Sbjct: 1991 LGAWFYSGDYCKPGHIRTLVPIFTLWFLWVERNDAKHRNLGMYPNRIVWRILKLIQQLSL 2050

Query: 216  ARKLLPVHWQGCHPQVSFMPVAEPVSRSLRSRIVRWIPPEAPWVKLNTDGSFDAASESVA 395
             ++LL   W+G         +          ++  W  P     KLN DGS    S++ A
Sbjct: 2051 GQQLLKWQWKGDKQIAQEWGITFQAESLPPPKVFPWHKPSIGEFKLNVDGS-AKLSQNAA 2109

Query: 396  GGGLVCDH-GALLGAFSSPVEARSIFEVELSALLHGLDLAVTFS-SHIWIEMDAA-IVTL 566
            GGG++ DH G ++  FS  +  ++  + EL AL  GL L   ++   +WIEMDAA ++ L
Sbjct: 2110 GGGVLRDHAGVMVFGFSENLGIQNSLQAELLALYRGLILCRDYNIRRLWIEMDAASVIRL 2169

Query: 567  LTSGKHGSADIRHLMTRIRLRLQGIQVRFSHIHRE 671
            L   + G   IR+L+  IR  L     R SHI RE
Sbjct: 2170 LQGNQRGPHAIRYLLVSIRQLLSHFSFRLSHIFRE 2204


>ref|XP_007046404.1| Uncharacterized protein TCM_011923 [Theobroma cacao]
            gi|508710339|gb|EOY02236.1| Uncharacterized protein
            TCM_011923 [Theobroma cacao]
          Length = 1954

 Score =  101 bits (251), Expect = 3e-19
 Identities = 69/220 (31%), Positives = 103/220 (46%), Gaps = 3/220 (1%)
 Frame = +3

Query: 21   DIAHRLGLWRHSFPRATCTHISFLISCLITWFI*TERNSHKHCGIPFRSSNIIWQVKHHL 200
            +++  L  W  S       HI  LI   I WF+  ERN  KH  +   S  ++W++   L
Sbjct: 1691 NVSQILWTWYLSGDYVRKGHIRILIPLFICWFLWLERNDAKHRHLGMYSDRVVWKIMKLL 1750

Query: 201  HTLVTARKLLPVHWQGCHPQVSFMPVAEPVSRSLRSRIVRWIPPEAPWVKLNTDGSFDAA 380
              L     L    W+G     +   +  P       +I+ W+ P     KLN DGS    
Sbjct: 1751 RQLQDGYLLKSWQWKGDKDFATMWGLFSPPKTRAAPQILHWVKPVPGEHKLNVDGS-SRQ 1809

Query: 381  SESVAGGGLVCDH-GALLGAFSSPVEARSIFEVELSALLHGLDLAVTFS-SHIWIEMDAA 554
            +++ A GG++ DH G L+  FS  +   +  + EL ALL GL L    +   +W+EMDA 
Sbjct: 1810 NQTAAIGGVLRDHTGTLVFDFSENIGPSNSLQAELRALLRGLLLCKERNIEKLWVEMDAL 1869

Query: 555  I-VTLLTSGKHGSADIRHLMTRIRLRLQGIQVRFSHIHRE 671
            + + ++   + GS DIR+L+  IR  L     R SHI RE
Sbjct: 1870 VAIQMIQQSQKGSHDIRYLLASIRKYLNFFSFRISHIFRE 1909


>ref|XP_007046402.1| Uncharacterized protein TCM_011921 [Theobroma cacao]
            gi|508710337|gb|EOY02234.1| Uncharacterized protein
            TCM_011921 [Theobroma cacao]
          Length = 926

 Score = 99.8 bits (247), Expect = 9e-19
 Identities = 72/219 (32%), Positives = 100/219 (45%), Gaps = 3/219 (1%)
 Frame = +3

Query: 24   IAHRLGLWRHSFPRATCTHISFLISCLITWFI*TERNSHKHCGIPFRSSNIIWQVKHHLH 203
            ++H L  W +S       HI  L+   I WF+  ERN  KH      +  ++W++   L 
Sbjct: 665  VSHILWAWFYSGDYVKRGHIRTLLPIFICWFLWLERNDAKHRYSGLYTDRVVWRIMKLLR 724

Query: 204  TLVTARKLLPVHWQGCHPQVSFMPVAEPVSRSLRSRIVRWIPPEAPWVKLNTDGSFDAAS 383
             L     L    W+G     +       +      +IV W  P     KLN DGS     
Sbjct: 725  QLHDGSLLQQWQWKGDTDIAAMWKYNLQLKLRAPPQIVYWRKPSTGEYKLNVDGS-SRHG 783

Query: 384  ESVAGGGLVCDH-GALLGAFSSPVEARSIFEVELSALLHGLDLA-VTFSSHIWIEMDA-A 554
            +  A GG++ DH G L+  FS  +   +  + EL ALL GL L        +WIEMDA A
Sbjct: 784  QHAASGGVLRDHTGKLIFGFSENIGNCNSLQAELRALLRGLLLCKERHIEQLWIEMDALA 843

Query: 555  IVTLLTSGKHGSADIRHLMTRIRLRLQGIQVRFSHIHRE 671
            ++ L+   + GS DIR+L+  IR  L  I  R SHI RE
Sbjct: 844  VIQLIPHSQKGSHDIRYLLESIRKCLNSISYRISHILRE 882


>ref|XP_007040950.1| Uncharacterized protein TCM_016759 [Theobroma cacao]
            gi|508778195|gb|EOY25451.1| Uncharacterized protein
            TCM_016759 [Theobroma cacao]
          Length = 879

 Score = 95.9 bits (237), Expect = 1e-17
 Identities = 71/219 (32%), Positives = 97/219 (44%), Gaps = 3/219 (1%)
 Frame = +3

Query: 24   IAHRLGLWRHSFPRATCTHISFLISCLITWFI*TERNSHKHCGIPFRSSNIIWQVKHHLH 203
            ++  L  W  S       HI  L+   I WF+  ERN  KH         ++W++   L 
Sbjct: 617  VSQILWAWFFSGDYVKKGHIRSLLPIFICWFLWLERNDAKHRHTRLNPDRVVWRIMKLLR 676

Query: 204  TLVTARKLLPVHWQGCHPQVSFMPVAEPVSRSLRSRIVRWIPPEAPWVKLNTDGSFDAAS 383
             L+    L    W+G     S              +I+ W  P     KLN DGS     
Sbjct: 677  QLLDGSLLHQWQWKGDTDIASMWGHTFQSKHRAPPQIIYWRKPFTGEYKLNVDGS-SRNG 735

Query: 384  ESVAGGGLVCDH-GALLGAFSSPVEARSIFEVELSALLHGLDLA-VTFSSHIWIEMDA-A 554
               A GG++ DH G L+  FS  +   +  + EL ALL GL L       ++WIEMDA A
Sbjct: 736  HLAASGGILRDHTGKLIFGFSENIGLCNSLQAELRALLRGLLLCKERHIENLWIEMDALA 795

Query: 555  IVTLLTSGKHGSADIRHLMTRIRLRLQGIQVRFSHIHRE 671
            ++ L+   + GS DIR+L+  IR  L  I  R SHI RE
Sbjct: 796  VIQLIQHSQKGSHDIRYLLESIRKCLSCISYRISHIFRE 834


>ref|XP_007040952.1| Uncharacterized protein TCM_005954 [Theobroma cacao]
            gi|508704887|gb|EOX96783.1| Uncharacterized protein
            TCM_005954 [Theobroma cacao]
          Length = 1134

 Score = 95.5 bits (236), Expect = 2e-17
 Identities = 64/200 (32%), Positives = 86/200 (43%), Gaps = 2/200 (1%)
 Frame = +3

Query: 78   HISFLISCLITWFI*TERNSHKHCGIPFRSSNIIWQVKHHLHTLVTARKLLPVHWQGCHP 257
            H   L+   I WF+  ERN  KH         +IW+   H   L     L    W+G   
Sbjct: 888  HFRVLLPLFICWFLWLERNDAKHRHTGLYPDRVIWRTMKHCRQLYDGSLLQQWQWKGDTD 947

Query: 258  QVSFMPVAEPVSRSLRSRIVRWIPPEAPWVKLNTDGSFDAASESVAGGGLVCDHGALLGA 437
              + +  + P  +    +I+ W  P     KLN DGS      +  GG L    G L+  
Sbjct: 948  IAAMLGFSFPPQQHASPQIIYWKKPSIGEYKLNVDGSSRNGLHAATGGVLRDHTGKLIFG 1007

Query: 438  FSSPVEARSIFEVELSALLHGLDLA-VTFSSHIWIEMDA-AIVTLLTSGKHGSADIRHLM 611
            FS  +   +  + EL ALL GL L        +WIEMDA A + L+   K G  DIR+L+
Sbjct: 1008 FSENIGPCNSLQAELRALLRGLLLCKERHIEKLWIEMDALAAIQLIQPSKKGPYDIRYLL 1067

Query: 612  TRIRLRLQGIQVRFSHIHRE 671
              IR+ L     R SH  RE
Sbjct: 1068 ESIRMCLSSFSYRLSHTFRE 1087


>ref|XP_007023907.1| Ribonuclease H-like protein [Theobroma cacao]
           gi|508779273|gb|EOY26529.1| Ribonuclease H-like protein
           [Theobroma cacao]
          Length = 458

 Score = 93.6 bits (231), Expect = 6e-17
 Identities = 66/200 (33%), Positives = 90/200 (45%), Gaps = 3/200 (1%)
 Frame = +3

Query: 81  ISFLISCLITWFI*TERNSHKHCGIPFRSSNIIWQVKHHLHTLVTARKLLPVHWQGCHPQ 260
           IS LI   I WF+  ERN  KH  +      ++W+    L  L     L    W+     
Sbjct: 218 ISALIPLFICWFLWLERNDAKHRHLGMYPDRVVWETMKLLRQLHDGSPLKQWQWKVDKDI 277

Query: 261 VSFMPVAEPVSRSLRSRIVRWIPPEAPWVKLNTDGSFDAASESVAGGGLVCDH-GALLGA 437
            +      P       +I+ W+ P     KLN DGS     +S   GGL+ DH G L+  
Sbjct: 278 AAMWSFLFPPKHGTTPQIIHWVKPFTGEYKLNVDGS-SRNCQSATSGGLLRDHIGKLVFG 336

Query: 438 FSSPVEARSIFEVELSALLHGLDLA-VTFSSHIWIEMDA-AIVTLLTSGKHGSADIRHLM 611
           FS  +   +  + EL ALL  L L        +WIEMDA  ++ ++   + GS DIR+L+
Sbjct: 337 FSENIGRCNSLQAELRALLRRLLLCKEQHIERLWIEMDALVVIQMIHQYQKGSHDIRYLL 396

Query: 612 TRIRLRLQGIQVRFSHIHRE 671
           T IR  L  I  R  HI RE
Sbjct: 397 TSIRKGLSSISYRILHIFRE 416


>ref|XP_007031313.1| Uncharacterized protein TCM_016763 [Theobroma cacao]
            gi|508710342|gb|EOY02239.1| Uncharacterized protein
            TCM_016763 [Theobroma cacao]
          Length = 2127

 Score = 90.5 bits (223), Expect = 5e-16
 Identities = 61/200 (30%), Positives = 87/200 (43%), Gaps = 2/200 (1%)
 Frame = +3

Query: 78   HISFLISCLITWFI*TERNSHKHCGIPFRSSNIIWQVKHHLHTLVTARKLLPVHWQGCHP 257
            H   L+   I WF+  ERN  KH      +  +IW+   H   L     L    W+G   
Sbjct: 1884 HFRVLLPLFICWFLWLERNDAKHRHTGLYADRVIWRTMKHCRQLYDGSLLQQWQWKGDTD 1943

Query: 258  QVSFMPVAEPVSRSLRSRIVRWIPPEAPWVKLNTDGSFDAASESVAGGGLVCDHGALLGA 437
              + +  +    +    +I+ W  P     KLN DGS      +  GG L    G L+  
Sbjct: 1944 IATMLGFSFTHKQHAPPQIIYWKKPSIGEYKLNVDGSSRNGLHAATGGVLRDHTGKLIFG 2003

Query: 438  FSSPVEARSIFEVELSALLHGLDLA-VTFSSHIWIEMDAAI-VTLLTSGKHGSADIRHLM 611
            FS  +   +  + EL ALL GL L        +WIEMDA + + L+   K G  ++R+L+
Sbjct: 2004 FSENIGPCNSLQAELRALLRGLLLCKERHIEKLWIEMDALVAIQLIQPSKKGPYNLRYLL 2063

Query: 612  TRIRLRLQGIQVRFSHIHRE 671
              IR+ L     R SHI RE
Sbjct: 2064 ESIRMCLSSFSYRLSHILRE 2083


>ref|XP_007036030.1| Uncharacterized protein TCM_021518 [Theobroma cacao]
            gi|508715059|gb|EOY06956.1| Uncharacterized protein
            TCM_021518 [Theobroma cacao]
          Length = 1702

 Score = 89.7 bits (221), Expect = 9e-16
 Identities = 68/220 (30%), Positives = 99/220 (45%), Gaps = 3/220 (1%)
 Frame = +3

Query: 21   DIAHRLGLWRHSFPRATCTHISFLISCLITWFI*TERNSHKHCGIPFRSSNIIWQVKHHL 200
            +++  L  W  S       HI  LI   I WF+  ERN  K   +   S  ++W++   L
Sbjct: 1271 NVSQILWAWYFSGDYVRKGHIRTLIPLFICWFLWLERNDAKQRHLGMYSDRVVWKIMKLL 1330

Query: 201  HTLVTARKLLPVHWQGCHPQVSFMPVAEPVSRSLRSRIVRWIPPEAPWVKLNTDGSFDAA 380
              L     L    W+G     +              +I  W+   +   KLN DGS    
Sbjct: 1331 RQLQDGYVLKNWQWKGDMDIAAMWGFNFSPKIQATPQIFHWVKLVSGEHKLNVDGS-SRQ 1389

Query: 381  SESVAGGGLVCDH-GALLGAFSSPVEARSIFEVELSALLHGLDLAVTFS-SHIWIEMDAA 554
            ++S A GGL+ DH G L+  FS  +   +  + EL ALL GL L    +   +WIEMDA 
Sbjct: 1390 NQSAAIGGLLRDHTGTLVFGFSENIGPSNSLQAELRALLRGLLLCKERNIEKLWIEMDAL 1449

Query: 555  I-VTLLTSGKHGSADIRHLMTRIRLRLQGIQVRFSHIHRE 671
            + + ++   + GS DI++L+  IR  L     R SHI RE
Sbjct: 1450 VAIQMIQQSQKGSHDIQYLLASIRKCLSFFSFRISHIFRE 1489



 Score = 68.9 bits (167), Expect = 2e-09
 Identities = 47/124 (37%), Positives = 66/124 (53%), Gaps = 3/124 (2%)
 Frame = +3

Query: 309  RIVRWIPPEAPWVKLNTDGSFDAASESVAGGGLVCDH-GALLGAFSSPVEARSIFEVELS 485
            +I+ W  P     KLN DG    A ++ A GG+  DH   ++  FS      +  + EL 
Sbjct: 1534 KIIYWSRPLMGEFKLNVDGCSKEAFQNAASGGVPRDHTSTMIFGFSENFGPYNSTQAELM 1593

Query: 486  ALLHGLDLAVTFS-SHIWIEMDA-AIVTLLTSGKHGSADIRHLMTRIRLRLQGIQVRFSH 659
            AL  GL L   ++ S +WIE+DA AIV +L  G  G +  ++L++ I   L GI  R SH
Sbjct: 1594 ALHRGLLLCNEYNISRVWIEIDAKAIVQMLHEGHKGYSRTQYLLSFICQCLSGISYRISH 1653

Query: 660  IHRE 671
            IHRE
Sbjct: 1654 IHRE 1657


>ref|XP_007040946.1| Uncharacterized protein TCM_016753 [Theobroma cacao]
            gi|508778191|gb|EOY25447.1| Uncharacterized protein
            TCM_016753 [Theobroma cacao]
          Length = 1275

 Score = 87.4 bits (215), Expect = 4e-15
 Identities = 67/213 (31%), Positives = 97/213 (45%), Gaps = 3/213 (1%)
 Frame = +3

Query: 42   LWRHSFPRATCTHISFLISCLITWFI*TERNSHKHCGIPFRSSNIIWQVKHHLHTLVTAR 221
            LW +S  +     I  L+   I WF+  ERN  KH      +  ++W++   L  L    
Sbjct: 877  LWGNSVAKQG--RIRTLLPIFICWFLWLERNDAKHRHSGLYTDRVVWRIMTLLRQLQDDS 934

Query: 222  KLLPVHWQGCHPQVSFMPVAEPVSRSLRSRIVRWIPPEAPWVKLNTDGSFDAASESVAGG 401
             L    W+G     +       + +    +IV W  P     KLN DGS     +  A G
Sbjct: 935  LLQQWQWKGDTDIAAMWRYNFQLKQRAPPQIVYWRKPFTGEYKLNVDGS-SRNGQHAASG 993

Query: 402  GLVCDHGA-LLGAFSSPVEARSIFEVELSALLHGLDLAVT-FSSHIWIEMDA-AIVTLLT 572
            G++ DH + L+  FS  +   +  + EL AL  GL L        +WIEMDA A++ L+ 
Sbjct: 994  GVLRDHTSKLIFCFSENIGTYNSLQAELRALHRGLLLCKERHIEKLWIEMDALAVIQLIP 1053

Query: 573  SGKHGSADIRHLMTRIRLRLQGIQVRFSHIHRE 671
              + GS DIR+L+  I+  L  I  R SHI RE
Sbjct: 1054 HSQKGSHDIRYLLESIKKCLNSISYRISHIFRE 1086


>ref|XP_007014629.1| Uncharacterized protein TCM_039895 [Theobroma cacao]
           gi|508784992|gb|EOY32248.1| Uncharacterized protein
           TCM_039895 [Theobroma cacao]
          Length = 206

 Score = 87.0 bits (214), Expect = 6e-15
 Identities = 53/124 (42%), Positives = 71/124 (57%), Gaps = 3/124 (2%)
 Frame = +3

Query: 309 RIVRWIPPEAPWVKLNTDGSFDAASESVAGGGLVCDH-GALLGAFSSPVEARSIFEVELS 485
           +++ W  P     KLN DGS   A ++ AGGGL+ DH G L+  FS      ++ + +L 
Sbjct: 41  KLISWHKPLIGEFKLNADGSSKDAFQNAAGGGLLRDHTGNLIFGFSENFGPANLLQAKLM 100

Query: 486 ALLHGLDLAVTFS-SHIWIEMDAAIVT-LLTSGKHGSADIRHLMTRIRLRLQGIQVRFSH 659
           AL  GL L + ++ S IWIEMDA IV  ++  G  GS   R+L+  IR  L G   RFSH
Sbjct: 101 ALHRGLFLCIEYNISSIWIEMDAKIVVQMIHEGHQGSYQTRYLLAFIRKCLSGFTFRFSH 160

Query: 660 IHRE 671
           IHRE
Sbjct: 161 IHRE 164


>ref|XP_007028292.1| Uncharacterized protein TCM_023960 [Theobroma cacao]
           gi|508716897|gb|EOY08794.1| Uncharacterized protein
           TCM_023960 [Theobroma cacao]
          Length = 303

 Score = 84.0 bits (206), Expect = 5e-14
 Identities = 65/226 (28%), Positives = 93/226 (41%), Gaps = 3/226 (1%)
 Frame = +3

Query: 3   HAHTSTDIAHRLGLWRHSFPRATCTHISFLISCLITWFI*TERNSHKHCGIPFRSSNIIW 182
           + H   ++ H L  W +S       HI  L+  LI WF+  ERN  KH  +    + +IW
Sbjct: 81  YVHNPQNVLHILHPWYYSGDYVKPGHIRILLPLLIMWFLWVERNDAKHKELKMYPNRVIW 140

Query: 183 QVKHHLHTLVTARKLLPVHWQGCHPQVSFMPVAEPVSRSLRSRIVRWIPPEAPWVKLNTD 362
           ++   L                                                 +L  D
Sbjct: 141 RIMRMLR------------------------------------------------QLYQD 152

Query: 363 GSFDAASESVAGGGLVCDH-GALLGAFSSPVEARSIFEVELSALLHGLDLAVTFS-SHIW 536
           GS   A ++ A GG++ DH   ++  F       S  + EL AL  GL L   ++ S +W
Sbjct: 153 GSSKEAFQNAASGGVLRDHTSTMIFGFFENFGPYSSIQAELMALHRGLLLCNEYNISRVW 212

Query: 537 IEMDA-AIVTLLTSGKHGSADIRHLMTRIRLRLQGIQVRFSHIHRE 671
           IEMDA AIV +L  G  GS+  R+L++ I   L GI  R SHIHR+
Sbjct: 213 IEMDAKAIVQMLHKGHKGSSRTRYLLSSIHQCLSGISYRISHIHRQ 258


>ref|XP_007022459.1| RNase H family protein [Theobroma cacao]
           gi|508722087|gb|EOY13984.1| RNase H family protein
           [Theobroma cacao]
          Length = 429

 Score = 79.7 bits (195), Expect = 9e-13
 Identities = 66/212 (31%), Positives = 86/212 (40%), Gaps = 3/212 (1%)
 Frame = +3

Query: 45  WRHSFPRATCTHISFLISCLITWFI*TERNSHKHCGIPFRSSNIIWQVKHHLHTLVTARK 224
           W  S       HI  LI   I WF+  ERN  KH                +L      + 
Sbjct: 204 WLFSSDYTKKGHIHILIPLFIFWFLWVERNDAKH---------------RNLGMYPNRKP 248

Query: 225 LLPVHWQGCHPQVSFMPVAEPVSRSLRSRIVRWIPPEAPWVKLNTDGSFDAASESVAGGG 404
            LP                       + ++  W  P     KLN DG      +S AGG 
Sbjct: 249 SLP-----------------------KPKVFSWQKPLTGEFKLNVDGGSKYDCQSAAGGR 285

Query: 405 LVCDH-GALLGAFSSPVEARSIFEVELSALLHGLDLAVTFS-SHIWIEMDAAIVT-LLTS 575
           L+ DH G L+ +F       +  + EL AL  GL L +  +   +WIEMDA +V  ++  
Sbjct: 286 LLRDHTGTLIFSFVENFGPYNSLQAELMALYRGLLLCIEHNVRRLWIEMDAKVVIQMIHR 345

Query: 576 GKHGSADIRHLMTRIRLRLQGIQVRFSHIHRE 671
           G  GSA IR+L+  IR  L  I  R SHIHRE
Sbjct: 346 GHKGSAQIRYLLASIRKCLSVISFRISHIHRE 377


Top