BLASTX nr result

ID: Gardenia21_contig00014221 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Gardenia21_contig00014221
         (780 letters)

Database: ./nr 
           77,306,371 sequences; 28,104,191,420 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

emb|CDP18110.1| unnamed protein product [Coffea canephora]            103   4e-21
ref|XP_007040952.1| Uncharacterized protein TCM_005954 [Theobrom...    96   8e-18
ref|XP_007031313.1| Uncharacterized protein TCM_016763 [Theobrom...    92   2e-17
ref|XP_007040950.1| Uncharacterized protein TCM_016759 [Theobrom...    89   1e-16
ref|XP_007031312.1| Uncharacterized protein TCM_016762 [Theobrom...    89   4e-16
ref|XP_007046402.1| Uncharacterized protein TCM_011921 [Theobrom...    88   5e-16
ref|XP_007026457.1| Uncharacterized protein TCM_021521 [Theobrom...    89   7e-16
ref|XP_007017128.1| Uncharacterized protein TCM_042327 [Theobrom...    88   1e-15
ref|XP_007036030.1| Uncharacterized protein TCM_021518 [Theobrom...    87   2e-15
ref|XP_007026458.1| Uncharacterized protein TCM_021522 [Theobrom...    85   1e-14
ref|XP_007040946.1| Uncharacterized protein TCM_016753 [Theobrom...    87   1e-14
ref|XP_007046404.1| Uncharacterized protein TCM_011923 [Theobrom...    81   1e-13
ref|XP_007023907.1| Ribonuclease H-like protein [Theobroma cacao...    79   4e-12
ref|XP_007010390.1| Retrotransposon, unclassified-like protein [...    79   4e-12
ref|XP_007017131.1| Uncharacterized protein TCM_033752 [Theobrom...    73   2e-11
ref|XP_007017129.1| Uncharacterized protein TCM_042328 [Theobrom...    73   3e-11
ref|XP_007008704.1| Uncharacterized protein TCM_042330 [Theobrom...    73   1e-10
ref|XP_007020288.1| Uncharacterized protein TCM_036737 [Theobrom...    71   4e-10
ref|XP_007023840.1| Uncharacterized protein TCM_028138 [Theobrom...    70   2e-09
gb|AAT38805.1| hypothetical protein SDM1_47t00008 [Solanum demis...    69   5e-09

>emb|CDP18110.1| unnamed protein product [Coffea canephora]
          Length = 186

 Score =  103 bits (257), Expect(2) = 4e-21
 Identities = 59/157 (37%), Positives = 83/157 (52%), Gaps = 2/157 (1%)
 Frame = -3

Query: 634 WKNRNKARFNG*SFAATSIIWEVQSFIFQLVSAKVLNEAHFRGDMDCIFASFVCPIVKKT 455
           WK+RN ARF   S     +I+ ++ F+ Q+  A+  + A F GD DC +A    P  +  
Sbjct: 30  WKSRNSARFEAGSITPAQVIFRIEEFLDQMGKARAFSRASFAGDRDCPWAGLDGPYKRDK 89

Query: 454 RVMVVRWLKTSHGYFKLNTDGSVFQGMA--GGLLRDSTDVLIFAFDKEVGEVNVLTAESX 281
            V+ V W K S G+ KLNTD SV  G A  GG+LRD    +IFAF KE GE++VL AE+ 
Sbjct: 90  GVVPVSWEKPSLGWVKLNTDASVLHGKAAGGGVLRDHCGRVIFAFYKEFGEMDVLEAEAQ 149

Query: 280 XXXXXXXXXLEKGFSNLHAEMDSIALAHLFTPNALAK 170
                     ++    L  E +S  L HL   + ++K
Sbjct: 150 SLLEGLRMCADRAVGALTVESNSNVLVHLVRSDVVSK 186



 Score = 25.8 bits (55), Expect(2) = 4e-21
 Identities = 8/24 (33%), Positives = 16/24 (66%)
 Frame = -2

Query: 707 FLSVLNLNLNHIRCVLPAIIVWFI 636
           F S   ++  H+R ++P +++WFI
Sbjct: 6   FFSHDRVSTTHVRVLIPLLVLWFI 29


>ref|XP_007040952.1| Uncharacterized protein TCM_005954 [Theobroma cacao]
            gi|508704887|gb|EOX96783.1| Uncharacterized protein
            TCM_005954 [Theobroma cacao]
          Length = 1134

 Score = 95.9 bits (237), Expect(2) = 8e-18
 Identities = 60/195 (30%), Positives = 85/195 (43%), Gaps = 4/195 (2%)
 Frame = -3

Query: 634  WKNRNKARFNG*SFAATSIIWEVQSFIFQLVSAKVLNEAHFRGDMD-CIFASFVCPIVKK 458
            W  RN A+          +IW       QL    +L +  ++GD D      F  P  + 
Sbjct: 902  WLERNDAKHRHTGLYPDRVIWRTMKHCRQLYDGSLLQQWQWKGDTDIAAMLGFSFPPQQH 961

Query: 457  TRVMVVRWLKTSHGYFKLNTDGSVFQGM---AGGLLRDSTDVLIFAFDKEVGEVNVLTAE 287
                ++ W K S G +KLN DGS   G+    GG+LRD T  LIF F + +G  N L AE
Sbjct: 962  ASPQIIYWKKPSIGEYKLNVDGSSRNGLHAATGGVLRDHTGKLIFGFSENIGPCNSLQAE 1021

Query: 286  SXXXXXXXXXXLEKGFSNLHAEMDSIALAHLFTPNALAKWPLCITFRQIRLALISLAANL 107
                        E+    L  EMD++A   L  P+    + +      IR+ L S +  L
Sbjct: 1022 LRALLRGLLLCKERHIEKLWIEMDALAAIQLIQPSKKGPYDIRYLLESIRMCLSSFSYRL 1081

Query: 106  NHVFCENNGSADSLA 62
            +H F E N +AD L+
Sbjct: 1082 SHTFREGNKAADYLS 1096



 Score = 22.3 bits (46), Expect(2) = 8e-18
 Identities = 11/32 (34%), Positives = 16/32 (50%)
 Frame = -2

Query: 731 VSAQFMA*FLSVLNLNLNHIRCVLPAIIVWFI 636
           VS    A ++S   +   H R +LP  I WF+
Sbjct: 870 VSQIIWAWYVSGDYVRKGHFRVLLPLFICWFL 901


>ref|XP_007031313.1| Uncharacterized protein TCM_016763 [Theobroma cacao]
            gi|508710342|gb|EOY02239.1| Uncharacterized protein
            TCM_016763 [Theobroma cacao]
          Length = 2127

 Score = 92.0 bits (227), Expect(2) = 2e-17
 Identities = 59/195 (30%), Positives = 84/195 (43%), Gaps = 4/195 (2%)
 Frame = -3

Query: 634  WKNRNKARFNG*SFAATSIIWEVQSFIFQLVSAKVLNEAHFRGDMD-CIFASFVCPIVKK 458
            W  RN A+       A  +IW       QL    +L +  ++GD D      F     + 
Sbjct: 1898 WLERNDAKHRHTGLYADRVIWRTMKHCRQLYDGSLLQQWQWKGDTDIATMLGFSFTHKQH 1957

Query: 457  TRVMVVRWLKTSHGYFKLNTDGSVFQGM---AGGLLRDSTDVLIFAFDKEVGEVNVLTAE 287
                ++ W K S G +KLN DGS   G+    GG+LRD T  LIF F + +G  N L AE
Sbjct: 1958 APPQIIYWKKPSIGEYKLNVDGSSRNGLHAATGGVLRDHTGKLIFGFSENIGPCNSLQAE 2017

Query: 286  SXXXXXXXXXXLEKGFSNLHAEMDSIALAHLFTPNALAKWPLCITFRQIRLALISLAANL 107
                        E+    L  EMD++    L  P+    + L      IR+ L S +  L
Sbjct: 2018 LRALLRGLLLCKERHIEKLWIEMDALVAIQLIQPSKKGPYNLRYLLESIRMCLSSFSYRL 2077

Query: 106  NHVFCENNGSADSLA 62
            +H+  E N +AD L+
Sbjct: 2078 SHILREGNQAADYLS 2092



 Score = 25.0 bits (53), Expect(2) = 2e-17
 Identities = 13/44 (29%), Positives = 21/44 (47%)
 Frame = -2

Query: 767  VYVWHVEIELSSVSAQFMA*FLSVLNLNLNHIRCVLPAIIVWFI 636
            +Y+W+       VS    A ++S   +   H R +LP  I WF+
Sbjct: 1858 IYIWNPR----HVSQIIWAWYVSGDYVRKGHFRVLLPLFICWFL 1897


>ref|XP_007040950.1| Uncharacterized protein TCM_016759 [Theobroma cacao]
            gi|508778195|gb|EOY25451.1| Uncharacterized protein
            TCM_016759 [Theobroma cacao]
          Length = 879

 Score = 89.4 bits (220), Expect(2) = 1e-16
 Identities = 58/196 (29%), Positives = 90/196 (45%), Gaps = 5/196 (2%)
 Frame = -3

Query: 634  WKNRNKARFNG*SFAATSIIWEVQSFIFQLVSAKVLNEAHFRGDMDCIFASFVCPIVKKT 455
            W  RN A+          ++W +   + QL+   +L++  ++GD D I + +      K 
Sbjct: 649  WLERNDAKHRHTRLNPDRVVWRIMKLLRQLLDGSLLHQWQWKGDTD-IASMWGHTFQSKH 707

Query: 454  RV--MVVRWLKTSHGYFKLNTDGSVFQG---MAGGLLRDSTDVLIFAFDKEVGEVNVLTA 290
            R    ++ W K   G +KLN DGS   G    +GG+LRD T  LIF F + +G  N L A
Sbjct: 708  RAPPQIIYWRKPFTGEYKLNVDGSSRNGHLAASGGILRDHTGKLIFGFSENIGLCNSLQA 767

Query: 289  ESXXXXXXXXXXLEKGFSNLHAEMDSIALAHLFTPNALAKWPLCITFRQIRLALISLAAN 110
            E            E+   NL  EMD++A+  L   +      +      IR  L  ++  
Sbjct: 768  ELRALLRGLLLCKERHIENLWIEMDALAVIQLIQHSQKGSHDIRYLLESIRKCLSCISYR 827

Query: 109  LNHVFCENNGSADSLA 62
            ++H+F E N +AD LA
Sbjct: 828  ISHIFREGNQAADYLA 843



 Score = 24.6 bits (52), Expect(2) = 1e-16
 Identities = 13/32 (40%), Positives = 16/32 (50%)
 Frame = -2

Query: 731 VSAQFMA*FLSVLNLNLNHIRCVLPAIIVWFI 636
           VS    A F S   +   HIR +LP  I WF+
Sbjct: 617 VSQILWAWFFSGDYVKKGHIRSLLPIFICWFL 648


>ref|XP_007031312.1| Uncharacterized protein TCM_016762 [Theobroma cacao]
            gi|508710341|gb|EOY02238.1| Uncharacterized protein
            TCM_016762 [Theobroma cacao]
          Length = 2214

 Score = 88.6 bits (218), Expect(2) = 4e-16
 Identities = 60/196 (30%), Positives = 87/196 (44%), Gaps = 5/196 (2%)
 Frame = -3

Query: 634  WKNRNKARFNG*SFAATSIIWEVQSFIFQLVSAKVLNEAHFRGDMDCIFASFVCPIVKKT 455
            W  RN A++         I+W +   + QL    +L +  ++GD D I A +      K 
Sbjct: 1985 WLERNDAKYRHSGLNTDRIVWRIMKLLRQLKDGSLLQQWQWKGDTD-IAAMWQYNFQLKL 2043

Query: 454  RV--MVVRWLKTSHGYFKLNTDGSVFQGM---AGGLLRDSTDVLIFAFDKEVGEVNVLTA 290
            R    +V W K S G +KLN DGS   G    +GG+LRD T  LIF F + +G  N L A
Sbjct: 2044 RAPPQIVYWRKPSTGEYKLNVDGSSRHGQHAASGGVLRDHTGKLIFGFSENIGTCNSLQA 2103

Query: 289  ESXXXXXXXXXXLEKGFSNLHAEMDSIALAHLFTPNALAKWPLCITFRQIRLALISLAAN 110
            E            E+    L  EMD++A   L   +      +      IR  L S++  
Sbjct: 2104 ELRALLRGLLLCKERHIEKLWIEMDALAAIQLLPHSQKGSHDIRYLLESIRKCLNSISYR 2163

Query: 109  LNHVFCENNGSADSLA 62
            ++H+  E N  AD L+
Sbjct: 2164 ISHIHREGNQVADFLS 2179



 Score = 23.9 bits (50), Expect(2) = 4e-16
 Identities = 13/32 (40%), Positives = 16/32 (50%)
 Frame = -2

Query: 731  VSAQFMA*FLSVLNLNLNHIRCVLPAIIVWFI 636
            VS    A F S   +   HIR +LP  I WF+
Sbjct: 1953 VSHILWAWFYSGDYVKRGHIRTLLPIFICWFL 1984


>ref|XP_007046402.1| Uncharacterized protein TCM_011921 [Theobroma cacao]
            gi|508710337|gb|EOY02234.1| Uncharacterized protein
            TCM_011921 [Theobroma cacao]
          Length = 926

 Score = 88.2 bits (217), Expect(2) = 5e-16
 Identities = 59/196 (30%), Positives = 88/196 (44%), Gaps = 5/196 (2%)
 Frame = -3

Query: 634  WKNRNKARFNG*SFAATSIIWEVQSFIFQLVSAKVLNEAHFRGDMDCIFASFVCPIVKKT 455
            W  RN A+          ++W +   + QL    +L +  ++GD D I A +   +  K 
Sbjct: 697  WLERNDAKHRYSGLYTDRVVWRIMKLLRQLHDGSLLQQWQWKGDTD-IAAMWKYNLQLKL 755

Query: 454  RV--MVVRWLKTSHGYFKLNTDGSVFQGM---AGGLLRDSTDVLIFAFDKEVGEVNVLTA 290
            R    +V W K S G +KLN DGS   G    +GG+LRD T  LIF F + +G  N L A
Sbjct: 756  RAPPQIVYWRKPSTGEYKLNVDGSSRHGQHAASGGVLRDHTGKLIFGFSENIGNCNSLQA 815

Query: 289  ESXXXXXXXXXXLEKGFSNLHAEMDSIALAHLFTPNALAKWPLCITFRQIRLALISLAAN 110
            E            E+    L  EMD++A+  L   +      +      IR  L S++  
Sbjct: 816  ELRALLRGLLLCKERHIEQLWIEMDALAVIQLIPHSQKGSHDIRYLLESIRKCLNSISYR 875

Query: 109  LNHVFCENNGSADSLA 62
            ++H+  E N  AD L+
Sbjct: 876  ISHILREGNQVADFLS 891



 Score = 23.9 bits (50), Expect(2) = 5e-16
 Identities = 13/32 (40%), Positives = 16/32 (50%)
 Frame = -2

Query: 731 VSAQFMA*FLSVLNLNLNHIRCVLPAIIVWFI 636
           VS    A F S   +   HIR +LP  I WF+
Sbjct: 665 VSHILWAWFYSGDYVKRGHIRTLLPIFICWFL 696


>ref|XP_007026457.1| Uncharacterized protein TCM_021521 [Theobroma cacao]
            gi|508715062|gb|EOY06959.1| Uncharacterized protein
            TCM_021521 [Theobroma cacao]
          Length = 1951

 Score = 89.0 bits (219), Expect(2) = 7e-16
 Identities = 57/214 (26%), Positives = 92/214 (42%), Gaps = 4/214 (1%)
 Frame = -3

Query: 634  WKNRNKARFNG*SFAATSIIWEVQSFIFQLVSAKVLNEAHFRGDMD-CIFASFVCPIVKK 458
            W  RN A+          +IW +   + QL +  +L +  ++GD D      F  P    
Sbjct: 1721 WLERNDAKHRHMGMYPNRVIWRIMKLLNQLYAGSLLKQWQWKGDTDIATMWGFKFPPKYC 1780

Query: 457  TRVMVVRWLKTSHGYFKLNTDGSVFQGM---AGGLLRDSTDVLIFAFDKEVGEVNVLTAE 287
            T   ++ W+K   G +KLN DGS    +    GG+LRD T  L FAF + +G +  L AE
Sbjct: 1781 TSPQIIYWIKPFIGEYKLNVDGSSKSNLNAAGGGVLRDHTGKLAFAFSENLGPLPSLQAE 1840

Query: 286  SXXXXXXXXXXLEKGFSNLHAEMDSIALAHLFTPNALAKWPLCITFRQIRLALISLAANL 107
                        E+  +NL  EMD++    +   +      +      IRL L S +  +
Sbjct: 1841 LHALLRGLLLCKERNITNLWIEMDALVAVQMVQQSQKGSHDIRYLLESIRLCLRSFSYRI 1900

Query: 106  NHVFCENNGSADSLAVLHLNSDHILSFSAGSSDI 5
            +H++ E N +AD L+        +  FS    ++
Sbjct: 1901 SHIYREGNQAADFLSNKGQTHQSLCVFSEAQGEL 1934



 Score = 22.7 bits (47), Expect(2) = 7e-16
 Identities = 7/14 (50%), Positives = 10/14 (71%)
 Frame = -2

Query: 677  HIRCVLPAIIVWFI 636
            HIR ++P  I WF+
Sbjct: 1707 HIRILIPLFICWFL 1720


>ref|XP_007017128.1| Uncharacterized protein TCM_042327 [Theobroma cacao]
            gi|508787491|gb|EOY34747.1| Uncharacterized protein
            TCM_042327 [Theobroma cacao]
          Length = 1014

 Score = 88.2 bits (217), Expect(2) = 1e-15
 Identities = 51/195 (26%), Positives = 86/195 (44%), Gaps = 4/195 (2%)
 Frame = -3

Query: 634  WKNRNKARFNG*SFAATSIIWEVQSFIFQLVSAKVLNEAHFRGDMD-CIFASFVCPIVKK 458
            W  RN A+       +  ++W++   + QL    +L +  ++GD D      F  P+  +
Sbjct: 784  WLERNDAKHRHLGMYSDRVVWKIMKVLRQLQDGSLLKKWQWKGDTDIAAMWGFTLPLKIR 843

Query: 457  TRVMVVRWLKTSHGYFKLNTDGSVFQGMA---GGLLRDSTDVLIFAFDKEVGEVNVLTAE 287
                ++ W+K   G +KLN DGS     +   GGLLRD T  L+F F + +G  N L AE
Sbjct: 844  ESPQIIHWVKPVTGEYKLNVDGSSRHNQSAATGGLLRDHTGTLVFGFSENIGPSNSLQAE 903

Query: 286  SXXXXXXXXXXLEKGFSNLHAEMDSIALAHLFTPNALAKWPLCITFRQIRLALISLAANL 107
                        ++    L  EMD++ +  +   +      +      IR  L   +  +
Sbjct: 904  LRALLRGLLLCKDRNIEKLWIEMDALVVIQMIQQSKKGSHDIRYLLASIRKCLSFFSFRI 963

Query: 106  NHVFCENNGSADSLA 62
            +H+F E N +AD L+
Sbjct: 964  SHIFREGNQAADFLS 978



 Score = 22.7 bits (47), Expect(2) = 1e-15
 Identities = 7/14 (50%), Positives = 10/14 (71%)
 Frame = -2

Query: 677 HIRCVLPAIIVWFI 636
           HIR ++P  I WF+
Sbjct: 770 HIRTLIPLFICWFL 783


>ref|XP_007036030.1| Uncharacterized protein TCM_021518 [Theobroma cacao]
            gi|508715059|gb|EOY06956.1| Uncharacterized protein
            TCM_021518 [Theobroma cacao]
          Length = 1702

 Score = 87.4 bits (215), Expect(2) = 2e-15
 Identities = 58/215 (26%), Positives = 92/215 (42%), Gaps = 5/215 (2%)
 Frame = -3

Query: 634  WKNRNKARFNG*SFAATSIIWEVQSFIFQLVSAKVLNEAHFRGDMD--CIFASFVCPIVK 461
            W  RN A+       +  ++W++   + QL    VL    ++GDMD   ++     P ++
Sbjct: 1304 WLERNDAKQRHLGMYSDRVVWKIMKLLRQLQDGYVLKNWQWKGDMDIAAMWGFNFSPKIQ 1363

Query: 460  KTRVMVVRWLKTSHGYFKLNTDGSVFQGMA---GGLLRDSTDVLIFAFDKEVGEVNVLTA 290
             T   +  W+K   G  KLN DGS  Q  +   GGLLRD T  L+F F + +G  N L A
Sbjct: 1364 ATP-QIFHWVKLVSGEHKLNVDGSSRQNQSAAIGGLLRDHTGTLVFGFSENIGPSNSLQA 1422

Query: 289  ESXXXXXXXXXXLEKGFSNLHAEMDSIALAHLFTPNALAKWPLCITFRQIRLALISLAAN 110
            E            E+    L  EMD++    +   +      +      IR  L   +  
Sbjct: 1423 ELRALLRGLLLCKERNIEKLWIEMDALVAIQMIQQSQKGSHDIQYLLASIRKCLSFFSFR 1482

Query: 109  LNHVFCENNGSADSLAVLHLNSDHILSFSAGSSDI 5
            ++H+F E N  AD L+       ++L FS    ++
Sbjct: 1483 ISHIFREGNQVADFLSNKGHTQQNLLVFSEAEGEL 1517



 Score = 23.1 bits (48), Expect(2) = 2e-15
 Identities = 11/33 (33%), Positives = 17/33 (51%)
 Frame = -2

Query: 734  SVSAQFMA*FLSVLNLNLNHIRCVLPAIIVWFI 636
            +VS    A + S   +   HIR ++P  I WF+
Sbjct: 1271 NVSQILWAWYFSGDYVRKGHIRTLIPLFICWFL 1303


>ref|XP_007026458.1| Uncharacterized protein TCM_021522 [Theobroma cacao]
            gi|508715063|gb|EOY06960.1| Uncharacterized protein
            TCM_021522 [Theobroma cacao]
          Length = 3503

 Score = 85.1 bits (209), Expect(2) = 1e-14
 Identities = 54/195 (27%), Positives = 85/195 (43%), Gaps = 4/195 (2%)
 Frame = -3

Query: 634  WKNRNKARFNG*SFAATSIIWEVQSFIFQLVSAKVLNEAHFRGDMD-CIFASFVCPIVKK 458
            W  RN A+          +IW +   + QL +  +L +  ++GD D      F  P    
Sbjct: 1478 WLERNDAKHRHMGMYPNRVIWRIMKLLNQLHAGSLLKQWQWKGDTDIATMWGFKYPPKYC 1537

Query: 457  TRVMVVRWLKTSHGYFKLNTDGSVFQGM---AGGLLRDSTDVLIFAFDKEVGEVNVLTAE 287
                ++ W+K   G +KLN DGS         GG+LRD T  L FAF + +G +  L AE
Sbjct: 1538 QSPQIISWIKPFIGEYKLNVDGSSKSSQNAAGGGVLRDHTGKLAFAFSENLGPLPSLQAE 1597

Query: 286  SXXXXXXXXXXLEKGFSNLHAEMDSIALAHLFTPNALAKWPLCITFRQIRLALISLAANL 107
                        E+  +NL  EMD++    +   +      +      IRL L S +  +
Sbjct: 1598 LHALLRGLLLCKERNITNLWIEMDALVAVQMVQQSQKGSHDIRYLLESIRLCLRSFSYRI 1657

Query: 106  NHVFCENNGSADSLA 62
            +H++ E N +AD L+
Sbjct: 1658 SHIYREGNQAADFLS 1672



 Score = 22.7 bits (47), Expect(2) = 1e-14
 Identities = 7/14 (50%), Positives = 10/14 (71%)
 Frame = -2

Query: 677  HIRCVLPAIIVWFI 636
            HIR ++P  I WF+
Sbjct: 1464 HIRILIPLFICWFL 1477



 Score = 72.8 bits (177), Expect(2) = 3e-11
 Identities = 52/196 (26%), Positives = 79/196 (40%), Gaps = 5/196 (2%)
 Frame = -3

Query: 634  WKNRNKARFNG*SFAATSIIWEVQSFIFQLVSAKVLNEAHFRGDMDCIFA-SFVCPIVKK 458
            W  RN A+          I+W++   I QL   K L +  ++GD         +   V  
Sbjct: 3272 WVERNDAKHRNLGMYPNRIVWKILKLIHQLFQGKQLQKWQWQGDKQIAQEWGIILKAVAP 3331

Query: 457  TRVMVVRWLKTSHGYFKLNTDGSVFQGM----AGGLLRDSTDVLIFAFDKEVGEVNVLTA 290
            +   ++ W K S G FKLN DGS    +     GGLLRD T  +IF F +  G  + L A
Sbjct: 3332 SPPKLLFWNKPSIGEFKLNVDGSSKYNLQTAAGGGLLRDHTGSMIFGFSENFGSQDSLQA 3391

Query: 289  ESXXXXXXXXXXLEKGFSNLHAEMDSIALAHLFTPNALAKWPLCITFRQIRLALISLAAN 110
            E           ++   + L  EMD+     +                 I   L  ++  
Sbjct: 3392 ELMALHRGLLLCIDHNVTRLWIEMDAKVAVQMINEGHQGSSRTRYLLASIHRCLSGISFR 3451

Query: 109  LNHVFCENNGSADSLA 62
            ++H+F E N +AD L+
Sbjct: 3452 ISHIFREGNQAADHLS 3467



 Score = 23.1 bits (48), Expect(2) = 3e-11
 Identities = 7/14 (50%), Positives = 11/14 (78%)
 Frame = -2

Query: 677  HIRCVLPAIIVWFI 636
            HIR ++P  I+WF+
Sbjct: 3258 HIRTLVPLFILWFL 3271


>ref|XP_007040946.1| Uncharacterized protein TCM_016753 [Theobroma cacao]
            gi|508778191|gb|EOY25447.1| Uncharacterized protein
            TCM_016753 [Theobroma cacao]
          Length = 1275

 Score = 87.4 bits (215), Expect(2) = 1e-14
 Identities = 58/196 (29%), Positives = 89/196 (45%), Gaps = 5/196 (2%)
 Frame = -3

Query: 634  WKNRNKARFNG*SFAATSIIWEVQSFIFQLVSAKVLNEAHFRGDMDCIFASFVCPIVKKT 455
            W  RN A+          ++W + + + QL    +L +  ++GD D I A +      K 
Sbjct: 901  WLERNDAKHRHSGLYTDRVVWRIMTLLRQLQDDSLLQQWQWKGDTD-IAAMWRYNFQLKQ 959

Query: 454  RV--MVVRWLKTSHGYFKLNTDGSVFQGM---AGGLLRDSTDVLIFAFDKEVGEVNVLTA 290
            R    +V W K   G +KLN DGS   G    +GG+LRD T  LIF F + +G  N L A
Sbjct: 960  RAPPQIVYWRKPFTGEYKLNVDGSSRNGQHAASGGVLRDHTSKLIFCFSENIGTYNSLQA 1019

Query: 289  ESXXXXXXXXXXLEKGFSNLHAEMDSIALAHLFTPNALAKWPLCITFRQIRLALISLAAN 110
            E            E+    L  EMD++A+  L   +      +      I+  L S++  
Sbjct: 1020 ELRALHRGLLLCKERHIEKLWIEMDALAVIQLIPHSQKGSHDIRYLLESIKKCLNSISYR 1079

Query: 109  LNHVFCENNGSADSLA 62
            ++H+F E N +AD L+
Sbjct: 1080 ISHIFREGNQAADFLS 1095



 Score = 20.4 bits (41), Expect(2) = 1e-14
 Identities = 7/13 (53%), Positives = 9/13 (69%)
 Frame = -2

Query: 674 IRCVLPAIIVWFI 636
           IR +LP  I WF+
Sbjct: 888 IRTLLPIFICWFL 900


>ref|XP_007046404.1| Uncharacterized protein TCM_011923 [Theobroma cacao]
            gi|508710339|gb|EOY02236.1| Uncharacterized protein
            TCM_011923 [Theobroma cacao]
          Length = 1954

 Score = 80.9 bits (198), Expect(2) = 1e-13
 Identities = 53/197 (26%), Positives = 84/197 (42%), Gaps = 6/197 (3%)
 Frame = -3

Query: 634  WKNRNKARFNG*SFAATSIIWEVQSFIFQLVSAKVLNEAHFRGDMDCIFASF---VCPIV 464
            W  RN A+       +  ++W++   + QL    +L    ++GD D  FA+      P  
Sbjct: 1724 WLERNDAKHRHLGMYSDRVVWKIMKLLRQLQDGYLLKSWQWKGDKD--FATMWGLFSPPK 1781

Query: 463  KKTRVMVVRWLKTSHGYFKLNTDGSVFQGMA---GGLLRDSTDVLIFAFDKEVGEVNVLT 293
             +    ++ W+K   G  KLN DGS  Q      GG+LRD T  L+F F + +G  N L 
Sbjct: 1782 TRAAPQILHWVKPVPGEHKLNVDGSSRQNQTAAIGGVLRDHTGTLVFDFSENIGPSNSLQ 1841

Query: 292  AESXXXXXXXXXXLEKGFSNLHAEMDSIALAHLFTPNALAKWPLCITFRQIRLALISLAA 113
            AE            E+    L  EMD++    +   +      +      IR  L   + 
Sbjct: 1842 AELRALLRGLLLCKERNIEKLWVEMDALVAIQMIQQSQKGSHDIRYLLASIRKYLNFFSF 1901

Query: 112  NLNHVFCENNGSADSLA 62
             ++H+F E N +AD L+
Sbjct: 1902 RISHIFREGNQAADFLS 1918



 Score = 23.1 bits (48), Expect(2) = 1e-13
 Identities = 9/24 (37%), Positives = 14/24 (58%)
 Frame = -2

Query: 707  FLSVLNLNLNHIRCVLPAIIVWFI 636
            +LS   +   HIR ++P  I WF+
Sbjct: 1700 YLSGDYVRKGHIRILIPLFICWFL 1723


>ref|XP_007023907.1| Ribonuclease H-like protein [Theobroma cacao]
           gi|508779273|gb|EOY26529.1| Ribonuclease H-like protein
           [Theobroma cacao]
          Length = 458

 Score = 79.0 bits (193), Expect = 4e-12
 Identities = 52/191 (27%), Positives = 79/191 (41%), Gaps = 4/191 (2%)
 Frame = -3

Query: 634 WKNRNKARFNG*SFAATSIIWEVQSFIFQLVSAKVLNEAHFRGDMD-CIFASFVCPIVKK 458
           W  RN A+          ++WE    + QL     L +  ++ D D     SF+ P    
Sbjct: 231 WLERNDAKHRHLGMYPDRVVWETMKLLRQLHDGSPLKQWQWKVDKDIAAMWSFLFPPKHG 290

Query: 457 TRVMVVRWLKTSHGYFKLNTDGS---VFQGMAGGLLRDSTDVLIFAFDKEVGEVNVLTAE 287
           T   ++ W+K   G +KLN DGS        +GGLLRD    L+F F + +G  N L AE
Sbjct: 291 TTPQIIHWVKPFTGEYKLNVDGSSRNCQSATSGGLLRDHIGKLVFGFSENIGRCNSLQAE 350

Query: 286 SXXXXXXXXXXLEKGFSNLHAEMDSIALAHLFTPNALAKWPLCITFRQIRLALISLAANL 107
                       E+    L  EMD++ +  +          +      IR  L S++  +
Sbjct: 351 LRALLRRLLLCKEQHIERLWIEMDALVVIQMIHQYQKGSHDIRYLLTSIRKGLSSISYRI 410

Query: 106 NHVFCENNGSA 74
            H+F E N +A
Sbjct: 411 LHIFREGNQAA 421


>ref|XP_007010390.1| Retrotransposon, unclassified-like protein [Theobroma cacao]
            gi|508727303|gb|EOY19200.1| Retrotransposon,
            unclassified-like protein [Theobroma cacao]
          Length = 1368

 Score = 79.0 bits (193), Expect = 4e-12
 Identities = 56/201 (27%), Positives = 89/201 (44%), Gaps = 5/201 (2%)
 Frame = -3

Query: 649  LFGSFWKNRNKARFNG*SFAATSIIWEVQSFIFQLVSAKVLNEAHFRGDMD-CIFASFVC 473
            +F   W  RN A+          IIW +   + +L    +L +  ++GD+D  I   F  
Sbjct: 1100 IFWFVWVERNDAKHRDLGMYPDRIIWRIMKILRKLFQGGLLCKWQWKGDLDIAIHWGFNF 1159

Query: 472  PIVKKTRVMVVRWLKTSHGYFKLNTDGSV---FQGMAGG-LLRDSTDVLIFAFDKEVGEV 305
               ++ R  ++ W+K   G  KLN DGS    FQ  AGG +LRD T  LIF F +  G  
Sbjct: 1160 AQERQARPKIINWIKPLIGELKLNVDGSSKDEFQNAAGGGVLRDHTGNLIFGFSENFGYQ 1219

Query: 304  NVLTAESXXXXXXXXXXLEKGFSNLHAEMDSIALAHLFTPNALAKWPLCITFRQIRLALI 125
            N L AE           +E   S +  E+D+  +  +   +    + +      IR  L 
Sbjct: 1220 NSLQAELLALHRGLCLCMEYNVSRVWIEVDAQVVIQMIQNHHKGSYKIQYLLESIRKCLQ 1279

Query: 124  SLAANLNHVFCENNGSADSLA 62
             ++  ++H+  E N +AD L+
Sbjct: 1280 VISVRISHIHREGNQAADFLS 1300


>ref|XP_007017131.1| Uncharacterized protein TCM_033752 [Theobroma cacao]
            gi|508722459|gb|EOY14356.1| Uncharacterized protein
            TCM_033752 [Theobroma cacao]
          Length = 2251

 Score = 73.2 bits (178), Expect(2) = 2e-11
 Identities = 54/195 (27%), Positives = 77/195 (39%), Gaps = 4/195 (2%)
 Frame = -3

Query: 634  WKNRNKARFNG*SFAATSIIWEVQSFIFQLVSAKVLNEAHFRGDMDCIFA-SFVCPIVKK 458
            W  RN A+          ++W V   I QL   + L +  ++GD         +      
Sbjct: 2021 WVERNDAKHRNLGMYPNRVVWRVLKLIQQLSLGQQLLKWQWKGDKQIAQEWGIIFQAESL 2080

Query: 457  TRVMVVRWLKTSHGYFKLNTDGSVFQG---MAGGLLRDSTDVLIFAFDKEVGEVNVLTAE 287
                V  W K S G FKLN DGS  Q      GG+LRD    ++F F + +G  N L AE
Sbjct: 2081 APPKVFSWHKPSLGEFKLNVDGSAKQSHNAAGGGILRDHAGEMVFGFSENLGTQNSLQAE 2140

Query: 286  SXXXXXXXXXXLEKGFSNLHAEMDSIALAHLFTPNALAKWPLCITFRQIRLALISLAANL 107
                        +     L  EMD+I++  L   N      +      +R  L   +   
Sbjct: 2141 LLALYRGLILCRDYNIRRLWIEMDAISVIRLLQGNHRGPHAIRYLMVSLRQLLSHFSFRF 2200

Query: 106  NHVFCENNGSADSLA 62
            +H+F E N +AD LA
Sbjct: 2201 SHIFREGNQAADFLA 2215



 Score = 23.1 bits (48), Expect(2) = 2e-11
 Identities = 7/14 (50%), Positives = 11/14 (78%)
 Frame = -2

Query: 677  HIRCVLPAIIVWFI 636
            HIR ++P  I+WF+
Sbjct: 2007 HIRTLVPLFILWFL 2020


>ref|XP_007017129.1| Uncharacterized protein TCM_042328 [Theobroma cacao]
            gi|508787492|gb|EOY34748.1| Uncharacterized protein
            TCM_042328 [Theobroma cacao]
          Length = 910

 Score = 73.2 bits (178), Expect(2) = 3e-11
 Identities = 53/195 (27%), Positives = 77/195 (39%), Gaps = 4/195 (2%)
 Frame = -3

Query: 634  WKNRNKARFNG*SFAATSIIWEVQSFIFQLVSAKVLNEAHFRGDMDCIFA-SFVCPIVKK 458
            W  RN A+          ++W V   I QL   + L +  ++GD         +      
Sbjct: 680  WVERNDAKHRNLGMYPNRVVWRVLKLIQQLSLGQQLLKWQWKGDKQIAQEWGIILQAESL 739

Query: 457  TRVMVVRWLKTSHGYFKLNTDGSV---FQGMAGGLLRDSTDVLIFAFDKEVGEVNVLTAE 287
                V  W K + G FKLN DGS         GG+LRD   V++F F + +G  N L AE
Sbjct: 740  APPKVFSWHKPTTGEFKLNVDGSAKHSHNAAGGGILRDHAGVMVFGFSENLGIQNSLQAE 799

Query: 286  SXXXXXXXXXXLEKGFSNLHAEMDSIALAHLFTPNALAKWPLCITFRQIRLALISLAANL 107
                        +     L  EMD+I++  L   N      +      +R  L   +   
Sbjct: 800  LLALYRGLILCRDYNIRRLWIEMDAISVIRLLQGNHRGPHAIRYLMVSLRQLLSHFSFRF 859

Query: 106  NHVFCENNGSADSLA 62
            +H+F E N +AD LA
Sbjct: 860  SHIFREGNQAADFLA 874



 Score = 23.1 bits (48), Expect(2) = 3e-11
 Identities = 7/14 (50%), Positives = 11/14 (78%)
 Frame = -2

Query: 677 HIRCVLPAIIVWFI 636
           HIR ++P  I+WF+
Sbjct: 666 HIRTLVPLFILWFL 679


>ref|XP_007008704.1| Uncharacterized protein TCM_042330 [Theobroma cacao]
            gi|508725617|gb|EOY17514.1| Uncharacterized protein
            TCM_042330 [Theobroma cacao]
          Length = 2249

 Score = 72.8 bits (177), Expect(2) = 1e-10
 Identities = 58/200 (29%), Positives = 80/200 (40%), Gaps = 9/200 (4%)
 Frame = -3

Query: 634  WKNRNKARFNG*SFAATSIIWEVQSFIFQLVSAKVLNEAHFRGDMDCI------FASFVC 473
            W  RN A+          I+W +   I QL   + L +  ++GD          F +   
Sbjct: 2019 WVERNDAKHRNLGMYPNRIVWRILKLIQQLSLGQQLLKWQWKGDKQIAQEWGITFQAESL 2078

Query: 472  PIVKKTRVMVVRWLKTSHGYFKLNTDGSVF---QGMAGGLLRDSTDVLIFAFDKEVGEVN 302
            P  K     V  W K S G FKLN DGS         GG+LRD   V++F F + +G  N
Sbjct: 2079 PPPK-----VFPWHKPSIGEFKLNVDGSAKLSQNAAGGGVLRDHAGVMVFGFSENLGIQN 2133

Query: 301  VLTAESXXXXXXXXXXLEKGFSNLHAEMDSIALAHLFTPNALAKWPLCITFRQIRLALIS 122
             L AE            +     L  EMD+ ++  L   N      +      IR  L  
Sbjct: 2134 SLQAELLALYRGLILCRDYNIRRLWIEMDAASVIRLLQGNQRGPHAIRYLLVSIRQLLSH 2193

Query: 121  LAANLNHVFCENNGSADSLA 62
             +  L+H+F E N +AD LA
Sbjct: 2194 FSFRLSHIFREGNQAADFLA 2213



 Score = 21.2 bits (43), Expect(2) = 1e-10
 Identities = 6/14 (42%), Positives = 10/14 (71%)
 Frame = -2

Query: 677  HIRCVLPAIIVWFI 636
            HIR ++P   +WF+
Sbjct: 2005 HIRTLVPIFTLWFL 2018


>ref|XP_007020288.1| Uncharacterized protein TCM_036737 [Theobroma cacao]
            gi|508725616|gb|EOY17513.1| Uncharacterized protein
            TCM_036737 [Theobroma cacao]
          Length = 2215

 Score = 70.9 bits (172), Expect(2) = 4e-10
 Identities = 51/196 (26%), Positives = 77/196 (39%), Gaps = 5/196 (2%)
 Frame = -3

Query: 634  WKNRNKARFNG*SFAATSIIWEVQSFIFQLVSAKVLNEAHFRGDMDCIFA-SFVCPIVKK 458
            W  RN A+          ++W++   + QL   K L +  ++GD         +      
Sbjct: 1984 WVERNDAKHRNLGMYPNRVVWKILKLLHQLFQGKQLQKWQWQGDKQIAQEWGIILKADAP 2043

Query: 457  TRVMVVRWLKTSHGYFKLNTDGSVFQG----MAGGLLRDSTDVLIFAFDKEVGEVNVLTA 290
            +   ++ WLK S G  KLN DGS          GGLLRD T  +IF F +  G  + L A
Sbjct: 2044 SPPKLLFWLKPSIGELKLNVDGSCKHNPQSAAGGGLLRDHTGSMIFGFSENFGPQDSLQA 2103

Query: 289  ESXXXXXXXXXXLEKGFSNLHAEMDSIALAHLFTPNALAKWPLCITFRQIRLALISLAAN 110
            E           +E   S L  EMD+     +                 I   L  ++  
Sbjct: 2104 ELMALHRGLLLCIEHNISRLWIEMDAKVAVQMIKEGHQGSSRTRYLLASIHRCLSGISFR 2163

Query: 109  LNHVFCENNGSADSLA 62
            ++H+F E N +AD L+
Sbjct: 2164 ISHIFREGNQAADHLS 2179



 Score = 21.2 bits (43), Expect(2) = 4e-10
 Identities = 6/14 (42%), Positives = 10/14 (71%)
 Frame = -2

Query: 677  HIRCVLPAIIVWFI 636
            HIR ++P   +WF+
Sbjct: 1970 HIRTLVPLFTLWFL 1983


>ref|XP_007023840.1| Uncharacterized protein TCM_028138 [Theobroma cacao]
            gi|508779206|gb|EOY26462.1| Uncharacterized protein
            TCM_028138 [Theobroma cacao]
          Length = 861

 Score = 69.7 bits (169), Expect = 2e-09
 Identities = 42/137 (30%), Positives = 61/137 (44%), Gaps = 4/137 (2%)
 Frame = -3

Query: 580  IIWEVQSFIFQLVSAKVLNEAHFRGDMD-CIFASFVCPIVKKTRVMVVRWLKTSHGYFKL 404
            +IW +     QL    +L +  ++GD D         P  +     ++ W K S G +KL
Sbjct: 665  VIWRIMKLCRQLYDGSLLQQWQWKGDTDIAAMLGLSFPPKQHAPPQIIYWKKPSIGEYKL 724

Query: 403  NTDGSVFQGM---AGGLLRDSTDVLIFAFDKEVGEVNVLTAESXXXXXXXXXXLEKGFSN 233
            N DGS   G+   +GG+LRD T  LIF F + +G  N L AE            E+    
Sbjct: 725  NVDGSSRNGLHAASGGVLRDHTGKLIFGFSENIGPCNSLQAELHALLRGFLLCKERHIEK 784

Query: 232  LHAEMDSIALAHLFTPN 182
            L  EMD++    L  P+
Sbjct: 785  LWIEMDALVAIQLIQPS 801


>gb|AAT38805.1| hypothetical protein SDM1_47t00008 [Solanum demissum]
          Length = 1155

 Score = 68.6 bits (166), Expect = 5e-09
 Identities = 45/144 (31%), Positives = 70/144 (48%), Gaps = 2/144 (1%)
 Frame = -3

Query: 466  VKKTRVMVVRWLKTSHGYFKLNTDGSVFQGMAGG--LLRDSTDVLIFAFDKEVGEVNVLT 293
            +    V+ V W+K    + KLNTDGS   G  GG  +LR++   +I AF  ++GE     
Sbjct: 982  INHKNVIRVNWIKPPTMFAKLNTDGSCVNGRCGGGGILRNALGQVIMAFTIKLGEGTSSW 1041

Query: 292  AESXXXXXXXXXXLEKGFSNLHAEMDSIALAHLFTPNALAKWPLCITFRQIRLALISLAA 113
            AE+          +++G + +  E DSI LA   T N    W + I  ++I+  +     
Sbjct: 1042 AEAMSMLHGMQLCIQRGVNMIIGETDSILLAKAITENWSIPWRMYIPVKKIQKMVEEHGF 1101

Query: 112  NLNHVFCENNGSADSLAVLHLNSD 41
             +NH   E N  AD LA + L++D
Sbjct: 1102 IINHCLREANQPADKLASISLSTD 1125


Top