BLASTX nr result

ID: Rehmannia22_contig00021327 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Rehmannia22_contig00021327
         (1183 letters)

Database: ./nr 
           37,332,560 sequences; 13,225,080,153 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

gb|EOY06960.1| Uncharacterized protein TCM_021522 [Theobroma cacao]   164   3e-51
gb|EOY34747.1| Uncharacterized protein TCM_042327 [Theobroma cacao]   164   3e-51
gb|EOY17513.1| Uncharacterized protein TCM_036737 [Theobroma cacao]   160   2e-50
gb|EOX96783.1| Uncharacterized protein TCM_005954 [Theobroma cacao]   166   3e-50
gb|EOY02239.1| Uncharacterized protein TCM_016763 [Theobroma cacao]   165   6e-50
gb|EOY02234.1| Uncharacterized protein TCM_011921 [Theobroma cacao]   163   1e-49
gb|EOY06959.1| Uncharacterized protein TCM_021521 [Theobroma cacao]   159   2e-49
gb|EOY02238.1| Uncharacterized protein TCM_016762 [Theobroma cacao]   165   3e-49
gb|EOY02236.1| Uncharacterized protein TCM_011923 [Theobroma cacao]   154   4e-49
gb|EOY14356.1| Uncharacterized protein TCM_033752 [Theobroma cacao]   158   9e-49
gb|EOY17514.1| Uncharacterized protein TCM_042330 [Theobroma cacao]   156   3e-48
gb|EOY06956.1| Uncharacterized protein TCM_021518 [Theobroma cacao]   148   6e-48
gb|EOY34748.1| Uncharacterized protein TCM_042328 [Theobroma cacao]   155   6e-48
gb|EOY25451.1| Uncharacterized protein TCM_016759 [Theobroma cacao]   156   8e-47
gb|EOY19200.1| Retrotransposon, unclassified-like protein [Theob...   152   3e-46
gb|EOY25447.1| Uncharacterized protein TCM_016753 [Theobroma cacao]   151   9e-36
ref|XP_004309110.1| PREDICTED: putative ribonuclease H protein A...   109   1e-31
gb|EOY26529.1| Ribonuclease H-like protein [Theobroma cacao]          140   1e-30
gb|EOY25454.1| Uncharacterized protein TCM_026877 [Theobroma cacao]   137   7e-30
ref|XP_004242524.1| PREDICTED: uncharacterized protein LOC101258...   111   5e-29

>gb|EOY06960.1| Uncharacterized protein TCM_021522 [Theobroma cacao]
          Length = 3503

 Score =  164 bits (414), Expect(2) = 3e-51
 Identities = 88/247 (35%), Positives = 138/247 (55%)
 Frame = +2

Query: 326  HISFIIPCVILWFIWLERNKNKHESKGFSAYRVIGRVEHHIYLLKSSKLFQKSTWKGFGN 505
            HI  ++P  ILWF+W+ERN  KH + G    R++ ++   I+ L   K  QK  W+G   
Sbjct: 3258 HIRTLVPLFILWFLWVERNDAKHRNLGMYPNRIVWKILKLIHQLFQGKQLQKWQWQGDKQ 3317

Query: 506  VAASFGIYFRISVVQKCIPVHWHKPDLGQYKLNIDGSKNPISSCGGIGGVLRDWQGNVIL 685
            +A  +GI  +         + W+KP +G++KLN+DGS          GG+LRD  G++I 
Sbjct: 3318 IAQEWGIILKAVAPSPPKLLFWNKPSIGEFKLNVDGSSKYNLQTAAGGGLLRDHTGSMIF 3377

Query: 686  AFADGFMDCPDSTYAEISALHKALSLIEALSFHKIWIETDSQVLLQLISHNATGKWSYQH 865
             F++ F    DS  AE+ ALH+ L L    +  ++WIE D++V +Q+I+    G    ++
Sbjct: 3378 GFSENF-GSQDSLQAELMALHRGLLLCIDHNVTRLWIEMDAKVAVQMINEGHQGSSRTRY 3436

Query: 866  LLSKIRKQMEGFDWKISHIFREGNKVADGLAKLGCSSQNFVQFFADDFPRQVRGLARVDQ 1045
            LL+ I + + G  ++ISHIFREGN+ AD L+  G + QN           Q+RG+ R+D+
Sbjct: 3437 LLASIHRCLSGISFRISHIFREGNQAADHLSNQGYTHQNLQ--VISQAEGQLRGILRLDK 3494

Query: 1046 LNLPSFR 1066
            +NL   R
Sbjct: 3495 INLAYVR 3501



 Score = 66.6 bits (161), Expect(2) = 3e-51
 Identities = 32/94 (34%), Positives = 55/94 (58%)
 Frame = +3

Query: 15   NPIFPPLCSHFLTPSMSIFMWRLIKNRLPVDQKLIDRGFSLAFKCWCCESPSAESLVHLF 194
            NP +  +    +  + S F+WRL+ + +PV+ K+  +GF LA +C CC+  S ESL+H+ 
Sbjct: 3156 NPTYNYIWHKSVPLTTSFFLWRLLHDWVPVELKMKSKGFQLASRCRCCK--SEESLMHVM 3213

Query: 195  LHNTHSHKVWMHFANMLHFSLPDTENINTFISSW 296
              N  +++VW +FA +    + +   IN  IS+W
Sbjct: 3214 WDNPVANQVWSYFAKVFQIHIINPCTINHIISAW 3247



 Score =  150 bits (380), Expect(2) = 2e-46
 Identities = 85/230 (36%), Positives = 125/230 (54%), Gaps = 1/230 (0%)
 Frame = +2

Query: 326  HISFIIPCVILWFIWLERNKNKHESKGFSAYRVIGRVEHHIYLLKSSKLFQKSTWKGFGN 505
            HI  +IP  I WF+WLERN  KH   G    RVI R+   +  L +  L ++  WKG  +
Sbjct: 1464 HIRILIPLFICWFLWLERNDAKHRHMGMYPNRVIWRIMKLLNQLHAGSLLKQWQWKGDTD 1523

Query: 506  VAASFGIYFRISVVQKCIPVHWHKPDLGQYKLNIDGSKNPISSCGGIGGVLRDWQGNVIL 685
            +A  +G  +     Q    + W KP +G+YKLN+DGS     +  G GGVLRD  G +  
Sbjct: 1524 IATMWGFKYPPKYCQSPQIISWIKPFIGEYKLNVDGSSKSSQNAAG-GGVLRDHTGKLAF 1582

Query: 686  AFADGFMDCPDSTYAEISALHKALSLIEALSFHKIWIETDSQVLLQLISHNATGKWSYQH 865
            AF++     P S  AE+ AL + L L +  +   +WIE D+ V +Q++  +  G    ++
Sbjct: 1583 AFSENLGPLP-SLQAELHALLRGLLLCKERNITNLWIEMDALVAVQMVQQSQKGSHDIRY 1641

Query: 866  LLSKIRKQMEGFDWKISHIFREGNKVADGLAKLGCSSQNF-VQFFADDFP 1012
            LL  IR  +  F ++ISHI+REGN+ AD L+  G + Q+  V   A +FP
Sbjct: 1642 LLESIRLCLRSFSYRISHIYREGNQAADFLSNKGQTHQSLCVVSEAQEFP 1691



 Score = 63.5 bits (153), Expect(2) = 2e-46
 Identities = 29/80 (36%), Positives = 47/80 (58%)
 Frame = +3

Query: 57   SMSIFMWRLIKNRLPVDQKLIDRGFSLAFKCWCCESPSAESLVHLFLHNTHSHKVWMHFA 236
            S+S F+WR++ N +PV+ ++ D+G  LA KC CC   S ESL+H+   N  + +VW  FA
Sbjct: 1376 SISFFLWRVLNNWIPVELRMKDKGIHLASKCVCCR--SEESLIHVLWENPVAKQVWNFFA 1433

Query: 237  NMLHFSLPDTENINTFISSW 296
                  +   ++I+  I +W
Sbjct: 1434 KSFQIYVSKPKHISQIIWAW 1453


>gb|EOY34747.1| Uncharacterized protein TCM_042327 [Theobroma cacao]
          Length = 1014

 Score =  164 bits (414), Expect(2) = 3e-51
 Identities = 89/244 (36%), Positives = 140/244 (57%)
 Frame = +2

Query: 326  HISFIIPCVILWFIWLERNKNKHESKGFSAYRVIGRVEHHIYLLKSSKLFQKSTWKGFGN 505
            HI  +IP  I WF+WLERN  KH   G  + RV+ ++   +  L+   L +K  WKG  +
Sbjct: 770  HIRTLIPLFICWFLWLERNDAKHRHLGMYSDRVVWKIMKVLRQLQDGSLLKKWQWKGDTD 829

Query: 506  VAASFGIYFRISVVQKCIPVHWHKPDLGQYKLNIDGSKNPISSCGGIGGVLRDWQGNVIL 685
            +AA +G    + + +    +HW KP  G+YKLN+DGS     S    GG+LRD  G ++ 
Sbjct: 830  IAAMWGFTLPLKIRESPQIIHWVKPVTGEYKLNVDGSSRHNQS-AATGGLLRDHTGTLVF 888

Query: 686  AFADGFMDCPDSTYAEISALHKALSLIEALSFHKIWIETDSQVLLQLISHNATGKWSYQH 865
             F++  +   +S  AE+ AL + L L +  +  K+WIE D+ V++Q+I  +  G    ++
Sbjct: 889  GFSEN-IGPSNSLQAELRALLRGLLLCKDRNIEKLWIEMDALVVIQMIQQSKKGSHDIRY 947

Query: 866  LLSKIRKQMEGFDWKISHIFREGNKVADGLAKLGCSSQNFVQFFADDFPRQVRGLARVDQ 1045
            LL+ IRK +  F ++ISHIFREGN+ AD L+  G + QN       +   ++ G+ ++D+
Sbjct: 948  LLASIRKCLSFFSFRISHIFREGNQAADFLSNKGHTHQNLQ--VISEAQGKLHGMLKLDR 1005

Query: 1046 LNLP 1057
            LNLP
Sbjct: 1006 LNLP 1009



 Score = 66.6 bits (161), Expect(2) = 3e-51
 Identities = 28/80 (35%), Positives = 51/80 (63%)
 Frame = +3

Query: 57  SMSIFMWRLIKNRLPVDQKLIDRGFSLAFKCWCCESPSAESLVHLFLHNTHSHKVWMHFA 236
           ++S F+WR++ N +PV+ +L ++GF LA KC CC   S ESL+H+   N  + +VW  FA
Sbjct: 682 TISFFLWRVLNNWIPVELRLKEKGFHLASKCVCCN--SEESLIHVLWDNPVAKQVWNFFA 739

Query: 237 NMLHFSLPDTENINTFISSW 296
           +    ++ + ++++  I +W
Sbjct: 740 DFFQINISNPQHVSQIIWAW 759


>gb|EOY17513.1| Uncharacterized protein TCM_036737 [Theobroma cacao]
          Length = 2215

 Score =  160 bits (404), Expect(2) = 2e-50
 Identities = 90/249 (36%), Positives = 139/249 (55%), Gaps = 2/249 (0%)
 Frame = +2

Query: 326  HISFIIPCVILWFIWLERNKNKHESKGFSAYRVIGRVEHHIYLLKSSKLFQKSTWKGFGN 505
            HI  ++P   LWF+W+ERN  KH + G    RV+ ++   ++ L   K  QK  W+G   
Sbjct: 1970 HIRTLVPLFTLWFLWVERNDAKHRNLGMYPNRVVWKILKLLHQLFQGKQLQKWQWQGDKQ 2029

Query: 506  VAASFGIYFRISVVQKCIPVHWHKPDLGQYKLNIDGS--KNPISSCGGIGGVLRDWQGNV 679
            +A  +GI  +         + W KP +G+ KLN+DGS   NP S+ G  GG+LRD  G++
Sbjct: 2030 IAQEWGIILKADAPSPPKLLFWLKPSIGELKLNVDGSCKHNPQSAAG--GGLLRDHTGSM 2087

Query: 680  ILAFADGFMDCPDSTYAEISALHKALSLIEALSFHKIWIETDSQVLLQLISHNATGKWSY 859
            I  F++ F    DS  AE+ ALH+ L L    +  ++WIE D++V +Q+I     G    
Sbjct: 2088 IFGFSENF-GPQDSLQAELMALHRGLLLCIEHNISRLWIEMDAKVAVQMIKEGHQGSSRT 2146

Query: 860  QHLLSKIRKQMEGFDWKISHIFREGNKVADGLAKLGCSSQNFVQFFADDFPRQVRGLARV 1039
            ++LL+ I + + G  ++ISHIFREGN+ AD L+  G + QN           Q+RG+ R+
Sbjct: 2147 RYLLASIHRCLSGISFRISHIFREGNQAADHLSNQGHTHQNLQ--VISQAEGQLRGILRL 2204

Query: 1040 DQLNLPSFR 1066
            +++NL   R
Sbjct: 2205 EKINLAYVR 2213



 Score = 67.8 bits (164), Expect(2) = 2e-50
 Identities = 32/94 (34%), Positives = 55/94 (58%)
 Frame = +3

Query: 15   NPIFPPLCSHFLTPSMSIFMWRLIKNRLPVDQKLIDRGFSLAFKCWCCESPSAESLVHLF 194
            NP+F  +    +  + S F+WRL+ + +PV+ K+  +GF LA +C CC+  S ESL+H+ 
Sbjct: 1868 NPVFNFIWHKSVPLTTSFFLWRLLHDWIPVELKMKTKGFQLASRCRCCK--SEESLMHVM 1925

Query: 195  LHNTHSHKVWMHFANMLHFSLPDTENINTFISSW 296
              N  +++VW +FA +    + +   IN  I +W
Sbjct: 1926 WKNPVANQVWSYFAKVFQIQIINPCTINQIICAW 1959


>gb|EOX96783.1| Uncharacterized protein TCM_005954 [Theobroma cacao]
          Length = 1134

 Score =  166 bits (420), Expect(2) = 3e-50
 Identities = 95/250 (38%), Positives = 134/250 (53%), Gaps = 1/250 (0%)
 Frame = +2

Query: 326  HISFIIPCVILWFIWLERNKNKHESKGFSAYRVIGRVEHHIYLLKSSKLFQKSTWKGFGN 505
            H   ++P  I WF+WLERN  KH   G    RVI R   H   L    L Q+  WKG  +
Sbjct: 888  HFRVLLPLFICWFLWLERNDAKHRHTGLYPDRVIWRTMKHCRQLYDGSLLQQWQWKGDTD 947

Query: 506  VAASFGIYFRISVVQKCIPVHWHKPDLGQYKLNIDGS-KNPISSCGGIGGVLRDWQGNVI 682
            +AA  G  F          ++W KP +G+YKLN+DGS +N + +    GGVLRD  G +I
Sbjct: 948  IAAMLGFSFPPQQHASPQIIYWKKPSIGEYKLNVDGSSRNGLHAA--TGGVLRDHTGKLI 1005

Query: 683  LAFADGFMDCPDSTYAEISALHKALSLIEALSFHKIWIETDSQVLLQLISHNATGKWSYQ 862
              F++    C +S  AE+ AL + L L +     K+WIE D+   +QLI  +  G +  +
Sbjct: 1006 FGFSENIGPC-NSLQAELRALLRGLLLCKERHIEKLWIEMDALAAIQLIQPSKKGPYDIR 1064

Query: 863  HLLSKIRKQMEGFDWKISHIFREGNKVADGLAKLGCSSQNFVQFFADDFPRQVRGLARVD 1042
            +LL  IR  +  F +++SH FREGNK AD L+  G   QN   F   +   Q+ G+ ++D
Sbjct: 1065 YLLESIRMCLSSFSYRLSHTFREGNKAADYLSNEGHKHQNLCVF--TEAQGQLHGMLKLD 1122

Query: 1043 QLNLPSFRTR 1072
            +LNLP  R R
Sbjct: 1123 RLNLPYVRFR 1132



 Score = 60.8 bits (146), Expect(2) = 3e-50
 Identities = 26/80 (32%), Positives = 47/80 (58%)
 Frame = +3

Query: 57   SMSIFMWRLIKNRLPVDQKLIDRGFSLAFKCWCCESPSAESLVHLFLHNTHSHKVWMHFA 236
            S+S F+W+ + N +PV+ ++ ++G  LA KC CC   S ESL+H+   N  + +VW  FA
Sbjct: 800  SISFFLWKTLHNWIPVELRMKEKGIQLASKCVCCN--SEESLIHVLWENPVAKQVWNFFA 857

Query: 237  NMLHFSLPDTENINTFISSW 296
             +    + +  +++  I +W
Sbjct: 858  KLFQIYILNPRHVSQIIWAW 877


>gb|EOY02239.1| Uncharacterized protein TCM_016763 [Theobroma cacao]
          Length = 2127

 Score =  165 bits (418), Expect(2) = 6e-50
 Identities = 94/248 (37%), Positives = 135/248 (54%), Gaps = 1/248 (0%)
 Frame = +2

Query: 326  HISFIIPCVILWFIWLERNKNKHESKGFSAYRVIGRVEHHIYLLKSSKLFQKSTWKGFGN 505
            H   ++P  I WF+WLERN  KH   G  A RVI R   H   L    L Q+  WKG  +
Sbjct: 1884 HFRVLLPLFICWFLWLERNDAKHRHTGLYADRVIWRTMKHCRQLYDGSLLQQWQWKGDTD 1943

Query: 506  VAASFGIYFRISVVQKCIPVHWHKPDLGQYKLNIDGS-KNPISSCGGIGGVLRDWQGNVI 682
            +A   G  F          ++W KP +G+YKLN+DGS +N + +    GGVLRD  G +I
Sbjct: 1944 IATMLGFSFTHKQHAPPQIIYWKKPSIGEYKLNVDGSSRNGLHAA--TGGVLRDHTGKLI 2001

Query: 683  LAFADGFMDCPDSTYAEISALHKALSLIEALSFHKIWIETDSQVLLQLISHNATGKWSYQ 862
              F++    C +S  AE+ AL + L L +     K+WIE D+ V +QLI  +  G ++ +
Sbjct: 2002 FGFSENIGPC-NSLQAELRALLRGLLLCKERHIEKLWIEMDALVAIQLIQPSKKGPYNLR 2060

Query: 863  HLLSKIRKQMEGFDWKISHIFREGNKVADGLAKLGCSSQNFVQFFADDFPRQVRGLARVD 1042
            +LL  IR  +  F +++SHI REGN+ AD L+  G   QN   F   +   Q+ G+ ++D
Sbjct: 2061 YLLESIRMCLSSFSYRLSHILREGNQAADYLSNEGHKHQNLCVF--TEAQGQLHGMLKLD 2118

Query: 1043 QLNLPSFR 1066
            +LNLP  R
Sbjct: 2119 RLNLPYVR 2126



 Score = 60.5 bits (145), Expect(2) = 6e-50
 Identities = 26/80 (32%), Positives = 47/80 (58%)
 Frame = +3

Query: 57   SMSIFMWRLIKNRLPVDQKLIDRGFSLAFKCWCCESPSAESLVHLFLHNTHSHKVWMHFA 236
            S+S F+W+ + N +PV+ ++ ++G  LA KC CC   S ESL+H+   N  + +VW  FA
Sbjct: 1796 SISFFLWKTLHNWIPVELRMKEKGIQLASKCVCCN--SEESLIHVLWENPVAKQVWNFFA 1853

Query: 237  NMLHFSLPDTENINTFISSW 296
             +    + +  +++  I +W
Sbjct: 1854 QLFQIYIWNPRHVSQIIWAW 1873


>gb|EOY02234.1| Uncharacterized protein TCM_011921 [Theobroma cacao]
          Length = 926

 Score =  163 bits (412), Expect(2) = 1e-49
 Identities = 91/247 (36%), Positives = 136/247 (55%)
 Frame = +2

Query: 326  HISFIIPCVILWFIWLERNKNKHESKGFSAYRVIGRVEHHIYLLKSSKLFQKSTWKGFGN 505
            HI  ++P  I WF+WLERN  KH   G    RV+ R+   +  L    L Q+  WKG  +
Sbjct: 683  HIRTLLPIFICWFLWLERNDAKHRYSGLYTDRVVWRIMKLLRQLHDGSLLQQWQWKGDTD 742

Query: 506  VAASFGIYFRISVVQKCIPVHWHKPDLGQYKLNIDGSKNPISSCGGIGGVLRDWQGNVIL 685
            +AA +    ++ +      V+W KP  G+YKLN+DGS          GGVLRD  G +I 
Sbjct: 743  IAAMWKYNLQLKLRAPPQIVYWRKPSTGEYKLNVDGSSRH-GQHAASGGVLRDHTGKLIF 801

Query: 686  AFADGFMDCPDSTYAEISALHKALSLIEALSFHKIWIETDSQVLLQLISHNATGKWSYQH 865
             F++   +C +S  AE+ AL + L L +     ++WIE D+  ++QLI H+  G    ++
Sbjct: 802  GFSENIGNC-NSLQAELRALLRGLLLCKERHIEQLWIEMDALAVIQLIPHSQKGSHDIRY 860

Query: 866  LLSKIRKQMEGFDWKISHIFREGNKVADGLAKLGCSSQNFVQFFADDFPRQVRGLARVDQ 1045
            LL  IRK +    ++ISHI REGN+VAD L+  G + QN   F   +   ++ G+ ++D+
Sbjct: 861  LLESIRKCLNSISYRISHILREGNQVADFLSNEGHNHQNLRVF--TEAQGKLHGMLKLDR 918

Query: 1046 LNLPSFR 1066
            LNLP  R
Sbjct: 919  LNLPYVR 925



 Score = 62.0 bits (149), Expect(2) = 1e-49
 Identities = 27/80 (33%), Positives = 49/80 (61%)
 Frame = +3

Query: 57  SMSIFMWRLIKNRLPVDQKLIDRGFSLAFKCWCCESPSAESLVHLFLHNTHSHKVWMHFA 236
           S+S F+WR + N +PV+ ++ ++G  LA KC CC   S ESL+H+   N+ + +VW  FA
Sbjct: 595 SISFFIWRALNNWIPVELRMKEKGIHLASKCVCCN--SEESLMHVLWGNSVAKQVWAFFA 652

Query: 237 NMLHFSLPDTENINTFISSW 296
           N     + + ++++  + +W
Sbjct: 653 NFFQIYIFNPQHVSHILWAW 672


>gb|EOY06959.1| Uncharacterized protein TCM_021521 [Theobroma cacao]
          Length = 1951

 Score =  159 bits (402), Expect(2) = 2e-49
 Identities = 90/249 (36%), Positives = 136/249 (54%)
 Frame = +2

Query: 326  HISFIIPCVILWFIWLERNKNKHESKGFSAYRVIGRVEHHIYLLKSSKLFQKSTWKGFGN 505
            HI  +IP  I WF+WLERN  KH   G    RVI R+   +  L +  L ++  WKG  +
Sbjct: 1707 HIRILIPLFICWFLWLERNDAKHRHMGMYPNRVIWRIMKLLNQLYAGSLLKQWQWKGDTD 1766

Query: 506  VAASFGIYFRISVVQKCIPVHWHKPDLGQYKLNIDGSKNPISSCGGIGGVLRDWQGNVIL 685
            +A  +G  F          ++W KP +G+YKLN+DGS     +  G GGVLRD  G +  
Sbjct: 1767 IATMWGFKFPPKYCTSPQIIYWIKPFIGEYKLNVDGSSKSNLNAAG-GGVLRDHTGKLAF 1825

Query: 686  AFADGFMDCPDSTYAEISALHKALSLIEALSFHKIWIETDSQVLLQLISHNATGKWSYQH 865
            AF++     P S  AE+ AL + L L +  +   +WIE D+ V +Q++  +  G    ++
Sbjct: 1826 AFSENLGPLP-SLQAELHALLRGLLLCKERNITNLWIEMDALVAVQMVQQSQKGSHDIRY 1884

Query: 866  LLSKIRKQMEGFDWKISHIFREGNKVADGLAKLGCSSQNFVQFFADDFPRQVRGLARVDQ 1045
            LL  IR  +  F ++ISHI+REGN+ AD L+  G + Q+   F   +   ++ G+ ++D+
Sbjct: 1885 LLESIRLCLRSFSYRISHIYREGNQAADFLSNKGQTHQSLCVF--SEAQGELIGILKLDK 1942

Query: 1046 LNLPSFRTR 1072
            LNLP  R R
Sbjct: 1943 LNLPYVRFR 1951



 Score = 65.1 bits (157), Expect(2) = 2e-49
 Identities = 31/94 (32%), Positives = 51/94 (54%)
 Frame = +3

Query: 15   NPIFPPLCSHFLTPSMSIFMWRLIKNRLPVDQKLIDRGFSLAFKCWCCESPSAESLVHLF 194
            N +F  +    +  S+S F+WR++ N +PV+ ++ D+G  LA KC CC   S ESL+H+ 
Sbjct: 1605 NALFSLIWHRSIPLSISFFLWRVLNNWIPVELRMKDKGIHLASKCVCCR--SEESLIHVL 1662

Query: 195  LHNTHSHKVWMHFANMLHFSLPDTENINTFISSW 296
              N  + +VW  FA      +    +I+  I +W
Sbjct: 1663 WENPVATQVWFFFAKSFQIYVSKPNHISQIIWAW 1696


>gb|EOY02238.1| Uncharacterized protein TCM_016762 [Theobroma cacao]
          Length = 2214

 Score =  165 bits (418), Expect(2) = 3e-49
 Identities = 91/247 (36%), Positives = 137/247 (55%)
 Frame = +2

Query: 326  HISFIIPCVILWFIWLERNKNKHESKGFSAYRVIGRVEHHIYLLKSSKLFQKSTWKGFGN 505
            HI  ++P  I WF+WLERN  K+   G +  R++ R+   +  LK   L Q+  WKG  +
Sbjct: 1971 HIRTLLPIFICWFLWLERNDAKYRHSGLNTDRIVWRIMKLLRQLKDGSLLQQWQWKGDTD 2030

Query: 506  VAASFGIYFRISVVQKCIPVHWHKPDLGQYKLNIDGSKNPISSCGGIGGVLRDWQGNVIL 685
            +AA +   F++ +      V+W KP  G+YKLN+DGS          GGVLRD  G +I 
Sbjct: 2031 IAAMWQYNFQLKLRAPPQIVYWRKPSTGEYKLNVDGSSRH-GQHAASGGVLRDHTGKLIF 2089

Query: 686  AFADGFMDCPDSTYAEISALHKALSLIEALSFHKIWIETDSQVLLQLISHNATGKWSYQH 865
             F++    C +S  AE+ AL + L L +     K+WIE D+   +QL+ H+  G    ++
Sbjct: 2090 GFSENIGTC-NSLQAELRALLRGLLLCKERHIEKLWIEMDALAAIQLLPHSQKGSHDIRY 2148

Query: 866  LLSKIRKQMEGFDWKISHIFREGNKVADGLAKLGCSSQNFVQFFADDFPRQVRGLARVDQ 1045
            LL  IRK +    ++ISHI REGN+VAD L+  G + QN   F   +   ++ G+ ++D+
Sbjct: 2149 LLESIRKCLNSISYRISHIHREGNQVADFLSNEGHNHQNLHVF--TEAQGKLHGMLKLDR 2206

Query: 1046 LNLPSFR 1066
            LNLP  R
Sbjct: 2207 LNLPYVR 2213



 Score = 58.2 bits (139), Expect(2) = 3e-49
 Identities = 26/80 (32%), Positives = 47/80 (58%)
 Frame = +3

Query: 57   SMSIFMWRLIKNRLPVDQKLIDRGFSLAFKCWCCESPSAESLVHLFLHNTHSHKVWMHFA 236
            S+S F+WR + N +PV+ ++  +G  LA KC CC   S ESL+H+   N+ + +VW  FA
Sbjct: 1883 SISFFIWRALNNWIPVELRMKGKGIHLASKCVCCN--SEESLMHVLWGNSVAKQVWAFFA 1940

Query: 237  NMLHFSLPDTENINTFISSW 296
                  + + ++++  + +W
Sbjct: 1941 KFFQIYVLNPKHVSHILWAW 1960


>gb|EOY02236.1| Uncharacterized protein TCM_011923 [Theobroma cacao]
          Length = 1954

 Score =  154 bits (390), Expect(2) = 4e-49
 Identities = 87/247 (35%), Positives = 138/247 (55%)
 Frame = +2

Query: 326  HISFIIPCVILWFIWLERNKNKHESKGFSAYRVIGRVEHHIYLLKSSKLFQKSTWKGFGN 505
            HI  +IP  I WF+WLERN  KH   G  + RV+ ++   +  L+   L +   WKG  +
Sbjct: 1710 HIRILIPLFICWFLWLERNDAKHRHLGMYSDRVVWKIMKLLRQLQDGYLLKSWQWKGDKD 1769

Query: 506  VAASFGIYFRISVVQKCIPVHWHKPDLGQYKLNIDGSKNPISSCGGIGGVLRDWQGNVIL 685
             A  +G++           +HW KP  G++KLN+DGS    +    IGGVLRD  G ++ 
Sbjct: 1770 FATMWGLFSPPKTRAAPQILHWVKPVPGEHKLNVDGSSRQ-NQTAAIGGVLRDHTGTLVF 1828

Query: 686  AFADGFMDCPDSTYAEISALHKALSLIEALSFHKIWIETDSQVLLQLISHNATGKWSYQH 865
             F++  +   +S  AE+ AL + L L +  +  K+W+E D+ V +Q+I  +  G    ++
Sbjct: 1829 DFSEN-IGPSNSLQAELRALLRGLLLCKERNIEKLWVEMDALVAIQMIQQSQKGSHDIRY 1887

Query: 866  LLSKIRKQMEGFDWKISHIFREGNKVADGLAKLGCSSQNFVQFFADDFPRQVRGLARVDQ 1045
            LL+ IRK +  F ++ISHIFREGN+ AD L+  G + Q+   F   +   ++ G+ ++D+
Sbjct: 1888 LLASIRKYLNFFSFRISHIFREGNQAADFLSNKGHTHQSLHVF--TEAQGKLYGMLKLDR 1945

Query: 1046 LNLPSFR 1066
            LNLP  R
Sbjct: 1946 LNLPYVR 1952



 Score = 68.6 bits (166), Expect(2) = 4e-49
 Identities = 31/80 (38%), Positives = 48/80 (60%)
 Frame = +3

Query: 57   SMSIFMWRLIKNRLPVDQKLIDRGFSLAFKCWCCESPSAESLVHLFLHNTHSHKVWMHFA 236
            S+S F+WR+  N +PVD +L ++GF LA KC CC   S ESL+H+   N  + +VW  FA
Sbjct: 1622 SISFFLWRVFHNWIPVDIRLKEKGFHLASKCICCN--SEESLIHVLWDNPIAKQVWNFFA 1679

Query: 237  NMLHFSLPDTENINTFISSW 296
            N     +   +N++  + +W
Sbjct: 1680 NSFQIYISKPQNVSQILWTW 1699


>gb|EOY14356.1| Uncharacterized protein TCM_033752 [Theobroma cacao]
          Length = 2251

 Score =  158 bits (400), Expect(2) = 9e-49
 Identities = 87/247 (35%), Positives = 135/247 (54%)
 Frame = +2

Query: 326  HISFIIPCVILWFIWLERNKNKHESKGFSAYRVIGRVEHHIYLLKSSKLFQKSTWKGFGN 505
            HI  ++P  ILWF+W+ERN  KH + G    RV+ RV   I  L   +   K  WKG   
Sbjct: 2007 HIRTLVPLFILWFLWVERNDAKHRNLGMYPNRVVWRVLKLIQQLSLGQQLLKWQWKGDKQ 2066

Query: 506  VAASFGIYFRISVVQKCIPVHWHKPDLGQYKLNIDGSKNPISSCGGIGGVLRDWQGNVIL 685
            +A  +GI F+   +       WHKP LG++KLN+DGS     +  G GG+LRD  G ++ 
Sbjct: 2067 IAQEWGIIFQAESLAPPKVFSWHKPSLGEFKLNVDGSAKQSHNAAG-GGILRDHAGEMVF 2125

Query: 686  AFADGFMDCPDSTYAEISALHKALSLIEALSFHKIWIETDSQVLLQLISHNATGKWSYQH 865
             F++  +   +S  AE+ AL++ L L    +  ++WIE D+  +++L+  N  G  + ++
Sbjct: 2126 GFSEN-LGTQNSLQAELLALYRGLILCRDYNIRRLWIEMDAISVIRLLQGNHRGPHAIRY 2184

Query: 866  LLSKIRKQMEGFDWKISHIFREGNKVADGLAKLGCSSQNFVQFFADDFPRQVRGLARVDQ 1045
            L+  +R+ +  F ++ SHIFREGN+ AD LA  G   QN   F       ++RG+  +DQ
Sbjct: 2185 LMVSLRQLLSHFSFRFSHIFREGNQAADFLANRGHEHQNLQVFTVAQ--GKLRGMLCLDQ 2242

Query: 1046 LNLPSFR 1066
             + P  R
Sbjct: 2243 TSFPYVR 2249



 Score = 63.5 bits (153), Expect(2) = 9e-49
 Identities = 30/94 (31%), Positives = 53/94 (56%)
 Frame = +3

Query: 15   NPIFPPLCSHFLTPSMSIFMWRLIKNRLPVDQKLIDRGFSLAFKCWCCESPSAESLVHLF 194
            NP+F  +    +  + S F+WRL+ + +PV+ K+  +G  LA +C CC+  S ES++H+ 
Sbjct: 1905 NPVFNFIWHKTVPLTTSFFLWRLLHDWIPVELKMKSKGLQLASRCRCCK--SEESIMHVM 1962

Query: 195  LHNTHSHKVWMHFANMLHFSLPDTENINTFISSW 296
              N  + +VW +FA +    + +   IN  I +W
Sbjct: 1963 WDNPVAMQVWNYFAKLFQILIINPCTINQIIGAW 1996


>gb|EOY17514.1| Uncharacterized protein TCM_042330 [Theobroma cacao]
          Length = 2249

 Score =  156 bits (395), Expect(2) = 3e-48
 Identities = 86/247 (34%), Positives = 136/247 (55%)
 Frame = +2

Query: 326  HISFIIPCVILWFIWLERNKNKHESKGFSAYRVIGRVEHHIYLLKSSKLFQKSTWKGFGN 505
            HI  ++P   LWF+W+ERN  KH + G    R++ R+   I  L   +   K  WKG   
Sbjct: 2005 HIRTLVPIFTLWFLWVERNDAKHRNLGMYPNRIVWRILKLIQQLSLGQQLLKWQWKGDKQ 2064

Query: 506  VAASFGIYFRISVVQKCIPVHWHKPDLGQYKLNIDGSKNPISSCGGIGGVLRDWQGNVIL 685
            +A  +GI F+   +       WHKP +G++KLN+DGS     +  G GGVLRD  G ++ 
Sbjct: 2065 IAQEWGITFQAESLPPPKVFPWHKPSIGEFKLNVDGSAKLSQNAAG-GGVLRDHAGVMVF 2123

Query: 686  AFADGFMDCPDSTYAEISALHKALSLIEALSFHKIWIETDSQVLLQLISHNATGKWSYQH 865
             F++  +   +S  AE+ AL++ L L    +  ++WIE D+  +++L+  N  G  + ++
Sbjct: 2124 GFSEN-LGIQNSLQAELLALYRGLILCRDYNIRRLWIEMDAASVIRLLQGNQRGPHAIRY 2182

Query: 866  LLSKIRKQMEGFDWKISHIFREGNKVADGLAKLGCSSQNFVQFFADDFPRQVRGLARVDQ 1045
            LL  IR+ +  F +++SHIFREGN+ AD LA  G   Q+           ++RG+ R+DQ
Sbjct: 2183 LLVSIRQLLSHFSFRLSHIFREGNQAADFLANRGHEHQSLQVVTVAQ--GKLRGMLRLDQ 2240

Query: 1046 LNLPSFR 1066
             +LP  R
Sbjct: 2241 TSLPYVR 2247



 Score = 63.5 bits (153), Expect(2) = 3e-48
 Identities = 29/94 (30%), Positives = 54/94 (57%)
 Frame = +3

Query: 15   NPIFPPLCSHFLTPSMSIFMWRLIKNRLPVDQKLIDRGFSLAFKCWCCESPSAESLVHLF 194
            NP+F  +    +  ++S F+WRL+ + +PV+ K+  +GF LA +C CC+  S ES++H+ 
Sbjct: 1903 NPVFNFIWHKTVPLTISFFLWRLLHDWIPVELKMKSKGFQLASRCRCCK--SEESIMHVM 1960

Query: 195  LHNTHSHKVWMHFANMLHFSLPDTENINTFISSW 296
              N  + +VW +F+      + +   IN  + +W
Sbjct: 1961 WDNPVATQVWNYFSKFFQILVINPCTINQILGAW 1994


>gb|EOY06956.1| Uncharacterized protein TCM_021518 [Theobroma cacao]
          Length = 1702

 Score =  148 bits (373), Expect(2) = 6e-48
 Identities = 83/223 (37%), Positives = 125/223 (56%)
 Frame = +2

Query: 326  HISFIIPCVILWFIWLERNKNKHESKGFSAYRVIGRVEHHIYLLKSSKLFQKSTWKGFGN 505
            HI  +IP  I WF+WLERN  K    G  + RV+ ++   +  L+   + +   WKG  +
Sbjct: 1290 HIRTLIPLFICWFLWLERNDAKQRHLGMYSDRVVWKIMKLLRQLQDGYVLKNWQWKGDMD 1349

Query: 506  VAASFGIYFRISVVQKCIPVHWHKPDLGQYKLNIDGSKNPISSCGGIGGVLRDWQGNVIL 685
            +AA +G  F   +       HW K   G++KLN+DGS     S   IGG+LRD  G ++ 
Sbjct: 1350 IAAMWGFNFSPKIQATPQIFHWVKLVSGEHKLNVDGSSRQNQSAA-IGGLLRDHTGTLVF 1408

Query: 686  AFADGFMDCPDSTYAEISALHKALSLIEALSFHKIWIETDSQVLLQLISHNATGKWSYQH 865
             F++  +   +S  AE+ AL + L L +  +  K+WIE D+ V +Q+I  +  G    Q+
Sbjct: 1409 GFSEN-IGPSNSLQAELRALLRGLLLCKERNIEKLWIEMDALVAIQMIQQSQKGSHDIQY 1467

Query: 866  LLSKIRKQMEGFDWKISHIFREGNKVADGLAKLGCSSQNFVQF 994
            LL+ IRK +  F ++ISHIFREGN+VAD L+  G + QN + F
Sbjct: 1468 LLASIRKCLSFFSFRISHIFREGNQVADFLSNKGHTQQNLLVF 1510



 Score = 71.2 bits (173), Expect(2) = 6e-48
 Identities = 35/92 (38%), Positives = 54/92 (58%), Gaps = 4/92 (4%)
 Frame = +3

Query: 33   LCSHF----LTPSMSIFMWRLIKNRLPVDQKLIDRGFSLAFKCWCCESPSAESLVHLFLH 200
            LCS F    +  S+S F+WR+  N +PVD +L D+GF LA KC CC   S E+L+H+   
Sbjct: 1190 LCSLFWHKSIPLSISFFLWRVFHNWIPVDLRLKDKGFHLASKCACCN--SEETLIHVLWD 1247

Query: 201  NTHSHKVWMHFANMLHFSLPDTENINTFISSW 296
            N  + +VW  FAN     + + +N++  + +W
Sbjct: 1248 NPVAKQVWNFFANFFQIYVSNPQNVSQILWAW 1279



 Score =  106 bits (265), Expect = 2e-20
 Identities = 58/168 (34%), Positives = 95/168 (56%)
 Frame = +2

Query: 563  VHWHKPDLGQYKLNIDGSKNPISSCGGIGGVLRDWQGNVILAFADGFMDCPDSTYAEISA 742
            ++W +P +G++KLN+DG           GGV RD    +I  F++ F    +ST AE+ A
Sbjct: 1536 IYWSRPLMGEFKLNVDGCSKEAFQNAASGGVPRDHTSTMIFGFSENFGPY-NSTQAELMA 1594

Query: 743  LHKALSLIEALSFHKIWIETDSQVLLQLISHNATGKWSYQHLLSKIRKQMEGFDWKISHI 922
            LH+ L L    +  ++WIE D++ ++Q++     G    Q+LLS I + + G  ++ISHI
Sbjct: 1595 LHRGLLLCNEYNISRVWIEIDAKAIVQMLHEGHKGYSRTQYLLSFICQCLSGISYRISHI 1654

Query: 923  FREGNKVADGLAKLGCSSQNFVQFFADDFPRQVRGLARVDQLNLPSFR 1066
             RE N+ AD L+  G + Q+   F   +   ++RG+ R+D+ NLP  R
Sbjct: 1655 HRESNQAADYLSNQGHTHQSLQVFSKAE--GELRGMIRLDKSNLPYVR 1700


>gb|EOY34748.1| Uncharacterized protein TCM_042328 [Theobroma cacao]
          Length = 910

 Score =  155 bits (392), Expect(2) = 6e-48
 Identities = 86/247 (34%), Positives = 134/247 (54%)
 Frame = +2

Query: 326  HISFIIPCVILWFIWLERNKNKHESKGFSAYRVIGRVEHHIYLLKSSKLFQKSTWKGFGN 505
            HI  ++P  ILWF+W+ERN  KH + G    RV+ RV   I  L   +   K  WKG   
Sbjct: 666  HIRTLVPLFILWFLWVERNDAKHRNLGMYPNRVVWRVLKLIQQLSLGQQLLKWQWKGDKQ 725

Query: 506  VAASFGIYFRISVVQKCIPVHWHKPDLGQYKLNIDGSKNPISSCGGIGGVLRDWQGNVIL 685
            +A  +GI  +   +       WHKP  G++KLN+DGS     +  G GG+LRD  G ++ 
Sbjct: 726  IAQEWGIILQAESLAPPKVFSWHKPTTGEFKLNVDGSAKHSHNAAG-GGILRDHAGVMVF 784

Query: 686  AFADGFMDCPDSTYAEISALHKALSLIEALSFHKIWIETDSQVLLQLISHNATGKWSYQH 865
             F++  +   +S  AE+ AL++ L L    +  ++WIE D+  +++L+  N  G  + ++
Sbjct: 785  GFSEN-LGIQNSLQAELLALYRGLILCRDYNIRRLWIEMDAISVIRLLQGNHRGPHAIRY 843

Query: 866  LLSKIRKQMEGFDWKISHIFREGNKVADGLAKLGCSSQNFVQFFADDFPRQVRGLARVDQ 1045
            L+  +R+ +  F ++ SHIFREGN+ AD LA  G   QN   F       ++RG+ R+DQ
Sbjct: 844  LMVSLRQLLSHFSFRFSHIFREGNQAADFLANRGHEHQNLQVFTVAQ--GKLRGMLRLDQ 901

Query: 1046 LNLPSFR 1066
             + P  R
Sbjct: 902  TSFPYVR 908



 Score = 63.9 bits (154), Expect(2) = 6e-48
 Identities = 30/94 (31%), Positives = 53/94 (56%)
 Frame = +3

Query: 15  NPIFPPLCSHFLTPSMSIFMWRLIKNRLPVDQKLIDRGFSLAFKCWCCESPSAESLVHLF 194
           NP+F  +    +  + S F+WRL+ + +PV+ K+  +G  LA +C CC+  S ES++H+ 
Sbjct: 564 NPVFNFIWHKTVPLTTSFFLWRLLHDWIPVELKMKSKGLQLASRCRCCK--SEESIMHVM 621

Query: 195 LHNTHSHKVWMHFANMLHFSLPDTENINTFISSW 296
             N  + +VW +FA +    + +   IN  I +W
Sbjct: 622 WDNPVAMQVWNYFAKLFQICIINPCTINQIIGAW 655


>gb|EOY25451.1| Uncharacterized protein TCM_016759 [Theobroma cacao]
          Length = 879

 Score =  156 bits (394), Expect(2) = 8e-47
 Identities = 89/247 (36%), Positives = 132/247 (53%)
 Frame = +2

Query: 326  HISFIIPCVILWFIWLERNKNKHESKGFSAYRVIGRVEHHIYLLKSSKLFQKSTWKGFGN 505
            HI  ++P  I WF+WLERN  KH     +  RV+ R+   +  L    L  +  WKG  +
Sbjct: 635  HIRSLLPIFICWFLWLERNDAKHRHTRLNPDRVVWRIMKLLRQLLDGSLLHQWQWKGDTD 694

Query: 506  VAASFGIYFRISVVQKCIPVHWHKPDLGQYKLNIDGSKNPISSCGGIGGVLRDWQGNVIL 685
            +A+ +G  F+         ++W KP  G+YKLN+DGS          GG+LRD  G +I 
Sbjct: 695  IASMWGHTFQSKHRAPPQIIYWRKPFTGEYKLNVDGSSRN-GHLAASGGILRDHTGKLIF 753

Query: 686  AFADGFMDCPDSTYAEISALHKALSLIEALSFHKIWIETDSQVLLQLISHNATGKWSYQH 865
             F++    C +S  AE+ AL + L L +      +WIE D+  ++QLI H+  G    ++
Sbjct: 754  GFSENIGLC-NSLQAELRALLRGLLLCKERHIENLWIEMDALAVIQLIQHSQKGSHDIRY 812

Query: 866  LLSKIRKQMEGFDWKISHIFREGNKVADGLAKLGCSSQNFVQFFADDFPRQVRGLARVDQ 1045
            LL  IRK +    ++ISHIFREGN+ AD LA  G S QN       +   ++ G+ ++D+
Sbjct: 813  LLESIRKCLSCISYRISHIFREGNQAADYLANEGHSHQNLC--VITEAQGELHGMLKLDR 870

Query: 1046 LNLPSFR 1066
            LNLP  R
Sbjct: 871  LNLPYVR 877



 Score = 59.3 bits (142), Expect(2) = 8e-47
 Identities = 25/80 (31%), Positives = 47/80 (58%)
 Frame = +3

Query: 57  SMSIFMWRLIKNRLPVDQKLIDRGFSLAFKCWCCESPSAESLVHLFLHNTHSHKVWMHFA 236
           S+S F+WR + N +PV+ ++ ++G  LA KC CC   S ESL+H+   N+ + +VW  F 
Sbjct: 547 SISFFLWRALNNWIPVELRMKEKGIQLASKCVCCN--SEESLMHVLWGNSVAKQVWAFFG 604

Query: 237 NMLHFSLPDTENINTFISSW 296
                 + + ++++  + +W
Sbjct: 605 KFFQIYVLNPQHVSQILWAW 624


>gb|EOY19200.1| Retrotransposon, unclassified-like protein [Theobroma cacao]
          Length = 1368

 Score =  152 bits (384), Expect(2) = 3e-46
 Identities = 91/245 (37%), Positives = 137/245 (55%), Gaps = 1/245 (0%)
 Frame = +2

Query: 326  HISFIIPCVILWFIWLERNKNKHESKGFSAYRVIGRVEHHIYLLKSSKLFQKSTWKGFGN 505
            HI  +I   I WF+W+ERN  KH   G    R+I R+   +  L    L  K  WKG  +
Sbjct: 1091 HIRTLILLFIFWFVWVERNDAKHRDLGMYPDRIIWRIMKILRKLFQGGLLCKWQWKGDLD 1150

Query: 506  VAASFGIYFRISVVQKCIPVHWHKPDLGQYKLNIDGS-KNPISSCGGIGGVLRDWQGNVI 682
            +A  +G  F      +   ++W KP +G+ KLN+DGS K+   +  G GGVLRD  GN+I
Sbjct: 1151 IAIHWGFNFAQERQARPKIINWIKPLIGELKLNVDGSSKDEFQNAAG-GGVLRDHTGNLI 1209

Query: 683  LAFADGFMDCPDSTYAEISALHKALSLIEALSFHKIWIETDSQVLLQLISHNATGKWSYQ 862
              F++ F    +S  AE+ ALH+ L L    +  ++WIE D+QV++Q+I ++  G +  Q
Sbjct: 1210 FGFSENF-GYQNSLQAELLALHRGLCLCMEYNVSRVWIEVDAQVVIQMIQNHHKGSYKIQ 1268

Query: 863  HLLSKIRKQMEGFDWKISHIFREGNKVADGLAKLGCSSQNFVQFFADDFPRQVRGLARVD 1042
            +LL  IRK ++    +ISHI REGN+ AD L+K G + QN   F   +   ++RG   V+
Sbjct: 1269 YLLESIRKCLQVISVRISHIHREGNQAADFLSKHGHTHQNLHVF--TEAQGELRGRTLVN 1326

Query: 1043 QLNLP 1057
            ++  P
Sbjct: 1327 RVEHP 1331



 Score = 61.2 bits (147), Expect(2) = 3e-46
 Identities = 27/80 (33%), Positives = 48/80 (60%)
 Frame = +3

Query: 57   SMSIFMWRLIKNRLPVDQKLIDRGFSLAFKCWCCESPSAESLVHLFLHNTHSHKVWMHFA 236
            ++S F+WR + N LPV+ ++  +G  LA KC CC+  S ESL+H+   +  + +VW +F+
Sbjct: 1003 TVSFFLWRTLHNWLPVEVRMKAKGIQLASKCLCCK--SEESLLHVLWESPVAQQVWNYFS 1060

Query: 237  NMLHFSLPDTENINTFISSW 296
                  + + +NI   ++SW
Sbjct: 1061 KFFQIYVHNPQNILQILNSW 1080


>gb|EOY25447.1| Uncharacterized protein TCM_016753 [Theobroma cacao]
          Length = 1275

 Score =  151 bits (381), Expect(2) = 9e-36
 Identities = 86/229 (37%), Positives = 125/229 (54%), Gaps = 2/229 (0%)
 Frame = +2

Query: 314  AHTAHISFIIPCVILWFIWLERNKNKHESKGFSAYRVIGRVEHHIYLLKSSKLFQKSTWK 493
            A    I  ++P  I WF+WLERN  KH   G    RV+ R+   +  L+   L Q+  WK
Sbjct: 883  AKQGRIRTLLPIFICWFLWLERNDAKHRHSGLYTDRVVWRIMTLLRQLQDDSLLQQWQWK 942

Query: 494  GFGNVAASFGIYFRISVVQKCIP--VHWHKPDLGQYKLNIDGSKNPISSCGGIGGVLRDW 667
            G  ++AA +   F++   Q+  P  V+W KP  G+YKLN+DGS          GGVLRD 
Sbjct: 943  GDTDIAAMWRYNFQLK--QRAPPQIVYWRKPFTGEYKLNVDGSSRNGQHAAS-GGVLRDH 999

Query: 668  QGNVILAFADGFMDCPDSTYAEISALHKALSLIEALSFHKIWIETDSQVLLQLISHNATG 847
               +I  F++  +   +S  AE+ ALH+ L L +     K+WIE D+  ++QLI H+  G
Sbjct: 1000 TSKLIFCFSEN-IGTYNSLQAELRALHRGLLLCKERHIEKLWIEMDALAVIQLIPHSQKG 1058

Query: 848  KWSYQHLLSKIRKQMEGFDWKISHIFREGNKVADGLAKLGCSSQNFVQF 994
                ++LL  I+K +    ++ISHIFREGN+ AD L+  G + QN   F
Sbjct: 1059 SHDIRYLLESIKKCLNSISYRISHIFREGNQAADFLSNEGHNHQNLRVF 1107



 Score = 27.3 bits (59), Expect(2) = 9e-36
 Identities = 14/39 (35%), Positives = 21/39 (53%)
 Frame = +3

Query: 90  NRLPVDQKLIDRGFSLAFKCWCCESPSAESLVHLFLHNT 206
           N L +   + ++G  L  KC CC   S ESL+H+   N+
Sbjct: 845 NTLALSFGIEEKGIHLVSKCVCCN--SEESLMHVLWGNS 881


>ref|XP_004309110.1| PREDICTED: putative ribonuclease H protein At1g65750-like [Fragaria
            vesca subsp. vesca]
          Length = 872

 Score =  109 bits (273), Expect(2) = 1e-31
 Identities = 76/242 (31%), Positives = 112/242 (46%), Gaps = 2/242 (0%)
 Frame = +2

Query: 353  ILWFIWLERNKNKHESKGFSAYRVIGRVEHHIYLLKSSKLFQKSTWKGFGN--VAASFGI 526
            ILW+IW  RN+ + +S+ FS   V   V  HI    SS+L          +  +  SFG 
Sbjct: 636  ILWYIWHARNQIRFDSRTFSVAGVCRLVSRHIQA--SSRLATGHMHNTIHDLCILKSFGA 693

Query: 527  YFRISVVQKCIPVHWHKPDLGQYKLNIDGSKNPISSCGGIGGVLRDWQGNVILAFADGFM 706
              R   + + + V WH P +G  K+N DG+       GG G V R ++G  + AFA   +
Sbjct: 694  CCRSRRIPRMVEVIWHPPSIGWIKINSDGAWKHEEGIGGFGAVFRYYKGQFVGAFAS-HI 752

Query: 707  DCPDSTYAEISALHKALSLIEALSFHKIWIETDSQVLLQLISHNATGKWSYQHLLSKIRK 886
            D P S  A++  +  A+ L     +  +W+E D   +L  I   +   W  +        
Sbjct: 753  DIPSSIAAKVMVVITAIELAWVRDWKHVWLEVDFSTVLDYIRSPSLVPWQLRVRWLNCLY 812

Query: 887  QMEGFDWKISHIFREGNKVADGLAKLGCSSQNFVQFFADDFPRQVRGLARVDQLNLPSFR 1066
            ++    +K SHIFREGN+VAD LA  G S    V +  D  P  +      D L +P+FR
Sbjct: 813  RISTMTFKSSHIFREGNRVADALANHGTSMSEEVWW--DVPPSFILSYYERDLLGMPNFR 870

Query: 1067 TR 1072
             R
Sbjct: 871  FR 872



 Score = 55.1 bits (131), Expect(2) = 1e-31
 Identities = 32/88 (36%), Positives = 47/88 (53%), Gaps = 1/88 (1%)
 Frame = +3

Query: 6   SPTNPIFPPLCSHFLTPSMSIFMWRLIKNRLPVDQKLIDRGFSLAFKCWCCESPSAESLV 185
           SP  P   PL S F+ P MS+  W++++  +     L  RG +L  +C  C + S ESL 
Sbjct: 522 SPVVPWGKPLWSKFILPRMSLHAWKVMRGTVISYHLLQRRGVALVSRCEFCGN-STESLD 580

Query: 186 HLFLHNTHSHKVWMHFANMLHFSL-PDT 266
           H+FLH + +  VW HF  +    L P+T
Sbjct: 581 HIFLHCSFAASVWNHFIYIFEIGLVPNT 608


>gb|EOY26529.1| Ribonuclease H-like protein [Theobroma cacao]
          Length = 458

 Score =  140 bits (353), Expect = 1e-30
 Identities = 83/243 (34%), Positives = 126/243 (51%)
 Frame = +2

Query: 329  ISFIIPCVILWFIWLERNKNKHESKGFSAYRVIGRVEHHIYLLKSSKLFQKSTWKGFGNV 508
            IS +IP  I WF+WLERN  KH   G    RV+      +  L      ++  WK   ++
Sbjct: 218  ISALIPLFICWFLWLERNDAKHRHLGMYPDRVVWETMKLLRQLHDGSPLKQWQWKVDKDI 277

Query: 509  AASFGIYFRISVVQKCIPVHWHKPDLGQYKLNIDGSKNPISSCGGIGGVLRDWQGNVILA 688
            AA +   F          +HW KP  G+YKLN+DGS     S    GG+LRD  G ++  
Sbjct: 278  AAMWSFLFPPKHGTTPQIIHWVKPFTGEYKLNVDGSSRNCQSATS-GGLLRDHIGKLVFG 336

Query: 689  FADGFMDCPDSTYAEISALHKALSLIEALSFHKIWIETDSQVLLQLISHNATGKWSYQHL 868
            F++    C +S  AE+ AL + L L +     ++WIE D+ V++Q+I     G    ++L
Sbjct: 337  FSENIGRC-NSLQAELRALLRRLLLCKEQHIERLWIEMDALVVIQMIHQYQKGSHDIRYL 395

Query: 869  LSKIRKQMEGFDWKISHIFREGNKVADGLAKLGCSSQNFVQFFADDFPRQVRGLARVDQL 1048
            L+ IRK +    ++I HIFREGN+ A  L+  G + QN       +   ++ G+ ++D+L
Sbjct: 396  LTSIRKGLSSISYRILHIFREGNQAAYFLSNQGYTHQNLC--LITEAQGELHGMLKLDRL 453

Query: 1049 NLP 1057
            NLP
Sbjct: 454  NLP 456


>gb|EOY25454.1| Uncharacterized protein TCM_026877 [Theobroma cacao]
          Length = 2367

 Score =  137 bits (346), Expect = 7e-30
 Identities = 82/247 (33%), Positives = 128/247 (51%)
 Frame = +2

Query: 326  HISFIIPCVILWFIWLERNKNKHESKGFSAYRVIGRVEHHIYLLKSSKLFQKSTWKGFGN 505
            HI  +IP   LWF+W+ERN  KH + G                    +   +  WKG   
Sbjct: 2143 HIRTLIPIFTLWFLWVERNDAKHRNLG--------------------QQLLEWQWKGDKQ 2182

Query: 506  VAASFGIYFRISVVQKCIPVHWHKPDLGQYKLNIDGSKNPISSCGGIGGVLRDWQGNVIL 685
            +A  +GI F+   +       WHKP  G++KLN+DGS     +  G GGVLRD  G +I 
Sbjct: 2183 IAQEWGITFQAKSLPPPKVFCWHKPSNGEFKLNVDGSAKLSQNAAG-GGVLRDHAGVMIF 2241

Query: 686  AFADGFMDCPDSTYAEISALHKALSLIEALSFHKIWIETDSQVLLQLISHNATGKWSYQH 865
             F++  +   +S  AE+ AL++ L L    +  ++WIE D+  +++L+  N  G  + ++
Sbjct: 2242 GFSEN-LGIQNSLKAELLALYRGLILCRDYNIRRLWIEMDATSVIRLLQGNHRGPHAIRY 2300

Query: 866  LLSKIRKQMEGFDWKISHIFREGNKVADGLAKLGCSSQNFVQFFADDFPRQVRGLARVDQ 1045
            LL  IR+ +  F ++++HIFREGN+ AD LA  G   Q+           ++RG+ R+DQ
Sbjct: 2301 LLGSIRQLLSHFSFRLTHIFREGNQAADFLANRGHEHQSLQVITVAQ--GKLRGMLRLDQ 2358

Query: 1046 LNLPSFR 1066
             +LP  R
Sbjct: 2359 TSLPYVR 2365


>ref|XP_004242524.1| PREDICTED: uncharacterized protein LOC101258077 [Solanum
            lycopersicum]
          Length = 1454

 Score =  111 bits (277), Expect(2) = 5e-29
 Identities = 77/252 (30%), Positives = 126/252 (50%), Gaps = 5/252 (1%)
 Frame = +2

Query: 338  IIPCVILWFIWLERNKNKHESKGFSAYRV---IGRVEHHIYLLKSSKLFQKSTWKGFGNV 508
            I+P  I W +W  R   K+  K  S YRV   I +    +  +    +  +++W    N+
Sbjct: 1203 ILPNFICWNLWKNRCAVKYGLKNSSIYRVQYGIFKNIMQVITIVFPSIPWQTSWNNLINI 1262

Query: 509  AASFGIYFRISVVQKCIPVHWHKPDLGQYKLNIDGSKNPISSCGGIGGVLRDWQGNVILA 688
                  +++I +V+      W+KPDLG+YKLN DGS    S   G GG+LRD QG +I A
Sbjct: 1263 VEQCKQHYKILIVK------WNKPDLGKYKLNTDGSALQNSGKIGGGGILRDNQGKIIYA 1316

Query: 689  FADGFMDCPDSTYAEISALHKALSLIEALSFHKIWIETDSQVLLQLISHNATGKWSYQHL 868
            F+  F     + +AEI A    L   E   + KI +E DS++L   I+ N    W Y+ L
Sbjct: 1317 FSLPF-GFGTNNFAEIKAALHGLDWCEQHGYKKIELEVDSKLLCNWINSNINIPWRYEEL 1375

Query: 869  LSKIRKQMEGFD-WKISHIFREGNKVADGLAKLGCSSQNFVQFFAD-DFPRQVRGLARVD 1042
            + +I + +   D ++  HI+RE N  AD L+K   + +   +F+        +RG   ++
Sbjct: 1376 IQQIHQIIRKMDQFQCHHIYREANCTADLLSKWSHNLEILQKFYTTRQLKEPIRGSYLLE 1435

Query: 1043 QLNLPSFRTRTI 1078
            ++ + +FR R +
Sbjct: 1436 KMGVQNFRRRKL 1447



 Score = 44.7 bits (104), Expect(2) = 5e-29
 Identities = 19/89 (21%), Positives = 46/89 (51%)
 Frame = +3

Query: 60   MSIFMWRLIKNRLPVDQKLIDRGFSLAFKCWCCESPSAESLVHLFLHNTHSHKVWMHFAN 239
            +S F+WR ++ +LP ++ L   G +L+  C+CC +   + + H+ ++   +  +W  +++
Sbjct: 1111 VSFFIWRALRGKLPTNENLQRIGKNLS-DCYCCYNKGKDDINHILINGNFAKYIWKIYSS 1169

Query: 240  MLHFSLPDTENINTFISSWKNFTPLHTLH 326
             +   LP    +   +  W+N    + +H
Sbjct: 1170 AVGV-LPINTTLRDLLLQWRNQQYTNEVH 1197


Top