BLASTX nr result

ID: Rehmannia25_contig00021866 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Rehmannia25_contig00021866
         (1200 letters)

Database: ./nr 
           37,332,560 sequences; 13,225,080,153 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

gb|EOY06960.1| Uncharacterized protein TCM_021522 [Theobroma cacao]   164   2e-53
gb|EOY17513.1| Uncharacterized protein TCM_036737 [Theobroma cacao]   160   1e-52
gb|EOY34747.1| Uncharacterized protein TCM_042327 [Theobroma cacao]   162   2e-52
gb|EOX96783.1| Uncharacterized protein TCM_005954 [Theobroma cacao]   165   2e-51
gb|EOY02234.1| Uncharacterized protein TCM_011921 [Theobroma cacao]   162   2e-51
gb|EOY02239.1| Uncharacterized protein TCM_016763 [Theobroma cacao]   164   3e-51
gb|EOY06959.1| Uncharacterized protein TCM_021521 [Theobroma cacao]   157   6e-51
gb|EOY14356.1| Uncharacterized protein TCM_033752 [Theobroma cacao]   159   1e-50
gb|EOY02236.1| Uncharacterized protein TCM_011923 [Theobroma cacao]   154   1e-50
gb|EOY02238.1| Uncharacterized protein TCM_016762 [Theobroma cacao]   164   2e-50
gb|EOY17514.1| Uncharacterized protein TCM_042330 [Theobroma cacao]   157   4e-50
gb|EOY34748.1| Uncharacterized protein TCM_042328 [Theobroma cacao]   155   6e-50
gb|EOY06956.1| Uncharacterized protein TCM_021518 [Theobroma cacao]   147   2e-48
gb|EOY25451.1| Uncharacterized protein TCM_016759 [Theobroma cacao]   155   4e-48
gb|EOY19200.1| Retrotransposon, unclassified-like protein [Theob...   153   3e-47
gb|EOY25447.1| Uncharacterized protein TCM_016753 [Theobroma cacao]   150   2e-35
ref|XP_004309110.1| PREDICTED: putative ribonuclease H protein A...   110   2e-33
gb|EOY26529.1| Ribonuclease H-like protein [Theobroma cacao]          139   2e-30
gb|EOY25454.1| Uncharacterized protein TCM_026877 [Theobroma cacao]   138   5e-30
ref|XP_004242524.1| PREDICTED: uncharacterized protein LOC101258...   109   8e-30

>gb|EOY06960.1| Uncharacterized protein TCM_021522 [Theobroma cacao]
          Length = 3503

 Score =  164 bits (416), Expect(2) = 2e-53
 Identities = 88/247 (35%), Positives = 139/247 (56%)
 Frame = +2

Query: 332  HISFIIPCVILWFIWLERNKNKHESKGFSAYRVIGRVEHHIYLLKSSKLFQKSTWKGFGN 511
            HI  ++P  ILWF+W+ERN  KH + G    R++ ++   I+ L   K  QK  W+G   
Sbjct: 3258 HIRTLVPLFILWFLWVERNDAKHRNLGMYPNRIVWKILKLIHQLFQGKQLQKWQWQGDKQ 3317

Query: 512  VAASFGIYFRISVVQKCIHVHWHKPDLGQYKLNIDGSKNPISSCGGIGGVLRDWQGNVIL 691
            +A  +GI  +         + W+KP +G++KLN+DGS          GG+LRD  G++I 
Sbjct: 3318 IAQEWGIILKAVAPSPPKLLFWNKPSIGEFKLNVDGSSKYNLQTAAGGGLLRDHTGSMIF 3377

Query: 692  VFADGFMDCPDSTYAEILALHKALSLIEASSFHKIWIETDSQVLLQLISHNATGKWSYQH 871
             F++ F    DS  AE++ALH+ L L    +  ++WIE D++V +Q+I+    G    ++
Sbjct: 3378 GFSENF-GSQDSLQAELMALHRGLLLCIDHNVTRLWIEMDAKVAVQMINEGHQGSSRTRY 3436

Query: 872  LLSKIRKQMEGFDWKISHIFREGNKVADGLAKLGCSSQNFVQFFADDFPRQVRGLARVDQ 1051
            LL+ I + + G  ++ISHIFREGN+ AD L+  G + QN           Q+RG+ R+D+
Sbjct: 3437 LLASIHRCLSGISFRISHIFREGNQAADHLSNQGYTHQNLQ--VISQAEGQLRGILRLDK 3494

Query: 1052 LNLPSFR 1072
            +NL   R
Sbjct: 3495 INLAYVR 3501



 Score = 73.2 bits (178), Expect(2) = 2e-53
 Identities = 33/100 (33%), Positives = 59/100 (59%)
 Frame = +3

Query: 3    KDTSPTNPIFSPLWSHFLTPSMSIFMWRLIKNRLPVDQKLIDRGFSLAFKCWCCESPSVE 182
            ++    NP ++ +W   +  + S F+WRL+ + +PV+ K+  +GF LA +C CC+S   E
Sbjct: 3150 RERKVVNPTYNYIWHKSVPLTTSFFLWRLLHDWVPVELKMKSKGFQLASRCRCCKSE--E 3207

Query: 183  SLVHLFLHNTHSHKVWMHFANMLHFSLPDTENINTFISSW 302
            SL+H+   N  +++VW +FA +    + +   IN  IS+W
Sbjct: 3208 SLMHVMWDNPVANQVWSYFAKVFQIHIINPCTINHIISAW 3247



 Score =  149 bits (375), Expect(2) = 5e-47
 Identities = 84/230 (36%), Positives = 124/230 (53%), Gaps = 1/230 (0%)
 Frame = +2

Query: 332  HISFIIPCVILWFIWLERNKNKHESKGFSAYRVIGRVEHHIYLLKSSKLFQKSTWKGFGN 511
            HI  +IP  I WF+WLERN  KH   G    RVI R+   +  L +  L ++  WKG  +
Sbjct: 1464 HIRILIPLFICWFLWLERNDAKHRHMGMYPNRVIWRIMKLLNQLHAGSLLKQWQWKGDTD 1523

Query: 512  VAASFGIYFRISVVQKCIHVHWHKPDLGQYKLNIDGSKNPISSCGGIGGVLRDWQGNVIL 691
            +A  +G  +     Q    + W KP +G+YKLN+DGS     +  G GGVLRD  G +  
Sbjct: 1524 IATMWGFKYPPKYCQSPQIISWIKPFIGEYKLNVDGSSKSSQNAAG-GGVLRDHTGKLAF 1582

Query: 692  VFADGFMDCPDSTYAEILALHKALSLIEASSFHKIWIETDSQVLLQLISHNATGKWSYQH 871
             F++     P S  AE+ AL + L L +  +   +WIE D+ V +Q++  +  G    ++
Sbjct: 1583 AFSENLGPLP-SLQAELHALLRGLLLCKERNITNLWIEMDALVAVQMVQQSQKGSHDIRY 1641

Query: 872  LLSKIRKQMEGFDWKISHIFREGNKVADGLAKLGCSSQNF-VQFFADDFP 1018
            LL  IR  +  F ++ISHI+REGN+ AD L+  G + Q+  V   A +FP
Sbjct: 1642 LLESIRLCLRSFSYRISHIYREGNQAADFLSNKGQTHQSLCVVSEAQEFP 1691



 Score = 67.4 bits (163), Expect(2) = 5e-47
 Identities = 32/94 (34%), Positives = 52/94 (55%)
 Frame = +3

Query: 21   NPIFSPLWSHFLTPSMSIFMWRLIKNRLPVDQKLIDRGFSLAFKCWCCESPSVESLVHLF 200
            N + S  W   +  S+S F+WR++ N +PV+ ++ D+G  LA KC CC S   ESL+H+ 
Sbjct: 1362 NALLSFNWHRSIPLSISFFLWRVLNNWIPVELRMKDKGIHLASKCVCCRSE--ESLIHVL 1419

Query: 201  LHNTHSHKVWMHFANMLHFSLPDTENINTFISSW 302
              N  + +VW  FA      +   ++I+  I +W
Sbjct: 1420 WENPVAKQVWNFFAKSFQIYVSKPKHISQIIWAW 1453


>gb|EOY17513.1| Uncharacterized protein TCM_036737 [Theobroma cacao]
          Length = 2215

 Score =  160 bits (406), Expect(2) = 1e-52
 Identities = 90/249 (36%), Positives = 140/249 (56%), Gaps = 2/249 (0%)
 Frame = +2

Query: 332  HISFIIPCVILWFIWLERNKNKHESKGFSAYRVIGRVEHHIYLLKSSKLFQKSTWKGFGN 511
            HI  ++P   LWF+W+ERN  KH + G    RV+ ++   ++ L   K  QK  W+G   
Sbjct: 1970 HIRTLVPLFTLWFLWVERNDAKHRNLGMYPNRVVWKILKLLHQLFQGKQLQKWQWQGDKQ 2029

Query: 512  VAASFGIYFRISVVQKCIHVHWHKPDLGQYKLNIDGS--KNPISSCGGIGGVLRDWQGNV 685
            +A  +GI  +         + W KP +G+ KLN+DGS   NP S+ G  GG+LRD  G++
Sbjct: 2030 IAQEWGIILKADAPSPPKLLFWLKPSIGELKLNVDGSCKHNPQSAAG--GGLLRDHTGSM 2087

Query: 686  ILVFADGFMDCPDSTYAEILALHKALSLIEASSFHKIWIETDSQVLLQLISHNATGKWSY 865
            I  F++ F    DS  AE++ALH+ L L    +  ++WIE D++V +Q+I     G    
Sbjct: 2088 IFGFSENF-GPQDSLQAELMALHRGLLLCIEHNISRLWIEMDAKVAVQMIKEGHQGSSRT 2146

Query: 866  QHLLSKIRKQMEGFDWKISHIFREGNKVADGLAKLGCSSQNFVQFFADDFPRQVRGLARV 1045
            ++LL+ I + + G  ++ISHIFREGN+ AD L+  G + QN           Q+RG+ R+
Sbjct: 2147 RYLLASIHRCLSGISFRISHIFREGNQAADHLSNQGHTHQNLQ--VISQAEGQLRGILRL 2204

Query: 1046 DQLNLPSFR 1072
            +++NL   R
Sbjct: 2205 EKINLAYVR 2213



 Score = 73.9 bits (180), Expect(2) = 1e-52
 Identities = 33/94 (35%), Positives = 57/94 (60%)
 Frame = +3

Query: 21   NPIFSPLWSHFLTPSMSIFMWRLIKNRLPVDQKLIDRGFSLAFKCWCCESPSVESLVHLF 200
            NP+F+ +W   +  + S F+WRL+ + +PV+ K+  +GF LA +C CC+S   ESL+H+ 
Sbjct: 1868 NPVFNFIWHKSVPLTTSFFLWRLLHDWIPVELKMKTKGFQLASRCRCCKSE--ESLMHVM 1925

Query: 201  LHNTHSHKVWMHFANMLHFSLPDTENINTFISSW 302
              N  +++VW +FA +    + +   IN  I +W
Sbjct: 1926 WKNPVANQVWSYFAKVFQIQIINPCTINQIICAW 1959


>gb|EOY34747.1| Uncharacterized protein TCM_042327 [Theobroma cacao]
          Length = 1014

 Score =  162 bits (411), Expect(2) = 2e-52
 Identities = 89/244 (36%), Positives = 140/244 (57%)
 Frame = +2

Query: 332  HISFIIPCVILWFIWLERNKNKHESKGFSAYRVIGRVEHHIYLLKSSKLFQKSTWKGFGN 511
            HI  +IP  I WF+WLERN  KH   G  + RV+ ++   +  L+   L +K  WKG  +
Sbjct: 770  HIRTLIPLFICWFLWLERNDAKHRHLGMYSDRVVWKIMKVLRQLQDGSLLKKWQWKGDTD 829

Query: 512  VAASFGIYFRISVVQKCIHVHWHKPDLGQYKLNIDGSKNPISSCGGIGGVLRDWQGNVIL 691
            +AA +G    + + +    +HW KP  G+YKLN+DGS     S    GG+LRD  G ++ 
Sbjct: 830  IAAMWGFTLPLKIRESPQIIHWVKPVTGEYKLNVDGSSRHNQS-AATGGLLRDHTGTLVF 888

Query: 692  VFADGFMDCPDSTYAEILALHKALSLIEASSFHKIWIETDSQVLLQLISHNATGKWSYQH 871
             F++  +   +S  AE+ AL + L L +  +  K+WIE D+ V++Q+I  +  G    ++
Sbjct: 889  GFSEN-IGPSNSLQAELRALLRGLLLCKDRNIEKLWIEMDALVVIQMIQQSKKGSHDIRY 947

Query: 872  LLSKIRKQMEGFDWKISHIFREGNKVADGLAKLGCSSQNFVQFFADDFPRQVRGLARVDQ 1051
            LL+ IRK +  F ++ISHIFREGN+ AD L+  G + QN       +   ++ G+ ++D+
Sbjct: 948  LLASIRKCLSFFSFRISHIFREGNQAADFLSNKGHTHQNLQ--VISEAQGKLHGMLKLDR 1005

Query: 1052 LNLP 1063
            LNLP
Sbjct: 1006 LNLP 1009



 Score = 71.2 bits (173), Expect(2) = 2e-52
 Identities = 31/94 (32%), Positives = 57/94 (60%)
 Frame = +3

Query: 21  NPIFSPLWSHFLTPSMSIFMWRLIKNRLPVDQKLIDRGFSLAFKCWCCESPSVESLVHLF 200
           N + S +W   +  ++S F+WR++ N +PV+ +L ++GF LA KC CC S   ESL+H+ 
Sbjct: 668 NTLCSFIWHKSIPLTISFFLWRVLNNWIPVELRLKEKGFHLASKCVCCNSE--ESLIHVL 725

Query: 201 LHNTHSHKVWMHFANMLHFSLPDTENINTFISSW 302
             N  + +VW  FA+    ++ + ++++  I +W
Sbjct: 726 WDNPVAKQVWNFFADFFQINISNPQHVSQIIWAW 759


>gb|EOX96783.1| Uncharacterized protein TCM_005954 [Theobroma cacao]
          Length = 1134

 Score =  165 bits (417), Expect(2) = 2e-51
 Identities = 95/250 (38%), Positives = 134/250 (53%), Gaps = 1/250 (0%)
 Frame = +2

Query: 332  HISFIIPCVILWFIWLERNKNKHESKGFSAYRVIGRVEHHIYLLKSSKLFQKSTWKGFGN 511
            H   ++P  I WF+WLERN  KH   G    RVI R   H   L    L Q+  WKG  +
Sbjct: 888  HFRVLLPLFICWFLWLERNDAKHRHTGLYPDRVIWRTMKHCRQLYDGSLLQQWQWKGDTD 947

Query: 512  VAASFGIYFRISVVQKCIHVHWHKPDLGQYKLNIDGS-KNPISSCGGIGGVLRDWQGNVI 688
            +AA  G  F          ++W KP +G+YKLN+DGS +N + +    GGVLRD  G +I
Sbjct: 948  IAAMLGFSFPPQQHASPQIIYWKKPSIGEYKLNVDGSSRNGLHAA--TGGVLRDHTGKLI 1005

Query: 689  LVFADGFMDCPDSTYAEILALHKALSLIEASSFHKIWIETDSQVLLQLISHNATGKWSYQ 868
              F++    C +S  AE+ AL + L L +     K+WIE D+   +QLI  +  G +  +
Sbjct: 1006 FGFSENIGPC-NSLQAELRALLRGLLLCKERHIEKLWIEMDALAAIQLIQPSKKGPYDIR 1064

Query: 869  HLLSKIRKQMEGFDWKISHIFREGNKVADGLAKLGCSSQNFVQFFADDFPRQVRGLARVD 1048
            +LL  IR  +  F +++SH FREGNK AD L+  G   QN   F   +   Q+ G+ ++D
Sbjct: 1065 YLLESIRMCLSSFSYRLSHTFREGNKAADYLSNEGHKHQNLCVF--TEAQGQLHGMLKLD 1122

Query: 1049 QLNLPSFRTR 1078
            +LNLP  R R
Sbjct: 1123 RLNLPYVRFR 1132



 Score = 66.2 bits (160), Expect(2) = 2e-51
 Identities = 29/95 (30%), Positives = 54/95 (56%)
 Frame = +3

Query: 18   TNPIFSPLWSHFLTPSMSIFMWRLIKNRLPVDQKLIDRGFSLAFKCWCCESPSVESLVHL 197
            +N + S +W   +  S+S F+W+ + N +PV+ ++ ++G  LA KC CC S   ESL+H+
Sbjct: 785  SNALCSFIWHRSIPLSISFFLWKTLHNWIPVELRMKEKGIQLASKCVCCNSE--ESLIHV 842

Query: 198  FLHNTHSHKVWMHFANMLHFSLPDTENINTFISSW 302
               N  + +VW  FA +    + +  +++  I +W
Sbjct: 843  LWENPVAKQVWNFFAKLFQIYILNPRHVSQIIWAW 877


>gb|EOY02234.1| Uncharacterized protein TCM_011921 [Theobroma cacao]
          Length = 926

 Score =  162 bits (409), Expect(2) = 2e-51
 Identities = 91/247 (36%), Positives = 136/247 (55%)
 Frame = +2

Query: 332  HISFIIPCVILWFIWLERNKNKHESKGFSAYRVIGRVEHHIYLLKSSKLFQKSTWKGFGN 511
            HI  ++P  I WF+WLERN  KH   G    RV+ R+   +  L    L Q+  WKG  +
Sbjct: 683  HIRTLLPIFICWFLWLERNDAKHRYSGLYTDRVVWRIMKLLRQLHDGSLLQQWQWKGDTD 742

Query: 512  VAASFGIYFRISVVQKCIHVHWHKPDLGQYKLNIDGSKNPISSCGGIGGVLRDWQGNVIL 691
            +AA +    ++ +      V+W KP  G+YKLN+DGS          GGVLRD  G +I 
Sbjct: 743  IAAMWKYNLQLKLRAPPQIVYWRKPSTGEYKLNVDGSSRH-GQHAASGGVLRDHTGKLIF 801

Query: 692  VFADGFMDCPDSTYAEILALHKALSLIEASSFHKIWIETDSQVLLQLISHNATGKWSYQH 871
             F++   +C +S  AE+ AL + L L +     ++WIE D+  ++QLI H+  G    ++
Sbjct: 802  GFSENIGNC-NSLQAELRALLRGLLLCKERHIEQLWIEMDALAVIQLIPHSQKGSHDIRY 860

Query: 872  LLSKIRKQMEGFDWKISHIFREGNKVADGLAKLGCSSQNFVQFFADDFPRQVRGLARVDQ 1051
            LL  IRK +    ++ISHI REGN+VAD L+  G + QN   F   +   ++ G+ ++D+
Sbjct: 861  LLESIRKCLNSISYRISHILREGNQVADFLSNEGHNHQNLRVF--TEAQGKLHGMLKLDR 918

Query: 1052 LNLPSFR 1072
            LNLP  R
Sbjct: 919  LNLPYVR 925



 Score = 68.9 bits (167), Expect(2) = 2e-51
 Identities = 31/96 (32%), Positives = 56/96 (58%)
 Frame = +3

Query: 15  PTNPIFSPLWSHFLTPSMSIFMWRLIKNRLPVDQKLIDRGFSLAFKCWCCESPSVESLVH 194
           P N + S +W   +  S+S F+WR + N +PV+ ++ ++G  LA KC CC S   ESL+H
Sbjct: 579 PHNTLGSLIWHRSIPLSISFFIWRALNNWIPVELRMKEKGIHLASKCVCCNSE--ESLMH 636

Query: 195 LFLHNTHSHKVWMHFANMLHFSLPDTENINTFISSW 302
           +   N+ + +VW  FAN     + + ++++  + +W
Sbjct: 637 VLWGNSVAKQVWAFFANFFQIYIFNPQHVSHILWAW 672


>gb|EOY02239.1| Uncharacterized protein TCM_016763 [Theobroma cacao]
          Length = 2127

 Score =  164 bits (415), Expect(2) = 3e-51
 Identities = 94/248 (37%), Positives = 135/248 (54%), Gaps = 1/248 (0%)
 Frame = +2

Query: 332  HISFIIPCVILWFIWLERNKNKHESKGFSAYRVIGRVEHHIYLLKSSKLFQKSTWKGFGN 511
            H   ++P  I WF+WLERN  KH   G  A RVI R   H   L    L Q+  WKG  +
Sbjct: 1884 HFRVLLPLFICWFLWLERNDAKHRHTGLYADRVIWRTMKHCRQLYDGSLLQQWQWKGDTD 1943

Query: 512  VAASFGIYFRISVVQKCIHVHWHKPDLGQYKLNIDGS-KNPISSCGGIGGVLRDWQGNVI 688
            +A   G  F          ++W KP +G+YKLN+DGS +N + +    GGVLRD  G +I
Sbjct: 1944 IATMLGFSFTHKQHAPPQIIYWKKPSIGEYKLNVDGSSRNGLHAA--TGGVLRDHTGKLI 2001

Query: 689  LVFADGFMDCPDSTYAEILALHKALSLIEASSFHKIWIETDSQVLLQLISHNATGKWSYQ 868
              F++    C +S  AE+ AL + L L +     K+WIE D+ V +QLI  +  G ++ +
Sbjct: 2002 FGFSENIGPC-NSLQAELRALLRGLLLCKERHIEKLWIEMDALVAIQLIQPSKKGPYNLR 2060

Query: 869  HLLSKIRKQMEGFDWKISHIFREGNKVADGLAKLGCSSQNFVQFFADDFPRQVRGLARVD 1048
            +LL  IR  +  F +++SHI REGN+ AD L+  G   QN   F   +   Q+ G+ ++D
Sbjct: 2061 YLLESIRMCLSSFSYRLSHILREGNQAADYLSNEGHKHQNLCVF--TEAQGQLHGMLKLD 2118

Query: 1049 QLNLPSFR 1072
            +LNLP  R
Sbjct: 2119 RLNLPYVR 2126



 Score = 65.9 bits (159), Expect(2) = 3e-51
 Identities = 29/95 (30%), Positives = 54/95 (56%)
 Frame = +3

Query: 18   TNPIFSPLWSHFLTPSMSIFMWRLIKNRLPVDQKLIDRGFSLAFKCWCCESPSVESLVHL 197
            +N + S +W   +  S+S F+W+ + N +PV+ ++ ++G  LA KC CC S   ESL+H+
Sbjct: 1781 SNALCSFIWHRSIPLSISFFLWKTLHNWIPVELRMKEKGIQLASKCVCCNSE--ESLIHV 1838

Query: 198  FLHNTHSHKVWMHFANMLHFSLPDTENINTFISSW 302
               N  + +VW  FA +    + +  +++  I +W
Sbjct: 1839 LWENPVAKQVWNFFAQLFQIYIWNPRHVSQIIWAW 1873


>gb|EOY06959.1| Uncharacterized protein TCM_021521 [Theobroma cacao]
          Length = 1951

 Score =  157 bits (397), Expect(2) = 6e-51
 Identities = 89/249 (35%), Positives = 135/249 (54%)
 Frame = +2

Query: 332  HISFIIPCVILWFIWLERNKNKHESKGFSAYRVIGRVEHHIYLLKSSKLFQKSTWKGFGN 511
            HI  +IP  I WF+WLERN  KH   G    RVI R+   +  L +  L ++  WKG  +
Sbjct: 1707 HIRILIPLFICWFLWLERNDAKHRHMGMYPNRVIWRIMKLLNQLYAGSLLKQWQWKGDTD 1766

Query: 512  VAASFGIYFRISVVQKCIHVHWHKPDLGQYKLNIDGSKNPISSCGGIGGVLRDWQGNVIL 691
            +A  +G  F          ++W KP +G+YKLN+DGS     +  G GGVLRD  G +  
Sbjct: 1767 IATMWGFKFPPKYCTSPQIIYWIKPFIGEYKLNVDGSSKSNLNAAG-GGVLRDHTGKLAF 1825

Query: 692  VFADGFMDCPDSTYAEILALHKALSLIEASSFHKIWIETDSQVLLQLISHNATGKWSYQH 871
             F++     P S  AE+ AL + L L +  +   +WIE D+ V +Q++  +  G    ++
Sbjct: 1826 AFSENLGPLP-SLQAELHALLRGLLLCKERNITNLWIEMDALVAVQMVQQSQKGSHDIRY 1884

Query: 872  LLSKIRKQMEGFDWKISHIFREGNKVADGLAKLGCSSQNFVQFFADDFPRQVRGLARVDQ 1051
            LL  IR  +  F ++ISHI+REGN+ AD L+  G + Q+   F   +   ++ G+ ++D+
Sbjct: 1885 LLESIRLCLRSFSYRISHIYREGNQAADFLSNKGQTHQSLCVF--SEAQGELIGILKLDK 1942

Query: 1052 LNLPSFRTR 1078
            LNLP  R R
Sbjct: 1943 LNLPYVRFR 1951



 Score = 72.0 bits (175), Expect(2) = 6e-51
 Identities = 33/94 (35%), Positives = 53/94 (56%)
 Frame = +3

Query: 21   NPIFSPLWSHFLTPSMSIFMWRLIKNRLPVDQKLIDRGFSLAFKCWCCESPSVESLVHLF 200
            N +FS +W   +  S+S F+WR++ N +PV+ ++ D+G  LA KC CC S   ESL+H+ 
Sbjct: 1605 NALFSLIWHRSIPLSISFFLWRVLNNWIPVELRMKDKGIHLASKCVCCRSE--ESLIHVL 1662

Query: 201  LHNTHSHKVWMHFANMLHFSLPDTENINTFISSW 302
              N  + +VW  FA      +    +I+  I +W
Sbjct: 1663 WENPVATQVWFFFAKSFQIYVSKPNHISQIIWAW 1696


>gb|EOY14356.1| Uncharacterized protein TCM_033752 [Theobroma cacao]
          Length = 2251

 Score =  159 bits (401), Expect(2) = 1e-50
 Identities = 88/247 (35%), Positives = 136/247 (55%)
 Frame = +2

Query: 332  HISFIIPCVILWFIWLERNKNKHESKGFSAYRVIGRVEHHIYLLKSSKLFQKSTWKGFGN 511
            HI  ++P  ILWF+W+ERN  KH + G    RV+ RV   I  L   +   K  WKG   
Sbjct: 2007 HIRTLVPLFILWFLWVERNDAKHRNLGMYPNRVVWRVLKLIQQLSLGQQLLKWQWKGDKQ 2066

Query: 512  VAASFGIYFRISVVQKCIHVHWHKPDLGQYKLNIDGSKNPISSCGGIGGVLRDWQGNVIL 691
            +A  +GI F+   +       WHKP LG++KLN+DGS     +  G GG+LRD  G ++ 
Sbjct: 2067 IAQEWGIIFQAESLAPPKVFSWHKPSLGEFKLNVDGSAKQSHNAAG-GGILRDHAGEMVF 2125

Query: 692  VFADGFMDCPDSTYAEILALHKALSLIEASSFHKIWIETDSQVLLQLISHNATGKWSYQH 871
             F++  +   +S  AE+LAL++ L L    +  ++WIE D+  +++L+  N  G  + ++
Sbjct: 2126 GFSEN-LGTQNSLQAELLALYRGLILCRDYNIRRLWIEMDAISVIRLLQGNHRGPHAIRY 2184

Query: 872  LLSKIRKQMEGFDWKISHIFREGNKVADGLAKLGCSSQNFVQFFADDFPRQVRGLARVDQ 1051
            L+  +R+ +  F ++ SHIFREGN+ AD LA  G   QN   F       ++RG+  +DQ
Sbjct: 2185 LMVSLRQLLSHFSFRFSHIFREGNQAADFLANRGHEHQNLQVFTVAQ--GKLRGMLCLDQ 2242

Query: 1052 LNLPSFR 1072
             + P  R
Sbjct: 2243 TSFPYVR 2249



 Score = 69.7 bits (169), Expect(2) = 1e-50
 Identities = 31/94 (32%), Positives = 55/94 (58%)
 Frame = +3

Query: 21   NPIFSPLWSHFLTPSMSIFMWRLIKNRLPVDQKLIDRGFSLAFKCWCCESPSVESLVHLF 200
            NP+F+ +W   +  + S F+WRL+ + +PV+ K+  +G  LA +C CC+S   ES++H+ 
Sbjct: 1905 NPVFNFIWHKTVPLTTSFFLWRLLHDWIPVELKMKSKGLQLASRCRCCKSE--ESIMHVM 1962

Query: 201  LHNTHSHKVWMHFANMLHFSLPDTENINTFISSW 302
              N  + +VW +FA +    + +   IN  I +W
Sbjct: 1963 WDNPVAMQVWNYFAKLFQILIINPCTINQIIGAW 1996


>gb|EOY02236.1| Uncharacterized protein TCM_011923 [Theobroma cacao]
          Length = 1954

 Score =  154 bits (389), Expect(2) = 1e-50
 Identities = 87/247 (35%), Positives = 138/247 (55%)
 Frame = +2

Query: 332  HISFIIPCVILWFIWLERNKNKHESKGFSAYRVIGRVEHHIYLLKSSKLFQKSTWKGFGN 511
            HI  +IP  I WF+WLERN  KH   G  + RV+ ++   +  L+   L +   WKG  +
Sbjct: 1710 HIRILIPLFICWFLWLERNDAKHRHLGMYSDRVVWKIMKLLRQLQDGYLLKSWQWKGDKD 1769

Query: 512  VAASFGIYFRISVVQKCIHVHWHKPDLGQYKLNIDGSKNPISSCGGIGGVLRDWQGNVIL 691
             A  +G++           +HW KP  G++KLN+DGS    +    IGGVLRD  G ++ 
Sbjct: 1770 FATMWGLFSPPKTRAAPQILHWVKPVPGEHKLNVDGSSRQ-NQTAAIGGVLRDHTGTLVF 1828

Query: 692  VFADGFMDCPDSTYAEILALHKALSLIEASSFHKIWIETDSQVLLQLISHNATGKWSYQH 871
             F++  +   +S  AE+ AL + L L +  +  K+W+E D+ V +Q+I  +  G    ++
Sbjct: 1829 DFSEN-IGPSNSLQAELRALLRGLLLCKERNIEKLWVEMDALVAIQMIQQSQKGSHDIRY 1887

Query: 872  LLSKIRKQMEGFDWKISHIFREGNKVADGLAKLGCSSQNFVQFFADDFPRQVRGLARVDQ 1051
            LL+ IRK +  F ++ISHIFREGN+ AD L+  G + Q+   F   +   ++ G+ ++D+
Sbjct: 1888 LLASIRKYLNFFSFRISHIFREGNQAADFLSNKGHTHQSLHVF--TEAQGKLYGMLKLDR 1945

Query: 1052 LNLPSFR 1072
            LNLP  R
Sbjct: 1946 LNLPYVR 1952



 Score = 73.9 bits (180), Expect(2) = 1e-50
 Identities = 35/94 (37%), Positives = 54/94 (57%)
 Frame = +3

Query: 21   NPIFSPLWSHFLTPSMSIFMWRLIKNRLPVDQKLIDRGFSLAFKCWCCESPSVESLVHLF 200
            N + S LW   +  S+S F+WR+  N +PVD +L ++GF LA KC CC S   ESL+H+ 
Sbjct: 1608 NVLCSLLWHKSIPLSISFFLWRVFHNWIPVDIRLKEKGFHLASKCICCNSE--ESLIHVL 1665

Query: 201  LHNTHSHKVWMHFANMLHFSLPDTENINTFISSW 302
              N  + +VW  FAN     +   +N++  + +W
Sbjct: 1666 WDNPIAKQVWNFFANSFQIYISKPQNVSQILWTW 1699


>gb|EOY02238.1| Uncharacterized protein TCM_016762 [Theobroma cacao]
          Length = 2214

 Score =  164 bits (415), Expect(2) = 2e-50
 Identities = 91/247 (36%), Positives = 137/247 (55%)
 Frame = +2

Query: 332  HISFIIPCVILWFIWLERNKNKHESKGFSAYRVIGRVEHHIYLLKSSKLFQKSTWKGFGN 511
            HI  ++P  I WF+WLERN  K+   G +  R++ R+   +  LK   L Q+  WKG  +
Sbjct: 1971 HIRTLLPIFICWFLWLERNDAKYRHSGLNTDRIVWRIMKLLRQLKDGSLLQQWQWKGDTD 2030

Query: 512  VAASFGIYFRISVVQKCIHVHWHKPDLGQYKLNIDGSKNPISSCGGIGGVLRDWQGNVIL 691
            +AA +   F++ +      V+W KP  G+YKLN+DGS          GGVLRD  G +I 
Sbjct: 2031 IAAMWQYNFQLKLRAPPQIVYWRKPSTGEYKLNVDGSSRH-GQHAASGGVLRDHTGKLIF 2089

Query: 692  VFADGFMDCPDSTYAEILALHKALSLIEASSFHKIWIETDSQVLLQLISHNATGKWSYQH 871
             F++    C +S  AE+ AL + L L +     K+WIE D+   +QL+ H+  G    ++
Sbjct: 2090 GFSENIGTC-NSLQAELRALLRGLLLCKERHIEKLWIEMDALAAIQLLPHSQKGSHDIRY 2148

Query: 872  LLSKIRKQMEGFDWKISHIFREGNKVADGLAKLGCSSQNFVQFFADDFPRQVRGLARVDQ 1051
            LL  IRK +    ++ISHI REGN+VAD L+  G + QN   F   +   ++ G+ ++D+
Sbjct: 2149 LLESIRKCLNSISYRISHIHREGNQVADFLSNEGHNHQNLHVF--TEAQGKLHGMLKLDR 2206

Query: 1052 LNLPSFR 1072
            LNLP  R
Sbjct: 2207 LNLPYVR 2213



 Score = 63.2 bits (152), Expect(2) = 2e-50
 Identities = 29/94 (30%), Positives = 53/94 (56%)
 Frame = +3

Query: 21   NPIFSPLWSHFLTPSMSIFMWRLIKNRLPVDQKLIDRGFSLAFKCWCCESPSVESLVHLF 200
            N + S +W   +  S+S F+WR + N +PV+ ++  +G  LA KC CC S   ESL+H+ 
Sbjct: 1869 NTLGSLIWHRSIPLSISFFIWRALNNWIPVELRMKGKGIHLASKCVCCNSE--ESLMHVL 1926

Query: 201  LHNTHSHKVWMHFANMLHFSLPDTENINTFISSW 302
              N+ + +VW  FA      + + ++++  + +W
Sbjct: 1927 WGNSVAKQVWAFFAKFFQIYVLNPKHVSHILWAW 1960


>gb|EOY17514.1| Uncharacterized protein TCM_042330 [Theobroma cacao]
          Length = 2249

 Score =  157 bits (396), Expect(2) = 4e-50
 Identities = 87/247 (35%), Positives = 137/247 (55%)
 Frame = +2

Query: 332  HISFIIPCVILWFIWLERNKNKHESKGFSAYRVIGRVEHHIYLLKSSKLFQKSTWKGFGN 511
            HI  ++P   LWF+W+ERN  KH + G    R++ R+   I  L   +   K  WKG   
Sbjct: 2005 HIRTLVPIFTLWFLWVERNDAKHRNLGMYPNRIVWRILKLIQQLSLGQQLLKWQWKGDKQ 2064

Query: 512  VAASFGIYFRISVVQKCIHVHWHKPDLGQYKLNIDGSKNPISSCGGIGGVLRDWQGNVIL 691
            +A  +GI F+   +       WHKP +G++KLN+DGS     +  G GGVLRD  G ++ 
Sbjct: 2065 IAQEWGITFQAESLPPPKVFPWHKPSIGEFKLNVDGSAKLSQNAAG-GGVLRDHAGVMVF 2123

Query: 692  VFADGFMDCPDSTYAEILALHKALSLIEASSFHKIWIETDSQVLLQLISHNATGKWSYQH 871
             F++  +   +S  AE+LAL++ L L    +  ++WIE D+  +++L+  N  G  + ++
Sbjct: 2124 GFSEN-LGIQNSLQAELLALYRGLILCRDYNIRRLWIEMDAASVIRLLQGNQRGPHAIRY 2182

Query: 872  LLSKIRKQMEGFDWKISHIFREGNKVADGLAKLGCSSQNFVQFFADDFPRQVRGLARVDQ 1051
            LL  IR+ +  F +++SHIFREGN+ AD LA  G   Q+           ++RG+ R+DQ
Sbjct: 2183 LLVSIRQLLSHFSFRLSHIFREGNQAADFLANRGHEHQSLQVVTVAQ--GKLRGMLRLDQ 2240

Query: 1052 LNLPSFR 1072
             +LP  R
Sbjct: 2241 TSLPYVR 2247



 Score = 69.7 bits (169), Expect(2) = 4e-50
 Identities = 30/94 (31%), Positives = 56/94 (59%)
 Frame = +3

Query: 21   NPIFSPLWSHFLTPSMSIFMWRLIKNRLPVDQKLIDRGFSLAFKCWCCESPSVESLVHLF 200
            NP+F+ +W   +  ++S F+WRL+ + +PV+ K+  +GF LA +C CC+S   ES++H+ 
Sbjct: 1903 NPVFNFIWHKTVPLTISFFLWRLLHDWIPVELKMKSKGFQLASRCRCCKSE--ESIMHVM 1960

Query: 201  LHNTHSHKVWMHFANMLHFSLPDTENINTFISSW 302
              N  + +VW +F+      + +   IN  + +W
Sbjct: 1961 WDNPVATQVWNYFSKFFQILVINPCTINQILGAW 1994


>gb|EOY34748.1| Uncharacterized protein TCM_042328 [Theobroma cacao]
          Length = 910

 Score =  155 bits (393), Expect(2) = 6e-50
 Identities = 87/247 (35%), Positives = 135/247 (54%)
 Frame = +2

Query: 332  HISFIIPCVILWFIWLERNKNKHESKGFSAYRVIGRVEHHIYLLKSSKLFQKSTWKGFGN 511
            HI  ++P  ILWF+W+ERN  KH + G    RV+ RV   I  L   +   K  WKG   
Sbjct: 666  HIRTLVPLFILWFLWVERNDAKHRNLGMYPNRVVWRVLKLIQQLSLGQQLLKWQWKGDKQ 725

Query: 512  VAASFGIYFRISVVQKCIHVHWHKPDLGQYKLNIDGSKNPISSCGGIGGVLRDWQGNVIL 691
            +A  +GI  +   +       WHKP  G++KLN+DGS     +  G GG+LRD  G ++ 
Sbjct: 726  IAQEWGIILQAESLAPPKVFSWHKPTTGEFKLNVDGSAKHSHNAAG-GGILRDHAGVMVF 784

Query: 692  VFADGFMDCPDSTYAEILALHKALSLIEASSFHKIWIETDSQVLLQLISHNATGKWSYQH 871
             F++  +   +S  AE+LAL++ L L    +  ++WIE D+  +++L+  N  G  + ++
Sbjct: 785  GFSEN-LGIQNSLQAELLALYRGLILCRDYNIRRLWIEMDAISVIRLLQGNHRGPHAIRY 843

Query: 872  LLSKIRKQMEGFDWKISHIFREGNKVADGLAKLGCSSQNFVQFFADDFPRQVRGLARVDQ 1051
            L+  +R+ +  F ++ SHIFREGN+ AD LA  G   QN   F       ++RG+ R+DQ
Sbjct: 844  LMVSLRQLLSHFSFRFSHIFREGNQAADFLANRGHEHQNLQVFTVAQ--GKLRGMLRLDQ 901

Query: 1052 LNLPSFR 1072
             + P  R
Sbjct: 902  TSFPYVR 908



 Score = 70.1 bits (170), Expect(2) = 6e-50
 Identities = 31/94 (32%), Positives = 55/94 (58%)
 Frame = +3

Query: 21  NPIFSPLWSHFLTPSMSIFMWRLIKNRLPVDQKLIDRGFSLAFKCWCCESPSVESLVHLF 200
           NP+F+ +W   +  + S F+WRL+ + +PV+ K+  +G  LA +C CC+S   ES++H+ 
Sbjct: 564 NPVFNFIWHKTVPLTTSFFLWRLLHDWIPVELKMKSKGLQLASRCRCCKSE--ESIMHVM 621

Query: 201 LHNTHSHKVWMHFANMLHFSLPDTENINTFISSW 302
             N  + +VW +FA +    + +   IN  I +W
Sbjct: 622 WDNPVAMQVWNYFAKLFQICIINPCTINQIIGAW 655


>gb|EOY06956.1| Uncharacterized protein TCM_021518 [Theobroma cacao]
          Length = 1702

 Score =  147 bits (370), Expect(2) = 2e-48
 Identities = 83/223 (37%), Positives = 125/223 (56%)
 Frame = +2

Query: 332  HISFIIPCVILWFIWLERNKNKHESKGFSAYRVIGRVEHHIYLLKSSKLFQKSTWKGFGN 511
            HI  +IP  I WF+WLERN  K    G  + RV+ ++   +  L+   + +   WKG  +
Sbjct: 1290 HIRTLIPLFICWFLWLERNDAKQRHLGMYSDRVVWKIMKLLRQLQDGYVLKNWQWKGDMD 1349

Query: 512  VAASFGIYFRISVVQKCIHVHWHKPDLGQYKLNIDGSKNPISSCGGIGGVLRDWQGNVIL 691
            +AA +G  F   +       HW K   G++KLN+DGS     S   IGG+LRD  G ++ 
Sbjct: 1350 IAAMWGFNFSPKIQATPQIFHWVKLVSGEHKLNVDGSSRQNQSAA-IGGLLRDHTGTLVF 1408

Query: 692  VFADGFMDCPDSTYAEILALHKALSLIEASSFHKIWIETDSQVLLQLISHNATGKWSYQH 871
             F++  +   +S  AE+ AL + L L +  +  K+WIE D+ V +Q+I  +  G    Q+
Sbjct: 1409 GFSEN-IGPSNSLQAELRALLRGLLLCKERNIEKLWIEMDALVAIQMIQQSQKGSHDIQY 1467

Query: 872  LLSKIRKQMEGFDWKISHIFREGNKVADGLAKLGCSSQNFVQF 1000
            LL+ IRK +  F ++ISHIFREGN+VAD L+  G + QN + F
Sbjct: 1468 LLASIRKCLSFFSFRISHIFREGNQVADFLSNKGHTQQNLLVF 1510



 Score = 73.9 bits (180), Expect(2) = 2e-48
 Identities = 34/94 (36%), Positives = 54/94 (57%)
 Frame = +3

Query: 21   NPIFSPLWSHFLTPSMSIFMWRLIKNRLPVDQKLIDRGFSLAFKCWCCESPSVESLVHLF 200
            N + S  W   +  S+S F+WR+  N +PVD +L D+GF LA KC CC S   E+L+H+ 
Sbjct: 1188 NVLCSLFWHKSIPLSISFFLWRVFHNWIPVDLRLKDKGFHLASKCACCNSE--ETLIHVL 1245

Query: 201  LHNTHSHKVWMHFANMLHFSLPDTENINTFISSW 302
              N  + +VW  FAN     + + +N++  + +W
Sbjct: 1246 WDNPVAKQVWNFFANFFQIYVSNPQNVSQILWAW 1279



 Score =  106 bits (264), Expect = 2e-20
 Identities = 58/168 (34%), Positives = 96/168 (57%)
 Frame = +2

Query: 569  VHWHKPDLGQYKLNIDGSKNPISSCGGIGGVLRDWQGNVILVFADGFMDCPDSTYAEILA 748
            ++W +P +G++KLN+DG           GGV RD    +I  F++ F    +ST AE++A
Sbjct: 1536 IYWSRPLMGEFKLNVDGCSKEAFQNAASGGVPRDHTSTMIFGFSENFGPY-NSTQAELMA 1594

Query: 749  LHKALSLIEASSFHKIWIETDSQVLLQLISHNATGKWSYQHLLSKIRKQMEGFDWKISHI 928
            LH+ L L    +  ++WIE D++ ++Q++     G    Q+LLS I + + G  ++ISHI
Sbjct: 1595 LHRGLLLCNEYNISRVWIEIDAKAIVQMLHEGHKGYSRTQYLLSFICQCLSGISYRISHI 1654

Query: 929  FREGNKVADGLAKLGCSSQNFVQFFADDFPRQVRGLARVDQLNLPSFR 1072
             RE N+ AD L+  G + Q+   F   +   ++RG+ R+D+ NLP  R
Sbjct: 1655 HRESNQAADYLSNQGHTHQSLQVFSKAE--GELRGMIRLDKSNLPYVR 1700


>gb|EOY25451.1| Uncharacterized protein TCM_016759 [Theobroma cacao]
          Length = 879

 Score =  155 bits (391), Expect(2) = 4e-48
 Identities = 89/247 (36%), Positives = 132/247 (53%)
 Frame = +2

Query: 332  HISFIIPCVILWFIWLERNKNKHESKGFSAYRVIGRVEHHIYLLKSSKLFQKSTWKGFGN 511
            HI  ++P  I WF+WLERN  KH     +  RV+ R+   +  L    L  +  WKG  +
Sbjct: 635  HIRSLLPIFICWFLWLERNDAKHRHTRLNPDRVVWRIMKLLRQLLDGSLLHQWQWKGDTD 694

Query: 512  VAASFGIYFRISVVQKCIHVHWHKPDLGQYKLNIDGSKNPISSCGGIGGVLRDWQGNVIL 691
            +A+ +G  F+         ++W KP  G+YKLN+DGS          GG+LRD  G +I 
Sbjct: 695  IASMWGHTFQSKHRAPPQIIYWRKPFTGEYKLNVDGSSRN-GHLAASGGILRDHTGKLIF 753

Query: 692  VFADGFMDCPDSTYAEILALHKALSLIEASSFHKIWIETDSQVLLQLISHNATGKWSYQH 871
             F++    C +S  AE+ AL + L L +      +WIE D+  ++QLI H+  G    ++
Sbjct: 754  GFSENIGLC-NSLQAELRALLRGLLLCKERHIENLWIEMDALAVIQLIQHSQKGSHDIRY 812

Query: 872  LLSKIRKQMEGFDWKISHIFREGNKVADGLAKLGCSSQNFVQFFADDFPRQVRGLARVDQ 1051
            LL  IRK +    ++ISHIFREGN+ AD LA  G S QN       +   ++ G+ ++D+
Sbjct: 813  LLESIRKCLSCISYRISHIFREGNQAADYLANEGHSHQNLC--VITEAQGELHGMLKLDR 870

Query: 1052 LNLPSFR 1072
            LNLP  R
Sbjct: 871  LNLPYVR 877



 Score = 64.7 bits (156), Expect(2) = 4e-48
 Identities = 28/95 (29%), Positives = 54/95 (56%)
 Frame = +3

Query: 18  TNPIFSPLWSHFLTPSMSIFMWRLIKNRLPVDQKLIDRGFSLAFKCWCCESPSVESLVHL 197
           +N + S +W   +  S+S F+WR + N +PV+ ++ ++G  LA KC CC S   ESL+H+
Sbjct: 532 SNALCSFIWHRSIPLSISFFLWRALNNWIPVELRMKEKGIQLASKCVCCNSE--ESLMHV 589

Query: 198 FLHNTHSHKVWMHFANMLHFSLPDTENINTFISSW 302
              N+ + +VW  F       + + ++++  + +W
Sbjct: 590 LWGNSVAKQVWAFFGKFFQIYVLNPQHVSQILWAW 624


>gb|EOY19200.1| Retrotransposon, unclassified-like protein [Theobroma cacao]
          Length = 1368

 Score =  153 bits (386), Expect(2) = 3e-47
 Identities = 92/245 (37%), Positives = 138/245 (56%), Gaps = 1/245 (0%)
 Frame = +2

Query: 332  HISFIIPCVILWFIWLERNKNKHESKGFSAYRVIGRVEHHIYLLKSSKLFQKSTWKGFGN 511
            HI  +I   I WF+W+ERN  KH   G    R+I R+   +  L    L  K  WKG  +
Sbjct: 1091 HIRTLILLFIFWFVWVERNDAKHRDLGMYPDRIIWRIMKILRKLFQGGLLCKWQWKGDLD 1150

Query: 512  VAASFGIYFRISVVQKCIHVHWHKPDLGQYKLNIDGS-KNPISSCGGIGGVLRDWQGNVI 688
            +A  +G  F      +   ++W KP +G+ KLN+DGS K+   +  G GGVLRD  GN+I
Sbjct: 1151 IAIHWGFNFAQERQARPKIINWIKPLIGELKLNVDGSSKDEFQNAAG-GGVLRDHTGNLI 1209

Query: 689  LVFADGFMDCPDSTYAEILALHKALSLIEASSFHKIWIETDSQVLLQLISHNATGKWSYQ 868
              F++ F    +S  AE+LALH+ L L    +  ++WIE D+QV++Q+I ++  G +  Q
Sbjct: 1210 FGFSENF-GYQNSLQAELLALHRGLCLCMEYNVSRVWIEVDAQVVIQMIQNHHKGSYKIQ 1268

Query: 869  HLLSKIRKQMEGFDWKISHIFREGNKVADGLAKLGCSSQNFVQFFADDFPRQVRGLARVD 1048
            +LL  IRK ++    +ISHI REGN+ AD L+K G + QN   F   +   ++RG   V+
Sbjct: 1269 YLLESIRKCLQVISVRISHIHREGNQAADFLSKHGHTHQNLHVF--TEAQGELRGRTLVN 1326

Query: 1049 QLNLP 1063
            ++  P
Sbjct: 1327 RVEHP 1331



 Score = 63.9 bits (154), Expect(2) = 3e-47
 Identities = 28/88 (31%), Positives = 51/88 (57%)
 Frame = +3

Query: 39   LWSHFLTPSMSIFMWRLIKNRLPVDQKLIDRGFSLAFKCWCCESPSVESLVHLFLHNTHS 218
            +W   +  ++S F+WR + N LPV+ ++  +G  LA KC CC+S   ESL+H+   +  +
Sbjct: 995  IWHKSIPLTVSFFLWRTLHNWLPVEVRMKAKGIQLASKCLCCKSE--ESLLHVLWESPVA 1052

Query: 219  HKVWMHFANMLHFSLPDTENINTFISSW 302
             +VW +F+      + + +NI   ++SW
Sbjct: 1053 QQVWNYFSKFFQIYVHNPQNILQILNSW 1080


>gb|EOY25447.1| Uncharacterized protein TCM_016753 [Theobroma cacao]
          Length = 1275

 Score =  150 bits (379), Expect(2) = 2e-35
 Identities = 84/227 (37%), Positives = 122/227 (53%)
 Frame = +2

Query: 320  AHTAHISFIIPCVILWFIWLERNKNKHESKGFSAYRVIGRVEHHIYLLKSSKLFQKSTWK 499
            A    I  ++P  I WF+WLERN  KH   G    RV+ R+   +  L+   L Q+  WK
Sbjct: 883  AKQGRIRTLLPIFICWFLWLERNDAKHRHSGLYTDRVVWRIMTLLRQLQDDSLLQQWQWK 942

Query: 500  GFGNVAASFGIYFRISVVQKCIHVHWHKPDLGQYKLNIDGSKNPISSCGGIGGVLRDWQG 679
            G  ++AA +   F++        V+W KP  G+YKLN+DGS          GGVLRD   
Sbjct: 943  GDTDIAAMWRYNFQLKQRAPPQIVYWRKPFTGEYKLNVDGSSRNGQHAAS-GGVLRDHTS 1001

Query: 680  NVILVFADGFMDCPDSTYAEILALHKALSLIEASSFHKIWIETDSQVLLQLISHNATGKW 859
             +I  F++  +   +S  AE+ ALH+ L L +     K+WIE D+  ++QLI H+  G  
Sbjct: 1002 KLIFCFSEN-IGTYNSLQAELRALHRGLLLCKERHIEKLWIEMDALAVIQLIPHSQKGSH 1060

Query: 860  SYQHLLSKIRKQMEGFDWKISHIFREGNKVADGLAKLGCSSQNFVQF 1000
              ++LL  I+K +    ++ISHIFREGN+ AD L+  G + QN   F
Sbjct: 1061 DIRYLLESIKKCLNSISYRISHIFREGNQAADFLSNEGHNHQNLRVF 1107



 Score = 27.3 bits (59), Expect(2) = 2e-35
 Identities = 14/39 (35%), Positives = 21/39 (53%)
 Frame = +3

Query: 96  NRLPVDQKLIDRGFSLAFKCWCCESPSVESLVHLFLHNT 212
           N L +   + ++G  L  KC CC S   ESL+H+   N+
Sbjct: 845 NTLALSFGIEEKGIHLVSKCVCCNSE--ESLMHVLWGNS 881


>ref|XP_004309110.1| PREDICTED: putative ribonuclease H protein At1g65750-like [Fragaria
            vesca subsp. vesca]
          Length = 872

 Score =  110 bits (274), Expect(2) = 2e-33
 Identities = 75/242 (30%), Positives = 112/242 (46%), Gaps = 2/242 (0%)
 Frame = +2

Query: 359  ILWFIWLERNKNKHESKGFSAYRVIGRVEHHIYLLKSSKLFQKSTWKGFGN--VAASFGI 532
            ILW+IW  RN+ + +S+ FS   V   V  HI    SS+L          +  +  SFG 
Sbjct: 636  ILWYIWHARNQIRFDSRTFSVAGVCRLVSRHIQA--SSRLATGHMHNTIHDLCILKSFGA 693

Query: 533  YFRISVVQKCIHVHWHKPDLGQYKLNIDGSKNPISSCGGIGGVLRDWQGNVILVFADGFM 712
              R   + + + V WH P +G  K+N DG+       GG G V R ++G  +  FA   +
Sbjct: 694  CCRSRRIPRMVEVIWHPPSIGWIKINSDGAWKHEEGIGGFGAVFRYYKGQFVGAFAS-HI 752

Query: 713  DCPDSTYAEILALHKALSLIEASSFHKIWIETDSQVLLQLISHNATGKWSYQHLLSKIRK 892
            D P S  A+++ +  A+ L     +  +W+E D   +L  I   +   W  +        
Sbjct: 753  DIPSSIAAKVMVVITAIELAWVRDWKHVWLEVDFSTVLDYIRSPSLVPWQLRVRWLNCLY 812

Query: 893  QMEGFDWKISHIFREGNKVADGLAKLGCSSQNFVQFFADDFPRQVRGLARVDQLNLPSFR 1072
            ++    +K SHIFREGN+VAD LA  G S    V +  D  P  +      D L +P+FR
Sbjct: 813  RISTMTFKSSHIFREGNRVADALANHGTSMSEEVWW--DVPPSFILSYYERDLLGMPNFR 870

Query: 1073 TR 1078
             R
Sbjct: 871  FR 872



 Score = 60.8 bits (146), Expect(2) = 2e-33
 Identities = 33/91 (36%), Positives = 49/91 (53%), Gaps = 1/91 (1%)
 Frame = +3

Query: 3   KDTSPTNPIFSPLWSHFLTPSMSIFMWRLIKNRLPVDQKLIDRGFSLAFKCWCCESPSVE 182
           +  SP  P   PLWS F+ P MS+  W++++  +     L  RG +L  +C  C + S E
Sbjct: 519 QQASPVVPWGKPLWSKFILPRMSLHAWKVMRGTVISYHLLQRRGVALVSRCEFCGN-STE 577

Query: 183 SLVHLFLHNTHSHKVWMHFANMLHFSL-PDT 272
           SL H+FLH + +  VW HF  +    L P+T
Sbjct: 578 SLDHIFLHCSFAASVWNHFIYIFEIGLVPNT 608


>gb|EOY26529.1| Ribonuclease H-like protein [Theobroma cacao]
          Length = 458

 Score =  139 bits (351), Expect = 2e-30
 Identities = 83/243 (34%), Positives = 126/243 (51%)
 Frame = +2

Query: 335  ISFIIPCVILWFIWLERNKNKHESKGFSAYRVIGRVEHHIYLLKSSKLFQKSTWKGFGNV 514
            IS +IP  I WF+WLERN  KH   G    RV+      +  L      ++  WK   ++
Sbjct: 218  ISALIPLFICWFLWLERNDAKHRHLGMYPDRVVWETMKLLRQLHDGSPLKQWQWKVDKDI 277

Query: 515  AASFGIYFRISVVQKCIHVHWHKPDLGQYKLNIDGSKNPISSCGGIGGVLRDWQGNVILV 694
            AA +   F          +HW KP  G+YKLN+DGS     S    GG+LRD  G ++  
Sbjct: 278  AAMWSFLFPPKHGTTPQIIHWVKPFTGEYKLNVDGSSRNCQSATS-GGLLRDHIGKLVFG 336

Query: 695  FADGFMDCPDSTYAEILALHKALSLIEASSFHKIWIETDSQVLLQLISHNATGKWSYQHL 874
            F++    C +S  AE+ AL + L L +     ++WIE D+ V++Q+I     G    ++L
Sbjct: 337  FSENIGRC-NSLQAELRALLRRLLLCKEQHIERLWIEMDALVVIQMIHQYQKGSHDIRYL 395

Query: 875  LSKIRKQMEGFDWKISHIFREGNKVADGLAKLGCSSQNFVQFFADDFPRQVRGLARVDQL 1054
            L+ IRK +    ++I HIFREGN+ A  L+  G + QN       +   ++ G+ ++D+L
Sbjct: 396  LTSIRKGLSSISYRILHIFREGNQAAYFLSNQGYTHQNLC--LITEAQGELHGMLKLDRL 453

Query: 1055 NLP 1063
            NLP
Sbjct: 454  NLP 456


>gb|EOY25454.1| Uncharacterized protein TCM_026877 [Theobroma cacao]
          Length = 2367

 Score =  138 bits (347), Expect = 5e-30
 Identities = 83/247 (33%), Positives = 129/247 (52%)
 Frame = +2

Query: 332  HISFIIPCVILWFIWLERNKNKHESKGFSAYRVIGRVEHHIYLLKSSKLFQKSTWKGFGN 511
            HI  +IP   LWF+W+ERN  KH + G                    +   +  WKG   
Sbjct: 2143 HIRTLIPIFTLWFLWVERNDAKHRNLG--------------------QQLLEWQWKGDKQ 2182

Query: 512  VAASFGIYFRISVVQKCIHVHWHKPDLGQYKLNIDGSKNPISSCGGIGGVLRDWQGNVIL 691
            +A  +GI F+   +       WHKP  G++KLN+DGS     +  G GGVLRD  G +I 
Sbjct: 2183 IAQEWGITFQAKSLPPPKVFCWHKPSNGEFKLNVDGSAKLSQNAAG-GGVLRDHAGVMIF 2241

Query: 692  VFADGFMDCPDSTYAEILALHKALSLIEASSFHKIWIETDSQVLLQLISHNATGKWSYQH 871
             F++  +   +S  AE+LAL++ L L    +  ++WIE D+  +++L+  N  G  + ++
Sbjct: 2242 GFSEN-LGIQNSLKAELLALYRGLILCRDYNIRRLWIEMDATSVIRLLQGNHRGPHAIRY 2300

Query: 872  LLSKIRKQMEGFDWKISHIFREGNKVADGLAKLGCSSQNFVQFFADDFPRQVRGLARVDQ 1051
            LL  IR+ +  F ++++HIFREGN+ AD LA  G   Q+           ++RG+ R+DQ
Sbjct: 2301 LLGSIRQLLSHFSFRLTHIFREGNQAADFLANRGHEHQSLQVITVAQ--GKLRGMLRLDQ 2358

Query: 1052 LNLPSFR 1072
             +LP  R
Sbjct: 2359 TSLPYVR 2365


>ref|XP_004242524.1| PREDICTED: uncharacterized protein LOC101258077 [Solanum
            lycopersicum]
          Length = 1454

 Score =  109 bits (273), Expect(2) = 8e-30
 Identities = 76/252 (30%), Positives = 125/252 (49%), Gaps = 5/252 (1%)
 Frame = +2

Query: 344  IIPCVILWFIWLERNKNKHESKGFSAYRV---IGRVEHHIYLLKSSKLFQKSTWKGFGNV 514
            I+P  I W +W  R   K+  K  S YRV   I +    +  +    +  +++W    N+
Sbjct: 1203 ILPNFICWNLWKNRCAVKYGLKNSSIYRVQYGIFKNIMQVITIVFPSIPWQTSWNNLINI 1262

Query: 515  AASFGIYFRISVVQKCIHVHWHKPDLGQYKLNIDGSKNPISSCGGIGGVLRDWQGNVILV 694
                  +++I +V+      W+KPDLG+YKLN DGS    S   G GG+LRD QG +I  
Sbjct: 1263 VEQCKQHYKILIVK------WNKPDLGKYKLNTDGSALQNSGKIGGGGILRDNQGKIIYA 1316

Query: 695  FADGFMDCPDSTYAEILALHKALSLIEASSFHKIWIETDSQVLLQLISHNATGKWSYQHL 874
            F+  F     + +AEI A    L   E   + KI +E DS++L   I+ N    W Y+ L
Sbjct: 1317 FSLPF-GFGTNNFAEIKAALHGLDWCEQHGYKKIELEVDSKLLCNWINSNINIPWRYEEL 1375

Query: 875  LSKIRKQMEGFD-WKISHIFREGNKVADGLAKLGCSSQNFVQFFAD-DFPRQVRGLARVD 1048
            + +I + +   D ++  HI+RE N  AD L+K   + +   +F+        +RG   ++
Sbjct: 1376 IQQIHQIIRKMDQFQCHHIYREANCTADLLSKWSHNLEILQKFYTTRQLKEPIRGSYLLE 1435

Query: 1049 QLNLPSFRTRTI 1084
            ++ + +FR R +
Sbjct: 1436 KMGVQNFRRRKL 1447



 Score = 48.9 bits (115), Expect(2) = 8e-30
 Identities = 22/104 (21%), Positives = 53/104 (50%)
 Frame = +3

Query: 21   NPIFSPLWSHFLTPSMSIFMWRLIKNRLPVDQKLIDRGFSLAFKCWCCESPSVESLVHLF 200
            +PI + +W   +   +S F+WR ++ +LP ++ L   G +L+  C+CC +   + + H+ 
Sbjct: 1096 DPINNIIWHKQIPFKVSFFIWRALRGKLPTNENLQRIGKNLS-DCYCCYNKGKDDINHIL 1154

Query: 201  LHNTHSHKVWMHFANMLHFSLPDTENINTFISSWKNFTPLHTLH 332
            ++   +  +W  +++ +   LP    +   +  W+N    + +H
Sbjct: 1155 INGNFAKYIWKIYSSAVGV-LPINTTLRDLLLQWRNQQYTNEVH 1197


Top