BLASTX nr result

ID: Rehmannia23_contig00009220 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Rehmannia23_contig00009220
         (1299 letters)

Database: ./nr 
           37,332,560 sequences; 13,225,080,153 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

gb|EOY06959.1| Uncharacterized protein TCM_021521 [Theobroma cacao]   223   2e-55
gb|EOY02238.1| Uncharacterized protein TCM_016762 [Theobroma cacao]   217   7e-54
gb|EOY34747.1| Uncharacterized protein TCM_042327 [Theobroma cacao]   217   1e-53
gb|EOY14356.1| Uncharacterized protein TCM_033752 [Theobroma cacao]   217   1e-53
gb|EOY25451.1| Uncharacterized protein TCM_016759 [Theobroma cacao]   213   1e-52
gb|EOY17514.1| Uncharacterized protein TCM_042330 [Theobroma cacao]   212   2e-52
gb|EOY06960.1| Uncharacterized protein TCM_021522 [Theobroma cacao]   212   3e-52
gb|EOY02239.1| Uncharacterized protein TCM_016763 [Theobroma cacao]   212   3e-52
gb|EOY02236.1| Uncharacterized protein TCM_011923 [Theobroma cacao]   211   4e-52
gb|EOY02234.1| Uncharacterized protein TCM_011921 [Theobroma cacao]   211   4e-52
gb|EOY34748.1| Uncharacterized protein TCM_042328 [Theobroma cacao]   211   7e-52
gb|EOY19200.1| Retrotransposon, unclassified-like protein [Theob...   208   5e-51
gb|EOY17513.1| Uncharacterized protein TCM_036737 [Theobroma cacao]   206   1e-50
gb|EOX96783.1| Uncharacterized protein TCM_005954 [Theobroma cacao]   200   9e-49
gb|EOY06956.1| Uncharacterized protein TCM_021518 [Theobroma cacao]   194   9e-47
gb|EOY25454.1| Uncharacterized protein TCM_026877 [Theobroma cacao]   176   2e-41
ref|XP_004309472.1| PREDICTED: putative ribonuclease H protein A...   158   5e-36
ref|XP_006356603.1| PREDICTED: putative ribonuclease H protein A...   155   3e-35
ref|XP_004309110.1| PREDICTED: putative ribonuclease H protein A...   154   6e-35
gb|EOY25447.1| Uncharacterized protein TCM_016753 [Theobroma cacao]   152   4e-34

>gb|EOY06959.1| Uncharacterized protein TCM_021521 [Theobroma cacao]
          Length = 1951

 Score =  223 bits (567), Expect = 2e-55
 Identities = 138/430 (32%), Positives = 206/430 (47%), Gaps = 3/430 (0%)
 Frame = -2

Query: 1295 FWHDIWFENSLLNDVIQCDKNGLENVAIDFFWSNDNWDLERLQIVVGPEWAAKXXXXXXX 1116
            FWHD W  +  L  +     N + +V    F++ D WD+E+L   +      +       
Sbjct: 1515 FWHDCWMGDQPLATLCPSFHNDMSHV--HKFYNGDVWDIEKLSSCLPTSLVDEILQIPFD 1572

Query: 1115 XXXXXXSWTHTDSMKWKPSSNGKFSLSSAYATVHNVPNPQPVFAEFWNSCLTPSASIFLW 936
                    +  D   W  +SNG FSL SA+  +     P  +F+  W+  +  S S FLW
Sbjct: 1573 R-------SQEDVAYWALTSNGDFSLWSAWEAIRQRQTPNALFSLIWHRSIPLSISFFLW 1625

Query: 935  RLIQNRLPVDTKLQKRGISLASKCVCCLPPSLVHYTDSSSHEETLSHLFLHNTQVMKVWM 756
            R++ N +PV+ +++ +GI LASKCVCC              EE+L H+   N    +VW 
Sbjct: 1626 RVLNNWIPVELRMKDKGIHLASKCVCC------------RSEESLIHVLWENPVATQVWF 1673

Query: 755  HFAAWVRCTLPHTEHISIFLSFWKNTTPLAQKNHITVLLPCLVLWYIWKERNHCRHENAS 576
             FA   +  +    HIS  +  W  +    +  HI +L+P  + W++W ERN  +H +  
Sbjct: 1674 FFAKSFQIYVSKPNHISQIIWAWFFSGDYTRNGHIRILIPLFICWFLWLERNDAKHRHMG 1733

Query: 575  FSHIRIIKTVENHIQLLSKAKLFQVHNWKGCIDTATRLGLQFRRASRSRNTPILWQQPSS 396
                R+I  +   +  L    L +   WKG  D AT  G +F     +    I W +P  
Sbjct: 1734 MYPNRVIWRIMKLLNQLYAGSLLKQWQWKGDTDIATMWGFKFPPKYCTSPQIIYWIKPFI 1793

Query: 395  PWIKLNIDGSYKSHLGAAGIGGIIRNHE*DTIWAFS---GYIPRSEGPLHAESVALETAL 225
               KLN+DGS KS+L AAG GG++R+H     +AFS   G +P  +  LH    AL   L
Sbjct: 1794 GEYKLNVDGSSKSNLNAAG-GGVLRDHTGKLAFAFSENLGPLPSLQAELH----ALLRGL 1848

Query: 224  TYSYTQSIDHLWIETDSLILCNMVANKFPGHWSCRYSLVQIANRLHNKHFRITHIHREGN 45
                 ++I +LWIE D+L+   MV     G    RY L  I   L +  +RI+HI+REGN
Sbjct: 1849 LLCKERNITNLWIEMDALVAVQMVQQSQKGSHDIRYLLESIRLCLRSFSYRISHIYREGN 1908

Query: 44   KVADFLASLG 15
            + ADFL++ G
Sbjct: 1909 QAADFLSNKG 1918


>gb|EOY02238.1| Uncharacterized protein TCM_016762 [Theobroma cacao]
          Length = 2214

 Score =  217 bits (553), Expect = 7e-54
 Identities = 131/427 (30%), Positives = 211/427 (49%)
 Frame = -2

Query: 1295 FWHDIWFENSLLNDVIQCDKNGLENVAIDFFWSNDNWDLERLQIVVGPEWAAKXXXXXXX 1116
            FWHD W  +  L       +N +  V    F+  D+WD+++L++ +      +       
Sbjct: 1779 FWHDCWMGDQPLVISFPSFRNDMSFV--HKFYKGDSWDVDKLRLFLPVNLIYEILLIPFD 1836

Query: 1115 XXXXXXSWTHTDSMKWKPSSNGKFSLSSAYATVHNVPNPQPVFAEFWNSCLTPSASIFLW 936
                    T  D   W  +SNG+FS  SA+ T+    +   + +  W+  +  S S F+W
Sbjct: 1837 R-------TQQDVAYWTLTSNGEFSTKSAWETIRQQQSHNTLGSLIWHRSIPLSISFFIW 1889

Query: 935  RLIQNRLPVDTKLQKRGISLASKCVCCLPPSLVHYTDSSSHEETLSHLFLHNTQVMKVWM 756
            R + N +PV+ +++ +GI LASKCVCC            + EE+L H+   N+   +VW 
Sbjct: 1890 RALNNWIPVELRMKGKGIHLASKCVCC------------NSEESLMHVLWGNSVAKQVWA 1937

Query: 755  HFAAWVRCTLPHTEHISIFLSFWKNTTPLAQKNHITVLLPCLVLWYIWKERNHCRHENAS 576
             FA + +  + + +H+S  L  W  +    ++ HI  LLP  + W++W ERN  ++ ++ 
Sbjct: 1938 FFAKFFQIYVLNPKHVSHILWAWFYSGDYVKRGHIRTLLPIFICWFLWLERNDAKYRHSG 1997

Query: 575  FSHIRIIKTVENHIQLLSKAKLFQVHNWKGCIDTATRLGLQFRRASRSRNTPILWQQPSS 396
             +  RI+  +   ++ L    L Q   WKG  D A      F+   R+    + W++PS+
Sbjct: 1998 LNTDRIVWRIMKLLRQLKDGSLLQQWQWKGDTDIAAMWQYNFQLKLRAPPQIVYWRKPST 2057

Query: 395  PWIKLNIDGSYKSHLGAAGIGGIIRNHE*DTIWAFSGYIPRSEGPLHAESVALETALTYS 216
               KLN+DGS + H   A  GG++R+H    I+ FS  I      L AE  AL   L   
Sbjct: 2058 GEYKLNVDGSSR-HGQHAASGGVLRDHTGKLIFGFSENIGTCNS-LQAELRALLRGLLLC 2115

Query: 215  YTQSIDHLWIETDSLILCNMVANKFPGHWSCRYSLVQIANRLHNKHFRITHIHREGNKVA 36
              + I+ LWIE D+L    ++ +   G    RY L  I   L++  +RI+HIHREGN+VA
Sbjct: 2116 KERHIEKLWIEMDALAAIQLLPHSQKGSHDIRYLLESIRKCLNSISYRISHIHREGNQVA 2175

Query: 35   DFLASLG 15
            DFL++ G
Sbjct: 2176 DFLSNEG 2182


>gb|EOY34747.1| Uncharacterized protein TCM_042327 [Theobroma cacao]
          Length = 1014

 Score =  217 bits (552), Expect = 1e-53
 Identities = 129/427 (30%), Positives = 204/427 (47%)
 Frame = -2

Query: 1295 FWHDIWFENSLLNDVIQCDKNGLENVAIDFFWSNDNWDLERLQIVVGPEWAAKXXXXXXX 1116
            FWHD W  N  L       +N +    +  F++ DNWD+  L++ +      +       
Sbjct: 578  FWHDCWMGNKPLVTSFPSFRNDM--TFVHKFYNGDNWDVNTLKLYLPMNLIDEILQIPFD 635

Query: 1115 XXXXXXSWTHTDSMKWKPSSNGKFSLSSAYATVHNVPNPQPVFAEFWNSCLTPSASIFLW 936
                    +  D   W  +S+G+FS  SA+  V    +P  + +  W+  +  + S FLW
Sbjct: 636  R-------SQDDIAYWALTSDGEFSTWSAWEAVRQRQSPNTLCSFIWHKSIPLTISFFLW 688

Query: 935  RLIQNRLPVDTKLQKRGISLASKCVCCLPPSLVHYTDSSSHEETLSHLFLHNTQVMKVWM 756
            R++ N +PV+ +L+++G  LASKCVCC            + EE+L H+   N    +VW 
Sbjct: 689  RVLNNWIPVELRLKEKGFHLASKCVCC------------NSEESLIHVLWDNPVAKQVWN 736

Query: 755  HFAAWVRCTLPHTEHISIFLSFWKNTTPLAQKNHITVLLPCLVLWYIWKERNHCRHENAS 576
             FA + +  + + +H+S  +  W  +    +K HI  L+P  + W++W ERN  +H +  
Sbjct: 737  FFADFFQINISNPQHVSQIIWAWYYSGDFVRKGHIRTLIPLFICWFLWLERNDAKHRHLG 796

Query: 575  FSHIRIIKTVENHIQLLSKAKLFQVHNWKGCIDTATRLGLQFRRASRSRNTPILWQQPSS 396
                R++  +   ++ L    L +   WKG  D A   G       R     I W +P +
Sbjct: 797  MYSDRVVWKIMKVLRQLQDGSLLKKWQWKGDTDIAAMWGFTLPLKIRESPQIIHWVKPVT 856

Query: 395  PWIKLNIDGSYKSHLGAAGIGGIIRNHE*DTIWAFSGYIPRSEGPLHAESVALETALTYS 216
               KLN+DGS + H  +A  GG++R+H    ++ FS  I  S   L AE  AL   L   
Sbjct: 857  GEYKLNVDGSSR-HNQSAATGGLLRDHTGTLVFGFSENIGPSNS-LQAELRALLRGLLLC 914

Query: 215  YTQSIDHLWIETDSLILCNMVANKFPGHWSCRYSLVQIANRLHNKHFRITHIHREGNKVA 36
              ++I+ LWIE D+L++  M+     G    RY L  I   L    FRI+HI REGN+ A
Sbjct: 915  KDRNIEKLWIEMDALVVIQMIQQSKKGSHDIRYLLASIRKCLSFFSFRISHIFREGNQAA 974

Query: 35   DFLASLG 15
            DFL++ G
Sbjct: 975  DFLSNKG 981


>gb|EOY14356.1| Uncharacterized protein TCM_033752 [Theobroma cacao]
          Length = 2251

 Score =  217 bits (552), Expect = 1e-53
 Identities = 133/427 (31%), Positives = 208/427 (48%)
 Frame = -2

Query: 1295 FWHDIWFENSLLNDVIQCDKNGLENVAIDFFWSNDNWDLERLQIVVGPEWAAKXXXXXXX 1116
            FWHD W   + L    Q   + +  V  DFF +N++W++E+L+ V+  E   +       
Sbjct: 1815 FWHDCWMGEAPLISSNQEFTSSMVQVC-DFF-TNNSWNIEKLKTVLQQEVVDEIAKIPID 1872

Query: 1115 XXXXXXSWTHTDSMKWKPSSNGKFSLSSAYATVHNVPNPQPVFAEFWNSCLTPSASIFLW 936
                     + D   W P+ NG FS  SA+  +       PVF   W+  +  + S FLW
Sbjct: 1873 TM-------NKDEAYWTPTPNGDFSTKSAWQLIRKRKVVNPVFNFIWHKTVPLTTSFFLW 1925

Query: 935  RLIQNRLPVDTKLQKRGISLASKCVCCLPPSLVHYTDSSSHEETLSHLFLHNTQVMKVWM 756
            RL+ + +PV+ K++ +G+ LAS+C CC              EE++ H+   N   M+VW 
Sbjct: 1926 RLLHDWIPVELKMKSKGLQLASRCRCC------------KSEESIMHVMWDNPVAMQVWN 1973

Query: 755  HFAAWVRCTLPHTEHISIFLSFWKNTTPLAQKNHITVLLPCLVLWYIWKERNHCRHENAS 576
            +FA   +  + +   I+  +  W  +    +  HI  L+P  +LW++W ERN  +H N  
Sbjct: 1974 YFAKLFQILIINPCTINQIIGAWFYSGDYCKPGHIRTLVPLFILWFLWVERNDAKHRNLG 2033

Query: 575  FSHIRIIKTVENHIQLLSKAKLFQVHNWKGCIDTATRLGLQFRRASRSRNTPILWQQPSS 396
                R++  V   IQ LS  +      WKG    A   G+ F+  S +      W +PS 
Sbjct: 2034 MYPNRVVWRVLKLIQQLSLGQQLLKWQWKGDKQIAQEWGIIFQAESLAPPKVFSWHKPSL 2093

Query: 395  PWIKLNIDGSYKSHLGAAGIGGIIRNHE*DTIWAFSGYIPRSEGPLHAESVALETALTYS 216
               KLN+DGS K    AAG GGI+R+H  + ++ FS  +  ++  L AE +AL   L   
Sbjct: 2094 GEFKLNVDGSAKQSHNAAG-GGILRDHAGEMVFGFSENL-GTQNSLQAELLALYRGLILC 2151

Query: 215  YTQSIDHLWIETDSLILCNMVANKFPGHWSCRYSLVQIANRLHNKHFRITHIHREGNKVA 36
               +I  LWIE D++ +  ++     G  + RY +V +   L +  FR +HI REGN+ A
Sbjct: 2152 RDYNIRRLWIEMDAISVIRLLQGNHRGPHAIRYLMVSLRQLLSHFSFRFSHIFREGNQAA 2211

Query: 35   DFLASLG 15
            DFLA+ G
Sbjct: 2212 DFLANRG 2218


>gb|EOY25451.1| Uncharacterized protein TCM_016759 [Theobroma cacao]
          Length = 879

 Score =  213 bits (543), Expect = 1e-52
 Identities = 132/430 (30%), Positives = 211/430 (49%), Gaps = 1/430 (0%)
 Frame = -2

Query: 1295 FWHDIWFENSLLNDVIQCDKNGLENVAIDFFWSNDNWDLERLQIVVGPEWAAKXXXXXXX 1116
            FWHD W  N  L       +N +    +  F++ D WD+++L+  +      +       
Sbjct: 443  FWHDCWMGNQPLVMSFPSLRNDMS--LVHNFYNGDTWDVDKLKAYLPMNLIDEILLIPFN 500

Query: 1115 XXXXXXSWTHTDSMKWKPSSNGKFSLSSAYATVHNVPNPQPVFAEFWNSCLTPSASIFLW 936
                    T  D   W  +SNG+F+  SA+ T+    +   + +  W+  +  S S FLW
Sbjct: 501  R-------TQQDVAYWTLTSNGEFATWSAWETIRQRKSSNALCSFIWHRSIPLSISFFLW 553

Query: 935  RLIQNRLPVDTKLQKRGISLASKCVCCLPPSLVHYTDSSSHEETLSHLFLHNTQVMKVWM 756
            R + N +PV+ +++++GI LASKCVCC            + EE+L H+   N+   +VW 
Sbjct: 554  RALNNWIPVELRMKEKGIQLASKCVCC------------NSEESLMHVLWGNSVAKQVWA 601

Query: 755  HFAAWVRCTLPHTEHISIFLSFWKNTTPLAQKNHITVLLPCLVLWYIWKERNHCRHENAS 576
             F  + +  + + +H+S  L  W  +    +K HI  LLP  + W++W ERN  +H +  
Sbjct: 602  FFGKFFQIYVLNPQHVSQILWAWFFSGDYVKKGHIRSLLPIFICWFLWLERNDAKHRHTR 661

Query: 575  FSHIRIIKTVENHIQLLSKAKLFQVHNWKGCIDTATRLGLQFRRASRSRNTPILWQQPSS 396
             +  R++  +   ++ L    L     WKG  D A+  G  F+   R+    I W++P +
Sbjct: 662  LNPDRVVWRIMKLLRQLLDGSLLHQWQWKGDTDIASMWGHTFQSKHRAPPQIIYWRKPFT 721

Query: 395  PWIKLNIDGSYKS-HLGAAGIGGIIRNHE*DTIWAFSGYIPRSEGPLHAESVALETALTY 219
               KLN+DGS ++ HL A+  GGI+R+H    I+ FS  I      L AE  AL   L  
Sbjct: 722  GEYKLNVDGSSRNGHLAAS--GGILRDHTGKLIFGFSENIGLCNS-LQAELRALLRGLLL 778

Query: 218  SYTQSIDHLWIETDSLILCNMVANKFPGHWSCRYSLVQIANRLHNKHFRITHIHREGNKV 39
               + I++LWIE D+L +  ++ +   G    RY L  I   L    +RI+HI REGN+ 
Sbjct: 779  CKERHIENLWIEMDALAVIQLIQHSQKGSHDIRYLLESIRKCLSCISYRISHIFREGNQA 838

Query: 38   ADFLASLGLS 9
            AD+LA+ G S
Sbjct: 839  ADYLANEGHS 848


>gb|EOY17514.1| Uncharacterized protein TCM_042330 [Theobroma cacao]
          Length = 2249

 Score =  212 bits (540), Expect = 2e-52
 Identities = 133/427 (31%), Positives = 202/427 (47%)
 Frame = -2

Query: 1295 FWHDIWFENSLLNDVIQCDKNGLENVAIDFFWSNDNWDLERLQIVVGPEWAAKXXXXXXX 1116
            FWHD W   + L    Q  +  L  V +  F+ N++WD+E+L+ V+  E   +       
Sbjct: 1813 FWHDCWMGETPLTSSNQ--ELSLSMVQVCDFFMNNSWDIEKLKTVLQQEVVDEIAKIPID 1870

Query: 1115 XXXXXXSWTHTDSMKWKPSSNGKFSLSSAYATVHNVPNPQPVFAEFWNSCLTPSASIFLW 936
                       D   W P+ NG+FS  SA+  +       PVF   W+  +  + S FLW
Sbjct: 1871 AMSK-------DEAYWAPTPNGEFSTKSAWQLIRKREVVNPVFNFIWHKTVPLTISFFLW 1923

Query: 935  RLIQNRLPVDTKLQKRGISLASKCVCCLPPSLVHYTDSSSHEETLSHLFLHNTQVMKVWM 756
            RL+ + +PV+ K++ +G  LAS+C CC              EE++ H+   N    +VW 
Sbjct: 1924 RLLHDWIPVELKMKSKGFQLASRCRCC------------KSEESIMHVMWDNPVATQVWN 1971

Query: 755  HFAAWVRCTLPHTEHISIFLSFWKNTTPLAQKNHITVLLPCLVLWYIWKERNHCRHENAS 576
            +F+ + +  + +   I+  L  W  +    +  HI  L+P   LW++W ERN  +H N  
Sbjct: 1972 YFSKFFQILVINPCTINQILGAWFYSGDYCKPGHIRTLVPIFTLWFLWVERNDAKHRNLG 2031

Query: 575  FSHIRIIKTVENHIQLLSKAKLFQVHNWKGCIDTATRLGLQFRRASRSRNTPILWQQPSS 396
                RI+  +   IQ LS  +      WKG    A   G+ F+  S        W +PS 
Sbjct: 2032 MYPNRIVWRILKLIQQLSLGQQLLKWQWKGDKQIAQEWGITFQAESLPPPKVFPWHKPSI 2091

Query: 395  PWIKLNIDGSYKSHLGAAGIGGIIRNHE*DTIWAFSGYIPRSEGPLHAESVALETALTYS 216
               KLN+DGS K    AAG GG++R+H    ++ FS  +   +  L AE +AL   L   
Sbjct: 2092 GEFKLNVDGSAKLSQNAAG-GGVLRDHAGVMVFGFSENL-GIQNSLQAELLALYRGLILC 2149

Query: 215  YTQSIDHLWIETDSLILCNMVANKFPGHWSCRYSLVQIANRLHNKHFRITHIHREGNKVA 36
               +I  LWIE D+  +  ++     G  + RY LV I   L +  FR++HI REGN+ A
Sbjct: 2150 RDYNIRRLWIEMDAASVIRLLQGNQRGPHAIRYLLVSIRQLLSHFSFRLSHIFREGNQAA 2209

Query: 35   DFLASLG 15
            DFLA+ G
Sbjct: 2210 DFLANRG 2216


>gb|EOY06960.1| Uncharacterized protein TCM_021522 [Theobroma cacao]
          Length = 3503

 Score =  212 bits (539), Expect = 3e-52
 Identities = 133/430 (30%), Positives = 202/430 (46%), Gaps = 3/430 (0%)
 Frame = -2

Query: 1295 FWHDIWFENSLLNDVIQCDKNGLENVAIDFFWSNDNWDLERLQIVVGPEWAAKXXXXXXX 1116
            FWHD W  +  L  +     N + +V    F++ D WD+ +L   +      +       
Sbjct: 1272 FWHDCWMGDQPLATLFPSFHNDMSHV--HKFYNGDEWDIVKLNSYLPTSLVDEILQIPFD 1329

Query: 1115 XXXXXXSWTHTDSMKWKPSSNGKFSLSSAYATVHNVPNPQPVFAEFWNSCLTPSASIFLW 936
                    +  D   W  +SNG+FS  SA+  +     P  + +  W+  +  S S FLW
Sbjct: 1330 R-------SQEDVAYWALTSNGEFSFWSAWEIIRQRQTPNALLSFNWHRSIPLSISFFLW 1382

Query: 935  RLIQNRLPVDTKLQKRGISLASKCVCCLPPSLVHYTDSSSHEETLSHLFLHNTQVMKVWM 756
            R++ N +PV+ +++ +GI LASKCVCC              EE+L H+   N    +VW 
Sbjct: 1383 RVLNNWIPVELRMKDKGIHLASKCVCC------------RSEESLIHVLWENPVAKQVWN 1430

Query: 755  HFAAWVRCTLPHTEHISIFLSFWKNTTPLAQKNHITVLLPCLVLWYIWKERNHCRHENAS 576
             FA   +  +   +HIS  +  W  +    +  HI +L+P  + W++W ERN  +H +  
Sbjct: 1431 FFAKSFQIYVSKPKHISQIIWAWFFSGDYTRNGHIRILIPLFICWFLWLERNDAKHRHMG 1490

Query: 575  FSHIRIIKTVENHIQLLSKAKLFQVHNWKGCIDTATRLGLQFRRASRSRNTPILWQQPSS 396
                R+I  +   +  L    L +   WKG  D AT  G ++          I W +P  
Sbjct: 1491 MYPNRVIWRIMKLLNQLHAGSLLKQWQWKGDTDIATMWGFKYPPKYCQSPQIISWIKPFI 1550

Query: 395  PWIKLNIDGSYKSHLGAAGIGGIIRNHE*DTIWAFS---GYIPRSEGPLHAESVALETAL 225
               KLN+DGS KS   AAG GG++R+H     +AFS   G +P  +  LH    AL   L
Sbjct: 1551 GEYKLNVDGSSKSSQNAAG-GGVLRDHTGKLAFAFSENLGPLPSLQAELH----ALLRGL 1605

Query: 224  TYSYTQSIDHLWIETDSLILCNMVANKFPGHWSCRYSLVQIANRLHNKHFRITHIHREGN 45
                 ++I +LWIE D+L+   MV     G    RY L  I   L +  +RI+HI+REGN
Sbjct: 1606 LLCKERNITNLWIEMDALVAVQMVQQSQKGSHDIRYLLESIRLCLRSFSYRISHIYREGN 1665

Query: 44   KVADFLASLG 15
            + ADFL++ G
Sbjct: 1666 QAADFLSNKG 1675



 Score =  211 bits (536), Expect = 7e-52
 Identities = 129/427 (30%), Positives = 198/427 (46%)
 Frame = -2

Query: 1295 FWHDIWFENSLLNDVIQCDKNGLENVAIDFFWSNDNWDLERLQIVVGPEWAAKXXXXXXX 1116
            FWHD W     L  VI+  +       +  F+ N++WD+E+L+ V+  E   +       
Sbjct: 3066 FWHDCWMGEEPL--VIRNQEFASSMAQVSDFFLNNSWDIEKLKSVLQQEVVEEIAKIPIN 3123

Query: 1115 XXXXXXSWTHTDSMKWKPSSNGKFSLSSAYATVHNVPNPQPVFAEFWNSCLTPSASIFLW 936
                    +  D   W P+ NG FS  SA+          P +   W+  +  + S FLW
Sbjct: 3124 A-------SSNDRAYWTPTPNGDFSTKSAWQLSRERKVVNPTYNYIWHKSVPLTTSFFLW 3176

Query: 935  RLIQNRLPVDTKLQKRGISLASKCVCCLPPSLVHYTDSSSHEETLSHLFLHNTQVMKVWM 756
            RL+ + +PV+ K++ +G  LAS+C CC              EE+L H+   N    +VW 
Sbjct: 3177 RLLHDWVPVELKMKSKGFQLASRCRCC------------KSEESLMHVMWDNPVANQVWS 3224

Query: 755  HFAAWVRCTLPHTEHISIFLSFWKNTTPLAQKNHITVLLPCLVLWYIWKERNHCRHENAS 576
            +FA   +  + +   I+  +S W  +   ++  HI  L+P  +LW++W ERN  +H N  
Sbjct: 3225 YFAKVFQIHIINPCTINHIISAWFYSGDYSKPGHIRTLVPLFILWFLWVERNDAKHRNLG 3284

Query: 575  FSHIRIIKTVENHIQLLSKAKLFQVHNWKGCIDTATRLGLQFRRASRSRNTPILWQQPSS 396
                RI+  +   I  L + K  Q   W+G    A   G+  +  + S    + W +PS 
Sbjct: 3285 MYPNRIVWKILKLIHQLFQGKQLQKWQWQGDKQIAQEWGIILKAVAPSPPKLLFWNKPSI 3344

Query: 395  PWIKLNIDGSYKSHLGAAGIGGIIRNHE*DTIWAFSGYIPRSEGPLHAESVALETALTYS 216
               KLN+DGS K +L  A  GG++R+H    I+ FS     S+  L AE +AL   L   
Sbjct: 3345 GEFKLNVDGSSKYNLQTAAGGGLLRDHTGSMIFGFSENF-GSQDSLQAELMALHRGLLLC 3403

Query: 215  YTQSIDHLWIETDSLILCNMVANKFPGHWSCRYSLVQIANRLHNKHFRITHIHREGNKVA 36
               ++  LWIE D+ +   M+     G    RY L  I   L    FRI+HI REGN+ A
Sbjct: 3404 IDHNVTRLWIEMDAKVAVQMINEGHQGSSRTRYLLASIHRCLSGISFRISHIFREGNQAA 3463

Query: 35   DFLASLG 15
            D L++ G
Sbjct: 3464 DHLSNQG 3470


>gb|EOY02239.1| Uncharacterized protein TCM_016763 [Theobroma cacao]
          Length = 2127

 Score =  212 bits (539), Expect = 3e-52
 Identities = 132/430 (30%), Positives = 205/430 (47%), Gaps = 3/430 (0%)
 Frame = -2

Query: 1295 FWHDIWFENSLLNDVIQCDKNGLENVAIDFFWSNDNWDLERLQIVVGPEWAAKXXXXXXX 1116
            FWHD W  +  L       +N + +     F++ D WD+++L+  +      +       
Sbjct: 1692 FWHDCWMGDKPLAASFPEFQNDMSHGY--HFYNGDTWDVDKLRSFLPTILVEEILQVPFD 1749

Query: 1115 XXXXXXSWTHTDSMKWKPSSNGKFSLSSAYATVHNVPNPQPVFAEFWNSCLTPSASIFLW 936
                    +  D   W  +SNG FS  SA+  +        + +  W+  +  S S FLW
Sbjct: 1750 K-------SREDVAYWTLTSNGDFSTRSAWEMIRQRQTSNALCSFIWHRSIPLSISFFLW 1802

Query: 935  RLIQNRLPVDTKLQKRGISLASKCVCCLPPSLVHYTDSSSHEETLSHLFLHNTQVMKVWM 756
            + + N +PV+ +++++GI LASKCVCC            + EE+L H+   N    +VW 
Sbjct: 1803 KTLHNWIPVELRMKEKGIQLASKCVCC------------NSEESLIHVLWENPVAKQVWN 1850

Query: 755  HFAAWVRCTLPHTEHISIFLSFWKNTTPLAQKNHITVLLPCLVLWYIWKERNHCRHENAS 576
             FA   +  + +  H+S  +  W  +    +K H  VLLP  + W++W ERN  +H +  
Sbjct: 1851 FFAQLFQIYIWNPRHVSQIIWAWYVSGDYVRKGHFRVLLPLFICWFLWLERNDAKHRHTG 1910

Query: 575  FSHIRIIKTVENHIQLLSKAKLFQVHNWKGCIDTATRLGLQFRRASRSRNTPILWQQPSS 396
                R+I     H + L    L Q   WKG  D AT LG  F     +    I W++PS 
Sbjct: 1911 LYADRVIWRTMKHCRQLYDGSLLQQWQWKGDTDIATMLGFSFTHKQHAPPQIIYWKKPSI 1970

Query: 395  PWIKLNIDGSYKSHLGAAGIGGIIRNHE*DTIWAFSGYIPRSEGP---LHAESVALETAL 225
               KLN+DGS ++ L AA  GG++R+H    I+ FS  I    GP   L AE  AL   L
Sbjct: 1971 GEYKLNVDGSSRNGLHAA-TGGVLRDHTGKLIFGFSENI----GPCNSLQAELRALLRGL 2025

Query: 224  TYSYTQSIDHLWIETDSLILCNMVANKFPGHWSCRYSLVQIANRLHNKHFRITHIHREGN 45
                 + I+ LWIE D+L+   ++     G ++ RY L  I   L +  +R++HI REGN
Sbjct: 2026 LLCKERHIEKLWIEMDALVAIQLIQPSKKGPYNLRYLLESIRMCLSSFSYRLSHILREGN 2085

Query: 44   KVADFLASLG 15
            + AD+L++ G
Sbjct: 2086 QAADYLSNEG 2095


>gb|EOY02236.1| Uncharacterized protein TCM_011923 [Theobroma cacao]
          Length = 1954

 Score =  211 bits (538), Expect = 4e-52
 Identities = 130/427 (30%), Positives = 206/427 (48%)
 Frame = -2

Query: 1295 FWHDIWFENSLLNDVIQCDKNGLENVAIDFFWSNDNWDLERLQIVVGPEWAAKXXXXXXX 1116
            FWHD W  +  L       +N +  V    F++  NWD+++L + +      +       
Sbjct: 1518 FWHDCWMGDQPLVTSFPHFRNDMSTV--HNFFNGHNWDVDKLNLYLPMNLVDEILQIPID 1575

Query: 1115 XXXXXXSWTHTDSMKWKPSSNGKFSLSSAYATVHNVPNPQPVFAEFWNSCLTPSASIFLW 936
                    +  D   W  +SNG+FS  SA+  +    +P  + +  W+  +  S S FLW
Sbjct: 1576 R-------SQDDVAYWSLTSNGEFSTRSAWEAIRLRKSPNVLCSLLWHKSIPLSISFFLW 1628

Query: 935  RLIQNRLPVDTKLQKRGISLASKCVCCLPPSLVHYTDSSSHEETLSHLFLHNTQVMKVWM 756
            R+  N +PVD +L+++G  LASKC+CC            + EE+L H+   N    +VW 
Sbjct: 1629 RVFHNWIPVDIRLKEKGFHLASKCICC------------NSEESLIHVLWDNPIAKQVWN 1676

Query: 755  HFAAWVRCTLPHTEHISIFLSFWKNTTPLAQKNHITVLLPCLVLWYIWKERNHCRHENAS 576
             FA   +  +   +++S  L  W  +    +K HI +L+P  + W++W ERN  +H +  
Sbjct: 1677 FFANSFQIYISKPQNVSQILWTWYLSGDYVRKGHIRILIPLFICWFLWLERNDAKHRHLG 1736

Query: 575  FSHIRIIKTVENHIQLLSKAKLFQVHNWKGCIDTATRLGLQFRRASRSRNTPILWQQPSS 396
                R++  +   ++ L    L +   WKG  D AT  GL     +R+    + W +P  
Sbjct: 1737 MYSDRVVWKIMKLLRQLQDGYLLKSWQWKGDKDFATMWGLFSPPKTRAAPQILHWVKPVP 1796

Query: 395  PWIKLNIDGSYKSHLGAAGIGGIIRNHE*DTIWAFSGYIPRSEGPLHAESVALETALTYS 216
               KLN+DGS + +  AA IGG++R+H    ++ FS  I  S   L AE  AL   L   
Sbjct: 1797 GEHKLNVDGSSRQNQTAA-IGGVLRDHTGTLVFDFSENIGPSNS-LQAELRALLRGLLLC 1854

Query: 215  YTQSIDHLWIETDSLILCNMVANKFPGHWSCRYSLVQIANRLHNKHFRITHIHREGNKVA 36
              ++I+ LW+E D+L+   M+     G    RY L  I   L+   FRI+HI REGN+ A
Sbjct: 1855 KERNIEKLWVEMDALVAIQMIQQSQKGSHDIRYLLASIRKYLNFFSFRISHIFREGNQAA 1914

Query: 35   DFLASLG 15
            DFL++ G
Sbjct: 1915 DFLSNKG 1921


>gb|EOY02234.1| Uncharacterized protein TCM_011921 [Theobroma cacao]
          Length = 926

 Score =  211 bits (538), Expect = 4e-52
 Identities = 128/427 (29%), Positives = 208/427 (48%)
 Frame = -2

Query: 1295 FWHDIWFENSLLNDVIQCDKNGLENVAIDFFWSNDNWDLERLQIVVGPEWAAKXXXXXXX 1116
            FWHD W  +  L       +N +    +  F+  D+WD+++L++ +      +       
Sbjct: 491  FWHDCWMGDQPLVISFPSFRNDMS--LVHKFYKGDSWDVDKLRLFLPVNLVDEILLIPFD 548

Query: 1115 XXXXXXSWTHTDSMKWKPSSNGKFSLSSAYATVHNVPNPQPVFAEFWNSCLTPSASIFLW 936
                    T  D   W  +SNG+FS  SA+ T+        + +  W+  +  S S F+W
Sbjct: 549  R-------TQQDVAYWILTSNGEFSTRSAWETIRKRQPHNTLGSLIWHRSIPLSISFFIW 601

Query: 935  RLIQNRLPVDTKLQKRGISLASKCVCCLPPSLVHYTDSSSHEETLSHLFLHNTQVMKVWM 756
            R + N +PV+ +++++GI LASKCVCC            + EE+L H+   N+   +VW 
Sbjct: 602  RALNNWIPVELRMKEKGIHLASKCVCC------------NSEESLMHVLWGNSVAKQVWA 649

Query: 755  HFAAWVRCTLPHTEHISIFLSFWKNTTPLAQKNHITVLLPCLVLWYIWKERNHCRHENAS 576
             FA + +  + + +H+S  L  W  +    ++ HI  LLP  + W++W ERN  +H  + 
Sbjct: 650  FFANFFQIYIFNPQHVSHILWAWFYSGDYVKRGHIRTLLPIFICWFLWLERNDAKHRYSG 709

Query: 575  FSHIRIIKTVENHIQLLSKAKLFQVHNWKGCIDTATRLGLQFRRASRSRNTPILWQQPSS 396
                R++  +   ++ L    L Q   WKG  D A       +   R+    + W++PS+
Sbjct: 710  LYTDRVVWRIMKLLRQLHDGSLLQQWQWKGDTDIAAMWKYNLQLKLRAPPQIVYWRKPST 769

Query: 395  PWIKLNIDGSYKSHLGAAGIGGIIRNHE*DTIWAFSGYIPRSEGPLHAESVALETALTYS 216
               KLN+DGS + H   A  GG++R+H    I+ FS  I      L AE  AL   L   
Sbjct: 770  GEYKLNVDGSSR-HGQHAASGGVLRDHTGKLIFGFSENIGNCNS-LQAELRALLRGLLLC 827

Query: 215  YTQSIDHLWIETDSLILCNMVANKFPGHWSCRYSLVQIANRLHNKHFRITHIHREGNKVA 36
              + I+ LWIE D+L +  ++ +   G    RY L  I   L++  +RI+HI REGN+VA
Sbjct: 828  KERHIEQLWIEMDALAVIQLIPHSQKGSHDIRYLLESIRKCLNSISYRISHILREGNQVA 887

Query: 35   DFLASLG 15
            DFL++ G
Sbjct: 888  DFLSNEG 894


>gb|EOY34748.1| Uncharacterized protein TCM_042328 [Theobroma cacao]
          Length = 910

 Score =  211 bits (536), Expect = 7e-52
 Identities = 131/427 (30%), Positives = 206/427 (48%)
 Frame = -2

Query: 1295 FWHDIWFENSLLNDVIQCDKNGLENVAIDFFWSNDNWDLERLQIVVGPEWAAKXXXXXXX 1116
            FWHD W  ++ L    Q   + +  V  DFF +N +W++E+L+ V+  E   +       
Sbjct: 474  FWHDCWMGDAPLISSNQEFTSSMVQVC-DFFMNN-SWNVEKLKTVLQQEVVDEIAKIPID 531

Query: 1115 XXXXXXSWTHTDSMKWKPSSNGKFSLSSAYATVHNVPNPQPVFAEFWNSCLTPSASIFLW 936
                       D   W P+ NG FS  SA+  +       PVF   W+  +  + S FLW
Sbjct: 532  TMSK-------DEAYWTPTPNGDFSTKSAWQLIRKRKVVNPVFNFIWHKTVPLTTSFFLW 584

Query: 935  RLIQNRLPVDTKLQKRGISLASKCVCCLPPSLVHYTDSSSHEETLSHLFLHNTQVMKVWM 756
            RL+ + +PV+ K++ +G+ LAS+C CC              EE++ H+   N   M+VW 
Sbjct: 585  RLLHDWIPVELKMKSKGLQLASRCRCC------------KSEESIMHVMWDNPVAMQVWN 632

Query: 755  HFAAWVRCTLPHTEHISIFLSFWKNTTPLAQKNHITVLLPCLVLWYIWKERNHCRHENAS 576
            +FA   +  + +   I+  +  W ++    +  HI  L+P  +LW++W ERN  +H N  
Sbjct: 633  YFAKLFQICIINPCTINQIIGAWFHSGDYCKPGHIRTLVPLFILWFLWVERNDAKHRNLG 692

Query: 575  FSHIRIIKTVENHIQLLSKAKLFQVHNWKGCIDTATRLGLQFRRASRSRNTPILWQQPSS 396
                R++  V   IQ LS  +      WKG    A   G+  +  S +      W +P++
Sbjct: 693  MYPNRVVWRVLKLIQQLSLGQQLLKWQWKGDKQIAQEWGIILQAESLAPPKVFSWHKPTT 752

Query: 395  PWIKLNIDGSYKSHLGAAGIGGIIRNHE*DTIWAFSGYIPRSEGPLHAESVALETALTYS 216
               KLN+DGS K    AAG GGI+R+H    ++ FS  +   +  L AE +AL   L   
Sbjct: 753  GEFKLNVDGSAKHSHNAAG-GGILRDHAGVMVFGFSENL-GIQNSLQAELLALYRGLILC 810

Query: 215  YTQSIDHLWIETDSLILCNMVANKFPGHWSCRYSLVQIANRLHNKHFRITHIHREGNKVA 36
               +I  LWIE D++ +  ++     G  + RY +V +   L +  FR +HI REGN+ A
Sbjct: 811  RDYNIRRLWIEMDAISVIRLLQGNHRGPHAIRYLMVSLRQLLSHFSFRFSHIFREGNQAA 870

Query: 35   DFLASLG 15
            DFLA+ G
Sbjct: 871  DFLANRG 877


>gb|EOY19200.1| Retrotransposon, unclassified-like protein [Theobroma cacao]
          Length = 1368

 Score =  208 bits (529), Expect = 5e-51
 Identities = 123/431 (28%), Positives = 207/431 (48%), Gaps = 4/431 (0%)
 Frame = -2

Query: 1295 FWHDIWF-ENSLLNDVIQCDKNGLENVAIDFFWSNDNWDLERLQIVVGPEWAAKXXXXXX 1119
            FWHD W  +  L+N      ++ ++   +++F+++D WD+++L+  +      +      
Sbjct: 899  FWHDAWMGDEPLVNSFPSFSQSMMK---VNYFFNDDAWDVDKLKTFIPNAIVEEILKIPI 955

Query: 1118 XXXXXXXSWTHTDSMKWKPSSNGKFSLSSAYATVHNVPNPQPVFAEFWNSCLTPSASIFL 939
                        D   W  ++NG FS+ SA+  +        V    W+  +  + S FL
Sbjct: 956  SREKE-------DIAYWALTANGDFSIKSAWELLRQRKQVNLVGQLIWHKSIPLTVSFFL 1008

Query: 938  WRLIQNRLPVDTKLQKRGISLASKCVCCLPPSLVHYTDSSSHEETLSHLFLHNTQVMKVW 759
            WR + N LPV+ +++ +GI LASKC+CC              EE+L H+   +    +VW
Sbjct: 1009 WRTLHNWLPVEVRMKAKGIQLASKCLCC------------KSEESLLHVLWESPVAQQVW 1056

Query: 758  MHFAAWVRCTLPHTEHISIFLSFWKNTTPLAQKNHITVLLPCLVLWYIWKERNHCRHENA 579
             +F+ + +  + + ++I   L+ W  +    +  HI  L+   + W++W ERN  +H + 
Sbjct: 1057 NYFSKFFQIYVHNPQNILQILNSWYYSGDFTKPGHIRTLILLFIFWFVWVERNDAKHRDL 1116

Query: 578  SFSHIRIIKTVENHIQLLSKAKLFQVHNWKGCIDTATRLGLQFRRASRSRNTPILWQQPS 399
                 RII  +   ++ L +  L     WKG +D A   G  F +  ++R   I W +P 
Sbjct: 1117 GMYPDRIIWRIMKILRKLFQGGLLCKWQWKGDLDIAIHWGFNFAQERQARPKIINWIKPL 1176

Query: 398  SPWIKLNIDGSYKSHLGAAGIGGIIRNHE*DTIWAFS---GYIPRSEGPLHAESVALETA 228
               +KLN+DGS K     A  GG++R+H  + I+ FS   GY    +  L AE +AL   
Sbjct: 1177 IGELKLNVDGSSKDEFQNAAGGGVLRDHTGNLIFGFSENFGY----QNSLQAELLALHRG 1232

Query: 227  LTYSYTQSIDHLWIETDSLILCNMVANKFPGHWSCRYSLVQIANRLHNKHFRITHIHREG 48
            L      ++  +WIE D+ ++  M+ N   G +  +Y L  I   L     RI+HIHREG
Sbjct: 1233 LCLCMEYNVSRVWIEVDAQVVIQMIQNHHKGSYKIQYLLESIRKCLQVISVRISHIHREG 1292

Query: 47   NKVADFLASLG 15
            N+ ADFL+  G
Sbjct: 1293 NQAADFLSKHG 1303


>gb|EOY17513.1| Uncharacterized protein TCM_036737 [Theobroma cacao]
          Length = 2215

 Score =  206 bits (525), Expect = 1e-50
 Identities = 129/428 (30%), Positives = 202/428 (47%), Gaps = 1/428 (0%)
 Frame = -2

Query: 1295 FWHDIWFENSLLNDVIQCDKNGLENVAIDFFWSNDNWDLERLQIVVGPEWAAKXXXXXXX 1116
            FWHD W     L +  Q   + +  V+ DFF +N +W++E+L+ V+  E   +       
Sbjct: 1778 FWHDCWMGEEPLVNRNQAFASSMAQVS-DFFLNN-SWNVEKLKTVLQQEVVEEIVKIPID 1835

Query: 1115 XXXXXXSWTHTDSMKWKPSSNGKFSLSSAYATVHNVPNPQPVFAEFWNSCLTPSASIFLW 936
                    +  D   W  + NG FS  SA+  + N     PVF   W+  +  + S FLW
Sbjct: 1836 T-------SSNDKAYWTTTPNGDFSTKSAWQLIRNRKVENPVFNFIWHKSVPLTTSFFLW 1888

Query: 935  RLIQNRLPVDTKLQKRGISLASKCVCCLPPSLVHYTDSSSHEETLSHLFLHNTQVMKVWM 756
            RL+ + +PV+ K++ +G  LAS+C CC              EE+L H+   N    +VW 
Sbjct: 1889 RLLHDWIPVELKMKTKGFQLASRCRCC------------KSEESLMHVMWKNPVANQVWS 1936

Query: 755  HFAAWVRCTLPHTEHISIFLSFWKNTTPLAQKNHITVLLPCLVLWYIWKERNHCRHENAS 576
            +FA   +  + +   I+  +  W  +   ++  HI  L+P   LW++W ERN  +H N  
Sbjct: 1937 YFAKVFQIQIINPCTINQIICAWFYSGDYSKPGHIRTLVPLFTLWFLWVERNDAKHRNLG 1996

Query: 575  FSHIRIIKTVENHIQLLSKAKLFQVHNWKGCIDTATRLGLQFRRASRSRNTPILWQQPSS 396
                R++  +   +  L + K  Q   W+G    A   G+  +  + S    + W +PS 
Sbjct: 1997 MYPNRVVWKILKLLHQLFQGKQLQKWQWQGDKQIAQEWGIILKADAPSPPKLLFWLKPSI 2056

Query: 395  PWIKLNIDGSYKSHLGAAGIGGIIRNHE*DTIWAFS-GYIPRSEGPLHAESVALETALTY 219
              +KLN+DGS K +  +A  GG++R+H    I+ FS  + P+    L AE +AL   L  
Sbjct: 2057 GELKLNVDGSCKHNPQSAAGGGLLRDHTGSMIFGFSENFGPQDS--LQAELMALHRGLLL 2114

Query: 218  SYTQSIDHLWIETDSLILCNMVANKFPGHWSCRYSLVQIANRLHNKHFRITHIHREGNKV 39
                +I  LWIE D+ +   M+     G    RY L  I   L    FRI+HI REGN+ 
Sbjct: 2115 CIEHNISRLWIEMDAKVAVQMIKEGHQGSSRTRYLLASIHRCLSGISFRISHIFREGNQA 2174

Query: 38   ADFLASLG 15
            AD L++ G
Sbjct: 2175 ADHLSNQG 2182


>gb|EOX96783.1| Uncharacterized protein TCM_005954 [Theobroma cacao]
          Length = 1134

 Score =  200 bits (509), Expect = 9e-49
 Identities = 127/411 (30%), Positives = 194/411 (47%), Gaps = 3/411 (0%)
 Frame = -2

Query: 1238 KNGLENVAIDFFWSNDNWDLERLQIVVGPEWAAKXXXXXXXXXXXXXSWTHTDSMKWKPS 1059
            KN + +V    F++ D WD+++L+  +      +               +  D   W  +
Sbjct: 715  KNDMSHVY--HFYNGDTWDVDKLKSFLPTVLVEEILQVPFDK-------SREDVAYWTLT 765

Query: 1058 SNGKFSLSSAYATVHNVPNPQPVFAEFWNSCLTPSASIFLWRLIQNRLPVDTKLQKRGIS 879
            SNG FS  SA   +        + +  W+  +  S S FLW+ + N +PV+ +++++GI 
Sbjct: 766  SNGDFSTRSAGEMIRQRQTSNALCSFIWHRSIPLSISFFLWKTLHNWIPVELRMKEKGIQ 825

Query: 878  LASKCVCCLPPSLVHYTDSSSHEETLSHLFLHNTQVMKVWMHFAAWVRCTLPHTEHISIF 699
            LASKCVCC            + EE+L H+   N    +VW  FA   +  + +  H+S  
Sbjct: 826  LASKCVCC------------NSEESLIHVLWENPVAKQVWNFFAKLFQIYILNPRHVSQI 873

Query: 698  LSFWKNTTPLAQKNHITVLLPCLVLWYIWKERNHCRHENASFSHIRIIKTVENHIQLLSK 519
            +  W  +    +K H  VLLP  + W++W ERN  +H +      R+I     H + L  
Sbjct: 874  IWAWYVSGDYVRKGHFRVLLPLFICWFLWLERNDAKHRHTGLYPDRVIWRTMKHCRQLYD 933

Query: 518  AKLFQVHNWKGCIDTATRLGLQFRRASRSRNTPILWQQPSSPWIKLNIDGSYKSHLGAAG 339
              L Q   WKG  D A  LG  F     +    I W++PS    KLN+DGS ++ L AA 
Sbjct: 934  GSLLQQWQWKGDTDIAAMLGFSFPPQQHASPQIIYWKKPSIGEYKLNVDGSSRNGLHAA- 992

Query: 338  IGGIIRNHE*DTIWAFSGYIPRSEGP---LHAESVALETALTYSYTQSIDHLWIETDSLI 168
             GG++R+H    I+ FS  I    GP   L AE  AL   L     + I+ LWIE D+L 
Sbjct: 993  TGGVLRDHTGKLIFGFSENI----GPCNSLQAELRALLRGLLLCKERHIEKLWIEMDALA 1048

Query: 167  LCNMVANKFPGHWSCRYSLVQIANRLHNKHFRITHIHREGNKVADFLASLG 15
               ++     G +  RY L  I   L +  +R++H  REGNK AD+L++ G
Sbjct: 1049 AIQLIQPSKKGPYDIRYLLESIRMCLSSFSYRLSHTFREGNKAADYLSNEG 1099


>gb|EOY06956.1| Uncharacterized protein TCM_021518 [Theobroma cacao]
          Length = 1702

 Score =  194 bits (492), Expect = 9e-47
 Identities = 116/356 (32%), Positives = 178/356 (50%)
 Frame = -2

Query: 1082 DSMKWKPSSNGKFSLSSAYATVHNVPNPQPVFAEFWNSCLTPSASIFLWRLIQNRLPVDT 903
            D   W  +SNG+FS  SA+  +    +P  + + FW+  +  S S FLWR+  N +PVD 
Sbjct: 1160 DIAYWALTSNGEFSTWSAWEALRLRQSPNVLCSLFWHKSIPLSISFFLWRVFHNWIPVDL 1219

Query: 902  KLQKRGISLASKCVCCLPPSLVHYTDSSSHEETLSHLFLHNTQVMKVWMHFAAWVRCTLP 723
            +L+ +G  LASKC CC            + EETL H+   N    +VW  FA + +  + 
Sbjct: 1220 RLKDKGFHLASKCACC------------NSEETLIHVLWDNPVAKQVWNFFANFFQIYVS 1267

Query: 722  HTEHISIFLSFWKNTTPLAQKNHITVLLPCLVLWYIWKERNHCRHENASFSHIRIIKTVE 543
            + +++S  L  W  +    +K HI  L+P  + W++W ERN  +  +      R++  + 
Sbjct: 1268 NPQNVSQILWAWYFSGDYVRKGHIRTLIPLFICWFLWLERNDAKQRHLGMYSDRVVWKIM 1327

Query: 542  NHIQLLSKAKLFQVHNWKGCIDTATRLGLQFRRASRSRNTPILWQQPSSPWIKLNIDGSY 363
              ++ L    + +   WKG +D A   G  F    ++      W +  S   KLN+DGS 
Sbjct: 1328 KLLRQLQDGYVLKNWQWKGDMDIAAMWGFNFSPKIQATPQIFHWVKLVSGEHKLNVDGSS 1387

Query: 362  KSHLGAAGIGGIIRNHE*DTIWAFSGYIPRSEGPLHAESVALETALTYSYTQSIDHLWIE 183
            + +  AA IGG++R+H    ++ FS  I  S   L AE  AL   L     ++I+ LWIE
Sbjct: 1388 RQNQSAA-IGGLLRDHTGTLVFGFSENIGPSNS-LQAELRALLRGLLLCKERNIEKLWIE 1445

Query: 182  TDSLILCNMVANKFPGHWSCRYSLVQIANRLHNKHFRITHIHREGNKVADFLASLG 15
             D+L+   M+     G    +Y L  I   L    FRI+HI REGN+VADFL++ G
Sbjct: 1446 MDALVAIQMIQQSQKGSHDIQYLLASIRKCLSFFSFRISHIFREGNQVADFLSNKG 1501



 Score = 78.6 bits (192), Expect = 5e-12
 Identities = 50/152 (32%), Positives = 74/152 (48%), Gaps = 3/152 (1%)
 Frame = -2

Query: 461  GLQFRRASRSRNTPILWQQPSSPWIKLNIDGSYKSHLGAAGIGGIIRNHE*DTIWAFSGY 282
            GL++ + S      I W +P     KLN+DG  K     A  GG+ R+H    I+ FS  
Sbjct: 1522 GLRYEQDSHGHPKIIYWSRPLMGEFKLNVDGCSKEAFQNAASGGVPRDHTSTMIFGFS-- 1579

Query: 281  IPRSEGPLH---AESVALETALTYSYTQSIDHLWIETDSLILCNMVANKFPGHWSCRYSL 111
               + GP +   AE +AL   L      +I  +WIE D+  +  M+     G+   +Y L
Sbjct: 1580 --ENFGPYNSTQAELMALHRGLLLCNEYNISRVWIEIDAKAIVQMLHEGHKGYSRTQYLL 1637

Query: 110  VQIANRLHNKHFRITHIHREGNKVADFLASLG 15
              I   L    +RI+HIHRE N+ AD+L++ G
Sbjct: 1638 SFICQCLSGISYRISHIHRESNQAADYLSNQG 1669


>gb|EOY25454.1| Uncharacterized protein TCM_026877 [Theobroma cacao]
          Length = 2367

 Score =  176 bits (446), Expect = 2e-41
 Identities = 125/427 (29%), Positives = 183/427 (42%)
 Frame = -2

Query: 1295 FWHDIWFENSLLNDVIQCDKNGLENVAIDFFWSNDNWDLERLQIVVGPEWAAKXXXXXXX 1116
            FWHD W   + L  +    +  L  V +  F+ N++WD+E+L+ V+  E   +       
Sbjct: 1985 FWHDCWMGETPL--ISSNHEFSLSMVQVCDFFMNNSWDIEKLKTVLQQEVVDEIAKIPID 2042

Query: 1115 XXXXXXSWTHTDSMKWKPSSNGKFSLSSAYATVHNVPNPQPVFAEFWNSCLTPSASIFLW 936
                       D   W P+ NG+FS  SA+  +       PVF   W+  +  + S FLW
Sbjct: 2043 AMSK-------DEAYWAPTPNGEFSTKSAWQLIRKREVVNPVFNFIWHKAIPLTTSFFLW 2095

Query: 935  RLIQNRLPVDTKLQKRGISLASKCVCCLPPSLVHYTDSSSHEETLSHLFLHNTQVMKVWM 756
            RL+ + +PV+ +++ +G  LAS+C CC              EE++ H+            
Sbjct: 2096 RLLHDWIPVELRMKSKGFQLASRCRCC------------RSEESIIHVM----------- 2132

Query: 755  HFAAWVRCTLPHTEHISIFLSFWKNTTPLAQKNHITVLLPCLVLWYIWKERNHCRHENAS 576
                                  W N   + Q  HI  L+P   LW++W ERN  +H N  
Sbjct: 2133 ----------------------WDNPVAV-QPGHIRTLIPIFTLWFLWVERNDAKHRNLG 2169

Query: 575  FSHIRIIKTVENHIQLLSKAKLFQVHNWKGCIDTATRLGLQFRRASRSRNTPILWQQPSS 396
                          QLL          WKG    A   G+ F+  S        W +PS+
Sbjct: 2170 Q-------------QLLE-------WQWKGDKQIAQEWGITFQAKSLPPPKVFCWHKPSN 2209

Query: 395  PWIKLNIDGSYKSHLGAAGIGGIIRNHE*DTIWAFSGYIPRSEGPLHAESVALETALTYS 216
               KLN+DGS K    AAG GG++R+H    I+ FS  +   +  L AE +AL   L   
Sbjct: 2210 GEFKLNVDGSAKLSQNAAG-GGVLRDHAGVMIFGFSENLG-IQNSLKAELLALYRGLILC 2267

Query: 215  YTQSIDHLWIETDSLILCNMVANKFPGHWSCRYSLVQIANRLHNKHFRITHIHREGNKVA 36
               +I  LWIE D+  +  ++     G  + RY L  I   L +  FR+THI REGN+ A
Sbjct: 2268 RDYNIRRLWIEMDATSVIRLLQGNHRGPHAIRYLLGSIRQLLSHFSFRLTHIFREGNQAA 2327

Query: 35   DFLASLG 15
            DFLA+ G
Sbjct: 2328 DFLANRG 2334


>ref|XP_004309472.1| PREDICTED: putative ribonuclease H protein At1g65750-like [Fragaria
            vesca subsp. vesca]
          Length = 364

 Score =  158 bits (399), Expect = 5e-36
 Identities = 101/360 (28%), Positives = 171/360 (47%), Gaps = 1/360 (0%)
 Frame = -2

Query: 1085 TDSMKWKPSSNGKFSLSSAYATVHNVPNPQPVFAEFWNSCLTPSASIFLWRLIQNRLPVD 906
            +D + W P S+G+ S   A+  +             W+  + P  S+  W++++ R+  +
Sbjct: 2    SDKLIWVPLSSGELSAKEAFQFLRPRLPSLDWGKLIWSKFIIPRISLHSWKVLRGRVLSE 61

Query: 905  TKLQKRGISLASKCVCCLPPSLVHYTDSSSHEETLSHLFLHNTQVMKVWMHFAAWVRC-T 729
              LQ+RGI+LAS+CV C               E+L H+FL  +    +W + A       
Sbjct: 62   DLLQRRGIALASRCVLC-----------GRDGESLPHIFLTCSFAASLWNNRAGLFELGC 110

Query: 728  LPHTEHISIFLSFWKNTTPLAQKNHITVLLPCLVLWYIWKERNHCRHENASFSHIRIIKT 549
            LP      + L ++       Q   I ++     LW+IWK RN  RH+N +     + + 
Sbjct: 111  LPQN---LVDLLYYGGVGRSHQLKEIWLICYTTTLWFIWKARNKMRHDNCTIVVDAVRQL 167

Query: 548  VENHIQLLSKAKLFQVHNWKGCIDTATRLGLQFRRASRSRNTPILWQQPSSPWIKLNIDG 369
            +  H++  SK  L  + N    +    + GL  R     R T + W  P   WIK+N DG
Sbjct: 168  IMGHVKTASKLALGCMSNSLTELRVLKKFGLLCRPHRAPRITEVNWHPPLFGWIKVNTDG 227

Query: 368  SYKSHLGAAGIGGIIRNHE*DTIWAFSGYIPRSEGPLHAESVALETALTYSYTQSIDHLW 189
            +++   G +G GGI R+     + AF+  +      + AE +A+  A+  ++ +  +H+W
Sbjct: 228  AWQKTTGKSGYGGIFRDFHGSFLGAFASNL-EILNSVDAEVMAVIQAIELAWVRDWEHIW 286

Query: 188  IETDSLILCNMVANKFPGHWSCRYSLVQIANRLHNKHFRITHIHREGNKVADFLASLGLS 9
            +E DS+I+ N + +     W  R       +R+   +FR +HI REGN+VAD LA++GLS
Sbjct: 287  LEVDSIIVLNFLQDPHLVPWRLRVGWGNFLHRISQMNFRSSHIFREGNQVADALANMGLS 346


>ref|XP_006356603.1| PREDICTED: putative ribonuclease H protein At1g65750-like [Solanum
            tuberosum]
          Length = 885

 Score =  155 bits (392), Expect = 3e-35
 Identities = 111/432 (25%), Positives = 198/432 (45%), Gaps = 1/432 (0%)
 Frame = -2

Query: 1298 SFWHDIWFENSLLNDVIQCDKNGLENVAIDFFWSNDNWDLERLQIVVGPEWAAKXXXXXX 1119
            SFW D W +   L  + +  K   E V +  F + + WD E+L   +  E          
Sbjct: 425  SFWFDNWTKQGALYHIEENAKE--EEVEVKEFCTGEGWDKEKLLQNLSLEMTDHIMENIS 482

Query: 1118 XXXXXXXSWTHTDSMKWKPSSNGKFSLSSAYATVHNVPNPQPVFAEFWNSCLTPSASIFL 939
                        D + W  ++ G F++ SA+    N    +      WN  L    + F+
Sbjct: 483  PPNTLFG----NDVVWWMANAQGIFTVKSAWQITRNKQEVRRDCEVIWNKELPFKINFFM 538

Query: 938  WRLIQNRLPVDTKLQKRGISLASKCVCCLPPSLVHYTDSSSHEETLSHLFLHNTQVMKVW 759
            WR+ + R+  D  L+K  I++ S+C CC              EET++HLF       K+W
Sbjct: 539  WRVWKRRIATDDNLKKMRINIVSRCWCC----------DRKKEETMTHLFPTAPITYKLW 588

Query: 758  MHFAAWVRCTLPHTEHISIFLSFWKN-TTPLAQKNHITVLLPCLVLWYIWKERNHCRHEN 582
             +FA +    +       + +S+WK+  TP  Q   I   +P +++W +WK RN  +H++
Sbjct: 589  RYFAHFAGINIDGMHLQQLIISWWKHEATPKLQG--IYKAIPAIIMWTLWKRRNALKHDS 646

Query: 581  ASFSHIRIIKTVENHIQLLSKAKLFQVHNWKGCIDTATRLGLQFRRASRSRNTPILWQQP 402
             S S  R+++ V   ++ + K++   + N +       +   Q++R  +     + W+ P
Sbjct: 647  -SISWERMVEMVIEVVRKMVKSQFPWIKNMRWTWQAIIQRLNQYKR--KIHVLRVTWKPP 703

Query: 401  SSPWIKLNIDGSYKSHLGAAGIGGIIRNHE*DTIWAFSGYIPRSEGPLHAESVALETALT 222
               ++K N DG+ + + G +  G  IR+ + D I+A +  I  +   + AE+VA+ TAL 
Sbjct: 704  DDHYVKSNTDGACRGNPGLSSFGFCIRDDKGDLIYAKAKGIGIATN-MEAETVAILTALR 762

Query: 221  YSYTQSIDHLWIETDSLILCNMVANKFPGHWSCRYSLVQIANRLHNKHFRITHIHREGNK 42
                + +  + IETDSL L  ++   +   W     + +I   +     +ITHI REGN 
Sbjct: 763  ECSNRKMQKVIIETDSLSLKKIIQQTWRVPWKIAEKVEEIREIMEKIKAKITHIFREGNS 822

Query: 41   VADFLASLGLST 6
            +AD LA++ + +
Sbjct: 823  LADSLANIAIES 834


>ref|XP_004309110.1| PREDICTED: putative ribonuclease H protein At1g65750-like [Fragaria
            vesca subsp. vesca]
          Length = 872

 Score =  154 bits (390), Expect = 6e-35
 Identities = 96/359 (26%), Positives = 164/359 (45%), Gaps = 1/359 (0%)
 Frame = -2

Query: 1082 DSMKWKPSSNGKFSLSSAYATVHNVPNPQPVFAEFWNSCLTPSASIFLWRLIQNRLPVDT 903
            D + W+ SS G+ +   A+  +       P     W+  + P  S+  W++++  +    
Sbjct: 497  DKLIWQASSTGELTAKQAFLFLQQASPVVPWGKPLWSKFILPRMSLHAWKVMRGTVISYH 556

Query: 902  KLQKRGISLASKCVCCLPPSLVHYTDSSSHEETLSHLFLHNTQVMKVWMHFAAWVRCTL- 726
             LQ+RG++L S+C  C            +  E+L H+FLH +    VW HF       L 
Sbjct: 557  LLQRRGVALVSRCEFC-----------GNSTESLDHIFLHCSFAASVWNHFIYIFEIGLV 605

Query: 725  PHTEHISIFLSFWKNTTPLAQKNHITVLLPCLVLWYIWKERNHCRHENASFSHIRIIKTV 546
            P+T      L    + +P  Q   + ++    +LWYIW  RN  R ++ +FS   + + V
Sbjct: 606  PNTIAEVFSLGLAMDRSP--QLKELWLICFTSILWYIWHARNQIRFDSRTFSVAGVCRLV 663

Query: 545  ENHIQLLSKAKLFQVHNWKGCIDTATRLGLQFRRASRSRNTPILWQQPSSPWIKLNIDGS 366
              HIQ  S+     +HN    +      G   R     R   ++W  PS  WIK+N DG+
Sbjct: 664  SRHIQASSRLATGHMHNTIHDLCILKSFGACCRSRRIPRMVEVIWHPPSIGWIKINSDGA 723

Query: 365  YKSHLGAAGIGGIIRNHE*DTIWAFSGYIPRSEGPLHAESVALETALTYSYTQSIDHLWI 186
            +K   G  G G + R ++   + AF+ +I      + A+ + + TA+  ++ +   H+W+
Sbjct: 724  WKHEEGIGGFGAVFRYYKGQFVGAFASHIDIPSS-IAAKVMVVITAIELAWVRDWKHVWL 782

Query: 185  ETDSLILCNMVANKFPGHWSCRYSLVQIANRLHNKHFRITHIHREGNKVADFLASLGLS 9
            E D   + + + +     W  R   +    R+    F+ +HI REGN+VAD LA+ G S
Sbjct: 783  EVDFSTVLDYIRSPSLVPWQLRVRWLNCLYRISTMTFKSSHIFREGNRVADALANHGTS 841


>gb|EOY25447.1| Uncharacterized protein TCM_016753 [Theobroma cacao]
          Length = 1275

 Score =  152 bits (383), Expect = 4e-34
 Identities = 114/426 (26%), Positives = 182/426 (42%)
 Frame = -2

Query: 1292 WHDIWFENSLLNDVIQCDKNGLENVAIDFFWSNDNWDLERLQIVVGPEWAAKXXXXXXXX 1113
            WHD W  +  L       +N + +V    F+  D+WD+++L++ +      +        
Sbjct: 756  WHDCWMGDQPLVISFPSFRNDMSSV--HKFYKGDSWDVDKLRLFLPVNLINEILPIPFDR 813

Query: 1112 XXXXXSWTHTDSMKWKPSSNGKFSLSSAYATVHNVPNPQPVFAEFWNSCLTPSASIFLWR 933
                   T  D   W  +SNG+FS  SA+ T+             W S            
Sbjct: 814  -------TQQDVAYWTLTSNGEFSTWSAWETIRQ-----------WQS------------ 843

Query: 932  LIQNRLPVDTKLQKRGISLASKCVCCLPPSLVHYTDSSSHEETLSHLFLHNTQVMKVWMH 753
               N L +   ++++GI L SKCVCC            + EE+L H+             
Sbjct: 844  --HNTLALSFGIEEKGIHLVSKCVCC------------NSEESLMHVL------------ 877

Query: 752  FAAWVRCTLPHTEHISIFLSFWKNTTPLAQKNHITVLLPCLVLWYIWKERNHCRHENASF 573
                                 W N+  +A++  I  LLP  + W++W ERN  +H ++  
Sbjct: 878  ---------------------WGNS--VAKQGRIRTLLPIFICWFLWLERNDAKHRHSGL 914

Query: 572  SHIRIIKTVENHIQLLSKAKLFQVHNWKGCIDTATRLGLQFRRASRSRNTPILWQQPSSP 393
               R++  +   ++ L    L Q   WKG  D A      F+   R+    + W++P + 
Sbjct: 915  YTDRVVWRIMTLLRQLQDDSLLQQWQWKGDTDIAAMWRYNFQLKQRAPPQIVYWRKPFTG 974

Query: 392  WIKLNIDGSYKSHLGAAGIGGIIRNHE*DTIWAFSGYIPRSEGPLHAESVALETALTYSY 213
              KLN+DGS ++   AA  GG++R+H    I+ FS  I  +   L AE  AL   L    
Sbjct: 975  EYKLNVDGSSRNGQHAAS-GGVLRDHTSKLIFCFSENI-GTYNSLQAELRALHRGLLLCK 1032

Query: 212  TQSIDHLWIETDSLILCNMVANKFPGHWSCRYSLVQIANRLHNKHFRITHIHREGNKVAD 33
             + I+ LWIE D+L +  ++ +   G    RY L  I   L++  +RI+HI REGN+ AD
Sbjct: 1033 ERHIEKLWIEMDALAVIQLIPHSQKGSHDIRYLLESIKKCLNSISYRISHIFREGNQAAD 1092

Query: 32   FLASLG 15
            FL++ G
Sbjct: 1093 FLSNEG 1098


Top