BLASTX nr result

ID: Rehmannia22_contig00027300 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Rehmannia22_contig00027300
         (982 letters)

Database: ./nr 
           37,332,560 sequences; 13,225,080,153 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

gb|EOY02239.1| Uncharacterized protein TCM_016763 [Theobroma cacao]   168   3e-39
gb|EOY02238.1| Uncharacterized protein TCM_016762 [Theobroma cacao]   166   1e-38
gb|EOY06959.1| Uncharacterized protein TCM_021521 [Theobroma cacao]   164   5e-38
gb|EOX96783.1| Uncharacterized protein TCM_005954 [Theobroma cacao]   164   5e-38
gb|EOY02234.1| Uncharacterized protein TCM_011921 [Theobroma cacao]   162   1e-37
gb|EOY34747.1| Uncharacterized protein TCM_042327 [Theobroma cacao]   161   3e-37
gb|EOY25451.1| Uncharacterized protein TCM_016759 [Theobroma cacao]   161   4e-37
gb|EOY02236.1| Uncharacterized protein TCM_011923 [Theobroma cacao]   158   3e-36
gb|EOY14356.1| Uncharacterized protein TCM_033752 [Theobroma cacao]   157   6e-36
gb|EOY34748.1| Uncharacterized protein TCM_042328 [Theobroma cacao]   155   2e-35
gb|EOY06960.1| Uncharacterized protein TCM_021522 [Theobroma cacao]   155   2e-35
gb|EOY19200.1| Retrotransposon, unclassified-like protein [Theob...   155   2e-35
gb|EOY17514.1| Uncharacterized protein TCM_042330 [Theobroma cacao]   152   1e-34
gb|EOY06956.1| Uncharacterized protein TCM_021518 [Theobroma cacao]   149   2e-33
gb|EOY17513.1| Uncharacterized protein TCM_036737 [Theobroma cacao]   144   4e-32
gb|EOY25447.1| Uncharacterized protein TCM_016753 [Theobroma cacao]   128   4e-27
ref|XP_006356603.1| PREDICTED: putative ribonuclease H protein A...   127   9e-27
ref|XP_004309472.1| PREDICTED: putative ribonuclease H protein A...   126   1e-26
ref|XP_004309110.1| PREDICTED: putative ribonuclease H protein A...   124   4e-26
gb|EOY25454.1| Uncharacterized protein TCM_026877 [Theobroma cacao]   124   6e-26

>gb|EOY02239.1| Uncharacterized protein TCM_016763 [Theobroma cacao]
          Length = 2127

 Score =  168 bits (425), Expect = 3e-39
 Identities = 108/324 (33%), Positives = 156/324 (48%), Gaps = 3/324 (0%)
 Frame = +2

Query: 5    LASKCVCCLPPSLVHYTDSSSHEETLSHLFLHNTQVMKVWMHFAAWVRCTLPHTEHISIF 184
            LASKCVCC            + EE+L H+   N    +VW  FA   +  + +  H+S  
Sbjct: 1822 LASKCVCC------------NSEESLIHVLWENPVAKQVWNFFAQLFQIYIWNPRHVSQI 1869

Query: 185  LSFWKNTTPLAQKNHITVLLPCLVLWYIWKERNHCRHENASFSHIRIIKTVENHIQLLSK 364
            +  W  +    +K H  VLLP  + W++W ERN  +H +      R+I     H + L  
Sbjct: 1870 IWAWYVSGDYVRKGHFRVLLPLFICWFLWLERNDAKHRHTGLYADRVIWRTMKHCRQLYD 1929

Query: 365  AKLFQVHNWKGCIDTATRLGLQFRRASRSRNTPILWQKPSSPWIKLNIDGSYKSHLGAAG 544
              L Q   WKG  D AT LG  F     +    I W+KPS    KLN+DGS ++ L AA 
Sbjct: 1930 GSLLQQWQWKGDTDIATMLGFSFTHKQHAPPQIIYWKKPSIGEYKLNVDGSSRNGLHAA- 1988

Query: 545  IGGIIRNHE*DTIWAFSGYIPRSEGP---LHAESVALETALTYSYTQSIDHLWIETDSLI 715
             GG++R+H    I+ FS  I    GP   L AE  AL   L     + I+ LWIE D+L+
Sbjct: 1989 TGGVLRDHTGKLIFGFSENI----GPCNSLQAELRALLRGLLLCKERHIEKLWIEMDALV 2044

Query: 716  LCNMVANKFPGHWSCRYSLVQIANRLHNKHFRITHIHREGNKVADFLASLGFSTLGSTKY 895
               ++     G ++ RY L  I   L +  +R++HI REGN+ AD+L++ G        +
Sbjct: 2045 AIQLIQPSKKGPYNLRYLLESIRMCLSSFSYRLSHILREGNQAADYLSNEGHKHQNLCVF 2104

Query: 896  TAISLPHSAKGIARLDQLEIPSFR 967
            T         G+ +LD+L +P  R
Sbjct: 2105 T--EAQGQLHGMLKLDRLNLPYVR 2126


>gb|EOY02238.1| Uncharacterized protein TCM_016762 [Theobroma cacao]
          Length = 2214

 Score =  166 bits (421), Expect = 1e-38
 Identities = 104/321 (32%), Positives = 160/321 (49%)
 Frame = +2

Query: 5    LASKCVCCLPPSLVHYTDSSSHEETLSHLFLHNTQVMKVWMHFAAWVRCTLPHTEHISIF 184
            LASKCVCC            + EE+L H+   N+   +VW  FA + +  + + +H+S  
Sbjct: 1909 LASKCVCC------------NSEESLMHVLWGNSVAKQVWAFFAKFFQIYVLNPKHVSHI 1956

Query: 185  LSFWKNTTPLAQKNHITVLLPCLVLWYIWKERNHCRHENASFSHIRIIKTVENHIQLLSK 364
            L  W  +    ++ HI  LLP  + W++W ERN  ++ ++  +  RI+  +   ++ L  
Sbjct: 1957 LWAWFYSGDYVKRGHIRTLLPIFICWFLWLERNDAKYRHSGLNTDRIVWRIMKLLRQLKD 2016

Query: 365  AKLFQVHNWKGCIDTATRLGLQFRRASRSRNTPILWQKPSSPWIKLNIDGSYKSHLGAAG 544
              L Q   WKG  D A      F+   R+    + W+KPS+   KLN+DGS + H   A 
Sbjct: 2017 GSLLQQWQWKGDTDIAAMWQYNFQLKLRAPPQIVYWRKPSTGEYKLNVDGSSR-HGQHAA 2075

Query: 545  IGGIIRNHE*DTIWAFSGYIPRSEGPLHAESVALETALTYSYTQSIDHLWIETDSLILCN 724
             GG++R+H    I+ FS  I      L AE  AL   L     + I+ LWIE D+L    
Sbjct: 2076 SGGVLRDHTGKLIFGFSENIGTCNS-LQAELRALLRGLLLCKERHIEKLWIEMDALAAIQ 2134

Query: 725  MVANKFPGHWSCRYSLVQIANRLHNKHFRITHIHREGNKVADFLASLGFSTLGSTKYTAI 904
            ++ +   G    RY L  I   L++  +RI+HIHREGN+VADFL++ G +      +T  
Sbjct: 2135 LLPHSQKGSHDIRYLLESIRKCLNSISYRISHIHREGNQVADFLSNEGHNHQNLHVFT-- 2192

Query: 905  SLPHSAKGIARLDQLEIPSFR 967
                   G+ +LD+L +P  R
Sbjct: 2193 EAQGKLHGMLKLDRLNLPYVR 2213


>gb|EOY06959.1| Uncharacterized protein TCM_021521 [Theobroma cacao]
          Length = 1951

 Score =  164 bits (415), Expect = 5e-38
 Identities = 108/324 (33%), Positives = 155/324 (47%), Gaps = 3/324 (0%)
 Frame = +2

Query: 5    LASKCVCCLPPSLVHYTDSSSHEETLSHLFLHNTQVMKVWMHFAAWVRCTLPHTEHISIF 184
            LASKCVCC              EE+L H+   N    +VW  FA   +  +    HIS  
Sbjct: 1645 LASKCVCC------------RSEESLIHVLWENPVATQVWFFFAKSFQIYVSKPNHISQI 1692

Query: 185  LSFWKNTTPLAQKNHITVLLPCLVLWYIWKERNHCRHENASFSHIRIIKTVENHIQLLSK 364
            +  W  +    +  HI +L+P  + W++W ERN  +H +      R+I  +   +  L  
Sbjct: 1693 IWAWFFSGDYTRNGHIRILIPLFICWFLWLERNDAKHRHMGMYPNRVIWRIMKLLNQLYA 1752

Query: 365  AKLFQVHNWKGCIDTATRLGLQFRRASRSRNTPILWQKPSSPWIKLNIDGSYKSHLGAAG 544
              L +   WKG  D AT  G +F     +    I W KP     KLN+DGS KS+L AAG
Sbjct: 1753 GSLLKQWQWKGDTDIATMWGFKFPPKYCTSPQIIYWIKPFIGEYKLNVDGSSKSNLNAAG 1812

Query: 545  IGGIIRNHE*DTIWAFS---GYIPRSEGPLHAESVALETALTYSYTQSIDHLWIETDSLI 715
             GG++R+H     +AFS   G +P  +  LH    AL   L     ++I +LWIE D+L+
Sbjct: 1813 -GGVLRDHTGKLAFAFSENLGPLPSLQAELH----ALLRGLLLCKERNITNLWIEMDALV 1867

Query: 716  LCNMVANKFPGHWSCRYSLVQIANRLHNKHFRITHIHREGNKVADFLASLGFSTLGSTKY 895
               MV     G    RY L  I   L +  +RI+HI+REGN+ ADFL++ G +      +
Sbjct: 1868 AVQMVQQSQKGSHDIRYLLESIRLCLRSFSYRISHIYREGNQAADFLSNKGQTHQSLCVF 1927

Query: 896  TAISLPHSAKGIARLDQLEIPSFR 967
            +         GI +LD+L +P  R
Sbjct: 1928 S--EAQGELIGILKLDKLNLPYVR 1949


>gb|EOX96783.1| Uncharacterized protein TCM_005954 [Theobroma cacao]
          Length = 1134

 Score =  164 bits (415), Expect = 5e-38
 Identities = 107/324 (33%), Positives = 152/324 (46%), Gaps = 3/324 (0%)
 Frame = +2

Query: 5    LASKCVCCLPPSLVHYTDSSSHEETLSHLFLHNTQVMKVWMHFAAWVRCTLPHTEHISIF 184
            LASKCVCC            + EE+L H+   N    +VW  FA   +  + +  H+S  
Sbjct: 826  LASKCVCC------------NSEESLIHVLWENPVAKQVWNFFAKLFQIYILNPRHVSQI 873

Query: 185  LSFWKNTTPLAQKNHITVLLPCLVLWYIWKERNHCRHENASFSHIRIIKTVENHIQLLSK 364
            +  W  +    +K H  VLLP  + W++W ERN  +H +      R+I     H + L  
Sbjct: 874  IWAWYVSGDYVRKGHFRVLLPLFICWFLWLERNDAKHRHTGLYPDRVIWRTMKHCRQLYD 933

Query: 365  AKLFQVHNWKGCIDTATRLGLQFRRASRSRNTPILWQKPSSPWIKLNIDGSYKSHLGAAG 544
              L Q   WKG  D A  LG  F     +    I W+KPS    KLN+DGS ++ L AA 
Sbjct: 934  GSLLQQWQWKGDTDIAAMLGFSFPPQQHASPQIIYWKKPSIGEYKLNVDGSSRNGLHAA- 992

Query: 545  IGGIIRNHE*DTIWAFSGYIPRSEGP---LHAESVALETALTYSYTQSIDHLWIETDSLI 715
             GG++R+H    I+ FS  I    GP   L AE  AL   L     + I+ LWIE D+L 
Sbjct: 993  TGGVLRDHTGKLIFGFSENI----GPCNSLQAELRALLRGLLLCKERHIEKLWIEMDALA 1048

Query: 716  LCNMVANKFPGHWSCRYSLVQIANRLHNKHFRITHIHREGNKVADFLASLGFSTLGSTKY 895
               ++     G +  RY L  I   L +  +R++H  REGNK AD+L++ G        +
Sbjct: 1049 AIQLIQPSKKGPYDIRYLLESIRMCLSSFSYRLSHTFREGNKAADYLSNEGHKHQNLCVF 1108

Query: 896  TAISLPHSAKGIARLDQLEIPSFR 967
            T         G+ +LD+L +P  R
Sbjct: 1109 T--EAQGQLHGMLKLDRLNLPYVR 1130


>gb|EOY02234.1| Uncharacterized protein TCM_011921 [Theobroma cacao]
          Length = 926

 Score =  162 bits (411), Expect = 1e-37
 Identities = 102/322 (31%), Positives = 158/322 (49%)
 Frame = +2

Query: 5    LASKCVCCLPPSLVHYTDSSSHEETLSHLFLHNTQVMKVWMHFAAWVRCTLPHTEHISIF 184
            LASKCVCC            + EE+L H+   N+   +VW  FA + +  + + +H+S  
Sbjct: 621  LASKCVCC------------NSEESLMHVLWGNSVAKQVWAFFANFFQIYIFNPQHVSHI 668

Query: 185  LSFWKNTTPLAQKNHITVLLPCLVLWYIWKERNHCRHENASFSHIRIIKTVENHIQLLSK 364
            L  W  +    ++ HI  LLP  + W++W ERN  +H  +     R++  +   ++ L  
Sbjct: 669  LWAWFYSGDYVKRGHIRTLLPIFICWFLWLERNDAKHRYSGLYTDRVVWRIMKLLRQLHD 728

Query: 365  AKLFQVHNWKGCIDTATRLGLQFRRASRSRNTPILWQKPSSPWIKLNIDGSYKSHLGAAG 544
              L Q   WKG  D A       +   R+    + W+KPS+   KLN+DGS + H   A 
Sbjct: 729  GSLLQQWQWKGDTDIAAMWKYNLQLKLRAPPQIVYWRKPSTGEYKLNVDGSSR-HGQHAA 787

Query: 545  IGGIIRNHE*DTIWAFSGYIPRSEGPLHAESVALETALTYSYTQSIDHLWIETDSLILCN 724
             GG++R+H    I+ FS  I      L AE  AL   L     + I+ LWIE D+L +  
Sbjct: 788  SGGVLRDHTGKLIFGFSENIGNCNS-LQAELRALLRGLLLCKERHIEQLWIEMDALAVIQ 846

Query: 725  MVANKFPGHWSCRYSLVQIANRLHNKHFRITHIHREGNKVADFLASLGFSTLGSTKYTAI 904
            ++ +   G    RY L  I   L++  +RI+HI REGN+VADFL++ G +      +T  
Sbjct: 847  LIPHSQKGSHDIRYLLESIRKCLNSISYRISHILREGNQVADFLSNEGHNHQNLRVFT-- 904

Query: 905  SLPHSAKGIARLDQLEIPSFRI 970
                   G+ +LD+L +P  R+
Sbjct: 905  EAQGKLHGMLKLDRLNLPYVRL 926


>gb|EOY34747.1| Uncharacterized protein TCM_042327 [Theobroma cacao]
          Length = 1014

 Score =  161 bits (408), Expect = 3e-37
 Identities = 100/318 (31%), Positives = 150/318 (47%)
 Frame = +2

Query: 5    LASKCVCCLPPSLVHYTDSSSHEETLSHLFLHNTQVMKVWMHFAAWVRCTLPHTEHISIF 184
            LASKCVCC            + EE+L H+   N    +VW  FA + +  + + +H+S  
Sbjct: 708  LASKCVCC------------NSEESLIHVLWDNPVAKQVWNFFADFFQINISNPQHVSQI 755

Query: 185  LSFWKNTTPLAQKNHITVLLPCLVLWYIWKERNHCRHENASFSHIRIIKTVENHIQLLSK 364
            +  W  +    +K HI  L+P  + W++W ERN  +H +      R++  +   ++ L  
Sbjct: 756  IWAWYYSGDFVRKGHIRTLIPLFICWFLWLERNDAKHRHLGMYSDRVVWKIMKVLRQLQD 815

Query: 365  AKLFQVHNWKGCIDTATRLGLQFRRASRSRNTPILWQKPSSPWIKLNIDGSYKSHLGAAG 544
              L +   WKG  D A   G       R     I W KP +   KLN+DGS + H  +A 
Sbjct: 816  GSLLKKWQWKGDTDIAAMWGFTLPLKIRESPQIIHWVKPVTGEYKLNVDGSSR-HNQSAA 874

Query: 545  IGGIIRNHE*DTIWAFSGYIPRSEGPLHAESVALETALTYSYTQSIDHLWIETDSLILCN 724
             GG++R+H    ++ FS  I  S   L AE  AL   L     ++I+ LWIE D+L++  
Sbjct: 875  TGGLLRDHTGTLVFGFSENIGPSNS-LQAELRALLRGLLLCKDRNIEKLWIEMDALVVIQ 933

Query: 725  MVANKFPGHWSCRYSLVQIANRLHNKHFRITHIHREGNKVADFLASLGFSTLGSTKYTAI 904
            M+     G    RY L  I   L    FRI+HI REGN+ ADFL++ G +          
Sbjct: 934  MIQQSKKGSHDIRYLLASIRKCLSFFSFRISHIFREGNQAADFLSNKGHT--HQNLQVIS 991

Query: 905  SLPHSAKGIARLDQLEIP 958
                   G+ +LD+L +P
Sbjct: 992  EAQGKLHGMLKLDRLNLP 1009


>gb|EOY25451.1| Uncharacterized protein TCM_016759 [Theobroma cacao]
          Length = 879

 Score =  161 bits (407), Expect = 4e-37
 Identities = 105/322 (32%), Positives = 158/322 (49%), Gaps = 1/322 (0%)
 Frame = +2

Query: 5    LASKCVCCLPPSLVHYTDSSSHEETLSHLFLHNTQVMKVWMHFAAWVRCTLPHTEHISIF 184
            LASKCVCC            + EE+L H+   N+   +VW  F  + +  + + +H+S  
Sbjct: 573  LASKCVCC------------NSEESLMHVLWGNSVAKQVWAFFGKFFQIYVLNPQHVSQI 620

Query: 185  LSFWKNTTPLAQKNHITVLLPCLVLWYIWKERNHCRHENASFSHIRIIKTVENHIQLLSK 364
            L  W  +    +K HI  LLP  + W++W ERN  +H +   +  R++  +   ++ L  
Sbjct: 621  LWAWFFSGDYVKKGHIRSLLPIFICWFLWLERNDAKHRHTRLNPDRVVWRIMKLLRQLLD 680

Query: 365  AKLFQVHNWKGCIDTATRLGLQFRRASRSRNTPILWQKPSSPWIKLNIDGSYKS-HLGAA 541
              L     WKG  D A+  G  F+   R+    I W+KP +   KLN+DGS ++ HL A+
Sbjct: 681  GSLLHQWQWKGDTDIASMWGHTFQSKHRAPPQIIYWRKPFTGEYKLNVDGSSRNGHLAAS 740

Query: 542  GIGGIIRNHE*DTIWAFSGYIPRSEGPLHAESVALETALTYSYTQSIDHLWIETDSLILC 721
              GGI+R+H    I+ FS  I      L AE  AL   L     + I++LWIE D+L + 
Sbjct: 741  --GGILRDHTGKLIFGFSENIGLCNS-LQAELRALLRGLLLCKERHIENLWIEMDALAVI 797

Query: 722  NMVANKFPGHWSCRYSLVQIANRLHNKHFRITHIHREGNKVADFLASLGFSTLGSTKYTA 901
             ++ +   G    RY L  I   L    +RI+HI REGN+ AD+LA+ G S       T 
Sbjct: 798  QLIQHSQKGSHDIRYLLESIRKCLSCISYRISHIFREGNQAADYLANEGHSHQNLCVIT- 856

Query: 902  ISLPHSAKGIARLDQLEIPSFR 967
                    G+ +LD+L +P  R
Sbjct: 857  -EAQGELHGMLKLDRLNLPYVR 877


>gb|EOY02236.1| Uncharacterized protein TCM_011923 [Theobroma cacao]
          Length = 1954

 Score =  158 bits (400), Expect = 3e-36
 Identities = 102/321 (31%), Positives = 156/321 (48%)
 Frame = +2

Query: 5    LASKCVCCLPPSLVHYTDSSSHEETLSHLFLHNTQVMKVWMHFAAWVRCTLPHTEHISIF 184
            LASKC+CC            + EE+L H+   N    +VW  FA   +  +   +++S  
Sbjct: 1648 LASKCICC------------NSEESLIHVLWDNPIAKQVWNFFANSFQIYISKPQNVSQI 1695

Query: 185  LSFWKNTTPLAQKNHITVLLPCLVLWYIWKERNHCRHENASFSHIRIIKTVENHIQLLSK 364
            L  W  +    +K HI +L+P  + W++W ERN  +H +      R++  +   ++ L  
Sbjct: 1696 LWTWYLSGDYVRKGHIRILIPLFICWFLWLERNDAKHRHLGMYSDRVVWKIMKLLRQLQD 1755

Query: 365  AKLFQVHNWKGCIDTATRLGLQFRRASRSRNTPILWQKPSSPWIKLNIDGSYKSHLGAAG 544
              L +   WKG  D AT  GL     +R+    + W KP     KLN+DGS + +  AA 
Sbjct: 1756 GYLLKSWQWKGDKDFATMWGLFSPPKTRAAPQILHWVKPVPGEHKLNVDGSSRQNQTAA- 1814

Query: 545  IGGIIRNHE*DTIWAFSGYIPRSEGPLHAESVALETALTYSYTQSIDHLWIETDSLILCN 724
            IGG++R+H    ++ FS  I  S   L AE  AL   L     ++I+ LW+E D+L+   
Sbjct: 1815 IGGVLRDHTGTLVFDFSENIGPSNS-LQAELRALLRGLLLCKERNIEKLWVEMDALVAIQ 1873

Query: 725  MVANKFPGHWSCRYSLVQIANRLHNKHFRITHIHREGNKVADFLASLGFSTLGSTKYTAI 904
            M+     G    RY L  I   L+   FRI+HI REGN+ ADFL++ G +      +T  
Sbjct: 1874 MIQQSQKGSHDIRYLLASIRKYLNFFSFRISHIFREGNQAADFLSNKGHTHQSLHVFT-- 1931

Query: 905  SLPHSAKGIARLDQLEIPSFR 967
                   G+ +LD+L +P  R
Sbjct: 1932 EAQGKLYGMLKLDRLNLPYVR 1952


>gb|EOY14356.1| Uncharacterized protein TCM_033752 [Theobroma cacao]
          Length = 2251

 Score =  157 bits (397), Expect = 6e-36
 Identities = 101/321 (31%), Positives = 151/321 (47%)
 Frame = +2

Query: 5    LASKCVCCLPPSLVHYTDSSSHEETLSHLFLHNTQVMKVWMHFAAWVRCTLPHTEHISIF 184
            LAS+C CC              EE++ H+   N   M+VW +FA   +  + +   I+  
Sbjct: 1945 LASRCRCC------------KSEESIMHVMWDNPVAMQVWNYFAKLFQILIINPCTINQI 1992

Query: 185  LSFWKNTTPLAQKNHITVLLPCLVLWYIWKERNHCRHENASFSHIRIIKTVENHIQLLSK 364
            +  W  +    +  HI  L+P  +LW++W ERN  +H N      R++  V   IQ LS 
Sbjct: 1993 IGAWFYSGDYCKPGHIRTLVPLFILWFLWVERNDAKHRNLGMYPNRVVWRVLKLIQQLSL 2052

Query: 365  AKLFQVHNWKGCIDTATRLGLQFRRASRSRNTPILWQKPSSPWIKLNIDGSYKSHLGAAG 544
             +      WKG    A   G+ F+  S +      W KPS    KLN+DGS K    AAG
Sbjct: 2053 GQQLLKWQWKGDKQIAQEWGIIFQAESLAPPKVFSWHKPSLGEFKLNVDGSAKQSHNAAG 2112

Query: 545  IGGIIRNHE*DTIWAFSGYIPRSEGPLHAESVALETALTYSYTQSIDHLWIETDSLILCN 724
             GGI+R+H  + ++ FS  +  ++  L AE +AL   L      +I  LWIE D++ +  
Sbjct: 2113 -GGILRDHAGEMVFGFSENL-GTQNSLQAELLALYRGLILCRDYNIRRLWIEMDAISVIR 2170

Query: 725  MVANKFPGHWSCRYSLVQIANRLHNKHFRITHIHREGNKVADFLASLGFSTLGSTKYTAI 904
            ++     G  + RY +V +   L +  FR +HI REGN+ ADFLA+ G        +T  
Sbjct: 2171 LLQGNHRGPHAIRYLMVSLRQLLSHFSFRFSHIFREGNQAADFLANRGHEHQNLQVFTVA 2230

Query: 905  SLPHSAKGIARLDQLEIPSFR 967
                  +G+  LDQ   P  R
Sbjct: 2231 Q--GKLRGMLCLDQTSFPYVR 2249


>gb|EOY34748.1| Uncharacterized protein TCM_042328 [Theobroma cacao]
          Length = 910

 Score =  155 bits (393), Expect = 2e-35
 Identities = 100/321 (31%), Positives = 151/321 (47%)
 Frame = +2

Query: 5    LASKCVCCLPPSLVHYTDSSSHEETLSHLFLHNTQVMKVWMHFAAWVRCTLPHTEHISIF 184
            LAS+C CC              EE++ H+   N   M+VW +FA   +  + +   I+  
Sbjct: 604  LASRCRCC------------KSEESIMHVMWDNPVAMQVWNYFAKLFQICIINPCTINQI 651

Query: 185  LSFWKNTTPLAQKNHITVLLPCLVLWYIWKERNHCRHENASFSHIRIIKTVENHIQLLSK 364
            +  W ++    +  HI  L+P  +LW++W ERN  +H N      R++  V   IQ LS 
Sbjct: 652  IGAWFHSGDYCKPGHIRTLVPLFILWFLWVERNDAKHRNLGMYPNRVVWRVLKLIQQLSL 711

Query: 365  AKLFQVHNWKGCIDTATRLGLQFRRASRSRNTPILWQKPSSPWIKLNIDGSYKSHLGAAG 544
             +      WKG    A   G+  +  S +      W KP++   KLN+DGS K    AAG
Sbjct: 712  GQQLLKWQWKGDKQIAQEWGIILQAESLAPPKVFSWHKPTTGEFKLNVDGSAKHSHNAAG 771

Query: 545  IGGIIRNHE*DTIWAFSGYIPRSEGPLHAESVALETALTYSYTQSIDHLWIETDSLILCN 724
             GGI+R+H    ++ FS  +   +  L AE +AL   L      +I  LWIE D++ +  
Sbjct: 772  -GGILRDHAGVMVFGFSENL-GIQNSLQAELLALYRGLILCRDYNIRRLWIEMDAISVIR 829

Query: 725  MVANKFPGHWSCRYSLVQIANRLHNKHFRITHIHREGNKVADFLASLGFSTLGSTKYTAI 904
            ++     G  + RY +V +   L +  FR +HI REGN+ ADFLA+ G        +T  
Sbjct: 830  LLQGNHRGPHAIRYLMVSLRQLLSHFSFRFSHIFREGNQAADFLANRGHEHQNLQVFTVA 889

Query: 905  SLPHSAKGIARLDQLEIPSFR 967
                  +G+ RLDQ   P  R
Sbjct: 890  Q--GKLRGMLRLDQTSFPYVR 908


>gb|EOY06960.1| Uncharacterized protein TCM_021522 [Theobroma cacao]
          Length = 3503

 Score =  155 bits (393), Expect = 2e-35
 Identities = 99/291 (34%), Positives = 140/291 (48%), Gaps = 3/291 (1%)
 Frame = +2

Query: 5    LASKCVCCLPPSLVHYTDSSSHEETLSHLFLHNTQVMKVWMHFAAWVRCTLPHTEHISIF 184
            LASKCVCC              EE+L H+   N    +VW  FA   +  +   +HIS  
Sbjct: 1402 LASKCVCC------------RSEESLIHVLWENPVAKQVWNFFAKSFQIYVSKPKHISQI 1449

Query: 185  LSFWKNTTPLAQKNHITVLLPCLVLWYIWKERNHCRHENASFSHIRIIKTVENHIQLLSK 364
            +  W  +    +  HI +L+P  + W++W ERN  +H +      R+I  +   +  L  
Sbjct: 1450 IWAWFFSGDYTRNGHIRILIPLFICWFLWLERNDAKHRHMGMYPNRVIWRIMKLLNQLHA 1509

Query: 365  AKLFQVHNWKGCIDTATRLGLQFRRASRSRNTPILWQKPSSPWIKLNIDGSYKSHLGAAG 544
              L +   WKG  D AT  G ++          I W KP     KLN+DGS KS   AAG
Sbjct: 1510 GSLLKQWQWKGDTDIATMWGFKYPPKYCQSPQIISWIKPFIGEYKLNVDGSSKSSQNAAG 1569

Query: 545  IGGIIRNHE*DTIWAFS---GYIPRSEGPLHAESVALETALTYSYTQSIDHLWIETDSLI 715
             GG++R+H     +AFS   G +P  +  LH    AL   L     ++I +LWIE D+L+
Sbjct: 1570 -GGVLRDHTGKLAFAFSENLGPLPSLQAELH----ALLRGLLLCKERNITNLWIEMDALV 1624

Query: 716  LCNMVANKFPGHWSCRYSLVQIANRLHNKHFRITHIHREGNKVADFLASLG 868
               MV     G    RY L  I   L +  +RI+HI+REGN+ ADFL++ G
Sbjct: 1625 AVQMVQQSQKGSHDIRYLLESIRLCLRSFSYRISHIYREGNQAADFLSNKG 1675



 Score =  154 bits (390), Expect = 4e-35
 Identities = 99/321 (30%), Positives = 148/321 (46%)
 Frame = +2

Query: 5    LASKCVCCLPPSLVHYTDSSSHEETLSHLFLHNTQVMKVWMHFAAWVRCTLPHTEHISIF 184
            LAS+C CC              EE+L H+   N    +VW +FA   +  + +   I+  
Sbjct: 3196 LASRCRCC------------KSEESLMHVMWDNPVANQVWSYFAKVFQIHIINPCTINHI 3243

Query: 185  LSFWKNTTPLAQKNHITVLLPCLVLWYIWKERNHCRHENASFSHIRIIKTVENHIQLLSK 364
            +S W  +   ++  HI  L+P  +LW++W ERN  +H N      RI+  +   I  L +
Sbjct: 3244 ISAWFYSGDYSKPGHIRTLVPLFILWFLWVERNDAKHRNLGMYPNRIVWKILKLIHQLFQ 3303

Query: 365  AKLFQVHNWKGCIDTATRLGLQFRRASRSRNTPILWQKPSSPWIKLNIDGSYKSHLGAAG 544
             K  Q   W+G    A   G+  +  + S    + W KPS    KLN+DGS K +L  A 
Sbjct: 3304 GKQLQKWQWQGDKQIAQEWGIILKAVAPSPPKLLFWNKPSIGEFKLNVDGSSKYNLQTAA 3363

Query: 545  IGGIIRNHE*DTIWAFSGYIPRSEGPLHAESVALETALTYSYTQSIDHLWIETDSLILCN 724
             GG++R+H    I+ FS     S+  L AE +AL   L      ++  LWIE D+ +   
Sbjct: 3364 GGGLLRDHTGSMIFGFSENF-GSQDSLQAELMALHRGLLLCIDHNVTRLWIEMDAKVAVQ 3422

Query: 725  MVANKFPGHWSCRYSLVQIANRLHNKHFRITHIHREGNKVADFLASLGFSTLGSTKYTAI 904
            M+     G    RY L  I   L    FRI+HI REGN+ AD L++ G++          
Sbjct: 3423 MINEGHQGSSRTRYLLASIHRCLSGISFRISHIFREGNQAADHLSNQGYT--HQNLQVIS 3480

Query: 905  SLPHSAKGIARLDQLEIPSFR 967
                  +GI RLD++ +   R
Sbjct: 3481 QAEGQLRGILRLDKINLAYVR 3501


>gb|EOY19200.1| Retrotransposon, unclassified-like protein [Theobroma cacao]
          Length = 1368

 Score =  155 bits (392), Expect = 2e-35
 Identities = 91/291 (31%), Positives = 142/291 (48%), Gaps = 3/291 (1%)
 Frame = +2

Query: 5    LASKCVCCLPPSLVHYTDSSSHEETLSHLFLHNTQVMKVWMHFAAWVRCTLPHTEHISIF 184
            LASKC+CC              EE+L H+   +    +VW +F+ + +  + + ++I   
Sbjct: 1029 LASKCLCC------------KSEESLLHVLWESPVAQQVWNYFSKFFQIYVHNPQNILQI 1076

Query: 185  LSFWKNTTPLAQKNHITVLLPCLVLWYIWKERNHCRHENASFSHIRIIKTVENHIQLLSK 364
            L+ W  +    +  HI  L+   + W++W ERN  +H +      RII  +   ++ L +
Sbjct: 1077 LNSWYYSGDFTKPGHIRTLILLFIFWFVWVERNDAKHRDLGMYPDRIIWRIMKILRKLFQ 1136

Query: 365  AKLFQVHNWKGCIDTATRLGLQFRRASRSRNTPILWQKPSSPWIKLNIDGSYKSHLGAAG 544
              L     WKG +D A   G  F +  ++R   I W KP    +KLN+DGS K     A 
Sbjct: 1137 GGLLCKWQWKGDLDIAIHWGFNFAQERQARPKIINWIKPLIGELKLNVDGSSKDEFQNAA 1196

Query: 545  IGGIIRNHE*DTIWAFS---GYIPRSEGPLHAESVALETALTYSYTQSIDHLWIETDSLI 715
             GG++R+H  + I+ FS   GY    +  L AE +AL   L      ++  +WIE D+ +
Sbjct: 1197 GGGVLRDHTGNLIFGFSENFGY----QNSLQAELLALHRGLCLCMEYNVSRVWIEVDAQV 1252

Query: 716  LCNMVANKFPGHWSCRYSLVQIANRLHNKHFRITHIHREGNKVADFLASLG 868
            +  M+ N   G +  +Y L  I   L     RI+HIHREGN+ ADFL+  G
Sbjct: 1253 VIQMIQNHHKGSYKIQYLLESIRKCLQVISVRISHIHREGNQAADFLSKHG 1303


>gb|EOY17514.1| Uncharacterized protein TCM_042330 [Theobroma cacao]
          Length = 2249

 Score =  152 bits (385), Expect = 1e-34
 Identities = 102/321 (31%), Positives = 148/321 (46%)
 Frame = +2

Query: 5    LASKCVCCLPPSLVHYTDSSSHEETLSHLFLHNTQVMKVWMHFAAWVRCTLPHTEHISIF 184
            LAS+C CC              EE++ H+   N    +VW +F+ + +  + +   I+  
Sbjct: 1943 LASRCRCC------------KSEESIMHVMWDNPVATQVWNYFSKFFQILVINPCTINQI 1990

Query: 185  LSFWKNTTPLAQKNHITVLLPCLVLWYIWKERNHCRHENASFSHIRIIKTVENHIQLLSK 364
            L  W  +    +  HI  L+P   LW++W ERN  +H N      RI+  +   IQ LS 
Sbjct: 1991 LGAWFYSGDYCKPGHIRTLVPIFTLWFLWVERNDAKHRNLGMYPNRIVWRILKLIQQLSL 2050

Query: 365  AKLFQVHNWKGCIDTATRLGLQFRRASRSRNTPILWQKPSSPWIKLNIDGSYKSHLGAAG 544
             +      WKG    A   G+ F+  S        W KPS    KLN+DGS K    AAG
Sbjct: 2051 GQQLLKWQWKGDKQIAQEWGITFQAESLPPPKVFPWHKPSIGEFKLNVDGSAKLSQNAAG 2110

Query: 545  IGGIIRNHE*DTIWAFSGYIPRSEGPLHAESVALETALTYSYTQSIDHLWIETDSLILCN 724
             GG++R+H    ++ FS  +   +  L AE +AL   L      +I  LWIE D+  +  
Sbjct: 2111 -GGVLRDHAGVMVFGFSENL-GIQNSLQAELLALYRGLILCRDYNIRRLWIEMDAASVIR 2168

Query: 725  MVANKFPGHWSCRYSLVQIANRLHNKHFRITHIHREGNKVADFLASLGFSTLGSTKYTAI 904
            ++     G  + RY LV I   L +  FR++HI REGN+ ADFLA+ G         T  
Sbjct: 2169 LLQGNQRGPHAIRYLLVSIRQLLSHFSFRLSHIFREGNQAADFLANRGHEHQSLQVVTVA 2228

Query: 905  SLPHSAKGIARLDQLEIPSFR 967
                  +G+ RLDQ  +P  R
Sbjct: 2229 Q--GKLRGMLRLDQTSLPYVR 2247


>gb|EOY06956.1| Uncharacterized protein TCM_021518 [Theobroma cacao]
          Length = 1702

 Score =  149 bits (375), Expect = 2e-33
 Identities = 93/288 (32%), Positives = 140/288 (48%)
 Frame = +2

Query: 5    LASKCVCCLPPSLVHYTDSSSHEETLSHLFLHNTQVMKVWMHFAAWVRCTLPHTEHISIF 184
            LASKC CC            + EETL H+   N    +VW  FA + +  + + +++S  
Sbjct: 1228 LASKCACC------------NSEETLIHVLWDNPVAKQVWNFFANFFQIYVSNPQNVSQI 1275

Query: 185  LSFWKNTTPLAQKNHITVLLPCLVLWYIWKERNHCRHENASFSHIRIIKTVENHIQLLSK 364
            L  W  +    +K HI  L+P  + W++W ERN  +  +      R++  +   ++ L  
Sbjct: 1276 LWAWYFSGDYVRKGHIRTLIPLFICWFLWLERNDAKQRHLGMYSDRVVWKIMKLLRQLQD 1335

Query: 365  AKLFQVHNWKGCIDTATRLGLQFRRASRSRNTPILWQKPSSPWIKLNIDGSYKSHLGAAG 544
              + +   WKG +D A   G  F    ++      W K  S   KLN+DGS + +  AA 
Sbjct: 1336 GYVLKNWQWKGDMDIAAMWGFNFSPKIQATPQIFHWVKLVSGEHKLNVDGSSRQNQSAA- 1394

Query: 545  IGGIIRNHE*DTIWAFSGYIPRSEGPLHAESVALETALTYSYTQSIDHLWIETDSLILCN 724
            IGG++R+H    ++ FS  I  S   L AE  AL   L     ++I+ LWIE D+L+   
Sbjct: 1395 IGGLLRDHTGTLVFGFSENIGPSNS-LQAELRALLRGLLLCKERNIEKLWIEMDALVAIQ 1453

Query: 725  MVANKFPGHWSCRYSLVQIANRLHNKHFRITHIHREGNKVADFLASLG 868
            M+     G    +Y L  I   L    FRI+HI REGN+VADFL++ G
Sbjct: 1454 MIQQSQKGSHDIQYLLASIRKCLSFFSFRISHIFREGNQVADFLSNKG 1501



 Score = 82.8 bits (203), Expect = 2e-13
 Identities = 56/185 (30%), Positives = 87/185 (47%), Gaps = 3/185 (1%)
 Frame = +2

Query: 422  GLQFRRASRSRNTPILWQKPSSPWIKLNIDGSYKSHLGAAGIGGIIRNHE*DTIWAFSGY 601
            GL++ + S      I W +P     KLN+DG  K     A  GG+ R+H    I+ FS  
Sbjct: 1522 GLRYEQDSHGHPKIIYWSRPLMGEFKLNVDGCSKEAFQNAASGGVPRDHTSTMIFGFS-- 1579

Query: 602  IPRSEGPLH---AESVALETALTYSYTQSIDHLWIETDSLILCNMVANKFPGHWSCRYSL 772
               + GP +   AE +AL   L      +I  +WIE D+  +  M+     G+   +Y L
Sbjct: 1580 --ENFGPYNSTQAELMALHRGLLLCNEYNISRVWIEIDAKAIVQMLHEGHKGYSRTQYLL 1637

Query: 773  VQIANRLHNKHFRITHIHREGNKVADFLASLGFSTLGSTKYTAISLPHSAKGIARLDQLE 952
              I   L    +RI+HIHRE N+ AD+L++ G +      ++        +G+ RLD+  
Sbjct: 1638 SFICQCLSGISYRISHIHRESNQAADYLSNQGHTHQSLQVFS--KAEGELRGMIRLDKSN 1695

Query: 953  IPSFR 967
            +P  R
Sbjct: 1696 LPYVR 1700


>gb|EOY17513.1| Uncharacterized protein TCM_036737 [Theobroma cacao]
          Length = 2215

 Score =  144 bits (364), Expect = 4e-32
 Identities = 95/322 (29%), Positives = 147/322 (45%), Gaps = 1/322 (0%)
 Frame = +2

Query: 5    LASKCVCCLPPSLVHYTDSSSHEETLSHLFLHNTQVMKVWMHFAAWVRCTLPHTEHISIF 184
            LAS+C CC              EE+L H+   N    +VW +FA   +  + +   I+  
Sbjct: 1908 LASRCRCC------------KSEESLMHVMWKNPVANQVWSYFAKVFQIQIINPCTINQI 1955

Query: 185  LSFWKNTTPLAQKNHITVLLPCLVLWYIWKERNHCRHENASFSHIRIIKTVENHIQLLSK 364
            +  W  +   ++  HI  L+P   LW++W ERN  +H N      R++  +   +  L +
Sbjct: 1956 ICAWFYSGDYSKPGHIRTLVPLFTLWFLWVERNDAKHRNLGMYPNRVVWKILKLLHQLFQ 2015

Query: 365  AKLFQVHNWKGCIDTATRLGLQFRRASRSRNTPILWQKPSSPWIKLNIDGSYKSHLGAAG 544
             K  Q   W+G    A   G+  +  + S    + W KPS   +KLN+DGS K +  +A 
Sbjct: 2016 GKQLQKWQWQGDKQIAQEWGIILKADAPSPPKLLFWLKPSIGELKLNVDGSCKHNPQSAA 2075

Query: 545  IGGIIRNHE*DTIWAFS-GYIPRSEGPLHAESVALETALTYSYTQSIDHLWIETDSLILC 721
             GG++R+H    I+ FS  + P+    L AE +AL   L      +I  LWIE D+ +  
Sbjct: 2076 GGGLLRDHTGSMIFGFSENFGPQDS--LQAELMALHRGLLLCIEHNISRLWIEMDAKVAV 2133

Query: 722  NMVANKFPGHWSCRYSLVQIANRLHNKHFRITHIHREGNKVADFLASLGFSTLGSTKYTA 901
             M+     G    RY L  I   L    FRI+HI REGN+ AD L++ G +         
Sbjct: 2134 QMIKEGHQGSSRTRYLLASIHRCLSGISFRISHIFREGNQAADHLSNQGHT--HQNLQVI 2191

Query: 902  ISLPHSAKGIARLDQLEIPSFR 967
                   +GI RL+++ +   R
Sbjct: 2192 SQAEGQLRGILRLEKINLAYVR 2213


>gb|EOY25447.1| Uncharacterized protein TCM_016753 [Theobroma cacao]
          Length = 1275

 Score =  128 bits (321), Expect = 4e-27
 Identities = 86/289 (29%), Positives = 141/289 (48%), Gaps = 5/289 (1%)
 Frame = +2

Query: 74   ETLSHLFLHNTQVMKVWM-----HFAAWVRCTLPHTEHISIFLSFWKNTTPLAQKNHITV 238
            ET+     HNT  +   +     H  +  +C   ++E  S+    W N+  +A++  I  
Sbjct: 836  ETIRQWQSHNTLALSFGIEEKGIHLVS--KCVCCNSEE-SLMHVLWGNS--VAKQGRIRT 890

Query: 239  LLPCLVLWYIWKERNHCRHENASFSHIRIIKTVENHIQLLSKAKLFQVHNWKGCIDTATR 418
            LLP  + W++W ERN  +H ++     R++  +   ++ L    L Q   WKG  D A  
Sbjct: 891  LLPIFICWFLWLERNDAKHRHSGLYTDRVVWRIMTLLRQLQDDSLLQQWQWKGDTDIAAM 950

Query: 419  LGLQFRRASRSRNTPILWQKPSSPWIKLNIDGSYKSHLGAAGIGGIIRNHE*DTIWAFSG 598
                F+   R+    + W+KP +   KLN+DGS ++   AA  GG++R+H    I+ FS 
Sbjct: 951  WRYNFQLKQRAPPQIVYWRKPFTGEYKLNVDGSSRNGQHAAS-GGVLRDHTSKLIFCFSE 1009

Query: 599  YIPRSEGPLHAESVALETALTYSYTQSIDHLWIETDSLILCNMVANKFPGHWSCRYSLVQ 778
             I  +   L AE  AL   L     + I+ LWIE D+L +  ++ +   G    RY L  
Sbjct: 1010 NI-GTYNSLQAELRALHRGLLLCKERHIEKLWIEMDALAVIQLIPHSQKGSHDIRYLLES 1068

Query: 779  IANRLHNKHFRITHIHREGNKVADFLASLGFSTLGSTKYTAISLPHSAK 925
            I   L++  +RI+HI REGN+ ADFL++ G +      +T    P +++
Sbjct: 1069 IKKCLNSISYRISHIFREGNQAADFLSNEGHNHQNLRVFTKAQGPPNSE 1117


>ref|XP_006356603.1| PREDICTED: putative ribonuclease H protein At1g65750-like [Solanum
            tuberosum]
          Length = 885

 Score =  127 bits (318), Expect = 9e-27
 Identities = 87/325 (26%), Positives = 159/325 (48%), Gaps = 2/325 (0%)
 Frame = +2

Query: 2    SLASKCVCCLPPSLVHYTDSSSHEETLSHLFLHNTQVMKVWMHFAAWVRCTLPHTEHISI 181
            ++ S+C CC              EET++HLF       K+W +FA +    +       +
Sbjct: 558  NIVSRCWCC----------DRKKEETMTHLFPTAPITYKLWRYFAHFAGINIDGMHLQQL 607

Query: 182  FLSFWKN-TTPLAQKNHITVLLPCLVLWYIWKERNHCRHENASFSHIRIIKTVENHIQLL 358
             +S+WK+  TP  Q   I   +P +++W +WK RN  +H++ S S  R+++ V   ++ +
Sbjct: 608  IISWWKHEATPKLQG--IYKAIPAIIMWTLWKRRNALKHDS-SISWERMVEMVIEVVRKM 664

Query: 359  SKAKLFQVHNWKGCIDTATRLGLQFRRASRSRNTPILWQKPSSPWIKLNIDGSYKSHLGA 538
             K++   + N +       +   Q++R  +     + W+ P   ++K N DG+ + + G 
Sbjct: 665  VKSQFPWIKNMRWTWQAIIQRLNQYKR--KIHVLRVTWKPPDDHYVKSNTDGACRGNPGL 722

Query: 539  AGIGGIIRNHE*DTIWAFSGYIPRSEGPLHAESVALETALTYSYTQSIDHLWIETDSLIL 718
            +  G  IR+ + D I+A +  I  +   + AE+VA+ TAL     + +  + IETDSL L
Sbjct: 723  SSFGFCIRDDKGDLIYAKAKGIGIATN-MEAETVAILTALRECSNRKMQKVIIETDSLSL 781

Query: 719  CNMVANKFPGHWSCRYSLVQIANRLHNKHFRITHIHREGNKVADFLASLGFSTLGSTKYT 898
              ++   +   W     + +I   +     +ITHI REGN +AD LA++   +    +Y+
Sbjct: 782  KKIIQQTWRVPWKIAEKVEEIREIMEKIKAKITHIFREGNSLADSLANIAIESQAEHQYS 841

Query: 899  AI-SLPHSAKGIARLDQLEIPSFRI 970
                LP   + I  +D+ +IP+ RI
Sbjct: 842  CFQELPLKERRILNIDKAQIPTLRI 866


>ref|XP_004309472.1| PREDICTED: putative ribonuclease H protein At1g65750-like [Fragaria
           vesca subsp. vesca]
          Length = 364

 Score =  126 bits (317), Expect = 1e-26
 Identities = 83/292 (28%), Positives = 137/292 (46%), Gaps = 1/292 (0%)
 Frame = +2

Query: 2   SLASKCVCCLPPSLVHYTDSSSHEETLSHLFLHNTQVMKVWMHFAAWVRC-TLPHTEHIS 178
           +LAS+CV C               E+L H+FL  +    +W + A       LP      
Sbjct: 70  ALASRCVLC-----------GRDGESLPHIFLTCSFAASLWNNRAGLFELGCLPQN---L 115

Query: 179 IFLSFWKNTTPLAQKNHITVLLPCLVLWYIWKERNHCRHENASFSHIRIIKTVENHIQLL 358
           + L ++       Q   I ++     LW+IWK RN  RH+N +     + + +  H++  
Sbjct: 116 VDLLYYGGVGRSHQLKEIWLICYTTTLWFIWKARNKMRHDNCTIVVDAVRQLIMGHVKTA 175

Query: 359 SKAKLFQVHNWKGCIDTATRLGLQFRRASRSRNTPILWQKPSSPWIKLNIDGSYKSHLGA 538
           SK  L  + N    +    + GL  R     R T + W  P   WIK+N DG+++   G 
Sbjct: 176 SKLALGCMSNSLTELRVLKKFGLLCRPHRAPRITEVNWHPPLFGWIKVNTDGAWQKTTGK 235

Query: 539 AGIGGIIRNHE*DTIWAFSGYIPRSEGPLHAESVALETALTYSYTQSIDHLWIETDSLIL 718
           +G GGI R+     + AF+  +      + AE +A+  A+  ++ +  +H+W+E DS+I+
Sbjct: 236 SGYGGIFRDFHGSFLGAFASNL-EILNSVDAEVMAVIQAIELAWVRDWEHIWLEVDSIIV 294

Query: 719 CNMVANKFPGHWSCRYSLVQIANRLHNKHFRITHIHREGNKVADFLASLGFS 874
            N + +     W  R       +R+   +FR +HI REGN+VAD LA++G S
Sbjct: 295 LNFLQDPHLVPWRLRVGWGNFLHRISQMNFRSSHIFREGNQVADALANMGLS 346


>ref|XP_004309110.1| PREDICTED: putative ribonuclease H protein At1g65750-like [Fragaria
            vesca subsp. vesca]
          Length = 872

 Score =  124 bits (312), Expect = 4e-26
 Identities = 81/292 (27%), Positives = 133/292 (45%), Gaps = 1/292 (0%)
 Frame = +2

Query: 2    SLASKCVCCLPPSLVHYTDSSSHEETLSHLFLHNTQVMKVWMHFAAWVRCTL-PHTEHIS 178
            +L S+C  C            +  E+L H+FLH +    VW HF       L P+T    
Sbjct: 564  ALVSRCEFC-----------GNSTESLDHIFLHCSFAASVWNHFIYIFEIGLVPNTIAEV 612

Query: 179  IFLSFWKNTTPLAQKNHITVLLPCLVLWYIWKERNHCRHENASFSHIRIIKTVENHIQLL 358
              L    + +P  Q   + ++    +LWYIW  RN  R ++ +FS   + + V  HIQ  
Sbjct: 613  FSLGLAMDRSP--QLKELWLICFTSILWYIWHARNQIRFDSRTFSVAGVCRLVSRHIQAS 670

Query: 359  SKAKLFQVHNWKGCIDTATRLGLQFRRASRSRNTPILWQKPSSPWIKLNIDGSYKSHLGA 538
            S+     +HN    +      G   R     R   ++W  PS  WIK+N DG++K   G 
Sbjct: 671  SRLATGHMHNTIHDLCILKSFGACCRSRRIPRMVEVIWHPPSIGWIKINSDGAWKHEEGI 730

Query: 539  AGIGGIIRNHE*DTIWAFSGYIPRSEGPLHAESVALETALTYSYTQSIDHLWIETDSLIL 718
             G G + R ++   + AF+ +I      + A+ + + TA+  ++ +   H+W+E D   +
Sbjct: 731  GGFGAVFRYYKGQFVGAFASHIDIPSS-IAAKVMVVITAIELAWVRDWKHVWLEVDFSTV 789

Query: 719  CNMVANKFPGHWSCRYSLVQIANRLHNKHFRITHIHREGNKVADFLASLGFS 874
             + + +     W  R   +    R+    F+ +HI REGN+VAD LA+ G S
Sbjct: 790  LDYIRSPSLVPWQLRVRWLNCLYRISTMTFKSSHIFREGNRVADALANHGTS 841


>gb|EOY25454.1| Uncharacterized protein TCM_026877 [Theobroma cacao]
          Length = 2367

 Score =  124 bits (311), Expect = 6e-26
 Identities = 92/274 (33%), Positives = 122/274 (44%)
 Frame = +2

Query: 146  RCTLPHTEHISIFLSFWKNTTPLAQKNHITVLLPCLVLWYIWKERNHCRHENASFSHIRI 325
            RC    +E  SI    W N   + Q  HI  L+P   LW++W ERN  +H N        
Sbjct: 2118 RCRCCRSEE-SIIHVMWDNPVAV-QPGHIRTLIPIFTLWFLWVERNDAKHRNLGQ----- 2170

Query: 326  IKTVENHIQLLSKAKLFQVHNWKGCIDTATRLGLQFRRASRSRNTPILWQKPSSPWIKLN 505
                    QLL          WKG    A   G+ F+  S        W KPS+   KLN
Sbjct: 2171 --------QLLE-------WQWKGDKQIAQEWGITFQAKSLPPPKVFCWHKPSNGEFKLN 2215

Query: 506  IDGSYKSHLGAAGIGGIIRNHE*DTIWAFSGYIPRSEGPLHAESVALETALTYSYTQSID 685
            +DGS K    AAG GG++R+H    I+ FS  +   +  L AE +AL   L      +I 
Sbjct: 2216 VDGSAKLSQNAAG-GGVLRDHAGVMIFGFSENLG-IQNSLKAELLALYRGLILCRDYNIR 2273

Query: 686  HLWIETDSLILCNMVANKFPGHWSCRYSLVQIANRLHNKHFRITHIHREGNKVADFLASL 865
             LWIE D+  +  ++     G  + RY L  I   L +  FR+THI REGN+ ADFLA+ 
Sbjct: 2274 RLWIEMDATSVIRLLQGNHRGPHAIRYLLGSIRQLLSHFSFRLTHIFREGNQAADFLANR 2333

Query: 866  GFSTLGSTKYTAISLPHSAKGIARLDQLEIPSFR 967
            G         T        +G+ RLDQ  +P  R
Sbjct: 2334 GHEHQSLQVITVAQ--GKLRGMLRLDQTSLPYVR 2365


Top