BLASTX nr result

ID: Rehmannia32_contig00004292 seq

BLASTX 2.2.26 [Sep-21-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Rehmannia32_contig00004292
         (1968 letters)

Database: All non-redundant GenBank CDS
translations+PDB+SwissProt+PIR+PRF excluding environmental samples
from WGS projects 
           149,584,005 sequences; 54,822,741,787 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_020550230.1| uncharacterized protein LOC105163879 isoform...   717   0.0  
ref|XP_011080690.1| uncharacterized protein LOC105163879 isoform...   717   0.0  
gb|PIN04698.1| Glucan 1,3-beta-glucosidase [Handroanthus impetig...   709   0.0  
ref|XP_012837600.1| PREDICTED: lysosomal beta glucosidase-like [...   699   0.0  
ref|XP_022847074.1| uncharacterized protein LOC111369698 [Olea e...   677   0.0  
ref|XP_022893219.1| uncharacterized protein LOC111407782 isoform...   670   0.0  
ref|XP_022893218.1| uncharacterized protein LOC111407782 isoform...   670   0.0  
emb|CDP09157.1| unnamed protein product [Coffea canephora]            659   0.0  
gb|EOY33800.1| Glycosyl hydrolase family protein [Theobroma cacao]    639   0.0  
ref|XP_007016181.2| PREDICTED: beta-glucosidase BoGH3B [Theobrom...   639   0.0  
gb|EOY33795.1| Glycosyl hydrolase family protein isoform 2 [Theo...   632   0.0  
gb|ARU79083.1| beta-glucosidase 5 GH3 family [Camellia sinensis]      634   0.0  
ref|XP_017983610.1| PREDICTED: beta-glucosidase BoGH3B [Theobrom...   632   0.0  
ref|XP_012445665.1| PREDICTED: lysosomal beta glucosidase-like [...   632   0.0  
ref|XP_002313393.1| glycosyl hydrolase family 3 family protein [...   629   0.0  
gb|PNT19511.1| hypothetical protein POPTR_009G042800v3 [Populus ...   629   0.0  
gb|EOY33796.1| Glycosyl hydrolase family protein isoform 3 [Theo...   627   0.0  
ref|XP_017648269.1| PREDICTED: beta-glucosidase BoGH3B-like [Gos...   627   0.0  
ref|XP_021277936.1| uncharacterized protein LOC110411900 [Herran...   626   0.0  
ref|XP_019261677.1| PREDICTED: uncharacterized protein LOC109239...   625   0.0  

>ref|XP_020550230.1| uncharacterized protein LOC105163879 isoform X2 [Sesamum indicum]
          Length = 494

 Score =  717 bits (1851), Expect = 0.0
 Identities = 354/444 (79%), Positives = 383/444 (86%)
 Frame = -3

Query: 1966 RWGRCYESYSENTEIVRKMTSLVTGLQGQPPEGHPKGYPFLAGRKNVLASAKHFVGDXXX 1787
            RWGRCYESYSENTEIVRKMTSLVTGLQGQPPEGHP GYPFLAGRKNVLASAKHFVGD   
Sbjct: 50   RWGRCYESYSENTEIVRKMTSLVTGLQGQPPEGHPIGYPFLAGRKNVLASAKHFVGDGGT 109

Query: 1786 XXXXXXXXXXTSYDDLERTHMAPYLDCLSQGVCTVMASYSSWNGRKLHTDYFLLTEVLKN 1607
                       SYDDLER HMAPYLDCL+QGVCTVMASYSSWNG+++HT+ FLLTEVLKN
Sbjct: 110  ENGTNEGNTIASYDDLERIHMAPYLDCLAQGVCTVMASYSSWNGKRMHTNDFLLTEVLKN 169

Query: 1606 KLGFKGFVISDWEALDRLYAPHGSNYRQSILSTVNAGIDMVMVPFRYXXXXXXXXXXXET 1427
            KLGFKGFVISDWEALDRLY PHGSNYR+SILSTVNAGIDMVMVPFR+           E+
Sbjct: 170  KLGFKGFVISDWEALDRLYVPHGSNYRESILSTVNAGIDMVMVPFRFELFLDEFLSLVES 229

Query: 1426 GEISMARIDDAVERILRVKFVAGVFEYPLTDRSLLDVVGCKPHRELAREAVRKSLVLLKN 1247
            GEISMARIDDAVERILRVKF+AGVFE PL+DRSLLD+VGCK HRELAREAVRKSLVLLKN
Sbjct: 230  GEISMARIDDAVERILRVKFIAGVFEDPLSDRSLLDLVGCKAHRELAREAVRKSLVLLKN 289

Query: 1246 GKDPKRPFLPLDNKAKRILVAGTHADNLGYQCXXXXXXXXXXXXXXXXXXXILDAIKEVV 1067
            GKDPKRP LPLD KAKRILVAGTHADNLGYQC                   IL+AIKEV+
Sbjct: 290  GKDPKRPLLPLDKKAKRILVAGTHADNLGYQCGGWTITWEGTTGRITEGTTILEAIKEVM 349

Query: 1066 DEKTEVVYELNPSPETLSREDFAFAVVAVGEGPYVESGGDDPVLKIPFNGSELVSLVADR 887
            D+KTEV+YELNP+PET S +DF+FA+VAVGEGPYVE+GGDDP LKIPFNGSEL SLVADR
Sbjct: 350  DDKTEVIYELNPTPETFSGQDFSFAIVAVGEGPYVETGGDDPELKIPFNGSELASLVADR 409

Query: 886  IPTLVILVSGRPLVLEPWLLEKIDALVVAWLPGTEGRGITDVIFGDYGFEGRLPMTWFRS 707
            +PTL+ILV+GRPL+LEP LLEK+D LVVAWLPGTEG+GITDVIFGDY F GRLPMTWFRS
Sbjct: 410  VPTLMILVTGRPLILEPSLLEKLDGLVVAWLPGTEGKGITDVIFGDYAFHGRLPMTWFRS 469

Query: 706  VDQLPINVGENSSDPLFPFGFGLT 635
            VDQLP++ GENSSDPLFP+GFGLT
Sbjct: 470  VDQLPVHAGENSSDPLFPYGFGLT 493


>ref|XP_011080690.1| uncharacterized protein LOC105163879 isoform X1 [Sesamum indicum]
          Length = 599

 Score =  717 bits (1851), Expect = 0.0
 Identities = 354/444 (79%), Positives = 383/444 (86%)
 Frame = -3

Query: 1966 RWGRCYESYSENTEIVRKMTSLVTGLQGQPPEGHPKGYPFLAGRKNVLASAKHFVGDXXX 1787
            RWGRCYESYSENTEIVRKMTSLVTGLQGQPPEGHP GYPFLAGRKNVLASAKHFVGD   
Sbjct: 155  RWGRCYESYSENTEIVRKMTSLVTGLQGQPPEGHPIGYPFLAGRKNVLASAKHFVGDGGT 214

Query: 1786 XXXXXXXXXXTSYDDLERTHMAPYLDCLSQGVCTVMASYSSWNGRKLHTDYFLLTEVLKN 1607
                       SYDDLER HMAPYLDCL+QGVCTVMASYSSWNG+++HT+ FLLTEVLKN
Sbjct: 215  ENGTNEGNTIASYDDLERIHMAPYLDCLAQGVCTVMASYSSWNGKRMHTNDFLLTEVLKN 274

Query: 1606 KLGFKGFVISDWEALDRLYAPHGSNYRQSILSTVNAGIDMVMVPFRYXXXXXXXXXXXET 1427
            KLGFKGFVISDWEALDRLY PHGSNYR+SILSTVNAGIDMVMVPFR+           E+
Sbjct: 275  KLGFKGFVISDWEALDRLYVPHGSNYRESILSTVNAGIDMVMVPFRFELFLDEFLSLVES 334

Query: 1426 GEISMARIDDAVERILRVKFVAGVFEYPLTDRSLLDVVGCKPHRELAREAVRKSLVLLKN 1247
            GEISMARIDDAVERILRVKF+AGVFE PL+DRSLLD+VGCK HRELAREAVRKSLVLLKN
Sbjct: 335  GEISMARIDDAVERILRVKFIAGVFEDPLSDRSLLDLVGCKAHRELAREAVRKSLVLLKN 394

Query: 1246 GKDPKRPFLPLDNKAKRILVAGTHADNLGYQCXXXXXXXXXXXXXXXXXXXILDAIKEVV 1067
            GKDPKRP LPLD KAKRILVAGTHADNLGYQC                   IL+AIKEV+
Sbjct: 395  GKDPKRPLLPLDKKAKRILVAGTHADNLGYQCGGWTITWEGTTGRITEGTTILEAIKEVM 454

Query: 1066 DEKTEVVYELNPSPETLSREDFAFAVVAVGEGPYVESGGDDPVLKIPFNGSELVSLVADR 887
            D+KTEV+YELNP+PET S +DF+FA+VAVGEGPYVE+GGDDP LKIPFNGSEL SLVADR
Sbjct: 455  DDKTEVIYELNPTPETFSGQDFSFAIVAVGEGPYVETGGDDPELKIPFNGSELASLVADR 514

Query: 886  IPTLVILVSGRPLVLEPWLLEKIDALVVAWLPGTEGRGITDVIFGDYGFEGRLPMTWFRS 707
            +PTL+ILV+GRPL+LEP LLEK+D LVVAWLPGTEG+GITDVIFGDY F GRLPMTWFRS
Sbjct: 515  VPTLMILVTGRPLILEPSLLEKLDGLVVAWLPGTEGKGITDVIFGDYAFHGRLPMTWFRS 574

Query: 706  VDQLPINVGENSSDPLFPFGFGLT 635
            VDQLP++ GENSSDPLFP+GFGLT
Sbjct: 575  VDQLPVHAGENSSDPLFPYGFGLT 598


>gb|PIN04698.1| Glucan 1,3-beta-glucosidase [Handroanthus impetiginosus]
          Length = 599

 Score =  709 bits (1829), Expect = 0.0
 Identities = 355/444 (79%), Positives = 379/444 (85%)
 Frame = -3

Query: 1966 RWGRCYESYSENTEIVRKMTSLVTGLQGQPPEGHPKGYPFLAGRKNVLASAKHFVGDXXX 1787
            RWGRCYESYSENTEIVRKMTSLVTGLQGQPPEGHPKGYPFLAGRKNVLASAKHFVGD   
Sbjct: 155  RWGRCYESYSENTEIVRKMTSLVTGLQGQPPEGHPKGYPFLAGRKNVLASAKHFVGDGGT 214

Query: 1786 XXXXXXXXXXTSYDDLERTHMAPYLDCLSQGVCTVMASYSSWNGRKLHTDYFLLTEVLKN 1607
                      TSYDDLER HMAPYLDCLSQGVCTVMASYSSWNGRKLHT++FL+TEVLKN
Sbjct: 215  ENGINEGNTITSYDDLERIHMAPYLDCLSQGVCTVMASYSSWNGRKLHTNHFLITEVLKN 274

Query: 1606 KLGFKGFVISDWEALDRLYAPHGSNYRQSILSTVNAGIDMVMVPFRYXXXXXXXXXXXET 1427
            KLGF GFVISDWEALDRL+APHGSNYRQSIL +VNAGIDMVMVPFRY           E+
Sbjct: 275  KLGFMGFVISDWEALDRLFAPHGSNYRQSILLSVNAGIDMVMVPFRYELFLEEFLALVES 334

Query: 1426 GEISMARIDDAVERILRVKFVAGVFEYPLTDRSLLDVVGCKPHRELAREAVRKSLVLLKN 1247
            GEI M+RIDDAVERILRVKFV+GVFEYPLTDRSLLDVVGCK HRELAREAVRKSLVLLKN
Sbjct: 335  GEIPMSRIDDAVERILRVKFVSGVFEYPLTDRSLLDVVGCKAHRELAREAVRKSLVLLKN 394

Query: 1246 GKDPKRPFLPLDNKAKRILVAGTHADNLGYQCXXXXXXXXXXXXXXXXXXXILDAIKEVV 1067
            GK+ KR  LPLD KAKRILVAGTHADNLGYQC                   ILDAIKEVV
Sbjct: 395  GKEQKRTLLPLDKKAKRILVAGTHADNLGYQCGGWTITWEGTTGRITEGTTILDAIKEVV 454

Query: 1066 DEKTEVVYELNPSPETLSREDFAFAVVAVGEGPYVESGGDDPVLKIPFNGSELVSLVADR 887
            D +TEVVYELNP+PET S +DF++A+VAVGE PYVESGGDDP LKIPFNG+ELV +VADR
Sbjct: 455  DNETEVVYELNPTPETFSGQDFSYAIVAVGEAPYVESGGDDPELKIPFNGTELVKIVADR 514

Query: 886  IPTLVILVSGRPLVLEPWLLEKIDALVVAWLPGTEGRGITDVIFGDYGFEGRLPMTWFRS 707
            +PTL ILV+GRPLVLEP LLEKI+ALVVAWLPGTEGRGITDVIFGDY F GRLPMTWFR+
Sbjct: 515  VPTLAILVTGRPLVLEPSLLEKIEALVVAWLPGTEGRGITDVIFGDYAFHGRLPMTWFRT 574

Query: 706  VDQLPINVGENSSDPLFPFGFGLT 635
            VDQLP++   NS+DPLFPFGFGLT
Sbjct: 575  VDQLPVHAEGNSTDPLFPFGFGLT 598


>ref|XP_012837600.1| PREDICTED: lysosomal beta glucosidase-like [Erythranthe guttata]
 gb|EYU45973.1| hypothetical protein MIMGU_mgv1a003210mg [Erythranthe guttata]
          Length = 600

 Score =  699 bits (1805), Expect = 0.0
 Identities = 345/444 (77%), Positives = 380/444 (85%)
 Frame = -3

Query: 1966 RWGRCYESYSENTEIVRKMTSLVTGLQGQPPEGHPKGYPFLAGRKNVLASAKHFVGDXXX 1787
            RWGRCYESY E+TEIVRKMTS+VTGLQGQPPEGH KGYPF+AGR NVLASAKHFVGD   
Sbjct: 156  RWGRCYESYGEDTEIVRKMTSIVTGLQGQPPEGHLKGYPFVAGRNNVLASAKHFVGDGGT 215

Query: 1786 XXXXXXXXXXTSYDDLERTHMAPYLDCLSQGVCTVMASYSSWNGRKLHTDYFLLTEVLKN 1607
                      TSYDDLER H+APYLDC+SQGVCTVMASYSSWNG+KLHTD+FLLTE+LK 
Sbjct: 216  ENGTNEGNTITSYDDLERIHLAPYLDCISQGVCTVMASYSSWNGKKLHTDHFLLTELLKK 275

Query: 1606 KLGFKGFVISDWEALDRLYAPHGSNYRQSILSTVNAGIDMVMVPFRYXXXXXXXXXXXET 1427
            KLGF GFVISDWEALDRLY+PHGSNYR+SILSTVNAGIDMVMVPFRY           E+
Sbjct: 276  KLGFMGFVISDWEALDRLYSPHGSNYRESILSTVNAGIDMVMVPFRYELFLEEFLSLAES 335

Query: 1426 GEISMARIDDAVERILRVKFVAGVFEYPLTDRSLLDVVGCKPHRELAREAVRKSLVLLKN 1247
            GEISMARIDDAVERILRVKFV+GVFE+P+ DRSLLD+VGCK HRELAREAVRKSLVLLKN
Sbjct: 336  GEISMARIDDAVERILRVKFVSGVFEHPMADRSLLDLVGCKAHRELAREAVRKSLVLLKN 395

Query: 1246 GKDPKRPFLPLDNKAKRILVAGTHADNLGYQCXXXXXXXXXXXXXXXXXXXILDAIKEVV 1067
            GKDPK+P LPL+ KAKRILVAGTHADNLGYQC                   +L+AIKE+V
Sbjct: 396  GKDPKKPLLPLNKKAKRILVAGTHADNLGYQCGGWTISWEGTSGKITEGTTMLEAIKEMV 455

Query: 1066 DEKTEVVYELNPSPETLSREDFAFAVVAVGEGPYVESGGDDPVLKIPFNGSELVSLVADR 887
            D  TEVVYE NPSPET S E+F+FA+VAVGEGPYVESGGDDP LKIPFNG+EL SLVAD+
Sbjct: 456  DHNTEVVYEQNPSPETFSGEEFSFAIVAVGEGPYVESGGDDPELKIPFNGAELASLVADK 515

Query: 886  IPTLVILVSGRPLVLEPWLLEKIDALVVAWLPGTEGRGITDVIFGDYGFEGRLPMTWFRS 707
            +PTLVIL++GRPLV+EP LLEKI+ALVVAWLPG+EGRGITDVIFGDY F+G+LPMTWFRS
Sbjct: 516  VPTLVILITGRPLVVEPSLLEKIEALVVAWLPGSEGRGITDVIFGDYPFQGKLPMTWFRS 575

Query: 706  VDQLPINVGENSSDPLFPFGFGLT 635
            VDQLP++ GENS DPLFPFGFGLT
Sbjct: 576  VDQLPVHSGENSLDPLFPFGFGLT 599


>ref|XP_022847074.1| uncharacterized protein LOC111369698 [Olea europaea var. sylvestris]
          Length = 607

 Score =  677 bits (1748), Expect = 0.0
 Identities = 336/445 (75%), Positives = 371/445 (83%), Gaps = 1/445 (0%)
 Frame = -3

Query: 1966 RWGRCYESYSENTEIVRKMTSLVTGLQGQPPEGHPKGYPFLAGRKNVLASAKHFVGDXXX 1787
            RWGRCYESYSE+TE+VRKMT+LVTGLQGQPPEGHP+GYPFL GRK V+A AKHFVGD   
Sbjct: 155  RWGRCYESYSEDTEVVRKMTTLVTGLQGQPPEGHPQGYPFLGGRKKVIACAKHFVGDGGT 214

Query: 1786 XXXXXXXXXXTSYDDLERTHMAPYLDCLSQGVCTVMASYSSWNGRKLHTDYFLLTEVLKN 1607
                       +YDDLE  HMAPYLDC+SQGVCTVMASYSSWNG KLH+D FL+TE+LK+
Sbjct: 215  DNGTNEGNTIITYDDLEGIHMAPYLDCISQGVCTVMASYSSWNGSKLHSDRFLITEILKD 274

Query: 1606 KLGFKGFVISDWEALDRLYAPHGSNYRQSILSTVNAGIDMVMVPFRYXXXXXXXXXXXET 1427
            KLGFKGFVISDWEALDRLY PHGSNYRQSILST+NAGIDMVMVPFR+           E+
Sbjct: 275  KLGFKGFVISDWEALDRLYVPHGSNYRQSILSTINAGIDMVMVPFRFELFLEEFLSLAES 334

Query: 1426 GEISMARIDDAVERILRVKFVAGVFEYPLTDRSLLDVVGCKPHRELAREAVRKSLVLLKN 1247
            GEI +ARIDDAVERILRVKFVAG+FEYP  DRSLLDVVGCKPHRELAREAVRKSLVLLKN
Sbjct: 335  GEIPLARIDDAVERILRVKFVAGLFEYPRGDRSLLDVVGCKPHRELAREAVRKSLVLLKN 394

Query: 1246 GKDPKRPFLPLDNKAKRILVAGTHADNLGYQCXXXXXXXXXXXXXXXXXXXILDAIKEVV 1067
            GKD K PFLPLD  AKRILVAGTHAD+LGY C                   ILDAIKEVV
Sbjct: 395  GKDQKIPFLPLDKNAKRILVAGTHADDLGYLCGGWTATWEGTSGRITDGTTILDAIKEVV 454

Query: 1066 DEKTEVVYELNPSPETLSREDFAFAVVAVGEGPYVESGGDDPVLKIPFNGSELVSLVADR 887
              +TEV YE NPS ET S +DF++AV+AVGE PYVE+GGDDP LKIPFNG+ELVSLVADR
Sbjct: 455  GSETEVTYEQNPSQETFSGQDFSYAVIAVGEAPYVETGGDDPELKIPFNGAELVSLVADR 514

Query: 886  IPTLVILVSGRPLVLEPWLLEKIDALVVAWLPGTEGRGITDVIFGDYGFEGRLPMTWFRS 707
            +PTLVIL++GRPLVLEPWLLEKIDALVVAWLPG+E +GITDVIFGDY F GRLPMTWF+S
Sbjct: 515  VPTLVILITGRPLVLEPWLLEKIDALVVAWLPGSECQGITDVIFGDYTFHGRLPMTWFKS 574

Query: 706  VDQLPINVG-ENSSDPLFPFGFGLT 635
            VDQLP+++G ++S DPLFPFGFGLT
Sbjct: 575  VDQLPLHIGKQDSYDPLFPFGFGLT 599


>ref|XP_022893219.1| uncharacterized protein LOC111407782 isoform X2 [Olea europaea var.
            sylvestris]
          Length = 540

 Score =  670 bits (1729), Expect = 0.0
 Identities = 332/446 (74%), Positives = 369/446 (82%), Gaps = 1/446 (0%)
 Frame = -3

Query: 1966 RWGRCYESYSENTEIVRKMTSLVTGLQGQPPEGHPKGYPFLAGRKNVLASAKHFVGDXXX 1787
            RWGRCYESYSE+TE+VRKMTSLVTGLQGQPPEGHPKGYPFLAGRKNV+A AKHFVGD   
Sbjct: 83   RWGRCYESYSEDTEVVRKMTSLVTGLQGQPPEGHPKGYPFLAGRKNVIACAKHFVGDGGT 142

Query: 1786 XXXXXXXXXXTSYDDLERTHMAPYLDCLSQGVCTVMASYSSWNGRKLHTDYFLLTEVLKN 1607
                       SYDDLER HMAPYLDC+SQGVCT+MASYSSWNG KLH  +FLLTE+LK+
Sbjct: 143  DYGTNEGDTIISYDDLERIHMAPYLDCISQGVCTIMASYSSWNGIKLHAHHFLLTEILKD 202

Query: 1606 KLGFKGFVISDWEALDRLYAPHGSNYRQSILSTVNAGIDMVMVPFRYXXXXXXXXXXXET 1427
            KLGFKGFVISDWEALDRL  P GSNYRQ I+S VNAGIDMVMVPFR+           E+
Sbjct: 203  KLGFKGFVISDWEALDRLCVPRGSNYRQCIMSAVNAGIDMVMVPFRFELFLGEFLSLVES 262

Query: 1426 GEISMARIDDAVERILRVKFVAGVFEYPLTDRSLLDVVGCKPHRELAREAVRKSLVLLKN 1247
            GEI M+RIDDAVERILRVKFVAG+FE+PL+DRSLLDVV CKPHRELAR AVRKSLVLLKN
Sbjct: 263  GEIPMSRIDDAVERILRVKFVAGIFEHPLSDRSLLDVVRCKPHRELARAAVRKSLVLLKN 322

Query: 1246 GKDPKRPFLPLDNKAKRILVAGTHADNLGYQCXXXXXXXXXXXXXXXXXXXILDAIKEVV 1067
            GKD K PFLPLD   KRILVAGTHAD+LGYQC                   ILDAIKEVV
Sbjct: 323  GKDQKIPFLPLDKNTKRILVAGTHADDLGYQCGGWTATWEGKSGRITDGTTILDAIKEVV 382

Query: 1066 DEKTEVVYELNPSPETLSREDFAFAVVAVGEGPYVESGGDDPVLKIPFNGSELVSLVADR 887
              +TEV YEL PSPET + ++F++AVVAVGE PYV++GGDDP LKIP NG+ELVS VAD+
Sbjct: 383  GSETEVTYELIPSPETFAGQNFSYAVVAVGEAPYVQTGGDDPELKIPLNGAELVSSVADQ 442

Query: 886  IPTLVILVSGRPLVLEPWLLEKIDALVVAWLPGTEGRGITDVIFGDYGFEGRLPMTWFRS 707
            +PTLVIL++GRPLVLEPWLLEKIDALVVAWLPG+EG+GITDVIFGDYGF GRLP TWF+S
Sbjct: 443  VPTLVILITGRPLVLEPWLLEKIDALVVAWLPGSEGQGITDVIFGDYGFHGRLPATWFKS 502

Query: 706  VDQLPINVG-ENSSDPLFPFGFGLTY 632
            VDQLP+++G ++S DPLFPFGFGL Y
Sbjct: 503  VDQLPLHIGNQDSYDPLFPFGFGLNY 528


>ref|XP_022893218.1| uncharacterized protein LOC111407782 isoform X1 [Olea europaea var.
            sylvestris]
          Length = 612

 Score =  670 bits (1729), Expect = 0.0
 Identities = 332/446 (74%), Positives = 369/446 (82%), Gaps = 1/446 (0%)
 Frame = -3

Query: 1966 RWGRCYESYSENTEIVRKMTSLVTGLQGQPPEGHPKGYPFLAGRKNVLASAKHFVGDXXX 1787
            RWGRCYESYSE+TE+VRKMTSLVTGLQGQPPEGHPKGYPFLAGRKNV+A AKHFVGD   
Sbjct: 155  RWGRCYESYSEDTEVVRKMTSLVTGLQGQPPEGHPKGYPFLAGRKNVIACAKHFVGDGGT 214

Query: 1786 XXXXXXXXXXTSYDDLERTHMAPYLDCLSQGVCTVMASYSSWNGRKLHTDYFLLTEVLKN 1607
                       SYDDLER HMAPYLDC+SQGVCT+MASYSSWNG KLH  +FLLTE+LK+
Sbjct: 215  DYGTNEGDTIISYDDLERIHMAPYLDCISQGVCTIMASYSSWNGIKLHAHHFLLTEILKD 274

Query: 1606 KLGFKGFVISDWEALDRLYAPHGSNYRQSILSTVNAGIDMVMVPFRYXXXXXXXXXXXET 1427
            KLGFKGFVISDWEALDRL  P GSNYRQ I+S VNAGIDMVMVPFR+           E+
Sbjct: 275  KLGFKGFVISDWEALDRLCVPRGSNYRQCIMSAVNAGIDMVMVPFRFELFLGEFLSLVES 334

Query: 1426 GEISMARIDDAVERILRVKFVAGVFEYPLTDRSLLDVVGCKPHRELAREAVRKSLVLLKN 1247
            GEI M+RIDDAVERILRVKFVAG+FE+PL+DRSLLDVV CKPHRELAR AVRKSLVLLKN
Sbjct: 335  GEIPMSRIDDAVERILRVKFVAGIFEHPLSDRSLLDVVRCKPHRELARAAVRKSLVLLKN 394

Query: 1246 GKDPKRPFLPLDNKAKRILVAGTHADNLGYQCXXXXXXXXXXXXXXXXXXXILDAIKEVV 1067
            GKD K PFLPLD   KRILVAGTHAD+LGYQC                   ILDAIKEVV
Sbjct: 395  GKDQKIPFLPLDKNTKRILVAGTHADDLGYQCGGWTATWEGKSGRITDGTTILDAIKEVV 454

Query: 1066 DEKTEVVYELNPSPETLSREDFAFAVVAVGEGPYVESGGDDPVLKIPFNGSELVSLVADR 887
              +TEV YEL PSPET + ++F++AVVAVGE PYV++GGDDP LKIP NG+ELVS VAD+
Sbjct: 455  GSETEVTYELIPSPETFAGQNFSYAVVAVGEAPYVQTGGDDPELKIPLNGAELVSSVADQ 514

Query: 886  IPTLVILVSGRPLVLEPWLLEKIDALVVAWLPGTEGRGITDVIFGDYGFEGRLPMTWFRS 707
            +PTLVIL++GRPLVLEPWLLEKIDALVVAWLPG+EG+GITDVIFGDYGF GRLP TWF+S
Sbjct: 515  VPTLVILITGRPLVLEPWLLEKIDALVVAWLPGSEGQGITDVIFGDYGFHGRLPATWFKS 574

Query: 706  VDQLPINVG-ENSSDPLFPFGFGLTY 632
            VDQLP+++G ++S DPLFPFGFGL Y
Sbjct: 575  VDQLPLHIGNQDSYDPLFPFGFGLNY 600


>emb|CDP09157.1| unnamed protein product [Coffea canephora]
          Length = 604

 Score =  659 bits (1699), Expect = 0.0
 Identities = 320/444 (72%), Positives = 361/444 (81%)
 Frame = -3

Query: 1966 RWGRCYESYSENTEIVRKMTSLVTGLQGQPPEGHPKGYPFLAGRKNVLASAKHFVGDXXX 1787
            RWGR YESY E+TE+VRK + LVTGLQGQPP GHP GYPFLAGRKNV+ASAKHFVGD   
Sbjct: 155  RWGRYYESYGEDTELVRKFSCLVTGLQGQPPAGHPNGYPFLAGRKNVMASAKHFVGDGGT 214

Query: 1786 XXXXXXXXXXTSYDDLERTHMAPYLDCLSQGVCTVMASYSSWNGRKLHTDYFLLTEVLKN 1607
                       +Y++LER HMAPYLDCLSQGVCTVM SYSSWNG +LHTD+FLLT+VLK 
Sbjct: 215  DKGINEGNTILAYEELERIHMAPYLDCLSQGVCTVMVSYSSWNGSRLHTDHFLLTKVLKE 274

Query: 1606 KLGFKGFVISDWEALDRLYAPHGSNYRQSILSTVNAGIDMVMVPFRYXXXXXXXXXXXET 1427
            KLGFKG VISDWEALDRLY PHGSNYRQSILSTVNAGIDMVMVPFRY           ++
Sbjct: 275  KLGFKGLVISDWEALDRLYHPHGSNYRQSILSTVNAGIDMVMVPFRYELFLEELLSLVQS 334

Query: 1426 GEISMARIDDAVERILRVKFVAGVFEYPLTDRSLLDVVGCKPHRELAREAVRKSLVLLKN 1247
            GEI M RI+D+VERILRVKFVAG+FE+P TDRSLL++VG KPHRELAREAVRKSLVLLKN
Sbjct: 335  GEIPMDRINDSVERILRVKFVAGLFEHPFTDRSLLELVGSKPHRELAREAVRKSLVLLKN 394

Query: 1246 GKDPKRPFLPLDNKAKRILVAGTHADNLGYQCXXXXXXXXXXXXXXXXXXXILDAIKEVV 1067
            GKDPK+PFLPLD KAKR+LV G HAD+LGYQC                   ILDAIKE V
Sbjct: 395  GKDPKKPFLPLDRKAKRVLVTGVHADDLGYQCGGWTCTWTGTSGRITIGTTILDAIKEAV 454

Query: 1066 DEKTEVVYELNPSPETLSREDFAFAVVAVGEGPYVESGGDDPVLKIPFNGSELVSLVADR 887
               TEV+YE NPSPET + E+F+FA+VAVGE PYVE+GGDDPVLKIPFNG EL+S VADR
Sbjct: 455  GSNTEVIYEKNPSPETFTSEEFSFAIVAVGESPYVETGGDDPVLKIPFNGDELISTVADR 514

Query: 886  IPTLVILVSGRPLVLEPWLLEKIDALVVAWLPGTEGRGITDVIFGDYGFEGRLPMTWFRS 707
            +PT+VIL+SGRPLVLEP  LEK++A + AWLPGTEGRGITDV+FGDY F GRLP+TWF+S
Sbjct: 515  VPTVVILISGRPLVLEPSTLEKVEAFIAAWLPGTEGRGITDVLFGDYAFHGRLPVTWFKS 574

Query: 706  VDQLPINVGENSSDPLFPFGFGLT 635
            VDQLP+++  NS DPLFP G+GLT
Sbjct: 575  VDQLPMHIESNSYDPLFPLGYGLT 598


>gb|EOY33800.1| Glycosyl hydrolase family protein [Theobroma cacao]
          Length = 606

 Score =  639 bits (1648), Expect = 0.0
 Identities = 312/444 (70%), Positives = 359/444 (80%)
 Frame = -3

Query: 1966 RWGRCYESYSENTEIVRKMTSLVTGLQGQPPEGHPKGYPFLAGRKNVLASAKHFVGDXXX 1787
            RWGRCYES+SE+T IVRKMTS++TGLQGQPP GHPKGYPF+AGR NV+A AKHFVGD   
Sbjct: 161  RWGRCYESFSEDTNIVRKMTSIITGLQGQPPSGHPKGYPFVAGRDNVIACAKHFVGDGGT 220

Query: 1786 XXXXXXXXXXTSYDDLERTHMAPYLDCLSQGVCTVMASYSSWNGRKLHTDYFLLTEVLKN 1607
                      +SYDDLER HMAPYLDCL+QGV TVMASYSSWNG KLH  +FLLT++LK+
Sbjct: 221  DKGINEGNTVSSYDDLERIHMAPYLDCLNQGVSTVMASYSSWNGCKLHAHHFLLTDILKD 280

Query: 1606 KLGFKGFVISDWEALDRLYAPHGSNYRQSILSTVNAGIDMVMVPFRYXXXXXXXXXXXET 1427
            KLGFKGFVISDW+ALDRL  P GSNYR  + + +NAGIDMVMVP RY           E+
Sbjct: 281  KLGFKGFVISDWKALDRLSEPRGSNYRHCVSTAINAGIDMVMVPHRYKQFIEDLTSLVES 340

Query: 1426 GEISMARIDDAVERILRVKFVAGVFEYPLTDRSLLDVVGCKPHRELAREAVRKSLVLLKN 1247
            GEI M+RIDDAVERILRVKFVAG+FEYP +DRSLLD+VGCK HRELAREAVRKSLVLLKN
Sbjct: 341  GEIQMSRIDDAVERILRVKFVAGLFEYPFSDRSLLDMVGCKLHRELAREAVRKSLVLLKN 400

Query: 1246 GKDPKRPFLPLDNKAKRILVAGTHADNLGYQCXXXXXXXXXXXXXXXXXXXILDAIKEVV 1067
            GK+P +PFLPLD  A+RILVAGTHAD+LGYQC                   ILDA +EVV
Sbjct: 401  GKNPGKPFLPLDKNARRILVAGTHADDLGYQCGGWTRYWQGSSGRITIGTTILDAFREVV 460

Query: 1066 DEKTEVVYELNPSPETLSREDFAFAVVAVGEGPYVESGGDDPVLKIPFNGSELVSLVADR 887
             EKTEV+Y+  PSP++ +R++F+FA+VAVGE PY ES GD+  L IPFNGSEL+S VA+R
Sbjct: 461  GEKTEVIYDKYPSPDSFARQNFSFAIVAVGEEPYAESVGDNSELIIPFNGSELISSVAER 520

Query: 886  IPTLVILVSGRPLVLEPWLLEKIDALVVAWLPGTEGRGITDVIFGDYGFEGRLPMTWFRS 707
            IPTLVIL+SGRPLV+EPWLLEK+DAL+ AWLPGTEGRGITDV++GDY FEGRLPMTWFR+
Sbjct: 521  IPTLVILISGRPLVIEPWLLEKVDALIAAWLPGTEGRGITDVVYGDYEFEGRLPMTWFRA 580

Query: 706  VDQLPINVGENSSDPLFPFGFGLT 635
            + QLPIN  +NS DPLFP GFGLT
Sbjct: 581  IKQLPINSEDNSCDPLFPLGFGLT 604


>ref|XP_007016181.2| PREDICTED: beta-glucosidase BoGH3B [Theobroma cacao]
          Length = 606

 Score =  639 bits (1647), Expect = 0.0
 Identities = 311/444 (70%), Positives = 360/444 (81%)
 Frame = -3

Query: 1966 RWGRCYESYSENTEIVRKMTSLVTGLQGQPPEGHPKGYPFLAGRKNVLASAKHFVGDXXX 1787
            RWGRCYES+SE+T IVRKMTS++TGLQGQPP GHPKGYPF+AGR NV+A AKHFVGD   
Sbjct: 161  RWGRCYESFSEDTNIVRKMTSIITGLQGQPPSGHPKGYPFVAGRDNVIACAKHFVGDGGT 220

Query: 1786 XXXXXXXXXXTSYDDLERTHMAPYLDCLSQGVCTVMASYSSWNGRKLHTDYFLLTEVLKN 1607
                      +SYDDLER HMAPYLDCL++GV TVMASYSSWNG KLH  +FLLT++LK+
Sbjct: 221  DKGTNEGNTVSSYDDLERIHMAPYLDCLNEGVSTVMASYSSWNGCKLHAHHFLLTDILKD 280

Query: 1606 KLGFKGFVISDWEALDRLYAPHGSNYRQSILSTVNAGIDMVMVPFRYXXXXXXXXXXXET 1427
            KLGFKGFVISDW+ALDRL  P GSNYR  + + +NAGIDMVMVP RY           E+
Sbjct: 281  KLGFKGFVISDWKALDRLSEPKGSNYRHCVYTAINAGIDMVMVPHRYKQFIEDLTSLVES 340

Query: 1426 GEISMARIDDAVERILRVKFVAGVFEYPLTDRSLLDVVGCKPHRELAREAVRKSLVLLKN 1247
            GEI M+RIDDAVERILRVKFVAG+FEYP +DRSLLD+VGCK HRELAREAVRKSLVLLKN
Sbjct: 341  GEIQMSRIDDAVERILRVKFVAGLFEYPFSDRSLLDMVGCKLHRELAREAVRKSLVLLKN 400

Query: 1246 GKDPKRPFLPLDNKAKRILVAGTHADNLGYQCXXXXXXXXXXXXXXXXXXXILDAIKEVV 1067
            GK+P +PFLPLD  A+RILVAGTHAD+LGYQC                   ILDA++EVV
Sbjct: 401  GKNPGKPFLPLDKNARRILVAGTHADDLGYQCGGWTRYWQGSSGRITIGTTILDALREVV 460

Query: 1066 DEKTEVVYELNPSPETLSREDFAFAVVAVGEGPYVESGGDDPVLKIPFNGSELVSLVADR 887
             EKTEV+Y+  PSP++ +R++F+FA+VAVGE PY ES GD+  L IPFNGSEL+S VA+R
Sbjct: 461  GEKTEVIYDKYPSPDSFARQNFSFAIVAVGEEPYAESVGDNSELIIPFNGSELISSVAER 520

Query: 886  IPTLVILVSGRPLVLEPWLLEKIDALVVAWLPGTEGRGITDVIFGDYGFEGRLPMTWFRS 707
            IPTLVIL+SGRPLV+EPWLLEK+DAL+ AWLPGTEGRGITDV++GDY FEGRLPMTWFR+
Sbjct: 521  IPTLVILISGRPLVIEPWLLEKVDALIAAWLPGTEGRGITDVVYGDYEFEGRLPMTWFRA 580

Query: 706  VDQLPINVGENSSDPLFPFGFGLT 635
            + QLPIN  +NS DPLFP GFGLT
Sbjct: 581  IKQLPINSEDNSCDPLFPLGFGLT 604


>gb|EOY33795.1| Glycosyl hydrolase family protein isoform 2 [Theobroma cacao]
          Length = 534

 Score =  632 bits (1629), Expect = 0.0
 Identities = 307/444 (69%), Positives = 356/444 (80%)
 Frame = -3

Query: 1966 RWGRCYESYSENTEIVRKMTSLVTGLQGQPPEGHPKGYPFLAGRKNVLASAKHFVGDXXX 1787
            RWGRCYESYSE+T  VRKMTS+VTGLQGQPP GHPKGYPF+AGR NV+A AKHFVGD   
Sbjct: 83   RWGRCYESYSEDTNSVRKMTSIVTGLQGQPPVGHPKGYPFVAGRNNVIACAKHFVGDGGT 142

Query: 1786 XXXXXXXXXXTSYDDLERTHMAPYLDCLSQGVCTVMASYSSWNGRKLHTDYFLLTEVLKN 1607
                       SYDDLER HMAPYLDC+SQGV T+MAS+SSWNGRKLH D+FLLTE+LK+
Sbjct: 143  EKGINEGNTILSYDDLERIHMAPYLDCISQGVSTIMASFSSWNGRKLHADHFLLTEILKD 202

Query: 1606 KLGFKGFVISDWEALDRLYAPHGSNYRQSILSTVNAGIDMVMVPFRYXXXXXXXXXXXET 1427
            KLGFKGFVISDWEALD+L  P GSN R  I S VNAGIDMVMVPF+Y           E+
Sbjct: 203  KLGFKGFVISDWEALDQLCEPQGSNNRYCISSAVNAGIDMVMVPFKYKQFVEDLAFLVES 262

Query: 1426 GEISMARIDDAVERILRVKFVAGVFEYPLTDRSLLDVVGCKPHRELAREAVRKSLVLLKN 1247
            GE+ M+RIDDAVERILRVKFV+G+FE+P +DRSLLD+VGCK HRELAREAVRKSLVLLKN
Sbjct: 263  GEVQMSRIDDAVERILRVKFVSGLFEHPFSDRSLLDIVGCKLHRELAREAVRKSLVLLKN 322

Query: 1246 GKDPKRPFLPLDNKAKRILVAGTHADNLGYQCXXXXXXXXXXXXXXXXXXXILDAIKEVV 1067
            GK+P+ PFLPLD  AKRILVAGTHAD+LGYQC                   ILDAI+E V
Sbjct: 323  GKNPENPFLPLDKNAKRILVAGTHADDLGYQCGGWTGTWHGCSGRITIGTTILDAIREAV 382

Query: 1066 DEKTEVVYELNPSPETLSREDFAFAVVAVGEGPYVESGGDDPVLKIPFNGSELVSLVADR 887
             +KTEV+Y+  PSP++L+ ++F+FA+V VGE PY E+ GD+  L IPFNGS+++S VAD+
Sbjct: 383  GDKTEVIYDQYPSPDSLAGKNFSFAIVVVGEPPYAETLGDNAELVIPFNGSDIISSVADK 442

Query: 886  IPTLVILVSGRPLVLEPWLLEKIDALVVAWLPGTEGRGITDVIFGDYGFEGRLPMTWFRS 707
            IPTL IL+SGRPLVLEPWLLEK+DALV AW PG+EG G+TDV+FGD+ FEGRLPMTWFRS
Sbjct: 443  IPTLAILISGRPLVLEPWLLEKVDALVAAWFPGSEGGGVTDVVFGDFEFEGRLPMTWFRS 502

Query: 706  VDQLPINVGENSSDPLFPFGFGLT 635
            ++QLP+N G NS DPLFP GFGLT
Sbjct: 503  INQLPMNAGHNSYDPLFPLGFGLT 526


>gb|ARU79083.1| beta-glucosidase 5 GH3 family [Camellia sinensis]
          Length = 616

 Score =  634 bits (1636), Expect = 0.0
 Identities = 309/443 (69%), Positives = 356/443 (80%)
 Frame = -3

Query: 1966 RWGRCYESYSENTEIVRKMTSLVTGLQGQPPEGHPKGYPFLAGRKNVLASAKHFVGDXXX 1787
            RWGRCYE Y E+TEIVRKMT++V+GLQGQPPEGHPKGYPFLAGR  V+A AKHFVGD   
Sbjct: 165  RWGRCYECYGEDTEIVRKMTTIVSGLQGQPPEGHPKGYPFLAGRDKVVACAKHFVGDGGT 224

Query: 1786 XXXXXXXXXXTSYDDLERTHMAPYLDCLSQGVCTVMASYSSWNGRKLHTDYFLLTEVLKN 1607
                       SYDDLER HMA YLDC+SQGVCTVMAS+SSWNG K+H+ +FLLT++LK+
Sbjct: 225  DKGINEGNTLASYDDLERIHMAAYLDCISQGVCTVMASFSSWNGTKMHSHHFLLTQILKD 284

Query: 1606 KLGFKGFVISDWEALDRLYAPHGSNYRQSILSTVNAGIDMVMVPFRYXXXXXXXXXXXET 1427
            KLGFKGFVISDW+ALD+L  PHGSNYR  I S VNAGIDMVMVP +Y           E+
Sbjct: 285  KLGFKGFVISDWQALDKLSDPHGSNYRNCISSAVNAGIDMVMVPLKYELFLEDILNLVES 344

Query: 1426 GEISMARIDDAVERILRVKFVAGVFEYPLTDRSLLDVVGCKPHRELAREAVRKSLVLLKN 1247
            GEI MARIDDAVERILRVKFVAG+FEYP+ D+SLLD VGCK HRELAREAVRKSLVLLKN
Sbjct: 345  GEIPMARIDDAVERILRVKFVAGLFEYPMADKSLLDTVGCKMHRELAREAVRKSLVLLKN 404

Query: 1246 GKDPKRPFLPLDNKAKRILVAGTHADNLGYQCXXXXXXXXXXXXXXXXXXXILDAIKEVV 1067
            GKDPK+PFLPLD   K+ILVAGTHAD+LGYQC                   ILDAIKE V
Sbjct: 405  GKDPKKPFLPLDRNCKKILVAGTHADDLGYQCGGWTFNWSGTSGRITIGTTILDAIKEAV 464

Query: 1066 DEKTEVVYELNPSPETLSREDFAFAVVAVGEGPYVESGGDDPVLKIPFNGSELVSLVADR 887
             +KTE++YE NPSP+T + +DF+FAVVAVGE PYVE GG DP L IPFNG+EL+S VA+R
Sbjct: 465  GDKTELIYEQNPSPDTFTGQDFSFAVVAVGESPYVEDGGGDPELIIPFNGAELISSVAER 524

Query: 886  IPTLVILVSGRPLVLEPWLLEKIDALVVAWLPGTEGRGITDVIFGDYGFEGRLPMTWFRS 707
            +PTL IL+SGRP+ L+P LLEKID L+ AWLPG+EG GITDVIFGD+ F+GRLP+TWF+S
Sbjct: 525  VPTLAILISGRPVTLKPELLEKIDGLIAAWLPGSEGGGITDVIFGDHEFQGRLPVTWFKS 584

Query: 706  VDQLPINVGENSSDPLFPFGFGL 638
            V+QLP++VGE+S DPLFP GFGL
Sbjct: 585  VEQLPMHVGEDSYDPLFPLGFGL 607


>ref|XP_017983610.1| PREDICTED: beta-glucosidase BoGH3B [Theobroma cacao]
 gb|EOY33794.1| Glycosyl hydrolase family protein isoform 1 [Theobroma cacao]
          Length = 606

 Score =  632 bits (1629), Expect = 0.0
 Identities = 307/444 (69%), Positives = 356/444 (80%)
 Frame = -3

Query: 1966 RWGRCYESYSENTEIVRKMTSLVTGLQGQPPEGHPKGYPFLAGRKNVLASAKHFVGDXXX 1787
            RWGRCYESYSE+T  VRKMTS+VTGLQGQPP GHPKGYPF+AGR NV+A AKHFVGD   
Sbjct: 155  RWGRCYESYSEDTNSVRKMTSIVTGLQGQPPVGHPKGYPFVAGRNNVIACAKHFVGDGGT 214

Query: 1786 XXXXXXXXXXTSYDDLERTHMAPYLDCLSQGVCTVMASYSSWNGRKLHTDYFLLTEVLKN 1607
                       SYDDLER HMAPYLDC+SQGV T+MAS+SSWNGRKLH D+FLLTE+LK+
Sbjct: 215  EKGINEGNTILSYDDLERIHMAPYLDCISQGVSTIMASFSSWNGRKLHADHFLLTEILKD 274

Query: 1606 KLGFKGFVISDWEALDRLYAPHGSNYRQSILSTVNAGIDMVMVPFRYXXXXXXXXXXXET 1427
            KLGFKGFVISDWEALD+L  P GSN R  I S VNAGIDMVMVPF+Y           E+
Sbjct: 275  KLGFKGFVISDWEALDQLCEPQGSNNRYCISSAVNAGIDMVMVPFKYKQFVEDLAFLVES 334

Query: 1426 GEISMARIDDAVERILRVKFVAGVFEYPLTDRSLLDVVGCKPHRELAREAVRKSLVLLKN 1247
            GE+ M+RIDDAVERILRVKFV+G+FE+P +DRSLLD+VGCK HRELAREAVRKSLVLLKN
Sbjct: 335  GEVQMSRIDDAVERILRVKFVSGLFEHPFSDRSLLDIVGCKLHRELAREAVRKSLVLLKN 394

Query: 1246 GKDPKRPFLPLDNKAKRILVAGTHADNLGYQCXXXXXXXXXXXXXXXXXXXILDAIKEVV 1067
            GK+P+ PFLPLD  AKRILVAGTHAD+LGYQC                   ILDAI+E V
Sbjct: 395  GKNPENPFLPLDKNAKRILVAGTHADDLGYQCGGWTGTWHGCSGRITIGTTILDAIREAV 454

Query: 1066 DEKTEVVYELNPSPETLSREDFAFAVVAVGEGPYVESGGDDPVLKIPFNGSELVSLVADR 887
             +KTEV+Y+  PSP++L+ ++F+FA+V VGE PY E+ GD+  L IPFNGS+++S VAD+
Sbjct: 455  GDKTEVIYDQYPSPDSLAGKNFSFAIVVVGEPPYAETLGDNAELVIPFNGSDIISSVADK 514

Query: 886  IPTLVILVSGRPLVLEPWLLEKIDALVVAWLPGTEGRGITDVIFGDYGFEGRLPMTWFRS 707
            IPTL IL+SGRPLVLEPWLLEK+DALV AW PG+EG G+TDV+FGD+ FEGRLPMTWFRS
Sbjct: 515  IPTLAILISGRPLVLEPWLLEKVDALVAAWFPGSEGGGVTDVVFGDFEFEGRLPMTWFRS 574

Query: 706  VDQLPINVGENSSDPLFPFGFGLT 635
            ++QLP+N G NS DPLFP GFGLT
Sbjct: 575  INQLPMNAGHNSYDPLFPLGFGLT 598


>ref|XP_012445665.1| PREDICTED: lysosomal beta glucosidase-like [Gossypium raimondii]
 gb|KJB55773.1| hypothetical protein B456_009G093600 [Gossypium raimondii]
          Length = 614

 Score =  632 bits (1629), Expect = 0.0
 Identities = 307/445 (68%), Positives = 354/445 (79%)
 Frame = -3

Query: 1966 RWGRCYESYSENTEIVRKMTSLVTGLQGQPPEGHPKGYPFLAGRKNVLASAKHFVGDXXX 1787
            RWGRCYES+SE+T IVRKMTS++TGLQGQPP GH KGYPF+AGR NV+A AKHFVGD   
Sbjct: 162  RWGRCYESFSEDTNIVRKMTSIITGLQGQPPVGHSKGYPFVAGRYNVIACAKHFVGDGGT 221

Query: 1786 XXXXXXXXXXTSYDDLERTHMAPYLDCLSQGVCTVMASYSSWNGRKLHTDYFLLTEVLKN 1607
                      +SYD+LE  HMAPYLDCL +GV TVMASYSSWNG KLH  +FLLTE+LK 
Sbjct: 222  EKGINEGNTISSYDELESIHMAPYLDCLYKGVSTVMASYSSWNGCKLHAHHFLLTEILKG 281

Query: 1606 KLGFKGFVISDWEALDRLYAPHGSNYRQSILSTVNAGIDMVMVPFRYXXXXXXXXXXXET 1427
            KLGFKGF+ISDW+ALDRL  P GSNYR+ + + +NAGIDMVMVP+RY           E+
Sbjct: 282  KLGFKGFLISDWKALDRLSEPRGSNYRRCVYTAINAGIDMVMVPYRYKQFIEDLISLVES 341

Query: 1426 GEISMARIDDAVERILRVKFVAGVFEYPLTDRSLLDVVGCKPHRELAREAVRKSLVLLKN 1247
            GEI M RIDDAVERILRVKFVAG+FEYP +DRSLLD +GCK HRELAREAVRKSLVLLKN
Sbjct: 342  GEIQMTRIDDAVERILRVKFVAGLFEYPFSDRSLLDTIGCKLHRELAREAVRKSLVLLKN 401

Query: 1246 GKDPKRPFLPLDNKAKRILVAGTHADNLGYQCXXXXXXXXXXXXXXXXXXXILDAIKEVV 1067
            GK+P +PFLPL+  A+R+L+AGTHA+NLGYQC                   ILDA +EV+
Sbjct: 402  GKNPGKPFLPLEKNAERVLIAGTHANNLGYQCGGWTRYWQGSSGRITTGTTILDAFREVM 461

Query: 1066 DEKTEVVYELNPSPETLSREDFAFAVVAVGEGPYVESGGDDPVLKIPFNGSELVSLVADR 887
             EKTEV+YE  PSP TLS ++F+FA+V VGE PY ES GD+  L IP NGSEL+S +ADR
Sbjct: 462  GEKTEVIYEKYPSPNTLSGQNFSFAIVGVGEEPYAESAGDNSELVIPLNGSELISTIADR 521

Query: 886  IPTLVILVSGRPLVLEPWLLEKIDALVVAWLPGTEGRGITDVIFGDYGFEGRLPMTWFRS 707
            IPTLVIL+SGRPLV+EPWLLEK+DALV AWLPGTEGRGITDV+FGDY FEGRLPMTWFR+
Sbjct: 522  IPTLVILISGRPLVIEPWLLEKMDALVAAWLPGTEGRGITDVVFGDYEFEGRLPMTWFRT 581

Query: 706  VDQLPINVGENSSDPLFPFGFGLTY 632
             ++LPIN G+NS DPLFP GFGLTY
Sbjct: 582  TEELPINKGDNSCDPLFPLGFGLTY 606


>ref|XP_002313393.1| glycosyl hydrolase family 3 family protein [Populus trichocarpa]
 gb|PNT19510.1| hypothetical protein POPTR_009G042800v3 [Populus trichocarpa]
          Length = 603

 Score =  629 bits (1621), Expect = 0.0
 Identities = 308/444 (69%), Positives = 351/444 (79%)
 Frame = -3

Query: 1966 RWGRCYESYSENTEIVRKMTSLVTGLQGQPPEGHPKGYPFLAGRKNVLASAKHFVGDXXX 1787
            RWGRCYESYSE+T IVR+M S+VTGLQGQPPEGHP GYPFLAGR NV+A AKHFVGD   
Sbjct: 152  RWGRCYESYSEDTNIVREMASIVTGLQGQPPEGHPNGYPFLAGRNNVIACAKHFVGDGGT 211

Query: 1786 XXXXXXXXXXTSYDDLERTHMAPYLDCLSQGVCTVMASYSSWNGRKLHTDYFLLTEVLKN 1607
                       SY+DLER HMAPYLDC+SQGV T+M SYSSWNGR+LH  +FLLTEVLK+
Sbjct: 212  HKGLNEGDTILSYEDLERIHMAPYLDCISQGVGTIMVSYSSWNGRQLHAHHFLLTEVLKD 271

Query: 1606 KLGFKGFVISDWEALDRLYAPHGSNYRQSILSTVNAGIDMVMVPFRYXXXXXXXXXXXET 1427
            KLGFKGFVISDWEALDRL  P GSNYR+ + + VNAG DMVMV  ++           E+
Sbjct: 272  KLGFKGFVISDWEALDRLSKPLGSNYRRCVSTAVNAGTDMVMVGQKHREFMKDLIFLAES 331

Query: 1426 GEISMARIDDAVERILRVKFVAGVFEYPLTDRSLLDVVGCKPHRELAREAVRKSLVLLKN 1247
            GEI M RIDDAVERILRVKFVAG+FEYP  DRSLLD+VGCK HRELAREAVRKSLVLLKN
Sbjct: 332  GEIPMTRIDDAVERILRVKFVAGLFEYPFADRSLLDIVGCKLHRELAREAVRKSLVLLKN 391

Query: 1246 GKDPKRPFLPLDNKAKRILVAGTHADNLGYQCXXXXXXXXXXXXXXXXXXXILDAIKEVV 1067
            GKDPK+P LPLD  AK+ILVAGTHADNLGYQC                   ILDAIKE +
Sbjct: 392  GKDPKKPLLPLDRSAKKILVAGTHADNLGYQCGGWTIAWNGMSGRITIGTTILDAIKEAI 451

Query: 1066 DEKTEVVYELNPSPETLSREDFAFAVVAVGEGPYVESGGDDPVLKIPFNGSELVSLVADR 887
             E+TEV+YE  PSP+TL+ +DF+FA+VAVGE PY E  GD+  L IPFNG++++S VAD+
Sbjct: 452  GEETEVIYEKIPSPDTLASQDFSFAIVAVGEDPYAEFTGDNSELAIPFNGADIISSVADK 511

Query: 886  IPTLVILVSGRPLVLEPWLLEKIDALVVAWLPGTEGRGITDVIFGDYGFEGRLPMTWFRS 707
            IPTLVIL+SGRPLV+EPWLLEKID L+ AWLPGTEG GITDVIFGDY F GRLP+TWFR 
Sbjct: 512  IPTLVILISGRPLVIEPWLLEKIDGLIAAWLPGTEGEGITDVIFGDYDFSGRLPVTWFRK 571

Query: 706  VDQLPINVGENSSDPLFPFGFGLT 635
            V+QLP+N+ +NS +PLFP GFGLT
Sbjct: 572  VEQLPMNLRDNSEEPLFPLGFGLT 595


>gb|PNT19511.1| hypothetical protein POPTR_009G042800v3 [Populus trichocarpa]
          Length = 609

 Score =  629 bits (1621), Expect = 0.0
 Identities = 308/444 (69%), Positives = 351/444 (79%)
 Frame = -3

Query: 1966 RWGRCYESYSENTEIVRKMTSLVTGLQGQPPEGHPKGYPFLAGRKNVLASAKHFVGDXXX 1787
            RWGRCYESYSE+T IVR+M S+VTGLQGQPPEGHP GYPFLAGR NV+A AKHFVGD   
Sbjct: 158  RWGRCYESYSEDTNIVREMASIVTGLQGQPPEGHPNGYPFLAGRNNVIACAKHFVGDGGT 217

Query: 1786 XXXXXXXXXXTSYDDLERTHMAPYLDCLSQGVCTVMASYSSWNGRKLHTDYFLLTEVLKN 1607
                       SY+DLER HMAPYLDC+SQGV T+M SYSSWNGR+LH  +FLLTEVLK+
Sbjct: 218  HKGLNEGDTILSYEDLERIHMAPYLDCISQGVGTIMVSYSSWNGRQLHAHHFLLTEVLKD 277

Query: 1606 KLGFKGFVISDWEALDRLYAPHGSNYRQSILSTVNAGIDMVMVPFRYXXXXXXXXXXXET 1427
            KLGFKGFVISDWEALDRL  P GSNYR+ + + VNAG DMVMV  ++           E+
Sbjct: 278  KLGFKGFVISDWEALDRLSKPLGSNYRRCVSTAVNAGTDMVMVGQKHREFMKDLIFLAES 337

Query: 1426 GEISMARIDDAVERILRVKFVAGVFEYPLTDRSLLDVVGCKPHRELAREAVRKSLVLLKN 1247
            GEI M RIDDAVERILRVKFVAG+FEYP  DRSLLD+VGCK HRELAREAVRKSLVLLKN
Sbjct: 338  GEIPMTRIDDAVERILRVKFVAGLFEYPFADRSLLDIVGCKLHRELAREAVRKSLVLLKN 397

Query: 1246 GKDPKRPFLPLDNKAKRILVAGTHADNLGYQCXXXXXXXXXXXXXXXXXXXILDAIKEVV 1067
            GKDPK+P LPLD  AK+ILVAGTHADNLGYQC                   ILDAIKE +
Sbjct: 398  GKDPKKPLLPLDRSAKKILVAGTHADNLGYQCGGWTIAWNGMSGRITIGTTILDAIKEAI 457

Query: 1066 DEKTEVVYELNPSPETLSREDFAFAVVAVGEGPYVESGGDDPVLKIPFNGSELVSLVADR 887
             E+TEV+YE  PSP+TL+ +DF+FA+VAVGE PY E  GD+  L IPFNG++++S VAD+
Sbjct: 458  GEETEVIYEKIPSPDTLASQDFSFAIVAVGEDPYAEFTGDNSELAIPFNGADIISSVADK 517

Query: 886  IPTLVILVSGRPLVLEPWLLEKIDALVVAWLPGTEGRGITDVIFGDYGFEGRLPMTWFRS 707
            IPTLVIL+SGRPLV+EPWLLEKID L+ AWLPGTEG GITDVIFGDY F GRLP+TWFR 
Sbjct: 518  IPTLVILISGRPLVIEPWLLEKIDGLIAAWLPGTEGEGITDVIFGDYDFSGRLPVTWFRK 577

Query: 706  VDQLPINVGENSSDPLFPFGFGLT 635
            V+QLP+N+ +NS +PLFP GFGLT
Sbjct: 578  VEQLPMNLRDNSEEPLFPLGFGLT 601


>gb|EOY33796.1| Glycosyl hydrolase family protein isoform 3 [Theobroma cacao]
          Length = 607

 Score =  627 bits (1617), Expect = 0.0
 Identities = 307/445 (68%), Positives = 356/445 (80%), Gaps = 1/445 (0%)
 Frame = -3

Query: 1966 RWGRCYESYSENTEIVRKMTSLVTGLQGQPPEGHPKGYPFLAGRKNVLASAKHFVGDXXX 1787
            RWGRCYESYSE+T  VRKMTS+VTGLQGQPP GHPKGYPF+AGR NV+A AKHFVGD   
Sbjct: 155  RWGRCYESYSEDTNSVRKMTSIVTGLQGQPPVGHPKGYPFVAGRNNVIACAKHFVGDGGT 214

Query: 1786 XXXXXXXXXXTSYDDLERTHMAPYLDCLSQGVCTVMASYSSWNGRKLHTDYFLLTEVLKN 1607
                       SYDDLER HMAPYLDC+SQGV T+MAS+SSWNGRKLH D+FLLTE+LK+
Sbjct: 215  EKGINEGNTILSYDDLERIHMAPYLDCISQGVSTIMASFSSWNGRKLHADHFLLTEILKD 274

Query: 1606 KLGFKGFVISDWEALDRLYAPHGSNYRQSILSTVNAGIDM-VMVPFRYXXXXXXXXXXXE 1430
            KLGFKGFVISDWEALD+L  P GSN R  I S VNAGIDM VMVPF+Y           E
Sbjct: 275  KLGFKGFVISDWEALDQLCEPQGSNNRYCISSAVNAGIDMVVMVPFKYKQFVEDLAFLVE 334

Query: 1429 TGEISMARIDDAVERILRVKFVAGVFEYPLTDRSLLDVVGCKPHRELAREAVRKSLVLLK 1250
            +GE+ M+RIDDAVERILRVKFV+G+FE+P +DRSLLD+VGCK HRELAREAVRKSLVLLK
Sbjct: 335  SGEVQMSRIDDAVERILRVKFVSGLFEHPFSDRSLLDIVGCKLHRELAREAVRKSLVLLK 394

Query: 1249 NGKDPKRPFLPLDNKAKRILVAGTHADNLGYQCXXXXXXXXXXXXXXXXXXXILDAIKEV 1070
            NGK+P+ PFLPLD  AKRILVAGTHAD+LGYQC                   ILDAI+E 
Sbjct: 395  NGKNPENPFLPLDKNAKRILVAGTHADDLGYQCGGWTGTWHGCSGRITIGTTILDAIREA 454

Query: 1069 VDEKTEVVYELNPSPETLSREDFAFAVVAVGEGPYVESGGDDPVLKIPFNGSELVSLVAD 890
            V +KTEV+Y+  PSP++L+ ++F+FA+V VGE PY E+ GD+  L IPFNGS+++S VAD
Sbjct: 455  VGDKTEVIYDQYPSPDSLAGKNFSFAIVVVGEPPYAETLGDNAELVIPFNGSDIISSVAD 514

Query: 889  RIPTLVILVSGRPLVLEPWLLEKIDALVVAWLPGTEGRGITDVIFGDYGFEGRLPMTWFR 710
            +IPTL IL+SGRPLVLEPWLLEK+DALV AW PG+EG G+TDV+FGD+ FEGRLPMTWFR
Sbjct: 515  KIPTLAILISGRPLVLEPWLLEKVDALVAAWFPGSEGGGVTDVVFGDFEFEGRLPMTWFR 574

Query: 709  SVDQLPINVGENSSDPLFPFGFGLT 635
            S++QLP+N G NS DPLFP GFGLT
Sbjct: 575  SINQLPMNAGHNSYDPLFPLGFGLT 599


>ref|XP_017648269.1| PREDICTED: beta-glucosidase BoGH3B-like [Gossypium arboreum]
          Length = 613

 Score =  627 bits (1617), Expect = 0.0
 Identities = 305/445 (68%), Positives = 352/445 (79%)
 Frame = -3

Query: 1966 RWGRCYESYSENTEIVRKMTSLVTGLQGQPPEGHPKGYPFLAGRKNVLASAKHFVGDXXX 1787
            RWGRCYES+SE+T IVRKMTS++TGLQGQPP GHPKGYPF+AGR NV+A AKHFVGD   
Sbjct: 162  RWGRCYESFSEDTNIVRKMTSIITGLQGQPPVGHPKGYPFVAGRYNVIACAKHFVGDGGT 221

Query: 1786 XXXXXXXXXXTSYDDLERTHMAPYLDCLSQGVCTVMASYSSWNGRKLHTDYFLLTEVLKN 1607
                      +SYD+LE  HMAPYLDCL +GV TVMASYSSWN  KLH  +FLLTE+LK 
Sbjct: 222  EKGINEGNTISSYDELESIHMAPYLDCLYKGVSTVMASYSSWNECKLHAHHFLLTEILKG 281

Query: 1606 KLGFKGFVISDWEALDRLYAPHGSNYRQSILSTVNAGIDMVMVPFRYXXXXXXXXXXXET 1427
            KLGFKGF+ISDW+ALDRL  P GSNYR+ + + +NAGIDMVMVP+RY           E+
Sbjct: 282  KLGFKGFLISDWKALDRLSEPRGSNYRRCVYTAINAGIDMVMVPYRYKQFIEDLISLVES 341

Query: 1426 GEISMARIDDAVERILRVKFVAGVFEYPLTDRSLLDVVGCKPHRELAREAVRKSLVLLKN 1247
            GEI M RIDDAVERILRVKFVAG+FEYP +DRSLLD +GCK HRELAREAVRKSLVLLKN
Sbjct: 342  GEIQMTRIDDAVERILRVKFVAGLFEYPFSDRSLLDTIGCKLHRELAREAVRKSLVLLKN 401

Query: 1246 GKDPKRPFLPLDNKAKRILVAGTHADNLGYQCXXXXXXXXXXXXXXXXXXXILDAIKEVV 1067
            GK+P +PFLPL+  A+R+L+AGTHA+NLGYQC                   ILDA +EV+
Sbjct: 402  GKNPGKPFLPLEKNAERVLIAGTHANNLGYQCGGWTRYWQGSSGRITTGTTILDAFREVM 461

Query: 1066 DEKTEVVYELNPSPETLSREDFAFAVVAVGEGPYVESGGDDPVLKIPFNGSELVSLVADR 887
             EKT+V+YE  PSP TLS ++F+FA+V VGE PY ES GD+  L IP NGSEL+S +ADR
Sbjct: 462  GEKTDVIYEKYPSPNTLSGQNFSFAIVGVGEEPYAESAGDNSELVIPLNGSELISTIADR 521

Query: 886  IPTLVILVSGRPLVLEPWLLEKIDALVVAWLPGTEGRGITDVIFGDYGFEGRLPMTWFRS 707
            IPTLVIL+SGRPLV+EPWLLEKIDALV AWLPGTE RGITDV+FGDY FEGRLPMTWFR+
Sbjct: 522  IPTLVILISGRPLVIEPWLLEKIDALVAAWLPGTEARGITDVVFGDYEFEGRLPMTWFRT 581

Query: 706  VDQLPINVGENSSDPLFPFGFGLTY 632
             ++LPIN G+NS DPLFP  FGLTY
Sbjct: 582  TEELPINKGDNSCDPLFPLAFGLTY 606


>ref|XP_021277936.1| uncharacterized protein LOC110411900 [Herrania umbratica]
          Length = 606

 Score =  626 bits (1614), Expect = 0.0
 Identities = 305/444 (68%), Positives = 354/444 (79%)
 Frame = -3

Query: 1966 RWGRCYESYSENTEIVRKMTSLVTGLQGQPPEGHPKGYPFLAGRKNVLASAKHFVGDXXX 1787
            RWGRCYESYSE+T IVRKM S+VTGLQGQP  GHPKGYPF+AGR NV+A AKHFVGD   
Sbjct: 155  RWGRCYESYSEDTNIVRKMASIVTGLQGQPLVGHPKGYPFVAGRNNVIACAKHFVGDGGT 214

Query: 1786 XXXXXXXXXXTSYDDLERTHMAPYLDCLSQGVCTVMASYSSWNGRKLHTDYFLLTEVLKN 1607
                       SYDDLER H+APYLDC+SQGV T+MASYSSWNGRKLH D+FLLTE+LK+
Sbjct: 215  EKGINEGNTILSYDDLERIHLAPYLDCISQGVSTIMASYSSWNGRKLHADHFLLTEILKD 274

Query: 1606 KLGFKGFVISDWEALDRLYAPHGSNYRQSILSTVNAGIDMVMVPFRYXXXXXXXXXXXET 1427
            KLGFKGFVISDWEALDRL  P GSN R  I   VNAGIDMVMVPF+Y           ET
Sbjct: 275  KLGFKGFVISDWEALDRLCEPRGSNNRYCISRAVNAGIDMVMVPFKYKQFMEDLAFLVET 334

Query: 1426 GEISMARIDDAVERILRVKFVAGVFEYPLTDRSLLDVVGCKPHRELAREAVRKSLVLLKN 1247
            GE+ M+RIDDAVERILRVKFV+G+FE+P +DRSLLD+VGCK HRELAREAVRKSLVLLKN
Sbjct: 335  GEVQMSRIDDAVERILRVKFVSGLFEHPFSDRSLLDIVGCKLHRELAREAVRKSLVLLKN 394

Query: 1246 GKDPKRPFLPLDNKAKRILVAGTHADNLGYQCXXXXXXXXXXXXXXXXXXXILDAIKEVV 1067
            GK+P+ PFLPLD  AKRILVAGTHAD+LGYQC                   ILDAI+E V
Sbjct: 395  GKNPENPFLPLDKNAKRILVAGTHADDLGYQCGGWTGTWHGGSGRITIGTTILDAIREAV 454

Query: 1066 DEKTEVVYELNPSPETLSREDFAFAVVAVGEGPYVESGGDDPVLKIPFNGSELVSLVADR 887
             +KTEV+Y+L PSP++L+ ++F+FA+V VGE PY E+ GD+  L IPFNGS+++S VAD+
Sbjct: 455  GDKTEVIYDLYPSPDSLAGKNFSFAIVVVGEPPYAETLGDNAELVIPFNGSDIISSVADK 514

Query: 886  IPTLVILVSGRPLVLEPWLLEKIDALVVAWLPGTEGRGITDVIFGDYGFEGRLPMTWFRS 707
            IPTL IL+SGRPLVLEPWLLEK+DALV AW PG+EG G+TDV+FGD+ FEG+LPMTWFRS
Sbjct: 515  IPTLAILISGRPLVLEPWLLEKVDALVAAWFPGSEGGGVTDVVFGDFEFEGQLPMTWFRS 574

Query: 706  VDQLPINVGENSSDPLFPFGFGLT 635
            ++QLP++ G NS  PLFP GFGLT
Sbjct: 575  INQLPMHAGHNSYGPLFPLGFGLT 598


>ref|XP_019261677.1| PREDICTED: uncharacterized protein LOC109239553 [Nicotiana attenuata]
 gb|OIT38332.1| putative beta-d-xylosidase 2 [Nicotiana attenuata]
          Length = 598

 Score =  625 bits (1613), Expect = 0.0
 Identities = 301/444 (67%), Positives = 361/444 (81%)
 Frame = -3

Query: 1966 RWGRCYESYSENTEIVRKMTSLVTGLQGQPPEGHPKGYPFLAGRKNVLASAKHFVGDXXX 1787
            RWGR YESYSE+TE+VRKMTSLVTGLQGQPPEGHP GYP++AGR  V+ASAKHFVGD   
Sbjct: 155  RWGRFYESYSEDTEVVRKMTSLVTGLQGQPPEGHPNGYPYVAGRNYVMASAKHFVGDGAT 214

Query: 1786 XXXXXXXXXXTSYDDLERTHMAPYLDCLSQGVCTVMASYSSWNGRKLHTDYFLLTEVLKN 1607
                       S+D++   H+APY+DC++QGVCTVMASYSSWNG K+HT  +LLTEVLK+
Sbjct: 215  ENGTNEGNTIASHDEMFNIHLAPYIDCIAQGVCTVMASYSSWNGDKMHTHRYLLTEVLKD 274

Query: 1606 KLGFKGFVISDWEALDRLYAPHGSNYRQSILSTVNAGIDMVMVPFRYXXXXXXXXXXXET 1427
            KLGFKG VI+DWEALDRL  PHGS+YRQSI ST+NAGIDMVMVPFRY           ++
Sbjct: 275  KLGFKGLVITDWEALDRLTDPHGSDYRQSIKSTINAGIDMVMVPFRYELFLEELLSLVKS 334

Query: 1426 GEISMARIDDAVERILRVKFVAGVFEYPLTDRSLLDVVGCKPHRELAREAVRKSLVLLKN 1247
            GEISMARIDDAVERILRVKFVAG+FE+P TDRSL+ +VGC+ HR++A EAVRKSLVLLKN
Sbjct: 335  GEISMARIDDAVERILRVKFVAGLFEHPFTDRSLIKLVGCEAHRDVACEAVRKSLVLLKN 394

Query: 1246 GKDPKRPFLPLDNKAKRILVAGTHADNLGYQCXXXXXXXXXXXXXXXXXXXILDAIKEVV 1067
            GKDPK+PFLPLD  AK+ILVAGTHAD+LGYQC                   IL+A+++VV
Sbjct: 395  GKDPKKPFLPLDRNAKKILVAGTHADDLGYQCGGWTATWTGESGRITVGTTILEALRKVV 454

Query: 1066 DEKTEVVYELNPSPETLSREDFAFAVVAVGEGPYVESGGDDPVLKIPFNGSELVSLVADR 887
              +TE+V+E NP+PET + +DF+FA+VA+GEGPY E+GGDDP LKIPFNG+E+ + VADR
Sbjct: 455  GNETEIVFEPNPTPETFANQDFSFAIVAIGEGPYCETGGDDPELKIPFNGTEIATFVADR 514

Query: 886  IPTLVILVSGRPLVLEPWLLEKIDALVVAWLPGTEGRGITDVIFGDYGFEGRLPMTWFRS 707
            +PT+ IL+SGRP+VLEP LLEK+DA V AWLPGTEG GITDV+FGDY F+G+LP+TWF++
Sbjct: 515  VPTVTILISGRPMVLEPSLLEKVDAFVAAWLPGTEGTGITDVLFGDYPFQGKLPVTWFKT 574

Query: 706  VDQLPINVGENSSDPLFPFGFGLT 635
            VDQLP++   N SDPLFPFGFGLT
Sbjct: 575  VDQLPMHARGN-SDPLFPFGFGLT 597


Top