BLASTX nr result
ID: Mentha25_contig00034155
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Mentha25_contig00034155 (1952 letters) Database: ./nr 38,876,450 sequences; 13,856,398,315 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value gb|EYU45973.1| hypothetical protein MIMGU_mgv1a003210mg [Mimulus... 964 0.0 ref|XP_004241177.1| PREDICTED: lysosomal beta glucosidase-like [... 881 0.0 ref|XP_006350824.1| PREDICTED: lysosomal beta glucosidase-like [... 870 0.0 ref|XP_007016175.1| Glycosyl hydrolase family protein isoform 1 ... 852 0.0 ref|XP_002279757.1| PREDICTED: periplasmic beta-glucosidase-like... 850 0.0 ref|XP_007016177.1| Glycosyl hydrolase family protein isoform 3 ... 847 0.0 emb|CAN81230.1| hypothetical protein VITISV_033665 [Vitis vinifera] 833 0.0 ref|XP_004150625.1| PREDICTED: lysosomal beta glucosidase-like [... 832 0.0 ref|XP_004169524.1| PREDICTED: LOW QUALITY PROTEIN: lysosomal be... 830 0.0 ref|XP_006425018.1| hypothetical protein CICLE_v10028048mg [Citr... 829 0.0 ref|XP_002525596.1| hydrolase, hydrolyzing O-glycosyl compounds,... 827 0.0 ref|XP_007016181.1| Glycosyl hydrolase family protein [Theobroma... 826 0.0 ref|XP_004150629.1| PREDICTED: lysosomal beta glucosidase-like [... 826 0.0 ref|XP_004240394.1| PREDICTED: lysosomal beta glucosidase-like [... 825 0.0 ref|XP_007207129.1| hypothetical protein PRUPE_ppa003012mg [Prun... 823 0.0 ref|XP_006404423.1| hypothetical protein EUTSA_v10010212mg [Eutr... 822 0.0 ref|XP_002313393.1| glycosyl hydrolase family 3 family protein [... 821 0.0 ref|XP_006290760.1| hypothetical protein CARUB_v10016864mg [Caps... 818 0.0 ref|XP_004294237.1| PREDICTED: lysosomal beta glucosidase-like [... 818 0.0 ref|NP_190284.1| glycosyl hydrolase family protein [Arabidopsis ... 816 0.0 >gb|EYU45973.1| hypothetical protein MIMGU_mgv1a003210mg [Mimulus guttatus] Length = 600 Score = 964 bits (2491), Expect = 0.0 Identities = 471/600 (78%), Positives = 522/600 (87%), Gaps = 1/600 (0%) Frame = +3 Query: 3 MDCV-YKDPNAPVEARVKDLLSRMTLLEKIGQMTQIERSVASASAIKDRFIGSVLSGGGS 179 M+CV YK NAP+EARVKDLLSRMTLLEKIGQMTQIERSVA+ S IKDRFIGSVLSGGGS Sbjct: 1 MECVIYKTLNAPIEARVKDLLSRMTLLEKIGQMTQIERSVATPSVIKDRFIGSVLSGGGS 60 Query: 180 KPFENAKSADWAEMIDGFQTGALETRLGIPIFYGVDAVHGNNNVYGTTIFPHNVGLGATR 359 KPFENAKSADWA+MIDG Q GALETRLGIPI YG DAVHGNNNVYGTTIFPHN+GLGATR Sbjct: 61 KPFENAKSADWADMIDGLQKGALETRLGIPIIYGTDAVHGNNNVYGTTIFPHNIGLGATR 120 Query: 360 DADLTRKIGEVTAIEVRASGAHYAFAPCVAVTRDPRWGRSYESYSEDTEVVRKMTSLVTG 539 DADL R+IGEVTA+EVRASGA YAFAPCVAV+RDPRWGR YESY EDTE+VRKMTS+VTG Sbjct: 121 DADLARRIGEVTALEVRASGAQYAFAPCVAVSRDPRWGRCYESYGEDTEIVRKMTSIVTG 180 Query: 540 LQGQPPEGHPKGYPFLAGRKNVLASAKHYVGDGGTEKGINEGNTITSYEDLERIHLAPYL 719 LQGQPPEGH KGYPF+AGR NVLASAKH+VGDGGTE G NEGNTITSY+DLERIHLAPYL Sbjct: 181 LQGQPPEGHLKGYPFVAGRNNVLASAKHFVGDGGTENGTNEGNTITSYDDLERIHLAPYL 240 Query: 720 DCLSQGVCSVMASYSSWNGRKLHTDHFLLTEVLKNKLGFMGFIISDWEALDRLYAPHGSN 899 DC+SQGVC+VMASYSSWNG+KLHTDHFLLTE+LK KLGFMGF+ISDWEALDRLY+PHGSN Sbjct: 241 DCISQGVCTVMASYSSWNGKKLHTDHFLLTELLKKKLGFMGFVISDWEALDRLYSPHGSN 300 Query: 900 YRQSILLTINAGIDMVMVPFRYELFLDEFLSLVESGEIPMARIDDAVERILRVKFIAGLF 1079 YR+SIL T+NAGIDMVMVPFRYELFL+EFLSL ESGEI MARIDDAVERILRVKF++G+F Sbjct: 301 YRESILSTVNAGIDMVMVPFRYELFLEEFLSLAESGEISMARIDDAVERILRVKFVSGVF 360 Query: 1080 EYPLTDLSLLDLVGCKAHRELAREAVRRSLVXXXXXXXXXXXXXXXXXXXXRILVAGTHA 1259 E+P+ D SLLDLVGCKAHRELAREAVR+SLV RILVAGTHA Sbjct: 361 EHPMADRSLLDLVGCKAHRELAREAVRKSLVLLKNGKDPKKPLLPLNKKAKRILVAGTHA 420 Query: 1260 DNLGYQCXXXXXXXXXXXXXXXXXXXXLDAIKEVVEDKAEVTYELNPTLETFSGQDYSYA 1439 DNLGYQC L+AIKE+V+ EV YE NP+ ETFSG+++S+A Sbjct: 421 DNLGYQCGGWTISWEGTSGKITEGTTMLEAIKEMVDHNTEVVYEQNPSPETFSGEEFSFA 480 Query: 1440 IVVVGEGPYVETGGDDLELKIPFNGTELVSLVAEKVPTLMILVTGRPLVLEPQLLEKIDA 1619 IV VGEGPYVE+GGDD ELKIPFNG EL SLVA+KVPTL+IL+TGRPLV+EP LLEKI+A Sbjct: 481 IVAVGEGPYVESGGDDPELKIPFNGAELASLVADKVPTLVILITGRPLVVEPSLLEKIEA 540 Query: 1620 LVVAWLPGSEGNGITDVIYGDYAFEGRLPVTWFKSVDKLPIHAGENSSEPLFPFGFGLKC 1799 LVVAWLPGSEG GITDVI+GDY F+G+LP+TWF+SVD+LP+H+GENS +PLFPFGFGL C Sbjct: 541 LVVAWLPGSEGRGITDVIFGDYPFQGKLPMTWFRSVDQLPVHSGENSLDPLFPFGFGLTC 600 >ref|XP_004241177.1| PREDICTED: lysosomal beta glucosidase-like [Solanum lycopersicum] Length = 598 Score = 881 bits (2276), Expect = 0.0 Identities = 418/597 (70%), Positives = 500/597 (83%) Frame = +3 Query: 3 MDCVYKDPNAPVEARVKDLLSRMTLLEKIGQMTQIERSVASASAIKDRFIGSVLSGGGSK 182 MDC+YKDPNA +E RVKDLLS+MT+ EKIGQMTQIER+VA+ SAI+DR IGSVLSGGGS+ Sbjct: 1 MDCIYKDPNAAIEERVKDLLSKMTVEEKIGQMTQIERAVANPSAIRDRCIGSVLSGGGSR 60 Query: 183 PFENAKSADWAEMIDGFQTGALETRLGIPIFYGVDAVHGNNNVYGTTIFPHNVGLGATRD 362 PFENA+S DWA MIDGFQ GA+E+RLGIPIFYG DA+HGNNNV+G TIFPHN+GLGATRD Sbjct: 61 PFENAESGDWANMIDGFQKGAVESRLGIPIFYGTDAIHGNNNVWGATIFPHNIGLGATRD 120 Query: 363 ADLTRKIGEVTAIEVRASGAHYAFAPCVAVTRDPRWGRSYESYSEDTEVVRKMTSLVTGL 542 ADL R+IGEVTA+E RA G+ YAFAPC+AV +DPRWGR YESYSEDTEVVRKMTSLV+GL Sbjct: 121 ADLVRRIGEVTALETRACGSQYAFAPCIAVAKDPRWGRFYESYSEDTEVVRKMTSLVSGL 180 Query: 543 QGQPPEGHPKGYPFLAGRKNVLASAKHYVGDGGTEKGINEGNTITSYEDLERIHLAPYLD 722 QGQPPEGHP GYP+++GR +V+ASAKH+VGDG TE G NEGNTI S++D+ IHLAPY+D Sbjct: 181 QGQPPEGHPYGYPYVSGRNSVMASAKHFVGDGATENGTNEGNTIASHDDMFNIHLAPYID 240 Query: 723 CLSQGVCSVMASYSSWNGRKLHTDHFLLTEVLKNKLGFMGFIISDWEALDRLYAPHGSNY 902 C++QGVC+VMASYSSWNG K+H+ +LLTEVLK KLGF G +I+DWEAL+RL PH ++Y Sbjct: 241 CIAQGVCTVMASYSSWNGDKMHSHRYLLTEVLKEKLGFKGLLITDWEALERLTDPHDADY 300 Query: 903 RQSILLTINAGIDMVMVPFRYELFLDEFLSLVESGEIPMARIDDAVERILRVKFIAGLFE 1082 RQS+ LTINAGIDMVMVPFRYELFL++ LSLVESGEIPM RIDDAVERILRVKF+AGLFE Sbjct: 301 RQSVKLTINAGIDMVMVPFRYELFLEQLLSLVESGEIPMTRIDDAVERILRVKFVAGLFE 360 Query: 1083 YPLTDLSLLDLVGCKAHRELAREAVRRSLVXXXXXXXXXXXXXXXXXXXXRILVAGTHAD 1262 +P TD SL+DLVGCKAHRELAREAVR+SLV +ILVAGTHAD Sbjct: 361 HPFTDRSLIDLVGCKAHRELAREAVRKSLVLLKNGKDPKKPFLPLDKTAKKILVAGTHAD 420 Query: 1263 NLGYQCXXXXXXXXXXXXXXXXXXXXLDAIKEVVEDKAEVTYELNPTLETFSGQDYSYAI 1442 +LGYQC +DAI+E++ DK E+ +E NPT ETF+G+D+S+AI Sbjct: 421 DLGYQCGGWTATWTGLSGRITVGTTIMDAIREMLGDKTEIVFEPNPTAETFAGEDFSFAI 480 Query: 1443 VVVGEGPYVETGGDDLELKIPFNGTELVSLVAEKVPTLMILVTGRPLVLEPQLLEKIDAL 1622 V +GEGPY ETGGDD ELKIPFNGTE+ + VA++VPT+ IL++GRP+V+EP LLEK+DA Sbjct: 481 VAIGEGPYCETGGDDPELKIPFNGTEIATFVADRVPTVTILISGRPMVIEPPLLEKVDAF 540 Query: 1623 VVAWLPGSEGNGITDVIYGDYAFEGRLPVTWFKSVDKLPIHAGENSSEPLFPFGFGL 1793 V AWLPG+EG+GITDV++GDY F+G+LPVTWFK+VD++P+H NS+ PLFPFGFGL Sbjct: 541 VAAWLPGTEGDGITDVLFGDYPFQGKLPVTWFKTVDQIPMHVHGNSN-PLFPFGFGL 596 >ref|XP_006350824.1| PREDICTED: lysosomal beta glucosidase-like [Solanum tuberosum] Length = 598 Score = 870 bits (2248), Expect = 0.0 Identities = 415/597 (69%), Positives = 495/597 (82%) Frame = +3 Query: 3 MDCVYKDPNAPVEARVKDLLSRMTLLEKIGQMTQIERSVASASAIKDRFIGSVLSGGGSK 182 MDC+YKDPNA +E RVKDLLS+MT+ EKIGQ+TQIER+VA+ SAI+DR IGSVLSGGGS+ Sbjct: 1 MDCIYKDPNAAIEERVKDLLSKMTVEEKIGQITQIERAVANPSAIRDRCIGSVLSGGGSR 60 Query: 183 PFENAKSADWAEMIDGFQTGALETRLGIPIFYGVDAVHGNNNVYGTTIFPHNVGLGATRD 362 PFENA+SADWA MIDGFQ GA+E+RLGIPIFYG DA+HGNNNV+G TIFPHN+GLGATRD Sbjct: 61 PFENAESADWANMIDGFQKGAVESRLGIPIFYGTDAIHGNNNVWGATIFPHNIGLGATRD 120 Query: 363 ADLTRKIGEVTAIEVRASGAHYAFAPCVAVTRDPRWGRSYESYSEDTEVVRKMTSLVTGL 542 ADL R+IG VTA+E RA G+ YAFAPCVAV +DPRWGR YESYSEDTEVVRKMTSLV+GL Sbjct: 121 ADLVRRIGVVTALETRACGSQYAFAPCVAVAKDPRWGRFYESYSEDTEVVRKMTSLVSGL 180 Query: 543 QGQPPEGHPKGYPFLAGRKNVLASAKHYVGDGGTEKGINEGNTITSYEDLERIHLAPYLD 722 QGQPPEGHP GYP++AGR +V+ASAKH+VGDG TE G NEGNTI S++D+ IHLAPY+D Sbjct: 181 QGQPPEGHPYGYPYVAGRNSVMASAKHFVGDGATENGTNEGNTIASHDDMFNIHLAPYID 240 Query: 723 CLSQGVCSVMASYSSWNGRKLHTDHFLLTEVLKNKLGFMGFIISDWEALDRLYAPHGSNY 902 C++QGVC+VMASYSSWNG K+H+ +LLTEVLK KLGF G +I+DWEAL+RL PH ++Y Sbjct: 241 CIAQGVCTVMASYSSWNGDKMHSHRYLLTEVLKEKLGFKGLLITDWEALERLTDPHDADY 300 Query: 903 RQSILLTINAGIDMVMVPFRYELFLDEFLSLVESGEIPMARIDDAVERILRVKFIAGLFE 1082 RQS+ TINAGIDMVMVPFRYELFL++ SLVESGEIP+ RIDDAVERILRVKF+ GLFE Sbjct: 301 RQSVKSTINAGIDMVMVPFRYELFLEQLQSLVESGEIPLTRIDDAVERILRVKFVTGLFE 360 Query: 1083 YPLTDLSLLDLVGCKAHRELAREAVRRSLVXXXXXXXXXXXXXXXXXXXXRILVAGTHAD 1262 +P TD SL+DLVGCKAHRELAREAVR+SLV +ILVAGTHAD Sbjct: 361 HPFTDRSLIDLVGCKAHRELAREAVRKSLVLLKNGKDPKKPFLPLDKTAKKILVAGTHAD 420 Query: 1263 NLGYQCXXXXXXXXXXXXXXXXXXXXLDAIKEVVEDKAEVTYELNPTLETFSGQDYSYAI 1442 +LGYQC +DAI+E++ DK E+ +E NPT ETF+ QD+S+AI Sbjct: 421 DLGYQCGGWTATWTGLSGRITVGTTIMDAIREMLGDKTEIVFEPNPTAETFASQDFSFAI 480 Query: 1443 VVVGEGPYVETGGDDLELKIPFNGTELVSLVAEKVPTLMILVTGRPLVLEPQLLEKIDAL 1622 V +GEGPY ETGGDD ELKIPFNGTE+ + VA++VPT+ IL++GRP+V+EP LLEK+DA Sbjct: 481 VAIGEGPYCETGGDDPELKIPFNGTEIATFVADRVPTVTILISGRPMVIEPPLLEKVDAF 540 Query: 1623 VVAWLPGSEGNGITDVIYGDYAFEGRLPVTWFKSVDKLPIHAGENSSEPLFPFGFGL 1793 V AWLPG+EG GITDV++GDY F+G+LPVTWFK+VD++P+H NS+ PLFPFGFGL Sbjct: 541 VAAWLPGTEGAGITDVLFGDYPFQGKLPVTWFKTVDQIPMHVHGNSN-PLFPFGFGL 596 >ref|XP_007016175.1| Glycosyl hydrolase family protein isoform 1 [Theobroma cacao] gi|508786538|gb|EOY33794.1| Glycosyl hydrolase family protein isoform 1 [Theobroma cacao] Length = 606 Score = 852 bits (2201), Expect = 0.0 Identities = 410/599 (68%), Positives = 485/599 (80%) Frame = +3 Query: 3 MDCVYKDPNAPVEARVKDLLSRMTLLEKIGQMTQIERSVASASAIKDRFIGSVLSGGGSK 182 MDCVYK+PNAP+E RVKDLLSRMTL EKIGQMTQIER VA SA+KD IGS+LS GGS Sbjct: 1 MDCVYKNPNAPIEDRVKDLLSRMTLQEKIGQMTQIERRVADPSALKDFSIGSILSAGGSG 60 Query: 183 PFENAKSADWAEMIDGFQTGALETRLGIPIFYGVDAVHGNNNVYGTTIFPHNVGLGATRD 362 PFENA S+DWA+M+D FQ ALE+RLGIP+ YG+DAVHGNN+VYG TIFPHNVGLGATRD Sbjct: 61 PFENALSSDWADMVDRFQQAALESRLGIPLIYGIDAVHGNNSVYGATIFPHNVGLGATRD 120 Query: 363 ADLTRKIGEVTAIEVRASGAHYAFAPCVAVTRDPRWGRSYESYSEDTEVVRKMTSLVTGL 542 ADL ++IG TA+EVRASG Y FAPCV V RDPRWGR YESYSEDT VRKMTS+VTGL Sbjct: 121 ADLAQRIGTATALEVRASGIQYTFAPCVTVCRDPRWGRCYESYSEDTNSVRKMTSIVTGL 180 Query: 543 QGQPPEGHPKGYPFLAGRKNVLASAKHYVGDGGTEKGINEGNTITSYEDLERIHLAPYLD 722 QGQPP GHPKGYPF+AGR NV+A AKH+VGDGGTEKGINEGNTI SY+DLERIH+APYLD Sbjct: 181 QGQPPVGHPKGYPFVAGRNNVIACAKHFVGDGGTEKGINEGNTILSYDDLERIHMAPYLD 240 Query: 723 CLSQGVCSVMASYSSWNGRKLHTDHFLLTEVLKNKLGFMGFIISDWEALDRLYAPHGSNY 902 C+SQGV ++MAS+SSWNGRKLH DHFLLTE+LK+KLGF GF+ISDWEALD+L P GSN Sbjct: 241 CISQGVSTIMASFSSWNGRKLHADHFLLTEILKDKLGFKGFVISDWEALDQLCEPQGSNN 300 Query: 903 RQSILLTINAGIDMVMVPFRYELFLDEFLSLVESGEIPMARIDDAVERILRVKFIAGLFE 1082 R I +NAGIDMVMVPF+Y+ F+++ LVESGE+ M+RIDDAVERILRVKF++GLFE Sbjct: 301 RYCISSAVNAGIDMVMVPFKYKQFVEDLAFLVESGEVQMSRIDDAVERILRVKFVSGLFE 360 Query: 1083 YPLTDLSLLDLVGCKAHRELAREAVRRSLVXXXXXXXXXXXXXXXXXXXXRILVAGTHAD 1262 +P +D SLLD+VGCK HRELAREAVR+SLV RILVAGTHAD Sbjct: 361 HPFSDRSLLDIVGCKLHRELAREAVRKSLVLLKNGKNPENPFLPLDKNAKRILVAGTHAD 420 Query: 1263 NLGYQCXXXXXXXXXXXXXXXXXXXXLDAIKEVVEDKAEVTYELNPTLETFSGQDYSYAI 1442 +LGYQC LDAI+E V DK EV Y+ P+ ++ +G+++S+AI Sbjct: 421 DLGYQCGGWTGTWHGCSGRITIGTTILDAIREAVGDKTEVIYDQYPSPDSLAGKNFSFAI 480 Query: 1443 VVVGEGPYVETGGDDLELKIPFNGTELVSLVAEKVPTLMILVTGRPLVLEPQLLEKIDAL 1622 VVVGE PY ET GD+ EL IPFNG++++S VA+K+PTL IL++GRPLVLEP LLEK+DAL Sbjct: 481 VVVGEPPYAETLGDNAELVIPFNGSDIISSVADKIPTLAILISGRPLVLEPWLLEKVDAL 540 Query: 1623 VVAWLPGSEGNGITDVIYGDYAFEGRLPVTWFKSVDKLPIHAGENSSEPLFPFGFGLKC 1799 V AW PGSEG G+TDV++GD+ FEGRLP+TWF+S+++LP++AG NS +PLFP GFGL C Sbjct: 541 VAAWFPGSEGGGVTDVVFGDFEFEGRLPMTWFRSINQLPMNAGHNSYDPLFPLGFGLTC 599 >ref|XP_002279757.1| PREDICTED: periplasmic beta-glucosidase-like [Vitis vinifera] Length = 720 Score = 850 bits (2197), Expect = 0.0 Identities = 410/599 (68%), Positives = 478/599 (79%), Gaps = 2/599 (0%) Frame = +3 Query: 3 MDCVYKDPNAPVEARVKDLLSRMTLLEKIGQMTQIERSVASASAIKDRFIGSVLSGGGSK 182 MDC+YKDPN P+EAR+KDLLSRMTL EK GQMTQIER VA+ S +KD IGS+LS GGS Sbjct: 113 MDCIYKDPNQPIEARIKDLLSRMTLKEKAGQMTQIERRVATPSVLKDLSIGSILSAGGSG 172 Query: 183 PFENAKSADWAEMIDGFQTGALETRLGIPIFYGVDAVHGNNNVYGTTIFPHNVGLGATRD 362 PF+ A SADWA+M+DGFQ ALE+RLGIP+ YG+DAVHGNN++YG TIFPHNVGLGATRD Sbjct: 173 PFDKALSADWADMVDGFQQSALESRLGIPLLYGIDAVHGNNSIYGATIFPHNVGLGATRD 232 Query: 363 ADLTRKIGEVTAIEVRASGAHYAFAPCVAVTRDPRWGRSYESYSEDTEVVRKMTSLVTGL 542 ADL ++IG TA+EVRASG HY FAPCVAV RDPRWGR YESYS DT +VRKMTS++TGL Sbjct: 233 ADLAQRIGVATALEVRASGIHYTFAPCVAVCRDPRWGRCYESYSSDTNIVRKMTSVITGL 292 Query: 543 QGQPPEGHPKGYPFLAGRKNVLASAKHYVGDGGTEKGINEGNTITSYEDLERIHLAPYLD 722 QG+PP GHPKGYPF+AGR NV+A AKH+VGDGGT+KG NEGNTI SYEDLERIH+ PY D Sbjct: 293 QGKPPPGHPKGYPFVAGRHNVVACAKHFVGDGGTDKGENEGNTILSYEDLERIHMTPYPD 352 Query: 723 CLSQGVCSVMASYSSWNGRKLHTDHFLLTEVLKNKLGFMGFIISDWEALDRLYA--PHGS 896 C+SQGV +VMASYSSWNG +LH FLL++VLK+K+GF GF+ISDWE LDRL PHGS Sbjct: 353 CISQGVATVMASYSSWNGTQLHAHRFLLSDVLKDKMGFKGFLISDWEGLDRLSKPNPHGS 412 Query: 897 NYRQSILLTINAGIDMVMVPFRYELFLDEFLSLVESGEIPMARIDDAVERILRVKFIAGL 1076 NYR SI +N GIDMVMVPFRY FL++ + LVESGEIPM RIDDAVERILRVK +AGL Sbjct: 413 NYRTSICTAVNTGIDMVMVPFRYAKFLEDLIDLVESGEIPMTRIDDAVERILRVKLVAGL 472 Query: 1077 FEYPLTDLSLLDLVGCKAHRELAREAVRRSLVXXXXXXXXXXXXXXXXXXXXRILVAGTH 1256 FEYP +D SLLD VGCK HR+LAREAVR+SLV R+LVAG+H Sbjct: 473 FEYPYSDRSLLDTVGCKLHRDLAREAVRKSLVLLKNGKDQKKPFLPLDRKAKRVLVAGSH 532 Query: 1257 ADNLGYQCXXXXXXXXXXXXXXXXXXXXLDAIKEVVEDKAEVTYELNPTLETFSGQDYSY 1436 AD+LGYQC LDAI+E V DK EV YE NP+ TF GQD+SY Sbjct: 533 ADDLGYQCGGWTATWHGASGRITIGTTVLDAIREAVGDKTEVIYEQNPSPATFEGQDFSY 592 Query: 1437 AIVVVGEGPYVETGGDDLELKIPFNGTELVSLVAEKVPTLMILVTGRPLVLEPQLLEKID 1616 AIVVVGE PY E GD+ EL IPFN +++SLVA+++PTL+IL++GRPLVLEP +LEK+D Sbjct: 593 AIVVVGEDPYAEHTGDNSELIIPFNANDVISLVADRIPTLVILISGRPLVLEPWILEKMD 652 Query: 1617 ALVVAWLPGSEGNGITDVIYGDYAFEGRLPVTWFKSVDKLPIHAGENSSEPLFPFGFGL 1793 AL+ AWLPGSEG GITDV++GDY FEGRLPVTWFKSV++LP+H +NS +PLFPFGFGL Sbjct: 653 ALIAAWLPGSEGGGITDVVFGDYDFEGRLPVTWFKSVEQLPMHPEDNSYDPLFPFGFGL 711 >ref|XP_007016177.1| Glycosyl hydrolase family protein isoform 3 [Theobroma cacao] gi|508786540|gb|EOY33796.1| Glycosyl hydrolase family protein isoform 3 [Theobroma cacao] Length = 607 Score = 847 bits (2189), Expect = 0.0 Identities = 410/600 (68%), Positives = 485/600 (80%), Gaps = 1/600 (0%) Frame = +3 Query: 3 MDCVYKDPNAPVEARVKDLLSRMTLLEKIGQMTQIERSVASASAIKDRFIGSVLSGGGSK 182 MDCVYK+PNAP+E RVKDLLSRMTL EKIGQMTQIER VA SA+KD IGS+LS GGS Sbjct: 1 MDCVYKNPNAPIEDRVKDLLSRMTLQEKIGQMTQIERRVADPSALKDFSIGSILSAGGSG 60 Query: 183 PFENAKSADWAEMIDGFQTGALETRLGIPIFYGVDAVHGNNNVYGTTIFPHNVGLGATRD 362 PFENA S+DWA+M+D FQ ALE+RLGIP+ YG+DAVHGNN+VYG TIFPHNVGLGATRD Sbjct: 61 PFENALSSDWADMVDRFQQAALESRLGIPLIYGIDAVHGNNSVYGATIFPHNVGLGATRD 120 Query: 363 ADLTRKIGEVTAIEVRASGAHYAFAPCVAVTRDPRWGRSYESYSEDTEVVRKMTSLVTGL 542 ADL ++IG TA+EVRASG Y FAPCV V RDPRWGR YESYSEDT VRKMTS+VTGL Sbjct: 121 ADLAQRIGTATALEVRASGIQYTFAPCVTVCRDPRWGRCYESYSEDTNSVRKMTSIVTGL 180 Query: 543 QGQPPEGHPKGYPFLAGRKNVLASAKHYVGDGGTEKGINEGNTITSYEDLERIHLAPYLD 722 QGQPP GHPKGYPF+AGR NV+A AKH+VGDGGTEKGINEGNTI SY+DLERIH+APYLD Sbjct: 181 QGQPPVGHPKGYPFVAGRNNVIACAKHFVGDGGTEKGINEGNTILSYDDLERIHMAPYLD 240 Query: 723 CLSQGVCSVMASYSSWNGRKLHTDHFLLTEVLKNKLGFMGFIISDWEALDRLYAPHGSNY 902 C+SQGV ++MAS+SSWNGRKLH DHFLLTE+LK+KLGF GF+ISDWEALD+L P GSN Sbjct: 241 CISQGVSTIMASFSSWNGRKLHADHFLLTEILKDKLGFKGFVISDWEALDQLCEPQGSNN 300 Query: 903 RQSILLTINAGIDM-VMVPFRYELFLDEFLSLVESGEIPMARIDDAVERILRVKFIAGLF 1079 R I +NAGIDM VMVPF+Y+ F+++ LVESGE+ M+RIDDAVERILRVKF++GLF Sbjct: 301 RYCISSAVNAGIDMVVMVPFKYKQFVEDLAFLVESGEVQMSRIDDAVERILRVKFVSGLF 360 Query: 1080 EYPLTDLSLLDLVGCKAHRELAREAVRRSLVXXXXXXXXXXXXXXXXXXXXRILVAGTHA 1259 E+P +D SLLD+VGCK HRELAREAVR+SLV RILVAGTHA Sbjct: 361 EHPFSDRSLLDIVGCKLHRELAREAVRKSLVLLKNGKNPENPFLPLDKNAKRILVAGTHA 420 Query: 1260 DNLGYQCXXXXXXXXXXXXXXXXXXXXLDAIKEVVEDKAEVTYELNPTLETFSGQDYSYA 1439 D+LGYQC LDAI+E V DK EV Y+ P+ ++ +G+++S+A Sbjct: 421 DDLGYQCGGWTGTWHGCSGRITIGTTILDAIREAVGDKTEVIYDQYPSPDSLAGKNFSFA 480 Query: 1440 IVVVGEGPYVETGGDDLELKIPFNGTELVSLVAEKVPTLMILVTGRPLVLEPQLLEKIDA 1619 IVVVGE PY ET GD+ EL IPFNG++++S VA+K+PTL IL++GRPLVLEP LLEK+DA Sbjct: 481 IVVVGEPPYAETLGDNAELVIPFNGSDIISSVADKIPTLAILISGRPLVLEPWLLEKVDA 540 Query: 1620 LVVAWLPGSEGNGITDVIYGDYAFEGRLPVTWFKSVDKLPIHAGENSSEPLFPFGFGLKC 1799 LV AW PGSEG G+TDV++GD+ FEGRLP+TWF+S+++LP++AG NS +PLFP GFGL C Sbjct: 541 LVAAWFPGSEGGGVTDVVFGDFEFEGRLPMTWFRSINQLPMNAGHNSYDPLFPLGFGLTC 600 >emb|CAN81230.1| hypothetical protein VITISV_033665 [Vitis vinifera] Length = 639 Score = 833 bits (2152), Expect = 0.0 Identities = 409/630 (64%), Positives = 478/630 (75%), Gaps = 33/630 (5%) Frame = +3 Query: 3 MDCVYKDPNAPVEARVKDLLSRMTLLEKIGQMTQIERSVASASAIKDR------------ 146 MDC+YKDPN P+EAR+KDLLSRMTL EK GQMTQIER VA+ S +KD Sbjct: 1 MDCIYKDPNQPIEARIKDLLSRMTLKEKAGQMTQIERRVATPSVLKDLSIGTIHLIMQYA 60 Query: 147 -------------------FIGSVLSGGGSKPFENAKSADWAEMIDGFQTGALETRLGIP 269 F GS+LS GGS PF+ A SADWA+M+DGFQ ALE+RLGIP Sbjct: 61 LMDCVLLCIFFIQLVVLILFSGSILSAGGSGPFDKALSADWADMVDGFQKSALESRLGIP 120 Query: 270 IFYGVDAVHGNNNVYGTTIFPHNVGLGATRDADLTRKIGEVTAIEVRASGAHYAFAPCVA 449 + YG+DAVHGNN++YG TIFPHNVGLGATRDADL ++IG TA+EVRASG HY FAPCVA Sbjct: 121 LLYGIDAVHGNNSIYGATIFPHNVGLGATRDADLAQRIGVATALEVRASGIHYTFAPCVA 180 Query: 450 VTRDPRWGRSYESYSEDTEVVRKMTSLVTGLQGQPPEGHPKGYPFLAGRKNVLASAKHYV 629 V RDPRWGR YES S DT +VRKMTS++TGLQG+PP GHPKGYPF+AGR NV+A AKH+V Sbjct: 181 VCRDPRWGRCYESXSSDTNIVRKMTSVITGLQGKPPPGHPKGYPFVAGRHNVVACAKHFV 240 Query: 630 GDGGTEKGINEGNTITSYEDLERIHLAPYLDCLSQGVCSVMASYSSWNGRKLHTDHFLLT 809 GDGGT+KG NEGNTI SYEDLERIH+ PY DC+SQGV +VMASYSSWNG +LH FLL+ Sbjct: 241 GDGGTDKGENEGNTILSYEDLERIHMTPYPDCISQGVATVMASYSSWNGTQLHAHRFLLS 300 Query: 810 EVLKNKLGFMGFIISDWEALDRLYA--PHGSNYRQSILLTINAGIDMVMVPFRYELFLDE 983 +VLK+K+GF GF+ISDWE LDRL PHGSNYR SI +N GIDMVMVPFRY FL++ Sbjct: 301 DVLKDKMGFKGFLISDWEGLDRLSKPNPHGSNYRTSICTAVNTGIDMVMVPFRYAKFLED 360 Query: 984 FLSLVESGEIPMARIDDAVERILRVKFIAGLFEYPLTDLSLLDLVGCKAHRELAREAVRR 1163 + LVESGEIPM RIDDAVERILRVKF+AGLFEYP +D SLLD VGCK HR+LAREAVR+ Sbjct: 361 LIDLVESGEIPMTRIDDAVERILRVKFVAGLFEYPYSDRSLLDTVGCKLHRDLAREAVRK 420 Query: 1164 SLVXXXXXXXXXXXXXXXXXXXXRILVAGTHADNLGYQCXXXXXXXXXXXXXXXXXXXXL 1343 SLV R+LVAG+HAD+LGYQC L Sbjct: 421 SLVLLKNGKDQKKPFLPLDRKAKRVLVAGSHADDLGYQCGGWTATWHGASGRITIGTTVL 480 Query: 1344 DAIKEVVEDKAEVTYELNPTLETFSGQDYSYAIVVVGEGPYVETGGDDLELKIPFNGTEL 1523 DAI+E V DK EV YE NP+ TF GQD+SYAIVVVGE PY E GD+ EL IPFN ++ Sbjct: 481 DAIREAVGDKTEVIYEQNPSPATFEGQDFSYAIVVVGEDPYAEHTGDNSELIIPFNANDV 540 Query: 1524 VSLVAEKVPTLMILVTGRPLVLEPQLLEKIDALVVAWLPGSEGNGITDVIYGDYAFEGRL 1703 +SLVA+++PTL+IL++GRPLVLEP +LEK+DAL+ AWLPGSEG G+TDV++GDY FEGRL Sbjct: 541 ISLVADRIPTLVILISGRPLVLEPWILEKMDALIAAWLPGSEGGGMTDVVFGDYDFEGRL 600 Query: 1704 PVTWFKSVDKLPIHAGENSSEPLFPFGFGL 1793 PVTWFKSV++LP+H +NS +PLFPFGFGL Sbjct: 601 PVTWFKSVEQLPMHPEDNSYDPLFPFGFGL 630 >ref|XP_004150625.1| PREDICTED: lysosomal beta glucosidase-like [Cucumis sativus] Length = 609 Score = 832 bits (2149), Expect = 0.0 Identities = 402/596 (67%), Positives = 475/596 (79%) Frame = +3 Query: 6 DCVYKDPNAPVEARVKDLLSRMTLLEKIGQMTQIERSVASASAIKDRFIGSVLSGGGSKP 185 DCVYK+ +AP+E R+KDLLSRMTL EKIGQMTQIER+VA+ SA+ D IGSVL+ GGS P Sbjct: 5 DCVYKNSSAPIEVRIKDLLSRMTLREKIGQMTQIERTVATPSALGDFAIGSVLNAGGSAP 64 Query: 186 FENAKSADWAEMIDGFQTGALETRLGIPIFYGVDAVHGNNNVYGTTIFPHNVGLGATRDA 365 F A S+DWA+MID FQ+ A+++RLGIPI YG DAVHGNNNVYG TIFPHNVGLGATRDA Sbjct: 65 FRGALSSDWADMIDRFQSWAIQSRLGIPIIYGSDAVHGNNNVYGATIFPHNVGLGATRDA 124 Query: 366 DLTRKIGEVTAIEVRASGAHYAFAPCVAVTRDPRWGRSYESYSEDTEVVRKMTSLVTGLQ 545 DL R+IG VTA+EVRASG HYAFAPCVAV+RDPRWGR YESYSEDTEVVRKMT LV GLQ Sbjct: 125 DLVRRIGTVTALEVRASGIHYAFAPCVAVSRDPRWGRCYESYSEDTEVVRKMTCLVEGLQ 184 Query: 546 GQPPEGHPKGYPFLAGRKNVLASAKHYVGDGGTEKGINEGNTITSYEDLERIHLAPYLDC 725 G+PP G+PKGYPF+AGR NV+A AKH+VGDGGT+KG+NEGNTI SY++LERIH+APYLDC Sbjct: 185 GKPPTGYPKGYPFVAGRNNVIACAKHFVGDGGTDKGLNEGNTIASYDELERIHMAPYLDC 244 Query: 726 LSQGVCSVMASYSSWNGRKLHTDHFLLTEVLKNKLGFMGFIISDWEALDRLYAPHGSNYR 905 ++QGV +VMASYSSWNGR LH DHFLLT++LKNKLGF GF+ISDW+ LDRL P GSNYR Sbjct: 245 IAQGVSTVMASYSSWNGRPLHADHFLLTQILKNKLGFKGFVISDWQGLDRLSRPRGSNYR 304 Query: 906 QSILLTINAGIDMVMVPFRYELFLDEFLSLVESGEIPMARIDDAVERILRVKFIAGLFEY 1085 I +NAGIDMVMVP RYE F+ + L LVESGEIPM RIDDAVERILRVKF++G+FE+ Sbjct: 305 LCISAAVNAGIDMVMVPLRYEQFIKDLLFLVESGEIPMTRIDDAVERILRVKFVSGVFEH 364 Query: 1086 PLTDLSLLDLVGCKAHRELAREAVRRSLVXXXXXXXXXXXXXXXXXXXXRILVAGTHADN 1265 P +D SLLD+VGCK HR+LAREAVR+SLV +ILVAG+HAD+ Sbjct: 365 PFSDRSLLDVVGCKIHRDLAREAVRKSLVLLKNGKDPTKPFLPLDMKAKKILVAGSHADD 424 Query: 1266 LGYQCXXXXXXXXXXXXXXXXXXXXLDAIKEVVEDKAEVTYELNPTLETFSGQDYSYAIV 1445 LGYQC LDAIKE V D+ EV YE NP+ T + QD S+AIV Sbjct: 425 LGYQCGGWTISWDGMTGRITIGTTILDAIKEAVGDQTEVIYEQNPSAATLNDQDISFAIV 484 Query: 1446 VVGEGPYVETGGDDLELKIPFNGTELVSLVAEKVPTLMILVTGRPLVLEPQLLEKIDALV 1625 +GE PY E GDD +L IPFNG ++V VA K+PTL+ILV+GRPL+LEP ++E +AL+ Sbjct: 485 AIGESPYAEFTGDDSKLVIPFNGNDIVKAVAGKMPTLVILVSGRPLILEPTVMENAEALI 544 Query: 1626 VAWLPGSEGNGITDVIYGDYAFEGRLPVTWFKSVDKLPIHAGENSSEPLFPFGFGL 1793 AWLPGSEG+GITDVI+GDY F GRLP+TWF++V++LP+HA N + LFPFGFGL Sbjct: 545 AAWLPGSEGSGITDVIFGDYDFTGRLPITWFRTVEQLPVHAENNLQDSLFPFGFGL 600 >ref|XP_004169524.1| PREDICTED: LOW QUALITY PROTEIN: lysosomal beta glucosidase-like [Cucumis sativus] Length = 609 Score = 830 bits (2145), Expect = 0.0 Identities = 402/596 (67%), Positives = 474/596 (79%) Frame = +3 Query: 6 DCVYKDPNAPVEARVKDLLSRMTLLEKIGQMTQIERSVASASAIKDRFIGSVLSGGGSKP 185 DCVYK+ +AP+E R+KDLLSRMTL EKIGQMTQIER+VA+ SA+ D IGSVL+ GGS P Sbjct: 5 DCVYKNSSAPIEVRIKDLLSRMTLREKIGQMTQIERTVATPSALGDFAIGSVLNAGGSAP 64 Query: 186 FENAKSADWAEMIDGFQTGALETRLGIPIFYGVDAVHGNNNVYGTTIFPHNVGLGATRDA 365 F A S+DWA+MID FQ+ A+++RLGIPI YG DAVHGNNNVYG TIFPHNVGLGATRDA Sbjct: 65 FRGALSSDWADMIDRFQSWAIQSRLGIPIIYGSDAVHGNNNVYGATIFPHNVGLGATRDA 124 Query: 366 DLTRKIGEVTAIEVRASGAHYAFAPCVAVTRDPRWGRSYESYSEDTEVVRKMTSLVTGLQ 545 DL R+IG VTA+EVRASG HYAFAPCVAV+RDPRWGR YESYSEDTEVVRKMT LV GLQ Sbjct: 125 DLVRRIGTVTALEVRASGIHYAFAPCVAVSRDPRWGRCYESYSEDTEVVRKMTCLVEGLQ 184 Query: 546 GQPPEGHPKGYPFLAGRKNVLASAKHYVGDGGTEKGINEGNTITSYEDLERIHLAPYLDC 725 G+PP G+PKGYPF+AGR NV+A AKH+VGDGGT+KG+NEGNTI SY++LERIH+APYLDC Sbjct: 185 GKPPTGYPKGYPFVAGRNNVIACAKHFVGDGGTDKGLNEGNTIASYDELERIHMAPYLDC 244 Query: 726 LSQGVCSVMASYSSWNGRKLHTDHFLLTEVLKNKLGFMGFIISDWEALDRLYAPHGSNYR 905 ++QGV +VMASYSSWNGR LH DHFLLT++LK KLGF GF+ISDW+ LDRL P GSNYR Sbjct: 245 IAQGVSTVMASYSSWNGRPLHADHFLLTQILKXKLGFKGFVISDWQGLDRLSRPRGSNYR 304 Query: 906 QSILLTINAGIDMVMVPFRYELFLDEFLSLVESGEIPMARIDDAVERILRVKFIAGLFEY 1085 I +NAGIDMVMVP RYE F+ + L LVESGEIPM RIDDAVERILRVKF++G+FE+ Sbjct: 305 LCISAAVNAGIDMVMVPLRYEQFIKDLLFLVESGEIPMTRIDDAVERILRVKFVSGVFEH 364 Query: 1086 PLTDLSLLDLVGCKAHRELAREAVRRSLVXXXXXXXXXXXXXXXXXXXXRILVAGTHADN 1265 P +D SLLD+VGCK HR+LAREAVR+SLV +ILVAG+HAD+ Sbjct: 365 PFSDRSLLDVVGCKIHRDLAREAVRKSLVLLKNGKDPTKPFLPLDMKAKKILVAGSHADD 424 Query: 1266 LGYQCXXXXXXXXXXXXXXXXXXXXLDAIKEVVEDKAEVTYELNPTLETFSGQDYSYAIV 1445 LGYQC LDAIKE V D+ EV YE NP+ T + QD S+AIV Sbjct: 425 LGYQCGGWTISWDGMTGRITIGTTILDAIKEAVGDQTEVIYEQNPSAATLNDQDISFAIV 484 Query: 1446 VVGEGPYVETGGDDLELKIPFNGTELVSLVAEKVPTLMILVTGRPLVLEPQLLEKIDALV 1625 +GE PY E GDD +L IPFNG ++V VA K+PTL+ILV+GRPL+LEP ++E +AL+ Sbjct: 485 AIGESPYAEFTGDDSKLVIPFNGNDIVKAVAGKMPTLVILVSGRPLILEPTVMENAEALI 544 Query: 1626 VAWLPGSEGNGITDVIYGDYAFEGRLPVTWFKSVDKLPIHAGENSSEPLFPFGFGL 1793 AWLPGSEG+GITDVI+GDY F GRLP+TWF++V++LP+HA N E LFPFGFGL Sbjct: 545 AAWLPGSEGSGITDVIFGDYDFTGRLPITWFRTVEQLPVHAENNLQESLFPFGFGL 600 >ref|XP_006425018.1| hypothetical protein CICLE_v10028048mg [Citrus clementina] gi|568870591|ref|XP_006488483.1| PREDICTED: lysosomal beta glucosidase-like [Citrus sinensis] gi|557526952|gb|ESR38258.1| hypothetical protein CICLE_v10028048mg [Citrus clementina] Length = 606 Score = 829 bits (2142), Expect = 0.0 Identities = 399/597 (66%), Positives = 475/597 (79%) Frame = +3 Query: 3 MDCVYKDPNAPVEARVKDLLSRMTLLEKIGQMTQIERSVASASAIKDRFIGSVLSGGGSK 182 M+ +Y++PNA VE R+KDLLSRMTL EKIGQMTQIER VA+ S +KD IGS+LS GGS Sbjct: 1 MESIYRNPNAHVEDRIKDLLSRMTLKEKIGQMTQIERGVATPSVLKDLSIGSILSSGGSM 60 Query: 183 PFENAKSADWAEMIDGFQTGALETRLGIPIFYGVDAVHGNNNVYGTTIFPHNVGLGATRD 362 P NA SA WA+M+DGFQ AL +RLGIP+ YG+DAVHGNN+VYG TIFPHNV LGATRD Sbjct: 61 PSVNALSAGWADMVDGFQKAALASRLGIPLIYGIDAVHGNNSVYGATIFPHNVNLGATRD 120 Query: 363 ADLTRKIGEVTAIEVRASGAHYAFAPCVAVTRDPRWGRSYESYSEDTEVVRKMTSLVTGL 542 DL R+IG TA+EVRASG HY FAPCVAV +DPRWGR YESYSEDTE+VRKMTS+V+GL Sbjct: 121 GDLARRIGVATALEVRASGIHYTFAPCVAVGKDPRWGRYYESYSEDTEIVRKMTSIVSGL 180 Query: 543 QGQPPEGHPKGYPFLAGRKNVLASAKHYVGDGGTEKGINEGNTITSYEDLERIHLAPYLD 722 QG+PP+ HPKGYP++AGR NV+A AKH+VGDGGTE+GINEGNTI++Y+DLE+IH+APYLD Sbjct: 181 QGRPPKEHPKGYPYVAGRNNVIACAKHFVGDGGTERGINEGNTISTYDDLEKIHMAPYLD 240 Query: 723 CLSQGVCSVMASYSSWNGRKLHTDHFLLTEVLKNKLGFMGFIISDWEALDRLYAPHGSNY 902 C+SQGVC++MASYSSWNGRKLH DHFLLTEVLKNKLGF GF+ISDWE LDRL PHGSNY Sbjct: 241 CISQGVCTIMASYSSWNGRKLHADHFLLTEVLKNKLGFKGFVISDWEGLDRLSQPHGSNY 300 Query: 903 RQSILLTINAGIDMVMVPFRYELFLDEFLSLVESGEIPMARIDDAVERILRVKFIAGLFE 1082 R I +NAGIDMVMVP R++ F ++ LVESG++PM+RIDDAVERILRVKF+AGLFE Sbjct: 301 RYCISTAVNAGIDMVMVPHRFDQFFEDLTYLVESGKVPMSRIDDAVERILRVKFVAGLFE 360 Query: 1083 YPLTDLSLLDLVGCKAHRELAREAVRRSLVXXXXXXXXXXXXXXXXXXXXRILVAGTHAD 1262 YP +D SLL++VGCK HRELAREAVR+SLV RILV GTHAD Sbjct: 361 YPFSDKSLLNIVGCKLHRELAREAVRKSLVLLKNGKKPEKPFLPLDRNAKRILVVGTHAD 420 Query: 1263 NLGYQCXXXXXXXXXXXXXXXXXXXXLDAIKEVVEDKAEVTYELNPTLETFSGQDYSYAI 1442 +LGYQC L+A+KE V D+ EV YE P+ +TF D+S+AI Sbjct: 421 DLGYQCGGWTKTWFGMSGKITIGTTILEAVKEAVGDETEVIYEKYPSPDTFVAGDFSFAI 480 Query: 1443 VVVGEGPYVETGGDDLELKIPFNGTELVSLVAEKVPTLMILVTGRPLVLEPQLLEKIDAL 1622 VGE PY ET GD+ EL IP NG +++SLVAE++PTL ILV+GRPLVLEPQLLEK DAL Sbjct: 481 AAVGEEPYAETLGDNSELIIPLNGGDVISLVAERIPTLAILVSGRPLVLEPQLLEKADAL 540 Query: 1623 VVAWLPGSEGNGITDVIYGDYAFEGRLPVTWFKSVDKLPIHAGENSSEPLFPFGFGL 1793 V AWLPGSEG+GI DV++GD+ F GRLPVTW++SV +LP++ +N+ +PLFP GFGL Sbjct: 541 VAAWLPGSEGSGIADVVFGDHDFTGRLPVTWYRSVQRLPMNVADNTYDPLFPLGFGL 597 >ref|XP_002525596.1| hydrolase, hydrolyzing O-glycosyl compounds, putative [Ricinus communis] gi|223535032|gb|EEF36714.1| hydrolase, hydrolyzing O-glycosyl compounds, putative [Ricinus communis] Length = 603 Score = 827 bits (2137), Expect = 0.0 Identities = 403/595 (67%), Positives = 468/595 (78%) Frame = +3 Query: 9 CVYKDPNAPVEARVKDLLSRMTLLEKIGQMTQIERSVASASAIKDRFIGSVLSGGGSKPF 188 C+YKDPN+PVE RVKDL+SRMTL EKI QMTQIER AS ++D +GS+LS GGS PF Sbjct: 4 CIYKDPNSPVEDRVKDLISRMTLKEKIAQMTQIERRAASPHYLRDFGVGSLLSVGGSTPF 63 Query: 189 ENAKSADWAEMIDGFQTGALETRLGIPIFYGVDAVHGNNNVYGTTIFPHNVGLGATRDAD 368 ENA S+DWA+MIDG+Q ALE+RLGIPI YG+DAVHGNNNVYG TIFPHNVGLGATRDAD Sbjct: 64 ENALSSDWADMIDGYQKLALESRLGIPIMYGIDAVHGNNNVYGATIFPHNVGLGATRDAD 123 Query: 369 LTRKIGEVTAIEVRASGAHYAFAPCVAVTRDPRWGRSYESYSEDTEVVRKMTSLVTGLQG 548 L R+IG TA+EVRASG HY FAPCVAV+RDPRWGR YESY EDT VVRKMTS+VTGLQG Sbjct: 124 LIRRIGVATALEVRASGIHYTFAPCVAVSRDPRWGRCYESYGEDTNVVRKMTSIVTGLQG 183 Query: 549 QPPEGHPKGYPFLAGRKNVLASAKHYVGDGGTEKGINEGNTITSYEDLERIHLAPYLDCL 728 +PPEGHP GYPF+AGR NV+A AKH+VGDGGT+KG+NEGNTI SYEDLE IH+ PYLDC+ Sbjct: 184 KPPEGHPNGYPFIAGRNNVIACAKHFVGDGGTDKGLNEGNTILSYEDLEGIHMTPYLDCI 243 Query: 729 SQGVCSVMASYSSWNGRKLHTDHFLLTEVLKNKLGFMGFIISDWEALDRLYAPHGSNYRQ 908 SQGVC++MASYSSWNGRKLH DHFLLTE+LK+KLGF G +ISDWE L+RL P GSNYR Sbjct: 244 SQGVCTIMASYSSWNGRKLHADHFLLTEILKDKLGFQGIVISDWEGLNRLSQPLGSNYRH 303 Query: 909 SILLTINAGIDMVMVPFRYELFLDEFLSLVESGEIPMARIDDAVERILRVKFIAGLFEYP 1088 I INAGIDMVMV ++E F++E + L ESGEI +ARIDDAVERILRVK +AGLFEYP Sbjct: 304 CISSAINAGIDMVMVGHKHEEFVEELMFLAESGEITIARIDDAVERILRVKLVAGLFEYP 363 Query: 1089 LTDLSLLDLVGCKAHRELAREAVRRSLVXXXXXXXXXXXXXXXXXXXXRILVAGTHADNL 1268 D LLDLVGCK HRELAREAVR+SLV +ILVAGTHADNL Sbjct: 364 FADRYLLDLVGCKLHRELAREAVRKSLVLLKNGKDPKKPFLPLDKNAKKILVAGTHADNL 423 Query: 1269 GYQCXXXXXXXXXXXXXXXXXXXXLDAIKEVVEDKAEVTYELNPTLETFSGQDYSYAIVV 1448 GYQC LDAIK V + EV +E NP+ +T + QD+SYAIV Sbjct: 424 GYQCGGWTKSWDGMSGRITIGTTILDAIKNTVGENTEVIFEENPSPDTLASQDFSYAIVA 483 Query: 1449 VGEGPYVETGGDDLELKIPFNGTELVSLVAEKVPTLMILVTGRPLVLEPQLLEKIDALVV 1628 VGEGPY E GD+ EL IPFNG ++S +A+++PTL IL++GRPLVLE LLEK+ A V Sbjct: 484 VGEGPYAEFTGDNSELVIPFNGMGVISSIADRIPTLAILISGRPLVLEASLLEKVYAFVA 543 Query: 1629 AWLPGSEGNGITDVIYGDYAFEGRLPVTWFKSVDKLPIHAGENSSEPLFPFGFGL 1793 AWLPG+EG G+ DVI+GDY F+G+LPVTWFKSV++LP++ G NS +PLFPFGFGL Sbjct: 544 AWLPGTEGAGVADVIFGDYEFKGKLPVTWFKSVEQLPMNYGANSYDPLFPFGFGL 598 >ref|XP_007016181.1| Glycosyl hydrolase family protein [Theobroma cacao] gi|508786544|gb|EOY33800.1| Glycosyl hydrolase family protein [Theobroma cacao] Length = 606 Score = 826 bits (2134), Expect = 0.0 Identities = 393/599 (65%), Positives = 478/599 (79%) Frame = +3 Query: 3 MDCVYKDPNAPVEARVKDLLSRMTLLEKIGQMTQIERSVASASAIKDRFIGSVLSGGGSK 182 M CVYK+PNAP+E R+K+L+S MTL EKIGQMTQIE VA+ S ++D IGS++SGGG Sbjct: 7 MSCVYKNPNAPIEDRIKNLVSGMTLQEKIGQMTQIELCVATPSDVRDLSIGSMISGGGKP 66 Query: 183 PFENAKSADWAEMIDGFQTGALETRLGIPIFYGVDAVHGNNNVYGTTIFPHNVGLGATRD 362 P E A +DWA+ +D FQ AL++RLGIP+ YG+DAVHGNN YG TIFPHN+GLGATRD Sbjct: 67 PLEKATPSDWADTLDRFQQAALDSRLGIPLIYGIDAVHGNNRFYGATIFPHNIGLGATRD 126 Query: 363 ADLTRKIGEVTAIEVRASGAHYAFAPCVAVTRDPRWGRSYESYSEDTEVVRKMTSLVTGL 542 ADL ++IG A+EVRASG H+ FAPCVAV RDPRWGR YES+SEDT +VRKMTS++TGL Sbjct: 127 ADLAQRIGAAVALEVRASGIHFNFAPCVAVCRDPRWGRCYESFSEDTNIVRKMTSIITGL 186 Query: 543 QGQPPEGHPKGYPFLAGRKNVLASAKHYVGDGGTEKGINEGNTITSYEDLERIHLAPYLD 722 QGQPP GHPKGYPF+AGR NV+A AKH+VGDGGT+KGINEGNT++SY+DLERIH+APYLD Sbjct: 187 QGQPPSGHPKGYPFVAGRDNVIACAKHFVGDGGTDKGINEGNTVSSYDDLERIHMAPYLD 246 Query: 723 CLSQGVCSVMASYSSWNGRKLHTDHFLLTEVLKNKLGFMGFIISDWEALDRLYAPHGSNY 902 CL+QGV +VMASYSSWNG KLH HFLLT++LK+KLGF GF+ISDW+ALDRL P GSNY Sbjct: 247 CLNQGVSTVMASYSSWNGCKLHAHHFLLTDILKDKLGFKGFVISDWKALDRLSEPRGSNY 306 Query: 903 RQSILLTINAGIDMVMVPFRYELFLDEFLSLVESGEIPMARIDDAVERILRVKFIAGLFE 1082 R + INAGIDMVMVP RY+ F+++ SLVESGEI M+RIDDAVERILRVKF+AGLFE Sbjct: 307 RHCVSTAINAGIDMVMVPHRYKQFIEDLTSLVESGEIQMSRIDDAVERILRVKFVAGLFE 366 Query: 1083 YPLTDLSLLDLVGCKAHRELAREAVRRSLVXXXXXXXXXXXXXXXXXXXXRILVAGTHAD 1262 YP +D SLLD+VGCK HRELAREAVR+SLV RILVAGTHAD Sbjct: 367 YPFSDRSLLDMVGCKLHRELAREAVRKSLVLLKNGKNPGKPFLPLDKNARRILVAGTHAD 426 Query: 1263 NLGYQCXXXXXXXXXXXXXXXXXXXXLDAIKEVVEDKAEVTYELNPTLETFSGQDYSYAI 1442 +LGYQC LDA +EVV +K EV Y+ P+ ++F+ Q++S+AI Sbjct: 427 DLGYQCGGWTRYWQGSSGRITIGTTILDAFREVVGEKTEVIYDKYPSPDSFARQNFSFAI 486 Query: 1443 VVVGEGPYVETGGDDLELKIPFNGTELVSLVAEKVPTLMILVTGRPLVLEPQLLEKIDAL 1622 V VGE PY E+ GD+ EL IPFNG+EL+S VAE++PTL+IL++GRPLV+EP LLEK+DAL Sbjct: 487 VAVGEEPYAESVGDNSELIIPFNGSELISSVAERIPTLVILISGRPLVIEPWLLEKVDAL 546 Query: 1623 VVAWLPGSEGNGITDVIYGDYAFEGRLPVTWFKSVDKLPIHAGENSSEPLFPFGFGLKC 1799 + AWLPG+EG GITDV+YGDY FEGRLP+TWF+++ +LPI++ +NS +PLFP GFGL C Sbjct: 547 IAAWLPGTEGRGITDVVYGDYEFEGRLPMTWFRAIKQLPINSEDNSCDPLFPLGFGLTC 605 >ref|XP_004150629.1| PREDICTED: lysosomal beta glucosidase-like [Cucumis sativus] Length = 611 Score = 826 bits (2134), Expect = 0.0 Identities = 399/597 (66%), Positives = 478/597 (80%), Gaps = 1/597 (0%) Frame = +3 Query: 6 DCVYKDPNAPVEARVKDLLSRMTLLEKIGQMTQIERSVASASAIKDRFIGSVLSGGGSKP 185 DC+Y++P A +E R+KDLLSRM+L EKIGQMTQIERSV + SA+ D +GSVLSGG + P Sbjct: 6 DCMYRNPGAAIEDRIKDLLSRMSLREKIGQMTQIERSVVTPSALTDLAVGSVLSGGDNPP 65 Query: 186 FENAKSADWAEMIDGFQTGALETRLGIPIFYGVDAVHGNNNVYGTTIFPHNVGLGATRDA 365 F+ A S DWA+M+DGFQ+ AL++RLGIPI YG+DAVHG++NVYG TIFPHNVGLGATRD Sbjct: 66 FDKAMSLDWADMVDGFQSLALQSRLGIPIIYGIDAVHGSSNVYGATIFPHNVGLGATRDG 125 Query: 366 DLTRKIGEVTAIEVRASGAHYAFAPCVAVTRDPRWGRSYESYSEDTEVVRKMTSLVTGLQ 545 L R+IG VTA+EVRASG HYAFAPC+AV+RDPRWGR YESYSE TEVVRKMTSLV GLQ Sbjct: 126 KLVRRIGTVTALEVRASGVHYAFAPCLAVSRDPRWGRCYESYSEHTEVVRKMTSLVEGLQ 185 Query: 546 GQPPEGHPKGYPFLAGRKNVLASAKHYVGDGGTEKGINEGNT-ITSYEDLERIHLAPYLD 722 G+PPEG+PKGYPF+AGR NV+A AKH+VGDGGT+KG+NEGNT I SY++LERIH+APYLD Sbjct: 186 GKPPEGYPKGYPFVAGRNNVIACAKHFVGDGGTDKGLNEGNTIIDSYDELERIHIAPYLD 245 Query: 723 CLSQGVCSVMASYSSWNGRKLHTDHFLLTEVLKNKLGFMGFIISDWEALDRLYAPHGSNY 902 C++QG+ +VMASYSSWNG LHT HFLLT+VLK KLGF GF+ISDWEALDRL P GSNY Sbjct: 246 CIAQGLSTVMASYSSWNGNPLHTHHFLLTQVLKEKLGFKGFVISDWEALDRLSNPRGSNY 305 Query: 903 RQSILLTINAGIDMVMVPFRYELFLDEFLSLVESGEIPMARIDDAVERILRVKFIAGLFE 1082 R I +NAGIDMVMVPFRYE F+ + LSLVESGEIP+ARIDDAVERILRVKF+AGLFE Sbjct: 306 RSCICTAVNAGIDMVMVPFRYEEFIKDLLSLVESGEIPIARIDDAVERILRVKFVAGLFE 365 Query: 1083 YPLTDLSLLDLVGCKAHRELAREAVRRSLVXXXXXXXXXXXXXXXXXXXXRILVAGTHAD 1262 +P +D SL+D+VGCK HR+LAREAVR+SLV +ILVAG+HAD Sbjct: 366 HPFSDRSLIDVVGCKIHRDLAREAVRKSLVLLRNGKDPMKPFLPLDRKAKKILVAGSHAD 425 Query: 1263 NLGYQCXXXXXXXXXXXXXXXXXXXXLDAIKEVVEDKAEVTYELNPTLETFSGQDYSYAI 1442 +LGYQC LDAIKE V D+ +V YE NP+ T + QD S+AI Sbjct: 426 DLGYQCGGWTISWNGSTGRTTVGTTILDAIKEAVGDQTKVIYEQNPSAVTLNDQDISFAI 485 Query: 1443 VVVGEGPYVETGGDDLELKIPFNGTELVSLVAEKVPTLMILVTGRPLVLEPQLLEKIDAL 1622 V +GE PY E+ GD+ +L IPFNG E+V VA K+PTL+IL++GRPLVLEP ++E ++AL Sbjct: 486 VAIGESPYAESAGDNSKLIIPFNGNEIVKAVAGKIPTLVILISGRPLVLEPTVIENVEAL 545 Query: 1623 VVAWLPGSEGNGITDVIYGDYAFEGRLPVTWFKSVDKLPIHAGENSSEPLFPFGFGL 1793 + AWLPG+EGNGITDVI+GDY F GRLPVTWFK+V++LP+HA N + LFPFGFGL Sbjct: 546 IAAWLPGTEGNGITDVIFGDYDFTGRLPVTWFKTVEQLPVHAENNLQDSLFPFGFGL 602 >ref|XP_004240394.1| PREDICTED: lysosomal beta glucosidase-like [Solanum lycopersicum] Length = 604 Score = 825 bits (2130), Expect = 0.0 Identities = 403/597 (67%), Positives = 471/597 (78%) Frame = +3 Query: 3 MDCVYKDPNAPVEARVKDLLSRMTLLEKIGQMTQIERSVASASAIKDRFIGSVLSGGGSK 182 MD VYK+P+A +E RVKDLLSRMTL EKIGQMTQIERSVA+ S I D IGS+LS GGS Sbjct: 1 MDFVYKNPSALIEERVKDLLSRMTLEEKIGQMTQIERSVATPSVITDLSIGSILSVGGSA 60 Query: 183 PFENAKSADWAEMIDGFQTGALETRLGIPIFYGVDAVHGNNNVYGTTIFPHNVGLGATRD 362 PFE+A S WA+M+DGFQ ALE+RLGIP+ YGVDA+HGNNNVYG T+FP NVGLGATRD Sbjct: 61 PFEDAPSEAWADMVDGFQKAALESRLGIPLLYGVDAIHGNNNVYGATVFPQNVGLGATRD 120 Query: 363 ADLTRKIGEVTAIEVRASGAHYAFAPCVAVTRDPRWGRSYESYSEDTEVVRKMTSLVTGL 542 ADL +KIG VTA+EVRA G +Y FAPCVAV RDPRWGR YESY EDTE++RKMTS+VTGL Sbjct: 121 ADLVQKIGIVTALEVRACGINYTFAPCVAVCRDPRWGRCYESYGEDTELIRKMTSIVTGL 180 Query: 543 QGQPPEGHPKGYPFLAGRKNVLASAKHYVGDGGTEKGINEGNTITSYEDLERIHLAPYLD 722 QGQPP G+P+ YPFLAGR V+A AKH+VGDGGT++GINEGNTI+SYEDLERIH+ PY+D Sbjct: 181 QGQPPPGYPQNYPFLAGRDKVVACAKHFVGDGGTDRGINEGNTISSYEDLERIHIPPYID 240 Query: 723 CLSQGVCSVMASYSSWNGRKLHTDHFLLTEVLKNKLGFMGFIISDWEALDRLYAPHGSNY 902 C+SQGVC+VMASYS WNG LH+ HFLLTEVLK KLGF GF+ISD E +DR + PHGSNY Sbjct: 241 CISQGVCTVMASYSKWNGSHLHSSHFLLTEVLKGKLGFKGFVISDSEGIDRFFHPHGSNY 300 Query: 903 RQSILLTINAGIDMVMVPFRYELFLDEFLSLVESGEIPMARIDDAVERILRVKFIAGLFE 1082 QSIL INAGIDMVMVPFRY+LFLD LVESG IPM RIDDAVERILRVKF++G FE Sbjct: 301 DQSILAAINAGIDMVMVPFRYQLFLDHLKYLVESGNIPMTRIDDAVERILRVKFVSGAFE 360 Query: 1083 YPLTDLSLLDLVGCKAHRELAREAVRRSLVXXXXXXXXXXXXXXXXXXXXRILVAGTHAD 1262 PL+D SLLD VGC HRELAREAVR+SLV RILVAG HAD Sbjct: 361 NPLSDRSLLDTVGCHQHRELAREAVRKSLVLLKNGKDVTKPFLPLDRKAKRILVAGKHAD 420 Query: 1263 NLGYQCXXXXXXXXXXXXXXXXXXXXLDAIKEVVEDKAEVTYELNPTLETFSGQDYSYAI 1442 +LG+QC L+AIK+ V + E+ YE NP+ +TF+ QD+SY I Sbjct: 421 DLGFQCGGWTKTWEGMGGRITIGTTILEAIKDAVGGETELVYEENPSPDTFASQDFSYCI 480 Query: 1443 VVVGEGPYVETGGDDLELKIPFNGTELVSLVAEKVPTLMILVTGRPLVLEPQLLEKIDAL 1622 VVVGE PY E+GGD +L+IP G EL+SLVA++VPTL+IL++GRPL +EP +LEK+DA Sbjct: 481 VVVGEPPYCESGGDSQDLRIPLGGEELISLVADRVPTLVILISGRPLHIEPSILEKMDAF 540 Query: 1623 VVAWLPGSEGNGITDVIYGDYAFEGRLPVTWFKSVDKLPIHAGENSSEPLFPFGFGL 1793 V AWLPG+EG GITDVI+GD+ F G LP+TWFKSVD+LP+H +NS EPLFPFG+GL Sbjct: 541 VAAWLPGTEGTGITDVIFGDFEFHGTLPMTWFKSVDQLPLHQEQNSYEPLFPFGYGL 597 >ref|XP_007207129.1| hypothetical protein PRUPE_ppa003012mg [Prunus persica] gi|462402771|gb|EMJ08328.1| hypothetical protein PRUPE_ppa003012mg [Prunus persica] Length = 612 Score = 823 bits (2125), Expect = 0.0 Identities = 396/596 (66%), Positives = 477/596 (80%) Frame = +3 Query: 6 DCVYKDPNAPVEARVKDLLSRMTLLEKIGQMTQIERSVASASAIKDRFIGSVLSGGGSKP 185 +C+Y++PN PVEARVKDLLSRMTL EK+GQMTQIER V++ AI+D IGSVLS GGS P Sbjct: 8 NCIYRNPNEPVEARVKDLLSRMTLKEKVGQMTQIERRVSTPDAIRDFSIGSVLSAGGSVP 67 Query: 186 FENAKSADWAEMIDGFQTGALETRLGIPIFYGVDAVHGNNNVYGTTIFPHNVGLGATRDA 365 FE A S+DWA+M+DGFQ ALE+RLGIP+ YG+DAVHGNN+VYG TIFPHNVGLGATRDA Sbjct: 68 FEKALSSDWADMVDGFQRSALESRLGIPLIYGIDAVHGNNSVYGATIFPHNVGLGATRDA 127 Query: 366 DLTRKIGEVTAIEVRASGAHYAFAPCVAVTRDPRWGRSYESYSEDTEVVRKMTSLVTGLQ 545 DL ++IG TA+EVRASG HY FAPCVAV RDPRWGR YESYSEDTE+VRKMTS+VTGLQ Sbjct: 128 DLVKRIGAATALEVRASGIHYTFAPCVAVCRDPRWGRCYESYSEDTEIVRKMTSIVTGLQ 187 Query: 546 GQPPEGHPKGYPFLAGRKNVLASAKHYVGDGGTEKGINEGNTITSYEDLERIHLAPYLDC 725 GQPP+G+PKGYPF+ GR N +A AKH+VGDGGT KG+NEGNTI+SY+DLERIH+APYL+C Sbjct: 188 GQPPQGYPKGYPFVLGRNNTIACAKHFVGDGGTHKGLNEGNTISSYDDLERIHMAPYLNC 247 Query: 726 LSQGVCSVMASYSSWNGRKLHTDHFLLTEVLKNKLGFMGFIISDWEALDRLYAPHGSNYR 905 +S GV +VMASYSSWNG KLH D FLLTE+LK+KLGF GF+ISDWEALD+L P G++YR Sbjct: 248 ISDGVSTVMASYSSWNGSKLHADRFLLTEILKDKLGFKGFVISDWEALDQLCEPRGADYR 307 Query: 906 QSILLTINAGIDMVMVPFRYELFLDEFLSLVESGEIPMARIDDAVERILRVKFIAGLFEY 1085 I +NAGIDMVMVPFRYE F+ + + LVE G I M+RIDDAVERILRVKF++GLFE+ Sbjct: 308 FCISSAVNAGIDMVMVPFRYEQFVKDLVYLVEHGNISMSRIDDAVERILRVKFVSGLFEH 367 Query: 1086 PLTDLSLLDLVGCKAHRELAREAVRRSLVXXXXXXXXXXXXXXXXXXXXRILVAGTHADN 1265 P +D SLLD+VGCK HR+LAREAVR+SLV RILVAGTHAD+ Sbjct: 368 PFSDRSLLDMVGCKLHRDLAREAVRKSLVLLKNGKDSRKPFLPLDRKAKRILVAGTHADD 427 Query: 1266 LGYQCXXXXXXXXXXXXXXXXXXXXLDAIKEVVEDKAEVTYELNPTLETFSGQDYSYAIV 1445 LGYQC L+AI++ V D E+ YE P+ +T + +D S+AIV Sbjct: 428 LGYQCGGWTATWDGRSGRITTGTTVLEAIQKAVGDDTEIIYEQYPSADTLAREDISFAIV 487 Query: 1446 VVGEGPYVETGGDDLELKIPFNGTELVSLVAEKVPTLMILVTGRPLVLEPQLLEKIDALV 1625 VGEGPY E GD+LEL IPFNGT+++S VA+++PTL+IL++GRPL LEP LLEK+DALV Sbjct: 488 AVGEGPYAEFRGDNLELAIPFNGTDVISSVADRLPTLVILISGRPLTLEPWLLEKMDALV 547 Query: 1626 VAWLPGSEGNGITDVIYGDYAFEGRLPVTWFKSVDKLPIHAGENSSEPLFPFGFGL 1793 AWLPGSEG GI DVI+GDY FEG LPV+WFK V++LP++A +NS +PL+P G+GL Sbjct: 548 AAWLPGSEGEGIADVIFGDYDFEGLLPVSWFKRVEQLPMNALDNSYDPLYPLGYGL 603 >ref|XP_006404423.1| hypothetical protein EUTSA_v10010212mg [Eutrema salsugineum] gi|557105542|gb|ESQ45876.1| hypothetical protein EUTSA_v10010212mg [Eutrema salsugineum] Length = 613 Score = 822 bits (2124), Expect = 0.0 Identities = 405/596 (67%), Positives = 473/596 (79%), Gaps = 1/596 (0%) Frame = +3 Query: 9 CVYKDPNAPVEARVKDLLSRMTLLEKIGQMTQIERSVASASAIKDRFIGSVLSGGGSKPF 188 CVYK+P+APVEARV+DLLS MTL EKIGQMTQIER+VAS +AI D FIGSVL+ GGS PF Sbjct: 8 CVYKNPDAPVEARVQDLLSHMTLPEKIGQMTQIERTVASPAAITDFFIGSVLNAGGSVPF 67 Query: 189 ENAKSADWAEMIDGFQTGALETRLGIPIFYGVDAVHGNNNVYGTTIFPHNVGLGATRDAD 368 E+AKS+DWA+MIDGFQ AL +RLGIPI YG DAVHGNNNVYG T+FPHN+GLGATRDAD Sbjct: 68 EDAKSSDWADMIDGFQRSALASRLGIPIIYGTDAVHGNNNVYGATVFPHNIGLGATRDAD 127 Query: 369 LTRKIGEVTAIEVRASGAHYAFAPCVAVTRDPRWGRSYESYSEDTEVVRKMTSLVTGLQG 548 L R+IG TA+EVRASG H+AFAPCVAV RDPRWGRSYESY ED +V +MTSLV+GLQG Sbjct: 128 LVRRIGAATALEVRASGVHWAFAPCVAVLRDPRWGRSYESYGEDAGLVCEMTSLVSGLQG 187 Query: 549 QPPEGHPKGYPFLAGRKNVLASAKHYVGDGGTEKGINEGNTITSYEDLERIHLAPYLDCL 728 PPE HP GYPF+AGR NV+A AKH+VGDGGT+KGINEGNTITSYEDLE+IH+ PYL+CL Sbjct: 188 VPPEEHPNGYPFVAGRNNVVACAKHFVGDGGTDKGINEGNTITSYEDLEKIHIPPYLNCL 247 Query: 729 SQGVCSVMASYSSWNGRKLHTDHFLLTEVLKNKLGFMGFIISDWEALDRLYAPHGSNYRQ 908 +QGV +VMASYSSWNG +LH D+FLLTEVLK KLGF GF+ISDWE LDRL P GSNYR Sbjct: 248 AQGVSTVMASYSSWNGSRLHADYFLLTEVLKEKLGFKGFVISDWEGLDRLSEPQGSNYRN 307 Query: 909 SILLTINAGIDMVMVPFRYELFLDEFLSLVESGEIPMARIDDAVERILRVKFIAGLFEYP 1088 I L +NAG+DMVMVPF+YE F+ + SLVESGEI MAR++DAVERILRVKF+AGLFE+P Sbjct: 308 CIKLAVNAGVDMVMVPFKYEKFIQDMTSLVESGEILMARVNDAVERILRVKFVAGLFEHP 367 Query: 1089 LTDLSLLDLVGCKAHRELAREAVRRSLVXXXXXXXXXXXXXXXXXXXXRILVAGTHADNL 1268 LTD SLL VGC+ HRE+AREAVR+SLV RILV G HAD+L Sbjct: 368 LTDRSLLGTVGCEKHREVAREAVRKSLVLLKKGENADKPFLPLDRNAKRILVTGPHADDL 427 Query: 1269 GYQCXXXXXXXXXXXXXXXXXXXXLDAIKEVVEDKAEVTYELNPTLETF-SGQDYSYAIV 1445 GYQC LDAIK V DK EV YE P+ ET S + +SYAIV Sbjct: 428 GYQCGGWTKTWFGLSGKITIGTTLLDAIKAAVGDKTEVIYEKTPSKETLASSEGFSYAIV 487 Query: 1446 VVGEGPYVETGGDDLELKIPFNGTELVSLVAEKVPTLMILVTGRPLVLEPQLLEKIDALV 1625 VGE PY ET GD EL IPFNG+++V+ VAE++PTL+IL++GRP+VLEP +LEK +ALV Sbjct: 488 AVGESPYAETIGDSSELIIPFNGSDIVTTVAERIPTLVILISGRPVVLEPAVLEKTEALV 547 Query: 1626 VAWLPGSEGNGITDVIYGDYAFEGRLPVTWFKSVDKLPIHAGENSSEPLFPFGFGL 1793 AWLPG+EG G+ DVI+GDY FEG+LPV+WFK+V+ LP++A NS +PLFP GFGL Sbjct: 548 AAWLPGTEGQGMADVIFGDYDFEGKLPVSWFKTVEHLPLNAHANSYDPLFPLGFGL 603 >ref|XP_002313393.1| glycosyl hydrolase family 3 family protein [Populus trichocarpa] gi|222849801|gb|EEE87348.1| glycosyl hydrolase family 3 family protein [Populus trichocarpa] Length = 603 Score = 821 bits (2120), Expect = 0.0 Identities = 397/597 (66%), Positives = 466/597 (78%) Frame = +3 Query: 9 CVYKDPNAPVEARVKDLLSRMTLLEKIGQMTQIERSVASASAIKDRFIGSVLSGGGSKPF 188 C+YKDPN+P+EARVKDLLSRMTL EK+ QMTQIERS+ D +GSV++ GGS PF Sbjct: 6 CIYKDPNSPIEARVKDLLSRMTLKEKVAQMTQIERSLV------DYLVGSVMNAGGSAPF 59 Query: 189 ENAKSADWAEMIDGFQTGALETRLGIPIFYGVDAVHGNNNVYGTTIFPHNVGLGATRDAD 368 NAKS+DWA+M+D FQ AL++RLGIPI YG+DAVHGNN VYGTTIFPHNVGLGATRDAD Sbjct: 60 PNAKSSDWADMVDWFQKLALQSRLGIPIIYGIDAVHGNNGVYGTTIFPHNVGLGATRDAD 119 Query: 369 LTRKIGEVTAIEVRASGAHYAFAPCVAVTRDPRWGRSYESYSEDTEVVRKMTSLVTGLQG 548 L R+IG TA+EVRA G Y FAPCVAV RDPRWGR YESYSEDT +VR+M S+VTGLQG Sbjct: 120 LVRRIGVATALEVRACGIQYTFAPCVAVCRDPRWGRCYESYSEDTNIVREMASIVTGLQG 179 Query: 549 QPPEGHPKGYPFLAGRKNVLASAKHYVGDGGTEKGINEGNTITSYEDLERIHLAPYLDCL 728 QPPEGHP GYPFLAGR NV+A AKH+VGDGGT KG+NEG+TI SYEDLERIH+APYLDC+ Sbjct: 180 QPPEGHPNGYPFLAGRNNVIACAKHFVGDGGTHKGLNEGDTILSYEDLERIHMAPYLDCI 239 Query: 729 SQGVCSVMASYSSWNGRKLHTDHFLLTEVLKNKLGFMGFIISDWEALDRLYAPHGSNYRQ 908 SQGV ++M SYSSWNGR+LH HFLLTEVLK+KLGF GF+ISDWEALDRL P GSNYR+ Sbjct: 240 SQGVGTIMVSYSSWNGRQLHAHHFLLTEVLKDKLGFKGFVISDWEALDRLSKPLGSNYRR 299 Query: 909 SILLTINAGIDMVMVPFRYELFLDEFLSLVESGEIPMARIDDAVERILRVKFIAGLFEYP 1088 + +NAG DMVMV ++ F+ + + L ESGEIPM RIDDAVERILRVKF+AGLFEYP Sbjct: 300 CVSTAVNAGTDMVMVGQKHREFMKDLIFLAESGEIPMTRIDDAVERILRVKFVAGLFEYP 359 Query: 1089 LTDLSLLDLVGCKAHRELAREAVRRSLVXXXXXXXXXXXXXXXXXXXXRILVAGTHADNL 1268 D SLLD+VGCK HRELAREAVR+SLV +ILVAGTHADNL Sbjct: 360 FADRSLLDIVGCKLHRELAREAVRKSLVLLKNGKDPKKPLLPLDRSAKKILVAGTHADNL 419 Query: 1269 GYQCXXXXXXXXXXXXXXXXXXXXLDAIKEVVEDKAEVTYELNPTLETFSGQDYSYAIVV 1448 GYQC LDAIKE + ++ EV YE P+ +T + QD+S+AIV Sbjct: 420 GYQCGGWTIAWNGMSGRITIGTTILDAIKEAIGEETEVIYEKIPSPDTLASQDFSFAIVA 479 Query: 1449 VGEGPYVETGGDDLELKIPFNGTELVSLVAEKVPTLMILVTGRPLVLEPQLLEKIDALVV 1628 VGE PY E GD+ EL IPFNG +++S VA+K+PTL+IL++GRPLV+EP LLEKID L+ Sbjct: 480 VGEDPYAEFTGDNSELAIPFNGADIISSVADKIPTLVILISGRPLVIEPWLLEKIDGLIA 539 Query: 1629 AWLPGSEGNGITDVIYGDYAFEGRLPVTWFKSVDKLPIHAGENSSEPLFPFGFGLKC 1799 AWLPG+EG GITDVI+GDY F GRLPVTWF+ V++LP++ +NS EPLFP GFGL C Sbjct: 540 AWLPGTEGEGITDVIFGDYDFSGRLPVTWFRKVEQLPMNLRDNSEEPLFPLGFGLTC 596 >ref|XP_006290760.1| hypothetical protein CARUB_v10016864mg [Capsella rubella] gi|482559467|gb|EOA23658.1| hypothetical protein CARUB_v10016864mg [Capsella rubella] Length = 609 Score = 818 bits (2114), Expect = 0.0 Identities = 403/596 (67%), Positives = 468/596 (78%), Gaps = 1/596 (0%) Frame = +3 Query: 9 CVYKDPNAPVEARVKDLLSRMTLLEKIGQMTQIERSVASASAIKDRFIGSVLSGGGSKPF 188 CVYK+ +APVEARVKDLLSRMTL EKIGQMTQIER+VAS AI D FIGSVL+ GGS PF Sbjct: 9 CVYKNRDAPVEARVKDLLSRMTLPEKIGQMTQIERNVASPVAITDSFIGSVLNAGGSAPF 68 Query: 189 ENAKSADWAEMIDGFQTGALETRLGIPIFYGVDAVHGNNNVYGTTIFPHNVGLGATRDAD 368 E+AKS+DWA+MIDGFQ AL +RLGIPI YG DAVHGNNNVYG T+FPHN+GLGATRDAD Sbjct: 69 EDAKSSDWADMIDGFQRSALASRLGIPIIYGTDAVHGNNNVYGATVFPHNIGLGATRDAD 128 Query: 369 LTRKIGEVTAIEVRASGAHYAFAPCVAVTRDPRWGRSYESYSEDTEVVRKMTSLVTGLQG 548 L R+IG TA+EVRASG H+AF+PCVAV RDPRWGR YESY ED +V +MTSLV+GLQG Sbjct: 129 LVRRIGAATALEVRASGVHWAFSPCVAVLRDPRWGRCYESYGEDPGLVSEMTSLVSGLQG 188 Query: 549 QPPEGHPKGYPFLAGRKNVLASAKHYVGDGGTEKGINEGNTITSYEDLERIHLAPYLDCL 728 PPE HP GYPF+AGR NV+A AKH+VGDGGT+KGINEGNTI SYEDLERIH+ PYL CL Sbjct: 189 VPPEEHPNGYPFVAGRNNVVACAKHFVGDGGTDKGINEGNTIASYEDLERIHITPYLKCL 248 Query: 729 SQGVCSVMASYSSWNGRKLHTDHFLLTEVLKNKLGFMGFIISDWEALDRLYAPHGSNYRQ 908 SQGV +VMASYSSWNG +LH D FLLTE+LK+KLGF GF++SDWE LDRL P GSNYR Sbjct: 249 SQGVSTVMASYSSWNGTQLHADRFLLTEILKDKLGFKGFLVSDWEGLDRLSKPQGSNYRN 308 Query: 909 SILLTINAGIDMVMVPFRYELFLDEFLSLVESGEIPMARIDDAVERILRVKFIAGLFEYP 1088 I ++NAGIDMVMVP +Y+ F+ + LVESGEIPM R++DAVERILRVKF+AGLFE+P Sbjct: 309 CIKTSVNAGIDMVMVPLKYDQFIQDMTDLVESGEIPMDRVNDAVERILRVKFVAGLFEHP 368 Query: 1089 LTDLSLLDLVGCKAHRELAREAVRRSLVXXXXXXXXXXXXXXXXXXXXRILVAGTHADNL 1268 LTD SLL VGCK HRELA+EAVR+SLV RILV GTHAD+L Sbjct: 369 LTDRSLLRTVGCKEHRELAQEAVRKSLVLLKNGKNADKPFLPLDRKAKRILVTGTHADDL 428 Query: 1269 GYQCXXXXXXXXXXXXXXXXXXXXLDAIKEVVEDKAEVTYELNPTLETF-SGQDYSYAIV 1445 GYQC LDAIK VV DK EV YE NP+ ET S + +SYAIV Sbjct: 429 GYQCGGWTKTWFGLSGRITIGTTLLDAIKAVVGDKTEVIYEKNPSKETLASSEGFSYAIV 488 Query: 1446 VVGEGPYVETGGDDLELKIPFNGTELVSLVAEKVPTLMILVTGRPLVLEPQLLEKIDALV 1625 VGE PY E GD+ EL IPFNG+++V+ VAEK+PTL++L++GRP+VLE ++LEK DALV Sbjct: 489 AVGEPPYAEATGDNSELTIPFNGSDIVTAVAEKIPTLVVLISGRPVVLEQRVLEKTDALV 548 Query: 1626 VAWLPGSEGNGITDVIYGDYAFEGRLPVTWFKSVDKLPIHAGENSSEPLFPFGFGL 1793 AWLPG+EG G+ DVI+GDY FEG+LPV+WFK V+ LP+ A NS +PLFPFGF L Sbjct: 549 AAWLPGTEGQGMADVIFGDYDFEGKLPVSWFKRVEHLPLDANANSYDPLFPFGFSL 604 >ref|XP_004294237.1| PREDICTED: lysosomal beta glucosidase-like [Fragaria vesca subsp. vesca] Length = 626 Score = 818 bits (2112), Expect = 0.0 Identities = 393/597 (65%), Positives = 473/597 (79%) Frame = +3 Query: 3 MDCVYKDPNAPVEARVKDLLSRMTLLEKIGQMTQIERSVASASAIKDRFIGSVLSGGGSK 182 ++C+Y++PN P+EARVKDLLSRMTL EK+GQMTQIER VA+ SAIKD IGSVLS GGS Sbjct: 7 LNCIYRNPNEPIEARVKDLLSRMTLKEKVGQMTQIERRVATPSAIKDFHIGSVLSAGGSG 66 Query: 183 PFENAKSADWAEMIDGFQTGALETRLGIPIFYGVDAVHGNNNVYGTTIFPHNVGLGATRD 362 PFENA SADWA+M+DGFQ A+ETRL IP+ YG+DAVHGNNNVYG TIFPHNVGLGATRD Sbjct: 67 PFENAVSADWADMVDGFQRSAMETRLRIPMVYGIDAVHGNNNVYGATIFPHNVGLGATRD 126 Query: 363 ADLTRKIGEVTAIEVRASGAHYAFAPCVAVTRDPRWGRSYESYSEDTEVVRKMTSLVTGL 542 ADL +IG TA+EVRASG Y FAPCVAV RDPRWGR YESYSEDTE+VRKMTS+++GL Sbjct: 127 ADLAERIGVATALEVRASGIQYTFAPCVAVCRDPRWGRCYESYSEDTEIVRKMTSIISGL 186 Query: 543 QGQPPEGHPKGYPFLAGRKNVLASAKHYVGDGGTEKGINEGNTITSYEDLERIHLAPYLD 722 QG+PP+GH GYPF+ GR + +A AKH+VGDGGT KG+NEGNTI+SY+DLERIH+APYLD Sbjct: 187 QGKPPQGHENGYPFVMGRNSTIACAKHFVGDGGTHKGLNEGNTISSYDDLERIHMAPYLD 246 Query: 723 CLSQGVCSVMASYSSWNGRKLHTDHFLLTEVLKNKLGFMGFIISDWEALDRLYAPHGSNY 902 C+SQGV +VMAS+SSWNG KLH D FLLTEVLKNKLGF GF+ISDW ALD++ P GSN Sbjct: 247 CISQGVSTVMASFSSWNGSKLHADRFLLTEVLKNKLGFKGFVISDWAALDKICDPPGSNN 306 Query: 903 RQSILLTINAGIDMVMVPFRYELFLDEFLSLVESGEIPMARIDDAVERILRVKFIAGLFE 1082 R IL INAGIDMVMVPF+++ F+++ + LV+ GEIPM+RIDDAVERILRVKF AGLFE Sbjct: 307 RFCILSGINAGIDMVMVPFKFQQFVEDLVYLVKHGEIPMSRIDDAVERILRVKFAAGLFE 366 Query: 1083 YPLTDLSLLDLVGCKAHRELAREAVRRSLVXXXXXXXXXXXXXXXXXXXXRILVAGTHAD 1262 YP D SLLD+VGCK HR+LAREAVR+SLV RILV+GTHAD Sbjct: 367 YPFADRSLLDIVGCKPHRDLAREAVRKSLVLLKNGKDPKKPFLPLDRKAKRILVSGTHAD 426 Query: 1263 NLGYQCXXXXXXXXXXXXXXXXXXXXLDAIKEVVEDKAEVTYELNPTLETFSGQDYSYAI 1442 +LGYQC L+AIK+ V ++ E+ YE +P+ +T + QD S+AI Sbjct: 427 DLGYQCGGWTATWDGKSGRITVGTTILEAIKKAVGEETEIIYEPHPSTDTLARQDISFAI 486 Query: 1443 VVVGEGPYVETGGDDLELKIPFNGTELVSLVAEKVPTLMILVTGRPLVLEPQLLEKIDAL 1622 V VGE PY E GD +L IPFNG +++S VA+++PTL IL++GRPL LEP LL+K+DA Sbjct: 487 VAVGEDPYAEFTGDRADLVIPFNGPDIISSVADRIPTLAILISGRPLTLEPSLLKKMDAF 546 Query: 1623 VVAWLPGSEGNGITDVIYGDYAFEGRLPVTWFKSVDKLPIHAGENSSEPLFPFGFGL 1793 + AWLPGSEG GITDVI+GDY FEG+LPVTWF+S+ +LP++ G NS +PL+P GFGL Sbjct: 547 IAAWLPGSEGEGITDVIFGDYDFEGKLPVTWFRSIKQLPLNFGSNSYDPLYPLGFGL 603 >ref|NP_190284.1| glycosyl hydrolase family protein [Arabidopsis thaliana] gi|6522581|emb|CAB61946.1| beta-D-glucan exohydrolase-like protein [Arabidopsis thaliana] gi|17065280|gb|AAL32794.1| beta-D-glucan exohydrolase-like protein [Arabidopsis thaliana] gi|20259996|gb|AAM13345.1| beta-D-glucan exohydrolase-like protein [Arabidopsis thaliana] gi|20260350|gb|AAM13073.1| beta-D-glucan exohydrolase-like protein [Arabidopsis thaliana] gi|30725406|gb|AAP37725.1| At3g47000 [Arabidopsis thaliana] gi|332644709|gb|AEE78230.1| glycosyl hydrolase family protein [Arabidopsis thaliana] Length = 608 Score = 816 bits (2108), Expect = 0.0 Identities = 402/596 (67%), Positives = 467/596 (78%), Gaps = 1/596 (0%) Frame = +3 Query: 9 CVYKDPNAPVEARVKDLLSRMTLLEKIGQMTQIERSVASASAIKDRFIGSVLSGGGSKPF 188 CVYK+ +APVEARVKDLLSRMTL EKIGQMTQIER VAS SA D FIGSVL+ GGS PF Sbjct: 8 CVYKNGDAPVEARVKDLLSRMTLPEKIGQMTQIERRVASPSAFTDFFIGSVLNAGGSVPF 67 Query: 189 ENAKSADWAEMIDGFQTGALETRLGIPIFYGVDAVHGNNNVYGTTIFPHNVGLGATRDAD 368 E+AKS+DWA+MIDGFQ AL +RLGIPI YG DAVHGNNNVYG T+FPHN+GLGATRDAD Sbjct: 68 EDAKSSDWADMIDGFQRSALASRLGIPIIYGTDAVHGNNNVYGATVFPHNIGLGATRDAD 127 Query: 369 LTRKIGEVTAIEVRASGAHYAFAPCVAVTRDPRWGRSYESYSEDTEVVRKMTSLVTGLQG 548 L R+IG TA+EVRASG H+AF+PCVAV RDPRWGR YESY ED E+V +MTSLV+GLQG Sbjct: 128 LVRRIGAATALEVRASGVHWAFSPCVAVLRDPRWGRCYESYGEDPELVCEMTSLVSGLQG 187 Query: 549 QPPEGHPKGYPFLAGRKNVLASAKHYVGDGGTEKGINEGNTITSYEDLERIHLAPYLDCL 728 PPE HP GYPF+AGR NV+A KH+VGDGGT+KGINEGNTI SYE+LE+IH+ PYL CL Sbjct: 188 VPPEEHPNGYPFVAGRNNVVACVKHFVGDGGTDKGINEGNTIASYEELEKIHIPPYLKCL 247 Query: 729 SQGVCSVMASYSSWNGRKLHTDHFLLTEVLKNKLGFMGFIISDWEALDRLYAPHGSNYRQ 908 +QGV +VMASYSSWNG +LH D FLLTE+LK KLGF GF++SDWE LDRL P GSNYR Sbjct: 248 AQGVSTVMASYSSWNGTRLHADRFLLTEILKEKLGFKGFLVSDWEGLDRLSEPQGSNYRY 307 Query: 909 SILLTINAGIDMVMVPFRYELFLDEFLSLVESGEIPMARIDDAVERILRVKFIAGLFEYP 1088 I +NAGIDMVMVPF+YE F+ + LVESGEIPMARI+DAVERILRVKF+AGLF +P Sbjct: 308 CIKTAVNAGIDMVMVPFKYEQFIQDMTDLVESGEIPMARINDAVERILRVKFVAGLFGHP 367 Query: 1089 LTDLSLLDLVGCKAHRELAREAVRRSLVXXXXXXXXXXXXXXXXXXXXRILVAGTHADNL 1268 LTD SLL VGCK HRELA+EAVR+SLV RILV GTHAD+L Sbjct: 368 LTDRSLLPTVGCKEHRELAQEAVRKSLVLLKSGKNADKPFLPLDRNAKRILVTGTHADDL 427 Query: 1269 GYQCXXXXXXXXXXXXXXXXXXXXLDAIKEVVEDKAEVTYELNPTLETF-SGQDYSYAIV 1445 GYQC LDAIKE V D+ EV YE P+ ET S + +SYAIV Sbjct: 428 GYQCGGWTKTWFGLSGRITIGTTLLDAIKEAVGDETEVIYEKTPSKETLASSEGFSYAIV 487 Query: 1446 VVGEGPYVETGGDDLELKIPFNGTELVSLVAEKVPTLMILVTGRPLVLEPQLLEKIDALV 1625 VGE PY ET GD+ EL+IPFNGT++V+ VAE +PTL+IL++GRP+VLEP +LEK +ALV Sbjct: 488 AVGEPPYAETMGDNSELRIPFNGTDIVTAVAEIIPTLVILISGRPVVLEPTVLEKTEALV 547 Query: 1626 VAWLPGSEGNGITDVIYGDYAFEGRLPVTWFKSVDKLPIHAGENSSEPLFPFGFGL 1793 AWLPG+EG G+ DV++GDY F+G+LPV+WFK V+ LP+ A NS +PLFPFGFGL Sbjct: 548 AAWLPGTEGQGVADVVFGDYDFKGKLPVSWFKHVEHLPLDAHANSYDPLFPFGFGL 603